[UPDATE] A list of resources, articles, and opinion pieces relating to large language models & robotics

A black keyboard at the bottom of the picture has an open book on it, with red words in labels floating on top, with a letter A balanced on top of them. The perspective makes the composition form a kind of triangle from the keyboard to the capital A. The AI filter makes it look like a messy, with a kind of cartoon style. Teresa Berndtsson / Better Images of AI / Letter Word Text Taxonomy / Licenced by CC-BY 4.0.

We’ve collected some of the articles, opinion pieces, videos and resources relating to large language models (LLMs). Some of these links also cover other generative models. We will periodically update this list to add any further resources of interest. This article represents the third in the series. (The previous versions are here: v1 | v2.)

What LLMs are and how they work

What are Generative AI models?, Kate Soule, video from IBM Technology.
Introduction to Large Language Models, John Ewald, video from Google Cloud Tech.
What is GPT-4 and how does it differ from ChatGPT?, Alex Hern, The Guardian.
What Is ChatGPT Doing … and Why Does It Work?, Stephen Wolfram.
Understanding Large Language Models — A Transformative Reading List, Sebastian Raschka.
How ChatGPT is Trained, video by Ari Seff.
ChatGPT – what is it? How does it work? Should we be excited? Or scared?, Deep Dhillon, The Radical AI podcast.
Everything you need to know about ChatGPT, Joanna Dungate, Turing Institute Blog.
Turing video lecture series on foundation models: Session 1 | Session 2 | Session 3 | Session 4.
Bard: What is Google’s Bard and how is it different to ChatGPT?, BBC.
Bard FAQs, Google.
Large Language Models from scratch | Large Language Models: Part 2, videos from Graphics in 5 minutes.
What are Large Language Models (LLMs)?, video from Google for Developers.
Risks of Large Language Models (LLM), Phaedra Boinodiris, video from IBM Technology.
How ChatGPT and Other LLMs Work—and Where They Could Go Next, David Nield, Wired.
What are Large Language Models, Machine Learning Mastery.
How To Delete Your Data From ChatGPT, Matt Burgess, Wired.
5 Ways ChatGPT Can Improve, Not Replace, Your Writing, David Nield, Wired.
AI prompt engineering: learn how not to ask a chatbot a silly question, Callum Bains, The Guardian.

Journal, conference, arXiv, and other articles

Scientists’ Perspectives on the Potential for Generative AI in their Fields, Meredith Ringel Morris, arXiv.
LaMDA: Language Models for Dialog Applications, Romal Thoppilan et al, arXiv.
What Language Model to Train if You Have One Million GPU Hours?, Teven Le Scao et al, arXiv.
Alpaca: A Strong, Replicable Instruction-Following Model, Rohan Taori et al.
Process for Adapting Language Models to Society (PALMS) with Values-Targeted Datasets, Irene Solaiman, Christy Dennison, NeurIPS 2021.
On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? , Emily Bender, Timnit Gebru, Angelina McMillan-Major, Shmargaret Shmitchell, FAccT 2021.
A Survey of Large Language Models, Wayne Xin Zhao et al, arXiv.
A Watermark for Large Language Models, John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers, Tom Goldstein, arXiv.
Between Subjectivity and Imposition: Power Dynamics in Data Annotation for Computer Vision, Milagros Miceli, Martin Schuessler, Tianling Yang, Proceedings of the ACM on Human-Computer Interaction.
AI classifier for indicating AI-written text, OpenAI.
Pythia: A Suite for Analyzing Large Language Models Across Training and Scaling, Stella Biderman et al, arXiv.
GPT-4 Technical Report, OpenAI, arXiv.
GPT-4 System Card, OpenAI.
BloombergGPT: A Large Language Model for Finance, Shijie Wu et al, arXiv.
Evading Watermark based Detection of AI-Generated Content, Zhengyuan Jiang, Jinghuai Zhang, Neil Zhenqiang Gong, arXiv.
PaLM 2 Technical Report, Google.
Large language models (LLM) and ChatGPT: what will the impact on nuclear medicine be?, Ian L. Alberts, Lorenzo Mercolli, Thomas Pyka, George Prenosil, Kuangyu Shi, Axel Rominger, and Ali Afshar-Oromieh, Eur J Nucl Med Mol Imaging.
Ethics of large language models in medicine and medical research, Hanzhou Li, John T Moon, Saptarshi Purkayastha, Leo Anthony Celi, Hari Trivedi and Judy W Gichoya, The Lancet.
Science in the age of large language models, Abeba Birhane, Atoosa Kasirzadeh, David Leslie & Sandra Wachter, Nature.
Standardizing chemical compounds with language models, Miruna T Cretu, Alessandra Toniato, Amol Thakkar, Amin A Debabeche, Teodoro Laino and Alain C Vaucher, Machine Learning: Science and Technology.
How to keep text private? A systematic review of deep learning methods for privacy-preserving natural language processing, Samuel Sousa & Roman Kern, Artificial Intelligence Review.
Material transformers: deep learning language models for generative materials design, Nihang Fu, Lai Wei, Yuqi Song, Qinyang Li, Rui Xin, Sadman Sadeed Omee, Rongzhi Dong, Edirisuriya M Dilanga Siriwardane and Jianjun Hu, Machine Learning: Science and Technology.
Large language models encode clinical knowledge, Karan Singhal et al, Nature.
SELFormer: molecular representation learning via SELFIES language models, Atakan Yüksel, Erva Ulusoy, Atabey Ünlü and Tunca Doğan, Machine Learning: Science and Technology.
GPT-4 + Stable-Diffusion = ?: Enhancing Prompt Understanding of Text-to-Image Diffusion Models with Large Language Models, Long Lian, Boyi Li, Adam Yala, and Trevor Darrell, BAIR blog.

[UPDATE] A list of resources, articles, and opinion pieces relating to large language models & robotics

What LLMs are and how they work

Journal, conference, arXiv, and other articles

Newspaper, magazine, University website, and blogpost articles

Reports

Podcasts and video discussions

Focus on LLMs and education

Relating to art and other creative processes

Pertaining to robotics

Misinformation, fake news and the impact on journalism

Regulation and policy

AIhub