Deep Dive into LLMs like ChatGPT

The best introduction to how large language models (LLMs) like ChatGPT works in the world. It covers the three main stages of their training: pre-training on vast amounts of internet text, supervised fine-tuning to become helpful assistants, and reinforcement learning to improve problem-solving skills. The video also discusses LLM psychology, including why they hallucinate, how they use tools, and their limitations. Finally, it looks at future capabilities like multimodality and agent-like behavior.

Visit Website

Visit Website

Introduction

This is a general audience deep dive into the Large Language Model (LLM) AI technology that powers ChatGPT and related products. It is covers the full training stack of how the models are developed, along with mental models of how to think about their "psychology", and how to get the best use them in practical applications. I have one "Intro to LLMs" video already from ~year ago, but that is just a re-recording of a random talk, so I wanted to loop around and do a lot more comprehensive version.

Instructor Andrej was a founding member at OpenAI (2015) and then Sr. Director of AI at Tesla (2017-2022), and is now a founder at Eureka Labs, which is building an AI-native school. His goal in this video is to raise knowledge and understanding of the state of the art in AI, and empower people to effectively use the latest and greatest in their work. Find more at https://karpathy.ai/ and https://x.com/karpathy

Chapters 00:00:00 introduction 00:01:00 pretraining data (internet) 00:07:47 tokenization 00:14:27 neural network I/O 00:20:11 neural network internals 00:26:01 inference 00:31:09 GPT-2: training and inference 00:42:52 Llama 3.1 base model inference 00:59:23 pretraining to post-training 01:01:06 post-training data (conversations) 01:20:32 hallucinations, tool use, knowledge/working memory 01:41:46 knowledge of self 01:46:56 models need tokens to think 02:01:11 tokenization revisited: models struggle with spelling 02:04:53 jagged intelligence 02:07:28 supervised finetuning to reinforcement learning 02:14:42 reinforcement learning 02:27:47 DeepSeek-R1 02:42:07 AlphaGo 02:48:26 reinforcement learning from human feedback (RLHF) 03:09:39 preview of things to come 03:15:15 keeping track of LLMs 03:18:34 where to find LLMs 03:21:46 grand summary

Back

Information

Websitewww.youtube.com
AuthorsAndrej Karpathy
Published date2025/02/06

More Items

Anthropic's Interactive Prompt Engineering Tutorial

2024

Anthropic

An interactive prompt engineering tutorial released by Anthropic. The GitHub repository provides a step-by-step course (9 chapters + appendix) with lessons and hands-on exercises for building and troubleshooting prompts for Claude. It uses Claude 3 Haiku for examples, includes example playgrounds and an answer key, and is targeted at people who want to learn practical prompt design and common failure modes.

anthropic claude tutorial course LLM+2

openai-cookbook

2022

OpenAI

The OpenAI Cookbook is an open-source GitHub repository from OpenAI that provides example code, guides, and recipes for using the OpenAI API. It contains practical examples covering prompt engineering, text generation, embeddings, retrieval-augmented generation (RAG), image generation, fine-tuning, integrations, and more. Most examples are in Python and designed to help developers learn and integrate the API quickly.

openai ai-api github tutorial ai-coding+2

Hands-On Large Language Models

2024

Jay Alammar, Maarten Grootendorst +1

Official code repository for the O'Reilly book "Hands-On Large Language Models" by Jay Alammar and Maarten Grootendorst. It provides runnable notebooks, visual explanations, and practical examples across chapters covering tokens and embeddings, transformer internals, text classification, semantic search, fine-tuning, multimodal models, and more. Recommended to run in Google Colab for easy setup.

book llm LLM github tutorial+5