Building LLMs from the Ground Up: A 3-hour Coding Workshop

Показать описание

REFERENCES:

DESCRIPTION:
This tutorial is aimed at coders interested in understanding the building blocks of large language models (LLMs), how LLMs work, and how to code them from the ground up in PyTorch. We will kick off this tutorial with an introduction to LLMs, recent milestones, and their use cases. Then, we will code a small GPT-like LLM, including its data input pipeline, core architecture components, and pretraining code ourselves. After understanding how everything fits together and how to pretrain an LLM, we will learn how to load pretrained weights and finetune LLMs using open-source libraries.

---

---

---

OUTLINE:
0:00 – Workshop overview
2:17 – Part 1: Intro to LLMs
9:14 – Workshop materials
10:48 – Part 2: Understanding LLM input data
23:25 – A simple tokenizer class
41:03 – Part 3: Coding an LLM architecture
45:01 – GPT-2 and Llama 2
1:07:11 – Part 4: Pretraining
1:29:37 – Part 5.1: Loading pretrained weights
1:45:12 – Part 5.2: Pretrained weights via LitGPT
1:53:09 – Part 6.1: Instruction finetuning
2:08:21 – Part 6.2: Instruction finetuning via LitGPT
02:26:45 – Part 6.3: Benchmark evaluation
02:36:55 – Part 6.4: Evaluating conversational performance
02:42:40 – Conclusion

Рекомендации по теме

Комментарии

Dude I keep accidentally running into your content while learning this material. The other day I was trying firing off weirdly specific google searches while trying to build intuition on how self-attention works and I found a year old comment you wrote on reddit that nailed what I was having trouble with. Just bought your book MEAP, you've been doing an amazing job, keep it up!

joneskin

Dear Sebastian,

I hope you are doing well. I am writing to express my deepest gratitude for your incredible effort and dedication to teaching on the online platform. Your generosity in sharing your knowledge for free has made a profound impact on so many of us.

Your classes have been a beacon of light in these challenging times, providing not only education but also inspiration and hope. The clarity with which you explain complex topics and your unwavering patience in addressing our questions have been truly remarkable.

Thank you for your time, energy, and passion for teaching. You've made a significant difference in my learning journey, and I am immensely grateful for the knowledge and wisdom you've imparted.

Wishing you all the best in your future endeavors. 😊

Warm regards,
Hari

haribhauhud

Sebastian, I want to sincerely thank you for providing such good material. I cannot express my gratitude enough! I admire your desire to share this content with such clarity and human touch! Thanks a lot!

paolodragol

This is a gem for me as a Msc AI student. Thank you for making this.

atlasflare

Your deep learning series got me through stat 453 at uw Madison and now this workshop has been the perfect transition into LLMs! Great video Sebastian!

masonholcombe

23 mins in. This is by far, the best tutorial I have seen on building LLMs from scratch. I have followed you for a while Sebastian for all the great contributions you have made over the years, but you have outdone yourself once again. Well done man and Thank you.

devtest

Just finished the book, extremely pedagogical and valuable. Great job as always Sebastian!

Alexander-jeqc

Thank you for such an amazing book, such an invaluable source for a beginner like me!
I watched the 4-hour lecture by Kapathy and initially thought that your content could hardly be impressed. However, I am "wow" reading through every single chapter of your book.

taido

Thanks a lot Sebastian! Coding from scratch up made most concepts crystal clear for me.

hokage

Sebastian I like your deep contents.we appreciate the time you put into this

maysammansor

Super helpful. Thanks for sharing.
looking forward to more such videos on LLMs.
Keep it up!!

nish

@SebastianRaschka - I just bought the book(How to build a LLM from scratch). Thank you for all your great effort!. :) I look forward to your new content soon. :)

thehard-coder

Mr Sebastain I found your channel yesterday so greatful to you for such top notch education.

prashlovessamosa

Make more videos professor ! Ur knowledge is enlightening me a lot !

shreyaskatiyar

Thank you for a such a awesome contribution towards democratizing LLM research

bosepukur

What a time to be alive haha, love your book.

thefatcat-hdze

Thank you. I recently got your book and this stuff is invaluable. So much stuff out there and its not all organized in a way that's easy to digest. Your books / videos are great!

hasaniqbal

Just finished the video, thank you very much for the detailed explanation. Next step is reading your book :) 🙂

parthsarthisharma

"Thank you! I love your work, Sebastian. 😊
I hope my small token of appreciation will motivate you further to create more content like this.
By the way, I already own most of your books. My favorite is your recent one - Build a Large Language Model (from Scratch)." 📚

iamsnglrty

I am following your blogs from very long time.i have already purchased your new book LLM .I have also purchased your machine learning books.Please upload such contents more .

amitabhachakraborty

Building LLMs from the Ground Up: A 3-hour Coding Workshop

Building LLMs from the Ground Up: A 3-hour Coding Workshop

How to Build an LLM from Scratch | An Overview

Let's build GPT: from scratch, in code, spelled out.

How ChatGPT Works Technically | ChatGPT Architecture

LLM Pretraining Course: Build LLMs From Ground Up

But what is a neural network? | Chapter 1, Deep learning

Ep 26. Build an Open Source LLM RAG for Your Code - Ground Crew

Training LLMs from Scratch #llmwithav #learnwithav #llm #datascience

Best Budget-Friendly University for LLM in the UK | CEO's Top Pick

Build Your Own LLM from Scratch: Step by Step

What are Generative AI models?

How to Build, Evaluate, and Iterate on LLM Agents

Zero to Hero - Develop your first app with Local LLMs on Windows | BRK142

Lessons From A Year Building With LLMs

Generative floor plan design using AI- Maket.ai #ai #architecture

What is Prompt Tuning?

Ep 8. Unexpected Skills Needed for LLM Development

Comparing Different Methods of Using an LLM #llmwithav #learnwithav #llm #datascience

Lets build an LLM from Scratch

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Building with Instruction-Tuned LLMs: A Step-by-Step Guide

Knowledge Graph Construction Demo from raw text using an LLM

LLMOps (LLM Bootcamp)

5 Commandments of Building LLM Products