GPT (nanoGPT) from a beginner’s perspective (Part 2 Final)

Показать описание

In this video I went through Karpathy's nanoGPT Code base, explaining the final part - Self-Attention.

nanoGPT is a character level implementation of a GPT (Generative Pre-trained Transformer) referencing the official Transformer from the Attention is All You Need paper.

Karpathy is my role model in the field of AI Research, he is a cofounder of Open AI and former Director of AI at Tesla.

My GPT Practice Repo

References

#gpt #nanogpt #karpthy #ai #nlp #llm #openai #google

Jaward

Рекомендации по теме

Комментарии

For optimizing computation: during an inference shouldn't we abandon triangular mask, if only we want to predict one-last token? It seems like an unnecessary computation if in none-training mode we take only the last one tokens logits which predicts next token after given input sequence.

madragonse

Where can I find your, gpt_dev.ipynb?

rpraver

GPT (nanoGPT) from a beginner’s perspective (Part 2 Final)

Let's build GPT: from scratch, in code, spelled out.

Let's reproduce GPT-2 (124M)

NanoGPT using Simpsons Data: Get Started with Large Language Models

karpathy/nanoGPT - Gource visualisation

Crafting a nanoGPT from Scratch on Your Terminal without OpenAI | FREE

Create a Large Language Model from Scratch with Python – Tutorial

NanoGPT meets the Simpsons #machinelearning #largelanguagemodels #datascience #gpt4

Generative AI for beginners | Large Language Models (LLMs) | Transformer | nanoGPT | RAG | LangChain

RajaGPT | Ep-1 | Starting with NanoGPT from Karpathy's code | ft. Raja Ayyanar | June 04, 2023

Making SimpleGPT2 — a GPT-2 implementation that prioritizes readability and education

Advice for machine learning beginners Andrej Karpathy and Lex Fridman #shorts #viral #commitment

How to Fine-Tune and Train LLMs With Your Own Data EASILY and FAST- GPT-LLM-Trainer

Building Your First GPT App | Beginner Friendly

LLMOPS : Train a LLM nanoGPT (GPT2) #machinelearning #datascience

The genius of Andrej Karpathy | John Carmack and Lex Fridman

How to Build an LLM from Scratch | An Overview

AI’s Making Millionaires. #chatgpt #business #ai #gpt #makemoneyonline #gpt4 #marketing

How to create a TinyGPT model from scratch #ai #transformers #aiengineer

Deep Generative Models 2024: 9-GPT: live coding in details

How To Use ChatGPT4 On Mobile (2023)

NVIDIA's $249 Secret Weapon for Edge AI - Jetson Orin Nano Super: Driveway Monitor

How to instruct chatGPT, with simple example

A 2-hour tutorial on how to build a BabyGPT2 from scratch step by step

Mathematics Behind GPT and LLM Models | Mathematics Powering AI Technologies| @enlightoon1919