Llama - EXPLAINED!

Показать описание

ABOUT ME

RESOURCES

PLAYLISTS FROM MY CHANNEL

MATH COURSES (7 day free trial)

OTHER RELATED COURSES (7 day free trial)

#chatgpt #deeplearning #machinelearning #bert #gpt

Рекомендации по теме

Комментарии

Would you like to see more videos on Llama? Let me know. Have a wonderful day :)

CodeEmporium

Yes please more deep dive into the code! Super valuable video because of that part.

jeswer

The more i watch videos, the more i understand a subject, this is propably because i Can now see the subject in different angles or perspectives, now i have a better intuition of transformer architectures and i Can code it from scratch, thank you.

share

Hey, thanks a lot for your videos. Your video - transformer attention is all you need helped me build an intuition back before transformers were really cool. It's lovely to see your video on llama, as I actively get to finetune llama on day to day basis :) Much love.

pipinstallyp

Great video! Looking forward to deep dive into llama code

danar

Clear, informative, well presented. Great video!

dollarscholar

Thanks for the great video and a GREAT way of presenting data and showing the code!

steel-r_ua

amazing work man. one of my favourite deep learning creators!

naevan

Beautifully Explained. Thank you. Yes, I want to know more about its architecture too.

prasadraavi

Would love a deep dive into stuff like LoRA and quantization (bitsandbytes library) as well. Perhaps, doing it from scratch in pytorch!

aurkom

Thank you for such an insightful video. Would definitely love a deep-dive video on the architecture and code of LLama 2. Could you please also do an implementation of BERT or RoBERTa fine-tuning (the training process optimized via deepspeed) .
Thanks again!!

abhijitnayak

yes. please. deep dive arch. and code walkthrough if possible.
Thanks a lot for the video. May gods blessing be with you.

gopalakrishna

Commenting for the algorithm. Very well explained. You have a talent !

dinoscheidt

Thank you so much for explaining brother!
Would be really great if you could give a code walkthrough video as well!

YashVerma-iilx

Good explanation with proper understanding !

spydeyftw

I have not implemented the code for decoder only so I have 3questions:

1. so it uses the triangular mask? I have heard from 2 sources which it does, but I dont get it, as we only feed inputs and not the outputs(unlike original transformer), how triangular mask on input data makes sense?

2. does why its called `decoder only`? the architecture seems much closer to encoder part of original transformer model, than its decoder part!! specially when the mask also not different than encoder of original.

3. is it autoregressive or still can be autoencoder to output the outputs in one pass?

popamaji

Very informative!! Would be sick if you could dive deeper.

dikshyakasaju

would you be intersted in making a guide of finetuning llamma2 or you thin kit is oversaturated?

naevan

please make a video about how the generative feature and how the reinforcement learning is used in language models?

popamaji

is this decoder with simplified form?!?!!?!? or its encoder with decoder mask?

popamaji

Llama - EXPLAINED!

Llama - EXPLAINED!

LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU

Meta AI Llama 3 Explained (in 3 Minutes!)

Zuck's new Llama is a beast

Llama 3.1 is ACTUALLY really good! (and open source)

How Large Language Models Work

Llama 405b: Full 92 page Analysis, and Uncontaminated SIMPLE Benchmark Results

Getting to Know Llama 2: Everything You Need to Start Building

LLM about Accelerating Myelin Repair Innovative Therapies and Biomarkers

LLAMA 3 : Explained and Summarised Under 8 Minutes (Compared to Llama 2, Meta AI)

LLaMA | New open foundation Large Language Model by Meta AI | Paper summary

Llama 2 Paper Explained

LlaMA 3 Architecture & Paper explained step by step

Llama 2: Full Breakdown

Llama: The Open-Source AI Model that's Changing How We Think About AI

Meta's New Llama 3.2 is here - Run it Privately on your Computer

Meta AI's Code Llama Explained in 1 Minute.

Learn How LLAMA 3 Works Now: The Complete Beginner’s Guide

Ollama-Run large language models Locally-Run Llama 2, Code Llama, and other models

Llama 3.3 70B in 5 Minutes

This Llama is Actually a SERIAL KILLER... #shorts

'okay, but I want Llama 3 for my specific use case' - Here's how

End To End LLM Project Using LLAMA 2- Open Source LLM Model From Meta

Meta Llama 3.1 is Game Over for GPT 4o ❓