Code Llama Paper Explained

Показать описание

In this video we dive deep into the research paper behind Code Llama, the new family of large language models for code by Meta AI, which were created by specializing Llama 2 model for code.

👍 Please like & subscribe if you enjoy this content.

The Code Llama family contains three type of models, foundations models called Code Llama, python specialization models called Code Llama - Python and instruction-following models called Code Llama - Instruct.
We review the Code Llama training pipeline to create each of these models.

We then explain thoroughly the interesting self-instruct method which is used in order to fine-tune the Code Llama - Instruct model.

We also explain how Code Llama is able to support its useful code infilling capability in addition to code completion.

Throughout the video we also review several tables and charts from the paper to understand how the models perform comparing to other models.

----------------------------------------------------------------------------------
----------------------------------------------------------------------------------
Chapters:
0:00 Introducing Code Llama
0:49 Code Llama Training Pipeline
2:13 Long Context Fine-tuning
4:32 Self-Instruct
6:05 Code Infilling
7:08 Results

Рекомендации по теме

Комментарии

Please keep these coming!!! Thank you for all these papers. Has become a weekly ritual of mine.

jayaraopratik

❤ Very clear explanation as always, great job!

ympeng

I want to know how they varry epochs length ? Like how do they train some portion of data with 4 epochs and other part with just 0.1 epochs

kalilinux

Code Llama Paper Explained

Code Llama Paper Explained

Llama - EXPLAINED!

Meta AI's Code Llama Explained in 1 Minute.

Llama 405b: Full 92 page Analysis, and Uncontaminated SIMPLE Benchmark Results

LLaMA explained: KV-Cache, Rotary Positional Embedding, RMS Norm, Grouped Query Attention, SwiGLU

Meta AI's Code Llama 70B in 7 Minutes

Code Llama: Open Foundation Models for Code

LLaMA Pro: Progressive LLaMA with Block Expansion (Paper Explained)

All about Code-Llama || Research Paper Analysis || Hugging face models🤗

Investigating the Code Llama Models and Fine Tuned Alternatives

9 New Gemini Leaks, Code Llama and A Major AI Consciousness Paper

Code Llama Unlocked: The New Code Generation Model

Tutorial on Code Llama

LLAMA 2 paper explained - first free commercial model vs ChatGPT!

Code Llama: OpenAI Lawsuit, Nvidia GPU Shortage - AI Paper Explained

Is CODE LLAMA Really Better Than GPT4 For Coding?!

Baptiste Rozière | Code Llama: Open Foundation Models for Code

Llama 3.1 is ACTUALLY really good! (and open source)

Coding LLaMA 2 from scratch in PyTorch - KV Cache, Grouped Query Attention, Rotary PE, RMSNorm

🚀 AI Highlights: Python in Excel, Code Llama, Seamless, 🤗 models, and More!

Code LLama: Meta Replaces Software Engineers with AI?

LLAMA 2 Full Paper Explained

Zuck's new Llama is a beast

Llama 2 Paper Explained