filmov
tv
Fine-Tuning Llama 2 70B on Consumer Hardware(QLora): A Step-by-Step Guide
Показать описание
In this video, I take you through a detailed tutorial on the recent update to the FineTune LLMs repo. This tutorial covers the process of fine-tuning Llama 70B on consumer-grade hardware. Specifically, I highlight the vital role of recent innovations like QLora and FlashAttention 2 in enabling such fine-tuning.
The tutorial also addresses the challenge of using the pad token ID in fine-tuning LLM models, and I present a neat trick using rare, unused tokens.
Finally, I demonstrate some runs using the model I trained and to answer some prompts, showing successful fine-tuning.
Access the complete video for insights into how I fine-tune LLMs and be sure to check out my other videos on the same. Remember to subscribe, share, and click the notification bell to stay updated!
#FineTuneLLMs #LLAMA70B #FineTuning #SoftwareTutorial #CodeTutorial #ProgrammingTutorial #Python #QLORA #FlashAttention2 #MachineLearning #DataScience #ComputerScience #AI #LanguageModel #NLP.
Timestamps:
00:00 - Intro
00:56 - Summary Of Qlora and Flash Attention
02:02 - Setting Up Software
05:14 - Getting A Dataset
05:47 - Examining The Software
12:37 - Running The Software
13:58 - Software Performance Analysis
15:13 - Training Results And Shared Model
16:16 - Running Instructions On Model
17:30 - Custom Datasets And Models
17:56 - Outro
The tutorial also addresses the challenge of using the pad token ID in fine-tuning LLM models, and I present a neat trick using rare, unused tokens.
Finally, I demonstrate some runs using the model I trained and to answer some prompts, showing successful fine-tuning.
Access the complete video for insights into how I fine-tune LLMs and be sure to check out my other videos on the same. Remember to subscribe, share, and click the notification bell to stay updated!
#FineTuneLLMs #LLAMA70B #FineTuning #SoftwareTutorial #CodeTutorial #ProgrammingTutorial #Python #QLORA #FlashAttention2 #MachineLearning #DataScience #ComputerScience #AI #LanguageModel #NLP.
Timestamps:
00:00 - Intro
00:56 - Summary Of Qlora and Flash Attention
02:02 - Setting Up Software
05:14 - Getting A Dataset
05:47 - Examining The Software
12:37 - Running The Software
13:58 - Software Performance Analysis
15:13 - Training Results And Shared Model
16:16 - Running Instructions On Model
17:30 - Custom Datasets And Models
17:56 - Outro
Комментарии