Finetuning Open-Source LLMs

Показать описание

This video offers a quick dive into the world of finetuning Large Language Models (LLMs). This video covers

- common usage scenarios for pretrained LLMs
- parameter-efficient finetuning
- a hands-on guide to using the 'lit-GPT' open-source repository for LLM finetuning

#FineTuning #LargeLanguageModels #LLMs #OpenAI #DeepLearning

Useful links to resources discussed in this video:

---

---

Рекомендации по теме

Комментарии

Thanks for sharing, especially about Lit-GPT (I'm always interested in more tutorials as my journey with fine-tuning and LLMs needs all the help it can get). Thanks again.

kenchang

Very much appreciate this video, fine-tuning seemed like a somewhat amorphous concept to me for sometime, but the diagrams you showed really made it easier to understand how people finetune.

Dom-zyqy

I recently listened to your latest videos. And now this one was recommended by perplexity for my specific use-case ;-) coincidence?

mulderbm

One of the approaches I have experimented with, which is both manual labor, time and compute expensive but more reliable, is as follows:
- Use a LLM to query for outputs. Use RAG and prompt engineering to get the best possible results.
- Generate chat logs for each query. The log should include everything - the prompt, the retrieved info if any and the model output. Any special symbol such as to denote the system prompt or anything else should also be left in. This is because LLMs are text generation models with no concept of chat.
- Manually update the model outputs to better reflect the expected output. This is a data creation task.
- Fine tune a copy of the same LLM using PEFT using the updated chat logs.

This can also be done iteratively as long the chat logs are generated initially by a model which hasn't been fine-tuned yet. Like a sort of A/B experiment. Some use cases are served the original model that generates the data for fine-tuning while the other are served the fine-tune model whose outputs are not used for any further fine-tuning.
Expensive but over time, your model would work better for realistic inputs.

MayurGarg

Thanks for the video, very helpful for me to understand different kinds of finetunning. BTW, what kind of finetunnig is huggingface belong to?

zjffdu

Time saw you here on YT! Hope you remember me.!

muhammadanas

I really wish people would stop putting their x link and start sharing something like mastadon or threads, as a free user, x is where u go to feel second class citizen.

PtYt

Finetuning Open-Source LLMs

Finetuning Open-Source LLMs

Steps By Step Tutorial To Fine Tune LLAMA 2 With Custom Dataset Using LoRA And QLoRA Techniques

Fine Tuning LLM Models – Generative AI Course

Should You Use Open Source Large Language Models?

OpenLLM: Fine-tune, Serve, Deploy, ANY LLMs with ease.

Fine-tuning Large Language Models (LLMs) | w/ Example Code

Prepare Fine-tuning Datasets with Open Source LLMs

'okay, but I want Llama 3 for my specific use case' - Here's how

Fine Tune LLaMA 2 In FIVE MINUTES! - 'Perform 10x Better For My Use Case'

Fine-tuning ANY Open Source Model like a Pro

Generative AI Fine Tuning LLM Models Crash Course

LLM Fine Tuning Crash Course: 1 Hour End-to-End Guide

Finetuning Open-Source LLMs // Sebastian Raschka // LLMs in Production Conference 3 Keynote 1

Fine-tuning Open Source LLMs with Mistral | Tokenization & Model Performance

LLAMA-3.1 🦙: EASIET WAY To FINE-TUNE ON YOUR DATA 🙌

QLoRA—How to Fine-tune an LLM on a Single GPU (w/ Python Code)

Fine-tuning Llama 2 on Your Own Dataset | Train an LLM for Your Use Case with QLoRA on a Single GPU

Finetune Open Source LLMs for Function Calling with Unsloth | Dataset, Training and Inference

Create fine-tuned models with NO-CODE for Ollama & LMStudio!

LLAMA-3 🦙: EASIET WAY To FINE-TUNE ON YOUR DATA 🙌

Fine-Tuning LLMs: Best Practices and When to Go Small // Mark Kim-Huang // MLOps Meetup #124

Developing an LLM: Building, Training, Finetuning

How Fine-tuning Open Source LLMs Solves GenAI Productionization - Piero Molino | Stanford MLSys #94

'I want Llama3 to perform 10x with my private knowledge' - Local Agentic RAG w/ llama3