Llama 2 in LangChain — FIRST Open Source Conversational Agent!

Показать описание

Llama 2 is the best-performing open-source Large Language Model (LLM) to date. In this video, we discover how to use the 70B parameter model fine-tuned for chat (Llama 2 70B Chat) using Hugging Face transformers and LangChain. We will see how to apply Llama 2 as a conversational agent within LangChain.

📌 Code Notebook

🌲 Subscribe for Latest Articles and Videos:

👋🏼 AI Consulting:

👾 Discord:

00:00 Llama 2 Model
02:55 Getting Access to Llama 2
06:12 Initializing Llama 2 70B with Hugging Face
08:17 Quantization and GPU Memory Requirements
11:14 Loading Llama 2
13:05 Stopping Criteria
15:17 Initializing Text Generation Pipeline
16:25 Loading Llama 2 in LangChain
17:08 Creating Llama 2 Conversational Agent
19:46 Prompt Engineering with Llama 2 Chat
22:16 Llama 2 Conversational Agent
24:14 Future of Open Source LLMs

#artificialintelligence #nlp #opensource #huggingface #langchain

Рекомендации по теме

Комментарии

Amazing work! I like how you breakdown all the nuances, including memory usage, great job James!

CMAZZONI

Playing with the 13B-Chat version and I've found with careful prompting it reliably outputs useful JSON. I haven't had time to stress test it, but I'm super impressed compared to all the other models I've tried. Nothing else has come close to showing this much usefulness out of the box.

jolieriskin

I tried and it works with 13B model too which works with free Colab. The prompt engineering is great!

LeoAr

I love how informative you are with the memory utilization and various parameters. The engineering trade offs are great to have your perspective on.

goobtron

James, your content is fantastic! I'd love to see a video implementing FlashAttention2 (with llama or other) to have a larger usable context window!

tfgidhr

Cheers, James. Nice vid. I’ve really been enjoying this model recently. The future is looking exciting !

bigpickles

Half way through the video I subscribed to this challenge, I love the way you simplified the details.

BestowTechs

You sir, are a life saver.
I almost gave up on llms because i couldn't find a single coherent tutorial about interfacing llm with external environment, which wasn't a marketing bs.
And then YouTube gods put this in my recommendations.

This is amazing. I was trying to figure out how to do this from scratch for like a week straight, and i managed to teach my "character" to say certain tags like [[time]], instead of replying with arbitrary made up date/time. I got stuck at that point, thinking there is something special about how commercial AIs do it, but you have just confirmed my intuition, and restored my hopes :)

Correct me if i got it wrong, but it seems langchain is basically just a library with a json interface, vaguely related to AI, and the actual "connection" between the model and the langchain is done by teaching the model to "speak" in json, and intercepting/redirecting the output ?

staviq

Awesome, thanks. Great explanation too.

Sulayman.

Thanks man - I was considering trying out Llama for an agent use-case, and you pushed me over the edge - cheers!

traviskassab

Rock solid goodness right there! James thanks for your time to spread the knowledge.

creatorsgear

Thanks James. Deeply appreciate your tutorials! keep them coming.

gkennedy_aiforsocialbenefit

First thing I try is a simple coding task, finding duplicate files (by their content) under a directory. So far only GPT-4 can complete this task. GPT-3.5, Claude 2 and WizardLM-WizardCoder-1.0-GGML (8 bit) comes close. All the other LLMs produce useless output. Not even Bard can do it. It looks like the only open-source model which can do coding and technical reasoning is still WizardCoder. I'm waiting for the next WizardCoder version to come out or for a Llama 2 fine-tuned for coding and technical reasoning tasks. That would be a real good open-source LLM, finally usable of offline work.

ViktorFerenczi

Thanks! It was very informative and on point.

spongebobsquarepants

Thanks for explaining the quantization, cos I understand the logic, but this was well put in code with example ^^

TeamUpWithAI

Would be great if you could make a video about hosting Llama 2 13b and 70b for production. It's easy to use the free inference api from HF, but there are really little resources out there speaking about the actual costs to run this, the trade-offs of using different VM of different specs (e.g. speed in tokens per second generated) etc... Great videos James, thank you!

dylanramirez

nice work, great example for me to create some application! thanks so much.

Bamboo_gong

I found this video really compelling. I believe it would be incredibly fascinating to leverage a CSV connection to answer data-specific questions. It reminds me of the article I read titled 'Talk To Your CSV: How To Visualize Your Data With Langchain And Streamlit.

simonmoyajimenez

Just subscribed. I enjoyed the content a lot - I liked that you clearly understand where people will have questions and clearly answer those questions.

scottmiller

Love your content James. I used MPT 30B for RetrievalQ&A. I just have one question how did you get the “Explain Text” field when you select certain text ?? seems very convenient and resourceful

kunal

Llama 2 in LangChain — FIRST Open Source Conversational Agent!

LLaMA2 with LangChain - Basics | LangChain TUTORIAL

Llama 2 in LangChain — FIRST Open Source Conversational Agent!

Using LangChain with Llama 2 | Generative AI Series

Getting Started with LangChain and Llama 2 in 15 Minutes | Beginner's Guide to LangChain

Pairing an open source LLM with Langchain (Llama 2) | Unscripted Coding

I used LLaMA 2 70B to rebuild GPT Banker...and its AMAZING (LLM RAG)

🔥 Fully LOCAL Llama 2 Q&A with LangChain!!!

End To End LLM Project Using LLAMA 2- Open Source LLM Model From Meta

New Course: AI 4 Everyone - Dive into Modern AI with Llama 3.1 and Gemini

Ollama-Run large language models Locally-Run Llama 2, Code Llama, and other models

Complete Langchain GEN AI Crash Course With 6 End To End LLM Projects With OPENAI,LLAMA2,Gemini Pro

Chat with Multiple PDFs using Llama 2 and LangChain (Use Private LLM & Free Embeddings for QA)

Llama2 Chat with Multiple Documents Using LangChain

Complete Langchain Course For Generative AI In 3 Hours

LangChain: Chat with Books and PDF Files with Llama 2 and Pinecone (Free LLMs & Embeddings)

How to use Llama 2/3 in your Application ? Complete Tutorial using LangChain & Prompt Engineerin...

I Analyzed My Finance With Local LLMs

Announcing LlamaIndex Gen AI Playlist- Llamaindex Vs Langchain Framework

Better Llama 2 with Retrieval Augmented Generation (RAG)

Fully local RAG agents with Llama 3.1

RetrievalQA with LLaMA 2 70b & Chroma DB

🔥 Fully LOCAL Llama 2 Langchain on CPU!!!

Hugging Face + Langchain in 5 mins | Access 200k+ FREE AI models for your AI apps

LangChain Explained in 13 Minutes | QuickStart Tutorial for Beginners