Run Llama 3 on CPU using Ollama

Показать описание

Discover how to effortlessly run the new LLaMA 3 language model on a CPU with Ollama, a no-code tool that ensures impressive speeds even on less powerful hardware.

Don't forget to like, comment, and subscribe for more tutorials like this!

Join this channel to get access to perks:

To further support the channel, you can contribute via the following methods:

Bitcoin Address: 32zhmo5T9jvu8gJDGW3LTuKBM1KPMHoCsW

#llama3 #llama #ai

Рекомендации по теме

Комментарии

I think you are one of the best channels to learn about AI. Thanks for keeping us up to speed with such fast moving tech!

johnbarros

Please do mention the specs of your machine. or add them to your video description. Thanks for posting your vids.

laalbujhakkar

To those who are getting responses very delayed...let me tell you that llama3 usually runs on GPU or higher tuned CPU .
These llm models generate on the basis of GPU performances...
And that's why the computation power matters in case of generation irrespective of the llm you use
So, its better to find any way to import model using API instead of downloading locally

pragyantiwari

I have tried the model but it does not respond as quickly as shown in the video. Nevertheless, it keeps responding. Thank you for sharing your knowledge and congratulations.

CesarVegaL

Is there any way to do rag on cpu but really fast like maybe not fast as groq but couple of second is fine? I want to only use like maximum 3 billion parameter model only since I don't think it will be fast if using 7 billion model..

Cingku

I run summarization in batches of text up to 4K, with langchain, and this model is quite slow on my machine. Gemma:2b takes 1/6 of the time to summarize the same amount.
So, while I like llama3 for local inference, it is a bit too slow for actual work.
Perhaps if somebody quantized it to 2b, it would be a competitor to gemma:2b, but it should also be said that Gemma models are made to run on low spec hardware, and were trained accordingly I thnk, while llama 3 is a more general purpose model.

aldotanca

can you build a project so that user provides a sentence and a word and llm should provide the entire dictionary lookup for the word in context to sentence

atulanand

How do we integrate private pdf into it. I would be very happy if you could give the simplest video about creating a chat with pdf using ollama 3 on cpu. I went through your previous video but was not able make it work in windows

prestocranius

Sri we want video fine-tuning the the Gemini pro 1.5 video and used rag

velugucharan

What is the full specs of your pc? How many cores is your cpu?

elikyals

Tell me about gpu bhai, its real slow in cpu

marufhoque

I think its not running localy bro, i would like to doit but if i disconect my laptop from internet it stop working

CarlosGomez-fjbz

Do you think i can run llama3 70b q8 model if i have 128gb ram with 3060 12gb vram?

simerosaitora

To all cpu users don't fall for it, it will work but slow and don't worth doing it, just use online api's or groq or colab

shivpawar

please it's not LAInux neither LYnux, its LInux

joseaugustodossantossilva

Run Llama 3 on CPU using Ollama

Run Llama 3 on CPU using Ollama

How to Run LLAMA 3 on your PC or Raspberry Pi 5

How to Run Llama 3.1 Locally on your computer? (Ollama, LM Studio)

How To Run Llama 3 8B, 70B Models On Your Laptop (Free)

How to Run Llama 3 Locally on your Computer (Ollama, LM Studio)

How To Run Llama 3.1: 8B, 70B, 405B Models Locally (Guide)

How to Run Llama 3.1 Locally on your Computer with Ollama and n8n (Step-by-Step Tutorial)

Run 70Bn Llama 3 Inference on a Single 4GB GPU

AI on x86 CPU - #Ollama & #Llama 3.1 installation tutorial

Install & Run Llama 3.1 in 2 min on Windows Locally

Run Llama 3 on Windows | Build with Meta Llama

Build Anything with Llama 3 Agents, Here’s How

Llama 3 Tutorial - Llama 3 on Windows 11 - Local LLM Model - Ollama Windows Install

'okay, but I want Llama 3 for my specific use case' - Here's how

Llama 3 8B: BIG Step for Local AI Agents! - Full Tutorial (Build Your Own Tools)

How to Run LLaMA Locally on CPU or GPU | Python & Langchain & CTransformers Guide

Llama 3 on Your Local Computer | Free GPT-4 Alternative

How to install llama 3 on Windows Mac and Linux

AMD GPU 6700XT run Llama 3.1 (Ollama run llama3.1)

This Llama 3 is powerful and uncensored, let’s run it

How to use Llama 3(70B) API for FREE (beats GPT4 for business!)

How to Run LLaMA locally on your Computer - GPT-3 Alternative

How Did Llama-3 Beat Models x200 Its Size?

How-To Run Llama 3 LOCALLY with RAG!!! (GPT4ALL Tutorial)