How to run a local AI chatbot on Windows in 5 min, no cuts, no edits, with Ollama, LMStudio, OpenAI

Показать описание

Running a local AI chatbot on Windows in just 5 min, no cuts, no edits, with Ollama and LMStudio

Scott Hanselman

Рекомендации по теме

Комментарии

I like your experiments Scott... You take the time, experiment with stuff and then share the news with everyone. These local model experiments are also very cool!

CodingAdventures

Nice video, Scott. I have both Ollama and LM Studio running in my laptop. Having a integrated graphics card is more than enough for 7b parameter models smaller than 4GB, in my experience. The output is not that fast, still faster than I can read.
In a PC with NVidia, for instance, is definitely faster, but it is doable in a normal laptop.

PinheiroJaime

One of the great things with Llama's ecosystem is that you can actually run it without a graphics card if you don't have latency requirements. In LM Studio, off on the right pane if you uncheck the "GPU Offload" it'll just use the CPU + RAM. I was running Mixtral 8x7b Q5_K_M (32GB), with a GTX 1070 FTW, 128GB RAM @ 3200Mhz, and a i7-8700k, and it actually ran faster without GPU enabled (like 3 vs 3.5 tokens / second).

Might be interesting to go over Autogen too. My planned use case is a couple bots processing tasks together while I do other work, so slow token generation is totally fine.

Frostbain

The "uncensored" models seem to give the best results. No bias training applied and not refusing to give factual results.

webluke

Wow, I didn't know Scott Hanselman had a YT channel... instant subscribe.

jazzweather

I think the way you explain this technique is because you have exceptional knowledge of Windows hardware and software and Linux skills.
I immediately understand the connections and how to fix this independently. If possible, more AI hardware, compute tips to run various LLMs locally.
Thank you for the explanation

jayhu

Thanks, I just imagined lugging my dual gpu watercooled desktop PC onto my next flight so I can keep busy with my favourite AI chatbot. Hope I can get it through security!

rabidtommy

The audio could use a little boost. Thanks for the vid!

mtranchi

Thanks for the tutorial, Scott! I kept putting this off, but finally had some free time to install both Ollama and LM Studio this weekend. My aging computer struggles with it, but it's still workable. I guess it's a good excuse to buy a new one 🤣🤣

JosuaBatubara

Eject a tape from a VCR, if you're old. You crack me up!

Siderite

Thank you Scott! Would love to see more videos related to running models locally

DiegoAguilera

very helpful video, thank you! 'hallucinate' is DEFinitely a more fun term to describe weird model responses, than 'not grounded in reality' ;)

zandorachan

If anyone stumbles on this with a more recent version of LM Studio, the GPU Acceleration option is now inside the Advanced Settings section in the right panel. Instead of typing in "-1" you can just click the [max] button.

rudyMents

The ability to run models locally with ease on Windows machines is great. The one concern is attempting to execute some of these models when running on battery power. Plan on your battery dying quickly.

zbart

This is sooo cool. Makes me want to get a machine with a dedicated gpu.

dlonicholas

Good video, perfect for when your security team have locked down the azure open ai resource group and no one can do prototypes.

coderider

I'm running lmstudio on $200 laptop with 32GB ram. Crappy integrated video card. It runs. Takes 20 seconds to answer... but can be done. Im not doing anything special lmstudio out of the box. GPU better, no doubt.

EricRohlfs

Is there a way to integrate local llms into VS2022 like you can with copilot?

TheVideoGameVault

This is so insanely accessible, i had no idea !

darkenaxe

Great job showing how easy it can be. Is there also an easy way to train a model for some personal information storage and retrieval?

ryanoc

How to run a local AI chatbot on Windows in 5 min, no cuts, no edits, with Ollama, LMStudio, OpenAI

How to run a local AI chatbot on Windows in 5 min, no cuts, no edits, with Ollama, LMStudio, OpenAI

Quickest and Easiest Way to Run a Local Web-Server

Local Run | New Feature Tutorial

Introducing Local Run

Run Your Own Local ChatGPT: Ollama WebUI

How to Run a Local LLM on Android

Run a Local, Private LLM By Downloading Just 1 File - ChatGPT-like Bot on Your PC!

How to Run a Local LLM on iOS

KC Temple Run hosts 5K for local charities

How to Easily Install and Run Llama 3.1 on a Local Windows Computer -Meta LLM alternative to ChatGPT

FlutterFlow Local Run | Learn to Run Flutter Flow Apps Locally | @FlutterFlow

Run LLMs without GPUs | local-llm

Run Llama 2 on local machine | step by step guide

Easy Tutorial: Run 30B Local LLM Models With 16GB of RAM

Llama 3 FULLY LOCAL on your Machine | Run Llama3 locally

Using Ollama to Run Local LLMs on the Raspberry Pi 5

Flutterflow Local Run ( The most awaited flutterflow feature)

How To Run a Local Network through your Telephone Line

How To Run For Local Office As A Progressive

How To Run Google Ads For Local Businesses (FULL TUTORIAL)

How to run YouTube Ads for Local Businesses the Easy Way in 5 minutes!

📍 How to Run Google Ads for Local Business in 2024 [Step by Step] 🚀💼

Run Your HTML and JavaScript File on a Live Local Host Server Using VsCode || Short and Simple

How to Run Local AI Models - in 5 minutes