Graph RAG with Ollama - Save $$$ with Local LLMs

Показать описание

Learn how to set up a local model using OLAMA and the Groq API for Project GraphRAG. We will also look at the limitation of using smaller local open weight models.

LINKS:

💻 RAG Beyond Basics Course:

Let's Connect:

Signup for Newsletter, localgpt:

TIMESTAMPS
00:00 Introduction to Project GraphRAG
00:30 Setting Up OLAMA and Choosing Models
01:14 Configuring GraphRAG with OLAMA
02:32 Using Groq API with GraphRAG
04:35 Running Local Indexing and Entity Extraction
07:15 Testing and Comparing Models
08:15 Importance of LLM Choice in GraphRAG
10:58 Optimizing Prompts for Different LLMs

All Interesting Videos:

Рекомендации по теме

Комментарии

Definitely waiting for fully customisation of graph RAG using open source models

programwithpradhan

Great discussion.

I am tackling things differently, and it seems to work pretty well.

I use ChatGPT to manually construct subject-predicate-target statements from a document. Instructions to infer root names from prepositional references, etc...

I feed this into a graph database.

User-entered text is passed to a local Llama3 instance to construct a graph query - this query is executed against the graph database.

This result set is added to the initial user text and passed to an appropriate LLM.

JohnBoen

Hey! Cool video. I actually built a full local solution using Ollama, no need for LM Studio at all. Here's what I did: I created a proxy that translates between OpenAI API embeddings and Ollama's format, both ways.

The cool thing is, it works flawlessly for both global and local queries. I'd be happy to share the script with you if you're interested!

maxs

ollama embeddings proxy on git hub. This script bridges the gap between OpenAI's embedding API and Ollama, making it compatible with the current version of Graphrag

maxs

Yeah! Prompting is key. Llama3 is very good at step by step instructions. Print this, write that, do this and combine 12:08 that, finally this. The good thing is that larger models are also able to understand this, but most of the time it does not work in the other direction.

MeinDeutschkurs

Hey, great video, A couple questions

1. Can I create the Graph using Llama3-70B and then use a different LLM (which doesn't have a rate limit) to answer RAG queries ?
2. Can I create a partial Graph, and update it as more data comes in (rather than create the Graph all over again) ?

jasmeetsingh

Yea I tried using llama-3 in LM Studio and using OpenAI embeddings since its cheap. GraphRAG detonated (after waiting an hour of course). It seems like it did all the LLM stuff OK and embeddings OK, but at the end when trying to put everything together it just went to crap. Too specific to using OpenAI for everything. Even using 4o it was more expensive than a six pack of beer and I ain't giving that up.

BrandonFoltz

May work with English-only and not even work with vLLM due to buggy parameters !

nguyenanhnguyen

Does this RAG provides better results compared to semantic chunking?

MrAhsan

what about Lamini fine-tuning ? this might just be the best of both worlds. Would be really interesting to see comparisons between traditional RAG (optimized techniques), graphRAG and fine-tuning (lamini).

awakenwithoutcoffee

For embedding model as well can't we use ollama? I see that mixedbread mxbai is available.

shameekm

Thanks for the video, but are you, by chance, sharing the code modifications? I did not see any links.

meelanc

Definitely have to use this with local models. I burned $46.00 in tokens. I waited 10 minutes before I could test the application. Your text files have to be 8-bit clean, or the app will blow up. Graph Rag is not a gift from microsoft.

mrchongnoi

Could you use gemini Flash or Claude Haiku or DeepSpeed V2 to keep costs down??

Sceptic

if my data size is larger(say 1000 pdfs) then the embedding cost will be too high, even if we use local models the time taken is too high right now. what do you think about using gemini pro model for doing that, as google charges no cost up to 300$ for AI projects. Maybe you can suggest your views and make a video on this in future?

takshitmathur

actually, when going with entity extraction, people can extract Noun and adj as a text fiile then make a cluster, using it in struct to send a request to bigger model such as chat GPT , the struct is well defined , accoding 5 topic of science so it always return a value you want to looking and decreasing the complexity . my structure is simple , define , formulate, it s shortcut of which ? ( looking for enpoint ), middle point ( by tracking respond then sending again), .. the structure will depend on user . Let define "Noun", is that Noun using for approximating a thing as a word ? . so when we now Noun and define for that word, we know the relevant topic around it , and because synonym is limited, so just respond over time , it would get the converge information, and luckily , we could use a little trick to get 1000 account for free . but my method is still limit when i dont define math formulate, and other language except english, some define still not unclear enough and fragment information only sense when take big sample .
your method is great too, why have to stick in one idea, the more would be better . thanks

TranKiet-pjmw

Have you tried to use it with an existing neo4j graph ?

brucewayne

So compared to GPTs, his search generation effect will be better?

xinzhang

If Graphrag can’t match the quality of Diffbot while being cheaper, it’s currently not useful.

readmarketings

Doesn't work on PC. I'll try on Mac. I haven't found a video that actually makes it work on PC locally.

lesptitsoiseaux

Graph RAG with Ollama - Save $$$ with Local LLMs

Graph RAG with Ollama - Save $$$ with Local LLMs

GraphRAG with Ollama - Install Local Models for RAG - Easiest Tutorial

GraphRAG Ollama: 100% Local Setup, Keeping your Data Private

Easy GraphRAG with Neo4j Visualisation Locally

GraphRAG Ollama UI - Gradio Interface for Microsoft GraphRAG

Graph RAG: Improving RAG with Knowledge Graphs

Ollama Python Library Released! How to implement Ollama RAG?

RAG with a Neo4j Knowledge Graph: How it Works and How to Set It Up

Rag Evaluation with Milvus, Ragas, Langchain, Ollama, Llama3

Building Corrective RAG from scratch with open-source, local LLMs

How I Build Local AI Agents with LangGraph & Ollama

Realtime Powerful RAG Pipeline using Neo4j(Knowledge Graph Db) and Langchain #rag

Llama3 Full Rag - API with Ollama, LangChain and ChromaDB with Flask API and PDF upload

Enhancing RAG: LlamaIndex and Ollama for On-Premise Data Extraction

Ollama Llama 3 - RAG: How to create a local RAG system with LLAMA 3 using OLLAMA

GraphRAG: LLM-Derived Knowledge Graphs for RAG

ERAG with Ollama - RAG Tool with Lexical, Semantic, Knowledge Graph Searches

Knowledge Graph Construction Demo from raw text using an LLM

Install GraphRAG Locally - Build RAG Pipeline with Local and Global Search

Reliable, fully local RAG agents with LLaMA3

Mistral 7B RAG Tutorial: Build RAG Application Easily

GraphRAG with Llama.cpp Locally with Groq

Unleash the power of Local LLM's with Ollama x AnythingLLM

You Need Better Knowledge Graphs for Your Graph RAG