Chatbot with RAG Using OCI Generative AI Agents

preview_player
Показать описание
In today’s technology landscape, we can tap into research and statistics, pulling in data feeds for analysis and drawing insights to make decisions in real time. However, new information can be hard to parse and contextualize, even for the most robust analytics solutions. This is where retrieval-augmented generation (RAG) is useful, allowing you to augment the knowledge of a large language model without retraining it when new information is available. This updates your model with current knowledge, making it more intelligent, with minimal effort.

Oracle Cloud Infrastructure (OCI) Generative AI Agents allows us to do just that. In this example, we’ll upload our documents, process this data, put it into a vector store (via OCI Search with OpenSearch), create a Redis cluster for caching purposes, and provide users with a way to consume the data through a chatbot.

For the infrastructure, we’ll have the following OCI services present:

• OCI Cache with Redis for caching user-agent interactions (so we can give some context to the model)
• OCI Search with OpenSearch cluster for vector similarity search (vector database) and storing indexes with data
• OCI Compute for connecting to the OpenSearch cluster securely (through OCI private subnet routing)
• OCI Generative AI Agents for communicating and interacting with the data in our cluster
Рекомендации по теме