Building a Multimodal RAG App for Medical Applications

preview_player
Показать описание
In this comprehensive guide, I delve into the fascinating world of multimodal Retrieval Augmented Generation (RAG) applications, tailored for medical use cases. This tutorial is a treasure trove of information for anyone interested in Gen AI and its applications in the healthcare sector.

Throughout this video, I demonstrate how to build a cutting-edge multimodal RAG app using a synergy of advanced technologies. The core of this application is the powerful GPT-4V, a large language model known for its versatility and depth in handling complex tasks. To enhance its capabilities, I integrate an embedding model and a vector database, which collectively work to enrich the app's understanding and response accuracy.

A key feature of this tutorial is the use of FastAPI, a modern, fast web framework for building APIs with Python. I walk you through the nuances of utilizing FastAPI to construct a robust and efficient backend for our application.

Moreover, I showcase the use of the unstructured library, a vital tool for handling tables and images in PDF documents. This functionality is crucial for the medical field, where information is often encapsulated in various formats within documents.

The highlight of this app is its user interaction capability. End users can pose queries, and the app, leveraging its multimodal capabilities, generates precise answers accompanied by relevant images extracted from documents. This feature exemplifies the practical application of Gen AI in providing enhanced user experiences and efficient information retrieval.

This tutorial is not just about building an app; it's about exploring the potential of Gen AI in revolutionizing how we interact with technology, especially in critical sectors like healthcare.

I encourage you to watch, learn, and embark on this journey with me. Your feedback is invaluable, so please feel free to leave your thoughts in the comments section. If you find this video helpful, don't forget to hit the 'Like' button and subscribe to my channel for more content on Gen AI and its transformative applications.

Don't forget to like, comment, and subscribe for more insightful and innovative Gen AI tutorials!"

Join this channel to get access to perks:

#ai #generativeai #doctor
Рекомендации по теме
Комментарии
Автор

The only thing I regret is not finding your YouTube channel earlier. Thank you for sharing your knowledge and putting this together for us. Lol. Don’t be alarmed. If you see me spending hours and hours on your videos, I promise I’m not a weirdo you just have a great valuable information and the way you explain it, is perfect for me. I am a big opponent against black box solutions.

Pure_Science_and_Technology
Автор

Can't wait for the local version, awesome content. i am using LM studio and autogen, looking into langchain integration.

Демократът
Автор

Thank you for the video - I will test it and let you know - your previous dermatologic app was exellent:)

micbab-vgmu
Автор

Thankkk youu sooo muchh!!!
Please get the next llava video soon....need for a presentation tomorrow 🙏🏻

placement
Автор

this was super useful, understood the concept and implementation was smooth. This can definitely help in building our own rag apps

soulfuljourney
Автор

I get the following error while trying to extract elements.

OSError: No such file or directory:

Would anyone have a sense of this? I tried installing nltk but get the same result.

Thanks

Gopikamaturi
Автор

Hi bro i watch your videos regularly i am big fan of your work 💯

JrTech-rwwj
Автор

Loved you video, nice detailed explanation and you don't spare important details. The only issue is that you're a Man Utd fan

MoBahri
Автор

Thanks for working on medical & healthcare modcals highly appreciated bro🎉 keep it up .. please add some UX/Ui design 🙏 explanation to make it easy for us to understand the road map of your work 🎉 highly appreciated 👍 good job ( 👏

UnitedhealthcareCompany
Автор

Can you please provide a ollama-llava version of it? How to use ollama llava instead of gpt 4o as multi modal RAG ? It would be of great help.

shaonlipal
Автор

is there any way we can implement this using complete open source models i don't want to make any API calls?

avinashnair
Автор

Is it the open source version of this uploaded? cant seem to find it

Instinctz_
Автор

Hi, you have not uploaded the LLAVA video

madniali
Автор

Pls share the links of other two videos related to AWS Ec2 deployment and llava use case. Thank You.

AdarshMamidpelliwar
Автор

Part 2 with Llava and deployment tooo
...Taras gyi😭😭😭😭🙏🙏🙏

saumyajaiswal
Автор

it is paid api service from openai, totally wasted, can we do using free apis like huggingface or gemini ???

pradip_patil
Автор

Great Video! Did you have the chance to do the video with LLAVA? Many thanks.

hhbounce
Автор

Thank you for the awesome video. Can you please tell the best approach where we have a multiple pdf chatbot.The Pdfs can have text, images, tables.The answer should contain text, images and tables(or get answers from them) from the pdfs itself wherever required.

saumyajaiswal
Автор

hello sir i got this error :- PDFInfoNotInstalledError: Unable to get page count. Is poppler installed and in PATH? any solution

HiteshSharma-sc
Автор

Great content👍 Can U show also to use fastapi with gradio or streamlit instead an HTML template? Maybe use Langserve (which also use Fastapi)

henkhbit