Easiest Way to Convert a PDF to JSON using LangChain Output Parsers

preview_player
Показать описание
This video tutorial demonstrates how to convert a PDF to JSON using LangChain's Output Parsers and GPT.

A task like this used to be complicated but can now be done in a few minutes thanks to the advancements in the field of AI and LLMs.

--

--
Timeline:

00:00 Introduction
01:24 Setting up a virtual environment
02:03 Installing LangChain
02:19 Importing libraries and ingesting PDF
04:13 Defining Pydantic object with fields to extract
06:26 Using JsonOutputParser with the Pydantic object
07:03 Creating a PromptTemplate
09:28 Importing ChatOpenAI and Putting everything together
10:33 Creating and invoking the chain
12:33 Running the application
Рекомендации по теме
Комментарии
Автор

This is only video on pdf to json extraction. Thank you

shankar
Автор

All the best Jeff, a stranger coming from reddit to support you!

vshalson
Автор

Great insights, thanks for sharing. Excited to see what you have in store for us with your upcoming videos. Keep the awesome work.. cheers

CFabzdigital
Автор

Love this. Thank you Jeff. Very clear.

danielscottpoynter
Автор

Simple and easy to follow, thank you. I do have a question though, is there a pdf size limit to parse and process the data?

SarmadBatt
Автор

How to extract multiple entities from a large pdf? How to share examples while extracting.

shankar
Автор

Hi is there any way I can do this without using an OpenAI llm

rc
Автор

GPT is great, but extraktAI is better tho imo

Payton-Prescott