How to Scrape and Extract Data with Langchain GPT Function Calling

Показать описание

the extractooor

#gpt4 #gpt3 #ai #python #webscraping

Рекомендации по теме

Комментарии

love the videos! Enjoying the straight to the point and fun commenatry. Very honest and very helpful!

fortestingpurposesonly

Just wanted to drop a comment to say thank you for creating and sharing this insightful video on how to use Langchain and chatGPT for web scraping and data extraction. The step-by-step demonstration using Python, Beautiful Soup, and Playwright was clear and extremely easy to follow.
Keep up the excellent work and I'm looking forward to your future content. Thanks again!

CalmCascade.

Great video. I learn so much just from reading other people's code.

jsfnnyc

I accidentally landed and now subscribed... you're lit 🔥

SigmaScorpion

i am getting a not implemented error from running the async playwright function. unable to figure out why

uiucdsc

hey, you don't have to re-declare the function on each cell!
also i would like to see if generating the schemas can also be done using the openai api

qnskjyp

I tried using function calling for invoice data extraction, but when the schema content and description got big I noticed a weird regression where the gpt will return a weird {text:nonsense} instead of the valid schema, for reference I was using gpt 3.5 1106

MohamedJemai-pwgn

how can we use vectore store as input for langchain extraction chain ?

aadhilimam

Rate limite exceeded error from langchain after several tries what you recommend Tyler

mertzorlu

also one needs to fidget with selenium or playwright instead of bs4 to navigate to/from pages

qnskjyp

You can also give the option to save in the CSV file

shivamkumar-qpjm

output=await run_player ("url") fails for me ;dont know why .I even installed asyncio and tried, if i remove await there it does not give correct output .everything else is fine, why is this happening

hccuwwi

Bro. 😂 i came for copper but i found G O L D.

MarxOrx

Love your videos. Have you thought about making the code avalible through google collabs?

nicolasmartinez

is this a better method than gpt-engineer ?

ronm

Any possibility to use an open source LLM to achieve similar results?

whackojaco

So just to be sure I understand this correctly...
- It will only scrape one page at a time, it won't do a full directory (say, a shared folder of Google docs)
- You have to know in advance what information you want; it looks for that specifically and generates output based on your query
- You cannot have it scan a number of pages/documents and *then* ask various questions about the content
- The info that it scrapes is not persistent from one query to the next, much less from one session to the next
- The scraped data is private to you, it does not get fed back into the model

Is that right?

Backstory: I'm an author. I'm looking for a way to feed all my manuscripts and copious notes, timelines, plot outlines, etc into a LLM and then be able to ask it questions about the content. Sort-of a virtual assistant dynamic story bible that helps me keep all my details straight without having to take time to dig for the info myself. (Like "What color are Karen's eyes?" or "In which book did Joe meet Captain Huffer for the first time?")
I'm thinking GPT4All is my best bet for now, but boy all the demos I've seen of it are horrendously slow. I haven't yet found an online-hosted model that will
1) take that much data (we're talking multiple 100k-word novels, plus notes) and
2) keep it private to me, not feed it back into the model. (If you know of one, please tell! :) )

carriebartkowiak

How to Scrape and Extract Data with Langchain GPT Function Calling

How To Scrape Any Website

How to Scrape and Extract Shein Products to CSV 2024

How to Scrape and Extract Data with Langchain GPT Function Calling

HTML Scraping: How to Scrape any Website and Extract HTML Code (2020 Tutorial)

🔰How To Scrape Thousands of Targeted Emails from Google - Email Marketing

How to scrape any website in minutes - No-code tutorial

How to scrape and export LinkedIn Job Postings to CSV 2024

How to scrape and export Justdial data and export to CSV 2024

Scrape 100,000+ Leads From Apollo.io In Minutes With Make.com , Apify & Lemlist !

How to scrape Reddit posts and comments

AWESOME Excel trick to scrape data from web automatically

How to Scrape Data from Website | Instant Data Scraper Chorme Extension | Learn Web Scraping

Scrape Data from Any Website with Browse Ai | Extract any data from any website

Scrape Data from Booking.com using Python - HTML to Excel & CSV

How to Scrape Any Website in Make.com

How to Scrape Emails, Phones, Contacts, and Social Media Links From Any Websites?

Scrape any website with OpenAI Functions & LangChain

How To Scrape YouTube Comments And Extract Emails

Scrape Data from Google Maps (in 2023)

How to Scrape LinkedIn Search Results and Extract All Data from a LinkedIn Profile | Data Scraping

How to Scrape LinkedIn to Google Sheets (in 1 Click!) | Easy No Code Scraper

How To Scrape Yelp To Find B2B Leads

How to Scrape Data From Facebook Accounts | Python Tutorial

Google Maps Search to Contact Data Flow- Scrape and extract contact data from Google Maps.