GPT-4 Vision API + Puppeteer = Easy Web Scraping

Показать описание

In today's video I do some experimentation with the new GPT-4 Vision API and try to scrape information from web pages using it.

00:00 Intro
01:04 Basic usage of GPT-4 Vision API
05:50 Test GPT-4 Vision with image from Unsplash
07:23 Taking a screenshot with Puppeteer
12:35 Test GPT-4 Vision with Wikipedia screenshot
18:14 Test GPT-4 Vision with Google weather info
19:29 Automating URL generation + screenshot taking
33:24 Handling timeouts and retries and making it conversational
44:30 Summarizing BBC news
45:33 Fixing slow loading pages
49:18 Asking for weather information
50:24 Tweaking system message
54:03 Asking for Tesla stock price
56:00 Outro

Unconventional Coding

Рекомендации по теме

Комментарии

Didn’t expect a coding video to be this entertaining. Love the frank display of your thought process.

Lewis

Your tutorial helps with the excitement and anxiety as a fellow dev. I knew I could do this myself but keep procrastinating and eventually some tasks end up as a mental block in WFH mode. Just forcing myself to watch a fella do something like this really helps, thank you!

arclife

I love how much of the process of programming he includes in the demo

dustinsoodak

A fabulous video that has been of great help in orienting our new collaborators. Your generosity is highly valued!

Autoscraping

Its interesting that this is exactly what I was looking for. Llast night i spent a few hours asking copilot how to implement the same libraries. Thanks for the tutorial

PostMeridianLyf

I just wanted to tell you that you are doing great and I really like your format.

Salfie

This guy has superpowers. He can talk and code at the same time!

mooktakim

This was super cool! Don't mind the long format at all. Would love to see you evolve this concept in another video.

fuba

Use the retry library and set a low timeout; you can use a simple decorator. If the timeout needs to be high and this isn't very pleasant, consider running multiple requests concurrently and waiting only on the first result.

thecount

Legend has it, he’s still trying to find out what the weather is like in Alaska…

gaming_for_sanity

This is so cool and nerdy! Maybe the best site to follow and learn more and more on OpenAI API. Difficult but entertaining to follow.

reunac

This is awesome. I love your videos. Please keep these videos going specially this one. I learned so much

cutecute

So for cookies you just need to know what cookie is being set, in many cases it’s likely just a matter of causing the same effect in puppeteer, one way is to add to the cookie store directly (I’m sure puppeteer has a way to do this), and an alternative is specifying a “user directory” for puppeteer so you can actually agree to things like cookies, in many ways consent popups are easy to “locate” using standard html locators simply because it is often set to a priority load event and is often a div/container with a name/id containing the word consent or cookie etc, so regex can be used to find these reasonably easy. Use puppeteer to locate the “Ok” button and click it and then having that reusable user directory means you only check for any site if you have or haven’t accepted consent, if not click it if so just scrape it

grant_vine

Really appreciate your information and style. Learning much!

gmichael

Great video dude. Im gonna rewatch later. I got a project this might help on.

ScootLogix

Seriously impressive. I'm a NodeJS API engineer and you're writing that JS code faster than me!

robbennett

I'd like to see a video from you about navigating websites with Puppeteer. Now that you ask, I'd like a tutorial on how it follows links, fills out data, crawls four or more links deep into a website, how to handle session cookies, automate and run loops, etc. :-)

digitalcivilulydighed

very interesting, thanks for sharing!

albertwang

Great Video! Can these libraries handle auth like azure oauth flow in order to browse to the page?

yoyartube

i was wondering how this is different from the web-search capapblilty of chatgpt-plus right now .
in other words, if i asked gpt to look for an answer on the web will it struggle to do so ?,
is this a hack way to use a better websearch via an api like method because it's not enabled yet in the openai dev tools .
any way i really like the video, can we use selenuim to do so also ?

mohamedbasueny

GPT-4 Vision API + Puppeteer = Easy Web Scraping

Web Scraping with GPT-4 Vision AI + Puppeteer is Mind-Blowingly EASY!

GPT-4 Vision API + Puppeteer = Easy Web Scraping

GPT-4-Vision and Puppeteer

GPT4V + Puppeteer = AI agent browse web like human? 🤖

GPT-4 Vision API :10 NEW MINDBLOWING Abilities + Examples

WEB SCRAPPING Using CHATGPT | How To Use GPT 4 Vision API To Automate Web Scrapping | Simplilearn

NEW GPT-4o Vision API: Best Way to Copy Text from Image (OCR in Python)

GPT-4 Vision API+Puppeteer=Easy Web Scraping#airevolution #technology #artificialintelligence #ai

Generate Apps from Sketches or Screenshots with OpenAI GPT-4 Vision API (6 mins quick demo)

GPT-4 Vision Browsing Part 2: Following links with Puppeteer

How to Build GPT-4 Vision AI Agents with AutoGen

Industrial-scale Web Scraping with AI & Proxy Networks

OpenAI's Vision API is a game changer

5 Use Cases for GPT-4 Vision API (and DALL-E 3)

Will AI Kill Traditional Web Scraping? (GPT4V + Mistral Medium Project)

i made gpt4 browse before chatgpt

Using GPT-4-Vision in a RAG Chat App

Web Scraping with ChatGPT is mind blowing 🤯

How To Use ChatGPT To Fully Automate Web Scraping

This GPT-4o Automation Changes Everything

Simple question 👀

Scrape any website with OpenAI Functions & LangChain

Tech Takeover: When Computers Start To Control Themselves with GPT4-V and Playwright!

OpenAI Secrets Unveiled: What You Need to Know About Function Calling