Build a Web Scraper API with Puppeteer

preview_player
Показать описание


Ethical scraping encouraged!

🧰 Resources

#colbyfayock #webscraping #puppeteer #nextjs #api #webdevelopment
Рекомендации по теме
Комментарии
Автор

Oh man! Thank you so much for this video!! I was struggling all day to get puppeteer running with nextjs, hetzner hosting and coolify! But with help of your video it works now without problems!

ThomasWapps
Автор

Hi, I have used it in different way, but this video is very useful for me

THATIPAMULAVENKATESH-lpwb
Автор

At 2:21 you can instead create a file named “api/scraper/…” and it will create the folders for you.

Also, running puppeteer on a serverless env seems kinda inefficient. Fetching that big S3 file, starting up the browser, then closing it again, etc. I would probably expose it as a separate service.

Anyways, great video! Was a pleasure to watch!

bthero
Автор

Hi colby, thank you for this tutorial, this was super helpful as I was stuck with the executable path issue on production, I tried the hosting the tar file to s3 bucket and using the path and it worked, but now It throws this error ```/tmp/chromium: error while loading shared libraries: libnss3.so: cannot open shared object file: No such file or directory``` in netlify serverless functions. would you know any solution or point me to the right resource to get past this issue? many thanks!

techhguy
Автор

Does this implementation work within Netlify?

RagnarSwanson
Автор

Puppeteer guys if selenium gotcha mad for hours MAN this thing powerful it can pull things that I gave up on for a while ago

OpiumSmoke