Render Dynamic Pages - Web Scraping Product Links with Python

Показать описание

Thanks to Stuart for sending this site in! I enjoyed this scraping challenge.

This video will show a simple method that can help with dynamically loaded content. I use the requestes-html library to render the page in the background quickly and efficiently, and scrape all the product links from the html DIV using the XPATH selector. I loop through each link to get all the product information.

Coming in part 2 - pagination and functions to tidy up the code.
-------------------------------------
-------------------------------------
Disclaimer: These are affiliate links and as an Amazon Associate I earn from qualifying purchases

Рекомендации по теме

Комментарии

Keyboard too loud? I've been using my mech kb again.. Is it too distracting?

JohnWatsonRooney

i'm going through ALL of your videos and just finished this one! learning so much it's incredible!

xilllllix

THANK YOU for this video and all the others. I am learning web scraping to gather data for my PhD thesis and you have helped me make such great progress in just a few days. :)

schlotto

Amazing explanation skills! Everything was clear. One of the greatest video for web scraping so far! Good job, Good luck!!

ottomanasina

I can get data from static websites using scrapy with relative ease, but I always come unstuck when I try the same with dynamic websites; I might give "html_requests' a go instead of my usual scrapy-selenium combo...Thanks for the video! 👊👊👊

edcoughlan

Man this is some amazing content. So glad i found your channel! Definitely earned a subscribe.

kewl

this was super useful! I have a project rn that needs to scrape on many pages that need renderer. This looks much more lightweight than what I'm using rn (selenium)

mia_bobia_

Lifesaver! Thank you so much! Wish you the best of luck with your channel!

agsantiago

Very clearly explained. May I ask if there is a GitHub repo containing the code that you used in the video?

neginbabaiha

When I use Xpath, in products (on a different site, but same principles) terminal keeps returning 'None', the site is gwt based, would that affect xpath from working?

Aaron-qngu

You are a great and creative person...keep going champ.

dobcs

Awesome!, I was searching for such type of scraping, and I found

farhadkhan

Hi, I tried your code on other website, but when I arrived at print(products) part, it returns 'NoneType' object. The code get no url. What should I do?. I tried to use the user-agent, but also return nothing

bagia

Nice video - minus the try/catch with no specific exception. I know this is a tutorial, but that’s a bad habit to share. Regardless, thank you for the content.

Nope-

Hi John and everyone, I'm having trouble with the html.render() method, I'd appreciate any help.
First time the method runs, it downloads chromium. After I ran it, 3 red lines were printed (Downloading Chromium & stuff I can't remember), I felt like it took too long (more than 10 minutes), so I stopped the program.
Now when I try to run a the method, the script just get stucked, I mean, it is running, but never continues to the lines after the html.render method. No errors are raising, the script simply never finishes to run.
I tried to pip uninstall requests-html and reinstall it but I'm getting the same not indicative result.
How can I troubleshoot this problem? I'm excite to work with requests-HTML and letting for of Selenium for standard rendering needs, but I can't.
Thanks a lot for anyone who cares enough to give it a try.

royteicher

Hello John,
if i add command r.html.render(sleep=1) the output be "Cannot use HTMLSession within an existing event loop. Use AsyncHTMLSession instead.", i am anything on google, no clue, any idea?

charisthawhite

You are a truly life saver. great great video. thanks mate

kavehyarohi

John: when I follow your code, @ "for item in products.absolute_links:, although I specify, e.g. 'div.product-subtext', the iteration only returns the item.text, (the link text of item) and not the sub-text of the item. This is true of price, name, and so-forth. Can you explain this behavior?

justinames

You missed an explanation: what circumstances should you use xpath v div.<classname>?

Dome

Thank you so much. Your video is going to help me a lot in a project that I'm going to start. One question if you don't mind, when I want to gather text but there is a part of the text is appearing and there is a[ click for more] ~>hyperlink, that prevents the text from being fully copied to the csv file. Do you have a hint or suggestions? I appreciate your help in advance

Mr.AIFella

Render Dynamic Pages - Web Scraping Product Links with Python

Render Dynamic Pages - Web Scraping Product Links with Python

10 Rendering Patterns for Web Apps

Creating HTML Page using JavaScript dynamically rendered content

Web Scraping With Selenium Python: Delayed JavaScript Rendering

Next.js Explained: Static vs. Dynamic rendering

Static vs Dynamic Websites - What's the Difference?

Render HTML Dynamically Using AJAX - JavaScript Tutorial

Rendering Dynamic Pages 2! - Web Scraping ALL products with Python

Angular 18 Template expression Syntax and Custom Directives Complete Guide - LIVE -

Dynamic Websites vs Static Pages vs Single Page Apps (SPAs)

Dynamic Rendering for JavaScript web apps - JavaScript SEO

Scraping Data from JavaScript rendered tables with Python

Static vs Dynamic Rendering in NextJs 13

Dynamically rendering components #javascript #code #programming #web #typescript

What are Server Side Rendering (SSR) & Client Side Rendering (CSR) | Pros + Cons

Creating an Umbraco 8 website: Render dynamic values

This Next.js function simplifies dynamic rendering

Next.js 14 Tutorial - 53 - Dynamic Rendering

Rendering Dynamic Content using plain HTML, CSS and JavaScript

Choosing between SSR, SSG, and dynamic rendering in Astro

Static vs Dynamic websites | Server-Side Rendering vs API Empowered Websites | .NET Core #2

Day7: Rendering Dynamic Data in Umbraco CMS

Ryan Seddon: So how does the browser actually render a website | JSConf EU 2015

How to Scrape JavaScript Websites (Delayed Rendering) - Web Scraping With Python #selenium #python