Parsing HTML document with Python and Regular Expressions for web scraping purposes

preview_player
Показать описание
Hey what's up guys, before anything else I would like to let you know that I'm aware of BeautifulSoup library and Scrapy framework - I have numerous tutorials on these topics located in the corresponding playlists on this channel, so the reason of creating this tutorial is more like for fun rather then for production.

In this video we'll compose a regular expression to parse the HTML document by matching starting tag and the corresponding textual node and then we'll create a python list of dictionaries containing tag's data, e.g. name, attributes, text.

All you need to follow this tutorial is web browser and bare python interactive console!

Building Tiny Scraper (web scraping library based on urllib and this HTML parser)

Be careful if you'd like to try it at home)
Рекомендации по теме
Комментарии
Автор

Thanks alot brother, aprreciate your sunday work

ali_rauf
Автор

Very useful content. Keep it up, man. Kudos to you

akuamtau
Автор

doing a Python online course thank you King this makes me think to a hamburger and in a short while it starts England Scotland for soccer Euro 2021... hi😀)

ugiampaolo
Автор

I want to start freelancing with web scraping, any advise!!!

rahalmehdiabdelaziz