Scraping IMDB for Simpsons Reviews | Python Web Scraping Walkthrough

preview_player
Показать описание
I'll be building a Python script to scrape series data from IMDB 🎬.

This is part 1 of 2, in a larger project where I want to collect a bunch of data on my favourite series from IMDB, then analyse it and create some cool visuals using Python 🐍.

-----------------
🎞️ Video Timestamps
----------------
00:00 - Intro
01:00 - Backgound
02:20 - Request Data
06:20 - Parse Response
14:18 - Make DataFrame
18:23 - Outro

-----------------
👨‍💻 About
----------------
My name is Matt, I'm a self-taught programmer and new YouTuber! 😊

I'm keen on tech, art and lately I've been getting into filming stuff - Hope to see you guys around!
Рекомендации по теме
Комментарии
Автор

Hey! Just wanted to raise a quick improvement to the code. When you're trying to isolate the score and votes text, you don't need to go down the html layer-by-layer as I did in the video.
You can actually get the same result by just targeting the final span tags directly:
episode_score = episode.find('span', attrs={'class':
Still good practice though, I suppose 😊 Happy coding! 👨🏽‍💻

mattbarty
Автор

You saved me days of hard work (I was struggling with "load more" buttons on IMDB reviews & scrapy)... I learned more in 10 minutes with you than the past 3 days or so. Thank you <3

Thaisdonega
Автор

yessss matt another video!! and the finished scraped data table looked so clean 😍

angelpapi
Автор

Matt, very clear, clean and organized content. New fan, keep it up...

tomasatb
Автор

Very nice video as a cs student and golden die hard simpson fan

asdfufu
Автор

Legend, I learnt so much please keep making project videos like these. Easy to understand and it felt more like listening to a friend rather than a lecture.

feedback: Can you post the final code in the video description or provide a link :)

dinodon
Автор

Great video with clear walkthrough ! Cheers !

unaveenj
Автор

hey matt!
amazing work done but I had 1 question
What if page numbers or season is not given ? only next and previous is mentioned

rohitshegokar
Автор

I usually use scrapy, but bs4 has great benefits to be able to visualize each line of code in Jupyter

DatabricksLATAM
Автор

Hey Matt, I'm not able to move forward because I'm getting <Response [403]>. Can you tell me what might be the problem ?

aroanrobert
Автор

Hi Matt great Video.. really breaks the concept down for easy understanding.. Can you also share the .ipnyb file for this.. It will be really helpful

manishtripathi
visit shbcf.ru