Simple and Fast Web Scraping using Python | Requests_HTML | IMDb movies Scraping | Data Analytics

preview_player
Показать описание
Python is a powerful programming langauage which supports libraries which can be used for variety of tasks like APIs, data analysis, machine learning, etc. [1]
Anaconda is an open-source distribution of the Python and R programming languages for data science that aims to simplify package management and deployment.
The Jupyter Notebook is the original web application for creating and sharing computational documents.
It offers a simple, streamlined, document-centric experience. [2]
Requests HTML library intends to make parsing HTML (e.g. scraping the web) as simple and intuitive as possible. [3]

In this video, we will be learning:
• Different web scraping libraries in Python
• How to import libraries and run python scripts in jupyter notebook
• How to scrape multiple data points from one website using requests_html library
• How to scrape multiple websites using requests_html library

Timeline:
• Web scraping libraries - 0:27
• iMDB website scraping scope - 1:28
• Jupyter notebook and libraries - 2:30
• Web scraping python script - 4:34
• Extracting xpath of different data points from website - 6:30
• Extracting iMDB movie links xpath from a python set - 13:15
• How to work with scraped data, tuple and dataframe - 18:05
• Web scraping python for loop - 20:00

How can web scraping help you?
• E-commerce - product details extraction, product price scraping, pricing analysis
• Business intelligence - extract data for taking business decisions
• Personal use - scrape any website using this method to get the data in excel, filter it using different fields to reduce your time scrolling and checking

* The information on the videos and the resources available for download/viewing through this YouTube Channel are for educational and informational purposes only.​

Source:
Рекомендации по теме
Комментарии
Автор

I prefer scrapy for scraping but interesting to see using jupyter for scraping. Your Office license is not genuine part was priceless... smh

TimoTalksTech
welcome to shbcf.ru