Web Scraping Using Python (Step-By-Step Tutorial)

preview_player
Показать описание


If you’re just getting started with web scraping, using Python is a great choice, as it’s an object-oriented programming language. Compared to other languages, Python’s classes and objects are notably easier to use. In addition, there are multiple libraries that can make building your scraper hassle-free.

This web scraping using python tutorial lays out everything you need to know before building your first scraping tool in Python. During the first step of the video, we show how to set up the coding environment, including the IDE installation and the project structure creation.

Secondly, we go through coding the main scraper functionality, which includes importing required libraries, downloading and setting up the correct webdriver, and coding the logic required for the scraper to function.

Then, we optimize the script and code the functionality of saving results to the file.

Finally, in this python web scraping tutorial, we expand the scraper’s functionality to save more than just one data point.

Watch these related videos:
Web scraping tips and tricks:
How to scrape multiple website URLs with Python:

Join over a thousand businesses that use Oxylabs proxies:
Residential Proxies:
Shared Datacenter Proxies:
Dedicated Datacenter Proxies
SOCKS5 Proxies:

In this video, we answer the following questions:
0:00 Web scraping with python tutorial
0:26 Step 1: setting up the coding environment
2:31 Step 2: coding the core functionality of the scraper
8:34 Step 3: optimizing the code and coding the functionality of saving results to file
9:57 Step 4: expanding the Python scraping functionality

© 2022 Oxylabs. All rights reserved.

#Oxylabs #webscraping #Python
Рекомендации по теме
Комментарии
Автор

Thanks for watching! Do you have any questions about this topic?

oxylabs
Автор

Please do more examples like this, with the same video length ❤️🙏🏻🙏🏻, really amazing work

saqerbassim
Автор

I keep getting an error, "element has no attribute find". Even if I was to copy and paste correct code, same error . I don't like beautifulsoup library, it must have another library.

kestonsmith
Автор

Great video, next time zoom in a bit so the code is easier to read

piercetaylor
Автор

Hey, at the moment my code is working, i am just getting the error "executable_path has been deprecated, please pass in a Service object"
Any idea on why this might happen?

MrConfusionTheEditor
Автор

Dude you rule, I have been trying to find a bs4 tutorial, is there a place I can tip your for the tutorial? I appreciate the info but I don't see any tip or bitcoin link. Thank you!

jaybee
Автор

thanks bro couldnt find much tutorial using pycharm

raiden
Автор

my web browser is up to date but doesn't show up

ayushmalhotra
Автор

I am getting : AttributeError: 'NoneType' object has no attribute 'text'

anne-sophieandersson
Автор

PIP install not working for me, any ideas?

kingjaffah
Автор

Hello, what do I do if the chromedriver isn't available? I just downloaded the latest Chrome browser today and I can't see it in any sites. Thank you for your help.

Samantha-jiyj
Автор

can you help im stuck after line 7 where you did the driver. get when i did that nothing would pop up Its not taking me to the page and it doesn't have any errors only says process finished with exit code 0

AbdulMajid-ywks
Автор

Thanks for the video! I need to do the same but for phones at footers. I would appreciate any help!

tatianabauer
Автор

This is my first time programming in 50 years, I learned Fortran and Cobol back in the day. So I apologize if this is a stupid question. I am getting the following error: "ModuleNotFoundError: No module named 'Selenium'" I am guessing Selenium is a library I that allows my chrome driver to work. I had the import commands at the top, tried pip install in the terminal, and even manually tried through the settings icon. No luck. What might I be doing wrong?

mitch
Автор

Great Video! However, for some reason, my google chrome does not open! there aren't any errors because it finishes with code 0. I can't figure it out, can you guys help, please.

brandondill
Автор

did not display the web page instead only displaying the output screen with "process finished with exit code 0"

koushik
Автор

when i type install beatiful soup it gives me error

kurtradler
Автор

How do you get around the "load more" button

EddieRenton
Автор

So I execute the program that takes our data from the website and creates a csv. But my name.csv file has no data stored in it from the website after I execute. When I print results I can see the title names appear so I assume the program is partially working.

IvanSanchez-ylcg
Автор

What to do if the href is specified as a direct hyperlink? I can't basically use ".find('h2')" as there's no h2 specified but just the href= "http/..."

vladislavkorobeinikov