Web Scraping (Downloading) Google Images using Python + Selenium Webdriver

preview_player
Показать описание
What's up, y'all! In this video we'll write a Python script from scratch that uses Selenium web automation and testing library to web scrape and download images from Google Images. We'll dive deep into understanding the (very simple) code behind it. Also, this video is made to be as approachable for beginners as possible. So, if it's your first introduction to web scraping - fear not - we'll go through the whole process starting from installing the Selenium library in Python all the way to having a working Python script. The video also has time stamps, so feel free to jump around. Hope you enjoy!

⏳Time stamps⏳
00:00 Intro
01:36 YOLOv5 Series
02:50 Installing Selenium and Beautiful Soup (bs4)
05:55 Launching automated google chrome browser
09:02 Is web scraping legal?
13:55 How the web scraper is going to work
20:30 Clicking on image containers
29:45 Waiting for high resolution images to load
39:44 Code to download the images
43:27 Testing the script
47:31 Outro

▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
It would mean the world to me, if you decided to support me and the channel =)

►You may consider watching ads that show up on the videos

Making these videos takes a lot of time and effort, so If you decide to support me, please don't hesitate get in touch with me as I'd like to thank you personally!
▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬▬
Thanks for watching!

Disclaimer: this video is for educational purposes and does not encourage anybody to violate any copyright laws.
Рекомендации по теме
Комментарии
Автор

Came for getting the script, stayed for the tutorial!! Thanks man.

mukilankrishnakumar
Автор

Thank you for the nice tutorial!
As mentioned before, the div boxes and selenium code have to be updated.

MrStgg
Автор

Google has changed the div boxes and there are some changes in the selenium updates but I have been able to download images by changing according to div boxes and updates. Thank you for your explanation it helped a lot while modifying the code. (Спасибо)

daviduday
Автор

I was saving image urls with pyautogui, this way is a miracle for me :D Thank you soo much!

vehimii
Автор

One of the most complete tutorial with really good explanation. Thanks for the same.

shrishailgajbhar
Автор

Hey man really interesting videos you got there ! I can't believe that u haven't more subscribers, I was wondering where are u from? And did you already did some AI with unity ? I could be a very nice video to watch tho

laurentk
Автор

this was my first python project. thank you so much! it was fun figuring it out with you

ezekiel
Автор

I cant thank you enough..!! This video helped alotttt... Keep up the good work man.. Thank you so much

harithasivan
Автор

It was awesome!!! Thanks man.. It will help me with my college project

devanshkaushik
Автор

been looking for this tutorial long time, awesome tutorial!!!

chetansinghkachhawa
Автор

finally, the man is back with web scraping

hussainbergthesalafi
Автор

learnt a lot to with regards to web scraping, thanksthanksthanks ! ! !

cyriljchan
Автор

Great tutorial and explanation thank you!

willyhorizont
Автор

this is a very useful and well-made tutorial thanks for the video

Frostbyte-Game-Studio
Автор

Awesome man, you've saved me a lot of time. Thanks!

mateuscoutinhomarim
Автор

Very helpful tutorial. Is it possible to thread the code in order to make it faster? Because it still seems slow by a computer's standard... I want to download images

zdtuzwz
Автор

I am having a problem @ 21:36 where I am supposed to click the image.
My code is:
driver.find_element(by=By.XPATH,
but the image is not being clicked. the xpath is correct and since Selenium changed the find_element_by_xpath i used the new find element method.
How can I solve this problem??

jemimahgreen
Автор

Thanks for your great video
I have 2 questions:
-How Can I open chrome browser with default profile?
-Your code works with normal links but It don`t work with blob links.Please tell me hot to download those links

yasserahmed
Автор

got this error:

error why: Traceback (most recent call last):
File "C:\Users\Charles\Desktop\Hack\DB\Hex\QR\search.py", line 1, in <module>
import bs4
File
_qbz5n2kfra8p0\LocalCache\local-packages\Python310\site-packages\bs4\__init__.py", l
ine 175
except Exception, e:

SyntaxError: multiple exception types must be parenthesized

why

dtory
Автор

Hey Ivan, Thank you so much for making this video. Several of your videos have helped me immensely so far. I would appreciate some advice.

It appears that when I run the scraper it thinks it's reached the end of the page when it gets to an ad block embedded within the search results. It's a bit like the "related searches" problem you mentioned. I can't seem to figure out how to make it ignore these ad blocks. The way it is now it only lets me download about 20 images if these ad blocks show up.

Any help would be wonderful. Thank you!

christianhoole