How to scrape PDF files using Python + Requests and BeautifulSoup

preview_player
Показать описание
Hey what's up guys, I know you get used to watch me scraping various data sources on this channel but this time I'm scraping something for my own purposes)

Source code

Scraped files (for those interested)
Рекомендации по теме
Комментарии
Автор

Hello teacher, thank you for this tutorial.
here is my suggestion to get href(s):

pdf_links = [
item['href']
for item in
content.find_all('a', href=True)
if item['href'][-4:] == '.pdf'
]

faresjewelry
Автор

Thanks. I'm exactly now self learning Python and Web scraping now. This was exactly what I needed. Simple, with commentary and working.

homeoffice
Автор

Hi Man.
Thanks for sharing.
It worked perfectly.
I love it when We can optimize time.

alfonsochang
Автор

Other question. What text editor do you use? Thanks in advance

investigacioneseconometric
Автор

If I want extract pdf that containt a specific word ¿? What I can do? Thanks in advance

investigacioneseconometric
Автор

Web scraping is working when any other website urls i have put... It's is working... But it is not storing the pdf in pdf folder... Can anyone help

investorKidd
join shbcf.ru