Talk - Josh Weissbock: Distributed Web Scraping in Python

preview_player
Показать описание
Web scraping is easy to do in Python, but it quickly becomes tedious when routinely running large batch scraping jobs. This talk looks at how to build a distributed web scraper to reduce batch scraping job times and improve durability of your code as well as lessons learned & stories along the way.

Рекомендации по теме
Комментарии
Автор

I'm surprised to not see Scrapy mentioned. How is this approach different or better?

Ashton