Multithreaded Java web scraper with output written to CSV, JSON, and MySQL database

preview_player
Показать описание

This AmazonWebScraper is created using Java/Jsoup.
Scraping is multithreaded - meaning each category will be scraped simultaneously

This demo will scrape Top 100 books from Amazon for 3 categories:
- new-releases
- movers-and-shakers
- bestsellers

Input is a CSV file containing the list of URLs.

Output is written to:
- Log
- CSV file(s)
- JSON file(s)
- Database entries (MySQL)
Рекомендации по теме
Комментарии
Автор

can you please provide the whole code of your JsoupAmazonScraper.java file ?

arjunsharma
Автор

this is very interesting, I been writing web scrapers for about a year...I can't get my scraper to run that think you would be willing to help me out?

genelarose
join shbcf.ru