Scrapy Crawler with MySQL & Python | Web Scraping (part 2)

Показать описание

#webscraping #scrapy #sql #mariadb

This demo / tutorial shows how to use Python code to Crawl an entire site with Scrapy, yet only save the 'interesting' links to the mariadb SQL database.

⦿ Web scraping without using any CSS or XPATH! Nice!

I search book titles to find any titles that contain the 'keywords' that I specify in a Python list. (Line 43 in the code).

You will see how I use:

⦿ list comprehension,
⦿ "ANY" function
⦿ SQL commands such as DESC table, TRUNCATE, and INSERT

This is more simple than using Scrapy pipelines, and the code in this video could be applied to other projects outside of Scrapy.

timings:-
0:00 Introduction
5:00 parsing the item
8:00 using the "ANY" function
13:06 mariadb (MySQL)
16:23 Scrapy populates the MySQL database

*Geany didn't display underscores at certain zoom levels, but if you see the GitHub code you'll see that they are present. I suspect this could also be due to me using a VM, which can't access the proper hardware/graphics driver. One day when I have more money I'll get a new PC and run Linux natively!

# Scrapy Crawler Code on GitHub:

# Install the "Connector"
pip install mysql-connector-python

# Install the Database:
sudo apt update
sudo apt install mariadb-server
sudo mysql_secure_installation

# Install phpmyadmin :
sudo apt-get install phpmyadmin

P360

Рекомендации по теме

Комментарии

Thanks man, this is exactly what I have been looking for - can you do a "part 3" with more data and show phpmyadmin and/or export data to csv and use Pandas?

stupidsoft

Bro ı can't see anything in my table

yusufhandogan

Scrapy Crawler with MySQL & Python | Web Scraping (part 2)

Scrapy Crawler with MySQL & Python | Web Scraping (part 2)

Scrapy CRAWLER | MySQL and Python | minimalist version for Web Scraping 'books.toscrape.com&apo...

How to Web Scrape Amazon using Python, Scrapy and MySQL | View output in phpmyadmin

How To Web Scrape To Multiple Tables | Part 3 of Scrapy Crawler + MySQL series

How To Add a Database to your Scrapy Project

Web Scraping News Sites (part 3) | Scrapy items.py, pipelines.py, MySQL (MariaDB)

SQLAlchemy Integration With Python Scrapy Mysql Database integration

Python Scrapy Tutorial - 17 - Storing data in MySQL Database

Web Scraping News Sites to a database with Python, MySQL and Scrapy

Saving Data To MySQL & Postgres - Python Scrapy Beginner Series (Part 3)

Scrapy Course – Python Web Scraping for Beginners

Flask with SQL | Display data from Scrapy spider stored in Postgres

Board crawler with Django and Scrapy

Downloading Files Using Scrapy

MySQL : Python Scrapy - populate start_urls from mysql

Web Scraping News Sites | part 4 | Scrapy MySQL - xpath examples, troubleshooting, completed project

Web Scraping Amazon with Scrapy, Python, and an SQL Database | How To Use Scrapy Pipelines

Scrapy Tutorial: Learn How to Build a Web Crawler with Examples

Sending Data to XLSX from Scrapy

Can you Scrape two sites in one Scrapy spider?

Python Scrapy Tutorial- 7 - Creating our first spider ( web crawler )

How I Use Scrapy Shell When Creating Web Scraping Projects

Scrapy Item Loaders - Populate Item Fields With Processors

Scraper for www.empirepro.com on Python with Scrapy framework