Redfin Analytics|python ETL pipeline with airflow|Data Engineering Project|Snowpipe|Snowflake

preview_player
Показать описание
1. Extraction

Data is extracted from Redfin using a Python script.
2. Load Raw Data

The extracted raw data is loaded into an Amazon S3 bucket.
3. Transform/Load

A Python script transforms the raw data for analysis.
The transformed data is loaded into another Amazon S3 bucket.
4. Trigger

Apache Airflow, running on an EC2 instance, triggers the transfer of the transformed data into Snowflake.
5. Snowpipe

The transformed data is automatically ingested into Snowflake using Snowpipe.
6. Snowflake

The data is stored in Snowflake for analytical purposes.
7. Visualization

Power BI, connected to Snowflake, is used to visualize and analyze the data.
Рекомендации по теме