Data Pipelines: How to make them better

Показать описание

There are so many tools that say ETL and data pipelines are easy. But what are the things you need to think about when designing a data pipeline. From auditing to logging, to data storage, data lakes, data warehouses. All these can be considered when designing your data pipeline.

⏯RELATED VIDEOS⏯

________________________________________________

________________________________________________

🎓Data courses (Not Produced by nullQueries)🎓

________________________________________________

📷VIDEO GEAR📷

💻VIDEO SOFTWARE💻
________________________________________________

Some of the links in this description are affiliate links and support the channel. Thanks for the support!

Рекомендации по теме

Комментарии

Steps to make pipeline better
1. Good auditing and logging: error handling
2. Repeatable and identical
3. Self healing: finding a way to find the delta, log files and compare, add a data lake before data warehouse, add hash or water marks before compare
4. Decouple EL and T: Landon Rae formate, transform to Dwh, make reporting table clean,
5. Always available: trancate and load refresh faster than update. Or build semantic layer
6. CICD: coded, git connected, versioned, rollbacks

georgechristy

Thanks for the video. Do you have an example of a pipeline built from scratch following the best practices mentioned in the video? Text/book or course-based doesn't matter

MrHaste

great video thanks for your effort but could you make more videos about building pipelines with open source tools that would greatly benefits people who just started in that field before jumping directly in the world of cloud

hoblwop

Data Pipelines: How to make them better

What is Data Pipeline | How to design Data Pipeline ? - ETL vs Data pipeline (2024)

Data Pipelines Explained

What is ETL Pipeline? | ETL Pipeline Tutorial | How to Build ETL Pipeline | Simplilearn

What are Data Pipelines?

How To DESIGN YOUR First DATA PIPELINE ??🔥 15 Minutes BASIC STEPS

How to build an ETL pipeline with Python | Data pipeline | Export from SQL Server to PostgreSQL

How Data Engineering Works

How to build a data pipeline with Google Cloud

The BEST library for building Data Pipelines...

Data Pipelines: Introduction to Streaming Data Pipelines

Data Pipelines: How to make them better

How To Create DATA PIPELINE Without CODING In FEW CLICKS??🔥Data Engineer|Data Scientist|Data Analyst...

Les 6 design patterns de Data Pipeline/ETL pour Data Engineer & Data Scientist

Designing a Data Pipeline | What is Data Pipeline | Big Data | Data Engineering | SCALER

Create a Data Pipeline in Azure Data Factory from Scratch DP-900 [Hands on Lab]

ETL Data Pipelines Explained

What Is A Data Pipeline - Data Engineering 101 (FT. Alexey from @DataTalksClub )

Twitter Data Pipeline using Airflow for Beginners | Data Engineering Project

Building Data Pipelines Part 1: Airbnb's Airflow Vs Spotify's Luigi

Getting Started with Data Pipelines in Fabric Data Factory

How to Build Data Pipelines for ML Projects (w/ Python Code)

What the HECK is a “Data Pipeline”? 👩🏻‍🔧📊🪠

Create My First ETL Data Pipeline with Azure Data Factory

Back to Basics: Building an Event Driven Serverless ETL Pipeline on AWS