filmov
tv
Event-driven Data Pipelines with Apache Airflow - Airflow Summit 2024
Показать описание
Presented by John Jackson at Airflow Summit 2024
Airflow is all about schedules…we use CRON strings and Timetable to define schedules, and there’s an Airflow Scheduler component that manages those timetables, and a lot more, to ensure that DAGs and tasks are addressed based on those schedules.
But what do you do if your data isn’t available on a schedule? What if data is coming from many sources, at varying times, and your job is to make sure it’s all as up-to-date as possible? An event-driven data pipeline may be the answer.
An event-driven architecture (or EDA) is an architecture pattern that uses events to decouple an application’s components. It relies on external events, not an internal schedule, to create loosely coupled data pipelines that determine when to take action, and what actions to take. In this session, we will discuss the design considerations when using Airflow in an EDA and the tools Airflow has to make this happen, including Datasets, REST API, Dynamic Task Mapping, custom Timetables, Sensors, and queues.
Airflow is all about schedules…we use CRON strings and Timetable to define schedules, and there’s an Airflow Scheduler component that manages those timetables, and a lot more, to ensure that DAGs and tasks are addressed based on those schedules.
But what do you do if your data isn’t available on a schedule? What if data is coming from many sources, at varying times, and your job is to make sure it’s all as up-to-date as possible? An event-driven data pipeline may be the answer.
An event-driven architecture (or EDA) is an architecture pattern that uses events to decouple an application’s components. It relies on external events, not an internal schedule, to create loosely coupled data pipelines that determine when to take action, and what actions to take. In this session, we will discuss the design considerations when using Airflow in an EDA and the tools Airflow has to make this happen, including Datasets, REST API, Dynamic Task Mapping, custom Timetables, Sensors, and queues.
Event-driven Data Pipelines with Apache Airflow - Airflow Summit 2024
Presentation on Kafka Event Driven Data Pipelines
Architecture: Event Driven Change Data Capture Pattern using Apache Pulsar
From Batch to Real-Time: Tips for Streaming Data Pipelines with Apache Kafka ft. Danica Fine
What is Data Pipeline? | Why Is It So Popular?
Event Driven Architectures with Apache Geode and Spring Integration
Event-Driven Change Data Capture using Apache Pulsar by Mary Grygleski
Implementing Event Based DAGs with Airflow
Event-driven applications: Apache Kafka and Python | DevNation Day 2021
Back to Basics: Building an Event Driven Serverless ETL Pipeline on AWS
Manning Introduces: Streaming Data Pipelines with Apache Kafka
Designing a Horizontally Scalable Event Driven Big Data Architecture w/ Apache Spark Ricardo Fanjul
Robin Moffatt — The Changing Face Of ETL: Event-driven Architectures for Data Engineers |Øredev 201...
Dynamic Event-driven Workflows with Prefect Cloud
Building the Foundations of an Intelligent, Event-Driven Data Platform at EFSA
Streaming Data Pipelines with Kafka - First Chapter Summary
Event-Driven Architecture: RabbitMQ and Apache Kafka
Jay Kreps | Kafka Summit 2018 Keynote (The Death and Rebirth of the Event-Driven Architecture)
Staging Reactive Data Pipelines Using Kafka
Brian Ritchie - Building Event-Driven Systems with Apache Kafka - Code on the Beach 2016
Real-Time Data Pipeline with Kafka on MovieLens data
Apache Kafka and KSQL in Action : Lets Build a Streaming Data Pipeline! by Viktor Gamov
Deliver Data Insights Faster With Dynamic Event Pipelines
code.talks 2019 - The Changing Face of ETL: Event-Driven Architectures for Data Engineers
Комментарии