A Deep Dive into Stateful Stream Processing in Structured Streaming 2018 Part 2 (Tathagata Das)

Показать описание

Tathagata Das is an Apache Spark committer and a member of the PMC. He's the lead developer behind Spark Streaming and currently develops Structured Streaming.

Stateful processing is one of the most challenging aspects of distributed, fault-tolerant stream processing. The DataFrame APIs in Structured Streaming make it very easy for the developer to express their stateful logic, either implicitly (streaming aggregations) or explicitly (mapGroupsWithState). However, there are a number of moving parts under the hood which makes all the magic possible. In this talk, I am going to dive deeper into how stateful processing works in Structured Streaming.

About: Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Connect with us:

Рекомендации по теме

Комментарии

How can we do deduplication and keep the last record instead of first (based on timestamp field in dataframe)? Current implementation for dropDuplicates keep the first occurrence and ignores all subsequent occurrences for that key, how can we tell Spark to update the state and keep the most recent value based on timestamp field.

AashishOla

A Deep Dive into Stateful Stream Processing in Structured Streaming 2018 Part 2 (Tathagata Das)

Deep Dive into Stateful Stream Processing in Structured Streaming with Tathagata Das (Databricks)

Stateful Workloads in Kubernetes: A Deep Dive - Kaslin Fields & Michelle Au, Google

Deep Dive into Stateful Stream Processing in Structured Streaming - Tathagata Das

Stateless vs. Stateful | Flutter Basics

Deep Dive into Stateful Stream Processing in Structured Streaming with Tathagata Das Continued

Stateful vs Stateless Firewalls - You NEED to know the difference

A Deep Dive into Stateful Stream Processing in Structured Streaming 2018 Part 2 (Tathagata Das)

Deep Dive in Stateful Widget | Stateful Widget in Flutter | Lifecycle of Stateful Widget in Flutter📱...

What Is Stateful IPS? - SecurityFirstCorp.com

Google SWE teaches systems design | EP40: Flink in 15 Minutes, Stateful Stream Processing!

Stateful vs Stateless Protocol Explained (with Examples, Advantages & Disadvantages)

Stateful NFTs - A Technical Deep Dive

Understand Stateful and Stateless protocols in 10 Minutes

(25) Spark Streaming : Stateless Vs Stateful operations Explained

Stateful Functions with Stephan Ewen | Whiteboard Walkthrough

How Stateful Hot Reloading of Flutter(or Dart) works? #Explained

Stateful vs Stateless Architecture | System Design

ElasticON EMEA: From Stateful to Stateless — and why it matters

🔒 Stateless vs Stateful Firewall 🔥 | Explained Simply | Palo Alto Deep Dive

Stateful vs Stateless Firewalls Explained: A Deep Dive with Ashok Sharma

State is Hard: An SDK for Building Stateful Applications

🔒 Deep Dive: AWS Security Groups Explained - Mastering Stateful Firewalls

A deep dive into Flink SQL - Jark Wu, Kurt Young

How Do Stateless Firewalls Compare to Stateful Firewalls in Terms of OSI Layers?