Philly ETE 2016 #8 - Demystifying Stream Processing with Apache Kafka - Ewen Cheslack-Postava

Показать описание

The concept of stream processing has been around for a while and most software systems operate as simple stream processors at their core: they read data in, process it, and maybe emit some data out. So why are there so many stream processing frameworks, all with different terminology, and why does it seem so complex to get up and running? What benefits does each stream processing system provide, and more importantly, what are they missing?

This presentation will start by abstracting away the individual frameworks and describe the key features and benefits that stream processing frameworks provide. These core features include scalability and parallelism through data partitioning, fault tolerance and event processing order guarantees, support for stateful stream processing, and handy stream processing primitives such as windowing. Understanding these features will enable you to map practical data problems to stream processing, write applications that process streams of data at scale, and understand how the different frameworks fit into the stream processing framework design space.

Next, we’ll describe Kafka’s new stream processing library, Kafka Streams, and the design decisions and tradeoffs it makes. Kafka Streams represents a new design point in the stream processing space. Where most frameworks provide a service for running stream processing applications, Kafka Streams emphasizes low-overhead development that feels more like developing any other application. This trades off the benefits of a centrally-managed stream processing infrastructure for an easier adoption path and easy integration with your existing deployment tooling. Kafka Streams is also designed to work solely with Kafka. This limits its use to data that is already in Kafka and requires additional tools to import/export data from other systems, but allows Kafka Streams to leverage unique Kafka features such as consumer groups to keep implementation complexity low and get scalability and fault tolerance nearly for free. Combined, these decisions represent a new design point for stream processing applications that we believe address use case not well served by today’s popular frameworks.

Рекомендации по теме

Philly ETE 2016 #8 - Demystifying Stream Processing with Apache Kafka - Ewen Cheslack-Postava

Philly ETE 2016 #8 - Demystifying Stream Processing with Apache Kafka - Ewen Cheslack-Postava

Philly ETE 2016 #33 - Securing Software by Construction - Jean Yang

Philly ETE 2016 #14 - Scala 2.12 & Java 8: More Fun Together! - Adriaan Moors

Philly ETE 2016 #35 - From Concurrent to Parallel - Brian Goetz

Philly ETE 2016 #46 - Security Vulnerabilities in Third Party Code: FIX ALL THE THINGS! - K. Price

Philly ETE 2016 #51 - AI: A Return to Meaning - David Ferrucci

Philly ETE 2016 #10 - Untangling Healthcare with Spark and Dataflow - Ryan Brush

Philly ETE 2014 #8 - Have You Seen Spring Lately? - David Turanski

Philly ETE 2017 #8 - C# and F# everywhere: inside open source .NET - Scott Hanselman

Philly ETE 2015 #7 - Let’s Get to the Rapids: Java 8 Stream Performance -

Philly ETE 2016 #9 - Emergence of Real-Time Analytics: Real-time Analysis of... - S. Palthepu

Philly ETE 2016 #7 - An Introduction to Web Components and Polymer - Jeff Posnick

Philly ETE 2016 #52 - Rethinking REST in a Microservices World - James Roper

Philly ETE 2016 #27 - The Node Module Diaries: Large App Architecture from the Trenches - J. Lipps

Philly ETE 2016 #30 - React.js Reconciliation - Jim Sproch

Philly ETE 2016 #12 - Academese to English: A Practical Tour of Scala’s Type System - Heather Miller...

Philly ETE 2016 #28 - Interactive Computing with Jupyter: Past, Present, and Future - Jason Grout

Philly ETE 2016 #16 - Unleash Your Data with Clojure: Using Transducers and Sequences - Alex Miller

Philly ETE 2016 #18 - Ionic 2: Your First @App - Mike Hartington

Philly ETE 2016 #49 - Building Microservices w/gRPC & Kubernetes: A practical intro–Kelsey Hight...

Philly ETE 2016 #39 - Move Deliberately and Don’t Break Anything: Lessons from...Java - Brian Goetz...

Philly ETE 2016 #25 - Agile HR - Leigh Ann Shaffner

Philly ETE 2016 #48 - Stability Without Stagnation: Lessons Learned Shipping Ember - Yehuda Katz

Philly ETE 2016 #3 - Taming the Modern Public and Private Clouds with Nomad - Diptanu Choudhury