How to build stream data pipeline with Apache Kafka and Spark Structured Streaming - PyCon SG 2019

Показать описание

Speaker: Takanori Aoki, Data Scientist, HOOQ

Objective: Main purpose of this session is to help audience be familiar with how to develop stream data processing application by Apache Kafka and Spark Structured Streaming in order to encourage them to start playing with these technologies. Description: In Big Data era, massive amount of data is generated at high speed by various types of devices. Stream processing technology plays an important role so that such data can be consumed by realtime application. In this talk, Takanori will present how to implement stream data pipeline and its application by using Apache Kafka and Spark Structured Streaming with Python. He will be elaborating on how to develop application rather than explaining system architectural design in order to help audience be familiar with stream processing implementation by Python. Takanori will introduce examples of application using Tweet data and pseudo-data of mobile device. In addition, he will also explain how to integrate streaming data into other data store technologies such as Apache Cassandra and Elasticsearch. Note: - Python codes to build these applications will be uploaded on GitHub.

About the speaker:

Produced by Engineers.SG

PyCon SG

Рекомендации по теме

Комментарии

Thx for the presentation. Can I find the source code somewhere?

rezahamzeh

Amazing presentation, how can I run the application ?

youssefsassi

Thank you for uploading this and thanks to Takanori for amazing content

onewithsixonewithsix

How to build stream data pipeline with Apache Kafka and Spark Structured Streaming - PyCon SG 2019

Stream Processing System Design Architecture

How to build stream data pipeline with Apache Kafka and Spark Structured Streaming - PyCon SG 2019

Building stream processing pipelines with Dataflow

Stream vs Batch processing explained with examples

Building stream processing applications with Apache Kafka using ksql

Azure Stream Analytics Tutorial | Processing stream data with SQL

Build a Real-time Stream Processing Pipeline with Apache Flink on AWS - Steffen Hausmann

Stream Data Processing for Fun and Profit - David Ostrovsky

Jeff Denworth, VAST Data | VAST Presents Enter the COSMOS

How to build a stream and batch processing Job On GCP Dataflow || DataFlow Tutorial

Creating Stream processing application using Spark and Kafka in Scala | Spark Streaming Course

What is Stream Processing? | Batch vs Stream Processing | Data Pipelines | Real-Time Data Processing

Moving back and forth between batch and stream processing (Google Cloud Next '17)

Stream Processing Pipeline - Using Pub/Sub, Dataflow & BigQuery

How to build a modern stream processor: The science behind Apache Flink - Stefan Richter

Kinesis Stream Tutorial | Kinesis Data Stream to S3 demo | Firehose | AWS Kafka

Create a Kafka Cluster Using AWS MSK And Stream Data - Full Coding Demo

Process Real-Time Data Streams in Minutes using Azure Stream Analytics' No-Code Editor Experien...

Creating Kafka Streams Application | Kafka Stream Quick Start | Introduction to Kafka Streams API

Create a data stream on AWS w/ Kinesis!

Stream API in Java

Apache Kafka and the Rise of Stream Processing by Guozhang Wang | DataEngConf NYC '16

Stream Designer | The Visual Builder for Kafka Pipelines in Confluent Cloud

Heron: Real-time Stream Data Processing at Twitter