What is Stream Processing? | Batch vs Stream Processing | Data Pipelines | Real-Time Data Processing

preview_player
Показать описание
In this tutorial we are going to cover stream processing. What it is and how it differs from Batch Processing. So far, we have been processing data with Batch processing approach. Batch processing is a method of running repetitive, high-volume data jobs in a group on ad-hoc or schedule basis.

Stream processing is the method of synching data from a source to a destination as the transactions are taking place at the source. We process data as it happens in the source. The latency is very low usually in seconds. So this is real-time data processing. In the next session we will start covering Stream Processing using PySpark and Apache Kafka.

#streamprocessing #realtimedata #pyspark

💥Subscribe to our channel:

📌 Links
-----------------------------------------
#️⃣ Follow me on social media! #️⃣

-----------------------------------------

Topics covered in this video:
0:00 - Introduction to Stream Processing
0:13 - Batch Processing
1:34 - Stream Processing
2:55 - Stream Processing Advantages
3:10 - Stream vs Batch Processing Comparison
4:38- Batch Processing Example
4:56 - Stream Processing Example
5:33 - Stream Processing Recap
5:57 - Stream Processing Coming Soon
Рекомендации по теме
Комментарии
Автор

Can’t wait to watch the next video on a real steam processing project. As always, great content

ML_Enthusiast
Автор

Really nice that you are covering now streaming also
Waiting for the Kafka video

Achilles
Автор

Question: what is the con of stream processing?

nikkinic
visit shbcf.ru