Understanding that Kafka Topic Partitions Still Drive Parallelism in Faust

Показать описание

Let's explore whether Kafka Topic Partitions Still Drive Parallelism in Faust or not ..

Prerequisite:
--------------------------
Kafka Topic Partitions & Producers using Python
Parallel Processing in Kafka Consumer

Reference Documentation:
-----------------------------------------------

Code:
--------------
Start Zookeeper:
------------------------------

Start Broker:
------------------------------

Start Kafka Topic:
------------------------------

Parallel Processing:
----------------------
from time import sleep
from json import dumps
from kafka import KafkaProducer

def custom_partitioner(key, all_partitions, available):
"""
Customer Kafka partitioner to get the partition corresponding to key
:param key: partitioning key
:param all_partitions: list of all partitions sorted by partition ID
:param available: list of available partitions in no particular order
:return: one of the values from all_partitions or available
"""
print("The key is : {}".format(key))
print("All partitions : {}".format(all_partitions))

producer = KafkaProducer(bootstrap_servers=['localhost:9092'],partitioner=custom_partitioner)
topic_name='hello_world5'

for e in range(1000):
data = {'number' : e}
print(data)
sleep(0.5)

Consumer:
--------------
import faust

app=faust.App('demo-streaming',broker='localhost:9092')

async def processor(stream):
async for message in stream:
print(f'Received {message}')

Start instances:
------------------
faust -A main worker -l info
faust -A main worker -l info --web-port=6067

Check this playlist for more Data Engineering related videos:

Apache Kafka form scratch

Snowflake Complete Course from scratch with End-to-End Project with in-depth explanation--

🙏🙏🙏🙏🙏🙏🙏🙏
YOU JUST NEED TO DO
3 THINGS to support my channel
LIKE
SHARE
&
SUBSCRIBE
TO MY YOUTUBE CHANNEL

Рекомендации по теме

Комментарии

If you run only one worker, but set the concurrency to 2 for faust agent and make it listen to a topic having two partitions, will the application run two concurrent instances of listener with each instance ingesting from one partition ?

aks

Understanding that Kafka Topic Partitions Still Drive Parallelism in Faust

Apache Kafka 101: Partitioning (2023)

Topics, Partitions and Offsets: Apache Kafka Tutorial #2

Understanding that Kafka Topic Partitions Still Drive Parallelism in Faust

Apache Kafka in 6 minutes

Kafka Topics, Partitions and Offsets Explained

What is Kafka Topic Partitions? | Apache Kafka Tutorial | Kafka Tutorial for Beginners

Part 6 - What is Kafka topic partition | Kafka for beginners

Apache Kafka - Brokers & Topic Replication Explanation

3. Apache Kafka Fundamentals | Apache Kafka Fundamentals

Kafka Topics and Partitions

5.2 Complete Kafka Training - What are Kafka Partitions [ Explained ]

Apache Kafka 101: Partitioning (Hands On - 2023)

Kafka Tutorial - Core Concepts

Introduction to Apache Kafka (understanding Topics and Partitions)

Apache Kafka 101: Consumers (2023)

Beginners guide to balance your data across Apache Kafka partitions By Olena Kutsenko

What is Kafka topic? | Apache Kafka Tutorial | Kafka Tutorial for Beginners

'In the Land of the Sizing, the One-Partition Kafka Topic is King' by Ricardo Ferreira

Strategies for Kafka Topic Partitioning when key=null

Kafka Topic Partitions Explained

What is a Kafka Consumer and How does it work?

Apache Kafka Architecture

How to choose the No. of Partitions for a Kafka Topic? Horizontal Scaling for Kafka Consumer

Understanding Kafka partition assignment strategies with in-depth intuition & Practical using Py...