KSQL Tutorial For Beginners : Streaming SQL for Apache Kafka

preview_player
Показать описание
This KSQL Tutorial For Beginners: Streaming SQL for Apache Kafka Video Explain:
1. All the fundamentals of KSQL
2.Create Streams
3.Tables
4.Generate Data with KSQL-datagen
5.Advance Operation: Joins, Windowing, Aggregation, Geospatial
6. Deal with all kind of Data Format like CSV, JSON, Avro.
7.Create and Use UDF, UDAF
8.Moving KSQL to production with tips

What is KSQL?
StreamSQL is a query language that extends SQL with the ability to process real-time data streams. SQL is primarily intended for manipulating relations, which are finite bags of tuples. StreamSQL adds the ability to manipulate streams, which are infinite sequences of tuples that are not all available at the same time

What Is KSQL? KSQL is the streaming SQL engine for Apache Kafka®. It provides an easy-to-use yet powerful interactive SQL interface for stream processing on Kafka, without the need to write code in a programming language such as Java or Python. KSQL is scalable, elastic, fault-tolerant, and real-time.
Рекомендации по теме
Комментарии
Автор

Hi,
Really, it is crystal clear about KSQL internals. I appreciate your fantastic effort. However, I have few questions as
1) Does KSQL has feature to create partition, index, bucket etc like Hive to process the huge data? For example 300 GB
data is pushed to Kafka per day. is KSQL well enough to handle 1 week of data(2100GB) in this case?
2) Please also let me know the broker's configuration, no. of brokers is needed in this situation.
3) In this case, should KSQL run on different machine or clusters. Currently, I have following servers
a) 1 zookeeper server
b) 3 brokers for Kafka cluster
c) 1 server for Kafka Connect cluster.

Waiting for your kind response.

mdshabbirhassan
Автор

its simple and clear explanation ....its very good about ksql
I have question here - ksql not present in apache kafka ?

sureshvkarri