Apache Hudi

preview_player
Показать описание
Hudi is a rich platform to build streaming data lakes with incremental data pipelines
on a self-managing database layer, while being optimized for lake engines and regular batch processing.

credit : Vinoth chandar
Рекомендации по теме
Комментарии
Автор

At 17:18, Read optimized table is showing latest data, but ideally rt table shows realtime data and ro table can have some delays.

AbhinavTyagi-gw
Автор

At 3:29, did you want to say the objects in S3 or hdfs are “immutable” ?

prasadBoyane
Автор

At 17:05 to 17:07 you say there are no changes to file 2 whereas I can see that E changes to E’ and A’ to A””. Can you please clarify?

dwivedys