Demystifying Delta Lake. Data Brew | Season 1 Episode 3

preview_player
Показать описание
Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs. For our “Demystifying Delta Lake” session, we will interview Michael Armbrust - committer and PMC member of Apache Spark™ and the original creator of Spark SQL. He currently leads the team at Databricks that designed and built Structured Streaming and Delta Lake.

Data Brew is a new video / podcast series where we explore and debate the evolution of Data + AI. No hype, no spin, just a straight shot of strong opinions from some really smart people.

This is our first season where we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.

More on the vidcast here:
Рекомендации по теме
Комментарии
Автор

Great interview. This is a unusual and informative perspective on some of the fundamental elements of parquet files.

FnordFandango
Автор

You are doing great. When's the next video coming out? Thumbs Up 👍

ethan-youtubetips