Demystifying Delta Lake. Data Brew | Season 1 Episode 3

Показать описание

Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs. For our “Demystifying Delta Lake” session, we will interview Michael Armbrust - committer and PMC member of Apache Spark™ and the original creator of Spark SQL. He currently leads the team at Databricks that designed and built Structured Streaming and Delta Lake.

Data Brew is a new video / podcast series where we explore and debate the evolution of Data + AI. No hype, no spin, just a straight shot of strong opinions from some really smart people.

This is our first season where we will be focusing on lakehouses – combining the key features of data warehouses, such as ACID transactions, with the scalability of data lakes, directly against low-cost object stores.

More on the vidcast here: