Delta Lake: Reliability and Data Quality for Data Lakes and Apache Spark by Michael Armbrust

preview_player
Показать описание
Delta Lake is an open source storage layer that brings reliability to data lakes. Delta Lake offers ACID transactions, scalable metadata handling, and unifies streaming and batch data processing. It runs on top of your existing data lake and is fully compatible with Apache Spark APIs.

#BIGTH19 #BigData #MachineLearning

Session presented at Big Things Conference 2019 by Michael Armbrust, Principal Engineer at Databricks

20th November 2019
Kinépolis, Madrid

Рекомендации по теме