Common Strategies for Improving Performance on Your Delta Lakehouse

preview_player
Показать описание
The Delta Architecture pattern has made the lives of data engineers much simpler, but what about improving query performance for data analysts? What are some common places to look at for tuning query performance? In this session we will cover some common techniques to apply to our delta tables to make them perform better for data analysts queries. We will look at a few examples of how you can analyze a query, and determine what to focus on to deliver better performance results.

About:
Databricks provides a unified data analytics platform, powered by Apache Spark™, that accelerates innovation by unifying data science, engineering and business.

Connect with us:
Рекомендации по теме
Комментарии
Автор

thanks for this brief but focused walk through the topic 🙂

bernardputersznit
Автор

Thanks Franco for the awesome insights and performance tips

dhruvsingh
Автор

Hi, how would you optimize for SELECT DISTINCT queries?

PilotMusicc
Автор

On 28:24 mark, should it be milliseconds instead of nanoseconds?

toniolora
Автор

Your analysts would have to be using the same cluster that you etl job is running to leverage the delta cache from those jobs, correct?

gardnmi