Data modeling, the secret sauce of building & managing a large scale data warehouse | Citus Con 2022

preview_player
Показать описание
Video of a talk delivered by Min Wei at Citus Con: An Event for Postgres. Abstract: You are tasked to build a large scale data warehouse. After much reading and listening, you pick the best tech, and load up with data. Fun and profit from now on? In this talk, I will reflect my journey with VeniceDB on how to scale a data system over years from data modeling perspective using PostgreSQL and Citus on Azure. I will cover the types of data and various query design techniques to meet the business requirements of the Windows Telemetry team at Microsoft.

Min Wei is a Principal Software Engineer at Microsoft working in the Windows Telemetry Metrics Store team. Min has been spending most of his career in the data space like Exchange content management, Hadoop/Hive system, Postgres, and ClickHouse. For the past 5 years he has been passionate about building and operating very large scale data warehouses on PostgreSQL.

► Video bookmarks:
⏩ 00:00 Introduction
⏩ 01:19 Windows telemetry data challenge
⏩ 04:13 Data schema of Venice DB
⏩ 06:28 Analytical queries at petabyte scale
⏩ 16:40 Venice DB data storage & organization
⏩ 20:30 Why Citus?

✅ Learn more:

📌 Let’s connect:

🔔 Subscribe to the Citus monthly technical newsletter:

#CitusCon #PostgreSQL #Citus
Рекомендации по теме