pg_duckdb: Postgres analytics just got faster with DuckDB

Показать описание

Postgres analytics 10x faster with just an extension?! 🤯

In August, we announced pg_duckdb, a PostgreSQL extension that integrates DuckDB's analytics engine directly into Postgres. It's open-source and represents a joint partnership between Hydra and MotherDuck.

Two months later, we are happy to announce its first release and highlight many features, including the ability to read and write over object storage with Parquet and CSV, as well as Apache Iceberg (currently read-only), and the capability to query from MotherDuck without leaving Postgres.

Note :
We ingested the TPC-DS datasets into PostgreSQL without indexes for two main reasons:
2. While indexes are common in real-world PostgreSQL scenarios, optimizing them for specific analytic queries can be complicated and bring extra overhead. Considering this, we believe there is value in looking at the performance of queries without any indexes.

📓 Resources

➡️ Follow Us

0:00 Intro
1:33 Postgres extension ecosystem
2:35 Getting started with pg_duckdb
6:20 Query data lake / lakehouse
8:54 Scaling to the cloud with MotherDuck
13:37 Moving forward

MotherDuck

Рекомендации по теме

Комментарии

We realized we hadn’t explained clearly why we didn't add indexes when using the TPC-DS datasets in PostgreSQL. So, here's why:
2. While indexes are common in real-world PostgreSQL scenarios, optimizing them for specific analytic queries can be complicated and bring extra overhead. Considering this, we believe there is value in looking at the performance of queries without any indexes.
In the meantime, keep quacking and keep coding... with your favorite elephant and your favorite duck!

motherduckdb