filmov
tv
pg_duckdb: Postgres analytics just got faster with DuckDB
Показать описание
Postgres analytics 10x faster with just an extension?! 🤯
In August, we announced pg_duckdb, a PostgreSQL extension that integrates DuckDB's analytics engine directly into Postgres. It's open-source and represents a joint partnership between Hydra and MotherDuck.
Two months later, we are happy to announce its first release and highlight many features, including the ability to read and write over object storage with Parquet and CSV, as well as Apache Iceberg (currently read-only), and the capability to query from MotherDuck without leaving Postgres.
Note :
We ingested the TPC-DS datasets into PostgreSQL without indexes for two main reasons:
2. While indexes are common in real-world PostgreSQL scenarios, optimizing them for specific analytic queries can be complicated and bring extra overhead. Considering this, we believe there is value in looking at the performance of queries without any indexes.
📓 Resources
➡️ Follow Us
0:00 Intro
1:33 Postgres extension ecosystem
2:35 Getting started with pg_duckdb
6:20 Query data lake / lakehouse
8:54 Scaling to the cloud with MotherDuck
13:37 Moving forward
In August, we announced pg_duckdb, a PostgreSQL extension that integrates DuckDB's analytics engine directly into Postgres. It's open-source and represents a joint partnership between Hydra and MotherDuck.
Two months later, we are happy to announce its first release and highlight many features, including the ability to read and write over object storage with Parquet and CSV, as well as Apache Iceberg (currently read-only), and the capability to query from MotherDuck without leaving Postgres.
Note :
We ingested the TPC-DS datasets into PostgreSQL without indexes for two main reasons:
2. While indexes are common in real-world PostgreSQL scenarios, optimizing them for specific analytic queries can be complicated and bring extra overhead. Considering this, we believe there is value in looking at the performance of queries without any indexes.
📓 Resources
➡️ Follow Us
0:00 Intro
1:33 Postgres extension ecosystem
2:35 Getting started with pg_duckdb
6:20 Query data lake / lakehouse
8:54 Scaling to the cloud with MotherDuck
13:37 Moving forward
pg_duckdb: Postgres analytics just got faster with DuckDB
Using DuckDB to analyze the data quality of Apache Parquet files
PostgreSQL in 100 Seconds
DuckDB and SQL - for Data Analysis and Processing
Data Engineering with DuckDb Tutorial | PySpark | SQL | Postgres | Python | ETL Data processing
In-Process Analytical Data Management with DuckDB - posit::conf(2023)
DuckDB Tutorial - DuckDB course for beginners
15 futuristic databases you’ve never heard of
Querying DuckDB with PRQL
8 PostgreSQL Extensions You Need To Know About
Practical Applications for DuckDB (with Simon Aubury & Ned Letcher)
Splicing Elephant & Duck DNA | Scaling Postgres 330
Vector databases are so hot right now. WTF are they?
DuckDB - Overview by Hannes Mühleisen
DuckDB An Embeddable Analytical Database
Implementing Hardware-Friendly Databases (with DuckDB co-creator, Hannes Mühleisen)
DuckDB An Embeddable Analytical Database
Data-intensive PostgreSQL: Three ways to scale | POSETTE 2024
77 Times Faster In Postgres 17 | Scaling Postgres 337
DuckDB and PostGIS: Your geospatial super duo
An introduction to Apache Parquet
DuckDB – Overview and latest developments (DuckCon #5, Seattle, 2024)
Own your business intelligence reports with evidence.dev
Unleashing the Power of DuckDB for Interactive SQL Notebooks
Комментарии