How to use DuckDB: A fast, self-contained analytics database

preview_player
Показать описание
Analytics databases, or column-oriented DBs, speed up analytics jobs on big datasets. But most such DBs are big products with a lot of setup and management overhead. DuckDB bucks that trend: it's a single, self-contained executable and library that brings SQL-query-powered analytics DB functionality to most any programming language, workflow, or data trove. This video shows how you can use DuckDB with Python to work with big, multi-megabyte JSON or CSV datasets, and query the data using conventional SQL syntax.

------------------------------­----
Рекомендации по теме
Комментарии
Автор

I needed this about a decade ago when someone asked me to read in & do stuff with a multigb csv file. I tried powershell's CSV tool and it never came back. They wanted an answer by 5, so I never did find a solution.

pythonlibrarian
Автор

Thank you for the quick intro video. I am wondering if DuckDb supports a lot of the advanced Tableau Calculations as part of query. Any production large scale deployments of this?

neeleshkamkolkar