EuroSciPy 2023 - Keynote: Polars

Показать описание

Polars is the "relatively" new fast dataframe implementation that redefines what DataFrames are able to do on a single machine, both in regard to performance and dataset size.
In this talk, we will dive into polars and see what makes them so efficient. It will touch on technologies like Arrow, Rust, parallelism, data structures, query optimization and more.

EuroSciPy

Рекомендации по теме

Комментарии

I have no knowledge or time to do benchmarking but, I was using pandas' "append" to combine about 8000 CSV files (about 10 GB in total) and it was taking almost an hour and a half, i decided to try polars, according to stack overflow i could use, concat, vstack, or extend, i randomly chose "vstack", and it did the same workload in less than 1 minute, same computer, same python version, same everything, all i had to do was modify the script a little bit, for example remove "index = False" when exporting the resulting (huge) dataframe to CSV.

iutubtivi

The API is very similar to lpyspark. In fact I don't think it would be a hassle to convert existing pipelines to polars.

Molox

EuroSciPy 2023 - Keynote: Polars

EuroSciPy 2023 - Keynote: Polars

EuroSciPy 2023 - DataFrame-agnostic code: are we there yet?

EuroSciPy 2023 - Accelerating your Python code - a systematic overview

EuroSciPy 2023 - Ibis: A fast, flexible, and portable tool for data analytics

What polars does for you — Ritchie Vink

Seamless Value Selection in Python with Polars: Mastering DataFrame Operations

Ritchie Vink Polars; done the fast, now the scale PyCon 2023

Polars DataFrame

Effortless DataFrame Calculations with Polars: Unleash Data Insights

Nico Kreiling: Raised by Pandas, striving for more: An opinionated introduction to Polars

Polars: The main alternative to pandas in Python!?

Learning Polars for Data Analysis? Start Here!

Polars vs Pandas | detailed test with explained results

Similarity Search w/ Polars

Thomas Bierhance: Polars - make the switch to lightning-fast dataframes

Tutorials - Matt Harrison: Getting Started with Polars

how to update mass data using Polars DataFrame

Juan Luis- Expressive and fast dataframes in Python with polars | PyData NYC 2022

Is the great dataframe showdown finally over? Enter: polars - Luca Baggi

Introduction to Polars: A Python Library for Data Analysis and Visualization

Polars: A highly optimized dataframe library | Matt Harrison | Conf42 Machine Learning 2023

Polars vs Pandas - what's the difference? — Cheuk Ting Ho

Polars: The Super Fast Dataframe Library for Python ... bye bye Pandas?

Polars: Working with Data Larger than RAM memory