Polars or Pandas -- Which is Faster?

preview_player
Показать описание
Sat down with Chad to visualize the performance difference between Polars and Pandas. You'll be surprised.

Charming Data:

Chad's LinkedIn:

Live App:
(Because it is hosted for free, the app might take a minute to turn on and load)

GitHub code:

Article:

Video Layout:
00:00 - Introduction
01:39 - Dash app with Polars
02:41 - Cheat sheet
03:51 - Speed comparison

************************************************************************
👉 Your support keeps Charming Data running, which is proudly a 100% member-supported educational channel:
Рекомендации по теме
Комментарии
Автор

a trick also to make plotly graph faster when using polars is calculate in polars before putting into plotly.express. For example, instead of using px.histogram(df, ...). you can" val_counts = df.value_counts()
fig = px.bar(
x=val_counts[target],
y=val_counts["count"],
text_auto=True,
)"
this significantly decrease the plot generation time as well for large dataset.

PengZhang-lsim
Автор

What version of Panda (and polars) were used?

Radioguy
Автор

Thank you for nice tutorial on comparison. May I know where we can see the code for above

karthikb.s.k.
Автор

Everyone will still use pandas because of 3rd-party libraries, the problem is not performance, it's an ecosystem.

ordinarygg