Bulk read multiple CSVs in Python - Pandas, Dask, File I/O, MMAP

preview_player
Показать описание
In this Python Tutorial, We're going through different ways - specifically 5 different ways to bulk read Multiple CSVs in Python. Our solution includes two ways of reading from Pandas, Dask, Python's in-built File I/O and MMAP which is the most efficient. We also look at time taken for each code. All the code snippets are taken from the Twitter thread shared by Abhishek Thakur

This video answers the following questions:
How to import multiple CSVs in Python?
How to Bulk Read CSVs in Python?
Easy and Efficient Solution to read CSVs in Python?

Рекомендации по теме
Комментарии
Автор

I have the same exact problem but with YAML files. I don't believe Pandas can concat yaml files together with "---". Any other solutions come to mind?

aram
Автор

Awsome, Did you ever compare the csv file vs pickle format with Dask ? I may give that a try. Surprised DASK had such an expensive result. This is very Insightful, thank you.

SolidBuildersInc