Data Science with Python! Analyzing File Types from Avro to Stata

preview_player
Показать описание
Tutorial comparing various file types using Python and pandas. Learn about the differences between text and binary files, compare read/write times for different file types and compare the sizes of different file types.

The notebook can be found in the "Data Science with Python" folder within the below repo.

CONNECT:

|-Video Chapters-|
0:00 - Intro
0:04 - Installing required packages
0:47 - Overview of the file types
2:44 - Text vs binary files
7:39 - Corrupted text vs binary files
10:41 - Reading stock data into a pandas DataFrame
11:22 - Timing the write times of different file types
19:39 - Timing the read times of different file types
23:59 - Analyzing the size of different file types
28:24 - Timing the write/read times as the number of rows grow
35:11 - Timing the write/read times as the number of columns grow
37:26 - Issues with writing to SAS
38:59 - References and additional learning
Рекомендации по теме
Комментарии
Автор

Really amazing stuff here and exactly what I was looking for when I searched "file types data science". Thanks for the presentation and your hard work!

TheEighthAct