Be kind to yourself! Spend (less) time on Data Exploration - Willem Hendriks | PyData Eindhoven 2021

preview_player
Показать описание
Is data exploration the most important step in a data project? Weird how this crucial phase almost never gets the credit its deserves.
I will do an attempt here, a little ode to the exploration phase, and try make it cool again by using some modern packages. Any tool that helps us being effective, increases the change we find that gem during data explorations. I believe any insights gained during the data exploration phase pays back at least double later on in the journey.

Willem Hendriks : Studied mathematics, and working with data since graduation. From small statistical sets to using big data tooling, currently at Big Data Republic.

PyData Eindhoven 2021

===

PyData is an educational program of NumFOCUS, a 501(c)3 non-profit organization in the United States. PyData provides a forum for the international community of users and developers of data analysis tools to share ideas and learn from each other. The global PyData network promotes discussion of best practices, new approaches, and emerging technologies for data management, processing, analytics, and visualization. PyData communities approach data science using many languages, including (but not limited to) Python, Julia, and R.

PyData conferences aim to be accessible and community-driven, with novice to advanced level presentations. PyData tutorials and talks bring attendees the latest project features along with cutting-edge use cases.

00:00 Welcome!
00:08 Introduction
2:05 What is Data Exploration?
2:26 "You have to get to know your data!"
11:50 Pandas Profiling
14:07 SweetVIZ
15:59 DABL
17:58 dTreeViz
19:56 dtale
22:35 Conclusions
25:18 Q&A

Рекомендации по теме