1 0 EDA databricks (How to do exploratory data analysis in PySpark)

preview_player
Показать описание
PySpark Exploratory Data Analysis
Demonstration of EDA in Databricks using PySpark & Seaborn
0:00 Introduction
3:11 Demo
30:24 Wrap up
In this demonstration, I will show how to run visualizations in PySpark.
Mostly, this involves converting a PySpark DataFrame to a Pandas DataFrame and then using the seaborn library to plot the data.

Рекомендации по теме
Комментарии
Автор

Could plotting only a sample of the data be misleading since it isn't plotting all the data ?

M_le-nr
welcome to shbcf.ru