Exploratory Data Analysis with PySpark using Diabetes Dataset

preview_player
Показать описание
Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns ,to spot anomalies, to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.

The datasets consists of several medical predictor variables and one target variable, Outcome. Predictor variables includes the number of pregnancies the patient has had, their BMI, insulin level, age, and so on.

Рекомендации по теме
Комментарии
Автор

Such quality content! Hope you get a million subscribers!

RaviPrakash-dzfm
Автор

Thank you so much sir...Love from India..

saravanajogan
Автор

could you provide the link of dataset, thanks

LienNguyen-pzhi
Автор

Why it's hard to build ml model in pyspark ?

BlueSkyGoldSun
Автор

diabetes.csv isn't presented in the repo.

jairajsahgal
join shbcf.ru