How to Remove Outliers From Data (Spotify Song Popularity Prediction) - Data Every Day #127

preview_player
Показать описание
Hi guys, welcome back to Data Every Day!
On today's episode, we are looking at a dataset of Spotify songs from 1921-2020 and trying to predict the popularity of a given song. We will be using a linear regression model to make our predictions, but first we will focus on removing outliers from the data.

Note: I am not affiliated with Spotify in any way.

Here is a link to the Kaggle dataset:

And here is a link to my notebook from the video:

Thanks so much for watching! If you enjoyed today's episode, be sure to subscribe and hit the bell for more content!

See you all tomorrow! :)

----------

Рекомендации по теме
Комментарии
Автор

Please can you do how to extract original data(attributes ) from spotify for popularity prediction?
Thanks for the tutorial!

letaendashew
Автор

Nice vedio but I have a question can we handle outliers in categorical data if yes so how we handle any solution you have ?

adarshjamwal
Автор

How do you think z scores compare in enhancing performance versus compressive scalers like log based? Surely scaler approach is a lot less work.
Your tutorial was instructive but perhaps to much so I now have a severe headache :)

jamespaladin
Автор

Hi,

How can we remove multivariate outliers? please help on this.

guptaaman