How You Should Split Your Datasets in Machine Learning

preview_player
Показать описание
Today we learn how to split datasets in machine learning properly using the stratified shuffle split.

◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾◾
📚 Programming Books & Merch 📚

🌐 Social Media & Contact 🌐

Рекомендации по теме
Комментарии
Автор

Excellent video, I didn't know the shuffle split existed. Really appreciate your expertise!

michaelmecham
Автор

Thank you for this awesome video!
I had problems of index using loc so I used iloc instead and now is working like a charm!
Really appreciate.

andreasdesousa
Автор

started learning ml today. Few hours later your video comes out, what a coincidence

scurge
Автор

That was a concise presentation. Nice!

ali-omuv
Автор

Superb. I can say I can now split my datasets in ML.

joguns
Автор

That's a great way! Didn't know about it. Thanks!

philip
Автор

thank you! yay for more data science videos

Caledonia_night
Автор

This is a good video, simple and up to a point.

rahulmore
Автор

Does using cut or qcut matter for situations like this?

jacksun
Автор

Why not use a constant seed while splitting the dataset to make them deterministic?

rahulmore
Автор

Dude do you think the CUT also is usefull for the X's (features)?
or maybe that's just useful when plotting or dashing in POWERbi

MrMadmaggot
Автор

In the application of your data prep, is it true that the time series are, while splitting, are actually broken? Or are the data and test set in time order?

miketan
Автор

Hi, are you just starting out learning Data Science?
And how much did you get the book?

amazing-graceolutomilayo