Machine Learning with Python video 8 :How to split the dataset into Test and Train

preview_player
Показать описание
In this video i will tell you how you can split your database into two sections that is test and train ,i have also explained what are dependent and independent variable we will be using sklearn's train_test_split package to split our dataset
* train data : it does what the name suggests it trains our machine learning algorithm
* test data : after the model is trained test data is used to test its efficiency


related video title :
Future of AI/ML | Rise Of Artificial Intelligence & Machine Learning | AI and ML Training | Edureka
Machine Learning Tutorial Python - 7: Training and Testing Data
Machine Learning with Python | Part 4 | Creating Train & Test Dataset & evaluating model with MSE
5 Python Projects for Beginners
Python Projects For Beginners | Python Projects Examples | Python Tutorial | Edureka
15 Python Projects in Under 15 Minutes (Code Included)
Python Tutorial for Beginners [Full Course] 2019
Create your own Chatbot using Python #1
Python Tutorial For Beginners | Python Full Course From Scratch | Python Programming | Edureka
Python Tutorials for Beginners - Learn Python Online
Data Science In 5 Minutes | Data Science For Beginners | What Is Data Science? | Simplilearn
What Can You Do with Python? - The 3 Main Applications

tags : #test_train_split #sklearn #i_know_python
Рекомендации по теме
Комментарии
Автор

I APPRECIATE YOUR WORK AND ONLY SOME PEOPLE ARE MAKING THIS QUARANTINE PRODUCTIVE AND ARE GRINDING SO SHOUTOUT TO ALL.THANX FOR VIDEOS.

ANAS-exru
Автор

I find the iloc code too restrictive, especially if you adjust the data later in the game. Instead I prefer to use code like:

column_names= df.columns
features = column_names[1:]
label = column_names[0]
display(features, label)

so at any point I can ask/query/return df[label] or df[features] without grid numerical references. This way for future iterations I can add or drop a column and only this section needs to be updated.

dependent = df[label].values
independent = df[features].values

cboyda
Автор

Doing god's work providing such knowledge for free♥️🔥

vnmhckng
Автор

sir for splitting the data set where have to store this data set sir. nice video and easy to understand. give your mail id sir

abiramit
Автор

What causes the error found input variables inconsistent with samples [7, 5000]

katersam
Автор

If i have 1000 rows in dataset. Then how can select first 200 rows for testing and last 800 rows for training instead of select randomly in splitting?

hasnain-khan
Автор

How to split image dataset with xtrain n ytrain

PadminiMansingh
Автор

What's random_state exactly for? Why it's equal to 5

rakeshkumarranjan
join shbcf.ru