Real-World Python Machine Learning Tutorial w/ Scikit Learn (sklearn basics, NLP, classifiers, etc)

preview_player
Показать описание
Practice your Python Pandas data science skills with problems on StrataScratch!

In this video we walk through a real world python machine learning project using the sci-kit learn library. In it we work our way to building a model that automatically classifies text as either having a positive or negative sentiment. We do this by using amazon reviews as our training data. Full video timeline in the comments!

Link to Code & Data:

Raw Data download:

Sci-kit learn documentation:

Make sure you have sci-kit learn downloaded! To do this either run "pip install sklearn" or use python through Anaconda.

Join the Python Army to get access to perks!

---------------------------
Follow me on social media!

To get one of the cool shirts I was wearing:

---------------------------

Video outline!
0:00 - What we will be doing!
3:40 - Sci-Kit Learn Overview
6:38 - How do we find training data?
9:33 - Download data
11:45 - Load our data into Jupyter Notebook
16:38 - Cleaning our code a bit (building data class)
20:13 - Using Enums
22:50 - Converting text to numerical vectors, bag of words (BOW) explanation
25:45 - Training/Test Split (make sure to "pip install sklearn" !)
33:45 - Bag of words in sklearn (CountVectorizer)
40:05 - fit_transform, fit, transform methods
42:05 - Model Selection (SVM, Decision Tree, Naive Bayes, Logistic Regression) & Classification
47:50 - predict method
56:58 - F1 score
1:01:01 - Improving our model (evenly distributing positive & negative examples and loading in more data)
1:20:36 - Let's see our model in action! (qualitative testing)
1:22:24 - Tfidf Vectorizer
1:25:40 - GridSearchCv to automatically find the best parameters
1:31:30 - Further NLP improvement opportunities
1:32:50 - Saving our model (Pickle) and reloading it later
1:36:37 - Category Classifier
1:39:14 - Confusion Matrix

---------------------

*I use affiliate links on the products that I recommend. I may earn a purchase commission or a referral bonus from the usage of these links.
Рекомендации по теме
Комментарии
Автор

Video outline!
0:20 - What we will be doing!
3:40 - Sci-Kit Learn Overview
6:38 - How do we find training data?
9:33 - Download data
11:45 - Load our data into Jupyter Notebook
16:38 - Cleaning our code a bit (building data class)
20:13 - Using Enums
22:50 - Converting text to numerical vectors, bag of words (BOW) explanation
25:45 - Training/Test Split (make sure to "pip install sklearn" !)
33:45 - Bag of words in sklearn (CountVectorizer)
40:05 - fit_transform, fit, transform methods
42:05 - Model Selection (SVM, Decision Tree, Naive Bayes, Logistic Regression) & Classification
47:50 - predict method
53:35 - Analysis & Evaluation (using clf.score() method)
56:58 - F1 score
1:01:01 - Improving our model (evenly distributing positive & negative examples and loading in more data)
1:20:36 - Let's see our model in action! (qualitative testing)
1:22:24 - Tfidf Vectorizer
1:25:40 - GridSearchCv to automatically find the best parameters
1:31:30 - Further NLP improvement opportunities
1:32:50 - Saving our model (Pickle) and reloading it later
1:36:37 - Category Classifier
1:39:14 - Confusion Matrix


Thank you for watching! Make sure to like & subscribe if you enjoyed :)

KeithGalli
Автор

you're the reason that I've got an internship in a great company :) well.. I'm broke now :D but when I earn tons of money( I hope we all do :D ) I'll donate you Keith !

mucahitugurlu
Автор

Keith, this is incredibly helpful. Your teaching style is to be commended. I look forward to more like this for ML.

Locke
Автор

This video is super helpful! I have struggled in making my model using sklearning for several days and you just make my day! Thanks!

alexq
Автор

You are so good, explaining the hardest things in common language and makes it easy to understand to even my grandma.... Thanks so much for making this simple!

hollmanbaez
Автор

This is by far the most useful tutorial that I have ever seen. You are an amazing teacher.

mohitkishore
Автор

He not only teaches the good stuff but also teach how to google things and get the job done.


Keep going brother!. You are Awesome.

ManishSharma-xqbe
Автор

Wow Keith, you're an absolute legend! I can't wait to get through your other videos and see your future work :D

aligh
Автор

Please keep uploading you're one of the best tutorial channels.

BennyHarassi
Автор

I like it when you showed us how you would use online resources, all the Googling and documentation stuff, so that we are not afraid to actually go online ourselves and explore more new functions :) Thanks Keith!! Stay healthy! :)

jenn
Автор

i always am being directed back and stay at Keith's video... just awesome...

Max-myrk
Автор

This one is just one heck of tutorial. Thanks a ton Keith. I am a Java Architect with 17 years of extensive experience, looking to shift to ML/Data Science. It took me 3 hours to cover this video. I must say first one hour was realy easy to follow but probably you covered a lot of things in the last 40 minutes.

saptarshisanyal
Автор

I was waiting for this! You sir, are a legend

FraserMyersMusic
Автор

Phew, finally finished watching this one:) A lot to take in, but super helpful and interesting! Thanks, Keith! :) Gonna start your real-world task with Pandas tomorrow!

jenn
Автор

thanks man, i'm watching your whole data science video series and you are awesome!

asafrozali
Автор

I have implemented my first ml model with the help of you please upload more content you are amazing well done !

merkol
Автор

Dude you are an excellent educator, thank you so much for this well structured, well explained video!!

ninjaduck
Автор

Thank you for this video! This saved me so much time digging through documentation to try to understand how to implement these libraries!

johnhutchinson
Автор

Really appreciate your efforts. I did not understand from my class teacher anything. Keith taught it very nicely. Thanks a lot

somshridhar
Автор

Another great video. Really appreciate minimal slides paired with the 'live' coding feel.

gannoncondon