K Nearest Neighbors Application - Practical Machine Learning Tutorial with Python p.14

Показать описание

In the last part we introduced Classification, which is a supervised form of machine learning, and explained the K Nearest Neighbors algorithm intuition. In this tutorial, we're actually going to apply a simple example of the algorithm using Scikit-Learn, and then in the subsquent tutorials we'll build our own algorithm to learn more about how it works under the hood.
To exemplify classification, we're going to use a Breast Cancer Dataset, which is a dataset donated to the University of California, Irvine (UCI) collection from the University of Wisconsin-Madison. UCI has a large Machine Learning Repository.

Рекомендации по теме

Комментарии

Thank you so much Edward Snowden. 😂 I took 6 Machine Learning courses in uni. First I failed Machine Learning, then failed Natural Language Processing, then failed Bioinformatices which also constituted some Machine Learning. The other three courses were Data Mining, Machine Learning (again) and... I can't remember 6th lol. Not once did I understand code that clearly. Good job.

ahmadjamalmughal

Note that cross_validation is depreciated. Use model_selection, which interestingly gets me a higher accuracy.

josiahls

Hey guys, if you are having the error ValueError: labels ['class'] not contained in axis be sure to save your file as ANSI ENCODING, that was what solved my problems here!

daniloktz

For more recent followers train_test_split has been moved. To fix use:
from sklearn.model_selection import train_test_split

barrettbryson

Hey Mate! I don't know if it works for others, but the warning for depreciation is fixed (for me) by simply providing a 2-dimensional list for the np.array declaration. this way you don't have to use reshape.

example_measures = np.array( [ [4, 2, 1, 1, 1, 2, 3, 2, 1] ] )

when you provided a 2nd set of data you didn't need to reshape there either, because you naturally made it 2-dimensional.
But this isn't to say that different versions of everything won't have different effects. For me, if I get the warning in question, the algorithm returns no prediction where as yours appeared to still run the prediction properly anyway.

Loving your videos!

nintendo_dringus

id, clump_thickness, unif_cell_size, unif_cell_shape, marg_adhesion, single_epith_cell_size, bare_nuclei, bland_chrom, norm_nucleoli, mitoses, class

TheHimanshu

Found out what the reshape was moaning about ... just needed an extra set of square brackets (i.e. no need to reshape)

example_measures = np.array([[4, 2, 1, 1, 1, 2, 3, 2, 1]])
#example_measures = example_measures.reshape(1, -1);

LOVED the video - thank you!

duncancarr

For who have errors: labels['class'] not contained or labels['id'] not contained in axis, remember to insert "id, clump_thickness, unif_cell_size, unif_cell_shape, marg_adhesion, single_epith_cell_size, bare_nuclei, bland_chrom, norm_nucleoli, mitoses, class" into the first line of your data file.

jimiyu

Load data, replace missing values and drop 'id' column in one step: df = pd.read_csv('breast_cancer_wisconsin.csv', header=0).replace('?', axis=1). Method chaining keeps the code short and concise. Great tutorial series by the way :)

stephan

You got me interested in machine learning and now I'm using it for my MA thesis. Thank you.

aryanyekrangi

Hi,

I just did this tutorial. This is awesome !
Your explanation is so clear and straight forward, it makes the entire ML looks so easy !

Thanks a lot for this (and all other) tutorials !

Moving on to next video.

Regards
Prasad

gprasas

I dont know if much has changed since the making of this video and when im writing this but its incredible to see that when i follow along the model predicts with 99% accuracy at times. Kinda crazy.

isaiahtoro

You are a superb teacher and an exceptional programmer. Keep up the good work!!!!

profmo

If anyone is facing issue to import cross_validation from sklearn, it's updated now and moved to the model_selection, so you can import it as: from sklearn.model_selection import cross_validate

arycloud

Can't wait for episode 15! I have to this exact application of rewriting KNN in Java.

levyroth

Your ML tutorials are best on the internet👍

nemesis_rc

if you use "example_measures=np.array([[4, 2, 1, 1, 1, 2, 3, 2, 1], ])", you needn't to reshape the array. What scikit-learn need in this instance is just a 2D array.

shuhangzhan

U r superb man! Your Tutorials made machine learning a "learn with fun tutorial"... Thanks!

shivambhirud

Better explained than when I was a student in a university.

alexmattheis

If you have the error, "ValueError: labels ['class'] not contained in axis", the following steps will solve your issue
1. save the downloaded file without having column names like id, class, clump_thickness. In other words, just save the file as it is
2. assign column names like this
df.columns = ["id", "clump_thickness", "unif_cell_size", "unif_cell_shape", "marg_adhesion",
"single_epith_cell_size", "bare_nuclei", "bland_chrom", "norm_nucleoli", "mitoses", "class"]
This line should be after df = pd.read_csv()

joshsato

K Nearest Neighbors Application - Practical Machine Learning Tutorial with Python p.14

K Nearest Neighbors | Intuitive explained | Machine Learning Basics

StatQuest: K-nearest neighbors, Clearly Explained

Applying and Understanding K-Nearest Neighbors (KNN) in R

K-Nearest Neighbors (KNN) with Rapidminer

Simple Explanation of the K-Nearest Neighbors (KNN) Algorithm

K-nearest neighbors algorithm using scklearn: Theory and Application

[C#] K-Nearest Neighbors

K Nearest Neighbors Application - Practical Machine Learning Tutorial with Python p.14

K-Nearest Neighbors with Google Sheets

K Nearest Neighbors Application - Machine Learning (breast-cancer-wisconsin)

k-nearest-neighbour KNN

K-Nearest Neighbors (KNN): KNN algorithm and its applications.

What is the K-Nearest Neighbor (KNN) Algorithm?

K-Nearest Neighbors (KNN) Algorithm - Intro, Application, Working, Select K Value | Machine Learning

Applying our K Nearest Neighbors Algorithm - Practical Machine Learning Tutorial with Python p.18

14 K Nearest Neighbors Application Practical Machine Learning Tutorial with Python p 14 red manc

Lec-7: kNN Classification with Real Life Example | Movie Imdb Example | Supervised Learning

KNN Algorithm In Machine Learning | KNN Algorithm Using Python | K Nearest Neighbor | Simplilearn

How to implement KNN from scratch with Python

K nearest neighbor

1. Solved Numerical Example of KNN Classifier to classify New Instance IRIS Example by Mahesh Huddar

K-Nearest Neighbors (KNN) with R | Classification and Regression Examples

Apply Python K-nearest Neighbors (KNN) Algorithm to Predict Wine Quality

KNN Classifier on Diabetes Dataset