SAS Tutorial | K-means Clustering Algorithm

preview_player
Показать описание
In this SAS How To Tutorial, Cat Truxillo explores using the k-means clustering algorithm. In SAS, there are lots of ways that you can perform k-means clustering. You can write a program in PROC FASTCLUS, PROC KCLUS, PROC CAS, python, or R; Point and click in Visual Statistics, Enterprise Guide, Enterprise Miner, JMP, Model Studio, and SAS Studio. You can profile your clusters with graphs, save cluster scores to data sets, and much more. In this video, Cat will demonstrate k-means clustering two ways; using SAS Visual Statistics and using SAS Studio.

Download Data Files

Chapters
0:00 – What is k-means clustering and How does k-means clustering work
2:55 – How to do k-means clustering in SAS Visual Statistics
9:39 – How to do k-means clustering in SAS University Edition or SAS Studio

Learn more about SAS Software
◉ Ask the Expert: Cluster Analysis: Introduction to Best Practices –

SUBSCRIBE TO THE SAS USERS YOUTUBE CHANNEL #SASUsers #LearnSAS

ABOUT SAS
SAS is a trusted analytics powerhouse for organizations seeking immediate value from their data. A deep bench of analytics solutions and broad industry knowledge keep our customers coming back and feeling confident. With SAS®, you can discover insights from your data and make sense of it all. Identify what’s working and fix what isn’t. Make more intelligent decisions. And drive relevant change.

CONNECT WITH SAS
Рекомендации по теме
Комментарии
Автор

Thank you, searched the web for a while, only to find very few instructions on K-Means clustering for SAS. Your video is very informative and helpful!

yvesyuxuanfan
Автор

This is wonderful! I don't know how I missed seeing this when it was released. This is one to watch take notes! I am looking forward to applying it. Thanks, Cat!

rogerward
Автор

I really liked the idea of using PCA along with clustering.

K-means is in such wide use that I wish people would understand that k-means will produce clusters whether or not the data are naturally clustered. As such, it is really a *partitioning* method. As an example, you can create 4 random uniform variables between 0 and 1 and cluster them with k-means, and you'll get an answer. I'm glad you at least cover the CCC, and I think value could be added by discussing whether the summary stats justify the division of the data into clusters.

AS it pertains to this video, the IRIS example shows clusters 2 and 3, but I'm not convinced, without seeing a rotation that separates them, that they are really separate.

michaeltuchman
Автор

Hey, join the conversation! How are you using k-means clustering right now? Leave a comment and let me know.

catherinetruxillo
Автор

As always, excellent presentation and demonstration Dr. Cat!

chrisblankenship
Автор

Very informative. Keep up the good work!

rhodesfrances
Автор

Thank you, such an insightful lecture, really enjoy it!

tingtingchen
Автор

Is there an approach or algorithm to figure out what the optimum number of groups might be given the data?

LaSupp
Автор

You didn't show how to do cluster observation in University Edition

gagofure
Автор

If I have an ID variable that identifies each observation.
Is there a way to assign each ID with a new variable (Cluster) that shows the cluster it belongs to, in Visual analytics?

dritans