Seaborn clustermap | How the clustermap works and what is the clustermap?

preview_player
Показать описание
This seaborn clustermap tutorial video shows you how to create a clustermap using Python seaborn. First we talk about what the clustermap is: a matrix plot with a heatmap and two clustering dendrograms. Next I walk you through some basic clustermap code for a toy data set so that you can understand what the clustermap does. I then explain the algorithm behind the clustermap: hierarchical agglomerative clustering. We discuss how the clustermap algorithm works and how it makes a dendrogram. Then I show you how to build a clustermap for more advanced data and how you can use the seaborn clustermap standard_scale to scale your columns. Then we move on to two important parameters: the distance metric and the linkage. I walk you through what each of these are with example visuals, and we also see these in the seaborn clustermap code as the metric and method keyword arguments. Finally, I show you how to use the seaborn clustermap row_colors option to tag your rows by color based on a categorical column of your dataset.

0:00 What is the clustermap?
0:24 Basic clustermap code
3:34 How clustermap works (HAC)
5:16 Clustermap for penguins
8:05 Metric and Linkage
10:34 Clustermap row colors
12:24 Final tip

Github CODE:

Related VIDEOS:

#seaborn #dataviz
Рекомендации по теме
Комментарии
Автор

I'm gonna use this a lot in my Classification related notebooks. Thanks for the lucid and in-depth explaination of this plot. Keep going :)

PhotoninDark
Автор

I am preparing my data mining exam for wednesday and this helped me understanding how the heatmap can be used. The video is incredibly clear, thank you so much!

dadi
Автор

Honestly I didn't find even a decent enough explanation of clustermap elswhere. This saved me. Infact I realized that I had understood dendrograms wrongly. I could see quite a lot of technical detail in your videos.

jagan
Автор

Very clear super understandable and in-depth

syedshaulhameed
Автор

Thank You, You are the Best in Seaborn!

AkaExcel
Автор

Useful and straight to the point, thanks! To your knowledge is there a way to have an OnHover functionality to display the x, y label? With many items it becomes hard to use the axis labels.

est
Автор

you are professional keep the work you gonna be big YouTuber

Itay
Автор

Thank you for a great video with suggested clustered heatmap interpretation! Subscribed to your channel rightaway! :)

gikis
Автор

Thank you soo much for making this video! I just had a question regarding what the default linkage method is for seaborn. I was also wondering whether we can numerically calculate the distance between columns or rows.

kaivalyaabburi
Автор

This was nice and simple to follow! do more do more! 😄

paulocaldas
Автор

Thanks for this vídeo. Good explanation

julianorocha
Автор

Hi Kimberly, great video. What would you suggest if there are some missing values (NA) in my matrix and I don't want to drop any samples out?

jeffreychiu
Автор

Great video! In the penguin example, for each penguin, can we say what are the most important features from the dendogram?

hianjana
Автор

I didnt know that one exists, looks pretty interesting, thank you very much! I followed your tutorials for quite a while and recently i saw some videos about altair. The simple syntax reminded me about seaborn and it seems to be even more simple because you can also aggregate within altair and not use pandas before like you would do in seaborn. I also liked the hole idea behind it with the grammer for the visualisation. I curious what you think about altair?

KauleisteClever
Автор

Thank you for this... Can you explain how to calculate the cluster diatances

dhanujakumar