Stable Diffusion - How to build amazing images with AI

preview_player
Показать описание
This video is about Stable Diffusion, the AI method to build amazing images from a prompt.

If you like this material, check out LLM University from Cohere!

Get the Grokking Machine Learning book!
Discount code (40%): serranoyt
(Use the discount code on checkout)

0:00 Introduction
1:27 How does Stable Diffusion work?
2:55 Embeddings
12:55 Diffusion Model
15:00 Numerical Example
17:39 Embedding Example
19:37 Image Generator Example
28:37 The Sigmoid Function
34:39 Diffusion Model Example
41:03 Summary
Рекомендации по теме
Комментарии
Автор

I am a fan of your work. I read your "Grokking Machine Learning". It's awesome. I am totally impressed. I stopped watching other AI videos and following you for most of the stuff. Simple and practical explanation. Thanks a lot and grateful for spreading the knowledge.

krajanna
Автор

These videos are always incredibly helpful, informative, and understandable. Very grateful

thebigFIDDLES
Автор

Serrano you are a genius bro your channel is so underrated

shafiqahmed
Автор

Always impressed with how understandable, but detailed your videos are. Thank you!

jasekraft
Автор

Amazing, I hope to truly understand the mechanism of stable diffusion through this video!

wanggogo
Автор

Amazing!! Thanks for this high level overview. It was really helpful and fun 👍

kyn-sskc
Автор

excellent explanation - thank you so much

anthonymalagutti
Автор

Superb, so elegant explanation. Big thanks Sir!

avijitsen
Автор

Really incredible job of stepping through the HELLO WORLD of image generation, especially how the video compresses the key output a 4x4 pixel grid and clearly hand computes each step of the way!

MikeTon
Автор

Great video, it gives good intuition to deep network architecture. Thanks

skytoin
Автор

Really amazing work easy to understand and grasp doing a great deal for the community thanks alot..

abhaymishra-ujjp
Автор

You are the best expainer ever. You are amazing.

NigusBasicEnglish
Автор

Thank you for such wonderful visualization that conveys an overview of complex mathematical concepts.
Can you please do a video detailing the underlying architecture of the neural network that forms the diffusion model?
Also, are Generative Adversarial Networks (GANs) not used anymore for image generation?

AravindUkrd
Автор

Thanks for teaching Mr Luis! I still remember fondly you teaching me machine learning basics over drinks in SF

olesik
Автор

Amazing deep dismantling job of complex structures. that s real ML/AI democratization.

samirelzein
Автор

In intermediate result it is said that after sigmoid, we will not get sharp image of ball and bat. How can there be fractional pixel values. Since it is monochromatic, it should be either in 0 or 1 right. Rounding off to nearest integer will give same result as before sigmoid. Even if it's not monochrome, pixels can't be in fractions right?

aswinosbalaji
Автор

So can we just use the diffusion model to denoise low quality or night time shots?

olesik
Автор

thank you for your amazing educational videos!
I have a questions though, is there any transformers (+ attention mechanism) involved in the text2image generator (the diffusion model)?
If no, then how the semantic in the text is captured??

hamidalavi
Автор

Could be that the diffusion model is trained to learn what amount of noise have to be removed from the input image instead the image with less noise? That is what i understended from others sources, cause they say that that is more easy for the model. Thank you, and good video, very enlightening

ASdASd-krft
Автор

Hi @Louis. Your videos are very informative and I love them. Thank you so much for sharing your knowledge with us.
I wanted to know if "Fourier Transforms in AI" is in your pipeline. I request you to please give some intuitions around that in a video. Thanks in advance.

abhishek-zmtx
visit shbcf.ru