controlnet paper explained - Adding Conditional Control to Text-to-Image Diffusion Models

preview_player
Показать описание
ControlNets is the first paper to enable precise spatial control of the generated outputs of image generation models. It won the best prize in the prestigious ICCV 2023 conference.

This video covers the architecture of ControlNets, the idea of classifier-free guidance, and how it has been modified for resolution reweighting. It also covers the qualitative results and ablation studies.

⌚️ ⌚️ ⌚️ TIMESTAMPS ⌚️ ⌚️ ⌚️
0:00 Introduction to ControlNet
1:45 Neural Network Blocks
2:04 ControlNet Architecture
3:02 ControlNet with Stable Diffusion
5:05 ControlNet Training
6:39 Classifier-free Guidance Resolution Weighting
6:56 Classifier Guidance
8:58 Classifier-free Guidance
9:46 Classifier-free Guidance Resolution Weighting
11:08 Ablation Studies

🛠 🛠 🛠 MY SOFTWARE TOOLS 🛠 🛠 🛠

📚 📚 📚 BOOKS I HAVE READ, REFER AND RECOMMEND 📚 📚 📚

MY KEY LINKS

WHO AM I?
I am a Machine Learning researcher/practitioner who has seen the grind of academia and start-ups equally. I started my career as a software engineer 15 years ago. Because of my love for Mathematics (coupled with a glimmer of luck), I graduated with a Master's in Computer Vision and Robotics in 2016 when the now happening AI revolution just started. Life has changed for the better ever since.

#machinelearning #deeplearning #aibites
Рекомендации по теме
Комментарии
Автор

Great work. Interesting paper read indeed.
At 7:27 ; Bayes theorem is incorrect. P(X/Y) = P(Y/X).P(X) / P(Y) ; The rest of the math that follows is fine.

abcd
Автор

great video but is not clear how one train it, one needs to have pairs of controlnet input - image output right?

frazuppi