Diffusion Models - Live Coding Tutorial

Показать описание

This is my live (to the most extent) coding video, where I implement from a scratch a diffusion model that generates 32 x 32 RGB images. The tutorial assumes a basic knowledge of deep learning and Python.

Links:

Sources:

Timestamps:
0:00 Introduction
0:32 Theoretical background
13:13 Live Coding - Forward diffusion
41:29 Live Coding - Training loop
1:00:05 - Live Coding - Overfitting one batch
1:03:36 - Live Coding - Reverse diffusion
1:13:40 - Live Coding - Training on CIFAR - 10 dataset
1:17:24 - Live Coding - Result evaluation
1:19:40 - (Bonus) Quick explanation of the UNet architecture used in the tutorial

dtransposed

Рекомендации по теме

Комментарии

Thanks man, I really appreciate your work

adeolaogunleye

I have looked almost every video on this subject and this is by far the best approach, it's simple enough to be well understood but it gives all the tools to built more advanced models. I wish you could do a remake of this one because sometimes the code snippet is out of frame and sometimes its hard to read because of the font size. Thx a lot for this upload!

outroutono

Thanks for sharing your work with us, Appreciate!,

bbbaaa

Good tutorial, just wished that we could see the screen while you're coding, as most of the new lines you added were off-screen :/ Keep it up!

danielfirebanks

Thanks a lot! I really appreciate.
This tutorial explain clearly. Awesome!
Hope to see more tutorial vedios on your youtube channel, thanks.

chichi

Great tutorial. Thanks for sharing.
Please make slightly advanced tutorials, like Conditonal (Image or Text) Generation of Images using Diffusion.
I see that there are very few advanced tutorials by any Youtuber.

kanakraj

Better font, but still can’t read not only the phone, that is main content consuming device, but even on my 13 inch MacBook. God bless I have 55 inch tv I can watch on. Even with such struggles I will continue to watch such a diamond video!
Thanks for video! Great content!

VitaliyHAN

Thank for this video. Can you make video about apply high resolution for this project ?

duyquangnguyen

You should have zoomed in the screen more so that its visible properly. Still appreciate your efforts! Nice vid.

paneercheeseparatha

Hi, thanks for the video. But can you explain the part on how you introduce the positional encoding to the network? Also, can this model work for a feed forward neural network rather than a U-net ?

anshumansinha

Thanks for tutorial.
Why posterior_variance_t = betas_t? Shouldn't it be equal to betas_t*(1 - alphas_cumprod_t_minus_1)/(1 - alphas_cumprod_t) according [Lil' Log]?

jflimnl

Is there a difference between `result = alpha_hat.gather(-1, t)` and `result = alpha_hat[t]` ?

brunokemmer

Coming from a programming background, I always find it very strange to name variables by generic Greek letters or just X, Y. I am not criticizing your video specifically, it is a pattern that is very wide spread. But for example, you are naming the first parameter to the forward_diffusion function "x0". is it to save space? is it because you think it is easier to reference it from the mathematical formulas?
In my mind it would be much more clear if "x0" would be named "image". or am I misunderstanding your explanation maybe.
As I mentioned, I don't think your video is bad. I'm just curious as to why it is so common that code related to machine learning is generally so generically named.

nqvst

Can you make a Image to Image tutorial?

chiscoduran

can u say why output was not as fascinating and what can be done from here to make output clearer @dtransposed79

playmaker

Thanks man, I really appreciate your work

utxuebc

Diffusion Models - Live Coding Tutorial

Diffusion Models - Live Coding Tutorial

Diffusion models from scratch in PyTorch

Coding Stable Diffusion from scratch in PyTorch

Diffusion Models Live Event

CODE Stable Diffusion: Step by Step (PyTorch, VAE, UNet, CLIP) #stablediffusion Generative AI

Exploring the NEW Hugging Face Diffusers Package | Diffusion Models w/ Python

Testing Stable Diffusion inpainting on video footage #shorts

Diffusion Models Live Event (David Ha)

How to Build Successful GenAI Solutions with Text-to-Image Models? Ft. Lohit Yogi (Adobe)

How Stable Diffusion Works (AI Image Generation)

AI Show Holiday Special: Under the hood of Stable Diffusion

Diffusion models explained in 4-difficulty levels

I've been playing around with stable diffusion, it's pretty cool

How to Deploy Diffusion Models

Stable Diffusion Crash Course for Beginners

High Level Code Walk through of Diffusion Models | Stable Diffusion | Joel Bunyan P.

Use Stable Diffusion - AI Art in 10 mins | No Coding Experience Needed

Day 7: Building the Diffusion models || DDPM from scratch || LIVE

L6 Diffusion Models (SP24)

How This Guy Uses A.I. to Create Art | Obsessed | WIRED

How to start diffusion models with 🧨🤗

Hands-on Coding Lab - Text Prompt UI for Stable Diffusion models - Part 4/4

Text to Image generation using Stable Diffusion || HuggingFace Tutorial Diffusers Library

Highlight: Day 7: Building the Diffusion models DDPM from scratch LIVE