How to Train Textual Inversion - Stable Diffusion AI | Embeddings and Low Memory

preview_player
Показать описание
Did you know that you can use Stable Diffusion to create unlimited professional looking photos of yourself?

This video follows the procedures outlined in the white paper cited below describing the technique of textual inversion:
Gal, R., Alaluf, Y., Atzmon, Y., Patashnik, O., Bermano, A. H., Chechik, G., & Cohen-Or, D. (2022, August 2).

Using the stable-diffusion-webui to train for high-resolution image synthesis with latent diffusion models, to create stable diffusion embeddings, it is recommended to use stable diffusion 1.5 models with diffusers and transformers from the automatic1111 webui.

Training observed using an NVidia Tesla M40 with 24gb of VRAM and an RTX3070 with 8gb of VRAM.

🔴 RECOMMENDED VIDEOS
I have created this stable diffusion tutorial series to help you get started
Part 5 in the Stable Diffusion Tutorial Series:

🎥 How to Install - Windows, Waifu ckpt

🎥 How to Install - MacOS, 2.1 model 768 ckpt

🎥 How to Install - Ubuntu Linux, 2.1 model 768 ckpt

🎥 How to Prompt

🎥 How to img2mg and Denoising and Update

🎥 How to Inpainting

5. YOU ARE HERE

🎥 How to Outpaint

#stablediffusiontutorial #stablediffusion #stablediffusiontextualinversion #textualinversion

Fast Image Resizer -

Prompt Template Files (Human Subjects) -

Audio : Hackers by Karl Casey White Bat Audio

When a statistical model fits exactly against training data, generative algorithms struggle to perform accurately against unseen data; they will have difficulty predicting if the model becomes overfitted and will be unable to generate new data.

Good luck trainers! Don't overtrain, you will deep fry your images.

Learn more about data overfitting here:

Here's my website:

Table of Contents:
Skip Ahead for the smarty pants.

0:00 Introduction and learning objectives.
1:00 Creating an Embedding: Section 1
1:32 Preprocessing Images: Section 2
2:57 Check Filenames For Accuracy.
3:10 Training Tab: Section 3
3:49 Style Filewords
4:45 Low VRAM 8GB Solution
6:09 Data overfitting Checking your work. Section 4
7:12 Recap Learning Objectives
Рекомендации по теме
Комментарии
Автор

So glad I clicked on this video! I couldn't find anything on embedding and here it was. Also had the out of memory error when trying to train a hypernetwork last night and the answer for that was here too! I'm going to try this out asap. I do have a question about the memory thing though. If you set that field to zero, how do you know what the training looks like? Does it just generate one image at the final step you set?

stolencoats
Автор

In my 10 years of watching YouTube, I've never seen a better tutorial. Is not too slow like most tutorials and the explanation is spot on. You are amazing! Thank you.

daqem
Автор

Ive spent all day following tutorials and waiting on 6 hrs of training to generate meh results. I wish id have found your video sooner! Already getting better results

nicholasdacek
Автор

Great tutorial. Those prompt template files really improved my training. Very much appreciated.

deeko
Автор

Thanks for making your video so straightforward. It was very easy to follow. I think sometimes some of the guys who are making these end up getting a little too 'clever' for their own good! lol

judpratt
Автор

Good vid, one of the more straight forward ones I've found on the subject. Thanks.

foot
Автор

My god you went for a Tesla M4...you are a hero!!! I wanted to do this but was too scared of having to deal with the setup and all!!

swannschilling
Автор

How couldn't I find your video even though it's 1 month already since upload!! I will try it now.

Maulana-Al-Bakrichod
Автор

You are an excellent teacher and will grow a user following, best of luck and thank you for explaining a rather tricky subject well. Look forward to more on stable diffusion from you, cheers.

criddyla
Автор

Fantastic tutorial, short, to the point and helpful!

mitchelllamper
Автор

Could this work on something like hands? Normally the AI doesn't do hands well, but if you fed it a bunch of pictures of just hands could it end up actually representing them properly through text to image?

JonnyCrackers
Автор

Great video. How to train special clothes like puffer coats. Maybe you can make another trainig video about clothes and textures

paskmoe
Автор

Thank-you for this tutorial! There's just one thing I need clarity on, in the templates, do we need to change the name in [name] to our embedding name?

lisagryffyths
Автор

I don't understand what I'm doing wrong, I followed the tutorial exactly but when training I'm noticing the previews look nothing like the images I'm using to train. It seems like its just taking the blip caption and outputting a random picture with that prompt, none of the pictures look like the person in the photos. Could someone help please?

kizahi
Автор

Ty for this vid. Have to slow down to follow along ;-)

martinkaiser
Автор

So I'm about to try for the 3rd time making an embedding of my face. First time with this video. Is the initialization text the prompt that will be used? Can it be different then the name of the embedding?

All the preview images from the video look pretty good as its training while mine all look like I'm an old man looking for my lost chromosome. The templet I was using from another video was just (name) so I'm guessing thats my problem?

I used the base steps and 900 to save while this video was 1000 and 10. That a setting to play with? And how many old man previews should I go through before starting over?

Thanks I've basically just been bashing my face against the keyboard and seeing what results I get.

mikedonahue
Автор

What GPU are you using for the video output?

mikealbert
Автор

It's kinda insane how fast this whole AI topic evolves. 2 weeks ago I would have tried to do what's explained in this video (which was very well explained btw), and today I'm using a google colab version of TheLastBen's implementation of fast Dreambooth to achieve the same or even better result in less time and no high-end graphics card, which I have discovered just a week ago 🤯

YVZSTUDIOS
Автор

Thanks for this video! I just don't understand what to put for [filewords] in the txt. I grabbed 20 images converted them to png, 512 512 but I don't have captions like you do. The captions show up as a txt document beside the picture. What can I put for filewords?

iamnotamoose
Автор

So weird question do you know if something broke with SD and training embeddings? I followed your video and it worked but now when I try and do it none of the pictures look anything like the person in my pictures. Just can't seem to figure out if a patch or something broke it lol first time worked flawlessly but then just kind of stopped working.

west