Multiclass Image Segmentation using UNETR in TensorFlow | Vision Transformer for Image Segmentation

preview_player
Показать описание
📺 Video Description: In this video, we are going to train the UNEt TRansformers (UNETR) architecture on the Landmark Guided Face Parsing dataset (LaPa) dataset for Multiclass Image Segmentation.

UNETR, or UNet Transformer, is a specialized architecture for medical image segmentation. It uses a pure transformer as the encoder, focusing on learning sequence representations for the input volume to capture global multi-scale information. The encoder connects directly to a decoder through skip connections, forming a U-Net-like structure and producing the ultimate semantic segmentation output.

🕒 Timeline:
00:00 - Introduction
00:59 - Landmark Guided Face Parsing dataset (LaPa) dataset.
02:46 - UNETR Architecture
04:35 - Training the UNETR
16:23 - Testing the UNETR
31:36 - Conclusion

💡Support:

🌐 Connect with Me:
Instagram: instagram/nikhilroxtomar

Immerse yourself in the world of UNETR and revolutionize your understanding of image segmentation. Subscribe, code along, and let's embark on this transformative journey together!
Рекомендации по теме
Комментарии
Автор

Nice❤ If you could post a video of attention Unet using pre-trained encoder as ResNet50 or any other one, it would be appreciated

AbrarMr
Автор

Hey this code could be beneficial for my research work, the thing is I do not have rgb_codes for my mask images.
I also have 11 classes including background, and I have converted the pixel values to 1, 2, 3, 4..., 10 for all my classes, how do I assign rgb_codes in my case. Was I able to explain my problem? Please let me know

puranjitsingh
Автор

Please i want to use this method but my dataset does'nt have the color code and and labels. it has just the images and their respective mask. could you help on the approach on how to accurately get this done? Thanks for your work.

Mind__Relaxation
Автор

How to calculate the dice coefficient value for this multiclass segmentor?

project-fd
Автор

Can you please upload the Google colab code

SethmiyaAbeyrathna
Автор

The 'sigmoid' activation function is employed in the final layer of the Unet2d code, which seems erroneous for a multi-class segmentation task. Instead, the 'softmax' function should be utilized.

puranjitsingh
Автор

Can this model works good on medical images? And how can i train this model for binary class segmentation 0 is my background and 1 is my mask region

desmondsamuel
Автор

What are the evaluation metrics that can be evaluated from this? IoU for each class accuracy etc. can be evaluated?

sidharthpisharody
join shbcf.ru