Multiclass Image Segmentation using UNETR in TensorFlow | Vision Transformer for Image Segmentation

Показать описание

📺 Video Description: In this video, we are going to train the UNEt TRansformers (UNETR) architecture on the Landmark Guided Face Parsing dataset (LaPa) dataset for Multiclass Image Segmentation.

UNETR, or UNet Transformer, is a specialized architecture for medical image segmentation. It uses a pure transformer as the encoder, focusing on learning sequence representations for the input volume to capture global multi-scale information. The encoder connects directly to a decoder through skip connections, forming a U-Net-like structure and producing the ultimate semantic segmentation output.

🕒 Timeline:
00:00 - Introduction
00:59 - Landmark Guided Face Parsing dataset (LaPa) dataset.
02:46 - UNETR Architecture
04:35 - Training the UNETR
16:23 - Testing the UNETR
31:36 - Conclusion

💡Support:

🌐 Connect with Me:
Instagram: instagram/nikhilroxtomar

Immerse yourself in the world of UNETR and revolutionize your understanding of image segmentation. Subscribe, code along, and let's embark on this transformative journey together!

Рекомендации по теме

Комментарии

Nice❤ If you could post a video of attention Unet using pre-trained encoder as ResNet50 or any other one, it would be appreciated

AbrarMr

Hey this code could be beneficial for my research work, the thing is I do not have rgb_codes for my mask images.
I also have 11 classes including background, and I have converted the pixel values to 1, 2, 3, 4..., 10 for all my classes, how do I assign rgb_codes in my case. Was I able to explain my problem? Please let me know

puranjitsingh

Please i want to use this method but my dataset does'nt have the color code and and labels. it has just the images and their respective mask. could you help on the approach on how to accurately get this done? Thanks for your work.

Mind__Relaxation

How to calculate the dice coefficient value for this multiclass segmentor?

project-fd

Can you please upload the Google colab code

SethmiyaAbeyrathna

The 'sigmoid' activation function is employed in the final layer of the Unet2d code, which seems erroneous for a multi-class segmentation task. Instead, the 'softmax' function should be utilized.

puranjitsingh

Can this model works good on medical images? And how can i train this model for binary class segmentation 0 is my background and 1 is my mask region

desmondsamuel

What are the evaluation metrics that can be evaluated from this? IoU for each class accuracy etc. can be evaluated?

sidharthpisharody

Multiclass Image Segmentation using UNETR in TensorFlow | Vision Transformer for Image Segmentation

Multiclass Image Segmentation using UNETR in TensorFlow | Vision Transformer for Image Segmentation

PyTorch Image Segmentation Tutorial with U-NET: everything from scratch baby

Multiclass Segmentation using UNET in TensorFlow | Crowd Instance-level Human Parsing (CHIP) Dataset

UNET Transformers: UNETR Implementation for 2D Segmentation in TensorFlow

UNETR Implementation for 2D Segmentation in PyTorch | UNTER = Vision Transformer + CNN Decoder

Dr. Asifullah Khan | MaxViT-UNet: Multi Axis Attention for Medical Image Segmentation

Implementing MultiResUNET in TensorFlow | Semantic Segmentation | Computer Vision | Deep Learning

Unet++ Model for Image Quality Detection: Model and Python Code Explained

Coding a U-net Convolutional Neural Network in Tensorflow

Python Image Segmentation Tutorial (2022)

Hair Segmentation with Vision Transformers (UNETR) in TensorFlow

I Trained UNETR to Segment Faces Into 11 Classes – U-net Transformer tutorial

UNet++ with Attention Mechanism for Hippocampus Segmentation

Geospatial deep learning with TensorFlow Keras: Train neural network model for semantic segmentation

Instance Segmentation of Image using Swin Transformer

AFTer-UNet: Axial Fusion Transformer UNet for Medical Image Segmentation

How To Train SegFormer on a Custom Dataset for Computer Vision

Image segmentation with pytorch using unet with imagenet weights as autoencoder

[P070] Semantic Segmentation of 3D Medical Images Through a Kaleidoscope

229 - Smooth blending of patches for semantic segmentation of large images (using U-Net)

The U-Net Model

PyTorch and Monai for AI Healthcare Imaging - Python Machine Learning Course

SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers

A Flexible 2.5D Medical Image Segmentation Approach with In-Slice and Cross-Slice Attent