Segment Anything Paper Explained: New Foundation Model From Meta AI Is Impressive!

Показать описание

Meta AI just released Segment Anything Model (SAM), an important step toward the first foundation model for image segmentation. I read the paper and played with the code for the past few days, and I would like to share some insights about this model. I've aimed to be concise and informative, providing you with a brief but comprehensive overview.

⭐ SUPPORT ⭐ ──────────────────
- Subscribe!

CHAPTERS
____________________
00:00 - Intro
00:22 - Demo
01:20 - Foundation Model
01:53 - Data Engine
03:39 - Promptable Architecture
05:10 - Zero-Shot Evaluation
05:35 - Discussion

PAPERS
____________________

USEFUL LINKS
____________________

TRY SAM LOCALLY
____________________

********************
#ai #meta #computervision #airesearch

Рекомендации по теме

Комментарии

Thank you for watching! Feel free to ask any questions about SAM, the paper, or how to run it locally.

botsknowbest

An academic paper in the thumbnail always let me know that the video is likely well researched, nice

yoavsnake

As someone in technology, I’ll know this channel with gain followers you’re in detail

PA-eofs

thanks fir the wonderful vid! I am intrested in *labeled* masks, have you seen the work of the hybrid mode of grounded-DINO + SAM? I'm curious to know how can I use a labeled dataset I have (of sea-objects) to learn the model to detect not only a boat/ship but to identify the name of the marine-vessel.

kobic

Great explanation. What I can't get my head around is how the training data for SAM is generated by a model in itself. Wouldn't you get a transfer of bias (e.g. the bias in the training set generating model is represented in what SAM learns)?

I mean, if that bias is low, it can work, but conceptually that's a fairly odd thing to do in the field, right?

ColorfullHD

Very good explanation...can this Sam works for medical images?

ashwiniyadav

Can these models key/roto video hair strands as good as a human compositor?

Take your video as an example. It is more or less acceptable for YT. But it is unacceptable even for a short film. You can see the despilled edges. You probably kept those because you wanted to preserve edge details. If you wanted to get rid of those you would lose detail. To do it both at the same time you need to use more advanced keying techniques pro vfx artists use than just picking a color, playing with balance and blur.

If an AI model isn't as good as that, it can be used in social media and for people to have fun. But if you want to use it in movies to actually make it believeable, to allow more people to make movies more easily, to really take advantage of it, and to save a ton of time and money, that will require some precision.

In films you can't really tell if a scene had green/blue screen even if you zoomed 400x. It will have perfect transition of edges that even if it was shown side by side, you really can't tell it. I would love to see an example where this is achieved with AI.

Now all of this is chroma keying (green/blue screen). Rotoscoping, which doesn't involve single colors to key, relies fully on precision. Vfx artists can also do that perfectly but it is a much harder task. And to do that every 24 frame a second seamlessly without any flickering or changing edges is even harder.

I would love to see an example where this is achieved.

berkertaskiran

Segment Anything Paper Explained: New Foundation Model From Meta AI Is Impressive!

Segment Anything Paper Explained: New Foundation Model From Meta AI Is Impressive!

Segment Anything - Model explanation with code

How does Segment Anything 2 (SAM 2) work? Paper and Network Architecture Explained!

Explaining the Segment Anything Model - Network architecture, Dataset, Training

SAM - Segment Anything Model by Meta AI: Complete Guide | Python Setup & Applications

Segment Anything! Meta's Amazing New AI

Segment Anything 2 (SAM 2): Meta AI's Newest Model | Community Q&A (Jul 30)

Segment Anything Model (SAM) from Meta AI: model architecture, data engine, results and limitations

Segment Anything Model (SAM) Breakdown | Computer Vision Breakthrough

SAM 2 | Segment Anything Model 2

Segment Anything Model (SAM): Build Custom Image Segmentation Model Using YOLOv8 and SAM

New Segment Anything 2 Model is Released - Model Overview and Auto Labelling of Dataset with Encord

Segment Anything Model (SAM): a new AI model from Meta AI

Segment Anything 2 (SAM 2) by Meta AI: Everything You Need to Know

Segment Anything by Meta Research: Image Segmentation with the Largest Dataset and Model Yet!

How to run SAM 2 (Segment Anything AI Model)?

Segment Anything

307 - Segment your images in python without training using Segment Anything Model (SAM)

Segment Anything Explained

SAM 2 Segment Anything - Image and Video Segmentation #computervision #objectsegmentation #sam #meta

SAM Segment Anything: Meta's Amazing New AI Advancements!

331 - Fine-tune Segment Anything Model (SAM) using custom data

Fast Segment Anything (FastSAM) vs SAM | Is it 50x faster?

Meta AI SAM Segment Anything Model Demo and Explanation Open Source Model Dataset Code Released