TrackFormer: Multi-Object Tracking with Transformers

preview_player
Показать описание
Following DETR's approach for object detection using transformers, TrackFormer employs them for multi-object tracking given an input video. This has the potential to be beneficial in surveillance, autonomous vehicles, sports analysis, and other applications. In this YouTube video, I present how DETR detects objects and how TrackFormer extends the idea for the task of multi-object tracking.

Table of Content:
00:00 Introduction
00:15 Previous Attempts
02:17 DETR
08:07 TrackFormer
13:47 Bipartite Matching
21:51 Set Prediction Loss
23:31 Track Augmentation
26:39 Result

Рекомендации по теме
Комментарии
Автор

This is pure gold! Huge! Thanks for share. invaluable to a project I am working on

HLAI
Автор

the index and the sigma discussion at 19:00at minimum cost mapping is confusing to me, I'll have to rewatch it later

samanthaqiu
Автор

The loss functions were definitely a bit tricky to get around. But that was a really cool video tho!
One thing you could've also touched upon is the usage of deformable detr in place of detr. I can see the trackformer code does incorporate it but wanted to know what changes in trackformer when you switch from detr to deformable one?

subramanyabhat
Автор

Hi, I am currently reading this paper, and I don't know where do the sigma_NMS and the other sigma threshold come from since the output according to this paper is just bounding box and classification result.

Where do we get those sigma value from the output of TrackFormer ?
Or the sigma value is the front 3 value of the classification score vector ?

蔡嘉祥-qm