Все публикации

Ishan Dave - Towards Label Efficiency and Privacy Preservation in Video Understanding

P. Tirupattur-Video Action Understanding: Action Classification, Temporal Localization and Detection

Lecture 22 - FuseCap: Leveraging Large Language Models for Enriched Fused Image Captions

Lecture 21 - Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection

Lecture 20 - OWLv2: Scaling Open-Vocabulary Object Detection

Lecture 19 - CM3Leon: Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

Presentation - Adapting Pretrained Vision Language Foundational Models to Medical Imaging Domains

Presentation - Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital

Presentation - MedKLIP - Medical Knowledge Enhanced Language-Image Pre -Training

Presentation - Consistency-Preserving Visual Question Answering in Medical Imaging

Presentation - Intra-class Contrastive Learning Improves Computer Aided Diagnosis of Breast Cancer

Presentation - Is PET all you need - A multi-modal study for Alzheimer's disease using 3D CNNs

Presentation - CLIP-Driven Universal Model for Organ Segmentation and Tumor Detection

Presentation - A controllable and Simultaneous Synthesizer of Images and Annotations with Minimal Hu

Presentation - A Robust Volumetric Transformer for Accurate 3D Tumor Segmentation

Presentation - Closing the Generalization Gap of Cross-silo Federated Medical Image Segmentation

Presentation - Self-Supervised Pre-Training of Swim Transformers for 3D Medical Image Analysis

Presentation - DiRA -Discriminative, Restorative, and Adversarial Learning for Self supervised Medic

Presentation - Diffusion Models for Medical Anomaly Detection

Presentation - Rethinking Breast Lesion Segmentation in Ultrasound - New Video Dataset and Baseline

Presentation - Scribble 2D5 -Weakly Supervised Volumetric Image Segmentation via scribble Annotation

Presentation - UNETR++ Delving into Efficient and Accurate 3D Medical Image Segmentation

Presentation - mmFormer Multimodal Medical Transformer for Incomplete Multimodal Learning of Brain T

Presentation - GaitForeMer Self Supervised Pretraining of Transformers via Human Motion Forecasting