Meta-Transformer: A Unified Framework for Multimodal Learning

preview_player
Показать описание
The paper proposes Meta-Transformer, a framework for multimodal learning that can process various modalities without paired training data. It achieves unified learning across 12 modalities and shows promising results on different benchmarks. Code is available at the provided link.

PODCASTS:
Рекомендации по теме