Meta-Transformer: A Unified Framework for Multimodal Learning #ai #aiengineer #computervision

preview_player
Показать описание
In this video 📝 we are going to take a look at the new meta-transformer model for multiple inputs. Meta-transformer is a unified framework for multimodal learning and can take both images, video, text, audio, sensor data and so on as inputs. We are going to go through their project page, github repo and look at the model architecture and results.

If you’re looking for courses and to extend your knowledge even more, check out this link here:

If you enjoyed this video, be sure to press the 👍 button so that I know what content you guys like to see.

_______________________________________________________________

🧑🏻‍💻 My AI and Computer Vision Courses⭐:

_______________________________________________________________

📞 Connect with Me:

_______________________________________________________________
tags:
#transformer #metatransformer #multimodal
Рекомендации по теме
Комментарии
Автор

Can't wait for a practical example. Thanks for your videos 😊

zakariaabderrahmanesadelao