Accelerate Transformer inference with AWS Inferentia

preview_player
Показать описание
In this video, I show you how to accelerate Transformer inference with AWS Inferentia, a custom chip designed by AWS.

⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️

Interested in hardware acceleration for Transformers? Check out my other videos :
Рекомендации по теме