Accelerate Transformer inference with AWS Inferentia

preview_player

Показать описание

In this video, I show you how to accelerate Transformer inference with AWS Inferentia, a custom chip designed by AWS.

⭐️⭐️⭐️ Don't forget to subscribe to be notified of future videos ⭐️⭐️⭐️

Interested in hardware acceleration for Transformers? Check out my other videos :

Рекомендации по теме