Quantization in Neural Networks - Basics Explained | Affine and Symmetric Quantization

preview_player
Показать описание
This tutorial explains the basics behind different quantization approaches explaining the math and the intuitions. Explains how the mapping is done from float32 precision to int8 precision.

----------------------------------------------------------------------------------------------------------------

Reference materials for further reading.
----------------------------------------------------------------------------------------------------------------

BGM Credits
🔻
Song: "Sappheiros - Falling (Ft. eSoreni) [Chill]" is under a Creative Commons license (CC-BY)
🔺
Рекомендации по теме
Комментарии
Автор

Great tutorial! Could you share the third reference "Nvidia docs on Quantisation Basics"? The page not found. Thanks!

zhou
Автор

Cool! If you increased font size and showed actual use I think that'd add a lot of visibility to this video. The content and explanations are great.

jeffr_ac
Автор

I would prefer without the background music 🥲

ThePdcaster