What is LLM Quantization?

preview_player
Показать описание
In this video, we learn about LLM quantization.

In particular, we learn about 3 things:

(1) How are LLM weights represented?
(2) What is quantization?
(3) Benefits and limits of quantization
(4) Mathematics of quantization
(5) What is the GGUF file format?

============================================================

(4) ML Teach by Doing playlist link:

============================================================
Рекомендации по теме
Комментарии
Автор

Thanks for the nice explanation, please get in to more depth of quantization like GGUF Vs AWQ vs QLORa, QAT vs PTQ and symmetric vs asymmetric

nikhiliyer