What is LLM Quantization?

preview_player

Показать описание

In this video, we learn about LLM quantization.

In particular, we learn about 3 things:

(1) How are LLM weights represented?
(2) What is quantization?
(3) Benefits and limits of quantization
(4) Mathematics of quantization
(5) What is the GGUF file format?

============================================================

(4) ML Teach by Doing playlist link:

============================================================

Vizuara

Рекомендации по теме

Комментарии

Thanks for the nice explanation, please get in to more depth of quantization like GGUF Vs AWQ vs QLORa, QAT vs PTQ and symmetric vs asymmetric

nikhiliyer