ICLR Paper: Learn Step Size Quantization

Показать описание

As deep networks are increasingly deployed in memory-constrained and throughput-critical systems, there is a need to create AI models that can maintain accuracy – and, as a result, trust – while also consuming fewer resources. Researchers at IBM’s Almaden Research Laboratory have reached a new milestone in AI precision and developed an algorithm that matches the inference accuracy of a 32-bit network while using only three bits.

The researchers achieved this level of energy efficiency using a new process called “learned step size quantization,” which improves parameter change estimates in a low-precision network during training, to produce better performance. The research also uncovered evidence that AI systems seeking to optimize performance on a given system might run with as few as 2 bits. This advance means AI systems are steadily coming closer to the low levels of energy consumed by the human brain, while maintaining performance.

Рекомендации по теме

ICLR Paper: Learn Step Size Quantization

ICLR Paper: Learn Step Size Quantization

Grokking: Generalization beyond Overfitting on small algorithmic datasets (Paper Explained)

Quantization step size

Fast and Slow Learning of Recurrent Independent Mechanisms (Machine Learning Paper Explained)

BackPACK for pyTorch

ICLR 2021 Keynote - 'Geometric Deep Learning: The Erlangen Programme of ML' - M Bronstein

Guillem Cucurull explains his ICLR 2018 paper

ICLR 2016 Best Paper Award: Deep Compression by Song Han

Reinforcement Learning with sparse rewards

Learning Rate Grafting: Transferability of Optimizer Tuning (Machine Learning Research Paper Review)

ICLR Paper: CLEVRER: Collision Events for Video Representation and Reasoning

[ICLR 2024 Outstanding Paper Winner] Protein Discovery with Discrete Walk-Jump Sampling

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale (Paper Explained)

Progressive Distillation for Fast Sampling of Diffusion Models (paper sumary)

Mamba: Linear-Time Sequence Modeling with Selective State Spaces (Paper Explained)

EigenGame PCA as a Nash Equilibrium | Outstanding Paper Award | ICLR 2021

What is multiset-equivariance? [ICLR 2022]

INR2Vec (ICLR 2023) with Luca De Luigi on Talking papers

Transfer Learning in GANs

Trends in Machine Learning at ICLR 2022 - Part 1,

Emtiyaz Khan - The Bayesian Learning Rule for Adaptive AI

Effect of Step Size on Accuracy | NUMERICAL SOLUTION for CE Problems: CDD Derivative Approximation

Trends in Machine Learning at ICLR 2022 - Brief Overview

Grammarly AI-NLP Club #11 - On the Stability of Fine-Tuning BERT