Accelerating Deep Learning by Focusing on the Biggest Losers

Показать описание

What if you could reduce the time your network trains by only training on the hard examples? This paper proposes to select samples with high loss and only train on those in order to speed up training.

Abstract:
This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward pass to decide whether to use that example to compute gradients and update parameters, or to skip immediately to the next example. By reducing the number of computationally-expensive backpropagation steps performed, Selective-Backprop accelerates training. Evaluation on CIFAR10, CIFAR100, and SVHN, across a variety of modern image models, shows that Selective-Backprop converges to target error rates up to 3.5x faster than with standard SGD and between 1.02--1.8x faster than a state-of-the-art importance sampling approach. Further acceleration of 26% can be achieved by using stale forward pass results for selection, thus also skipping forward passes of low priority examples.

Authors: Angela H. Jiang, Daniel L.-K. Wong, Giulio Zhou, David G. Andersen, Jeffrey Dean, Gregory R. Ganger, Gauri Joshi, Michael Kaminksy, Michael Kozuch, Zachary C. Lipton, Padmanabhan Pillai

Рекомендации по теме

Комментарии

There are so many ML papers these days, the authors have to resort to click baity titles.

What a time to be alive.

herp_derpingson

Aside from the hard example selection, is this identical to the RevNet technique for saving memory needed for backprop?

connor-shorten

This actually seems a lot like intrinsically motivated AI. The only difference is that those AIs move to get more high-loss&decrease-in-loss input instead of selecting neurons or examples in a batch when training.

simleek

I think it will be difficult for multi gpu training, because they will forward once and sync the results for a total gpu node batch for forward and backward, and it will be a tradeoff for extra forward time with saving sample backward time.

guanfuchen

won't it just overfit to the selected hard examples and underfit to the easy ones?

sehbanomer

What resources do you recommend for starting with DL? Anything in R?

superkhoy

This approach seems like a derivative of boosting

DrAhdol

Great paper though why hasnt anyone thought about this before?

herp_derpingson

Accelerating Deep Learning by Focusing on the Biggest Losers

Accelerating Deep Learning by Focusing on the Biggest Losers

Accelerating Deep Learning by Focusing on the Biggest Losers - in Tensorflow

Accelerating Deep Learning for Vision Using Caffe Framework

XLDB2015: Accelerating Deep Learning at Facebook

FSW 2023: Accelerating Deep Learning for the Earth and Moon

IJCNN 2021 Tutorial: Accelerating Deep Learning Computation

Accelerating Deep Learning with GPUs

Scott Soutter Accelerating Deep Learning Training

The Nexus of Innovation

RarePlanes: Accelerating Deep Learning With Synthetic Data

Webinar: Accelerating Deep Learning Inference Workloads at Scale

Accelerating Deep Learning with Mixed Precision Arithmetic, w/ Greg Diamos - #97

MLCommons: Accelerating Machine Learning Diane Feddema/Red Hat,Peter Mattson,David Kanter/ MLCommons

Mythic: Accelerating Deep Neural Networks on the Mythic AMP

Accelerating Consumer Products Reformulation with Machine Learning

ISCA'20 - Session 6A - JPEG-ACT: Accelerating Deep Learning via Transform-Based Lossy Compressi...

Accelerating Machine Learning and Deep Learning At Scale With Apache Spark: talk by Ziya Ma

Accelerating Deep Learning Models on Xilinx 7nm Versal Card

SDC2020: Analog Memory-based Techniques for Accelerating Deep Neural Networks

Accelerating Machine Learning DevOps with Kubeflow • Derek Ferguson • GOTO 2019

Luis Ceze — Accelerating Machine Learning Systems

Accelerating exploration and representation learning with offline pre-training - ArXiv:2

Accelerating TensorFlow with RDMA for High-Performance Deep Learning

AI Quorum: Accelerating Drug Discovery via Deep Learning + Molecular Simulation