Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

Показать описание

Lecture 14 introduces the communication bottlenecks of distributed training: bandwidth and latency. This lecture introduces gradient compression including gradient pruning and gradient quantization to solve the bandwidth bottleneck and introduces delayed gradient averaging to alleviate the latency problem.

Keywords: Distributed Training, Bandwidth, Latency, Deep Gradient Compression, Delayed Gradient Averaging

------------------------------------------------------------------------------------

TinyML and Efficient Deep Learning Computing

Instructors:

Have you found it difficult to deploy neural networks on mobile devices and IoT devices? Have you ever found it too slow to train neural networks? This course is a deep dive into efficient machine learning techniques that enable powerful deep learning applications on resource-constrained devices. Topics cover efficient inference techniques, including model compression, pruning, quantization, neural architecture search, and distillation; and efficient training techniques, including gradient compression and on-device transfer learning; followed by application-specific model optimization techniques for videos, point cloud, and NLP; and efficient quantum machine learning. Students will get hands-on experience implementing deep learning applications on microcontrollers, mobile phones, and quantum machines with an open-ended design project related to mobile AI.

Website:

MIT HAN Lab

Рекомендации по теме

Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

Lecture 14 - Distributed Training and Gradient Compression (Part II) | MIT 6.S965

Cornell ECE 5545: ML HW & Systems. Lecture 14: Distributed Training

Lecture 13 - Distributed Training and Gradient Compression (Part I) | MIT 6.S965

CS 182: Lecture 14: Part 1: Imitation Learning

Lecture 14: Real and Financial Flows: Thailand

Lecture 14 - Heart Modeling Using Timed Automata - Part 1 [PoM-CPS]

EfficientML.ai Lecture 17: Distributed Training (Part I) (MIT 6.5940, Fall 2023)

Basic of Organic chemistry - Chemistry - Session 14

PIM Course: Lecture 14: Genome Sequence Alignment on PIM (Spring 2023)

CN Module1 Lecture14: Distributed Hash Tables(DHT)

Lecture 13 - Distributed Training and Gradient Compression (Part I) | MIT 6.S965

Stanford CS224N NLP with Deep Learning | Winter 2021 | Lecture 14 - T5 and Large Language Models

PIM Course: Lecture 14: Data-Centric Architectures for Machine Learning and Databases - Fall 2022

6.047/6.878 Lecture 14 - GWAS and Disease Dissection (Fall 2020)

Lecture 14c. The Dragon protocol

Shape Analysis (Lectures 14, extra content): A simple Laplacian on point clouds

Applied Machine Learning 2019 - Lecture 14 - Dimensionality Reduction

BEST DEFENCE ACADEMY IN DEHRADUN | NDA FOUNDATION COURSE AFTER 10TH | NDA COACHING #shorts #nda #ssb

Lecture 14: Modeling Thermal Systems

Symplectic geometry & classical mechanics, Lecture 14

EfficientML.ai Lecture 17: Distributed Training (Part I) (MIT 6.5940, Fall 2023, Zoom)

Lecture 14: Inspection in PatQuick, Hough Transform, Homography, Position Determination, Multi-Scale

Systems Genetics - Lecture 14 - Deep Learning in Life Sciences (Spring 2021)