High-Performance Communication Strategies in Parallel and Distributed Deep Learning

Показать описание

Recorded talk [best effort].

Speaker: Torsten Hoefler
Conference: DFN Webinar
Abstract: Deep Neural Networks (DNNs) are becoming an important tool in modern computing applications. Accelerating their training is a major challenge and techniques range from distributed algorithms to low-level circuit design. In this talk, we describe the problem from a theoretical perspective, followed by approaches for its parallelization.

Specifically, we present trends in DNN architectures and the resulting implications on parallelization strategies. We discuss the different types of concurrency in DNNs; synchronous and asynchronous stochastic gradient descent; distributed system architectures; communication schemes; and performance modeling. Based on these approaches, we extrapolate potential directions for parallelism in deep learning.

Scalable Parallel Computing Lab, SPCL @ ETH Zurich

Рекомендации по теме

High-Performance Communication Strategies in Parallel and Distributed Deep Learning

High-Performance Communication Strategies in Parallel and Distributed Deep Learning

Parallel Computing Explained In 3 Minutes

High performance computing /Parallel Computing :One To All Broadcast and All To One Reduction(HIND)

Parallel Best First Search Algorithm in Artificial Intelligence With Example | Parallel Formulation

High-Performance Parallel Graph Coloring with Strong Guarantees on Work, Depth, and Quality

Basic communication operations in parallel computing part 2

Designing a High Performance Parallel Personal Cluster

1.11 Communication Costs in Parallel Machines

All To All Personalized Communication Introduction | Parallel Computing | High Performance Computing

All To All Personalized Communication on a Mesh | Parallel Computing | High Performance Computing

Data-Centric Parallel Programming

2021 High Performance Computing Lecture 2 Parallel Programming with MPI Part1 💻

High Performance Visualization | Parallel performance with Dask & Datashader

Effect Of Granularity On Performance Of Parallel System Explained with Solved Example in Hindi

Parallel Formulation Of Best First Search ll Parallel Computing ll Easiest Explanation in Hindi

Parallel Depth First Search ll Parallel Sorting Techniques ll Work Load Imbalance Explained in Hindi

Sources Of Overhead in Parallel Program (High Performance Computing) Explained in Hindi

HPC 014 Prefix Sum one to all personalized Communication on Hypercube

All To All Personalized Communication on a Ring | Parallel Computing | High Performance Computing

Scatter and Gather in Parallel Computing | Scatter and Gather Pattern over Hypercube

Circular Shift Introduction | High Performance Computing | Parallel Computing

Sources of Overhead in Parallel Programs | Sources of Overhead in Parallel Computing

All To All Broadcast and Reduction on Ring, Mesh and Hypercube in Parallel Computing

2022 High Performance Computing Lecture 5 Parallel Algorithms and Data Structures Part1 💻