Distributed Training with PyTorch on Piz Daint - Day 3

preview_player
Показать описание
The Piz Daint supercomputer at CSCS provides an ideal platform for supporting intensive deep learning workloads as it comprises thousands of Tesla GPU compute nodes communicating through a high-speed interconnect. In this three-day course, we will look at how to run distributed deep learning workloads with PyTorch on Piz Daint. This course is an update from last years’s workshop Efficient and Distributed Training with TensorFlow on Piz Daint. In this year's edition we will be using the PyTorch software ecosystem.

The main focus of the course will be training modules with PyTorch taking advantage of Piz Daint's setup of multiple single-gpu nodes. As a consequence, this course is addressed to scientists who are planning or are already engaged in intensive machine learning workloads and wish to start using PyTorch on Piz Daint.
Рекомендации по теме