High Performance Hardware for Distributed Deep Learning

Показать описание

In this video from Switzerland HPC Conference, Gaurav Kaul from Intel presents: High Performance Hardware for Distributed Deep Learning – System Benchmarking, Performance Optimization and Architecture for Scalable Systems.

"With the recent success of Deep Learning and related techniques, we are beginning to see new specialized hardware or extensions to existing architectures dedicated to making training and inference computations faster or energy efficient or both. These technologies use either traditional CMOS technology on conventional von Neumann architectures such as CPUs or accelerators such as DSPs, GPUs, FPGAs, and ASICs or other novel and exotic technologies in research phase such as neuromorphic computing. The overarching goal being to address a specific tradeoff in mapping machine learning algorithms in general and deep learning in particular, to a specific underlying hardware technology. Conversely, there has been quite an effort on the empirical side at devising deep network architectures for efficient implementation on these novel hardware architectures. This also has implications on leveraging appropriate hardware technology for inferencing primarily with energy and latency as the primary design goals. These efforts are finding some traction in the computer architecture community to look at effective building blocks for mapping these deep neural networks to appropriate processing elements (existing or new) and code optimization techniques for existing architectures. This talk aims to tie these seemingly disparate themes of co-design, Neural network architecture, algorithms and system architecture and bring together researchers at the interface of machine learning, hardware implementation, and systems for discussing the state of the art and the state of the possible.

The talk will focus on the following themes and present work done on related to

● How deep learning computations and algorithms are mapped and co-designed with new processing and interconnects technologies as the Intel Xeon Phi and Intel Omni-Path fabric

● Evaluate the different tradeoffs in accuracy, computational complexity, hardware cost, energy efficiency and application throughput currently investigated in these approaches.

Gaurav Kaul is a systems architect at Intel Corporation. He works extensively with life science customers in Europe and Middle East in designing and deployment of computing infrastructure for analysis of genomic data.

and

Рекомендации по теме

Комментарии

Hi. Could you please make an own playlist for this particular Switzerland HPC Conference videos?

I'm having difficulty filtering out the videos. Thanks! :)

cSharpIndonesia

Is this slide deck available for download...

equality_

High Performance Hardware for Distributed Deep Learning

High Performance Hardware for Distributed Deep Learning

What is HPC? An introduction to High-Performance Computing

What is High Performance Computing?

Mike Pittaro - High Performance Hardware for Data Analysis

High Performance Computing Center

How Fast is Your Computer? | DESIGNING FOR HIGH PERFORMANCE (Mechanical Sympathy)

High Performance Blockchain | Extremely Scalable Hardware & Software Architecture | $HPB

Hardware-efficient Stream Processing - George Theodorakis

What is High Performance Computing - HPC?

CPU vs GPU | Simply Explained

High-Performance Computing with Python: Bottlenecks

Book Review - Applied Machine Learning and High-Performance Computing on AWS

Parallel Computing Explained In 3 Minutes

High performance hardware for the cloud

FAST '22 - HTMFS: Strong Consistency Comes for Free with Hardware Transactional Memory...

GPUs: Explained

Streaming Message Interface: Distributed Memory Programming on Reconfigurable Hardware

DAOS: Scale-Out Software-Defined Storage for HPC/Big Data/AI Convergence

Distributed Shared Memory - Georgia Tech - HPCA: Part 5

Tech Showcase: Prometheus: Co-Designing Distributed Systems and Datacenter Hardware

InfoComm 2023: G&D North America Demos G&D KVM-Over-IP Distributed Architecture

Mike Stolz on the new ClusterStor 6000 for HPC

Accelerated Database

HPC Terminology and Core Concepts - What's in a Node?