Model vs Data Parallelism in Machine Learning

preview_player
Показать описание

Рекомендации по теме
Комментарии
Автор

That was great, i specially liked it that you showed some simple code snippets.

moeinhasani
Автор

The first code sample I show is an example of model parallelism
The second code sample I show is an example of data parallelism

I want to discuss pipelining in another video

marksaroufim
Автор

Are the graphs correct? It looks like when the data or model size is increased the time converges to some value. It would mean that we can double data or model size without any increase in time.

adhithadias
Автор

Thanks, Mark! Can you tell what's being exchanged across nodes in either of these cases? My understanding is that in data parallelism the weights and biases are shared and eventually their averages are shared across all the nodes. Not sure what's being shared between nodes in model parallelism.

VivekSharma-tmkd