Distributed TensorFlow (TensorFlow Dev Summit 2017)

Показать описание

TensorFlow gives you the flexibility to scale up to hundreds of GPUs, train models with a huge number of parameters, and customize every last detail of the training process. In this talk, Derek Murray gives you a bottom-up introduction to Distributed TensorFlow, showing all the tools available for harnessing this power.

Further reading:

event: TensorFlow Dev Summit 2017; re_ty: Publish;

Рекомендации по теме

Комментарии

Contents
Objectives: 3:41
Intro: 4:00
Distbelief inspiration: 5:51
Replication: 7:55
In-graph replication: 8:21
Between-graph replication: 9:54
Variable placement: 11:17
Device placement summary: 14:39
Sessions and servers: 15:14
Fault tolerance: 18:51
High-level APIs: 25:08

AmilaManoj

22:10, how can the chief work restore the failed PS tasks?

xdxn

Nice talk. Any pointers to the presentation charts?

raghkripa

10:38
Is there any difference of a smaller graph between these 2 tasks?
Isn't there the same subgraph (output = ...\ loss = ...) ?
Or how is it transform to 2 (or maybe more) subgraph?

redfishleo

What if I have Images as input data for between graph training on multiple nodes? Do I need to put image database on each of those workers? Please guide me

thesawatdatta

A question, Does the Distributed Tensorflow uses Round Robin algorithm.

jugsma

TensorFlow を使用した複数ノード間 multi-GPU のわかりやすい解説だ．秋葉さんの ChainerMN 解説と一緒に見るのがおすすめ．

ryonakamura

Awesome stuff :)

Wisdom of Mycroft Holmes(Mark Gattis)

utkarsh_dubey

just saying dist-keras .. uses spark too ... sorry .. just wanted to leave that here.

chefboyrdee

Distributed TensorFlow (TensorFlow Dev Summit 2017)

Distributed TensorFlow (TensorFlow Dev Summit 2017)

Distributed TensorFlow (TensorFlow Dev Summit 2018)

Distributed TensorFlow model training on Cloud AI Platform (TF Dev Summit '20)

TensorFlow Ecosystem: Integrating TensorFlow with your infrastructure (TensorFlow Dev Summit 2017)

TensorFlow Dev Summit 2020 Livestream

TensorFlow Dev Summit 2018 - Livestream

TensorFlow Extended (TF Dev Summit '20)

TensorFlow Enterprise (TF Dev Summit '20)

Training Performance: A user’s guide to converge faster (TensorFlow Dev Summit 2018)

TensorFlow High-Level APIs: Models in a Box (TensorFlow Dev Summit 2017)

Project Magenta (TensorFlow Dev Summit 2018)

Introducing TensorFlow 2.0 and its high-level APIs (TF Dev Summit '19)

TensorFlow at DeepMind (TensorFlow Dev Summit 2017)

TensorFlow Dev Summit 2019 Livestream

Reconstructing Fusion Plasmas (TensorFlow Dev Summit 2018)

TensorFlow Dev Summit 2020 Keynote

TensorFlow Probability: Learning with confidence (TF Dev Summit '19)

Accelerate models with TFLite Delegates (TF Dev Summit '20)

Searching Over Ideas (TensorFlow Dev Summit 2018)

Scaling TensorFlow 2 models to multi-worker GPUs (TF Dev Summit '20)

Mobile and Embedded TensorFlow (TensorFlow Dev Summit 2017)

TensorFlow Enterprise: Productionizing TensorFlow with Google Cloud (TF Dev Summit '20)

Integrating Keras & TensorFlow: The Keras workflow, expanded (TensorFlow Dev Summit 2017)

Distributed TensorFlow (TensorFlow @ O’Reilly AI Conference, San Francisco '18)