Distributed Training of Deep Neural Networks: Theoretical and Practical Limits of Parallel Scalability

Franz-Josef Pfreundt; Janis Keuper

arxiv: 1609.06870 · v4 · pith:I7DTJFI4new · submitted 2016-09-22 · 💻 cs.CV

Distributed Training of Deep Neural Networks: Theoretical and Practical Limits of Parallel Scalability

Janis Keuper , Franz-Josef Pfreundt This is my paper

classification 💻 cs.CV

keywords trainingpracticaldeepdistributednetworksscalabilitytheoreticaladdition

0 comments

read the original abstract

This paper presents a theoretical analysis and practical evaluation of the main bottlenecks towards a scalable distributed solution for the training of Deep Neuronal Networks (DNNs). The presented results show, that the current state of the art approach, using data-parallelized Stochastic Gradient Descent (SGD), is quickly turning into a vastly communication bound problem. In addition, we present simple but fixed theoretic constraints, preventing effective scaling of DNN training beyond only a few dozen nodes. This leads to poor scalability of DNN training in most practical scenarios.

This paper has not been read by Pith yet.

Distributed Training of Deep Neural Networks: Theoretical and Practical Limits of Parallel Scalability

discussion (0)