GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

David Picard; Matthieu Cord; Michael Blot

arxiv: 1804.01852 · v2 · pith:REQM6TSZnew · submitted 2018-04-04 · 💻 cs.LG · stat.ML

GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

Michael Blot , David Picard , Matthieu Cord This is my paper

classification 💻 cs.LG stat.ML

keywords distributedgosgdgossipgradientmethodoptimizationthreadsadapted

0 comments

read the original abstract

We address the issue of speeding up the training of convolutional neural networks by studying a distributed method adapted to stochastic gradient descent. Our parallel optimization setup uses several threads, each applying individual gradient descents on a local variable. We propose a new way of sharing information between different threads based on gossip algorithms that show good consensus convergence properties. Our method called GoSGD has the advantage to be fully asynchronous and decentralized.

This paper has not been read by Pith yet.

GoSGD: Distributed Optimization for Deep Learning with Gossip Exchange

discussion (0)