Beyond Gradient Descent for Regularized Segmentation Losses

Dmitrii Marin; Ismail Ben Ayed; Meng Tang; Yuri Boykov

arxiv: 1809.02322 · v2 · pith:NHPWD5QZnew · submitted 2018-09-07 · 💻 cs.LG · stat.ML

Beyond Gradient Descent for Regularized Segmentation Losses

Dmitrii Marin , Meng Tang , Ismail Ben Ayed , Yuri Boykov This is my paper

classification 💻 cs.LG stat.ML

keywords losssegmentationarchitecturesdescentfunctiongradientnetworkoptimization

0 comments

read the original abstract

The simplicity of gradient descent (GD) made it the default method for training ever-deeper and complex neural networks. Both loss functions and architectures are often explicitly tuned to be amenable to this basic local optimization. In the context of weakly-supervised CNN segmentation, we demonstrate a well-motivated loss function where an alternative optimizer (ADM) achieves the state-of-the-art while GD performs poorly. Interestingly, GD obtains its best result for a "smoother" tuning of the loss function. The results are consistent across different network architectures. Our loss is motivated by well-understood MRF/CRF regularization models in "shallow" segmentation and their known global solvers. Our work suggests that network design/training should pay more attention to optimization methods.

This paper has not been read by Pith yet.

Beyond Gradient Descent for Regularized Segmentation Losses

discussion (0)