Optimisation of Overparametrized Sum-Product Networks

Franz Pernkopf; Martin Trapp; Robert Peharz

arxiv: 1905.08196 · v2 · pith:C6SPX25Onew · submitted 2019-05-20 · 💻 cs.LG · stat.ML

Optimisation of Overparametrized Sum-Product Networks

Martin Trapp , Robert Peharz , Franz Pernkopf This is my paper

classification 💻 cs.LG stat.ML

keywords networkssum-productdeepoptimisationcomparedlearningparametershallow

0 comments

read the original abstract

It seems to be a pearl of conventional wisdom that parameter learning in deep sum-product networks is surprisingly fast compared to shallow mixture models. This paper examines the effects of overparameterization in sum-product networks on the speed of parameter optimisation. Using theoretical analysis and empirical experiments, we show that deep sum-product networks exhibit an implicit acceleration compared to their shallow counterpart. In fact, gradient-based optimisation in deep tree-structured sum-product networks is equal to gradient ascend with adaptive and time-varying learning rates and additional momentum terms.

This paper has not been read by Pith yet.

Optimisation of Overparametrized Sum-Product Networks

discussion (0)