Generalised Mixability, Constant Regret, and Bayesian Updating

Mark D. Reid; Rafael M. Frongillo; Robert C. Williamson

arxiv: 1403.2433 · v1 · pith:JINHR4JZnew · submitted 2014-03-10 · 💻 cs.LG · stat.ML

Generalised Mixability, Constant Regret, and Bayesian Updating

Mark D. Reid , Rafael M. Frongillo , Robert C. Williamson This is my paper

classification 💻 cs.LG stat.ML

keywords mixabilityconstantdivergenceregretaggregatingalgorithmboundsgeneralised

0 comments

read the original abstract

Mixability of a loss is known to characterise when constant regret bounds are achievable in games of prediction with expert advice through the use of Vovk's aggregating algorithm. We provide a new interpretation of mixability via convex analysis that highlights the role of the Kullback-Leibler divergence in its definition. This naturally generalises to what we call $\Phi$-mixability where the Bregman divergence $D_\Phi$ replaces the KL divergence. We prove that losses that are $\Phi$-mixable also enjoy constant regret bounds via a generalised aggregating algorithm that is similar to mirror descent.

This paper has not been read by Pith yet.

Generalised Mixability, Constant Regret, and Bayesian Updating

discussion (0)