pith. sign in

arxiv: 1403.2433 · v1 · pith:JINHR4JZnew · submitted 2014-03-10 · 💻 cs.LG · stat.ML

Generalised Mixability, Constant Regret, and Bayesian Updating

classification 💻 cs.LG stat.ML
keywords mixabilityconstantdivergenceregretaggregatingalgorithmboundsgeneralised
0
0 comments X
read the original abstract

Mixability of a loss is known to characterise when constant regret bounds are achievable in games of prediction with expert advice through the use of Vovk's aggregating algorithm. We provide a new interpretation of mixability via convex analysis that highlights the role of the Kullback-Leibler divergence in its definition. This naturally generalises to what we call $\Phi$-mixability where the Bregman divergence $D_\Phi$ replaces the KL divergence. We prove that losses that are $\Phi$-mixable also enjoy constant regret bounds via a generalised aggregating algorithm that is similar to mirror descent.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.