A new Hedging algorithm and its application to inferring latent random variables

Daniel Hsu; Yoav Freund

arxiv: 0806.4802 · v1 · submitted 2008-06-30 · 💻 cs.GT · cs.AI

A new Hedging algorithm and its application to inferring latent random variables

Yoav Freund , Daniel Hsu This is my paper

classification 💻 cs.GT cs.AI

keywords algorithmexpertscumulativediscountedgaininferringlatentlearning

0 comments

read the original abstract

We present a new online learning algorithm for cumulative discounted gain. This learning algorithm does not use exponential weights on the experts. Instead, it uses a weighting scheme that depends on the regret of the master algorithm relative to the experts. In particular, experts whose discounted cumulative gain is smaller (worse) than that of the master algorithm receive zero weight. We also sketch how a regret-based algorithm can be used as an alternative to Bayesian averaging in the context of inferring latent random variables.

This paper has not been read by Pith yet.

A new Hedging algorithm and its application to inferring latent random variables

discussion (0)