pith. sign in

arxiv: 0806.4802 · v1 · submitted 2008-06-30 · 💻 cs.GT · cs.AI

A new Hedging algorithm and its application to inferring latent random variables

classification 💻 cs.GT cs.AI
keywords algorithmexpertscumulativediscountedgaininferringlatentlearning
0
0 comments X
read the original abstract

We present a new online learning algorithm for cumulative discounted gain. This learning algorithm does not use exponential weights on the experts. Instead, it uses a weighting scheme that depends on the regret of the master algorithm relative to the experts. In particular, experts whose discounted cumulative gain is smaller (worse) than that of the master algorithm receive zero weight. We also sketch how a regret-based algorithm can be used as an alternative to Bayesian averaging in the context of inferring latent random variables.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.