Laplace's rule of succession in information geometry

Yann Ollivier

arxiv: 1503.04304 · v1 · pith:7AQ3T4N2new · submitted 2015-03-14 · 💻 cs.IT · math.IT· math.ST· stat.TH

Laplace's rule of succession in information geometry

Yann Ollivier This is my paper

classification 💻 cs.IT math.ITmath.STstat.TH

keywords bayesianrulecorrespondsinformationlaplacelikelihoodmaximumobserved

0 comments

read the original abstract

Laplace's "add-one" rule of succession modifies the observed frequencies in a sequence of heads and tails by adding one to the observed counts. This improves prediction by avoiding zero probabilities and corresponds to a uniform Bayesian prior on the parameter. The canonical Jeffreys prior corresponds to the "add-one-half" rule. We prove that, for exponential families of distributions, such Bayesian predictors can be approximated by taking the average of the maximum likelihood predictor and the \emph{sequential normalized maximum likelihood} predictor from information theory. Thus in this case it is possible to approximate Bayesian predictors without the cost of integrating or sampling in parameter space.

This paper has not been read by Pith yet.

Laplace's rule of succession in information geometry

discussion (0)