pith. sign in

arxiv: 1901.09665 · v1 · pith:LJZOKYJVnew · submitted 2019-01-28 · 🧮 math.ST · stat.TH

Exact Good-Turing characterization of the two-parameter Poisson-Dirichlet superpopulation model

classification 🧮 math.ST stat.TH
keywords good-turingbayesiannonparametricpoisson-dirichletsuperpopulationtwo-parameterestimationestimator
0
0 comments X
read the original abstract

Large sample size equivalence between the celebrated {\it approximated} Good-Turing estimator of the probability to discover a species already observed a certain number of times (Good, 1953) and the modern Bayesian nonparametric counterpart has been recently established by virtue of a particular smoothing rule based on the two-parameter Poisson-Dirichlet model. Here we improve on this result showing that, for any finite sample size, when the population frequencies are assumed to be selected from a superpopulation with two-parameter Poisson-Dirichlet distribution, then Bayesian nonparametric estimation of the discovery probabilities corresponds to Good-Turing {\it exact} estimation. Moreover under general superpopulation hypothesis the Good-Turing solution admits an interpretation as a modern Bayesian nonparametric estimator under partial information.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.