pith. sign in

arxiv: cs/0607014 · v1 · pith:5WA6RSAInew · submitted 2006-07-05 · 💻 cs.IT · math.IT

Strong Consistency of the Good-Turing Estimator

classification 💻 cs.IT math.IT
keywords totalblockdistributionestimatorgood-turinglengthprobabilitystring
0
0 comments X
read the original abstract

We consider the problem of estimating the total probability of all symbols that appear with a given frequency in a string of i.i.d. random variables with unknown distribution. We focus on the regime in which the block length is large yet no symbol appears frequently in the string. This is accomplished by allowing the distribution to change with the block length. Under a natural convergence assumption on the sequence of underlying distributions, we show that the total probabilities converge to a deterministic limit, which we characterize. We then show that the Good-Turing total probability estimator is strongly consistent.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.