pith. sign in

arxiv: 1401.3760 · v1 · pith:F2VXXOJInew · submitted 2014-01-15 · 💻 cs.IT · math.IT· stat.ME

Large Alphabet Compression and Predictive Distributions through Poissonization and Tilting

classification 💻 cs.IT math.ITstat.ME
keywords distributionsclassalphabetconditionlargemethodoptimalsize
0
0 comments X
read the original abstract

This paper introduces a convenient strategy for coding and predicting sequences of independent, identically distributed random variables generated from a large alphabet of size $m$. In particular, the size of the sample is allowed to be variable. The employment of a Poisson model and tilting method simplifies the implementation and analysis through independence. The resulting strategy is optimal within the class of distributions satisfying a moment condition, and is close to optimal for the class of all i.i.d distributions on strings of a given length. Moreover, the method can be used to code and predict strings with a condition on the tail of the ordered counts. It can also be applied to distributions in an envelope class.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.