pith. sign in

arxiv: cond-mat/0203467 · v1 · pith:FBTMJ7EFnew · submitted 2002-03-22 · ❄️ cond-mat.stat-mech

Guessing probability distributions from small samples

classification ❄️ cond-mat.stat-mech
keywords distributionmethodprobabilitysampleyieldsapproximatedapproximationcalculation
0
0 comments X
read the original abstract

We propose a new method for the calculation of the statistical properties, as e.g. the entropy, of unknown generators of symbolic sequences. The probability distribution p(k) of the elements k of a population can be approximated by the frequencies f(k) of a sample provided the sample is long enough so that each element k occurs many times. Our method yields an approximation if this precondition does not hold. For a given f(k) we recalculate the Zipf-ordered probability distribution by optimization of the parameters of a guessed distribution. We demonstrate that our method yields reliable results.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.