pith. machine review for the scientific record. sign in

arxiv: physics/0307138 · v2 · submitted 2003-07-29 · ⚛️ physics.data-an · physics.comp-ph

Recognition: unknown

Entropy Estimates from Insufficient Samplings

Authors on Pith no claims yet
classification ⚛️ physics.data-an physics.comp-ph
keywords estimatorsbiasesentropypointsregimesamplingtheyadvantage
0
0 comments X
read the original abstract

We present a detailed derivation of some estimators of Shannon entropy for discrete distributions. They hold for finite samples of N points distributed into M "boxes", with N and M -> oo, but N/M < oo. In the high sampling regime (<< 1 points in each box) they have exponentially small biases. In the low sampling regime the errors increase but are still much smaller than for most other estimators. One advantage is that our main estimators are given analytically, with explicitly known analytical formulas for the biases.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. SENECA: Small-Sample Discrete Entropy Estimation via Self-Consistent Missing Mass

    cs.IT 2026-05 unverdicted novelty 7.0

    SENECA uses a novel self-consistent missing mass calculation to improve discrete entropy estimates in small-sample regimes and outperforms alternatives in numerical tests.