Minimax rates of entropy estimation on large alphabets via best polynomial approximation

Pengkun Yang; Yihong Wu

arxiv: 1407.0381 · v3 · pith:ZJJ2PYHSnew · submitted 2014-07-01 · 💻 cs.IT · math.IT· math.ST· stat.TH

Minimax rates of entropy estimation on large alphabets via best polynomial approximation

Yihong Wu , Pengkun Yang This is my paper

classification 💻 cs.IT math.ITmath.STstat.TH

keywords fracentropyminimaxapproximationbestconsistentconstantestimation

0 comments

read the original abstract

Consider the problem of estimating the Shannon entropy of a distribution over $k$ elements from $n$ independent samples. We show that the minimax mean-square error is within universal multiplicative constant factors of $$\Big(\frac{k }{n \log k}\Big)^2 + \frac{\log^2 k}{n}$$ if $n$ exceeds a constant factor of $\frac{k}{\log k}$; otherwise there exists no consistent estimator. This refines the recent result of Valiant-Valiant \cite{VV11} that the minimal sample size for consistent entropy estimation scales according to $\Theta(\frac{k}{\log k})$. The apparatus of best polynomial approximation plays a key role in both the construction of optimal estimators and, via a duality argument, the minimax lower bound.

This paper has not been read by Pith yet.

Minimax rates of entropy estimation on large alphabets via best polynomial approximation

discussion (0)