pith. sign in

arxiv: 1504.08070 · v2 · pith:4D74DRNRnew · submitted 2015-04-30 · 💻 cs.IT · math.IT· math.ST· stat.TH

Universal Compression of Power-Law Distributions

classification 💻 cs.IT math.ITmath.STstat.TH
keywords distributionszipfexpectedredundancyalphadistributionhencenatural
0
0 comments X
read the original abstract

English words and the outputs of many other natural processes are well-known to follow a Zipf distribution. Yet this thoroughly-established property has never been shown to help compress or predict these important processes. We show that the expected redundancy of Zipf distributions of order $\alpha>1$ is roughly the $1/\alpha$ power of the expected redundancy of unrestricted distributions. Hence for these orders, Zipf distributions can be better compressed and predicted than was previously known. Unlike the expected case, we show that worst-case redundancy is roughly the same for Zipf and for unrestricted distributions. Hence Zipf distributions have significantly different worst-case and expected redundancies, making them the first natural distribution class shown to have such a difference.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.