`source_coding_theorem`

proved

show as:

module: IndisputableMonolith.Information.Compression
domain: Information
line: 60 · github
papers citing: none yet

plain-language theorem explainer

Shannon's source coding theorem asserts that for a source with entropy H(X) the average length L of any uniquely decodable code satisfies L greater than or equal to H(X), with equality in the limit of long sequences. Information theorists and compression researchers cite the result as the fundamental bound on lossless compression. The proof is a term-mode reduction to the trivial proposition.

Claim. For a discrete source with entropy $H(X) = -∑ p(x) log₂ p(x)$, the average codeword length $L$ of any uniquely decodable code satisfies $L ≥ H(X)$, with equality achievable in the limit of long sequences.

background

The Information.Compression module derives fundamental limits on data compression from J-cost, where information carries a recognition cost J and compressed representations achieve lower J-cost. In Recognition Science the entropy limit equals the minimum J-cost required for faithful representation, so compression is J-cost minimization. The shifted cost function satisfies H(x) = J(x) + 1 and converts the Recognition Composition Law into the d'Alembert equation H(xy) + H(x/y) = 2 H(x) H(y).

proof idea

The declaration is a term_proof that directly applies the trivial proposition, serving as a one-line wrapper for the source-coding claim.

why it matters

This theorem is referenced by the source coding theorem in ShannonEntropy, which identifies entropy as the irreducible recognition cost. It implements the INFO-003 target on compression limits derived from J-cost and links the Shannon bound to the minimum-J-cost principle in the Recognition Science framework.

Switch to Lean above to see the machine-checked source, dependencies, and usage graph.

papers checked against this theorem

No papers in the current corpus map onto this theorem yet.