A Finite-State Gibbs Construction from a Recognition Cost

arxiv: 2605.15667 · v1 · pith:SDZQA7LYnew · submitted 2026-05-15 · ❄️ cond-mat.stat-mech · cs.IT· math-ph· math.IT· math.MP

A Finite-State Gibbs Construction from a Recognition Cost

Megan Simons , Jonathan Washburn This is my paper

Pith reviewed 2026-05-19 19:59 UTC · model grok-4.3

classification ❄️ cond-mat.stat-mech cs.ITmath-phmath.ITmath.MP

keywords Gibbs distributionRecognition Composition Lawratio costconvex dualityKullback-Leibler divergencefinite state spacemultinomial countingstatistical mechanics

0 comments p. Extension

pith:SDZQA7LY Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{SDZQA7LY}

Prints a linked pith:SDZQA7LY badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

The pith

A ratio-cost construction from the Recognition Composition Law induces the standard Gibbs distribution on finite states.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops an alternative route to the Gibbs distribution on a finite outcome space by starting from a recognition cost rather than an externally supplied energy. After defining the Recognition Composition Law as the normalized d'Alembert degree-two closure J(x) = 1/2(x + x^{-1}) - 1 with unit log-curvature calibration, the induced cost vector allows multinomial counting and convex duality to recover both the Gibbs weights and the relation that the difference in recognition free energies equals temperature times the Kullback-Leibler divergence. A reader would care if this shows that the familiar Boltzmann form can emerge from ratio-based costs and counting arguments without presupposing maximum entropy. The work supplies supporting technical results such as a non-asymptotic Stirling bound and includes an explicit three-state comparison to other cost models.

Core claim

Given the induced cost vector X_ω = J(r_ω) from the RCL, multinomial counting and convex duality recover the finite-state Gibbs weights and the identity F_R(q) - F_R(p) = T_R D_KL(q || p). The entropy-maximization steps are classical once the cost is fixed. New technical content includes a non-asymptotic Stirling bound and soft-shell constrained-type theorems for real-valued costs.

What carries the argument

The Recognition Composition Law with J(x) = ½(x + x^{-1}) - 1, which turns reference ratios r_ω into the cost vector X_ω that drives the recovery of Gibbs weights via counting and duality.

If this is right

The entropy maximization procedure yields the Gibbs law in the usual way after the RCL cost is substituted for energy.
Non-asymptotic Stirling bounds apply to the multinomial coefficients with real-valued costs.
Soft-shell constrained-type theorems characterize the large-deviation behavior for these costs.
The three-state example demonstrates how the RCL Gibbs law differs from squared-log, affinity, and Tsallis forms at equal mean cost, along with sample-size power estimates.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If the axioms hold more generally, similar cost-based derivations might apply to other equilibrium distributions beyond finite states.
The free-energy to KL identity provides a direct bridge between recognition costs and information-theoretic measures that could be explored in inference problems.
A natural extension would be to examine whether physical systems with measurable ratio costs exhibit the predicted sample-size scaling in their fluctuation statistics.

Load-bearing premise

The framework requires that axioms (A1) through (A3) are accepted without further derivation and that the normalized d'Alembert degree-two closure is the appropriate Recognition Composition Law with unit log-curvature calibration.

What would settle it

A direct computation or measurement in a finite-state system showing that the frequencies minimizing the recognition free energy at fixed mean cost do not coincide with the multinomial-derived Gibbs probabilities would falsify the central recovery claim.

Figures

Figures reproduced from arXiv: 2605.15667 by Jonathan Washburn, Megan Simons.

**Figure 1.** Figure 1: Quadratic lower bound on J. The RCL cost J(x) = 1 2 (x + x −1 ) − 1 (solid curve) and its quadratic surrogate (log x) 2/2 (dashed curve). The two curves share a common second-order expansion at x = 1; the gap grows for x ≪ 1 or x ≫ 1, where J(e t ) grows like 1 2 e |t| in the log-coordinate t = log x (equivalently, J(x) grows linearly in max(x, 1/x)), while (log x) 2/2 grows only quadratically in |t|. line… view at source ↗

**Figure 2.** Figure 2: Gibbs weights pk as a function of the target mean cost ⟨X⟩ for the threestate example with costs X = (0, 0.5, 1), plotted over the positive-temperature range Emc ∈ (0, 1 2 ) in which β > 0 (Proposition 5.2); Emc denotes the same constraint as the column ⟨X⟩ (target) in [PITH_FULL_IMAGE:figures/full_fig_p038_2.png] view at source ↗

read the original abstract

On a finite outcome space, the canonical Gibbs distribution is usually obtained by maximizing Shannon entropy at fixed mean of an externally supplied energy functional. This paper studies the finite-state consequences of a ratio-cost construction instead: after adopting the normalized d'Alembert degree-two closure called the Recognition Composition Law (RCL), with unit log-curvature calibration at the reference ratio, the continuous nontrivial positive branch is $J(x)=\tfrac12(x+x^{-1})-1=\cosh(\log x)-1$. Given the induced cost vector $X_\omega=J(r_\omega)$, multinomial counting and convex duality recover the finite-state Gibbs weights and the identity $F_{\mathrm{R}}(q)-F_{\mathrm{R}}(p)=T_{\mathrm{R}}\,D_{\mathrm{KL}}(q\Vert p)$; the entropy-maximization steps are classical once the cost is fixed. New technical content includes a non-asymptotic Stirling bound and soft-shell constrained-type theorems for real-valued costs. A three-state example compares the Gibbs law to squared-log, affinity-as-energy, and Tsallis alternatives at the same cost vector and mean-cost constraint, with sample-size power calculations at fixed RCL ground truth. The framework is conditional on axioms (A1)--(A3) and restricted to finite outcome spaces with strictly positive weights; it does not derive the composition law from a more primitive principle.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper recovers the standard finite-state Gibbs weights and a KL identity from a ratio cost induced by an assumed Recognition Composition Law, plus some new technical bounds.

read the letter

The main takeaway is that once the cost vector is defined from the specific J(x) coming out of the normalized d'Alembert closure, ordinary multinomial counting and convex duality give back the usual Gibbs form and the free-energy difference equal to temperature times KL divergence. That recovery step is classical. The paper is upfront that the construction sits on top of axioms (A1)-(A3) that are not derived from anything more primitive, so the novelty is really in the choice of cost rather than a fresh derivation of Gibbs from ratio costs alone. The unit log-curvature calibration is an extra input as well. This keeps the result conditional on accepting that particular composition law. The work stays restricted to finite spaces with strictly positive weights, which is also stated clearly. What the paper does add is a non-asymptotic Stirling bound and some soft-shell constrained-type results for real-valued costs. Those look like usable technical tools. The three-state example is straightforward and lets you see how the Gibbs law compares to squared-log, affinity-as-energy, and Tsallis alternatives at fixed mean cost, with power calculations at fixed ground truth. That concrete check is helpful for seeing the differences in practice. The citation pattern is light and mostly points to the standard entropy-maximization literature, which fits the scope. The math appears internally consistent once the axioms are granted, and the claims do not over-reach what the derivations support. This is the kind of paper that would interest people working on alternative cost or information-theoretic foundations for equilibrium on small discrete spaces, or anyone who might apply the new bounds. It has enough formal content and a clear enough structure that it deserves a serious referee rather than a desk reject, even though the axioms will probably be the main point of discussion in review.

Referee Report

2 major / 2 minor

Summary. The paper claims that on a finite outcome space, adopting the normalized d'Alembert degree-two closure as the Recognition Composition Law (RCL) with unit log-curvature calibration yields the cost function J(x) = ½(x + x^{-1}) - 1. Given the induced cost vector X_ω = J(r_ω), standard multinomial counting and convex duality then recover the finite-state Gibbs weights together with the identity F_R(q) - F_R(p) = T_R D_KL(q || p). New technical contributions include a non-asymptotic Stirling bound and soft-shell constrained-type theorems; a three-state example compares the resulting Gibbs law to squared-log, affinity-as-energy, and Tsallis alternatives under the same mean-cost constraint, with accompanying sample-size power calculations. The framework is explicitly conditional on axioms (A1)-(A3) and restricted to finite spaces with strictly positive weights.

Significance. If the adopted RCL and axioms are accepted, the work supplies an alternative route to the canonical Gibbs distribution that begins from a ratio-cost functional rather than an externally supplied energy. It supplies concrete new technical tools (non-asymptotic Stirling bound, soft-shell theorems) and a worked three-state comparison that quantifies distinguishability from other distributions at fixed cost vector. These elements could be useful for finite-state models in which ratio-based costs arise naturally.

major comments (2)

[Abstract] Abstract and the section defining the RCL: the recovery of the Gibbs weights and the KL identity is obtained by classical multinomial counting plus convex duality once the cost vector X_ω = J(r_ω) is fixed by the chosen J. Because the paper states that axioms (A1)-(A3) are adopted rather than derived from a more primitive principle, the central claim is an equivalence under a specific cost rather than a derivation of the Gibbs measure from ratio costs alone. A load-bearing justification or independent motivation for this particular composition law is therefore required.
[Section on non-asymptotic Stirling bound] Section presenting the non-asymptotic Stirling bound: the abstract highlights this bound as new technical content, yet the reader's assessment notes that soundness cannot be confirmed without the full derivation or verification. If the bound is used to support the finite-state construction, an explicit statement of the error term and its dependence on the cost vector should be supplied.

minor comments (2)

[Introduction] Notation for F_R, T_R and the reference ratio should be introduced with a single consolidated definition early in the text to avoid repeated cross-referencing.
[Three-state example] The three-state example would benefit from an explicit table listing the numerical values of the cost vector, mean-cost constraint, and resulting probabilities for each alternative distribution.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and describe the revisions we will make to strengthen the presentation.

read point-by-point responses

Referee: [Abstract] Abstract and the section defining the RCL: the recovery of the Gibbs weights and the KL identity is obtained by classical multinomial counting plus convex duality once the cost vector X_ω = J(r_ω) is fixed by the chosen J. Because the paper states that axioms (A1)-(A3) are adopted rather than derived from a more primitive principle, the central claim is an equivalence under a specific cost rather than a derivation of the Gibbs measure from ratio costs alone. A load-bearing justification or independent motivation for this particular composition law is therefore required.

Authors: We agree that the central contribution is an equivalence obtained once the normalized d'Alembert degree-two RCL is adopted together with axioms (A1)-(A3), rather than a derivation of the RCL itself from more primitive axioms. The manuscript already states that it does not derive the composition law from a more primitive principle. In revision we will add a dedicated paragraph in the RCL-definition section that supplies independent motivation for this particular closure: it arises naturally when costs are defined on ratios (as in certain recognition or relative-likelihood models) and yields a strictly convex J that recovers the classical Gibbs form and KL identity via standard multinomial counting and convex duality. We will also clarify the scope of the claim in the abstract. revision: yes
Referee: [Section on non-asymptotic Stirling bound] Section presenting the non-asymptotic Stirling bound: the abstract highlights this bound as new technical content, yet the reader's assessment notes that soundness cannot be confirmed without the full derivation or verification. If the bound is used to support the finite-state construction, an explicit statement of the error term and its dependence on the cost vector should be supplied.

Authors: We accept the point that the non-asymptotic Stirling bound is advertised as new technical content and that its derivation must be verifiable. In the revised manuscript we will move the full proof to a self-contained appendix, state the explicit error term (including its dependence on the cost vector X_ω and on sample size n), and add a short remark on how the bound is applied in the soft-shell theorems. This will allow readers to assess soundness directly. revision: yes

Circularity Check

0 steps flagged

No circularity; derivation conditional on explicit axioms but self-contained

full rationale

The paper states that it adopts axioms (A1)-(A3) to define the Recognition Composition Law without deriving the composition law from a more primitive principle, then applies the resulting J(x) to induce the cost vector X_ω = J(r_ω). From there it invokes classical multinomial counting and convex duality to recover the Gibbs weights and the identity F_R(q) - F_R(p) = T_R D_KL(q || p). This recovery is presented as standard once the cost is fixed, with no equation reducing the output to the input by construction, no fitted parameter renamed as prediction, and no load-bearing self-citation. The framework is therefore an equivalence under chosen axioms rather than a tautological loop, making the derivation self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The paper introduces the RCL as a new entity without independent evidence or derivation from primitives, relies on three unspecified axioms (A1)-(A3), and uses one calibration choice; the recovery of Gibbs is performed with classical tools once the cost is fixed.

free parameters (1)

unit log-curvature calibration at the reference ratio
Normalization of the d'Alembert degree-two closure is set to unit log-curvature to obtain the specific J(x).

axioms (1)

domain assumption Axioms (A1)--(A3)
The framework is explicitly conditional on these axioms, which are not derived from a more primitive principle.

invented entities (1)

Recognition Composition Law (RCL) no independent evidence
purpose: To define the cost functional J(x) = ½(x + x^{-1}) - 1 for the ratio-cost construction.
Introduced as the normalized d'Alembert degree-two closure; no independent evidence or derivation from primitives is supplied.

pith-pipeline@v0.9.0 · 5782 in / 1787 out tokens · 63494 ms · 2026-05-19T19:59:57.100579+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel matches

?

matches
MATCHES: this paper passage directly uses, restates, or depends on the cited Recognition theorem or module.

after adopting the normalized d’Alembert degree-two closure called the Recognition Composition Law (RCL), with unit log-curvature calibration at the reference ratio, the continuous nontrivial positive branch is J(x)=½(x+x^{-1})−1=cosh(logx)−1
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean absolute_floor_iff_bare_distinguishability echoes

?

echoes
ECHOES: this paper passage has the same mathematical shape or conceptual pattern as the Recognition theorem, but is not a direct formal dependency.

Given the induced cost vector X_ω = J(r_ω), multinomial counting and convex duality recover the finite-state Gibbs weights and the identity F_R(q) - F_R(p) = T_R D_KL(q || p)

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

47 extracted references · 47 canonical work pages · 1 internal anchor

[1]

Ludwig Boltzmann. Über die Beziehung zwischen dem zweiten Hauptsatze der mechanischen Wärmetheorie und der Wahrscheinlichkeitsrechnung respektive den Sätzen über das Wärmegleichgewicht.Wiener Berichte, 76:373–435, 1877

work page
[2]

Willard Gibbs.Elementary Principles in Statistical Mechanics

J. Willard Gibbs.Elementary Principles in Statistical Mechanics. Yale University Press, 1902

work page 1902
[3]

David Ruelle.Statistical Mechanics: Rigorous Results. W. A. Benjamin, 1969

work page 1969
[4]

E. T. Jaynes. Information theory and statistical mechanics.Phys. Rev., 106:620–630, 1957

work page 1957
[5]

E. T. Jaynes. Information theory and statistical mechanics. II.Phys. Rev., 108:171–190, 1957

work page 1957
[6]

Steve Presse, Kingshuk Ghosh, Julian Lee, and Ken A. Dill. Principles of maximum entropy and maximum caliber in statistical physics.Rev. Mod. Phys., 85:1115–1141, 2013

work page 2013
[7]

Shore and Rodney W

John E. Shore and Rodney W. Johnson. Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy.IEEE Trans. Inform. Theory, 26(1):26–37, 1980

work page 1980
[8]

EBL-Schweitzer (USP / EBEB),

Ariel Caticha.Entropic Inference and the Foundations of Physics. EBL-Schweitzer (USP / EBEB),

work page
[9]

Lecture notes available athttps://www.albany.edu/physics/faculty/ariel-caticha

work page
[10]

Possible generalization of Boltzmann–Gibbs statistics.J

Constantino Tsallis. Possible generalization of Boltzmann–Gibbs statistics.J. Stat. Phys., 52(1– 2):479–487, 1988

work page 1988
[11]

Springer, 2011

Jan Naudts.Generalised Thermostatistics. Springer, 2011

work page 2011
[12]

The large deviation approach to statistical mechanics.Phys

Hugo Touchette. The large deviation approach to statistical mechanics.Phys. Rep., 478:1–69, 2009

work page 2009
[13]

Oscar E. Lanford. Entropy and equilibrium states in classical statistical mechanics. InStatistical Mechanics and Mathematical Problems, volume 20 ofLecture Notes in Physics, pages 1–113. Springer, 1973

work page 1973
[14]

Uniqueness of the canonical reciprocal cost

Jonathan Washburn and Milan Zlatanović. Uniqueness of the canonical reciprocal cost. Mathematics, 14(6):935, 2026. Also available as arXiv:2602.05753 [math.CA]

work page arXiv 2026
[15]

Aczél.Lectures on Functional Equations and Their Applications

J. Aczél.Lectures on Functional Equations and Their Applications. Academic Press, 1966

work page 1966
[16]

Cambridge University Press, 2009

Marek Kuczma, Bogdan Choczewski, and Roman Ger.Iterative Functional Equations, volume 32 ofEncyclopedia of Mathematics and Its Applications. Cambridge University Press, 2009. Reprint of the 1990 original

work page 2009
[17]

The coercive projection theorem for canonical reciprocal costs, 2026

Jonathan Washburn and Amir Rahnamai Barghi. The coercive projection theorem for canonical reciprocal costs, 2026. Preprint, arXiv:2603.20205

work page arXiv 2026
[18]

A remark on Stirling’s formula.Amer

Herbert Robbins. A remark on Stirling’s formula.Amer. Math. Monthly, 62(1):26–29, 1955

work page 1955
[19]

Ellis.Entropy, Large Deviations, and Statistical Mechanics

Richard S. Ellis.Entropy, Large Deviations, and Statistical Mechanics. Springer, 1985

work page 1985
[20]

Schnakenberg

J. Schnakenberg. Network theory of microscopic and macroscopic behavior of master equation systems.Rev. Mod. Phys., 48:571–585, 1976

work page 1976
[21]

Cover and Joy A

Thomas M. Cover and Joy A. Thomas.Elements of Information Theory. Wiley, 2 edition, 2006

work page 2006
[22]

Academic Press, 1981

Imre Csiszár and János Körner.Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, 1981

work page 1981
[23]

Springer, 2 edition,

Amir Dembo and Ofer Zeitouni.Large Deviations Techniques and Applications. Springer, 2 edition,

work page
[24]

Corrected reprint of the 1998 edition

work page 1998
[25]

Levin, Yuval Peres, and Elizabeth L

David A. Levin, Yuval Peres, and Elizabeth L. Wilmer.Markov Chains and Mixing Times. American Mathematical Society, 2 edition, 2017

work page 2017
[26]

Dario Lucente, Marco Baldovin, Andrea Puglisi, and Angelo Vulpiani.H-theorem at negative temperature: the random exchange model with bounds.J. Stat. Mech., 2025(1):013210, 2025

work page 2025
[27]

Solomon Kullback and Richard A. Leibler. On information and sufficiency.Ann. Math. Statist., 22:79–86, 1951

work page 1951
[28]

Information-type measures of difference of probability distributions and indirect observations.Studia Sci

Imre Csiszár. Information-type measures of difference of probability distributions and indirect observations.Studia Sci. Math. Hungar., 2:299–318, 1967

work page 1967
[29]

S. M. Ali and S. D. Silvey. A general class of coefficients of divergence of one distribution from another.J. Roy. Statist. Soc. Ser. B Methodol., 28(1):131–142, 1966

work page 1966
[30]

Wiley, 1998

Alexander Schrijver.Theory of Linear and Integer Programming. Wiley, 1998. Finite-state Gibbs from a recognition cost50

work page 1998
[31]

I. N. Sanov. On the probability of large deviations of random magnitudes.Mat. Sb. (N.S.), 42(1):11–44, 1957

work page 1957
[32]

Stochastic operators, information, and entropy.Comm

Jürgen Voigt. Stochastic operators, information, and entropy.Comm. Math. Phys., 81:31–38, 1981

work page 1981
[33]

J. R. Norris.Markov Chains. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1997

work page 1997
[34]

Stochastic thermodynamics, fluctuation theorems and molecular machines.Rep

Udo Seifert. Stochastic thermodynamics, fluctuation theorems and molecular machines.Rep. Prog. Phys., 75(12):126001, 2012

work page 2012
[35]

Nonequilibrium thermodynamics of chemical reaction networks: Wisdom from stochastic thermodynamics.Phys

Riccardo Rao and Massimiliano Esposito. Nonequilibrium thermodynamics of chemical reaction networks: Wisdom from stochastic thermodynamics.Phys. Rev. X, 6:041064, 2016

work page 2016
[36]

Relative Entropy and Inductive Inference

Ariel Caticha. Relative entropy and inductive inference. InBayesian Inference and Maximum Entropy Methods in Science and Engineering, volume 707 ofAIP Conference Proceedings, pages 75–96. American Institute of Physics, 2004. Also available as arXiv:physics/0311093

work page internal anchor Pith review Pith/arXiv arXiv 2004
[37]

Springer, 1985

Shun-ichi Amari.Differential-Geometrical Methods in Statistics, volume 28 ofLecture Notes in Statistics. Springer, 1985

work page 1985
[38]

American Mathematical Society, 2000

Shun-ichi Amari and Hiroshi Nagaoka.Methods of Information Geometry, volume 191 of Translations of Mathematical Monographs. American Mathematical Society, 2000

work page 2000
[39]

Analysis synthesis telephony based on the maximum likelihood method.Reports of the 6th International Congress on Acoustics, pages C–17–C–20, 1968

Fumitada Itakura and Shuzo Saito. Analysis synthesis telephony based on the maximum likelihood method.Reports of the 6th International Congress on Acoustics, pages C–17–C–20, 1968

work page 1968
[40]

Means of Hermitian positive-definite matrices based on the log-determinantα-divergence function.Linear Algebra Appl., 436:1872–1889, 2012

Zeineb Chebbi and Maher Moakher. Means of Hermitian positive-definite matrices based on the log-determinantα-divergence function.Linear Algebra Appl., 436:1872–1889, 2012

work page 2012
[41]

A new metric on the manifold of kernel matrices with application to matrix geometric means.Adv

Suvrit Sra. A new metric on the manifold of kernel matrices with application to matrix geometric means.Adv. Neural Inf. Process. Syst., 25:144–152, 2012

work page 2012
[42]

Cambridge University Press, 1989

János Aczél and Jean Dhombres.Functional Equations in Several Variables, volume 31 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, 1989

work page 1989
[43]

Three faces of the second law

Massimiliano Esposito and Christian Van den Broeck. Three faces of the second law. I. Master equation formulation.Phys. Rev. E, 82:011143, 2010

work page 2010
[44]

Irreversible thermodynamics of open chemical networks

Matteo Polettini and Massimiliano Esposito. Irreversible thermodynamics of open chemical networks. I. Emergent cycles and broken conservation laws.J. Chem. Phys., 141(2):024117, 2014

work page 2014
[45]

Stochastic thermodynamic interpretation of information geometry.Phys

Sosuke Ito. Stochastic thermodynamic interpretation of information geometry.Phys. Rev. Lett., 121:030605, 2018

work page 2018
[46]

Barato and Udo Seifert

Andre C. Barato and Udo Seifert. Thermodynamic uncertainty relation for biomolecular processes. Phys. Rev. Lett., 114:158101, 2015

work page 2015
[47]

Estimators, escort probabilities, andϕ-exponential families in statistical physics.J

Jan Naudts. Estimators, escort probabilities, andϕ-exponential families in statistical physics.J. Inequal. Pure Appl. Math., 5(4):Article 102, 2004

work page 2004

[1] [1]

Ludwig Boltzmann. Über die Beziehung zwischen dem zweiten Hauptsatze der mechanischen Wärmetheorie und der Wahrscheinlichkeitsrechnung respektive den Sätzen über das Wärmegleichgewicht.Wiener Berichte, 76:373–435, 1877

work page

[2] [2]

Willard Gibbs.Elementary Principles in Statistical Mechanics

J. Willard Gibbs.Elementary Principles in Statistical Mechanics. Yale University Press, 1902

work page 1902

[3] [3]

David Ruelle.Statistical Mechanics: Rigorous Results. W. A. Benjamin, 1969

work page 1969

[4] [4]

E. T. Jaynes. Information theory and statistical mechanics.Phys. Rev., 106:620–630, 1957

work page 1957

[5] [5]

E. T. Jaynes. Information theory and statistical mechanics. II.Phys. Rev., 108:171–190, 1957

work page 1957

[6] [6]

Steve Presse, Kingshuk Ghosh, Julian Lee, and Ken A. Dill. Principles of maximum entropy and maximum caliber in statistical physics.Rev. Mod. Phys., 85:1115–1141, 2013

work page 2013

[7] [7]

Shore and Rodney W

John E. Shore and Rodney W. Johnson. Axiomatic derivation of the principle of maximum entropy and the principle of minimum cross-entropy.IEEE Trans. Inform. Theory, 26(1):26–37, 1980

work page 1980

[8] [8]

EBL-Schweitzer (USP / EBEB),

Ariel Caticha.Entropic Inference and the Foundations of Physics. EBL-Schweitzer (USP / EBEB),

work page

[9] [9]

Lecture notes available athttps://www.albany.edu/physics/faculty/ariel-caticha

work page

[10] [10]

Possible generalization of Boltzmann–Gibbs statistics.J

Constantino Tsallis. Possible generalization of Boltzmann–Gibbs statistics.J. Stat. Phys., 52(1– 2):479–487, 1988

work page 1988

[11] [11]

Springer, 2011

Jan Naudts.Generalised Thermostatistics. Springer, 2011

work page 2011

[12] [12]

The large deviation approach to statistical mechanics.Phys

Hugo Touchette. The large deviation approach to statistical mechanics.Phys. Rep., 478:1–69, 2009

work page 2009

[13] [13]

Oscar E. Lanford. Entropy and equilibrium states in classical statistical mechanics. InStatistical Mechanics and Mathematical Problems, volume 20 ofLecture Notes in Physics, pages 1–113. Springer, 1973

work page 1973

[14] [14]

Uniqueness of the canonical reciprocal cost

Jonathan Washburn and Milan Zlatanović. Uniqueness of the canonical reciprocal cost. Mathematics, 14(6):935, 2026. Also available as arXiv:2602.05753 [math.CA]

work page arXiv 2026

[15] [15]

Aczél.Lectures on Functional Equations and Their Applications

J. Aczél.Lectures on Functional Equations and Their Applications. Academic Press, 1966

work page 1966

[16] [16]

Cambridge University Press, 2009

Marek Kuczma, Bogdan Choczewski, and Roman Ger.Iterative Functional Equations, volume 32 ofEncyclopedia of Mathematics and Its Applications. Cambridge University Press, 2009. Reprint of the 1990 original

work page 2009

[17] [17]

The coercive projection theorem for canonical reciprocal costs, 2026

Jonathan Washburn and Amir Rahnamai Barghi. The coercive projection theorem for canonical reciprocal costs, 2026. Preprint, arXiv:2603.20205

work page arXiv 2026

[18] [18]

A remark on Stirling’s formula.Amer

Herbert Robbins. A remark on Stirling’s formula.Amer. Math. Monthly, 62(1):26–29, 1955

work page 1955

[19] [19]

Ellis.Entropy, Large Deviations, and Statistical Mechanics

Richard S. Ellis.Entropy, Large Deviations, and Statistical Mechanics. Springer, 1985

work page 1985

[20] [20]

Schnakenberg

J. Schnakenberg. Network theory of microscopic and macroscopic behavior of master equation systems.Rev. Mod. Phys., 48:571–585, 1976

work page 1976

[21] [21]

Cover and Joy A

Thomas M. Cover and Joy A. Thomas.Elements of Information Theory. Wiley, 2 edition, 2006

work page 2006

[22] [22]

Academic Press, 1981

Imre Csiszár and János Körner.Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, 1981

work page 1981

[23] [23]

Springer, 2 edition,

Amir Dembo and Ofer Zeitouni.Large Deviations Techniques and Applications. Springer, 2 edition,

work page

[24] [24]

Corrected reprint of the 1998 edition

work page 1998

[25] [25]

Levin, Yuval Peres, and Elizabeth L

David A. Levin, Yuval Peres, and Elizabeth L. Wilmer.Markov Chains and Mixing Times. American Mathematical Society, 2 edition, 2017

work page 2017

[26] [26]

Dario Lucente, Marco Baldovin, Andrea Puglisi, and Angelo Vulpiani.H-theorem at negative temperature: the random exchange model with bounds.J. Stat. Mech., 2025(1):013210, 2025

work page 2025

[27] [27]

Solomon Kullback and Richard A. Leibler. On information and sufficiency.Ann. Math. Statist., 22:79–86, 1951

work page 1951

[28] [28]

Information-type measures of difference of probability distributions and indirect observations.Studia Sci

Imre Csiszár. Information-type measures of difference of probability distributions and indirect observations.Studia Sci. Math. Hungar., 2:299–318, 1967

work page 1967

[29] [29]

S. M. Ali and S. D. Silvey. A general class of coefficients of divergence of one distribution from another.J. Roy. Statist. Soc. Ser. B Methodol., 28(1):131–142, 1966

work page 1966

[30] [30]

Wiley, 1998

Alexander Schrijver.Theory of Linear and Integer Programming. Wiley, 1998. Finite-state Gibbs from a recognition cost50

work page 1998

[31] [31]

I. N. Sanov. On the probability of large deviations of random magnitudes.Mat. Sb. (N.S.), 42(1):11–44, 1957

work page 1957

[32] [32]

Stochastic operators, information, and entropy.Comm

Jürgen Voigt. Stochastic operators, information, and entropy.Comm. Math. Phys., 81:31–38, 1981

work page 1981

[33] [33]

J. R. Norris.Markov Chains. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge University Press, 1997

work page 1997

[34] [34]

Stochastic thermodynamics, fluctuation theorems and molecular machines.Rep

Udo Seifert. Stochastic thermodynamics, fluctuation theorems and molecular machines.Rep. Prog. Phys., 75(12):126001, 2012

work page 2012

[35] [35]

Nonequilibrium thermodynamics of chemical reaction networks: Wisdom from stochastic thermodynamics.Phys

Riccardo Rao and Massimiliano Esposito. Nonequilibrium thermodynamics of chemical reaction networks: Wisdom from stochastic thermodynamics.Phys. Rev. X, 6:041064, 2016

work page 2016

[36] [36]

Relative Entropy and Inductive Inference

Ariel Caticha. Relative entropy and inductive inference. InBayesian Inference and Maximum Entropy Methods in Science and Engineering, volume 707 ofAIP Conference Proceedings, pages 75–96. American Institute of Physics, 2004. Also available as arXiv:physics/0311093

work page internal anchor Pith review Pith/arXiv arXiv 2004

[37] [37]

Springer, 1985

Shun-ichi Amari.Differential-Geometrical Methods in Statistics, volume 28 ofLecture Notes in Statistics. Springer, 1985

work page 1985

[38] [38]

American Mathematical Society, 2000

Shun-ichi Amari and Hiroshi Nagaoka.Methods of Information Geometry, volume 191 of Translations of Mathematical Monographs. American Mathematical Society, 2000

work page 2000

[39] [39]

Analysis synthesis telephony based on the maximum likelihood method.Reports of the 6th International Congress on Acoustics, pages C–17–C–20, 1968

Fumitada Itakura and Shuzo Saito. Analysis synthesis telephony based on the maximum likelihood method.Reports of the 6th International Congress on Acoustics, pages C–17–C–20, 1968

work page 1968

[40] [40]

Means of Hermitian positive-definite matrices based on the log-determinantα-divergence function.Linear Algebra Appl., 436:1872–1889, 2012

Zeineb Chebbi and Maher Moakher. Means of Hermitian positive-definite matrices based on the log-determinantα-divergence function.Linear Algebra Appl., 436:1872–1889, 2012

work page 2012

[41] [41]

A new metric on the manifold of kernel matrices with application to matrix geometric means.Adv

Suvrit Sra. A new metric on the manifold of kernel matrices with application to matrix geometric means.Adv. Neural Inf. Process. Syst., 25:144–152, 2012

work page 2012

[42] [42]

Cambridge University Press, 1989

János Aczél and Jean Dhombres.Functional Equations in Several Variables, volume 31 of Encyclopedia of Mathematics and its Applications. Cambridge University Press, 1989

work page 1989

[43] [43]

Three faces of the second law

Massimiliano Esposito and Christian Van den Broeck. Three faces of the second law. I. Master equation formulation.Phys. Rev. E, 82:011143, 2010

work page 2010

[44] [44]

Irreversible thermodynamics of open chemical networks

Matteo Polettini and Massimiliano Esposito. Irreversible thermodynamics of open chemical networks. I. Emergent cycles and broken conservation laws.J. Chem. Phys., 141(2):024117, 2014

work page 2014

[45] [45]

Stochastic thermodynamic interpretation of information geometry.Phys

Sosuke Ito. Stochastic thermodynamic interpretation of information geometry.Phys. Rev. Lett., 121:030605, 2018

work page 2018

[46] [46]

Barato and Udo Seifert

Andre C. Barato and Udo Seifert. Thermodynamic uncertainty relation for biomolecular processes. Phys. Rev. Lett., 114:158101, 2015

work page 2015

[47] [47]

Estimators, escort probabilities, andϕ-exponential families in statistical physics.J

Jan Naudts. Estimators, escort probabilities, andϕ-exponential families in statistical physics.J. Inequal. Pure Appl. Math., 5(4):Article 102, 2004

work page 2004