On the partition function of a class of Mallows model
Pith reviewed 2026-05-07 14:25 UTC · model grok-4.3
The pith
The partition function of a class of Mallows models converges to a Fredholm determinant after exponential rescaling.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The author establishes Pal's conjecture by showing that for twice continuously differentiable c, the limit lim_{n→∞} e^{-n Γ_0} L_n is equal to the Fredholm determinant of the integral operator associated with the entropy-regularized optimal transport problem.
What carries the argument
Fredholm determinant of the integral operator associated with the entropy-regularized optimal transport problem
If this is right
- If the claim holds, then L_n is asymptotically equivalent to the determinant times e^{n Γ_0}.
- The connection between the Mallows model and entropy-regularized transport is made rigorous at the level of the partition function.
- No further regularity on the operator is needed beyond the twice differentiability of c.
- Exact asymptotics become available for this class of models on the symmetric group.
Where Pith is reading between the lines
- The result suggests that similar Fredholm determinants may appear in asymptotics of other discrete models with continuous relaxations.
- Practical computation of the determinant could approximate L_n for large n in applications.
- Extensions might include higher-order terms in the asymptotic expansion or other cost functions.
Load-bearing premise
The twice continuous differentiability of the cost function c is sufficient to complete the proof of the limit without additional conditions.
What would settle it
Direct computation of lim e^{-n Γ_0} L_n for increasing n using Monte Carlo or exact methods for a chosen c, compared against numerical approximation of the Fredholm determinant.
read the original abstract
Let $\Sym{n}$ denote the set of all permutations on $n$ labels. Let $c:[0, 1]^2\to [0, \infty)$ be a twice continuously differentiable function. A subfamily of the Mallows model is the Gibbs probability measures on $\Sym{n}$ such that $\mathbb{P}(X=\sigma)=L_n^{-1} \prod_{i=1}^{n}\exp(-c(i/n, \sigma(i)/n))$. Mukherjee [Ann. Stat., Vol. 44(2), pp 853--875 (2016)] computed the limit of the log partition function and showed that $\lim_{n\to \infty}\frac{1}{n}\log L_n=-\Gamma_0$ where $\Gamma_0$ is the optimal cost associated with an entropy regularized optimal transport problem. In the KRP Memorial Volume of the Indian Journal of Pure and Applied Math, Pal conjectured an exact value for the limit $\lim_{n\to \infty} e^{-n\Gamma_0}L_n$ in terms of the Fredholm determinant of an integral operator and provided a partial proof. We give a complete proof of Pal's conjecture.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims to provide a complete proof of Pal's conjecture: for a twice continuously differentiable cost c:[0,1]^2 → [0,∞), the partition function L_n of the associated Gibbs measure on S_n satisfies lim_{n→∞} e^{-n Γ_0} L_n = det(I+K), where Γ_0 is the entropy-regularized OT cost from Mukherjee's theorem and K is the integral operator on L^2[0,1] whose kernel is constructed from the optimal plan (or its derivatives) for that OT problem.
Significance. If the proof holds, the result supplies the precise multiplicative constant in the asymptotic expansion of the Mallows partition function, completing the large-deviation analysis of Mukherjee and the partial argument of Pal. The explicit identification with a Fredholm determinant from entropy-regularized OT is a concrete analytic advance that may permit further exact asymptotics or extensions to other permutation models.
major comments (2)
- [§3] §3 (completion of Pal's argument): the claim that the integral operator K is trace-class (hence that the Fredholm determinant is defined without further conditions) rests only on the C^2 regularity of c. A C^2 kernel yields a Hilbert-Schmidt operator, but trace-class membership requires ∑|λ_i| < ∞, which does not follow automatically. The manuscript must supply an explicit verification—either by exploiting the special structure of the OT plan or by deriving a stronger decay estimate on the eigenvalues—otherwise the identification with det(I+K) remains formally incomplete.
- [Definition of K] Definition of K (immediately after the statement of the main theorem): the kernel is described as 'associated with' the entropy-regularized OT problem, but the precise formula (involving the optimal density or its derivatives) is not written out. Without this explicit expression it is impossible to check the trace-class property or to confirm that the operator is independent of auxiliary choices in the OT formulation.
minor comments (2)
- [Introduction] The sign convention for the Fredholm determinant (I+K versus I-K) should be stated explicitly and matched to the standard literature on regularized OT.
- [§2] A short remark on the domain of the operator (L^2[0,1] with Lebesgue measure) and the precise normalization of the kernel would improve readability.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive suggestions. The comments highlight two points where additional explicit details will strengthen the manuscript. We address each below and will incorporate the necessary clarifications in the revised version.
read point-by-point responses
-
Referee: [§3] §3 (completion of Pal's argument): the claim that the integral operator K is trace-class (hence that the Fredholm determinant is defined without further conditions) rests only on the C^2 regularity of c. A C^2 kernel yields a Hilbert-Schmidt operator, but trace-class membership requires ∑|λ_i| < ∞, which does not follow automatically. The manuscript must supply an explicit verification—either by exploiting the special structure of the OT plan or by deriving a stronger decay estimate on the eigenvalues—otherwise the identification with det(I+K) remains formally incomplete.
Authors: We agree that C^2 regularity of the cost alone yields a Hilbert-Schmidt operator and that an explicit verification of trace-class membership is required. The kernel K is constructed from the optimal entropy-regularized transport plan, whose density and derivatives inherit infinite smoothness from the C^2 cost and the strictly convex entropy term. This structure produces a kernel whose eigenvalues decay exponentially, which we will verify in a new lemma in §3 by combining the analyticity of the optimal plan with standard trace-class criteria for integral operators on L^2[0,1]. revision: yes
-
Referee: [Definition of K] Definition of K (immediately after the statement of the main theorem): the kernel is described as 'associated with' the entropy-regularized OT problem, but the precise formula (involving the optimal density or its derivatives) is not written out. Without this explicit expression it is impossible to check the trace-class property or to confirm that the operator is independent of auxiliary choices in the OT formulation.
Authors: We accept that the current description is insufficiently explicit. Immediately after the main theorem we will insert the precise formula for the kernel K(x,y) in terms of the optimal density ρ^* and its first derivatives arising from the entropy-regularized OT problem (as obtained from the Euler-Lagrange equation for the dual potentials). This explicit expression makes the independence from auxiliary choices manifest and supplies the starting point for the trace-class argument. revision: yes
Circularity Check
Derivation completes Pal's conjecture via external analytic tools without self-referential reduction
full rationale
The paper's central result is a completion of Pal's partial proof for the exact limit of the normalized partition function as a Fredholm determinant, building directly on Mukherjee's independently established limit for (1/n) log L_n = -Γ_0. No step in the provided abstract or described chain defines the target determinant or the limit value in terms of quantities fitted or constructed inside this manuscript; the argument invokes standard properties of entropy-regularized OT and C^2 regularity to close the operator analysis left open by Pal. This is a standard mathematical completion relying on external results rather than any self-definitional, fitted-input, or self-citation load-bearing reduction. The derivation chain therefore remains self-contained against the cited benchmarks.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption The entropy-regularized optimal transport problem admits a unique optimal plan whose cost defines Γ_0
- standard math The integral operator arising from the model is trace-class so that its Fredholm determinant is well-defined and finite
Reference graph
Works this paper leans on
- [1]
-
[2]
Itai Benjamini, Noam Berger, Christopher Hoffman, and Elchanan Mossel, Mixing times of the biased card shuffling and the asymmetric exclusion process, Transactions of the American Mathematical Society 357 (2005), no. 8, 3013--3029
work page 2005
-
[3]
Csisz\'ar, I -divergence geometry of probability distributions and minimization problems , Ann
I. Csisz\'ar, I -divergence geometry of probability distributions and minimization problems , Ann. Probability 3 (1975), 146--158. 365798
work page 1975
-
[4]
11, Institute of Mathematical Statistics, Hayward, CA, 1988
Persi Diaconis, Group representations in probability and statistics, Institute of Mathematical Statistics Lecture Notes---Monograph Series, vol. 11, Institute of Mathematical Statistics, Hayward, CA, 1988. 964069
work page 1988
-
[5]
Persi Diaconis and Arun Ram, Analysis of systematic scan metropolis algorithms using iwahori-hecke algebra techniques., Michigan Mathematical Journal 48 (2000), no. 1, 157--190
work page 2000
-
[6]
Christian L \'e onard, From the S chr\"odinger problem to the M onge- K antorovich problem , J. Funct. Anal. 262 (2012), no. 4, 1879--1920. 2873864
work page 2012
-
[7]
thesis, University of Washington, Seattle, 2017
Avi William Levy, Novel uses of the mallows model in coloring and matching, Ph.D. thesis, University of Washington, Seattle, 2017
work page 2017
-
[8]
C. L. Mallows, Non-null ranking models. I , Biometrika 44 (1957), no. 1/2, 114--130
work page 1957
-
[9]
Peter McCullagh, An asymptotic approximation for the permanent of a doubly stochastic matrix, Journal of Statistical Computation and Simulation 84 (2014), no. 2, 404--414
work page 2014
-
[10]
Sumit Mukherjee, Estimation in exponential families on permutations, Ann. Statist. 44 (2016), no. 2, 853--875. 3476619
work page 2016
-
[11]
Soumik Pal, Limiting partition function for the mallows model: a conjecture and partial evidence, To appear in Indian Journal of Pure and Applied Mathematics (2024)
work page 2024
-
[12]
uschendorf and W. Thomsen, Note on the S chr\
L. R\"uschendorf and W. Thomsen, Note on the S chr\"odinger equation and I -projections , Statist. Probab. Lett. 17 (1993), no. 5, 369--375. 1237783
work page 1993
-
[13]
120, American Mathematical Society, Providence, RI, 2005
Barry Simon, Trace ideals and their applications, second ed., Mathematical Surveys and Monographs, vol. 120, American Mathematical Society, Providence, RI, 2005. 2154153
work page 2005
-
[14]
58, American Mathematical Soc., 2021
C \'e dric Villani, Topics in optimal transportation, vol. 58, American Mathematical Soc., 2021
work page 2021
-
[15]
201, Cambridge University Press, Cambridge, 2022
Jan van Neerven, Functional analysis, Cambridge Studies in Advanced Mathematics, vol. 201, Cambridge University Press, Cambridge, 2022. 4425670
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.