The non-tightness of the reconstruction threshold of a 4 states symmetric model with different in-block and out-block mutations
Pith reviewed 2026-05-25 17:56 UTC · model grok-4.3
The pith
Conditions exist under which the reconstruction threshold of a 4-state symmetric model is not tight.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In the 4-state symmetric model with distinct in-block and out-block transition probabilities, the reconstruction threshold on regular trees is not tight whenever the parameters satisfy the inequalities established in the paper.
What carries the argument
The 4-state symmetric model with different in-block and out-block transition probabilities, which produces distinct decay rates for information inside versus outside blocks.
If this is right
- The corresponding 4-community stochastic block model possesses a hybrid-hard phase under the identified conditions.
- Root information can be recovered from the leaves in some parameter regimes but not others.
- The non-tightness result extends earlier tightness proofs from symmetric two-state models to this four-state asymmetric case.
Where Pith is reading between the lines
- Similar non-tightness may appear in other multi-state models once in-block and out-block rates are allowed to differ.
- Clustering algorithms for four-community graphs may need separate procedures for the hybrid-hard region identified here.
Load-bearing premise
The established connection between tree reconstruction and stochastic block model clustering continues to hold for this 4-state model.
What would settle it
An explicit computation of the root-to-leaf mutual information that decays exactly at the usual Kesten-Stigum threshold rather than strictly inside the claimed non-tight region would falsify the result.
read the original abstract
The tree reconstruction problem is to collect and analyze massive data at the $n$th level of the tree, to identify whether there is non-vanishing information of the root, as $n$ goes to infinity. Its connection to the clustering problem in the setting of the stochastic block model, which has wide applications in machine learning and data mining, has been well established. For the stochastic block model, an "information-theoretically-solvable-but-computationally-hard" region, or say "hybrid-hard phase", appears whenever the reconstruction bound is not tight of the corresponding reconstruction on the tree problem. Although it has been studied in numerous contexts, the existing literature with rigorous reconstruction thresholds established are very limited, and it becomes extremely challenging when the model under investigation has $4$ states (the stochastic block model with $4$ communities). In this paper, inspired by the newly proposed $q_1+q_2$ stochastic block model, we study a $4$ states symmetric model with different in-block and out-block transition probabilities, and rigorously give the conditions for the non-tightness of the reconstruction threshold.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript examines the tree reconstruction problem on a 4-state symmetric channel whose transition probabilities differ between in-block and out-block pairs. It claims to derive rigorous conditions under which the reconstruction threshold is not tight, thereby identifying an information-theoretically solvable but computationally hard regime in the associated 4-community stochastic block model.
Significance. A correct derivation would enlarge the small collection of models for which non-tightness of the reconstruction threshold has been proved, thereby clarifying the location of the hybrid-hard phase for community detection with four groups. The extension from symmetric q-ary channels to the asymmetric in/out mutation structure is a non-trivial technical step whose validity determines the result's reach.
major comments (2)
- [Introduction / Model definition] The equivalence between root reconstruction on the tree and community detection in the corresponding SBM must be re-established for the new transition matrix. Standard second-moment or fixed-point arguments were derived for symmetric channels; the differing in-block and out-block probabilities alter the relevant stability analysis and the paper must show explicitly that the non-tightness criterion still follows from the same mutual-information or reconstruction threshold expressions.
- [Main result / Theorem statement] The main theorem stating the conditions for non-tightness (presumably in §4 or §5) relies on the claim that the 4-state model inherits the known tree-SBM connection. If the proof invokes prior results for q-ary symmetric channels without re-deriving the Kesten-Stigum threshold or the second-moment bound for the asymmetric case, the rigor of the stated conditions is not secured.
minor comments (1)
- [Abstract] The abstract asserts that 'rigorous conditions are given' but does not state what those conditions are; a one-sentence summary of the main theorem would improve readability.
Simulated Author's Rebuttal
We thank the referee for the careful reading and constructive comments on the equivalence between tree reconstruction and the SBM for our 4-state model with differing in-block and out-block transitions. We address each major comment below and will revise the manuscript accordingly.
read point-by-point responses
-
Referee: [Introduction / Model definition] The equivalence between root reconstruction on the tree and community detection in the corresponding SBM must be re-established for the new transition matrix. Standard second-moment or fixed-point arguments were derived for symmetric channels; the differing in-block and out-block probabilities alter the relevant stability analysis and the paper must show explicitly that the non-tightness criterion still follows from the same mutual-information or reconstruction threshold expressions.
Authors: We agree that the equivalence must be re-established explicitly for the new transition matrix. In the revised manuscript we will insert a dedicated subsection (in the model definition or introduction) that adapts the second-moment method and fixed-point analysis to the case of distinct in-block and out-block probabilities, confirming that the non-tightness criterion continues to follow from the corresponding mutual-information and reconstruction-threshold expressions. revision: yes
-
Referee: [Main result / Theorem statement] The main theorem stating the conditions for non-tightness (presumably in §4 or §5) relies on the claim that the 4-state model inherits the known tree-SBM connection. If the proof invokes prior results for q-ary symmetric channels without re-deriving the Kesten-Stigum threshold or the second-moment bound for the asymmetric case, the rigor of the stated conditions is not secured.
Authors: We acknowledge the concern. The current proof sketch may rely too heavily on prior symmetric-channel results. In the revision we will expand the proof of the main theorem to contain a self-contained derivation of the Kesten-Stigum threshold and the second-moment bound that accounts for the asymmetric in/out-block transition structure, thereby securing the rigor of the stated non-tightness conditions. revision: yes
Circularity Check
No circularity; tree-SBM connection cited as externally established
full rationale
The abstract states the tree reconstruction to SBM clustering connection 'has been well established' and claims to 'rigorously give the conditions for the non-tightness' for the new 4-state model. No self-citations, fitted parameters renamed as predictions, or definitional reductions are visible. The derivation is presented as building on prior external results with new analysis for the asymmetric transition probabilities, satisfying the criteria for a self-contained result.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Main Theorem. If |λ1|≠|λ3| and 0<|λ2|<max{|λ1|,|λ3|}, the Kesten-Stigum bound is not tight for every d...
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
the connection between the reconstruction on the tree problem and the clustering problem in the setting of the stochastic block model
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
- [1]
-
[2]
J. Bernussou and J.-L. Abatut , Point mapping stability , Pergamon, 1977. 20 W. LIU AND N. NING
work page 1977
-
[3]
S. Bhamidi, R. Rajagopal, and S. Roch , Network delay inference from additive metrics , Random Structures & Algorithms, 37 (2010), pp. 176–203
work page 2010
-
[4]
P. M. Bleher, J. Ruiz, and V. A. Zagrebnov , On the purity of the limiting Gibbs state for the Ising model on the bethe lattice , Journal of Statistical Physics, 79 (1995), pp. 473–482
work page 1995
- [5]
-
[6]
G. Brito, I. Dumitriu, S. Ganguly, C. Hoffman, and L. V. Tran , Recovery and rigidity in a regular stochastic block model , in Proceedings of the twenty-seventh annual ACM-SIAM symposium on Discrete algorithms, Society for Industrial a nd Applied Mathematics, 2016, pp. 1589–1601
work page 2016
- [7]
-
[8]
C. Daskalakis, E. Mossel, and S. Roch , Optimal phylogenetic reconstruction, in Proceedings of the thirty-eighth annual ACM symposium on Theory of compu ting, ACM, 2006, pp. 159– 168
work page 2006
- [9]
-
[10]
H. Kesten and B. P. Stigum , Additional limit theorems for indecomposable multidimens ional galton-watson processes, The Annals of Mathematical Statistics, 37 (1966), pp. 1463 –1481
work page 1966
-
[11]
H. Kesten and B. P. Stigum , Limit theorems for decomposable multi-dimensional galton - watson processes, Journal of Mathematical Analysis and Applications, 17 (19 67), pp. 309– 338
-
[12]
M. Kimura , A simple method for estimating evolutionary rates of base su bstitutions through comparative studies of nucleotide sequences , Journal of molecular evolution, 16 (1980), pp. 111–120
work page 1980
-
[13]
W. Liu, S. R. Jammalamadaka, and N. Ning , The tightness of the Kesten-Stigum reconstruc- tion bound of symmetric model with multiple mutations , Journal of Statistical Physics, 170 (2018), pp. 617–641
work page 2018
- [14]
- [15]
-
[16]
M. M ´ezard and A. Montanari , Reconstruction on trees and spin glass transition , Journal of statistical physics, 124 (2006), pp. 1317–1350
work page 2006
-
[17]
E. Mossel , Reconstruction on trees: beating the second eigenvalue , Annals of Applied Proba- bility, (2001), pp. 285–300
work page 2001
-
[18]
E. Mossel , Phase transitions in phylogeny , Transactions of the American Mathematical Soci- ety, 356 (2004), pp. 2379–2404
work page 2004
-
[19]
E. Mossel , Survey: information flow on trees , DIMACS series in discrete mathematics and theoretical computer science, 63 (2004), pp. 155–170
work page 2004
-
[20]
Deep Learning and Hierarchal Generative Models
E. Mossel , Deep learning and hierarchal generative models , arXiv preprint arXiv:1612.09057, (2016)
work page internal anchor Pith review Pith/arXiv arXiv 2016
- [21]
- [22]
-
[23]
Non-Reconstructability in the Stochastic Block Model
J. Neeman and P. Netrapalli , Non-reconstructability in the stochastic block model , arXiv preprint arXiv:1404.6304, (2014)
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[24]
F. Ricci-Tersenghi, G. Semerjian, and L. Zdeborov ´a, Typology of phase transitions in bayesian inference problems , Physical Review E, 99 (2019), p. 042109
work page 2019
-
[25]
Sly , Reconstruction for the Potts model , The Annals of Probability, 39 (2011), pp
A. Sly , Reconstruction for the Potts model , The Annals of Probability, 39 (2011), pp. 1365– 1406
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.