Attributed Network Alignment: Statistical Limits and Efficient Algorithm
Pith reviewed 2026-05-10 20:12 UTC · model grok-4.3
The pith
Node features and edges together set sharp thresholds for recovering hidden vertex mappings between graphs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In the featured correlated Gaussian Wigner model, the optimal information-theoretic thresholds for exact recovery and partial recovery of the latent mapping are characterized. QPAlign, an algorithm based on a quadratic programming relaxation, is proposed and shown to achieve strong empirical performance on synthetic and real datasets while carrying theoretical guarantees.
What carries the argument
The featured correlated Gaussian Wigner model, which couples two graphs through an unknown vertex permutation with node features jointly distributed under the same permutation.
If this is right
- Exact recovery of the full mapping succeeds with high probability once the combined edge and feature signal exceeds an explicit threshold.
- Partial recovery of a positive fraction of the mapping is possible between a lower threshold and the exact-recovery threshold.
- QPAlign returns a reliable solution with convergence guarantees whenever the model parameters place the instance above the relevant threshold.
- The same algorithm outperforms purely edge-based methods once node features carry additional correlation information.
Where Pith is reading between the lines
- The same quadratic-programming relaxation could be adapted to other correlation models beyond the Gaussian Wigner case.
- Social-network de-anonymization pipelines that already use profile attributes would gain a statistically grounded performance guarantee by switching to QPAlign.
- Large-scale experiments on graphs with millions of vertices would test whether the current relaxation remains computationally tractable without further approximation.
Load-bearing premise
Node features must be correlated under the same unknown permutation as the edges, and the joint distributions must obey the Gaussian Wigner structure.
What would settle it
A simulation drawn exactly from the featured correlated Gaussian Wigner model in which the mapping is recovered with high probability below the derived information-theoretic threshold would falsify the claimed optimality of those thresholds.
Figures
read the original abstract
This paper studies the problem of recovering a hidden vertex correspondence between two correlated graphs when both edge weights and node features are observed. While most existing work on graph alignment relies primarily on edge information, many real-world applications provide informative node features in addition to graph topology. To capture this setting, we introduce the featured correlated Gaussian Wigner model, where two graphs are coupled through an unknown vertex permutation, and the node features are correlated under the same permutation. We characterize the optimal information-theoretic thresholds for exact recovery and partial recovery of the latent mapping. On the algorithmic side, we propose QPAlign, an algorithm based on a quadratic programming relaxation, and demonstrate its strong empirical performance on both synthetic and real datasets. Moreover, we also derive theoretical guarantees for the proposed procedure, supporting its reliability and providing convergence guarantees.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the featured correlated Gaussian Wigner model for attributed network alignment, where two graphs are coupled through an unknown vertex permutation with jointly Gaussian correlated edge weights and node features. It characterizes the optimal information-theoretic thresholds for exact recovery and partial recovery of the latent mapping. The authors propose QPAlign, a quadratic programming relaxation algorithm, demonstrate its empirical performance on synthetic and real datasets, and derive theoretical guarantees for the procedure.
Significance. If the characterizations and guarantees hold under the stated model, the work is significant for extending graph alignment theory to include node features, providing both sharp information-theoretic limits and a practical algorithm with supporting analysis. The explicit model definition means the stress-test concern about the Gaussian Wigner structure does not land as an internal inconsistency; the claims are scoped to this setting and the derivations follow from it in a standard manner for such models.
minor comments (3)
- Abstract: the claim of 'strong empirical performance' is stated without any quantitative metrics, baselines, or dataset sizes; a single sentence summarizing key results (e.g., recovery accuracy or runtime) would improve the summary.
- Model section: the joint distribution parameters (edge correlation rho and feature correlation) are introduced but their precise roles in the threshold expressions could be highlighted with a short table or remark for quick reference.
- Experiments: more detail on how the real-world datasets are preprocessed to fit the Gaussian Wigner assumption (or how deviations are handled) would help readers evaluate practical relevance.
Simulated Author's Rebuttal
We thank the referee for the careful and positive review of our manuscript. We are pleased that the significance of the featured correlated Gaussian Wigner model, the information-theoretic thresholds, and the QPAlign algorithm with its guarantees has been recognized. The recommendation for minor revision is appreciated. No major comments were raised in the report.
Circularity Check
No circularity: model defined first, thresholds and algorithm derived from it
full rationale
The paper introduces the featured correlated Gaussian Wigner model explicitly, then derives information-theoretic thresholds for exact/partial recovery and provides guarantees for QPAlign. No quoted steps reduce a claimed prediction or first-principles result to its own inputs by construction, self-definition, or load-bearing self-citation. Derivations rely on standard analysis of the stated joint Gaussian structure rather than tautological renaming or fitted parameters presented as predictions. The derivation chain is self-contained.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Edge weights and node features follow jointly Gaussian distributions under the unknown permutation
invented entities (1)
-
featured correlated Gaussian Wigner model
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
featured correlated Gaussian Wigner model G(n,d,ρ,r) ... nlog(1/(1-ρ²))+dlog(1/(1-r²))≥(4+ε)log n ... S_π(G1,G2)=φ(ρ)∑βe(G1)βπ(e)(G2)+φ(r)∑xvyπ(v)
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
QPAlign ... quadratic programming relaxation ... projected gradient descent on Birkhoff polytope
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Exact random graph matching with multiple graphs
[AH24] Taha Ameen and Bruce Hajek. Exact random graph matching with multiple graphs. arXiv preprint arXiv:2405.12293,
-
[2]
Detecting correlation between multiple unlabeled Gaussian networks.arXiv preprint arXiv:2504.16279,
[AH25] Taha Ameen and Bruce Hajek. Detecting correlation between multiple unlabeled Gaussian networks.arXiv preprint arXiv:2504.16279,
-
[3]
Exact alignment recovery for correlated Erd˝ os- R´ enyi graphs.arXiv preprint arXiv:1711.06783,
[CK17] Daniel Cullina and Negar Kiyavash. Exact alignment recovery for correlated Erd˝ os- R´ enyi graphs.arXiv preprint arXiv:1711.06783,
-
[4]
Gaussian database alignment and gaussian planted matching.arXiv preprint arXiv:2307.02459,
[DCK23] Osman Emre Dai, Daniel Cullina, and Negar Kiyavash. Gaussian database alignment and gaussian planted matching.arXiv preprint arXiv:2307.02459,
-
[5]
[DL23] Jian Ding and Zhangsong Li. A polynomial-time iterative algorithm for random graph matching with non-vanishing correlation.arXiv preprint arXiv:2306.00266,
-
[6]
Optimal recovery of correlated Erd˝ os-R´ enyi graphs.arXiv preprint arXiv:2502.12077,
[Du25] Hang Du. Optimal recovery of correlated Erd˝ os-R´ enyi graphs.arXiv preprint arXiv:2502.12077,
-
[7]
[GL24] Shuyang Gong and Zhangsong Li. The Umeyama algorithm for matching corre- lated Gaussian geometric models in the low-dimensional regime.arXiv preprint arXiv:2402.15095,
work page internal anchor Pith review Pith/arXiv arXiv
-
[8]
Impossibility of partial re- covery in the graph alignment problem
[GML21] Luca Ganassali, Laurent Massouli´ e, and Marc Lelarge. Impossibility of partial re- covery in the graph alignment problem. InConference on Learning Theory, pages 2080–2102. PMLR,
work page 2080
-
[9]
[HY25] Dong Huang and Pengkun Yang. Sample complexity of correlation detection in the Gaussian Wigner model.arXiv preprint arXiv:2505.14138,
-
[10]
Strong recovery of geometric planted matchings
[KNW22] Dmitriy Kunisky and Jonathan Niles-Weed. Strong recovery of geometric planted matchings. InProceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 834–876. SIAM,
work page 2022
-
[11]
[MS24] Andrea Muratori and Guilhem Semerjian. Faster algorithms for the alignment of sparse correlated Erd˝ os-R´ enyi random graphs.Journal of Statistical Mechanics: The- ory and Experiment, 2024(11):113405,
work page 2024
-
[12]
[MWZ24] Cheng Mao, Alexander S. Wein, and Shenduo Zhang. Information-theoretic thresh- olds for planted dense cycles.arXiv preprint arXiv:2402.00305,
-
[13]
[PSSZ22] Giovanni Piccioli, Guilhem Semerjian, Gabriele Sicuro, and Lenka Zdeborov´ a. Align- ing random graphs with a sub-tree similarity message-passing algorithm.Journal of Statistical Mechanics: Theory and Experiment, 2022(6):063401,
work page 2022
-
[14]
Independence testing for inhomoge- neous random graphs.arXiv preprint arXiv:2304.09132,
[SPT23] Yukun Song, Carey E Priebe, and Minh Tang. Independence testing for inhomoge- neous random graphs.arXiv preprint arXiv:2304.09132,
-
[15]
Nonparametric graphon estimation.arXiv preprint arXiv:1309.5936,
51 [WO13] Patrick J Wolfe and Sofia C Olhede. Nonparametric graphon estimation.arXiv preprint arXiv:1309.5936,
-
[16]
[Zha18] Yuan Zhang. Consistent polynomial-time unseeded graph matching for Lipschitz graphons.arXiv preprint arXiv:1807.11027,
-
[17]
PARROT: Position- aware regularized optimal transport for network alignment
52 [ZZXT23] Zhichen Zeng, Si Zhang, Yinglong Xia, and Hanghang Tong. PARROT: Position- aware regularized optimal transport for network alignment. InProceedings of the ACM Web Conference 2023, pages 372–382. Association for Computing Machinery,
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.