SIGMA: An Efficient Heterophilous Graph Neural Network with Fast Global Aggregation
Pith reviewed 2026-05-24 08:42 UTC · model grok-4.3
The pith
SIGMA integrates SimRank into graph neural network aggregation to capture distant global similarities in heterophilous graphs with linear complexity.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
SIGMA integrates the structural similarity measurement SimRank into GNN aggregation. Its theoretical analysis shows that this inherently captures distant global similarity even under heterophily, something conventional approaches achieve only after multiple iterative aggregations. The method requires only one-time computation with complexity linear in the node set size O(n).
What carries the argument
The SimRank integration into the aggregation step, which computes structural similarities across the graph once and uses them to distinguish nodes under heterophily.
If this is right
- SIGMA achieves state-of-the-art performance on heterophilous graph datasets.
- It delivers superior aggregation and overall efficiency compared to baselines.
- It obtains 5 times acceleration on the large-scale heterophily dataset pokec with over 30 million edges.
- The linear O(n) complexity enables application to graphs with millions of nodes without iterative overhead.
Where Pith is reading between the lines
- Replacing iterative global updates with a one-shot similarity computation could generalize to other graph tasks that need long-range information.
- Future work might test whether different structural similarity measures yield similar efficiency gains under heterophily.
- Large-scale graph learning pipelines could adopt this pattern to reduce memory and time costs when scaling beyond current limits.
Load-bearing premise
That adding SimRank to the aggregation step preserves the global similarity capture and linear complexity without introducing hidden iterative costs or reducing expressivity on heterophilous data.
What would settle it
A direct comparison on a heterophilous graph with tens of millions of edges showing whether SIGMA's accuracy after one pass equals that of an iterative baseline while using far less time and memory.
Figures
read the original abstract
Graph neural networks (GNNs) realize great success in graph learning but suffer from performance loss when meeting heterophily, i.e. neighboring nodes are dissimilar, due to their local and uniform aggregation. Existing attempts of heterophilous GNNs incorporate long-range or global aggregations to distinguish nodes in the graph. However, these aggregations usually require iteratively maintaining and updating full-graph information, which limits their efficiency when applying to large-scale graphs. In this paper, we propose SIGMA, an efficient global heterophilous GNN aggregation integrating the structural similarity measurement SimRank. Our theoretical analysis illustrates that SIGMA inherently captures distant global similarity even under heterophily, that conventional approaches can only achieve after iterative aggregations. Furthermore, it enjoys efficient one-time computation with a complexity only linear to the node set size $\mathcal{O}(n)$. Comprehensive evaluation demonstrates that SIGMA achieves state-of-the-art performance with superior aggregation and overall efficiency. Notably, it obtains $5\times$ acceleration on the large-scale heterophily dataset pokec with over 30 million edges compared to the best baseline aggregation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes SIGMA, a heterophilous GNN that integrates SimRank into the aggregation step to enable global structural similarity capture. It claims that this yields inherent distant-node similarity under heterophily (unlike local iterative baselines), with a one-time O(n) computation, SOTA accuracy, and 5× speedup on the 30M-edge pokec dataset.
Significance. If the O(n) reduction of SimRank is rigorously shown to preserve the similarity guarantee on general heterophilous graphs, the work would meaningfully advance scalable global aggregation for heterophily, directly addressing the efficiency barrier noted for prior long-range methods.
major comments (2)
- [theoretical analysis] The central O(n) claim for global aggregation (abstract and theoretical analysis) rests on an unshown reduction of SimRank; standard SimRank requires iterative fixed-point computation or quadratic matrix operations, so the manuscript must exhibit the exact closed-form or sampling equations that achieve strict linearity while retaining the distant-similarity guarantee under heterophily.
- [efficiency experiments] § on efficiency and experiments: the reported 5× acceleration versus the best baseline on pokec must be accompanied by explicit per-component timing (SimRank pre-computation vs. GNN forward pass) and confirmation that no hidden per-iteration or per-pair costs remain in the claimed one-time O(n) procedure.
minor comments (2)
- Notation for the SimRank-augmented aggregation operator should be defined explicitly (e.g., how the similarity matrix is folded into the message-passing rule) to allow reproduction.
- The abstract states “comprehensive evaluation” but the manuscript should add a table comparing wall-clock time and memory against all cited heterophilous baselines on the same hardware.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback. We address each major comment below and will revise the manuscript to improve clarity on the theoretical reduction and experimental details.
read point-by-point responses
-
Referee: [theoretical analysis] The central O(n) claim for global aggregation (abstract and theoretical analysis) rests on an unshown reduction of SimRank; standard SimRank requires iterative fixed-point computation or quadratic matrix operations, so the manuscript must exhibit the exact closed-form or sampling equations that achieve strict linearity while retaining the distant-similarity guarantee under heterophily.
Authors: Section 3.2 of the manuscript presents the SimRank integration via a one-pass structural similarity computation that avoids full iteration or quadratic operations by leveraging the graph's adjacency matrix in a linearized form. The distant-similarity guarantee under heterophily follows from the SimRank fixed-point properties applied globally in a single aggregation step. To address the request for explicitness, we will add the precise closed-form equations and the linearity derivation as a dedicated subsection in the revision. revision: yes
-
Referee: [efficiency experiments] § on efficiency and experiments: the reported 5× acceleration versus the best baseline on pokec must be accompanied by explicit per-component timing (SimRank pre-computation vs. GNN forward pass) and confirmation that no hidden per-iteration or per-pair costs remain in the claimed one-time O(n) procedure.
Authors: We will expand the efficiency experiments section with a breakdown table for the pokec dataset, explicitly reporting wall-clock times for the one-time SimRank pre-computation versus the subsequent GNN forward and backward passes. This will also include a statement confirming that all similarity values are materialized once with no per-iteration or per-pair recomputation during training or inference. revision: yes
Circularity Check
No significant circularity; derivation integrates external SimRank measure
full rationale
The paper's central claims rest on integrating the established SimRank similarity measure into GNN aggregation, supported by a theoretical analysis of global similarity capture under heterophily and an asserted O(n) one-time computation. No self-definitional equations, fitted parameters renamed as predictions, or load-bearing self-citations that reduce the result to its own inputs appear in the abstract or described derivation. The approach treats SimRank as an independent input rather than deriving it from the model's own outputs or prior self-referential results, making the chain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Theorem III.2. ... bZu = ∑ℓ=1^∞ c^ℓ ∑v ←→P(u,v|t(2ℓ)) · Hv
-
IndisputableMonolith/Foundation/DimensionForcing.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Lemma III.5 ... O(d² / c(1-c)²ϵ) time ... top-k pruning ... O(knf)
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Semi-supervised classifica- tion with graph convolutional networks,
M. Welling and T. N. Kipf, “Semi-supervised classifica- tion with graph convolutional networks,” in J. Interna- tional Conference on Learning Representations (ICLR 2017), 2016
work page 2017
-
[2]
Inductive representation learning on large graphs,
W. Hamilton, Z. Ying, and J. Leskovec, “Inductive representation learning on large graphs,” vol. 30, 2017
work page 2017
-
[3]
Learning-based approaches for graph problems: a survey,
K. S. Yow and S. Luo, “Learning-based approaches for graph problems: a survey,” arXiv preprint arXiv:2204.01057, 2022
-
[4]
Spiking graph convolutional networks,
Z. Zhu, J. Peng, J. Li, L. Chen, Q. Yu, and S. Luo, “Spiking graph convolutional networks,” arXiv preprint arXiv:2205.02767, 2022
-
[5]
Fusing global domain information and local semantic information to classify financial documents,
M. Fan, D. Cheng, F. Yang, S. Luo, Y . Luo, W. Qian, and A. Zhou, “Fusing global domain information and local semantic information to classify financial documents,” in Proceedings of the 29th ACM International Conference on Information & Knowledge Management , 2020, pp. 2413–2420
work page 2020
-
[6]
Approximate graph propagation,
H. Wang, M. He, Z. Wei, S. Wang, Y . Yuan, X. Du, and J.- R. Wen, “Approximate graph propagation,” inProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , 2021, pp. 1686–1696
work page 2021
-
[7]
Scalable graph neural networks via bidirectional propagation,
M. Chen, Z. Wei, B. Ding, Y . Li, Y . Yuan, X. Du, and J.-R. Wen, “Scalable graph neural networks via bidirectional propagation,” Advances in neural information processing systems, vol. 33, pp. 14 556–14 566, 2020
work page 2020
-
[8]
P. Veliˇckovi´c, G. Cucurull, A. Casanova, A. Romero, P. Lio, and Y . Bengio, “Graph attention networks,” arXiv preprint arXiv:1710.10903, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[9]
Predict then propagate: Graph neural networks meet personal- ized pagerank,
J. Klicpera, A. Bojchevski, and S. Günnemann, “Predict then propagate: Graph neural networks meet personal- ized pagerank,” International Conference on Learning Representations, 2018
work page 2018
-
[10]
Beyond homophily in graph neural networks: Current limitations and effective designs,
J. Zhu, Y . Yan, L. Zhao, M. Heimann, L. Akoglu, and D. Koutra, “Beyond homophily in graph neural networks: Current limitations and effective designs,” Advances in Neural Information Processing Systems, vol. 33, pp. 7793– 7804, 2020
work page 2020
-
[11]
Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mixing,
S. Abu-El-Haija, B. Perozzi, A. Kapoor, N. Alipour- fard, K. Lerman, H. Harutyunyan, G. Ver Steeg, and A. Galstyan, “Mixhop: Higher-order graph convolutional architectures via sparsified neighborhood mixing,” in international conference on machine learning . PMLR, 2019, pp. 21–29
work page 2019
-
[12]
Geom-gcn: Geometric graph convolutional networks,
H. Pei, B. Wei, K. C.-C. Chang, Y . Lei, and B. Yang, “Geom-gcn: Geometric graph convolutional networks,” International Conference on Learning Representations , 2020
work page 2020
-
[13]
T. Yang, Y . Wang, Z. Yue, Y . Yang, Y . Tong, and J. Bai, “Graph pointer neural networks.” Proceedings of the ... AAAI Conference on Artificial Intelligence , 2022
work page 2022
-
[14]
Non-local graph neural networks,
M. Liu, Z. Wang, and S. Ji, “Non-local graph neural networks,” IEEE Transactions on Pattern Analysis and Machine Intelligence, 2021
work page 2021
-
[15]
Universal graph convolutional networks,
D. Jin, Z. Yu, C. Huo, R. Wang, X. Wang, D. He, and J. Han, “Universal graph convolutional networks,” Advances in Neural Information Processing Systems , vol. 34, pp. 10 654–10 664, 2021
work page 2021
-
[16]
Finding global homophily in graph neural networks when meeting heterophily,
X. Li, R. Zhu, Y . Cheng, C. Shan, S. Luo, D. Li, and W. Qian, “Finding global homophily in graph neural networks when meeting heterophily,” International Con- ference on Machine Learning , 2022
work page 2022
-
[17]
Large scale learning on non- homophilous graphs: New benchmarks and strong simple methods,
D. Lim, F. Hohne, X. Li, S. L. Huang, V . Gupta, O. Bhalerao, and S. N. Lim, “Large scale learning on non- homophilous graphs: New benchmarks and strong simple methods,” Advances in Neural Information Processing Systems, vol. 34, pp. 20 887–20 902, 2021
work page 2021
-
[18]
S. Suresh, V . Budde, J. Neville, P. Li, and J. Ma, “Breaking the limit of graph neural networks by improving the assortativity of graphs with local mixing patterns,” Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining , 2021
work page 2021
-
[19]
Simrank: a measure of structural- context similarity,
G. Jeh and J. Widom, “Simrank: a measure of structural- context similarity,” in Proceedings of the eighth ACM SIGKDD international conference on Knowledge discov- ery and data mining , 2002, pp. 538–543
work page 2002
-
[20]
Cast: a correlation-based adaptive spectral clustering algorithm on multi-scale data,
X. Li, B. Kao, C. Shan, D. Yin, and M. Ester, “Cast: a correlation-based adaptive spectral clustering algorithm on multi-scale data,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discov- ery & Data Mining , 2020, pp. 439–449
work page 2020
-
[21]
Two sides of the same coin: Heterophily and oversmoothing in graph convolutional neural networks
Y . Yan, M. Hashemi, K. Swersky, Y . Yang, and D. Koutra, “Two sides of the same coin: Heterophily and over- smoothing in graph convolutional neural networks,” arXiv preprint arXiv:2102.06462, 2021
-
[22]
Graph Neural Networks Beyond Compromise Between Attribute and Topology,
L. Yang, W. Zhou, W. Peng, B. Niu, J. Gu, C. Wang, X. Cao, and D. He, “Graph Neural Networks Beyond Compromise Between Attribute and Topology,” in Pro- ceedings of the ACM Web Conference 2022 . Virtual Event, Lyon France: ACM, Apr. 2022, pp. 1127–1135
work page 2022
-
[23]
Exact single-source simrank computation on large graphs,
H. Wang, Z. Wei, Y . Yuan, X. Du, and J.-R. Wen, “Exact single-source simrank computation on large graphs,” in Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data , 2020, pp. 653–663
work page 2020
-
[24]
Representation learning on graphs with jumping knowledge networks,
K. Xu, C. Li, Y . Tian, T. Sonobe, K.-i. Kawarabayashi, and S. Jegelka, “Representation learning on graphs with jumping knowledge networks,” in Proceedings of the 35th International Conference on Machine Learning , ser. Proceedings of Machine Learning Research, J. Dy and A. Krause, Eds., vol. 80. PMLR, 10–15 Jul 2018, pp. 5453–5462
work page 2018
-
[25]
Efficient top-k simrank-based simi- larity join,
W. Tao and G. Li, “Efficient top-k simrank-based simi- larity join,” in Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data , 2014, pp. 1603–1604
work page 2014
-
[26]
Efficient simrank track- ing in dynamic graphs,
Y . Wang, X. Lian, and L. Chen, “Efficient simrank track- ing in dynamic graphs,” in 2018 IEEE 34th international conference on data engineering (ICDE) . IEEE, 2018, pp. 545–556
work page 2018
-
[27]
Gsim: A graph neural network based rele- vance measure for heterogeneous graphs,
L. Luo, Y . Fang, M. Lu, X. Cao, X. Zhang, and 13 W. Zhang, “Gsim: A graph neural network based rele- vance measure for heterogeneous graphs,” arXiv preprint arXiv:2208.06144, 2022
-
[28]
Massively parallel single- source simranks in o(logn) rounds,
S. Luo and Z. Zhu, “Massively parallel single- source simranks in o(logn) rounds,” arXiv preprint arXiv:2304.04015, 2023
-
[29]
Cluster-gcn: An efficient algorithm for training deep and large graph convolutional networks,
W.-L. Chiang, X. Liu, S. Si, Y . Li, S. Bengio, and C.-J. Hsieh, “Cluster-gcn: An efficient algorithm for training deep and large graph convolutional networks,” in Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining , 2019, pp. 257–266
work page 2019
-
[30]
Adaptive universal generalized pagerank graph neural network,
E. Chien, J. Peng, P. Li, and O. Milenkovic, “Adaptive universal generalized pagerank graph neural network,” in International Conference on Learning Representations ,
-
[31]
Available: https://openreview.net/forum? id=n6jl7fLxrP
[Online]. Available: https://openreview.net/forum? id=n6jl7fLxrP
-
[32]
Dif- fusion Improves Graph Learning,
J. Gasteiger, S. Weißenberger, and S. Günnemann, “Dif- fusion Improves Graph Learning,” in 32nd Advances in Neural Information Processing Systems , 2019
work page 2019
-
[33]
Scaling graph neural networks with approximate pagerank,
A. Bojchevski, J. Gasteiger, B. Perozzi, A. Kapoor, M. Blais, B. Rózemberczki, M. Lukasik, and S. Günne- mann, “Scaling graph neural networks with approximate pagerank,” in Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, 2020, pp. 2464–2473
work page 2020
-
[34]
Do transformers really perform badly for graph representation?
C. Ying, T. Cai, S. Luo, S. Zheng, G. Ke, D. He, Y . Shen, and T.-Y . Liu, “Do transformers really perform badly for graph representation?” Advances in neural information processing systems, vol. 34, pp. 28 877–28 888, 2021
work page 2021
-
[35]
Simple and deep graph convolutional networks,
M. Chen, Z. Wei, Z. Huang, B. Ding, and Y . Li, “Simple and deep graph convolutional networks,” in International Conference on Machine Learning . PMLR, 2020, pp. 1725–1735
work page 2020
-
[36]
Simplifying graph convolutional networks,
F. Wu, A. Souza, T. Zhang, C. Fifty, T. Yu, and K. Wein- berger, “Simplifying graph convolutional networks,” in Proceedings of the 36th International Conference on Machine Learning, K. Chaudhuri and R. Salakhutdinov, Eds., vol. 97, 6 2019, pp. 6861–6871
work page 2019
-
[37]
Gbk-gnn: Gated bi-kernel graph neural networks for mod- eling both homophily and heterophily,
L. Du, X. Shi, Q. Fu, X. Ma, H. Liu, S. Han, and D. Zhang, “Gbk-gnn: Gated bi-kernel graph neural networks for mod- eling both homophily and heterophily,” in Proceedings of the ACM Web Conference 2022 , 2022, pp. 1550–1558
work page 2022
-
[38]
T. Wang, D. Jin, R. Wang, D. He, and Y . Huang, “Powerful graph convolutional networks with adaptive propagation mechanism for homophily and heterophily,” in Proceedings of the AAAI conference on artificial intelligence, vol. 36, no. 4, 2022, pp. 4210–4218
work page 2022
-
[39]
Similarity-navigated graph neural networks for node classification,
M. Zou, Z. Gan, R. Cao, C. Guan, and S. Leng, “Similarity-navigated graph neural networks for node classification,” Information Sciences , vol. 633, pp. 41– 69, 2023
work page 2023
-
[40]
Predicting global label relationship matrix for graph neural networks under heterophily,
L. Liang, X. Hu, Z. Xu, Z. Song, and I. King, “Predicting global label relationship matrix for graph neural networks under heterophily,” Advances in Neural Information Processing Systems, vol. 36, pp. 10 909–10 921, 2023
work page 2023
-
[41]
Scara: Scalable graph neural networks with feature-oriented optimization,
N. Liao, D. Mo, S. Luo, X. Li, and P. Yin, “Scara: Scalable graph neural networks with feature-oriented optimization,” Proceedings of the VLDB Endowment , vol. 15, no. 11, pp. 3240–3248, 2022
work page 2022
-
[42]
LD2: Scalable heterophilous graph neural network with decoupled em- bedding,
N. Liao, S. Luo, X. Li, and J. Shi, “LD2: Scalable heterophilous graph neural network with decoupled em- bedding,” in Advances in Neural Information Processing Systems, vol. 36, Dec 2023
work page 2023
-
[43]
Scalable decoupling graph neural networks with feature-oriented optimization,
N. Liao, D. Mo, S. Luo, X. Li, and P. Yin, “Scalable decoupling graph neural networks with feature-oriented optimization,” The VLDB Journal , vol. 33, no. 3, pp. 667–683, May 2024. [Online]. Available: https: //link.springer.com/article/10.1007/s00778-023-00829-6
-
[44]
Bird: Efficient approximation of bidirectional hidden personalized pagerank,
H. Liu and S. Luo, “Bird: Efficient approximation of bidirectional hidden personalized pagerank,” Proceedings of the VLDB Endowment , vol. 17, no. 9, pp. 2255–2268, 2024
work page 2024
-
[45]
Agenda: Robust personalized pager- anks in evolving graphs,
D. Mo and S. Luo, “Agenda: Robust personalized pager- anks in evolving graphs,” in Proceedings of the 30th ACM International Conference on Information & Knowledge Management, 2021, pp. 1315–1324
work page 2021
-
[46]
Topology- monitorable contrastive learning on dynamic graphs,
Z. Zhu, K. Wang, H. Liu, J. Li, and S. Luo, “Topology- monitorable contrastive learning on dynamic graphs,” in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , 2024, pp. 4700– 4711
work page 2024
-
[47]
S. Luan, C. Hua, Q. Lu, J. Zhu, M. Zhao, S. Zhang, X.-W. Chang, and D. Precup, “Is heterophily a real nightmare for graph neural networks to do node classification?” arXiv preprint arXiv:2109.05641, 2021
-
[48]
Multi- scale attributed node embedding,
B. Rozemberczki, C. Allen, and R. Sarkar, “Multi- scale attributed node embedding,” Journal of Complex Networks, vol. 9, no. 2, p. cnab014, 2021
work page 2021
-
[49]
Deep gaussian embedding of graphs: Unsupervised inductive learning via ranking,
A. Bojchevski and S. Günnemann, “Deep gaussian embedding of graphs: Unsupervised inductive learning via ranking,” in International Conference on Learning Representations, 2018. [Online]. Available: https:// openreview.net/forum?id=r1ZdKJ-0W
work page 2018
-
[50]
Collective classification in network data,
P. Sen, G. Namata, M. Bilgic, L. Getoor, B. Galligher, and T. Eliassi-Rad, “Collective classification in network data,” AI magazine, vol. 29, no. 3, pp. 93–93, 2008
work page 2008
-
[51]
Query-driven active surveying for collective classification,
G. Namata, B. London, L. Getoor, B. Huang, and U. Edu, “Query-driven active surveying for collective classification,” in 10th International Workshop on Mining and Learning with Graphs, vol. 8, 2012, p. 1
work page 2012
-
[52]
On the use of arxiv as a dataset,
C. B. Clement, M. Bierbaum, K. P. O’Keeffe, and A. A. Alemi, “On the use of arxiv as a dataset,” 2019
work page 2019
-
[53]
Snap datasets: Stanford large network dataset collection,
Jure Leskovec and Andrej Krevl, “Snap datasets: Stanford large network dataset collection,” http://snap.stanford.edu/ data, 2014. 14
work page 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.