Informative Graph Structure Learning
Pith reviewed 2026-05-19 21:04 UTC · model grok-4.3
The pith
InGSL reduces redundant edges in graph structure learning by balancing similarity and diversity through a mutual-information strategy.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that similarity-based edge construction creates substantial structure redundancy and that a mutual-information-guided strategy which jointly accounts for similarity and diversity can produce more informative graphs. InGSL implements this strategy as a plug-in module that integrates into existing GSL frameworks, resulting in measurable accuracy gains while using a smaller total number of edges on representative benchmarks.
What carries the argument
The mutual-information-guided learning strategy that scores candidate edges by their contribution to both similarity and diversity, thereby pruning redundant connections.
If this is right
- GNN training and inference can run with lower memory and time costs when the input graph is constructed by InGSL.
- Existing graph structure learners gain accuracy simply by adding the mutual-information term rather than redesigning their core objective.
- Downstream tasks that rely on graph connectivity benefit from graphs that retain both local similarity and global diversity.
- The approach suggests that edge budgets in learned graphs can be treated as an explicit optimization target rather than an after-effect.
Where Pith is reading between the lines
- The same balance of similarity and diversity could be applied to dynamic or temporal graphs where edge redundancy grows over time.
- If the mutual-information term proves stable across domains, it may reduce the need for heavy post-processing or regularization in graph construction pipelines.
- Practitioners working with very large graphs might test whether the method scales by substituting cheaper approximations for the mutual-information calculation.
Load-bearing premise
Mutual information can be computed from embeddings in a way that reliably picks informative edges without losing important connections or adding enough overhead to erase the savings.
What would settle it
Running InGSL on the same six GSL baselines and finding either no reduction in edge count or no accuracy improvement on standard node-classification and link-prediction tasks would disprove the central claim.
Figures
read the original abstract
The quality of graph-structured data is fundamental to the success of modern graph analysis techniques such as Graph Neural Networks (GNNs). However, real-world graph data is often suboptimal, suffering from issues such as noise and incomplete connections. Graph Structure Learning (GSL) has emerged as a promising technique that adaptively optimizes node connections. However, we observe that the effectiveness of GSL often comes at the cost of a dramatic expansion in edge count, resulting in significant storage and computational overhead. In this work, we reveal that this limitation stems from the prevalent use of similarity-based edge construction, which predominantly connects highly similar neighbors based on their embeddings, introducing substantial structure redundancy. To address this, we propose a novel Informative Graph Structure Learning method (InGSL), which jointly considers both similarity and diversity in edge construction by incorporating a mutual-information-guided learning strategy. Notably, InGSL serves as a plug-in module that can be seamlessly integrated into existing GSL frameworks. Through extensive experiments on six representative GSL methods, we demonstrate that InGSL achieves significant performance improvements at a reduced number of edges.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Informative Graph Structure Learning (InGSL) as a plug-in module for existing Graph Structure Learning (GSL) frameworks. It argues that similarity-based edge construction introduces substantial structure redundancy, inflating edge counts and computational costs. InGSL jointly optimizes similarity and diversity via a mutual-information-guided strategy to produce more informative, sparser graphs, claiming significant performance gains with fewer edges across six representative GSL methods.
Significance. If the empirical results hold with proper controls, InGSL could meaningfully improve the practicality of GSL by addressing a common overhead issue while preserving or enhancing downstream task performance. The plug-in design is a clear strength for integration into existing pipelines. However, the absence of quantitative metrics, ablations, and overhead analysis in the reported experiments limits evaluation of net savings and generalizability.
major comments (3)
- Abstract: the central claim of 'significant performance improvements at a reduced number of edges' is presented without any quantitative metrics, error bars, edge-count deltas, or statistical tests, preventing verification that gains are attributable to the MI term rather than baseline weaknesses or hyperparameter choices.
- Experiments section (six-method evaluation): no ablation isolating the mutual-information component, no wall-clock overhead measurements, and no reporting of how the joint similarity-diversity objective affects critical edge retention on graphs where similarity already encodes label information; this undermines attribution of the claimed net savings.
- Method description: the complexity and implementation details of the MI estimator (e.g., whether computed on raw features, embeddings, or via sampling) are not analyzed, leaving open whether the added term offsets the claimed edge reductions in practice.
minor comments (2)
- Clarify the precise formulation of the mutual-information term and its weighting relative to the similarity objective.
- Ensure all figures and tables include explicit edge-count comparisons and runtime measurements for the six baselines with and without InGSL.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive review of our manuscript on Informative Graph Structure Learning (InGSL). The comments highlight important areas for clarification and strengthening, particularly around quantitative reporting and experimental rigor. We address each major comment below and will incorporate the suggested revisions to improve the paper's clarity and verifiability.
read point-by-point responses
-
Referee: Abstract: the central claim of 'significant performance improvements at a reduced number of edges' is presented without any quantitative metrics, error bars, edge-count deltas, or statistical tests, preventing verification that gains are attributable to the MI term rather than baseline weaknesses or hyperparameter choices.
Authors: We agree that the abstract would benefit from quantitative support to make the central claims immediately verifiable. In the revised manuscript, we will update the abstract to report specific average performance gains (e.g., accuracy improvements across the six GSL methods) and edge-count reductions (with deltas relative to baselines), along with references to the corresponding experimental tables. To address attribution, our experiments already fix hyperparameters and compare directly against the original GSL baselines; we will add a brief statement in the abstract noting that the observed gains are due to the added MI-guided diversity term under these controlled settings. We will also include error bars from multiple runs where space permits. revision: yes
-
Referee: Experiments section (six-method evaluation): no ablation isolating the mutual-information component, no wall-clock overhead measurements, and no reporting of how the joint similarity-diversity objective affects critical edge retention on graphs where similarity already encodes label information; this undermines attribution of the claimed net savings.
Authors: We acknowledge these gaps in the current experimental presentation. We will add a dedicated ablation study comparing the full InGSL objective against a similarity-only variant to isolate the contribution of the mutual-information diversity term. Wall-clock overhead measurements (training time with and without InGSL) will be included to quantify any added cost against the benefits of reduced edges. For the point on edge retention where similarity encodes label information, we will expand the discussion in the experiments section with analysis on datasets such as Cora, showing that the MI term still promotes informative diversity without discarding label-relevant edges; this will be supported by additional metrics on retained edge quality. revision: yes
-
Referee: Method description: the complexity and implementation details of the MI estimator (e.g., whether computed on raw features, embeddings, or via sampling) are not analyzed, leaving open whether the added term offsets the claimed edge reductions in practice.
Authors: We appreciate this observation on implementation details. The MI estimator in InGSL is applied to learned node embeddings using a sampling-based approximation for scalability (as outlined in the method), but we agree an explicit analysis is warranted. In the revised method section, we will provide the precise formulation of the MI estimator, clarify that it operates on embeddings with mini-batch sampling, and include a complexity breakdown (showing the added term scales as O(B log B) per batch where B is batch size). We will also discuss how this overhead is offset by downstream savings from fewer edges in GNN training, supported by the new wall-clock results mentioned above. revision: yes
Circularity Check
No significant circularity; empirical plug-in with external validation
full rationale
The paper introduces InGSL as a plug-in module that augments existing GSL frameworks with a mutual-information term to balance similarity and diversity in edge selection. No load-bearing derivation, equation, or uniqueness claim reduces by construction to a fitted parameter or prior self-citation; the central mechanism is presented as an additive empirical strategy whose performance is asserted via experiments on six independent methods rather than internal re-derivation of its own inputs. The abstract and method description contain no self-referential fitting loops or renamed known results that would trigger any of the enumerated circularity patterns.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we reveal that this limitation stems from the prevalent use of similarity-based edge construction, which predominantly connects highly similar neighbors based on their embeddings, introducing substantial structure redundancy... InGSL employs a learnable selection procedure, guided by mutual information between the selected neighbors and all candidate neighbors
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Thomas N. Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. InInternational Conference on Learning Representations, Red Hook; NY; United States, 2017. Curran Associates, Inc
work page 2017
-
[2]
Neural message passing for quantum chemistry
Justin Gilmer, Samuel S Schoenholz, Patrick F Riley, Oriol Vinyals, and George E Dahl. Neural message passing for quantum chemistry. InInternational Conference on Machine Learning, pages 1263–1272, Sydney NSW Australia, 2017. PMLR, JMLR.org
work page 2017
-
[3]
Michaël Defferrard, Xavier Bresson, and Pierre Vandergheynst. Convolutional neural networks on graphs with fast localized spectral filtering.Advances in Neural Information Processing Systems, 29, 2016
work page 2016
-
[4]
Yu Cui, Feng Liu, Jiawei Chen, Canghong Jin, Xingyu Lou, Changwang Zhang, Jun Wang, Yuegang Sun, and Can Wang. Hatllm: Hierarchical attention mask- ing for enhanced collaborative modeling in llm-based recommendation.CoRR, abs/2510.10955, 2025
-
[5]
Simplifying graph convolutional networks
Felix Wu, Amauri Souza, Tianyi Zhang, Christopher Fifty, Tao Yu, and Kilian Weinberger. Simplifying graph convolutional networks. InInternational Confer- ence on Machine Learning, pages 6861–6871, Long Beach, California, USA, 2019. PMLR, Curran Associates, Inc
work page 2019
-
[6]
Adaptive universal generalized pagerank graph neural network
Eli Chien, Jianhao Peng, Pan Li, and Olgica Milenkovic. Adaptive universal generalized pagerank graph neural network. In9th International Conference on Learning Representations, ICLR 2021, Virtual Event, Austria, May 3-7, 2021. OpenReview.net, 2021
work page 2021
-
[7]
Polynormer: Polynomial-expressive graph transformer in linear time
Chenhui Deng, Zichao Yue, and Zhiru Zhang. Polynormer: Polynomial-expressive graph transformer in linear time. InThe Twelfth International Conference on Learning Representations, Red Hook; NY; United States, 2024. Curran Associates Inc
work page 2024
-
[8]
Scalable and effective implicit graph neural networks on large graphs
Juncheng Liu, Bryan Hooi, Kenji Kawaguchi, Yiwei Wang, Chaosheng Dong, and Xiaokui Xiao. Scalable and effective implicit graph neural networks on large graphs. InThe Twelfth International Conference on Learning Representations, Red Hook; NY; United States, 2024. Curran Associates, Inc
work page 2024
-
[9]
OpenGSL: A comprehensive benchmark for graph structure learning
Zhiyao Zhou, Sheng Zhou, Bochao Mao, Xuanyi Zhou, Jiawei Chen, Qiaoyu Tan, Daochen Zha, Yan Feng, Chun Chen, and Can Wang. OpenGSL: A comprehensive benchmark for graph structure learning. InThirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, Red Hook, NY, USA, 2023. Curran Associates Inc
work page 2023
-
[10]
GSLB: The graph structure learning benchmark
Zhixun Li, Liang Wang, Xin Sun, Yifan Luo, Yanqiao Zhu, Dingshuo Chen, Yingtao Luo, Xiangxin Zhou, Qiang Liu, Shu Wu, Liang Wang, and Jeffrey Xu Yu. GSLB: The graph structure learning benchmark. InThirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track, Red Hook; NY; United States, 2023. Curran Associates Inc
work page 2023
-
[11]
Learning discrete structures for graph neural networks
Luca Franceschi, Mathias Niepert, Massimiliano Pontil, and Xiao He. Learning discrete structures for graph neural networks. InInternational Conference on Machine Learning, pages 1972–1982, Long Beach, California, USA, 2019. PMLR, Curran Associates Inc
work page 1972
-
[12]
Graph structure learning for robust graph neural networks
Wei Jin, Yao Ma, Xiaorui Liu, Xianfeng Tang, Suhang Wang, and Jiliang Tang. Graph structure learning for robust graph neural networks. InProceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data mining, pages 66–74, New York, NY, USA, 2020. Association for Computing Machinery
work page 2020
-
[13]
Prose: Graph structure learning via progressive strategy
Huizhao Wang, Yao Fu, Tao Yu, Linghui Hu, Weihao Jiang, and Shiliang Pu. Prose: Graph structure learning via progressive strategy. InProceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 2337–2348, New York, NY, USA, 2023. Association for Computing Machinery
work page 2023
-
[14]
Self-guided robust graph structure refinement
Yeonjun In, Kanghoon Yoon, Kibum Kim, Kijung Shin, and Chanyoung Park. Self-guided robust graph structure refinement. InProceedings of the ACM on Web Conference 2024, pages 697–708, New York, NY, USA, 2024. Association for Computing Machinery
work page 2024
-
[15]
Towards scalable and efficient graph structure learning
Siqi Shen, Wentao Zhang, Chengshuo Du, Chong Chen, Fangcheng Fu, Yingxia Shao, and Bin Cui. Towards scalable and efficient graph structure learning. In2025 IEEE 41st International Conference on Data Engineering (ICDE), pages 1759–1772. IEEE Computer Society, 2025
work page 2025
-
[16]
Graph structure refinement with energy-based contrastive learning
Xianlin Zeng, Yufeng Wang, Yuqi Sun, Guodong Guo, Wenrui Ding, and Baochang Zhang. Graph structure refinement with energy-based contrastive learning. In Toby Walsh, Julie Shah, and Zico Kolter, editors,AAAI-25, Sponsored by the Association for the Advancement of Artificial Intelligence, February 25 - March 4, 2025, Philadelphia, PA, USA, pages 22326–22335...
work page 2025
-
[17]
Graph-revised convolutional network
Donghan Yu, Ruohong Zhang, Zhengbao Jiang, Yuexin Wu, and Yiming Yang. Graph-revised convolutional network. InMachine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2020, Ghent, Belgium, September 14–18, 2020, Proceedings, Part III, pages 378–393, Berlin, Heidelberg,
work page 2020
-
[18]
Springer, Springer-Verlag
-
[19]
How powerful are k-hop message passing graph neural networks
Jiarui Feng, Yixin Chen, Fuhai Li, Anindya Sarkar, and Muhan Zhang. How powerful are k-hop message passing graph neural networks. In Sanmi Koyejo, S. Mohamed, A. Agarwal, Danielle Belgrave, K. Cho, and A. Oh, editors,Advances in Neural Information Processing Systems 35: Annual Conference on Neural Infor- mation Processing Systems 2022, NeurIPS 2022, New O...
work page 2022
-
[20]
Ho- momorphism expressivity of spectral invariant graph neural networks
Jingchu Gai, Yiheng Du, Bohang Zhang, Haggai Maron, and Liwei Wang. Ho- momorphism expressivity of spectral invariant graph neural networks. InThe Thirteenth International Conference on Learning Representations, ICLR 2025, Sin- gapore, April 24-28, 2025. OpenReview.net, 2025
work page 2025
-
[21]
Towards unsupervised deep graph structure learning
Yixin Liu, Yu Zheng, Daokun Zhang, Hongxu Chen, Hao Peng, and Shirui Pan. Towards unsupervised deep graph structure learning. InProceedings of the ACM Web Conference 2022, pages 1392–1403, New York, NY, USA, 2022. Association for Computing Machinery
work page 2022
-
[22]
Dongcheng Zou, Hao Peng, Xiang Huang, Renyu Yang, Jianxin Li, Jia Wu, Chun- yang Liu, and Philip S Yu. Se-gsl: A general and effective graph structure learning framework through structural entropy optimization. InProceedings of the ACM Web Conference 2023, pages 499–510, New York, NY, USA, 2023. Association for Computing Machinery
work page 2023
-
[23]
Yu Chen and Mohammed Zaki. Iterative deep graph learning for graph neural networks: Better and robust node embeddings.Advances in Neural Information Processing Systems, 33:19314–19326, 2020
work page 2020
-
[24]
Compact graph structure learning via mutual information compression
Nian Liu, Xiao Wang, Lingfei Wu, Yu Chen, Xiaojie Guo, and Chuan Shi. Compact graph structure learning via mutual information compression. InProceedings of the ACM Web Conference 2022, pages 1601–1610, New York, NY, USA, 2022. Association for Computing Machinery
work page 2022
-
[25]
Jeongmin Bae, Donghyoung Han, and Min-Soo Kim. Flexgnn: A high- performance, large-scale full-graph gnn system with best-effort training plan optimization. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2, KDD ’25, page 59–70, New York, NY, USA, 2025. Association for Computing Machinery
work page 2025
-
[26]
Progressive stacking for scalable graph condensation
Yibing Bai, Min Gao, Zongwei Wang, Xinyi Gao, and Wentao Li. Progressive stacking for scalable graph condensation. InProceedings of the 31st ACM SIGKDD Conference acronym ’XX, June 03–05, 2018, Woodstock, NY Shen Han, et al. Conference on Knowledge Discovery and Data Mining V.2, KDD ’25, page 83–94, New York, NY, USA, 2025. Association for Computing Machinery
work page 2018
-
[27]
Contrastive graph condensation: Advancing data versatility through self- supervised learning
Xinyi Gao, Yayong Li, Tong Chen, Guanhua Ye, Wentao Zhang, and Hongzhi Yin. Contrastive graph condensation: Advancing data versatility through self- supervised learning. InProceedings of the 31st ACM SIGKDD Conference on Knowledge Discovery and Data Mining V.2, KDD ’25, page 661–672, New York, NY, USA, 2025. Association for Computing Machinery
work page 2025
-
[28]
Paul Pu Liang, Zihao Deng, Martin Q. Ma, James Y. Zou, Louis-Philippe Morency, and Ruslan Salakhutdinov. Factorized contrastive learning: Going beyond multi- view redundancy. In Alice Oh, Tristan Naumann, Amir Globerson, Kate Saenko, Moritz Hardt, and Sergey Levine, editors,Advances in Neural Information Pro- cessing Systems 36: Annual Conference on Neura...
work page 2023
-
[29]
Collective classification in network data.AI magazine, 29(3):93–93, 2008
Prithviraj Sen, Galileo Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi-Rad. Collective classification in network data.AI magazine, 29(3):93–93, 2008
work page 2008
-
[30]
Open graph benchmark: Datasets for machine learning on graphs
Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. In Hugo Larochelle, Marc’Aurelio Ranzato, Raia Hadsell, Maria-Florina Balcan, and Hsuan-Tien Lin, editors,Advances in Neural Information Processing Systems 33: Annual Conference on ...
work page 2020
-
[31]
Label informed attributed network em- bedding
Xiao Huang, Jundong Li, and Xia Hu. Label informed attributed network em- bedding. InProceedings of the tenth ACM International Conference on Web Search and Data Mining, pages 731–739, New York, NY, USA, 2017. Association for Computing Machinery
work page 2017
-
[32]
Oleg Platonov, Denis Kuznedelev, Michael Diskin, Artem Babenko, and Liudmila Prokhorenkova. A critical look at the evaluation of gnns under heterophily: Are we really making progress? InThe Eleventh International Conference on Learning Representations, Kigali, Rwanda, 2023. Curran Associates, Inc
work page 2023
-
[33]
Uncertainty-aware graph structure learn- ing
Shen Han, Zhiyao Zhou, Jiawei Chen, Zhezheng Hao, Sheng Zhou, Gang Wang, Yan Feng, Chun Chen, and Can Wang. Uncertainty-aware graph structure learn- ing. InProceedings of the ACM on Web Conference 2025, pages 4863–4874, 2025
work page 2025
-
[34]
Graph structure estimation neural networks
Ruijia Wang, Shuai Mou, Xiao Wang, Wanpeng Xiao, Qi Ju, Chuan Shi, and Xing Xie. Graph structure estimation neural networks. InProceedings of the Web Conference 2021, pages 342–353, New York, NY, USA, 2021. Association for Computing Machinery
work page 2021
-
[35]
Graph lottery ticket automated
Guibin Zhang, Kun Wang, Wei Huang, Yanwei Yue, Yang Wang, Roger Zimmer- mann, Aojun Zhou, Dawei Cheng, Jin Zeng, and Yuxuan Liang. Graph lottery ticket automated. InThe Twelfth International Conference on Learning Represen- tations, ICLR 2024, Vienna, Austria, May 7-11, 2024. OpenReview.net, 2024
work page 2024
-
[36]
Adaptive universal generalized pagerank graph neural network.arXiv preprint arXiv:2006.07988,
Eli Chien, Jianhao Peng, Pan Li, and Olgica Milenkovic. Adaptive universal generalized pagerank graph neural network.ArXiv preprint arXiv:2006.07988, 2020
-
[37]
Com- bining neural networks with personalized pagerank for classification on graphs
Johannes Gasteiger, Aleksandar Bojchevski, and Stephan Günnemann. Com- bining neural networks with personalized pagerank for classification on graphs. InInternational Conference on Learning Representations, Red Hook; NY; United States, 2019. Curran Associates Inc
work page 2019
-
[38]
Yiwei Sun, Suhang Wang, Xianfeng Tang, Tsung-Yu Hsieh, and Vasant Honavar. Adversarial attacks on graph neural networks via node injections: A hierarchical reinforcement learning approach. InProceedings of the Web Conference 2020, pages 673–683, 2020
work page 2020
-
[39]
Spectral net- works and locally connected networks on graphs
Joan Bruna, Wojciech Zaremba, Arthur Szlam, and Yann LeCun. Spectral net- works and locally connected networks on graphs. In2nd International Conference on Learning Representations, ICLR 2014, Red Hook; NY; United States, 2013. Curran Associates Inc
work page 2014
-
[40]
Qimai Li, Xiaotong Zhang, Han Liu, Quanyu Dai, and Xiao-Ming Wu. Dimen- sionwise separable 2-d graph convolution for unsupervised and semi-supervised learning on graphs. InProceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 953–963, New York, NY, USA, 2021. Association for Computing Machinery
work page 2021
-
[41]
Mingguo He, Zhewei Wei, and Ji-Rong Wen. Convolutional neural networks on graphs with chebyshev approximation, revisited.Advances in Neural Information Processing Systems, 35:7264–7276, 2022
work page 2022
-
[42]
Generating graphs via spectral diffusion
Giorgia Minello, Alessandro Bicciato, Luca Rossi, Andrea Torsello, and Luca Cosmo. Generating graphs via spectral diffusion. InThe Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025
work page 2025
-
[43]
Sketch-gnn: Scalable graph neural networks with sublinear training complexity
Mucong Ding, Tahseen Rabbani, Bang An, Evan Wang, and Furong Huang. Sketch-gnn: Scalable graph neural networks with sublinear training complexity. Advances in Neural Information Processing Systems, 35:2930–2943, 2022
work page 2022
-
[44]
Musegnn: Forming scalable, convergent GNN layers that minimize a sampling-based energy
Haitian Jiang, Renjie Liu, Zengfeng Huang, Yichuan Wang, Xiao Yan, Zhenkun Cai, Minjie Wang, and David Wipf. Musegnn: Forming scalable, convergent GNN layers that minimize a sampling-based energy. InThe Thirteenth International Conference on Learning Representations, ICLR 2025, Singapore, April 24-28, 2025. OpenReview.net, 2025
work page 2025
-
[45]
Graph mixture of experts: learning on large-scale graphs with explicit diversity modeling
Haotao Wang, Ziyu Jiang, Yuning You, Yan Han, Gaowen Liu, Jayanth Srinivasa, Ramana Rao Kompella, and Zhangyang Wang. Graph mixture of experts: learning on large-scale graphs with explicit diversity modeling. InProceedings of the 37th International Conference on Neural Information Processing Systems, NIPS ’23, Red Hook, NY, USA, 2023. Curran Associates Inc
work page 2023
-
[46]
Fug: Feature-universal graph contrastive pre-training for graphs with diverse node features
Jitao Zhao, Di Jin, Meng Ge, Lianze Shan, Xin Wang, Dongxiao He, and Zhiyong Feng. Fug: Feature-universal graph contrastive pre-training for graphs with diverse node features. In A. Globerson, L. Mackey, D. Belgrave, A. Fan, U. Paquet, J. Tomczak, and C. Zhang, editors,Advances in Neural Information Processing Systems, volume 37, pages 4003–4034. Curran A...
work page 2024
-
[47]
Gecia Bravo Hermsdorff and Lee M. Gunderson. A unifying framework for spectrum-preserving graph sparsification and coarsening. In Hanna M. Wallach, Hugo Larochelle, Alina Beygelzimer, Florence d’Alché-Buc, Emily B. Fox, and Roman Garnett, editors,Advances in Neural Information Processing Systems 32: Annual Conference on Neural Information Processing Syste...
work page 2019
-
[48]
A unified lottery ticket hypothesis for graph neural networks
Tianlong Chen, Yongduo Sui, Xuxi Chen, Aston Zhang, and Zhangyang Wang. A unified lottery ticket hypothesis for graph neural networks. InInternational conference on machine learning, pages 1695–1706. PMLR, 2021
work page 2021
-
[49]
Haoran You, Zhihan Lu, Zijian Zhou, Yonggan Fu, and Yingyan Lin. Early-bird gcns: Graph-network co-optimization towards more efficient GCN training and inference via drawing early-bird lottery tickets. InThirty-Sixth AAAI Conference on Artificial Intelligence, AAAI 2022, Thirty-Fourth Conference on Innovative Appli- cations of Artificial Intelligence, IAA...
work page 2022
-
[50]
Dslr: Diversity enhancement and structure learning for rehearsal-based graph continual learning
Seungyoon Choi, Wonjoong Kim, Sungwon Kim, Yeonjun In, Sein Kim, and Chanyoung Park. Dslr: Diversity enhancement and structure learning for rehearsal-based graph continual learning. InProceedings of the ACM Web Confer- ence 2024, WWW ’24, page 733–744, New York, NY, USA, 2024. Association for Computing Machinery
work page 2024
-
[51]
Beyond redundancy: Information- aware unsupervised multiplex graph structure learning
Zhixiang Shen, Shuo Wang, and Zhao Kang. Beyond redundancy: Information- aware unsupervised multiplex graph structure learning. In Amir Globersons, Lester Mackey, Danielle Belgrave, Angela Fan, Ulrich Paquet, Jakub M. Tomczak, and Cheng Zhang, editors,Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing ...
work page 2024
-
[52]
Rajendra Bhatia and Chandler Davis. A cauchy-schwarz inequality for operators with applications.Linear algebra and its applications, 223:119–129, 1995
work page 1995
-
[53]
Using the triangle inequality to accelerate k-means
Charles Elkan. Using the triangle inequality to accelerate k-means. InProceedings of the 20th international conference on Machine Learning (ICML-03), pages 147–153, 2003
work page 2003
-
[54]
On the Properties of the Softmax Function with Application in Game Theory and Reinforcement Learning
Bolin Gao and Lacra Pavel. On the properties of the softmax function with applica- tion in game theory and reinforcement learning.arXiv preprint arXiv:1704.00805, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[55]
Cross-entropy loss functions: Theoretical analysis and applications
Anqi Mao, Mehryar Mohri, and Yutao Zhong. Cross-entropy loss functions: Theoretical analysis and applications. InInternational conference on Machine learning, pages 23803–23828. pmlr, 2023. Informative Graph Structure Learning Conference acronym ’XX, June 03–05, 2018, Woodstock, NY A Proof A.1 Proof of Lemma 1 To prove Lemma 1, we first introduce the Cauc...
work page 2023
-
[56]
By the triangle inequality, ∥Z (𝑙) 𝑖 W𝑐 −Z (𝑙+1) 𝑖 W𝑐 ∥2 =∥ ∑︁ 𝑣 𝑗 ∈ N (𝑣𝑖 ) S𝑖 𝑗 (Z (𝑙) 𝑖 −Z (𝑙) 𝑗 )W𝑐 ∥2 (20) ≤ ∑︁ 𝑗∈ N (𝑖) S𝑖 𝑗 ∥ (Z(𝑙) 𝑖 −Z (𝑙) 𝑗 ) ∥2 · ∥W𝑐 ∥2. (21) ∀𝑣 𝑗 ∈ N (𝑣 𝑖 ),the cosine similarity cos(Z(𝑙) 𝑖 , Z(𝑙) 𝑗 ) ≥𝜖 and ∥𝑍 (𝑙) 𝑖 ∥ ≤ 𝐵, then ∥Z (𝑙) 𝑖 −Z (𝑙) 𝑗 ∥2 ≤𝐵 √︃ 2−2 cos(Z (𝑙) 𝑖 ,Z (𝑙) 𝑗 ) ≤𝐵 √ 2−2𝜖,(22) thus |L (Z(𝑙+1) 𝑖 W𝑐,Y 𝑖 ) − L...
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.