S2Aligner: Pair-Efficient and Transferable Pre-Training for Sparse Text-Attributed Graphs
Pith reviewed 2026-05-20 12:45 UTC · model grok-4.3
The pith
S2Aligner decouples semantic alignment from structural modeling for reliable pre-training on sparse text-attributed graphs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
S2Aligner decomposes graph-text representations into semantic and structural components, uses structure-oriented reconstruction with consistency control to inject reliable topology cues into text representations, and suppresses inconsistent structural signals under textual sparsity. It introduces sparsity-aware cross-domain risk balancing, which calibrates domain risks through a global-domain density ratio and downweights unreliable sparse samples via graph reliability estimation. Theoretical analysis shows that this objective reduces cross-domain generalization gaps by controlling domain risk discrepancy.
What carries the argument
The decomposition of graph-text representations into independent semantic and structural components that allows topology-aware signals to be injected via reconstruction without contaminating the shared semantic space.
If this is right
- Pre-training succeeds on graphs where textual anchors are missing or noisy.
- Cross-domain transfer bias from sparsity is reduced through risk calibration.
- Structural information enhances alignment while semantic space remains clean.
- Generalization gaps shrink as domain risk discrepancies are controlled.
Where Pith is reading between the lines
- The decoupling approach may apply to other sparse multimodal learning problems beyond graphs.
- Reliability estimation could help in selecting which nodes to annotate in active learning for graphs.
- The framework suggests structure acts as a stabilizer rather than a direct supervisor in low-text regimes.
Load-bearing premise
Graph representations can be cleanly decomposed into independent semantic and structural components such that topology-aware signals can be injected via reconstruction without contaminating the shared semantic space.
What would settle it
An experiment where the structure reconstruction term is ablated and performance under high sparsity remains the same or improves would falsify the claim that the decomposition prevents contamination and enables better alignment.
Figures
read the original abstract
Pre-training on text-attributed graphs (TAGs) is central to building transferable graph foundation models, where LLM-as-Aligner methods align graph and text representations through the semantic knowledge of large language models. However, these methods usually assume that node texts provide sufficient and reliable supervision, an assumption often violated in real-world sparse TAGs. When textual anchors are missing, noisy, or uneven across domains, graph structures must be aligned with weak semantic evidence, leading to unreliable structure-semantics correspondence and sparsity-induced transfer bias. This paper presents S2Aligner, a sparsity-aware and structure-enhanced LLM-as-Aligner framework for graph-text pre-training on sparse TAGs. The key idea is to decouple semantic alignment from structural modeling, allowing topology-aware signals to enhance alignment without contaminating the shared semantic space. Specifically, S2Aligner decomposes graph-text representations into semantic and structural components, uses structure-oriented reconstruction with consistency control to inject reliable topology cues into text representations, and suppresses inconsistent structural signals under textual sparsity. Moreover, S2Aligner introduces sparsity-aware cross-domain risk balancing, which calibrates domain risks through a global-domain density ratio and downweights unreliable sparse samples via graph reliability estimation. Theoretical analysis shows that this objective reduces cross-domain generalization gaps by controlling domain risk discrepancy. Extensive experiments across diverse graph domains, sparsity levels, and downstream tasks demonstrate that S2Aligner consistently outperforms existing baselines.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents S2Aligner, a sparsity-aware and structure-enhanced LLM-as-Aligner framework for pre-training on sparse text-attributed graphs (TAGs). It decouples semantic alignment from structural modeling by decomposing graph-text representations into semantic and structural components, employs structure-oriented reconstruction with consistency control to inject topology cues into text representations, and introduces sparsity-aware cross-domain risk balancing that uses a global-domain density ratio and graph reliability estimation to downweight unreliable sparse samples. Theoretical analysis is claimed to show that this objective reduces cross-domain generalization gaps by controlling domain risk discrepancy. Extensive experiments across diverse graph domains, sparsity levels, and downstream tasks are reported to demonstrate consistent outperformance over existing baselines.
Significance. If the decomposition into independent components holds without leakage and the theoretical bound is non-circular, the work could meaningfully advance transferable graph foundation models for real-world sparse TAGs where textual supervision is missing or noisy. It directly targets sparsity-induced transfer bias and cross-domain generalization, which are practical bottlenecks in applications such as social networks and knowledge graphs.
major comments (2)
- [Abstract] Abstract (key idea paragraph): The central premise that representations can be cleanly decomposed into independent semantic and structural components such that structure-oriented reconstruction injects reliable topology cues 'without contaminating the shared semantic space' is load-bearing for both the theoretical bound and the empirical claims, yet no explicit mechanism (orthogonal projection, separate encoders, or invariance proof) is provided to guarantee separation under sparsity-induced noise and missing textual anchors.
- [Theoretical analysis] Theoretical analysis (sparsity-aware cross-domain risk balancing): The objective calibrates domain risks via a global-domain density ratio and graph reliability estimation; it is unclear whether these quantities are computed independently of the downstream evaluation data. If they are fitted to the same samples used for measuring generalization gaps, the claimed reduction in domain risk discrepancy becomes circular and does not constitute a valid bound.
minor comments (2)
- [Abstract] The abstract asserts outperformance and theoretical reduction in gaps but supplies no equations, implementation details, error bars, or rules for excluding sparse samples, which hinders immediate verification of the stated claims.
- [Experiments] In the experimental section, confirm that all reported results include standard error bars over multiple runs and explicitly state the criteria used to define and handle varying sparsity levels across domains.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback and for identifying key areas where additional clarity would strengthen the manuscript. We address each major comment below with point-by-point responses, providing explanations grounded in the current manuscript while noting where revisions will improve presentation.
read point-by-point responses
-
Referee: [Abstract] Abstract (key idea paragraph): The central premise that representations can be cleanly decomposed into independent semantic and structural components such that structure-oriented reconstruction injects reliable topology cues 'without contaminating the shared semantic space' is load-bearing for both the theoretical bound and the empirical claims, yet no explicit mechanism (orthogonal projection, separate encoders, or invariance proof) is provided to guarantee separation under sparsity-induced noise and missing textual anchors.
Authors: We thank the referee for this observation. The decomposition is realized in the manuscript through distinct modeling pathways detailed in Sections 3.1 and 3.2: semantic alignment is performed via LLM-driven contrastive objectives on available text attributes, while structural components are processed by a dedicated graph encoder operating on topology and node features independently of the semantic path. The structure-oriented reconstruction objective (Equation 4) combined with the consistency control term (Equation 6) and the sparsity-aware suppression mechanism explicitly down-weights or excludes structural signals that conflict with semantic evidence, thereby limiting leakage into the shared space. We acknowledge that an explicit statement of this separation (e.g., via separate encoders and consistency gating) is not highlighted in the abstract. In the revision we will add a concise clause to the abstract and a short clarifying paragraph in Section 3.2 that references the separate encoders and the role of consistency control under noise. revision: partial
-
Referee: [Theoretical analysis] Theoretical analysis (sparsity-aware cross-domain risk balancing): The objective calibrates domain risks via a global-domain density ratio and graph reliability estimation; it is unclear whether these quantities are computed independently of the downstream evaluation data. If they are fitted to the same samples used for measuring generalization gaps, the claimed reduction in domain risk discrepancy becomes circular and does not constitute a valid bound.
Authors: We appreciate the referee's concern about potential circularity. Both the global-domain density ratio and the graph reliability estimation are computed exclusively from the pre-training corpus: the density ratio uses aggregate statistics across source-domain training splits, and reliability scores are derived from intrinsic graph properties (degree distributions and textual sparsity) observed only during pre-training. These quantities are treated as fixed parameters when deriving the generalization bound in Theorem 1 (Section 4). No downstream evaluation data or test splits are used in their estimation. To eliminate any ambiguity, we will insert explicit statements in the revised Section 4 clarifying the data sources and independence from downstream tasks. revision: yes
Circularity Check
No significant circularity in derivation chain
full rationale
The paper defines a sparsity-aware objective that decouples semantic alignment from structural modeling via decomposition into components, structure-oriented reconstruction, and cross-domain risk balancing using density ratios and reliability estimation. The theoretical claim that this reduces generalization gaps by controlling domain risk discrepancy is presented as following from the objective definition. No equations, self-citations, or steps in the provided abstract and description reduce any prediction or bound to its inputs by construction, nor rename fitted quantities as independent results. The derivation remains self-contained with independent content from the proposed method.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Martin Arjovsky, Léon Bottou, Ishaan Gulrajani, and David Lopez-Paz. Invariant risk mini- mization.arXiv preprint arXiv:1907.02893, 2019
work page internal anchor Pith review Pith/arXiv arXiv 1907
-
[2]
ConGraT: Self-supervised contrastive pretraining for joint graph and text embeddings
William Brannon, Wonjune Kang, Suyash Fulay, Hang Jiang, Brandon Roy, Deb Roy, and Jad Kabbara. ConGraT: Self-supervised contrastive pretraining for joint graph and text embeddings. InProceedings of TextGraphs-17: Graph-based Methods for Natural Language Processing, pages 19–39, Bangkok, Thailand, August 2024. Association for Computational Linguistics
work page 2024
-
[3]
Multi-level graph convolutional networks for cross-platform anchor link prediction
Hongxu Chen, Hongzhi Yin, Xiangguo Sun, Tong Chen, Bogdan Gabrys, and Katarzyna Musial. Multi-level graph convolutional networks for cross-platform anchor link prediction. InProceedings of the 26th ACM SIGKDD international conference on knowledge discovery & data mining, pages 1503–1511, 2020
work page 2020
-
[4]
Brainnet: Epileptic wave detection from seeg with hierarchical graph diffusion learning
Junru Chen, Yang Yang, Tao Yu, Yingying Fan, Xiaolong Mo, and Carl Yang. Brainnet: Epileptic wave detection from seeg with hierarchical graph diffusion learning. InProceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining, pages 2741–2751, 2022
work page 2022
-
[5]
Llaga: Large language and graph assistant.arXiv preprint arXiv:2402.08170, 2024
Runjin Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, and Zhangyang Wang. Llaga: Large language and graph assistant.arXiv preprint arXiv:2402.08170, 2024
-
[6]
Zhikai Chen, Haitao Mao, Jingzhe Liu, Yu Song, Bingheng Li, Wei Jin, Bahare Fatemi, Anton Tsitsulin, Bryan Perozzi, Hui Liu, et al. Text-space graph foundation models: Comprehensive benchmarks and new insights.Advances in Neural Information Processing Systems, 37:7464– 7492, 2024
work page 2024
-
[7]
Jiarui Feng, Hao Liu, Lecheng Kong, Mingfang Zhu, Yixin Chen, and Muhan Zhang. Taglas: An atlas of text-attributed graph datasets in the era of large graph and language models.arXiv preprint arXiv:2406.14683, 2024
-
[8]
Domain-adversarial training of neural networks
Yaroslav Ganin, Evgeniya Ustinova, Hana Ajakan, Pascal Germain, Hugo Larochelle, François Laviolette, Mario March, and Victor Lempitsky. Domain-adversarial training of neural networks. Journal of machine learning research, 17(59):1–35, 2016
work page 2016
-
[9]
Explanations as features: Llm-based features for text-attributed graphs.CoRR, abs/2305.19523, 2023
Xiaoxin He, Xavier Bresson, Thomas Laurent, and Bryan Hooi. Explanations as features: Llm-based features for text-attributed graphs.CoRR, abs/2305.19523, 2023
-
[10]
Graphmae: Self-supervised masked graph autoencoders
Zhenyu Hou, Xiao Liu, Yukuo Cen, Yuxiao Dong, Hongxia Yang, Chunjie Wang, and Jie Tang. Graphmae: Self-supervised masked graph autoencoders. InProceedings of the 28th ACM SIGKDD conference on knowledge discovery and data mining, pages 594–604, 2022
work page 2022
-
[11]
Open graph benchmark: Datasets for machine learning on graphs
Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. Advances in neural information processing systems, 33:22118–22133, 2020
work page 2020
-
[12]
Can gnn be good adapter for llms? InProceedings of the ACM web conference 2024, pages 893–904, 2024
Xuanwen Huang, Kaiqiao Han, Yang Yang, Dezheng Bao, Quanjin Tao, Ziwei Chai, and Qi Zhu. Can gnn be good adapter for llms? InProceedings of the ACM web conference 2024, pages 893–904, 2024. 10
work page 2024
-
[13]
Out-of-distribution generalization via risk ex- trapolation (rex)
David Krueger, Ethan Caballero, Joern-Henrik Jacobsen, Amy Zhang, Jonathan Binas, Dinghuai Zhang, Remi Le Priol, and Aaron Courville. Out-of-distribution generalization via risk ex- trapolation (rex). InInternational conference on machine learning, pages 5815–5826. PMLR, 2021
work page 2021
-
[14]
Haoyang Li, Ziwei Zhang, Xin Wang, and Wenwu Zhu. Learning invariant graph representations for out-of-distribution generalization.Advances in Neural Information Processing Systems, 35:11828–11841, 2022
work page 2022
-
[15]
Yichuan Li, Kaize Ding, and Kyumin Lee. GRENADE: Graph-centric language model for self-supervised representation learning on text-attributed graphs. In Houda Bouamor, Juan Pino, and Kalika Bali, editors,Findings of the Association for Computational Linguistics: EMNLP 2023, pages 2745–2757, Singapore, December 2023. Association for Computational Linguistics
work page 2023
-
[16]
Zerog: Investigating cross- dataset zero-shot transferability in graphs
Yuhan Li, Peisong Wang, Zhixun Li, Jeffrey Xu Yu, and Jia Li. Zerog: Investigating cross- dataset zero-shot transferability in graphs. InProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 1725–1735, 2024
work page 2024
-
[17]
Hao Liu, Jiarui Feng, Lecheng Kong, Ningyue Liang, Dacheng Tao, Yixin Chen, and Muhan Zhang. One for all: Towards training one graph model for all classification tasks.arXiv preprint arXiv:2310.00149, 2023
-
[18]
Jiawei Liu, Cheng Yang, Zhiyuan Lu, Junze Chen, Yibo Li, Mengmei Zhang, Ting Bai, Yuan Fang, Lichao Sun, Philip S. Yu, and Chuan Shi. Towards graph foundation models: A survey and beyond.CoRR, abs/2310.11829, 2023
-
[19]
Yuhang Liu, Minglai Shao, Zengyi Wo, Yunlong Chu, Bing Hao, Shengzhong Liu, Ruijie Wang, and Jianxin Li. Learning noise-resilient and transferable graph-text alignment via dynamic quality assessment.arXiv preprint arXiv:2510.19384, 2025
-
[20]
Learning transferable features with deep adaptation networks
Mingsheng Long, Yue Cao, Jianmin Wang, and Michael Jordan. Learning transferable features with deep adaptation networks. InInternational conference on machine learning, pages 97–105. PMLR, 2015
work page 2015
-
[21]
Decoupled Weight Decay Regularization
Ilya Loshchilov and Frank Hutter. Decoupled weight decay regularization.arXiv preprint arXiv:1711.05101, 2017
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[22]
Wiki-cs: A wikipedia-based benchmark for graph neural networks.arXiv preprint arXiv:2007.02901, 2020
Péter Mernyei and C˘at˘alina Cangea. Wiki-cs: A wikipedia-based benchmark for graph neural networks.arXiv preprint arXiv:2007.02901, 2020
-
[23]
Jonas Peters, Peter Bühlmann, and Nicolai Meinshausen. Causal inference by using invariant prediction: identification and confidence intervals.Journal of the Royal Statistical Society Series B: Statistical Methodology, 78(5):947–1012, 2016
work page 2016
-
[24]
Learning transferable visual models from natural language supervision
Alec Radford, Jong Wook Kim, Chris Hallacy, Aditya Ramesh, Gabriel Goh, Sandhini Agarwal, Girish Sastry, Amanda Askell, Pamela Mishkin, Jack Clark, et al. Learning transferable visual models from natural language supervision. InInternational conference on machine learning, pages 8748–8763. PmLR, 2021
work page 2021
-
[25]
Ladislav Rampášek, Michael Galkin, Vijay Prakash Dwivedi, Anh Tuan Luu, Guy Wolf, and Dominique Beaini. Recipe for a general, powerful, scalable graph transformer.Advances in Neural Information Processing Systems, 35:14501–14515, 2022
work page 2022
-
[26]
Sentence-bert: Sentence embeddings using siamese bert- networks
Nils Reimers and Iryna Gurevych. Sentence-bert: Sentence embeddings using siamese bert- networks. InProceedings of the 2019 conference on empirical methods in natural language processing and the 9th international joint conference on natural language processing (EMNLP- IJCNLP), pages 3982–3992, 2019
work page 2019
-
[27]
A survey of large lan- guage models for graphs
Xubin Ren, Jiabin Tang, Dawei Yin, Nitesh Chawla, and Chao Huang. A survey of large lan- guage models for graphs. InProceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 6616–6626, 2024. 11
work page 2024
-
[28]
Shiori Sagawa, Pang Wei Koh, Tatsunori B Hashimoto, and Percy Liang. Distributionally robust neural networks for group shifts: On the importance of regularization for worst-case generalization.arXiv preprint arXiv:1911.08731, 2019
work page internal anchor Pith review Pith/arXiv arXiv 1911
-
[29]
Collective classification in network data.AI magazine, 29(3):93–93, 2008
Prithviraj Sen, Galileo Namata, Mustafa Bilgic, Lise Getoor, Brian Galligher, and Tina Eliassi- Rad. Collective classification in network data.AI magazine, 29(3):93–93, 2008
work page 2008
-
[30]
Deep coral: Correlation alignment for deep domain adaptation
Baochen Sun and Kate Saenko. Deep coral: Correlation alignment for deep domain adaptation. InEuropean conference on computer vision, pages 443–450. Springer, 2016
work page 2016
-
[31]
Graphgpt: Graph instruction tuning for large language models
Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, Suqi Cheng, Dawei Yin, and Chao Huang. Graphgpt: Graph instruction tuning for large language models. InProceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 491–500, 2024
work page 2024
-
[32]
Bootstrapped representation learning on graphs
Shantanu Thakoor, Corentin Tallec, Mohammad Gheshlaghi Azar, Rémi Munos, Petar Veliˇckovi´c, and Michal Valko. Bootstrapped representation learning on graphs. InICLR 2021 workshop on geometrical and topological representation learning, pages 1–14. OpenReview. net, 2021
work page 2021
-
[33]
Visualizing data using t-sne.Journal of Machine Learning Research, 9(86):2579–2605, 2008
Laurens van der Maaten and Geoffrey Hinton. Visualizing data using t-sne.Journal of Machine Learning Research, 9(86):2579–2605, 2008
work page 2008
-
[34]
Deep graph infomax.stat, 1050:21, 2018
Petar Velickovic, William Fedus, William L Hamilton, Pietro Liò, Yoshua Bengio, and R Devon Hjelm. Deep graph infomax.stat, 1050:21, 2018
work page 2018
-
[35]
Text Embeddings by Weakly-Supervised Contrastive Pre-training
Liang Wang, Nan Yang, Xiaolong Huang, Binxing Jiao, Linjun Yang, Daxin Jiang, Rangan Majumder, and Furu Wei. Text embeddings by weakly-supervised contrastive pre-training. arXiv preprint arXiv:2212.03533, 2022
work page internal anchor Pith review Pith/arXiv arXiv 2022
-
[36]
Wenhui Wang, Furu Wei, Li Dong, Hangbo Bao, Nan Yang, and Ming Zhou. Minilm: Deep self-attention distillation for task-agnostic compression of pre-trained transformers.Advances in neural information processing systems, 33:5776–5788, 2020
work page 2020
-
[37]
Augmenting low-resource text classification with graph-grounded pre-training and prompting
Zhihao Wen and Yuan Fang. Augmenting low-resource text classification with graph-grounded pre-training and prompting. InProceedings of the 46th International ACM SIGIR Conference on Research and Development in Information Retrieval, pages 506–516, 2023
work page 2023
-
[38]
Gina Wong, Joshua Gleason, Rama Chellappa, Yoav Wald, and Anqi Liu. Weighted risk invari- ance: Domain generalization under invariant feature shift.arXiv preprint arXiv:2407.18428, 2024
-
[39]
Qitian Wu, Hengrui Zhang, Junchi Yan, and David Wipf. Handling distribution shifts on graphs: An invariance perspective.arXiv preprint arXiv:2202.02466, 2022
-
[40]
Hao Yan, Chaozhuo Li, Ruosong Long, Chao Yan, Jianan Zhao, Wenwen Zhuang, Jun Yin, Peiyan Zhang, Weihao Han, Hao Sun, et al. A comprehensive study on text-attributed graphs: Benchmarking and rethinking.Advances in Neural Information Processing Systems, 36:17238– 17264, 2023
work page 2023
-
[41]
An Yang, Anfeng Li, Baosong Yang, Beichen Zhang, Binyuan Hui, Bo Zheng, Bowen Yu, Chang Gao, Chengen Huang, Chenxu Lv, et al. Qwen3 technical report.arXiv preprint arXiv:2505.09388, 2025
work page internal anchor Pith review Pith/arXiv arXiv 2025
-
[42]
Deep graph contrastive representation learning.arXiv preprint arXiv:2006.04131, 2020
Yanqiao Zhu, Yichen Xu, Feng Yu, Qiang Liu, Shu Wu, and Liang Wang. Deep graph contrastive representation learning.arXiv preprint arXiv:2006.04131, 2020
-
[43]
Graphclip: Enhancing transferability in graph foundation models for text-attributed graphs
Yun Zhu, Haizhou Shi, Xiaotang Wang, Yongchao Liu, Yaoke Wang, Boci Peng, Chuntao Hong, and Siliang Tang. Graphclip: Enhancing transferability in graph foundation models for text-attributed graphs. InProceedings of the ACM on Web Conference 2025, pages 2183–2197, 2025. 12 A Appendix Overview The appendix is structured as follows: • Section B discusses lim...
work page 2025
-
[44]
Feature decomposition: inputs consist of invariantZand domain-specificV e
-
[45]
Conditional independence:Y⊥ ⊥V e |Z
-
[46]
Invariant label condition:p ei(Y|Z) =p ej(Y|Z)for alle i, ej ∈ E
-
[47]
Non-degenerate density:p e(Z)>0for alle∈ E. Proposition 1If the assumptions hold and the predictor f depends only on Z, then the weighted risks are equal across all domains. Proof.The weighted risk for domaineis: Re = ZZZ YZV e pe(Z, Ve, Y)·ℓ(f(Z), Y)·r e(Z) dVedZdY. Marginalizing outV e: Re = ZZ YZ ℓ(f(Z), Y)·r e(Z)·p e(Z, Y) dZdY. Substitutingp e(Z, Y) ...
work page 2023
-
[48]
Text-only Language Models:These approaches independently process the raw sentences of each entity while completely ignoring topological connections. • SBERT[ 26]: We incorporate standard sentence embedding architectures to produce dense seman- tic representations, specifically evaluating theall-MiniLM-L6-v2andmulti-qa-distilbert-cos-v1 variants. • Qwen3 (...
-
[49]
Text-Attributed Graph (TAG) & LLM-based Methods:These recent strategies strive to fuse structural patterns with the semantic comprehension capabilities of language models. • GraphGPT[ 31]: This architecture maps topological properties into discrete tokens and employs a dual-stage instruction fine-tuning process to synchronize GNN outputs with an LLM’s sem...
-
[50]
Graph Self-Supervised Learning (SSL) Models:Focusing primarily on topology and dense features, these methodologies leverage traditional graph neural networks. • DGI[ 34]: A foundational self-supervised strategy that maximizes mutual information by distin- guishing authentic node-graph representations from artificially corrupted counterparts. 16 • GRACE[ 4...
-
[51]
• GraphCLIP[ 43]: This framework relies heavily on contrastive alignment objectives
State-of-the-Art Graph-Text Aligners:Serving as our primary zero-shot competitors, these methods focus explicitly on synchronizing semantic and structural spaces. • GraphCLIP[ 43]: This framework relies heavily on contrastive alignment objectives. It synthesizes subgraph summaries to align embedding spaces, providing strong zero-shot graph–text alignment ...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.