GraphReAct: Reasoning and Acting for Multi-step Graph Inference

Chang Zhou; Hong Cheng; Renhe Jiang; Xikun Zhang; Xingtong Yu; Xinming Zhang; Xuanting Xie; Yuan Fang; Zhongwei Kuai

arxiv: 2605.07357 · v2 · submitted 2026-05-08 · 💻 cs.AI

GraphReAct: Reasoning and Acting for Multi-step Graph Inference

Xingtong Yu , Zhongwei Kuai , Chang Zhou , Xuanting Xie , Renhe Jiang , Xikun Zhang , Hong Cheng , Xinming Zhang

show 1 more author

Yuan Fang

This is my paper

Pith reviewed 2026-05-12 04:15 UTC · model grok-4.3

classification 💻 cs.AI

keywords graph reasoningreasoning-actinglarge language modelsmulti-step inferencetopological retrievalsemantic retrievalcontext refinementgraph learning

0 comments

The pith

GraphReAct enables large language models to perform multi-step inference on graph data by interleaving reasoning with topological retrieval, semantic retrieval, and context refinement actions.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces a framework called GraphReAct that adapts reasoning-acting methods from general language model use to graphs, where information sits in node connections and hidden representations rather than plain text. It defines three specific actions: one that pulls local structural links, one that finds relevant evidence through semantic similarity, and one that distills the growing context into a tighter form. This setup lets the model expand its view step by step and then compress it, addressing the distributed nature of graph information. A sympathetic reader would care because many practical problems involve networks or relational data, and a single general method could avoid building separate models for each domain. Experiments across six benchmarks show consistent gains over prior approaches.

Core claim

GraphReAct designs a graph-based action space with topological retrieval to capture local structural dependencies, semantic retrieval to access non-local relevant evidence, and context refinement to distill accumulated information into compact representations. By interleaving these actions with reasoning steps, the framework supports progressive transitions from context expansion to compression during multi-step inference over graph-structured data.

What carries the argument

Graph-based action space of topological retrieval, semantic retrieval, and context refinement actions that expand then compress reasoning context.

If this is right

The approach yields higher accuracy than prior graph learning methods on six standard benchmarks.
Multi-step inference benefits from dynamic context expansion followed by compression.
Reasoning-acting can be extended to structured data beyond plain text by adding graph-specific actions.
Progressive refinement supports longer inference chains without context overload.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same action pattern might transfer to other relational tasks such as knowledge base querying or molecular property prediction.
It points toward hybrid systems that combine LLM flexibility with explicit graph traversal rules.
Scalability tests on graphs much larger than the benchmarks would clarify whether repeated retrieval steps remain efficient.
Removing the need for task-specific fine-tuning could simplify deployment across different network datasets.

Load-bearing premise

The language model can execute the retrieval and refinement actions reliably without adding excessive noise or needing dataset-specific tuning.

What would settle it

Run the same six benchmark evaluations after disabling the context refinement action or injecting noise into the retrieval outputs and check whether outperformance over baselines disappears.

Figures

Figures reproduced from arXiv: 2605.07357 by Chang Zhou, Hong Cheng, Renhe Jiang, Xikun Zhang, Xingtong Yu, Xinming Zhang, Xuanting Xie, Yuan Fang, Zhongwei Kuai.

**Figure 2.** Figure 2: Overall framework of GRAPHREACT. an input graph, it first produces representations HV = GE(X, G; Θ), and then maps them into the token embedding space of the LLM via a projection function: Htok = Proj(H; ϕ), (3) where ϕ is the learnable parameter. The graph embedding tokens Htok share the same embedding dimension as LLM tokens and are incorporated into an instruction template for the LLM. 4 Proposed Approa… view at source ↗

**Figure 3.** Figure 3: Impact of inference steps in GRAPHREACT [PITH_FULL_IMAGE:figures/full_fig_p009_3.png] view at source ↗

**Figure 5.** Figure 5: Numbers of Wiki entities for Search. G Effect of the Number of Retrieved Entities in Textual Search We further analyze the impact of the number of retrieved Wikipedia entities S in the text-based Search action, as shown in [PITH_FULL_IMAGE:figures/full_fig_p017_5.png] view at source ↗

read the original abstract

Reasoning-acting frameworks enhance large language models (LLMs) by interleaving reasoning with actions for dynamic information acquisition. However, extending this paradigm to graph learning remains underexplored. Graph data is inherently structured, with information distributed across nodes and edges and encoded through both topology and latent representations. As a result, effective reasoning over graphs requires not only retrieving informative evidence from the graph, but also progressively refining the accumulated context during multi-step inference. In this work, we propose GraphReAct, a graph reasoning-acting framework that enables step-by-step inference over graph-structured data. Specifically, we design a graph-based action space with two complementary retrieval actions: topological retrieval, which captures local structural dependencies, and semantic retrieval, which accesses non-local but relevant evidence in the representation space. These actions dynamically expand the reasoning context. To further support multi-step reasoning, we introduce another type of action, context refinement, which distills and reorganizes accumulated information into a compact representation. By interleaving reasoning with both retrieval and refinement actions, our framework enables a progressive transition from context expansion to compression. Extensive experiments on six benchmark datasets demonstrate that GraphReAct consistently outperforms state-of-the-art methods, validating the effectiveness of reasoning-acting for graph learning.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The paper introduces GraphReAct, a reasoning-acting framework for multi-step inference over graph-structured data. It defines a graph-based action space with topological retrieval (local structural dependencies), semantic retrieval (non-local relevant evidence via embeddings), and context refinement (distilling accumulated information into compact representations). These actions are interleaved with reasoning steps to enable progressive context expansion followed by compression. The central claim is that this yields consistent outperformance over state-of-the-art methods on six benchmark datasets, validating the reasoning-acting paradigm for graph learning.

Significance. If the empirical claims hold with proper controls, the work would be significant for bridging LLM-based reasoning-acting frameworks with graph data, offering a parameter-free way to handle structured information through dynamic retrieval and refinement without model fine-tuning. It addresses an underexplored extension of ReAct-style methods to graphs and could influence hybrid LLM-graph systems.

major comments (2)

[Abstract and Experiments] Abstract and experimental results section: The claim that GraphReAct 'consistently outperforms state-of-the-art methods' on six benchmarks is presented without any details on the specific baselines used, evaluation metrics, statistical significance tests, error bars, or experimental controls (e.g., prompt variations or retrieval accuracy). This directly undermines assessment of the central empirical support, as the reported gains could stem from better prompting rather than the proposed action design.
[Method] Method section (action execution): The framework assumes the base LLM can reliably execute topological retrieval (exact neighbor lists), semantic retrieval (embedding-based non-local nodes), and context refinement (lossless compression of structural invariants) without introducing unquantified noise or requiring dataset-specific tuning. No quantitative diagnostics (e.g., retrieval precision/recall or ablation on action fidelity) are provided to validate this assumption, which is load-bearing for multi-step trajectories where errors compound.

minor comments (1)

[Abstract and Method] The abstract and method descriptions use terms like 'progressive transition from context expansion to compression' without defining how refinement is prompted or measured for fidelity.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their insightful comments, which have helped us identify areas for improvement in our manuscript. We address each major comment below and indicate the revisions we will make to strengthen the paper.

read point-by-point responses

Referee: [Abstract and Experiments] Abstract and experimental results section: The claim that GraphReAct 'consistently outperforms state-of-the-art methods' on six benchmarks is presented without any details on the specific baselines used, evaluation metrics, statistical significance tests, error bars, or experimental controls (e.g., prompt variations or retrieval accuracy). This directly undermines assessment of the central empirical support, as the reported gains could stem from better prompting rather than the proposed action design.

Authors: We agree that providing more details would strengthen the presentation of our empirical results. The abstract is intentionally high-level, but we will revise it to include a brief overview of the baselines and metrics used. In the experiments section, we will add error bars, report statistical significance, and include additional controls for prompt variations and retrieval accuracy to demonstrate that the performance gains stem from the proposed graph-based action space rather than prompting differences. revision: yes
Referee: [Method] Method section (action execution): The framework assumes the base LLM can reliably execute topological retrieval (exact neighbor lists), semantic retrieval (embedding-based non-local nodes), and context refinement (lossless compression of structural invariants) without introducing unquantified noise or requiring dataset-specific tuning. No quantitative diagnostics (e.g., retrieval precision/recall or ablation on action fidelity) are provided to validate this assumption, which is load-bearing for multi-step trajectories where errors compound.

Authors: We appreciate this point on validating the core assumptions of our framework. The method relies on the LLM's ability to perform these actions as instructed, but we recognize the need for empirical validation of action reliability. We will add quantitative diagnostics, including precision and recall for semantic and topological retrieval, as well as an ablation on context refinement fidelity. These additions will be included in a new subsection or appendix to show that action execution errors are minimal and do not significantly compound in multi-step inference. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical framework proposal with no derivations or self-referential reductions

full rationale

The paper introduces GraphReAct as a new action space (topological retrieval, semantic retrieval, context refinement) interleaved with LLM reasoning for graph inference. The central claim rests on experimental outperformance across six benchmarks rather than any first-principles derivation or prediction. No equations, fitted parameters renamed as predictions, self-citation load-bearing uniqueness theorems, or ansatz smuggling appear in the provided text. The design choices are presented as novel contributions justified by empirical results, not by reduction to prior inputs or self-citations. This is a standard non-circular empirical ML framework paper.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are stated. The framework description implies standard LLM capabilities and graph embedding assumptions but does not introduce new fitted constants or postulates.

pith-pipeline@v0.9.0 · 5541 in / 1145 out tokens · 51196 ms · 2026-05-12T04:15:27.787394+00:00 · methodology

Review history (2 revisions) →

discussion (0)

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages

[1]

Chain-of-thought prompting elicits reasoning in large language models

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models. In NeurIPS, volume 35, pages 24824–24837, 2022

work page 2022
[2]

Towards revealing the mystery behind chain of thought: a theoretical perspective

Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, and Liwei Wang. Towards revealing the mystery behind chain of thought: a theoretical perspective. volume 36, pages 70757–70798, 2023

work page 2023
[3]

Automatic chain of thought prompting in large language models

Zhuosheng Zhang, Aston Zhang, Mu Li, and Alex Smola. Automatic chain of thought prompting in large language models. InICLR, 2023

work page 2023
[4]

React: Synergizing reasoning and acting in language models

Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik R Narasimhan, and Yuan Cao. React: Synergizing reasoning and acting in language models. InICLR, 2022

work page 2022
[5]

Satori: Reinforcement learning with chain-of-action-thought enhances llm reasoning via autoregressive search

Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory W Wornell, Subhro Das, David Daniel Cox, and Chuang Gan. Satori: Reinforcement learning with chain-of-action-thought enhances llm reasoning via autoregressive search. In ICML, 2025

work page 2025
[6]

Preact: Prediction enhances agent’s planning ability

Dayuan Fu, Jianzhao Huang, Siyuan Lu, Guanting Dong, Yejie Wang, Keqing He, and Weiran Xu. Preact: Prediction enhances agent’s planning ability. InACL, pages 1–16, 2025

work page 2025
[7]

Graph learning: A survey.IEEE Transactions on Artificial Intelligence, 2(2):109–127, 2021

Feng Xia, Ke Sun, Shuo Yu, Abdul Aziz, Liangtian Wan, Shirui Pan, and Huan Liu. Graph learning: A survey.IEEE Transactions on Artificial Intelligence, 2(2):109–127, 2021

work page 2021
[8]

John Wiley & Sons, 2006

Diane J Cook and Lawrence B Holder.Mining graph data. John Wiley & Sons, 2006

work page 2006
[9]

Semi-supervised classification with graph convolutional networks

Thomas N Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. InICLR, 2017

work page 2017
[10]

Graph attention networks

Petar Veliˇckovi´c, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. Graph attention networks. InICLR, 2018

work page 2018
[11]

How powerful are graph neural networks? InICLR, 2019

Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. How powerful are graph neural networks? InICLR, 2019

work page 2019
[12]

Inductive representation learning on large graphs

Will Hamilton, Zhitao Ying, and Jure Leskovec. Inductive representation learning on large graphs. InNeurIPS, 2017

work page 2017
[13]

Graph contrastive learning with augmentations

Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, and Yang Shen. Graph contrastive learning with augmentations. InNeurIPS, volume 33, pages 5812–5823, 2020

work page 2020
[14]

Deep graph infomax

Petar Veliˇckovi´c, William Fedus, William L Hamilton, Pietro Liò, Yoshua Bengio, and R Devon Hjelm. Deep graph infomax. InICLR, 2018

work page 2018
[15]

GraphPrompt: Unifying pre-training and downstream tasks for graph neural networks

Zemin Liu, Xingtong Yu, Yuan Fang, and Xinming Zhang. GraphPrompt: Unifying pre-training and downstream tasks for graph neural networks. InWWW, pages 417–428, 2023

work page 2023
[16]

Generalized graph prompt: Toward a unification of pre-training and downstream tasks on graphs.IEEE TKDE, 36(11):6237– 6250, 2023

Xingtong Yu, Zhenghao Liu, Yuan Fang, Zemin Liu, Sihong Chen, and Xinming Zhang. Generalized graph prompt: Toward a unification of pre-training and downstream tasks on graphs.IEEE TKDE, 36(11):6237– 6250, 2023

work page 2023
[17]

Xingtong Yu, Chang Zhou, Zhongwei Kuai, Xinming Zhang, and Yuan Fang

Xingtong Yu, Chang Zhou, Zhongwei Kuai, Xinming Zhang, and Yuan Fang. GCoT: Chain-of- thought prompt learning for graphs.arXiv preprint arXiv:2502.08092, 2025

work page arXiv 2025
[18]

Gft: Graph foundation model with transferable tree vocabulary.NeurIPS, 37:107403–107443, 2024

Zehong Wang, Zheyuan Zhang, Nitesh V Chawla, Chuxu Zhang, and Yanfang Ye. Gft: Graph foundation model with transferable tree vocabulary.NeurIPS, 37:107403–107443, 2024

work page 2024
[19]

Unigraph2: Learning a unified embedding space to bind multimodal graphs

Yufei He, Yuan Sui, Xiaoxin He, Yue Liu, Yifei Sun, and Bryan Hooi. Unigraph2: Learning a unified embedding space to bind multimodal graphs. InWWW 2025, pages 1759–1770, 2025

work page 2025
[20]

One for all: Towards training one graph model for all classification tasks

Hao Liu, Jiarui Feng, Lecheng Kong, Ningyue Liang, Dacheng Tao, Yixin Chen, and Muhan Zhang. One for all: Towards training one graph model for all classification tasks. InICLR, 2024. 10

work page 2024
[21]

Graver: Generative graph vocabularies for robust graph foundation models fine-tuning

Haonan Yuan, Qingyun Sun, Junhua Shi, Xingcheng Fu, Bryan Hooi, Jianxin Li, and Philip S Yu. Graver: Generative graph vocabularies for robust graph foundation models fine-tuning. NeurIPS, 2025

work page 2025
[22]

Graphgpt: Graph instruction tuning for large language models

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, Suqi Cheng, Dawei Yin, and Chao Huang. Graphgpt: Graph instruction tuning for large language models. InSIGIR, pages 491–500, 2024

work page 2024
[23]

Higpt: Heterogeneous graph language model

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Long Xia, Dawei Yin, and Chao Huang. Higpt: Heterogeneous graph language model. InSIGKDD, pages 2842–2853, 2024

work page 2024
[24]

Llaga: Large language and graph assistant.ICML, 2024

Runjin Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, and Zhangyang Wang. Llaga: Large language and graph assistant.ICML, 2024

work page 2024
[25]

Least-to-most prompting enables complex reasoning in large language models.ICLR, 2023

Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, et al. Least-to-most prompting enables complex reasoning in large language models.ICLR, 2023

work page 2023
[26]

Self-consistency improves chain of thought reasoning in language models.ICLR, 2023

Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, and Denny Zhou. Self-consistency improves chain of thought reasoning in language models.ICLR, 2023

work page 2023
[27]

Tree of thoughts: Deliberate problem solving with large language models.NeurIPS, 36:11809–11822, 2023

Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Tom Griffiths, Yuan Cao, and Karthik Narasimhan. Tree of thoughts: Deliberate problem solving with large language models.NeurIPS, 36:11809–11822, 2023

work page 2023
[28]

Llms as zero-shot graph learners: Alignment of gnn representations with llm token embeddings.NeurIPS, 37:5950–5973, 2024

Duo Wang, Yuan Zuo, Fengzhi Li, and Junjie Wu. Llms as zero-shot graph learners: Alignment of gnn representations with llm token embeddings.NeurIPS, 37:5950–5973, 2024

work page 2024
[29]

Breadth-first search

Alan Bundy and Lincoln Wallen. Breadth-first search. InCatalogue of artificial intelligence tools, pages 13–13. 1984

work page 1984
[30]

Learning similarity with cosine similarity ensemble

Peipei Xia, Li Zhang, and Fanzhang Li. Learning similarity with cosine similarity ensemble. Information sciences, 307:39–52, 2015

work page 2015
[31]

Is homophily a necessity for graph neural networks? InICLR, 2022

Yao Ma, Xiaorui Liu, Neil Shah, and Jiliang Tang. Is homophily a necessity for graph neural networks? InICLR, 2022

work page 2022
[32]

Revisiting heterophily for graph neural networks.NeurIPS, pages 1362–1375, 2022

Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang, and Doina Precup. Revisiting heterophily for graph neural networks.NeurIPS, pages 1362–1375, 2022

work page 2022
[33]

Open graph benchmark: Datasets for machine learning on graphs

Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. InNeurIPS, 2020

work page 2020
[34]

Harnessing explanations: Llm-to-lm interpreter for enhanced text-attributed graph representation learning

Xiaoxin He, Xavier Bresson, Thomas Laurent, Adam Perold, Yann LeCun, and Bryan Hooi. Harnessing explanations: Llm-to-lm interpreter for enhanced text-attributed graph representation learning. InICLR, 2024

work page 2024
[35]

Augmenting low-resource text classification with graph-grounded pre-training and prompting

Zhihao Wen and Yuan Fang. Augmenting low-resource text classification with graph-grounded pre-training and prompting. InSIGIR, 2023

work page 2023
[36]

A comprehensive study on text-attributed graphs: Benchmarking and rethinking.NeurIPS, 36:17238–17264, 2023

Hao Yan, Chaozhuo Li, Ruosong Long, Chao Yan, Jianan Zhao, Wenwen Zhuang, Jun Yin, Peiyan Zhang, Weihao Han, Hao Sun, et al. A comprehensive study on text-attributed graphs: Benchmarking and rethinking.NeurIPS, 36:17238–17264, 2023

work page 2023
[37]

Multilayer perceptron (mlp)

Hind Taud and Jean-Franccois Mas. Multilayer perceptron (mlp). InGeomatic approaches for modeling land change scenarios, pages 451–455. 2017

work page 2017
[38]

Nodeformer: A scalable graph structure learning transformer for node classification.NeurIPS, 35:27387–27401, 2022

Qitian Wu, Wentao Zhao, Zenan Li, David P Wipf, and Junchi Yan. Nodeformer: A scalable graph structure learning transformer for node classification.NeurIPS, 35:27387–27401, 2022

work page 2022
[39]

DIFFormer: Scalable (graph) transformers induced by energy constrained diffusion

Qitian Wu, Chenxiao Yang, Wentao Zhao, Yixuan He, David Wipf, and Junchi Yan. DIFFormer: Scalable (graph) transformers induced by energy constrained diffusion. InInternational Confer- ence on Learning Representations, 2023. 11

work page 2023
[40]

Geometric knowledge distillation: Topology compression for graph neural networks.NeurIPS, 35:29761–29775, 2022

Chenxiao Yang, Qitian Wu, and Junchi Yan. Geometric knowledge distillation: Topology compression for graph neural networks.NeurIPS, 35:29761–29775, 2022

work page 2022
[41]

Graph-less neural networks: Teaching old mlps new tricks via distillation

Shichang Zhang, Yozen Liu, Yizhou Sun, and Neil Shah. Graph-less neural networks: Teaching old mlps new tricks via distillation. InICLR, 2022

work page 2022
[42]

This subgraph xxx, so the category might be xxx

Wei-Lin Chiang, Zhuohan Li, Ziqing Lin, Ying Sheng, Zhanghao Wu, Hao Zhang, Lianmin Zheng, Siyuan Zhuang, Yonghao Zhuang, Joseph E Gonzalez, et al. Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality.See https://vicuna. lmsys. org (accessed 14 April 2023), 2(3):6, 2023. 12 A Alogrithm We summarize the overall procedure of GRAPHREACTi...

work page 2023
[43]

Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences

Text-based inference: Analyze the titles and descriptions of all neighbors as a whole. Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences. Give more weight to recurring elements shared across neighbors, like magical creatures, vehicle types, specific emotions,...

work page
[46]

This subgraph xxx

Output constraint: Output exactly ONE sentence. Use the fixed format exactly as specified. Do not add explanations, bullet points, or extra text. The phrase "This subgraph xxx" should concisely describe the overall thematic focus of the children’s books in the subgraph. Here are correct examples: - "This subgraph features stories about fairies and magical...

work page
[47]

Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences

Text-based inference: Analyze the titles and descriptions of all nodes as a whole. Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences. Give more weight to recurring elements shared across the node set, like magical creatures, vehicle types, specific emotions, ...

work page
[48]

has written over one hundred books

Noise filtering: Ignore non-informative promotional content, including but not limited to: bestseller status, author biography details (e.g., "has written over one hundred books"), series accolades, review quotes, and generic publisher blurbs. Focus only on information that reflects the book’s core story, subject matter, characters, or intended educationa...

work page
[49]

Category constraint: The final category MUST be chosen from the following list and cannot be invented or modified: {categories}

work page
[50]

This node set xxx

Output constraint: Output exactly ONE sentence. Use the fixed format exactly as specified. Do not add explanations, bullet points, or extra text. The phrase "This node set xxx" should concisely describe the overall thematic focus of the children’s books in the set. Here are correct examples: - "This node set features stories about fairies and magical adve...

work page
[51]

• Depending on the country in which research is conducted, IRB approval (or equivalent) may be required for any human subjects research

Institutional review board (IRB) approvals or equivalent for research with human subjects Question: Does the paper describe potential risks incurred by study participants, whether such risks were disclosed to the subjects, and whether Institutional Review Board (IRB) approvals (or an equivalent approval/review based on the requirements of your country or ...

work page

[1] [1]

Chain-of-thought prompting elicits reasoning in large language models

Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Fei Xia, Ed Chi, Quoc V Le, Denny Zhou, et al. Chain-of-thought prompting elicits reasoning in large language models. In NeurIPS, volume 35, pages 24824–24837, 2022

work page 2022

[2] [2]

Towards revealing the mystery behind chain of thought: a theoretical perspective

Guhao Feng, Bohang Zhang, Yuntian Gu, Haotian Ye, Di He, and Liwei Wang. Towards revealing the mystery behind chain of thought: a theoretical perspective. volume 36, pages 70757–70798, 2023

work page 2023

[3] [3]

Automatic chain of thought prompting in large language models

Zhuosheng Zhang, Aston Zhang, Mu Li, and Alex Smola. Automatic chain of thought prompting in large language models. InICLR, 2023

work page 2023

[4] [4]

React: Synergizing reasoning and acting in language models

Shunyu Yao, Jeffrey Zhao, Dian Yu, Nan Du, Izhak Shafran, Karthik R Narasimhan, and Yuan Cao. React: Synergizing reasoning and acting in language models. InICLR, 2022

work page 2022

[5] [5]

Satori: Reinforcement learning with chain-of-action-thought enhances llm reasoning via autoregressive search

Maohao Shen, Guangtao Zeng, Zhenting Qi, Zhang-Wei Hong, Zhenfang Chen, Wei Lu, Gregory W Wornell, Subhro Das, David Daniel Cox, and Chuang Gan. Satori: Reinforcement learning with chain-of-action-thought enhances llm reasoning via autoregressive search. In ICML, 2025

work page 2025

[6] [6]

Preact: Prediction enhances agent’s planning ability

Dayuan Fu, Jianzhao Huang, Siyuan Lu, Guanting Dong, Yejie Wang, Keqing He, and Weiran Xu. Preact: Prediction enhances agent’s planning ability. InACL, pages 1–16, 2025

work page 2025

[7] [7]

Graph learning: A survey.IEEE Transactions on Artificial Intelligence, 2(2):109–127, 2021

Feng Xia, Ke Sun, Shuo Yu, Abdul Aziz, Liangtian Wan, Shirui Pan, and Huan Liu. Graph learning: A survey.IEEE Transactions on Artificial Intelligence, 2(2):109–127, 2021

work page 2021

[8] [8]

John Wiley & Sons, 2006

Diane J Cook and Lawrence B Holder.Mining graph data. John Wiley & Sons, 2006

work page 2006

[9] [9]

Semi-supervised classification with graph convolutional networks

Thomas N Kipf and Max Welling. Semi-supervised classification with graph convolutional networks. InICLR, 2017

work page 2017

[10] [10]

Graph attention networks

Petar Veliˇckovi´c, Guillem Cucurull, Arantxa Casanova, Adriana Romero, Pietro Lio, and Yoshua Bengio. Graph attention networks. InICLR, 2018

work page 2018

[11] [11]

How powerful are graph neural networks? InICLR, 2019

Keyulu Xu, Weihua Hu, Jure Leskovec, and Stefanie Jegelka. How powerful are graph neural networks? InICLR, 2019

work page 2019

[12] [12]

Inductive representation learning on large graphs

Will Hamilton, Zhitao Ying, and Jure Leskovec. Inductive representation learning on large graphs. InNeurIPS, 2017

work page 2017

[13] [13]

Graph contrastive learning with augmentations

Yuning You, Tianlong Chen, Yongduo Sui, Ting Chen, Zhangyang Wang, and Yang Shen. Graph contrastive learning with augmentations. InNeurIPS, volume 33, pages 5812–5823, 2020

work page 2020

[14] [14]

Deep graph infomax

Petar Veliˇckovi´c, William Fedus, William L Hamilton, Pietro Liò, Yoshua Bengio, and R Devon Hjelm. Deep graph infomax. InICLR, 2018

work page 2018

[15] [15]

GraphPrompt: Unifying pre-training and downstream tasks for graph neural networks

Zemin Liu, Xingtong Yu, Yuan Fang, and Xinming Zhang. GraphPrompt: Unifying pre-training and downstream tasks for graph neural networks. InWWW, pages 417–428, 2023

work page 2023

[16] [16]

Generalized graph prompt: Toward a unification of pre-training and downstream tasks on graphs.IEEE TKDE, 36(11):6237– 6250, 2023

Xingtong Yu, Zhenghao Liu, Yuan Fang, Zemin Liu, Sihong Chen, and Xinming Zhang. Generalized graph prompt: Toward a unification of pre-training and downstream tasks on graphs.IEEE TKDE, 36(11):6237– 6250, 2023

work page 2023

[17] [17]

Xingtong Yu, Chang Zhou, Zhongwei Kuai, Xinming Zhang, and Yuan Fang

Xingtong Yu, Chang Zhou, Zhongwei Kuai, Xinming Zhang, and Yuan Fang. GCoT: Chain-of- thought prompt learning for graphs.arXiv preprint arXiv:2502.08092, 2025

work page arXiv 2025

[18] [18]

Gft: Graph foundation model with transferable tree vocabulary.NeurIPS, 37:107403–107443, 2024

Zehong Wang, Zheyuan Zhang, Nitesh V Chawla, Chuxu Zhang, and Yanfang Ye. Gft: Graph foundation model with transferable tree vocabulary.NeurIPS, 37:107403–107443, 2024

work page 2024

[19] [19]

Unigraph2: Learning a unified embedding space to bind multimodal graphs

Yufei He, Yuan Sui, Xiaoxin He, Yue Liu, Yifei Sun, and Bryan Hooi. Unigraph2: Learning a unified embedding space to bind multimodal graphs. InWWW 2025, pages 1759–1770, 2025

work page 2025

[20] [20]

One for all: Towards training one graph model for all classification tasks

Hao Liu, Jiarui Feng, Lecheng Kong, Ningyue Liang, Dacheng Tao, Yixin Chen, and Muhan Zhang. One for all: Towards training one graph model for all classification tasks. InICLR, 2024. 10

work page 2024

[21] [21]

Graver: Generative graph vocabularies for robust graph foundation models fine-tuning

Haonan Yuan, Qingyun Sun, Junhua Shi, Xingcheng Fu, Bryan Hooi, Jianxin Li, and Philip S Yu. Graver: Generative graph vocabularies for robust graph foundation models fine-tuning. NeurIPS, 2025

work page 2025

[22] [22]

Graphgpt: Graph instruction tuning for large language models

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Lixin Su, Suqi Cheng, Dawei Yin, and Chao Huang. Graphgpt: Graph instruction tuning for large language models. InSIGIR, pages 491–500, 2024

work page 2024

[23] [23]

Higpt: Heterogeneous graph language model

Jiabin Tang, Yuhao Yang, Wei Wei, Lei Shi, Long Xia, Dawei Yin, and Chao Huang. Higpt: Heterogeneous graph language model. InSIGKDD, pages 2842–2853, 2024

work page 2024

[24] [24]

Llaga: Large language and graph assistant.ICML, 2024

Runjin Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, and Zhangyang Wang. Llaga: Large language and graph assistant.ICML, 2024

work page 2024

[25] [25]

Least-to-most prompting enables complex reasoning in large language models.ICLR, 2023

Denny Zhou, Nathanael Schärli, Le Hou, Jason Wei, Nathan Scales, Xuezhi Wang, Dale Schuurmans, Claire Cui, Olivier Bousquet, Quoc Le, et al. Least-to-most prompting enables complex reasoning in large language models.ICLR, 2023

work page 2023

[26] [26]

Self-consistency improves chain of thought reasoning in language models.ICLR, 2023

Xuezhi Wang, Jason Wei, Dale Schuurmans, Quoc Le, Ed Chi, Sharan Narang, Aakanksha Chowdhery, and Denny Zhou. Self-consistency improves chain of thought reasoning in language models.ICLR, 2023

work page 2023

[27] [27]

Tree of thoughts: Deliberate problem solving with large language models.NeurIPS, 36:11809–11822, 2023

Shunyu Yao, Dian Yu, Jeffrey Zhao, Izhak Shafran, Tom Griffiths, Yuan Cao, and Karthik Narasimhan. Tree of thoughts: Deliberate problem solving with large language models.NeurIPS, 36:11809–11822, 2023

work page 2023

[28] [28]

Llms as zero-shot graph learners: Alignment of gnn representations with llm token embeddings.NeurIPS, 37:5950–5973, 2024

Duo Wang, Yuan Zuo, Fengzhi Li, and Junjie Wu. Llms as zero-shot graph learners: Alignment of gnn representations with llm token embeddings.NeurIPS, 37:5950–5973, 2024

work page 2024

[29] [29]

Breadth-first search

Alan Bundy and Lincoln Wallen. Breadth-first search. InCatalogue of artificial intelligence tools, pages 13–13. 1984

work page 1984

[30] [30]

Learning similarity with cosine similarity ensemble

Peipei Xia, Li Zhang, and Fanzhang Li. Learning similarity with cosine similarity ensemble. Information sciences, 307:39–52, 2015

work page 2015

[31] [31]

Is homophily a necessity for graph neural networks? InICLR, 2022

Yao Ma, Xiaorui Liu, Neil Shah, and Jiliang Tang. Is homophily a necessity for graph neural networks? InICLR, 2022

work page 2022

[32] [32]

Revisiting heterophily for graph neural networks.NeurIPS, pages 1362–1375, 2022

Sitao Luan, Chenqing Hua, Qincheng Lu, Jiaqi Zhu, Mingde Zhao, Shuyuan Zhang, Xiao-Wen Chang, and Doina Precup. Revisiting heterophily for graph neural networks.NeurIPS, pages 1362–1375, 2022

work page 2022

[33] [33]

Open graph benchmark: Datasets for machine learning on graphs

Weihua Hu, Matthias Fey, Marinka Zitnik, Yuxiao Dong, Hongyu Ren, Bowen Liu, Michele Catasta, and Jure Leskovec. Open graph benchmark: Datasets for machine learning on graphs. InNeurIPS, 2020

work page 2020

[34] [34]

Harnessing explanations: Llm-to-lm interpreter for enhanced text-attributed graph representation learning

Xiaoxin He, Xavier Bresson, Thomas Laurent, Adam Perold, Yann LeCun, and Bryan Hooi. Harnessing explanations: Llm-to-lm interpreter for enhanced text-attributed graph representation learning. InICLR, 2024

work page 2024

[35] [35]

Augmenting low-resource text classification with graph-grounded pre-training and prompting

Zhihao Wen and Yuan Fang. Augmenting low-resource text classification with graph-grounded pre-training and prompting. InSIGIR, 2023

work page 2023

[36] [36]

A comprehensive study on text-attributed graphs: Benchmarking and rethinking.NeurIPS, 36:17238–17264, 2023

Hao Yan, Chaozhuo Li, Ruosong Long, Chao Yan, Jianan Zhao, Wenwen Zhuang, Jun Yin, Peiyan Zhang, Weihao Han, Hao Sun, et al. A comprehensive study on text-attributed graphs: Benchmarking and rethinking.NeurIPS, 36:17238–17264, 2023

work page 2023

[37] [37]

Multilayer perceptron (mlp)

Hind Taud and Jean-Franccois Mas. Multilayer perceptron (mlp). InGeomatic approaches for modeling land change scenarios, pages 451–455. 2017

work page 2017

[38] [38]

Nodeformer: A scalable graph structure learning transformer for node classification.NeurIPS, 35:27387–27401, 2022

Qitian Wu, Wentao Zhao, Zenan Li, David P Wipf, and Junchi Yan. Nodeformer: A scalable graph structure learning transformer for node classification.NeurIPS, 35:27387–27401, 2022

work page 2022

[39] [39]

DIFFormer: Scalable (graph) transformers induced by energy constrained diffusion

Qitian Wu, Chenxiao Yang, Wentao Zhao, Yixuan He, David Wipf, and Junchi Yan. DIFFormer: Scalable (graph) transformers induced by energy constrained diffusion. InInternational Confer- ence on Learning Representations, 2023. 11

work page 2023

[40] [40]

Geometric knowledge distillation: Topology compression for graph neural networks.NeurIPS, 35:29761–29775, 2022

Chenxiao Yang, Qitian Wu, and Junchi Yan. Geometric knowledge distillation: Topology compression for graph neural networks.NeurIPS, 35:29761–29775, 2022

work page 2022

[41] [41]

Graph-less neural networks: Teaching old mlps new tricks via distillation

Shichang Zhang, Yozen Liu, Yizhou Sun, and Neil Shah. Graph-less neural networks: Teaching old mlps new tricks via distillation. InICLR, 2022

work page 2022

[42] [42]

This subgraph xxx, so the category might be xxx

Wei-Lin Chiang, Zhuohan Li, Ziqing Lin, Ying Sheng, Zhanghao Wu, Hao Zhang, Lianmin Zheng, Siyuan Zhuang, Yonghao Zhuang, Joseph E Gonzalez, et al. Vicuna: An open-source chatbot impressing gpt-4 with 90%* chatgpt quality.See https://vicuna. lmsys. org (accessed 14 April 2023), 2(3):6, 2023. 12 A Alogrithm We summarize the overall procedure of GRAPHREACTi...

work page 2023

[43] [43]

Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences

Text-based inference: Analyze the titles and descriptions of all neighbors as a whole. Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences. Give more weight to recurring elements shared across neighbors, like magical creatures, vehicle types, specific emotions,...

work page

[44] [46]

This subgraph xxx

Output constraint: Output exactly ONE sentence. Use the fixed format exactly as specified. Do not add explanations, bullet points, or extra text. The phrase "This subgraph xxx" should concisely describe the overall thematic focus of the children’s books in the subgraph. Here are correct examples: - "This subgraph features stories about fairies and magical...

work page

[45] [47]

Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences

Text-based inference: Analyze the titles and descriptions of all nodes as a whole. Identify the dominant themes, genres, or target age group, such as fantasy, animals, educational concepts, activities, or everyday life experiences. Give more weight to recurring elements shared across the node set, like magical creatures, vehicle types, specific emotions, ...

work page

[46] [48]

has written over one hundred books

Noise filtering: Ignore non-informative promotional content, including but not limited to: bestseller status, author biography details (e.g., "has written over one hundred books"), series accolades, review quotes, and generic publisher blurbs. Focus only on information that reflects the book’s core story, subject matter, characters, or intended educationa...

work page

[47] [49]

Category constraint: The final category MUST be chosen from the following list and cannot be invented or modified: {categories}

work page

[48] [50]

This node set xxx

Output constraint: Output exactly ONE sentence. Use the fixed format exactly as specified. Do not add explanations, bullet points, or extra text. The phrase "This node set xxx" should concisely describe the overall thematic focus of the children’s books in the set. Here are correct examples: - "This node set features stories about fairies and magical adve...

work page

[49] [51]

• Depending on the country in which research is conducted, IRB approval (or equivalent) may be required for any human subjects research

Institutional review board (IRB) approvals or equivalent for research with human subjects Question: Does the paper describe potential risks incurred by study participants, whether such risks were disclosed to the subjects, and whether Institutional Review Board (IRB) approvals (or an equivalent approval/review based on the requirements of your country or ...

work page