RAGA: Reading-And-Graph-building-Agent for Autonomous Knowledge Graph Construction and Retrieval-Augmented Generation

Chengrui Han; Zesheng Cheng

arxiv: 2605.17072 · v1 · pith:TYCWOPVHnew · submitted 2026-05-16 · 💻 cs.AI · cs.CL

RAGA: Reading-And-Graph-building-Agent for Autonomous Knowledge Graph Construction and Retrieval-Augmented Generation

Chengrui Han , Zesheng Cheng This is my paper

Pith reviewed 2026-05-20 15:25 UTC · model grok-4.3

classification 💻 cs.AI cs.CL

keywords knowledge graph constructionretrieval-augmented generationLLM agentsReAct frameworkhybrid retrievalprovenanceautonomous agents

0 comments

The pith

An LLM agent builds knowledge graphs by cycling through read-search-verify-construct steps and keeps them synchronized with vector search.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces RAGA to fix problems in existing LLM knowledge graph pipelines that miss cross-chunk relations and lack interpretability. It equips the agent with tools for creating, reading, updating and deleting graph entries inside a ReAct loop that follows a fixed cognitive sequence. Evidence from each entry stays linked to its original text, and the graph stays aligned with vector embeddings for combined retrieval. A sympathetic reader would care because the design aims to make extracted knowledge more traceable and the retrieval step more accurate on documents where facts span multiple sections.

Core claim

RAGA is an LLM-based framework that supplies an atomic toolset for full knowledge graph CRUD operations and embeds a Read-Search-Verify-Construct constraint inside a ReAct tool loop. A KG-vector synchronization mechanism supports hybrid symbolic-vector retrieval, and every knowledge entry remains anchored to its source text for auditable provenance. Preliminary tests on a QASPER subset show that the resulting fusion retrieval improves both answer quality and evidence quality over zero-shot baselines.

What carries the argument

Read-Search-Verify-Construct cognitive constraint embedded in a ReAct tool loop, which forces the agent to follow sequential verification before constructing or updating graph entries while supporting atomic CRUD operations.

If this is right

Hybrid symbolic-vector retrieval outperforms zero-shot baselines on answer and evidence quality.
Evidence-anchored verification supplies auditable provenance for every knowledge entry.
Atomic CRUD tools allow the agent to maintain the full lifecycle of the knowledge graph without external orchestration.
The framework design provides a concrete reference point for other agent-driven autonomous knowledge graph construction efforts.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same agent loop might be applied to streaming document collections so the graph updates continuously as new text arrives.
Cross-document reasoning tasks could benefit if the verification step is extended to check consistency across multiple source files at once.
Replacing the underlying LLM with a more capable model could reduce the rate at which cross-chunk relations are missed during the search step.

Load-bearing premise

An LLM agent can reliably carry out the Read-Search-Verify-Construct sequence at scale without introducing hallucinations or overlooking relations that cross text chunks.

What would settle it

Process a complete set of related scientific documents with the agent, then compare the constructed knowledge graph against human annotations for missing cross-chunk relations and for any entries that cannot be traced to source text.

read the original abstract

Existing LLM-driven knowledge graph (KG) construction methods predominantly employ stateless batch processing pipelines, exhibiting structural deficiencies in cross-chunk semantic relation capture, entity disambiguation, and construction process interpretability. These limitations undermine KG quality, retrieval precision, and deployment trust in high-stakes domains. We propose RAGA (Reading And Graph-building Agent), an LLM-based autonomous KG construction and retrieval fusion framework. RAGA provides an atomic toolset supporting full KG lifecycle CRUD operations and embeds a Read-Search-Verify-Construct cognitive constraint into a ReAct tool loop. A KG-vector synchronization mechanism enables hybrid symbolic-vector retrieval, while evidence-anchored verification links every knowledge entry to its source text for auditable provenance. Preliminary experiments on a subset of the QASPER scientific QA dataset indicate that RAGA's fusion retrieval outperforms zero-shot baselines, with KG integration providing measurable gains in both answer and evidence quality. The framework design and experimental baseline serve as a reference for agent-driven autonomous KG construction.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

RAGA bundles a Read-Search-Verify-Construct loop into a ReAct agent plus KG-vector sync and provenance links, but the QASPER subset tests give almost no numbers or failure analysis to back the reliability claims.

read the letter

Hi, the main thing to know about this RAGA paper is that it wraps a ReAct agent around a four-step cognitive constraint for turning text into a knowledge graph, adds CRUD tools, keeps the graph and vector store in sync for hybrid lookup, and ties every triple back to its source sentence. That package is the concrete contribution. It does a clear job naming the usual problems with batch KG pipelines—missed cross-chunk edges, poor disambiguation, and zero traceability—and the design choices look like a reasonable engineering response to those issues. The provenance anchoring and the hybrid retrieval step are the parts that could actually help downstream RAG quality in practice. The experiments, though, are the soft spot. They ran on only a subset of QASPER, report only that answer and evidence quality improved over zero-shot baselines, and give no numbers, no other agent baselines, and no breakdown of errors such as missed relations or wrong source links. Without that, it is hard to judge whether the Read-Search-Verify-Construct loop actually reduces hallucinations or scales past the easy cases. The paper is aimed at people already working on agentic knowledge construction and retrieval-augmented systems; they might pick up the toolset and the constraint pattern as a starting point even if they end up changing the details. I would send it to peer review so the authors can add proper metrics, error analysis, and comparisons—the framework is coherent enough to be worth referee time, but the current evidence is too thin to stand on its own.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes RAGA, an LLM-based autonomous agent framework for knowledge graph (KG) construction and retrieval-augmented generation. It introduces an atomic toolset for full KG lifecycle CRUD operations, embeds a Read-Search-Verify-Construct cognitive constraint into a ReAct tool loop, adds a KG-vector synchronization mechanism for hybrid symbolic-vector retrieval, and uses evidence-anchored verification to link every knowledge entry to its source text for provenance. Preliminary experiments on a QASPER subset indicate that the fusion retrieval outperforms zero-shot baselines with measurable gains in answer and evidence quality.

Significance. If the central claims hold under rigorous validation, the work could advance agent-driven KG construction by improving cross-chunk relation capture, interpretability, and deployment trust in scientific domains. The combination of cognitive constraints, hybrid retrieval, and auditable provenance represents a constructive step beyond stateless batch pipelines.

major comments (2)

[Experimental evaluation] Experimental evaluation: The central claims of outperformance and reliable KG construction rest on preliminary experiments on a QASPER subset that provide only qualitative statements of gains in answer and evidence quality. No quantitative metrics, specific baselines, error analysis, or scaling behavior are reported, which directly undermines support for the robustness of the Read-Search-Verify-Construct constraint and hallucination reduction.
[Framework description] Framework description: The assumption that an LLM agent can reliably execute the Read-Search-Verify-Construct sequence at scale without introducing hallucinations or missing cross-chunk relations is load-bearing for the claimed advantages over existing methods, yet it is supported only by the preliminary subset results rather than systematic validation or failure-mode analysis.

minor comments (1)

[Method] Clarify the exact definition and implementation details of the KG-vector synchronization mechanism to avoid ambiguity in how symbolic and vector stores are kept consistent during CRUD operations.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive feedback. The comments accurately note the preliminary character of the reported experiments and the need for stronger validation of the framework assumptions. We address each major comment below and indicate the revisions planned for the next manuscript version.

read point-by-point responses

Referee: Experimental evaluation: The central claims of outperformance and reliable KG construction rest on preliminary experiments on a QASPER subset that provide only qualitative statements of gains in answer and evidence quality. No quantitative metrics, specific baselines, error analysis, or scaling behavior are reported, which directly undermines support for the robustness of the Read-Search-Verify-Construct constraint and hallucination reduction.

Authors: We agree that the experimental evaluation section is currently limited and requires expansion to better substantiate the claims. The reported results are explicitly described as preliminary and rely on qualitative observations from a QASPER subset. In the revised manuscript we will add quantitative metrics (e.g., answer accuracy, evidence precision/recall, and KG construction F1), explicit baseline comparisons (standard zero-shot RAG and batch KG pipelines), and a concise error analysis highlighting cases of missed cross-chunk relations or residual hallucinations. Scaling behavior will be discussed on the basis of additional runs performed on larger subsets; full large-scale scaling will be noted as future work given resource constraints. revision: yes
Referee: Framework description: The assumption that an LLM agent can reliably execute the Read-Search-Verify-Construct sequence at scale without introducing hallucinations or missing cross-chunk relations is load-bearing for the claimed advantages over existing methods, yet it is supported only by the preliminary subset results rather than systematic validation or failure-mode analysis.

Authors: The Read-Search-Verify-Construct loop is presented as a cognitive constraint intended to reduce hallucinations and improve cross-chunk relation capture through iterative tool use and evidence anchoring. The preliminary results provide initial support for this design choice. We acknowledge that systematic validation and explicit failure-mode analysis are needed. In the revision we will add a dedicated paragraph discussing observed failure modes (e.g., verification-step errors) and how the ReAct-style loop with provenance links mitigates them. We will also clarify that while the current experiments demonstrate feasibility, broader multi-dataset validation remains planned future work. revision: partial

Circularity Check

0 steps flagged

No circularity: framework proposal without derivations or fitted quantities

full rationale

The paper describes an LLM agent framework (RAGA) for KG construction and RAG, including toolsets, a Read-Search-Verify-Construct constraint in a ReAct loop, KG-vector synchronization, and evidence-anchored verification. No equations, mathematical derivations, parameter fitting, or predictive reductions appear in the provided abstract or description. Claims rest on preliminary experiments versus zero-shot baselines on a QASPER subset rather than any self-referential definitions or self-citation chains that collapse results to inputs by construction. The work is self-contained as an engineering proposal and empirical reference.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The framework rests on standard assumptions about LLM agent capabilities rather than new mathematical axioms or fitted parameters; the main addition is the postulated agent architecture itself.

axioms (1)

domain assumption LLM agents following ReAct-style loops can be constrained to perform reliable Read-Search-Verify-Construct operations for KG construction
Invoked in the description of the cognitive constraint and tool loop without further justification or empirical bounds.

invented entities (1)

RAGA agent with Read-Search-Verify-Construct constraint no independent evidence
purpose: To enable autonomous, auditable KG construction and hybrid retrieval
New agent design introduced by the paper; no independent evidence outside the framework description is provided.

pith-pipeline@v0.9.0 · 5703 in / 1367 out tokens · 61434 ms · 2026-05-20T15:25:02.606340+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

embeds a Read–Search–Verify–Construct cognitive constraint into a ReAct tool loop

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

35 extracted references · 35 canonical work pages

[1]

World Wide Web , volume =

Zhu, Yuqi and Wang, Xiaohan and Chen, Jing and Qiao, Shuofei and Ou, Yixin and Yao, Yunzhi and Deng, Shumin and Chen, Huajun and Zhang, Ningyu , title =. World Wide Web , volume =. 2024 , doi =

work page 2024
[2]

and Gardner, Matt , title =

Dasigi, Pradeep and Lo, Kyle and Beltagy, Iz and Cohan, Arman and Smith, Noah A. and Gardner, Matt , title =. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =. 2021 , doi =

work page 2021
[3]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , pages =

Zhang, Bowen and Soh, Harold , title =. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , pages =. 2024 , doi =

work page 2024
[4]

Web Information Systems Engineering ---

Lairgi, Yassir and Moncla, Ludovic and Cazabet, R\'. Web Information Systems Engineering ---. 2024 , doi =

work page 2024
[5]

Retrieval-Augmented Generation for Knowledge-Intensive

Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K\". Retrieval-Augmented Generation for Knowledge-Intensive. Advances in Neural Information Processing Systems 33 (NeurIPS 2020) , pages =. 2020 , doi =

work page 2020
[6]

2023 , eprint =

Gao, Yunfan and Xiong, Yun and Gao, Xinyu and Jia, Kangxiang and Pan, Jinliu and Bi, Yuxi and Dai, Yi and Sun, Jiawei and Wang, Meng and Wang, Haofen , title =. 2023 , eprint =

work page 2023
[7]

2024 , eprint =

Edge, Darren and Trinh, Ha and Cheng, Newman and Bradley, Joshua and Chao, Alex and Mody, Apurva and Truitt, Steven and Metropolitansky, Dasha and Ness, Robert Osazuwa and Larson, Jonathan , title =. 2024 , eprint =

work page 2024
[8]

Findings of the Association for Computational Linguistics: EMNLP 2025 , pages =

Guo, Zirui and Xia, Lianghao and Yu, Yanhua and Ao, Tu and Huang, Chao , title =. Findings of the Association for Computational Linguistics: EMNLP 2025 , pages =. 2025 , doi =

work page 2025
[9]

Companion Proceedings of the ACM on Web Conference 2025 (WWW 2025 Companion) , pages =

Liang, Lei and Bo, Zhongpu and Gui, Zhengke and Zhu, Zhongshu and Zhong, Ling and Zhao, Peilong and Sun, Mengshu and Zhang, Zhiqiang and Zhou, Jun and Chen, Wenguang and Zhang, Wen and Chen, Huajun , title =. Companion Proceedings of the ACM on Web Conference 2025 (WWW 2025 Companion) , pages =. 2025 , doi =

work page 2025
[10]

and Cao, Yuan , title =

Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik R. and Cao, Yuan , title =. The Eleventh International Conference on Learning Representations (ICLR) , year =

work page
[11]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Jiang, Jinhao and Zhou, Kun and Zhao, Wayne Xin and Song, Yang and Zhu, Chen and Zhu, Hengshu and Wen, Ji-Rong , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025
[12]

2024 , eprint =

Anokhin, Petr and Semenov, Nikita and Sorokin, Artyom and Evseev, Dmitry and Kravchenko, Andrey and Burtsev, Mikhail and Burnaev, Evgeny , title =. 2024 , eprint =

work page 2024
[13]

Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =

Ning, Yansong and Liu, Hao , title =. Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =. 2024 , doi =

work page 2024
[14]

Proceedings of the 5th ACM International Conference on AI in Finance (ICAIF 2024) , pages =

Sarmah, Bhaskarjit and Mehta, Dhagash and Hall, Benika and Rao, Rohan and Patel, Sunil and Pasquali, Stefano , title =. Proceedings of the 5th ACM International Conference on AI in Finance (ICAIF 2024) , pages =. 2024 , doi =

work page 2024
[15]

, title =

Sarthi, Parth and Abdullah, Salman and Tuli, Aditi and Khanna, Shubh and Goldie, Anna and Manning, Christopher D. , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page
[16]

and Lin, Kevin and Wooders, Sarah and Stoica, Ion and Gonzalez, Joseph E

Packer, Charles and Fang, Vivian and Patil, Shishir G. and Lin, Kevin and Wooders, Sarah and Stoica, Ion and Gonzalez, Joseph E. , title =. 2023 , eprint =

work page 2023
[17]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Zhong, Wanjun and Guo, Lianghong and Gao, Qiqi and Ye, He and Wang, Yanlin , title =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2024 , doi =

work page 2024
[18]

2026 , eprint =

Huo, Yupeng and Lu, Yaxi and Zhang, Zhong and Chen, Haotian and Lin, Yankai , title =. 2026 , eprint =

work page 2026
[19]

2025 , eprint =

Xu, Wujiang and Liang, Zujie and Mei, Kai and Gao, Hang and Tan, Juntao and Zhang, Yongfeng , title =. 2025 , eprint =

work page 2025
[20]

2026 , eprint =

Lv, Can and Chang, Heng and Guo, Yuchen and Tao, Shengyu and Zhou, Shiji , title =. 2026 , eprint =

work page 2026
[21]

The Twelfth International Conference on Learning Representations (ICLR) , year =

Asai, Akari and Wu, Zeqiu and Wang, Yizhong and Sil, Avirup and Hajishirzi, Hannaneh , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page
[22]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages =

Jeong, Soyeong and Baek, Jinheon and Cho, Sukmin and Hwang, Sung Ju and Park, Jong , title =. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages =. 2024 , doi =

work page 2024
[23]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish , title =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2023 , doi =

work page 2023
[24]

and Shum, Heung-Yeung and Guo, Jian , title =

Sun, Jiashuo and Xu, Chengjin and Tang, Lumingyuan and Wang, Saizhuo and Lin, Chen and Gong, Yeyun and Ni, Lionel M. and Shum, Heung-Yeung and Guo, Jian , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page
[25]

The Twelfth International Conference on Learning Representations (ICLR) , year =

Luo, Linhao and Li, Yuan-Fang and Haffari, Gholamreza and Pan, Shirui , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page
[26]

Findings of the Association for Computational Linguistics: ACL 2025 , pages =

Mavromatis, Costas and Karypis, George , title =. Findings of the Association for Computational Linguistics: ACL 2025 , pages =. 2025 , doi =

work page 2025
[27]

and Laurent, Thomas and LeCun, Yann and Bresson, Xavier and Hooi, Bryan , title =

He, Xiaoxin and Tian, Yijun and Sun, Yifei and Chawla, Nitesh V. and Laurent, Thomas and LeCun, Yann and Bresson, Xavier and Hooi, Bryan , title =. Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =. 2024 , doi =

work page 2024
[28]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Dong, Ge and Jin, Jiajie and Li, Xinyu and Zhu, Yutao and Dou, Zhicheng and Wen, Jirong , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025
[29]

Advances in Neural Information Processing Systems 38 (NeurIPS 2024) , pages =

Gutierrez, Bernal Jimenez and Shu, Yiheng and Gu, Yu and Yasunaga, Michihiro and Su, Yu , title =. Advances in Neural Information Processing Systems 38 (NeurIPS 2024) , pages =. 2024 , doi =

work page 2024
[30]

and Walker, Steve and Jones, Susan and Hancock-Beaulieu, Micheline M

Robertson, Stephen E. and Walker, Steve and Jones, Susan and Hancock-Beaulieu, Micheline M. and Gatford, Mike , title =. Overview of the Third Text REtrieval Conference (TREC-3) , series =

work page
[31]

Journal of Documentation , volume =

Sparck Jones, Karen , title =. Journal of Documentation , volume =. 1972 , doi =

work page 1972
[32]

2025 , eprint =

Wang, Jinyu and Fu, Jingjing and Wang, Rui and Song, Lei and Bian, Jiang , title =. 2025 , eprint =

work page 2025
[33]

The Thirteenth International Conference on Learning Representations (ICLR) , year =

Li, Zhuoqun and Chen, Xuanang and Yu, Haiyang and Lin, Hongyu and Lu, Yaojie and Tang, Qiaoyu and Huang, Fei and Han, Xianpei and Sun, Le and Li, Yongbin , title =. The Thirteenth International Conference on Learning Representations (ICLR) , year =

work page
[34]

and Rangwala, Huzefa and Faloutsos, Christos , title =

Lee, Meng-Chieh and Zhu, Qi and Mavromatis, Costas and Han, Zhen and Adeshina, Soji and Ioannidis, Vassilis N. and Rangwala, Huzefa and Faloutsos, Christos , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025
[35]

2025 , eprint =

Luo, Haoran and E, Haihong and Chen, Guanting and Lin, Qika and Guo, Yikai and Xu, Fangzhi and Kuang, Zemin and Song, Meina and Wu, Xiaobao and Zhu, Yifan and Tuan, Luu Anh , title =. 2025 , eprint =

work page 2025

[1] [1]

World Wide Web , volume =

Zhu, Yuqi and Wang, Xiaohan and Chen, Jing and Qiao, Shuofei and Ou, Yixin and Yao, Yunzhi and Deng, Shumin and Chen, Huajun and Zhang, Ningyu , title =. World Wide Web , volume =. 2024 , doi =

work page 2024

[2] [2]

and Gardner, Matt , title =

Dasigi, Pradeep and Lo, Kyle and Beltagy, Iz and Cohan, Arman and Smith, Noah A. and Gardner, Matt , title =. Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies , pages =. 2021 , doi =

work page 2021

[3] [3]

Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , pages =

Zhang, Bowen and Soh, Harold , title =. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing , pages =. 2024 , doi =

work page 2024

[4] [4]

Web Information Systems Engineering ---

Lairgi, Yassir and Moncla, Ludovic and Cazabet, R\'. Web Information Systems Engineering ---. 2024 , doi =

work page 2024

[5] [5]

Retrieval-Augmented Generation for Knowledge-Intensive

Lewis, Patrick and Perez, Ethan and Piktus, Aleksandra and Petroni, Fabio and Karpukhin, Vladimir and Goyal, Naman and K\". Retrieval-Augmented Generation for Knowledge-Intensive. Advances in Neural Information Processing Systems 33 (NeurIPS 2020) , pages =. 2020 , doi =

work page 2020

[6] [6]

2023 , eprint =

Gao, Yunfan and Xiong, Yun and Gao, Xinyu and Jia, Kangxiang and Pan, Jinliu and Bi, Yuxi and Dai, Yi and Sun, Jiawei and Wang, Meng and Wang, Haofen , title =. 2023 , eprint =

work page 2023

[7] [7]

2024 , eprint =

Edge, Darren and Trinh, Ha and Cheng, Newman and Bradley, Joshua and Chao, Alex and Mody, Apurva and Truitt, Steven and Metropolitansky, Dasha and Ness, Robert Osazuwa and Larson, Jonathan , title =. 2024 , eprint =

work page 2024

[8] [8]

Findings of the Association for Computational Linguistics: EMNLP 2025 , pages =

Guo, Zirui and Xia, Lianghao and Yu, Yanhua and Ao, Tu and Huang, Chao , title =. Findings of the Association for Computational Linguistics: EMNLP 2025 , pages =. 2025 , doi =

work page 2025

[9] [9]

Companion Proceedings of the ACM on Web Conference 2025 (WWW 2025 Companion) , pages =

Liang, Lei and Bo, Zhongpu and Gui, Zhengke and Zhu, Zhongshu and Zhong, Ling and Zhao, Peilong and Sun, Mengshu and Zhang, Zhiqiang and Zhou, Jun and Chen, Wenguang and Zhang, Wen and Chen, Huajun , title =. Companion Proceedings of the ACM on Web Conference 2025 (WWW 2025 Companion) , pages =. 2025 , doi =

work page 2025

[10] [10]

and Cao, Yuan , title =

Yao, Shunyu and Zhao, Jeffrey and Yu, Dian and Du, Nan and Shafran, Izhak and Narasimhan, Karthik R. and Cao, Yuan , title =. The Eleventh International Conference on Learning Representations (ICLR) , year =

work page

[11] [11]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Jiang, Jinhao and Zhou, Kun and Zhao, Wayne Xin and Song, Yang and Zhu, Chen and Zhu, Hengshu and Wen, Ji-Rong , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025

[12] [12]

2024 , eprint =

Anokhin, Petr and Semenov, Nikita and Sorokin, Artyom and Evseev, Dmitry and Kravchenko, Andrey and Burtsev, Mikhail and Burnaev, Evgeny , title =. 2024 , eprint =

work page 2024

[13] [13]

Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =

Ning, Yansong and Liu, Hao , title =. Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =. 2024 , doi =

work page 2024

[14] [14]

Proceedings of the 5th ACM International Conference on AI in Finance (ICAIF 2024) , pages =

Sarmah, Bhaskarjit and Mehta, Dhagash and Hall, Benika and Rao, Rohan and Patel, Sunil and Pasquali, Stefano , title =. Proceedings of the 5th ACM International Conference on AI in Finance (ICAIF 2024) , pages =. 2024 , doi =

work page 2024

[15] [15]

, title =

Sarthi, Parth and Abdullah, Salman and Tuli, Aditi and Khanna, Shubh and Goldie, Anna and Manning, Christopher D. , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page

[16] [16]

and Lin, Kevin and Wooders, Sarah and Stoica, Ion and Gonzalez, Joseph E

Packer, Charles and Fang, Vivian and Patil, Shishir G. and Lin, Kevin and Wooders, Sarah and Stoica, Ion and Gonzalez, Joseph E. , title =. 2023 , eprint =

work page 2023

[17] [17]

Proceedings of the AAAI Conference on Artificial Intelligence , volume =

Zhong, Wanjun and Guo, Lianghong and Gao, Qiqi and Ye, He and Wang, Yanlin , title =. Proceedings of the AAAI Conference on Artificial Intelligence , volume =. 2024 , doi =

work page 2024

[18] [18]

2026 , eprint =

Huo, Yupeng and Lu, Yaxi and Zhang, Zhong and Chen, Haotian and Lin, Yankai , title =. 2026 , eprint =

work page 2026

[19] [19]

2025 , eprint =

Xu, Wujiang and Liang, Zujie and Mei, Kai and Gao, Hang and Tan, Juntao and Zhang, Yongfeng , title =. 2025 , eprint =

work page 2025

[20] [20]

2026 , eprint =

Lv, Can and Chang, Heng and Guo, Yuchen and Tao, Shengyu and Zhou, Shiji , title =. 2026 , eprint =

work page 2026

[21] [21]

The Twelfth International Conference on Learning Representations (ICLR) , year =

Asai, Akari and Wu, Zeqiu and Wang, Yizhong and Sil, Avirup and Hajishirzi, Hannaneh , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page

[22] [22]

Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages =

Jeong, Soyeong and Baek, Jinheon and Cho, Sukmin and Hwang, Sung Ju and Park, Jong , title =. Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers) , pages =. 2024 , doi =

work page 2024

[23] [23]

Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Trivedi, Harsh and Balasubramanian, Niranjan and Khot, Tushar and Sabharwal, Ashish , title =. Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2023 , doi =

work page 2023

[24] [24]

and Shum, Heung-Yeung and Guo, Jian , title =

Sun, Jiashuo and Xu, Chengjin and Tang, Lumingyuan and Wang, Saizhuo and Lin, Chen and Gong, Yeyun and Ni, Lionel M. and Shum, Heung-Yeung and Guo, Jian , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page

[25] [25]

The Twelfth International Conference on Learning Representations (ICLR) , year =

Luo, Linhao and Li, Yuan-Fang and Haffari, Gholamreza and Pan, Shirui , title =. The Twelfth International Conference on Learning Representations (ICLR) , year =

work page

[26] [26]

Findings of the Association for Computational Linguistics: ACL 2025 , pages =

Mavromatis, Costas and Karypis, George , title =. Findings of the Association for Computational Linguistics: ACL 2025 , pages =. 2025 , doi =

work page 2025

[27] [27]

and Laurent, Thomas and LeCun, Yann and Bresson, Xavier and Hooi, Bryan , title =

He, Xiaoxin and Tian, Yijun and Sun, Yifei and Chawla, Nitesh V. and Laurent, Thomas and LeCun, Yann and Bresson, Xavier and Hooi, Bryan , title =. Advances in Neural Information Processing Systems 37 (NeurIPS 2024) , volume =. 2024 , doi =

work page 2024

[28] [28]

Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =

Dong, Ge and Jin, Jiajie and Li, Xinyu and Zhu, Yutao and Dou, Zhicheng and Wen, Jirong , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025

[29] [29]

Advances in Neural Information Processing Systems 38 (NeurIPS 2024) , pages =

Gutierrez, Bernal Jimenez and Shu, Yiheng and Gu, Yu and Yasunaga, Michihiro and Su, Yu , title =. Advances in Neural Information Processing Systems 38 (NeurIPS 2024) , pages =. 2024 , doi =

work page 2024

[30] [30]

and Walker, Steve and Jones, Susan and Hancock-Beaulieu, Micheline M

Robertson, Stephen E. and Walker, Steve and Jones, Susan and Hancock-Beaulieu, Micheline M. and Gatford, Mike , title =. Overview of the Third Text REtrieval Conference (TREC-3) , series =

work page

[31] [31]

Journal of Documentation , volume =

Sparck Jones, Karen , title =. Journal of Documentation , volume =. 1972 , doi =

work page 1972

[32] [32]

2025 , eprint =

Wang, Jinyu and Fu, Jingjing and Wang, Rui and Song, Lei and Bian, Jiang , title =. 2025 , eprint =

work page 2025

[33] [33]

The Thirteenth International Conference on Learning Representations (ICLR) , year =

Li, Zhuoqun and Chen, Xuanang and Yu, Haiyang and Lin, Hongyu and Lu, Yaojie and Tang, Qiaoyu and Huang, Fei and Han, Xianpei and Sun, Le and Li, Yongbin , title =. The Thirteenth International Conference on Learning Representations (ICLR) , year =

work page

[34] [34]

and Rangwala, Huzefa and Faloutsos, Christos , title =

Lee, Meng-Chieh and Zhu, Qi and Mavromatis, Costas and Han, Zhen and Adeshina, Soji and Ioannidis, Vassilis N. and Rangwala, Huzefa and Faloutsos, Christos , title =. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages =. 2025 , doi =

work page 2025

[35] [35]

2025 , eprint =

Luo, Haoran and E, Haihong and Chen, Guanting and Lin, Qika and Guo, Yikai and Xu, Fangzhi and Kuang, Zemin and Song, Meina and Wu, Xiaobao and Zhu, Yifan and Tuan, Luu Anh , title =. 2025 , eprint =

work page 2025