arxiv: 2604.17458 · v2 · submitted 2026-04-19 · 💻 cs.AI

Recognition: unknown

EHRAG: Bridging Semantic Gaps in Lightweight GraphRAG via Hybrid Hypergraph Construction and Retrieval

Yifan Song , Xingjian Tao , Zhicheng Yang , Yihong Luo , Jing Tang

Authors on Pith no claims yet

Pith reviewed 2026-05-10 05:39 UTC · model grok-4.3

classification 💻 cs.AI

keywords GraphRAGhypergraphlightweight RAGsemantic retrievalhybrid diffusionentity embeddingspersonalized PageRankmulti-hop reasoning

0 comments

The pith

EHRAG builds a hybrid hypergraph from sentence co-occurrences and embedding clusters to retrieve across semantic gaps that pure structure misses.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces EHRAG to fix a gap in lightweight GraphRAG methods. Those methods turn a corpus into a graph using only named-entity co-occurrences, so entities that are related but never appear together stay disconnected. EHRAG adds a second set of hyperedges formed by clustering the embeddings of entity text, so the same graph now holds both explicit structure and latent semantic links. Retrieval then runs a combined diffusion process that scores documents by both kinds of edges before refining the top results with personalized PageRank. If the claim holds, multi-hop questions can be answered more accurately while the entire indexing step stays linear and uses no language-model tokens.

Core claim

EHRAG constructs structural hyperedges from sentence-level co-occurrence after lightweight entity extraction and semantic hyperedges by clustering entity text embeddings. Retrieval then applies structure-semantic hybrid diffusion with topic-aware scoring followed by personalized PageRank refinement to return the top-k documents. On four datasets this produces higher performance than prior lightweight GraphRAG baselines while preserving linear indexing complexity and zero token consumption during construction.

What carries the argument

The hybrid hypergraph that joins sentence-level co-occurrence hyperedges with embedding-cluster hyperedges, which supplies the retrieval mechanism that links both structural and semantic signals.

If this is right

Retrieval accuracy rises on the four evaluated datasets relative to prior lightweight baselines.
Indexing cost remains linear in the size of the corpus.
Hypergraph construction consumes no language-model tokens.
Hybrid diffusion plus PPR selects documents that better support multi-hop reasoning.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same two-layer hyperedge construction could be inserted into other graph-based retrieval pipelines that currently rely only on co-occurrence.
Zero-token construction makes the method practical for large private corpora where sending text to an external model is undesirable.
Replacing the embedding cluster step with alternative grouping methods might further tighten the semantic connections captured by the hypergraph.

Load-bearing premise

That the specific pairing of sentence co-occurrence hyperedges, embedding-cluster hyperedges, hybrid diffusion, and PPR refinement yields a retrieval distribution measurably superior to earlier lightweight GraphRAG methods on the chosen metrics.

What would settle it

An ablation on the same four datasets that removes either the semantic hyperedges or the PPR refinement step and shows the remaining system matches or falls below baseline scores.

Figures

Figures reproduced from arXiv: 2604.17458 by Jing Tang, Xingjian Tao, Yifan Song, Yihong Luo, Zhicheng Yang.

**Figure 2.** Figure 2: The overall framework of EHRAG. The process is divided into two phases: (1) Offline Construction: We extract entities using lightweight models to build structural hyperedges, while simultaneously clustering entity embeddings (e.g., BIRCH) to form semantic hyperedges. (2) Online Retrieval: User queries activate anchor nodes, initiating a structure-semantic hybrid diffusion process that propagates scores thr… view at source ↗

**Figure 3.** Figure 3: Efficiency comparison on 2WikiMultiHop. We report the Indexing Time, Token Consumption, and [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Parameter sensitivity analysis on 2WikiMultiHop (2Wiki) and HotpotQA (Hotpot) datasets. [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

read the original abstract

Graph-based Retrieval-Augmented Generation (GraphRAG) enhances LLMs by structuring corpus into graphs to facilitate multi-hop reasoning. While recent lightweight approaches reduce indexing costs by leveraging Named Entity Recognition (NER), they rely strictly on structural co-occurrence, failing to capture latent semantic connections between disjoint entities. To address this, we propose EHRAG, a lightweight RAG framework that constructs a hypergraph capturing both structure and semantic level relationships, employing a hybrid structural-semantic retrieval mechanism. Specifically, EHRAG constructs structural hyperedges based on sentence-level co-occurrence with lightweight entity extraction and semantic hyperedges by clustering entity text embeddings, ensuring the hypergraph encompasses both structural and semantic information. For retrieval, EHRAG performs a structure-semantic hybrid diffusion with topic-aware scoring and personalized pagerank (PPR) refinement to identify the top-k relevant documents. Experiments on four datasets show that EHRAG outperforms state-of-the-art baselines while maintaining linear indexing complexity and zero token consumption for construction. Code is available at https://github.com/yfsong00/EHRAG.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

EHRAG's hybrid hypergraph plus diffusion/PPR retrieval is a new combo for lightweight GraphRAG, but the experiments do not isolate what actually drives the reported gains.

read the letter

The main thing to know about EHRAG is that it builds a hypergraph with both sentence co-occurrence hyperedges and additional ones from clustering entity embeddings, then retrieves with a blended structure-semantic diffusion step plus topic-aware PPR. That specific mix is not described in the prior lightweight GraphRAG papers they cite, and the approach keeps indexing linear with zero token cost for construction, which is a practical constraint they respect throughout.

Referee Report

2 major / 2 minor

Summary. The paper proposes EHRAG, a lightweight GraphRAG framework that builds a hybrid hypergraph with structural hyperedges (sentence-level entity co-occurrence via lightweight NER) and semantic hyperedges (clustering of entity embeddings). Retrieval uses structure-semantic hybrid diffusion, topic-aware scoring, and PPR refinement to select top-k documents. The central claim is that this outperforms SOTA baselines on four datasets while preserving linear indexing complexity and zero token consumption during construction; open-source code is provided.

Significance. If the empirical claims hold after proper controls, the work would be moderately significant for the GraphRAG literature: it targets a recognized limitation of purely structural lightweight methods (missing latent semantics) without increasing indexing cost, and the public code supports reproducibility. However, the absence of isolating ablations means the significance cannot yet be assessed beyond the level of an incremental engineering contribution.

major comments (2)

[Experiments] Experiments section: the manuscript reports outperformance on four datasets but provides no ablation studies that disable semantic hyperedges, replace hybrid diffusion with pure structural or pure semantic propagation, or remove PPR refinement. Without these controls it is impossible to attribute gains to the proposed hybrid mechanism rather than implementation details or dataset artifacts, directly undermining the central claim that the combination 'bridges semantic gaps' in a meaningful way.
[§3.2 and §3.3] §3.2 and §3.3: the description of hybrid diffusion and topic-aware scoring does not include a formal definition or pseudocode for how structural and semantic signals are combined (e.g., weighting scheme, normalization), making it difficult to verify that the retrieval distribution is produced by the claimed hybrid process rather than by post-hoc tuning.

minor comments (2)

[Abstract] Abstract and §4: quantitative results (exact metrics, baselines, error bars, statistical significance) are summarized but not tabulated in the abstract; a concise results table should be added for quick assessment.
[§3.1] The paper states 'zero token consumption for construction' but does not clarify whether this excludes the cost of the initial embedding model used for semantic hyperedges; a short clarification would improve precision.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive and detailed feedback on our manuscript. The comments highlight important opportunities to strengthen the empirical validation and formal clarity of EHRAG. We address each major comment below and will revise the manuscript accordingly to improve reproducibility and attribution of results.

read point-by-point responses

Referee: [Experiments] Experiments section: the manuscript reports outperformance on four datasets but provides no ablation studies that disable semantic hyperedges, replace hybrid diffusion with pure structural or pure semantic propagation, or remove PPR refinement. Without these controls it is impossible to attribute gains to the proposed hybrid mechanism rather than implementation details or dataset artifacts, directly undermining the central claim that the combination 'bridges semantic gaps' in a meaningful way.

Authors: We agree that dedicated ablation studies are necessary to rigorously isolate the contribution of the hybrid components and support the central claim. While the existing comparisons to strong baselines (which lack semantic hyperedges or hybrid diffusion) provide indirect evidence, they do not fully control for each element. In the revised manuscript we will add a new ablation subsection in Experiments that includes: (1) disabling semantic hyperedges, (2) replacing hybrid diffusion with pure structural or pure semantic propagation, and (3) removing PPR refinement. These results will be reported on the same four datasets to directly attribute performance gains. revision: yes
Referee: [§3.2 and §3.3] §3.2 and §3.3: the description of hybrid diffusion and topic-aware scoring does not include a formal definition or pseudocode for how structural and semantic signals are combined (e.g., weighting scheme, normalization), making it difficult to verify that the retrieval distribution is produced by the claimed hybrid process rather than by post-hoc tuning.

Authors: We acknowledge that the current prose description in §§3.2–3.3 leaves the precise combination of signals underspecified. In the revised manuscript we will insert formal mathematical definitions together with pseudocode for the structure-semantic hybrid diffusion and topic-aware scoring. These additions will explicitly detail the weighting scheme, normalization steps, and integration of the two signal types, enabling direct verification that the retrieval distribution follows the claimed hybrid process. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical method with independent algorithmic steps and dataset evaluation

full rationale

The paper introduces EHRAG as an algorithmic construction: sentence-level co-occurrence hyperedges plus embedding-cluster hyperedges, followed by structure-semantic hybrid diffusion, topic-aware scoring, and PPR refinement. These steps are specified directly in the method without equations that equate outputs to inputs by definition, without fitted parameters relabeled as predictions, and without load-bearing self-citations that would make the hybrid mechanism reduce to prior author work. The central claims rest on empirical outperformance across four datasets rather than any tautological derivation, rendering the contribution self-contained.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no explicit free parameters, axioms, or invented entities are stated in the provided text.

pith-pipeline@v0.9.0 · 5496 in / 1178 out tokens · 35570 ms · 2026-05-10T05:39:18.341956+00:00 · methodology

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Prune-OPD: Efficient and Reliable On-Policy Distillation for Long-Horizon Reasoning
cs.LG 2026-05 unverdicted novelty 6.0

Prune-OPD dynamically prunes unreliable teacher rewards in on-policy distillation by monitoring prefix drift via top-k overlap, reducing training time 37.6-68% on AMC/AIME/HMMT while preserving or improving performance.

Reference graph

Works this paper leans on

300 extracted references · 125 canonical work pages · cited by 1 Pith paper · 6 internal anchors

[1]

arXiv preprint arXiv:2402.02389 , year=

Kicgpt: Large language model with knowledge in context for knowledge graph completion , author=. arXiv preprint arXiv:2402.02389 , year=

work page arXiv
[2]

arXiv preprint arXiv:2410.02811 , year=

Sac-kg: Exploiting large language models as skilled automatic constructors for domain knowledge graphs , author=. arXiv preprint arXiv:2410.02811 , year=

work page arXiv
[3]

Edward J Hu and yelong shen and Phillip Wallis and Zeyuan Allen-Zhu and Yuanzhi Li and Shean Wang and Lu Wang and Weizhu Chen , booktitle =. Lo
[4]

2020 , publisher=

spaCy: Industrial-strength natural language processing in python , author=. 2020 , publisher=

2020
[5]

Constructing a multi-hop qa dataset for comprehensive evaluation of reasoning steps,

Constructing a multi-hop qa dataset for comprehensive evaluation of reasoning steps , author=. arXiv preprint arXiv:2011.01060 , year=

work page arXiv 2011
[6]

2023 , journal =

GPT-4 Technical Report , author=. 2023 , journal =

2023
[7]

arXiv e-prints , pages=

The llama 3 herd of models , author=. arXiv e-prints , pages=
[8]

IEEE Transactions on Knowledge and Data Engineering , year=

Next-generation database interfaces: A survey of llm-based text-to-sql , author=. IEEE Transactions on Knowledge and Data Engineering , year=
[9]

Knapsack Optimization-based Schema Linking for LLM-based Text-to-SQL Generation

Knapsack optimization-based schema linking for llm-based Text-to-SQL generation , author=. arXiv preprint arXiv:2502.12911 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[10]

2025 , eprint=

Reliable Reasoning Path: Distilling Effective Guidance for LLM Reasoning with Knowledge Graphs , author=. 2025 , eprint=

2025
[11]

2025 , eprint=

GraphRAG-Bench: Challenging Domain-Specific Reasoning for Evaluating Graph Retrieval-Augmented Generation , author=. 2025 , eprint=

2025
[12]

2025 , eprint=

LAG: Logic-Augmented Generation from a Cartesian Perspective , author=. 2025 , eprint=

2025
[13]

arXiv preprint arXiv:2508.07690 , year=

LoSemB: Logic-Guided Semantic Bridging for Inductive Tool Retrieval , author=. arXiv preprint arXiv:2508.07690 , year=

work page arXiv
[14]

arXiv preprint arXiv:2407.00341 , year=

Iterative data generation with large language models for aspect-based sentiment analysis , author=. arXiv preprint arXiv:2407.00341 , year=

work page arXiv
[15]

Transactions of the Association for Computational Linguistics , volume=

MuSiQue: Multi-hop Questions via Single-hop Question Composition , author=. Transactions of the Association for Computational Linguistics , volume=
[16]

condition

AutoSchemaKG: Autonomous Knowledge Graph Construction through Dynamic Schema Induction from Web-Scale Corpora , author=. arXiv preprint arXiv:2505.23628 , year=

work page arXiv
[17]

Towards a unified view of parameter-efficient transfer learning , year =

He, Junxian and Zhou, Chunting and Ma, Xuezhe and Berg-Kirkpatrick, Taylor and Neubig, Graham , booktitle =. Towards a unified view of parameter-efficient transfer learning , year =
[18]

Gao, D., Wang, H., Li, Y., et al

Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? , author=. arXiv preprint arXiv:2405.05904 , year=

work page arXiv
[19]

Proceedings of the 27th ACM international conference on information and knowledge management , pages=

Acekg: A large-scale knowledge graph for academic data mining , author=. Proceedings of the 27th ACM international conference on information and knowledge management , pages=
[20]

Proceedings of the 1st workshop on LREC (Language Resources and Technologies for the Legal Knowledge Graph) workshop , pages=

Lynx: building the legal knowledge graph for smart compliance services in multilingual Europe , author=. Proceedings of the 1st workshop on LREC (Language Resources and Technologies for the Legal Knowledge Graph) workshop , pages=
[21]

Conference on Parsimony and Learning , pages=

Investigating the catastrophic forgetting in multimodal large language model fine-tuning , author=. Conference on Parsimony and Learning , pages=
[22]

Briefings in bioinformatics , volume=

Enriching contextualized language model from knowledge graph for biomedical information extraction , author=. Briefings in bioinformatics , volume=. 2021 , publisher=

2021
[23]

IEEE Transactions on Knowledge and Data Engineering , volume=

Unifying large language models and knowledge graphs: A roadmap , author=. IEEE Transactions on Knowledge and Data Engineering , volume=. 2024 , publisher=

2024
[24]

arXiv preprint arXiv:2010.12688 , year=

Knowledge graph based synthetic corpus generation for knowledge-enhanced language model pre-training , author=. arXiv preprint arXiv:2010.12688 , year=

work page arXiv 2010
[25]

arXiv preprint arXiv:2310.06671 , year=

Making large language models perform better in knowledge graph completion , author=. arXiv preprint arXiv:2310.06671 , year=

work page arXiv
[26]

IEEE Access , volume=

Language model guided knowledge graph embeddings , author=. IEEE Access , volume=. 2022 , publisher=

2022
[27]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

K-bert: Enabling language representation with knowledge graph , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
[28]

arXiv preprint arXiv:2111.08546 , year=

Interpreting language models through knowledge graph extraction , author=. arXiv preprint arXiv:2111.08546 , year=

work page arXiv
[29]

Think-on-Graph: Deep and Responsible Reasoning of Large Language Model on Knowledge Graph , author=
[30]

arXiv preprint arXiv:2306.11489 , year=

Chatgpt is not enough: Enhancing large language models with knowledge graphs for fact-aware language modeling , author=. arXiv preprint arXiv:2306.11489 , year=

work page arXiv
[31]

Think-on-graph 2.0: Deep and interpretable large language model reasoning with knowledge graph-guided retrieval , author=
[32]

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

Toolllm: Facilitating large language models to master 16000+ real-world apis , author=. arXiv preprint arXiv:2307.16789 , year=

work page internal anchor Pith review Pith/arXiv arXiv
[33]

arXiv preprint arXiv:2302.11509 , year=

Construction of knowledge graphs: State and challenges , author=. arXiv preprint arXiv:2302.11509 , year=

work page arXiv
[34]

arXiv preprint arXiv:2306.04136 , year=

Knowledge-augmented language model prompting for zero-shot knowledge graph question answering , author=. arXiv preprint arXiv:2306.04136 , year=

work page arXiv
[35]

arXiv preprint arXiv:2310.08975 , year=

Chatkbqa: A generate-then-retrieve framework for knowledge base question answering with fine-tuned large language models , author=. arXiv preprint arXiv:2310.08975 , year=

work page arXiv
[36]

arXiv preprint arXiv:2311.09278 , year=

Symbol-LLM: Towards foundational symbol-centric interface for large language models , author=. arXiv preprint arXiv:2311.09278 , year=

work page arXiv
[37]

arXiv preprint arXiv:2305.13269 , year=

Chain-of-knowledge: Grounding large language models via dynamic knowledge adapting over heterogeneous sources , author=. arXiv preprint arXiv:2305.13269 , year=

work page arXiv
[38]

arXiv preprint arXiv:2308.10173 , year=

Foodgpt: A large language model in food testing domain with incremental pre-training and knowledge graph prompt , author=. arXiv preprint arXiv:2308.10173 , year=

work page arXiv
[39]

Mindmap: Knowledge graph prompting sparks graph of thoughts in large language models,

Mindmap: Knowledge graph prompting sparks graph of thoughts in large language models , author=. arXiv preprint arXiv:2308.09729 , year=

work page arXiv
[40]

InProceedings of the Twenty- Seventh International Joint Conference on Artificial Intelligence, pages 4396–4402

Two Heads Are Better Than One: Integrating Knowledge from Knowledge Graphs and Large Language Models for Entity Alignment , author=. arXiv preprint arXiv:2401.16960 , year=

work page arXiv
[41]

Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Generated Knowledge Prompting for Commonsense Reasoning , author=. Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=
[42]

arXiv preprint arXiv:2306.06427 , year=

Boosting language models reasoning with chain-of-knowledge prompting , author=. arXiv preprint arXiv:2306.06427 , year=

work page arXiv
[43]

An enhanced prompt- based llm reasoning scheme via knowledge graph-integrated collaboration

An enhanced prompt-based LLM reasoning scheme via knowledge graph-integrated collaboration , author=. arXiv preprint arXiv:2402.04978 , year=

work page arXiv
[44]

arXiv preprint arXiv:2402.14293 , year=

Leveraging Large Language Models for Concept Graph Recovery and Question Answering in NLP Education , author=. arXiv preprint arXiv:2402.14293 , year=

work page arXiv
[45]

Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval , pages=

Answering complex questions by joining multi-document evidence with quasi knowledge graphs , author=. Proceedings of the 42nd international ACM SIGIR conference on research and development in information retrieval , pages=
[46]

arXiv preprint arXiv:2402.13415 , year=

Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text , author=. arXiv preprint arXiv:2402.13415 , year=

work page arXiv
[47]

arXiv preprint arXiv:2406.18027 , year=

Automated Clinical Data Extraction with Knowledge Conditioned LLMs , author=. arXiv preprint arXiv:2406.18027 , year=

work page arXiv
[48]

2023 IEEE International Conference on Big Data (BigData) , pages=

AutoKG: Efficient automated knowledge graph generation for language models , author=. 2023 IEEE International Conference on Big Data (BigData) , pages=. 2023 , organization=

2023
[49]

ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation

ArchRAG: Attributed Community-based Hierarchical Retrieval-Augmented Generation , author=. arXiv preprint arXiv:2502.09891 , year=

work page internal anchor Pith review arXiv
[50]

arXiv preprint arXiv:2311.06414 , year=

Knowledge Graphs are not Created Equal: Exploring the Properties and Structure of Real KGs , author=. arXiv preprint arXiv:2311.06414 , year=

work page arXiv
[51]

arXiv preprint arXiv:2311.04524 , year=

Validating ChatGPT Facts through RDF Knowledge Graphs and Sentence Similarity , author=. arXiv preprint arXiv:2311.04524 , year=

work page arXiv
[52]

Material Property Prediction with Joint Reasoning based on Large Language Models and Knowledge Graphs for Lithium Batteries , author=
[53]

" Semantic Rules from Text: Automating Common-Sense Knowledge Curation with LLM, s , author=
[54]

AIREG: Enhanced Educational Recommender System with Large Language Models and Knowledge Graphs , author=
[55]

AAAI Conference on Artificial Intelligence , volume=

Mitigating large language model hallucinations via autonomous knowledge graph-based retrofitting , author=. AAAI Conference on Artificial Intelligence , volume=
[56]

arXiv preprint arXiv:2308.06374 , year=

Large language models and knowledge graphs: Opportunities and challenges , author=. arXiv preprint arXiv:2308.06374 , year=

work page arXiv
[57]

arXiv preprint arXiv:2308.10168 , year=

Head-to-tail: How knowledgeable are large language models (llm)? AKA will llms replace knowledge graphs? , author=. arXiv preprint arXiv:2308.10168 , year=

work page arXiv
[58]

arXiv preprint arXiv:2308.14321 , year=

Leveraging a medical knowledge graph into large language models for diagnosis prediction , author=. arXiv preprint arXiv:2308.14321 , year=

work page arXiv
[59]

arXiv preprint arXiv:2309.01538 , year=

Chatrule: Mining logical rules with large language models for knowledge graph reasoning , author=. arXiv preprint arXiv:2309.01538 , year=

work page arXiv
[60]

Enhancing knowl- edge graph construction using large language models,

Enhancing knowledge graph construction using large language models , author=. arXiv preprint arXiv:2305.04676 , year=

work page arXiv
[61]

ACM Transactions on Asian and Low-Resource Language Information Processing , volume=

Codekgc: Code language model for generative knowledge graph construction , author=. ACM Transactions on Asian and Low-Resource Language Information Processing , volume=. 2024 , publisher=

2024
[62]

Proceedings of the 1st Workshop on Natural Language Reasoning and Structured Explanations (NLRSE) , pages=

Knowledge graph-augmented language models for complex question answering , author=. Proceedings of the 1st Workshop on Natural Language Reasoning and Structured Explanations (NLRSE) , pages=
[63]

arXiv preprint arXiv:2408.03910 , year=

Codexgraph: Bridging large language models and code repositories via code graph databases , author=. arXiv preprint arXiv:2408.03910 , year=

work page arXiv
[64]

arXiv preprint arXiv:2311.07914 , year=

Can knowledge graphs reduce hallucinations in llms?: A survey , author=. arXiv preprint arXiv:2311.07914 , year=

work page arXiv
[65]

Proceedings of the AAAI Conference on Artificial Intelligence , volume=

CyberQ: Generating Questions and Answers for Cybersecurity Education Using Knowledge Graph-Augmented LLMs , author=. Proceedings of the AAAI Conference on Artificial Intelligence , volume=
[66]

T -RAG: Lessons from the LLM trenches,

T-RAG: lessons from the LLM trenches , author=. arXiv preprint arXiv:2402.07483 , year=

work page arXiv
[67]

Applied Sciences , volume=

Evaluating Complex Entity Knowledge Propagation for Knowledge Editing in LLMs , author=. Applied Sciences , volume=. 2024 , publisher=

2024
[68]

arXiv preprint arXiv:2406.06621 , year=

LinkQ: An LLM-Assisted Visual Interface for Knowledge Graph Question-Answering , author=. arXiv preprint arXiv:2406.06621 , year=

work page arXiv
[69]

arXiv preprint arXiv:2309.11206 , year=

Retrieve-rewrite-answer: A kg-to-text enhanced llms framework for knowledge graph question answering , author=. arXiv preprint arXiv:2309.11206 , year=

work page arXiv
[70]

Findings of the Association for Computational Linguistics: NAACL 2024 , pages=

Bring your own kg: Self-supervised program synthesis for zero-shot kgqa , author=. Findings of the Association for Computational Linguistics: NAACL 2024 , pages=

2024
[71]

arXiv preprint arXiv:2402.16567 , year=

Aligning Large Language Models to a Domain-specific Graph Database , author=. arXiv preprint arXiv:2402.16567 , year=

work page arXiv
[72]

Computers in Industry , volume=

Construction contract risk identification based on knowledge-augmented language models , author=. Computers in Industry , volume=. 2024 , publisher=

2024
[73]

Thread: A logic-based data organization paradigm for how-to question answering with retrieval augmented generation.arXiv preprint arXiv:2406.13372, 2024

Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation , author=. arXiv preprint arXiv:2406.13372 , year=

work page arXiv
[74]

arXiv preprint arXiv:2404.19234 , year=

Multi-hop Question Answering over Knowledge Graphs using Large Language Models , author=. arXiv preprint arXiv:2404.19234 , year=

work page arXiv
[75]

Intelligent Computing Technology and Automation , pages=

A Methodology for Generating and Optimizing Chain-of-Thought Based on Knowledge Graphs , author=. Intelligent Computing Technology and Automation , pages=. 2024 , publisher=

2024
[76]

arXiv preprint arXiv:2312.15883 , year=

Think and retrieval: A hypothesis knowledge graph enhanced medical large language models , author=. arXiv preprint arXiv:2312.15883 , year=

work page arXiv
[77]

Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems , pages=

Personalized quest and dialogue generation in role-playing games: A knowledge graph-and language model-based approach , author=. Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems , pages=

2023
[78]

arXiv preprint arXiv:2305.01157 , year=

Complex logical reasoning over knowledge graphs using large language models , author=. arXiv preprint arXiv:2305.01157 , year=

work page arXiv
[79]

European Semantic Web Conference , pages=

Knowledge injection to counter large language model (LLM) hallucination , author=. European Semantic Web Conference , pages=. 2023 , organization=

2023
[80]

International Semantic Web Conference , pages=

Text2kgbench: A benchmark for ontology-driven knowledge graph generation from text , author=. International Semantic Web Conference , pages=. 2023 , organization=

2023

Showing first 80 references.