Preprint, arXiv:2603.00511

Multimodal adaptive retrieval augmented generation through internal representation learning · 2026 · arXiv 2603.00511

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

GAPD: Gold-Action Policy Distillation for Agentic Reinforcement Learning in Knowledge Base Question Answering

cs.CL · 2026-05-28 · unverdicted · novelty 6.0

GAPD adds dense token-level guidance from gold actions to outcome-based RL for KBQA via mid-anchor matching and outperforms SOTA on WebQSP, GrailQA, and GraphQ.

citing papers explorer

Showing 1 of 1 citing paper after filters.

GAPD: Gold-Action Policy Distillation for Agentic Reinforcement Learning in Knowledge Base Question Answering cs.CL · 2026-05-28 · unverdicted · none · ref 1
GAPD adds dense token-level guidance from gold actions to outcome-based RL for KBQA via mid-anchor matching and outperforms SOTA on WebQSP, GrailQA, and GraphQ.

Preprint, arXiv:2603.00511

fields

years

verdicts

representative citing papers

citing papers explorer