BELLE: A bi-level multi-agent reasoning framework for multi-hop question answering

Taolin Zhang, Dongyang Li, Qizhou Chen, Chengyu Wang, Xiaofeng He · 2025 · DOI 10.18653/v1/2025.acl-long.211

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

ECHO: Learning Epistemically Adaptive Language Agents with Turn-Level Credit

cs.MA · 2026-06-29 · unverdicted · novelty 7.0

ECHO is a clipped policy-gradient method that uses posterior-sensitive rewards to give turn-level epistemic credit in multi-turn information-seeking tasks, outperforming trajectory-level GRPO on a new Clue Selector Game benchmark.

Improving General Role-Playing Agents via Psychology-Grounded Reasoning and Role-Aware Policy Optimization

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

Psy-CoT decomposes reasoning into Interaction Perception, Psychological Empathy, and Logical Construction while RAPO asymmetrically weights role-specific tokens during policy optimization, outperforming prior CoT and GRPO baselines on role-playing benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

ECHO: Learning Epistemically Adaptive Language Agents with Turn-Level Credit cs.MA · 2026-06-29 · unverdicted · none · ref 76
ECHO is a clipped policy-gradient method that uses posterior-sensitive rewards to give turn-level epistemic credit in multi-turn information-seeking tasks, outperforming trajectory-level GRPO on a new Clue Selector Game benchmark.
Improving General Role-Playing Agents via Psychology-Grounded Reasoning and Role-Aware Policy Optimization cs.CL · 2026-06-25 · unverdicted · none · ref 136
Psy-CoT decomposes reasoning into Interaction Perception, Psychological Empathy, and Logical Construction while RAPO asymmetrically weights role-specific tokens during policy optimization, outperforming prior CoT and GRPO baselines on role-playing benchmarks.

BELLE: A bi-level multi-agent reasoning framework for multi-hop question answering

fields

years

verdicts

representative citing papers

citing papers explorer