In-Place Feedback: Reliable Refinement for Multi-Turn Expert-LLM Collaboration

Chaehyeon Chung; Dongwoo Kim; Minjong Lee; Moonjeong Park; Saemi Moon; Seunghyuk Cho; Youngbin Choi

arxiv: 2510.00777 · v2 · pith:YC7B7DIFnew · submitted 2025-10-01 · 💻 cs.LG

In-Place Feedback: Reliable Refinement for Multi-Turn Expert-LLM Collaboration

Youngbin Choi , Minjong Lee , Saemi Moon , Seunghyuk Cho , Chaehyeon Chung , MoonJeong Park , Dongwoo Kim This is my paper

classification 💻 cs.LG

keywords feedbackin-placemulti-turncollaborationdirectlyerrorsexpert-llmllm-generated

0 comments

read the original abstract

LLM-generated drafts often contain subtle factual or logical errors, yet prior work shows that models struggle to reliably integrate multi-turn feedback aimed at fixing them. We propose in-place feedback, an interaction paradigm in which the user directly edits the model's previous response and the model continues generation from the edited context. In-place feedback consistently outperforms standard multi-turn feedback across five reasoning-intensive benchmarks while requiring fewer tokens, and our fine-grained analysis shows that it applies corrections more reliably and propagates them to subsequent reasoning. A user study with domain experts refining LLM-generated summaries corroborates these findings: participants report higher final-output satisfaction and substantially lower fatigue with in-place feedback, and a mixed strategy combining in-place and multi-turn feedback scores highest on every measured dimension. These results suggest that editing errors directly is a more effective paradigm for expert-LLM collaboration.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Rewarding Beliefs, Not Actions: Consistency-Guided Credit Assignment for Long-Horizon Agents
cs.CL 2026-05 unverdicted novelty 6.0

ReBel uses belief-consistency supervision and belief-aware grouping to improve credit assignment in long-horizon RL for LLM agents, achieving up to 20.4 percentage points higher success and 2.1x better sample efficien...