When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Chiyu Wu; Haoming Xu; Jin Shang; Mengru Wang; Shumin Deng; Weihong Xu; Yu Gong; Yunzhi Yao; Zongrui Li

arxiv: 2605.30219 · v1 · pith:UHE23H3Knew · submitted 2026-05-28 · 💻 cs.AI · cs.CL· cs.LG

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

Haoming Xu , Weihong Xu , Zongrui Li , Mengru Wang , Yunzhi Yao , Chiyu Wu , Jin Shang , Yu Gong

show 1 more author

Shumin Deng

This is my paper

classification 💻 cs.AI cs.CLcs.LG

keywords beliefmodelsfailedfailuresstatewhenacrossbelief-state

0 comments

read the original abstract

Long-horizon interactions require language models to manage accumulating information: when to update their state, when to preserve their state, and what to ignore. We study this challenge as \textbf{Contextual Belief Management (CBM)}: maintaining a predicted belief state aligned with formal evidence while isolating task-irrelevant noise. To make CBM measurable, we introduce BeliefTrack, a closed-world benchmark spanning Rule Discovery and Circuit Diagnosis, where a finite belief space and symbolic verifiers enable exact turn-level evaluation. BeliefTrack diagnoses three failures: Failed Stay, Failed Update, and Failed Isolation. Across multiple LLMs, vanilla models exhibit severe CBM failures, while explicit belief-tracking prompts provide limited gains. In contrast, reinforcement learning with belief-state rewards reduces failure rates by 70.9\% on average. Further probing reveals latent belief-state dynamics behind these failures, and representation-level steering reduces failure rates by 46.1\% across two tasks\footnote{Code is coming soon at https://github.com/zjunlp/CBM.

This paper has not been read by Pith yet.

When Should Models Change Their Minds? Contextual Belief Management in Large Language Models

discussion (0)