Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

· 2026 · cs.LG · arXiv 2604.28010

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

We reframe clinician overrides of clinical AI recommendations as implicit preference data - the same signal structure exploited by reinforcement learning from human feedback (RLHF), but richer: the annotator is a domain expert, the alternatives carry real consequences, and downstream outcomes are observable. We present a formal framework extending standard preference learning with three contributions: a five-category override taxonomy mapping override types to distinct model update targets; a preference formulation conditioned on patient state s, organizational context c, and clinician capability kappa, where kappa decomposes into execution capability kappa-exec and alignment capability kappa-align; and a dual learning architecture that jointly trains a reward model and a capability model via alternating optimization, preventing a failure mode we term suppression bias-the systematic suppression of correct-but-difficult recommendations when clinician capability falls below the execution threshold. We argue that chronic disease management under outcome-based payment contracts produces override data with uniquely favorable properties-longitudinal density, concentrated decision space, outcome labels, and natural capability variation-and that training environments combining longitudinal outcome measurement with aligned financial incentives are a necessary condition for learning a reward model aligned with patient trajectory rather than with encounter economics. This framework emerged from operational work to improve clinician capability in a live value-based care deployment.

representative citing papers

Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management

cs.LG · 2026-05-10 · unverdicted · novelty 5.0

A new RL framework for chronic disease management compresses time-to-control using clinician capability weighting and action intensity constraints, yielding 15 percentage point gains on synthetic type 2 diabetes simulations over standard offline RL.

citing papers explorer

Showing 1 of 1 citing paper.

Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management cs.LG · 2026-05-10 · unverdicted · none · ref 22 · internal anchor
A new RL framework for chronic disease management compresses time-to-control using clinician capability weighting and action intensity constraints, yielding 15 percentage point gains on synthetic type 2 diabetes simulations over standard offline RL.

Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

fields

years

verdicts

representative citing papers

citing papers explorer