Dual static CVaR decompositions suffer from a CVaR evaluation gap caused by empty intersections in risk-assignment consistency constraints, and no single policy can be optimal across all risk levels in some MDPs.
Guidelines for reinforcement learning in healthcare
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.LG 2roles
background 1polarities
background 1representative citing papers
A new RL framework for chronic disease management compresses time-to-control using clinician capability weighting and action intensity constraints, yielding 15 percentage point gains on synthetic type 2 diabetes simulations over standard offline RL.
citing papers explorer
-
On the Fundamental Limitations of Dual Static CVaR Decompositions in Markov Decision Processes
Dual static CVaR decompositions suffer from a CVaR evaluation gap caused by empty intersections in risk-assignment consistency constraints, and no single policy can be optimal across all risk levels in some MDPs.
-
Learning to Compress Time-to-Control: A Reinforcement Learning Framework for Chronic Disease Management
A new RL framework for chronic disease management compresses time-to-control using clinician capability weighting and action intensity constraints, yielding 15 percentage point gains on synthetic type 2 diabetes simulations over standard offline RL.