A Reinforcement Learning Approach to Weaning of Mechanical Ventilation in Intensive Care Units

A reinforcement learning approach to weaning of mechanical ventilation in intensive care units , author= · 2017 · cs.AI · arXiv 1704.06300

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open full Pith review browse 5 citing papers arXiv PDF

abstract

The management of invasive mechanical ventilation, and the regulation of sedation and analgesia during ventilation, constitutes a major part of the care of patients admitted to intensive care units. Both prolonged dependence on mechanical ventilation and premature extubation are associated with increased risk of complications and higher hospital costs, but clinical opinion on the best protocol for weaning patients off of a ventilator varies. This work aims to develop a decision support tool that uses available patient information to predict time-to-extubation readiness and to recommend a personalized regime of sedation dosage and ventilator support. To this end, we use off-policy reinforcement learning algorithms to determine the best action at a given patient state from sub-optimal historical ICU data. We compare treatment policies from fitted Q-iteration with extremely randomized trees and with feedforward neural networks, and demonstrate that the policies learnt show promise in recommending weaning protocols with improved outcomes, in terms of minimizing rates of reintubation and regulating physiological stability.

representative citing papers

An adaptive variance estimator for relative sparsity

stat.ME · 2026-05-04 · unverdicted · novelty 6.0

A new adaptive variance estimator for relative sparsity coefficients is introduced that fully utilizes the prior asymptotic normality theorem and incorporates variable selection effects.

VentAgent: When LLMs Learn to Breathe -- Multi-Objective Arbitration for ARDS Ventilation

cs.LG · 2026-06-03 · unverdicted · novelty 5.0

VentAgent uses LLMs in a three-stage Perception-Planning-Orchestration hierarchy to perform multi-objective arbitration for mechanical ventilation in ARDS, outperforming RL baselines on a simulator while producing human-readable reasoning.

On Safer Reinforcement Learning for Sedation and Analgesia in Intensive Care

cs.LG · 2026-01-30 · unverdicted · novelty 5.0

Offline RL for ICU sedation shows that adding 30-day mortality to the objective yields policies whose clinician agreement correlates negatively with mortality, unlike pain-only versions.

Deep Reinforcement Learning for Clinical Decision Support: A Brief Survey

cs.LG · 2019-07-22 · unverdicted · novelty 2.0

This survey compiles deep reinforcement learning algorithms for clinical decision support, reviews case studies, and offers guidance on algorithm selection for medical applications.

Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems

cs.LG · 2020-05-04 · unverdicted · novelty 2.0

Offline RL promises to extract high-utility policies from static datasets but faces fundamental challenges that current methods only partially address.

citing papers explorer

Showing 5 of 5 citing papers after filters.

An adaptive variance estimator for relative sparsity stat.ME · 2026-05-04 · unverdicted · none · ref 209
A new adaptive variance estimator for relative sparsity coefficients is introduced that fully utilizes the prior asymptotic normality theorem and incorporates variable selection effects.
VentAgent: When LLMs Learn to Breathe -- Multi-Objective Arbitration for ARDS Ventilation cs.LG · 2026-06-03 · unverdicted · none · ref 31 · internal anchor
VentAgent uses LLMs in a three-stage Perception-Planning-Orchestration hierarchy to perform multi-objective arbitration for mechanical ventilation in ARDS, outperforming RL baselines on a simulator while producing human-readable reasoning.
On Safer Reinforcement Learning for Sedation and Analgesia in Intensive Care cs.LG · 2026-01-30 · unverdicted · none · ref 18 · internal anchor
Offline RL for ICU sedation shows that adding 30-day mortality to the objective yields policies whose clinician agreement correlates negatively with mortality, unlike pain-only versions.
Deep Reinforcement Learning for Clinical Decision Support: A Brief Survey cs.LG · 2019-07-22 · unverdicted · none · ref 11 · internal anchor
This survey compiles deep reinforcement learning algorithms for clinical decision support, reviews case studies, and offers guidance on algorithm selection for medical applications.
Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems cs.LG · 2020-05-04 · unverdicted · none · ref 171
Offline RL promises to extract high-utility policies from static datasets but faces fundamental challenges that current methods only partially address.

A Reinforcement Learning Approach to Weaning of Mechanical Ventilation in Intensive Care Units

fields

years

verdicts

representative citing papers

citing papers explorer