LUME-DBN: Full Bayesian Learning of DBNs from Incomplete data in Intensive Care
Pith reviewed 2026-05-18 01:02 UTC · model grok-4.3
The pith
A Gibbs sampling procedure learns dynamic Bayesian networks from incomplete intensive care data by sampling missing values as Gaussian parameters.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A Gibbs sampler jointly infers DBN parameters and missing data by modeling each unobserved entry as a Gaussian random variable whose full conditional is directly available, thereby performing principled imputation and uncertainty quantification inside the same iterative procedure used for network learning.
What carries the argument
Gibbs sampling over the joint posterior of DBN parameters and missing values, with each missing observation treated as an unknown Gaussian parameter that is resampled from its full conditional at every iteration.
Load-bearing premise
Missing values can be treated as unknown parameters that follow a Gaussian distribution whose full conditional can be sampled directly within the Gibbs procedure for DBNs.
What would settle it
Mask known values in a complete longitudinal dataset generated from a known DBN, run the sampler, and check whether the recovered posterior means and credible intervals for the masked entries are well calibrated against the true hidden values.
Figures
read the original abstract
Dynamic Bayesian networks (DBNs) are increasingly used in healthcare due to their ability to model complex temporal relationships in patient data while maintaining interpretability, an essential feature for clinical decision-making. However, existing approaches to handling missing data in longitudinal clinical datasets are largely derived from static Bayesian networks literature, failing to properly account for the temporal nature of the data. This gap limits the ability to quantify uncertainty over time, which is particularly critical in settings such as intensive care, where understanding the temporal dynamics is fundamental for model trustworthiness and applicability across diverse patient groups. Despite the potential of DBNs, a full Bayesian framework that integrates missing data handling remains underdeveloped. In this work, we propose a novel Gibbs sampling-based method for learning DBNs from incomplete data. Our method treats each missing value as an unknown parameter following a Gaussian distribution. At each iteration, the unobserved values are sampled from their full conditional distributions, allowing for principled imputation and uncertainty estimation. We evaluate our method on both simulated datasets and real-world intensive care data from critically ill patients. Compared to standard model-agnostic techniques such as MICE, our Bayesian approach demonstrates superior reconstruction accuracy and convergence properties. These results highlight the clinical relevance of incorporating full Bayesian inference in temporal models, providing more reliable imputations and offering deeper insight into model behavior. Our approach supports safer and more informed clinical decision-making, particularly in settings where missing data are frequent and potentially impactful.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes LUME-DBN, a Gibbs sampling procedure for full Bayesian learning of Dynamic Bayesian Networks from incomplete longitudinal data. Missing entries are treated as unknown parameters drawn from Gaussian distributions and sampled from their full conditionals inside the DBN Gibbs sweep; the method is evaluated on simulated data and real ICU records, with the central claim that it yields superior reconstruction accuracy and convergence relative to model-agnostic imputation such as MICE.
Significance. A sound implementation would advance principled uncertainty quantification for temporal clinical models by folding imputation directly into DBN parameter learning. The approach correctly emphasizes the temporal structure that static BN methods ignore, and the provision of a joint posterior over structure, parameters, and imputations is a genuine technical contribution if the distributional assumptions can be justified.
major comments (2)
- [Abstract] Abstract: the assertion that the Bayesian approach 'demonstrates superior reconstruction accuracy and convergence properties' is unsupported by any numerical results, dataset sizes, missingness fractions, statistical tests, or ablation details, rendering the central empirical claim unverifiable from the manuscript text.
- [Method] Method section (Gibbs sampling for missing values): the full conditional for each missing entry is derived under an unconditional Gaussian model. ICU variables routinely include binary flags, integer counts, bounded lab values, and censored observations; sampling from a Gaussian produces probability mass outside valid support, so the imputed draws are not from the true posterior and the comparison with MICE (which admits per-variable conditional models) is not on equal footing.
minor comments (2)
- [Abstract] The abstract states that the method 'supports safer and more informed clinical decision-making' without any downstream clinical-task evaluation (e.g., prediction of outcomes or treatment effects) that would substantiate this claim.
- [Method] Notation for the DBN transition and emission distributions is not introduced before the Gibbs sweep is described, making it difficult to follow how the full conditionals are obtained.
Simulated Author's Rebuttal
We thank the referee for their constructive and insightful comments on our manuscript. We have addressed each major comment point by point below, making revisions where appropriate to improve clarity, rigor, and transparency.
read point-by-point responses
-
Referee: [Abstract] Abstract: the assertion that the Bayesian approach 'demonstrates superior reconstruction accuracy and convergence properties' is unsupported by any numerical results, dataset sizes, missingness fractions, statistical tests, or ablation details, rendering the central empirical claim unverifiable from the manuscript text.
Authors: We agree that the abstract would be strengthened by including concrete details to support the empirical claims. In the revised manuscript, we have updated the abstract to reference specific quantitative results from the experiments, including reconstruction accuracy metrics, dataset sizes (number of patients and time steps), missingness fractions, and the statistical comparisons performed against MICE. These additions make the central claims directly verifiable while preserving the abstract's conciseness. revision: yes
-
Referee: [Method] Method section (Gibbs sampling for missing values): the full conditional for each missing entry is derived under an unconditional Gaussian model. ICU variables routinely include binary flags, integer counts, bounded lab values, and censored observations; sampling from a Gaussian produces probability mass outside valid support, so the imputed draws are not from the true posterior and the comparison with MICE (which admits per-variable conditional models) is not on equal footing.
Authors: We appreciate this observation on the distributional assumptions. Our derivation of the full conditionals for missing values assumes Gaussian distributions, which is appropriate for the continuous physiological variables (e.g., heart rate, blood pressure) that form the primary focus of our ICU datasets and evaluations. We acknowledge that this assumption does not enforce valid support for binary, count-based, or censored variables and that MICE can employ variable-specific models. In the revision, we have added explicit discussion of this modeling choice and its limitations in the Methods and Limitations sections, clarified that comparisons were restricted to continuous variables for fairness, and outlined extensions to mixed-type models as future work. revision: partial
Circularity Check
No significant circularity detected
full rationale
The paper describes a Gibbs sampling procedure that treats missing values as Gaussian parameters whose full conditionals are sampled inside the DBN learning sweep. No equations, derivations, or self-citations are shown that reduce the claimed reconstruction accuracy or convergence properties to a quantity fitted from the same data by construction. The method is presented as a direct application of standard conditional sampling, and performance claims rest on empirical comparisons to MICE on simulated and real ICU data rather than tautological redefinitions or imported uniqueness results. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Missing values follow a Gaussian distribution and can be sampled from their full conditional given the current DBN parameters.
Reference graph
Works this paper leans on
-
[1]
DynamicBayesianNetworkstopredictsequencesoforgan failuresinpatientsadmittedtoICU
M.Sandrietal.“DynamicBayesianNetworkstopredictsequencesoforgan failuresinpatientsadmittedtoICU”.In:Journal of Biomedical Informatics 48 (2014), pp. 106–113
work page 2014
-
[2]
Q. Chen et al. “Dynamic Bayesian network for predicting physiological changes, organ dysfunctions and mortality risk in critical trauma patients”. In:BMC Medical Informatics and Decision Making22.1 (2022), p. 119. LUME-DBN: full Bayesian DBN learning from incomplete data 13
work page 2022
-
[3]
Hard and Soft EM in Bayesian Network Learning from Incomplete Data
A. Ruggieri et al. “Hard and Soft EM in Bayesian Network Learning from Incomplete Data”. In:Algorithms13.12 (2020), p. 329
work page 2020
-
[4]
Being Bayesian about learning Gaussian Bayesian net- works from incomplete data
M. Grzegorczyk. “Being Bayesian about learning Gaussian Bayesian net- works from incomplete data”. In:International Journal of Approximate Reasoning147 (2023), pp. 1–22
work page 2023
-
[5]
Learning dynamic Bayesian networks
Z. Ghahramani. “Learning dynamic Bayesian networks”. In:Lecture Notes in Computer Science1387 (1998), pp. 168–197
work page 1998
-
[6]
Modelling Non-homogeneous Dynamic Bayesian Networks with Piecewise Linear Regression Models
M. Grzegorczyk and D. Husmeier. “Modelling Non-homogeneous Dynamic Bayesian Networks with Piecewise Linear Regression Models”. In:Hand- book of Statistical Genomics. Ed. by D. J. Balding, I. Moltke, and J. Mar- ioni. Vol. 2. John Wiley & Sons, 2019, pp. 899–931
work page 2019
-
[7]
C. M. Bishop.Pattern Recognition and Machine Learning. Springer, 2006
work page 2006
-
[8]
Inference from iterative simulation using multiple sequences
A. Gelman and D. B. Rubin. “Inference from iterative simulation using multiple sequences”. In:Statistical Science7.4 (1992), pp. 457–472
work page 1992
-
[9]
T. Saito and M. Rehmsmeier. “The Precision–Recall Plot Is More Informa- tive than the ROC Plot When Evaluating Binary Classifiers on Imbalanced Datasets”. In:PLOS ONE10.3 (2015), e0118432
work page 2015
-
[10]
A. L. Goldberger et al.PhysioNet 2012 Challenge Dataset. 2012
work page 2012
-
[11]
The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure
V. Jean-Louis et al. “The SOFA (Sepsis-related Organ Failure Assessment) score to describe organ dysfunction/failure”. In:Intensive Care Medicine 22.7 (1996), pp. 707–710
work page 1996
-
[12]
D. B. Rubin. “Inference and Missing Data”. In:Biometrika63.3 (1976), pp. 581–592
work page 1976
-
[13]
Temporal Clustering of Critical Illness Events on Medical Wards
Y. Zhao et al. “Temporal Clustering of Critical Illness Events on Medical Wards”. In:JAMA Internal Medicine179.9 (2019), pp. 1237–1244
work page 2019
-
[14]
Effective Hemodynamic Monitoring
M. R. Pinsky et al. “Effective Hemodynamic Monitoring”. In:Critical Care 26 (2022)
work page 2022
-
[15]
J.Kreit.“Respiratory–CardiovascularInteractionsDuringMechanicalVen- tilation: Physiology and Clinical Implications”. In:Comprehensive Physi- ology12.3 (2022), pp. 3425–3448
work page 2022
-
[16]
Learning Bayesian Networks with the bnlearn R Package
M. Scutari. “Learning Bayesian Networks with the bnlearn R Package”. In: Statistical Software35 (Oct. 2010)
work page 2010
-
[17]
Greedy Structure Learning from Data that Contain Systematic Missing Values
Y. Liu and A. C. Constantinou. “Greedy Structure Learning from Data that Contain Systematic Missing Values”. In:Machine Learning111.10 (2022), pp. 3867–3896
work page 2022
-
[18]
Causal discovery for linear mixed data
Y. Zeng et al. “Causal discovery for linear mixed data”. In:Proceedings of the First Conference on Causal Learning and Reasoning (CLeaR 2022). Vol. 177. Proceedings of Machine Learning Research. 2022, pp. 994–1009
work page 2022
-
[19]
K. Shafiee et al. “Partially non-homogeneous dynamic Bayesian networks based on Bayesian regression models with partitioned design matrices”. In: Bioinformatics35.12 (2019), pp. 2108–2117
work page 2019
-
[20]
Constructing a Gene Regulatory Network Based on a Nonhomogeneous Dynamic Bayesian Network
J. Zhang, C. Hu, and Q. Zhang. “Constructing a Gene Regulatory Network Based on a Nonhomogeneous Dynamic Bayesian Network”. In:Electronics 11.18 (2022), p. 2936. 14 F. Pirola, F. Stella and M. Grzegorczyk Appendix A Full Conditional Distributions of Missing Data We consider three configurations where exactly one value is missing at time t= 2:x 2 1[M IS],x...
work page 2022
-
[21]
sharing a common child. Deriving the FCD forX t 3 in this setting illustrates the procedure in case of more complex Markov blankets, in the direction of demonstrating the tractability of the FCD in the broader case. P(x t 3[M IS] | ·)∝P(X t 3 |X t−1 1 , Xt−1 2 )·P(X t+1 4 |X t 3)·P(X t+1 5 |X t 3, Xt 4) ∝ N(x t 3[M IS]|µ3 =β (3) 0 +β (3) 1 xt−1 1 +β (3) 2...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.