Identification In Missing Data Models Represented By Directed Acyclic Graphs
Pith reviewed 2026-05-25 12:26 UTC · model grok-4.3
The pith
Missing data models on directed acyclic graphs contain identifiable target distributions that existing algorithms miss.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The most general identification strategies proposed so far retain a significant gap in that they fail to identify a wide class of identifiable distributions; a new algorithm that significantly generalizes the types of manipulations used in the ID algorithm recovers these distributions whenever they are identifiable under the missing data DAG.
What carries the argument
A generalized manipulation algorithm that extends the ID algorithm's operations to missing data mechanisms represented by a factorization with respect to a directed acyclic graph.
If this is right
- More target distributions become recoverable without requiring parametric restrictions on the missingness mechanism.
- Inference procedures can be applied to a larger collection of missing data problems represented by DAGs.
- The gap between what is identifiable and what prior algorithms could identify is narrowed.
- Identification results carry over directly to estimation once the functional is obtained.
Where Pith is reading between the lines
- The approach may suggest similar generalizations for identification in other graphical missing data settings beyond standard DAGs.
- Practical implementations could be tested by constructing synthetic examples where identifiability holds but earlier methods fail.
- Connections to causal effect identification under missingness could allow joint handling of both problems in the same graph.
Load-bearing premise
The missing data mechanism is correctly represented by a factorization with respect to the given directed acyclic graph.
What would settle it
A concrete missing data DAG together with an explicit target distribution that is identifiable from the observed law but is not recovered by the new algorithm, or a distribution the algorithm returns that is in fact not a function of the observed data alone.
Figures
read the original abstract
Missing data is a pervasive problem in data analyses, resulting in datasets that contain censored realizations of a target distribution. Many approaches to inference on the target distribution using censored observed data, rely on missing data models represented as a factorization with respect to a directed acyclic graph. In this paper we consider the identifiability of the target distribution within this class of models, and show that the most general identification strategies proposed so far retain a significant gap in that they fail to identify a wide class of identifiable distributions. To address this gap, we propose a new algorithm that significantly generalizes the types of manipulations used in the ID algorithm, developed in the context of causal inference, in order to obtain identification.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper considers identifiability of a target distribution in missing-data models whose observed-data law factorizes according to a given DAG. It argues that existing general-purpose identification procedures (including extensions of the causal ID algorithm) leave a non-trivial gap, failing to recover the target even when it is identifiable under the DAG. The authors introduce a new algorithm that enlarges the set of allowed manipulations beyond those in the standard ID algorithm and claim that the resulting procedure recovers the target whenever it is identifiable.
Significance. If the soundness claim holds, the result would close a documented gap in the graphical identification literature for missing data and would allow routine application of a single algorithm to a strictly larger class of identifiable problems than was previously possible. The work directly extends a well-studied causal-inference primitive (the ID algorithm) rather than starting from scratch, which increases its immediate utility.
major comments (2)
- [§4] §4, Algorithm 1, lines 12–18: the generalized ‘missingness intervention’ operation is defined by replacing the conditional distribution of the missingness indicator with a fixed value; the manuscript must supply an explicit inductive argument showing that each such step preserves the observed-data law when the target is identifiable under the input DAG. Without this argument the completeness claim rests on the examples alone.
- [Example 3] Example 3 (the three-variable chain with MNAR missingness on the middle variable): the paper asserts that prior ID-based procedures return ‘unidentified’ while the new algorithm returns the correct functional. The derivation of the functional should be written out in full (including the explicit expression for the recovered density) so that readers can verify it does not rely on an implicit parametric assumption.
minor comments (2)
- [§2–3] Notation for the observed-data law versus the full-data law is introduced inconsistently between §2 and §3; a single table of symbols would eliminate repeated parenthetical clarifications.
- [Figures 1–3] The running example graphs would be easier to follow if every node were explicitly labeled as fully observed, partially observed, or missingness indicator.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive review. The comments highlight areas where additional rigor and clarity will improve the manuscript. We address each major comment below and will incorporate the requested material in the revision.
read point-by-point responses
-
Referee: [§4] §4, Algorithm 1, lines 12–18: the generalized ‘missingness intervention’ operation is defined by replacing the conditional distribution of the missingness indicator with a fixed value; the manuscript must supply an explicit inductive argument showing that each such step preserves the observed-data law when the target is identifiable under the input DAG. Without this argument the completeness claim rests on the examples alone.
Authors: We agree that an explicit inductive argument is required to establish that each generalized missingness intervention preserves the observed-data law. In the revised manuscript we will insert a formal inductive proof in §4 that proceeds by induction on the number of interventions, showing preservation at each step under the assumption that the target is identifiable from the input DAG. This will place the completeness claim on a rigorous footing rather than relying primarily on examples. revision: yes
-
Referee: [Example 3] Example 3 (the three-variable chain with MNAR missingness on the middle variable): the paper asserts that prior ID-based procedures return ‘unidentified’ while the new algorithm returns the correct functional. The derivation of the functional should be written out in full (including the explicit expression for the recovered density) so that readers can verify it does not rely on an implicit parametric assumption.
Authors: We will expand Example 3 to contain a complete, line-by-line derivation of the recovered functional. The expanded example will explicitly state each algebraic step and the final expression for the target density, making clear that the derivation uses only the graphical structure and the definition of the new operations, without any parametric restrictions. revision: yes
Circularity Check
No significant circularity
full rationale
The paper proposes a new identification algorithm for missing-data models on DAGs by generalizing manipulations from the causal ID algorithm. The derivation chain consists of defining the class of models via DAG factorization, exhibiting a gap in prior methods via counterexamples, and presenting generalized operations whose soundness is argued directly from the graphical structure rather than by fitting parameters or reducing to self-citations. No equation equates a claimed prediction to an input by construction, no uniqueness theorem is imported solely from overlapping prior work as an external fact, and the central result does not rename a known empirical pattern. The reference to the ID algorithm functions as an external foundation from causal inference, not a load-bearing loop internal to this manuscript. The derivation is therefore self-contained against the stated graphical assumptions.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
new algorithm that significantly generalizes the types of manipulations used in the ID algorithm... fixing operations according to a partial order... fix sets of variables jointly
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
nested factorization... fixing operator φV
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Pearl’s calcu- lus of interventions is complete
Yimin Huang and Marco V altorta. Pearl’s calcu- lus of interventions is complete. In Twenty Sec- ond Conference On Uncertainty in Artificial Intel- ligence, 2006
work page 2006
- [2]
-
[3]
Graphical models for recovering probabilistic and causal queries from missing data
Karthika Mohan and Judea Pearl. Graphical models for recovering probabilistic and causal queries from missing data. In Advances in Neural Information Processing Systems, pages 1520–1528. 2014
work page 2014
-
[4]
Graph- ical models for inference with missing data
Karthika Mohan, Judea Pearl, and Jin Tian. Graph- ical models for inference with missing data. In Ad- vances in Neural Information Processing Systems , pages 1277–1285, 2013
work page 2013
-
[5]
Probabilistic Reasoning in Intelligent Systems
Judea Pearl. Probabilistic Reasoning in Intelligent Systems. Morgan and Kaufmann, San Mateo, 1988
work page 1988
-
[6]
Causality: Models, Reasoning, and Inference
Judea Pearl. Causality: Models, Reasoning, and Inference. Cambridge University Press, 2 edition, 2009
work page 2009
-
[7]
Thomas S. Richardson, Robin J. Evans, James M. Robins, and Ilya Shpitser. Nested Markov properties for acyclic directed mixed graphs. arXiv:1701.06686v2, 2017. Working paper
-
[8]
James M. Robins. A new approach to causal in- ference in mortality studies with sustained expo- sure periods – application to control of the healthy worker survivor effect. Mathematical Modeling , 7:1393–1512, 1986
work page 1986
-
[9]
James M. Robins. Non-response models for the analysis of non-monotone non-ignorable missing data. Statistics in Medicine , 16:21–37, 1997
work page 1997
-
[10]
D. B. Rubin. Causal inference and missing data (with discussion). Biometrika, 63:581–592, 1976
work page 1976
-
[11]
Mauricio Sadinle and Jerome P . Reiter. Item- wise conditionally independent nonresponse mod- elling for incomplete multivariate data. Biometrika, 104(1):207–220, 2017
work page 2017
-
[12]
Consistent estimation of functions of data missing non-monotonically and not at random
Ilya Shpitser. Consistent estimation of functions of data missing non-monotonically and not at random. In Advances in Neural Information Processing Sys- tems, pages 3144–3152, 2016
work page 2016
-
[13]
Missing data as a causal and probabilistic prob- lem
Ilya Shpitser, Karthika Mohan, and Judea Pearl. Missing data as a causal and probabilistic prob- lem. In Proceedings of the Thirty First Conference on Uncertainty in Artificial Intelligence (UAI-15) , pages 802–811. AUAI Press, 2015
work page 2015
-
[14]
Identification of joint interventional distributions in recursive semi- Markovian causal models
Ilya Shpitser and Judea Pearl. Identification of joint interventional distributions in recursive semi- Markovian causal models. In Proceedings of the Twenty-First National Conference on Artificial In- telligence (AAAI-06). AAAI Press, 2006
work page 2006
-
[15]
Tchetgen Tchetgen, Linbo Wang, and BaoLuo Sun
Eric J. Tchetgen Tchetgen, Linbo Wang, and BaoLuo Sun. Discrete choice models for non- monotone nonignorable missing data: Identifica- tion and inference. Statistica Sinica , 28(4):2069– 2088, 2018
work page 2069
-
[16]
A general identification condition for causal effects
Jin Tian and Judea Pearl. A general identification condition for causal effects. In Eighteenth National Conference on Artificial Intelligence , pages 567– 573, 2002
work page 2002
-
[17]
Semiparametric Theory and Missing Data
Anastasios Tsiatis. Semiparametric Theory and Missing Data. Springer-V erlag New Y ork, 1st edi- tion edition, 2006
work page 2006
-
[18]
Y an Zhou, Roderick J. A. Little, and John D. Kalbfleisch. Block-conditional missing at ran- dom models for missing data. Statistical Science , 25(4):517–532, 2010. 7 APPENDIX A. Proofs Proposition 1 Given a DAG G(X(1), R, O, X), the distribution p(Ri|paG(Ri))|paG(Ri)∩ R=1 is identifiable from p(R, O, X) if there exists (i) Z⊆ X(1)∪ R∪ O, (ii) an equivalenc...
work page 2010
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.