Causal discovery under mean independence and linearity
Pith reviewed 2026-05-08 17:52 UTC · model grok-4.3
The pith
One-sided mean independence of disturbances allows generic identification of source nodes in linear acyclic models
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The authors define the Linear Mean-Independent Acyclic Model in which each disturbance satisfies a one-sided mean-independence restriction with respect to its parents. They prove that the finite-order consequences of these restrictions generically identify the source nodes. From any such source the graph can be reduced by removing it and its outgoing edges, and the process repeats. The proof is constructive, yielding the DirectLiMIAM algorithm that sequentially finds sources by testing mean-independence on candidate residuals.
What carries the argument
Finite-order consequences of one-sided mean-independence restrictions, used to test and identify source nodes recursively
If this is right
- DirectLiMIAM recovers causal order by iteratively identifying and removing source nodes based on residual mean-independence tests
- Performance exceeds LiNGAM methods in simulations where disturbances are mean-independent but otherwise dependent
- Empirical results on oil market variables produce an ordering consistent with economic intuition, unlike independence-based methods
- The framework shows that dependence alone does not prevent causal discovery if mean independence holds
Where Pith is reading between the lines
- If mean-independence can be verified on data, the method offers a practical alternative when full independence is implausible
- Extensions could explore whether similar finite-order conditions work for non-linear or non-Gaussian settings
- The recursive structure suggests compatibility with other identifiability results that rely on source detection
Load-bearing premise
The one-sided mean-independence restrictions on disturbances hold and their finite-order consequences are sufficient to identify source nodes generically
What would settle it
Observing a linear acyclic system with mean-independent disturbances in which no source node satisfies the finite-order identifiability conditions, or empirical data where the recovered order is demonstrably wrong despite the restrictions
Figures
read the original abstract
Causal discovery methods such as LiNGAM identify causal structure from observational data by assuming mutually independent disturbances. This assumption is fragile: shared volatility, common scale effects, or other forms of dependence can cause the methods to recover the wrong causal order, even with infinite data. We introduce the Linear Mean-Independent Acyclic Model (LiMIAM), which replaces full independence with weaker one-sided mean-independence restrictions on the disturbances. Under finite-order consequences of these restrictions, source nodes are generically identifiable, and hence a compatible causal order can be recovered recursively. Our proof is constructive and leads to DirectLiMIAM, a sequential residual-based algorithm for causal discovery under dependent noise. In simulations with mean-independent but dependent disturbances, DirectLiMIAM outperforms LiNGAM methods. A large-scale empirical application to the oil market highlights the implausibility of the independence assumption and the ability of DirectLiMIAM to recover a realistic causal ordering, from policy to production and from prices to inflation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript introduces the Linear Mean-Independent Acyclic Model (LiMIAM), which replaces the mutual independence assumption on disturbances in linear acyclic models (as in LiNGAM) with weaker one-sided mean-independence restrictions. It claims that finite-order consequences of these restrictions generically identify source nodes, enabling recursive recovery of a compatible causal order via the constructive DirectLiMIAM algorithm. Simulations with mean-independent but dependent disturbances show outperformance over LiNGAM, and an empirical application to oil market data illustrates recovery of a realistic ordering.
Significance. If the identifiability and recursion results hold, the work is significant for causal discovery because it relaxes a strong and frequently violated assumption (full independence), addressing fragility to shared volatility or other dependence. The constructive proof leading to a sequential residual-based algorithm and the large-scale empirical example are strengths that could improve robustness in applications.
major comments (2)
- [§3] §3 (generic identifiability of source nodes): the argument that finite-order consequences of one-sided mean-independence suffice for generic identification of sources does not explicitly verify that these consequences (and the mean-independence restrictions themselves) are preserved in the reduced system after conditioning on or removing an identified source. Because disturbances are permitted to be dependent, removal can induce new statistical dependence in the residuals that violates the moment conditions used for identification, which is load-bearing for the recursive recovery claim.
- [§4] §4 (DirectLiMIAM algorithm and recursion): the sequential procedure assumes that after identifying a source the remaining variables continue to satisfy the same class of finite-order mean-independence restrictions with respect to their own disturbances and parents, but no lemma or step demonstrates this propagation under one-sided (rather than two-sided) dependence.
minor comments (2)
- [Abstract and §2] The abstract and §2 introduce 'finite-order consequences' without specifying the maximal order used in practice or how it is selected from data; this affects reproducibility of DirectLiMIAM.
- [§2] Notation for the mean-independence restriction (e.g., E[ε_i | parents] = 0 or similar) could be stated as an explicit equation rather than described in prose to avoid ambiguity with standard conditional mean independence.
Simulated Author's Rebuttal
We thank the referee for their thorough review and valuable feedback on our manuscript. The major comments raise important points regarding the completeness of our identifiability and recursion arguments. We address each comment in turn below.
read point-by-point responses
-
Referee: [§3] §3 (generic identifiability of source nodes): the argument that finite-order consequences of one-sided mean-independence suffice for generic identification of sources does not explicitly verify that these consequences (and the mean-independence restrictions themselves) are preserved in the reduced system after conditioning on or removing an identified source. Because disturbances are permitted to be dependent, removal can induce new statistical dependence in the residuals that violates the moment conditions used for identification, which is load-bearing for the recursive recovery claim.
Authors: We appreciate the referee highlighting this potential gap in our presentation. The manuscript establishes generic identifiability of source nodes based on the finite-order consequences of the one-sided mean-independence assumptions. However, we acknowledge that the preservation of these conditions in the reduced model after source removal is not explicitly verified, which is essential for the recursive identification. We will revise the manuscript by adding a lemma in §3 that demonstrates this preservation. Specifically, we will show that under the linear acyclic structure and the original mean-independence restrictions, the residuals in the subsystem satisfy analogous finite-order mean-independence conditions with respect to their parents, even when disturbances are dependent. This will be done by carefully tracking the induced dependencies and verifying the moment conditions hold. revision: yes
-
Referee: [§4] §4 (DirectLiMIAM algorithm and recursion): the sequential procedure assumes that after identifying a source the remaining variables continue to satisfy the same class of finite-order mean-independence restrictions with respect to their own disturbances and parents, but no lemma or step demonstrates this propagation under one-sided (rather than two-sided) dependence.
Authors: We agree with the referee that the DirectLiMIAM algorithm's recursive validity relies on the propagation of the assumptions. The current text assumes this without a dedicated proof step. We will update §4 to include an explicit argument or lemma establishing that after removing an identified source, the remaining variables satisfy the required mean-independence restrictions under one-sided dependence. This revision will provide the missing justification for the sequential procedure. revision: yes
Circularity Check
No circularity: constructive identification argument is independent of inputs
full rationale
The paper's central derivation introduces the LiMIAM model with one-sided mean-independence on disturbances and claims generic identifiability of source nodes from finite-order consequences, enabling recursive order recovery via a constructive proof. This does not reduce to self-definition, fitted parameters renamed as predictions, or load-bearing self-citations; the abstract and described proof strategy treat the restrictions as primitive assumptions whose consequences are derived forward without circular reduction. No quoted equations or steps exhibit the target result being presupposed in the inputs.
Axiom & Free-Parameter Ledger
axioms (2)
- domain assumption Disturbances satisfy one-sided mean-independence restrictions
- ad hoc to paper Finite-order consequences of mean-independence restrictions allow generic identifiability of source nodes
Lean theorems connected to this paper
-
Cost.FunctionalEquation (J(x) = ½(x + x⁻¹) − 1, washburn_uniqueness_aczel)washburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
S^{(d)}_{ij} = E[X_i X_j^{d-1}] E[X_j^2] − E[X_i X_j] E[X_j^d]; Theorem 3.6 (Generic identification from a pair of moments).
-
Foundation/* — RS forcing chain has no opinion on econometric SVAR identificationreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Application to oil-market SVAR with 5–6 variables and 24 lags; ordering from OPEC surprises to production to prices to CPI.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Samuel and Drton, Mathias , title =
Wang, Y. Samuel and Drton, Mathias , title =. Biometrika , volume =. 2020 , doi =
2020
-
[2]
Sims , journal =
Christopher A. Sims , journal =. Macroeconomics and Reality , volume =
-
[3]
Essays in Honor of Joon Y
Lutz Kilian and Xiaoqing Zhou , title=. Essays in Honor of Joon Y. Park: Econometric Methodology in Empirical Applications , chapter=. 2023 , month=
2023
-
[4]
Topics in Advanced Econometrics, Volume II Linear and Nonlinear Simultaneous Equations , author =
-
[5]
Quantitative Economics , volume =
Hoesch, Lukas and Lee, Adam and Mesters, Geert , title =. Quantitative Economics , volume =
-
[6]
2017 , publisher=
Structural Vector Autoregressive Analysis , author=. 2017 , publisher=
2017
-
[7]
American Economic Review , Volume =
Kilian, Lutz , Title =. American Economic Review , Volume =. 2009 , Month =. doi:10.1257/aer.99.3.1053 , URL =
-
[8]
Känzig, Diego R. , Title =. American Economic Review , Volume =. 2021 , Month =. doi:10.1257/aer.20190964 , URL =
-
[9]
Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =
Pfister, Niklas and Bühlmann, Peter and Schölkopf, Bernhard and Peters, Jonas , title =. Journal of the Royal Statistical Society Series B: Statistical Methodology , volume =. 2018 , month =. doi:10.1111/rssb.12235 , url =
-
[10]
and Heskes, Tom , title =
Mooij, Joris M. and Heskes, Tom , title =. Proceedings of the 29th Annual Conference on Uncertainty in Artificial Intelligence (UAI 2013) , year =
2013
-
[11]
and Magliacane, Sara and Claassen, Tom , title =
Mooij, Joris M. and Magliacane, Sara and Claassen, Tom , title =. Journal of Machine Learning Research , year =
-
[12]
, title =
Ramey, Valerie A. , title =. Handbook of Macroeconomics , editor =. 2016 , publisher =
2016
-
[13]
Distribution theory of spherical distributions and a location-scale parameter generalization , author=. Sankhy. 1970 , publisher=
1970
-
[14]
Journal of the American Statistical Association , volume =
Christoph Schultheiss and Peter Bühlmann and Ming Yuan , title =. Journal of the American Statistical Association , volume =
-
[15]
2007 , author =
Chapter 76 Large Sample Sieve Estimation of Semi-Nonparametric Models , editor =. 2007 , author =
2007
-
[16]
2001 , author =
Goodness-of-fit tests for kernel regression with an application to option implied volatilities , journal =. 2001 , author =
2001
-
[17]
Consistent Model Specification Tests: Omitted Variables and Semiparametric Functional Forms , volume =
Yanqin Fan and Qi Li , journal =. Consistent Model Specification Tests: Omitted Variables and Semiparametric Functional Forms , volume =
-
[18]
and Crump, R
Cattaneo, Matias D. and Crump, R. and Farrel, M. and Feng, Yingjie , title =. American Economic Review , year =
-
[19]
and Jordan, Michael I
Bach, Francis R. and Jordan, Michael I. , title =. Journal of Machine Learning Research , year =
-
[20]
Estimation of a Structural Vector Autoregression Model Using Non-
Hyv. Estimation of a Structural Vector Autoregression Model Using Non-. Journal of Machine Learning Research , year =
-
[21]
Pairwise Likelihood Ratios for Estimation of Non-
Hyv. Pairwise Likelihood Ratios for Estimation of Non-. Journal of Machine Learning Research , year =
-
[22]
and Nolan, Garry P
Sachs, Karen and Perez, Omar and Pe'er, Dana and Lauffenburger, Douglas A. and Nolan, Garry P. , title =. Science , volume =
-
[23]
and Miller, Karla L
Smith, Stephen M. and Miller, Karla L. and Salimi-Khorshidi, Gholamreza and Webster, Matthew and Beckmann, Christian F. and Nichols, Thomas E. and Ramsey, Joseph D. and Woolrich, Mark W. , title =. NeuroImage , volume =
-
[24]
Annals of the Institute of Statistical Mathematics , volume =
The Calculation of Cumulants via Conditioning , author =. Annals of the Institute of Statistical Mathematics , volume =
-
[25]
2009 , publisher =
Causality , author =. 2009 , publisher =
2009
-
[26]
2000 , publisher =
Causation, Prediction, and Search , author =. 2000 , publisher =
2000
-
[27]
Identification of Structural Vector Autoregressions via Non-
Lanne, Markku and Meitz, Mika and Saikkonen, Pentti , journal =. Identification of Structural Vector Autoregressions via Non-
-
[28]
Structural Vector Autoregressions With Nonnormal Residuals , journal =
Lanne, Markku and L. Structural Vector Autoregressions With Nonnormal Residuals , journal =. 2010 , volume =
2010
-
[29]
The Econometrics of Complex Systems , pages =
Pseudo-Maximum Likelihood Independent Component Analysis , author =. The Econometrics of Complex Systems , pages =. 2017 , publisher =
2017
-
[30]
Statistical Inference for Independent Component Analysis: Application to Structural
Gouri. Statistical Inference for Independent Component Analysis: Application to Structural. Journal of Econometrics , year =
-
[31]
Science , volume =
Decorrelated Neuronal Firing in Monkey Visual Cortex , author =. Science , volume =
-
[32]
Nature Reviews Genetics , volume =
Computational and Analytical Challenges in Single-Cell Transcriptomics , author =. Nature Reviews Genetics , volume =
-
[33]
and Casale, F
Buettner, Florian and Natarajan, Kedar N. and Casale, F. Paolo and Proserpio, Valentina and Scialdone, Antonio and Theis, Fabian J. and Teichmann, Sarah A. and Marioni, John C. and Stegle, Oliver , title =. Nature Biotechnology , year =
-
[34]
International conference on machine learning , pages=
Linear causal disentanglement via interventions , author=. International conference on machine learning , pages=. 2023 , organization=
2023
-
[35]
SIAM Journal on Matrix Analysis and Applications , volume =
Jacobi Angles for Simultaneous Diagonalization , author =. SIAM Journal on Matrix Analysis and Applications , volume =
-
[36]
Signal Processing , year =
Comon, Pierre , title =. Signal Processing , year =
-
[37]
The Annals of Statistics , volume =
CAM: Causal Additive Models, High-Dimensional Order Search and Penalized Regression , author =. The Annals of Statistics , volume =
-
[38]
Samuel and Drton, Mathias , journal =
Wang, Y. Samuel and Drton, Mathias , journal =. Causal Discovery with Unobserved Confounding and Non-
-
[39]
Samuel and Drton, Mathias , title =
Wang, Y. Samuel and Drton, Mathias , title =. Biometrika , volume =
-
[40]
Statistical Causal Discovery:
Shimizu, Shohei , year =. Statistical Causal Discovery:
-
[41]
The Review of Economic Studies , volume =
Harvey, Andrew and Ruiz, Esther and Shephard, Neil , title =. The Review of Economic Studies , volume =
-
[42]
, title =
Engle, Robert F. , title =. Econometrica , volume =
-
[43]
Journal of Econometrics , volume =
Bollerslev, Tim , title =. Journal of Econometrics , volume =
-
[44]
Science , volume =
Arieli, Amos and Sterkin, Avital and Grinvald, Amiram and Aertsen, Ad , title =. Science , volume =
-
[45]
and Kohn, Adam , title =
Cohen, Marlene R. and Kohn, Adam , title =. Nature Neuroscience , volume =
-
[46]
and Goris, Robbe L
Rabinowitz, Neil C. and Goris, Robbe L. T. and Cohen, Marlene R. and Simoncelli, Eero P. , title =. Nature , volume =
-
[47]
and Levine, Arnold J
Elowitz, Michael B. and Levine, Arnold J. and Siggia, Eric D. and Swain, Peter S. , title =. Science , volume =
-
[48]
Accounting for Technical Noise in Single-Cell
Brennecke, Philip and Anders, Simon and Kim, Jong Kyoung and Ko. Accounting for Technical Noise in Single-Cell. Nature Methods , volume =
-
[49]
Shimizu, Shohei and Hoyer, Patrik O. and Hyv. A Linear Non-. Journal of Machine Learning Research , volume =. 2006 , url =
2006
-
[50]
Journal of Machine Learning Research , volume =
Shimizu, Shohei and Inazumi, Takanori and Sogawa, Yasuhiro and Hyv. Journal of Machine Learning Research , volume =. 2011 , url =
2011
-
[51]
and Shimizu, Shohei and Kerminen, Antti and Palviainen, Markus , title =
Hoyer, Patrik O. and Shimizu, Shohei and Kerminen, Antti and Palviainen, Markus , title =. Proceedings of the 24th Conference on Uncertainty in Artificial Intelligence , pages =
-
[52]
The Annals of Statistics , volume =
Non-Independent Component Analysis , author =. The Annals of Statistics , volume =
-
[53]
Orthogonal Eigenvectors and Singular Vectors of Tensors , year =
Ribot,. Orthogonal Eigenvectors and Singular Vectors of Tensors , year =. 2506.19009 , archiveprefix =
-
[54]
Beyond Independent Component Analysis: Identifiability and Algorithms , year =
Ribot,. Beyond Independent Component Analysis: Identifiability and Algorithms , year =. 2510.07525 , archiveprefix =
-
[55]
IEEE Transactions on Neural Networks , volume =
Fast and Robust Fixed-Point Algorithms for Independent Component Analysis , author =. IEEE Transactions on Neural Networks , volume =
-
[56]
Neural Computation , volume =
A Fast Fixed-Point Algorithm for Independent Component Analysis , author =. Neural Computation , volume =
-
[57]
Linear Algebra and its Applications , volume =
The Number of Eigenvalues of a Tensor , author =. Linear Algebra and its Applications , volume =
-
[58]
Advances in neural information processing systems , volume=
Dags with no tears: Continuous optimization for structure learning , author=. Advances in neural information processing systems , volume=
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.