Causal Discovery in Structural VAR Models Under Equal Noise Variance
Pith reviewed 2026-05-22 05:05 UTC · model grok-4.3
The pith
In structural VAR models with equal noise variances, multiple causal structures induce the same observed process and are related by orthogonal transformations plus global scale.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
In linear Gaussian structural VAR models under the equal noise variance assumption, the set of structural parameterizations that induce the same stationary observed process law forms an equivalence class characterized by orthogonal transformations of the structural equations together with a global positive scale. This characterization yields the observational alignment discrepancy, which compares models only after accounting for transformations that leave the observed law unchanged. Building on the theory, the ENVAR procedure searches the equivalence class for a sparse normalized structural representative.
What carries the argument
Observational equivalence class characterized by orthogonal transformations of the structural equations plus a global positive scale factor, which groups all structural VAR parameterizations that produce identical stationary observed processes.
If this is right
- Causal discovery returns an equivalence class rather than a single graph, so downstream users must account for the remaining ambiguity.
- The ENVAR algorithm can be run on synthetic or real multivariate time series to produce a sparse normalized representative inside the equivalence class.
- The same observational law can arise from distinct contemporaneous causal structures provided they are related by rotation and scaling.
- The method directly handles non-acyclic contemporaneous effects, which are realistic when sampling intervals are coarse relative to system dynamics.
Where Pith is reading between the lines
- The orthogonal-plus-scale characterization may suggest analogous equivalence results for other linear time-series models that impose moment or variance constraints.
- In fMRI or other neuroimaging pipelines, the equivalence class could be used to generate multiple candidate causal graphs that are then tested against interventional data.
- Relaxing the equal-variance assumption would likely enlarge the equivalence class or restore point identification, offering a natural direction for sensitivity analysis.
Load-bearing premise
All structural noise terms share exactly the same variance.
What would settle it
A concrete counter-example consisting of two structural VAR parameterizations that induce the same observed process law but cannot be related by any orthogonal transformation plus global scale, or conversely an example where equal-variance models are point-identified.
Figures
read the original abstract
Causal discovery from multivariate time series is challenging when causal effects may occur both across time and within the same sampling interval. This issue is especially important in applications such as neuroscience, where the sampling rate may be coarse relative to the underlying dynamics and contemporaneous effects need not form an acyclic graph. We study causal discovery in linear Gaussian structural VAR models under an equal noise variance assumption, meaning that the structural noise terms have a common variance. Unlike the DAG-based cross-sectional equal noise variance setting, the time-series setting considered here does not generally yield point identification of a unique causal graph. Instead, multiple structural VAR parameterizations can induce the same stationary observed process law. We introduce a notion of observational equivalence tailored to this setting and show that the corresponding equivalence class is characterized by orthogonal transformations of the structural equations together with a global positive scale. This characterization leads to an equivalence-aware model discrepancy, the observational alignment discrepancy, which compares structural models modulo transformations that preserve the observed law. Building on this theory, we propose ENVAR, a sparsity-based procedure that searches over the induced observational equivalence class for a sparse normalized structural representative. We evaluate the proposed methodology on synthetic structural VAR data and on an fMRI dataset.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. This paper addresses causal discovery in linear Gaussian structural VAR models with equal noise variances. It introduces a tailored notion of observational equivalence and demonstrates that the equivalence class consists of models related by orthogonal transformations of the structural equations along with a global positive scale. Based on this, it defines an observational alignment discrepancy and develops the ENVAR procedure to identify a sparse normalized representative within the class. The methodology is assessed through experiments on synthetic structural VAR data and an fMRI dataset.
Significance. If the central theoretical result holds, the paper offers a meaningful advance in handling identifiability issues in time-series causal models. The equal variance assumption provides a realistic middle ground between full identification and complete non-identification, leading to a well-defined equivalence class that can be searched efficiently for sparse solutions. This is particularly relevant for applications with coarse sampling, such as fMRI. The direct connection to linear algebra properties of the model is a strength, and the empirical evaluations provide practical validation.
major comments (2)
- §3: The characterization of the equivalence class by orthogonal transformations and global scale is derived from the condition A Σ A^T = σ² I. However, the manuscript should explicitly verify that the transformation preserves the reduced-form coefficients for the lagged terms in higher-order VAR(p) models with p > 1, as this is load-bearing for the general claim.
- §5.2: In the fMRI application, the choice of the sparsity level and the normalization procedure within the equivalence class should be justified more clearly, as small changes could affect the recovered graph structure.
minor comments (2)
- Abstract: The abstract mentions 'synthetic and fMRI evaluations' but could briefly note the key performance metrics or baselines used.
- Introduction: Some equations use notation that is introduced later; consider moving the model definition earlier for better flow.
Simulated Author's Rebuttal
We thank the referee for the positive and constructive feedback on our manuscript. We address the major comments below and will incorporate the suggested clarifications in the revised version.
read point-by-point responses
-
Referee: §3: The characterization of the equivalence class by orthogonal transformations and global scale is derived from the condition A Σ A^T = σ² I. However, the manuscript should explicitly verify that the transformation preserves the reduced-form coefficients for the lagged terms in higher-order VAR(p) models with p > 1, as this is load-bearing for the general claim.
Authors: We appreciate this important point. The equivalence transformations are defined on the structural parameters such that the observed reduced-form process is preserved. Specifically, for a VAR(p) model, applying an orthogonal transformation Q to the contemporaneous matrix A and correspondingly to the lagged coefficient matrices ensures invariance of the reduced-form coefficients. To address the referee's concern explicitly, we will add a verification subsection in §3 demonstrating algebraically that the lagged terms remain unchanged in the reduced-form representation for p > 1. This will include the relevant matrix equations. revision: yes
-
Referee: §5.2: In the fMRI application, the choice of the sparsity level and the normalization procedure within the equivalence class should be justified more clearly, as small changes could affect the recovered graph structure.
Authors: We agree that additional justification would improve clarity. The sparsity level was selected via a grid search minimizing the observational alignment discrepancy on held-out data, and the normalization follows directly from fixing the global scale to unity as defined in the equivalence class. In the revision, we will elaborate on this procedure in §5.2, including a brief sensitivity analysis to demonstrate robustness of the recovered graph to small perturbations in the sparsity parameter. revision: yes
Circularity Check
No significant circularity; equivalence class follows from linear algebra on reduced-form parameters
full rationale
The paper's central derivation establishes that, under equal noise variance, observational equivalence in structural VAR models is characterized by orthogonal transformations of the structural equations plus a global positive scale. This follows directly from the relation between structural parameters (A, lagged coefficients) and the observed stationary law (reduced-form AR matrices and innovation covariance satisfying A Sigma A^T = sigma^2 I). The characterization is obtained by algebraic manipulation of the model equations without fitting parameters to target data, without self-citation load-bearing steps, and without renaming or smuggling ansatzes. The subsequent sparsity search and observational alignment discrepancy are downstream applications of this independent mathematical result.
Axiom & Free-Parameter Ledger
free parameters (1)
- global positive scale
axioms (1)
- domain assumption The data-generating process is a linear Gaussian structural VAR model in which all structural noise terms share a common variance.
Reference graph
Works this paper leans on
- [1]
-
[2]
Chen, Wenyu and Drton, Mathias and Wang, Y. Samuel , title =. Biometrika , year =
-
[3]
Granger, Clive W. J. , title =. Econometrica , year =
- [4]
-
[5]
New Introduction to Multiple Time Series Analysis , publisher =
L. New Introduction to Multiple Time Series Analysis , publisher =. 2005 , doi =
work page 2005
-
[6]
Causal Search in Structural Vector Autoregressive Models , booktitle =
Moneta, Alessio and Chla. Causal Search in Structural Vector Autoregressive Models , booktitle =. 2011 , url =
work page 2011
-
[7]
Estimation of a Structural Vector Autoregression Model Using Non-
Hyv. Estimation of a Structural Vector Autoregression Model Using Non-. Journal of Machine Learning Research , year =
-
[8]
Causal Inference on Time Series Using Restricted Structural Equation Models , booktitle =
Peters, Jonas and Janzing, Dominik and Sch. Causal Inference on Time Series Using Restricted Structural Equation Models , booktitle =. 2013 , url =
work page 2013
-
[9]
Malinsky, Daniel and Spirtes, Peter , title =. Proceedings of the 2018. 2018 , url =
work page 2018
-
[10]
Runge, Jakob and Nowack, Peer and Kretschmer, Marlene and Flaxman, Seth and Sejdinovic, Dino , title =. Science Advances , year =
-
[11]
Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence , series =
Runge, Jakob , title =. Proceedings of the 36th Conference on Uncertainty in Artificial Intelligence , series =. 2020 , url =
work page 2020
-
[12]
Pamfil, Roxana and Sriwattanaworachai, Nisara and Desai, Shaan and Pilgerstorfer, Philip and Georgatzis, Konstantinos and Beaumont, Paul and Aragam, Bryon , title =. Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics , series =. 2020 , url =
work page 2020
-
[13]
and Devijver, Emilie and Gaussier, Eric , title =
Assaad, Charles K. and Devijver, Emilie and Gaussier, Eric , title =. Journal of Artificial Intelligence Research , year =
-
[14]
and Harrison, Lee and Penny, Will , title =
Friston, Karl J. and Harrison, Lee and Penny, Will , title =. NeuroImage , year =
-
[15]
Smith, Stephen M. and Miller, Karla L. and Salimi-Khorshidi, Gholamreza and Webster, Matthew and Beckmann, Christian F. and Nichols, Thomas E. and Ramsey, Joseph D. and Woolrich, Mark W. , title =. NeuroImage , year =
-
[16]
Seth, Anil K. and Barrett, Adam B. and Barnett, Lionel , title =. Journal of Neuroscience , year =
-
[17]
Ritter, Petra and Schirner, Michael and McIntosh, Anthony R. and Jirsa, Viktor K. , title =. Brain Connectivity , year =
-
[18]
Arab, Fahimeh and Ghassami, AmirEmad and Jamalabadi, Hamidreza and Peters, Megan A. K. and Nozari, Erfan , title =. Network Neuroscience , year =
-
[19]
The WU-Minn human connectome project: an overview , author=. Neuroimage , volume=. 2013 , publisher=
work page 2013
-
[20]
The minimal preprocessing pipelines for the Human Connectome Project , author=. Neuroimage , volume=. 2013 , publisher=
work page 2013
-
[21]
Function in the human connectome: task-fMRI and individual differences in behavior , author=. Neuroimage , volume=. 2013 , publisher=
work page 2013
-
[22]
Towards a consensus regarding global signal regression for resting state functional connectivity MRI , author=. Neuroimage , volume=. 2017 , publisher=
work page 2017
-
[23]
Local-global parcellation of the human cerebral cortex from intrinsic functional connectivity MRI , author=. Cerebral cortex , volume=. 2018 , publisher=
work page 2018
-
[24]
Topographic organization of the human subcortex unveiled with functional connectivity gradients , author=. Nature neuroscience , volume=. 2020 , publisher=
work page 2020
-
[25]
Nature Biomedical Engineering , year=
Macroscopic resting-state brain dynamics are best described by linear models , author=. Nature Biomedical Engineering , year=
- [26]
-
[27]
Magnetic resonance imaging , volume=
Investigating directed cortical interactions in time-resolved fMRI data using vector autoregressive modeling and Granger causality mapping , author=. Magnetic resonance imaging , volume=. 2003 , publisher=
work page 2003
-
[28]
Large-scale automated synthesis of human functional neuroimaging data , author=. Nature methods , volume=. 2011 , publisher=
work page 2011
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.