Conditional independence testing with a single realization of a multivariate nonstationary nonlinear time series
Pith reviewed 2026-05-22 18:00 UTC · model grok-4.3
The pith
A framework enables conditional independence testing from a single realization of a nonstationary nonlinear multivariate time series.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The central claim is that conditional independence between two components of a multivariate time series, given a third, can be tested using only a single realization even when the series is nonstationary and the dependence structure is nonlinear. The test is constructed by estimating time-varying nonlinear regression functions to produce error processes, forming statistics from local long-run covariance matrices of products of these errors, and applying a distribution-uniform strong Gaussian approximation to control the test.
What carries the argument
Time-varying nonlinear regression combined with estimation of local long-run covariance matrices of error products and a distribution-uniform strong Gaussian approximation.
If this is right
- Tests for unconditional independence between two processes become available under the same single-realization nonstationary nonlinear conditions.
- Causal discovery algorithms that rely on conditional independence tests can now be applied to single observed trajectories of nonlinear nonstationary series.
- Variable selection procedures for multivariate time series can operate without requiring multiple independent realizations or linearity.
Where Pith is reading between the lines
- The same machinery could be used to test conditional independence in single long recordings from neuroscience or finance where stationarity cannot be assumed.
- If the Gaussian approximation holds uniformly, the framework might extend to constructing confidence intervals for measures of nonlinear dependence in nonstationary settings.
Load-bearing premise
The method assumes that time-varying nonlinear regression functions can be estimated reliably enough from a single realization to yield error processes whose local long-run covariances admit a consistent Gaussian approximation.
What would settle it
A simulation experiment on a known nonstationary nonlinear process in which conditional independence holds but the proposed test rejects at rates far above the nominal level across many sample sizes would falsify the claim.
Figures
read the original abstract
Identifying relationships among stochastic processes is a core objective in many fields, such as economics. While the standard toolkit for multivariate time series analysis has many advantages, it can be difficult to capture nonlinear dynamics using linear vector autoregressive models. This difficulty has motivated the development of methods for causal discovery and variable selection for nonlinear time series, which routinely employ tests for conditional independence. In this paper, we introduce the first framework for conditional independence testing that works with a single realization of a nonstationary nonlinear process. We also show how our framework can be used to test for independence. The key technical ingredients of our framework are time-varying nonlinear regression, estimation of local long-run covariance matrices of products of error processes, and a distribution-uniform strong Gaussian approximation.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper introduces the first framework for conditional independence testing (and independence testing) that operates on a single realization of a multivariate nonstationary nonlinear time series. The method combines time-varying nonlinear regression, local long-run covariance estimation of products of error processes, and a distribution-uniform strong Gaussian approximation to construct valid test statistics and critical values.
Significance. If the technical claims hold, the result would fill an important gap by extending conditional independence testing beyond the stationary or multi-realization settings that dominate the literature. This is relevant for empirical work in economics and related fields where only one observed path of a nonlinear nonstationary process is available. The paper explicitly builds on standard regression and Gaussian-approximation tools rather than introducing entirely new primitives, which strengthens its potential impact if the uniformity conditions are rigorously established.
major comments (1)
- [§3.2–3.3, Theorem 4.1] §3.2–3.3 and Theorem 4.1: The distribution-uniform strong Gaussian approximation for the local long-run covariance estimators of residual products is presented as holding under the single-realization nonstationary nonlinear regime. However, the effective local sample size is governed by the bandwidth, which shrinks with sample size, while the regression function itself is time-varying; the uniformity over the function class therefore rests on moment and dependence conditions whose verification is not fully detailed for this regime. Because the critical values for the conditional-independence test are obtained directly from this approximation, any gap here is load-bearing for the central claim.
minor comments (2)
- [Notation] Notation for the time-varying regression estimator and the local covariance estimator should be made consistent between the main text and the appendix; currently the same symbol is used for both the population and estimated versions in several places.
- [Simulation study] The simulation section would benefit from an explicit statement of the bandwidth selection rule used in the reported experiments, as this choice directly affects the local sample size and therefore the finite-sample behavior of the Gaussian approximation.
Simulated Author's Rebuttal
We thank the referee for the careful and constructive report. The concern about the distribution-uniform strong Gaussian approximation is well taken, as it underpins the validity of the critical values. We respond to the major comment below and will strengthen the exposition in the revision.
read point-by-point responses
-
Referee: [§3.2–3.3, Theorem 4.1] §3.2–3.3 and Theorem 4.1: The distribution-uniform strong Gaussian approximation for the local long-run covariance estimators of residual products is presented as holding under the single-realization nonstationary nonlinear regime. However, the effective local sample size is governed by the bandwidth, which shrinks with sample size, while the regression function itself is time-varying; the uniformity over the function class therefore rests on moment and dependence conditions whose verification is not fully detailed for this regime. Because the critical values for the conditional-independence test are obtained directly from this approximation, any gap here is load-bearing for the central claim.
Authors: We agree that the uniformity of the Gaussian approximation over the function class is load-bearing and that the interaction between the shrinking bandwidth and the time-varying regression requires careful justification in the single-realization setting. Theorem 4.1 is stated under Assumptions 3.1–3.4, which impose moment and dependence conditions that are formulated to be uniform over the relevant function class and to accommodate local estimation with bandwidth h_n → 0. The appendix proof combines maximal inequalities for nonstationary triangular arrays with a blocking argument that explicitly accounts for the effective local sample size of order n h_n. Nevertheless, we acknowledge that sections 3.2–3.3 present these steps at a relatively high level. In the revised manuscript we will add a dedicated paragraph in §3.3 that walks through the verification of uniformity under the stated assumptions, highlighting how the time-varying regression is handled via local smoothing and how the moment conditions scale with the local sample size. This clarification will make the route from assumptions to the critical values fully explicit without changing the theorem statement itself. revision: partial
Circularity Check
No significant circularity detected in derivation chain
full rationale
The paper presents a framework for conditional independence testing based on time-varying nonlinear regression, estimation of local long-run covariance matrices of error products, and a distribution-uniform strong Gaussian approximation. These components are described as key technical ingredients drawn from standard statistical tools without any quoted reduction showing that a central result equals its inputs by construction, a fitted parameter renamed as a prediction, or a load-bearing premise justified solely by overlapping self-citation. The abstract and described method indicate an independent construction that does not collapse to tautology.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The key technical ingredients of our framework are time-varying nonlinear regression, estimation of local long-run covariance matrices of products of error processes, and a distribution-uniform strong Gaussian approximation.
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We quantify temporal dependence using the functional dependence measure of Wu [Wu05] ... total variation-type nonstationarity condition of Mies and Steland [MS23].
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
The generalization ability of online algorithms for dependent data
[AD12] Alekh Agarwal and John C. Duchi. “The generalization ability of online algorithms for dependent data”. In:IEEE Transactions on Information Theory 59.1 (2012), pp. 573–587. [ALW13] Pierre Alquier, Xiaoyin Li, and Olivier Wintenberger. “Prediction of time series by statistical learning: general losses and fast rates”. In:Dependence Modeling 1 (2013),...
-
[2]
Inference for parameters defined by moment inequalities: a recommended moment selection procedure
[AB12] Donald W.K. Andrews and Panle Jia Barwick. “Inference for parameters defined by moment inequalities: a recommended moment selection procedure”. In: Econometrica: Journal of the Econometric Society80.6 (2012), pp. 2805–
work page 2012
-
[3]
[AG09] Donald W.K. Andrews and Patrik Guggenberger. “Validity of subsampling and plug-in asymptotic inference for parameters defined by moment inequalities”. In: Econometric Theory25.3 (2009), pp. 669–709. [AS10] Donald W.K. Andrews and Gustavo Soares. “Inference for parameters defined by moment inequalities using generalized moment selection”. In:Econome...
-
[4]
Robust inference with knockoffs
[BCS20] Rina Foygel Barber, Emmanuel J. Candés, and Richard J. Samworth. “Robust inference with knockoffs”. In:The Annals of Statistics48.3 (2020), pp. 1409–
work page 2020
-
[5]
Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling
[BJ22] Rina Foygel Barber and Lucas Janson. “Testing goodness-of-fit and conditional independence with approximate co-sufficient sampling”. In:The Annals of Statistics 50.5 (2022), pp. 2514–2544. 120 [BM15] Sumanta Basu and George Michailidis. “Regularized estimation in sparse high-dimensional time series models”. In:Annals of Statistics43.4 (2015), pp. 1...
work page 2022
-
[6]
Some new asymptotic theory for least squares series: pointwise and uniform results
[Bel+15] Alexandre Belloni, Victor Chernozhukov, Denis Chetverikov, and Kengo Kato. “Some new asymptotic theory for least squares series: pointwise and uniform results”. In:Journal of Econometrics186.2 (2015), pp. 345–366. [BH95] Yoav Benjamini and Yosef Hochberg. “Controlling the false discovery rate: a practical and powerful approach to multiple testing...
-
[7]
Gaussian approxima- tion for non-stationary time series with optimal rate and explicit construction
121 [BKW24] Soham Bonnerjee, Sayar Karmakar, and Wei Biao Wu. “Gaussian approxima- tion for non-stationary time series with optimal rate and explicit construction”. In: The Annals of Statistics52.5 (2024), pp. 2293–2317. [BRT12] Taoufik Bouezmarni, Jeroen V.K. Rombouts, and Abderrahim Taamouti. “Non- parametric copula-based test for conditional independen...
work page 2024
-
[8]
Panning for gold: model-X knockoffs for high dimensional controlled variable selection
13140/RG.2.2.36779.31523/1. [Can+18] Emmanuel J. Candés, Yingying Fan, Lucas Janson, and Jinchi Lv. “Panning for gold: model-X knockoffs for high dimensional controlled variable selection”. In: Journal of the Royal Statistical Society Series B: Statistical Methodology 80.3 (2018), pp. 551–577. [CZK24] Abhinav Chakraborty, Jeffrey Zhang, and Eugene Katsevi...
-
[9]
Cross validation for locally stationary processes
[DR19] Rainer Dahlhaus and Stefan Richter. “Cross validation for locally stationary processes”. In:Annals of Statistics47.4 (2019), pp. 2145–2173. 123 [DRW19] Rainer Dahlhaus, Stefan Richter, and Wei Biao Wu. “Towards a general theory for nonlinear locally stationary processes”. In:Bernoulli 25.2 (2019), pp. 1013–
work page 2019
-
[10]
Partial association measures and an application to qualitative regression
[Dau80] J. J. Daudin. “Partial association measures and an application to qualitative regression”. In:Biometrika 67.3 (1980), pp. 581–590. [DN20] Richard A. Davis and Mikkel S. Nielsen. “Modeling of time series using random forests: theoretical developments”. In:Electronic Journal of Statistics(2020), pp. 3644–3671. [DT22] Philip A. Dawid and Ambuj Tewari...
work page 1980
-
[11]
[DZ25] Xiucai Ding and Zhou Zhou. “On the partial autocorrelation function for locally stationary time series: characterization, estimation and inference”. In: Biometrika 112.2 (2025). [DZ21] Xiucai Ding and Zhou Zhou. “Simultaneous sieve inference for time- inhomogeneousnonlineartimeseriesregression”.arXivpreprintarXiv:2112.08545
-
[12]
A permutation-based kernel conditional independence test
[Dor+14] Gary Doran, Krikamol Muandet, Kun Zhang, and Bernhard Schölkopf. “A permutation-based kernel conditional independence test”. In:Proceedings of the 124 Thirtieth Conference on Uncertainty in Artificial Intelligence(2014), pp. 132–
work page 2014
-
[13]
Graphical modelling of multivariate time series
[Eic12] Michael Eichler. “Graphical modelling of multivariate time series”. In:Proba- bility Theory and Related Fields153 (2012), pp. 233–268. [FFX20] Jianqing Fan, Yang Feng, and Lucy Xia. “A projection-based conditional dependencemeasurewithapplicationstohigh-dimensionalundirectedgraphical models”. In:Journal of Econometrics218.1 (2020), pp. 119–139. [F...
work page 2012
-
[14]
Cyclostationarity: half a century of research
[GNP06] William A. Gardner, Antonio Napolitano, and Luigi Paura. “Cyclostationarity: half a century of research”. In:Signal Processing86.4 (2006), pp. 639–697. [Goe20] Benjamin Goehry. “Random forests for time-dependent processes”. In:ESAIM: Probability and Statistics24 (2020), pp. 801–826. [GMR93] Christian Gourieroux, Alain Monfort, and Eric Renault. “I...
work page 2006
-
[15]
Confidence intervals for partially identified parameters
[IM04] Guido W. Imbens and Charles F. Manski. “Confidence intervals for partially identified parameters”. In:Econometrica: Journal of the Econometric Society 72.6 (2004), pp. 1845–1857. [KKR24] Iden Kalemaj, Shiva Kasiviswanathan, and Aaditya Ramdas. “Differentially private conditional independence testing”. In:International Conference on Artificial Intel...
work page 2004
-
[16]
Rates of uniform convergence of empirical means with mixing processes
[KV02] Rajeeva L. Karandikar and Mathukumalli Vidyasagar. “Rates of uniform convergence of empirical means with mixing processes”. In:Statistics and Probability Letters58.3 (2002), pp. 297–307. [KW20] Sayar Karmakar and Wei Biao Wu. “Optimal Gaussian approximation for multiple time series”. In: 30.3 (2020), pp. 1399–1417. [Kas18] Maximilian Kasy. “Uniform...
work page 2002
-
[17]
[KKK22] Arun K. Kuchibhotla, John E. Kolassa, and Todd A. Kuffner. “Post-selection inference”. In:Annual Review of Statistics and Its Application9.1 (2022), pp. 505–527. [KBW23] Arun Kumar Kuchibhotla, Sivaraman Balakrishnan, and Larry Wasserman. “Median regularity and honest inference”. In:Biometrika 110.3 (2023), pp. 831–
work page 2022
-
[18]
Generalization bounds for non- stationary mixing processes
[KM17] Vitaly Kuznetsov and Mehryar Mohri. “Generalization bounds for non- stationary mixing processes”. In:Machine Learning 106.1 (2017), pp. 93–
work page 2017
-
[19]
Generalization bounds for time series prediction with non-stationary processes
[KM14] Vitaly Kuznetsov and Mehryar Mohri. “Generalization bounds for time series prediction with non-stationary processes”. In:Algorithmic Learning Theory25 (2014), pp. 260–274. [KM15] Vitaly Kuznetsov and Mehryar Mohri. “Learning theory and algorithms for forecasting non-stationary time series”. In:Advances in Neural Information Processing Systems28 (20...
work page 2014
-
[20]
Honest confidence regions for nonparametric regression
[Li89] Ker-Chau Li. “Honest confidence regions for nonparametric regression”. In: The Annals of Statistics17.3 (1989), pp. 1001–1008. [Li+11] Lingling Li, Eric Tchetgen Tchetgen, Aad van der Vaart, and James M. Robins. “Higher order inference on a treatment effect under low regularity conditions”. In: Statistics and Probability Letters81.7 (2011), pp. 821...
-
[21]
Learning the structure of a nonstationary vector autoregression
129 [MS19] Daniel Malinsky and Peter Spirtes. “Learning the structure of a nonstationary vector autoregression”. In:International Conference on Artificial Intelligence and Statistics89 (2019), pp. 2986–2994. [Man+24] Georg Manten, Cecilia Casolo, Emilio Ferrucci, Søren Wengel Mogensen, Cristo- pher Salvi, and Niki Kilbertus. “Signature kernel conditional ...
-
[22]
Distribution-free learning of Bayesian network structure in continuous domains
[Mar05] Dimitris Margaritis. “Distribution-free learning of Bayesian network structure in continuous domains”. In:AAAI 5 (2005), pp. 825–830. [McF89] Daniel McFadden. “A method of simulated moments for estimation of discrete response models without numerical integration”. In:Econometrica: Journal of the Econometric Society57.5 (1989), pp. 995–1026. [Mie24...
-
[23]
Cross-Fitting and Fast Remainder Rates for Semiparametric Estimation
[MS23] Fabian Mies and Ansgar Steland. “Sequential Gaussian approximation for nonstationary time series in high dimensions”. In:Bernoulli 29.4 (2023), pp. 3114–3140. [MK20] Mehryar Mohri and Vitaly Kuznetsov. “Discrepancy-based theory and algo- rithms for forecasting non-stationary time series”. In:Annals of Mathematics and Artificial Intelligence88.4 (20...
work page internal anchor Pith review Pith/arXiv arXiv 2023
-
[24]
Minimax optimal conditional independence testing
[NBW21] Matey Neykov, Sivaraman Balakrishnan, and Larry Wasserman. “Minimax optimal conditional independence testing”. In:The Annals of Statistics49.4 (2021), pp. 2151–2177. [Niu+24] Ziang Niu, Abhinav Chakraborty, Oliver Dukes, and Eugene Katsevich. “Rec- onciling model-X and doubly robust approaches to conditional independence testing”. In:The Annals of...
work page 2021
-
[25]
Lasso regression in sparse linear model with phi-mixing errors
[PZZ23] Ling Peng, Yan Zhu, and Wenxuan Zhong. “Lasso regression in sparse linear model with phi-mixing errors”. In:Metrika 86.1 (2023), pp. 1–26. [Pet+14] Jonas Peters, Joris M. Mooij, Dominik Janzing, and Bernhard Schölkopf. “Causal discovery with continuous additive noise models”. In:Journal of Machine Learning Research15.58 (2014), pp. 2009–2053. [Pis...
work page 2023
-
[26]
A Scalable Conditional Independence Test for Nonlinear, Non-Gaussian Data
[QKC15] Christopher J. Quinn, Negar Kiyavash, and Todd P. Coleman. “Directed information graphs”. In:IEEE Transactions on Information Theory61.12 (2015), pp. 6887–6909. [Ram14] Joseph D. Ramsey. “A scalable conditional independence test for nonlinear, non-gaussian data”. arXiv preprint arXiv:1401.5031
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[27]
Central limit theorems for local martingales
131 [Reb80] Rolando Rebolledo. “Central limit theorems for local martingales”. In: Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete51.3 (1980), pp. 269–286. [RGR22] Nicolas-Domenic Reiter, Andreas Gerhardus, and Jakob Runge. “Causal infer- ence for temporal patterns”. arXiv preprint arXiv:2205.15149
-
[28]
Improving the bandwidth-free inference methods by prewhitening
[RS13] Yeonwoo Rho and Xiaofeng Shao. “Improving the bandwidth-free inference methods by prewhitening”. In:Journal of Statistical Planning and Inference 143.11 (2013), pp. 1912–1922. [RWG19] Alessandro Rinaldo, Larry Wasserman, and Max G’Sell. “Bootstrapping and sample splitting for high-dimensional, assumption-lean inference”. In:The Annals of Statistics...
work page 2013
-
[29]
Causal network reconstruction from time series: from theoretical assumptions to practical estimation
[Run18a] Jakob Runge. “Causal network reconstruction from time series: from theoretical assumptions to practical estimation”. In:Chaos: An Interdisciplinary Journal of Nonlinear Science28.7 (2018). [Run18b] Jakob Runge. “Conditional independence testing based on a nearest-neighbor estimator of conditional mutual information”. In:International Conference o...
work page 2018
-
[30]
Nonparametric tests of conditional indepen- dence for time series
134 [SW21] Xiaojun Song and Haoyu Wei. “Nonparametric tests of conditional indepen- dence for time series”. arXiv preprint arXiv:2110.04847
-
[31]
[SA09] Ingo Steinwart and Marian Anghel. “Consistency of support vector machines for forecasting the evolution of an unknown ergodic dynamical system from observations with unknown noise”. In:Annals of Statistics37.2 (2009), pp. 841–
work page 2009
-
[32]
Learning from dependent obser- vations
[SHS09] Ingo Steinwart, Don Hush, and Clint Scovel. “Learning from dependent obser- vations”. In:Journal of Multivariate Analysis100.1 (2009), pp. 175–194. [SW07] Liangjun Su and Halbert White. “A consistent characteristic function-based test for conditional independence”. In:Journal of Econometrics141.2 (2007), pp. 807–834. [SW08] Liangjun Su and Halbert...
-
[33]
Lasso guarantees for beta- mixing heavy-tailed time series
[WLT20] Kam Chung Wong, Zifan Li, and Ambuj Tewari. “Lasso guarantees for beta- mixing heavy-tailed time series”. In:The Annals of Statistics48.2 (2020), pp. 1124–1142. [Wu11] Wei Biao Wu. “Asymptotic theory for stationary processes”. In:Statistics and its Interface4.2 (2011), pp. 207–226. [Wu05] Wei Biao Wu. “Nonlinear system theory: another look at depe...
work page 2020
-
[34]
Bootstrapping High Dimensional Time Series
Elsevier, 2012, pp. 187–209. [WW16] Wei-Biao Wu and Ying Nian Wu. “Performance bounds for parameter estimates of high-dimensional linear models with correlated errors”. In:Electronic Journal of Statistics10.1 (2016), pp. 352–379. [WZ24] Weichi Wu and Zhou Zhou. “Multiscale jump testing and estimation under complex temporal dynamics”. In:Bernoulli 30.3 (20...
work page internal anchor Pith review Pith/arXiv arXiv 2012
-
[35]
Another look at bandwidth-free inference: a sample splitting approach
[ZS24] Yi Zhang and Xiaofeng Shao. “Another look at bandwidth-free inference: a sample splitting approach”. In:Journal of the Royal Statistical Society Series B: Statistical Methodology86.1 (2024), pp. 246–272. [ZZZ22] Yeqing Zhou, Yaowu Zhang, and Liping Zhu. “A Projective Approach to Conditional Independence Test for Dependent Processes”. In:Journal of ...
work page 2024
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.