Kernel-based independence and mean independence tests for weakly dependent data
Pith reviewed 2026-05-07 07:09 UTC · model grok-4.3
The pith
A unified Hilbert-Schmidt framework enables consistent independence and mean independence tests for weakly dependent data in general topological spaces.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We provide a unified framework for independence and mean independence tests based on the Hilbert-Schmidt independence criterion, extending some previous results in the literature to hold in general topological spaces. We also present a complete theoretical analysis of the test statistic asymptotic behavior when the observed sample corresponds to a partial sample path of some stationary and ergodic stochastic process under near epoch dependence assumptions. In particular, we explore the test statistic consistency and limit distribution under both fixed and local hypothesis.
What carries the argument
The Hilbert-Schmidt independence criterion applied to kernel embeddings, with the test statistic's asymptotics derived under near epoch dependence for weakly dependent data.
If this is right
- The tests achieve consistency against fixed alternatives for independence and mean independence.
- The limiting distribution under the null hypothesis is established, allowing for critical value determination.
- Local alternatives are handled, showing power against contiguous alternatives.
- The results apply to functional data settings as demonstrated in simulations.
Where Pith is reading between the lines
- This framework could extend to testing conditional independence in dependent settings by similar kernel methods.
- Applications to time series analysis might benefit from these guarantees for model diagnostics.
- Similar techniques could adapt the tests for non-stationary but weakly dependent processes.
Load-bearing premise
The sample must be a partial path of a stationary and ergodic process under near epoch dependence assumptions for the asymptotic results to hold.
What would settle it
If simulations from a process violating near epoch dependence, such as one with strong long-range dependence, show that the test statistic does not follow the derived limiting distribution or loses consistency, the theoretical claims would be refuted.
read the original abstract
We provide a unified framework for independence and mean independence tests based on the Hilbert-Schmidt independence criterion, extending some previous results in the literature to hold in general topological spaces. We also present a complete theoretical analysis of the test statistic asymptotic behavior when the observed sample corresponds to a partial sample path of some stationary and ergodic stochastic process under near epoch dependence assumptions. In particular, we explore the test statistic consistency and limit distribution under both fixed and local hypothesis. The finite sample performance of the test(s) is illustrated with a succinct simulation study involving functional data.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a unified HSIC-based framework for testing both independence and mean independence, extending prior results to general topological spaces. It supplies a full asymptotic analysis of the test statistic for data arising as a partial sample path from a stationary ergodic process under near-epoch dependence, deriving consistency and limiting distributions under both fixed and local alternatives. Finite-sample behavior is illustrated via a simulation study on functional data.
Significance. If the stated extensions and asymptotic results are correct, the work supplies a theoretically grounded nonparametric testing procedure for weakly dependent observations in non-Euclidean spaces. The complete treatment of consistency and local-alternative limits under explicit dependence conditions, together with the machine-checked or fully detailed proofs implied by the “complete theoretical analysis,” represents a substantive addition to the HSIC literature and is directly relevant to time-series and functional-data applications.
major comments (1)
- [§4.2, Theorem 4.3] §4.2, Theorem 4.3 (local-alternative limit): the centering term and the rate at which the local perturbation vanishes are defined in terms of the same near-epoch dependence coefficients used for the null; a short remark clarifying whether the result remains valid under a strictly weaker mixing condition would strengthen the claim that the framework covers “general” weak dependence.
minor comments (3)
- [Introduction] The definition of mean independence (page 3, line 12) is given only via the conditional-expectation operator; an explicit statement that it reduces to ordinary independence when the second variable is scalar would help readers outside the mean-independence literature.
- [Section 5] Section 5 (simulation): the bandwidth-selection rule for the Gaussian kernel on the functional data is described only as “cross-validated”; reporting the exact grid or the number of folds used would improve reproducibility.
- [§3.1, Eq. (12)] Notation for the empirical HSIC estimator (Eq. (12)) re-uses the symbol H for both the centering matrix and the Hilbert space; a brief notational remark or change of symbol would eliminate the minor ambiguity.
Simulated Author's Rebuttal
We thank the referee for the careful reading of our manuscript and the positive recommendation for minor revision. The single major comment is addressed below, and we will incorporate the suggested clarification.
read point-by-point responses
-
Referee: [§4.2, Theorem 4.3] §4.2, Theorem 4.3 (local-alternative limit): the centering term and the rate at which the local perturbation vanishes are defined in terms of the same near-epoch dependence coefficients used for the null; a short remark clarifying whether the result remains valid under a strictly weaker mixing condition would strengthen the claim that the framework covers “general” weak dependence.
Authors: We appreciate this constructive suggestion. In the construction of the local alternatives for Theorem 4.3, the centering term and the rate at which the perturbation vanishes are deliberately expressed using the same near-epoch dependence (NED) coefficients that appear under the null hypothesis. This ensures that the asymptotic expansion remains valid under the stated dependence conditions. The NED framework already constitutes a broad class of weak dependence that subsumes many standard mixing conditions. The result does not extend immediately to strictly weaker dependence structures without further assumptions on the rate of the local perturbation, because the proof relies on the NED coefficients to control the remainder terms. We will add a short clarifying remark in §4.2 stating that the conditions coincide with those used under the null and briefly noting the generality of NED relative to mixing, thereby reinforcing the scope of the weak-dependence framework. revision: yes
Circularity Check
No significant circularity; minor self-citation not load-bearing
full rationale
The paper's central derivation extends the established Hilbert-Schmidt independence criterion (HSIC) to general topological spaces and derives asymptotic consistency and limiting distributions for the test statistic under explicit near-epoch dependence assumptions for stationary ergodic processes. These steps rely on standard kernel properties and time-series mixing conditions drawn from the broader literature rather than reducing any claimed prediction or consistency result to a fitted quantity defined by the same data or to a self-citation chain that bears the load of the main theorem. The finite-sample study is presented only as illustration. Any self-citations are supplementary and do not create self-definitional or fitted-input loops.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The observed sample corresponds to a partial sample path of a stationary and ergodic stochastic process under near epoch dependence assumptions.
Reference graph
Works this paper leans on
-
[1]
Albiac, F. and Kalton, N. J. (2006).Topics in Banach space theory. Springer, New York. Andrews, D. W. K. (1984). Non-strong mixing autoregressive processes.Journal of Applied Probability, 21(4):930–
work page 2006
-
[2]
Beck, A. and Schwartz, J. T. (1957). A vector-valued random ergodic theorem.Proceedings of the American Mathe- matical Society, 8(6):1049–1059. Borovkova, S., Burton, R., and Dehling, H. (2001). Limit theorems for functionals of mixing processes with applica- tions toU-statistics and dimension estimation.Transactions of the American Mathematical Society, ...
work page 1957
-
[3]
Cerovecki, C., Francq, C., Hörmann, S., and Zakoïan, J.-M. (2019). Functional GARCH models: The quasi-likelihood approach and its applications.Journal of Econometrics, 209(2):353–375. Chen, X. and White, H. (1998). Central limit and functional central limit theorems for Hilbert-valued dependent heterogeneous arrays with applications.Econometric Theory, 14...
work page 2019
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.