pith. sign in

arxiv: 2506.17463 · v3 · submitted 2025-06-20 · 🧮 math.ST · stat.ME· stat.TH

Testing Separability of High-Dimensional Covariance Matrices

Pith reviewed 2026-05-19 07:53 UTC · model grok-4.3

classification 🧮 math.ST stat.MEstat.TH
keywords separability testinghigh-dimensional covariancecore covariance matrixsphericity testmatrix-variate datainvariant testsasymptotic spectral equivalence
0
0 comments X

The pith

Testing separability of a high-dimensional covariance matrix is equivalent to testing sphericity of its core component.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that checking whether the covariance matrix Σ for matrix-variate data is separable is the same as checking whether its core component is spherical. This equivalence produces test statistics that remain well-defined when the dimension is large relative to sample size and whose distributions under the null do not depend on unknown parameters. Researchers can therefore simulate the null distribution exactly and control error rates precisely. The authors also prove that the sample covariance and its core are asymptotically equivalent in spectrum when the ratio of dimension to sample size converges to a positive constant, which justifies the use of these tests in the high-dimensional regime. Numerical comparisons indicate that the resulting procedures detect non-separability more often than earlier methods.

Core claim

We show that testing separability of Σ is equivalent to testing sphericity of its core component. With this insight, we construct test statistics that are well-defined in high-dimensional settings and have distributions that are invariant under the null hypothesis of separability, allowing for exact simulation of null distributions. We establish the asymptotic properties of some test statistics by proving the asymptotic spectral equivalence between the sample covariance matrix and its core in a p/n→γ∈(0,∞) regime.

What carries the argument

The core covariance matrix, a complementary object to a separable covariance matrix, carries the argument by converting separability testing into an equivalent sphericity test whose null distribution is parameter-free.

If this is right

  • Test statistics become invariant under the null, so their critical values can be obtained by direct simulation without estimating nuisance parameters.
  • The tests remain valid when the ratio of variables to observations converges to any positive finite limit.
  • The same equivalence supplies a route to other high-dimensional tests by replacing separability with other structural hypotheses on the core.
  • Numerical evidence indicates these procedures detect departures from separability more reliably than prior methods for the same sample sizes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The reduction to a sphericity problem on the core may simplify the derivation of tests for related covariance structures such as Kronecker or factor models.
  • Applied researchers working with matrix-valued observations could use the exact null simulation to set precise significance levels in settings where parameter-dependent approximations have been unreliable.
  • Finite-sample refinements that exploit the exact invariance might further improve performance when the dimension-to-sample ratio is moderate rather than large.

Load-bearing premise

The core covariance matrix is defined so that separability of the original matrix holds exactly if and only if the core is spherical, and the asymptotic spectral equivalence between sample and population versions is valid when dimension over sample size approaches a positive constant.

What would settle it

A Monte Carlo experiment in which the proposed test statistic under the null of separability produces a distribution that still depends on unknown parameters, or in which the tests do not show higher power than existing procedures, would falsify the central claims.

Figures

Figures reproduced from arXiv: 2506.17463 by Bongjung Sung, Peter D. Hoff.

Figure 1
Figure 1. Figure 1: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of n across each w = 0.2, 0.4, 0.6, 0.8, and ˆγ := ˆγ1γˆ2 = p1p2/n. The tests ϕ1, ϕ2, and ϕ3 are those proposed in this article. LRT and PTCLT denote the tests by [41, 34] and [3], respectively. LRT is not present when ˆγ > 1, since its test statistic is not well-defined in this case [PITH_FULL_IMAGE:figures/full_fig_p018_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Empirical power of the separability tests under Np1×p2 (0, C2,w) as functions of n across each w = 0.2, 0.4, 0.6, 0.8, and ˆγ := ˆγ1γˆ2 = p1p2/n. For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of w across (ˆγ1, γˆ2). Here n = 256 and (p1, p2) is determined according to the value of (ˆγ1, γˆ2). For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p019_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of w across (ˆγ1, γˆ2). Here n = 256 and (p1, p2) is determined according to the value of (ˆγ1, γˆ2). For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p019_4.png] view at source ↗
read the original abstract

Due to their parsimony, separable covariance models have been popular in modeling matrix-variate data. However, the inference from such a model may be misleading if the population covariance matrix $\Sigma$ is actually non-separable, motivating the use of statistical tests of separability. The existing separability tests suffer mainly from two issues: 1) test statistics that are not well-defined in high-dimensional settings, 2) low power for small sample sizes and null distributions that depend on unknown parameters, preventing exact error rate control. To address these issues, we propose novel invariant tests using the core covariance matrix, a complementary notion to a separable covariance matrix. We show that testing separability of $\Sigma$ is equivalent to testing sphericity of its core component. With this insight, we construct test statistics that are well-defined in high-dimensional settings and have distributions that are invariant under the null hypothesis of separability, allowing for exact simulation of null distributions. We establish the asymptotic properties of some test statistics by proving the asymptotic spectral equivalence between the sample covariance matrix and its core in a $p/n\rightarrow\gamma\in(0,\infty)$ regime. The large power of our proposed tests relative to existing procedures is demonstrated numerically.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes a new approach to testing separability of high-dimensional covariance matrices Σ by introducing the complementary notion of a core covariance matrix. It claims that testing separability of Σ is exactly equivalent to testing sphericity of the core component. This equivalence is used to construct invariant test statistics that remain well-defined when p/n → γ ∈ (0, ∞), possess null distributions free of unknown parameters (hence exactly simulable), and satisfy asymptotic spectral equivalence between the sample covariance and the core. Numerical experiments are presented to illustrate higher power relative to existing procedures.

Significance. If the claimed exact equivalence and the asymptotic spectral equivalence hold under the stated high-dimensional regime, the work supplies a principled route to parameter-free, simulable null distributions for separability testing. This would address two documented shortcomings of prior tests (ill-defined statistics in high dimensions and parameter-dependent nulls) and could be useful in matrix-variate applications where Kronecker structure is assumed but must be verified.

major comments (2)
  1. [Abstract and §2] Abstract and §2 (definition of core): the central claim that separability of Σ is equivalent to sphericity of the core holds only if the core is constructed so that any separable Σ maps precisely to a multiple of the identity and any non-separable Σ maps to a non-spherical matrix, without implicit normalizations or additional eigenvalue-separation assumptions. The manuscript must exhibit the explicit mapping and verify that no hidden parameters remain.
  2. [§4] §4 (asymptotic spectral equivalence): the transfer of null-distribution properties to the proposed test statistics relies on proving that the sample covariance and its core are asymptotically spectrally equivalent when p/n → γ. The proof must state the precise moment and eigenvalue conditions under which this equivalence is unconditional; otherwise the invariance and exact simulation claims become conditional.
minor comments (2)
  1. [§2] Notation for the core matrix should be introduced with a clear symbol (e.g., Σ_core) and distinguished from the separable factors at first use.
  2. [Numerical experiments] The numerical section would benefit from reporting the exact simulation size used for critical values and the number of Monte Carlo replications.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate how we will revise the paper accordingly.

read point-by-point responses
  1. Referee: [Abstract and §2] Abstract and §2 (definition of core): the central claim that separability of Σ is equivalent to sphericity of the core holds only if the core is constructed so that any separable Σ maps precisely to a multiple of the identity and any non-separable Σ maps to a non-spherical matrix, without implicit normalizations or additional eigenvalue-separation assumptions. The manuscript must exhibit the explicit mapping and verify that no hidden parameters remain.

    Authors: We agree that the equivalence requires explicit verification. In the revised manuscript we will expand Section 2 to include the full construction of the core component and a direct proof that Σ is separable if and only if its core is a scalar multiple of the identity. The argument will rely solely on the given definitions and will confirm the absence of hidden normalizations or eigenvalue-separation assumptions. revision: yes

  2. Referee: [§4] §4 (asymptotic spectral equivalence): the transfer of null-distribution properties to the proposed test statistics relies on proving that the sample covariance and its core are asymptotically spectrally equivalent when p/n → γ. The proof must state the precise moment and eigenvalue conditions under which this equivalence is unconditional; otherwise the invariance and exact simulation claims become conditional.

    Authors: We accept the need for explicit conditions. We will revise the statement of the main theorem in Section 4 to list the precise assumptions—finite fourth moments of the entries and eigenvalues of Σ bounded away from zero and infinity—under which the asymptotic spectral equivalence holds unconditionally. This will make clear that the invariance and exact-simulation properties are valid under these stated conditions. revision: yes

Circularity Check

1 steps flagged

Equivalence of separability testing to core sphericity is by construction via complementary definition of the core

specific steps
  1. self definitional [Abstract]
    "we propose novel invariant tests using the core covariance matrix, a complementary notion to a separable covariance matrix. We show that testing separability of Σ is equivalent to testing sphericity of its core component."

    The core is introduced as the complementary notion to separability; the claimed equivalence therefore holds exactly by how the core is defined (any separable Σ produces a spherical core and vice versa). The 'show that' step is thus a restatement of the definition rather than a derived result from matrix properties or external theorems.

full rationale

The paper introduces the core covariance matrix as a 'complementary notion to a separable covariance matrix' and then states that testing separability of Σ is equivalent to testing sphericity of the core. This equivalence is load-bearing for the subsequent construction of invariant test statistics and the claim of exact null distribution simulation. Because the core is explicitly positioned as complementary, the mapping (separable Σ maps to spherical core, non-separable maps to non-spherical) is built into the definition rather than independently derived from first principles or external properties. The asymptotic spectral equivalence result between sample covariance and core is presented as a separate technical contribution, but the foundational population-level equivalence reduces to the definitional choice. This produces partial circularity: the central insight is self-definitional while later asymptotic and simulation steps retain independent content.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the definition of the core covariance matrix as a complementary notion to separability and on standard assumptions for covariance matrices in high-dimensional settings. No free parameters are mentioned. The core matrix itself functions as an invented entity whose independent evidence is not provided in the abstract.

axioms (1)
  • domain assumption Covariance matrices are symmetric positive semi-definite and the observations follow a distribution permitting the stated spectral equivalences.
    Standard background assumption invoked for the asymptotic results in the p/n → γ regime.
invented entities (1)
  • core covariance matrix no independent evidence
    purpose: Complementary notion that reduces separability testing to sphericity testing and enables invariant statistics.
    Introduced in the paper as the key device for constructing the new tests.

pith-pipeline@v0.9.0 · 5742 in / 1379 out tokens · 38503 ms · 2026-05-19T07:53:23.535072+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

76 extracted references · 76 canonical work pages

  1. [1]

    Allen and Robert Tibshirani

    Genevera I. Allen and Robert Tibshirani. Inference with transposable data: modelling the effects of row and column correlations. J. R. Stat. Soc., B: Stat. Methodol. , 74(4):721–743, 2012

  2. [2]

    An Introduction to Multivariate Statistical Analysis

    Theodore Wilbur Anderson. An Introduction to Multivariate Statistical Analysis. Wiley, 2003

  3. [3]

    John A. D. Aston, Davide Pigoli, and Shahin Tavakoli. Tests for separability in nonparametric covariance operators of random surfaces. Ann. Stat., 45(4):1431–1461, 2017

  4. [4]

    On sample eigenvalues in a generalized spiked population model

    Zhidong Bai and Jianfeng Yao. On sample eigenvalues in a generalized spiked population model. J. Multivar. Anal. , 106:167–177, 2012. 22

  5. [5]

    Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices

    Jinho Baik, G´ erard Ben Arous, and Sandrine P´ ech’e. Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Probab., 33(5):1643–1697, 2005

  6. [6]

    Silverstein

    Jinho Baik and Jack W. Silverstein. Eigenvalues of large sample covariance matrices of spiked population models. J. Multivar. Anal. , 97(6):1382–1408, 2006

  7. [7]

    Universality for the largest eigenvalue of sample covariance matrices with general population

    Zhigang Bao, Guangmin Pan, and Wang Zhou. Universality for the largest eigenvalue of sample covariance matrices with general population. Ann. Stat., 43(1):382–421, 2015

  8. [8]

    Tony Cai, Xiao Han, and Guangming Pan

    T. Tony Cai, Xiao Han, and Guangming Pan. Limiting laws for divergent spiked eigenvalues and largest non-spiked eigenvalue of sample covariance matrices. Ann. Stat., 48(3):1255–1280, 2020

  9. [9]

    Testing separability of space–time functional processes

    Panayiotis Constantinou, Piotr Kokoszka, and Matthew Reimherr. Testing separability of space–time functional processes. Biometrika, 104(2):425–437, 2017

  10. [10]

    Maximum likelihood estimation for matrix normal models via quiver representations

    Harm Derksen and Visu Makam. Maximum likelihood estimation for matrix normal models via quiver representations. SIAM J. Appl. Algebra Geom. , 5(2):338–365, 2021

  11. [11]

    Maximum likelihood estimation for tensor normal models via castling transforms

    Harm Derksen, Visu Makam, and Michael Walter. Maximum likelihood estimation for tensor normal models via castling transforms. Forum math. Sigma., 10(50), 2022

  12. [12]

    Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices

    Xue Ding. Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices. Stat. Probab. Lett., 156:108613, 2020

  13. [13]

    Rational maximum likelihood estimators of kronecker covariance matrices

    Mathias Drton, Alexandros Grosdos, and Andrew McCormack. Rational maximum likelihood estimators of kronecker covariance matrices. Algebr. Stat., 15(1):144–163, 2024

  14. [14]

    Existence and uniqueness of the kronecker covariance mle

    Mathias Drton, Satoshi Kuriki, and Peter Hoff. Existence and uniqueness of the kronecker covariance mle. Ann. Stat., 49(5):2721–2754, 2021

  15. [15]

    Fisher, Xiaoqian Sun, and Colin M

    Thomas J. Fisher, Xiaoqian Sun, and Colin M. Gallagher. A new test for sphericity of the covariance matrix for high dimensional data. J. Multivar. Anal. , 101(10):2554–2570, 2010

  16. [16]

    A higher-order lq decomposition for separable covariance models

    David Gerard and Peter Hoff. A higher-order lq decomposition for separable covariance models. Linear Algebra Appl., 505(15):57–84, 2016

  17. [17]

    Testing the first- order separability hypothesis for spatio-temporal point patterns

    Mohammad Ghorbani, Nafiseh Vafae, Jiˇ r´ ı Dvoˇ r´ ak, and Mari Myllym¨ aki. Testing the first- order separability hypothesis for spatio-temporal point patterns. Comput. Stat. Data. Anal. , 161:107245, 2021

  18. [18]

    Nonseparable, stationary covariance functions for space–time data

    Tilmann Gneiting. Nonseparable, stationary covariance functions for space–time data. J. Am. Stat. Assoc., 97(458):590–600, 2002

  19. [19]

    Genton, and Peter Guttorp

    Tilmann Gneiting, Marc G. Genton, and Peter Guttorp. Geostatistical space–time models, stationarity, separability, and full symmetry. Monogr. Stat. Appl. Probab., 107:151–175, 2007

  20. [20]

    Rungang Han, Pixu Shi, and Anru R. Zhang. Guaranteed functional tensor singular value decomposition. J. Am. Stat. Assoc. , 119(546):995–1007, 2024

  21. [21]

    Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix

    Johannes Heiny and Carolin Kleemann. Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix. arXiv preprint arXiv:2302.13914 , 2023. 23

  22. [22]

    Peter Hoff, Andrew Mccormack, and Anru R. Zhang. Core shrinkage covariance estimation for matrix-variate data. J. R. Stat. Soc. Ser. B Methodol. , 85(5):1659–1679, 2023

  23. [23]

    Peter D. Hoff. Separable covariance arrays via the tucker product, with applications to mul- tivariate relational data. Bayesian Anal., 6(2):179–196, 2011

  24. [24]

    Testing stationarity of functional time series

    Lajos Horv´ ath, Piotr Kokoszka, and Gregory Rice. Testing stationarity of functional time series. J. Econom., 179(1):66–82, 2014

  25. [25]

    Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

    Dandan Jiang and Zhidong Bai. Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

  26. [26]

    Johnstone

    Iain M. Johnstone. On the distribution of the largest eigenvalue in principal components analysis. Ann. Stat., 29(2):295–327, 2001

  27. [27]

    Probability Theory

    Achim Klenke. Probability Theory. Springer-Verlag, 2008

  28. [28]

    Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population

    Ji Oon Lee and Kevin Schnelli. Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population. Ann. Appl. Probab., 26(6):3786—-3839, 2016

  29. [29]

    Lehmann and Joseph P

    E.L. Lehmann and Joseph P. Romano. Testing statistical hypotheses. Springer, 2022

  30. [30]

    Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size

    Zeng Li and Jianfeng Yao. Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size. Electron. J. Stat., 10(2):2973–3010, 2016

  31. [31]

    Lindquist

    Martin A. Lindquist. The statistical analysis of fmri data. Stat. Sci., 23(4):439–464, 2008

  32. [32]

    Linton and Haihan Tang

    Oliver B. Linton and Haihan Tang. Estimation of the kronecker covariance model by quadratic form. Econom. Theory, 38(5):1014–1067, 2022

  33. [33]

    Van Loan

    Charles F. Van Loan. The ubiquitous kronecker product. J. Comput. Appl. Math. , 123(1):85– 100, 2000

  34. [34]

    Zimmerman

    Nelson Lu and Dale L. Zimmerman. The likelihood ratio test for a separable covariance matrix. Stat. Probab. Lett., 73(4):449–457, 2005

  35. [35]

    Simultaneous singular value decomposition

    Takanori Maehara and Kazuo Murota. Simultaneous singular value decomposition. Linear Algebra Appl., 435(1):106–116, 2011

  36. [36]

    Manceur and Pierre Dutilleul

    A.M. Manceur and Pierre Dutilleul. Unbiased modified likelihood ratio tests for simple and double separability of a variance–covariance structure.Stat. Probab. Lett., 83(2):631–636, 2013

  37. [37]

    Manceur and Pierre Dutilleul

    Ameur M. Manceur and Pierre Dutilleul. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. J. Comput. Appl. Math., 239(1):37–49, 2013

  38. [38]

    The eigenvalue distribution in some ensembles of random matrices

    Volodymyr Marchenko and Leonid Pastur. The eigenvalue distribution in some ensembles of random matrices. Math. USSR Sbornik , 1:457–483, 1967

  39. [39]

    Panaretos

    Tomas Masak and Victor M. Panaretos. Random surface covariance estimation by shifted partial tracing. J. Am. Stat. Assoc. , 118(544):2562–2574, 2023

  40. [40]

    Panaretos

    Tomas Masak, Soham Sarkar, and Victor M. Panaretos. Separable expansions for covariance estimation via the partial inner product. Biometrika, 110(1):225–247, 2023. 24

  41. [41]

    Mitchell, Marc G

    Matthew W. Mitchell, Marc G. Genton, and Marcia L. Gumpertz. A likelihood ratio test for separability of covariances. J. Multivar. Anal. , 97(5):1025–1043, 2006

  42. [42]

    Davide Pigoli, John A. D. Aston, Ian L. Dryden, and Piercesare Secchi. Distances and inference for covariance operators. Biometrika, 101(2):409–422, 2014

  43. [43]

    Dimension-free structured covariance estimation

    Nikita Puchkin and Maxim Rakhuba. Dimension-free structured covariance estimation. Proc. Mach. Learn. Res., 247:1–31, 2024

  44. [44]

    de Mucnk, and Mathisca C.M

    Beata Ro´ s, Fetsje Bijma, Jan C. de Mucnk, and Mathisca C.M. de Gunst. Existence and uniqueness of the maximum likelihood estimator for models with a kronecker product covari- ance structure. J. Multivar. Anal. , 143:345–361, 2016

  45. [45]

    Hanson-wright inequality and sub-gaussian concentra- tion

    Mark Rudelson and Roman Vershynin. Hanson-wright inequality and sub-gaussian concentra- tion. Electron. Commun. Probab., 18(82):1–9, 2013

  46. [46]

    Silverstein and Bai Z

    Jack W. Silverstein and Bai Z. D. On the empirical distribution of eigenvalues of a class of large dimensional random matrices,. J. Multivar. Anal. , 54(2):175–192, 1995

  47. [47]

    Simpson, Lloyd J

    Sean L. Simpson, Lloyd J. Edwards, Martin A. Styner, and Keith E. Muller. Separability tests for high-dimensional, low-sample size multivariate repeated measures data. J. Appl. Stat. , 41(11):2450–2461, 2014

  48. [48]

    Soloveychik and D

    I. Soloveychik and D. Trushin. Gaussian and robust kronecker product covariance estimation: Existence and uniqueness. J. Multivar. Anal. , 149:92–113, 2016

  49. [49]

    covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018

    Shahin Tavakoli. covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018. R package version 1.1.0

  50. [50]

    Tracy and Harold Widom

    Craig A. Tracy and Harold Widom. Level-spacing distributions and the airy kernel. Commun. Math. Phys., 159(1):151–174, 1994

  51. [51]

    Hero III, and Shuheng Zhou

    Theodoros Tsiligkaridis, Alfred O. Hero III, and Shuheng Zhou. On convergence of kronecker graphical lasso algorithms. IEEE Trans. Signal Process., 61(7):1743–1755, 2013

  52. [52]

    Hero III

    Theodoros Tsiligkaridis and Alfred O. Hero III. Covariance estimation in high dimensions via kronecker product expansions. IEEE Trans. Signal Process., 61(21):5347–5360, 2013

  53. [53]

    High-dimensional probability: An introduction with applications in data science, volume 47

    Roman Vershynin. High-dimensional probability: An introduction with applications in data science, volume 47. Cambridge university press, 2018

  54. [54]

    On the sphericity test with large-dimensional observations

    Qinwen Wang and Jianfeng Yao. On the sphericity test with large-dimensional observations. Electron. J. Stat., 7:2164–2192, 2013

  55. [55]

    High-dimensional sphericity test by extended likelihood ratio

    Zhendong Wang and Xingzhong Xu. High-dimensional sphericity test by extended likelihood ratio. Metrika, 84:1169—-1212, 2021

  56. [56]

    On estimation of covariance matrices with kronecker product structure

    Karl Werner, Magnus Jansson, and Petre Stoica. On estimation of covariance matrices with kronecker product structure. IEEE Trans. Signal Process., 56(2):478–491, 2008

  57. [57]

    Testing kronecker product covariance matrices for high-dimensional matrix-variate data

    Long Yu, Jiahui Xie, and Wang Zhou. Testing kronecker product covariance matrices for high-dimensional matrix-variate data. Biometrika, 110(3):799––814, 2023. 25 Appendix In Appendix, we provide the proofs of the theoretical results and additional theoretical results in the main text (Section A), miscellaneous figures and tables supporting the results and...

  58. [58]

    For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality

    Lastly, if C has a rank −2 partial isotropic structure with p1 = p2, using β = det(A⊤A) in (24) and noting that tr( A⊤A) = p2 1(1 − λ), the result follows. For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality. Let ¯A = [vec( ¯A1), . . . ,vec( ¯Ar)] for ¯Ai in the proof of Proposition 2 for a co...

  59. [59]

    If (j, k) = (u, v), j = k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)2] = E   p1X i1,i2=1 a2 i1a2 i2   = p1E[a4 1] + p1(p1 − 1)E[a2 1]E[a2 2] = p1ν4 + p1(p1 − 1)

  60. [60]

    If j = k, u = v, and j ̸= u, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)(b⊤b)] = E[a⊤a]E(b⊤b)] = E2[a⊤a] = p1X i=1 E[a2 i ] !2 = p2 1

  61. [61]

    If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)2] = E   p1X i1,i2=1 ai1bi1ai2bi2   = p1E[a2 1b2 1] + p1(p1 − 1)E[a1b1a2b2] = p1. 38

  62. [62]

    For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0

    Otherwise, using the independence of entries of a, b, c, d yields that E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = 0. For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0. One can compute the expectation for other remaining cases using the independence. A...

  63. [63]

    If (j, k) = (u, v), j = k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

  64. [64]

    If j = k, u = v, and j ̸= u, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)(c⊤d)] = E[a⊤b]E[c⊤d] = 0

  65. [65]

    If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

  66. [66]

    To prove the second assertion, partition Ip as Ip = diag(Ip1,

    Otherwise, as an analogy to the item 4), one can verify that E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = 0. To prove the second assertion, partition Ip as Ip = diag(Ip1, . . . , Ip1 | {z } p2 ). Then R(Ip) =   W1 ... Wp2   ⇒ R(Ip)R(Ip)⊤ =   W1W ⊤ 1 · · · W1W ⊤ p2 ... ... ... Wp2W ⊤ 1 · · · Wp2W ⊤ p2   , where the row of Wi ∈ Rp2×p2 1 is vec(Ip1) on ith row and ...

  67. [67]

    To bound W11,11, (I) = nVar (a⊤a)2 = nVar   p1X i=1 a4 i + 2 X 1≤i<j≤p1 a2 i a2 j   ≤ 2nVar p1X i=1 a4 i ! + 16nVar   X 1≤i<j≤p1 a2 i a2 j   ≲ np1ν8 + 16nE[ X i1<j1,i2<j2 a2 i1a2 j1a2 i2a2 j2] ≲ np1ν8 + np4 1 = O(np4 1). (34) 41 Since j = k = u = v = 1, we have that (II) + (III) = 2n(n − 1)Var (a⊤b)2 = 2n(n − 1)Var   p1X i=1 a2 i b2 i + 2 X 1≤i<...

  68. [68]

    To bound W11,22, (I) = nVar (a⊤a)(b⊤b) ≤ nE[(a⊤a)2(b⊤b)2] = nE[(a⊤a)2]E[(b⊤b)2] = O(np4 1), (II) + (III) = 2n(n − 1)Var (a⊤b)(c⊤d) ≤ n(n − 1)E[(a⊤b)2(c⊤d)2] = 2n(n − 1)E2[(a⊤b)2] ≤ n2 E[ p1X i1=1 a2 i1b2 i1] !2 = O(n2p2 1). (37) 42 On the other hand, (IV) = 4n(n − 1)Cov (a⊤a)(b⊤b), (a⊤c)(b⊤d) = 4n(n − 1) E[(a⊤a)(b⊤b)(a⊤c)(b⊤d)] − E[(a⊤a)(b⊤b)]E[(a⊤c)]E[(b...

  69. [69]

    (39) As an analogy to (38), one may see that (III) = (IV) = (V) = 0

    To bound W11,12, (I) = nVar (a⊤a)(a⊤b) ≤ nE[(a⊤a)2(a⊤b)2] = nE[ p1X i1,i2,i3,i4=1 a2 i1a2 i2ai3bi3ai4bi4] = nE[ p1X i1,i2,i3=1 a2 i1a2 i2a2 i3b2 i3] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) ≤ n(n − 1)E[(a⊤b)2(a⊤c)2] ≤ n2E[ p1X i1,i2,i3,i4=1 ai1bi1ai2bi2ai3ci3ai4ci4] = n2E[ p1X i1,i3=1 a2 i1b2 i1a2 i3c2 i3] = O(n2p2 1). (39) As an analogy to (38), one may...

  70. [70]

    Again one may observe that (III) = (IV) = (V) = 0

    To bound W11,23, (I) = nVar (a⊤a)(b⊤c) = nE[(a⊤a)2(b⊤c)2] = nE[(a⊤a)2]E[(b⊤c)2] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1), (40) where the computation of (II) follows from (37). Again one may observe that (III) = (IV) = (V) = 0. Hence, from (40), W11,23 = O(np3 1 + n2p2 1). 43

  71. [71]

    (41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1)

    To bound W12,12, deduce from (35) and (39) that (I) = nVar (a⊤b)2 = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1). (42) From (41) and (42), we have that W12,12 = O(n3p2 1)

  72. [72]

    (43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0

    To bound W12,13, (I) = nVar (a⊤b)(a⊤c) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0. Hence, from (43), W12,13 = O(n2p2 1)

  73. [73]

    (44) Again we have that (III) = (IV) = (V) = 0

    To bound W12,34, one can deduce from (37) that (I) = nVar (a⊤b)(c⊤d) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1). (44) Again we have that (III) = (IV) = (V) = 0. Thus, (44) implies that W12,34 = O(n2p2 1). Therefore, we verified (30) from the items 1)–7). Using the results from Lemma 4–5, we prove Theorem 5 and Corollary 3. Proof of Theorem 5 No...

  74. [74]

    47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C)

    log 12− c n3/2δ K2 ! ≲ 2 exp n log 12 − c n3/2δ K2 ! ≲ 2 exp −˜cn3/2δ , where the first ≲ holds because (A1) implies that p2 1 + p2 2 ≍ n, and the second ≲ holds for any constant ˜c ∈ (0, c/K2). 47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C). As discussed in Section 4.3, we show that σ1(R(C)) scal...

  75. [75]

    We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2

    derived the sample eigenvalue bias, as follows (see Theorem 1.1–1.2 of [6]): for j ∈ J i, ℓj a.s.→ ( σ2 ai + γai ai−σ2 , if ai > 1 + √γ, σ2(1 + √γ)2, o.t. . We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2. To align with the assumption of Σ, we set λ = 1/(1 + rc/...

  76. [76]

    Here ( p1, p2, n) = (20, 20, 1600)

    in Proposition 3. Here ( p1, p2, n) = (20, 20, 1600). We conclude this section with the figures that numerically verify the results of Corollary 2 and Theorem 5, as well as the table that demonstrates the consistency of T3. To verify Corollary 2, we generate the test statistic T1 with (p1, p2, n) = (72, 72, 1296) with 1000 simulations, and assess whether ...