Testing Separability of High-Dimensional Covariance Matrices

Bongjung Sung; Peter D. Hoff

arxiv: 2506.17463 · v3 · submitted 2025-06-20 · 🧮 math.ST · stat.ME· stat.TH

Testing Separability of High-Dimensional Covariance Matrices

Bongjung Sung , Peter D. Hoff This is my paper

Pith reviewed 2026-05-19 07:53 UTC · model grok-4.3

classification 🧮 math.ST stat.MEstat.TH

keywords separability testinghigh-dimensional covariancecore covariance matrixsphericity testmatrix-variate datainvariant testsasymptotic spectral equivalence

0 comments

The pith

Testing separability of a high-dimensional covariance matrix is equivalent to testing sphericity of its core component.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that checking whether the covariance matrix Σ for matrix-variate data is separable is the same as checking whether its core component is spherical. This equivalence produces test statistics that remain well-defined when the dimension is large relative to sample size and whose distributions under the null do not depend on unknown parameters. Researchers can therefore simulate the null distribution exactly and control error rates precisely. The authors also prove that the sample covariance and its core are asymptotically equivalent in spectrum when the ratio of dimension to sample size converges to a positive constant, which justifies the use of these tests in the high-dimensional regime. Numerical comparisons indicate that the resulting procedures detect non-separability more often than earlier methods.

Core claim

We show that testing separability of Σ is equivalent to testing sphericity of its core component. With this insight, we construct test statistics that are well-defined in high-dimensional settings and have distributions that are invariant under the null hypothesis of separability, allowing for exact simulation of null distributions. We establish the asymptotic properties of some test statistics by proving the asymptotic spectral equivalence between the sample covariance matrix and its core in a p/n→γ∈(0,∞) regime.

What carries the argument

The core covariance matrix, a complementary object to a separable covariance matrix, carries the argument by converting separability testing into an equivalent sphericity test whose null distribution is parameter-free.

If this is right

Test statistics become invariant under the null, so their critical values can be obtained by direct simulation without estimating nuisance parameters.
The tests remain valid when the ratio of variables to observations converges to any positive finite limit.
The same equivalence supplies a route to other high-dimensional tests by replacing separability with other structural hypotheses on the core.
Numerical evidence indicates these procedures detect departures from separability more reliably than prior methods for the same sample sizes.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The reduction to a sphericity problem on the core may simplify the derivation of tests for related covariance structures such as Kronecker or factor models.
Applied researchers working with matrix-valued observations could use the exact null simulation to set precise significance levels in settings where parameter-dependent approximations have been unreliable.
Finite-sample refinements that exploit the exact invariance might further improve performance when the dimension-to-sample ratio is moderate rather than large.

Load-bearing premise

The core covariance matrix is defined so that separability of the original matrix holds exactly if and only if the core is spherical, and the asymptotic spectral equivalence between sample and population versions is valid when dimension over sample size approaches a positive constant.

What would settle it

A Monte Carlo experiment in which the proposed test statistic under the null of separability produces a distribution that still depends on unknown parameters, or in which the tests do not show higher power than existing procedures, would falsify the central claims.

Figures

Figures reproduced from arXiv: 2506.17463 by Bongjung Sung, Peter D. Hoff.

**Figure 1.** Figure 1: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of n across each w = 0.2, 0.4, 0.6, 0.8, and ˆγ := ˆγ1γˆ2 = p1p2/n. The tests ϕ1, ϕ2, and ϕ3 are those proposed in this article. LRT and PTCLT denote the tests by [41, 34] and [3], respectively. LRT is not present when ˆγ > 1, since its test statistic is not well-defined in this case [PITH_FULL_IMAGE:figures/full_fig_p018_1.png] view at source ↗

**Figure 2.** Figure 2: Empirical power of the separability tests under Np1×p2 (0, C2,w) as functions of n across each w = 0.2, 0.4, 0.6, 0.8, and ˆγ := ˆγ1γˆ2 = p1p2/n. For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p018_2.png] view at source ↗

**Figure 3.** Figure 3: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of w across (ˆγ1, γˆ2). Here n = 256 and (p1, p2) is determined according to the value of (ˆγ1, γˆ2). For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p019_3.png] view at source ↗

**Figure 4.** Figure 4: Empirical power of the separability tests under Np1×p2 (0, C1,w) as functions of w across (ˆγ1, γˆ2). Here n = 256 and (p1, p2) is determined according to the value of (ˆγ1, γˆ2). For the notations of ϕ1, ϕ2, ϕ3, LRT, and PTCLT, see [PITH_FULL_IMAGE:figures/full_fig_p019_4.png] view at source ↗

read the original abstract

Due to their parsimony, separable covariance models have been popular in modeling matrix-variate data. However, the inference from such a model may be misleading if the population covariance matrix $\Sigma$ is actually non-separable, motivating the use of statistical tests of separability. The existing separability tests suffer mainly from two issues: 1) test statistics that are not well-defined in high-dimensional settings, 2) low power for small sample sizes and null distributions that depend on unknown parameters, preventing exact error rate control. To address these issues, we propose novel invariant tests using the core covariance matrix, a complementary notion to a separable covariance matrix. We show that testing separability of $\Sigma$ is equivalent to testing sphericity of its core component. With this insight, we construct test statistics that are well-defined in high-dimensional settings and have distributions that are invariant under the null hypothesis of separability, allowing for exact simulation of null distributions. We establish the asymptotic properties of some test statistics by proving the asymptotic spectral equivalence between the sample covariance matrix and its core in a $p/n\rightarrow\gamma\in(0,\infty)$ regime. The large power of our proposed tests relative to existing procedures is demonstrated numerically.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The core covariance reduction turns separability testing into sphericity testing, which cleanly delivers invariant high-dimensional statistics with simulatable nulls.

read the letter

The main thing to know is that this paper defines a core covariance matrix so separability of the full Sigma becomes equivalent to sphericity of the core. That move lets them build test statistics that stay invariant under the null and whose distributions can be simulated exactly without estimating any parameters. It also gives asymptotic spectral equivalence results between the sample covariance and the core when p/n goes to a positive constant. This directly targets the two problems called out in the abstract: statistics that fall apart in high dimensions and nulls that depend on unknowns. The numerical comparisons show higher power than prior methods, which is useful if the tests hold up. The reduction itself looks like a genuine step beyond the cited separability literature. The soft spot is exactly where the stress-test note points: the equivalence is only as strong as the core definition. If that definition requires hidden normalizations, eigenvalue separation, or extra moment conditions to map separable matrices exactly to spherical ones and non-separable ones to non-spherical ones, then both the exact invariance and the high-dimensional asymptotics become conditional rather than unconditional. The abstract states the equivalence and the spectral result, but the proofs will need to be checked for any such restrictions. This is aimed at statisticians working with matrix-variate data who need to validate the separability assumption before fitting Kronecker models. Readers who care about exact error control in high dimensions or who already use sphericity tests will see the most direct value. It deserves peer review because the reduction is new, the invariance property is practically attractive, and the technical claims are specific enough to be refereeable even if some conditions need tightening.

Referee Report

2 major / 2 minor

Summary. The paper proposes a new approach to testing separability of high-dimensional covariance matrices Σ by introducing the complementary notion of a core covariance matrix. It claims that testing separability of Σ is exactly equivalent to testing sphericity of the core component. This equivalence is used to construct invariant test statistics that remain well-defined when p/n → γ ∈ (0, ∞), possess null distributions free of unknown parameters (hence exactly simulable), and satisfy asymptotic spectral equivalence between the sample covariance and the core. Numerical experiments are presented to illustrate higher power relative to existing procedures.

Significance. If the claimed exact equivalence and the asymptotic spectral equivalence hold under the stated high-dimensional regime, the work supplies a principled route to parameter-free, simulable null distributions for separability testing. This would address two documented shortcomings of prior tests (ill-defined statistics in high dimensions and parameter-dependent nulls) and could be useful in matrix-variate applications where Kronecker structure is assumed but must be verified.

major comments (2)

[Abstract and §2] Abstract and §2 (definition of core): the central claim that separability of Σ is equivalent to sphericity of the core holds only if the core is constructed so that any separable Σ maps precisely to a multiple of the identity and any non-separable Σ maps to a non-spherical matrix, without implicit normalizations or additional eigenvalue-separation assumptions. The manuscript must exhibit the explicit mapping and verify that no hidden parameters remain.
[§4] §4 (asymptotic spectral equivalence): the transfer of null-distribution properties to the proposed test statistics relies on proving that the sample covariance and its core are asymptotically spectrally equivalent when p/n → γ. The proof must state the precise moment and eigenvalue conditions under which this equivalence is unconditional; otherwise the invariance and exact simulation claims become conditional.

minor comments (2)

[§2] Notation for the core matrix should be introduced with a clear symbol (e.g., Σ_core) and distinguished from the separable factors at first use.
[Numerical experiments] The numerical section would benefit from reporting the exact simulation size used for critical values and the number of Monte Carlo replications.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful reading and constructive comments on our manuscript. We address each major comment below and indicate how we will revise the paper accordingly.

read point-by-point responses

Referee: [Abstract and §2] Abstract and §2 (definition of core): the central claim that separability of Σ is equivalent to sphericity of the core holds only if the core is constructed so that any separable Σ maps precisely to a multiple of the identity and any non-separable Σ maps to a non-spherical matrix, without implicit normalizations or additional eigenvalue-separation assumptions. The manuscript must exhibit the explicit mapping and verify that no hidden parameters remain.

Authors: We agree that the equivalence requires explicit verification. In the revised manuscript we will expand Section 2 to include the full construction of the core component and a direct proof that Σ is separable if and only if its core is a scalar multiple of the identity. The argument will rely solely on the given definitions and will confirm the absence of hidden normalizations or eigenvalue-separation assumptions. revision: yes
Referee: [§4] §4 (asymptotic spectral equivalence): the transfer of null-distribution properties to the proposed test statistics relies on proving that the sample covariance and its core are asymptotically spectrally equivalent when p/n → γ. The proof must state the precise moment and eigenvalue conditions under which this equivalence is unconditional; otherwise the invariance and exact simulation claims become conditional.

Authors: We accept the need for explicit conditions. We will revise the statement of the main theorem in Section 4 to list the precise assumptions—finite fourth moments of the entries and eigenvalues of Σ bounded away from zero and infinity—under which the asymptotic spectral equivalence holds unconditionally. This will make clear that the invariance and exact-simulation properties are valid under these stated conditions. revision: yes

Circularity Check

1 steps flagged

Equivalence of separability testing to core sphericity is by construction via complementary definition of the core

specific steps

self definitional [Abstract]
"we propose novel invariant tests using the core covariance matrix, a complementary notion to a separable covariance matrix. We show that testing separability of Σ is equivalent to testing sphericity of its core component."

The core is introduced as the complementary notion to separability; the claimed equivalence therefore holds exactly by how the core is defined (any separable Σ produces a spherical core and vice versa). The 'show that' step is thus a restatement of the definition rather than a derived result from matrix properties or external theorems.

full rationale

The paper introduces the core covariance matrix as a 'complementary notion to a separable covariance matrix' and then states that testing separability of Σ is equivalent to testing sphericity of the core. This equivalence is load-bearing for the subsequent construction of invariant test statistics and the claim of exact null distribution simulation. Because the core is explicitly positioned as complementary, the mapping (separable Σ maps to spherical core, non-separable maps to non-spherical) is built into the definition rather than independently derived from first principles or external properties. The asymptotic spectral equivalence result between sample covariance and core is presented as a separate technical contribution, but the foundational population-level equivalence reduces to the definitional choice. This produces partial circularity: the central insight is self-definitional while later asymptotic and simulation steps retain independent content.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 1 invented entities

The central claim rests on the definition of the core covariance matrix as a complementary notion to separability and on standard assumptions for covariance matrices in high-dimensional settings. No free parameters are mentioned. The core matrix itself functions as an invented entity whose independent evidence is not provided in the abstract.

axioms (1)

domain assumption Covariance matrices are symmetric positive semi-definite and the observations follow a distribution permitting the stated spectral equivalences.
Standard background assumption invoked for the asymptotic results in the p/n → γ regime.

invented entities (1)

core covariance matrix no independent evidence
purpose: Complementary notion that reduces separability testing to sphericity testing and enables invariant statistics.
Introduced in the paper as the key device for constructing the new tests.

pith-pipeline@v0.9.0 · 5742 in / 1379 out tokens · 38503 ms · 2026-05-19T07:53:23.535072+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel (J-cost uniqueness) unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We show that testing separability of Σ is equivalent to testing sphericity of its core component... Σ = K^{1/2} C K^{1/2,⊤} ... C = I_p
IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Kronecker-core decomposition (KCD) ... core map c : S_p^+ → C_{p1,p2}^+

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

76 extracted references · 76 canonical work pages

[1]

Allen and Robert Tibshirani

Genevera I. Allen and Robert Tibshirani. Inference with transposable data: modelling the effects of row and column correlations. J. R. Stat. Soc., B: Stat. Methodol. , 74(4):721–743, 2012

work page 2012
[2]

An Introduction to Multivariate Statistical Analysis

Theodore Wilbur Anderson. An Introduction to Multivariate Statistical Analysis. Wiley, 2003

work page 2003
[3]

John A. D. Aston, Davide Pigoli, and Shahin Tavakoli. Tests for separability in nonparametric covariance operators of random surfaces. Ann. Stat., 45(4):1431–1461, 2017

work page 2017
[4]

On sample eigenvalues in a generalized spiked population model

Zhidong Bai and Jianfeng Yao. On sample eigenvalues in a generalized spiked population model. J. Multivar. Anal. , 106:167–177, 2012. 22

work page 2012
[5]

Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices

Jinho Baik, G´ erard Ben Arous, and Sandrine P´ ech’e. Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Probab., 33(5):1643–1697, 2005

work page 2005
[6]

Silverstein

Jinho Baik and Jack W. Silverstein. Eigenvalues of large sample covariance matrices of spiked population models. J. Multivar. Anal. , 97(6):1382–1408, 2006

work page 2006
[7]

Universality for the largest eigenvalue of sample covariance matrices with general population

Zhigang Bao, Guangmin Pan, and Wang Zhou. Universality for the largest eigenvalue of sample covariance matrices with general population. Ann. Stat., 43(1):382–421, 2015

work page 2015
[8]

Tony Cai, Xiao Han, and Guangming Pan

T. Tony Cai, Xiao Han, and Guangming Pan. Limiting laws for divergent spiked eigenvalues and largest non-spiked eigenvalue of sample covariance matrices. Ann. Stat., 48(3):1255–1280, 2020

work page 2020
[9]

Testing separability of space–time functional processes

Panayiotis Constantinou, Piotr Kokoszka, and Matthew Reimherr. Testing separability of space–time functional processes. Biometrika, 104(2):425–437, 2017

work page 2017
[10]

Maximum likelihood estimation for matrix normal models via quiver representations

Harm Derksen and Visu Makam. Maximum likelihood estimation for matrix normal models via quiver representations. SIAM J. Appl. Algebra Geom. , 5(2):338–365, 2021

work page 2021
[11]

Maximum likelihood estimation for tensor normal models via castling transforms

Harm Derksen, Visu Makam, and Michael Walter. Maximum likelihood estimation for tensor normal models via castling transforms. Forum math. Sigma., 10(50), 2022

work page 2022
[12]

Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices

Xue Ding. Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices. Stat. Probab. Lett., 156:108613, 2020

work page 2020
[13]

Rational maximum likelihood estimators of kronecker covariance matrices

Mathias Drton, Alexandros Grosdos, and Andrew McCormack. Rational maximum likelihood estimators of kronecker covariance matrices. Algebr. Stat., 15(1):144–163, 2024

work page 2024
[14]

Existence and uniqueness of the kronecker covariance mle

Mathias Drton, Satoshi Kuriki, and Peter Hoff. Existence and uniqueness of the kronecker covariance mle. Ann. Stat., 49(5):2721–2754, 2021

work page 2021
[15]

Fisher, Xiaoqian Sun, and Colin M

Thomas J. Fisher, Xiaoqian Sun, and Colin M. Gallagher. A new test for sphericity of the covariance matrix for high dimensional data. J. Multivar. Anal. , 101(10):2554–2570, 2010

work page 2010
[16]

A higher-order lq decomposition for separable covariance models

David Gerard and Peter Hoff. A higher-order lq decomposition for separable covariance models. Linear Algebra Appl., 505(15):57–84, 2016

work page 2016
[17]

Testing the first- order separability hypothesis for spatio-temporal point patterns

Mohammad Ghorbani, Nafiseh Vafae, Jiˇ r´ ı Dvoˇ r´ ak, and Mari Myllym¨ aki. Testing the first- order separability hypothesis for spatio-temporal point patterns. Comput. Stat. Data. Anal. , 161:107245, 2021

work page 2021
[18]

Nonseparable, stationary covariance functions for space–time data

Tilmann Gneiting. Nonseparable, stationary covariance functions for space–time data. J. Am. Stat. Assoc., 97(458):590–600, 2002

work page 2002
[19]

Genton, and Peter Guttorp

Tilmann Gneiting, Marc G. Genton, and Peter Guttorp. Geostatistical space–time models, stationarity, separability, and full symmetry. Monogr. Stat. Appl. Probab., 107:151–175, 2007

work page 2007
[20]

Rungang Han, Pixu Shi, and Anru R. Zhang. Guaranteed functional tensor singular value decomposition. J. Am. Stat. Assoc. , 119(546):995–1007, 2024

work page 2024
[21]

Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix

Johannes Heiny and Carolin Kleemann. Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix. arXiv preprint arXiv:2302.13914 , 2023. 23

work page arXiv 2023
[22]

Peter Hoff, Andrew Mccormack, and Anru R. Zhang. Core shrinkage covariance estimation for matrix-variate data. J. R. Stat. Soc. Ser. B Methodol. , 85(5):1659–1679, 2023

work page 2023
[23]

Peter D. Hoff. Separable covariance arrays via the tucker product, with applications to mul- tivariate relational data. Bayesian Anal., 6(2):179–196, 2011

work page 2011
[24]

Testing stationarity of functional time series

Lajos Horv´ ath, Piotr Kokoszka, and Gregory Rice. Testing stationarity of functional time series. J. Econom., 179(1):66–82, 2014

work page 2014
[25]

Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

Dandan Jiang and Zhidong Bai. Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

work page 2021
[26]

Johnstone

Iain M. Johnstone. On the distribution of the largest eigenvalue in principal components analysis. Ann. Stat., 29(2):295–327, 2001

work page 2001
[27]

Probability Theory

Achim Klenke. Probability Theory. Springer-Verlag, 2008

work page 2008
[28]

Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population

Ji Oon Lee and Kevin Schnelli. Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population. Ann. Appl. Probab., 26(6):3786—-3839, 2016

work page 2016
[29]

Lehmann and Joseph P

E.L. Lehmann and Joseph P. Romano. Testing statistical hypotheses. Springer, 2022

work page 2022
[30]

Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size

Zeng Li and Jianfeng Yao. Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size. Electron. J. Stat., 10(2):2973–3010, 2016

work page 2016
[31]

Lindquist

Martin A. Lindquist. The statistical analysis of fmri data. Stat. Sci., 23(4):439–464, 2008

work page 2008
[32]

Linton and Haihan Tang

Oliver B. Linton and Haihan Tang. Estimation of the kronecker covariance model by quadratic form. Econom. Theory, 38(5):1014–1067, 2022

work page 2022
[33]

Van Loan

Charles F. Van Loan. The ubiquitous kronecker product. J. Comput. Appl. Math. , 123(1):85– 100, 2000

work page 2000
[34]

Zimmerman

Nelson Lu and Dale L. Zimmerman. The likelihood ratio test for a separable covariance matrix. Stat. Probab. Lett., 73(4):449–457, 2005

work page 2005
[35]

Simultaneous singular value decomposition

Takanori Maehara and Kazuo Murota. Simultaneous singular value decomposition. Linear Algebra Appl., 435(1):106–116, 2011

work page 2011
[36]

Manceur and Pierre Dutilleul

A.M. Manceur and Pierre Dutilleul. Unbiased modified likelihood ratio tests for simple and double separability of a variance–covariance structure.Stat. Probab. Lett., 83(2):631–636, 2013

work page 2013
[37]

Manceur and Pierre Dutilleul

Ameur M. Manceur and Pierre Dutilleul. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. J. Comput. Appl. Math., 239(1):37–49, 2013

work page 2013
[38]

The eigenvalue distribution in some ensembles of random matrices

Volodymyr Marchenko and Leonid Pastur. The eigenvalue distribution in some ensembles of random matrices. Math. USSR Sbornik , 1:457–483, 1967

work page 1967
[39]

Panaretos

Tomas Masak and Victor M. Panaretos. Random surface covariance estimation by shifted partial tracing. J. Am. Stat. Assoc. , 118(544):2562–2574, 2023

work page 2023
[40]

Panaretos

Tomas Masak, Soham Sarkar, and Victor M. Panaretos. Separable expansions for covariance estimation via the partial inner product. Biometrika, 110(1):225–247, 2023. 24

work page 2023
[41]

Mitchell, Marc G

Matthew W. Mitchell, Marc G. Genton, and Marcia L. Gumpertz. A likelihood ratio test for separability of covariances. J. Multivar. Anal. , 97(5):1025–1043, 2006

work page 2006
[42]

Davide Pigoli, John A. D. Aston, Ian L. Dryden, and Piercesare Secchi. Distances and inference for covariance operators. Biometrika, 101(2):409–422, 2014

work page 2014
[43]

Dimension-free structured covariance estimation

Nikita Puchkin and Maxim Rakhuba. Dimension-free structured covariance estimation. Proc. Mach. Learn. Res., 247:1–31, 2024

work page 2024
[44]

de Mucnk, and Mathisca C.M

Beata Ro´ s, Fetsje Bijma, Jan C. de Mucnk, and Mathisca C.M. de Gunst. Existence and uniqueness of the maximum likelihood estimator for models with a kronecker product covari- ance structure. J. Multivar. Anal. , 143:345–361, 2016

work page 2016
[45]

Hanson-wright inequality and sub-gaussian concentra- tion

Mark Rudelson and Roman Vershynin. Hanson-wright inequality and sub-gaussian concentra- tion. Electron. Commun. Probab., 18(82):1–9, 2013

work page 2013
[46]

Silverstein and Bai Z

Jack W. Silverstein and Bai Z. D. On the empirical distribution of eigenvalues of a class of large dimensional random matrices,. J. Multivar. Anal. , 54(2):175–192, 1995

work page 1995
[47]

Simpson, Lloyd J

Sean L. Simpson, Lloyd J. Edwards, Martin A. Styner, and Keith E. Muller. Separability tests for high-dimensional, low-sample size multivariate repeated measures data. J. Appl. Stat. , 41(11):2450–2461, 2014

work page 2014
[48]

Soloveychik and D

I. Soloveychik and D. Trushin. Gaussian and robust kronecker product covariance estimation: Existence and uniqueness. J. Multivar. Anal. , 149:92–113, 2016

work page 2016
[49]

covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018

Shahin Tavakoli. covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018. R package version 1.1.0

work page 2018
[50]

Tracy and Harold Widom

Craig A. Tracy and Harold Widom. Level-spacing distributions and the airy kernel. Commun. Math. Phys., 159(1):151–174, 1994

work page 1994
[51]

Hero III, and Shuheng Zhou

Theodoros Tsiligkaridis, Alfred O. Hero III, and Shuheng Zhou. On convergence of kronecker graphical lasso algorithms. IEEE Trans. Signal Process., 61(7):1743–1755, 2013

work page 2013
[52]

Hero III

Theodoros Tsiligkaridis and Alfred O. Hero III. Covariance estimation in high dimensions via kronecker product expansions. IEEE Trans. Signal Process., 61(21):5347–5360, 2013

work page 2013
[53]

High-dimensional probability: An introduction with applications in data science, volume 47

Roman Vershynin. High-dimensional probability: An introduction with applications in data science, volume 47. Cambridge university press, 2018

work page 2018
[54]

On the sphericity test with large-dimensional observations

Qinwen Wang and Jianfeng Yao. On the sphericity test with large-dimensional observations. Electron. J. Stat., 7:2164–2192, 2013

work page 2013
[55]

High-dimensional sphericity test by extended likelihood ratio

Zhendong Wang and Xingzhong Xu. High-dimensional sphericity test by extended likelihood ratio. Metrika, 84:1169—-1212, 2021

work page 2021
[56]

On estimation of covariance matrices with kronecker product structure

Karl Werner, Magnus Jansson, and Petre Stoica. On estimation of covariance matrices with kronecker product structure. IEEE Trans. Signal Process., 56(2):478–491, 2008

work page 2008
[57]

Testing kronecker product covariance matrices for high-dimensional matrix-variate data

Long Yu, Jiahui Xie, and Wang Zhou. Testing kronecker product covariance matrices for high-dimensional matrix-variate data. Biometrika, 110(3):799––814, 2023. 25 Appendix In Appendix, we provide the proofs of the theoretical results and additional theoretical results in the main text (Section A), miscellaneous figures and tables supporting the results and...

work page 2023
[58]

For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality

Lastly, if C has a rank −2 partial isotropic structure with p1 = p2, using β = det(A⊤A) in (24) and noting that tr( A⊤A) = p2 1(1 − λ), the result follows. For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality. Let ¯A = [vec( ¯A1), . . . ,vec( ¯Ar)] for ¯Ai in the proof of Proposition 2 for a co...

work page
[59]

If (j, k) = (u, v), j = k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)2] = E   p1X i1,i2=1 a2 i1a2 i2   = p1E[a4 1] + p1(p1 − 1)E[a2 1]E[a2 2] = p1ν4 + p1(p1 − 1)

work page
[60]

If j = k, u = v, and j ̸= u, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)(b⊤b)] = E[a⊤a]E(b⊤b)] = E2[a⊤a] = p1X i=1 E[a2 i ] !2 = p2 1

work page
[61]

If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)2] = E   p1X i1,i2=1 ai1bi1ai2bi2   = p1E[a2 1b2 1] + p1(p1 − 1)E[a1b1a2b2] = p1. 38

work page
[62]

For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0

Otherwise, using the independence of entries of a, b, c, d yields that E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = 0. For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0. One can compute the expectation for other remaining cases using the independence. A...

work page
[63]

If (j, k) = (u, v), j = k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

work page
[64]

If j = k, u = v, and j ̸= u, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)(c⊤d)] = E[a⊤b]E[c⊤d] = 0

work page
[65]

If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

work page
[66]

To prove the second assertion, partition Ip as Ip = diag(Ip1,

Otherwise, as an analogy to the item 4), one can verify that E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = 0. To prove the second assertion, partition Ip as Ip = diag(Ip1, . . . , Ip1 | {z } p2 ). Then R(Ip) =   W1 ... Wp2   ⇒ R(Ip)R(Ip)⊤ =   W1W ⊤ 1 · · · W1W ⊤ p2 ... ... ... Wp2W ⊤ 1 · · · Wp2W ⊤ p2   , where the row of Wi ∈ Rp2×p2 1 is vec(Ip1) on ith row and ...

work page
[67]

To bound W11,11, (I) = nVar (a⊤a)2 = nVar   p1X i=1 a4 i + 2 X 1≤i<j≤p1 a2 i a2 j   ≤ 2nVar p1X i=1 a4 i ! + 16nVar   X 1≤i<j≤p1 a2 i a2 j   ≲ np1ν8 + 16nE[ X i1<j1,i2<j2 a2 i1a2 j1a2 i2a2 j2] ≲ np1ν8 + np4 1 = O(np4 1). (34) 41 Since j = k = u = v = 1, we have that (II) + (III) = 2n(n − 1)Var (a⊤b)2 = 2n(n − 1)Var   p1X i=1 a2 i b2 i + 2 X 1≤i<...

work page
[68]

To bound W11,22, (I) = nVar (a⊤a)(b⊤b) ≤ nE[(a⊤a)2(b⊤b)2] = nE[(a⊤a)2]E[(b⊤b)2] = O(np4 1), (II) + (III) = 2n(n − 1)Var (a⊤b)(c⊤d) ≤ n(n − 1)E[(a⊤b)2(c⊤d)2] = 2n(n − 1)E2[(a⊤b)2] ≤ n2 E[ p1X i1=1 a2 i1b2 i1] !2 = O(n2p2 1). (37) 42 On the other hand, (IV) = 4n(n − 1)Cov (a⊤a)(b⊤b), (a⊤c)(b⊤d) = 4n(n − 1) E[(a⊤a)(b⊤b)(a⊤c)(b⊤d)] − E[(a⊤a)(b⊤b)]E[(a⊤c)]E[(b...

work page
[69]

(39) As an analogy to (38), one may see that (III) = (IV) = (V) = 0

To bound W11,12, (I) = nVar (a⊤a)(a⊤b) ≤ nE[(a⊤a)2(a⊤b)2] = nE[ p1X i1,i2,i3,i4=1 a2 i1a2 i2ai3bi3ai4bi4] = nE[ p1X i1,i2,i3=1 a2 i1a2 i2a2 i3b2 i3] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) ≤ n(n − 1)E[(a⊤b)2(a⊤c)2] ≤ n2E[ p1X i1,i2,i3,i4=1 ai1bi1ai2bi2ai3ci3ai4ci4] = n2E[ p1X i1,i3=1 a2 i1b2 i1a2 i3c2 i3] = O(n2p2 1). (39) As an analogy to (38), one may...

work page
[70]

Again one may observe that (III) = (IV) = (V) = 0

To bound W11,23, (I) = nVar (a⊤a)(b⊤c) = nE[(a⊤a)2(b⊤c)2] = nE[(a⊤a)2]E[(b⊤c)2] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1), (40) where the computation of (II) follows from (37). Again one may observe that (III) = (IV) = (V) = 0. Hence, from (40), W11,23 = O(np3 1 + n2p2 1). 43

work page
[71]

(41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1)

To bound W12,12, deduce from (35) and (39) that (I) = nVar (a⊤b)2 = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1). (42) From (41) and (42), we have that W12,12 = O(n3p2 1)

work page
[72]

(43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0

To bound W12,13, (I) = nVar (a⊤b)(a⊤c) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0. Hence, from (43), W12,13 = O(n2p2 1)

work page
[73]

(44) Again we have that (III) = (IV) = (V) = 0

To bound W12,34, one can deduce from (37) that (I) = nVar (a⊤b)(c⊤d) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1). (44) Again we have that (III) = (IV) = (V) = 0. Thus, (44) implies that W12,34 = O(n2p2 1). Therefore, we verified (30) from the items 1)–7). Using the results from Lemma 4–5, we prove Theorem 5 and Corollary 3. Proof of Theorem 5 No...

work page
[74]

47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C)

log 12− c n3/2δ K2 ! ≲ 2 exp n log 12 − c n3/2δ K2 ! ≲ 2 exp −˜cn3/2δ , where the first ≲ holds because (A1) implies that p2 1 + p2 2 ≍ n, and the second ≲ holds for any constant ˜c ∈ (0, c/K2). 47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C). As discussed in Section 4.3, we show that σ1(R(C)) scal...

work page
[75]

We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2

derived the sample eigenvalue bias, as follows (see Theorem 1.1–1.2 of [6]): for j ∈ J i, ℓj a.s.→ ( σ2 ai + γai ai−σ2 , if ai > 1 + √γ, σ2(1 + √γ)2, o.t. . We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2. To align with the assumption of Σ, we set λ = 1/(1 + rc/...

work page
[76]

Here ( p1, p2, n) = (20, 20, 1600)

in Proposition 3. Here ( p1, p2, n) = (20, 20, 1600). We conclude this section with the figures that numerically verify the results of Corollary 2 and Theorem 5, as well as the table that demonstrates the consistency of T3. To verify Corollary 2, we generate the test statistic T1 with (p1, p2, n) = (72, 72, 1296) with 1000 simulations, and assess whether ...

work page

[1] [1]

Allen and Robert Tibshirani

Genevera I. Allen and Robert Tibshirani. Inference with transposable data: modelling the effects of row and column correlations. J. R. Stat. Soc., B: Stat. Methodol. , 74(4):721–743, 2012

work page 2012

[2] [2]

An Introduction to Multivariate Statistical Analysis

Theodore Wilbur Anderson. An Introduction to Multivariate Statistical Analysis. Wiley, 2003

work page 2003

[3] [3]

John A. D. Aston, Davide Pigoli, and Shahin Tavakoli. Tests for separability in nonparametric covariance operators of random surfaces. Ann. Stat., 45(4):1431–1461, 2017

work page 2017

[4] [4]

On sample eigenvalues in a generalized spiked population model

Zhidong Bai and Jianfeng Yao. On sample eigenvalues in a generalized spiked population model. J. Multivar. Anal. , 106:167–177, 2012. 22

work page 2012

[5] [5]

Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices

Jinho Baik, G´ erard Ben Arous, and Sandrine P´ ech’e. Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices. Ann. Probab., 33(5):1643–1697, 2005

work page 2005

[6] [6]

Silverstein

Jinho Baik and Jack W. Silverstein. Eigenvalues of large sample covariance matrices of spiked population models. J. Multivar. Anal. , 97(6):1382–1408, 2006

work page 2006

[7] [7]

Universality for the largest eigenvalue of sample covariance matrices with general population

Zhigang Bao, Guangmin Pan, and Wang Zhou. Universality for the largest eigenvalue of sample covariance matrices with general population. Ann. Stat., 43(1):382–421, 2015

work page 2015

[8] [8]

Tony Cai, Xiao Han, and Guangming Pan

T. Tony Cai, Xiao Han, and Guangming Pan. Limiting laws for divergent spiked eigenvalues and largest non-spiked eigenvalue of sample covariance matrices. Ann. Stat., 48(3):1255–1280, 2020

work page 2020

[9] [9]

Testing separability of space–time functional processes

Panayiotis Constantinou, Piotr Kokoszka, and Matthew Reimherr. Testing separability of space–time functional processes. Biometrika, 104(2):425–437, 2017

work page 2017

[10] [10]

Maximum likelihood estimation for matrix normal models via quiver representations

Harm Derksen and Visu Makam. Maximum likelihood estimation for matrix normal models via quiver representations. SIAM J. Appl. Algebra Geom. , 5(2):338–365, 2021

work page 2021

[11] [11]

Maximum likelihood estimation for tensor normal models via castling transforms

Harm Derksen, Visu Makam, and Michael Walter. Maximum likelihood estimation for tensor normal models via castling transforms. Forum math. Sigma., 10(50), 2022

work page 2022

[12] [12]

Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices

Xue Ding. Some sphericity tests for high dimensional data based on ratio of the traces of sample covariance matrices. Stat. Probab. Lett., 156:108613, 2020

work page 2020

[13] [13]

Rational maximum likelihood estimators of kronecker covariance matrices

Mathias Drton, Alexandros Grosdos, and Andrew McCormack. Rational maximum likelihood estimators of kronecker covariance matrices. Algebr. Stat., 15(1):144–163, 2024

work page 2024

[14] [14]

Existence and uniqueness of the kronecker covariance mle

Mathias Drton, Satoshi Kuriki, and Peter Hoff. Existence and uniqueness of the kronecker covariance mle. Ann. Stat., 49(5):2721–2754, 2021

work page 2021

[15] [15]

Fisher, Xiaoqian Sun, and Colin M

Thomas J. Fisher, Xiaoqian Sun, and Colin M. Gallagher. A new test for sphericity of the covariance matrix for high dimensional data. J. Multivar. Anal. , 101(10):2554–2570, 2010

work page 2010

[16] [16]

A higher-order lq decomposition for separable covariance models

David Gerard and Peter Hoff. A higher-order lq decomposition for separable covariance models. Linear Algebra Appl., 505(15):57–84, 2016

work page 2016

[17] [17]

Testing the first- order separability hypothesis for spatio-temporal point patterns

Mohammad Ghorbani, Nafiseh Vafae, Jiˇ r´ ı Dvoˇ r´ ak, and Mari Myllym¨ aki. Testing the first- order separability hypothesis for spatio-temporal point patterns. Comput. Stat. Data. Anal. , 161:107245, 2021

work page 2021

[18] [18]

Nonseparable, stationary covariance functions for space–time data

Tilmann Gneiting. Nonseparable, stationary covariance functions for space–time data. J. Am. Stat. Assoc., 97(458):590–600, 2002

work page 2002

[19] [19]

Genton, and Peter Guttorp

Tilmann Gneiting, Marc G. Genton, and Peter Guttorp. Geostatistical space–time models, stationarity, separability, and full symmetry. Monogr. Stat. Appl. Probab., 107:151–175, 2007

work page 2007

[20] [20]

Rungang Han, Pixu Shi, and Anru R. Zhang. Guaranteed functional tensor singular value decomposition. J. Am. Stat. Assoc. , 119(546):995–1007, 2024

work page 2024

[21] [21]

Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix

Johannes Heiny and Carolin Kleemann. Asymptotic independence of point process and frobe- nius norm of a large sample covariance matrix. arXiv preprint arXiv:2302.13914 , 2023. 23

work page arXiv 2023

[22] [22]

Peter Hoff, Andrew Mccormack, and Anru R. Zhang. Core shrinkage covariance estimation for matrix-variate data. J. R. Stat. Soc. Ser. B Methodol. , 85(5):1659–1679, 2023

work page 2023

[23] [23]

Peter D. Hoff. Separable covariance arrays via the tucker product, with applications to mul- tivariate relational data. Bayesian Anal., 6(2):179–196, 2011

work page 2011

[24] [24]

Testing stationarity of functional time series

Lajos Horv´ ath, Piotr Kokoszka, and Gregory Rice. Testing stationarity of functional time series. J. Econom., 179(1):66–82, 2014

work page 2014

[25] [25]

Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

Dandan Jiang and Zhidong Bai. Generalized four moment theorem and an application to clt for spiked eigenvalues of high-dimensional covariance matrices.Bernoulli, 27(1):274–294, 2021

work page 2021

[26] [26]

Johnstone

Iain M. Johnstone. On the distribution of the largest eigenvalue in principal components analysis. Ann. Stat., 29(2):295–327, 2001

work page 2001

[27] [27]

Probability Theory

Achim Klenke. Probability Theory. Springer-Verlag, 2008

work page 2008

[28] [28]

Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population

Ji Oon Lee and Kevin Schnelli. Tracy-widom distribution for the largest eigenvalue of real sample covariance matrices with general population. Ann. Appl. Probab., 26(6):3786—-3839, 2016

work page 2016

[29] [29]

Lehmann and Joseph P

E.L. Lehmann and Joseph P. Romano. Testing statistical hypotheses. Springer, 2022

work page 2022

[30] [30]

Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size

Zeng Li and Jianfeng Yao. Testing the sphericity of a covariance matrix when the dimension is much larger than the sample size. Electron. J. Stat., 10(2):2973–3010, 2016

work page 2016

[31] [31]

Lindquist

Martin A. Lindquist. The statistical analysis of fmri data. Stat. Sci., 23(4):439–464, 2008

work page 2008

[32] [32]

Linton and Haihan Tang

Oliver B. Linton and Haihan Tang. Estimation of the kronecker covariance model by quadratic form. Econom. Theory, 38(5):1014–1067, 2022

work page 2022

[33] [33]

Van Loan

Charles F. Van Loan. The ubiquitous kronecker product. J. Comput. Appl. Math. , 123(1):85– 100, 2000

work page 2000

[34] [34]

Zimmerman

Nelson Lu and Dale L. Zimmerman. The likelihood ratio test for a separable covariance matrix. Stat. Probab. Lett., 73(4):449–457, 2005

work page 2005

[35] [35]

Simultaneous singular value decomposition

Takanori Maehara and Kazuo Murota. Simultaneous singular value decomposition. Linear Algebra Appl., 435(1):106–116, 2011

work page 2011

[36] [36]

Manceur and Pierre Dutilleul

A.M. Manceur and Pierre Dutilleul. Unbiased modified likelihood ratio tests for simple and double separability of a variance–covariance structure.Stat. Probab. Lett., 83(2):631–636, 2013

work page 2013

[37] [37]

Manceur and Pierre Dutilleul

Ameur M. Manceur and Pierre Dutilleul. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. J. Comput. Appl. Math., 239(1):37–49, 2013

work page 2013

[38] [38]

The eigenvalue distribution in some ensembles of random matrices

Volodymyr Marchenko and Leonid Pastur. The eigenvalue distribution in some ensembles of random matrices. Math. USSR Sbornik , 1:457–483, 1967

work page 1967

[39] [39]

Panaretos

Tomas Masak and Victor M. Panaretos. Random surface covariance estimation by shifted partial tracing. J. Am. Stat. Assoc. , 118(544):2562–2574, 2023

work page 2023

[40] [40]

Panaretos

Tomas Masak, Soham Sarkar, and Victor M. Panaretos. Separable expansions for covariance estimation via the partial inner product. Biometrika, 110(1):225–247, 2023. 24

work page 2023

[41] [41]

Mitchell, Marc G

Matthew W. Mitchell, Marc G. Genton, and Marcia L. Gumpertz. A likelihood ratio test for separability of covariances. J. Multivar. Anal. , 97(5):1025–1043, 2006

work page 2006

[42] [42]

Davide Pigoli, John A. D. Aston, Ian L. Dryden, and Piercesare Secchi. Distances and inference for covariance operators. Biometrika, 101(2):409–422, 2014

work page 2014

[43] [43]

Dimension-free structured covariance estimation

Nikita Puchkin and Maxim Rakhuba. Dimension-free structured covariance estimation. Proc. Mach. Learn. Res., 247:1–31, 2024

work page 2024

[44] [44]

de Mucnk, and Mathisca C.M

Beata Ro´ s, Fetsje Bijma, Jan C. de Mucnk, and Mathisca C.M. de Gunst. Existence and uniqueness of the maximum likelihood estimator for models with a kronecker product covari- ance structure. J. Multivar. Anal. , 143:345–361, 2016

work page 2016

[45] [45]

Hanson-wright inequality and sub-gaussian concentra- tion

Mark Rudelson and Roman Vershynin. Hanson-wright inequality and sub-gaussian concentra- tion. Electron. Commun. Probab., 18(82):1–9, 2013

work page 2013

[46] [46]

Silverstein and Bai Z

Jack W. Silverstein and Bai Z. D. On the empirical distribution of eigenvalues of a class of large dimensional random matrices,. J. Multivar. Anal. , 54(2):175–192, 1995

work page 1995

[47] [47]

Simpson, Lloyd J

Sean L. Simpson, Lloyd J. Edwards, Martin A. Styner, and Keith E. Muller. Separability tests for high-dimensional, low-sample size multivariate repeated measures data. J. Appl. Stat. , 41(11):2450–2461, 2014

work page 2014

[48] [48]

Soloveychik and D

I. Soloveychik and D. Trushin. Gaussian and robust kronecker product covariance estimation: Existence and uniqueness. J. Multivar. Anal. , 149:92–113, 2016

work page 2016

[49] [49]

covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018

Shahin Tavakoli. covsep: Tests for Determining if the Covariance Structure of 2-Dimensional Data is Separable, 2018. R package version 1.1.0

work page 2018

[50] [50]

Tracy and Harold Widom

Craig A. Tracy and Harold Widom. Level-spacing distributions and the airy kernel. Commun. Math. Phys., 159(1):151–174, 1994

work page 1994

[51] [51]

Hero III, and Shuheng Zhou

Theodoros Tsiligkaridis, Alfred O. Hero III, and Shuheng Zhou. On convergence of kronecker graphical lasso algorithms. IEEE Trans. Signal Process., 61(7):1743–1755, 2013

work page 2013

[52] [52]

Hero III

Theodoros Tsiligkaridis and Alfred O. Hero III. Covariance estimation in high dimensions via kronecker product expansions. IEEE Trans. Signal Process., 61(21):5347–5360, 2013

work page 2013

[53] [53]

High-dimensional probability: An introduction with applications in data science, volume 47

Roman Vershynin. High-dimensional probability: An introduction with applications in data science, volume 47. Cambridge university press, 2018

work page 2018

[54] [54]

On the sphericity test with large-dimensional observations

Qinwen Wang and Jianfeng Yao. On the sphericity test with large-dimensional observations. Electron. J. Stat., 7:2164–2192, 2013

work page 2013

[55] [55]

High-dimensional sphericity test by extended likelihood ratio

Zhendong Wang and Xingzhong Xu. High-dimensional sphericity test by extended likelihood ratio. Metrika, 84:1169—-1212, 2021

work page 2021

[56] [56]

On estimation of covariance matrices with kronecker product structure

Karl Werner, Magnus Jansson, and Petre Stoica. On estimation of covariance matrices with kronecker product structure. IEEE Trans. Signal Process., 56(2):478–491, 2008

work page 2008

[57] [57]

Testing kronecker product covariance matrices for high-dimensional matrix-variate data

Long Yu, Jiahui Xie, and Wang Zhou. Testing kronecker product covariance matrices for high-dimensional matrix-variate data. Biometrika, 110(3):799––814, 2023. 25 Appendix In Appendix, we provide the proofs of the theoretical results and additional theoretical results in the main text (Section A), miscellaneous figures and tables supporting the results and...

work page 2023

[58] [58]

For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality

Lastly, if C has a rank −2 partial isotropic structure with p1 = p2, using β = det(A⊤A) in (24) and noting that tr( A⊤A) = p2 1(1 − λ), the result follows. For a core covariance matrix with a rank−r partial isotropic structure, assume p1 = p2r without loss of generality. Let ¯A = [vec( ¯A1), . . . ,vec( ¯Ar)] for ¯Ai in the proof of Proposition 2 for a co...

work page

[59] [59]

If (j, k) = (u, v), j = k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)2] = E   p1X i1,i2=1 a2 i1a2 i2   = p1E[a4 1] + p1(p1 − 1)E[a2 1]E[a2 2] = p1ν4 + p1(p1 − 1)

work page

[60] [60]

If j = k, u = v, and j ̸= u, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤a)(b⊤b)] = E[a⊤a]E(b⊤b)] = E2[a⊤a] = p1X i=1 E[a2 i ] !2 = p2 1

work page

[61] [61]

If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)2] = E   p1X i1,i2=1 ai1bi1ai2bi2   = p1E[a2 1b2 1] + p1(p1 − 1)E[a1b1a2b2] = p1. 38

work page

[62] [62]

For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0

Otherwise, using the independence of entries of a, b, c, d yields that E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = 0. For instance, if j = u and (j − k)(j − v)(k − v) ̸= 0, E[(z⊤ 1jz1k)(z⊤ 1uz1v)] = E[(a⊤b)(a⊤c)] = E   p1X i1,i2=1 ai1bi1ai2ci2   = nX i1,i2=1 E [ai1ai2] E[bi1]E[ci2] = 0. One can compute the expectation for other remaining cases using the independence. A...

work page

[63] [63]

If (j, k) = (u, v), j = k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

work page

[64] [64]

If j = k, u = v, and j ̸= u, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)(c⊤d)] = E[a⊤b]E[c⊤d] = 0

work page

[65] [65]

If (j, k) = (u, v), j ̸= k, E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = E[(a⊤b)2] = p1

work page

[66] [66]

To prove the second assertion, partition Ip as Ip = diag(Ip1,

Otherwise, as an analogy to the item 4), one can verify that E[(z⊤ 1jz2k)(z⊤ 1uz2v)] = 0. To prove the second assertion, partition Ip as Ip = diag(Ip1, . . . , Ip1 | {z } p2 ). Then R(Ip) =   W1 ... Wp2   ⇒ R(Ip)R(Ip)⊤ =   W1W ⊤ 1 · · · W1W ⊤ p2 ... ... ... Wp2W ⊤ 1 · · · Wp2W ⊤ p2   , where the row of Wi ∈ Rp2×p2 1 is vec(Ip1) on ith row and ...

work page

[67] [67]

To bound W11,11, (I) = nVar (a⊤a)2 = nVar   p1X i=1 a4 i + 2 X 1≤i<j≤p1 a2 i a2 j   ≤ 2nVar p1X i=1 a4 i ! + 16nVar   X 1≤i<j≤p1 a2 i a2 j   ≲ np1ν8 + 16nE[ X i1<j1,i2<j2 a2 i1a2 j1a2 i2a2 j2] ≲ np1ν8 + np4 1 = O(np4 1). (34) 41 Since j = k = u = v = 1, we have that (II) + (III) = 2n(n − 1)Var (a⊤b)2 = 2n(n − 1)Var   p1X i=1 a2 i b2 i + 2 X 1≤i<...

work page

[68] [68]

To bound W11,22, (I) = nVar (a⊤a)(b⊤b) ≤ nE[(a⊤a)2(b⊤b)2] = nE[(a⊤a)2]E[(b⊤b)2] = O(np4 1), (II) + (III) = 2n(n − 1)Var (a⊤b)(c⊤d) ≤ n(n − 1)E[(a⊤b)2(c⊤d)2] = 2n(n − 1)E2[(a⊤b)2] ≤ n2 E[ p1X i1=1 a2 i1b2 i1] !2 = O(n2p2 1). (37) 42 On the other hand, (IV) = 4n(n − 1)Cov (a⊤a)(b⊤b), (a⊤c)(b⊤d) = 4n(n − 1) E[(a⊤a)(b⊤b)(a⊤c)(b⊤d)] − E[(a⊤a)(b⊤b)]E[(a⊤c)]E[(b...

work page

[69] [69]

(39) As an analogy to (38), one may see that (III) = (IV) = (V) = 0

To bound W11,12, (I) = nVar (a⊤a)(a⊤b) ≤ nE[(a⊤a)2(a⊤b)2] = nE[ p1X i1,i2,i3,i4=1 a2 i1a2 i2ai3bi3ai4bi4] = nE[ p1X i1,i2,i3=1 a2 i1a2 i2a2 i3b2 i3] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) ≤ n(n − 1)E[(a⊤b)2(a⊤c)2] ≤ n2E[ p1X i1,i2,i3,i4=1 ai1bi1ai2bi2ai3ci3ai4ci4] = n2E[ p1X i1,i3=1 a2 i1b2 i1a2 i3c2 i3] = O(n2p2 1). (39) As an analogy to (38), one may...

work page

[70] [70]

Again one may observe that (III) = (IV) = (V) = 0

To bound W11,23, (I) = nVar (a⊤a)(b⊤c) = nE[(a⊤a)2(b⊤c)2] = nE[(a⊤a)2]E[(b⊤c)2] = O(np3 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1), (40) where the computation of (II) follows from (37). Again one may observe that (III) = (IV) = (V) = 0. Hence, from (40), W11,23 = O(np3 1 + n2p2 1). 43

work page

[71] [71]

(41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1)

To bound W12,12, deduce from (35) and (39) that (I) = nVar (a⊤b)2 = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (41) Also, (36) implies that (III) = n(n − 1)Cov (a⊤b)2, (c⊤d)2 = 0, (IV) = 4n(n − 1)Cov (a⊤b)2, (a⊤c)2 = O(n2p2 1), (V) = 4n(n − 1)(n − 2)Cov (a⊤b)2, (a⊤c)2 = O(n3p2 1). (42) From (41) and (42), we have that W12,12 = O(n3p2 1)

work page

[72] [72]

(43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0

To bound W12,13, (I) = nVar (a⊤b)(a⊤c) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(a⊤c) = O(n2p2 1). (43) As the same with the items 3) and 4), one can verify that (III) = (IV) = (V) = 0. Hence, from (43), W12,13 = O(n2p2 1)

work page

[73] [73]

(44) Again we have that (III) = (IV) = (V) = 0

To bound W12,34, one can deduce from (37) that (I) = nVar (a⊤b)(c⊤d) = O(np2 1), (II) = n(n − 1)Var (a⊤b)(c⊤d) = O(n2p2 1). (44) Again we have that (III) = (IV) = (V) = 0. Thus, (44) implies that W12,34 = O(n2p2 1). Therefore, we verified (30) from the items 1)–7). Using the results from Lemma 4–5, we prove Theorem 5 and Corollary 3. Proof of Theorem 5 No...

work page

[74] [74]

47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C)

log 12− c n3/2δ K2 ! ≲ 2 exp n log 12 − c n3/2δ K2 ! ≲ 2 exp −˜cn3/2δ , where the first ≲ holds because (A1) implies that p2 1 + p2 2 ≍ n, and the second ≲ holds for any constant ˜c ∈ (0, c/K2). 47 A.5 Singular Values of R(C) In this subsection, we provide some results on the singular values of R(C). As discussed in Section 4.3, we show that σ1(R(C)) scal...

work page

[75] [75]

We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2

derived the sample eigenvalue bias, as follows (see Theorem 1.1–1.2 of [6]): for j ∈ J i, ℓj a.s.→ ( σ2 ai + γai ai−σ2 , if ai > 1 + √γ, σ2(1 + √γ)2, o.t. . We examine whether the above result holds when Σ is a core covariance matrix with a rank −r partial isotropic structure, focusing on r = 1 , 2. To align with the assumption of Σ, we set λ = 1/(1 + rc/...

work page

[76] [76]

Here ( p1, p2, n) = (20, 20, 1600)

in Proposition 3. Here ( p1, p2, n) = (20, 20, 1600). We conclude this section with the figures that numerically verify the results of Corollary 2 and Theorem 5, as well as the table that demonstrates the consistency of T3. To verify Corollary 2, we generate the test statistic T1 with (p1, p2, n) = (72, 72, 1296) with 1000 simulations, and assess whether ...

work page