pith. sign in

arxiv: 2505.19925 · v2 · pith:MATR5NMOnew · submitted 2025-05-26 · 📊 stat.ME · cs.LG

Cellwise and Casewise Robust Covariance in High Dimensions

classification 📊 stat.ME cs.LG
keywords covarianceoutliersrobustcasewisecellwisecellrcovdatamatrix
0
0 comments X
read the original abstract

The sample covariance matrix is a cornerstone of multivariate statistics, but it is highly sensitive to outliers. These can be casewise outliers, such as cases belonging to a different population, or cellwise outliers, which are deviating cells (entries) of the data matrix. Recently some robust covariance estimators have been developed that can handle both types of outliers, but their computation is only feasible up to at most 20 dimensions. To remedy this we propose the cellRCov method, a robust covariance estimator that simultaneously handles casewise outliers, cellwise outliers, and missing data. It relies on a decomposition of the covariance on principal and orthogonal subspaces, leveraging recent work on robust PCA. It also employs a ridge-type regularization to stabilize the estimated covariance matrix. We establish some theoretical properties of cellRCov, including its casewise and cellwise influence functions as well as consistency and asymptotic normality. A simulation study demonstrates the superior performance of cellRCov in contaminated and missing data scenarios. Furthermore, its practical utility is illustrated in a real-world application to anomaly detection. We also construct and illustrate the cellRCCA method for robust and regularized canonical correlation analysis.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Cellwise and Casewise Robust Multivariate Regression with Inference

    stat.ME 2026-05 unverdicted novelty 7.0

    cellMR provides robust multivariate regression for casewise and cellwise outliers, missing data, and high dimensions, with cellBoot delivering asymptotically valid robust inference via indirect bootstrap.

  2. Explainable Outlier Detection for Multivariate Functional Data

    stat.ME 2026-05 unverdicted novelty 6.0

    A framework connects separable-covariance functional data to matrix-variate MMCD estimation and linear-complexity Shapley decompositions to deliver robust and explainable outlier detection.

  3. Cellwise Outliers

    stat.ME 2026-03 unverdicted novelty 2.0

    Cellwise outliers can contaminate over half the cases even at low proportions, necessitating specialized robust techniques for location, covariance, regression, PCA, and tensor data that differ from casewise approaches.