The Fragility of Sparsity
Pith reviewed 2026-05-24 06:09 UTC · model grok-4.3
The pith
Sparsity-based linear regression estimates shift by two standard errors or more when the coding of categorical controls changes, while ordinary least squares estimates remain unchanged.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Estimates predicated on the assumption of sparsity are fragile in two ways. Different choices of the regressor matrix which do not impact ordinary least squares estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates by two standard errors or more. Two tests of the sparsity assumption, developed by comparing sparsity-based estimators with OLS, tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust inference at little efficiency cost.
What carries the argument
Two tests that compare sparsity-based estimators to OLS, together with documented sensitivity of those estimators to arbitrary choices in the construction of the regressor matrix such as baseline-category coding.
If this is right
- Sparsity-based estimates can change materially with coding decisions that leave OLS unaffected.
- The sparsity assumption is rejected by the new tests in each of the three examined applications.
- Ordinary least squares supplies more robust inference than sparsity-based methods whenever the number of regressors is much smaller than the sample size.
- The efficiency advantage of sparsity methods is small relative to the gain in robustness from using OLS in typical settings.
Where Pith is reading between the lines
- Applied researchers using sparse methods may need to document stability across multiple plausible codings of categorical variables.
- The same fragility pattern could appear in other regularized estimators that rely on implicit selection among many regressors.
- Routine reporting of both sparse and OLS results side by side would let readers assess sensitivity directly.
Load-bearing premise
The two tests that compare sparsity-based estimators to OLS correctly detect when the sparsity assumption is violated.
What would settle it
Finding new data sets in which the two tests fail to reject sparsity or in which sparsity-based estimates remain stable across different codings of the same categorical controls would undermine the claim of general fragility.
read the original abstract
We show, using three empirical applications, that linear regression estimates predicated on the assumption of sparsity are fragile in two ways. First, we document that different choices of the regressor matrix which do not impact ordinary least squares (OLS) estimates, such as the choice of baseline category with categorical controls, can move sparsity-based estimates by two standard errors or more. Second, we develop two tests of the sparsity assumption by comparing sparsity-based estimators with OLS. The tests tend to reject the sparsity assumption in all three applications. Unless the number of regressors is comparable to or exceeds the sample size, OLS yields more robust inference at little efficiency cost.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper claims that linear regression estimates based on sparsity assumptions are fragile in two ways, documented via three empirical applications. Different choices of the regressor matrix (e.g., baseline category for categorical controls) that leave OLS estimates unchanged can shift sparsity-based estimates by two or more standard errors. Two new tests comparing sparsity estimators to OLS reject the sparsity assumption in the applications. OLS is recommended for robust inference unless the number of regressors is comparable to or exceeds sample size.
Significance. If the tests are valid detectors of sparsity violations, the results would illustrate concrete fragility of penalized estimators to innocuous design choices and provide evidence against sparsity in typical settings, favoring OLS with minimal efficiency cost. This would contribute to the high-dimensional regression literature by emphasizing robustness considerations in applied work.
major comments (2)
- [Section on the two tests of the sparsity assumption] The section developing the two tests: provides no analytic derivation of asymptotic size under the exact null (sparsity holds and design satisfies the paper's conditions). There is also no Monte Carlo evidence that the tests control size under regressor matrices with categorical controls or normalizations that the paper shows affect the penalized estimators. This is load-bearing for reading the application rejections as evidence against sparsity.
- [Empirical applications] Empirical applications: the abstract describes the strategy and rejections but supplies no details on test construction, exact specifications, or handling of multiple testing, so the degree to which the data support the central claim cannot be verified from the provided information.
minor comments (1)
- Some figures or tables could benefit from clearer labeling of the exact regressor-matrix variations being compared.
Simulated Author's Rebuttal
We thank the referee for the detailed and constructive report. The comments highlight important issues regarding the validation of our proposed tests and the presentation of the empirical results. We respond to each major comment below and indicate planned revisions.
read point-by-point responses
-
Referee: [Section on the two tests of the sparsity assumption] The section developing the two tests: provides no analytic derivation of asymptotic size under the exact null (sparsity holds and design satisfies the paper's conditions). There is also no Monte Carlo evidence that the tests control size under regressor matrices with categorical controls or normalizations that the paper shows affect the penalized estimators. This is load-bearing for reading the application rejections as evidence against sparsity.
Authors: We agree that formal size validation is important for interpreting the rejections. The manuscript does not contain an analytic derivation of asymptotic size under the exact null, as deriving the limiting distribution of the proposed tests (which compare penalized estimators to OLS) under sparsity is technically involved and depends on the specific penalty and design conditions. However, we will add Monte Carlo evidence in the revision demonstrating that the tests control size at conventional levels under regressor matrices that include categorical controls and the normalizations highlighted in the applications. This will directly address the concern for the designs used in the paper. revision: yes
-
Referee: Empirical applications: the abstract describes the strategy and rejections but supplies no details on test construction, exact specifications, or handling of multiple testing, so the degree to which the data support the central claim cannot be verified from the provided information.
Authors: The abstract is intentionally concise and focuses on the high-level strategy and findings. Full details on test construction, the exact specifications (including all controls, normalizations, and penalty choices), and any handling of multiple testing across applications are provided in Sections 3, 4, and 5 of the manuscript. To improve accessibility, we will revise the abstract to briefly reference the test approach and direct readers to the main text for specifications. revision: partial
Circularity Check
No circularity; derivation and tests are self-contained
full rationale
The paper introduces two new tests that compare sparsity-based estimators to OLS and applies them directly to three external empirical datasets. The documented fragility arises from explicit sensitivity checks on regressor matrix choices (e.g., baseline categories) that leave OLS unchanged. No load-bearing step reduces by construction to a fitted parameter, self-definition, or self-citation chain; the tests are presented as original contributions without invoking prior author work as an unverified uniqueness theorem or ansatz. The central claims rest on observable empirical differences rather than tautological equivalence to inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Standard linear regression assumptions hold so that OLS serves as a reliable benchmark
Forward citations
Cited by 1 Pith paper
-
Variance or Standard Deviation? Shell Geometry and Global-Scale Priors in High-Dimensional Shrinkage
Under a radial-power benchmark, the SD-flat prior has a one-unit asymptotic risk advantage near the origin over the variance-flat prior, with crossover in the critical regime and second-order equivalence for strong signals.
Reference graph
Works this paper leans on
-
[1]
http://jmlr.org/papers/v15/javanmard14a.html Jochmans, K. (2022). Heteroscedasticity-robust inference in linear regression models with many covariates. Journal of the American Statistical Association , 117 (538), 887–896. https://doi.org/10.1080/01621459.2020.1831924 Kline, P., Saggio, R., & Sølvsten, M. (2020). Leave-out estimation of variance components...
-
[2]
https://doi.org/10.1111/rssb.12026 45
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.