On the Curse of Dimensionality in Private Sparse Covariance Estimation and PCA

Kevin Tian; Purnamrita Sarkar; Shourya Pandey; Syamantak Kumar

arxiv: 2606.21951 · v1 · pith:W754BME2new · submitted 2026-06-20 · 💻 cs.LG · cs.CR· cs.DS· math.ST· stat.TH

On the Curse of Dimensionality in Private Sparse Covariance Estimation and PCA

Syamantak Kumar , Shourya Pandey , Purnamrita Sarkar , Kevin Tian This is my paper

Pith reviewed 2026-06-26 12:10 UTC · model grok-4.3

classification 💻 cs.LG cs.CRcs.DSmath.STstat.TH

keywords differential privacysparse covariance estimationprincipal component analysishigh-dimensional statisticssample complexitycurse of dimensionalityrow-column sparsity

0 comments

The pith

Differentially private sparse covariance estimation and PCA require polynomially many samples in dimension even under row-column sparsity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that differential privacy imposes a polynomial dependence on dimension d for sparse covariance estimation and PCA under the k-row-column sparsity model, in contrast to the non-private setting where poly(k, log d) samples suffice. An upper bound of poly(k, log d) samples works for PCA when the leading eigenvector is also sparse. Complementary lower bounds of poly(d) samples hold for both problems under DP, yielding an exponential separation when k is polylog in d. The techniques also strengthen lower bounds for ordinary DP PCA without any sparsity assumptions.

Core claim

Under k-row-column sparsity of the covariance matrix, non-private algorithms achieve poly(k, log d) sample complexity for covariance estimation and PCA in operator norm, but standard DP requires Omega(d) samples; the paper proves that poly(d) lower bounds are necessary for both tasks under DP when k = polylog(d), while poly(k, log d) upper bounds remain possible for PCA if the leading eigenvector is additionally sparse.

What carries the argument

k-row-column sparsity (k-RCS) model of the covariance matrix together with differential privacy constraints on the output.

If this is right

Sparse covariance estimation under DP requires poly(d) samples even with k-RCS.
Sparse PCA under DP requires poly(d) samples even with k-RCS.
If the leading eigenvector is sparse, PCA under DP recovers poly(k, log d) sample complexity.
The same poly(d) lower bounds apply to standard DP PCA without sparsity assumptions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Sparsity of the matrix alone is insufficient to escape the curse of dimensionality under privacy; eigenvector sparsity is also needed for the upper bound.
The separation may extend to other high-dimensional sparse estimation tasks where privacy must be enforced on the output.
Practical high-dimensional private learning may need to assume stronger structural conditions than row-column sparsity.

Load-bearing premise

The covariance matrix exactly satisfies the k-row-column sparsity pattern and the differential privacy parameters match the stated definitions.

What would settle it

A concrete algorithm that achieves sub-polynomial(d) sample complexity for k-RCS covariance estimation or PCA while satisfying the same DP guarantee would falsify the lower bounds.

read the original abstract

We study high-dimensional differentially private (DP) covariance estimation in the operator norm, and principal component analysis (PCA), under $k$-row-column sparsity ($k$-RCS) of the covariance matrix. In the non-private setting, it is known that $\mathsf{poly}(k, \log d)$ samples suffice to solve both of these problems. However, the only comparable result known under DP (Wang et al. 2021) requires $\Omega(d)$ samples under standard parameterizations of the problem. We investigate when this curse of dimensionality is inherent for sparse covariance estimation tasks under DP. On the upper bound front, we show that a $\mathsf{poly}(k, \log d)$ sample complexity for PCA is possible under DP, if we also posit sparsity of the leading eigenvector. We complement this result with $\mathsf{poly}(d)$ lower bounds under DP for both sparse covariance estimation and PCA, establishing an exponential gap between the private and non-private variants of these problems when $k = \mathsf{polylog}(d)$. To our knowledge, no such separation has previously been demonstrated for any sparse estimation problems in private high-dimensional statistics. Our techniques are flexible enough that they imply stronger lower bounds even for the well-studied problem of standard DP PCA, without sparsity assumptions.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper shows poly(k, log d) DP-PCA samples are possible under k-RCS plus eigenvector sparsity, paired with poly(d) lower bounds for standard k-RCS covariance and PCA, but the claimed exponential gap does not yet apply to identical settings.

read the letter

The headline result is an exponential sample gap under DP for these sparse problems when k is polylog d. Non-private methods get poly(k, log d) for both covariance estimation and PCA under k-row-column sparsity. The paper gives a matching upper bound for DP-PCA once the leading eigenvector is also assumed sparse, then shows poly(d) lower bounds for the standard k-RCS versions of both problems. The techniques also strengthen the lower bound for ordinary DP-PCA without sparsity. That is the concrete advance over Wang et al. 2021.

The upper bound is a clean positive statement: the extra eigenvector sparsity lets you dodge the dimensional dependence. The lower bounds look like standard packing or information arguments and are stated for the usual k-RCS model. No obvious circularity or invented quantities.

The soft spot is the mismatch the stress-test note flags. The upper bound explicitly needs eigenvector sparsity; the lower bounds are proved for k-RCS covariance and PCA without that extra condition. Nothing in the abstract indicates whether the hard instances used for the lower bounds happen to have sparse leading eigenvectors. If they do not, the poly(d) lower bound does not rule out poly(k, log d) samples in the exact regime where the upper bound is proved. The separation therefore sits between two slightly different problem classes rather than a single tight pair.

This is worth a serious referee for anyone working on private high-dimensional estimation. The separation, if tightened to matching assumptions, would matter for algorithm design. Right now the gap needs clarification on the eigenvector condition before the exponential claim is fully supported. I would send it to review and ask the authors to address whether the lower-bound constructions satisfy the extra sparsity used in the upper bound.

Referee Report

2 major / 2 minor

Summary. The paper studies differentially private covariance estimation and PCA under the k-row-column sparsity (k-RCS) model. It shows that poly(k, log d) samples suffice for DP-PCA when the leading eigenvector is additionally sparse, while poly(d) lower bounds hold for both sparse covariance estimation and PCA under (standard) k-RCS; this yields an exponential separation from the non-private sample complexity when k = polylog(d). Stronger lower bounds are also claimed for ordinary DP-PCA without sparsity assumptions.

Significance. If correct, the results establish the first exponential private/non-private gap for any sparse estimation task in high-dimensional DP statistics. The upper bound demonstrates that an extra eigenvector-sparsity assumption suffices to recover the non-private sample complexity under DP, while the lower-bound techniques are noted to extend to the well-studied non-sparse DP-PCA setting.

major comments (2)

[Abstract and lower-bound section for PCA] The claimed exponential gap between private and non-private regimes requires that the hard instances used for the poly(d) lower bound on DP-PCA (under k-RCS) also satisfy the leading-eigenvector sparsity assumption used for the poly(k, log d) upper bound. If those instances do not have a sparse leading eigenvector, the separation applies only to a strictly larger problem class and does not directly compare the two regimes under identical modeling assumptions. The abstract explicitly qualifies the upper bound with the extra sparsity condition but does not state whether the lower-bound constructions meet it.
[Lower bound for sparse covariance estimation] §4 (or wherever the lower-bound reduction for sparse covariance estimation appears): the reduction must be checked to confirm it produces instances whose leading eigenvector is not sparse; otherwise the poly(d) lower bound does not rule out the poly(k, log d) regime that the upper bound targets.

minor comments (2)

[Introduction / Preliminaries] Notation for the k-RCS model and the additional eigenvector sparsity condition should be introduced with a single consistent definition early in the paper rather than piecemeal.
[Conclusion or lower-bound section] The statement that the techniques 'imply stronger lower bounds even for the well-studied problem of standard DP PCA' would benefit from an explicit corollary or theorem number that isolates the non-sparse case.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their thoughtful comments, which help clarify the scope of our results. Below we respond to the major comments point by point.

read point-by-point responses

Referee: [Abstract and lower-bound section for PCA] The claimed exponential gap between private and non-private regimes requires that the hard instances used for the poly(d) lower bound on DP-PCA (under k-RCS) also satisfy the leading-eigenvector sparsity assumption used for the upper bound. If those instances do not have a sparse leading eigenvector, the separation applies only to a strictly larger problem class and does not directly compare the two regimes under identical modeling assumptions. The abstract explicitly qualifies the upper bound with the extra sparsity condition but does not state whether the lower-bound constructions meet it.

Authors: The lower bounds apply to the k-RCS model without the additional leading eigenvector sparsity assumption. Our constructions for the lower bounds are designed such that the leading eigenvector is dense and does not satisfy the sparsity condition. This ensures there is no contradiction with the upper bound, which requires the additional assumption to achieve the improved sample complexity. The exponential gap is established for the standard k-RCS model. We will revise the manuscript to explicitly state that the lower bound instances do not have sparse leading eigenvectors, and clarify the distinction in the abstract and introduction. revision: yes
Referee: [Lower bound for sparse covariance estimation] §4 (or wherever the lower-bound reduction for sparse covariance estimation appears): the reduction must be checked to confirm it produces instances whose leading eigenvector is not sparse; otherwise the poly(d) lower bound does not rule out the poly(k, log d) regime that the upper bound targets.

Authors: We have checked the reduction for the sparse covariance estimation lower bound. The resulting instances have dense leading eigenvectors that do not satisfy the sparsity assumption used in the upper bound for PCA. Therefore, the poly(d) lower bound correctly applies to the standard k-RCS model and does not conflict with the poly(k, log d) upper bound under the additional assumption. We will add a note in the relevant section to confirm this property of the hard instances. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper's upper bound for DP-PCA is an algorithmic construction under k-RCS plus leading eigenvector sparsity, while the poly(d) lower bounds for covariance estimation and PCA are stated to rest on standard information-theoretic reductions under the k-RCS model. No quoted equations or steps reduce a claimed result to a fitted parameter, self-defined quantity, or load-bearing self-citation by construction. The derivation chain is self-contained against external benchmarks and does not match any enumerated circularity pattern.

Axiom & Free-Parameter Ledger

0 free parameters · 2 axioms · 0 invented entities

The central claims rest on the standard definition of differential privacy and the k-row-column sparsity model for the covariance matrix; these are domain assumptions drawn from prior literature rather than new postulates.

axioms (2)

standard math Standard definition of differential privacy
The entire analysis is conducted in the DP framework as stated in the problem setup.
domain assumption k-row-column sparsity (k-RCS) model for the covariance matrix
This sparsity structure is the key modeling choice enabling the poly(k, log d) non-private bounds and the claimed private separation.

pith-pipeline@v0.9.1-grok · 5784 in / 1567 out tokens · 38821 ms · 2026-06-26T12:10:32.301408+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

65 extracted references · 2 canonical work pages

[1]

The Annals of Statistics , number =

Zongming Ma , title =. The Annals of Statistics , number =. 2013 , doi =

2013
[2]

, title =

Spielman, Daniel A. , title =
[3]

2008 IEEE international symposium on information theory , pages=

High-dimensional analysis of semidefinite relaxations for sparse principal components , author=. 2008 IEEE international symposium on information theory , pages=. 2008 , organization=

2008
[4]

2012 , publisher=

Matrix analysis , author=. 2012 , publisher=

2012
[5]

Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024 , year =

Hilal Asi and Daogao Liu and Kevin Tian , title =. Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024 , year =

2024
[6]

The Annals of Probability , volume =

Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices , author =. The Annals of Probability , volume =. 2005 , doi =

2005
[7]

Samworth , journal =

Tengyao Wang and Quentin Berthet and Richard J. Samworth , journal =. STATISTICAL AND COMPUTATIONAL TRADE-OFFS IN ESTIMATION OF SPARSE PRINCIPAL COMPONENTS , urldate =
[8]

Conference on learning theory , pages=

Complexity theoretic lower bounds for sparse principal component detection , author=. Conference on learning theory , pages=. 2013 , organization=

2013
[9]

Bertsimas, Dimitris and Kitane, Driss Lahlou , title =. J. Mach. Learn. Res. , month = jan, articleno =. 2023 , issue_date =

2023
[10]

arXiv: Statistics Theory , year=

Covariance regularization by thresholding , author=. arXiv: Statistics Theory , year=
[11]

Advances in Neural Information Processing Systems , volume=

Sparse PCA from sparse linear regression , author=. Advances in Neural Information Processing Systems , volume=
[12]

Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

Fingerprinting codes and the price of approximate differential privacy , author=. Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=
[13]

Tony and Ma, Zongming and Wu, Yihong , title =

Cai, T. Tony and Ma, Zongming and Wu, Yihong , title =. The Annals of Statistics , volume =. 2013 , doi =

2013
[14]

Tony Cai and Cun-Hui Zhang and Harrison H

T. Tony Cai and Cun-Hui Zhang and Harrison H. Zhou , journal =. OPTIMAL RATES OF CONVERGENCE FOR COVARIANCE MATRIX ESTIMATION , urldate =
[15]

Journal of the American Statistical Association , volume=

Adaptive thresholding for sparse covariance matrix estimation , author=. Journal of the American Statistical Association , volume=. 2011 , publisher=

2011
[16]

Tony Cai and Harrison H

T. Tony Cai and Harrison H. Zhou , journal =. OPTIMAL RATES OF CONVERGENCE FOR SPARSE COVARIANCE MATRIX ESTIMATION , urldate =
[17]

2024 , eprint=

Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices , author=. 2024 , eprint=

2024
[18]

The Journal of Machine Learning Research , volume=

A near-optimal algorithm for differentially-private principal components , author=. The Journal of Machine Learning Research , volume=. 2013 , publisher=

2013
[19]

Ullman , title =

Albert Cheu and Jonathan R. Ullman , title =
[20]

Advances in neural information processing systems , volume=

A direct formulation for sparse PCA using semidefinite programming , author=. Advances in neural information processing systems , volume=
[21]

Journal of Machine Learning Research , volume=

Sparse PCA via covariance thresholding , author=. Journal of Machine Learning Research , volume=
[22]

Foundations and Trends

The algorithmic foundations of differential privacy , author=. Foundations and Trends. 2014 , publisher=

2014
[23]

An Iterative Algorithm for Differentially Private \ k\ -

Johanna D. An Iterative Algorithm for Differentially Private \ k\ -. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
[24]

Advances in Neural Information Processing Systems , volume=

Practical differentially private top-k selection with pay-what-you-get composition , author=. Advances in Neural Information Processing Systems , volume=
[25]

Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

Analyze gauss: optimal bounds for privacy-preserving principal component analysis , author=. Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=
[26]

Journal of the Royal Statistical Society: Series B , volume=

Large covariance estimation by thresholding principal orthogonal complements , author=. Journal of the Royal Statistical Society: Series B , volume=
[27]

2022 , editor =

Tsfadia, Eliad and Cohen, Edith and Kaplan, Haim and Mansour, Yishay and Stemmer, Uri , booktitle =. 2022 , editor =

2022
[28]

Frobenius, Georg and Frobenius, Ferdinand Georg and Frobenius, Ferdinand Georg and Frobenius, Ferdinand Georg and Mathematician, Germany , year=
[29]

International Conference on Artificial Intelligence and Statistics , pages=

Minimax-optimal privacy-preserving sparse pca in distributed systems , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

2018
[30]

IEEE Transactions on Information Theory , volume=

Lower bounds for constant weight codes , author=. IEEE Transactions on Information Theory , volume=. 2003 , publisher=

2003
[31]

arXiv preprint arXiv:2108.02534 , year=

Existence and polynomial time construction of biregular, bipartite Ramanujan graphs of all degrees , author=. arXiv preprint arXiv:2108.02534 , year=

arXiv
[32]

Proceedings of the forty-second ACM symposium on Theory of computing , pages=

On the geometry of differential privacy , author=. Proceedings of the forty-second ACM symposium on Theory of computing , pages=
[33]

Advances in neural information processing systems , volume=

The noisy power method: A meta algorithm with applications , author=. Advances in neural information processing systems , volume=
[34]

Johnstone and Arthur Yu Lu , title =

Iain M. Johnstone and Arthur Yu Lu , title =. Journal of the American Statistical Association , volume =. 2009 , publisher =. doi:10.1198/jasa.2009.0121 , note =

work page doi:10.1198/jasa.2009.0121 2009
[35]

Principal Component Analysis , author =
[36]

Conference on Learning Theory , pages=

Privately learning high-dimensional distributions , author=. Conference on Learning Theory , pages=. 2019 , organization=

2019
[37]

Conference on Learning Theory , pages=

Private mean estimation of heavy-tailed distributions , author=. Conference on Learning Theory , pages=. 2020 , organization=

2020
[38]

Proceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms , pages=

On differentially private low rank approximation , author=. Proceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms , pages=. 2013 , organization=

2013
[39]

9th Innovations in Theoretical Computer Science Conference (ITCS 2018) , pages =

Karwa, Vishesh and Vadhan, Salil , title =. 9th Innovations in Theoretical Computer Science Conference (ITCS 2018) , pages =. 2018 , volume =. doi:10.4230/LIPIcs.ITCS.2018.44 , annote =

work page doi:10.4230/lipics.itcs.2018.44 2018
[40]

Lee and Kobbi Nissim and Sofya Raskhodnikova and Adam D

Shiva Prasad Kasiviswanathan and Homin K. Lee and Kobbi Nissim and Sofya Raskhodnikova and Adam D. Smith , title =. 2011 , url =

2011
[41]

The Annals of Statistics , number =

Vladimir Koltchinskii and Karim Lounici , title =. The Annals of Statistics , number =. 2017 , doi =

2017
[42]

Bernoulli , pages=

Concentration inequalities and moment bounds for sample covariance operators , author=. Bernoulli , pages=. 2017 , publisher=

2017
[43]

Advances in Neural Information Processing Systems , volume=

Oja's algorithm for streaming sparse PCA , author=. Advances in Neural Information Processing Systems , volume=
[44]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Private Geometric Median in Nearly-Linear Time , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=
[45]

Mathematics , volume=

Differentially Private Sparse Covariance Matrix Estimation under Lower-Bounded Moment Assumption , author=. Mathematics , volume=. 2023 , publisher=

2023
[46]

Advances in neural information processing systems , volume=

DP-PCA: Statistically optimal and differentially private PCA , author=. Advances in neural information processing systems , volume=
[47]

Advances in Neural Information Processing Systems , volume=

Faster algorithms for user-level private stochastic convex optimization , author=. Advances in Neural Information Processing Systems , volume=
[48]

2024 , eprint =

Narayanan, Shyam , title =. 2024 , eprint =

2024
[49]

Kobbi Nissim and Chao Yan , title =. J. Priv. Confidentiality , volume =
[50]

Statistica Sinica , pages=

Asymptotics of sample eigenstructure for a large dimensional spiked covariance model , author=. Statistica Sinica , pages=. 2007 , publisher=

2007
[51]

Differentially Private

Acharya, Jayadev and Sun, Ziteng and Zhang, Huanyu , booktitle =. Differentially Private. 2021 , editor =

2021
[52]

International Conference on Machine Learning , pages=

Oneshot differentially private top-k selection , author=. International Conference on Machine Learning , pages=. 2021 , organization=

2021
[53]

Biometrika , volume =

Gradient-based sparse principal component analysis with extensions to online learning , author =. Biometrika , volume =. 2023 , month = jun, doi =

2023
[54]

Journal of the American Statistical Association , volume=

Generalized thresholding of large covariance matrices , author=. Journal of the American Statistical Association , volume=. 2009 , publisher=

2009
[55]

Conference on learning theory , pages=

Interactive fingerprinting codes and the hardness of preventing false discovery , author=. Conference on learning theory , pages=. 2015 , organization=

2015
[56]

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) , pages=

Tight lower bounds for differentially private selection , author=. 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) , pages=. 2017 , organization=

2017
[57]

2018 , publisher=

High-dimensional probability: An introduction with applications in data science , author=. 2018 , publisher=

2018
[58]

arXiv preprint arXiv:1011.3027 , year=

Introduction to the non-asymptotic analysis of random matrices , author=. arXiv preprint arXiv:1011.3027 , year=

Pith/arXiv arXiv
[59]

The Annals of Statistics , volume=

Minimax sparse principal subspace estimation in high dimensions , author=. The Annals of Statistics , volume=
[60]

2019 , publisher=

High-dimensional statistics: A non-asymptotic viewpoint , author=. 2019 , publisher=

2019
[61]

Theoretical Computer Science , volume=

Differentially private high dimensional sparse covariance matrix estimation , author=. Theoretical Computer Science , volume=. 2021 , publisher=

2021
[62]

Biometrika , volume=

A useful variant of the Davis--Kahan theorem for statisticians , author=. Biometrika , volume=. 2015 , publisher=

2015
[63]

The Journal of Machine Learning Research , volume=

Truncated power method for sparse eigenvalue problems , author=. The Journal of Machine Learning Research , volume=. 2013 , publisher=

2013
[64]

Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016 , pages =

Zeyuan Allen Zhu and Yuanzhi Li , title =. Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016 , pages =

2016
[65]

Journal of computational and graphical statistics , volume=

Sparse principal component analysis , author=. Journal of computational and graphical statistics , volume=. 2006 , publisher=

2006

[1] [1]

The Annals of Statistics , number =

Zongming Ma , title =. The Annals of Statistics , number =. 2013 , doi =

2013

[2] [2]

, title =

Spielman, Daniel A. , title =

[3] [3]

2008 IEEE international symposium on information theory , pages=

High-dimensional analysis of semidefinite relaxations for sparse principal components , author=. 2008 IEEE international symposium on information theory , pages=. 2008 , organization=

2008

[4] [4]

2012 , publisher=

Matrix analysis , author=. 2012 , publisher=

2012

[5] [5]

Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024 , year =

Hilal Asi and Daogao Liu and Kevin Tian , title =. Advances in Neural Information Processing Systems 38: Annual Conference on Neural Information Processing Systems 2024 , year =

2024

[6] [6]

The Annals of Probability , volume =

Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices , author =. The Annals of Probability , volume =. 2005 , doi =

2005

[7] [7]

Samworth , journal =

Tengyao Wang and Quentin Berthet and Richard J. Samworth , journal =. STATISTICAL AND COMPUTATIONAL TRADE-OFFS IN ESTIMATION OF SPARSE PRINCIPAL COMPONENTS , urldate =

[8] [8]

Conference on learning theory , pages=

Complexity theoretic lower bounds for sparse principal component detection , author=. Conference on learning theory , pages=. 2013 , organization=

2013

[9] [9]

Bertsimas, Dimitris and Kitane, Driss Lahlou , title =. J. Mach. Learn. Res. , month = jan, articleno =. 2023 , issue_date =

2023

[10] [10]

arXiv: Statistics Theory , year=

Covariance regularization by thresholding , author=. arXiv: Statistics Theory , year=

[11] [11]

Advances in Neural Information Processing Systems , volume=

Sparse PCA from sparse linear regression , author=. Advances in Neural Information Processing Systems , volume=

[12] [12]

Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

Fingerprinting codes and the price of approximate differential privacy , author=. Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

[13] [13]

Tony and Ma, Zongming and Wu, Yihong , title =

Cai, T. Tony and Ma, Zongming and Wu, Yihong , title =. The Annals of Statistics , volume =. 2013 , doi =

2013

[14] [14]

Tony Cai and Cun-Hui Zhang and Harrison H

T. Tony Cai and Cun-Hui Zhang and Harrison H. Zhou , journal =. OPTIMAL RATES OF CONVERGENCE FOR COVARIANCE MATRIX ESTIMATION , urldate =

[15] [15]

Journal of the American Statistical Association , volume=

Adaptive thresholding for sparse covariance matrix estimation , author=. Journal of the American Statistical Association , volume=. 2011 , publisher=

2011

[16] [16]

Tony Cai and Harrison H

T. Tony Cai and Harrison H. Zhou , journal =. OPTIMAL RATES OF CONVERGENCE FOR SPARSE COVARIANCE MATRIX ESTIMATION , urldate =

[17] [17]

2024 , eprint=

Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices , author=. 2024 , eprint=

2024

[18] [18]

The Journal of Machine Learning Research , volume=

A near-optimal algorithm for differentially-private principal components , author=. The Journal of Machine Learning Research , volume=. 2013 , publisher=

2013

[19] [19]

Ullman , title =

Albert Cheu and Jonathan R. Ullman , title =

[20] [20]

Advances in neural information processing systems , volume=

A direct formulation for sparse PCA using semidefinite programming , author=. Advances in neural information processing systems , volume=

[21] [21]

Journal of Machine Learning Research , volume=

Sparse PCA via covariance thresholding , author=. Journal of Machine Learning Research , volume=

[22] [22]

Foundations and Trends

The algorithmic foundations of differential privacy , author=. Foundations and Trends. 2014 , publisher=

2014

[23] [23]

An Iterative Algorithm for Differentially Private \ k\ -

Johanna D. An Iterative Algorithm for Differentially Private \ k\ -. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

[24] [24]

Advances in Neural Information Processing Systems , volume=

Practical differentially private top-k selection with pay-what-you-get composition , author=. Advances in Neural Information Processing Systems , volume=

[25] [25]

Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

Analyze gauss: optimal bounds for privacy-preserving principal component analysis , author=. Proceedings of the forty-sixth annual ACM symposium on Theory of computing , pages=

[26] [26]

Journal of the Royal Statistical Society: Series B , volume=

Large covariance estimation by thresholding principal orthogonal complements , author=. Journal of the Royal Statistical Society: Series B , volume=

[27] [27]

2022 , editor =

Tsfadia, Eliad and Cohen, Edith and Kaplan, Haim and Mansour, Yishay and Stemmer, Uri , booktitle =. 2022 , editor =

2022

[28] [28]

Frobenius, Georg and Frobenius, Ferdinand Georg and Frobenius, Ferdinand Georg and Frobenius, Ferdinand Georg and Mathematician, Germany , year=

[29] [29]

International Conference on Artificial Intelligence and Statistics , pages=

Minimax-optimal privacy-preserving sparse pca in distributed systems , author=. International Conference on Artificial Intelligence and Statistics , pages=. 2018 , organization=

2018

[30] [30]

IEEE Transactions on Information Theory , volume=

Lower bounds for constant weight codes , author=. IEEE Transactions on Information Theory , volume=. 2003 , publisher=

2003

[31] [31]

arXiv preprint arXiv:2108.02534 , year=

Existence and polynomial time construction of biregular, bipartite Ramanujan graphs of all degrees , author=. arXiv preprint arXiv:2108.02534 , year=

arXiv

[32] [32]

Proceedings of the forty-second ACM symposium on Theory of computing , pages=

On the geometry of differential privacy , author=. Proceedings of the forty-second ACM symposium on Theory of computing , pages=

[33] [33]

Advances in neural information processing systems , volume=

The noisy power method: A meta algorithm with applications , author=. Advances in neural information processing systems , volume=

[34] [34]

Johnstone and Arthur Yu Lu , title =

Iain M. Johnstone and Arthur Yu Lu , title =. Journal of the American Statistical Association , volume =. 2009 , publisher =. doi:10.1198/jasa.2009.0121 , note =

work page doi:10.1198/jasa.2009.0121 2009

[35] [35]

Principal Component Analysis , author =

[36] [36]

Conference on Learning Theory , pages=

Privately learning high-dimensional distributions , author=. Conference on Learning Theory , pages=. 2019 , organization=

2019

[37] [37]

Conference on Learning Theory , pages=

Private mean estimation of heavy-tailed distributions , author=. Conference on Learning Theory , pages=. 2020 , organization=

2020

[38] [38]

Proceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms , pages=

On differentially private low rank approximation , author=. Proceedings of the twenty-fourth annual ACM-SIAM symposium on Discrete algorithms , pages=. 2013 , organization=

2013

[39] [39]

9th Innovations in Theoretical Computer Science Conference (ITCS 2018) , pages =

Karwa, Vishesh and Vadhan, Salil , title =. 9th Innovations in Theoretical Computer Science Conference (ITCS 2018) , pages =. 2018 , volume =. doi:10.4230/LIPIcs.ITCS.2018.44 , annote =

work page doi:10.4230/lipics.itcs.2018.44 2018

[40] [40]

Lee and Kobbi Nissim and Sofya Raskhodnikova and Adam D

Shiva Prasad Kasiviswanathan and Homin K. Lee and Kobbi Nissim and Sofya Raskhodnikova and Adam D. Smith , title =. 2011 , url =

2011

[41] [41]

The Annals of Statistics , number =

Vladimir Koltchinskii and Karim Lounici , title =. The Annals of Statistics , number =. 2017 , doi =

2017

[42] [42]

Bernoulli , pages=

Concentration inequalities and moment bounds for sample covariance operators , author=. Bernoulli , pages=. 2017 , publisher=

2017

[43] [43]

Advances in Neural Information Processing Systems , volume=

Oja's algorithm for streaming sparse PCA , author=. Advances in Neural Information Processing Systems , volume=

[44] [44]

The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

Private Geometric Median in Nearly-Linear Time , author=. The Thirty-ninth Annual Conference on Neural Information Processing Systems , year=

[45] [45]

Mathematics , volume=

Differentially Private Sparse Covariance Matrix Estimation under Lower-Bounded Moment Assumption , author=. Mathematics , volume=. 2023 , publisher=

2023

[46] [46]

Advances in neural information processing systems , volume=

DP-PCA: Statistically optimal and differentially private PCA , author=. Advances in neural information processing systems , volume=

[47] [47]

Advances in Neural Information Processing Systems , volume=

Faster algorithms for user-level private stochastic convex optimization , author=. Advances in Neural Information Processing Systems , volume=

[48] [48]

2024 , eprint =

Narayanan, Shyam , title =. 2024 , eprint =

2024

[49] [49]

Kobbi Nissim and Chao Yan , title =. J. Priv. Confidentiality , volume =

[50] [50]

Statistica Sinica , pages=

Asymptotics of sample eigenstructure for a large dimensional spiked covariance model , author=. Statistica Sinica , pages=. 2007 , publisher=

2007

[51] [51]

Differentially Private

Acharya, Jayadev and Sun, Ziteng and Zhang, Huanyu , booktitle =. Differentially Private. 2021 , editor =

2021

[52] [52]

International Conference on Machine Learning , pages=

Oneshot differentially private top-k selection , author=. International Conference on Machine Learning , pages=. 2021 , organization=

2021

[53] [53]

Biometrika , volume =

Gradient-based sparse principal component analysis with extensions to online learning , author =. Biometrika , volume =. 2023 , month = jun, doi =

2023

[54] [54]

Journal of the American Statistical Association , volume=

Generalized thresholding of large covariance matrices , author=. Journal of the American Statistical Association , volume=. 2009 , publisher=

2009

[55] [55]

Conference on learning theory , pages=

Interactive fingerprinting codes and the hardness of preventing false discovery , author=. Conference on learning theory , pages=. 2015 , organization=

2015

[56] [56]

2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) , pages=

Tight lower bounds for differentially private selection , author=. 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS) , pages=. 2017 , organization=

2017

[57] [57]

2018 , publisher=

High-dimensional probability: An introduction with applications in data science , author=. 2018 , publisher=

2018

[58] [58]

arXiv preprint arXiv:1011.3027 , year=

Introduction to the non-asymptotic analysis of random matrices , author=. arXiv preprint arXiv:1011.3027 , year=

Pith/arXiv arXiv

[59] [59]

The Annals of Statistics , volume=

Minimax sparse principal subspace estimation in high dimensions , author=. The Annals of Statistics , volume=

[60] [60]

2019 , publisher=

High-dimensional statistics: A non-asymptotic viewpoint , author=. 2019 , publisher=

2019

[61] [61]

Theoretical Computer Science , volume=

Differentially private high dimensional sparse covariance matrix estimation , author=. Theoretical Computer Science , volume=. 2021 , publisher=

2021

[62] [62]

Biometrika , volume=

A useful variant of the Davis--Kahan theorem for statisticians , author=. Biometrika , volume=. 2015 , publisher=

2015

[63] [63]

The Journal of Machine Learning Research , volume=

Truncated power method for sparse eigenvalue problems , author=. The Journal of Machine Learning Research , volume=. 2013 , publisher=

2013

[64] [64]

Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016 , pages =

Zeyuan Allen Zhu and Yuanzhi Li , title =. Advances in Neural Information Processing Systems 29: Annual Conference on Neural Information Processing Systems 2016 , pages =

2016

[65] [65]

Journal of computational and graphical statistics , volume=

Sparse principal component analysis , author=. Journal of computational and graphical statistics , volume=. 2006 , publisher=

2006