pith. sign in

arxiv: 1504.06729 · v4 · pith:UUAYZAXGnew · submitted 2015-04-25 · 💻 cs.DS

Optimal Principal Component Analysis in Distributed and Streaming Models

classification 💻 cs.DS
keywords distributedstreamingtimesanalysiscomponentepsilonmatrixmodels
0
0 comments X
read the original abstract

We study the Principal Component Analysis (PCA) problem in the distributed and streaming models of computation. Given a matrix $A \in R^{m \times n},$ a rank parameter $k < rank(A)$, and an accuracy parameter $0 < \epsilon < 1$, we want to output an $m \times k$ orthonormal matrix $U$ for which $$ || A - U U^T A ||_F^2 \le \left(1 + \epsilon \right) \cdot || A - A_k||_F^2, $$ where $A_k \in R^{m \times n}$ is the best rank-$k$ approximation to $A$. This paper provides improved algorithms for distributed PCA and streaming PCA.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.