The local convexity of solving systems of quadratic equations

Chris D. White; Rachel Ward; Sujay Sanghavi

arxiv: 1506.07868 · v5 · pith:IYJVFOQUnew · submitted 2015-06-25 · 🧮 math.NA · math.OC· stat.ML

The local convexity of solving systems of quadratic equations

Chris D. White , Sujay Sanghavi , Rachel Ward This is my paper

classification 🧮 math.NA math.OCstat.ML

keywords convexityinitializationlocalorthogonalquadratictimesdescentfunction

0 comments

read the original abstract

This paper considers the recovery of a rank $r$ positive semidefinite matrix $X X^T\in\mathbb{R}^{n\times n}$ from $m$ scalar measurements of the form $y_i := a_i^T X X^T a_i$ (i.e., quadratic measurements of $X$). Such problems arise in a variety of applications, including covariance sketching of high-dimensional data streams, quadratic regression, quantum state tomography, among others. A natural approach to this problem is to minimize the loss function $f(U) = \sum_i (y_i - a_i^TUU^Ta_i)^2$ which has an entire manifold of solutions given by $\{XO\}_{O\in\mathcal{O}_r}$ where $\mathcal{O}_r$ is the orthogonal group of $r\times r$ orthogonal matrices; this is {\it non-convex} in the $n\times r$ matrix $U$, but methods like gradient descent are simple and easy to implement (as compared to semidefinite relaxation approaches). In this paper we show that once we have $m \geq C nr \log^2(n)$ samples from isotropic gaussian $a_i$, with high probability {\em (a)} this function admits a dimension-independent region of {\em local strong convexity} on lines perpendicular to the solution manifold, and {\em (b)} with an additional polynomial factor of $r$ samples, a simple spectral initialization will land within the region of convexity with high probability. Together, this implies that gradient descent with initialization (but no re-sampling) will converge linearly to the correct $X$, up to an orthogonal transformation. We believe that this general technique (local convexity reachable by spectral initialization) should prove applicable to a broader class of nonconvex optimization problems.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Universality in Learning from Linear Measurements
math.ST 2019-06 unverdicted novelty 7.0

The number of linear measurements for perfect structured signal recovery depends only on first and second moments of the measurement distribution, reducing analysis to the Gaussian case and yielding 3n measurements fo...