Tensor decompositions for learning latent variable models

Anima Anandkumar; Daniel Hsu; Matus Telgarsky; Rong Ge; Sham M. Kakade

arxiv: 1210.7559 · v4 · pith:P7DQL47Onew · submitted 2012-10-29 · 💻 cs.LG · math.NA· stat.ML

Tensor decompositions for learning latent variable models

Anima Anandkumar , Rong Ge , Daniel Hsu , Sham M. Kakade , Matus Telgarsky This is my paper

classification 💻 cs.LG math.NAstat.ML

keywords tensordecompositionlatentmodelsestimationmatricesvariableapproaches

0 comments

read the original abstract

This work considers a computationally and statistically efficient parameter estimation method for a wide class of latent variable models---including Gaussian mixture models, hidden Markov models, and latent Dirichlet allocation---which exploits a certain tensor structure in their low-order observable moments (typically, of second- and third-order). Specifically, parameter estimation is reduced to the problem of extracting a certain (orthogonal) decomposition of a symmetric tensor derived from the moments; this decomposition can be viewed as a natural generalization of the singular value decomposition for matrices. Although tensor decompositions are generally intractable to compute, the decomposition of these specially structured tensors can be efficiently obtained by a variety of approaches, including power iterations and maximization approaches (similar to the case of matrices). A detailed analysis of a robust tensor power method is provided, establishing an analogue of Wedin's perturbation theorem for the singular vectors of matrices. This implies a robust and computationally tractable estimation approach for several popular latent variable models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Language Models as Knowledge Bases?
cs.CL 2019-09 accept novelty 7.0

BERT stores relational knowledge extractable via cloze queries without fine-tuning and matches supervised baselines on open-domain QA tasks.