Convergence Rates of Latent Topic Models Under Relaxed Identifiability Conditions

Yining Wang

arxiv: 1710.11070 · v2 · pith:CWNOCEZMnew · submitted 2017-10-30 · 📊 stat.ML · cs.LG

Convergence Rates of Latent Topic Models Under Relaxed Identifiability Conditions

Yining Wang This is my paper

classification 📊 stat.ML cs.LG

keywords convergenceratelatentmodelstopicallocationanandkumarassuming

0 comments

read the original abstract

In this paper we study the frequentist convergence rate for the Latent Dirichlet Allocation (Blei et al., 2003) topic models. We show that the maximum likelihood estimator converges to one of the finitely many equivalent parameters in Wasserstein's distance metric at a rate of $n^{-1/4}$ without assuming separability or non-degeneracy of the underlying topics and/or the existence of more than three words per document, thus generalizing the previous works of Anandkumar et al. (2012, 2014) from an information-theoretical perspective. We also show that the $n^{-1/4}$ convergence rate is optimal in the worst case.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning Mixtures of Nonparametric and Convolutional Measures on Effectively Low-dimensional Affine Spaces
math.ST 2026-04 unverdicted novelty 6.0

Mixtures of convolutional measures on low-dimensional affine spaces admit unique identifiability in semi-parametric settings and posterior contraction rates under convex polytope support assumptions in a well-specifie...