Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Erik B. Sudderth; Finale Doshi-Velez; Gabriel Hope; Leah Weiner; Michael C. Hughes; Roy H. Perlis; Thomas H. McCoy Jr.

arxiv: 1707.07341 · v1 · pith:D6HB3XGKnew · submitted 2017-07-23 · 📊 stat.ML · cs.AI· cs.LG

Prediction-Constrained Training for Semi-Supervised Mixture and Topic Models

Michael C. Hughes , Leah Weiner , Gabriel Hope , Thomas H. McCoy Jr. , Roy H. Perlis , Erik B. Sudderth , Finale Doshi-Velez This is my paper

classification 📊 stat.ML cs.AIcs.LG

keywords modelsdatatopiclabelslearningmixturesemi-supervisedtraining

0 comments

read the original abstract

Supervisory signals have the potential to make low-dimensional data representations, like those learned by mixture and topic models, more interpretable and useful. We propose a framework for training latent variable models that explicitly balances two goals: recovery of faithful generative explanations of high-dimensional data, and accurate prediction of associated semantic labels. Existing approaches fail to achieve these goals due to an incomplete treatment of a fundamental asymmetry: the intended application is always predicting labels from data, not data from labels. Our prediction-constrained objective for training generative models coherently integrates loss-based supervisory signals while enabling effective semi-supervised learning from partially labeled data. We derive learning algorithms for semi-supervised mixture and topic models using stochastic gradient descent with automatic differentiation. We demonstrate improved prediction quality compared to several previous supervised topic models, achieving predictions competitive with high-dimensional logistic regression on text sentiment analysis and electronic health records tasks while simultaneously learning interpretable topics.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

An adaptive variance estimator for relative sparsity
stat.ME 2026-05 unverdicted novelty 6.0

A new adaptive variance estimator for relative sparsity coefficients is introduced that fully utilizes the prior asymptotic normality theorem and incorporates variable selection effects.