pith. sign in

arxiv: 1712.00499 · v1 · pith:L2QS5XBEnew · submitted 2017-12-01 · 💻 cs.LG · stat.ML

Prediction-Constrained Topic Models for Antidepressant Recommendation

classification 💻 cs.LG stat.ML
keywords datalabelsmodelstopicgenerativegoalsinterpretablepredicting
0
0 comments X
read the original abstract

Supervisory signals can help topic models discover low-dimensional data representations that are more interpretable for clinical tasks. We propose a framework for training supervised latent Dirichlet allocation that balances two goals: faithful generative explanations of high-dimensional data and accurate prediction of associated class labels. Existing approaches fail to balance these goals by not properly handling a fundamental asymmetry: the intended task is always predicting labels from data, not data from labels. Our new prediction-constrained objective trains models that predict labels from heldout data well while also producing good generative likelihoods and interpretable topic-word parameters. In a case study on predicting depression medications from electronic health records, we demonstrate improved recommendations compared to previous supervised topic models and high- dimensional logistic regression from words alone.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.