pith. machine review for the scientific record. sign in

arxiv: 1301.3570 · v1 · submitted 2013-01-16 · 📊 stat.ML

Recognition: unknown

A Nested HDP for Hierarchical Topic Models

Authors on Pith no claims yet
classification 📊 stat.ML
keywords hierarchicalnestedtopicncrpnhdpprocessaccordingalgorithm
0
0 comments X
read the original abstract

We develop a nested hierarchical Dirichlet process (nHDP) for hierarchical topic modeling. The nHDP is a generalization of the nested Chinese restaurant process (nCRP) that allows each word to follow its own path to a topic node according to a document-specific distribution on a shared tree. This alleviates the rigid, single-path formulation of the nCRP, allowing a document to more easily express thematic borrowings as a random effect. We demonstrate our algorithm on 1.8 million documents from The New York Times.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.