Evaluating topic coherence measures

Alexander Hinneburg; Andreas Both; Frank Rosner; Martin Nettling; Michael R\"oder

arxiv: 1403.6397 · v1 · pith:FNMSN3G6new · submitted 2014-03-25 · 💻 cs.LG · cs.CL· cs.IR

Evaluating topic coherence measures

Frank Rosner , Alexander Hinneburg , Michael R\"oder , Martin Nettling , Andreas Both This is my paper

classification 💻 cs.LG cs.CLcs.IR

keywords coherencemeasurestopictopicswordpairsscoreannotations

0 comments

read the original abstract

Topic models extract representative word sets - called topics - from word counts in documents without requiring any semantic annotations. Topics are not guaranteed to be well interpretable, therefore, coherence measures have been proposed to distinguish between good and bad topics. Studies of topic coherence so far are limited to measures that score pairs of individual words. For the first time, we include coherence measures from scientific philosophy that score pairs of more complex word subsets and apply them to topic scoring.

This paper has not been read by Pith yet.

Evaluating topic coherence measures

discussion (0)