Interactive Semantic Featuring for Text Classification
classification
💻 cs.CL
stat.ML
keywords
featuresdictionaryclassificationhuman-comprehensiblemodelstextbuiltcalled
read the original abstract
In text classification, dictionaries can be used to define human-comprehensible features. We propose an improvement to dictionary features called smoothed dictionary features. These features recognize document contexts instead of n-grams. We describe a principled methodology to solicit dictionary features from a teacher, and present results showing that models built using these human-comprehensible features are competitive with models trained with Bag of Words features.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.