Evaluating Informal-Domain Word Representations With UrbanDictionary

Adam Lopez; Naomi Saphra

arxiv: 1606.08270 · v1 · pith:F3XWSC6Znew · submitted 2016-06-27 · 💻 cs.CL

Evaluating Informal-Domain Word Representations With UrbanDictionary

Naomi Saphra , Adam Lopez This is my paper

classification 💻 cs.CL

keywords domainsevaluationinformalmetricspellingurbandictionarycollectedcomment

0 comments

read the original abstract

Existing corpora for intrinsic evaluation are not targeted towards tasks in informal domains such as Twitter or news comment forums. We want to test whether a representation of informal words fulfills the promise of eliding explicit text normalization as a preprocessing step. One possible evaluation metric for such domains is the proximity of spelling variants. We propose how such a metric might be computed and how a spelling variant dataset can be collected using UrbanDictionary.

This paper has not been read by Pith yet.

Evaluating Informal-Domain Word Representations With UrbanDictionary

discussion (0)