pith. sign in

arxiv: 1606.05467 · v2 · pith:YCTPNPMTnew · submitted 2016-06-17 · 💻 cs.CL · cs.SI

Gender Inference using Statistical Name Characteristics in Twitter

classification 💻 cs.CL cs.SI
keywords namesgendernametwittercharacteristicsill-formedinferenceusers
0
0 comments X
read the original abstract

Much attention has been given to the task of gender inference of Twitter users. Although names are strong gender indicators, the names of Twitter users are rarely used as a feature; probably due to the high number of ill-formed names, which cannot be found in any name dictionary. Instead of relying solely on a name database, we propose a novel name classifier. Our approach extracts characteristics from the user names and uses those in order to assign the names to a gender. This enables us to classify international first names as well as ill-formed names.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.