Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging
classification
💻 cs.CL
keywords
disparatedistantlow-resourcepart-of-speechsourcessupervisionaccessannotated
read the original abstract
We introduce DsDs: a cross-lingual neural part-of-speech tagger that learns from disparate sources of distant supervision, and realistically scales to hundreds of low-resource languages. The model exploits annotation projection, instance selection, tag dictionaries, morphological lexicons, and distributed representations, all in a uniform framework. The approach is simple, yet surprisingly effective, resulting in a new state of the art without access to any gold annotated data.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.