Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging

Barbara Plank; \v{Z}eljko Agi\'c

arxiv: 1808.09733 · v1 · pith:6BBLVEXMnew · submitted 2018-08-29 · 💻 cs.CL

Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging

Barbara Plank , \v{Z}eljko Agi\'c This is my paper

classification 💻 cs.CL

keywords disparatedistantlow-resourcepart-of-speechsourcessupervisionaccessannotated

0 comments

read the original abstract

We introduce DsDs: a cross-lingual neural part-of-speech tagger that learns from disparate sources of distant supervision, and realistically scales to hundreds of low-resource languages. The model exploits annotation projection, instance selection, tag dictionaries, morphological lexicons, and distributed representations, all in a uniform framework. The approach is simple, yet surprisingly effective, resulting in a new state of the art without access to any gold annotated data.

This paper has not been read by Pith yet.

Distant Supervision from Disparate Sources for Low-Resource Part-of-Speech Tagging

discussion (0)