Joining Hands: Exploiting Monolingual Treebanks for Parsing of Code-mixing Data

Dipti Misra Sharma; Irshad Ahmad Bhat; Manish Shrivastava; Riyaz Ahmad Bhat

arxiv: 1703.10772 · v1 · pith:7PFHJYQInew · submitted 2017-03-31 · 💻 cs.CL

Joining Hands: Exploiting Monolingual Treebanks for Parsing of Code-mixing Data

Irshad Ahmad Bhat , Riyaz Ahmad Bhat , Manish Shrivastava , Dipti Misra Sharma This is my paper

classification 💻 cs.CL

keywords dataannotatedcode-mixedhindimonolingualparsingstrategiesannotations

0 comments

read the original abstract

In this paper, we propose efficient and less resource-intensive strategies for parsing of code-mixed data. These strategies are not constrained by in-domain annotations, rather they leverage pre-existing monolingual annotated resources for training. We show that these methods can produce significantly better results as compared to an informed baseline. Besides, we also present a data set of 450 Hindi and English code-mixed tweets of Hindi multilingual speakers for evaluation. The data set is manually annotated with Universal Dependencies.

This paper has not been read by Pith yet.

Joining Hands: Exploiting Monolingual Treebanks for Parsing of Code-mixing Data

discussion (0)