pith. sign in

arxiv: 1603.06270 · v2 · pith:OWIV4QONnew · submitted 2016-03-20 · 💻 cs.CL · cs.LG

Multi-Task Cross-Lingual Sequence Tagging from Scratch

classification 💻 cs.CL cs.LG
keywords modelcross-lingualmulti-tasksequencetaggingdeepindependentjoint
0
0 comments X
read the original abstract

We present a deep hierarchical recurrent neural network for sequence tagging. Given a sequence of words, our model employs deep gated recurrent units on both character and word levels to encode morphology and context information, and applies a conditional random field layer to predict the tags. Our model is task independent, language independent, and feature engineering free. We further extend our model to multi-task and cross-lingual joint training by sharing the architecture and parameters. Our model achieves state-of-the-art results in multiple languages on several benchmark tasks including POS tagging, chunking, and NER. We also demonstrate that multi-task and cross-lingual joint training can improve the performance in various cases.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks

    cs.CL 2019-07 unverdicted novelty 6.0

    DropAttention regularizes attention weights in fully-connected self-attention networks to reduce overfitting and improve performance.