pith. machine review for the scientific record. sign in

arxiv: 1809.01301 · v2 · pith:ROXNHI4Vnew · submitted 2018-09-05 · 💻 cs.CL

BPE and CharCNNs for Translation of Morphology: A Cross-Lingual Comparison and Analysis

classification 💻 cs.CL
keywords charcnnbeenbestcross-lingualdatalanguageslow-resourcesparsity
0
0 comments X
read the original abstract

Neural Machine Translation (NMT) in low-resource settings and of morphologically rich languages is made difficult in part by data sparsity of vocabulary words. Several methods have been used to help reduce this sparsity, notably Byte-Pair Encoding (BPE) and a character-based CNN layer (charCNN). However, the charCNN has largely been neglected, possibly because it has only been compared to BPE rather than combined with it. We argue for a reconsideration of the charCNN, based on cross-lingual improvements on low-resource data. We translate from 8 languages into English, using a multi-way parallel collection of TED transcripts. We find that in most cases, using both BPE and a charCNN performs best, while in Hebrew, using a charCNN over words is best.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.