Learning to Segment Inputs for NMT Favors Character-Level Processing

· 2018 · cs.CL · arXiv 1810.01480

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Most modern neural machine translation (NMT) systems rely on presegmented inputs. Segmentation granularity importantly determines the input and output sequence lengths, hence the modeling depth, and source and target vocabularies, which in turn determine model size, computational costs of softmax normalization, and handling of out-of-vocabulary words. However, the current practice is to use static, heuristic-based segmentations that are fixed before NMT training. This begs the question whether the chosen segmentation is optimal for the translation task. To overcome suboptimal segmentation choices, we present an algorithm for dynamic segmentation based on the Adaptative Computation Time algorithm (Graves 2016), that is trainable end-to-end and driven by the NMT objective. In an evaluation on four translation tasks we found that, given the freedom to navigate between different segmentation levels, the model prefers to operate on (almost) character level, providing support for purely character-level NMT models from a novel angle.

representative citing papers

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges

cs.CL · 2019-07-11 · unverdicted · novelty 5.0

A single multilingual NMT model for 103 languages trained on 25B examples demonstrates transfer learning benefits for low-resource languages.

citing papers explorer

Showing 1 of 1 citing paper.

Massively Multilingual Neural Machine Translation in the Wild: Findings and Challenges cs.CL · 2019-07-11 · unverdicted · none · ref 11 · internal anchor
A single multilingual NMT model for 103 languages trained on 25B examples demonstrates transfer learning benefits for low-resource languages.

Learning to Segment Inputs for NMT Favors Character-Level Processing

fields

years

verdicts

representative citing papers

citing papers explorer