pith. sign in

Fully Character-Level Neural Machine Translation without Explicit Segmentation

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it
abstract

Most existing machine translation systems operate at the level of words, relying on explicit segmentation to extract tokens. We introduce a neural machine translation (NMT) model that maps a source character sequence to a target character sequence without any segmentation. We employ a character-level convolutional network with max-pooling at the encoder to reduce the length of source representation, allowing the model to be trained at a speed comparable to subword-level models while capturing local regularities. Our character-to-character model outperforms a recently proposed baseline with a subword-level encoder on WMT'15 DE-EN and CS-EN, and gives comparable performance on FI-EN and RU-EN. We then demonstrate that it is possible to share a single character-level encoder across multiple languages by training a model on a many-to-one translation task. In this multilingual setting, the character-level encoder significantly outperforms the subword-level encoder on all the language pairs. We observe that on CS-EN, FI-EN and RU-EN, the quality of the multilingual character-level translation even surpasses the models specifically trained on that language pair alone, both in terms of BLEU score and human judgment.

years

2019 2

verdicts

UNVERDICTED 2

representative citing papers

Learning to Reformulate the Queries on the WEB

cs.IR · 2019-07-02 · unverdicted · novelty 5.0

An unsupervised character-level CNN encoder with attention-based RNN decoder, trained on Clueweb09 anchor phrases, generates query reformulations that improve retrieval on TREC collections.

citing papers explorer

Showing 2 of 2 citing papers.

  • Learning to Reformulate the Queries on the WEB cs.IR · 2019-07-02 · unverdicted · none · ref 30 · internal anchor

    An unsupervised character-level CNN encoder with attention-based RNN decoder, trained on Clueweb09 anchor phrases, generates query reformulations that improve retrieval on TREC collections.

  • Hierarchical Sequence to Sequence Voice Conversion with Limited Data eess.AS · 2019-07-15 · unverdicted · none · ref 53 · internal anchor

    Hierarchical seq2seq model for parallel voice conversion pretrained as autoencoder on single-speaker data then adapted to limited multispeaker data, using mel spectrograms converted via wavenet vocoder.