Neural Machine Transliteration: Preliminary Results

Amir H. Jadidinejad

arxiv: 1609.04253 · v1 · pith:WSB557KDnew · submitted 2016-09-14 · 💻 cs.CL

Neural Machine Transliteration: Preliminary Results

Amir H. Jadidinejad This is my paper

classification 💻 cs.CL

keywords sequenceneuralrecurrenttargettransliterationdecoderencoderencoder-decoder

0 comments

read the original abstract

Machine transliteration is the process of automatically transforming the script of a word from a source language to a target language, while preserving pronunciation. Sequence to sequence learning has recently emerged as a new paradigm in supervised learning. In this paper a character-based encoder-decoder model has been proposed that consists of two Recurrent Neural Networks. The encoder is a Bidirectional recurrent neural network that encodes a sequence of symbols into a fixed-length vector representation, and the decoder generates the target sequence using an attention-based recurrent neural network. The encoder, the decoder and the attention mechanism are jointly trained to maximize the conditional probability of a target sequence given a source sequence. Our experiments on different datasets show that the proposed encoder-decoder model is able to achieve significantly higher transliteration quality over traditional statistical models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning to Reformulate the Queries on the WEB
cs.IR 2019-07 unverdicted novelty 5.0

An unsupervised character-level CNN encoder with attention-based RNN decoder, trained on Clueweb09 anchor phrases, generates query reformulations that improve retrieval on TREC collections.