pith. sign in

arxiv: 1704.04154 · v2 · pith:GFOREOFNnew · submitted 2017-04-13 · 💻 cs.CL

Learning Joint Multilingual Sentence Representations with Neural Machine Translation

classification 💻 cs.CL
keywords differentrepresentationssentencesentencesclosejointlanguagesmachine
0
0 comments X
read the original abstract

In this paper, we use the framework of neural machine translation to learn joint sentence representations across six very different languages. Our aim is that a representation which is independent of the language, is likely to capture the underlying semantics. We define a new cross-lingual similarity measure, compare up to 1.4M sentence representations and study the characteristics of close sentences. We provide experimental evidence that sentences that are close in embedding space are indeed semantically highly related, but often have quite different structure and syntax. These relations also hold when comparing sentences in different languages.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Improving Zero-shot Translation with Language-Independent Constraints

    cs.CL 2019-06 unverdicted novelty 4.0

    Language-independent constraints and regularization in multilingual Transformer NMT yield a 2.23 BLEU average gain on zero-shot pairs from the IWSLT 2017 dataset.