pith. sign in

arxiv: 1803.07416 · v1 · pith:LCAZFOOLnew · submitted 2018-03-16 · 💻 cs.LG · cs.CL· stat.ML

Tensor2Tensor for Neural Machine Translation

classification 💻 cs.LG cs.CLstat.ML
keywords machineneuraltensor2tensortranslationdeepimplementationincludeslearning
0
0 comments X
read the original abstract

Tensor2Tensor is a library for deep learning models that is well-suited for neural machine translation and includes the reference implementation of the state-of-the-art Transformer model.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 4 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Hindi Visual Genome: A Dataset for Multimodal English-to-Hindi Machine Translation

    cs.CL 2019-07 unverdicted novelty 7.0

    The paper releases the first multimodal English-Hindi machine translation dataset of 31,525 segments with images and a challenge test set of 1,400 segments selected via embedding similarity for image-resolvable ambiguities.

  2. HuggingFace's Transformers: State-of-the-art Natural Language Processing

    cs.CL 2019-10 accept novelty 6.0

    Hugging Face releases an open-source Python library that supplies a unified API and pretrained weights for major Transformer architectures used in natural language processing.

  3. Widening the Representation Bottleneck in Neural Machine Translation with Lexical Shortcuts

    cs.CL 2019-06 conditional novelty 6.0

    Gated lexical shortcut connections added to the transformer yield 0.9 BLEU average gains on five WMT directions while lowering the lexical content stored in hidden states.

  4. Comparing Semi-Parametric Model Learning Algorithms for Dynamic Model Estimation in Robotics

    stat.ML 2019-06 unverdicted novelty 5.0

    Semi-parametric Gaussian process regression yields the most accurate inverse dynamics models in most tested robotic scenarios compared to parametric, non-parametric, and other semi-parametric baselines.