pith. sign in

arxiv: 1811.00119 · v2 · pith:VCT3665Xnew · submitted 2018-10-31 · 💻 cs.CL

A task in a suit and a tie: paraphrase generation with semantic augmentation

classification 💻 cs.CL
keywords transformersaugmentationfindgainsgenerationincorporatingparaphraseparaphrases
0
0 comments X
read the original abstract

Paraphrasing is rooted in semantics. We show the effectiveness of transformers (Vaswani et al. 2017) for paraphrase generation and further improvements by incorporating PropBank labels via a multi-encoder. Evaluating on MSCOCO and WikiAnswers, we find that transformers are fast and effective, and that semantic augmentation for both transformers and LSTMs leads to sizable 2-3 point gains in BLEU, METEOR and TER. More importantly, we find surprisingly large gains on human evaluations compared to previous models. Nevertheless, manual inspection of generated paraphrases reveals ample room for improvement: even our best model produces human-acceptable paraphrases for only 28% of captions from the CHIA dataset (Sharma et al. 2018), and it fails spectacularly on sentences from Wikipedia. Overall, these results point to the potential for incorporating semantics in the task while highlighting the need for stronger evaluation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Decomposable Neural Paraphrase Generation

    cs.CL 2019-06 unverdicted novelty 6.0

    DNPG is a multi-component Transformer model that disentangles paraphrase generation across granularity levels and supports unsupervised domain adaptation, claiming competitive in-domain and superior cross-domain performance.