Neural Paraphrase Generation with Stacked Residual LSTM Networks

Aaditya Prakash; Ashequl Qadir; Joey Liu; Kathy Lee; Oladimeji Farri; Sadid A. Hasan; Vivek Datla

arxiv: 1610.03098 · v3 · pith:LHIAYG6Enew · submitted 2016-10-10 · 💻 cs.CL

Neural Paraphrase Generation with Stacked Residual LSTM Networks

Aaditya Prakash , Sadid A. Hasan , Kathy Lee , Vivek Datla , Ashequl Qadir , Joey Liu , Oladimeji Farri This is my paper

classification 💻 cs.CL

keywords generationlstmdeeplearningmodelsparaphraseresidualmodel

0 comments

read the original abstract

In this paper, we propose a novel neural approach for paraphrase generation. Conventional para- phrase generation methods either leverage hand-written rules and thesauri-based alignments, or use statistical machine learning principles. To the best of our knowledge, this work is the first to explore deep learning models for paraphrase generation. Our primary contribution is a stacked residual LSTM network, where we add residual connections between LSTM layers. This allows for efficient training of deep LSTMs. We evaluate our model and other state-of-the-art deep learning models on three different datasets: PPDB, WikiAnswers and MSCOCO. Evaluation results demonstrate that our model outperforms sequence to sequence, attention-based and bi- directional LSTM models on BLEU, METEOR, TER and an embedding-based sentence similarity metric.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Decomposable Neural Paraphrase Generation
cs.CL 2019-06 unverdicted novelty 6.0

DNPG is a multi-component Transformer model that disentangles paraphrase generation across granularity levels and supports unsupervised domain adaptation, claiming competitive in-domain and superior cross-domain performance.