pith. machine review for the scientific record. sign in

arxiv: 1902.07110 · v1 · submitted 2019-02-19 · 💻 cs.CL

Recognition: unknown

A novel repetition normalized adversarial reward for headline generation

Authors on Pith no claims yet
classification 💻 cs.CL
keywords repetitionadversarialrewardgeneratinggenerationincoherentmodelnormalized
0
0 comments X
read the original abstract

While reinforcement learning can effectively improve language generation models, it often suffers from generating incoherent and repetitive phrases \cite{paulus2017deep}. In this paper, we propose a novel repetition normalized adversarial reward to mitigate these problems. Our repetition penalized reward can greatly reduce the repetition rate and adversarial training mitigates generating incoherent phrases. Our model significantly outperforms the baseline model on ROUGE-1\,(+3.24), ROUGE-L\,(+2.25), and a decreased repetition-rate (-4.98\%).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.