Point-less: More Abstractive Summarization with Pointer-Generator Networks

David Rau; Eelco van der Wel; Freek Boutkan; Jorn Ranzijn

arxiv: 1905.01975 · v1 · pith:USY2WEIBnew · submitted 2019-04-18 · 💻 cs.CL

Point-less: More Abstractive Summarization with Pointer-Generator Networks

Freek Boutkan , Jorn Ranzijn , David Rau , Eelco van der Wel This is my paper

classification 💻 cs.CL

keywords abstractivedropoutrougeattentionhoweverlossmultiheadpointer-generator

0 comments

read the original abstract

The Pointer-Generator architecture has shown to be a big improvement for abstractive summarization seq2seq models. However, the summaries produced by this model are largely extractive as over 30% of the generated sentences are copied from the source text. This work proposes a multihead attention mechanism, pointer dropout, and two new loss functions to promote more abstractive summaries while maintaining similar ROUGE scores. Both the multihead attention and dropout do not improve N-gram novelty, however, the dropout acts as a regularizer which improves the ROUGE score. The new loss function achieves significantly higher novel N-grams and sentences, at the cost of a slightly lower ROUGE score.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Optimizing Abstractive Summarization With Fine-Tuned PEGASUS
cs.CL 2026-06 unverdicted novelty 2.0

Fine-tuned PEGASUS achieves state-of-the-art ROUGE scores on XL-Sum English corpus with 4.04% ROUGE-1, 15.25% ROUGE-2, and 3.39% ROUGE-L gains over mT5 baseline.