pith. sign in

arxiv: 1808.10792 · v2 · pith:5LLLNXINnew · submitted 2018-08-31 · 💻 cs.CL · cs.AI· cs.LG

Bottom-Up Abstractive Summarization

classification 💻 cs.CL cs.AIcs.LG
keywords contentselectorabstractivebottom-upfluentotherphrasesselection
0
0 comments X
read the original abstract

Neural network-based methods for abstractive summarization produce outputs that are more fluent than other techniques, but which can be poor at content selection. This work proposes a simple technique for addressing this issue: use a data-efficient content selector to over-determine phrases in a source document that should be part of the summary. We use this selector as a bottom-up attention step to constrain the model to likely phrases. We show that this approach improves the ability to compress text, while still generating fluent summaries. This two-step process is both simpler and higher performing than other end-to-end content selection models, leading to significant improvements on ROUGE for both the CNN-DM and NYT corpus. Furthermore, the content selector can be trained with as little as 1,000 sentences, making it easy to transfer a trained summarizer to a new domain.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Enriching and Controlling Global Semantics for Text Summarization

    cs.CL 2021-09 unverdicted novelty 5.0

    A normalizing-flow neural topic model plus control mechanism are added to Transformer summarizers to supply and regulate global semantics, with reported gains over prior models on five benchmarks.