pith. sign in

arxiv: 1805.06266 · v2 · pith:MSSN4D6Hnew · submitted 2018-05-16 · 💻 cs.CL

A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

classification 💻 cs.CL
keywords modelabstractiveattentionextractiveinconsistencylesslossreadable
0
0 comments X
read the original abstract

We propose a unified model combining the strength of extractive and abstractive summarization. On the one hand, a simple extractive model can obtain sentence-level attention with high ROUGE scores but less readable. On the other hand, a more complicated abstractive model can obtain word-level dynamic attention to generate a more readable paragraph. In our model, sentence-level attention is used to modulate the word-level attention such that words in less attended sentences are less likely to be generated. Moreover, a novel inconsistency loss function is introduced to penalize the inconsistency between two levels of attentions. By end-to-end training our model with the inconsistency loss and original losses of extractive and abstractive models, we achieve state-of-the-art ROUGE scores while being the most informative and readable summarization on the CNN/Daily Mail dataset in a solid human evaluation.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.