pith. sign in

arxiv: 1509.00685 · v2 · pith:ZPDTBANJnew · submitted 2015-09-02 · 💻 cs.CL · cs.AI

A Neural Attention Model for Abstractive Sentence Summarization

classification 💻 cs.CL cs.AI
keywords modelabstractivesentencesummarizationamountapproachattentionattention-based
0
0 comments X
read the original abstract

Summarization based on text extraction is inherently limited, but generation-style abstractive methods have proven challenging to build. In this work, we propose a fully data-driven approach to abstractive sentence summarization. Our method utilizes a local attention-based model that generates each word of the summary conditioned on the input sentence. While the model is structurally simple, it can easily be trained end-to-end and scales to a large amount of training data. The model shows significant performance gains on the DUC-2004 shared task compared with several strong baselines.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Learning to summarize from human feedback

    cs.CL 2020-09 conditional novelty 7.0

    Reinforcement learning on a reward model trained from human summary comparisons produces summaries humans prefer over supervised fine-tuning or human references on TL;DR and transfers to CNN/DM.

  2. CodeBERT: A Pre-Trained Model for Programming and Natural Languages

    cs.CL 2020-02 unverdicted novelty 6.0

    CodeBERT pre-trains a bimodal model on code and text pairs plus unimodal data to achieve state-of-the-art results on natural language code search and code documentation generation.

  3. CTRL: A Conditional Transformer Language Model for Controllable Generation

    cs.CL 2019-09 unverdicted novelty 6.0

    CTRL is a large conditional transformer language model that uses naturally occurring control codes to steer text generation style and content.

  4. Ranking sentences from product description & bullets for better search

    cs.IR 2019-07 unverdicted novelty 4.0

    Two RL-based extractive summarization models rank sentences from product fields by leveraging titles and click-through logs to improve search relevance.

  5. Saliency Maps Generation for Automatic Text Summarization

    cs.LG 2019-07 unverdicted novelty 4.0

    LRP saliency maps on a seq2seq summarization model sometimes reflect actual input feature usage and sometimes do not, requiring quantitative counterfactual validation.