pith. sign in

arxiv: 1904.08455 · v3 · pith:C6HF3TWEnew · submitted 2019-04-17 · 💻 cs.CL

Headline Generation: Learning from Decomposable Document Titles

classification 💻 cs.CL
keywords titlesheadlinesarticlesdecomposabledocumentdocument-titlemodelnews
0
0 comments X
read the original abstract

We propose a novel method for generating titles for unstructured text documents. We reframe the problem as a sequential question-answering task. A deep neural network is trained on document-title pairs with decomposable titles, meaning that the vocabulary of the title is a subset of the vocabulary of the document. To train the model we use a corpus of millions of publicly available document-title pairs: news articles and headlines. We present the results of a randomized double-blind trial in which subjects were unaware of which titles were human or machine-generated. When trained on approximately 1.5 million news articles, the model generates headlines that humans judge to be as good or better than the original human-written headlines in the majority of cases.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.