pith. sign in

arxiv: 1810.02851 · v1 · pith:FL4KNAWYnew · submitted 2018-10-05 · 💻 cs.CL

Learning to Encode Text as Human-Readable Summaries using Generative Adversarial Networks

classification 💻 cs.CL
keywords generatorinputoutputtextdatahuman-readablerepresentationabstractive
0
0 comments X
read the original abstract

Auto-encoders compress input data into a latent-space representation and reconstruct the original data from the representation. This latent representation is not easily interpreted by humans. In this paper, we propose training an auto-encoder that encodes input text into human-readable sentences, and unpaired abstractive summarization is thereby achieved. The auto-encoder is composed of a generator and a reconstructor. The generator encodes the input text into a shorter word sequence, and the reconstructor recovers the generator input from the generator output. To make the generator output human-readable, a discriminator restricts the output of the generator to resemble human-written sentences. By taking the generator output as the summary of the input text, abstractive summarization is achieved without document-summary pairs as training data. Promising results are shown on both English and Chinese corpora.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.