A Simple, Fast Diverse Decoding Algorithm for Neural Generation

Jiwei Li , Will Monroe , Dan Jurafsky

Authors on Pith no claims yet

classification 💻 cs.CL

keywords decodingalgorithmdiversegenerationneuraldiversityfastfurther

read the original abstract

In this paper, we propose a simple, fast decoding algorithm that fosters diversity in neural generation. The algorithm modifies the standard beam search algorithm by adding an inter-sibling ranking penalty, favoring choosing hypotheses from diverse parents. We evaluate the proposed model on the tasks of dialogue response generation, abstractive summarization and machine translation. We find that diverse decoding helps across all tasks, especially those for which reranking is needed. We further propose a variation that is capable of automatically adjusting its diversity decoding rates for different inputs using reinforcement learning (RL). We observe a further performance boost from this RL technique. This paper includes material from the unpublished script "Mutual Information and Diverse Decoding Improve Neural Machine Translation" (Li and Jurafsky, 2016).

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The Curious Case of Neural Text Degeneration
cs.CL 2019-04 accept novelty 7.0

Nucleus sampling draws from the smallest set of tokens whose cumulative probability exceeds threshold p, yielding more human-like diversity and coherence than beam search or full-distribution sampling.