Recognition: unknown
A Simple, Fast Diverse Decoding Algorithm for Neural Generation
read the original abstract
In this paper, we propose a simple, fast decoding algorithm that fosters diversity in neural generation. The algorithm modifies the standard beam search algorithm by adding an inter-sibling ranking penalty, favoring choosing hypotheses from diverse parents. We evaluate the proposed model on the tasks of dialogue response generation, abstractive summarization and machine translation. We find that diverse decoding helps across all tasks, especially those for which reranking is needed. We further propose a variation that is capable of automatically adjusting its diversity decoding rates for different inputs using reinforcement learning (RL). We observe a further performance boost from this RL technique. This paper includes material from the unpublished script "Mutual Information and Diverse Decoding Improve Neural Machine Translation" (Li and Jurafsky, 2016).
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
The Curious Case of Neural Text Degeneration
Nucleus sampling draws from the smallest set of tokens whose cumulative probability exceeds threshold p, yielding more human-like diversity and coherence than beam search or full-distribution sampling.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.