pith. machine review for the scientific record. sign in

arxiv: 1611.08562 · v2 · submitted 2016-11-25 · 💻 cs.CL

Recognition: unknown

A Simple, Fast Diverse Decoding Algorithm for Neural Generation

Authors on Pith no claims yet
classification 💻 cs.CL
keywords decodingalgorithmdiversegenerationneuraldiversityfastfurther
0
0 comments X
read the original abstract

In this paper, we propose a simple, fast decoding algorithm that fosters diversity in neural generation. The algorithm modifies the standard beam search algorithm by adding an inter-sibling ranking penalty, favoring choosing hypotheses from diverse parents. We evaluate the proposed model on the tasks of dialogue response generation, abstractive summarization and machine translation. We find that diverse decoding helps across all tasks, especially those for which reranking is needed. We further propose a variation that is capable of automatically adjusting its diversity decoding rates for different inputs using reinforcement learning (RL). We observe a further performance boost from this RL technique. This paper includes material from the unpublished script "Mutual Information and Diverse Decoding Improve Neural Machine Translation" (Li and Jurafsky, 2016).

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. The Curious Case of Neural Text Degeneration

    cs.CL 2019-04 accept novelty 7.0

    Nucleus sampling draws from the smallest set of tokens whose cumulative probability exceeds threshold p, yielding more human-like diversity and coherence than beam search or full-distribution sampling.