Learning to Write with Cooperative Discriminators

Antoine Bosselut; Ari Holtzman; David Golub; Jan Buys; Maxwell Forbes; Yejin Choi

arxiv: 1805.06087 · v1 · pith:RSUBHMWUnew · submitted 2018-05-16 · 💻 cs.CL

Learning to Write with Cooperative Discriminators

Ari Holtzman , Jan Buys , Maxwell Forbes , Antoine Bosselut , David Golub , Yejin Choi This is my paper

classification 💻 cs.CL

keywords modelstextgeneratedlanguagelearningoverallusedamounts

0 comments

read the original abstract

Recurrent Neural Networks (RNNs) are powerful autoregressive sequence models, but when used to generate natural language their output tends to be overly generic, repetitive, and self-contradictory. We postulate that the objective function optimized by RNN language models, which amounts to the overall perplexity of a text, is not expressive enough to capture the notion of communicative goals described by linguistic principles such as Grice's Maxims. We propose learning a mixture of multiple discriminative models that can be used to complement the RNN generator and guide the decoding process. Human evaluation demonstrates that text generated by our system is preferred over that of baselines by a large margin and significantly enhances the overall coherence, style, and information content of the generated text.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Decision Transformer: Reinforcement Learning via Sequence Modeling
cs.LG 2021-06 accept novelty 8.0

Decision Transformer casts RL as autoregressive sequence modeling conditioned on desired returns, past states and actions, matching or exceeding offline RL baselines on Atari, Gym and Key-to-Door tasks.