Comparison of Diverse Decoding Methods from Conditional Language Models

Daphne Ippolito; Reno Kriz; João Sedoc; Maria Kustikova; Chris Callison-Burch

2019 ACL ACL 2019

Comparison of Diverse Decoding Methods from Conditional Language Models

Abstract

AbstractWhile conditional language models have greatly improved in their ability to output high quality natural language, many NLP applications benefit from being able to generate a diverse set of candidate sequences. Diverse decoding strategies aim to, within a given-sized candidate list, cover as much of the space of high-quality outputs as possible, leading to improvements for tasks that rerank and combine candidate outputs. Standard decoding methods, such as beam search, optimize for generating high likelihood sequences rather than diverse ones, though recent work has focused on increasing diversity in these methods. In this work, we perform an extensive survey of decoding-time strategies for generating diverse outputs from a conditional language model. In addition, we present a novel method where we over-sample candidates, then use clustering to remove similar sequences, thus achieving high diversity without sacrificing quality.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — diverse decoding

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Daphne Ippolito , Reno Kriz , João Sedoc , Maria Kustikova , Chris Callison-Burch

Topics

Machine Learning > Core Methods > Clustering Machine Learning > Core Methods > Representation Learning Natural Language Processing > Generation > Text Generation Deep Learning > Learning Types > Generative Models Deep Learning > Models > Language Models

Keywords

text generation sequence clustering beam search conditional language model candidate generation diverse decoding

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019