A Retrieve-and-Edit Framework for Predicting Structured Outputs

Tatsunori B Hashimoto; Kelvin Guu; Yonatan Oren; Percy Liang

2018 NIPS NeurIPS 2018

A Retrieve-and-Edit Framework for Predicting Structured Outputs

Abstract

For the task of generating complex outputs such as source code, editing existing outputs can be easier than generating complex outputs from scratch. With this motivation, we propose an approach that first retrieves a training example based on the input (e.g., natural language description) and then edits it to the desired output (e.g., code). Our contribution is a computationally efficient method for learning a retrieval model that embeds the input in a task-dependent way without relying on a hand-crafted metric or incurring the expense of jointly training the retriever with the editor. Our retrieve-and-edit framework can be applied on top of any base model. We show that on a new autocomplete task for GitHub Python code and the Hearthstone cards benchmark, retrieve-and-edit significantly boosts the performance of a vanilla sequence-to-sequence model on both tasks.

🌱 Topic Pioneer — Retrieval-Augmented Generation

🌉 Interdisciplinary Bridge — Computer Science and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Retrieval-Augmented Generation

🧭 Keyword Pioneer — retrieval augmented generation

🐣 Hot Topic Early Bird — retrieval augmented generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Tatsunori B Hashimoto , Kelvin Guu , Yonatan Oren , Percy Liang

Topics

Machine Learning > Core Methods > Embedding Learning Natural Language Processing > Generation > Text Generation Computer Science > Applications > Information Retrieval Machine Learning > Learning Types > Retrieval-Augmented Generation Deep Learning > Learning Types > Representation Learning

Keywords

retrieval augmented generation information retrieval code generation structured output sequence-to-sequence model retrieval model

Download PDF

Related papers

Maximum Causal Tsallis Entropy Imitation Learning 2018

Recurrent World Models Facilitate Policy Evolution 2018

Bandit Learning in Concave N-Person Games 2018

Algorithmic Assurance: An Active Approach to Algorithmic Testing using Bayesian Optimisation 2018

PAC-Bayes bounds for stable algorithms with instance-dependent priors 2018