Language Models are Few-Shot Butlers

Vincent Micheli; Francois Fleuret

2021 EMNLP EMNLP 2021

Language Models are Few-Shot Butlers

Abstract

AbstractPretrained language models demonstrate strong performance in most NLP tasks when fine-tuned on small task-specific datasets. Hence, these autoregressive models constitute ideal agents to operate in text-based environments where language understanding and generative capabilities are essential. Nonetheless, collecting expert demonstrations in such environments is a time-consuming endeavour. We introduce a two-stage procedure to learn from a small set of demonstrations and further improve by interacting with an environment. We show that language models fine-tuned with only 1.2% of the expert demonstrations and a simple reinforcement learning algorithm achieve a 51% absolute improvement in success rate over existing methods in the ALFWorld environment.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing and Reinforcement Learning

🐣 Hot Topic Early Bird — agent system

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Vincent Micheli , Francois Fleuret

Topics

Artificial Intelligence > Core AI > Agent Systems Artificial Intelligence > Learning Paradigms > Few-Shot Learning Natural Language Processing > Generation > Language Modeling Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Paradigms > Few-Shot Learning Natural Language Processing > Applications > Dialogue Systems Deep Learning > Models > Large Language Models Deep Learning > Learning Types > Reinforcement Learning Deep Learning > Learning Types > Few-Shot Learning

Keywords

reinforcement learning few-shot learning embodied ai language model pretrained language model agent system text-based environment embodied artificial intelligence

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021