Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game

Philipp Sadler; Sherzod Hakimov; David Schlangen

2024 ACL ACL 2024

Learning Communication Policies for Different Follower Behaviors in a Collaborative Reference Game

Abstract

AbstractIn this work, we evaluate the adaptability of neural agents towards assumed partner behaviors in a collaborative reference game. In this game, success is achieved when a knowledgeable guide can verbally lead a follower to the selection of a specific puzzle piece among several distractors. We frame this language grounding and coordination task as a reinforcement learning problem and measure to which extent a common reinforcement training algorithm (PPO) is able to produce neural agents (the guides) that perform well with various heuristic follower behaviors that vary along the dimensions of confidence and autonomy. We experiment with a learning signal that in addition to the goal condition also respects an assumed communicative effort. Our results indicate that this novel ingredient leads to communicative strategies that are less verbose (staying silent in some of the steps) and that with respect to that the guide’s strategies indeed adapt to the partner’s level of confidence and autonomy.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing and Reinforcement Learning

🧭 Keyword Pioneer — collaborative reference game

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Philipp Sadler , Sherzod Hakimov , David Schlangen

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Machine Learning > Learning Types > Self-Supervised Learning Reinforcement Learning > Methods > Deep RL Natural Language Processing > Applications > Dialogue Systems

Keywords

reinforcement learning language grounding neural agent proximate policy optimization multi-agent system communication policy collaborative game collaborative reference game

Download PDF

Related papers

Reinforcement Learning-Driven LLM Agent for Automated Attacks on LLMs 2024

EtymoLink: A Structured English Etymology Dataset 2024

Turkish Delights: A Dataset on Turkish Euphemisms 2024

Subjectivity Detection in English News using Large Language Models 2024

Does DetectGPT Fully Utilize Perturbation? Bridging Selective Perturbation to Fine-tuned Contrastive Learning Detector would be Better 2024