A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication

Maryam Zare; Alan Wagner; Rebecca Passonneau

2022 EMNLP EMNLP 2022

A POMDP Dialogue Policy with 3-way Grounding and Adaptive Sensing for Learning through Communication

Abstract

AbstractAgents to assist with rescue, surgery, and similar activities could collaborate better with humans if they could learn new strategic behaviors through communication. We introduce a novel POMDP dialogue policy for learning from people. The policy has 3-way grounding of language in the shared physical context, the dialogue context, and persistent knowledge. It can learn distinct but related games, and can continue learning across dialogues for complex games. A novel sensing component supports adaptation to information-sharing differences across people. The single policy performs better than oracle policies customized to specific games and information behavior.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Natural Language Processing and Reinforcement Learning and Robotics

📈 Trend Setter — Human-Robot Interaction

🧭 Keyword Pioneer — pomdp dialogue policy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio