On-line Dialogue Policy Learning with Companion Teaching

Lu Chen; Runzhe Yang; Cheng Chang; Zihao Ye; Xiang Zhou; Kai Yu

2017 EACL EACL 2017

On-line Dialogue Policy Learning with Companion Teaching

Abstract

AbstractOn-line dialogue policy learning is the key for building evolvable conversational agent in real world scenarios. Poor initial policy can easily lead to bad user experience and consequently fail to attract sufficient users for policy training. A novel framework, companion teaching, is proposed to include a human teacher in the dialogue policy training loop to address the cold start problem. Here, dialogue policy is trained using not only user’s reward, but also teacher’s example action as well as estimated immediate reward at turn level. Simulation experiments showed that, with small number of human teaching dialogues, the proposed approach can effectively improve user experience at the beginning and smoothly lead to good performance with more user interaction data.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing and Reinforcement Learning

🧭 Keyword Pioneer — companion teaching

🐣 Hot Topic Early Bird — conversational agent

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lu Chen , Runzhe Yang , Cheng Chang , Zihao Ye , Xiang Zhou , Kai Yu

Topics

Artificial Intelligence > Core AI > Agent Systems Natural Language Processing > Generation > Dialogue Systems Reinforcement Learning > Methods > Policy Learning Machine Learning > Learning Types > Reinforcement Learning Natural Language Processing > Applications > Dialogue Systems

Keywords

reinforcement learning online learning conversational agent reward estimation cold start problem dialogue policy companion teaching cold start

Download PDF

Related papers

Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages 2017

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension 2017

Is this a Child, a Girl or a Car? Exploring the Contribution of Distributional Similarity to Learning Referential Word Meanings 2017

Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit 2017

Assessing Convincingness of Arguments in Online Debates with Limited Number of Features 2017