Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems

Yen-chen Wu; Bo-Hsiang Tseng; Milica Gasic

2020 EMNLP EMNLP 2020

Actor-Double-Critic: Incorporating Model-Based Critic for Task-Oriented Dialogue Systems

Abstract

AbstractIn order to improve the sample-efficiency of deep reinforcement learning (DRL), we implemented imagination augmented agent (I2A) in spoken dialogue systems (SDS). Although I2A achieves a higher success rate than baselines by augmenting predicted future into a policy network, its complicated architecture introduces unwanted instability. In this work, we propose actor-double-critic (ADC) to improve the stability and overall performance of I2A. ADC simplifies the architecture of I2A to reduce excessive parameters and hyper-parameters. More importantly, a separate model-based critic shares parameters between actions and makes back-propagation explicit. In our experiments on Cambridge Restaurant Booking task, ADC enhances success rates considerably and shows robustness to imperfect environment models. In addition, ADC exhibits the stability and sample-efficiency as significantly reducing the baseline standard deviation of success rates and reaching the 80% success rate with half training data.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing and Reinforcement Learning

🧭 Keyword Pioneer — model-based critic

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yen-chen Wu , Bo-Hsiang Tseng , Milica Gasic

Topics

Natural Language Processing > Generation > Dialogue Systems Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Methods > Policy Learning Natural Language Processing > Applications > Dialogue Systems Deep Learning > Learning Types > Reinforcement Learning

Keywords

deep reinforcement learning sample efficiency task-oriented dialogue model-based reinforcement learning dialogue system task-oriented dialogue system model-based critic imagination augmented agent

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020