Deep Reinforcement Learning via Past-Success Directed Exploration

Xiaoming Liu; Zhixiong Xu; Lei Cao; Xiliang Chen; Kai Kang

2019 AAAI AAAI 2019

Deep Reinforcement Learning via Past-Success Directed Exploration

Abstract

Abstract The balance between exploration and exploitation has always been a core challenge in reinforcement learning. This paper proposes “past-success exploration strategy combined with Softmax action selection”(PSE-Softmax) as an adaptive control method for taking advantage of the characteristics of the online learning process of the agent to adapt exploration parameters dynamically. The proposed strategy is tested on OpenAI Gym with discrete and continuous control tasks, and the experimental results show that PSE-Softmax strategy delivers better performance than deep reinforcement learning algorithms with basic exploration strategies.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — softmax action selection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xiaoming Liu , Zhixiong Xu , Lei Cao , Xiliang Chen , Kai Kang

Topics

Machine Learning > Optimization & Theory > Neural Network Optimization Reinforcement Learning > Methods > Deep RL

Keywords

deep reinforcement learning online learning adaptive control exploration strategy softmax action selection

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019