2019
AAAI
AAAI 2019
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search
Abstract
Abstract In this paper, we propose an actor ensemble algorithm, named ACE, for continuous control with a deterministic policy in reinforcement learning. In ACE, we use actor ensemble (i.e., multiple actors) to search the global maxima of the critic. Besides the ensemble perspective, we also formulate ACE in the option framework by extending the option-critic architecture with deterministic intra-option policies, revealing a relationship between ensemble and options. Furthermore, we perform a look-ahead tree search with those actors and a learned value prediction model, resulting in a refined value estimation. We demonstrate a significant performance boost of ACE over DDPG and its variants in challenging physical robot simulators.
🚀
Conference Pioneer
— AAAI 2019
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Reinforcement Learning
🧭
Keyword Pioneer
— deterministic policy
🐣
Hot Topic Early Bird
— tree search
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio