Deep Residual Reinforcement Learning (Extended Abstract)

Shangtong Zhang; Wendelin Boehmer; Shimon Whiteson

2021 IJCAI IJCAI 2021

Deep Residual Reinforcement Learning (Extended Abstract)

Abstract

We revisit residual algorithms in both model-free and model-based reinforcement learning settings. We propose the bidirectional target network technique to stabilize residual algorithms, yielding a residual version of DDPG that significantly outperforms vanilla DDPG in commonly used benchmarks. Moreover, we find the residual algorithm an effective approach to the distribution mismatch problem in model-based planning. Compared with the existing TD(k) method, our residual-based method makes weaker assumptions about the model and yields a greater performance boost.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🌉 Interdisciplinary Bridge — Deep Learning and Reinforcement Learning

🧭 Keyword Pioneer — residual algorithm

Authors

Shangtong Zhang , Wendelin Boehmer , Shimon Whiteson

Topics

Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Deep Learning > Learning Types > Reinforcement Learning

Keywords

deep reinforcement learning model-based planning target network residual learning distribution mismatch residual algorithm

Download PDF

Related papers

Type Anywhere You Want: An Introduction to Invisible Mobile Keyboard 2021

Guaranteeing Maximin Shares: Some Agents Left Behind 2021

Surprisingly Popular Voting Recovers Rankings, Surprisingly! 2021

Strategyproof Randomized Social Choice for Restricted Sets of Utility Functions 2021

Diversity in Kemeny Rank Aggregation: A Parameterized Approach 2021