Attention-Privileged Reinforcement Learning

Sasha Salter; Dushyant Rao; Markus Wulfmeier; Raia Hadsell; Ingmar Posner

2020 CORL CoRL 2020

Attention-Privileged Reinforcement Learning

Abstract

Image-based Reinforcement Learning is known to suffer from poor sample efficiency and generalisation to unseen visuals such as distractors (task-independent aspects of the observation space). Visual domain randomisation encourages transfer by training over visual factors of variation that may be encountered in the target domain. This increases learning complexity, can negatively impact learning rate and performance, and requires knowledge of potential variations during deployment. In this paper, we introduce Attention-Privileged Reinforcement Learning (APRiL) which uses a self-supervised attention mechanism to significantly alleviate these drawbacks: by focusing on task-relevant aspects of the observations, attention provides robustness to distractors as well as significantly increased learning efficiency. APRiL trains two attention-augmented actor-critic agents: one purely based on image observations, available across training and transfer domains; and one with access to privileged information (such as environment states) available only during training. Experience is shared between both agents and their attention mechanisms are aligned. The image-based policy can then be deployed without access to privileged information. We experimentally demonstrate accelerated and more robust learning on a diverse set of domains, leading to improved final performance for environments both within and outside the training distribution.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Reinforcement Learning

🧭 Keyword Pioneer — visual domain randomization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sasha Salter , Dushyant Rao , Markus Wulfmeier , Raia Hadsell , Ingmar Posner

Topics

Machine Learning > Learning Types > Self-Supervised Learning Machine Learning > Application Areas > Domain Generalization Reinforcement Learning > Methods > Deep RL Artificial Intelligence > Core AI > Robotics Deep Learning > Techniques > Self-Supervised Learning Deep Learning > Learning Types > Reinforcement Learning

Keywords

deep reinforcement learning reinforcement learning sample efficiency domain generalization attention mechanism domain randomization privileged information visual domain randomization visual domain

Download PDF

Related papers

Augmenting GAIL with BC for sample efficient imitation learning 2020

Neuro-Symbolic Program Search for Autonomous Driving Decision Module Design 2020

LiRaNet: End-to-End Trajectory Prediction using Spatio-Temporal Radar Fusion 2020

DROGON: A Trajectory Prediction Model based on Intention-Conditioned Behavior Reasoning 2020

CAMPs: Learning Context-Specific Abstractions for Efficient Planning in Factored MDPs 2020