Contrastive Inductive Bias Controlling Networks for
 Reinforcement Learning

DONGXU LI; Shaochen Wang; Kang Chen; Bin Li

2022 ACML ACML 2022

Contrastive Inductive Bias Controlling Networks for Reinforcement Learning

Abstract

Effective learning in an visual-based environment is essential for reinforcement learning (RL) agent, while it has been empirically observed that learning from high dimensional observations such as raw pixels is sample-inefficient. For common practice, RL algorithms for image input often use encoders composed of CNNs to extract useful features from high dimensional observations. Recent studies have shown that CNNs have strong inductive bias towards image styles rather than content (i.e. agent shapes), while content is the information that RL algorithms should focus on. Inspired by this, we suggest reducing the intrinsic style bias of CNNs by proposing Contrastive Inductive Bias Controlling Networks for RL. It can help RL algorithms effectively focus on truly noteworthy information like agents’ own characteristics. Our approach incorporates two transfer networks and feature encoder with contrastive learning methods, guiding RL algorithms to learn more efficiently with sampling. Extensive experiments show that the extended framework greatly enhances the performance of existing model-free methods (i.e. SAC), enabling it to reach state-of-the-art performance on the DeepMind control suite benchmark.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

DONGXU LI , Shaochen Wang , Kang Chen , Bin Li

Topics

Machine Learning > Learning Types > Contrastive Learning Reinforcement Learning > Methods > Deep RL

Keywords

contrastive learning reinforcement learning sample efficiency convolutional neural network inductive bia

Download PDF

Related papers

When to Classify Events in Open Times Series? 2022

Noisy Riemannian Gradient Descent for Eigenvalue Computation with Application to Inexact Stochastic Recursive Gradient Algorithm 2022

A Self-improving Skin Lesions Diagnosis Framework Via Pseudo-labeling and Self-distillation 2022

Towards Data-Free Domain Generalization 2022

SNAIL: Semi-Separated Uncertainty Adversarial Learning for Universal Domain Adaptation 2022