Towards Generalization and Simplicity in Continuous Control

Aravind Rajeswaran; Kendall Lowrey; Emanuel V. Todorov; Sham M. Kakade

2017 NIPS NeurIPS 2017

Towards Generalization and Simplicity in Continuous Control

Abstract

The remarkable successes of deep learning in speech recognition and computer vision have motivated efforts to adapt similar techniques to other problem domains, including reinforcement learning (RL). Consequently, RL methods have produced rich motor behaviors on simulated robot tasks, with their success largely attributed to the use of multi-layer neural networks. This work is among the first to carefully study what might be responsible for these recent advancements. Our main result calls this emerging narrative into question by showing that much simpler architectures -- based on linear and RBF parameterizations -- achieve comparable performance to state of the art results. We not only study different policy representations with regard to performance measures at hand, but also towards robustness to external perturbations. We again find that the learned neural network policies --- under the standard training scenarios --- are no more robust than linear (or RBF) policies; in fact, all three are remarkably brittle. Finally, we then directly modify the training scenarios in order to favor more robust policies, and we again do not find a compelling case to favor multi-layer architectures. Overall, this study suggests that multi-layer architectures should not be the default choice, unless a side-by-side comparison to simpler architectures shows otherwise. More generally, we hope that these results lead to more interest in carefully studying the architectural choices, and associated trade-offs, for training generalizable and robust policies.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Reinforcement Learning

📈 Trend Setter — Domain Generalization

🧭 Keyword Pioneer — linear policy

🐣 Hot Topic Early Bird — domain generalization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Aravind Rajeswaran , Kendall Lowrey , Emanuel V. Todorov , Sham M. Kakade

Topics

Machine Learning > Optimization & Theory > Neural Network Optimization Machine Learning > Application Areas > Domain Generalization Reinforcement Learning > Applications > Robotics Deep Learning > Learning Types > Reinforcement Learning Deep Learning > Optimization & Theory > Theory

Keywords

reinforcement learning domain generalization continuous control linear policy policy architecture policy representation neural network

Download PDF

Related papers

High-Order Attention Models for Visual Question Answering 2017

Breaking the Nonsmooth Barrier: A Scalable Parallel Method for Composite Optimization 2017

Premise Selection for Theorem Proving by Deep Graph Embedding 2017

Neural Program Meta-Induction 2017

Safe and Nested Subgame Solving for Imperfect-Information Games 2017