Learning to Actively Reduce Memory Requirements for Robot Control Tasks

Meghan Booker; Anirudha Majumdar

2021 L4DC L4DC 2021

Learning to Actively Reduce Memory Requirements for Robot Control Tasks

Abstract

Robots equipped with rich sensing modalities (e.g., RGB-D cameras) performing long-horizon tasks motivate the need for policies that are highly memory-efficient. State-of-the-art approaches for controlling robots often use memory representations that are excessively rich for the task or rely on handcrafted tricks for memory efficiency. Instead, this work provides a general approach for jointly synthesizing memory representations and policies; the resulting policies actively seek to reduce memory requirements. Specifically, we present a reinforcement learning framework that leverages an implementation of the group LASSO regularization to synthesize policies that employ low-dimensional and task-centric memory representations. We demonstrate the efficacy of our approach with simulated examples including navigation in discrete and continuous spaces as well as vision-based indoor navigation set in a photo-realistic simulator. The results on these examples indicate that our method is capable of finding policies that rely only on low-dimensional memory representations, improving generalization, and actively reducing memory requirements.

🌉 Interdisciplinary Bridge — Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — low-dimensional memory

🐣 Hot Topic Early Bird — memory efficiency

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Meghan Booker , Anirudha Majumdar

Topics

Reinforcement Learning > Methods > Deep RL Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning group lasso memory efficiency policy synthesis low-dimensional memory

Download PDF

Related papers

Abstraction-based branch and bound approach to Q-learning for hybrid optimal control 2021

Data-driven design of switching reference governors for brake-by-wire applications 2021

Learning local modules in dynamic networks 2021

Certainty Equivalent Perception-Based Control 2021

Sample Complexity of Linear Quadratic Gaussian (LQG) Control for Output Feedback Systems 2021