Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments

Visak C V Kumar; Sehoon Ha; C. Karen Liu

2021 RSS RSS 2021

Error-Aware Policy Learning: Zero-Shot Generalization in Partially Observable Dynamic Environments

Abstract

Simulation provides a safe and efficient way to generate useful data for learning complex robotic tasks. However; matching simulation and real-world dynamics can be quite challenging; especially for systems that have a large number of unobserved or unmeasurable parameters; which may lie in the robot dynamics itself or in the environment with which the robot interacts. We introduce a novel approach to tackle such a sim-to-real problem by developing policies capable of adapting to new environments; in a zero-shot manner. Key to our approach is an error-aware policy (EAP) that is explicitly made aware of the effect of unobservable factors during training. An EAP takes as input the predicted future state error in the target environment; which is provided by an error-prediction function; simultaneously trained with the EAP. We validate our approach on an assistive walking device trained to help the human user recover from external pushes. We show that a trained EAP for a hip-torque assistive device can be transferred to different human agents with unseen biomechanical characteristics. In addition; we show that our method can be applied to other standard RL control tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Visak C V Kumar , Sehoon Ha , C. Karen Liu

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Zero-Shot Learning Reinforcement Learning > Applications > Robotics

Keywords

zero-shot learning sim-to-real transfer transfer learning partially observable environment policy adaptation error prediction

Download PDF

Related papers

Resolving Conflict in Decision-Making for Autonomous Driving 2021

Variational Inference MPC using Tsallis Divergence 2021

Jerk-limited Real-time Trajectory Generation with Arbitrary Target States 2021

Sampling-Based Motion Planning on Sequenced Manifolds 2021

Real-Time Multi-View 3D Human Pose Estimation using Semantic Feedback to Smart Edge Sensors 2021