Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience

Beomjoon Kim; Leslie Pack Kaelbling; Tomás Lozano-Pérez

2019 AAAI AAAI 2019

Adversarial Actor-Critic Method for Task and Motion Planning Problems Using Planning Experience

Abstract

Abstract We propose an actor-critic algorithm that uses past planning experience to improve the efficiency of solving robot task-and-motion planning (TAMP) problems. TAMP planners search for goal-achieving sequences of high-level operator instances specified by both discrete and continuous parameters. Our algorithm learns a policy for selecting the continuous parameters during search, using a small training set generated from the search trees of previously solved instances. We also introduce a novel fixed-length vector representation for world states with varying numbers of objects with different shapes, based on a set of key robot configurations. We demonstrate experimentally that our method learns more efficiently from less data than standard reinforcementlearning approaches and that using a learned policy to guide a planner results in the improvement of planning efficiency.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning and Robotics

🧭 Keyword Pioneer — robotics planning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Beomjoon Kim , Leslie Pack Kaelbling , Tomás Lozano-Pérez

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Learning Types > Adversarial Learning Reinforcement Learning > Methods > Deep RL Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Motion Planning

Keywords

reinforcement learning adversarial learning policy learning robot planning actor-critic method task and motion planning search guidance search tree robotics planning planning experience

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019