Policy Search for Multi-Robot Coordination under Uncertainty

Christopher Amato; George Konidaris; Ariel Anders; Gabriel Cruz; Jonathan How; Leslie Kaelbling

2015 RSS RSS 2015

Policy Search for Multi-Robot Coordination under Uncertainty

Abstract

We introduce a principled method for multi-robot coordination based on a generic model (termed a MacDec-POMDP) of multi-robot cooperative planning in the presence of stochasticity, uncertain sensing and communication limitations. We present a new MacDec-POMDP planning algorithm that searches over policies represented as finite-state controllers, rather than the existing policy tree representation. Finite-state controllers can be much more concise than trees, are much easier to interpret, and can operate over an infinite horizon. The resulting policy search algorithm requires a substantially simpler simulator that models only the outcomes of executing a given set of motor controllers, not the details of the executions themselves and can to solve significantly larger problems than existing MacDec-POMDP planners. We demonstrate significantly improved performance over previous methods and application to a cooperative multi-robot bartending task, showing that our method can be used for actual multi-robot systems.

📈 Trend Setter — Multi-Agent Systems

🧭 Keyword Pioneer — finite-state controller

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Reinforcement Learning, Robotics

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Reinforcement Learning and Robotics

🐣 Hot Topic Early Bird — partially observable markov decision process

Authors

Christopher Amato , George Konidaris , Ariel Anders , Gabriel Cruz , Jonathan How , Leslie Kaelbling

Topics

Artificial Intelligence > Core AI > Multi-Agent Systems Reinforcement Learning > Methods > Policy Learning Reinforcement Learning > Methods > Multi-Agent Systems Reinforcement Learning > Applications > Robotics Robotics > Capabilities > Manipulation Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Multi-Agent Systems Artificial Intelligence > Core AI > Robotics

Keywords

policy search partially observable markov decision process finite-state controller multi-robot coordination

Download PDF

Related papers

Guidance and Navigation for UAV Airborne Docking 2015

IMU Preintegration on Manifold for Efficient Visual-Inertial Maximum-a-Posteriori Estimation 2015

Direct Loss Minimization Inverse Optimal Control 2015

A New Perspective and Extension of the Gaussian Filter 2015

Get Out of My Lab: Large-scale, Real-Time Visual-Inertial Localization 2015