Learning A Risk-Aware Trajectory Planner From Demonstrations Using Logic Monitor

Xiao Li; Jonathan DeCastro; Sertac Karaman; Daniela Rus; Cristian-Ioan Vasile; Cristian Ioan Vasile

2021 CORL CoRL 2021

Learning A Risk-Aware Trajectory Planner From Demonstrations Using Logic Monitor

Abstract

Risk awareness is an important factor to consider when deploying policies on robots in the real-world. Defining the right set of risk metrics can be difficult. In this work, we use a differentiable logic monitor that keeps track of the environmental agents’ behaviors and provides a risk metric that the controlled agent can incorporate during planning. We introduce LogicRiskNet, a learning structure that can be constructed from temporal logic formulas describing rules governing a safe agent’s behaviors. The network’s parameters can be learned from demonstration data. By using temporal logic, the network provides an interpretable architecture that can explain what risk metrics are important to the human. We integrate LogicRiskNet in an inverse optimal control (IOC) framework and show that we can learn to generate trajectory plans that accurately mimic the expert’s risk handling behaviors solely from demonstration data. We evaluate our method on a real-world driving dataset.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Reinforcement Learning

🐣 Hot Topic Early Bird — temporal logic

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Xiao Li , Jonathan DeCastro , Cristian-Ioan Vasile , Cristian Ioan Vasile , Sertac Karaman , Daniela Rus

Topics

Artificial Intelligence > Core AI > Interpretability Artificial Intelligence > Core AI > Trajectory Prediction Reinforcement Learning > Applications > Robotics

Keywords

imitation learning inverse optimal control temporal logic trajectory planning risk metric

Download PDF

Related papers

FlingBot: The Unreasonable Effectiveness of Dynamic Manipulation for Cloth Unfolding 2021

TANDEM: Tracking and Dense Mapping in Real-time using Deep Multi-view Stereo 2021

Taskography: Evaluating robot task planning over large 3D scene graphs 2021

Parallelised Diffeomorphic Sampling-based Motion Planning 2021

Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning 2021