Learning to Plan with Logical Automata

Brandon Araki; Kiran Vodrahalli; Thomas Leech; Cristian Ioan Vasile; Mark Donahue; Daniela Rus

2019 RSS RSS 2019

Learning to Plan with Logical Automata

Abstract

This paper introduces the Logic-based Value Iteration Network (LVIN) framework, which combines imitation learning and logical automata to enable agents to learn complex behaviors from demonstrations. We address two problems with learning from expert knowledge: (1) how to generalize learned policies for a task to larger classes of tasks, and (2) how to account for erroneous demonstrations. Our LVIN model solves finite gridworld environments by instantiating a recurrent, convolutional neural network as a value iteration procedure over a learned Markov Decision Process (MDP) that factors into two MDPs: a small finite state automaton (FSA) corresponding to logical rules, and a larger MDP corresponding to motions in the environment. The parameters of LVIN (value function, reward map, FSA transitions, large MDP transitions) are approximately learned from expert trajectories. Since the model represents the learned rules as an FSA, the model is interpretable; since the FSA is integrated into planning, the behavior of the agent can be manipulated by modifying the FSA transitions. We demonstrate these abilities in several domains of interest, including a lunchbox-packing manipulation task and a driving domain.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — logical automaton

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

🐣 Hot Topic Early Bird — value iteration

Authors

Brandon Araki , Kiran Vodrahalli , Thomas Leech , Cristian Ioan Vasile , Mark Donahue , Daniela Rus

Topics

Artificial Intelligence > Core AI > Interpretability Artificial Intelligence > Core AI > Planning Robotics > Capabilities > Manipulation Robotics > Capabilities > Motion Planning Artificial Intelligence > Core AI > Reasoning Machine Learning > Learning Types > Imitation Learning

Keywords

motion planning imitation learning markov decision process value iteration recurrent neural network finite state automaton logical automaton interpretable planning gridworld environment logical automata

Download PDF

Related papers

Online Incremental Learning of the Terrain Traversal Cost in Autonomous Exploration 2019

A 2-Approximation Algorithm for the Online Tethered Coverage Problem 2019

End-To-End Robotic Reinforcement Learning without Reward Engineering 2019

TossingBot: Learning to Throw Arbitrary Objects with Residual Physics 2019

Value Iteration Networks on Multiple Levels of Abstraction 2019