Optimize Planning Heuristics to Rank, not to Estimate Cost-to-Goal

Leah Chrestien; Stefan Edelkamp; Antonin Komenda; Tomas Pevny

2023 NIPS NeurIPS 2023

Optimize Planning Heuristics to Rank, not to Estimate Cost-to-Goal

Abstract

In imitation learning for planning, parameters of heuristic functions are optimized against a set of solved problem instances. This work revisits the necessary and sufficient conditions of strictly optimally efficient heuristics for forward search algorithms, mainly A* and greedy best-first search, which expand only states on the returned optimal path. It then proposes a family of loss functions based on ranking tailored for a given variant of the forward search algorithm. Furthermore, from a learning theory point of view, it discusses why optimizing cost-to-goal h* is unnecessarily difficult. The experimental comparison on a diverse set of problems unequivocally supports the derived theory.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — cost-to-goal estimation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Leah Chrestien , Stefan Edelkamp , Antonin Komenda , Tomas Pevny

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Optimization & Theory > Learning Theory Machine Learning > Application Areas > Domain Adaptation Machine Learning > Learning Types > Imitation Learning

Keywords

imitation learning heuristic search planning algorithm ranking loss a* search optimal efficiency planning heuristics cost-to-goal estimation

Download PDF

Related papers

Risk-Averse Model Uncertainty for Distributionally Robust Safe Reinforcement Learning 2023

Generative Modeling through the Semi-dual Formulation of Unbalanced Optimal Transport 2023

Self-Supervised Motion Magnification by Backpropagating Through Optical Flow 2023

Diffused Task-Agnostic Milestone Planner 2023

Characterizing Graph Datasets for Node Classification: Homophily-Heterophily Dichotomy and Beyond 2023