Learning to Branch with Tree MDPs

Lara Scavuzzo; Feng Chen; Didier Chetelat; Maxime Gasse; Andrea Lodi; Neil Yorke-Smith; Karen Aardal

2022 NIPS NeurIPS 2022

Learning to Branch with Tree MDPs

Abstract

State-of-the-art Mixed Integer Linear Programming (MILP) solvers combine systematic tree search with a plethora of hard-coded heuristics, such as branching rules. While approaches to learn branching strategies have received increasing attention and have shown very promising results, most of the literature focuses on learning fast approximations of the \emph{strong branching} rule. Instead, we propose to learn branching rules from scratch with Reinforcement Learning (RL). We revisit the work of Etheve et al. (2020) and propose a generalization of Markov Decisions Processes (MDP), which we call \emph{tree MDP}, that provides a more suitable formulation of the branching problem. We derive a policy gradient theorem for tree MDPs that exhibits a better credit assignment compared to its temporal counterpart. We demonstrate through computational experiments that this new framework is suitable to tackle the learning-to-branch problem in MILP, and improves the learning convergence.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — tree mdp

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lara Scavuzzo , Feng Chen , Didier Chetelat , Maxime Gasse , Andrea Lodi , Neil Yorke-Smith , Karen Aardal

Topics

Artificial Intelligence > Core AI > Planning Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Combinatorial Optimization Machine Learning > Learning Types > Reinforcement Learning Deep Learning > Learning Types > Reinforcement Learning

Keywords

reinforcement learning policy gradient markov decision process branch and bound mixed integer programming credit assignment mixed integer linear programming tree mdp branching rule

Download PDF

Related papers

Transferring Pre-trained Multimodal Representations with Cross-modal Similarity Matching 2022

A Theoretical View on Sparsely Activated Networks 2022

Prune and distill: similar reformatting of image information along rat visual cortex and deep neural networks 2022

Matryoshka Representation Learning 2022

Off-Policy Evaluation with Deficient Support Using Side Information 2022