Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

Jian Wu; Saul Toscano-Palmerin; Peter I. Frazier; Andrew Gordon Wilson

2019 UAI UAI 2019

Practical Multi-fidelity Bayesian Optimization for Hyperparameter Tuning

Abstract

Bayesian optimization is popular for optimizing time-consuming black-box objectives. Nonetheless, for hyperparameter tuning in deep neural networks, the time required to evaluate the validation error for even a few hyperparameter settings remains a bottleneck. Multi-fidelity optimization promises relief using cheaper proxies to such objectives — for example, validation error for a network trained using a subset of the training points or fewer iterations than required for convergence. We propose a highly flexible and practical approach to multi-fidelity Bayesian optimization, focused on efficiently optimizing hyperparameters for iteratively trained supervised learning models. We introduce a new acquisition function, the trace-aware knowledge-gradient, which efficiently leverages both multiple continuous fidelity controls and trace observations — values of the objective at a sequence of fidelities, available when varying fidelity using training iterations. We provide a provably convergent method for optimizing our acquisition function and show it outperforms state-of-the-art alternatives for hyperparameter tuning of deep neural networks and large-scale kernel learning.

🚀 Conference Pioneer — UAI 2019

🧭 Keyword Pioneer — multi-fidelity optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

Authors

Jian Wu , Saul Toscano-Palmerin , Peter I. Frazier , Andrew Gordon Wilson

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Artificial Intelligence > Bayesian & Probabilistic > Bayesian Inference Machine Learning > Learning Types > Hyperparameter Optimization

Keywords

bayesian optimization hyperparameter tuning knowledge gradient multi-fidelity optimization trace observation neural network

Download PDF

Related papers

Fisher-Bures Adversary Graph Convolutional Networks 2019

Augmenting and Tuning Knowledge Graph Embeddings 2019

Learning Factored Markov Decision Processes with Unawareness 2019

Expressive Priors in Bayesian Neural Networks: Kernel Combinations and Periodic Functions 2019

Countdown Regression: Sharp and Calibrated Survival Predictions 2019