Mix and Match: An Optimistic Tree-Search Approach for Learning Models from Mixture Distributions

Matthew Faw; Rajat Sen; Karthikeyan Shanmugam; Constantine Caramanis; Sanjay Shakkottai

2020 NIPS NeurIPS 2020

Mix and Match: An Optimistic Tree-Search Approach for Learning Models from Mixture Distributions

Abstract

We consider a covariate shift problem where one has access to several different training datasets for the same learning problem and a small validation set which possibly differs from all the individual training distributions. The distribution shift is due, in part, to \emph{unobserved} features in the datasets. The objective, then, is to find the best mixture distribution over the training datasets (with only observed features) such that training a learning algorithm using this mixture has the best validation performance. Our proposed algorithm, \textsf{Mix&Match}, combines stochastic gradient descent (SGD) with optimistic tree search and model re-use (evolving partially trained models with samples from different mixture distributions) over the space of mixtures, for this task. We prove a novel high probability bound on the final SGD iterate without relying on a global gradient norm bound, and use it to show the advantages of model re-use. Additionally, we provide simple regret guarantees for our algorithm with respect to recovering the optimal mixture, given a total budget of SGD evaluations. Finally, we validate our algorithm on two real-world datasets.

🧭 Keyword Pioneer — optimistic tree search

🐣 Hot Topic Early Bird — covariate shift

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Matthew Faw , Rajat Sen , Karthikeyan Shanmugam , Constantine Caramanis , Sanjay Shakkottai

Topics

Machine Learning > Learning Types > Unsupervised Learning Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Domain Adaptation Machine Learning > Learning Types > Distribution Shift Machine Learning > Learning Paradigms > Domain Adaptation

Keywords

stochastic gradient descent domain adaptation covariate shift mixture distribution model reuse optimistic tree search model re-use

Download PDF

Related papers

Higher-Order Spectral Clustering of Directed Graphs 2020

Self-Supervised MultiModal Versatile Networks 2020

Multi-Robot Collision Avoidance under Uncertainty with Probabilistic Safety Barrier Certificates 2020

Causal Intervention for Weakly-Supervised Semantic Segmentation 2020

Taming Discrete Integration via the Boon of Dimensionality 2020