An LP for Sequential Learning Under Budgets

Joseph Wang; Kirill Trapeznikov; Venkatesh Saligrama

2014 AISTATS AISTATS 2014

An LP for Sequential Learning Under Budgets

Abstract

We present a convex framework to learn sequential decisions and apply this to the problem of learning under a budget. We consider the structure proposed [1], where sensor measurements are acquired in a sequence. The goal after acquiring each new measurement is to make a decision whether to stop and classify or to pay the cost of using the next sensor in the sequence. We introduce a novel formulation of an empirical risk objective for the multi stage sequential decision problem. This objective naturally lends itself to a non-convex multilinear formulation. Nevertheless, we derive a novel perspective that leads to a tight convex objective. This is accomplished by expressing the empirical risk in terms of linear superposition of indicator functions. We then derive an LP formulation by utilizing hinge loss surrogates. Our LP achieves or exceeds the empirical performance as the non-convex alternating algorithm that requires a large number of random initializations. Consequently, the LP has the advantage of guaranteed convergence, global optimality, repeatability and computation efficiency.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — linear programming

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy

Authors

Joseph Wang , Kirill Trapeznikov , Venkatesh Saligrama

Topics

Machine Learning > Learning Types > Active Learning Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

budgeted learning linear programming sequential decision empirical risk sensor selection

Download PDF

Related papers

Improved Bounds for Online Learning Over the Permutahedron and Other Ranking Polytopes 2014

PAC-Bayesian Theory for Transductive Learning 2014

Sparse Bayesian Variable Selection for the Identification of Antigenic Variability in the Foot-and-Mouth Disease Virus 2014

Analytic Long-Term Forecasting with Periodic Gaussian Processes 2014

Exploiting the Limits of Structure Learning via Inherent Symmetry 2014