Learning Adaptive Value of Information for Structured Prediction

David J Weiss; Ben Taskar

2013 NIPS NeurIPS 2013

Learning Adaptive Value of Information for Structured Prediction

Abstract

Discriminative methods for learning structured models have enabled wide-spread use of very rich feature representations. However, the computational cost of feature extraction is prohibitive for large-scale or time-sensitive applications, often dominating the cost of inference in the models. Significant efforts have been devoted to sparsity-based model selection to decrease this cost. Such feature selection methods control computation statically and miss the opportunity to fine-tune feature extraction to each input at run-time. We address the key challenge of learning to control fine-grained feature extraction adaptively, exploiting non-homogeneity of the data. We propose an architecture that uses a rich feedback loop between extraction and prediction. The run-time control policy is learned using efficient value-function approximation, which adaptively determines the value of information of features at the level of individual variables for each input. We demonstrate significant speedups over state-of-the-art methods on two challenging datasets. For articulated pose estimation in video, we achieve a more accurate state-of-the-art model that is simultaneously 4$\times$ faster while using only a small fraction of possible features, with similar results on an OCR task.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — adaptive feature extraction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

📈 Trend Setter — Model Compression

🐣 Hot Topic Early Bird — structured prediction

Authors

David J Weiss , Ben Taskar

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Active Learning Machine Learning > Application Areas > Efficient Computing Computer Vision > Analysis > Human Pose Estimation Machine Learning > Application Areas > Model Compression Machine Learning > Learning Types > Reinforcement Learning Machine Learning > Learning Types > Representation Learning Machine Learning > Core Methods > Feature Learning Machine Learning > Core Methods > Multi-Task Learning Artificial Intelligence > Core AI > Computer Vision Machine Learning > Learning Types > Optimization Computer Vision > Analysis > Pose Estimation

Keywords

feature extraction structured prediction feature selection value function approximation articulated pose estimation value of information adaptive feature extraction adaptive computation

Download PDF

Related papers

Latent Structured Active Learning 2013

On Flat versus Hierarchical Classification in Large-Scale Taxonomies 2013

Generalized Method-of-Moments for Rank Aggregation 2013

Third-Order Edge Statistics: Contour Continuation, Curvature, and Cortical Connections 2013

Accelerated Mini-Batch Stochastic Dual Coordinate Ascent 2013