Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Congcong Li; Adarsh Kowdle; Ashutosh Saxena; Tsuhan Chen

2010 NIPS NeurIPS 2010

Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models

Abstract

In many machine learning domains (such as scene understanding), several related sub-tasks (such as scene categorization, depth estimation, object detection) operate on the same raw data and provide correlated outputs. Each of these tasks is often notoriously hard, and state-of-the-art classifiers already exist for many sub-tasks. It is desirable to have an algorithm that can capture such correlation without requiring to make any changes to the inner workings of any classifier. We propose Feedback Enabled Cascaded Classification Models (FE-CCM), that maximizes the joint likelihood of the sub-tasks, while requiring only a ‘black-box’ interface to the original classifier for each sub-task. We use a two-layer cascade of classifiers, which are repeated instantiations of the original ones, with the output of the first layer fed into the second layer as input. Our training method involves a feedback step that allows later classifiers to provide earlier classifiers information about what error modes to focus on. We show that our method significantly improves performance in all the sub-tasks in two different domains: (i) scene understanding, where we consider depth estimation, scene categorization, event categorization, object detection, geometric labeling and saliency detection, and (ii) robotic grasping, where we consider grasp point detection and object classification.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — feedback mechanism

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

🌱 Topic Pioneer — Multi-Task Learning

📈 Trend Setter — Robotics

🐣 Hot Topic Early Bird — multi-task learning

Authors

Congcong Li , Adarsh Kowdle , Ashutosh Saxena , Tsuhan Chen

Topics

Machine Learning > Core Methods > Classification Computer Vision > Analysis > Scene Understanding Machine Learning > Learning Types > Multi-Task Learning Artificial Intelligence > Core AI > Robotics Computer Vision > Core AI > Computer Vision Artificial Intelligence > Core AI > Multi-Task Learning

Keywords

multi-task learning scene understanding object detection depth estimation robotic grasping cascaded classification saliency detection feedback mechanism

Download PDF

Related papers

Link Discovery using Graph Feature Tracking 2010

Trading off Mistakes and Don't-Know Predictions 2010

A Novel Kernel for Learning a Neuron Model from Spike Train Data 2010

Decomposing Isotonic Regression for Efficiently Solving Large Problems 2010

Learning Kernels with Radiuses of Minimum Enclosing Balls 2010