Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video

Yang Yang; Guang Shu; Mubarak Shah

2013 CVPR CVPR 2013

Semi-supervised Learning of Feature Hierarchies for Object Detection in a Video

Abstract

We propose a novel approach to boost the performance of generic object detectors on videos by learning videospecific features using a deep neural network. The insight behind our proposed approach is that an object appearing in different frames of a video clip should share similar features, which can be learned to build better detectors. Unlike many supervised detector adaptation or detection-bytracking methods, our method does not require any extra annotations or utilize temporal correspondence. We start with the high-confidence detections from a generic detector, then iteratively learn new video-specific features and refine the detection scores. In order to learn discriminative and compact features, we propose a new feature learning method using a deep neural network based on auto encoders. It differs from the existing unsupervised feature learning methods in two ways: first it optimizes both discriminative and generative properties of the features simultaneously, which gives our features better discriminative ability; second, our learned features are more compact, while the unsupervised feature learning methods usually learn a redundant set of over-complete features. Extensive experimental results on person and horse detection show that significant performance improvement can be achieved with our proposed method.

🚀 Conference Pioneer — CVPR 2013

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning and Machine Learning

📈 Trend Setter — Semi-Supervised Learning

🧭 Keyword Pioneer — auto encoder

🐣 Hot Topic Early Bird — deep neural network

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yang Yang , Guang Shu , Mubarak Shah

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Deep Learning > Architectures > Autoencoders Computer Vision > Analysis > Object Detection Deep Learning > Learning Types > Semi-Supervised Learning

Keywords

semi-supervised learning feature learning object detection video understanding feature hierarchy deep neural network discriminative feature auto encoder

Download PDF

Related papers

Nonlinearly Constrained MRFs: Exploring the Intrinsic Dimensions of Higher-Order Cliques 2013

An Approach to Pose-Based Action Recognition 2013

Modeling Actions through State Changes 2013

A Convex Regularizer for Reducing Color Artifact in Color Image Recovery 2013

Deformable Spatial Pyramid Matching for Fast Dense Correspondences 2013