Multi-fold MIL Training for Weakly Supervised Object Localization

Ramazan Gokberk Cinbis; Jakob Verbeek; Cordelia Schmid

2014 CVPR CVPR 2014

Multi-fold MIL Training for Weakly Supervised Object Localization

Abstract

Object category localization is a challenging problem in computer vision. Standard supervised training requires bounding box annotations of object instances. This time-consuming annotation process is sidestepped in weakly supervised learning. In this case, the supervised information is restricted to binary labels that indicate the absence/presence of object instances in the image, without their locations. We follow a multiple-instance learning approach that iteratively trains the detector and infers the object locations in the positive training images. Our main contribution is a multi-fold multiple instance learning procedure, which prevents training from prematurely locking onto erroneous object locations. This procedure is particularly important when high-dimensional representations, such as the Fisher vectors, are used. We present a detailed experimental evaluation using the PASCAL VOC 2007 dataset. Compared to state-of-the-art weakly supervised detectors, our approach better localizes objects in the training images, which translates into improved detection performance.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

📈 Trend Setter — Multi-Instance Learning

🐣 Hot Topic Early Bird — object localization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ramazan Gokberk Cinbis , Jakob Verbeek , Cordelia Schmid

Topics

Machine Learning > Learning Types > Weakly Supervised Learning Computer Vision > Analysis > Object Detection Machine Learning > Learning Types > Multi-Instance Learning Machine Learning > Learning Paradigms > Weakly Supervised Learning

Keywords

feature extraction object detection weakly supervised learning object localization multiple instance learning bounding box annotation

Download PDF

Related papers

Efficient Nonlinear Markov Models for Human Motion 2014

Occlusion Geodesics for Online Multi-Object Tracking 2014

A Principled Approach for Coarse-to-Fine MAP Inference 2014

Locally Optimized Product Quantization for Approximate Nearest Neighbor Search 2014

Fast and Accurate Image Matching with Cascade Hashing for 3D Reconstruction 2014