Scene Parsing with Object Instances and Occlusion Ordering

Joseph Tighe; Marc Niethammer; Svetlana Lazebnik

2014 CVPR CVPR 2014

Scene Parsing with Object Instances and Occlusion Ordering

Abstract

This work proposes a method to interpret a scene by assigning a semantic label at every pixel and inferring the spatial extent of individual object instances together with their occlusion relationships. Starting with an initial pixel labeling and a set of candidate object masks for a given test image, we select a subset of objects that explain the image well and have valid overlap relationships and occlusion ordering. This is done by minimizing an integer quadratic program either using a greedy method or a standard solver. Then we alternate between using the object predictions to refine the pixel labels and vice versa. The proposed system obtains promising results on two challenging subsets of the LabelMe and SUN datasets, the largest of which contains 45,676 images and 232 classes.

🧭 Keyword Pioneer — object instance detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Joseph Tighe , Marc Niethammer , Svetlana Lazebnik

Topics

Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Scene Understanding Computer Vision > Analysis > Semantic Segmentation Computer Vision > Processing > Image Segmentation

Keywords

semantic segmentation scene parsing object instance detection object instance pixel labeling integer quadratic programming occlusion ordering scene parsing method occlusion ordering inference

Download PDF

Related papers

Efficient Nonlinear Markov Models for Human Motion 2014

Occlusion Geodesics for Online Multi-Object Tracking 2014

A Principled Approach for Coarse-to-Fine MAP Inference 2014

Locally Optimized Product Quantization for Approximate Nearest Neighbor Search 2014

Fast and Accurate Image Matching with Cascade Hashing for 3D Reconstruction 2014