Seamless Scene Segmentation

Lorenzo Porzi; Samuel Rota Bulò; Aleksander Colovic; Peter Kontschieder

2019 CVPR CVPR 2019

Seamless Scene Segmentation

Abstract

In this work we introduce a novel, CNN-based architecture that can be trained end-to-end to deliver seamless scene segmentation results. Our goal is to predict consistent semantic segmentation and detection results by means of a panoptic output format, going beyond the simple combination of independently trained segmentation and detection models. The proposed architecture takes advantage of a novel segmentation head that seamlessly integrates multi-scale features generated by a Feature Pyramid Network with contextual information conveyed by a light-weight DeepLab-like module. As additional contribution we review the panoptic metric and propose an alternative that overcomes its limitations when evaluating non-instance categories. Our proposed network architecture yields state-of-the-art results on three challenging street-level datasets, i.e. Cityscapes, Indian Driving Dataset and Mapillary Vistas.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lorenzo Porzi , Samuel Rota Bulò , Aleksander Colovic , Peter Kontschieder

Topics

Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Scene Understanding Computer Vision > Processing > Semantic Segmentation Computer Vision > Analysis > Object Segmentation Deep Learning > Learning Types > Multi-Task Learning

Keywords

semantic segmentation multi-task learning object detection instance segmentation convolutional neural network panoptic segmentation feature pyramid network

Download PDF

Related papers

Fast Single Image Reflection Suppression via Convex Optimization 2019

Learning Video Representations From Correspondence Proposals 2019

ATOM: Accurate Tracking by Overlap Maximization 2019

Visual Tracking via Adaptive Spatially-Regularized Correlation Filters 2019

Edge-Labeling Graph Neural Network for Few-Shot Learning 2019