Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation

Anirud Thyagharajan; Benjamin Ummenhofer; Prashant Laddha; Om Ji Omer; Sreenivas Subramoney

2022 CVPR CVPR 2022

Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation

Abstract

3D semantic segmentation is a fundamental building block for several scene understanding applications such as autonomous driving, robotics and AR/VR. Several state-of-the-art semantic segmentation models suffer from the part-misclassification problem, wherein parts of the same object are labelled incorrectly. Previous methods have utilized hierarchical, iterative methods to fuse semantic and instance information, but they lack learnability in context fusion, and are computationally complex and heuristic driven. This paper presents Segment-Fusion, a novel attention-based method for hierarchical fusion of semantic and instance information to address the part misclassifications. The presented method includes a graph segmentation algorithm for grouping points into segments that pools point-wise features into segment-wise features, a learnable attention-based network to fuse these segments based on their semantic and instance features, and followed by a simple yet effective connected component labelling algorithm to convert segment features to instance labels. Segment-Fusion can be flexibly employed with any network architecture for semantic/instance segmentation. It improves the qualitative and quantitative performance of several semantic segmentation backbones by upto 5% on the ScanNet and S3DIS datasets.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🐣 Hot Topic Early Bird — 3d semantic segmentation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Anirud Thyagharajan , Benjamin Ummenhofer , Prashant Laddha , Om Ji Omer , Sreenivas Subramoney

Topics

Deep Learning > Architectures > Graph Neural Networks Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Semantic Segmentation Computer Vision > Processing > Semantic Segmentation

Keywords

attention mechanism point cloud instance segmentation 3d semantic segmentation graph neural network

Download PDF

Related papers

UniCoRN: A Unified Conditional Image Repainting Network 2022

Why Discard if You Can Recycle?: A Recycling Max Pooling Module for 3D Point Cloud Analysis 2022

All-in-One Image Restoration for Unknown Corruption 2022

Stability-Driven Contact Reconstruction From Monocular Color Images 2022

Forecasting Characteristic 3D Poses of Human Actions 2022