Multi-Space Alignments Towards Universal LiDAR Segmentation

Youquan Liu; Lingdong Kong; Xiaoyang Wu; Runnan Chen; Xin Li; Liang Pan; Ziwei Liu; Yuexin Ma

2024 CVPR CVPR 2024

Multi-Space Alignments Towards Universal LiDAR Segmentation

Abstract

A unified and versatile LiDAR segmentation model with strong robustness and generalizability is desirable for safe autonomous driving perception. This work presents M3Net a one-of-a-kind framework for fulfilling multi-task multi-dataset multi-modality LiDAR segmentation in a universal manner using just a single set of parameters. To better exploit data volume and diversity we first combine large-scale driving datasets acquired by different types of sensors from diverse scenes and then conduct alignments in three spaces namely data feature and label spaces during the training. As a result M3Net is capable of taming heterogeneous data for training state-of-the-art LiDAR segmentation models. Extensive experiments on twelve LiDAR segmentation datasets verify our effectiveness. Notably using a shared set of parameters M3Net achieves 75.1% 83.1% and 72.4% mIoU scores respectively on the official benchmarks of SemanticKITTI nuScenes and Waymo Open.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Youquan Liu , Lingdong Kong , Xiaoyang Wu , Runnan Chen , Xin Li , Liang Pan , Ziwei Liu , Yuexin Ma

Topics

Computer Vision > Analysis > 3D Vision Computer Vision > Domain-Specific > Autonomous Driving Deep Learning > Learning Types > Multi-Modal Learning Deep Learning > Learning Types > Multi-Task Learning

Keywords

semantic segmentation multi-task learning domain adaptation 3d vision multi-modal learning multi-modality learning lidar segmentation

Download PDF

Related papers

DUSt3R: Geometric 3D Vision Made Easy 2024

Bezier Everywhere All at Once: Learning Drivable Lanes as Bezier Graphs 2024

NeRFDeformer: NeRF Transformation from a Single View via 3D Scene Flows 2024

Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization 2024

DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models 2024