DiffuBox: Refining 3D Object Detection with Point Diffusion

Xiangyu Chen; Zhenzhen Liu; Katie Z Luo; Siddhartha Datta; Adhitya Polavaram; Yan Wang; Yurong You; Boyi Li; Marco Pavone; Wei-Lun Chao; Mark Campbell; Bharath Hariharan; Kilian Q. Weinberger

2024 NIPS NeurIPS 2024

DiffuBox: Refining 3D Object Detection with Point Diffusion

Abstract

Ensuring robust 3D object detection and localization is crucial for many applications in robotics and autonomous driving. Recent models, however, face difficulties in maintaining high performance when applied to domains with differing sensor setups or geographic locations, often resulting in poor localization accuracy due to domain shift. To overcome this challenge, we introduce a novel diffusion-based box refinement approach. This method employs a domain-agnostic diffusion model, conditioned on the LiDAR points surrounding a coarse bounding box, to simultaneously refine the box's location, size, and orientation. We evaluate this approach under various domain adaptation settings, and our results reveal significant improvements across different datasets, object classes and detectors. Our PyTorch implementation is available at https://github.com/cxy1997/DiffuBox.

🌉 Interdisciplinary Bridge — Computer Vision and Deep Learning

🧭 Keyword Pioneer — point diffusion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Xiangyu Chen , Zhenzhen Liu , Katie Z Luo , Siddhartha Datta , Adhitya Polavaram , Yan Wang , Yurong You , Boyi Li , Marco Pavone , Wei-Lun Chao , Mark Campbell , Bharath Hariharan , Kilian Q. Weinberger

Topics

Deep Learning > Models > Diffusion Models Computer Vision > Analysis > 3D Vision Computer Vision > Domain-Specific > Autonomous Driving

Keywords

domain adaptation 3d object detection bounding box point diffusion

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024