VILAM: Infrastructure-assisted 3D Visual Localization and Mapping for Autonomous Driving

Jiahe Cui; Shuyao Shi; Yuze He; Jianwei Niu; Guoliang Xing; Zhenchao Ouyang

2024 NSDI NSDI 2024

VILAM: Infrastructure-assisted 3D Visual Localization and Mapping for Autonomous Driving

Abstract

Visual Simultaneous Localization and Mapping (SLAM) presents a promising avenue for fulfilling the essential perception and localization tasks in autonomous driving systems using cost-effective visual sensors. Nevertheless, existing visual SLAM frameworks often suffer from substantial cumulative errors and performance degradation in complicated driving scenarios. In this paper, we propose VILAM, a novel framework that leverages intelligent roadside infrastructures to realize high-precision and globally consistent localization and mapping on autonomous vehicles. The key idea of VILAM is to utilize the precise scene measurement from the infrastructure as global references to correct errors in the local map constructed by the vehicle. To overcome the unique deformation in the 3D local map to align it with the infrastructure measurement, VILAM proposes a novel elastic point cloud registration method that enables independent optimization of different parts of the local map. Moreover, VILAM adopts a lightweight factor graph construction and optimization to first correct the vehicle trajectory, and thus reconstruct the consistent global map efficiently. We implement the VILAM end-to-end on a real-world smart lamppost testbed in multiple road scenarios. Extensive experiment results show that VILAM can achieve decimeter-level localization and mapping accuracy with consumer-level onboard cameras and is robust under diverse road scenarios. A video demo of VILAM on our real-world testbed is available at https://youtu.be/lTlqDNipDVE.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics

Authors

Jiahe Cui , Shuyao Shi , Yuze He , Jianwei Niu , Guoliang Xing , Zhenchao Ouyang

Topics

Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Depth Estimation Computer Vision > Domain-Specific > Autonomous Driving

Keywords

point cloud registration visual slam factor graph 3d mapping

Download PDF

Related papers

Accelerating Skewed Workloads With Performance Multipliers in the TurboDB Distributed Database 2024

Efficient Exposure of Partial Failure Bugs in Distributed Systems with Inferred Abstract States 2024

Making Kernel Bypass Practical for the Cloud with Junction 2024

Horus: Granular In-Network Task Scheduler for Cloud Datacenters 2024

Fast Vector Query Processing for Large Datasets Beyond GPU Memory with Reordered Pipelining 2024