ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization

Mason Peterson; Yixuan Jia; Yulun Tian; Annika Thomas; Jonathan P. How

2025 RSS RSS 2025

ROMAN: Open-Set Object Map Alignment for Robust View-Invariant Global Localization

Abstract

Global localization is a fundamental capability required for long-term and drift-free robot navigation. However, current methods fail to relocalize when faced with significantly different viewpoints. We present ROMAN (Robust Object Map Alignment Anywhere), a global localization method capable of localizing in challenging and diverse environments by creating and aligning maps of open-set and view-invariant objects. ROMAN formulates and solves a registration problem between object submaps using a unified graph-theoretic global data association approach with a novel incorporation of a gravity direction prior and object shape and semantic similarity. This work's open-set object mapping and information-rich object association algorithm enables global localization, even in instances when maps are created from robots traveling in opposite directions. Through a set of challenging global localization experiments in indoor, urban, and unstructured/forested environments, we demonstrate that ROMAN achieves higher relative pose estimation accuracy than other image-based pose estimation methods or segment-based registration methods. Additionally, we evaluate ROMAN as a loop closure module in large-scale multi-robot SLAM and show a 35% improvement in trajectory estimation error compared to standard SLAM systems using visual features for loop closures. Code and videos can be found at https://acl.mit.edu/roman.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🧭 Keyword Pioneer — map alignment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Mason Peterson , Yixuan Jia , Yulun Tian , Annika Thomas , Jonathan P. How

Topics

Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > 3D Vision Computer Vision > Domain-Specific > Autonomous Driving

Keywords

point cloud registration object detection global localization map alignment

Download PDF

Related papers

Enhancing Autonomous Driving Systems with On-Board Deployed Large Language Models 2025

Debiasing 6-DOF IMU via Hierarchical Learning of Continuous Bias Dynamics 2025

SpatialVLA: Exploring Spatial Representations for Visual-Language-Action Models 2025

RoboVerse: A Unified Platform, Benchmark and Dataset for Scalable and Generalizable Robot Learning 2025

Learning Humanoid Standing-up Control across Diverse Postures 2025