OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images

Weijia Li; Yawen Lai; Linning Xu; Yuanbo Xiangli; Jinhua Yu; Conghui He; Gui-Song Xia; Dahua Lin

2023 CVPR CVPR 2023

OmniCity: Omnipotent City Understanding With Multi-Level and Multi-View Images

Abstract

This paper presents OmniCity, a new dataset for omnipotent city understanding from multi-level and multi-view images. More precisely, OmniCity contains multi-view satellite images as well as street-level panorama and mono-view images, constituting over 100K pixel-wise annotated images that are well-aligned and collected from 25K geo-locations in New York City. To alleviate the substantial pixel-wise annotation efforts, we propose an efficient street-view image annotation pipeline that leverages the existing label maps of satellite view and the transformation relations between different views (satellite, panorama, and mono-view). With the new OmniCity dataset, we provide benchmarks for a variety of tasks including building footprint extraction, height estimation, and building plane/instance/fine-grained segmentation. Compared with existing multi-level and multi-view benchmarks, OmniCity contains a larger number of images with richer annotation types and more views, provides more benchmark results of state-of-the-art models, and introduces a new task for fine-grained building instance segmentation on street-level panorama images. Moreover, OmniCity provides new problem settings for existing tasks, such as cross-view image matching, synthesis, segmentation, detection, etc., and facilitates the developing of new methods for large-scale city understanding, reconstruction, and simulation. The OmniCity dataset as well as the benchmarks will be released at https://city-super.github.io/omnicity/.

🧭 Keyword Pioneer — street-view image

🐣 Hot Topic Early Bird — satellite imagery

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Weijia Li , Yawen Lai , Linning Xu , Yuanbo Xiangli , Jinhua Yu , Conghui He , Gui-Song Xia , Dahua Lin

Topics

Computer Vision > Analysis > 3D Vision Computer Vision > Analysis > Object Detection Computer Vision > Analysis > Semantic Segmentation Computer Vision > Processing > Image Segmentation Computer Vision > Domain-Specific > Remote Sensing

Keywords

semantic segmentation instance segmentation cross-view matching multi-view image satellite imagery height estimation street-view image building extraction building footprint extraction street-level panorama city reconstruction building segmentation city understanding

Download PDF

Related papers

CORA: Adapting CLIP for Open-Vocabulary Detection With Region Prompting and Anchor Pre-Matching 2023

3DAvatarGAN: Bridging Domains for Personalized Editable Avatars 2023

Physics-Driven Diffusion Models for Impact Sound Synthesis From Videos 2023

Transductive Few-Shot Learning With Prototype-Based Label Propagation by Iterative Graph Refinement 2023

EXIF As Language: Learning Cross-Modal Associations Between Images and Camera Metadata 2023