Cross-View Completion Models are Zero-shot Correspondence Estimators

Honggyu An; Jin Hyeon Kim; Seonghoon Park; Jaewoo Jung; Jisang Han; Sunghwan Hong; Seungryong Kim

2025 CVPR CVPR 2025

Cross-View Completion Models are Zero-shot Correspondence Estimators

Abstract

In this work, we analyze new aspects of cross-view completion, mainly through the analogy of cross-view completion and traditional self-supervised correspondence learning algorithms. Based on our analysis, we reveal that the cross-attention map of Croco-v2, best reflects this correspondence information compared to other correlations from the encoder or decoder features. We further verify the effectiveness of the cross-attention map by evaluating on both zero-shot and supervised dense geometric correspondence and multi-frame depth estimation.

🧭 Keyword Pioneer — zero-shot correspondence

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Deep Learning, Machine Learning

Authors

Honggyu An , Jin Hyeon Kim , Seonghoon Park , Jaewoo Jung , Jisang Han , Sunghwan Hong , Seungryong Kim

Topics

Machine Learning > Learning Types > Self-Supervised Learning Machine Learning > Learning Types > Zero-Shot Learning

Keywords

cross-attention map cross-view completion zero-shot correspondence self-supervised correspondence learning dense geometric correspondence

Download PDF

Related papers

AnyCam: Learning to Recover Camera Poses and Intrinsics from Casual Videos 2025

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding 2025

FADE: Frequency-Aware Diffusion Model Factorization for Video Editing 2025

Fast and Accurate Gigapixel Pathological Image Classification with Hierarchical Distillation Multi-Instance Learning 2025

Reversible Decoupling Network for Single Image Reflection Removal 2025