OT-VP: Optimal Transport-Guided Visual Prompting for Test-Time Adaptation

Yunbei Zhang; Akshay Mehra; Jihun Hamm

2025 WACV WACV 2025

OT-VP: Optimal Transport-Guided Visual Prompting for Test-Time Adaptation

Abstract

Vision Transformers (ViTs) have demonstrated remarkable capabilities in learning representations but their performance is compromised when applied to unseen domains. Previous methods either engage in prompt learning during the training phase or modify model parameters at test time through entropy minimization. The former often overlooks unlabeled target data while the latter doesn't fully address domain shifts. In this work our approach Optimal Transport-guided Test-Time Visual Prompting (OT-VP) handles these problems by leveraging prompt learning at test time to align the target and source domains without accessing the training process or altering pre-trained model parameters. This method involves learning a universal visual prompt for the target domain by optimizing the Optimal Transport distance. With only four learned prompt tokens OT-VP exceeds state-of-the-art performance across three stylistic datasets--PACS VLCS OfficeHome and one corrupted dataset ImageNet-C. Additionally OT-VP operates efficiently both in terms of memory and computation and is adaptable for extension to online settings. The code is available at https://github.com/zybeich/OT-VP.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yunbei Zhang , Akshay Mehra , Jihun Hamm

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Application Areas > Domain Adaptation Deep Learning > Architectures > Transformers

Keywords

vision transformer optimal transport test-time adaptation domain shift visual prompting prompt token

Download PDF

Related papers

Neural Graph Map: Dense Mapping with Efficient Loop Closure Integration 2025

ELMGS: Enhancing Memory and Computation Scalability through Compression for 3D Gaussian Splatting 2025

Feature Fusion Transferability Aware Transformer for Unsupervised Domain Adaptation 2025

Uncertainty-Aware Online Extrinsic Calibration: A Conformal Prediction Approach 2025

Disentangling Spatio-Temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video 2025