Probabilistic Feature Matching for Fast Scalable Visual Prompting

Thomas Frick; Cezary Skura; Filip M. Janicki; Roy Assaf; Niccolo Avogaro; Daniel Caraballo; Yagmur G. Cinar; Brown Ebouky; Ioana Giurgiu; Takayuki Katsuki; Piotr Kluska; Cristiano Malossi; Haoxiang Qiu; Tomoya Sakai; Florian Scheidegger; Andrej Simeski; Daniel Yang; Andrea Bartezzaghi; Mattia Rigotti

2024 IJCAI IJCAI 2024

Probabilistic Feature Matching for Fast Scalable Visual Prompting

Abstract

In this work, we propose a novel framework for image segmentation guided by visual prompting which leverages the power of vision foundation models. Inspired by recent advancements in computer vision, our approach integrates multiple large-scale pretrained models to address the challenges of segmentation tasks with limited and sparsely annotated data interactively provided by a user. Our method combines a frozen feature extraction backbone with a scalable and efficient probabilistic feature correspondence (soft matching) procedure derived from Optimal Transport to couple pixels between reference and target images. Moreover, a pretrained segmentation model is harnessed to translate user scribbles into reference masks and matched target pixels into output target segmentation masks. This results in a framework that we name Softmatcher, a versatile and fast training-free architecture for image segmentation by visual prompting. We demonstrate the efficiency and scalability of Softmatcher for real-time interactive image segmentation by visual prompting and showcase it in diverse visual domains including technical visual inspection use cases.

🌉 Interdisciplinary Bridge — Computer Vision and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Thomas Frick , Cezary Skura , Filip M. Janicki , Roy Assaf , Niccolo Avogaro , Daniel Caraballo , Yagmur G. Cinar , Brown Ebouky , Ioana Giurgiu , Takayuki Katsuki , Piotr Kluska , Cristiano Malossi , Haoxiang Qiu , Tomoya Sakai , Florian Scheidegger , Andrej Simeski , Daniel Yang , Andrea Bartezzaghi , Mattia Rigotti

Topics

Machine Learning > Optimization & Theory > Bayesian Inference Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Domain Adaptation Computer Vision > Analysis > Object Detection Computer Vision > Processing > Image Segmentation

Keywords

semantic segmentation image segmentation optimal transport feature matching foundation model visual prompting interactive segmentation

Download PDF

Related papers

Langshaw: Declarative Interaction Protocols Based on Sayso and Conflict 2024

A Successful Strategy for Multichannel Iterated Prisoner’s Dilemma 2024

Bring Metric Functions into Diffusion Models 2024

Fast One-Stage Unsupervised Domain Adaptive Person Search 2024

FreqFormer: Frequency-aware Transformer for Lightweight Image Super-resolution 2024