← Applications

Computer Vision › Applications ›

Computer Vision

329 directly classified papers

Papers per year

Papers

Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models ACL 2025

MICE: Mixture of Image Captioning Experts Augmented e-Commerce Product Attribute Value Extraction ACL 2025

TinySAM: Pushing the Envelope for Efficient Segment Anything Model AAAI 2025

What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations ACL 2025

CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model ACL 2025

WinSpot: GUI Grounding Benchmark with Multimodal Large Language Models ACL 2025

ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters AAAI 2025

QuARF: Quality-Adaptive Receptive Fields for Degraded Image Perception AAAI 2025

Noisy Correspondence Rectification via Asymmetric Similarity Learning AAAI 2025

Risk Controlled Image Retrieval AAAI 2025

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia ACL 2025

SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction ACL 2025

Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans? CVPR 2025

IMOL: Incomplete-Modality-Tolerant Learning for Multi-Domain Fake News Video Detection ACL 2025

Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing AAAI 2025

VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering AAAI 2025

FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments AAAI 2025

Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images AAAI 2025

RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone WACV 2025

MYOPIA: Protecting Face Privacy from Malicious Personalized Text-to-Image Synthesis via Unlearnable Examples AAAI 2025

END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions AAAI 2025

Inheriting Generalized Learngene for Efficient Knowledge Transfer across Multiple Tasks AAAI 2025

Fair Domain Generalization with Heterogeneous Sensitive Attributes Across Domains WACV 2025

Object-level Geometric Structure Preserving for Natural Image Stitching AAAI 2025

IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web ACL 2025