Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Computer Vision
›
Applications
›
Computer Vision
329 directly classified papers
Papers per year
2006: 3
2007: 2
2010: 3
2011: 5
2012: 2
2013: 9
2014: 9
2015: 16
2016: 14
2017: 9
2018: 16
2019: 29
2020: 30
2021: 33
2022: 36
2023: 30
2024: 49
2025: 34
Papers
Evaluation Agent: Efficient and Promptable Evaluation Framework for Visual Generative Models
ACL 2025
MICE: Mixture of Image Captioning Experts Augmented e-Commerce Product Attribute Value Extraction
ACL 2025
TinySAM: Pushing the Envelope for Efficient Segment Anything Model
AAAI 2025
What Is That Talk About? A Video-to-Text Summarization Dataset for Scientific Presentations
ACL 2025
CoachMe: Decoding Sport Elements with a Reference-Based Coaching Instruction Generation Model
ACL 2025
WinSpot: GUI Grounding Benchmark with Multimodal Large Language Models
ACL 2025
ReMask-Animate: Refined Character Image Animation Using Mask-Guided Adapters
AAAI 2025
QuARF: Quality-Adaptive Receptive Fields for Degraded Image Perception
AAAI 2025
Noisy Correspondence Rectification via Asymmetric Similarity Learning
AAAI 2025
Risk Controlled Image Retrieval
AAAI 2025
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia
ACL 2025
SHuBERT: Self-Supervised Sign Language Representation Learning via Multi-Stream Cluster Prediction
ACL 2025
Dual-view X-ray Detection: Can AI Detect Prohibited Items from Dual-view X-ray Images like Humans?
CVPR 2025
IMOL: Incomplete-Modality-Tolerant Learning for Multi-Domain Fake News Video Detection
ACL 2025
Leveraging Asynchronous Spiking Neural Networks for Ultra Efficient Event-Based Visual Processing
AAAI 2025
VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering
AAAI 2025
FreeCap: Hybrid Calibration-Free Motion Capture in Open Environments
AAAI 2025
Neural Assembler: Learning to Generate Fine-Grained Robotic Assembly Instructions from Multi-View Images
AAAI 2025
RapidNet: Multi-Level Dilated Convolution Based Mobile Backbone
WACV 2025
MYOPIA: Protecting Face Privacy from Malicious Personalized Text-to-Image Synthesis via Unlearnable Examples
AAAI 2025
END^2: Robust Dual-Decoder Watermarking Framework Against Non-Differentiable Distortions
AAAI 2025
Inheriting Generalized Learngene for Efficient Knowledge Transfer across Multiple Tasks
AAAI 2025
Fair Domain Generalization with Heterogeneous Sensitive Attributes Across Domains
WACV 2025
Object-level Geometric Structure Preserving for Natural Image Stitching
AAAI 2025
IW-Bench: Evaluating Large Multimodal Models for Converting Image-to-Web
ACL 2025
<
1
2
3
4
5
…
14
>