Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Applications
Computer Vision
›
Applications
›
Computer Vision
329 directly classified papers
Papers per year
2006: 3
2007: 2
2010: 3
2011: 5
2012: 2
2013: 9
2014: 9
2015: 16
2016: 14
2017: 9
2018: 16
2019: 29
2020: 30
2021: 33
2022: 36
2023: 30
2024: 49
2025: 34
Papers
Re-assembling the past: The RePAIR dataset and benchmark for real world 2D and 3D puzzle solving
NIPS 2024
Knowledge Guided Semi-supervised Learning for Quality Assessment of User Generated Videos
AAAI 2024
MuST: Robust Image Watermarking for Multi-Source Tracing
AAAI 2024
Image as a Language: Revisiting Scene Text Recognition via Balanced, Unified and Synchronized Vision-Language Reasoning Network
AAAI 2024
Web-Scale Visual Entity Recognition: An LLM-Driven Data Approach
NIPS 2024
BirdCollect: A Comprehensive Benchmark for Analyzing Dense Bird Flock Attributes
AAAI 2024
Cloud Object Detector Adaptation by Integrating Different Source Knowledge
NIPS 2024
Post-trained Convolution Networks for Single Image Super-resolution (Abstract Reprint)
AAAI 2024
Learnability Matters: Active Learning for Video Captioning
NIPS 2024
MineObserver 2.0: A Deep Learning & In-Game Framework for Assessing Natural Language Descriptions of Minecraft Imagery
AAAI 2024
Gaze-Based Interaction Adaptation for People with Involuntary Head Movements (Student Abstract)
AAAI 2024
Virtual Try-On: Real-Time Interactive Hybrid Network with High-Fidelity
AAAI 2024
Mitigating Open-Vocabulary Caption Hallucinations
EMNLP 2024
Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding
EMNLP 2024
Gloss2Text: Sign Language Gloss translation using LLMs and Semantically Aware Label Smoothing
EMNLP 2024
G3: An Effective and Adaptive Framework for Worldwide Geolocalization Using Large Multi-Modality Models
NIPS 2024
CuReD: Deep Learning Optical Character Recognition for Cuneiform Text Editions and Legacy Materials
ACL 2024
Efficient Scene Recovery Using Luminous Flux Prior
CVPR 2024
Language-only Training of Zero-shot Composed Image Retrieval
CVPR 2024
SeeSR: Towards Semantics-Aware Real-World Image Super-Resolution
CVPR 2024
Fully Geometric Panoramic Localization
CVPR 2024
VideoGUI: A Benchmark for GUI Automation from Instructional Videos
NIPS 2024
Unleashing Unlabeled Data: A Paradigm for Cross-View Geo-Localization
CVPR 2024
ODGS: 3D Scene Reconstruction from Omnidirectional Images with 3D Gaussian Splattings
NIPS 2024
DDR: Exploiting Deep Degradation Response as Flexible Image Descriptor
NIPS 2024
<
1
2
3
4
5
…
14
>