Research Explorer
Papers
Conferences
Authors
Topics
Keywords
Trends
Achievements
Explore
← Computer Vision
Computer Vision
›
Core AI
63 directly classified papers
Subtopics
Computer Vision (1562)
Multimodal Learning (1257)
Efficient Computing (179)
Interpretability (74)
Foundation Models (35)
Multi-Modal Learning (29)
Papers per year
2007: 2
2009: 1
2010: 4
2011: 3
2012: 1
2013: 2
2014: 3
2016: 2
2020: 2
2021: 1
2023: 10
2024: 21
2025: 11
Papers
Inference-Scale Complexity in ANN-SNN Conversion for High-Performance and Low-Power Applications
CVPR 2025
MambaOut: Do We Really Need Mamba for Vision?
CVPR 2025
Building Vision Models upon Heat Conduction
CVPR 2025
EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality
CVPR 2025
Neural LightRig: Unlocking Accurate Object Normal and Material Estimation with Multi-Light Diffusion
CVPR 2025
EdgeDiff: Edge-aware Diffusion Network for Building Reconstruction from Point Clouds
CVPR 2025
Associative Transformer
CVPR 2025
MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining
CVPR 2025
Closest Neighbors are Harmful for Lightweight Masked Auto-encoders
CVPR 2025
CTRL-O: Language-Controllable Object-Centric Visual Representation Learning
CVPR 2025
DA-VPT: Semantic-Guided Visual Prompt Tuning for Vision Transformers
CVPR 2025
DIMAT: Decentralized Iterative Merging-And-Training for Deep Learning Models
CVPR 2024
Groupwise Query Specialization and Quality-Aware Multi-Assignment for Transformer-based Visual Relationship Detection
CVPR 2024
BioCLIP: A Vision Foundation Model for the Tree of Life
CVPR 2024
Connecting Joint-Embedding Predictive Architecture with Contrastive Self-supervised Learning
NIPS 2024
You Only Need Less Attention at Each Stage in Vision Transformers
CVPR 2024
Mean-Shift Feature Transformer
CVPR 2024
Neural Clustering based Visual Representation Learning
CVPR 2024
LUWA Dataset: Learning Lithic Use-Wear Analysis on Microscopic Images
CVPR 2024
FakeInversion: Learning to Detect Images from Unseen Text-to-Image Models by Inverting Stable Diffusion
CVPR 2024
From Activation to Initialization: Scaling Insights for Optimizing Neural Fields
CVPR 2024
InstructDiffusion: A Generalist Modeling Interface for Vision Tasks
CVPR 2024
Viewpoint-Aware Visual Grounding in 3D Scenes
CVPR 2024
Learning Vision from Models Rivals Learning Vision from Data
CVPR 2024
When does perceptual alignment benefit vision representations?
NIPS 2024
<
1
2
3
>