Papers

2,653 papers found
Understanding the Visual Projection Space of Multimodal LLMs
Sungheon Jeong, Yoojeong Song, Hyungjoon Kim
2026 WACV
Zero-Shot Audio-Visual Editing via Cross-Modal Delta Denoising
Yan-Bo Lin, Kevin Lin, Zhengyuan Yang et al.
2026 WACV
VOCAL: Visual Odometry via ContrAstive Learning
Chi-Yao Huang, Zeel Bhatt, Yezhou Yang
2026 WACV
CLIP's Visual Embedding Projector is a Few-shot Cornucopia
Mohammad Fahes, Tuan-Hung Vu, Andrei Bursuc et al.
2026 WACV
2026 WACV
2026 WACV
ChartQA-X: Generating Explanations for Visual Chart Reasoning
Shamanthak Hegde, Pooyan Fazli, Hasti Seifi
2026 WACV
A Computational Approach to Visual Metonymy
Saptarshi Ghosh, Linfeng Liu, Tianyu Jiang
2026 EACL
Progressive Visual Refinement for Multi-modal Summarization
Ye Xiong, Hidetaka Kamigaito, Soichiro Murakami et al.
2026 EACL
DRIVINGVQA: A Dataset for Interleaved Visual Chain-of-Thought in Real-World Driving Scenarios
Charles Corbière, Simon Roburin, Syrielle Montariol et al.
2026 EACL
Open-World Object Counting in Videos
Niki Amini-Naieni, Andrew Zisserman
2026 AAAI
2026 AAAI
VMChill: A Dataset for Fine-Grained Visual-Musical Synergy
Xiaowei Chi, Zeyue Tian, Jialiang Chen et al.
2026 AAAI
Primary Visual Cortex Inspired Point Cloud Analysis Framework
Jisheng Dang, Delin Deng, Bimei Wang et al.
2026 AAAI
SCAN: Self-Calibrated AutoregressioN for High-Quality Visual Generation
Zhanzhou Feng, Qingpei Guo, Jingdong Chen et al.
2026 AAAI
AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Bin-Bin Gao, Yue Zhou, Jiangtao Yan et al.
2026 AAAI
2026 AAAI
2026 AAAI
2026 AAAI
Next Patch Prediction for AutoRegressive Visual Generation
Yatian Pang, Peng Jin, Shuo Yang et al.
2026 AAAI
2026 AAAI