Papers
3,673 papers found
Annotation-Free Audio-Visual Segmentation
Jinxiang Liu, Yu Wang, Chen Ju et al.
Fixed Pattern Noise Removal for Multi-View Single-Sensor Infrared Camera
Arnaud Barral, Pablo Arias, Axel Davy
Neural Image Compression Using Masked Sparse Visual Representation
Wei Jiang, Wei Wang, Yue Chen
MVAD: A Multiple Visual Artifact Detector for Video Streaming
Chen Feng, Duolikun Danier, Fan Zhang et al.
Data-Efficient 3D Visual Grounding via Order-Aware Referring
Tung-Yu Wu, Sheng-Yu Huang, Yu-Chiang Frank Wang
Fine-Grained Spatial and Verbal Losses for 3D Visual Grounding
Sombit Dey, Ozan Unal, Christos Sakaridis et al.
VHS: High-Resolution Iterative Stereo Matching with Visual Hull Priors
Markus Plack, Hannah Dröge, Leif Van Holland et al.
Temporally Streaming Audio-Visual Synchronization for Real-World Videos
Jordan G Voas, Wei-Cheng Tseng, Layne Berry et al.
Optimizing Dense Visual Predictions Through Multi-Task Coherence and Prioritization
Maxime Fontana, Michael Spratling, Miaojing Shi
SensorFlow: Sensor and Image Fused Video Stabilization
Jiyang Yu, Tianhao Zhang, Fuhao Shi et al.
When Visual State Space Model Meets Backdoor Attacks
Sankalp Nagaonkar, Achyut Mani Tripathi, Ashish Mishra
PTQ4VM: Post-Training Quantization for Visual Mamba
Younghyun Cho, Changhun Lee, Seonggon Kim et al.
WiGNet: Windowed Vision Graph Neural Network
Gabriele Spadaro, Marco Grangetto, Attilio Fiandrotti et al.
From Visual Explanations to Counterfactual Explanations with Latent Diffusion
Tung Luu, Nam Le, Duc Le et al.
Scene-LLM: Extending Language Model for 3D Visual Reasoning
Rao Fu, Jingyu Liu, Xilun Chen et al.
CusConcept: Customized Visual Concept Decomposition with Diffusion Models
Zhi Xu, Shaozhe Hao, Kai Han
SUM: Saliency Unification through Mamba for Visual Attention Modeling
Alireza Hosseini, Amirhossein Kazerouni, Saeed Akhavan et al.
Make VLM Recognize Visual Hallucination on Cartoon Character Image with Pose Information
Bumsoo Kim, Wonseop Shin, Kyuchul Lee et al.
Adaptive Deviation Learning for Visual Anomaly Detection with Data Contamination
Anindya Sundar Das, Guansong Pang, Monowar Bhuyan
Dance Any Beat: Blending Beats with Visuals in Dance Video Generation
Xuanchen Wang, Heng Wang, Dongnan Liu et al.
Enhancing Visual Classification using Comparative Descriptors
Hankyeol Lee, Gawon Seo, Wonseok Choi et al.
Dense Depth from Event Focal Stack
Kenta Horikawa, Mariko Isogawa, Hideo Saito et al.
VG-SSL: Benchmarking Self-Supervised Representation Learning Approaches for Visual Geo-Localization
Jiuhong Xiao, Gao Zhu, Giuseppe Loianno
BIV-Priv-Seg: Locating Private Content in Images Taken by People with Visual Impairments
Yu-Yun Tseng, Tanusree Sharma, Lotus Zhang et al.
Semantic Clustering of Image Retrieval Databases used for Visual Localization
Henry Hölzemann, Torsten Fiolka