Papers
4,428 papers found
Alignment and Distillation: A Robust Framework for Multimodal Domain Generalizable Human Action Recognition
Hyeonbin Ji, Juyeob Lee, Eunil Park
Align Video Diffusion Model with Online Video-Centric Preference Optimization
Jiacheng Zhang, Jie Wu, Weifeng Chen et al.
A Little More Like This: Text-to-Image Retrieval with Vision-Language Models Using Relevance Feedback
Bulat Khaertdinov, Mirela Popa, Nava Tintarev
A Multi-Agent Diffusion Approach for MRI Anomaly Segmentation via Modality-Specific LoRA Specialization
Wafa Al Ghallabi, Muhammad Zaigham Zaheer, Ritesh Thawkar et al.
Analysis of Text Accuracy and Visual Alignment in Vision-Language Models for Artistic Text Generation
Fatima Alderazi, Motaz Alfarraj
Anatomically-guided Masked Autoencoder Pre-training for Aneurysm Detection
Alberto M. Ceballos Arroyo, Jisoo Kim, Chu-Hsuan Lin et al.
Anatomy-VLM: A Fine-grained Vision-Language Model for Medical Interpretation
Difei Gu, Yunhe Gao, Mu Zhou et al.
An Efficient Multi-Rater Setup Towards Personalized and Diversified Medical Image Segmentation
Sajed Almorsy, Ayman Khalafallah, Marwan Torki
An improved architecture for part-based animal re-identification through semantic segmentation distillation
EugĂȘnio Dias Ribeiro Neto, Marc Chaumont, GĂ©rard Subsol et al.
A Novel Metric for Detecting Memorization in Generative Models for Brain MRI Synthesis
Antonio Scardace, Lemuel Puglisi, Francesco Guarnera et al.
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM
Sunghyun Ahn, Youngwan Jo, Kijung Lee et al.
AnyBald: Toward Realistic Diffusion-Based Hair Removal In-The-Wild
Yongjun Choi, Seungoh Han, Soomin Kim et al.
Any Detector Can Detect Anything
Thomas E. Huang, Siyuan Li, Martin Danelljan et al.
AortaDiff: A Unified Multitask Diffusion Framework for Contrast-Free AAA Imaging
Yuxuan Ou, Ning Bi, Jiazhen Pan et al.
ArchitectHead: Continuous Level of Detail Control for 3D Gaussian Head Avatars
Peizhi Yan, Rabab Ward, Qiang Tang et al.
Are All Marine Species Created Equal? Performance Disparities in Underwater Object Detection
Melanie Wille, Tobias Fischer, Scarlett Raine
ART-ASyn: Anatomy-aware Realistic Texture-based Anomaly Synthesis Framework for Chest X-Rays
Qinyi Cao, Jianan Fan, Weidong Cai
ASC: Learning Augmentation Severity-Consistent Representations Improves Generalization via Augmentation Search
Amirhossein Alamdar, Hossein Jafarinia, Mahdi Noori et al.
ATM: Enhanced Alignment for Text-to-Motion Generation
Ke Han, Yueming Lyu, Weichen Yu et al.
AugMapNet: Improving Spatial Latent Structure via BEV Grid Augmentation for Enhanced Vectorized Online HD Map Construction
Thomas Monninger, Md Zafar Anwar, Stanislaw Antol et al.
Augmenting with NeRFs: Fast Relocalization on Densified Datasets
Michael Tomadakis, Rebecca Borissova, Yuxuan Zhang et al.
A Unified Diffusion-Based Framework for Multi-Agent Trajectory Prediction Integrating Structured Multi-Modal Representations
Chenxi Yang, Suyang Xi, Hong Ding et al.
A Universal Self-Attention Enhancement for Bridging Low-bit Quantization and Vision Transformers
Jiahe Qian, Peisong Wang, Zhengyang Zhuge et al.
AusSmoke meets MultiNatSmoke: a fully-labelled diverse smoke segmentation dataset
Weihao Li, Hongjin Zhao, Gao Zhu et al.