Papers
2,899 papers found
StreamKV: Streaming Video Question-Answering with Segment-based KV Cache Retrieval and Compression
Yilong Chen, Xiang Bai, Zhibin Wang et al.
Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Yu Fei, Quan Deng, Shengeng Tang et al.
APVR: Hour-Level Long Video Understanding with Adaptive Pivot Visual Information Retrieval
Hong Gao, Yiming Bao, Xuezhen Tu et al.
Adaptive Evidential Learning for Temporal-Semantic Robustness in Moment Retrieval
Haojian Huang, Kaijing Ma, Jin Chen et al.
GranAlign: Granularity-Aware Alignment Framework for Zero-shot Video Moment Retrieval
Mingyu Jeon, Sunjae Yoon, Jonghee Kim et al.
Fine-Grained Image Retrieval via Dual-Vision Adaptation
Xin Jiang, Meiqi Cao, Hao Tang et al.
Dual-Teacher Interactive Knowledge Distillation Network for Text-to-Visible & Infrared Person Retrieval
Chenglong Li, Zhengyu Chen, Yifei Deng et al.
Modality and Task Adaptation for Enhanced Zero-shot Composed Image Retrieval
Haiwen Li, Delong Liu, Zhaohui Hou et al.
SSR-SAM: Retrieval-Style Segment Anything Model for Semi-Supervised Ultra-High-Resolution Image Segmentation
Shijie Li, Yiming Chen, Zhineng Chen et al.
Hyperbolic Hierarchical Alignment Reasoning Network for Text-3D Retrieval
Wenrui Li, Yidan Lu, Yeyu Chai et al.
RegionRAG: Region-level Retrieval-Augmented Generation for Visual Document Understanding
Yinglu Li, Zhiying Lu, Zhihang Liu et al.
HABIT: Chrono-Synergia Robust Progressive Learning Framework for Composed Image Retrieval
Zixu Li, Yupeng Hu, Zhiwei Chen et al.
Object-Centric Framework for Video Moment Retrieval
Zongyao Li, Yongkang Wong, Satoshi Yamazaki et al.
Discretization Is Not Always Better: Rethinking Deep Quantization for Asymmetric Image Retrieval
Xinze Liu, Dayan Wu, Hengjie Zhu et al.
Unlearning in Cross-Modal Retrieval via Prior-Prototype Guided Partitioned Dampening
Yi Lu, Shu Li, Yurong Qian
Appearance-Motion Decomposed Alignment for Text-Video Retrieval
Meng Meng, Zichang Tan, Yong Zhang et al.
PMPGuard: Catching Pseudo-Matched Pairs in Remote Sensing Image–Text Retrieval
Pengxiang Ouyang, Qing Ma, Zheng Wang et al.
Organ-Aware Routing Mixture-of-Retrieval Augmented Generation for Fetal Ultrasound Reporting
Bin Pu, Siyu Wang, Rongbin Li et al.
WaveC2R: Wavelet-Driven Coarse-to-Refined Hierarchical Learning for Radar Retrieval
Chunlei Shi, Han Xu, Yinghao Li et al.
Meta-Guided Sample Reweighting for Robust Cross-Modal Hashing Retrieval with Noisy Labels
Ziang Tan, Weitao An, Erkun Yang
Heterogeneous Uncertainty-Guided Composed Image Retrieval with Fine-Grained Probabilistic Learning
Haomiao Tang, Jinpeng Wang, Minyi Zhao et al.
Manipulation Intention Understanding for Zero-Shot Composed Image Retrieval
Yuanmin Tang, Jing Yu, Keke Gai et al.
DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion Adaptation
Zun Wang, Jialu Li, Han Lin et al.
Retrieval-driven Reasoning for Deliberative Visual Classification
Jianye Xie, Lianyong Qi, Fan Wang et al.