Papers
3,673 papers found
Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-Shot Learning
Shivam Chandhok, Vineeth N Balasubramanian
Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation
Tianqi Tang, Xin Yu, Xuanyi Dong et al.
Improving Video Captioning With Temporal Composition of a Visual-Syntactic Embedding
Jesus Perez-Martin, Benjamin Bustos, Jorge Perez
MART: Motion-Aware Recurrent Neural Network for Robust Visual Tracking
Heng Fan, Haibin Ling
TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships
Gal S. Kenigsfield, Ran El-Yaniv
Breaking Shortcuts by Masking for Robust Visual Reasoning
Keren Ye, Mingda Zhang, Adriana Kovashka
Seeing Through Your Skin: Recognizing Objects With a Novel Visuotactile Sensor
Francois R. Hogan, Michael Jenkin, Sahand Rezaei-Shoshtari et al.
Visual Speech Enhancement Without a Real Visual Stream
Sindhu B. Hegde, K.R. Prajwal, Rudrabha Mukhopadhyay et al.
Structured Visual Search via Composition-Aware Learning
Mert Kilickaya, Arnold W.M. Smeulders
Self-Supervised Visual-LiDAR Odometry With Flip Consistency
Bin Li, Mu Hu, Shuling Wang et al.
Fusion Learning Using Semantics and Graph Convolutional Network for Visual Food Recognition
Heng Zhao, Kim-Hui Yap, Alex Chichung Kot
TracKlinic: Diagnosis of Challenge Factors in Visual Tracking
Heng Fan, Fan Yang, Peng Chu et al.
Meta Module Network for Compositional Visual Reasoning
Wenhu Chen, Zhe Gan, Linjie Li et al.
Single Image Human Proxemics Estimation for Visual Social Distancing
Maya Aghaei, Matteo Bustreo, Yiming Wang et al.
S-VVAD: Visual Voice Activity Detection by Motion Segmentation
Muhammad Shahid, Cigdem Beyan, Vittorio Murino
Deep Poisoning: Towards Robust Image Data Sharing Against Visual Disclosure
Hao Guo, Brian Dolhansky, Eric Hsin et al.
Efficient Video Annotation With Visual Interpolation and Frame Selection Guidance
Alina Kuznetsova, Aakrati Talati, Yiwen Luo et al.
From Node To Graph: Joint Reasoning on Visual-Semantic Relational Graph for Zero-Shot Detection
Hui Nie, Ruiping Wang, Xilin Chen
VCSeg: Virtual Camera Adaptation for Road Segmentation
Gong Cheng, James H. Elder
Fair Visual Recognition in Limited Data Regime Using Self-Supervision and Self-Distillation
Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri
Billion-Scale Pretraining With Vision Transformers for Multi-Task Visual Representations
Josh Beal, Hao-Yu Wu, Dong Huk Park et al.
Visualizing Paired Image Similarity in Transformer Networks
Samuel Black, Abby Stylianou, Robert Pless et al.
V-SlowFast Network for Efficient Visual Sound Separation
Lingyu Zhu, Esa Rahtu
Learned Event-Based Visual Perception for Improved Space Object Detection
Nikolaus Salvatore, Justin Fletcher
SEGA: Semantic Guided Attention on Visual Prototype for Few-Shot Learning
Fengyuan Yang, Ruiping Wang, Xilin Chen