Papers
2,653 papers found
PoliTo at SemEval-2023 Task 1: CLIP-based Visual-Word Sense Disambiguation Based on Back-Translation
Lorenzo Vaiani, Luca Cagliero, Paolo Garza
SLT at SemEval-2023 Task 1: Enhancing Visual Word Sense Disambiguation through Image Text Retrieval using BLIP
Mohammadreza Molavi, Hossein Zeinali
Ebhaam at SemEval-2023 Task 1: A CLIP-Based Approach for Comparing Cross-modality and Unimodality in Visual Word Sense Disambiguation
Zeinab Taghavi, Parsa Haghighi Naeini, Mohammad Ali Sadraei Javaheri et al.
UAlberta at SemEval-2023 Task 1: Context Augmentation and Translation for Multilingual Visual Word Sense Disambiguation
Michael Ogezi, Bradley Hauer, Talgat Omarov et al.
SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation
Omid Ghahroodi, Seyed Arshan Dalili, Sahel Mesforoush et al.
SemEval-2023 Task 1: Visual Word Sense Disambiguation
Alessandro Raganato, Iacer Calixto, Asahi Ushio et al.
ML Mob at SemEval-2023 Task 1: Probing CLIP on Visual Word-Sense Disambiguation
Clifton Poth, Martin Hentschel, Tobias Werner et al.
Enabling Unsupervised Neural Machine Translation with Word-level Visual Representations
Chengpeng Fu, Xiaocheng Feng, Yichong Huang et al.
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis
Hengshun Zhou, Jun Du, Gongzhen Zou et al.
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting
Haotian Wang, Jun Du, Hengshun Zhou et al.
ML Mob at SemEval-2023 Task 1: Probing CLIP on Visual Word-Sense Disambiguation
Clifton Poth, Martin Hentschel, Tobias Werner et al.
Obtaining referential word meanings from visual and distributional information: Experiments on object naming
Sina Zarrieß, David Schlangen
ViCo: Word Embeddings From Visual Co-Occurrences
Tanmay Gupta, Alexander Schwing, Derek Hoiem
VCWE: Visual Character-Enhanced Word Embeddings
Chi Sun, Xipeng Qiu, Xuanjing Huang
Visual Grounding Helps Learn Word Meanings in Low-Data Regimes
Chengxu Zhuang, Evelina Fedorenko, Jacob Andreas
Sub-Word Level Lip Reading With Visual Attention
K R Prajwal, Triantafyllos Afouras, Andrew Zisserman
Bridging by Word: Image Grounded Vocabulary Construction for Visual Captioning
Zhihao Fan, Zhongyu Wei, Siyuan Wang et al.
Visual Grounding in Video for Unsupervised Word Translation
Gunnar A. Sigurdsson, Jean-Baptiste Alayrac, Aida Nematzadeh et al.
Learning Word-Like Units from Joint Audio-Visual Analysis
David Harwath, James Glass
Visual Grounding of Inter-lingual Word-Embeddings
Wafaa Mohammed, Hassan Shahmohammadi, Hendrik P. A. Lensch et al.
Visual Word2Vec (vis-w2v): Learning Visually Grounded Word Embeddings Using Abstract Scenes
Satwik Kottur, Ramakrishna Vedantam, Jose M. F. Moura et al.
Graph-Based Visual Saliency
Jonathan Harel, Christof Koch, Pietro Perona
A Nonparametric Approach to Bottom-Up Visual Saliency
Wolf Kienzle, Felix A. Wichmann, Matthias O. Franz et al.
Sparse deep belief net model for visual area V2
Honglak Lee, Chaitanya Ekanadham, Andrew Y. Ng