SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation

Omid Ghahroodi; Seyed Arshan Dalili; Sahel Mesforoush; Ehsaneddin Asgari

2023 SEMEVAL SemEval 2023

SUT at SemEval-2023 Task 1: Prompt Generation for Visual Word Sense Disambiguation

Abstract

AbstractVisual Word Sense Disambiguation (V-WSD) identifies the correct visual sense of a multi-sense word in a specific context. This can be challenging as images may need to provide additional context and words may have multiple senses. A proper V-WSD system can benefit applications like image retrieval and captioning. This paper proposes a Prompt Generation approach to solve this challenge. This approach improves the robustness of language-image models like CLIP to contextual ambiguities and helps them better correlate between textual and visual contexts of different senses of words.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Computer Vision

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

Authors

Omid Ghahroodi , Seyed Arshan Dalili , Sahel Mesforoush , Ehsaneddin Asgari

Topics

Artificial Intelligence > Core AI > Multimodal Learning Computer Vision > Generation > Image Generation Artificial Intelligence > Learning Paradigms > Zero-Shot Learning

Keywords

prompt generation clip model language-image model visual word sense disambiguation contextual ambiguity

Download PDF

Related papers

Coco at SemEval-2023 Task 10: Explainable Detection of Online Sexism 2023

ZBL2W at SemEval-2023 Task 9: A Multilingual Fine-tuning Model with Data Augmentation for Tweet Intimacy Analysis 2023

MLModeler5 at SemEval-2023 Task 3: Detecting the Category and the Framing Techniques in Online News in a Multi-lingual Setup 2023

OPI at SemEval-2023 Task 9: A Simple But Effective Approach to Multilingual Tweet Intimacy Analysis 2023

NLP-LISAC at SemEval-2023 Task 12: Sentiment Analysis for Tweets expressed in African languages via Transformer-based Models 2023