2025
ACL
ACL 2025
A Couch Potato is not a Potato on a Couch: Prompting Strategies, Image Generation, and Compositionality Prediction for Noun Compounds
Abstract
AbstractWe explore the role of the visual modality and of vision transformers in predicting the compositionality of English noun compounds. Crucially, we contribute a framework to address the challenge of obtaining adequate images that represent non-compositional compounds (such as “couch potato”), making it relevant for any image-based approach targeting figurative language. Our method uses prompting strategies and diffusion models to generate images. Comparing and combining our approach with a state-of-the-art text-based approach reveals complementary contributions regarding features as well as degrees of abstractness in compounds.
🌉
Interdisciplinary Bridge
— Computer Vision and Machine Learning
🧭
Keyword Pioneer
— figurative language
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
Topics
Machine Learning > Learning Types > Self-Supervised Learning
Computer Vision > Generation > Image Generation
Natural Language Processing > Understanding > Semantic Analysis
Interdisciplinary > Linguistics > Computational Linguistics
Computer Vision > Core AI > Multimodal Learning
Deep Learning > Learning Types > Multi-Modal Learning