Easy as PIE? Identifying Multi-Word Expressions with LLMs

Kai Golan Hashiloni; Ofri Hefetz; Kfir Bar

2025 EMNLP EMNLP 2025

Easy as PIE? Identifying Multi-Word Expressions with LLMs

Abstract

AbstractWe investigate the identification of idiomatic expressions—a semantically non-compositional subclass of multiword expressions (MWEs)—in running text using large language models (LLMs) without any fine-tuning. Instead, we adopt a prompt-based approach and evaluate a range of prompting strategies, including zero-shot, few-shot, and chain-of-thought variants, across multiple languages, datasets, and model types. Our experiments show that, with well-crafted prompts, LLMs can perform competitively with supervised models trained on annotated data. These findings highlight the potential of prompt-based LLMs as a flexible and effective alternative for idiomatic expression identification.

❓ The Questioner

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kai Golan Hashiloni , Ofri Hefetz , Kfir Bar

Topics

Machine Learning > Learning Types > Zero-Shot Learning Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods > Large Language Models Natural Language Processing > Resources & Methods > Lexical Semantics Machine Learning > Learning Types > Few-Shot Learning Deep Learning > Learning Types > In-Context Learning Natural Language Processing > Understanding > Lexical Semantics

Keywords

zero-shot learning few-shot learning in-context learning lexical semantics prompt engineering semantic analysis prompt-based learning multi-word expression idiomatic expression idiom detection large language model

Download PDF

Related papers

Bit-Flip Error Resilience in LLMs: A Comprehensive Analysis and Defense Framework 2025

VoiceCraft-X: Unifying Multilingual, Voice-Cloning Speech Synthesis and Speech Editing 2025

Model-based Large Language Model Customization as Service 2025

ZoomEye: Enhancing Multimodal LLMs with Human-Like Zooming Capabilities through Tree-Based Image Exploration 2025

SlideCoder: Layout-aware RAG-enhanced Hierarchical Slide Generation from Design 2025