2025
EMNLP
EMNLP 2025
Multimodal Neural Machine Translation: A Survey of the State of the Art
Abstract
AbstractMultimodal neural machine translation (MNMT) has received increasing attention due to its widespread applications in various fields such as cross-border e-commerce and cross-border social media platforms. The task aims to integrate other modalities, such as the visual modality, with textual data to enhance translation performance. We survey the major milestones in MNMT research, providing a comprehensive overview of relevant datasets and recent methodologies, and discussing key challenges and promising research directions.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Computer Vision and Deep Learning and Natural Language Processing
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Multimodal Learning
Natural Language Processing > Applications > Machine Translation
Natural Language Processing > Generation > Machine Translation
Computer Vision > Core AI > Multimodal Learning
Deep Learning > Learning Types > Multi-Modal Learning
Artificial Intelligence > Core AI > Multi-Modal Learning