CUNI System for the WMT18 Multimodal Translation Task

Jindřich Helcl; Jindřich Libovický; Dušan Variš

2018 EMNLP EMNLP 2018

CUNI System for the WMT18 Multimodal Translation Task

Abstract

AbstractWe present our submission to the WMT18 Multimodal Translation Task. The main feature of our submission is applying a self-attentive network instead of a recurrent neural network. We evaluate two methods of incorporating the visual features in the model: first, we include the image representation as another input to the network; second, we train the model to predict the visual features and use it as an auxiliary objective. For our submission, we acquired both textual and multimodal additional data. Both of the proposed methods yield significant improvements over recurrent networks and self-attentive textual baselines.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — self attention

🐣 Hot Topic Early Bird — visual feature

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jindřich Helcl , Jindřich Libovický , Dušan Variš

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Application Areas > Domain Adaptation Deep Learning > Architectures > Transformers Deep Learning > Models > Generative Models Natural Language Processing > Applications > Machine Translation Deep Learning > Models > Transformers Deep Learning > Learning Types > Multi-Modal Learning

Keywords

neural machine translation image representation visual feature self attention multimodal translation auxiliary objective neural network self-attentive network

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018