Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation

Baosong Yang; Derek F. Wong; Tong Xiao; Lidia S. Chao; Jingbo Zhu

2017 EMNLP EMNLP 2017

Towards Bidirectional Hierarchical Representations for Attention-based Neural Machine Translation

Abstract

AbstractThis paper proposes a hierarchical attentional neural translation model which focuses on enhancing source-side hierarchical representations by covering both local and global semantic information using a bidirectional tree-based encoder. To maximize the predictive likelihood of target words, a weighted variant of an attention mechanism is used to balance the attentive information between lexical and phrase vectors. Using a tree-based rare word encoding, the proposed model is extended to sub-word level to alleviate the out-of-vocabulary (OOV) problem. Empirical results reveal that the proposed model significantly outperforms sequence-to-sequence attention-based and tree-based neural translation models in English-Chinese translation tasks.

🌉 Interdisciplinary Bridge — Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — sub-word encoding

🐣 Hot Topic Early Bird — hierarchical attention

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Baosong Yang , Derek F. Wong , Tong Xiao , Lidia S. Chao , Jingbo Zhu

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Applications > Machine Translation Natural Language Processing > Generation > Machine Translation Deep Learning > Techniques > Attention

Keywords

attention mechanism neural machine translation hierarchical attention out-of-vocabulary problem sub-word encoding bidirectional representation tree-based encoder

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017