Speculative Beam Search for Simultaneous Translation

Renjie Zheng; Mingbo Ma; Baigong Zheng; Liang Huang

2019 IJCNLP IJCNLP 2019

Speculative Beam Search for Simultaneous Translation

Abstract

AbstractBeam search is universally used in (full-sentence) machine translation but its application to simultaneous translation remains highly non-trivial, where output words are committed on the fly. In particular, the recently proposed wait-k policy (Ma et al., 2018) is a simple and effective method that (after an initial wait) commits one output word on receiving each input word, making beam search seemingly inapplicable. To address this challenge, we propose a new speculative beam search algorithm that hallucinates several steps into the future in order to reach a more accurate decision by implicitly benefiting from a target language model. This idea makes beam search applicable for the first time to the generation of a single word in each step. Experiments over diverse language pairs show large improvement compared to previous work.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — output word

🐣 Hot Topic Early Bird — simultaneous translation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Renjie Zheng , Mingbo Ma , Baigong Zheng , Liang Huang

Topics

Machine Learning > Optimization & Theory > Optimization Natural Language Processing > Applications > Machine Translation

Keywords

simultaneous translation speculative beam search wait-k policy target language model output word

Download PDF

Related papers

Fine-grained Knowledge Fusion for Sequence Labeling Domain Adaptation 2019

Exploiting Monolingual Data at Scale for Neural Machine Translation 2019

Distributionally Robust Language Modeling 2019

Unsupervised Domain Adaptation of Contextualized Embeddings for Sequence Labeling 2019

ARAML: A Stable Adversarial Training Framework for Text Generation 2019