INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion

Hengchao Shang; Zongyao Li; Daimeng Wei; Jiaxin Guo; Minghan Wang; Xiaoyu Chen; Lizhi Lei; Hao Yang

2023 EMNLP EMNLP 2023

INarIG: Iterative Non-autoregressive Instruct Generation Model For Word-Level Auto Completion

Abstract

AbstractComputer-aided translation (CAT) aims to enhance human translation efficiency and is still important in scenarios where machine translation cannot meet quality requirements. One fundamental task within this field is Word-Level Auto Completion (WLAC). WLAC predicts a target word given a source sentence, translation context, and a human typed character sequence. Previous works either employ word classification models to exploit contextual information from both sides of the target word or directly disregarded the dependencies from the right-side context. Furthermore, the key information, i.e. human typed sequences, is only used as prefix constraints in the decoding module. In this paper, we propose the INarIG (Iterative Non-autoregressive Instruct Generation) model, which constructs the human typed sequence into Instruction Unit and employs iterative decoding with subwords to fully utilize input information given in the task. Our model is more competent in dealing with low-frequency words (core scenario of this task), and achieves state-of-the-art results on the WMT22 and benchmark datasets, with a maximum increase of over 10% prediction accuracy.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — auto completion

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Hengchao Shang , Zongyao Li , Daimeng Wei , Jiaxin Guo , Minghan Wang , Xiaoyu Chen , Lizhi Lei , Hao Yang

Topics

Natural Language Processing > Generation > Text Generation Natural Language Processing > Applications > Machine Translation Machine Learning > Learning Types > Supervised Learning Deep Learning > Models > Large Language Models Deep Learning > Learning Types > Generative Models

Keywords

machine translation iterative decoding non-autoregressive model non-autoregressive generation word prediction instruction generation word-level completion computer-aided translation auto completion word-level auto completion

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023