Adapting Translation Models for Transcript Disfluency Detection

Qianqian Dong; Feng Wang; Zhen Yang; Wei Chen; Shuang Xu; Bo Xu

2019 AAAI AAAI 2019

Adapting Translation Models for Transcript Disfluency Detection

Abstract

Abstract Transcript disfluency detection (TDD) is an important component of the real-time speech translation system, which arouses more and more interests in recent years. This paper presents our study on adapting neural machine translation (NMT) models for TDD. We propose a general training framework for adapting NMT models to TDD task rapidly. In this framework, the main structure of the model is implemented similar to the NMT model. Additionally, several extended modules and training techniques which are independent of the NMT model are proposed to improve the performance, such as the constrained decoding, denoising autoencoder initialization and a TDD-specific training object. With the proposed training framework, we achieve significant improvement. However, it is too slow in decoding to be practical. To build a feasible and production-ready solution for TDD, we propose a fast non-autoregressive TDD model following the non-autoregressive NMT model emerged recently. Even we do not assume the specific architecture of the NMT model, we build our TDD model on the basis of Transformer, which is the state-of-the-art NMT model. We conduct extensive experiments on the publicly available set, Switchboard, and in-house Chinese set. Experimental results show that the proposed model significantly outperforms previous state-ofthe-art models.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing and Speech & Audio

📈 Trend Setter — Speech Enhancement

🧭 Keyword Pioneer — transcript disfluency detection

🐣 Hot Topic Early Bird — non-autoregressive model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Qianqian Dong , Feng Wang , Zhen Yang , Wei Chen , Shuang Xu , Bo Xu

Topics

Machine Learning > Application Areas > Domain Adaptation Deep Learning > Architectures > Transformers Natural Language Processing > Applications > Machine Translation Speech & Audio > Processing > Speech Enhancement

Keywords

neural machine translation speech processing denoising autoencoder non-autoregressive translation non-autoregressive model transcript disfluency detection

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019