Disfluency Correction using Unsupervised and Semi-supervised Learning

Nikhil Saini; Drumil Trivedi; Shreya Khare; Tejas Dhamecha; Preethi Jyothi; Samarth Bharadwaj; Pushpak Bhattacharyya

2021 EACL EACL 2021

Disfluency Correction using Unsupervised and Semi-supervised Learning

Abstract

AbstractSpoken language is different from the written language in its style and structure. Disfluencies that appear in transcriptions from speech recognition systems generally hamper the performance of downstream NLP tasks. Thus, a disfluency correction system that converts disfluent to fluent text is of great value. This paper introduces a disfluency correction model that translates disfluent to fluent text by drawing inspiration from recent encoder-decoder unsupervised style-transfer models for text. We also show considerable benefits in performance when utilizing a small sample of 500 parallel disfluent-fluent sentences in a semi-supervised way. Our unsupervised approach achieves a BLEU score of 79.39 on the Switchboard corpus test set, with further improvement to a BLEU score of 85.28 with semi-supervision. Both are comparable to two competitive fully-supervised models.

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

🧭 Keyword Pioneer — disfluency correction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nikhil Saini , Drumil Trivedi , Shreya Khare , Tejas Dhamecha , Preethi Jyothi , Samarth Bharadwaj , Pushpak Bhattacharyya

Topics

Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Learning Types > Unsupervised Learning Speech & Audio > Analysis > Clinical Speech Analysis

Keywords

unsupervised learning semi-supervised learning style transfer encoder-decoder model disfluency correction

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021