Sub-Band Knowledge Distillation Framework for Speech Enhancement

Xiang Hao; Shixue Wen; Xiangdong Su; Yun Liu; Guanglai Gao; Xiaofei Li

2020 INTERSPEECH INTERSPEECH 2020

Sub-Band Knowledge Distillation Framework for Speech Enhancement

Abstract

In single-channel speech enhancement, methods based on full-band spectral features have been widely studying, while only a few methods pay attention to non-full-band spectral features. In this paper, we explore a knowledge distillation framework based on sub-band spectral mapping for single-channel speech enhancement. First, we divide the full frequency band into multiple sub-bands and pre-train elite-level sub-band enhancement model (teacher model) for each sub-band. The teacher models are dedicated to processing their own sub-bands. Next, under the teacher models’ guidance, we train a general sub-band enhancement model (student model) that works for all sub-bands. Without increasing the number of model parameters and computational complexity, the student model’s performance is further improved. To evaluate the proposed method, we conducted a large number of experiments on an open-source data set. The final experimental results show that the guidance from the elite-level teacher models dramatically improves the student model’s performance, which exceeds the full-band model by employing fewer parameters.

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

🧭 Keyword Pioneer — sub-band spectral feature

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Xiang Hao , Shixue Wen , Xiangdong Su , Yun Liu , Guanglai Gao , Xiaofei Li

Topics

Machine Learning > Application Areas > Knowledge Distillation Speech & Audio > Synthesis > Speech Enhancement Speech & Audio > Analysis > Speech Enhancement Machine Learning > Learning Types > Knowledge Distillation Deep Learning > Techniques > Knowledge Distillation

Keywords

model compression knowledge distillation speech enhancement teacher-student framework parameter efficiency sub-band spectral feature full frequency band teacher student learning sub-band spectral mapping

Download PDF

Related papers

Memory Controlled Sequential Self Attention for Sound Recognition 2020

Dual Attention in Time and Frequency Domain for Voice Activity Detection 2020

Automatic Prediction of Speech Intelligibility Based on X-Vectors in the Context of Head and Neck Cancer 2020

A Noise Robust Technique for Detecting Vowels in Speech Signals 2020

Joint Detection of Sentence Stress and Phrase Boundary for Prosody 2020