Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation

Yun Liu; Hui Zhang; Xueliang Zhang

2018 INTERSPEECH INTERSPEECH 2018

Using Shifted Real Spectrum Mask as Training Target for Supervised Speech Separation

Abstract

Deep learning-based speech separation has been widely studied in recent years. Most of these kind approaches focus on recovering the magnitude spectrum of the target speech, but ignore the phase estimation. Recently, a method called shifted real spectrum (SRS) is proposed. Unlike the short-time Fourier transform (STFT), the SRS contains only real components which encode the phase information. In this paper, we propose several SRS-based masks and use them as the training target of deep neural networks. Experimental results show that the proposed target outperforms the commonly used masks computed on STFT in general.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — shifted real spectrum

🐣 Hot Topic Early Bird — speech separation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Yun Liu , Hui Zhang , Xueliang Zhang

Topics

Machine Learning > Core Methods > Classification Machine Learning > Optimization & Theory > Optimization Deep Learning > Models > Generative Models

Keywords

speech separation deep neural network phase estimation shifted real spectrum spectral mask

Download PDF

Related papers

HoloCompanion: An MR Friend for EveryOne 2018

Estimation of the Vocal Tract Length of Vowel Sounds Based on the Frequency of the Significant Spectral Valley 2018

Deep Learning Techniques for Koala Activity Detection 2018

An Exploration of Local Speaking Rate Variations in Mandarin Read Speech 2018

Acoustic Analysis of Whispery Voice Disguise in Mandarin Chinese 2018