Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression

Pavlos Papadopoulos; Ruchir Travadi; Shrikanth S. Narayanan

2017 INTERSPEECH INTERSPEECH 2017

Global SNR Estimation of Speech Signals for Unknown Noise Conditions Using Noise Adapted Non-Linear Regression

Abstract

The performance of speech technologies deteriorates in the presence of noise. Additionally, we need these technologies to be able to operate across a variety of noise levels and conditions. SNR estimation can guide the design and operation of such technologies or can be used as a pre-processing tool in database creation (e.g. identify/discard noisy signals). We propose a new method to estimate the global SNR of a speech signal when prior information about the noise that corrupts the signal, and speech boundaries within the signal, are not available. To achieve this goal, we train a neural network that performs non-linear regression to estimate the SNR. We use energy ratios as features, as well as ivectors to provide information about the noise that corrupts the signal. We compare our method against others in the literature, using the Mean Absolute Error (MAE) metric, and show that our method outperforms them consistently.

🧭 Keyword Pioneer — non-linear regression

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Pavlos Papadopoulos , Ruchir Travadi , Shrikanth S. Narayanan

Topics

Machine Learning > Core Methods > Regression Machine Learning > Optimization & Theory > Neural Network Optimization

Keywords

signal-to-noise ratio noise estimation speech signal neural network non-linear regression energy ratio

Download PDF

Related papers

Description of the Munich-Passau Snore Sound Corpus (MPSSC) 2017

A Study on Replay Attack and Anti-Spoofing for Automatic Speaker Verification 2017

Binaural Reverberant Speech Separation Based on Deep Neural Networks 2017

Building Audio-Visual Phonetically Annotated Arabic Corpus for Expressive Text to Speech 2017

A Comparison of Danish Listeners’ Processing Cost in Judging the Truth Value of Norwegian, Swedish, and English Sentences 2017