End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework

Tom Bäckström

2019 INTERSPEECH INTERSPEECH 2019

End-to-End Optimization of Source Models for Speech and Audio Coding Using a Machine Learning Framework

Abstract

Speech coding is the most commonly used application of speech processing. Accumulated layers of improvements have however made codecs so complex that optimization of individual modules becomes increasingly difficult. This work introduces machine learning methodology to speech and audio coding, such that we can optimize quality in terms of overall entropy. We can then use conventional quantization, coding and perceptual models without modification such that the codec adheres to conventional requirements on algorithmic complexity, latency and robustness to packet loss. Experiments demonstrate that end-to-end optimization of quantization accuracy of the spectral envelope can be used for a lossless reduction in bitrate of 0.4 kbits/s.

🧭 Keyword Pioneer — spectral envelope

🐝 Cross-Pollinator — Data Science & Analytics, Deep Learning, Machine Learning, Speech & Audio

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Speech & Audio

Authors

Tom Bäckström

Topics

Machine Learning > Optimization & Theory > Optimization Speech & Audio > Synthesis > Speech Enhancement Machine Learning > Core Methods > Optimization Deep Learning > Optimization & Theory > Optimization

Keywords

spectral envelope speech coding audio coding end-to-end optimization bitrate reduction quantization accuracy perceptual modeling

Download PDF

Related papers

Using Real-Time Visual Biofeedback for Second Language Instruction 2019

VAE-Based Regularization for Deep Speaker Embedding 2019

End-to-End SpeakerBeam for Single Channel Target Speech Recognition 2019

Attention-Enhanced Connectionist Temporal Classification for Discrete Speech Emotion Recognition 2019

Attentive to Individual: A Multimodal Emotion Recognition Network with Personalized Attention Profile 2019