2024 INTERSPEECH INTERSPEECH 2024

Speech quality evaluation of neural audio codecs

Abstract

This paper presents speech quality results to characterize the state of the art and technological advance of recent neural audio codecs targeting low bitrates. Audio quality was evaluated in one clean speech experiment (in French). Degradation Mean Opinion Score (DMOS) results are reported and discussed for neural audio codecs (LPCNet, Lyra V2, EnCodec, AudioCraft, AudioDec, Descript Audio Codec) – traditional codecs (Opus, EVS) are also included as performance yardsticks. We also discuss observed codec complexity to complement subjective test results.

🧭 Keyword Pioneer — degradation mean opinion score
🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio