2016
INTERSPEECH
INTERSPEECH 2016
Complex Linear Projection (CLP): A Discriminative Approach to Joint Feature Extraction and Acoustic Modeling
Abstract
State-of-the-art automatic speech recognition (ASR) systems typically rely on pre-processed features. This paper studies the time-frequency duality in ASR feature extraction methods and proposes extending the standard acoustic model with a complex-valued linear projection layer to learn and optimize features that minimize standard cost functions such as cross-entropy. The proposed Complex Linear Projection (CLP) features achieve superior performance compared to pre-processed Log Mel features.
π
Conference Pioneer
β INTERSPEECH 2016
π
Interdisciplinary Bridge
β Deep Learning and Machine Learning and Speech & Audio
π§
Keyword Pioneer
β complex linear projection
π
Cross-Pollinator
β Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
π£
Hot Topic Early Bird
β model architecture
Authors
Topics
Machine Learning > Core Methods > Representation Learning
Deep Learning > Techniques > Model Architecture
Speech & Audio > Recognition > Automatic Speech Recognition
Speech & Audio > Recognition > Speech Recognition
Machine Learning > Core Methods > Feature Learning
Deep Learning > Learning Types > Representation Learning