Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition

Natasha Singh-miller; Michael Collins

2009 NIPS NeurIPS 2009

Learning Label Embeddings for Nearest-Neighbor Multi-class Classification with an Application to Speech Recognition

Abstract

We consider the problem of using nearest neighbor methods to provide a conditional probability estimate, P(y|a), when the number of labels y is large and the labels share some underlying structure. We propose a method for learning error-correcting output codes (ECOCs) to model the similarity between labels within a nearest neighbor framework. The learned ECOCs and nearest neighbor information are used to provide conditional probability estimates. We apply these estimates to the problem of acoustic modeling for speech recognition. We demonstrate an absolute reduction in word error rate (WER) of 0.9% (a 2.5% relative reduction in WER) on a lecture recognition task over a state-of-the-art baseline GMM model.

🌉 Interdisciplinary Bridge — Machine Learning and Speech & Audio

📈 Trend Setter — Speech Recognition

🧭 Keyword Pioneer — error-correcting output code

🐣 Hot Topic Early Bird — speech recognition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Natasha Singh-miller , Michael Collins

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Metric Learning Machine Learning > Core Methods > Embedding Learning Speech & Audio > Recognition > Speech Recognition

Keywords

speech recognition acoustic modeling nearest neighbor multi-class classification conditional probability label embedding nearest neighbor classification error-correcting output code

Download PDF

Related papers

Solving Stochastic Games 2009

Bilinear classifiers for visual recognition 2009

Zero-shot Learning with Semantic Output Codes 2009

Matrix Completion from Power-Law Distributed Samples 2009

Heavy-Tailed Symmetric Stochastic Neighbor Embedding 2009