2018
INTERSPEECH
INTERSPEECH 2018
Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks
Abstract
Time Delay Neural Networks (TDNNs), also known as one-dimensional Convolutional Neural Networks (1-d CNNs), are an efficient and well-performing neural network architecture for speech recognition. We introduce a factored form of TDNNs (TDNN-F) which is structurally the same as a TDNN whose layers have been compressed via SVD, but is trained from a random start with one of the two factors of each matrix constrained to be semi-orthogonal. This gives substantial improvements over TDNNs and performs about as well as TDNN-LSTM hybrids.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning
🧭
Keyword Pioneer
— semi-orthogonal constraint
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing
🐣
Hot Topic Early Bird
— singular value decomposition
Authors
Topics
Machine Learning > Optimization & Theory > Optimization
Deep Learning > Architectures > Neural Networks
Speech & Audio > Recognition > Automatic Speech Recognition
Machine Learning > Application Areas > Model Compression
Machine Learning > Core Methods > Matrix Factorization
Deep Learning > Optimization & Theory > Model Compression