2020
AAAI
AAAI 2020
VECA: A Method for Detecting Overfitting in Neural Networks (Student Abstract)
Abstract
Abstract Despite their widespread applications, deep neural networks often tend to overfit the training data. Here, we propose a measure called VECA (Variance of Eigenvalues of Covariance matrix of Activation matrix) and demonstrate that VECA is a good predictor of networks' generalization performance during the training process. Experiments performed on fully-connected networks and convolutional neural networks trained on benchmark image datasets show a strong correlation between test loss and VECA, which suggest that we can calculate the VECA to estimate generalization performance without sacrificing training data to be used as a validation set.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning
🧭
Keyword Pioneer
— eigenvalue variance
🐣
Hot Topic Early Bird
— training dynamics
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Machine Learning > Optimization & Theory > Neural Network Optimization
Machine Learning > Optimization & Theory > Theory
Deep Learning > Architectures > Neural Networks
Deep Learning > Optimization & Theory > Neural Network Optimization
Deep Learning > Optimization & Theory > Theory
Machine Learning > Learning Types > Regularization