2021
NIPS
NeurIPS 2021
Representation Costs of Linear Neural Networks: Analysis and Design
Abstract
For different parameterizations (mappings from parameters to predictors), we study the regularization cost in predictor space induced by $l_2$ regularization on the parameters (weights). We focus on linear neural networks as parameterizations of linear predictors. We identify the representation cost of certain sparse linear ConvNets and residual networks. In order to get a better understanding of how the architecture and parameterization affect the representation cost, we also study the reverse problem, identifying which regularizers on linear predictors (e.g., $l_p$ norms, group norms, the $k$-support-norm, elastic net) can be the representation cost induced by simple $l_2$ regularization, and designing the parameterizations that do so.
🌉
Interdisciplinary Bridge
— Deep Learning and Machine Learning
🧭
Keyword Pioneer
— weight norm
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Vision, Deep Learning, Machine Learning, Mathematics & Optimization, Reinforcement Learning
Authors
Topics
Machine Learning > Core Methods > Representation Learning
Machine Learning > Optimization & Theory > Neural Network Optimization
Machine Learning > Optimization & Theory > Theory
Deep Learning > Architectures > Neural Networks
Deep Learning > Optimization & Theory > Theory
Machine Learning > Optimization & Theory > Regularization