Neural Kernels Without Tangents

Vaishaal Shankar; Alex Fang; Wenshuo Guo; Sara Fridovich-Keil; Jonathan Ragan-Kelley; Ludwig Schmidt; Benjamin Recht

2020 ICML ICML 2020

Neural Kernels Without Tangents

Abstract

We investigate the connections between neural networks and simple building blocks in kernel space. In particular, using well established feature space tools such as direct sum, averaging, and moment lifting, we present an algebra for creating “compositional” kernels from bags of features. We show that these operations correspond to many of the building blocks of “neural tangent kernels (NTK)”. Experimentally, we show that there is a correlation in test error between neural network architectures and the associated kernels. We construct a simple neural network architecture using only 3x3 convolutions, 2x2 average pooling, ReLU, and optimized with SGD and MSE loss that achieves 96% accuracy on CIFAR10, and whose corresponding compositional kernel achieves 90% accuracy. We also use our constructions to investigate the relative performance of neural networks, NTKs, and compositional kernels in the small dataset regime. In particular, we find that compositional kernels outperform NTKs and neural networks outperform both kernel methods.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

🧭 Keyword Pioneer — sgd optimization

🐣 Hot Topic Early Bird — feature space

Authors

Vaishaal Shankar , Alex Fang , Wenshuo Guo , Sara Fridovich-Keil , Jonathan Ragan-Kelley , Ludwig Schmidt , Benjamin Recht

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Theory Deep Learning > Architectures > Neural Networks Machine Learning > Core Methods > Kernel Methods Deep Learning > Learning Types > Representation Learning

Keywords

neural tangent kernel feature learning feature space convolutional neural network compositional kernel kernel methods sgd optimization

Download PDF

Related papers

Correlation Clustering with Asymmetric Classification Errors 2020

Learning Portable Representations for High-Level Planning 2020

Proving the Lottery Ticket Hypothesis: Pruning is All You Need 2020

Minimax Pareto Fairness: A Multi Objective Perspective 2020

DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training 2020