Convex Deep Learning via Normalized Kernels

Özlem Aslan; Xinhua Zhang; Dale Schuurmans

2014 NIPS NeurIPS 2014

Convex Deep Learning via Normalized Kernels

Abstract

Deep learning has been a long standing pursuit in machine learning, which until recently was hampered by unreliable training methods before the discovery of improved heuristics for embedded layer training. A complementary research strategy is to develop alternative modeling architectures that admit efficient training methods while expanding the range of representable structures toward deep models. In this paper, we develop a new architecture for nested nonlinearities that allows arbitrarily deep compositions to be trained to global optimality. The approach admits both parametric and nonparametric forms through the use of normalized kernels to represent each latent layer. The outcome is a fully convex formulation that is able to capture compositions of trainable nonlinear layers to arbitrary depth.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — nested nonlinearities

🐣 Hot Topic Early Bird — representation learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Özlem Aslan , Xinhua Zhang , Dale Schuurmans

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Deep Learning > Architectures > Neural Networks Machine Learning > Core Methods > Kernel Methods Deep Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Convex Optimization

Keywords

representation learning convex optimization deep learning nested nonlinearities global optimality normalized kernel kernel methods neural network

Download PDF

Related papers

Information-based learning by agents in unbounded state spaces 2014

Stochastic Gradient Descent, Weighted Sampling, and the Randomized Kaczmarz algorithm 2014

Partition-wise Linear Models 2014

Active Regression by Stratification 2014

Cone-Constrained Principal Component Analysis 2014