Input-Output Equivalence of Unitary and Contractive RNNs

Melikasadat Emami; Mojtaba Sahraee Ardakan; Sundeep Rangan; Alyson K. Fletcher

2019 NIPS NeurIPS 2019

Input-Output Equivalence of Unitary and Contractive RNNs

Abstract

Unitary recurrent neural networks (URNNs) have been proposed as a method to overcome the vanishing and exploding gradient problem in modeling data with long-term dependencies. A basic question is how restrictive is the unitary constraint on the possible input-output mappings of such a network? This works shows that for any contractive RNN with ReLU activations, there is a URNN with at most twice the number of hidden states and the identical input-output mapping. Hence, with ReLU activations, URNNs are as expressive as general RNNs. In contrast, for certain smooth activations, it is shown that the input-output mapping of an RNN cannot be matched with a URNN, even with an arbitrary number of states. The theoretical results are supported by experiments on modeling of slowly-varying dynamical systems.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning

🧭 Keyword Pioneer — unitary recurrent neural network

🐣 Hot Topic Early Bird — hidden state

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Melikasadat Emami , Mojtaba Sahraee Ardakan , Sundeep Rangan , Alyson K. Fletcher

Topics

Machine Learning > Optimization & Theory > Theory Deep Learning > Architectures > Neural Networks

Keywords

dynamical system hidden state unitary recurrent neural network contractive activation input-output mapping

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019