The Complexity of Explaining Neural Networks Through (group) Invariants

Danielle Ensign; Scott Neville; Arnab Paul; Suresh Venkatasubramanian

2017 ALT ALT 2017

The Complexity of Explaining Neural Networks Through (group) Invariants

Abstract

Ever since the work of Minsky and Papert, it has been thought that neural networks derive their effectiveness by finding representations of the data that are invariant with respect to the task. In other words, the representations eliminate components of the data that vary in a way that is irrelevant. These invariants are naturally expressed with respect to group operations, and thus an understanding of these groups is key to explaining the effectiveness of the neural network. Moreover, a line of work in deep learning has shown that explicit knowledge of group invariants can lead to more effective training results. In this paper, we investigate the difficulty of discovering anything about these implicit invariants. Unfortunately, our main results are negative: we show that a variety of questions around investigating invariant representations are NP-hard, even in approximate settings. Moreover, these results do not depend on the kind of architecture used: in fact, our results follow as soon as the network architecture is powerful enough to be universal. The key idea behind our results is that if we can find the symmetries of a problem then we can solve it.

🚀 Conference Pioneer — ALT 2017

🧭 Keyword Pioneer — group invariant

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Danielle Ensign , Scott Neville , Arnab Paul , Suresh Venkatasubramanian

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Learning Theory

Keywords

representation learning computational complexity neural network group invariant

Download PDF

Related papers

Minimax rates for cost-sensitive learning on manifolds with approximate nearest neighbours 2017

Lifelong Learning in Costly Feature Spaces 2017

Efficient tracking of a growing number of experts 2017

Hypotheses testing on infinite random graphs 2017

Algorithmic Learning Theory (ALT) 2017: Preface 2017