Sparse PCA via Bipartite Matchings

Megasthenis Asteris; Dimitris Papailiopoulos; Anastasios Kyrillidis; Alexandros G Dimakis

2015 NIPS NeurIPS 2015

Sparse PCA via Bipartite Matchings

Abstract

We consider the following multi-component sparse PCA problem:given a set of data points, we seek to extract a small number of sparse components with \emph{disjoint} supports that jointly capture the maximum possible variance.Such components can be computed one by one, repeatedly solving the single-component problem and deflating the input data matrix, but this greedy procedure is suboptimal.We present a novel algorithm for sparse PCA that jointly optimizes multiple disjoint components. The extracted features capture variance that lies within a multiplicative factor arbitrarily close to $1$ from the optimal.Our algorithm is combinatorial and computes the desired components by solving multiple instances of the bipartite maximum weight matching problem.Its complexity grows as a low order polynomial in the ambient dimension of the input data, but exponentially in its rank.However, it can be effectively applied on a low-dimensional sketch of the input data.We evaluate our algorithm on real datasets and empirically demonstrate that in many cases it outperforms existing, deflation-based approaches.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐣 Hot Topic Early Bird — combinatorial optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Megasthenis Asteris , Dimitris Papailiopoulos , Anastasios Kyrillidis , Alexandros G Dimakis

Topics

Machine Learning > Core Methods > Representation Learning Mathematics & Optimization > Mathematics > Graph Theory Mathematics & Optimization > Optimization > Combinatorial Optimization Machine Learning > Core Methods > Dimensionality Reduction Machine Learning > Learning Types > Sparse Learning

Keywords

combinatorial optimization dimensionality reduction feature extraction matrix factorization bipartite matching sparse principal component analysis variance maximization sparse pca matrix deflation

Download PDF

Related papers

Data Generation as Sequential Decision Making 2015

A Recurrent Latent Variable Model for Sequential Data 2015

Combinatorial Cascading Bandits 2015

Accelerated Mirror Descent in Continuous and Discrete Time 2015

Matrix Completion with Noisy Side Information 2015