Sparse and Unique Nonnegative Matrix Factorization Through Data Preprocessing

Nicolas Gillis

2012 JMLR JMLR 2012

Sparse and Unique Nonnegative Matrix Factorization Through Data Preprocessing

Abstract

Nonnegative matrix factorization (NMF) has become a very popular technique in machine learning because it automatically extracts meaningful features through a sparse and part-based representation. However, NMF has the drawback of being highly ill-posed, that is, there typically exist many different but equivalent factorizations. In this paper, we introduce a completely new way to obtaining more well-posed NMF problems whose solutions are sparser. Our technique is based on the preprocessing of the nonnegative input data matrix, and relies on the theory of M-matrices and the geometric interpretation of NMF. This approach provably leads to optimal and sparse solutions under the separability assumption of Donoho and Stodden (2003), and, for rank-three matrices, makes the number of exact factorizations finite. We illustrate the effectiveness of our technique on several image data sets. [abs] [ pdf ][ bib ] © JMLR 2012. (edit, beta)

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🧭 Keyword Pioneer — separability assumption

🐣 Hot Topic Early Bird — feature extraction

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Nicolas Gillis

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Optimization & Theory > Optimization Mathematics & Optimization > Optimization > Continuous Optimization

Keywords

feature extraction sparse representation nonnegative matrix factorization separability assumption

Download PDF

Related papers

Plug-in Approach to Active Learning 2012

An Active Learning Algorithm for Ranking from Pairwise Preferences with an Almost Optimal Query Complexity 2012

Eliminating Spammers and Ranking Annotators for Crowdsourced Labeling Tasks 2012

GPLP: A Local and Parallel Computation Toolbox for Gaussian Process Regression 2012

Query Strategies for Evading Convex-Inducing Classifiers 2012