On Socially Fair Low-Rank Approximation and Column Subset Selection

Zhao Song; Ali Vakilian; David P. Woodruff; Samson Zhou

2024 NIPS NeurIPS 2024

On Socially Fair Low-Rank Approximation and Column Subset Selection

Abstract

Low-rank approximation and column subset selection are two fundamental and related problems that are applied across a wealth of machine learning applications. In this paper, we study the question of socially fair low-rank approximation and socially fair column subset selection, where the goal is to minimize the loss over all sub-populations of the data. We show that surprisingly, even constant-factor approximation to fair low-rank approximation requires exponential time under certain standard complexity hypotheses. On the positive side, we give an algorithm for fair low-rank approximation that, for a constant number of groups and constant-factor accuracy, runs in $2^{\text{poly}(k)}$ rather than the naive $n^{\text{poly}(k)}$, which is a substantial improvement when the dataset has a large number $n$ of observations. We then show that there exist bicriteria approximation algorithms for fair low-rank approximation and fair column subset selection that runs in polynomial time.

🌉 Interdisciplinary Bridge — Machine Learning and Mathematics & Optimization

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning

🧭 Keyword Pioneer — sub-population loss

Authors

Zhao Song , Ali Vakilian , David P. Woodruff , Samson Zhou

Topics

Machine Learning > Application Areas > Fairness Mathematics & Optimization > Mathematics > Linear Algebra Mathematics & Optimization > Optimization > Combinatorial Optimization Machine Learning > Core Methods > Matrix Factorization Machine Learning > Core Methods > Optimization Mathematics & Optimization > Optimization > Approximation Algorithms Machine Learning > Optimization & Theory > Approximation Algorithms

Keywords

matrix factorization low-rank approximation matrix approximation approximation algorithm fair machine learning column subset selection bicriteria approximation fair optimization sub-population loss

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024