Learning Adversarially Fair and Transferable Representations

David Madras; Elliot Creager; Toniann Pitassi; Richard Zemel

2018 ICML ICML 2018

Learning Adversarially Fair and Transferable Representations

Abstract

In this paper, we advocate for representation learning as the key to mitigating unfair prediction outcomes downstream. Motivated by a scenario where learned representations are used by third parties with unknown objectives, we propose and explore adversarial representation learning as a natural method of ensuring those parties act fairly. We connect group fairness (demographic parity, equalized odds, and equal opportunity) to different adversarial objectives. Through worst-case theoretical guarantees and experimental validation, we show that the choice of this objective is crucial to fair prediction. Furthermore, we present the first in-depth experimental demonstration of fair transfer learning and demonstrate empirically that our learned representations admit fair predictions on new tasks while maintaining utility, an essential goal of fair representation learning.

🧭 Keyword Pioneer — fair transfer learning

🐣 Hot Topic Early Bird — group fairness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

David Madras , Elliot Creager , Toniann Pitassi , Richard Zemel

Topics

Machine Learning > Learning Types > Adversarial Learning Machine Learning > Application Areas > Fairness Machine Learning > Learning Types > Representation Learning

Keywords

representation learning adversarial learning domain adaptation group fairness fair transfer learning

Download PDF

Related papers

Rectify Heterogeneous Models with Semantic Mapping 2018

Bayesian Optimization of Combinatorial Structures 2018

The Well-Tempered Lasso 2018

Approximation Algorithms for Cascading Prediction Models 2018

Classification from Pairwise Similarity and Unlabeled Data 2018