Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Francisco Vargas; Ryan Cotterell

2020 EMNLP EMNLP 2020

Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

Abstract

AbstractBolukbasi et al. (2016) presents one of the first gender bias mitigation techniques for word embeddings. Their method takes pre-trained word embeddings as input and attempts to isolate a linear subspace that captures most of the gender bias in the embeddings. As judged by an analogical evaluation task, their method virtually eliminates gender bias in the embeddings. However, an implicit and untested assumption of their method is that the bias subspace is actually linear. In this work, we generalize their method to a kernelized, non-linear version. We take inspiration from kernel principal component analysis and derive a non-linear bias isolation technique. We discuss and overcome some of the practical drawbacks of our method for non-linear gender bias mitigation in word embeddings and analyze empirically whether the bias subspace is actually linear. Our analysis shows that gender bias is in fact well captured by a linear subspace, justifying the assumption of Bolukbasi et al. (2016).

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Mathematics & Optimization and Natural Language Processing

🧭 Keyword Pioneer — analogical evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Francisco Vargas , Ryan Cotterell

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Application Areas > Fairness Natural Language Processing > Resources & Methods > Text Representation Mathematics & Optimization > Mathematics > Linear Algebra Machine Learning > Core Methods > Dimensionality Reduction Machine Learning > Learning Types > Fairness Deep Learning > Learning Types > Representation Learning Machine Learning > Core Methods > Interpretability

Keywords

principal component analysis kernel principal component analysis bias mitigation word embedding linear subspace gender bia gender bias mitigation kernel methods analogical evaluation

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020