OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings

Sunipa Dev; Tao Li; Jeff M Phillips; Vivek Srikumar

2021 EMNLP EMNLP 2021

OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings

Abstract

AbstractLanguage representations are known to carry stereotypical biases and, as a result, lead to biased predictions in downstream tasks. While existing methods are effective at mitigating biases by linear projection, such methods are too aggressive: they not only remove bias, but also erase valuable information from word embeddings. We develop new measures for evaluating specific information retention that demonstrate the tradeoff between bias removal and information retention. To address this challenge, we propose OSCaR (Orthogonal Subspace Correction and Rectification), a bias-mitigating method that focuses on disentangling biased associations between concepts instead of removing concepts wholesale. Our experiments on gender biases show that OSCaR is a well-balanced approach that ensures that semantic information is retained in the embeddings and bias is also effectively mitigated.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

📈 Trend Setter — Fairness

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Sunipa Dev , Tao Li , Jeff M Phillips , Vivek Srikumar

Topics

Machine Learning > Core Methods > Representation Learning Machine Learning > Application Areas > Fairness Machine Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Fairness Deep Learning > Techniques > Representation Learning Deep Learning > Learning Types > Fairness

Keywords

orthogonal projection semantic information bias mitigation word embedding orthogonal subspace gender bia

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021