Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function

Yusu Qian; Urwa Muaz; Ben Zhang; Jae Won Hyun

2019 ACL ACL 2019

Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function

Abstract

AbstractGender bias exists in natural language datasets, which neural language models tend to learn, resulting in biased text generation. In this research, we propose a debiasing approach based on the loss function modification. We introduce a new term to the loss function which attempts to equalize the probabilities of male and female words in the output. Using an array of bias evaluation metrics, we provide empirical evidence that our approach successfully mitigates gender bias in language models without increasing perplexity. In comparison to existing debiasing strategies, data augmentation, and word embedding debiasing, our method performs better in several aspects, especially in reducing gender bias in occupation words. Finally, we introduce a combination of data augmentation and our approach and show that it outperforms existing strategies in all bias evaluation metrics.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

📈 Trend Setter — Loss Functions

🧭 Keyword Pioneer — gender bia

🐣 Hot Topic Early Bird — text generation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Yusu Qian , Urwa Muaz , Ben Zhang , Jae Won Hyun

Topics

Machine Learning > Optimization & Theory > Loss Functions Machine Learning > Application Areas > Fairness Natural Language Processing > Generation > Language Modeling Artificial Intelligence > Core AI > Fairness Natural Language Processing > Applications > Text Generation Machine Learning > Learning Types > Fairness Deep Learning > Learning Types > Representation Learning

Keywords

data augmentation text generation loss function language model word embedding gender bia

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019