Measuring Bias in Contextualized Word Representations

Keita Kurita; Nidhi Vyas; Ayush Pareek; Alan W Black; Yulia Tsvetkov

2019 ACL ACL 2019

Measuring Bias in Contextualized Word Representations

Abstract

AbstractContextual word embeddings such as BERT have achieved state of the art performance in numerous NLP tasks. Since they are optimized to capture the statistical properties of training data, they tend to pick up on and amplify social stereotypes present in the data as well. In this study, we (1) propose a template-based method to quantify bias in BERT; (2) show that this method obtains more consistent results in capturing social biases than the traditional cosine based method; and (3) conduct a case study, evaluating gender bias in a downstream task of Gender Pronoun Resolution. Although our case study focuses on gender bias, the proposed technique is generalizable to unveiling other biases, including in multiclass settings, such as racial and religious biases.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — social stereotype

🐣 Hot Topic Early Bird — social bia

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Keita Kurita , Nidhi Vyas , Ayush Pareek , Alan W Black , Yulia Tsvetkov

Topics

Machine Learning > Application Areas > Fairness Natural Language Processing > Resources & Methods > Text Representation Machine Learning > Learning Types > Representation Learning Natural Language Processing > Applications > Natural Language Inference Artificial Intelligence > Core AI > Fairness

Keywords

social bia gender bia social stereotype stereotype amplification contextual word embedding gender pronoun resolution

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019