Demographic-Aware Language Model Fine-tuning as a Bias Mitigation Technique

Aparna Garimella; Rada Mihalcea; Akhash Amarnath

2022 AACL AACL 2022

Demographic-Aware Language Model Fine-tuning as a Bias Mitigation Technique

Abstract

AbstractBERT-like language models (LMs), when exposed to large unstructured datasets, are known to learn and sometimes even amplify the biases present in such data. These biases generally reflect social stereotypes with respect to gender, race, age, and others. In this paper, we analyze the variations in gender and racial biases in BERT, a large pre-trained LM, when exposed to different demographic groups. Specifically, we investigate the effect of fine-tuning BERT on text authored by historically disadvantaged demographic groups in comparison to that by advantaged groups. We show that simply by fine-tuning BERT-like LMs on text authored by certain demographic groups can result in the mitigation of social biases in these LMs against various target groups.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning

🐣 Hot Topic Early Bird — demographic bia

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Aparna Garimella , Rada Mihalcea , Akhash Amarnath

Topics

Artificial Intelligence > Core AI > Responsible AI Machine Learning > Application Areas > Domain Adaptation Machine Learning > Application Areas > Fairness Artificial Intelligence > Core AI > Fairness Machine Learning > Learning Types > Fairness Deep Learning > Learning Types > Transfer Learning

Keywords

bias mitigation language model language model fine-tuning social bia gender bia racial bia demographic bia social stereotype

Download PDF

Related papers

A Japanese Corpus of Many Specialized Domains for Word Segmentation and Part-of-Speech Tagging 2022

Enhancing Tabular Reasoning with Pattern Exploiting Training 2022

Re-contextualizing Fairness in NLP: The Case of India 2022

Adversarially Improving NMT Robustness to ASR Errors with Confusion Sets 2022

Promoting Pre-trained LM with Linguistic Features on Automatic Readability Assessment 2022