Stepmothers are mean and academics are pretentious: What do pretrained language models learn about you?

Rochelle Choenni; Ekaterina Shutova; Robert Van Rooij

2021 EMNLP EMNLP 2021

Stepmothers are mean and academics are pretentious: What do pretrained language models learn about you?

Abstract

AbstractIn this paper, we investigate what types of stereotypical information are captured by pretrained language models. We present the first dataset comprising stereotypical attributes of a range of social groups and propose a method to elicit stereotypes encoded by pretrained language models in an unsupervised fashion. Moreover, we link the emergent stereotypes to their manifestation as basic emotions as a means to study their emotional effects in a more generalized manner. To demonstrate how our methods can be used to analyze emotion and stereotype shifts due to linguistic experience, we use fine-tuning on news sources as a case study. Our experiments expose how attitudes towards different social groups vary across models and how quickly emotions and stereotypes can shift at the fine-tuning stage.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — stereotype analysis

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rochelle Choenni , Ekaterina Shutova , Robert Van Rooij

Topics

Artificial Intelligence > Core AI > Interpretability Artificial Intelligence > Core AI > Fairness Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Learning Types > Fairness Deep Learning > Learning Types > Fine-Tuning

Keywords

unsupervised learning bias detection emotion detection pretrained language model emotion analysis social stereotype stereotype analysis social group

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021