A Computational Exploration of Pejorative Language in Social Media

Liviu P. Dinu; Ioan-Bogdan Iordache; Ana Sabina Uban; Marcos Zampieri

2021 EMNLP EMNLP 2021

A Computational Exploration of Pejorative Language in Social Media

Abstract

AbstractIn this paper we study pejorative language, an under-explored topic in computational linguistics. Unlike existing models of offensive language and hate speech, pejorative language manifests itself primarily at the lexical level, and describes a word that is used with a negative connotation, making it different from offensive language or other more studied categories. Pejorativity is also context-dependent: the same word can be used with or without pejorative connotations, thus pejorativity detection is essentially a problem similar to word sense disambiguation. We leverage online dictionaries to build a multilingual lexicon of pejorative terms for English, Spanish, Italian, and Romanian. We additionally release a dataset of tweets annotated for pejorative use. Based on these resources, we present an analysis of the usage and occurrence of pejorative words in social media, and present an attempt to automatically disambiguate pejorative usage in our dataset.

🌉 Interdisciplinary Bridge — Interdisciplinary and Natural Language Processing

🧭 Keyword Pioneer — pejorative language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Liviu P. Dinu , Ioan-Bogdan Iordache , Ana Sabina Uban , Marcos Zampieri

Topics

Natural Language Processing > Understanding > Sentiment Analysis Interdisciplinary > Linguistics > Computational Linguistics Interdisciplinary > Social > Social Media Analysis Natural Language Processing > Applications > Sentiment Analysis Natural Language Processing > Understanding > Lexical Semantics

Keywords

sentiment analysis word sense disambiguation lexical semantics computational linguistics social media pejorative language

Download PDF

Related papers

Continual Learning in Multilingual NMT via Language-Specific Embeddings 2021

MultiDoc2Dial: Modeling Dialogues Grounded in Multiple Documents 2021

Efficient Multi-Task Auxiliary Learning: Selecting Auxiliary Data by Feature Similarity 2021

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings 2021

Semantics-Preserved Data Augmentation for Aspect-Based Sentiment Analysis 2021