Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company’s Reputation

Nikolay Babakov; Varvara Logacheva; Olga Kozlova; Nikita Semenov; Alexander Panchenko

2021 EACL EACL 2021

Detecting Inappropriate Messages on Sensitive Topics that Could Harm a Company’s Reputation

Abstract

AbstractNot all topics are equally “flammable” in terms of toxicity: a calm discussion of turtles or fishing less often fuels inappropriate toxic dialogues than a discussion of politics or sexual minorities. We define a set of sensitive topics that can yield inappropriate and toxic messages and describe the methodology of collecting and labelling a dataset for appropriateness. While toxicity in user-generated data is well-studied, we aim at defining a more fine-grained notion of inappropriateness. The core of inappropriateness is that it can harm the reputation of a speaker. This is different from toxicity in two respects: (i) inappropriateness is topic-related, and (ii) inappropriate message is not toxic but still unacceptable. We collect and release two datasets for Russian: a topic-labelled dataset and an appropriateness-labelled dataset. We also release pre-trained classification models trained on this data.

🧭 Keyword Pioneer — inappropriate message detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Nikolay Babakov , Varvara Logacheva , Olga Kozlova , Nikita Semenov , Alexander Panchenko

Topics

Machine Learning > Core Methods > Classification Machine Learning > Application Areas > Fairness

Keywords

text classification toxicity classification inappropriate message detection topic-sensitive classification reputation harm detection

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021