Subversive Toxicity Detection using Sentiment Information

Eloi Brassard-Gourdeau; Richard Khoury

2019 ACL ACL 2019

Subversive Toxicity Detection using Sentiment Information

Abstract

AbstractThe presence of toxic content has become a major problem for many online communities. Moderators try to limit this problem by implementing more and more refined comment filters, but toxic users are constantly finding new ways to circumvent them. Our hypothesis is that while modifying toxic content and keywords to fool filters can be easy, hiding sentiment is harder. In this paper, we explore various aspects of sentiment detection and their correlation to toxicity, and use our results to implement a toxicity detection tool. We then test how adding the sentiment information helps detect toxicity in three different real-world datasets, and incorporate subversion to these datasets to simulate a user trying to circumvent the system. Our results show sentiment information has a positive impact on toxicity detection.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning

🐣 Hot Topic Early Bird — toxicity detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Eloi Brassard-Gourdeau , Richard Khoury

Topics

Machine Learning > Core Methods > Classification Interdisciplinary > Social > Social Media Analysis

Keywords

sentiment analysis natural language processing text classification toxicity detection content moderation

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019