Predictive Embeddings for Hate Speech Detection on Twitter

Rohan Kshirsagar; Tyrus Cukuvac; Kathy McKeown; Susan McGregor

2018 EMNLP EMNLP 2018

Predictive Embeddings for Hate Speech Detection on Twitter

Abstract

AbstractWe present a neural-network based approach to classifying online hate speech in general, as well as racist and sexist speech in particular. Using pre-trained word embeddings and max/mean pooling from simple, fully-connected transformations of these embeddings, we are able to predict the occurrence of hate speech on three commonly used publicly available datasets. Our models match or outperform state of the art F1 performance on all three datasets using significantly fewer parameters and minimal feature preprocessing compared to previous methods.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🐣 Hot Topic Early Bird — hate speech detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rohan Kshirsagar , Tyrus Cukuvac , Kathy McKeown , Susan McGregor

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Embedding Learning Natural Language Processing > Applications > Text Classification

Keywords

text classification word embedding hate speech detection neural network max pooling

Download PDF

Related papers

Speeding Up Neural Machine Translation Decoding by Cube Pruning 2018

Limitations in learning an interpreted language with recurrent models 2018

Results of the sixth edition of the BioASQ Challenge 2018

Neural Segmental Hypergraphs for Overlapping Mention Recognition 2018

Hybrid Neural Attention for Agreement/Disagreement Inference in Online Debates 2018