2023
ACL
ACL 2023
A Weakly Supervised Classifier and Dataset of White Supremacist Language
Abstract
AbstractWe present a dataset and classifier for detecting the language of white supremacist extremism, a growing issue in online hate speech. Our weakly supervised classifier is trained on large datasets of text from explicitly white supremacist domains paired with neutral and anti-racist data from similar domains. We demonstrate that this approach improves generalization performance to new domains. Incorporating anti-racist texts as counterexamples to white supremacist language mitigates bias.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing
🧭
Keyword Pioneer
— white supremacist language
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Responsible AI
Machine Learning > Core Methods > Classification
Machine Learning > Learning Types > Weakly Supervised Learning
Natural Language Processing > Applications > Text Classification
Natural Language Processing > Applications > Sentiment Analysis
Deep Learning > Learning Types > Weakly Supervised Learning