HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Naquee Rizwan; Seid Muhie Yimam; Daryna Dementieva; Dr. Florian Skupin; Tim Fischer; Daniil Moskovskiy; Aarushi Ajay Borkar; Robert Geislinger; Punyajoy Saha; Sarthak Roy; Martin Semmann; Alexander Panchenko; Chris Biemann; Animesh Mukherjee

2025 ACL ACL 2025

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Abstract

AbstractDespite regulations imposed by nations and social media platforms, e.g. (Government of India, 2021; European Parliament and Council of the European Union, 2022), inter alia, hateful content persists as a significant challenge. Existing approaches primarily rely on reactive measures such as blocking or suspending offensive messages, with emerging strategies focusing on proactive measurements like detoxification and counterspeech. In our work, which we call HATEPRISM, we conduct a comprehensive examination of hate speech regulations and strategies from three perspectives: country regulations, social platform policies, and NLP research datasets. Our findings reveal significant inconsistencies in hate speech definitions and moderation practices across jurisdictions and platforms, alongside a lack of alignment with research efforts. Based on these insights, we suggest ideas and research direction for further exploration of a unified framework for automated hate speech moderation incorporating diverse strategies.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — proactive mitigation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Naquee Rizwan , Seid Muhie Yimam , Daryna Dementieva , Dr. Florian Skupin , Tim Fischer , Daniil Moskovskiy , Aarushi Ajay Borkar , Robert Geislinger , Punyajoy Saha , Sarthak Roy , Martin Semmann , Alexander Panchenko , Chris Biemann , Animesh Mukherjee

Topics

Artificial Intelligence > Core AI > Responsible AI Machine Learning > Application Areas > Fairness

Keywords

content moderation hate speech detection proactive mitigation

Download PDF

Graphically Speaking: Unmasking Abuse in Social Media with Conversation Insights 2025

CodeTool: Enhancing Programmatic Tool Invocation of LLMs via Process Supervision 2025

Structural Deep Encoding for Table Question Answering 2025

Vision-aided Unsupervised Constituency Parsing with Multi-MLLM Debating 2025

HatePRISM: Policies, Platforms, and Research Integration. Advancing NLP for Hate Speech Proactive Mitigation

Abstract

Authors

Topics

Keywords

Related papers