Mind Your Language: Abuse and Offense Detection for Code-Switched Languages

Raghav Kapoor; Yaman Kumar; Kshitij Rajput; Rajiv Ratn Shah; Ponnurangam Kumaraguru; Roger Zimmermann

2019 AAAI AAAI 2019

Mind Your Language: Abuse and Offense Detection for Code-Switched Languages

Abstract

Abstract In multilingual societies like the Indian subcontinent, use of code-switched languages is much popular and convenient for the users. In this paper, we study offense and abuse detection in the code-switched pair of Hindi and English (i.e, Hinglish), the pair that is the most spoken. The task is made difficult due to non-fixed grammar, vocabulary, semantics and spellings of Hinglish language. We apply transfer learning and make a LSTM based model for hate speech classification. This model surpasses the performance shown by the current best models to establish itself as the state-of-the-art in the unexplored domain of Hinglish offensive text classification. We also release our model and the embeddings trained for research purposes.

🚀 Conference Pioneer — AAAI 2019

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🧭 Keyword Pioneer — offense detection

🐣 Hot Topic Early Bird — offense detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Raghav Kapoor , Yaman Kumar , Kshitij Rajput , Rajiv Ratn Shah , Ponnurangam Kumaraguru , Roger Zimmermann

Topics

Natural Language Processing > Understanding > Sentiment Analysis Natural Language Processing > Resources & Methods > Multilingual NLP Natural Language Processing > Applications > Sentiment Analysis Deep Learning > Learning Types > Transfer Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

transfer learning sentiment analysis multilingual nlp abuse detection long short-term memory hate speech detection offense detection

Download PDF

Related papers

Cooperative Multimodal Approach to Depression Detection in Twitter 2019

Learning to Align Question and Answer Utterances in Customer Service Conversation with Recurrent Pointer Networks 2019

Community Detection in Social Networks Considering Topic Correlations 2019

Session-Based Recommendation with Graph Neural Networks 2019

Blameworthiness in Multi-Agent Settings 2019