ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Badr AlKhamissi; Faisal Ladhak; Srinivasan Iyer; Veselin Stoyanov; Zornitsa Kozareva; Xian Li; Pascale Fung; Lambert Mathias; Asli Celikyilmaz; Mona Diab

2022 EMNLP EMNLP 2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Abstract

AbstractHate speech detection is complex; it relies on commonsense reasoning, knowledge of stereotypes, and an understanding of social nuance that differs from one culture to the next. It is also difficult to collect a large-scale hate speech annotated dataset. In this work, we frame this problem as a few-shot learning task, and show significant gains with decomposing the task into its “constituent” parts. In addition, we see that infusing knowledge from reasoning datasets (e.g. ATOMIC2020) improves the performance even further. Moreover, we observe that the trained models generalize to out-of-distribution datasets, showing the superiority of task decomposition and knowledge infusion compared to previously used methods. Concretely, our method outperforms the baseline by 17.83% absolute gain in the 16-shot case.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Natural Language Processing

🐣 Hot Topic Early Bird — task decomposition

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Badr AlKhamissi , Faisal Ladhak , Srinivasan Iyer , Veselin Stoyanov , Zornitsa Kozareva , Xian Li , Pascale Fung , Lambert Mathias , Asli Celikyilmaz , Mona Diab

Topics

Artificial Intelligence > Learning Paradigms > Few-Shot Learning Natural Language Processing > Applications > Text Classification Deep Learning > Techniques > Knowledge Distillation Artificial Intelligence > Core AI > Knowledge Distillation

Keywords

few-shot learning task decomposition commonsense reasoning hate speech detection knowledge infusion

Download PDF

Generative Entity Typing with Curriculum Learning 2022

Towards Reinterpreting Neural Topic Models via Composite Activations 2022

Weakly Supervised Headline Dependency Parsing 2022

Cross-modal Transfer Between Vision and Language for Protest Detection 2022

ToKen: Task Decomposition and Knowledge Infusion for Few-Shot Hate Speech Detection

Abstract

Authors

Topics

Keywords

Related papers