Fine-tuning for multi-domain and multi-label uncivil language detection

Kadir Bulut Ozler; Kate Kenski; Steve Rains; Yotam Shmargad; Kevin Coe; Steven Bethard

2020 EMNLP EMNLP 2020

Fine-tuning for multi-domain and multi-label uncivil language detection

Abstract

AbstractIncivility is a problem on social media, and it comes in many forms (name-calling, vulgarity, threats, etc.) and domains (microblog posts, online news comments, Wikipedia edits, etc.). Training machine learning models to detect such incivility must handle the multi-label and multi-domain nature of the problem. We present a BERT-based model for incivility detection and propose several approaches for training it for multi-label and multi-domain datasets. We find that individual binary classifiers outperform a joint multi-label classifier, and that simply combining multiple domains of training data outperforms other recently-proposed fine tuning strategies. We also establish new state-of-the-art performance on several incivility detection datasets.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Multi-Label Classification

🧭 Keyword Pioneer — incivility detection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Kadir Bulut Ozler , Kate Kenski , Steve Rains , Yotam Shmargad , Kevin Coe , Steven Bethard

Topics

Machine Learning > Core Methods > Classification Natural Language Processing > Applications > Text Classification Deep Learning > Models > Transformers Deep Learning > Learning Types > Multi-Label Classification Deep Learning > Learning Types > Multi-Domain Learning

Keywords

text classification multi-label classification multi-domain learning bert-based model incivility detection

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020