Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset

Che-Wei Tsai; Yen-Hao Huang; Tsu-Keng Liao; Didier Fernando Salazar Estrada; Retnani Latifah; Yi-Shin Chen

2024 EMNLP EMNLP 2024

Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset

Abstract

AbstractIn multi-person communications, conflicts often arise. Each individual may have their own perspective, which can differ. Additionally, commonly referenced offensive datasets frequently neglect contextual information and are primarily constructed with a focus on intended offenses. This study suggests that conflicts are pivotal in revealing a broader range of human interactions, including instances of unintended offensive language. This paper proposes a conflict-based data collection method to utilize inter-conflict cues in multi-person communications. By focusing on specific cue posts within conversation threads, our proposed approach effectively identifies relevant instances for analysis. Detailed analyses are provided to showcase the proposed approach efficiently gathers data on subtly offensive content. The experimental results indicate that incorporating elements of conflict into data collection significantly enhances the comprehensiveness and accuracy of detecting offensive language but also enriches our understanding of conflict dynamics in digital communication.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Interdisciplinary and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — unintended offense

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Che-Wei Tsai , Yen-Hao Huang , Tsu-Keng Liao , Didier Fernando Salazar Estrada , Retnani Latifah , Yi-Shin Chen

Topics

Machine Learning > Application Areas > Domain Adaptation Natural Language Processing > Understanding > Sentiment Analysis Natural Language Processing > Applications > Text Classification Interdisciplinary > Social > Social Media Analysis Natural Language Processing > Applications > Sentiment Analysis Machine Learning > Learning Types > Classification Data Science & Analytics > Applications > Social Media Analysis

Keywords

text classification social media analysis offensive language detection social media conflict analysis data collection method unintended offense

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024