LifeTox: Unveiling Implicit Toxicity in Life Advice

Minbeom Kim; Jahyun Koo; Hwanhee Lee; Joonsuk Park; Hwaran Lee; Kyomin Jung

2024 NAACL NAACL 2024

LifeTox: Unveiling Implicit Toxicity in Life Advice

Abstract

AbstractAs large language models become increasingly integrated into daily life, detecting implicit toxicity across diverse contexts is crucial. To this end, we introduce LifeTox, a dataset designed for identifying implicit toxicity within a broad range of advice-seeking scenarios. Unlike existing safety datasets, LifeTox comprises diverse contexts derived from personal experiences through open-ended questions. Our experiments demonstrate that RoBERTa fine-tuned on LifeTox matches or surpasses the zero-shot performance of large language models in toxicity classification tasks. These results underscore the efficacy of LifeTox in addressing the complex challenges inherent in implicit toxicity. We open-sourced the dataset and the LifeTox moderator family; 350M, 7B, and 13B.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Minbeom Kim , Jahyun Koo , Hwanhee Lee , Joonsuk Park , Hwaran Lee , Kyomin Jung

Topics

Natural Language Processing > Understanding > Sentiment Analysis Natural Language Processing > Applications > Text Classification

Keywords

sentiment analysis text classification toxicity detection implicit toxicity

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024