MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification

Chadi Helwe; Tom Calamai; Pierre-Henri Paris; Chloé Clavel; Fabian Suchanek

2024 NAACL NAACL 2024

MAFALDA: A Benchmark and Comprehensive Study of Fallacy Detection and Classification

Abstract

AbstractWe introduce MAFALDA, a benchmark for fallacy classification that merges and unites previous fallacy datasets. It comes with a taxonomy that aligns, refines, and unifies existing classifications of fallacies. We further provide a manual annotation of a part of the dataset together with manual explanations for each annotation. We propose a new annotation scheme tailored for subjective NLP tasks, and a new evaluation method designed to handle subjectivity. We then evaluate several language models under a zero-shot learning setting and human performances on MAFALDA to assess their capability to detect and classify fallacies.

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Chadi Helwe , Tom Calamai , Pierre-Henri Paris , Chloé Clavel , Fabian Suchanek

Topics

Machine Learning > Core Methods > Classification Machine Learning > Learning Types > Zero-Shot Learning

Keywords

benchmark evaluation natural language processing text classification language model fallacy detection

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024