MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation

Arkadiusz Modzelewski; Giovanni Da San Martino; Pavel Savov; Magdalena Anna Wilczyńska; Adam Wierzbicki

2024 EMNLP EMNLP 2024

MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation

Abstract

AbstractThis study presents a novel corpus of 15,356 Polish web articles, including articles identified as containing disinformation. Our dataset enables a multifaceted understanding of disinformation. We present a distinctive multilayered methodology for annotating disinformation in texts. What sets our corpus apart is its focus on uncovering hidden intent and manipulation in disinformative content. A team of experts annotated each article with multiple labels indicating both disinformation creators’ intents and the manipulation techniques employed. Additionally, we set new baselines for binary disinformation detection and two multiclass multilabel classification tasks: manipulation techniques and intention types classification.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — manipulation technique

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Arkadiusz Modzelewski , Giovanni Da San Martino , Pavel Savov , Magdalena Anna Wilczyńska , Adam Wierzbicki

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Representation Learning Natural Language Processing > Applications > Text Classification Machine Learning > Learning Types > Multi-Label Classification

Keywords

text classification multi-label classification intent classification corpus linguistics multilabel classification disinformation detection corpus annotation manipulation detection intention classification manipulation technique

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024