2021 EMNLP EMNLP 2021

SWEAT: Scoring Polarization of Topics across Different Corpora

Abstract

AbstractUnderstanding differences of viewpoints across corpora is a fundamental task for computational social sciences. In this paper, we propose the Sliced Word Embedding Association Test (SWEAT), a novel statistical measure to compute the relative polarization of a topical wordset across two distributional representations. To this end, SWEAT uses two additional wordsets, deemed to have opposite valence, to represent two different poles. We validate our approach and illustrate a case study to show the usefulness of the introduced measure.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Interdisciplinary and Machine Learning and Natural Language Processing
🧭 Keyword Pioneer — topic polarization
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio