2018 ACL ACL 2018

Obtaining Reliable Human Ratings of Valence, Arousal, and Dominance for 20,000 English Words

Abstract

AbstractWords play a central role in language and thought. Factor analysis studies have shown that the primary dimensions of meaning are valence, arousal, and dominance (VAD). We present the NRC VAD Lexicon, which has human ratings of valence, arousal, and dominance for more than 20,000 English words. We use Best–Worst Scaling to obtain fine-grained scores and address issues of annotation consistency that plague traditional rating scale methods of annotation. We show that the ratings obtained are vastly more reliable than those in existing lexicons. We also show that there exist statistically significant differences in the shared understanding of valence, arousal, and dominance across demographic variables such as age, gender, and personality.

🌉 Interdisciplinary Bridge — Data Science & Analytics and Interdisciplinary and Machine Learning and Natural Language Processing
🧭 Keyword Pioneer — lexicon construction
🐣 Hot Topic Early Bird — affective computing
🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors