ConStance: Modeling Annotation Contexts to Improve Stance Classification

Kenneth Joseph; Lisa Friedland; William Hobbs; David Lazer; Oren Tsur

2017 EMNLP EMNLP 2017

ConStance: Modeling Annotation Contexts to Improve Stance Classification

Abstract

AbstractManual annotations are a prerequisite for many applications of machine learning. However, weaknesses in the annotation process itself are easy to overlook. In particular, scholars often choose what information to give to annotators without examining these decisions empirically. For subjective tasks such as sentiment analysis, sarcasm, and stance detection, such choices can impact results. Here, for the task of political stance detection on Twitter, we show that providing too little context can result in noisy and uncertain annotations, whereas providing too strong a context may cause it to outweigh other signals. To characterize and reduce these biases, we develop ConStance, a general model for reasoning about annotations across information conditions. Given conflicting labels produced by multiple annotators seeing the same instances with different contexts, ConStance simultaneously estimates gold standard labels and also learns a classifier for new instances. We show that the classifier learned by ConStance outperforms a variety of baselines at predicting political stance, while the model’s interpretable parameters shed light on the effects of each context.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

📈 Trend Setter — Interpretability

🧭 Keyword Pioneer — annotation bia

🐣 Hot Topic Early Bird — probabilistic modeling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Kenneth Joseph , Lisa Friedland , William Hobbs , David Lazer , Oren Tsur

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Core Methods > Classification Natural Language Processing > Understanding > Sentiment Analysis Natural Language Processing > Applications > Text Classification Machine Learning > Bayesian & Probabilistic > Probabilistic Modeling Machine Learning > Learning Types > Transfer Learning Natural Language Processing > Applications > Sentiment Analysis Machine Learning > Learning Types > Classification Machine Learning > Learning Types > Interpretability

Keywords

probabilistic modeling text classification context modeling twitter datum stance classification annotation bia political stance detection annotation context interpretable parameter subjective task classification

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017