Categorizing Comparative Sentences

Alexander Panchenko; Alexander Bondarenko; Mirco Franzek; Matthias Hagen; Chris Biemann

2019 ACL ACL 2019

Categorizing Comparative Sentences

Abstract

AbstractWe tackle the tasks of automatically identifying comparative sentences and categorizing the intended preference (e.g., “Python has better NLP libraries than MATLAB” → Python, better, MATLAB). To this end, we manually annotate 7,199 sentences for 217 distinct target item pairs from several domains (27% of the sentences contain an oriented comparison in the sense of “better” or “worse”). A gradient boosting model based on pre-trained sentence embeddings reaches an F1 score of 85% in our experimental evaluation. The model can be used to extract comparative sentences for pro/con argumentation in comparative / argument search engines or debating technologies.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — preference extraction

🐣 Hot Topic Early Bird — gradient boosting

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Alexander Panchenko , Alexander Bondarenko , Mirco Franzek , Matthias Hagen , Chris Biemann

Topics

Machine Learning > Core Methods > Classification Machine Learning > Core Methods > Representation Learning Natural Language Processing > Applications > Sentiment Analysis Machine Learning > Learning Types > Classification

Keywords

sentiment analysis text classification gradient boosting sentence embedding preference extraction comparative sentence

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019