SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

Dimosthenis Antypas; Asahi Ushio; Francesco Barbieri; Leonardo Neves; Kiamehr Rezaee; Luis Espinosa-Anke; Jiaxin Pei; Jose Camacho-Collados

2023 EMNLP EMNLP 2023

SuperTweetEval: A Challenging, Unified and Heterogeneous Benchmark for Social Media NLP Research

Abstract

AbstractDespite its relevance, the maturity of NLP for social media pales in comparison with general-purpose models, metrics and benchmarks. This fragmented landscape makes it hard for the community to know, for instance, given a task, which is the best performing model and how it compares with others. To alleviate this issue, we introduce a unified benchmark for NLP evaluation in social media, SuperTweetEval, which includes a heterogeneous set of tasks and datasets combined, adapted and constructed from scratch. We benchmarked the performance of a wide range of models on SuperTweetEval and our results suggest that, despite the recent advances in language modelling, social media remains challenging.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Data Science & Analytics and Interdisciplinary and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — natural language benchmark

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Dimosthenis Antypas , Asahi Ushio , Francesco Barbieri , Leonardo Neves , Kiamehr Rezaee , Luis Espinosa-Anke , Jiaxin Pei , Jose Camacho-Collados

Topics

Natural Language Processing > Applications Natural Language Processing > Applications > Text Classification Natural Language Processing > Resources & Methods > Large Language Models Interdisciplinary > Social > Social Media Analysis Machine Learning > Optimization & Theory > Evaluation Artificial Intelligence > Core AI > Natural Language Processing Data Science & Analytics > Applications > Social Media Analysis

Keywords

benchmark evaluation unified benchmark text classification multilingual nlp language model evaluation social media nlp natural language benchmark heterogeneous benchmark

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023