Superlim: A Swedish Language Understanding Evaluation Benchmark

Aleksandrs Berdicevskis; Gerlof Bouma; Robin Kurtz; Felix Morger; Joey Öhman; Yvonne Adesam; Lars Borin; Dana Dannélls; Markus Forsberg; Tim Isbister; Anna Lindahl; Martin Malmsten; Faton Rekathati; Magnus Sahlgren; Elena Volodina; Love Börjeson; Simon Hengchen; Nina Tahmasebi

2023 EMNLP EMNLP 2023

Superlim: A Swedish Language Understanding Evaluation Benchmark

Abstract

AbstractWe present Superlim, a multi-task NLP benchmark and analysis platform for evaluating Swedish language models, a counterpart to the English-language (Super)GLUE suite. We describe the dataset, the tasks, the leaderboard and report the baseline results yielded by a reference implementation. The tested models do not approach ceiling performance on any of the tasks, which suggests that Superlim is truly difficult, a desirable quality for a benchmark. We address methodological challenges, such as mitigating the Anglocentric bias when creating datasets for a less-resourced language; choosing the most appropriate measures; documenting the datasets and making the leaderboard convenient and transparent. We also highlight other potential usages of the dataset, such as, for instance, the evaluation of cross-lingual transfer learning.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — swedish language model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Aleksandrs Berdicevskis , Gerlof Bouma , Robin Kurtz , Felix Morger , Joey Öhman , Yvonne Adesam , Lars Borin , Dana Dannélls , Markus Forsberg , Tim Isbister , Anna Lindahl , Martin Malmsten , Faton Rekathati , Magnus Sahlgren , Elena Volodina , Love Börjeson , Simon Hengchen , Nina Tahmasebi

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Optimization & Theory > Statistical Learning Natural Language Processing > Resources & Methods > Multilingual NLP

Keywords

cross-lingual transfer resource-scarce language nlp benchmark swedish language model superlim benchmark

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023