ALUE: Arabic Language Understanding Evaluation

Haitham Seelawi; Ibraheem Tuffaha; Mahmoud Gzawi; Wael Farhan; Bashar Talafha; Riham Badawi; Zyad Sober; Oday Al-Dweik; Abed Alhakim Freihat; Hussein Al-Natsheh

2021 EACL EACL 2021

ALUE: Arabic Language Understanding Evaluation

Abstract

AbstractThe emergence of Multi-task learning (MTL)models in recent years has helped push thestate of the art in Natural Language Un-derstanding (NLU). We strongly believe thatmany NLU problems in Arabic are especiallypoised to reap the benefits of such models. Tothis end we propose the Arabic Language Un-derstanding Evaluation Benchmark (ALUE),based on 8 carefully selected and previouslypublished tasks. For five of these, we providenew privately held evaluation datasets to en-sure the fairness and validity of our benchmark. We also provide a diagnostic dataset to helpresearchers probe the inner workings of theirmodels.Our initial experiments show thatMTL models outperform their singly trainedcounterparts on most tasks. But in order to en-tice participation from the wider community,we stick to publishing singly trained baselinesonly. Nonetheless, our analysis reveals thatthere is plenty of room for improvement inArabic NLU. We hope that ALUE will playa part in helping our community realize someof these improvements. Interested researchersare invited to submit their results to our online,and publicly accessible leaderboard.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — arabic language understanding

🐣 Hot Topic Early Bird — language model evaluation

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Haitham Seelawi , Ibraheem Tuffaha , Mahmoud Gzawi , Wael Farhan , Bashar Talafha , Riham Badawi , Zyad Sober , Oday Al-Dweik , Abed Alhakim Freihat , Hussein Al-Natsheh

Topics

Artificial Intelligence > Learning Paradigms > Transfer Learning Machine Learning > Learning Types > Semi-Supervised Learning Natural Language Processing > Applications > Text Classification

Keywords

multi-task learning sentiment analysis text classification language model evaluation natural language understanding arabic language understanding

Download PDF

Related papers

Joint Coreference Resolution and Character Linking for Multiparty Conversation 2021

Progressively Pretrained Dense Corpus Index for Open-Domain Question Answering 2021

Crisscrossed Captions: Extended Intramodal and Intermodal Semantic Similarity Judgments for MS-COCO 2021

Representations for Question Answering from Documents with Tables and Text 2021

Gender and Racial Fairness in Depression Research using Social Media 2021