Feature Optimization for Predicting Readability of Arabic L1 and L2

Hind Saddiki; Nizar Habash; Violetta Cavalli-Sforza; Muhamed Al Khalil

2018 ACL ACL 2018

Feature Optimization for Predicting Readability of Arabic L1 and L2

Abstract

AbstractAdvances in automatic readability assessment can impact the way people consume information in a number of domains. Arabic, being a low-resource and morphologically complex language, presents numerous challenges to the task of automatic readability assessment. In this paper, we present the largest and most in-depth computational readability study for Arabic to date. We study a large set of features with varying depths, from shallow words to syntactic trees, for both L1 and L2 readability tasks. Our best L1 readability accuracy result is 94.8% (75% error reduction from a commonly used baseline). The comparable results for L2 are 72.4% (45% error reduction). We also demonstrate the added value of leveraging L1 features for L2 readability prediction.

🌉 Interdisciplinary Bridge — Interdisciplinary and Machine Learning and Natural Language Processing

📈 Trend Setter — Classification

🧭 Keyword Pioneer — feature optimization

🐣 Hot Topic Early Bird — arabic language

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Hind Saddiki , Nizar Habash , Violetta Cavalli-Sforza , Muhamed Al Khalil

Topics

Machine Learning > Core Methods > Classification Natural Language Processing > Understanding > Syntax Natural Language Processing > Applications > Text Classification Interdisciplinary > Linguistics > Computational Linguistics Machine Learning > Application Areas > Classification

Keywords

readability assessment feature optimization morphological analysis arabic language syntactic tree second language learning automatic readability assessment arabic language processing

Download PDF

Related papers

Economic Event Detection in Company-Specific News Text 2018

Investigating Effective Parameters for Fine-tuning of Word Embeddings Using Only a Small Corpus 2018

SemAxis: A Lightweight Framework to Characterize Domain-Specific Word Semantics Beyond Sentiment 2018

Fighting Offensive Language on Social Media with Unsupervised Text Style Transfer 2018

Affordances in Grounded Language Learning 2018