2020
CONLL
CoNLL 2020
Processing effort is a poor predictor of cross-linguistic word order frequency
Abstract
AbstractSome have argued that word orders which are more difficult to process should be rarer cross-linguistically. Our current study fails to replicate the results of Maurits, Navarro, and Perfors (2010), who used an entropy-based Uniform Information Density (UID) measure to moderately predict the Greenbergian typology of transitive word orders. We additionally report an inability of three measures of processing difficulty — entropy-based UID, surprisal-based UID, and pointwise mutual information — to correctly predict the correct typological distribution, using transitive constructions from 20 languages in the Universal Dependencies project (version 2.5). However, our conclusions are limited by data sparsity.
🌉
Interdisciplinary Bridge
— Mathematics & Optimization and Natural Language Processing
🧭
Keyword Pioneer
— processing difficulty
🐝
Cross-Pollinator
— Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Speech & Audio