The Linearity of the Effect of Surprisal on Reading Times across Languages

Weijie Xu; Jason Chon; Tianran Liu; Richard Futrell

2023 EMNLP EMNLP 2023

The Linearity of the Effect of Surprisal on Reading Times across Languages

Abstract

AbstractIn psycholinguistics, surprisal theory posits that the amount of online processing effort expended by a human comprehender per word positively correlates with the surprisal of that word given its preceding context. In addition to this overall correlation, more importantly, the specific quantitative form taken by the processing effort as a function of surprisal offers insights into the underlying cognitive mechanisms of language processing. Focusing on English, previous studies have looked into the linearity of surprisal on reading times. Here, we extend the investigation by examining eyetracking corpora of seven languages: Danish, Dutch, English, German, Japanese, Mandarin, and Russian. We find evidence for superlinearity in some languages, but the results are highly sensitive to which language model is used to estimate surprisal.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Interdisciplinary and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — cognitive mechanism

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Weijie Xu , Jason Chon , Tianran Liu , Richard Futrell

Topics

Machine Learning > Optimization & Theory > Statistical Learning Interdisciplinary > Linguistics > Computational Linguistics Interdisciplinary > Cognitive Science > Cognitive Modeling Interdisciplinary > Cognitive Science > Perception Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Learning Types > Evaluation Artificial Intelligence > Core AI > Language

Keywords

cross-linguistic analysis language model reading time language processing surprisal theory eye-tracking corpus cognitive mechanism cognitive processing

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023