Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection

Maria Corkery; Yevgen Matusevych; Sharon Goldwater

2019 ACL ACL 2019

Are we there yet? Encoder-decoder neural networks as cognitive models of English past tense inflection

Abstract

AbstractThe cognitive mechanisms needed to account for the English past tense have long been a subject of debate in linguistics and cognitive science. Neural network models were proposed early on, but were shown to have clear flaws. Recently, however, Kirov and Cotterell (2018) showed that modern encoder-decoder (ED) models overcome many of these flaws. They also presented evidence that ED models demonstrate humanlike performance in a nonce-word task. Here, we look more closely at the behaviour of their model in this task. We find that (1) the model exhibits instability across multiple simulations in terms of its correlation with human data, and (2) even when results are aggregated across simulations (treating each simulation as an individual human participant), the fit to the human data is not strong—worse than an older rule-based model. These findings hold up through several alternative training regimes and evaluation measures. Although other neural architectures might do better, we conclude that there is still insufficient evidence to claim that neural nets are a good cognitive model for this task.

❓ The Questioner

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Interdisciplinary and Machine Learning

🧭 Keyword Pioneer — past tense inflection

🐣 Hot Topic Early Bird — cognitive modeling

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Speech & Audio

Authors

Maria Corkery , Yevgen Matusevych , Sharon Goldwater

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Optimization & Theory > Theory Deep Learning > Architectures > Neural Networks Interdisciplinary > Linguistics > Computational Linguistics Interdisciplinary > Cognitive Science > Cognitive Modeling Deep Learning > Learning Types > Representation Learning

Keywords

cognitive modeling rule-based model cognitive model past tense inflection encoder-decoder neural network english linguistics nonce-word task human-like performance

Download PDF

Related papers

What do phone embeddings learn about Phonology? 2019

Unsupervised Morphological Segmentation for Low-Resource Polysynthetic Languages 2019

Understanding Undesirable Word Embedding Associations 2019

Inferential Machine Comprehension: Answering Questions by Recursively Deducing the Evidence Chain from Text 2019

Domain Adaptation of Neural Machine Translation by Lexicon Induction 2019