2020
CONLL
CoNLL 2020
Cloze Distillation: Improving Neural Language Models with Human Next-Word Prediction
Abstract
AbstractContemporary autoregressive language models (LMs) trained purely on corpus data have been shown to capture numerous features of human incremental processing. However, past work has also suggested dissociations between corpus probabilities and human next-word predictions. Here we evaluate several state-of-the-art language models for their match to human next-word predictions and to reading time behavior from eye movements. We then propose a novel method for distilling the linguistic information implicit in human linguistic predictions into pre-trained LMs: Cloze Distillation. We apply this method to a baseline neural LM and show potential improvement in reading time prediction and generalization to held-out human cloze data.
🌉
Interdisciplinary Bridge
— Artificial Intelligence and Deep Learning and Machine Learning
🧭
Keyword Pioneer
— autoregressive language model
🐝
Cross-Pollinator
— Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio
Authors
Topics
Artificial Intelligence > Core AI > Human-AI Interaction
Machine Learning > Application Areas > Knowledge Distillation
Deep Learning > Models > Generative Models
Natural Language Processing > Resources & Methods > Language Modeling
Machine Learning > Learning Types > Knowledge Distillation
Deep Learning > Techniques > Knowledge Distillation