Comparing Character-level Neural Language Models Using a Lexical Decision Task

Gaël Le Godais; Tal Linzen; Emmanuel Dupoux

2017 EACL EACL 2017

Comparing Character-level Neural Language Models Using a Lexical Decision Task

Abstract

AbstractWhat is the information captured by neural network models of language? We address this question in the case of character-level recurrent neural language models. These models do not have explicit word representations; do they acquire implicit ones? We assess the lexical capacity of a network using the lexical decision task common in psycholinguistics: the system is required to decide whether or not a string of characters forms a word. We explore how accuracy on this task is affected by the architecture of the network, focusing on cell type (LSTM vs. SRN), depth and width. We also compare these architectural properties to a simple count of the parameters of the network. The overall number of parameters in the network turns out to be the most important predictor of accuracy; in particular, there is little evidence that deeper networks are beneficial for this task.

🌉 Interdisciplinary Bridge — Deep Learning and Interdisciplinary and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — character-level language model

🐣 Hot Topic Early Bird — network architecture

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Gaël Le Godais , Tal Linzen , Emmanuel Dupoux

Topics

Machine Learning > Optimization & Theory > Statistical Learning Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Language Modeling Interdisciplinary > Linguistics > Computational Linguistics

Keywords

neural network architecture network architecture recurrent neural network lexical decision character-level language model lexical decision task parameter count

Download PDF

Related papers

Cross-Lingual Dependency Parsing with Late Decoding for Truly Low-Resource Languages 2017

Learning and Knowledge Transfer with Memory Networks for Machine Comprehension 2017

Is this a Child, a Girl or a Car? Exploring the Contribution of Distributional Similarity to Learning Referential Word Meanings 2017

Building Web-Interfaces for Vector Semantic Models with the WebVectors Toolkit 2017

Assessing Convincingness of Arguments in Online Debates with Limited Number of Features 2017