Using Morphological Knowledge in Open-Vocabulary Neural Language Models

Austin Matthews; Graham Neubig; Chris Dyer

2018 NAACL NAACL 2018

Using Morphological Knowledge in Open-Vocabulary Neural Language Models

Abstract

AbstractLanguages with productive morphology pose problems for language models that generate words from a fixed vocabulary. Although character-based models allow any possible word type to be generated, they are linguistically naïve: they must discover that words exist and are delimited by spaces—basic linguistic facts that are built in to the structure of word-based models. We introduce an open-vocabulary language model that incorporates more sophisticated linguistic knowledge by predicting words using a mixture of three generative processes: (1) by generating words as a sequence of characters, (2) by directly generating full word forms, and (3) by generating words as a sequence of morphemes that are combined using a hand-written morphological analyzer. Experiments on Finnish, Turkish, and Russian show that our model outperforms character sequence models and other strong baselines on intrinsic and extrinsic measures. Furthermore, we show that our model learns to exploit morphological knowledge encoded in the analyzer, and, as a byproduct, it can perform effective unsupervised morphological disambiguation.

🌉 Interdisciplinary Bridge — Deep Learning and Interdisciplinary and Natural Language Processing

🧭 Keyword Pioneer — morphological knowledge

🐝 Cross-Pollinator — Artificial Intelligence, Deep Learning, Interdisciplinary, Machine Learning, Natural Language Processing, Speech & Audio

Authors

Austin Matthews , Graham Neubig , Chris Dyer

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Generation > Language Modeling Interdisciplinary > Linguistics > Morphology

Keywords

open-vocabulary language model morphological knowledge character sequence morpheme sequence unsupervised disambiguation multilingual modeling

Download PDF

Related papers

A Melody-Conditioned Lyrics Language Model 2018

Before Name-Calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation 2018

Automated Essay Scoring in the Presence of Biased Ratings 2018

Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input 2018

QuickEdit: Editing Text & Translations by Crossing Words Out 2018