Reusing Weights in Subword-Aware Neural Language Models

Zhenisbek Assylbekov; Rustem Takhanov

2018 NAACL NAACL 2018

Reusing Weights in Subword-Aware Neural Language Models

Abstract

AbstractWe propose several ways of reusing subword embeddings and other weights in subword-aware neural language models. The proposed techniques do not benefit a competitive character-aware model, but some of them improve the performance of syllable- and morpheme-aware models while showing significant reductions in model sizes. We discover a simple hands-on principle: in a multi-layer input embedding model, layers should be tied consecutively bottom-up if reused at output. Our best morpheme-aware model with properly reused weights beats the competitive word-level model by a large margin across multiple languages and has 20%-87% fewer parameters.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — morpheme-aware model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Zhenisbek Assylbekov , Rustem Takhanov

Topics

Deep Learning > Architectures > Neural Networks Deep Learning > Techniques > Model Architecture Natural Language Processing > Generation > Language Modeling Machine Learning > Application Areas > Model Compression

Keywords

language modeling weight sharing parameter reduction neural language model subword embedding weight reuse morpheme-aware model

Download PDF

Related papers

A Melody-Conditioned Lyrics Language Model 2018

Before Name-Calling: Dynamics and Triggers of Ad Hominem Fallacies in Web Argumentation 2018

Automated Essay Scoring in the Presence of Biased Ratings 2018

Neural Automated Essay Scoring and Coherence Modeling for Adversarially Crafted Input 2018

QuickEdit: Editing Text & Translations by Crossing Words Out 2018