High-risk learning: acquiring new word vectors from tiny data

Aurélie Herbelot; Marco Baroni

2017 EMNLP EMNLP 2017

High-risk learning: acquiring new word vectors from tiny data

Abstract

AbstractDistributional semantics models are known to struggle with small data. It is generally accepted that in order to learn ‘a good vector’ for a word, a model must have sufficient examples of its usage. This contradicts the fact that humans can guess the meaning of a word from a few occurrences only. In this paper, we show that a neural language model such as Word2Vec only necessitates minor modifications to its standard architecture to learn new terms from tiny data, using background knowledge from a previously learnt semantic space. We test our model on word definitions and on a nonce task involving 2-6 sentences’ worth of context, showing a large increase in performance over state-of-the-art models on the definitional task.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Language Models

🧭 Keyword Pioneer — tiny datum

🐣 Hot Topic Early Bird — few-shot learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Aurélie Herbelot , Marco Baroni

Topics

Deep Learning > Architectures > Neural Networks Natural Language Processing > Resources & Methods > Text Representation Machine Learning > Learning Paradigms > Few-Shot Learning Machine Learning > Learning Types > Representation Learning Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Models > Language Models

Keywords

representation learning few-shot learning semantic space distributional semantics background knowledge neural language model word vector tiny datum nonce task

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017