Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Alexis CONNEAU; Douwe Kiela; Holger Schwenk; Loic Barrault; Antoine Bordes

2017 EMNLP EMNLP 2017

Supervised Learning of Universal Sentence Representations from Natural Language Inference Data

Abstract

AbstractMany modern NLP systems rely on word embeddings, previously trained in an unsupervised manner on large corpora, as base features. Efforts to obtain embeddings for larger chunks of text, such as sentences, have however not been so successful. Several attempts at learning unsupervised representations of sentences have not reached satisfactory enough performance to be widely adopted. In this paper, we show how universal sentence representations trained using the supervised data of the Stanford Natural Language Inference datasets can consistently outperform unsupervised methods like SkipThought vectors on a wide range of transfer tasks. Much like how computer vision uses ImageNet to obtain features, which can then be transferred to other tasks, our work tends to indicate the suitability of natural language inference for transfer learning to other NLP tasks. Our encoder is publicly available.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

📈 Trend Setter — Transfer Learning

🧭 Keyword Pioneer — universal sentence representation

🐣 Hot Topic Early Bird — natural language inference

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Alexis CONNEAU , Douwe Kiela , Holger Schwenk , Loic Barrault , Antoine Bordes

Topics

Machine Learning > Core Methods > Representation Learning Natural Language Processing > Resources & Methods > Natural Language Inference Natural Language Processing > Resources & Methods > Text Representation Natural Language Processing > Resources & Methods > Transfer Learning Deep Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

transfer learning natural language inference supervised learning sentence embedding sentence encoder universal sentence representation skip-thought vector

Download PDF

Related papers

Reinforced Video Captioning with Entailment Rewards 2017

Cross-lingual Character-Level Neural Morphological Tagging 2017

Inter-Weighted Alignment Network for Sentence Pair Modeling 2017

Investigating Different Syntactic Context Types and Context Representations for Learning Word Embeddings 2017

An Empirical Analysis of Edit Importance between Document Versions 2017