Entailment Semantics Can Be Extracted from an Ideal Language Model

William Merrill; Alex Warstadt; Tal Linzen

2022 CONLL CoNLL 2022

Entailment Semantics Can Be Extracted from an Ideal Language Model

Abstract

AbstractLanguage models are often trained on text alone, without additional grounding. There is debate as to how much of natural language semantics can be inferred from such a procedure. We prove that entailment judgments between sentences can be extracted from an ideal language model that has perfectly learned its target distribution, assuming the training sentences are generated by Gricean agents, i.e., agents who follow fundamental principles of communication from the linguistic theory of pragmatics. We also show entailment judgments can be decoded from the predictions of a language model trained on such Gricean data. Our results reveal a pathway for understanding the semantic information encoded in unlabeled linguistic data and a potential framework for extracting semantics from language models.

🧭 Keyword Pioneer — entailment judgment

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

William Merrill , Alex Warstadt , Tal Linzen

Topics

Natural Language Processing > Resources & Methods > Lexical Semantics Natural Language Processing > Resources & Methods > Natural Language Inference

Keywords

natural language inference pragmatic reasoning language model semantic extraction entailment judgment

Download PDF

Related papers

How Hate Speech Varies by Target Identity: A Computational Analysis 2022

Continual Learning for Natural Language Generations with Transformer Calibration 2022

Causal Analysis of Syntactic Agreement Neurons in Multilingual Language Models 2022

Parsing as Deduction Revisited: Using an Automatic Theorem Prover to Solve an SMT Model of a Minimalist Parser 2022

Leveraging a New Spanish Corpus for Multilingual and Cross-lingual Metaphor Detection 2022