Entities as Experts: Sparse Memory Access with Entity Supervision

Thibault Fevry; Livio Baldini Soares; Nicholas FitzGerald; Eunsol Choi; Tom Kwiatkowski

2020 EMNLP EMNLP 2020

Entities as Experts: Sparse Memory Access with Entity Supervision

Abstract

AbstractWe focus on the problem of capturing declarative knowledge about entities in the learned parameters of a language model. We introduce a new model—Entities as Experts (EaE)—that can access distinct memories of the entities mentioned in a piece of text. Unlike previous efforts to integrate entity knowledge into sequence models, EaE’s entity representations are learned directly from text. We show that EaE’s learned representations capture sufficient knowledge to answer TriviaQA questions such as “Which Dr. Who villain has been played by Roger Delgado, Anthony Ainley, Eric Roberts?”, outperforming an encoder-generator Transformer model with 10x the parameters on this task. According to the Lama knowledge probes, EaE contains more factual knowledge than a similar sized Bert, as well as previous approaches that integrate external sources of entity knowledge. Because EaE associates parameters with specific entities, it only needs to access a fraction of its parameters at inference time, and we show that the correct identification and representation of entities is essential to EaE’s performance.

🌉 Interdisciplinary Bridge — Deep Learning and Knowledge & Reasoning and Machine Learning and Natural Language Processing

📈 Trend Setter — Retrieval-Augmented Generation

🧭 Keyword Pioneer — entity supervision

🐣 Hot Topic Early Bird — factual knowledge

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Thibault Fevry , Livio Baldini Soares , Nicholas FitzGerald , Eunsol Choi , Tom Kwiatkowski

Topics

Machine Learning > Core Methods > Embedding Learning Natural Language Processing > Applications > Question Answering Natural Language Processing > Resources & Methods > Large Language Models Knowledge & Reasoning > Representation > Knowledge Graphs Knowledge & Reasoning > Representation > Knowledge Representation Natural Language Processing > Resources & Methods > Retrieval-Augmented Generation Deep Learning > Learning Types > Representation Learning Deep Learning > Models > Language Models

Keywords

factual knowledge knowledge probing language model knowledge retrieval entity memory entity representation entity supervision entity knowledge memory access sparse memory access

Download PDF

Related papers

Fast semantic parsing with well-typedness guarantees 2020

Detecting Objectifying Language in Online Professor Reviews 2020

Analogous Process Structure Induction for Sub-event Sequence Prediction 2020

Aspect Sentiment Classification with Aspect-Specific Opinion Spans 2020

Robust and Interpretable Grounding of Spatial References with Relation Networks 2020