When is a bishop not like a rook? When it’s like a rabbi! Multi-prototype BERT embeddings for estimating semantic relationships

Gabriella Chronis; Katrin Erk

2020 CONLL CoNLL 2020

When is a bishop not like a rook? When it’s like a rabbi! Multi-prototype BERT embeddings for estimating semantic relationships

Abstract

AbstractThis paper investigates contextual language models, which produce token representations, as a resource for lexical semantics at the word or type level. We construct multi-prototype word embeddings from bert-base-uncased (Devlin et al., 2018). These embeddings retain contextual knowledge that is critical for some type-level tasks, while being less cumbersome and less subject to outlier effects than exemplar models. Similarity and relatedness estimation, both type-level tasks, benefit from this contextual knowledge, indicating the context-sensitivity of these processes. BERT’s token level knowledge also allows the testing of a type-level hypothesis about lexical abstractness, demonstrating the relationship between token-level phenomena and type-level concreteness ratings. Our findings provide important insight into the interpretability of BERT: layer 7 approximates semantic similarity, while the final layer (11) approximates relatedness.

❓ The Questioner

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing

📈 Trend Setter — Lexical Semantics

🐝 Cross-Pollinator — Artificial Intelligence, Computer Vision, Data Science & Analytics, Deep Learning, Interdisciplinary, Machine Learning, Mathematics & Optimization, Natural Language Processing, Robotics, Speech & Audio

🐣 Hot Topic Early Bird — semantic relatedness

Authors

Gabriella Chronis , Katrin Erk

Topics

Artificial Intelligence > Core AI > Multimodal Learning Machine Learning > Core Methods > Embedding Learning Natural Language Processing > Resources & Methods > Large Language Models Natural Language Processing > Resources & Methods > Lexical Semantics

Keywords

lexical semantics semantic similarity word embedding contextual embedding multi-prototype embedding semantic relatedness contextual language model

Download PDF

Related papers

Recurrent babbling: evaluating the acquisition of grammar from limited input data 2020

Finding The Right One and Resolving it 2020

Enriching Word Embeddings with Temporal and Spatial Information 2020

Interpreting Attention Models with Human Visual Attention in Machine Reading Comprehension 2020

Bridging Information-Seeking Human Gaze and Machine Reading Comprehension 2020