The Locality and Symmetry of Positional Encodings

Lihu Chen; Gael Varoquaux; Fabian Suchanek

2023 EMNLP EMNLP 2023

The Locality and Symmetry of Positional Encodings

Abstract

AbstractPositional Encodings (PEs) are used to inject word-order information into transformer-based language models. While they can significantly enhance the quality of sentence representations, their specific contribution to language models is not fully understood, especially given recent findings that various positional encodings are insensitive to word order. In this work, we conduct a systematic study of positional encodings in Bidirectional Masked Language Models (BERT-style) , which complements existing work in three aspects: (1) We uncover the core function of PEs by identifying two common properties, Locality and Symmetry; (2) We show that the two properties are closely correlated with the performances of downstream tasks; (3) We quantify the weakness of current PEs by introducing two new probing tasks, on which current PEs perform poorly. We believe that these results are the basis for developing better PEs for transformer-based language models.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🧭 Keyword Pioneer — bidirectional masked language model

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Lihu Chen , Gael Varoquaux , Fabian Suchanek

Topics

Machine Learning > Core Methods > Representation Learning Deep Learning > Architectures > Transformers Natural Language Processing > Resources & Methods > Text Representation Machine Learning > Learning Types > Representation Learning Natural Language Processing > Resources & Methods > Language Modeling Machine Learning > Optimization & Theory > Representation Learning Deep Learning > Models > Transformers Artificial Intelligence > Core AI > Language Deep Learning > Learning Types > Representation Learning Deep Learning > Techniques > Representation Learning

Keywords

representation learning word order language model positional encoding transformer language model sentence representation probing task bidirectional masked language model

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023