NLMs: Augmenting Negation in Language Models

Rituraj Singh; Rahul Kumar; Vivek Sridhar

2023 EMNLP EMNLP 2023

NLMs: Augmenting Negation in Language Models

Abstract

AbstractNegation is the fundamental component in a natural language that reverses the semantic meaning of a sentence. It plays an extremely important role across a wide range of applications, yet they are underrepresented in pre-trained language models (LMs), resulting often in wrong inferences. In this work, we try to improve the underlying understanding of the negation in the pre-trained LMs. To augment negation understanding, we propose a language model objective with a weighted cross-entropy loss and elastic weight consolidation regularization. We reduce the mean top 1 error rate for BERT-base to 1.1%, BERT-large to 0.78%, RoBERTA-base to 3.74%, RoBERTA-large to 0.01% on the negated LAMA dataset. It minimizes the BERT error rate by a margin of 8% and also outperform the existing negation models. We also provide empirical evidences that negated augmented models outperform the classical models on original as well as negation benchmarks on natural language inference tasks.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Rituraj Singh , Rahul Kumar , Vivek Sridhar

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Learning Types > Continual Learning Machine Learning > Learning Types > Semi-Supervised Learning Natural Language Processing > Resources & Methods > Large Language Models Natural Language Processing > Resources & Methods > Language Modeling Deep Learning > Learning Types > Self-Supervised Learning Natural Language Processing > Understanding > Natural Language Inference Deep Learning > Learning Types > Representation Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

representation learning natural language inference language model pre-trained language model language model fine-tuning elastic weight consolidation negation handling weight consolidation negation understanding weighted cross-entropy

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023