Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Luiza Pozzobon; Beyza Ermis; Patrick Lewis; Sara Hooker

2023 EMNLP EMNLP 2023

Goodtriever: Adaptive Toxicity Mitigation with Retrieval-augmented Models

Abstract

AbstractConsiderable effort has been dedicated to mitigating toxicity, but existing methods often require drastic modifications to model parameters or the use of computationally intensive auxiliary models. Furthermore, previous approaches have often neglected the crucial factor of language’s evolving nature over time. In this work, we present a comprehensive perspective on toxicity mitigation that takes into account its changing nature. We introduce Goodtriever, a flexible methodology that matches the current state-of-the-art toxicity mitigation while achieving 43% relative latency reduction during inference and being more computationally efficient. By incorporating a retrieval-based approach at decoding time, Goodtriever enables toxicity-controlled text generation. Our research advocates for an increased focus on adaptable mitigation techniques, which better reflect the data drift models face when deployed in the wild.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Luiza Pozzobon , Beyza Ermis , Patrick Lewis , Sara Hooker

Topics

Artificial Intelligence > Core AI > Responsible AI Machine Learning > Application Areas > Fairness Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Generation > Text Generation Deep Learning > Learning Types > Retrieval-Augmented Generation Artificial Intelligence > Core AI > Safety

Keywords

text generation computational efficiency language model retrieval-augmented generation latency reduction adaptive mitigation controlled generation retrieval-augmented model toxicity mitigation decoding time adaptive model

Download PDF

Related papers

Exploring Linguistic Probes for Morphological Generalization 2023

NameGuess: Column Name Expansion for Tabular Data 2023

Vision-Enhanced Semantic Entity Recognition in Document Images via Visually-Asymmetric Consistency Learning 2023

Improving Conversational Recommendation Systems via Bias Analysis and Language-Model-Enhanced Data Augmentation 2023

On the Calibration of Large Language Models and Alignment 2023