WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off

Eva Giboulot; Teddy Furon

2024 NIPS NeurIPS 2024

WaterMax: breaking the LLM watermark detectability-robustness-quality trade-off

Abstract

Watermarking is a technical means to dissuade malfeasant usage of Large Language Models.This paper proposes a novel watermarking scheme, so-called WaterMax, that enjoys high detectability while sustaining the quality of the generated text of the original LLM.Its new design leaves the LLM untouched (no modification of the weights, logits or temperature).WaterMax balances robustness and computational complexity contrary to the watermarking techniques of the literature inherently provoking a trade-off between quality and robustness.Its performance is both theoretically proven and experimentally validated.It outperforms all the SotA techniques under the most complete benchmark suite.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🧭 Keyword Pioneer — text watermark

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Speech & Audio

Authors

Eva Giboulot , Teddy Furon

Topics

Artificial Intelligence > Core AI > Foundation Models Machine Learning > Application Areas > Privacy Natural Language Processing > Generation > Text Generation Artificial Intelligence > Core AI > Privacy Deep Learning > Models > Large Language Models

Keywords

text generation text watermark text watermarking watermark detection text quality large language model watermark robustness

Download PDF

Related papers

SPIQA: A Dataset for Multimodal Question Answering on Scientific Papers 2024

Training for Stable Explanation for Free 2024

NeuralSolver: Learning Algorithms For Consistent and Efficient Extrapolation Across General Tasks 2024

Expectation Alignment: Handling Reward Misspecification in the Presence of Expectation Mismatch 2024

MicroAdam: Accurate Adaptive Optimization with Low Space Overhead and Provable Convergence 2024