SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

Abe Hou; Jingyu Zhang; Tianxing He; Yichen Wang; Yung-Sung Chuang; Hongwei Wang; Lingfeng Shen; Benjamin Van Durme; Daniel Khashabi; Yulia Tsvetkov

2024 NAACL NAACL 2024

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation

Abstract

AbstractExisting watermarked generation algorithms employ token-level designs and therefore, are vulnerable to paraphrase attacks. To address this issue, we introduce watermarking on the semantic representation of sentences. We propose SemStamp, a robust sentence-level semantic watermarking algorithm that uses locality-sensitive hashing (LSH) to partition the semantic space of sentences. The algorithm encodes and LSH-hashes a candidate sentence generated by a language model, and conducts rejection sampling until the sampled sentence falls in watermarked partitions in the semantic embedding space. To test the paraphrastic robustness of watermarking algorithms, we propose a “bigram paraphrase” attack that produces paraphrases with small bigram overlap with the original sentence. This attack is shown to be effective against existing token-level watermark algorithms, while posing only minor degradations to SemStamp. Experimental results show that our novel semantic watermark algorithm is not only more robust than the previous state-of-the-art method on various paraphrasers and domains, but also better at preserving the quality of generation.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Abe Hou , Jingyu Zhang , Tianxing He , Yichen Wang , Yung-Sung Chuang , Hongwei Wang , Lingfeng Shen , Benjamin Van Durme , Daniel Khashabi , Yulia Tsvetkov

Topics

Artificial Intelligence > Core AI > AI Safety Machine Learning > Application Areas > Privacy

Keywords

text generation locality-sensitive hashing sentence embedding paraphrase attack semantic watermark

Download PDF

Related papers

Working Alliance Transformer for Psychotherapy Dialogue Classification 2024

Named Entity Recognition Under Domain Shift via Metric Learning for Life Sciences 2024

Assessing Logical Puzzle Solving in Large Language Models: Insights from a Minesweeper Case Study 2024

TelME: Teacher-leading Multimodal Fusion Network for Emotion Recognition in Conversation 2024

Extractive Summarization with Text Generator 2024