StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements

Jillian Fisher; Skyler Hallinan; Ximing Lu; Mitchell L Gordon; Zaïd Harchaoui; Yejin Choi

2024 EMNLP EMNLP 2024

StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements

Abstract

AbstractAuthorship obfuscation, rewriting a text to intentionally obscure the identity of the author, is important yet challenging. Current methods using large language models (LLMs) lack interpretability and controllability, often ignoring author-specific stylistic features, resulting in less robust performance overall.To address this, we develop StyleRemix, an adaptive and interpretable obfuscation method that perturbs specific, fine-grained style elements of the original input text. StyleRemix uses pre-trained Low Rank Adaptation (LoRA) modules to rewrite inputs along various stylistic axes (e.g., formality, length) while maintaining low computational costs. StyleRemix outperforms state-of-the-art baselines and much larger LLMs on an array of domains on both automatic and human evaluation.Additionally, we release AuthorMix, a large set of 30K high-quality, long-form texts from a diverse set of 14 authors and 4 domains, and DiSC, a parallel corpus of 1,500 texts spanning seven style axes in 16 unique directions.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Deep Learning and Machine Learning and Natural Language Processing

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Jillian Fisher , Skyler Hallinan , Ximing Lu , Mitchell L Gordon , Zaïd Harchaoui , Yejin Choi

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Application Areas > Knowledge Distillation Natural Language Processing > Generation > Text Generation Artificial Intelligence > Core AI > Large Language Models Machine Learning > Learning Types > Knowledge Distillation Deep Learning > Techniques > Knowledge Distillation Deep Learning > Learning Types > Transfer Learning Artificial Intelligence > Core AI > Natural Language Processing

Keywords

style transfer knowledge distillation text generation low-rank adaptation parameter efficient fine-tuning authorship obfuscation low rank adaptation large language model

Download PDF

Related papers

EmbodiedBERT: Cognitively Informed Metaphor Detection Incorporating Sensorimotor Information 2024

Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation 2024

Learning to Extract Structured Entities Using Language Models 2024

Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis 2024

CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages 2024