Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Milad Alshomary; Narutatsu Ri; Marianna Apidianaki; Ajay Patel; Smaranda Muresan; Kathleen McKeown

2025 COLING COLING 2025

Latent Space Interpretation for Stylistic Analysis and Explainable Authorship Attribution

Abstract

AbstractRecent state-of-the-art authorship attribution methods learn authorship representations of text in a latent, uninterpretable space, which hinders their usability in real-world applications. We propose a novel approach for interpreting learned embeddings by identifying representative points in the latent space and leveraging large language models to generate informative natural language descriptions of the writing style associated with each point. We evaluate the alignment between our interpretable and latent spaces and demonstrate superior prediction agreement over baseline methods. Additionally, we conduct a human evaluation to assess the quality of these style descriptions and validate their utility in explaining the latent space. Finally, we show that human performance on the challenging authorship attribution task improves by +20% on average when aided with explanations from our method.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Milad Alshomary , Narutatsu Ri , Marianna Apidianaki , Ajay Patel , Smaranda Muresan , Kathleen McKeown

Topics

Artificial Intelligence > Core AI > Interpretability Machine Learning > Core Methods > Representation Learning

Keywords

authorship attribution latent space writing style interpretable embedding large language model

Download PDF

Related papers

Navigating Dialectal Bias and Ethical Complexities in Levantine Arabic Hate Speech Detection 2025

TaCIE: Enhancing Instruction Comprehension in Large Language Models through Task-Centred Instruction Evolution 2025

Positive Text Reframing under Multi-strategy Optimization 2025

RAM2C: A Liberal Arts Educational Chatbot based on Retrieval-augmented Multi-role Multi-expert Collaboration 2025

Two-stage Incomplete Utterance Rewriting on Editing Operation 2025