Deep Reinforcement Learning-based Text Anonymization against Private-Attribute Inference

Ahmadreza Mosallanezhad; Ghazaleh Beigi; Huan Liu

2019 EMNLP EMNLP 2019

Deep Reinforcement Learning-based Text Anonymization against Private-Attribute Inference

Abstract

AbstractUser-generated textual data is rich in content and has been used in many user behavioral modeling tasks. However, it could also leak user private-attribute information that they may not want to disclose such as age and location. User’s privacy concerns mandate data publishers to protect privacy. One effective way is to anonymize the textual data. In this paper, we study the problem of textual data anonymization and propose a novel Reinforcement Learning-based Text Anonymizor, RLTA, which addresses the problem of private-attribute leakage while preserving the utility of textual data. Our approach first extracts a latent representation of the original text w.r.t. a given task, then leverages deep reinforcement learning to automatically learn an optimal strategy for manipulating text representations w.r.t. the received privacy and utility feedback. Experiments show the effectiveness of this approach in terms of preserving both privacy and utility.

🌉 Interdisciplinary Bridge — Machine Learning and Natural Language Processing and Reinforcement Learning and Security & Privacy

🧭 Keyword Pioneer — private attribute inference

🐣 Hot Topic Early Bird — privacy protection

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Ahmadreza Mosallanezhad , Ghazaleh Beigi , Huan Liu

Topics

Machine Learning > Application Areas > Privacy Reinforcement Learning > Methods > Deep RL Machine Learning > Learning Types > Reinforcement Learning Security & Privacy > Privacy Natural Language Processing > Applications > Text Processing

Keywords

deep reinforcement learning reinforcement learning privacy protection utility preservation text anonymization private-attribute inference private attribute inference

Download PDF

Related papers

Read, Attend and Comment: A Deep Architecture for Automatic News Comment Generation 2019

Chains-of-Reasoning at TextGraphs 2019 Shared Task: Reasoning over Chains of Facts for Explainable Multi-hop Inference 2019

A Boundary-aware Neural Model for Nested Named Entity Recognition 2019

Iterative Dual Domain Adaptation for Neural Machine Translation 2019

A Multi-Pairwise Extension of Procrustes Analysis for Multilingual Word Translation 2019