Privacy-Preserving Classification of Personal Text Messages with Secure Multi-Party Computation

Devin Reich; Ariel Todoki; Rafael Dowsley; Martine De Cock; anderson nascimento

2019 NIPS NeurIPS 2019

Privacy-Preserving Classification of Personal Text Messages with Secure Multi-Party Computation

Abstract

Classification of personal text messages has many useful applications in surveillance, e-commerce, and mental health care, to name a few. Giving applications access to personal texts can easily lead to (un)intentional privacy violations. We propose the first privacy-preserving solution for text classification that is provably secure. Our method, which is based on Secure Multiparty Computation (SMC), encompasses both feature extraction from texts, and subsequent classification with logistic regression and tree ensembles. We prove that when using our secure text classification method, the application does not learn anything about the text, and the author of the text does not learn anything about the text classification model used by the application beyond what is given by the classification result itself. We perform end-to-end experiments with an application for detecting hate speech against women and immigrants, demonstrating excellent runtime results without loss of accuracy.

🌉 Interdisciplinary Bridge — Artificial Intelligence and Machine Learning and Natural Language Processing and Security & Privacy

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy, Speech & Audio

Authors

Devin Reich , Ariel Todoki , Rafael Dowsley , Martine De Cock , anderson nascimento

Topics

Machine Learning > Application Areas > Privacy Natural Language Processing > Applications > Text Classification Security & Privacy > Privacy Artificial Intelligence > Core AI > Privacy Machine Learning > Learning Types > Classification

Keywords

text classification logistic regression secure multi-party computation hate speech detection privacy-preserving classification

Download PDF

Related papers

Two Generator Game: Learning to Sample via Linear Goodness-of-Fit Test 2019

Metalearned Neural Memory 2019

Model Similarity Mitigates Test Set Overuse 2019

Continual Unsupervised Representation Learning 2019

Reinforcement Learning with Convex Constraints 2019