Private Learning with Public Features

Walid Krichene; Nicolas E Mayoraz; Steffen Rendle; Shuang Song; Abhradeep Thakurta; Li Zhang

2024 AISTATS AISTATS 2024

Private Learning with Public Features

Abstract

We study a class of private learning problems in which the data is a join of private and public features. This is often the case in private personalization tasks such as recommendation or ad prediction, in which features related to individuals are sensitive, while features related to items (the movies or songs to be recommended, or the ads to be shown to users) are publicly available and do not require protection. A natural question is whether private algorithms can achieve higher utility in the presence of public features. We give a positive answer for multi-encoder models where one of the encoders operates on public features. We develop new algorithms that take advantage of this separation by only protecting certain sufficient statistics (instead of adding noise to the gradient). This method has a guaranteed utility improvement for linear regression, and importantly, achieves the state of the art on two standard private recommendation benchmarks, demonstrating the importance of methods that adapt to the private-public feature separation.

🧭 Keyword Pioneer — public feature

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Security & Privacy

🌉 Interdisciplinary Bridge — Data Science & Analytics and Machine Learning

Authors

Walid Krichene , Nicolas E Mayoraz , Steffen Rendle , Shuang Song , Abhradeep Thakurta , Li Zhang

Topics

Machine Learning > Core Methods > Regression Machine Learning > Learning Types > Semi-Supervised Learning Machine Learning > Application Areas > Privacy Data Science & Analytics > Applications > Recommender Systems

Keywords

differential privacy linear regression recommendation system private learning recommender system public feature sufficient statistic multi-encoder model feature separation

Download PDF

Related papers

Causal Bandits with General Causal Models and Interventions 2024

Boundary-Aware Uncertainty for Feature Attribution Explainers 2024

Better Representations via Adversarial Training in Pre-Training: A Theoretical Perspective 2024

A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning 2024

Pure Exploration in Bandits with Linear Constraints 2024