Fast and Memory Efficient Differentially Private-SGD via JL Projections

Zhiqi Bu; Sivakanth Gopi; Janardhan Kulkarni; Yin Tat Lee; Hanwen Shen; Uthaipon Tantipongpipat

2021 NIPS NeurIPS 2021

Fast and Memory Efficient Differentially Private-SGD via JL Projections

Abstract

Differentially Private-SGD (DP-SGD) of Abadi et al. and its variations are the only known algorithms for private training of large scale neural networks. This algorithm requires computation of per-sample gradients norms which is extremely slow and memory intensive in practice. In this paper, we present a new framework to design differentially private optimizers called DP-SGD-JL and DP-Adam-JL. Our approach uses Johnson–Lindenstrauss (JL) projections to quickly approximate the per-sample gradient norms without exactly computing them, thus making the training time and memory requirements of our optimizers closer to that of their non-DP versions. Unlike previous attempts to make DP-SGD faster which work only on a subset of network architectures or use compiler techniques, we propose an algorithmic solution which works for any network in a black-box manner which is the main contribution of this paper. To illustrate this, on IMDb dataset, we train a Recurrent Neural Network (RNN) to achieve good privacy-vs-accuracy tradeoff, while being significantly faster than DP-SGD and with a similar memory footprint as non-private SGD.

🌉 Interdisciplinary Bridge — Deep Learning and Machine Learning and Security & Privacy

🧭 Keyword Pioneer — private training

🐣 Hot Topic Early Bird — privacy-preserving learning

🐝 Cross-Pollinator — Artificial Intelligence, Computer Science, Computer Vision, Data Science & Analytics, Deep Learning, Healthcare & Medicine, Interdisciplinary, Knowledge & Reasoning, Machine Learning, Mathematics & Optimization, Natural Language Processing, Reinforcement Learning, Robotics, Security & Privacy, Speech & Audio

Authors

Zhiqi Bu , Sivakanth Gopi , Janardhan Kulkarni , Yin Tat Lee , Hanwen Shen , Uthaipon Tantipongpipat

Topics

Machine Learning > Optimization & Theory > Optimization Machine Learning > Application Areas > Privacy Security & Privacy > Differential Privacy Machine Learning > Learning Types > Privacy Deep Learning > Optimization & Theory > Stochastic Methods

Keywords

differential privacy stochastic gradient descent privacy-preserving learning gradient norm private training differentially private sgd johnson-lindenstrauss projection per-sample gradient jl projection

Download PDF

Related papers

Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data 2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation 2021

Test-Time Personalization with a Transformer for Human Pose Estimation 2021

NTopo: Mesh-free Topology Optimization using Implicit Neural Representations 2021

Scalable Intervention Target Estimation in Linear Models 2021