Distributed Learning without Distress: Privacy-Preserving Empirical Risk Minimization

Distributed Learning without Distress:
Privacy-Preserving Empirical Risk Minimization

Bargav Jayaraman, Lingxiao Wang, David Evans and Quanquan Gu.
32^nd Conference on Neural Information Processing Systems (NeurIPS)
Montreal, Canada
December 2018

Abstract

Distributed learning allows a group of independent data owners to collaboratively learn a model over their data sets without exposing their private data. We present a distributed learning approach that combines differential privacy with secure multi-party computation. We explore two popular methods of differential privacy, output perturbation and gradient perturbation, and advance the state-of-the-art for both methods in the distributed learning setting. In our output perturbation method, the parties combine local models within a secure computation and then add therequired differential privacy noise before revealing the model. In our gradient perturbation method, the data owners collaboratively train a global model via aniterative learning algorithm. At each iteration, the parties aggregate their local gradients within a secure computation, adding sufficient noise to ensure privacy before the gradient updates are revealed. For both methods, we show that the noise can be reduced in the multi-party setting by adding the noise inside the securecomputation after aggregation, asymptotically improving upon the best previous results. Experiments on real world data sets demonstrate that our methods providesubstantial utility gains for typical privacy requirements.

Code

https://github.com/bargavj/distributedMachineLearning

Paper

[PDF], 12 pages [NeurIPS Proceedings Page]