NoPeek: Information leakage reduction to share activations in distributed deep learning
Author(s)
Vepakomma, Praneeth; Singh, Abhishek; Gupta, Otkrist; Raskar, Ramesh
DownloadSubmitted version (13.07Mb)
Open Access Policy
Open Access Policy
Creative Commons Attribution-Noncommercial-Share Alike
Terms of use
Metadata
Show full item recordAbstract
For distributed machine learning with sensitive data, we demonstrate how minimizing distance correlation between raw data and intermediary representations reduces leakage of sensitive raw data patterns across client communications while maintaining model accuracy. Leakage (measured using distance correlation between input and intermediate representations) is the risk associated with the invertibility of raw data from intermediary representations. This can prevent client entities that hold sensitive data from using distributed deep learning services. We demonstrate that our method is resilient to such reconstruction attacks and is based on reduction of distance correlation between raw data and learned representations during training and inference with image datasets. We prevent such reconstruction of raw data while maintaining information required to sustain good classification accuracies.
Date issued
2020Department
Program in Media Arts and Sciences (Massachusetts Institute of Technology); Massachusetts Institute of Technology. Media LaboratoryJournal
IEEE International Conference on Data Mining Workshops, ICDMW
Publisher
IEEE
Citation
Vepakomma, P, Singh, A, Gupta, O and Raskar, R. "NoPeek: Information leakage reduction to share activations in distributed deep learning." IEEE International Conference on Data Mining Workshops, ICDMW, 2020-November.
Version: Original manuscript