Article

Free access

Privacy amplification by subsampling: tight analyses via couplings and divergences

Authors:

Marco GaboardiAuthors Info & Claims

NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems

Pages 6280 - 6290

Published: 03 December 2018 Publication History

PDF eReader Publisher Site

Abstract

Differential privacy comes equipped with multiple analytical tools for the design of private data analyses. One important tool is the so-called "privacy amplification by subsampling" principle, which ensures that a differentially private mechanism run on a random subsample of a population provides higher privacy guarantees than when run on the entire population. Several instances of this principle have been studied for different random subsampling methods, each with an ad-hoc analysis. In this paper we present a general method that recovers and improves prior analyses, yields lower bounds and derives new instances of privacy amplification by subsampling. Our method leverages a characterization of differential privacy as a divergence which emerged in the program verification community. Furthermore, it introduces new tools, including advanced joint convexity and privacy profiles, which might be of independent interest.

References

[1]

Martín Abadi, Andy Chu, Ian Goodfellow, H Brendan McMahan, Ilya Mironov, Kunal Talwar, and Li Zhang. Deep learning with differential privacy. In Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pages 308-318. ACM, 2016.

Digital Library

[2]

Borja Balle and Yu-Xiang Wang. Improving the gaussian mechanism for differential privacy: Analytical calibration and optimal denoising. In Proceedings of the 35th International Conference on Machine Learning, ICML, 2018.

[3]

Gilles Barthe and Federico Olmedo. Beyond differential privacy: Composition theorems and relational logic for f-divergences between probabilistic programs. In International Colloquium on Automata, Languages, and Programming, pages 49-60. Springer, 2013.

[4]

Gilles Barthe, Boris Köpf, Federico Olmedo, and Santiago Zanella Béguelin. Probabilistic relational reasoning for differential privacy. In Symposium on Principles of Programming Languages (POPL), pages 97-110, 2012.

Digital Library

[5]

Gilles Barthe, Marco Gaboardi, Benjamin Grégoire, Justin Hsu, and Pierre-Yves Strub. Proving differential privacy via probabilistic couplings. In Symposium on Logic in Computer Science (LICS), pages 749-758, 2016.

Digital Library

[6]

Raef Bassily, Adam Smith, and Abhradeep Thakurta. Private empirical risk minimization: Efficient algorithms and tight error bounds. In Foundations of Computer Science (FOCS), 2014 IEEE 55th Annual Symposium on, pages 464-473. IEEE, 2014.

Digital Library

[7]

Amos Beimel, Shiva Prasad Kasiviswanathan, and Kobbi Nissim. Bounds on the sample complexity for private learning and private data release. In Theory of Cryptography Conference, pages 437-454. Springer, 2010.

Digital Library

[8]

Amos Beimel, Kobbi Nissim, and Uri Stemmer. Characterizing the sample complexity of private learners. In Proceedings of the 4th conference on Innovations in Theoretical Computer Science, pages 97-110. ACM, 2013.

Digital Library

[9]

Amos Beimel, Hai Brenner, Shiva Prasad Kasiviswanathan, and Kobbi Nissim. Bounds on the sample complexity for private learning and private data release. Machine learning, 94(3):401-437, 2014.

Digital Library

[10]

Mark Bun and Thomas Steinke. Concentrated differential privacy: Simplifications, extensions, and lower bounds. In Theory of Cryptography - 14th International Conference, TCC 2016-B, Beijing, China, October 31 - November 3, 2016, Proceedings, Part I, pages 635-658, 2016.

Digital Library

[11]

Mark Bun, Kobbi Nissim, Uri Stemmer, and Salil Vadhan. Differentially private release and learning of threshold functions. In Foundations of Computer Science (FOCS), 2015 IEEE 56th Annual Symposium on, pages 634-649. IEEE, 2015.

Digital Library

[12]

Mark Bun, Cynthia Dwork, Guy Rothblum, and Thomas Steinke. Composable and versatile privacy via truncated cdp. In Symposium on Theory of Computing, STOC, 2018.

Digital Library

[13]

Kamalika Chaudhuri and Nina Mishra. When random sampling preserves privacy. In Annual International Cryptology Conference, pages 198-213. Springer, 2006.

Digital Library

[14]

Cynthia Dwork and Aaron Roth. The algorithmic foundations of differential privacy. Foundations and Trends in Theoretical Computer Science, 9(3-4):211-407, 2014.

Digital Library

[15]

Cynthia Dwork and Guy N Rothblum. Concentrated differential privacy. arXiv preprint arXiv:1603.01887, 2016.

[16]

Cynthia Dwork, Guy N Rothblum, and Salil Vadhan. Boosting and differential privacy. In Foundations of Computer Science (FOCS), 2010 51st Annual IEEE Symposium on, pages 51-60. IEEE, 2010.

Digital Library

[17]

Joonas Jälkö, Antti Honkela, and Onur Dikmen. Differentially private variational inference for non-conjugate models. In Proceedings of the Thirty-Third Conference on Uncertainty in Artificial Intelligence, UAI 2017, Sydney, Australia, August 11-15, 2017, 2017.

[18]

Peter Kairouz, Sewoong Oh, and Pramod Viswanath. The composition theorem for differential privacy. IEEE Transactions on Information Theory, 63(6):4037-4049, 2017.

Digital Library

[19]

Shiva Prasad Kasiviswanathan, Homin K Lee, Kobbi Nissim, Sofya Raskhodnikova, and Adam Smith. What can we learn privately? SIAM Journal on Computing, 40(3):793-826, 2011.

Digital Library

[20]

Ninghui Li, Wahbeh Qardaji, and Dong Su. On sampling, anonymization, and differential privacy or, k-anonymization meets differential privacy. In Proceedings of the 7th ACM Symposium on Information, Computer and Communications Security, pages 32-33. ACM, 2012.

Digital Library

[21]

Ilya Mironov. Rényi differential privacy. In 30th IEEE Computer Security Foundations Symposium, CSF 2017, Santa Barbara, CA, USA, August 21-25, 2017, pages 263-275, 2017.

[22]

Jack Murtagh and Salil Vadhan. The complexity of computing the optimal composition of differential privacy. In Theory of Cryptography Conference, pages 157-175. Springer, 2016.

Digital Library

[23]

Ferdinand Österreicher. Csiszár's f-divergences-basic properties. RGMIA Res. Rep. Coll, 2002.

[24]

Mijung Park, James R. Foulds, Kamalika Chaudhuri, and Max Welling. Private topic modeling. CoRR, abs/1609.04120, 2016a.

[25]

Mijung Park, James R. Foulds, Kamalika Chaudhuri, and Max Welling. Variational bayes in private settings (VIPS). CoRR, abs/1611.00340, 2016b.

[26]

Igal Sason and Sergio Verdú. f-divergence inequalities. IEEE Transactions on Information Theory, 62(11):5973-6006, 2016.

Digital Library

[27]

Jonathan Ullman. Cs7880: Rigorous approaches to data privacy. http://www.ccs.neu.edu/home/jullman/PrivacyS17/HW1sol.pdf, 2017.

[28]

Salil P. Vadhan. The complexity of differential privacy. In Tutorials on the Foundations of Cryptography., pages 347-450. 2017.

[29]

Yu-Xiang Wang, Stephen Fienberg, and Alex Smola. Privacy for free: Posterior sampling and stochastic gradient monte carlo. In Proceedings of the 32nd International Conference on Machine Learning (ICML-15), pages 2493-2502, 2015.

Digital Library

[30]

Yu-Xiang Wang, Jing Lei, and Stephen E. Fienberg. Learning with differential privacy: Stability, learnability and the sufficiency and necessity of erm principle. Journal of Machine Learning Research, 17(183):1-40, 2016.

Digital Library

[31]

Yu-Xiang Wang, Borja Balle, and Shiva Kasiviswanathan. Subsampled rényi differential privacy and analytical moments accountant. ArXiv e-prints, 2018.

Cited By

Cao LXiao DYan YMadden SLi G(2021)ATLANTICProceedings of the VLDB Endowment10.14778/3476311.347633714:12(2755-2758)Online publication date: 28-Oct-2021
https://dl.acm.org/doi/10.14778/3476311.3476337
Arcolezi HCouchot JAl Bouna BXiao XDemartini GZuccon GCulpepper JHuang ZTong H(2021)Random Sampling Plus Fake DataProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482467(47-57)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482467
Bacci GBacci GLarsen KTribastone MTschaikowski MVandin AGorla D(2021)Efficient local computation of differential bisimulations via coupling and up-to methodsProceedings of the 36th Annual ACM/IEEE Symposium on Logic in Computer Science10.1109/LICS52264.2021.9470555(1-14)Online publication date: 29-Jun-2021
https://dl.acm.org/doi/10.1109/LICS52264.2021.9470555
Show More Cited By

Privacy amplification by subsampling: tight analyses via couplings and divergences
1. Computing methodologies
2. Theory of computation

Recommendations

FDP-LDA: Inherent Privacy Amplification of Collapsed Gibbs Sampling via Group Subsampling
Web and Big Data
Abstract
Latent Dirichlet allocation (LDA) is a widely used fundamental tool for text analysis. Collapsed Gibbs sampling (CGS), as a widely adopted algorithm for learning the parameters of LDA, has the risk of privacy leakage. In this paper, we study the ...
Privacy Amplification by Sampling under User-level Differential Privacy
SIGMOD

Random sampling is an effective tool for reducing the computational costs of query processing in large databases. It has also been used frequently for private data analysis, in particular, under differential privacy (DP). An interesting phenomenon that ...
Network Shuffling: Privacy Amplification via Random Walks
SIGMOD '22: Proceedings of the 2022 International Conference on Management of Data

Recently, it is shown that shuffling can amplify the central differential privacy guarantees of data randomized with local differential privacy. Within this setup, a centralized, trusted shuffler is responsible for shuffling by keeping the identities of ...

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Guide Proceedings

NIPS'18: Proceedings of the 32nd International Conference on Neural Information Processing Systems

December 2018

11021 pages

Publisher

Curran Associates Inc.

Red Hook, NY, United States

Publication History

Published: 03 December 2018

Qualifiers

Article

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

5
Total Citations
View Citations
290
Total Downloads

Downloads (Last 12 months)93
Downloads (Last 6 weeks)23

Reflects downloads up to 20 Nov 2024

Other Metrics

View Author Metrics

Citations

Cited By

Cao LXiao DYan YMadden SLi G(2021)ATLANTICProceedings of the VLDB Endowment10.14778/3476311.347633714:12(2755-2758)Online publication date: 28-Oct-2021
https://dl.acm.org/doi/10.14778/3476311.3476337
Arcolezi HCouchot JAl Bouna BXiao XDemartini GZuccon GCulpepper JHuang ZTong H(2021)Random Sampling Plus Fake DataProceedings of the 30th ACM International Conference on Information & Knowledge Management10.1145/3459637.3482467(47-57)Online publication date: 26-Oct-2021
https://dl.acm.org/doi/10.1145/3459637.3482467
Bacci GBacci GLarsen KTribastone MTschaikowski MVandin AGorla D(2021)Efficient local computation of differential bisimulations via coupling and up-to methodsProceedings of the 36th Annual ACM/IEEE Symposium on Logic in Computer Science10.1109/LICS52264.2021.9470555(1-14)Online publication date: 29-Jun-2021
https://dl.acm.org/doi/10.1109/LICS52264.2021.9470555
Bater JPark YHe XWang XRogers J(2020)SAQEProceedings of the VLDB Endowment10.14778/3407790.340785413:12(2691-2705)Online publication date: 1-Jul-2020
https://dl.acm.org/doi/10.14778/3407790.3407854
LeTien NHabrard ASebban M(2019)Differentially private optimal transportProceedings of the 28th International Joint Conference on Artificial Intelligence10.5555/3367243.3367435(2852-2858)Online publication date: 10-Aug-2019
https://dl.acm.org/doi/10.5555/3367243.3367435

View Options

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Media

Figures

Other

Tables

View Table of Contents