Privacy-Preserving Learning of Random Forests Without Revealing the Trees

Lukas-Malte Bammert¹²,
Stefan Kramer¹²,
Mattia Cerrato¹² &
…
Ernst Althaus¹²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 14276))

Included in the following conference series:

International Conference on Discovery Science

820 Accesses

Abstract

The paper presents a method for the privacy-preserving learning of random forests from private data of three parties, where not even the decision trees, i.e., neither the tree structures nor their parameters (the annotations of attributes and attribute values), are disclosed to any of the parties. To make this practical for realistically size data, a custom protocol is needed for the private comparison of two numbers, such that the numbers themselves are only available in shares and are not known to either party. Experiments with five datasets indicate that the overall protocol matches classical random forests in accuracy and can handle datasets of realistic size.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 69.99; Price excludes VAT (USA)

Softcover Book: USD 89.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

A two-phase random forest with differential privacy

Article 06 October 2022

A Differentially Private Random Decision Forest Using Reliable Signal-to-Noise Ratios

Differential Private (Random) Decision Tree Without Adding Noise

Notes

1.
Notice that we follow the original definition of random forests by Breiman (2001).
2.
http://archive.ics.uci.edu/ml.
3.
https://websockets.readthedocs.io/en/stable/.

References

Akavia, A., Leibovich, M., Resheff, Y.S., Ron, R., Shahar, M., Vald, M.: Privacy-preserving decision trees training and prediction. Cryptology ePrint Archive, Paper 2021/768 (2021). https://eprint.iacr.org/2021/768
Althaus, E., Dousti, M.S., Kramer, S., Rassau, N.J.P.: Fast private parameter learning and evaluation for sum-product networks. CoRR abs/2104.07353 (2021). https://arxiv.org/abs/2104.07353
Araki, T., Furukawa, J., Lindell, Y., Nof, A., Ohara, K.: High-throughput semi-honest secure three-party computation with an honest majority. In: Proceedings of the 2016 ACM SIGSAC Conference on Computer and Communications Security, pp. 805–817. Association for Computing Machinery, New York, NY, USA (2016). https://doi.org/10.1145/2976749.2978331
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Article MATH Google Scholar
Canetti, R.: Security and composition of multiparty cryptographic protocols. J. Cryptol. 13(1), 143–202 (2000). https://doi.org/10.1007/s001459910006
Article MathSciNet MATH Google Scholar
Du, W., Zhan, Z.: Building decision tree classifier on private data. In: Proceedings of the IEEE International Conference on Privacy, Security and Data Mining, vol. 14, pp. 1–8. Australian Computer Society Inc. (2002)
Google Scholar
Emekci, F., Sahin, O., Agrawal, D., El Abbadi, A.: Privacy preserving decision tree learning over multiple parties. Data Knowl. Eng. 63(2), 348–361 (2007). https://www.sciencedirect.com/science/article/pii/S0169023X07000365
Giacomelli, I., Jha, S., Joye, M., Page, C.D., Yoon, K.: Privacy-preserving ridge regression over distributed data from lhe. Cryptology ePrint Archive, Report 2017/979 (2017). https://eprint.iacr.org/2017/979
Goldreich, O., Micali, S., Wigderson, A.: How to play any mental game. In: Proceedings of the Nineteenth Annual ACM Symposium on Theory of Computing, pp. 218–229. Association for Computing Machinery, New York, NY, USA (1987). https://doi.org/10.1145/28395.28420
Goldreich, O.: Foundations of Cryptography - Basic Applications, vol. 2. Cambridge University Press, Cambridge (2004)
Book MATH Google Scholar
de Hoogh, S., Schoenmakers, B., Chen, P., op den Akker, H.: Practical secure decision tree learning in a teletreatment application. In: Christin, N., Safavi-Naini, R. (eds.) FC 2014. LNCS, vol. 8437, pp. 179–194. Springer, Heidelberg (2014). https://doi.org/10.1007/978-3-662-45472-5_12
Chapter Google Scholar
Lindell, Y., Pinkas, B.: Privacy preserving data mining. In: Bellare, M. (ed.) CRYPTO 2000. LNCS, vol. 1880, pp. 36–54. Springer, Heidelberg (2000). https://doi.org/10.1007/3-540-44598-6_3
Chapter Google Scholar
Mohassel, P., Rindal, P.: Aby 3: a mixed protocol framework for machine learning. In: Proceedings of the 2018 ACM SIGSAC Conference on Computer and Communications Security, pp. 35–52, October 2018
Google Scholar
Mohassel, P., Zhang, Y.: SecureML: a system for scalable privacy-preserving machine learning. In: 2017 IEEE Symposium on Security and Privacy (SP), pp. 19–38 (2017). https://doi.org/10.1109/SP.2017.12
Nikolaenko, V., Weinsberg, U., Ioannidis, S., Joye, M., Boneh, D., Taft, N.: Privacy-preserving ridge regression on hundreds of millions of records. In: 2013 IEEE Symposium on Security and Privacy, pp. 334–348 (2013). https://doi.org/10.1109/SP.2013.30
Riazi, M.S., Weinert, C., Tkachenko, O., Songhori, E.M., Schneider, T., Koushanfar, F.: Chameleon: a hybrid secure computation framework for machine learning applications. CoRR abs/1801.03239 (2018). http://arxiv.org/abs/1801.03239
Samet, S., Miri, A.: Privacy preserving ID3 using Gini index over horizontally partitioned data. In: 2008 IEEE/ACS International Conference on Computer Systems and Applications, pp. 645–651 (2008). https://doi.org/10.1109/AICCSA.2008.4493598
Scikit-learn: random forest classifier. https://scikit-learn.org/stable/modules/generated/sklearn.ensemble.RandomForestClassifier.html
Vaidya, J., Clifton, C., Kantarcioglu, M., Patterson, A.S.: Privacy-preserving decision trees over vertically partitioned data 2(3) (2008). https://doi.org/10.1145/1409620.1409624
Wang, K., Xu, Y., She, R., Yu, P.S.: Classification spanning private databases. In: Proceedings of the 21st National Conference on Artificial Intelligence, vol. 1, p. 293–298. AAAI’06, AAAI Press (2006)
Google Scholar
Yao, A.C.: Protocols for secure computations. In: Proceedings of the 23rd Annual Symposium on Foundations of Computer Science, pp. 160–164. IEEE Computer Society, USA (1982)
Google Scholar

Download references

Acknowledgements

This work was partly funded by the Carl-Zeiss-Stiftung as part of the CZS Durchbrueche project under grant number [P2021-02-014].

Author information

Authors and Affiliations

Institut für Informatik, Johannes Gutenberg Universität Mainz, Saarstraße 21, 55112, Mainz, Germany
Lukas-Malte Bammert, Stefan Kramer, Mattia Cerrato & Ernst Althaus

Authors

Lukas-Malte Bammert
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Kramer
View author publications
You can also search for this author in PubMed Google Scholar
Mattia Cerrato
View author publications
You can also search for this author in PubMed Google Scholar
Ernst Althaus
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Stefan Kramer .

Editor information

Editors and Affiliations

Waikato University, Hamilton, New Zealand
Albert Bifet
Aeronautics Institute of Technology, São José dos Campos, Brazil
Ana Carolina Lorena
University of Porto, Porto, Portugal
Rita P. Ribeiro
University of Porto, Porto, Portugal
João Gama
University of Coimbra, Coimbra, Portugal
Pedro H. Abreu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Bammert, LM., Kramer, S., Cerrato, M., Althaus, E. (2023). Privacy-Preserving Learning of Random Forests Without Revealing the Trees. In: Bifet, A., Lorena, A.C., Ribeiro, R.P., Gama, J., Abreu, P.H. (eds) Discovery Science. DS 2023. Lecture Notes in Computer Science(), vol 14276. Springer, Cham. https://doi.org/10.1007/978-3-031-45275-8_25

Download citation

DOI: https://doi.org/10.1007/978-3-031-45275-8_25
Published: 08 October 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-45274-1
Online ISBN: 978-3-031-45275-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Privacy-Preserving Learning of Random Forests Without Revealing the Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A two-phase random forest with differential privacy

A Differentially Private Random Decision Forest Using Reliable Signal-to-Noise Ratios

Differential Private (Random) Decision Tree Without Adding Noise

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Privacy-Preserving Learning of Random Forests Without Revealing the Trees

Abstract

Access this chapter

Subscribe and save

Buy Now

Similar content being viewed by others

A two-phase random forest with differential privacy

A Differentially Private Random Decision Forest Using Reliable Signal-to-Noise Ratios

Differential Private (Random) Decision Tree Without Adding Noise

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation