Abstract
Privacy-preservation in distributed databases is an important area of research in recent years. In a typical scenario, multiple parties may wish to collaborate to extract interesting global information such as class labels without revealing their respective data to each other. This may be particularly useful in applications such as car selling units, medical research etc. In the proposed work, we aim to develop a global classification model based on the Naïve Bayes classification scheme. The Naïve Bayes classification has been used because of its simplicity and high efficiency. For privacy-preservation of the data, the concept of trusted third party with two offsets has been used. The data is first anonymized at local party end and then the aggregation and global classification is done at the trusted third party. The proposed algorithms address various types of fragmentation schemes such as horizontal, vertical and arbitrary distribution.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Vaidya, J., Kantarcıoğlu, M., Clifton, C.: Privacy-Preserving Naïve Bayes Cassification. International Journal on Very Large Data Bases 17(4), 879–898 (2008)
Leew, C.K., Choi, U.J., Liew, C.J.: A data distortion by probability distribution. ACM TODS, 395–411 (1985)
Warner, S.L.: Randomized Response: A survey technique for eliminating evasive answer bias. Journal of American Statistical Association, 63–69 (1965)
Agarwal, R., Srikanth, R.: Privacy–preserving data mining. In: Proceedings of the ACM SIGMOD conference (2005)
Agarwal, D., Agarwal, C.C.: On the design and Quantification of Privacy–Preserving Data Mining Algorithms. In: ACM PODS Conference, pp. 1224–1236 (2002)
Zhang, P., Tong, Y., Tang, D.: Privacy–Preserving Naïve Bayes Classifier. In: Li, X., Wang, S., Dong, Z.Y. (eds.) ADMA 2005. LNCS (LNAI), vol. 3584, pp. 744–752. Springer, Heidelberg (2005)
Zhu, Y., Liu, L.: Optimal Randomization for Privacy–Preserving Data Mining. In: KDD ACM KDD Conference (2004)
Gambs, S., Kegl, B., Aimeur, E.: Privacy –Preserving Boosting, Journal (to appear)
Poovammal, E., Poonavaikko: An Improved Method for Privacy Preserving Data Mining. In: IEEE IACC Conference, Patiala, India, pp. 1453–1458 (2009)
Yao, A.C.: Protocol for secure sum computations. In: Proc. IEEE Foundations of Computer Science, pp. 160–164 (1982)
Bohanec, M., Zupan, B.: UCI Machine Learning Repository (1997), http://archive.ics.uci.edu/ml/datasets/Car+Evaluation
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2010 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Keshavamurthy, B.N., Sharma, M., Toshniwal, D. (2010). Privacy-Preserving Naïve Bayes Classification Using Trusted Third Party and Offset Computation over Distributed Databases. In: Das, V.V., Vijaykumar, R. (eds) Information and Communication Technologies. ICT 2010. Communications in Computer and Information Science, vol 101. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-15766-0_89
Download citation
DOI: https://doi.org/10.1007/978-3-642-15766-0_89
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-15765-3
Online ISBN: 978-3-642-15766-0
eBook Packages: Computer ScienceComputer Science (R0)