Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures

Konrad Stark¹⁸,
Johann Eder¹⁹ &
Kurt Zatloukal¹⁸

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4081))

Included in the following conference series:

International Conference on Data Warehousing and Knowledge Discovery

791 Accesses
12 Citations

Abstract

Biobanks are gaining in importance by storing large collections of patient’s clinical data (e.g. disease history, laboratory parameters, diagnosis, life style) together with biological materials such as tissue samples, blood or other body fluids. When releasing these patient-specific data for medical studies privacy protection has to be guaranteed for ethical and legal reasons. k-anonymity may be used to ensure privacy by generalising and suppressing attributes in order to release sufficient data twins that mask patients’ identities. However, data transformation techniques like generalisation may produce anonymised data unusable for medical studies because some attributes become too coarse-grained. We propose a priority-driven anonymisation technique that allows to specify the degree of acceptable information loss for each attribute separately. We use generalisation and suppression of attributes together with a weighting-scheme for quantifying generalisation steps. Our approach handles both numerical and categorical attributes and provides a data anonymisation based on priorities and weights. The anonymisation algorithm described in this paper has been implemented and tested on a carcinoma data set. We discuss some general privacy protecting methods for medical data and show some medical-relevant use cases that benefit from our anonymisation technique.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic

$34.99 /Month

Get 10 units per month
Download Article/Chapter or eBook
1 Unit = 1 Article or 1 Chapter
Cancel anytime

Buy Now

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

A Survey of Anonymization Algorithms for Electronic Health Records

Evaluation of Anonymization Tools for Health Data

SECRETA: A Tool for Anonymizing Relational, Transaction and RT-Datasets

References

A biobank for the advancement of medicine, http://www.bioresource-med.com
Fung, B.C.M., Wang, K., Yu, P.S.: Top-down specialization for information and privacy preservation. In: ICDE, pp. 205–216 (2005)
Google Scholar
Genomeresearch in Austria, http://www.gen-au.at/english/content.jsp
Sweeney, L.: Computational disclosure control for medical microdata (1997)
Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Incognito: Efficient full-domain k-anonymity. In: SIGMOD Conference, pp. 49–60 (2005)
Google Scholar
LeFevre, K., DeWitt, D.J., Ramakrishnan, R.: Multidimensional k-anonymity. In Technical Report 1521, University of Wisconsin, 2005 (2005)
Google Scholar
Sweeney, L., Samarati, P.: Protecting privacy when disclosing information: k-anonymity and its enforcement through generalization and suppression. In: Proceedings of the IEEE Symposium on Research in Security and Privacy (1998)
Google Scholar
Samarati, P.: Protecting respondents’ identities in microdata release. IEEE Transactions on Knowledge and Data Engineering 13(6), 1010–1027 (2001)
Article Google Scholar
Sweeney, L.: Achieving k-anonymity privacy protection using generalization and suppression. Int. J. Uncertain. Fuzziness Knowl.-Based Syst. 10(5), 571–588 (2002)
Article MATH MathSciNet Google Scholar
Wang, K., Yu, P.S., Chakraborty, S.: Bottom-up generalization: A data mining solution to privacy protection. In: ICDM, pp. 249–256 (2004)
Google Scholar

Download references

Author information

Authors and Affiliations

Institute of Pathology, Medical University Graz, Auenbruggerplatz 25, A-8036, Graz
Konrad Stark & Kurt Zatloukal
Department of Knowledge and Business Engineering, University of Vienna, Rathausstrae 19/9, A-1010, Wien
Johann Eder

Authors

Konrad Stark
View author publications
You can also search for this author in PubMed Google Scholar
Johann Eder
View author publications
You can also search for this author in PubMed Google Scholar
Kurt Zatloukal
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Institute of Software Technology and Interactive Systems, Vienna University of Technology, Favoritenstr. 9-11/188, A-1040, Wien, Austria
A Min Tjoa
Department of Software and Computing Systems, University of Alicante, Spain
Juan Trujillo

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Stark, K., Eder, J., Zatloukal, K. (2006). Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures. In: Tjoa, A.M., Trujillo, J. (eds) Data Warehousing and Knowledge Discovery. DaWaK 2006. Lecture Notes in Computer Science, vol 4081. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11823728_38

Download citation

DOI: https://doi.org/10.1007/11823728_38
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-37736-8
Online ISBN: 978-3-540-37737-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Survey of Anonymization Algorithms for Electronic Health Records

Evaluation of Anonymization Tools for Health Data

SECRETA: A Tool for Anonymizing Relational, Transaction and RT-Datasets

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Subscribe and save

Buy Now

Navigation

Priority-Based k-Anonymity Accomplished by Weighted Generalisation Structures

Abstract

Access this chapter

Subscribe and save

Buy Now

Preview

Similar content being viewed by others

A Survey of Anonymization Algorithms for Electronic Health Records

Evaluation of Anonymization Tools for Health Data

SECRETA: A Tool for Anonymizing Relational, Transaction and RT-Datasets

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation