Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1102199.1102203acmconferencesArticle/Chapter ViewAbstractPublication PagesccsConference Proceedingsconference-collections
Article

Protecting privacy in tabular healthcare data: explicit uncertainty for disclosure control

Published: 07 November 2005 Publication History

Abstract

Summary medical data provides important statistical information for public health, but risks revealing confidential patient information. This risk is particularly difficult to assess when many different tables are released, independently protected against disclosure by various techniques. In this paper, we present a new technique for disclosure control in tabular data which uses explicit uncertainty to prevent small numbers of records from being identified disclosively. In contrast to other techniques, bounds on the cell perturbations are also made public. This technique can be applied effectively to large datasets in their entirety, automatically, and the transformed data can then be used to create the derivative tables, or hosted on a public web site. It is even safe for population-based data. Furthermore, we show that this transformation is computationally efficient while ensuring k-anonymity, and demonstrate the suitability of the transformed data for further statistical analysis.

References

[1]
Department of Health and Human Services. 45CFR164 Subpart E --- Privacy of Individually Identifiable Health Information. Code of Federal Regulations. U.S. Government Printing Office via GPO Access, Oct. 2004. {Online}. Available: http://www.gpoaccess.gov/cfr/ {1 Jun 2005}.
[2]
O. Duke-Williams and P. Rees. Can census offices publish statistics for more than one small area geography? An analysis of the differencing problem in statistical disclosure. International Journal of Geographical Information Science, 12(6):579--605, 1998.
[3]
R. Garfinkel, R. Gopal, and P. Goes. Privacy protection of binary confidential data against deterministic, stochastic, and insider threat. Management Science, 48(6):749--764, 2002.
[4]
HES. Hospital Episode Statistics from the NHS Health and Social Care Information Centre, 2005. {Online}. Available: http://www.hesonline.nhs.uk/ {1 Jun 2005}.
[5]
A. Hundepool. Summary of the CASC project, Feb 2005. European Project IST-2000-25069 CASC, Deliverable 9-D1, http://neon.vb.cbs.nl/casc.
[6]
R. J. A. Little and D. B. Rubin. Statistical Analysis with Missing Data, Second Edition. Wiley, New York, 2002.
[7]
P. Samarati. Protecting respondents' identities in microdata release. IEEE Transactions on Knowledge and Data Engineering, 13(6):1010--1027, 2001.
[8]
L. Sweene. k-anonymity: A model for protecting privacy. International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, 10(5):557--570, 2002.
[9]
G. van Rossum. Python library reference, July 2001. {Online}. Available: http://www.python.org/doc/ref/.
[10]
L. Willenborg and T. de Waal, editors. Elements of Statistical Disclosure Control, volume 155 of Lecture Notes in Statistics. Springer Verlag, 2001.

Cited By

View all

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
WPES '05: Proceedings of the 2005 ACM workshop on Privacy in the electronic society
November 2005
116 pages
ISBN:1595932283
DOI:10.1145/1102199
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 07 November 2005

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. disclosure control
  2. healthcare data
  3. local suppression
  4. microdata

Qualifiers

  • Article

Conference

CCS05
Sponsor:

Acceptance Rates

Overall Acceptance Rate 106 of 355 submissions, 30%

Upcoming Conference

CCS '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)11
  • Downloads (Last 6 weeks)0
Reflects downloads up to 21 Nov 2024

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media