Protecting privacy in tabular healthcare data: Explicit uncertainty for disclosure control

Brian Shand*, Jem Rashbass

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Summary medical data provides important statistical information for public health, but risks revealing confidential patient information. This risk is particularly difficult to assess when many different tables are released, independently protected against disclosure by various techniques. In this paper, we present a new technique for disclosure control in tabular data which uses explicit uncertainty to prevent small numbers of records from being identified disclosively. In contrast to other techniques, bounds on the cell perturbations are also made public. This technique can be applied effectively to large datasets in their entirety, automatically, and the transformed data can then be used to create the derivative tables, or hosted on a public web site. It is even safe for population-based data. Furthermore, we show that this transformation is computationally efficient while ensuring k-anonymity, and demonstrate the suitability of the transformed data for further statistical analysis.

Original languageEnglish
Title of host publicationWPES'05
Subtitle of host publicationProceedings of the 2005 ACM Workshop on Privacy in the Electronic Society
Pages20-26
Number of pages7
Publication statusPublished - 2005
Externally publishedYes
EventWPES'05: 2005 ACM Workshop on Privacy in the Electronic Society - Alexandria, VA, United States
Duration: 7 Nov 20057 Nov 2005

Publication series

NameWPES'05: Proceedings of the 2005 ACM Workshop on Privacy in the Electronic Society

Conference

ConferenceWPES'05: 2005 ACM Workshop on Privacy in the Electronic Society
Country/TerritoryUnited States
CityAlexandria, VA
Period7/11/057/11/05

Keywords

  • Disclosure control
  • Healthcare data
  • Local suppression
  • Microdata

Fingerprint

Dive into the research topics of 'Protecting privacy in tabular healthcare data: Explicit uncertainty for disclosure control'. Together they form a unique fingerprint.

Cite this