TY - GEN
T1 - Protecting privacy in tabular healthcare data
T2 - WPES'05: 2005 ACM Workshop on Privacy in the Electronic Society
AU - Shand, Brian
AU - Rashbass, Jem
PY - 2005
Y1 - 2005
N2 - Summary medical data provides important statistical information for public health, but risks revealing confidential patient information. This risk is particularly difficult to assess when many different tables are released, independently protected against disclosure by various techniques. In this paper, we present a new technique for disclosure control in tabular data which uses explicit uncertainty to prevent small numbers of records from being identified disclosively. In contrast to other techniques, bounds on the cell perturbations are also made public. This technique can be applied effectively to large datasets in their entirety, automatically, and the transformed data can then be used to create the derivative tables, or hosted on a public web site. It is even safe for population-based data. Furthermore, we show that this transformation is computationally efficient while ensuring k-anonymity, and demonstrate the suitability of the transformed data for further statistical analysis.
AB - Summary medical data provides important statistical information for public health, but risks revealing confidential patient information. This risk is particularly difficult to assess when many different tables are released, independently protected against disclosure by various techniques. In this paper, we present a new technique for disclosure control in tabular data which uses explicit uncertainty to prevent small numbers of records from being identified disclosively. In contrast to other techniques, bounds on the cell perturbations are also made public. This technique can be applied effectively to large datasets in their entirety, automatically, and the transformed data can then be used to create the derivative tables, or hosted on a public web site. It is even safe for population-based data. Furthermore, we show that this transformation is computationally efficient while ensuring k-anonymity, and demonstrate the suitability of the transformed data for further statistical analysis.
KW - Disclosure control
KW - Healthcare data
KW - Local suppression
KW - Microdata
UR - http://www.scopus.com/inward/record.url?scp=33749033093&partnerID=8YFLogxK
M3 - Conference contribution
AN - SCOPUS:33749033093
SN - 1595932283
SN - 9781595932280
T3 - WPES'05: Proceedings of the 2005 ACM Workshop on Privacy in the Electronic Society
SP - 20
EP - 26
BT - WPES'05
Y2 - 7 November 2005 through 7 November 2005
ER -