Cyber Center Publications

A framework for efficient data anonymization under privacy and accuracy constraints

Gabriel Ghinita
Panagiotis Karras
Panos Kalnis
Nikos Mamoulis

Abstract

Recent research studied the problem of publishing microdata without revealing sensitive information, leading to the privacy-preserving paradigms of k-anonymity and l-diversity. k-anonymity protects against the identification of an individual's record. l-diversity, in addition, safeguards against the association of an individual with specific sensitive information. However, existing approaches suffer from at least one of the following drawbacks: (i) l-diversification is solved by techniques developed for the simpler k-anonymization problem, causing unnecessary information loss. (ii) The anonymization process is inefficient in terms of computational and I/O cost. (iii) Previous research focused exclusively on the privacy-constrained problem and ignored the equally important accuracy-constrained (or dual) anonymization problem.

In this article, we propose a framework for efficient anonymization of microdata that addresses these deficiencies. First, we focus on one-dimensional (i.e., single-attribute) quasi-identifiers, and study the properties of optimal solutions under the k-anonymity and l-diversity models for the privacy-constrained (i.e., direct) and the accuracy-constrained (i.e., dual) anonymization problems. Guided by these properties, we develop efficient heuristics to solve the one-dimensional problems in linear time. Finally, we generalize our solutions to multidimensional quasi-identifiers using space-mapping techniques. Extensive experimental evaluation shows that our techniques clearly outperform the existing approaches in terms of execution time and information loss.

Keywords

anonymity, design experimentation, general privacy, security

Date of this Version

6-2009

Comments

ACM Transactions on Database Systems Volume 34 Issue 2, June 2009

Link to Full Text

COinS

Cyber Center Publications

A framework for efficient data anonymization under privacy and accuracy constraints

Abstract

Keywords

Date of this Version

Comments

Search

Links

Links for Authors

Browse

Cyber Center Publications

A framework for efficient data anonymization under privacy and accuracy constraints

Authors

Abstract

Keywords

Date of this Version

Comments

Share

Search

Links

Links for Authors

Browse