Fast data anonymization with low information loss 论文

2007引用 277

Privacy-Preserving Technologies in DataCryptography and Data SecurityInternet Traffic Analysis and Secure E-voting

Cryptography and Data Security Privacy-Preserving Technologies in Data Internet Traffic Analysis and Secure E-voting

作者

摘要

Recent research studied the problem of publishing microdata without revealing sensitive information, leading to the privacy preserving paradigms of k-anonymity and ℓ-diversity. k-anonymity protects against the identification of an individual’s record. ℓ-diversity, in addition, safeguards against the association of an individual with specific sensitive information. However, existing approaches suffer from at least one of the following drawbacks: (i) The information loss metrics are counter-intuitive and fail to capture data inaccuracies inflicted for the sake of privacy. (ii) ℓ-diversity is solved by techniques developed for the simpler k-anonymity problem, which introduces unnecessary inaccuracies. (iii) The anonymization process is inefficient in terms of computation and I/O cost. In this paper we propose a framework for efficient privacy preservation that addresses these deficiencies. First, we focus on one-dimensional (i.e., single attribute) quasiidentifiers, and study the properties of optimal solutions for k-anonymity and ℓ-diversity, based on meaningful information loss metrics. Guided by these properties, we develop efficient heuristics to solve the one-dimensional problems in linear time. Finally, we generalize our solutions to multi-dimensional quasi-identifiers using space-mapping techniques. Extensive experimental evaluation shows that our techniques clearly outperform the state-of-the-art, in terms of execution time and information loss. 1.

作者查看全部 (4)

Nikos Mamoulis

Panos Kalnis

Panagiotis Karras

Gabriel Ghinita

Fast data anonymization with low information loss 论文

摘要

作者查看全部 (4)

相关技术

相关事件

相关文章