Using unknowns to prevent discovery of association rules 论文

2001ACM SIGMOD Record引用 339
Data Mining Algorithms and ApplicationsImbalanced Data Classification TechniquesPrivacy-Preserving Technologies in Data

摘要

Data mining technology has given us new capabilities to identify correlations in large data sets. This introduces risks when the data is to be made public, but the correlations are private. We introduce a method for selectively removing individual values from a database to prevent the discovery of a set of rules, while preserving the data for other applications. The efficacy and complexity of this method are discussed. We also present an experiment showing an example of this methodology.