Discrimination-aware data mining 论文

2008引用 677

Imbalanced Data Classification Techniques

作者

摘要

In the context of civil rights law, discrimination refers to unfair or unequal treatment of people based on membership to a category or a minority, without regard to individual merit. Rules extracted from databases by data mining techniques, such as classification or association rules, when used for decision tasks such as benefit or credit approval, can be discriminatory in the above sense. In this paper, the notion of discriminatory classification rules is introduced and studied. Providing a guarantee of non-discrimination is shown to be a non trivial task. A naive approach, like taking away all discriminatory attributes, is shown to be not enough when other background knowledge is available. Our approach leads to a precise formulation of the redlining problem along with a formal result relating discriminatory rules with apparently safe ones by means of background knowledge. An empirical assessment of the results on the German credit dataset is also provided.

作者查看全部 (3)

Franco Turini

Salvatore Ruggieri

Dino Pedreshi

Discrimination-aware data mining 论文

摘要

作者查看全部 (3)

相关技术查看全部 (1)

相关事件

相关文章