Nonparametric genetic clustering: comparison of validity indices 论文
2001IEEE Transactions on Systems Man and Cybernetics Part C (Applications and Reviews)引用 267
Advanced Clustering Algorithms ResearchGene expression and cancer classificationData Mining Algorithms and Applications
摘要
A variable-string-length genetic algorithm (GA) is used for developing a novel nonparametric clustering technique when the number of clusters is not fixed a-priori. Chromosomes in the same population may now have different lengths since they encode different number of clusters. The crossover operator is redefined to tackle the concept of variable string length. A cluster validity index is used as a measure of the fitness of a chromosome. The performance of several cluster validity indices, namely the Davies-Bouldin (1979) index, Dunn's (1973) index, two of its generalized versions and a recently developed index, in appropriately partitioning a data set, are compared.