Clustering Categorical Data Using Silhouette Coefficient as a Relocating Measure 论文

2007引用 243
Advanced Clustering Algorithms ResearchData Management and AlgorithmsData Mining Algorithms and Applications

摘要

Cluster analysis is an unsupervised learning method that constitutes a cornerstone of an intelligent data analysis process. Clustering categorical data is an important research area data mining. In this paper we propose a novel algorithm to cluster categorical data. Based on the minimum dissimilarity value objects are grouped into cluster. In the merging process, the objects are relocated using silhouette coefficient. Experimental results show that the proposed method is efficient.