A Simple Linear Time (1+ ∊) -Approximation Algorithm for k-Means Clustering in Any Dimensions 论文

2004引用 248
Data Management and AlgorithmsAdvanced Clustering Algorithms ResearchAlgorithms and Data Compression

摘要

We present the first linear time (1 + /spl epsiv/)-approximation algorithm for the k-means problem for fixed k and /spl epsiv/. Our algorithm runs in O(nd) time, which is linear in the size of the input. Another feature of our algorithm is its simplicity - the only technique involved is random sampling.