Streaming-data algorithms for high-quality clustering 论文

2003引用 579
Advanced Clustering Algorithms ResearchData Stream Mining TechniquesAdvanced Database Systems and Queries

摘要

Streaming data analysis has recently attracted attention in numerous applications including telephone records, Web documents and click streams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.