Streaming-data algorithms for high-quality clustering 论文
2003引用 579
Advanced Clustering Algorithms ResearchData Stream Mining TechniquesAdvanced Database Systems and Queries
摘要
Streaming data analysis has recently attracted attention in numerous applications including telephone records, Web documents and click streams. For such analysis, single-pass algorithms that consume a small amount of memory are critical. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.