Efficiently mining long patterns from databases 论文

1998ACM SIGMOD Record引用 396
Data Mining Algorithms and ApplicationsRough Sets and Fuzzy LogicImbalanced Data Classification Techniques

摘要

We present a pattern-mining algorithm that scales roughly linearly in the number of maximal patterns embedded in a database irrespective of the length of the longest pattern. In comparison, previous algorithms based on Apriori scale exponentially with longest pattern length. Experiments on real data show that when the patterns are long, our algorithm is more efficient by an order of magnitude or more.