Efficiently mining frequent trees in a forest: algorithms and applications 论文

2005IEEE Transactions on Knowledge and Data Engineering引用 310

Data Mining Algorithms and ApplicationsAlgorithms and Data CompressionGenomics and Phylogenetic Studies

生物科技 Genomics and Phylogenetic Studies Algorithms and Data Compression Data Mining Algorithms and Applications

作者

摘要

Mining frequent trees is very useful in domains like bioinformatics, Web mining, mining semistructured data, etc. We formulate the problem of mining (embedded) subtrees in a forest of rooted, labeled, and ordered trees. We present TREEMINER, a novel algorithm to discover all frequent subtrees in a forest, using a new data structure called scope-list. We contrast TREEMINER with a pattern matching tree mining algorithm (PATTERNMATCHER), and we also compare it with TREEMINERD, which counts only distinct occurrences of a pattern. We conduct detailed experiments to test the performance and scalability of these methods. We also use tree mining to analyze RNA structure and phylogenetics data sets from bioinformatics domain.

作者查看全部 (1)

Mohammed J. Zaki

Efficiently mining frequent trees in a forest: algorithms and applications 论文

摘要

作者查看全部 (1)

相关技术查看全部 (2)

相关事件

相关文章