A few logs suffice to build (almost) all trees (I) 论文

1999Random Structures and Algorithms引用 226
Genomics and Phylogenetic StudiesAlgorithms and Data CompressionGenome Rearrangement Algorithms

摘要

A phylogenetic tree, also called an “evolutionary tree,” is a leaf-labeled tree which represents the evolutionary history for a set of species, and the construction of such trees is a fundamental problem in biology. Here we address the issue of how many sequence sites are required in order to recover the tree with high probability when the sites evolve under standard Markov-style i.i.d. mutation models. We provide analytic upper and lower bounds for the required sequence length, by developing a new polynomial time algorithm. In particular, we show when the mutation probabilities are bounded the required sequence length can grow surprisingly slowly (a power of log n) in the number n of sequences, for almost all trees. ©1999 John Wiley & Sons, Inc. Random Struct. Alg., 14, 153–184, 1999