Finding composite regulatory patterns in DNA sequences 论文

2002Bioinformatics引用 335顶会
Genomics and Chromatin DynamicsGenomics and Phylogenetic StudiesAlgorithms and Data Compression

摘要

Abstract Pattern discovery in unaligned DNA sequences is a fundamental problem in computational biology with important applications in finding regulatory signals. Current approaches to pattern discovery focus on monad patterns that correspond to relatively short contiguous strings. However, many of the actual regulatory signals are composite patterns that are groups of monad patterns that occur near each other. A difficulty in discovering composite patterns is that one or both of the component monad patterns in the group may be ‘too weak’. Since the traditional monad-based motif finding algorithms usually output one (or a few) high scoring patterns, they often fail to find composite regulatory signals consisting of weak monad parts. In this paper, we present a MITRA (MIsmatch TRee Algorithm) approach for discovering composite signals. We demonstrate that MITRA performs well for both monad and composite patterns by presenting experiments over biological and synthetic data. Availability: MITRA is available at http://www.cs.columbia.edu/compbio/mitra/ Contact: eeskin@cs.columbia.edu Keywords: regulatory motif finding; pattern finding; dyad motifs.

相关事件

暂无数据

相关文章

暂无数据