Stochastic context-free grammers for tRNA modeling 论文

1994Nucleic Acids Research引用 390顶会
RNA and protein synthesis mechanismsRNA modifications and cancerAlgorithms and Data Compression

详细信息

发表期刊/会议
Nucleic Acids Research
发表日期
1994-01-01
发表年份
1994

关键词

RNA and protein synthesis mechanismsRNA modifications and cancerAlgorithms and Data Compression

摘要

Stochastic context-free grammars (SCFGs) are applied to the problems of folding, aligning and modeling families of tRNA sequences. SCFGs capture the sequences' common primary and secondary structure and generalize the hidden Markov models (HMMs) used in related work on protein and DNA. Results show that after having been trained on as few as 20 tRNA sequences from only two tRNA subfamilies (mitochondrial and cytoplasmic), the model can discern general tRNA from similar-length RNA sequences of other kinds, can find secondary structure of new tRNA sequences, and can produce multiple alignments of large sets of tRNA sequences. Our results suggest potential improvements in the alignments of the D- and T-domains in some mitochondrial tRNAs that cannot be fit into the canonical secondary structure.