A study of accessible motifs and RNA folding complexity

Ydo Wexler, Chaya Zilberstein, Michal Ziv-Ukelson

Research output: Contribution to journalArticlepeer-review

56 Scopus citations

Abstract

mRNA molecules are folded in the cells and therefore many of their substrings may actually be inaccessible to protein and microRNA binding. The need to apply an accessibility criterion to the task of genome-wide mRNA motif discovery raises the challenge of overcoming the core O (n3) factor imposed by the time complexity of the currently best known algorithms for RNA secondary structure prediction. We speed up the dynamic programming algorithms that are standard for RNA folding prediction. Our new approach significantly reduces the computations without sacrificing the optimality of the results, yielding an expected time complexity of O(n2ψ(n)), where ψ(n) is shown to be constant on average under standard polymer folding models. A benchmark analysis confirms that in practice the runtime ratio between the previous approach and the new algorithm indeed grows linearly with increasing sequence size. The fast new RNA folding algorithm is utilized for genome-wide discovery of accessible cis-regulatory motifs in data sets of ribosomal densities and decay rates of S. cerevisiae genes and to the mining of exposed binding sites of tissue-specific microRNAs in A. thaliana.

Original languageEnglish
Pages (from-to)856-872
Number of pages17
JournalJournal of Computational Biology
Volume14
Issue number6
DOIs
StatePublished - 1 Jul 2007
Externally publishedYes

Keywords

  • Average case complexity
  • Binding sites
  • Dynamic programming
  • Optimal solution
  • RNA folding

ASJC Scopus subject areas

  • Modeling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Mathematics
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'A study of accessible motifs and RNA folding complexity'. Together they form a unique fingerprint.

Cite this