TY - GEN
T1 - Sparse RNA folding
T2 - 20th Annual Symposium on Combinatorial Pattern Matching, CPM 2009
AU - Backofen, Rolf
AU - Tsur, Dekel
AU - Zakov, Shay
AU - Ziv-Ukelson, Michal
PY - 2009/11/9
Y1 - 2009/11/9
N2 - The classical algorithm for RNA single strand folding requires O(n Z) time and O(n 2) space, where n denotes the length of the input sequence and Z is a sparsity parameter that satisfies n ≤ Z ≤ n 2. We show how to reduce the space complexity of this algorithm. The space reduction is based on the observation that some solutions for subproblems are not examined after a certain stage of the algorithm, and may be discarded from memory. This yields an O(nZ) time and O(Z) space algorithm, that outputs both the cardinality of the optimal folding as well as a corresponding secondary structure. The space-efficient approach also extends to the related RNA simultaneous alignment with folding problem, and can be applied to reduce the space complexity of the fastest algorithm for this problem from O(n 2 m 2+ Z̃) down to Z̃ where n and m denote the lengths of the input sequences to be aligned, and is a sparsity parameter that satisfies n m ≤Z̃ ≤ n 2 m 2. In addition, we also show how to speed up the base-pairing maximization variant of RNA single strand folding. The speed up is achieved by combining two independent existing techniques, which restrict the number of expressions that need to be examined in bottleneck computations of these algorithms. This yields an O(LZ) time and O(Z) space algorithm, where L denotes the maximum cardinality of a folding of the input sequence. Additional online supporting material may be found at: http://www.cs.bgu.ac.il/zakovs/ RNAfold/CPM09-supporting-material.pdf
AB - The classical algorithm for RNA single strand folding requires O(n Z) time and O(n 2) space, where n denotes the length of the input sequence and Z is a sparsity parameter that satisfies n ≤ Z ≤ n 2. We show how to reduce the space complexity of this algorithm. The space reduction is based on the observation that some solutions for subproblems are not examined after a certain stage of the algorithm, and may be discarded from memory. This yields an O(nZ) time and O(Z) space algorithm, that outputs both the cardinality of the optimal folding as well as a corresponding secondary structure. The space-efficient approach also extends to the related RNA simultaneous alignment with folding problem, and can be applied to reduce the space complexity of the fastest algorithm for this problem from O(n 2 m 2+ Z̃) down to Z̃ where n and m denote the lengths of the input sequences to be aligned, and is a sparsity parameter that satisfies n m ≤Z̃ ≤ n 2 m 2. In addition, we also show how to speed up the base-pairing maximization variant of RNA single strand folding. The speed up is achieved by combining two independent existing techniques, which restrict the number of expressions that need to be examined in bottleneck computations of these algorithms. This yields an O(LZ) time and O(Z) space algorithm, where L denotes the maximum cardinality of a folding of the input sequence. Additional online supporting material may be found at: http://www.cs.bgu.ac.il/zakovs/ RNAfold/CPM09-supporting-material.pdf
UR - http://www.scopus.com/inward/record.url?scp=70350639137&partnerID=8YFLogxK
U2 - 10.1007/978-3-642-02441-2_22
DO - 10.1007/978-3-642-02441-2_22
M3 - Conference contribution
AN - SCOPUS:70350639137
SN - 3642024408
SN - 9783642024405
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 249
EP - 262
BT - Combinatorial Pattern Matching - 20th Annual Symposium, CPM 2009, Proceedings
Y2 - 22 June 2009 through 24 June 2009
ER -