TY - GEN

T1 - Fast RNA structure alignment for crossing input structures

AU - Backofen, Rolf

AU - Landau, Gad M.

AU - Möhl, Mathias

AU - Tsur, Dekel

AU - Weimann, Oren

PY - 2009/11/9

Y1 - 2009/11/9

N2 - The complexity of pairwise RNA structure alignment depends on the structural restrictions assumed for both the input structures and the computed consensus structure. For arbitrarily crossing input and consensus structures, the problem is NP-hard. For non-crossing consensus structures, Jiang et al's algorithm [1] computes the alignment in O(n 2 m 2) time where n and m denote the lengths of the two input sequences. If also the input structures are non-crossing, the problem corresponds to tree editing which can be solved in O(m2n(1 + log n/m))time [2]. We present a new algorithm that solves the problem for d-crossing structures in O(d m 2 nlogn) time, where d is a parameter that is one for non-crossing structures, bounded by n for crossing structures, and much smaller than n on most practical examples. Crossing input structures allow for applications where the input is not a fixed structure but is given as base-pair probability matrices.

AB - The complexity of pairwise RNA structure alignment depends on the structural restrictions assumed for both the input structures and the computed consensus structure. For arbitrarily crossing input and consensus structures, the problem is NP-hard. For non-crossing consensus structures, Jiang et al's algorithm [1] computes the alignment in O(n 2 m 2) time where n and m denote the lengths of the two input sequences. If also the input structures are non-crossing, the problem corresponds to tree editing which can be solved in O(m2n(1 + log n/m))time [2]. We present a new algorithm that solves the problem for d-crossing structures in O(d m 2 nlogn) time, where d is a parameter that is one for non-crossing structures, bounded by n for crossing structures, and much smaller than n on most practical examples. Crossing input structures allow for applications where the input is not a fixed structure but is given as base-pair probability matrices.

KW - RNA

KW - Sequence structure alignment

KW - Simultaneous alignment and folding

UR - http://www.scopus.com/inward/record.url?scp=70350629169&partnerID=8YFLogxK

U2 - 10.1007/978-3-642-02441-2_21

DO - 10.1007/978-3-642-02441-2_21

M3 - Conference contribution

AN - SCOPUS:70350629169

SN - 3642024408

SN - 9783642024405

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 236

EP - 248

BT - Combinatorial Pattern Matching - 20th Annual Symposium, CPM 2009, Proceedings

T2 - 20th Annual Symposium on Combinatorial Pattern Matching, CPM 2009

Y2 - 22 June 2009 through 24 June 2009

ER -