A linear inside-outside algorithm for correcting sequencing errors in structured RNAs

Vladimir Reinharz, Yann Ponty, Jérôme Waldispühl

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

Analysis of the sequence-structure relationship in RNA molecules are essential to evolutionary studies but also to concrete applications such as error-correction methodologies in sequencing technologies. The prohibitive sizes of the mutational and conformational landscapes combined with the volume of data to proceed require efficient algorithms to compute sequence-structure properties. More specifically, here we aim to calculate which mutations increase the most the likelihood of a sequence to a given structure and RNA family. In this paper, we introduce RNApyro, an efficient linear-time and space inside-outside algorithm that computes exact mutational probabilities under secondary structure and evolutionary constraints given as a multiple sequence alignment with a consensus structure. We develop a scoring scheme combining classical stacking base pair energies to novel isostericity scales, and apply our techniques to correct point-wise errors in 5s rRNA sequences. Our results suggest that RNApyro is a promising algorithm to complement existing tools in the NGS error-correction pipeline.

Original languageEnglish
Title of host publicationResearch in Computational Molecular Biology - 17th Annual International Conference, RECOMB 2013, Proceedings
Pages199-211
Number of pages13
DOIs
StatePublished - 3 Apr 2013
Externally publishedYes
Event17th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2013 - Beijing, China
Duration: 7 Apr 201310 Apr 2013

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume7821 LNBI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference17th Annual International Conference on Research in Computational Molecular Biology, RECOMB 2013
Country/TerritoryChina
CityBeijing
Period7/04/1310/04/13

Keywords

  • RNA
  • mutations
  • secondary structure

ASJC Scopus subject areas

  • Theoretical Computer Science
  • General Computer Science

Fingerprint

Dive into the research topics of 'A linear inside-outside algorithm for correcting sequencing errors in structured RNAs'. Together they form a unique fingerprint.

Cite this