Using structural and evolutionary information to detect and correct pyrosequencing errors in noncoding RNAs

Vladimir Reinharz, Yann Ponty, Jérôme Waldispühl

Research output: Contribution to journalArticlepeer-review

1 Scopus citations

Abstract

The analysis of the sequence-structure relationship in RNA molecules is not only essential for evolutionary studies but also for concrete applications such as error-correction in next generation sequencing (NGS) technologies. The prohibitive sizes of the mutational and conformational landscapes, combined with the volume of data to process, require efficient algorithms to compute sequence-structure properties. In this article, we address the correction of NGS errors by calculating which mutations most increase the likelihood of a sequence to a given structure and RNA family. We introduce RNApyro, an efficient, linear time and space inside-outside algorithm that computes exact mutational probabilities under secondary structure and evolutionary constraints given as a multiple sequence alignment with a consensus structure. We develop a scoring scheme combining classical stacking base-pair energies to novel isostericity scores and apply our techniques to correct pointwise errors in 5s and 16s rRNA sequences. Our results suggest that RNApyro is a promising algorithm to complement existing tools in the NGS error-correction pipeline.

Original languageEnglish
Pages (from-to)905-919
Number of pages15
JournalJournal of Computational Biology
Volume20
Issue number11
DOIs
StatePublished - 1 Nov 2013
Externally publishedYes

Keywords

  • RNA
  • mutations
  • secondary structure

ASJC Scopus subject areas

  • Modeling and Simulation
  • Molecular Biology
  • Genetics
  • Computational Mathematics
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'Using structural and evolutionary information to detect and correct pyrosequencing errors in noncoding RNAs'. Together they form a unique fingerprint.

Cite this