TY - JOUR
T1 - Automated identification of RNA conformational motifs
T2 - Theory and application to the HM LSU 23S rRNA
AU - Hershkovitz, Eli
AU - Tannenbaum, Emmanuel
AU - Howerton, Shelley B.
AU - Sheth, Ajay
AU - Tannenbaum, Allen
AU - Williams, Loren Dean
N1 - Funding Information:
This work was partially supported by grants from NSF, AFOSR, ARO, MURI and NIH. The authors thank Drs Nick Hud, Jane Richardson and Laura Murray for helpful discussions.
PY - 2003/11/1
Y1 - 2003/11/1
N2 - We develop novel methods for recognizing and cataloging conformational states of RNA, and for discovering statistical rules governing those states. We focus on the conformation of the large ribosomal subunit from Haloarcula marismortui. The two approaches described here involve torsion matching and binning. Torsion matching is a pattern-recognition code which finds structural repetitions. Binning is a classification technique based on distributional models of the data. In comparing the results of the two methods we have tested the hypothesis that the conformation of a very large complex RNA molecule can be described accurately by a limited number of discrete conformational states. We identify and eliminate extraneous and redundant information without losing accuracy. We conclude, as expected, that four of the torsion angles contain the overwhelming bulk of the structural information. That information is not significantly compromised by binning the continuous torsional information into a limited number of discrete values. The correspondence between torsion matching and binning is 99% (per residue). Binning, however, does have several advantages. In particular, we demonstrate that the conformation of a large complex RNA molecule can be represented by a small alphabet. In addition, the binning method lends itself to a natural graphical representation using trees.
AB - We develop novel methods for recognizing and cataloging conformational states of RNA, and for discovering statistical rules governing those states. We focus on the conformation of the large ribosomal subunit from Haloarcula marismortui. The two approaches described here involve torsion matching and binning. Torsion matching is a pattern-recognition code which finds structural repetitions. Binning is a classification technique based on distributional models of the data. In comparing the results of the two methods we have tested the hypothesis that the conformation of a very large complex RNA molecule can be described accurately by a limited number of discrete conformational states. We identify and eliminate extraneous and redundant information without losing accuracy. We conclude, as expected, that four of the torsion angles contain the overwhelming bulk of the structural information. That information is not significantly compromised by binning the continuous torsional information into a limited number of discrete values. The correspondence between torsion matching and binning is 99% (per residue). Binning, however, does have several advantages. In particular, we demonstrate that the conformation of a large complex RNA molecule can be represented by a small alphabet. In addition, the binning method lends itself to a natural graphical representation using trees.
UR - http://www.scopus.com/inward/record.url?scp=0344011107&partnerID=8YFLogxK
U2 - 10.1093/nar/gkg835
DO - 10.1093/nar/gkg835
M3 - Article
C2 - 14576313
AN - SCOPUS:0344011107
SN - 0305-1048
VL - 31
SP - 6249
EP - 6257
JO - Nucleic Acids Research
JF - Nucleic Acids Research
IS - 21
ER -