Protein fold recognition using sequence-derived predictions

Daniel Fischer, David Eisenberg

Research output: Contribution to journalArticlepeer-review

330 Scopus citations

Abstract

In protein fold recognition, one assigns a probe amino acid sequence of unknown structure to one of a library of target 3D structures. Correct assignment depends on effective scoring of the probe sequence for its compatibility with each of the target structures. Here we show that, in addition to the amino acid sequence of the probe, sequence-derived properties of the probe sequence (such as the predicted secondary structure) are useful in fold assignment. The additional measure of compatibility between probe and target is the level of agreement between the predicted secondary structure of the probe and the known secondary structure of the target fold. That is, we recommend a sequence-structure compatibility function that combines previously developed compatibility functions (such as the 3D-1D scores of Bowie et al. [1991] or sequence-sequence replacement tables) with the predicted secondary structure of the probe sequence. The effect on fold assignment of adding predicted secondary structure is evaluated here by using a benchmark set of proteins (Fischer et al., 1996a). The 3D structures of the probe sequences of the benchmark are actually known, but are ignored by our method. The results show that the inclusion of the predicted secondary structure improves fold assignment by about 25%. The results also show that, if the true secondary structure of the probe were known, correct fold assignment would increase by an additional 8-32%. We conclude that incorporating sequence-derived predictions significantly improves assignment of sequences to known 3D folds. Finally, we apply the new method to assign folds to sequences in the SWISSPROT database; six fold assignments are given that are not detectable by standard sequence-sequence comparison methods; for two of these, the fold is known from X-ray crystallography and the fold assignment is correct.

Original languageEnglish
Pages (from-to)947-955
Number of pages9
JournalProtein Science
Volume5
Issue number5
DOIs
StatePublished - 1 Jan 1996
Externally publishedYes

Keywords

  • fold-recognition performance assessment benchmark
  • protein fold recognition
  • secondary structure prediction

ASJC Scopus subject areas

  • Biochemistry
  • Molecular Biology

Fingerprint

Dive into the research topics of 'Protein fold recognition using sequence-derived predictions'. Together they form a unique fingerprint.

Cite this