Abstract
With the advent of genome sequencing projects, the amino acid sequences of thousands of proteins are determined every year. Each of these protein sequences must be identified with its function and its 3-dimensional structure for us to gain a full understanding of the molecular biology of organisms. To meet this challenge, new methods are being developed for fold recognition, the computational assignment of newly determined amino acid sequences to 3-dimensional protein structures. These methods start with a library of known 3-dimensional target protein structures. The new probe sequence is then aligned to each target protein structure in the library and the compatibility of the sequence for that structure is score. If a target structure is found to have a significantly high compatibility score, it is assumed that the probe sequence folds in much the same way as the target structure. The fundamental assumptions of this approach are that many different sequences fold in similar ways and there is a relatively high probability that a new sequence possesses a previously observed fold. We review various approaches to fold recognition dud break down the process into its main steps: creation of a library of target folds; representation of the folds; alignment of the probe sequence to a target fold using a sequence-to- structure compatibility scoring function; and assessment of significance of compatibility. We emphasize that even though this new field of fold recognition has made rapid progress, technical problems remain to be solved in most of the steps. Standard benchmarks may help identify the problem steps and find solutions to the problems.
Original language | English |
---|---|
Pages (from-to) | 126-136 |
Number of pages | 11 |
Journal | FASEB Journal |
Volume | 10 |
Issue number | 1 |
DOIs | |
State | Published - 1 Jan 1996 |
Externally published | Yes |
Keywords
- fold recognition
- inverted protein folding
ASJC Scopus subject areas
- Biotechnology
- Biochemistry
- Molecular Biology
- Genetics