Fiser Bioinformatics 2003
Fiser Bioinformatics 2003
ABSTRACT
Summary: ModLoop is a web server for automated modeling
of loops in protein structures. The input is the atomic coordinates of the protein structure in the Protein Data Bank format,
and the specification of the starting and ending residues of one
or more segments to be modeled, containing no more than
20 residues in total. The output is the coordinates of the nonhydrogen atoms in the modeled segments. A user provides
the input to the server via a simple web interface, and receives
the output by e-mail. The server relies on the loop modeling
routine in MODELLER that predicts the loop conformations by
satisfaction of spatial restraints, without relying on a database
of known protein structures. For a rapid response, ModLoop
runs on a cluster of Linux PC computers.
Availability: The server is freely accessible to academic users
at http://salilab.org/modloop
Contact: andras@fiserlab.org
2500
(Fiser et al., 2002; Al Lazikani et al., 2001) in studying interactions between the protein and its ligands. Loop modeling
can also be useful in refining low- and medium-resolution
structures determined by X-ray crystallography and NMR
spectroscopy.
Prediction of loop conformations by optimization of an
objective function was implemented in MODELLER (Fiser
et al., 2000). The method optimizes the positions of all
non-hydrogen atoms of a loop in a fixed environment. The
optimization relies on a protocol consisting of the conjugate gradient minimization and molecular dynamics simulation
with simulated annealing. The pseudo-energy function contains terms from a molecular mechanics force field as well as
restraints based on statistical distributions derived from known
protein structures. Bonds, angles, some dihedral angles and
improper dihedral angles are restrained by the corresponding terms in the CHARMM-22 potential function (MacKerell
et al., 1998). The mainchain and sidechain dihedral angles
as well as non-bonded atom pairs are restrained by statistical
potentials extracted from many known protein structures (Sali
and Blundell, 1993; Sali and Overington, 1994; Fiser et al.,
2000; Melo and Feytmans, 1997).
Evaluation of the method relied on 40 randomly selected
loops of known structure at each length from 1 to 14 residues.
The accuracy was determined by building loops both in the
native and distorted loop environments because only approximate loop environments are available in real comparative
modeling applications. The errors in loop predictions increase
with loop length and environment distortion (Fig. 1).
The modeling protocol generates a number of independently optimized conformations, starting with random initial
conformations. The final loop prediction is the optimized conformation that has the lowest pseudo-energy score. Currently,
the number of independent optimizations is limited to 300.
Because each individual optimization typically takes a few
minutes of CPU time on a Pentium processor, the ModLoop
web server runs on a cluster of Pentium nodes.
REFERENCES
Al Lazikani,B., Jung,J., Xiang,Z. and Honig,B. (2001) Protein
structure prediction. Curr. Opin. Chem. Biol., 5, 5156.
2501