0% found this document useful (0 votes)

49 views2 pages

Eyrich Bioinformatics 2001

This document describes EVA, a web server that automatically and continuously evaluates the performance of protein structure prediction methods. EVA downloads new protein structures from PDB each day and sends the sequences to prediction servers. It collects the predictions, evaluates the performance, and publishes weekly summaries on the web. So far EVA has evaluated over 3000 protein chains across four prediction categories (comparative modeling, threading, secondary structure, contact prediction). The goals are to provide large-scale, standardized evaluations that help both developers and users assess prediction methods.

Uploaded by

Naura Corporation

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

49 views2 pages

Eyrich Bioinformatics 2001

Uploaded by

Naura Corporation

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Vol. 17 no.

12 2001
BIOINFORMATICS APPLICATIONS NOTE Pages 1242–1243

EVA: continuous automatic evaluation of protein

structure prediction servers
Volker A. Eyrich 1, Marc A. Martı́-Renom 2, Dariusz Przybylski 3,
Mallur S. Madhusudhan 2, András Fiser 2, Florencio Pazos 4,
Alfonso Valencia 4, Andrej Sali 2 and Burkhard Rost 3,∗
1 Columbia University, Department of Chemistry, 3000 Broadway MC 3136, New York,
NY 10027, USA, 2 The Rockefeller University, Laboratory of Molecular Biophysics,
Pels Family Center for Biochemistry and Structural Biology, 1230 York Avenue, New
York, NY 10021-6399, USA, 3 CUBIC Columbia University, Department of
Biochemistry and Molecular Biophysics, 650 West 168th Street, New York, NY 10032,
USA and 4 Protein Design Group, CNB-CSIC, Cantoblanco, Madrid 28049, Spain

Received on February 20, 2001; revised on May 28, 2001; accepted on July 4, 2001

ABSTRACT How well do experts predict protein structure? The

Summary: Evaluation of protein structure prediction CASP experiments attempt to address the problem of over-
methods is difficult and time-consuming. Here, we de- estimated performance (Zemla et al., 2001). Although
scribe EVA, a web server for assessing protein structure CASP resolves the bias resulting from using known pro-
prediction methods, in an automated, continuous and tein structures as targets, it has limitations. (1) The meth-
large-scale fashion. Currently, EVA evaluates the perfor- ods are ranked by human assessors who have to evaluate
mance of a variety of prediction methods available through thousands of predictions in 1–2 months (∼ 10 000 from
the internet. Every week, the sequences of the latest 160 groups for CASP4; Zemla et al., 2001). (2) Some as-
experimentally determined protein structures are sent pects of the assessments are not statistically significant be-
to prediction servers, results are collected, performance cause they are based on few proteins. (3) The assessments
is evaluated, and a summary is published on the web. cover only proteins determined in a period of about four
EVA has so far collected data for more than 3000 protein months every two years. (4) Users cannot always repro-
chains. These results may provide valuable insight to both duce CASP predictions, because programs or the required
developers and users of prediction methods. human expertise are not available. Effectively, CASP aims
Availability: http://cubic.bioc.columbia.edu/eva. at assessing how well experts can predict structure.
Contact: eva@cubic.bioc.columbia.edu
How well do computers predict protein structure?
EVALUATING PREDICTIONS IS CRUCIAL CAFASP has recently extended CASP by testing auto-
Correctly evaluating structure prediction is difficult. matic prediction servers on the CASP proteins (Fischer
Developers of prediction methods in bioinformatics et al., 1999). Although CAFASP aimed at evaluating
may significantly over-estimate their performance be- programs rather than experts, it is still limited to a
cause of the following reasons. First, it is difficult and small number of test proteins (Zemla et al., 2001). This
time-consuming to correctly separate data sets used for limitation prompted us to create EVA, a large-scale
developing and testing. Second, estimates of performance and continuously running web server that automat-
of the different methods are often based on different data ically assesses protein structure prediction servers
sets. This problem frequently originates from the rapid (http://cubic.bioc.columbia.edu/eva/doc/flow.html). The
growth of the sequence and structure databases. Third, aims of EVA are: (1) Evaluate continuously and auto-
single numbers are usually not sufficient to describe the matically blind predictions by all co-operating prediction
performance of a method. The lack of clarity is particu- servers. (2) Update the results on the web every week.
larly unfortunate at a time when an increasing number of (3) Enable developers, non-expert users, and reviewers
tools are made easily available through the internet and to determine the performance of the tested prediction
many of the users are not experts in the field of protein programs. (4) Compare prediction methods based on
structure prediction. identical and sufficiently large data sets. Similar aims are
also pursued by the LiveBench project (Rychlewski and
∗ To whom correspondence should be addressed. Fischer, 2000). Although EVA continues to grow, most

1242
c Oxford University Press 2001
EVA: evaluation of prediction servers

of these objectives have already been realised. We will into 3665 chains, 3130 (85%) of which were similar in
extend EVA in three additional ways: (i) test more servers, sequence to known structures and 535 (15%) of which
(ii) refine the evaluation of threading servers, and (iii) add were new (less than 30% sequence identity over more than
alternative structure alignment methods for evaluation. 100 residues). In comparative modelling, EVA evaluated
EVA is already downloading target sequences from PDB over 6600 models with common subsets for 303 chains.
prior to the release of their structures. In secondary structure prediction, EVA based its analysis
on over 30 000 individual predictions; 127 chains were
CURRENT IMPLEMENTATION OF EVA common to all methods, 348 to four methods. For both
Results in four prediction categories. Currently, EVA
of these categories, EVA evaluated most of the existing
evaluates four different categories of structure prediction
servers in the field on the largest protein sets ever. Details
servers (see EVA home page for URLs and list of servers):
about the evaluation are available on the EVA web site;
comparative modelling (3), threading (6), secondary
details about the predictions will be published elsewhere.
structure prediction (9), and inter-residue contact predic-
tion (4). Brief explanations about the methods are on the Additional resources: PSI-BLAST alignments and se-
EVA web site. quence unique subset of PDB. EVA also maintains a
Results are updated every week. Every day, EVA down- number of additional data resources. One resource is
loads the newest protein structures from PDB (Berman et a continuously updated list giving the largest subset of
al., 2000). The structures are added to a mySQL database, sequence-unique proteins in PDB (no protein in the set
sequences are extracted for every protein chain, and sent shares more than 33 identical residues over 100 residues
to each server by META-PP (Eyrich and Rost, 2000). aligned). This set now contains 2435 chains. Another
Predictions are collected and sent for evaluation to the resource contains over 5000 PSI-BLAST alignments for
EVA-satellites (comparative modelling: Rockefeller Uni- proteins added to PDB during the existence of EVA.
versity, contacts: CNB Madrid, and all other: Columbia
University). Depending on the category, the assessments ACKNOWLEDGEMENTS
are made available within hours to days. The central EVA We are particularly grateful to Phil Bourne (UCSD)
site at CUBIC downloads all HTML pages produced by and Kevin Karplus (UCSC) for their support. We
the satellites, and builds up the ‘latest week’ results that would also like to thank Arne Elofsson (Stockholm),
are then mirrored at the satellites (for a flowchart of EVA, Torsten Schwede, Nicolas Guex, and Manual Peitsch (all
see http://cubic.bioc.columbia.edu/eva/doc/flow.html). three form Glaxo, Geneva) for helpful discussions, and
Comparing: Identical data sets, major questions first! Nigel Brown (MRC, London) for his program MView.
EVA compares methods based only on identical data sets. Last not least, we are grateful to the developers who
This approach is essential for reliably ranking methods. permitted us to test their prediction servers. We apologise
However, it reduces the number of available proteins since to all whose servers we evaluated that we had to remove
not all predictions are available for all servers. Another their citations from this paper; they can be found at: http:
important feature of EVA is that it displays the results //cubic.bioc.columbia.edu/eva/doc/explain methods.html.
hierarchically, so that users get the ‘big picture’ first,
followed by information at increasingly higher levels of REFERENCES
detail upon request. Berman,H.M., Westbrook,J., Feng,Z., Gillliland,G., Bhat,T.N.,
Weissig,H., Shindyalov,I.N. and Bourne,P.E. (2000) The protein
Methods are not ranked based on too few test proteins! data bank. Nucleic Acids Res., 28, 235–242.
Since prediction accuracy varies between proteins, pub- Eyrich,V. and Rost,B. (2000) The META-PredictProtein
lished estimates for performance are averages over many server. WWW document (http://cubic.bioc.columbia.edu/
proteins, with some standard deviation. We use this stan- predictprotein/submit meta.html) CUBIC, Columbia University,
dard deviation to estimate the error of the average accuracy Department of Biochemistry & Molecular Biophysics.
as a function of the test set size. This is justified, since dif- Fischer,D., Barret,C., Bryson,K., Elofsson,A., Godzik,A., Jones,D.,
ferent prediction methods typically have similar standard Karplus,K.J., Kelley,L.A., MacCallum,R.M., Pawowski,K.,
deviations. For example, when a method correctly predicts Rost,B., Rychlewski,L. and Sternberg,M. (1999) CAFASP-1:
75% of the residues in a set of 16 proteins with a standard critical assessment of fully automated structure prediction
deviation of 10%, a difference relative to another method methods. Proteins, 3 (Suppl.), 209–217.
< 2.5% (Q = 10/sqrt (16)) is not significant. Thus, we Rychlewski,L. and Fischer,D. (2000) LiveBench: continuous bench-
marking of prediction servers. WWW document (http://BioInfo.
cannot distinguish between 75% and 73% accuracy.
PL/LiveBench/) http://BioInfo.PL/LiveBench/, IIMCB Warsaw.
Resource with over 40 000 predictions. 2996 new Zemla,A., Venclovas,C. and Fidelis,K. (2001) Protein structure pre-
protein structures have been added to PDB since EVA diction center. http://PredictionCenter.llnl.gov/, Lawrence Liver-
started in June 2000. The 2996 proteins were dissected more National Laboratory.

1243

Introduction To Pharmacology: Prof. Johnny S. Bacud JR., RPH, Mspharm Cand
No ratings yet
Introduction To Pharmacology: Prof. Johnny S. Bacud JR., RPH, Mspharm Cand
81 pages
Mogilski (2016) Staying Friends With An Ex - Sex and Dark Personality Traits Predict Motivations For Post-Relationship Friendship
No ratings yet
Mogilski (2016) Staying Friends With An Ex - Sex and Dark Personality Traits Predict Motivations For Post-Relationship Friendship
7 pages
UTS - Lec 6 - Unpacking The Self & Physical Self - Panganiban
No ratings yet
UTS - Lec 6 - Unpacking The Self & Physical Self - Panganiban
14 pages
HypnoDate Attract and Seduce Women With Hypnosis Mantesh PDF
No ratings yet
HypnoDate Attract and Seduce Women With Hypnosis Mantesh PDF
1 page
Protein Structure Prediction Thesis
100% (3)
Protein Structure Prediction Thesis
8 pages
TR 20211112 许锦波基于深度学习的蛋白质结构预测
No ratings yet
TR 20211112 许锦波基于深度学习的蛋白质结构预测
47 pages
Alkaloids From Amphibian Skin - A Tabulation of Over Eight-Hundred Compounds
No ratings yet
Alkaloids From Amphibian Skin - A Tabulation of Over Eight-Hundred Compounds
20 pages
Autoloop: A Novel Autoregressive Deep Learning Method For Protein Loop Prediction With High Accuracy
No ratings yet
Autoloop: A Novel Autoregressive Deep Learning Method For Protein Loop Prediction With High Accuracy
34 pages
FTN - 8 @AakashNeetards
No ratings yet
FTN - 8 @AakashNeetards
31 pages
DeepECA - An End-To-End Learning Framework For Protein Contact Prediction From A Multiple Sequence Alignment
No ratings yet
DeepECA - An End-To-End Learning Framework For Protein Contact Prediction From A Multiple Sequence Alignment
17 pages
CASP14 Progress Paper - Submitted
No ratings yet
CASP14 Progress Paper - Submitted
29 pages
Improved Protein Structure Prediction by Deep Learning Irrespective of Co-Evolution Information
No ratings yet
Improved Protein Structure Prediction by Deep Learning Irrespective of Co-Evolution Information
23 pages
Health Assessment in Nursing
No ratings yet
Health Assessment in Nursing
48 pages
알파폴드1논문
No ratings yet
알파폴드1논문
27 pages
Exploring The Alexander Technique: Its Central Hypothesis and Teaching Modalities
No ratings yet
Exploring The Alexander Technique: Its Central Hypothesis and Teaching Modalities
20 pages
Power Systems: Uspex A
No ratings yet
Power Systems: Uspex A
26 pages
Asking Jinn Is Shirk PDF
No ratings yet
Asking Jinn Is Shirk PDF
30 pages
Eramian ProteinSci 2008
No ratings yet
Eramian ProteinSci 2008
14 pages
From PDB To AlphaFold2 and Beyond
No ratings yet
From PDB To AlphaFold2 and Beyond
13 pages
Alpha Fold
No ratings yet
Alpha Fold
16 pages
2019 - Evaluating Protein Transfer Learning With TAPE
No ratings yet
2019 - Evaluating Protein Transfer Learning With TAPE
20 pages
A Prot Protein Structure Modeling Using MSA Transformer
No ratings yet
A Prot Protein Structure Modeling Using MSA Transformer
11 pages
Accurate Protein Structure Prediction by Embeddings and Deep Learning Representations
No ratings yet
Accurate Protein Structure Prediction by Embeddings and Deep Learning Representations
9 pages
Kitaabu Swiyaam
No ratings yet
Kitaabu Swiyaam
18 pages
AlphaFold - Laterst - s41586 021 03819 2
No ratings yet
AlphaFold - Laterst - s41586 021 03819 2
12 pages
Worksheet 4.4
No ratings yet
Worksheet 4.4
4 pages
Base Paper
No ratings yet
Base Paper
14 pages
Zhang Et Al 2024 Protein Language Models Learn Evolutionary Statistics of Interacting Sequence Motifs
No ratings yet
Zhang Et Al 2024 Protein Language Models Learn Evolutionary Statistics of Interacting Sequence Motifs
9 pages
Basic Immunology Introduction:: Hypersensitivity Reactions
No ratings yet
Basic Immunology Introduction:: Hypersensitivity Reactions
10 pages
Personal Relationship
No ratings yet
Personal Relationship
35 pages
s41586 021 03819 2 - Reference
No ratings yet
s41586 021 03819 2 - Reference
16 pages
Ab Initio
No ratings yet
Ab Initio
11 pages
Paper Protein Structure Prediction
No ratings yet
Paper Protein Structure Prediction
8 pages
MSP Rocks and Minerals Lessons
No ratings yet
MSP Rocks and Minerals Lessons
6 pages
Prediction of Inter-Residue Multiple Distances
No ratings yet
Prediction of Inter-Residue Multiple Distances
9 pages
Highly Accurate Protein Structure Prediction With Alphafold: Article
No ratings yet
Highly Accurate Protein Structure Prediction With Alphafold: Article
12 pages
Orange Book 6
No ratings yet
Orange Book 6
13 pages
Ethnobotanical Knowledge of Philippine Lowland Farmers and Its Application in Agroforestry
No ratings yet
Ethnobotanical Knowledge of Philippine Lowland Farmers and Its Application in Agroforestry
22 pages
Anaesthesia For Bleeding Tonsil
No ratings yet
Anaesthesia For Bleeding Tonsil
15 pages
Example Multipage
No ratings yet
Example Multipage
7 pages
Uniqueness in Translating Arabic Hagiography of Shaikh Abd Al-Qādir Al-Jailānī: The Case of
No ratings yet
Uniqueness in Translating Arabic Hagiography of Shaikh Abd Al-Qādir Al-Jailānī: The Case of
8 pages
17 Cordata
No ratings yet
17 Cordata
23 pages
Proteins - 1999 - Simons - Ab Initio Protein Structure Prediction of CASP III Targets Using ROSETTA
No ratings yet
Proteins - 1999 - Simons - Ab Initio Protein Structure Prediction of CASP III Targets Using ROSETTA
6 pages
2014 - A Word of Caution About Biological Inference - Revisiting Cysteine Covalent State Predictions
No ratings yet
2014 - A Word of Caution About Biological Inference - Revisiting Cysteine Covalent State Predictions
5 pages
Front - Matter Pauline Doran Book PDF
No ratings yet
Front - Matter Pauline Doran Book PDF
8 pages
TS EAMCET 3-Months
No ratings yet
TS EAMCET 3-Months
5 pages
Gates
No ratings yet
Gates
2 pages
Borodovsky Bioinformatics 2001 PDF
No ratings yet
Borodovsky Bioinformatics 2001 PDF
3 pages
Improved Protein Structure Prediction Using Potentials From Deep Learning
No ratings yet
Improved Protein Structure Prediction Using Potentials From Deep Learning
22 pages
Classification of Protein Sequences For Cancer Diagnosis Using Artificial Neural Network IJERTV12IS120095
No ratings yet
Classification of Protein Sequences For Cancer Diagnosis Using Artificial Neural Network IJERTV12IS120095
5 pages
Bauer 2013 Amphidromy
No ratings yet
Bauer 2013 Amphidromy
19 pages
General Pathology - Topical Past Papers-1
100% (1)
General Pathology - Topical Past Papers-1
21 pages
Skyfire AC Datasheet
No ratings yet
Skyfire AC Datasheet
2 pages
Mepolizumab For Treatment of Adolescents and Adults With Eosinophilic Oesophagitis: A Multicentre, Randomised, Double-Blind, Placebo-Controlled Clinical Trial
No ratings yet
Mepolizumab For Treatment of Adolescents and Adults With Eosinophilic Oesophagitis: A Multicentre, Randomised, Double-Blind, Placebo-Controlled Clinical Trial
10 pages
Research Output
No ratings yet
Research Output
21 pages
Bio Articles3
No ratings yet
Bio Articles3
14 pages
Why Is Alphafold Important
No ratings yet
Why Is Alphafold Important
2 pages
Rapid in Silico Directed Evolution by A Protein Languagemodel With EVOLVEpro
No ratings yet
Rapid in Silico Directed Evolution by A Protein Languagemodel With EVOLVEpro
21 pages
Highly Accurate Protein Structure Prediction With AlphaFold Nature
No ratings yet
Highly Accurate Protein Structure Prediction With AlphaFold Nature
1 page
Cell Cycle and Cancer Review
No ratings yet
Cell Cycle and Cancer Review
5 pages
Protein STR
No ratings yet
Protein STR
63 pages
Deep Learning in Protein Structural
No ratings yet
Deep Learning in Protein Structural
23 pages
791 Ak Lecture6
No ratings yet
791 Ak Lecture6
50 pages
CESI Price List Dec81
No ratings yet
CESI Price List Dec81
1 page
Flip Benchmark Fitness NeurIPS - 2021
No ratings yet
Flip Benchmark Fitness NeurIPS - 2021
14 pages
TWINKL Knowledge Organiser
No ratings yet
TWINKL Knowledge Organiser
2 pages
Unit 5, Novel Drug Delivery Systems, B Pharmacy 7th Sem, Carewell Pharma
No ratings yet
Unit 5, Novel Drug Delivery Systems, B Pharmacy 7th Sem, Carewell Pharma
30 pages
s41586 021 03828 1 - Reference
No ratings yet
s41586 021 03828 1 - Reference
23 pages
Xóa 2
No ratings yet
Xóa 2
15 pages
Daily Test I (Report Text)
No ratings yet
Daily Test I (Report Text)
10 pages
SESNet
No ratings yet
SESNet
19 pages
Baker PRO 99 CASP3 Ab Initio PDF
No ratings yet
Baker PRO 99 CASP3 Ab Initio PDF
6 pages
Sali Structure 2002
No ratings yet
Sali Structure 2002
2 pages
Protein Contact Prediction From Amino Acid Co-Evolution Using Convolutional Networks For Graph-Valued Images
No ratings yet
Protein Contact Prediction From Amino Acid Co-Evolution Using Convolutional Networks For Graph-Valued Images
9 pages
Science - Secondary - AQA - Combined Science - Foundation - 10-05-2025
No ratings yet
Science - Secondary - AQA - Combined Science - Foundation - 10-05-2025
129 pages
John Moult, Krzysztof Fidelis, CASP
No ratings yet
John Moult, Krzysztof Fidelis, CASP
4 pages
Fiser Bioinformatics 2003
No ratings yet
Fiser Bioinformatics 2003
2 pages
1 s2.0 S0010482522004784 Main
No ratings yet
1 s2.0 S0010482522004784 Main
27 pages
Adult in Take
No ratings yet
Adult in Take
9 pages
Ynaaaa
No ratings yet
Ynaaaa
5 pages
Reiki Level I Certification
No ratings yet
Reiki Level I Certification
1 page
Protein Function
No ratings yet
Protein Function
12 pages
Bioinformatics: 3D-Jury: A Simple Approach To Improve Protein Structure Predictions
No ratings yet
Bioinformatics: 3D-Jury: A Simple Approach To Improve Protein Structure Predictions
4 pages
Espript/Endscript: Extracting and Rendering Sequence and 3D Information From Atomic Structures of Proteins
No ratings yet
Espript/Endscript: Extracting and Rendering Sequence and 3D Information From Atomic Structures of Proteins
4 pages
Protein Sructure Prediction Using Phyre - Kelly & Sternberg 2009
No ratings yet
Protein Sructure Prediction Using Phyre - Kelly & Sternberg 2009
9 pages
GKL 789
No ratings yet
GKL 789
10 pages
Fiser Bioinformatics 2003
No ratings yet
Fiser Bioinformatics 2003
2 pages
Phyre2 (English Version)
No ratings yet
Phyre2 (English Version)
3 pages
SSRN 4541252
No ratings yet
SSRN 4541252
25 pages
Acc Catalog - Corp - 0075 r2 Us
No ratings yet
Acc Catalog - Corp - 0075 r2 Us
48 pages
Experiment-7 (HOMOLOGY MODELING)
No ratings yet
Experiment-7 (HOMOLOGY MODELING)
12 pages
Protein Desin With Deep Learning
No ratings yet
Protein Desin With Deep Learning
9 pages
Gene Pridiction and Orf
No ratings yet
Gene Pridiction and Orf
34 pages
2010 Bioinformatics 26 687-688
No ratings yet
2010 Bioinformatics 26 687-688
2 pages
1.2 Plants Organ System
No ratings yet
1.2 Plants Organ System
32 pages
Towards best practice in the Archetype Development Process
From Everand
Towards best practice in the Archetype Development Process
Alberto Moreno Conde
No ratings yet
Answer (A, B, C or D) For Each Question. Write Your Answers in The Corresponding Numbered Boxes
No ratings yet
Answer (A, B, C or D) For Each Question. Write Your Answers in The Corresponding Numbered Boxes
9 pages
Icon 8 BEC Outdoor Hotspot 1
No ratings yet
Icon 8 BEC Outdoor Hotspot 1
1 page
Introduction to Bioinformatics Using Action Labs
From Everand
Introduction to Bioinformatics Using Action Labs
Jean-Louis Lassez
5/5 (1)
Bioinformatics: Merging Biology and Technology
From Everand
Bioinformatics: Merging Biology and Technology
Mani Devar
No ratings yet
Comprehensive Guide to BLAST: Definitive Reference for Developers and Engineers
From Everand
Comprehensive Guide to BLAST: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Ceph Architecture and Administration: Definitive Reference for Developers and Engineers
From Everand
Ceph Architecture and Administration: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
ChaosBlade in Practice: The Complete Guide for Developers and Engineers
From Everand
ChaosBlade in Practice: The Complete Guide for Developers and Engineers
William Smith
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Eyrich Bioinformatics 2001

Uploaded by

Eyrich Bioinformatics 2001

Uploaded by

Vol. 17 no.

EVA: continuous automatic evaluation of protein

ABSTRACT How well do experts predict protein structure? The

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.