Volume 29, Number 5—May 2023
Online Report
US National Institutes of Health Prioritization of SARS-CoV-2 Variants
Abstract
Since late 2020, SARS-CoV-2 variants have regularly emerged with competitive and phenotypic differences from previously circulating strains, sometimes with the potential to escape from immunity produced by prior exposure and infection. The Early Detection group is one of the constituent groups of the US National Institutes of Health National Institute of Allergy and Infectious Diseases SARS-CoV-2 Assessment of Viral Evolution program. The group uses bioinformatic methods to monitor the emergence, spread, and potential phenotypic properties of emerging and circulating strains to identify the most relevant variants for experimental groups within the program to phenotypically characterize. Since April 2021, the group has prioritized variants monthly. Prioritization successes include rapidly identifying most major variants of SARS-CoV-2 and providing experimental groups within the National Institutes of Health program easy access to regularly updated information on the recent evolution and epidemiology of SARS-CoV-2 that can be used to guide phenotypic investigations.
As part of the US National Institutes of Health (NIH) National Institute of Allergy and Infectious Diseases (NIAID) SARS-CoV-2 Assessment of Viral Evolution (SAVE) effort to combat the SARS-CoV-2 pandemic, the NIH NIAID SAVE Early Detection Group regularly prioritizes SARS-CoV-2 variants. The main goal of prioritization is to identify relevant lineages for phenotypic testing by experimental groups within the NIH SAVE network by integrating up-to-date epidemiologic, structural, and genetic information. Practically, this process involves regular contact with the experimental groups in weekly meetings and continued consideration of the types of experimental work the groups conduct, which includes characterization for replicative fitness, pathogenicity, and antigenic escape.
The SAVE Early Detection Group does not attempt to recapitulate the efforts of numerous public health organizations worldwide to designate variants of concern and variants of interest (1–3). The group aims to horizon scan for variants and substitutions; it focuses on early signals of lineages and substitutions that pose a public health threat on the basis of epidemiology and estimated phenotype and prepares for emergence of such lineages by providing information about relevant aspects of SARS-CoV-2 biology to enable faster assessment of future lineages.
The group publishes monthly lineage prioritizations for use by experimental collaborators. Identifying and prioritizing relevant lineages is difficult, and the group draws on the expertise of 9 constituent subgroups. The subgroups take different approaches to prioritizing variants, the results from which are pooled to form publicly available consensus rankings. Since April 2021, the group has produced 21 prioritizations.
Generating Consensus Rankings
Participating groups suggest new lineages, which may be novel lineages or sublineages containing specific additional substitutions on top of a designated Pango (4) lineage. For each lineage, the suggesting group provides information about lineage name, substitutions, GISAID/GenBank accession numbers, number of sequences, epidemiologic information, and reason (Table 1).
Ranking of Variants
Each group ranks newly suggested lineages and lineages from the previous prioritization (to incorporate new epidemiologic information). Four types of rank can be assigned (Table 2).
Generation of Consensus Ranking
Ranks are first transformed to numeric ranks (Table 3). Lineages are then ordered by mean rank to produce a consensus, which is split into prioritization categories. Generally, category 1 includes lineages that are either increasing or at high frequency or contain substitutions with evidence of a relevant phenotypic effect. Category 2 contains lineages that are potentially interesting because of epidemiology or sequence, and categories 3 and 4 (when present) are typically a record of other circulating minor lineages. Furthermore, lineages being studied by experimental groups are moved to the ongoing category, and lineages with substantial phenotypic data already available are moved into the well-studied category. After discussion within the Early Detection Group, the consensus ranking is made publicly available (6).
Emergency Updates
Lineages that come to the group’s attention in between prioritizations can be added outside of the normal monthly timeline. They are added to a rank determined directly by discussion between the subgroups.
Individual Ranking Methods
Ranking methods are presented in general terms because ranking is often performed by hand rather than algorithmically. Manual ranking is performed because of the need to weigh numerous factors, including which types of variation are most interesting to characterize at any particular time.
The 9 Teams
The Los Alamos National Laboratory (LANL) team, led by author B.K., developed tools for early detection of spike variants (https://cov.lanl.gov) (7), based on emergent mutational patterns in sequences regularly updated from GISAID (https://www.gisaid.org). Because spike protein variants are often found in multiple Pango lineages (sometimes because of recombination [8]), and because Pango lineages sometimes include very diverse forms of spike protein, variant dynamics and regional frequency calculations are based on spike sequence rather than Pango lineage. Variant dynamics and global spread are tracked at multiple geographic levels, and variants are deemed to be of interest if relative sampling frequency is substantially increasing in multiple locations or if they are more highly mutated relative to past variants and increasing in >1 geographic location. The relative importance of mutational patterns is weighed by substantial transitions in variant frequencies over time at different geographic levels, structural considerations, levels of convergence, and literature-based assessments of relevance to neutralizing antibody sensitivity and infectivity. The LANL team maintains a folder in the download section on GISAID that contains the information used for suggesting new lineages and provides full-length genome and spike alignments for representative forms of circulating variants.
The University of California Riverside School of Medicine team, led by author A.G., uses relative growth in the prevalence of specific substitutions and deletions/insertions, which are mapped onto Pango lineages or used to define new ones, identifying the fastest growing variants and mutation combinations within these lineages. Although the main focus has been on spike mutations, nonspike mutations are also tracked. These criteria are automated and available from a regularly updated website (https://coronavirus3d.org). For the final variant and subvariant ranking, additional criteria are included: simultaneous growth in >2 distinct geographic locations, mutations in different regions of the spike (N terminal domain [NTD], receptor-binding domain [RBD], furin cleavage region, S2 spike domain), their potential effect on protein structure (by modeling), and the reemergence of individual mutations in novel combinations.
The Bacterial and Viral Bioinformatics Resource Center team, led by author R.S. at the J. Craig Venter Institute, uses a custom heuristic algorithm that combines sequence prevalence metrics with functional impact predictions, focusing on sequence features of concern with the spike protein. To identify concerning upward trends and their global spread for each residue, variant and lineage sequence prevalence and fold growth are calculated month to month in all countries. Substitutions are given a functional impact score based on whether the substitution has been demonstrated to cause a substantial decrease in polyclonal or monoclonal antibody binding, an increase in angiotensin-converting enzyme 2 (ACE2) binding, or if the position is located within the NTD supersite or the furin cleavage site (9). The sequence prevalence and functional impact scores are combined to generate an Emergence Score for the ranking of emerging lineages. Although the main focus has been on spike mutations, nonspike mutations are also tracked. A detailed description of the method has been published (10).
The Cambridge University team, led by author S.T., follows sequence prevalence increases over time and geographic spread as well as prevalence increases in defined pre-immunized cohorts. This team prioritizes mutations displaying convergent evolution, focusing on those likely to cause immune escape, by looking at experimentally determined antibody escape (with particular focus on polyclonal serum) and by considering structural reasoning. This information is jointly considered when determining the importance of each mutation. The emphasis is primarily on substitutions in RBD and the mechanism around ACE2 binding and secondarily on NTD and proximity to the furin cleavage site. These substitutions are given higher priority if they are clearly transmitting faster and if they are in a different Barnes class from substitutions previously seen in the same lineages but are given lower priority if they have already been characterized.
The Broad Institute team, led by author J.L., believes, like the University of California-Riverside School of Medicine team, that the accelerated growth of a lineage relative to its peers, across multiple geographic regions, is the single most important marker of lineages of potential concern. The team has developed 2 related approaches for identifying such lineages. The first approach fits a binomial logistic regression to the proportion of each lineage over time in every state. The team then systematically compares the growth rate of every lineage in each state, relative to all other lineages, and identifies lineages that are consistently increasing in multiple states (e.g., as in Earnest et al. [11]). A second related approach fits multinomial logistic regression models across geographic regions (12). This approach is a generalization of the first approach that estimates the relative growth rate of each lineage compared with every other, allowing for more complex and realistic lineage dynamics such as nonmonotonic modeled trajectories. The results of these 2 approaches form the basis for an initial prioritization list that is then discussed internally and brought to NIH SAVE discussions.
The Walter Reed Army Institute of Research team, led by author M.R., has a variant scoring scheme based on increased prevalence and potential effects of mutations in spike. This scoring is primarily performed in a lineage-independent manner; hence, the initial focus is on convergent evolution rather than on lineage tracking, although changing mutation frequencies within lineages are also tracked. Weight scores are given for various characteristics such as fold increase over time, geographic spread, variant growth, and potential effects on antibody recognition. Relevant antibody contact sites in the NTD and RBD are identified by analyzing spike–antibody complex structures deposited in the Protein Data Bank (http://www.wwpdb.org). The identification of contact sites is performed to upweight substitutions at relevant antibody binding sites with recent changes in frequency. Several tools enable tracking of variants of concern and mutations of interest at global, regional, and country levels on a weekly basis by using data from the previous 3 months. An open-build using data from GenBank is available (13).
The Icahn School of Medicine at Mt. Sinai team, led by author H.v.B., has a similar approach to the Walter Reed Army Institute of Research team, ranking variants based on an aggregate score for sequence prevalence increase and genetic changes of concern, but the criteria differ slightly for different genomic regions. A higher weight is given to mutations associated with antibody escape or changes in ACE2 affinity, with a focus on NTD and RBD, associated with higher transmissibility, with evidence of convergent evolution, near the furin cleavage site, and in enzyme active sites. Moreover, data from surveillance cohorts in the New York, New York, metropolitan area are used to assess lineages associated with breakthrough infections after vaccination. To minimize false positives and increase confidence of early detection, historical data are used to estimate weighting factors and to add or remove criteria.
The Ben Gurion University of the Negev, The National Institute for Biotechnology, in the Negev, Israel, team, led by author T.H., takes an approach based solely on the prediction of potential antibody escape caused by mutations. The team analyzes a large set of solved 3-dimensional antibody structures for spike, curated from the Protein Data Bank. Contact positions for each antibody are extracted on the basis of solved structures. To assess the effects of each single point amino acid mutation on escape from antibody responses, the predicted changes in binding energies (ΔΔG) for each antibody are computed for each specific mutation within its contact footprint, by using FoldX (14). Using the ΔΔG scores, team members compute an antibody escape score for each mutation. Variants are then scored and ranked on the basis of the predicted cumulative effect of their mutations on antibody evasion.
The University of Missouri team, led by author M.J., believes that because a minority of prolonged infections give rise to variants containing numerous convergent mutations that arise independently in different locations, these convergent variants forecast those likely to arise in future circulating viruses. These lineages were initially discovered through wastewater sequencing, in which highly divergent SARS-CoV-2 lineages (cryptic lineages) were sporadically identified (15,16). A few similarly advanced lineages have now also been found from long-term COVID-19 patients. The team maintains a database of evolutionarily advanced cryptic lineages and evolutionarily advanced patient lineages and has identified numerous discrete mutations repeatedly appearing in these advanced lineages. Indeed, all prominent amino acid changes in the RBD of the dominant Omicron sublineages were observed repeatedly in evolutionarily advanced lineages before appearing in Omicron. The team systematically evaluates new lineages by comparing combinations of changes in new circulating lineages with those that have appeared frequently in advanced lineages and reports their findings during NIH SAVE discussions.
We display the February 2023 prioritization in priority order and split into functional categories (6) (Figure 1; Appendix 1). Functional groups are based on the region of spike harboring mutations in each lineage (RBD, NTD, or other). Lineages are placed in the many substitutions or recombinants section if they contain many substitutions relative to other circulating variants or were produced by recombination. The split enables experimental groups to quickly identify and compare the lineages most relevant to their focus. The split also alleviates a difficulty of using the consensus-approach to prioritization. Different groups use different methods to rank lineages; some focus primarily on epidemiologic data; others focus on whether a lineage contains mutations that are likely to have particular phenotypic effects or that show substantial convergent evolution. Furthermore, groups focus on different regions of the spike protein; because mutations in different regions are likely to have different phenotypic effects, prioritizing between them is difficult. Splitting the prioritization into structural regions makes it easier to compare lineages that have been nominated for similar reasons.
We also compared the rankings provided by each subgroup for the February 2023 prioritization (Figure 2). We found generally good agreement between subgroups, although with some notable differences. For example, the DS.1 lineage is ranked higher by the Ben Gurion University of the Negev team than by other subgroups. This finding is consistent with their ranking method, which focuses on antibody escape, given that DS.1 contains 5 RBD substitutions on top of the BA.2.75 spike sequence. By contrast, it is ranked lower by groups focusing on epidemiology, because of its low observed count.
We also compared the 4 most recent prioritizations, showing the movement of lineages between priority categories (Appendix 2). Category 1 lineages, which typically show consistent growth or contain phenotypically relevant substitutions, are retained in the following month’s prioritization 98% of the time, compared with 67% for category 2, and 38% for category 3, and 28% for category 4 (Appendix 3 Figure 2). Indeed, on only 2 occasions have lineages in categories 3 or 4 later entered the ongoing or well-studied categories, when AY.1 and AY.2 did so during July–November 2021. Lineages rarely move from a lower to a higher prioritization category (Appendix 3 Figure 2), suggesting that high priority lineages are typically judged to need immediate characterization. Similarly, lineages that the group has been aware of without considering them highly important are unlikely to become substantially more important over time, suggesting that newer lineages should perhaps be considered for characterization.
Comparing the prioritization categories to epidemiologic data (Figure 3; Appendix 3 Figure 3) shows that lineages are generally identified before or shortly after they reach 0.1% of global circulation. However, the prioritizations do not attempt to predict or recapitulate the epidemiology of lineages but attempt rather to identify relevant lineages to study, weighing the probability that a variant circulates widely, the risk it would pose should it do so, and our ability to estimate its phenotype based on existing data. Lineages can therefore circulate at moderately high frequency without being highly prioritized because their constituent substitutions are less likely to have relevant phenotypic effects (e.g., BA.5 + A1020S) or because the lineage’s high frequency seems to be caused by founder effects.
Producing prioritizations has been successful in 2 respects. First, and of primary concern, the prioritizations have provided experimental groups within NIH SAVE easy access to regularly updated information on the recent evolution and epidemiology of SARS-CoV-2, used to guide phenotypic investigations. In addition to simply giving access to raw data, which are now accessible by using numerous online epidemiologic tools, the prioritizations are synthesized with close consideration of the interests of the experimental groups and consider numerous factors in assessing the relevance of circulating lineages.
Second, most major variants of SARS-CoV-2 have been rapidly identified by the SAVE Early Detection Group and include Delta and Lambda (in the first prioritization in April 2021), Mu (in July 2021), and Omicron as an emergency update (between the November and December 2021 prioritizations). Indeed, the usefulness of the rankings has increased over time; Omicron lineages BA.2 and BA.5 have diversified into a wide range of sublineages, which have been epidemiologically successful (showing growth in numerous locations) and often show considerable immune escape, making them key lineages to track for vaccine effectiveness purposes (17). Those lineages have been rapidly identified and tracked in the prioritizations, including BF.7 (June 2022), BA.2.75 (July 2022), BQ.1.1 (October 2022), XBB (October 2022), and BN.1 (November 2022).
The prioritizations have been most useful to experimental collaborators in this period of high diversity, helping to identify adaptive and threatening variation, which is most pressing for study. By contrast, variation for identifying and prioritizing in earlier periods of the pandemic, such as during the circulation of Delta, was less epidemiologically useful. This period of rapid diversification has also made clear the value of the consensus approach; with each subgroup routinely monitoring the evolution of the virus from its own perspective, a well-rounded assessment of new variants can be rapidly reached.
This hands-on approach does come with limitations. It is labor intensive, requiring continued active participation from each research group; it is not an automated system because we do not believe that there is an automated solution for monitoring the ever-changing challenges presented by evolution of SARS-CoV-2. This manual approach also means that the prioritization typically only works at a monthly resolution, although lineages are sometimes added to the prioritization in between monthly updates if deemed necessary.
Of note, 3 major lineage replacements (Alpha, Delta, Omicron) have emerged while another lineage has dominated global circulation. During the period of Alpha and Delta circulation, focus was placed on subvariants of those lineages containing a small number of additional substitutions. Although this focus was in part to enable experimental work to enhance scientific understanding of SARS-CoV-2 phenotypic variation, we suspected that the next dominant strain might have been produced by refinement of the current dominant strain, through addition of further fitness-enhancing substitutions. Our suspicion was based on 2 reasons: first, fitness-enhancing substitutions are likely to appear first in the dominant strain because of its greater circulation; and second, the rapid expansion of Alpha and then Delta demonstrated fitness advantages over the previously circulating strain. Substitutions arising in other strains would need to first overcome this fitness deficit and would be outcompeted by a sublineage of the dominant strain containing the same substitution, disregarding potential epistatic effects.
In reality, emergence of Delta and Omicron has shown that the next dominant strain is not always produced by incremental refinement of the current dominant strain. Whether SARS-CoV-2 will continue to evolve in this fashion is unclear, particularly given the success of numerous more incremental subvariants of Omicron BA.2 and BA.5, which more closely follow the pattern we were preparing for during circulation of Alpha and Delta. Regardless of which of those patterns governs SARS-CoV-2 evolution in the long term, continued surveillance is needed to identify variation within the currently dominant lineage and to detect highly divergent sequences, which may be particularly likely to originate from countries in which a smaller proportion of cases are sequenced, which may harbor substantial unsampled variation; in Delta and Omicron, the emergent lineage was first identified in a country that performed minimal sequencing (Omicron in Botswana) or minimal sequencing relative to circulation (Delta in India).
The prioritizations made by the 9 laboratories that form the NIH Early Detection team have been a valuable resource for helping experimental groups keep up to date with the rapid evolution of SARS-CoV-2. The NIH Early Detection team will continue to refine the methods used by individual subgroups to prioritize variants and the way in which this information is presented to experimental collaborators.
Mr. Turner is a PhD candidate at the Center for Pathogen Evolution at the Department of Zoology, University of Cambridge. His primary research interest is the prediction of antigenic evolution in influenza and SARS-CoV-2.
Acknowledgments
We gratefully acknowledge all data contributors (i.e., the authors and their originating laboratories responsible for obtaining the specimens, and their submitting laboratories) for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based. We acknowledge Fritz Obermeyer and Martin Jankowiak for helpful discussions and advice.
E.L-G., B.M., D.J.S., and S.T. are supported by the NIH NIAID Centers of Excellence for Influenza Research and Response (CEIRR) contract 75N93021C00014 as part of the SAVE program; B.M. is funded in part by the German Ministry of research under project codes DZIF, MolTrax, and PREPARED; M.J. is funded in part with federal funds from the NIH National Institute of Drug Abuse under contract 1U01DA053893-01; D.O’C. is funded by Centers for Disease Control and Prevention (CDC) contract 75D30121C11060, CDC contract 75D30122C15355, and State of Wisconsin Department of Health Services project 435100-A22-ELCProjE-01; W.M.F., B.K., J.T., and H.Y. are funded by the Laboratory Directed Research and Development program of LANL under project 20220660ER, and through the SAVE program NIH AAI22018-001; A.M.N., R.H.S., M.S., Z.S.W., and Y.Z. are supported with federal funds from the NIH NIAID Bacterial and Viral Bioinformatics Resource Center contract HHS75N93019C00076; E.G. was supported in part by the Division of Intramural Research of the NIH NIAID; B.L.D. and M.R. are supported by a cooperative agreement between The Henry M. Jackson Foundation for the Advancement of Military Medicine and the US Department of the Army (W81XWH-18-2-0040); J.E.L. is supported by CDC BAA 75D30120C09605 and U19AI110818; A.A, A.G., L.J., and M.S. are funded by HHSN272201700060C; H.v.B., A.S.G.-R., and Z.K. are supported by the NIH NIAID CEIRR contract 75N93021C00014 and NIAID contract HHSN272201400008C; L.C.-L. and T.H. are funded in part by the NIH NIAID CEIRR under contract 75N9302100016, St Jude Children’s Research Hospital; M.S. is supported by NIH NIAID Contract 75N93019C00076.
The views expressed are those of the authors and should not be construed to represent the positions of the US Army, the Department of Defense, or the Department of Health and Human Services.
References
- World Health Organization. Tracking SARS-CoV-2 variants [cited 2022 Jan 19]. https://www.who.int/en/activities/tracking-SARS-CoV-2-variants
- European Centre for Disease Prevention and Control. SARS-CoV-2 variants of concern as of 06 January 2022 [cited 2021 Jan 6]. https://www.ecdc.europa.eu/en/covid-19/variants-concern
- Public Health England. SARS-CoV-2 variants of concern and variants under investigation in England: Technical briefing 1 [cited 2022 Oct 26]. https://assets.publishing.service.gov.uk/government/uploads/system/uploads/attachment_data/file/959438/Technical_Briefing_VOC_SH_NJL2_SH2.pdf
- Rambaut A, Holmes EC, O’Toole Á, Hill V, McCrone JT, Ruis C, et al. A dynamic nomenclature proposal for SARS-CoV-2 lineages to assist genomic epidemiology. Nat Microbiol. 2020;5:1403–7. DOIPubMedGoogle Scholar
- Shu Y, McCauley J. GISAID: Global initiative on sharing all influenza data - from vision to reality. Euro Surveill. 2017;22:30494. DOIPubMedGoogle Scholar
- National Institutes of Health. NIH SAVE Early Detection Prioritization Summary [2023 Feb 10]. https://docs.google.com/spreadsheets/d/167uJP9LfJN07410sWaMSKU1Se-4XX687j8IgVX4MV_w/edit#gid=1166031460
- Korber B, Fischer WM, Gnanakaran S, Yoon H, Theiler J, Abfalterer W, et al.; Sheffield COVID-19 Genomics Group. Tracking changes in SARS-CoV-2 spike: evidence that D614G increases infectivity of the COVID-19 virus. Cell. 2020;182:812–827.e19. DOIPubMedGoogle Scholar
- Fischer W, Giorgi EE, Chakraborty S, Nguyen K, Bhattacharya T, Theiler J, et al.; Network for Genomic Surveillance in South Africa (NGS-SA). HIV-1 and SARS-CoV-2: Patterns in the evolution of two pandemic pathogens. Cell Host Microbe. 2021;29:1093–110. DOIPubMedGoogle Scholar
- Bacterial and Viral Bioinformatics Resource Center. SARS-CoV-2 real-time tracking and early warning system for variants and lineages of concern (VoCs/LoCs) [cited 2023 Feb 10]. https://www.bv-brc.org/view/VariantLineage/#view_tab=overview
- Wallace ZS, Davis J, Niewiadomska AM, Olson RD, Shukla M, Stevens R, et al. Early detection of emerging SARS-CoV-2 variants of interest for experimental evaluation. Front Bioinform. 2022;2:
1020189 . DOIPubMedGoogle Scholar - Earnest R, Uddin R, Matluk N, Renzette N, Turbett SE, Siddle KJ, et al.; New England Variant Investigation Team. Comparative transmissibility of SARS-CoV-2 variants Delta and Alpha in New England, USA. Cell Rep Med. 2022;3:
100583 . DOIPubMedGoogle Scholar - Obermeyer F, Jankowiak M, Barkas N, Schaffner SF, Pyle JD, Yurkovetskiy L, et al. Analysis of 6.4 million SARS-CoV-2 genomes identifies mutations associated with fitness. Science. 2022;376:1327–32. DOIPubMedGoogle Scholar
- US Military HIV Research Program. SARS-CoV-2 sequencing tracking [cited 2022 Oct 26]. https://www.hivresearch.org/SARS-CoV-2-sequence-tracking
- Schymkowitz J, Borg J, Stricher F, Nys R, Rousseau F, Serrano L. The FoldX web server: an online force field. Nucleic Acids Res. 2005 Jul 1;33(Web Server issue):W382–8.
- Gregory DA, Trujillo M, Rushford C, Flury A, Kannoly S, San KM, et al. Genetic diversity and evolutionary convergence of cryptic SARS- CoV-2 lineages detected via wastewater sequencing. PLoS Pathog. 2022;18:
e1010636 . DOIPubMedGoogle Scholar - Smyth DS, Trujillo M, Gregory DA, Cheung K, Gao A, Graham M, et al. Tracking cryptic SARS-CoV-2 lineages detected in NYC wastewater. Nat Commun. 2022;13:635. DOIPubMedGoogle Scholar
- Cao Y, Jian F, Wang J, Yu Y, Song W, Yisimayi A, et al. Imprinted SARS-CoV-2 humoral immunity induces convergent Omicron RBD evolution. Nature. 2023;614:521–9.PubMedGoogle Scholar
Figures
Tables
Cite This ArticleOriginal Publication Date: April 13, 2023
Table of Contents – Volume 29, Number 5—May 2023
EID Search Options |
---|
Advanced Article Search – Search articles by author and/or keyword. |
Articles by Country Search – Search articles by the topic country. |
Article Type Search – Search articles by article type and issue. |
Please use the form below to submit correspondence to the authors or contact them at the following address:
Derek Smith, Centre for Pathogen Evolution, Department of Zoology, University of Cambridge, Downing Street, Cambridge CB2 3EJ, UK
Top