Key Points
-
Millions of new genetic variants have been discovered through sequencing studies and deposited in human genomic databases. Many of these, particularly rare variants, have been annotated as deleterious.
-
Recent research has examined whether different human populations have a varying burden of deleterious alleles — a concept referred to as mutation load.
-
Several recent studies have suggested that there is no significant difference among populations in the estimated number of deleterious alleles per individual. However, these analyses are sensitive to annotation prediction algorithms and summary statistics, leading to different, sometimes contradictory, interpretations.
-
These calculations also involved a number of simplifying assumptions, including additive allelic effects, no epistasis and simple distributions of selection coefficients across deleterious variants and across populations.
-
Following classical models of mutation load, we consider genotype frequencies in order to highlight how mutation load can change under different models of dominance.
-
Additionally, genetic drift has shifted the allele frequency spectrum of deleterious variants such that the genomes of individuals in Out-of-Africa populations carry more common deleterious variants.
-
The frequency distribution of deleterious variants has implications for characterizing the genetic architecture of diseases across populations.
Abstract
Next-generation sequencing technology has facilitated the discovery of millions of genetic variants in human genomes. A sizeable fraction of these variants are predicted to be deleterious. Here, we review the pattern of deleterious alleles as ascertained in genome sequencing data sets and ask whether human populations differ in their predicted burden of deleterious alleles — a phenomenon known as mutation load. We discuss three demographic models that are predicted to affect mutation load and relate these models to the evidence (or the lack thereof) for variation in the efficacy of purifying selection in diverse human genomes. We also emphasize why accurate estimation of mutation load depends on assumptions regarding the distribution of dominance and selection coefficients — quantities that remain poorly characterized for current genomic data sets.
This is a preview of subscription content, access via your institution
Access options
Subscribe to this journal
Receive 12 print issues and online access
$209.00 per year
only $17.42 per issue
Buy this article
- Purchase on SpringerLink
- Instant access to full article PDF
Prices may be subject to local taxes which are calculated during checkout
Similar content being viewed by others
References
Ohta, T. & Gillespie, J. Development of neutral and nearly neutral theories. Theor. Popul. Biol. 49, 128–142 (1996). This paper reviews the development of the neutral and nearly neutral theories by key contributors to the field of population genetics.
Kimura, M., Maruyama, T. & Crow, J. F. The mutation load in small populations. Genetics 48, 1303–1312 (1963). This is a foundational paper on the effect of drift on mutation load in finite populations, demonstrating that mildly deleterious alleles can contribute more to load than strongly deleterious alleles.
King, J. L. & Jukes, T. H. Non-Darwinian evolution. Science 164, 788–798 (1969).
Marth, G. T., Czabarka, E., Murvai, J. & Sherry, S. T. The allele frequency spectrum in genome-wide human variation data reveals signals of differential demographic history in three large world populations. Genetics 166, 351–372 (2004).
Laval, G., Patin, E., Barreiro, L. B. & Quintana-Murci, L. Formulating a historical and demographic model of recent human evolution based on resequencing data from noncoding regions. PLoS ONE 5, e10284 (2010).
Gronau, I., Hubisz, M. J., Gulko, B., Danko, C. G. & Siepel, A. Bayesian inference of ancient human demography from individual genome sequences. Nature Genet. 43, 1031–1034 (2011).
Veeramah, K. R. et al. An early divergence of KhoeSan ancestors from those of other modern humans is supported by an ABC-based analysis of autosomal resequencing data. Mol. Bio Evol. 29, 617–630 (2012).
Kimura, M. Evolutionary rate at the molecular level. Nature 217, 624–626 (1968).
Kimura, M. The Neutral Theory of Molecular Evolution (Cambridge Univ. Press, 1985).
Ohta, T. Slightly deleterious mutant substitutions in evolution. Nature 246, 96–98 (1973).
Crow, J. F. Genetic loads and the cost of natural selection. Math. Top. Popul. Genet. 1, 128–177 (1970).
Agrawal, A. F. & Whitlock, M. C. Mutation load: the fitness of individuals in populations where deleterious alleles are abundant. Annu. Rev. Ecol. Evol. Syst. 43, 115–135 (2012).
Crow, J. F. 2. The concept of genetic load: a reply. Am. J. Hum. Genet. 15, 310–315 (1963).
Charlesworth, D. & Willis, J. H. Fundamental concepts in genetics: the genetics of inbreeding depression. Nature Rev. Genet. 10, 783–796 (2009). This is a broad review of inbreeding depression and heterosis, fitness phenomena that are caused by the presence of deleterious recessive mutations in populations.
Li, J. Z. et al. Worldwide human relationships inferred from genome-wide patterns of variation. Science 319, 1100–1104 (2008).
Henn, B. M., Cavalli-Sforza, L. L. & Feldman, M. W. The great human expansion. Proc. Natl Acad. Sci. USA 109, 17758–17764 (2012).
Agrawal, A. F. & Whitlock, M. C. Inferences about the distribution of dominance drawn from yeast gene knockout data. Genetics 187, 553–566 (2011). The distribution of dominance coefficients is directly measured from yeast knockout experiments, showing that large-effect mutations tend to be more recessive than weak-effect mutations.
Mukai, T., Chigusa, S. I., Mettler, L. E. & Crow, J. F. Mutation rate and dominance of genes affecting viability in Drosophila melanogaster. Genetics 72, 335–355 (1972).
Houle, D., Hughes, K. A., Assimacopoulos, S. & Charlesworth, B. The effects of spontaneous mutation on quantitative traits. II. Dominance of mutations with effects on life-history traits. Genet. Res. 70, 27–34 (1997).
Manna, F., Martin, G. & Lenormand, T. Fitness landscapes: an alternative theory for the dominance of mutation. Genetics 189, 923–937 (2011).
Morton, N. E., Crow, J. F. & Muller, H. J. An estimate of the mutational damage in man from data on consanguienous marriages. Proc. Natl Acad. Sci. USA 42, 855–863 (1956). This is among the earliest work to empirically measure the mutation load in humans by considering the reduction in fitness due to recessive mutations in consanguineous unions.
Reich, D. E. & Lander, E. S. On the allelic spectrum of human disease. Trends Genet. 17, 502–510 (2001).
Bittles, A. H. & Black, M. L. Consanguinity, human evolution, and complex diseases. Proc. Natl Acad. Sci. USA 107, 1779–1786 (2010).
Szpiech, Z. A. et al. Long runs of homozygosity are enriched for deleterious variation. Am. J. Hum. Genet. 93, 90–102 (2013).
McQuillan, R. et al. Evidence of inbreeding depression on human height. PLoS Genet. 8, e1002655 (2012).
Tabor, H. K. et al. Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results. Am. J. Hum. Genet. 95, 183–193 (2014). Based on analysis of the exome sequences of >6,500 individuals, this study shows that nearly 45% of individuals carry a known variant associated with severe Mendelian diseases.
Xue, Y. et al. Deleterious- and disease-allele prevalence in healthy individuals: insights from current predictions, mutation databases, and population-scale resequencing. Am. J. Hum. Genet. 91, 1022–1032 (2012).
Li, Y. et al. Resequencing of 200 human exomes identifies an excess of low-frequency non-synonymous coding variants. Nature Genet. 42, 969–972 (2010).
Erickson, R. P. & Mitchison, N. A. The low frequency of recessive disease: insights from ENU mutagenesis, severity of disease phenotype, GWAS associations, and demography: an analytical review. J. Appl. Genet. 55, 319–327 (2014).
De la Cruz, O. & Raska, P. Population structure at different minor allele frequency levels. BMC Proc. 8, S55 (2014).
Henn, B. M., Gravel, S., Moreno-Estrada, A., Acevedo-Acevedo, S. & Bustamante, C. D. Fine-scale population structure and the era of next-generation sequencing. Hum. Mol. Genet. 19, R221–R226 (2010).
Mathieson, I. & McVean, G. Demography and the age of rare variants. PLoS Genet. 10, e1004528 (2014).
Deshpande, O., Batzoglou, S., Feldman, M. W. & Luca Cavalli-Sforza, L. A serial founder effect model for human settlement out of Africa. Proc. Biol. Sci. 276, 291–300 (2009).
1000 Genomes Project Consortium et al. An integrated map of genetic variation from 1,092 human genomes. Nature 490, 56–65 (2013).
DeGiorgio, M., Jakobsson, M. & Rosenberg, N. A. Explaining worldwide patterns of human genetic variation using a coalescent-based serial founder model of migration outward from Africa. Proc. Natl Acad. Sci. USA 106, 16057–16062 (2009).
Nelson, M. R. et al. An abundance of rare functional variants in 202 drug target genes sequenced in 14,002 people. Science 337, 100–104 (2012).
Tennessen, J. A. et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science 337, 64–69 (2012).
Goode, D. L. et al. Evolutionary constraint facilitates interpretation of genetic variation in resequenced human genomes. Genome Res. 20, 301–310 (2010).
MacArthur, D. G. et al. A systematic survey of loss-of-function variants in human protein-coding genes. Science 335, 823–828 (2012).
Pritchard, J. K. Are rare variants responsible for susceptibility to complex diseases? Am. J. Hum. Genet. 69, 124–137 (2001).
Agarwala, V., Flannick, J., Sunyaev, S., GoT2D Consortium & Altshuler, D. Evaluating empirical bounds on complex disease genetic architecture. Nature Genet. 45, 1418–1427 (2013).
Gibson, G. Rare and common variants: twenty arguments. Nature Rev. Genet. 13, 135–145 (2012).
Maher, M. C., Uricchio, L. H., Torgerson, D. G. & Hernandez, R. D. Population genetics of rare variants and complex diseases. Hum. Hered. 74, 118–128 (2012).
Klopfstein, S. The fate of mutations surfing on the wave of a range expansion. Mol. Bio. Evol. 23, 482–490 (2005).
Marth, G. T. et al. The functional spectrum of low-frequency coding variation. Genome Biol. 12, R84 (2011).
Keinan, A. & Clark, A. G. Recent explosive human population growth has resulted in an excess of rare genetic variants. Science 336, 740–743 (2012).
Boyko, A. R. et al. Assessing the evolutionary impact of amino acid mutations in the human genome. PLoS Genet. 4, e1000083 (2008). This paper estimates selection coefficients for alleles with different predicted deleterious effects in humans and includes a discussion of methods to infer the DFE via site frequency spectra.
Lohmueller, K. E. et al. Proportionally more deleterious genetic variation in European than in African populations. Nature 451, 994–997 (2008). This is a formative paper considering the proportion of deleterious mutations in European-Americans compared to African-Americans based on analysis of an early genome sequencing data set. The higher proportion of deleterious variants in European-Americans was ascribed to increased genetic drift during the Out-of-Africa bottleneck.
Simons, Y. B., Turchin, M. C., Pritchard, J. K. & Sella, G. The deleterious mutation load is insensitive to recent population history. Nature Genet. 46, 220–224 (2014). This paper challenges the earlier studies (for example, reference 48) by demonstrating, via simulation, that the average number of deleterious mutations per individual under an additive model should be the same across populations for different human demographic histories.
Casals, F. et al. Whole-exome sequencing reveals a rapid change in the frequency of rare functional variants in a founding population of humans. PLoS Genet. 9, e1003815 (2013).
Do, R. et al. No evidence that selection has been less effective at removing deleterious mutations in Europeans than in Africans. Nature Genet. 47, 126–131 (2015).
Fu, W. et al. Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants. Nature 493, 216–220 (2013).
Fu, W., Gittelman, R. M., Bamshad, M. J. & Akey, J. M. Characteristics of neutral and deleterious protein-coding variation among individuals and populations. Am. J. Hum. Genet. 95, 421–436 (2014). This paper shows that European-American individuals carry slightly more deleterious derived alleles in their genome sequences, on average, than African-Americans under a conservation-based framework to predict variant function; this is consistent with Out-of-Africa bottleneck simulations.
Gravel, S. When is selection effective? bioRXiv http://dx.doi.org/10.1101/010934 (2014).
Lim, E. T. et al. Distribution and medical impact of loss-of-function variants in the Finnish founder population. PLoS Genet. 10, e1004494 (2014).
Sajantila, A. et al. Paternal and maternal DNA lineages reveal a bottleneck in the founding of the Finnish population. Proc. Natl Acad. Sci. USA 93, 12035–12039 (1996).
Gravel, S. et al. Demographic history and rare allele sharing among human populations. Proc. Natl Acad. Sci. USA 108, 11983–11988 (2011).
Li, H. & Durbin, R. Inference of human population history from individual whole-genome sequences. Nature 475, 493–496 (2011).
Meyer, M. et al. A high-coverage genome sequence from an archaic Denisovan individual. Science 338, 222–226 (2012).
Gutenkunst, R. N., Hernandez, R. D., Williamson, S. H. & Bustamante, C. D. Inferring the joint demographic history of multiple populations from multidimensional SNP frequency data. PLoS Genet. 5, e1000695 (2009).
Henn, B. M. et al. Hunter-gatherer genomic diversity suggests a southern African origin for modern humans. Proc. Natl Acad. Sci. 108, 5154–5162 (2011).
Lohmueller, K. E. The impact of population demography and selection on the genetic architecture of complex traits. PLoS Genet. 10, e1004379 (2014). Reprising his earlier work in reference 48, this paper focuses on patterns of deleterious variants over time, given different demographic scenarios of expansion, bottleneck and combinations of demographic events.
Ramachandran, S. et al. Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. Proc. Natl Acad. Sci. USA 102, 15942–15947 (2005).
Sousa, V., Peischl, S. & Excoffier, L. Impact of range expansions on current human genomic diversity. Curr. Opin. Genet. Dev. 29, 22–30 (2014).
Moreau, C. et al. Deep human genealogies reveal a selective advantage to be on an expanding wave front. Science 334, 1148–1150 (2011).
Peischl, S., Dupanloup, I., Kirkpatrick, M. & Excoffier, L. On the accumulation of deleterious mutations during range expansions. Mol. Ecol. 22, 5972–5982 (2013). This is a complex simulation study showing that range expansion can result in expansion load from deleterious mutations that rise to a high frequency on a geographical wave front.
Flaxman, S. M. Surfing downhill: when should population range expansion be characterized by reductions in fitness? Mol. Ecol. 22, 5963–5965 (2013).
Coventry, A. et al. Deep resequencing reveals excess rare recent variants consistent with explosive population growth. Nature Commun. 1, 131–136 (2010).
Gignoux, C. R., Henn, B. M. & Mountain, J. L. Rapid, global demographic expansions after the origins of agriculture. Proc. Natl Acad. Sci. USA 108, 6044–6049 (2011).
Zheng, H.-X., Yan, S., Qin, Z.-D. & Jin, L. MtDNA analysis of global populations support that major population expansions began before Neolithic time. Sci. Rep. 2, 745 (2012).
Forster, P. Ice ages and the mitochondrial DNA chronology of human dispersals: a review. Phil. Trans. R. Soc. Lond. B. 359, 255–264 (2004).
Gazave, E., Chang, D., Clark, A. G. & Keinan, A. Population growth inflates the per-individual number of deleterious mutations and reduces their mean effect. Genetics 195, 969–978 (2013).
Kamberov, Y. G. et al. Modeling recent human evolution in mice by expression of a selected EDAR variant. Cell 152, 691–702 (2013).
Hernandez, R. D. et al. Classic selective sweeps were rare in recent human evolution. Science 331, 920–924 (2011).
Moschovis, P. P. et al. Childhood anemia at high altitude: risk factors for poor outcomes in severe pneumonia. Pediatrics 132, e1156–e1162 (2013).
Whitlock, M. C. & Bourguet, D. Factors affecting the genetic load in Drosophila: synergistic epistasis and correlations among fitness components. Evolution 54, 1654–1660 (2000).
Fry, J. D. On the rate and linearity of viability declines in Drosophila mutation-accumulation experiments: genomic mutation rates and synergistic epistasis revisited. Genetics 166, 797–806 (2004).
Zuk, O., Hechter, E., Sunyaev, S. R. & Lander, E. S. The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl Acad. Sci. USA 109, 1193–1198 (2012).
Arbiza, L. et al. Genome-wide inference of natural selection on human transcription factor binding sites. Nature Genet. 45, 723–729 (2013).
Lohmueller, K. E. The distribution of deleterious genetic variation in human populations. Curr. Opin. Genet. Dev. 29, 139–146 (2014).
Acknowledgements
The authors thank K. Lohmueller, S. Peischl and L. Excoffier for discussion regarding these topics.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Related links
FURTHER INFORMATION
Supplementary information
Supplementary information S1 (box)
Variant Annotation Algorithms (PDF 171 kb)
Supplementary information S2 (figure)
Demographic history based on the site frequency spectrum and sharing of rare alleles. (PDF 249 kb)
Supplementary information S3 (figure)
Allele sharing versus allele frequency among European populations. (PDF 238 kb)
Glossary
- Genetic load
-
Reduction in population fitness compared to a theoretical 'perfectly adapted' genotype. This reduction can be caused by the constant influx of new, deleterious mutations (the mutation load), but the genetic load also encompasses reduction in fitness caused by other phenomena, such as inbreeding and changing environment.
- Neutral theory
-
A theory stating that the variation observed within and between species is largely determined by neutral mutation and genetic drift, and not by natural selection. Neutral theory became the basis for many additional population genetic models.
- Substitution load
-
The difference between optimal fitness and mean fitness in a population undergoing a selective sweep. A locus undergoing a selective sweep will result in some individuals with lower fitness. If multiple loci are under adaptation, the number of individuals who will not reproduce in a generation becomes too large to realistically maintain a stable population. This substitution load puts a limit on the rate of adaptation and is sometimes referred to as Haldane's cost of selection.
- Nearly neutral mutations
-
Variants of relatively weak selective effects that can be accounted for by an extension of neutral theory. Mutations that are slightly deleterious or slightly beneficial will behave as neutral depending on the relationship between population size and the selection coefficient. Because they can reach high frequency, nearly neutral variants can have a large impact on the genetic load, and their evolution is sensitive to fluctuations in population size.
- Inbreeding load
-
Reduction in fitness caused by an excess of recessive homozygotes following inbreeding within a population. The inbreeding load measures the difference between the average fitness of individuals in a population and the fitness of a hypothetical randomly mating population with the same allele frequencies.
- Mutation–selection balance
-
An equilibrium model in which the frequency of an allele is determined by recurrent mutation and the selection coefficient against the allele.
- Consanguineous union
-
In clinical genetics, a union between two individuals who are related as second cousins or closer, with the inbreeding coefficient (F) equal to or higher than 0.0156.
- Distribution of fitness effects
-
(DFE). The distribution of selection coefficients associated with newly arising mutations in a population.
- Out-of-Africa
-
A model by which a small group of modern humans exited Africa approximately 50,000 years ago and dispersed into the Eurasian continents. This movement was accompanied by a severe, possibly tenfold, population bottleneck.
- Serial founder effects
-
Serial expansion of a small ancestral population into a new geographical range; each new deme is created by sampling a small number of individuals to colonize the next location, resulting in a reduced effective population size.
Rights and permissions
About this article
Cite this article
Henn, B., Botigué, L., Bustamante, C. et al. Estimating the mutation load in human genomes. Nat Rev Genet 16, 333–343 (2015). https://doi.org/10.1038/nrg3931
Published:
Issue Date:
DOI: https://doi.org/10.1038/nrg3931
This article is cited by
-
Whole genome sequencing and the application of a SNP panel reveal primary evolutionary lineages and genomic variation in the lion (Panthera leo)
BMC Genomics (2022)
-
Developmental disruption and restoration of brain synaptome architecture in the murine Pax6 neurodevelopmental disease model
Nature Communications (2022)
-
Healthcare services relaxing natural selection may contribute to increase of dementia incidence
Scientific Reports (2022)
-
Genetic structure and demographic history of Allium mongolicum based on SSR markers
Plant Systematics and Evolution (2022)
-
Large-scale genomic analysis reveals the genetic cost of chicken domestication
BMC Biology (2021)