An official website of the United States government
The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
- Publications
- Account settings
- My Bibliography
- Collections
- Citation manager
Save citation to file
Email citation, add to collections.
- Create a new collection
- Add to an existing collection
Add to My Bibliography
Your saved search, create a file for external citation management software, your rss feed.
- Search in PubMed
- Search in NLM Catalog
- Add to Search
Revisiting the malaria hypothesis: accounting for polygenicity and pleiotropy
Affiliations.
- 1 Department of Pediatrics, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA. Electronic address: [email protected].
- 2 Department of Biology, Tufts University, Medford, MA 02155, USA.
- 3 Department of Biology, Stanford University, Stanford, CA 94305, USA.
- 4 Department of Pediatrics, Stanford University School of Medicine, Stanford, CA 94305, USA; Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA 94305, USA. Electronic address: [email protected].
- PMID: 35065882
- PMCID: PMC8916997
- DOI: 10.1016/j.pt.2021.12.007
The malaria hypothesis predicts local, balancing selection of deleterious alleles that confer strong protection from malaria. Three protective variants, recently discovered in red cell genes, are indeed more common in African than European populations. Still, up to 89% of the heritability of severe malaria is attributed to many genome-wide loci with individually small effects. Recent analyses of hundreds of genome-wide association studies (GWAS) in humans suggest that most functional, polygenic variation is pleiotropic for multiple traits. Interestingly, GWAS alleles and red cell traits associated with small reductions in malaria risk are not enriched in African populations. We propose that other selective and neutral forces, in addition to malaria prevalence, explain the global distribution of most genetic variation impacting malaria risk.
Keywords: evolution; malaria; pleiotropy; polygenic; red blood cells.
Copyright © 2021 Elsevier Ltd. All rights reserved.
PubMed Disclaimer
Conflict of interest statement
Declaration of interests The authors declare no competing interests.
Figure 1, Key Figure.. The main polygenic…
Figure 1, Key Figure.. The main polygenic component of malaria resistance is not enriched in…
Figure I. Most common human alleles have…
Figure I. Most common human alleles have a broad geographic distribution.
Data are summarized across…
Similar articles
- A global overview of pleiotropy and genetic architecture in complex traits. Watanabe K, Stringer S, Frei O, Umićević Mirkov M, de Leeuw C, Polderman TJC, van der Sluis S, Andreassen OA, Neale BM, Posthuma D. Watanabe K, et al. Nat Genet. 2019 Sep;51(9):1339-1348. doi: 10.1038/s41588-019-0481-0. Epub 2019 Aug 19. Nat Genet. 2019. PMID: 31427789
- Estimation of regional polygenicity from GWAS provides insights into the genetic architecture of complex traits. Johnson R, Burch KS, Hou K, Paciuc M, Pasaniuc B, Sankararaman S. Johnson R, et al. PLoS Comput Biol. 2021 Oct 21;17(10):e1009483. doi: 10.1371/journal.pcbi.1009483. eCollection 2021 Oct. PLoS Comput Biol. 2021. PMID: 34673766 Free PMC article.
- Genome-wide large-scale multi-trait analysis characterizes global patterns of pleiotropy and unique trait-specific variants. Qi G, Chhetri SB, Ray D, Dutta D, Battle A, Bhattacharjee S, Chatterjee N. Qi G, et al. Nat Commun. 2024 Aug 14;15(1):6985. doi: 10.1038/s41467-024-51075-5. Nat Commun. 2024. PMID: 39143063 Free PMC article.
- Evolutionary perspectives on polygenic selection, missing heritability, and GWAS. Uricchio LH. Uricchio LH. Hum Genet. 2020 Jan;139(1):5-21. doi: 10.1007/s00439-019-02040-6. Epub 2019 Jun 14. Hum Genet. 2020. PMID: 31201529 Free PMC article. Review.
- Discovery and implications of polygenicity of common diseases. Visscher PM, Yengo L, Cox NJ, Wray NR. Visscher PM, et al. Science. 2021 Sep 24;373(6562):1468-1473. doi: 10.1126/science.abi8206. Epub 2021 Sep 23. Science. 2021. PMID: 34554790 Free PMC article. Review.
- Local genetic adaptation to habitat in wild chimpanzees. Ostridge HJ, Fontsere C, Lizano E, Soto DC, Schmidt JM, Saxena V, Alvarez-Estape M, Barratt CD, Gratton P, Bocksberger G, Lester JD, Dieguez P, Agbor A, Angedakin S, Assumang AK, Bailey E, Barubiyo D, Bessone M, Brazzola G, Chancellor R, Cohen H, Coupland C, Danquah E, Deschner T, Dotras L, Dupain J, Egbe VE, Granjon AC, Head J, Hedwig D, Hermans V, Hernandez-Aguilar RA, Jeffery KJ, Jones S, Junker J, Kadam P, Kaiser M, Kalan AK, Kambere M, Kienast I, Kujirakwinja D, Langergraber KE, Lapuente J, Larson B, Laudisoit A, Lee KC, Llana M, Maretti G, Martín R, Meier A, Morgan D, Neil E, Nicholl S, Nixon S, Normand E, Orbell C, Ormsby LJ, Orume R, Pacheco L, Preece J, Regnaut S, Robbins MM, Rundus A, Sanz C, Sciaky L, Sommer V, Stewart FA, Tagg N, Tédonzong LR, van Schijndel J, Vendras E, Wessling EG, Willie J, Wittig RM, Yuh YG, Yurkiw K, Vigilant L, Piel A, Boesch C, Kühl HS, Dennis MY, Marques-Bonet T, Arandjelovic M, Andrés AM. Ostridge HJ, et al. bioRxiv [Preprint]. 2024 Jul 9:2024.07.09.601734. doi: 10.1101/2024.07.09.601734. bioRxiv. 2024. PMID: 39026872 Free PMC article. Preprint.
- Plasmodium knowlesi ( Pk ) Malaria: A Review & Proposal of Therapeutically Rational Exchange (T-REX) of Pk -Resistant Red Blood Cells. Jajosky RP, Wu SC, Jajosky PG, Stowell SR. Jajosky RP, et al. Trop Med Infect Dis. 2023 Oct 20;8(10):478. doi: 10.3390/tropicalmed8100478. Trop Med Infect Dis. 2023. PMID: 37888606 Free PMC article. Review.
- Neither the African-Centric S47 Nor P72 Variant of TP53 Is Associated With Reduced Risk of Febrile Malaria in a Malian Cohort Study. Bhardwaj J, Upadhye A, Gaskin EL, Doumbo S, Kayentao K, Ongoiba A, Traore B, Crompton PD, Tran TM. Bhardwaj J, et al. J Infect Dis. 2023 Jul 14;228(2):202-211. doi: 10.1093/infdis/jiad066. J Infect Dis. 2023. PMID: 36961831 Free PMC article.
- Does fetal hemoglobin inhibit the malarial parasite Plasmodium falciparum? Archer NM, Pasvol G, Wilson I, Duraisingh MT. Archer NM, et al. Am J Hematol. 2022 Sep;97(9):E325-E327. doi: 10.1002/ajh.26661. Epub 2022 Jul 30. Am J Hematol. 2022. PMID: 35834276 Free PMC article. No abstract available.
- A novel ENU-induced Cpox mutation causes microcytic hypochromic anemia in mice. Miyasaka Y, Okuda K, Miura I, Motegi H, Wakana S, Ohno T. Miyasaka Y, et al. Exp Anim. 2022 Nov 10;71(4):433-441. doi: 10.1538/expanim.22-0032. Epub 2022 May 9. Exp Anim. 2022. PMID: 35527013 Free PMC article.
- Shriner D and Rotimi CN (2018) Whole-Genome-Sequence-Based Haplotypes Reveal Single Origin of the Sickle Allele during the Holocene Wet Phase. Am. J. Hum. Genet 102, 547–556 - PMC - PubMed
- World Health Organization (2020) World malaria report 2020: 20 years of global progress and challenges, World Health Organization
- Kariuki SN and Williams TN (2020) Human genetics and malaria resistance. Hum. Genet 139, 801–811 - PMC - PubMed
- Goheen MM et al. (2016) Anemia Offers Stronger Protection Than Sickle Cell Trait Against the Erythrocytic Stage of Falciparum Malaria and This Protection Is Reversed by Iron Supplementation. EBioMedicine 14, 123–130 - PMC - PubMed
- Kanias T et al. (2017) Ethnicity, sex, and age are determinants of red blood cell storage and stress hemolysis: Results of the REDS-III RBC-Omics study. Blood Adv. 1, 1132–1141 - PMC - PubMed
Publication types
- Search in MeSH
Related information
Grants and funding.
- DP2 HL137186/HL/NHLBI NIH HHS/United States
- R35 GM118165/GM/NIGMS NIH HHS/United States
LinkOut - more resources
Full text sources.
- ClinicalKey
- Elsevier Science
- Europe PubMed Central
- PubMed Central
- MedlinePlus Health Information
- Citation Manager
NCBI Literature Resources
MeSH PMC Bookshelf Disclaimer
The PubMed wordmark and PubMed logo are registered trademarks of the U.S. Department of Health and Human Services (HHS). Unauthorized use of these marks is strictly prohibited.
Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.
- View all journals
- Explore content
- About the journal
- Publish with us
- Sign up for alerts
- Published: 23 March 2011
Population genetics of malaria resistance in humans
- P W Hedrick 1
Heredity volume 107 , pages 283–304 ( 2011 ) Cite this article
17k Accesses
189 Citations
12 Altmetric
Metrics details
- Haematological diseases
- Population genetics
A Corrigendum to this article was published on 16 November 2011
The high mortality and widespread impact of malaria have resulted in this disease being the strongest evolutionary selective force in recent human history, and genes that confer resistance to malaria provide some of the best-known case studies of strong positive selection in modern humans. I begin by reviewing JBS Haldane's initial contribution to the potential of malaria genetic resistance in humans. Further, I discuss the population genetics aspects of many of the variants, including globin, G6PD deficiency, Duffy, ovalocytosis, ABO and human leukocyte antigen variants. Many of the variants conferring resistance to malaria are ‘loss-of-function’ mutants and appear to be recent polymorphisms from the last 5000–10 000 years or less. I discuss estimation of selection coefficients from case–control data and make predictions about the change for S , C and G6PD-deficiency variants. In addition, I consider the predicted joint changes when the two β-globin alleles S and C are both variable in the same population and when there is a variation for α-thalassemia and S , two unlinked, but epistatic variants. As more becomes known about genes conferring genetic resistance to malaria in humans, population genetics approaches can contribute both to investigating past selection and predicting the consequences in future generations for these variants.
Similar content being viewed by others
Malaria protection due to sickle haemoglobin depends on parasite genotype
The effect on the equilibrium sickle cell allele frequency of the probable protection conferred by malaria and sickle cell gene against other infectious diseases
On the feasibility of malaria hypothesis
Introduction.
Today, the knowledge that specific genetic variants can contribute to disease resistance in humans is well known ( Dronamraju, 2004 ; Kaslow et al., 2008 ). Haldane (1949a) is widely credited with first proposing that genetic resistance to a disease was potentially an important evolutionary force in humans. Before discussing malaria resistance further, it is worthwhile to consider the initial historical contribution and insights of JBS Haldane, one of the founders of population genetics, to the population genetics of malaria resistance in humans.
Acknowledgment to the general review entitled ‘Disease and Evolution’ by Haldane (1949a) is confusing because there is nothing discussed about human resistance to disease in the body of this manuscript from a meeting in 1948 in Milan. Only in the subsequent discussion from G Montalenti at the end of the article (in Italian), and then from Haldane's response to Montalenti's comment, is this hypothesis put forward. As a result, it is useful to present below a translation of these comments from the Italian and Haldane's brief response.
MONTALENTI. Stresses the importance of the views expressed by Professor Haldane. Recalls the case of microcitemia or thalassemia studied by Silvestroni, Bianco and Montalenti. Recalls a gene, lethal in the homozygous state (Cooley's disease) occurs, in the heterozygous state, with such frequency in some populations (more than 10%) that one must admit that it represents in this condition an advantage for individuals that carry it. Because of some research, far from complete, it appears that the gene is more frequent in malarial areas. Professor Haldane has suggested in verbal communication that individuals with microcitemico, who among other characteristics have increased globular resistance, may be more resistant to malaria. However, this is an interesting case of heterosis, which relates to what was described by Professor Haldane.
HALDANE. 1: I agree with Dr Montalenti's project. Another possibility is that (by analogy with the advantage possessed by vermillion Drosophila on media deficient in tryptophan) microcythemic heterozygotes may be at an advantage on diets deficient in iron or other substances, thus leading to anaemia.
In addition, at the VIII International Congress of Genetics in Stockholm in 1948, actually previous to the Milan meeting, Haldane (1949b) outlined his hypothesis in more detail. This was in response to the idea put forth by Neel and Valentine (1947) that different human populations may have different mutation rates.
Finally, Neel and Valentine (1947) have calculated a mutation rate for the gene which when homozygous gives rise to the lethal condition of thalassemia major (Cooley's anemia), while the heterozygote has a mild microcytic anemia with decreased fragility of the erythrocytes. The gene is found in about 4.1% of the people of Italian origin in Rochester, NY. If the heterozygote had normal viability, equilibrium would be secured by a mutation rate of 4 × 10 −4 . On the other hand, if the heterozygotes had an increased fitness of only 2.1% there would be equilibrium without any mutation at all.
Neel and Valentine believe that the heterozygote is less fit than normal, and think that the mutation rate is above 4 × 10 −4 rather than below it. I believe that the possibility that the heterozygote is fitter than normal must be seriously considered. Such increased fitness is found in the case of several lethal and sublethal genes in Drosophila and Zea . A possible mechanism is as follows. The corpuscles of the anemic heterozygotes are smaller than normal and more resistant to hypotonic solutions. It is at least conceivable that they are also more resistant to attacks by the sporozoa that cause malaria, a disease prevalent in Italy, Sicily and Greece, where the gene is frequent. Similarly, the gene, which causes an anemia similar to that of iron deficiency, might be harmless or even useful to persons on an iron-deficient diet, although causing a relative anemia when the diet is more generous. Numerous other similar hypotheses could be framed. On many of them, the gene would lower fitness in America, but might, at least in the heterozygous condition, have the opposite effect in Greece or Sicily. Until more is known of the physiology of this gene in various environments, I doubt if we can accept the hypothesis that it arises very frequently by mutation in a small section of the human species.
From examining these passages, it seems appropriate to cite Haldane (1949b) as the first source of the idea that genetic variants in humans may confer resistance to malaria (see also the discussion in Crow, 2004 ; Weatherall, 2004 ).
At present, it is difficult to imagine that Haldane in 1948 was the first to articulate the potential importance of genetic variants in humans for resistance to malaria (actually, Beet, 1946 found that there were lower rates of malaria in sickle-cell heterozygotes than in normal individuals). The first generally recognized evidence for genetic resistance to malaria was published only a few years later by Allison (1954 , 2004 ) showing that the densities of the malarial parasite, Plasmodium falciparum , were significantly lower in sickle-cell hemoglobin heterozygotes AS than in normal AA individuals. Allison (1955) then discussed the evolutionary implications of his findings in the influential 1955 Cold Springs Harbor Symposium on Quantitative Biology, probably the initiation of the large effort, both medically and evolutionarily, to understand genetic resistance to malaria in humans (see also Allison, 1964 ). Overall, the original ‘malaria hypothesis’ of Haldane that diseases such as thalassemia are polymorphisms with an advantage to heterozygotes because of the balance of the benefits of resistance to malaria and detrimental effects due to the disease has been proven correct; however, the actual mechanisms of protection for different genetic variants have often proven to be complicated to understand.
My purpose here is to discuss some of the recent evolutionary genetic developments in human genetic resistance to malaria. Fortunately, in recent years there have been excellent reviews of the mechanisms of malaria resistance, as well as surveys of the many genes that may putatively confer malaria resistance ( Kwiatkowski, 2005 ; Verra et al., 2009 ; López et al., 2010 ). Therefore, I will not discuss in detail the molecular basis of the mechanisms of resistance or enumerate all the potential malaria resistance genes, but will focus on the genetic variants in which there is strong evidence for resistance, and data useful for population genetic approaches and analysis. Understanding the geographic patterns of these variants is fundamentally influenced by resistance selection, but would also include consideration of gene flow, mutation, genetic drift and the interaction of all these factors. I will mention these other evolutionary factors as appropriate, but will focus here on the effect of selection on these advantageous variants. I will also discuss the population genetics of two examples in which there are two malarial resistance variants in the same population, the two alleles S and C at the β-globin locus and then alleles at two different genes, the S allele at the β-globin locus and the α + thalassemia variant at the α-globin locus.
General background
Surprising to many today, malaria was endemic in southern parts of Europe and North America less than a century ago and was not eliminated from many locations around the Mediterranean Sea until after World War II in the late 1940s. Today malaria is still found across most of sub-Saharan Africa and in extensive regions in Asia and South America. To illustrate the human impact of malaria, in 2008 there were 243 million clinical cases and an estimated 863 000 deaths from malaria ( WHO, 2009 ), many of whom were children, making it one of the leading causes of death worldwide. Most of this mortality is from ‘severe malaria,’ which is due to P. falciparum , begins about 6–14 days after infection from mosquitoes, has the biggest impact on children and pregnant women, progresses rapidly and can cause death within hours or days (here, I will generally be considering the impact of severe malaria). The past mortality rate from malaria was even higher, particularly for populations without previous exposure to malaria. For example, on the west coasts of Africa in the early 1800s, mortality rates in Europeans often exceeded 50% per year ( Curtin, 1989 ). After quinine was introduced in the later 1800s, the mortality rate reduced to about 25% of that before its introduction, indicating that the great majority of the mortality was due to malaria.
It is generally assumed that death from malaria increased between 10 000 and 5000 years ago with the development of agriculture and settlements. These changes are thought to have increased both the density of the malaria mosquito vector and human population density, thereby greatly facilitating the transmission of malaria ( Carter and Mendis, 2002 ). Support for this hypothesis comes from some molecular analysis of P. falciparum strains that suggests that the African malaria population expanded around 10 000 years ago and spread to other areas ( Joy et al., 2003 ). In addition, chromosomal data from the mosquito vector Anopheles gambiae support contemporaneous speciation ( Coluzzi et al., 2002 see also; Lawniczak et al., 2010 and Neafsey et al., 2010 ). Recent analysis suggests that P. falciparum in humans is the result of cross-species transmission from gorillas ( Liu et al., 2010 ). Although it is not possible to date this transmission precisely at this point, it appears recent and may be coincidental with the advent of strong selection for malaria resistance in humans.
Although I will generally not discuss the mosquito vector of malaria here, Carter and Mendis (2002) summarized some relevant information about vector–host preference. In most areas where endemic malaria is prevalent, Anopheles mosquitoes are generally highly zoophilic, rather than anthropophilic, that is, they greatly prefer animals to humans for blood meals. However, in sub-Saharan Africa, these mosquito vectors are highly anthropophilic. Livingstone (1958) suggested that this was an adaptation to preference for humans in the new agrarian environments that had both high densities of humans and provided numerous small water pools for mosquito development. On the other hand, in Asia and the Middle East, there was an abundance of domesticated animal species during the rise of agriculture, which served as a blood source for mosquitoes; there were generally no such domesticated animal sources in Africa.
P. falciparum is the malarial species that is the most important cause of mortality in humans but, in addition, nearly 3 billion people, mainly in Asia and South America, are at risk from another malaria species, P. vivax , which causes chronic disease and deadly complications ( Guerra et al., 2010 ). Three more species, P. ovale , P. malariae and P. knowlesi ( Cox-Singh et al., 2008 ) can also cause malaria in humans. Infections with malaria may reach very high levels, for example, Mueller et al. (2009) found that 73% of a Papua New Guinea population were infected. When the species of malaria infecting individuals were identified, 54.9, 35.7, 13.4 and 4.8% of the individuals were infected by P. falciparum , P. vivax , P.malariae and P. ovale , respectively, and a significant excess of mixed infections over expectations was detected.
As a result of the high mortality and widespread impact of malaria, it is thought to be the strongest evolutionary selective force in recent human history ( Kwiatkowski, 2005 ). In fact, genes that confer resistance to malaria provide some of the best-known case studies of strong positive selection in modern humans. For example, maintenance of the sickle-cell hemoglobin variant is the classic example of heterozygote advantage, G6PD deficiency is an example of very strong selection at an X-linked locus, variants at β-globin ( S , C and E ) and those causing G6PD deficiency ( A -, Med and Mahidol ) illustrate selective advantage from a single-nucleotide change, HLA-B53 provides an example of a selective advantage of a gene conversion product and α-thalassemia provides an example of selection on duplicated loci. Before I examine resistance from some specific genetic variants, let me summarize several important general findings related to genetic resistance to malaria in humans.
(1) There is extensive overlap of the historical geographical distribution of malaria and human genetic variants that confer resistance to malaria. For example, P. falciparum is found across Africa and Asia, as are the resistance variants of hemoglobin and G6PD. Further, in regions of high endemic malaria, such as sub-Saharan tropical Africa and lowland Melanesia, there are generally more than one resistant variant present. Resistant variants are generally not present in regions where there is no history of malaria, except in recent immigrants ( O'Shaughnessy et al., 1990 ). For example, there does not appear to be any of the common malaria-resistant variants in native people of the new world ( Livingstone, 1985 ), apparently because their Asian ancestors were not exposed to malaria and malaria was only brought to the new world by early Spanish explorers in the 1600s and 1700s. In addition, there are convincing examples of microgeographic variation in β-thalassemia with higher frequencies at lower altitudes in Sardinia where malaria was historically endemic compared with higher altitudes ( Siniscalco et al., 1961 ) and variation of α-thalassemia ( Flint et al., 1986 ) and β-thalassemia ( Hill et al., 1988 ) in Melanesia correlated with altitude, which is highly correlated with malaria endemicity.
(2) The presence of strong selective pressure for resistance to malaria has resulted in the high frequency of some detrimental genetic diseases, such as sickle-cell anemia, thalassemia, G6PD deficiency and ovalocytosis. In fact, Weatherall (2010) has suggested that there is now a global health burden because more than 300 000 children are born each year with a severe inherited hemoglobin disorder that is in high frequency because it confers, or has conferred, malarial resistance. In fact, this strong selective pressure explains why disorders of the red blood cell are by far the most common genetic diseases in humans ( Weatherall, 2008 ). Further, it is possible that some of our other immunological, inflammatory and other chronic diseases found today may be due to the pleiotropic effects of the malaria resistance variants in high frequency because of the evolutionary pressure that malaria has exerted on humans ( Kwiatkowski, 2005 ).
(3) Many of the variants conferring resistance to malaria are ‘loss-of-function’ mutants and result in changed, generally reduced, expression or an altered product. In some cases, these mutants have a large pleiotropic cost associated with them and result in disease, as discussed above. In other cases, the change in expression or in gene product does not appear to have a major cost, as in the Duffy ‘null’ variants, the O in the ABO system and allele C for the β-globin gene.
(4) Variants at some loci conferring resistance to malaria are nucleotide- or codon-specific whereas others are quite general. For example, the two known independent Duffy ‘null’ variants in African and Papua New Guinea are at the exact same nucleotide position, 33 nucleotides upstream from the start codon apparently on different genetic backgrounds. Moreover, two of the three structural polymorphic β-globin variants that confer resistance to malaria, S and C , are at the same codon. The only other structural variant that appears to provide resistance to malaria, E , may provide protection because it alters β-globin expression and functions similar to a thalassemia variant. On the other hand, there are multiple variants that alter expression levels and provide protection within each of the categories known as α-thalassemia, β-thalassemia and G6PD deficiency.
(5) It appears that many of the genetic variants that provide resistance to severe malaria that have been dated are recent, not ancient, polymorphisms and that they have arisen in the last 5000–10 000 years or less (see Table 2). That is, these variants appear to date to a recent time when severe malaria became a strong selective force in humans. This means that these variants did not provide resistance to malaria in other species, that is, they are not trans-specific polymorphisms, and that they have been generated by new mutations in our recent human ancestors. Either there does not appear to be homologous resistance variants in other primates ( Verrelli et al., 2006 ; MacFie et al., 2009 ) or the malaria resistance variants found in primates are not identical to those in humans ( Tung et al., 2009 ).
(6) There are many different unlinked genes that appear to confer resistance to malaria in humans. Because the selection pressure from malaria through high mortality and widespread exposure is great, genes that provide resistance to malaria would be selected even though there may be a cost to resistance from genetic disease. In addition, there appears to be many different types of genes that can confer resistance. For example, Kwiatkowski and Luoni (2008) list potential resistance genes in the categories of hemoglobin genes, erythrocyte surface molecules, other erythrocyte-related proteins, adhesion molecules, mediators of innate immunity and acquired immunity genes. Recent genome-wide studies show the potential to identify many other genes that have a smaller effect on malarial resistance. Further, although much of the variation in malaria resistance appears because of genetic effects, only a small proportion of this variation appears because of well-known genetic factors, such as sickle-cell and α-thalassemia ( Mackinnon et al., 2005 ).
(7) Some of the genes that confer resistance to malaria are among the most variable genes in the human genome. The human leukocyte antigen (HLA), G6PD, globin and ABO genes are all among the most variable genes in the human genome and all appear to have variants that confer resistance to malaria. This high variation may be because genomic surveys have shown that disease resistance genes in general have high standing variation and that some of this variability provides resistance to malaria. It may also be because these regions have a high enough mutation rate by which they can generate enough variants quickly enough to provide variation for positive selection for malaria resistance.
(8) Different populations appear to have different levels of resistance to malaria even when they both live in the same, or similar, area where malaria is prevalent. These differences could result from non-genetic factors such as culture, nutritional status or patterns of disease transmission or incidence, but in some cases they appear to be from genetic differences such as different frequencies of resistance variants (for example, high frequency of Duffy ‘null’ in Africa confers resistance to P. vivax , see below; high frequency of α-thalassemia in Nepalese populations, Modiano et al., 1991 ), epistasis with other host resistance variants or differences in unknown genetic resistance factors ( Dolo et al., 2005 ).
Resistance mutants
Many different genes are involved with genetic resistance to malaria ( Kwiatkowski, 2005 ; Verra et al., 2009 ) and these genetic factors may vary over populations because of the presence or absence of specific resistant genetic variants, different species or strains of malaria and different effects of specific variants in given host genotypes. Other factors such as population structure and chance, as well as the experimental design or the phenotype(s) of malaria diagnosed in a given study, may impact the conclusions. Efforts to use genomics approaches and genome-wide association analysis have been initiated ( Ayodo et al., 2007 ; Timmann et al., 2007 ; The Malaria Genomic Epidemiology Network, 2008 ; Jallow et al., 2009 ; Eid et al., 2010 ) and some new regions of potential malaria resistance have been identified.
In an estimate of the extent of malaria resistance explained by genetic factors in Kenya using pedigree analysis, 25% was attributed to additively functioning host genes and 29% to unidentified household effects ( Mackinnon et al., 2005 ). Only 2.1% of the variation was attributed to sickle-cell variation and, using an unrelated approach, only about 2% was attributed to α-thalassemia. In other words, a large part of the variation in malaria resistance was attributed to additive genetic effects, but only a small proportion could be attributed to the most well-known genetic factors in the population.
Below I will concentrate on genes for which there is strong evidence for resistance and data that are useful for population genetic approaches and analysis. In general, I will discuss resistance variants that lower mortality from severe malaria, although resistance variants may reduce mortality or morbidity when malaria is not severe or have other impacts, such as reduction in the rate of malaria infection. Some other important genes not discussed here are those that are concerned with the innate immune response (as distinct from the innate resistance) and cytoadherence ( Kwiatkowski and Luoni, 2008 ). Some recent references to genes that are not discussed here are for the tumor necrosis factor ( TNF ) gene ( Clark et al., 2009 ), the toll-like receptor ( TLR ) genes ( Ferwerda et al., 2007 ) and the genes that cause CD36 deficiency ( Fry et al., 2009 ).
Hemoglobin mutants
Hemoglobinopathies, inherited disorders of hemoglobin structure and function, consist of structural hemoglobin variants and the thalassemias, inherited defects that reduce production in the synthesis of the α- and β-globins of the human adult hemoglobin ( Weatherall and Clegg, 2001 ; Weatherall, 2008 ). Two α and two β subunits compose the tetrameric protein backbone of adult hemoglobin, and the α and β subunits are made up of 141 and 146 amino acids, respectively, each with three exons. The genes that code for α- and β-globins, HBA (there are actually two genes, HBA1 and HBA2 , with identical amino-acid sequences) and HBB , are unlinked and are located on chromosomes 11 and 16, respectively. Hundreds of structural variants of adult hemoglobin have been documented ( Giardine et al., 2007 ), but only three, all at the HBB locus, S , C and E , have reached substantial frequencies in any populations. Two of these three, S and C , are different mutants (with different phenotypes) at the same amino-acid position 6 and, although E is a structural variant at amino-acid position 26, it also affects regulation in a manner similar to the thalassemias. Surprisingly then, the apparently advantageous structural variants are only at two codons in gene HBB , suggesting that mutants at other sites at the HBB locus, and all sites at the HBA loci, are either neutral or detrimental in malarial environments (or have not undergone mutation in malarial environments). Finally, every region in the world where malaria was prevalent has hemoglobinopathies, but no two areas have the same hemoglobinopathy or combination of hemoglobinopathies ( Flint et al., 1998 ). I will discuss some of these patterns below, but here I suggest that they appear to be caused primarily by differential mutation originating in various regions where malaria is prevalent and potentially by subsequent selective interaction between the variants.
Sickle cell, S or β S or HbS
The sickle-cell allele (for simplicity I will symbolize it by S ) was one of the first human genetic variants that was associated with a specific molecular-caused disease ( Pauling et al., 1949 ; Ingram, 2004 ). The sickle-cell allele is widely known as a variant that causes red blood cells to be deformed into a sickle shape when deoxygenated in AS heterozygotes, in which A indicates the non-mutant form of the β-globin gene, and also provides resistance to malaria in AS heterozygotes. In SS homozygotes, S causes the severe disease sickle-cell anemia. It is generally assumed that individuals with genotype SS had very low or zero fitness when, or if, there was no modern medical care. The greatest documented impact of S is that it protects against death or severe malarial disease in heterozygotes, and it appears that S has less effect on the rate of infection ( Weatherall, 2008 ). Multiple mechanisms of malaria protection in AS individuals have been proposed and supported, including that the growth of malarial parasites is suppressed in sickle cells.
The S allele differs from the normal A allele at one nucleotide at codon position 6 in the β-globin molecule, which translates into a change from the amino acid glutamic acid to valine (β6 Glu → Val and GAG → GTG; Table 1 ). The frequency of allele S is up to 0.2 in some parts of sub-Saharan Africa, Greece and India. There appears to be five independent origins, or mutations, of the S allele from the A allele, based on their existence in nearly non-overlapping geographical regions, four in Africa, Bantu (Central Africa Republic, CAR), Benin, Cameroon and Senegal, and another one in both India and Saudi Arabia, and the extensive differences in linked variants resulting in association with different classical haplotypes. Mediterranean S alleles appear to descend from the African S alleles, many of them from the Benin haplotype ( Flint et al., 1998 ). Recently, Ralph and Coop (2010) have shown that such parallel adaptation of multiple S alleles is not inconsistent with their theoretical spatial model and general estimates of mutation rate, gene flow, selection and population size, a finding consistent with that of Karasov et al. (2010) who suggest that mutation does not limit adaptation in many situations. On the other hand, it has been proposed that a plausible explanation for much of the haplotype diversity around S alleles is that it reflects past recombination and gene conversion ( Flint et al., 1998 ).
A general approach to determine the risk of individuals with a given genotype getting a disease, relative to that in the rest of the population, is to calculate the odds ratio ( OR ) as
where f c and f d are the frequencies of the genotype in control and diseased groups, respectively. Another measure of the protectiveness of a genotype from a disease is the relative risk ( RR ), which is
OR and RR are closest to each other when both f c and f d are in low frequency (for an introduction to OR and RR , see Agresti, 2007 ).
In a large study of HLA variation and malaria in Gambia, Hill et al. (1991) also examined the effect of sickle-cell variation on the presence of severe malaria as a standard to measure the impact of HLA variation. In 619 children with severe malaria only seven (0.012) were AS , carriers of the S allele. On the other hand, in 510 other children, who were outpatients and termed ‘mild controls’, many more, that is, 66 (0.129) were AS . Further in this study, OR =0.082 and RR =0.093, indicating very great protection (often calculated as 1— OR or 1— RR ) from severe malaria for AS genotypes. Ackerman et al. (2005) compared this ‘case–control’ method with a family-based method in more recent samples from Gambia and found similar OR values of 0.10 and 0.11.
To understand the dynamics of genetic change from adaptation for resistance to malaria, it is useful to estimate the selection coefficients for the different genotypes, values that can be estimated from OR data; but first, let me show how OR can be used in a population genetics context. Let me define the probability that malaria is present, given the AS heterozygote, relative to the probability of malaria when another genotype is present, as w . Using the traditional population genetics equation for viability selection (without reproduction; Hedrick 2011a , p. 129), the frequency of the genotype AS in the disease group is expected to be
This expression can be solved for w and is identical to OR
The selection coefficient ( s ) for genotype AS can be estimated as
where m is the rate of non-genotype-specific mortality due to malaria (generally the mortality given that an individual has severe malaria) and 1— OR is the protection from malaria conferred by the genotype (after Hill, 1991 ; Hedrick, 2004 ). (Note that both Hill (1991) and Hedrick (2004) used the symbol RR in this equation, but in fact used the definition of OR given in equation (1a), reflecting the similarity of these measures for rare genotypes and the less specific use of these terms in earlier studies). For example, for the AS genotype if OR =0.082 as above and m =0.1 (see below), then s =0.092.
Hill (1991) suggested a value of 0.07 for m whereas from studies by both WHO (1998) and Carter and Mendis (2002) , a value of at least m =0.1 may be appropriate. However, recognizing that the appropriate value of m here should reflect the mortality in the evolutionary period over the many centuries when malaria was endemic and that it should account for the cumulative mortality of individuals with malaria before reproduction (and selection should include the impact on reproductive and mating success), an even higher value may be appropriate. For example, Rowe et al. (2006) found that in rural West Africans in whom malaria is hyperendemic and there is inadequate medical care, ∼ 20% of the children die of malaria by the age of 5. As a result, I will use m =0.1 here as a general value for illustration, with the caveat that m , and the consequent selection coefficient, may actually have been significantly higher in the past.
Because this selection gives an advantage to the AS genotype, the relative fitness of the AS genotype is the highest (1) and the relative fitness of genotype AA is 1— s or 0.908; therefore, the relative fitness levels of genotypes AA , AS and SS become 0.908, 1 and 0. From these values, the expected equilibrium frequency of allele S ( Hedrick, 2011a , p. 136) is 0.092/(0.092+1.0)=0.084, not unlike that observed in many sub-Saharan populations.
The age of two of the putative S alleles has been estimated by examining the extent of linkage disequilibrium surrounding the HBB locus. Currat et al. (2002) examined the S Senegal haplotype in a sample from eastern Senegal, in which there has not been recent admixture, and found that all haplotypes were identical. Using a selective advantage of 0.152 for AS genotypes over AA , they estimated that the age of S Senegal alleles was between 45 and 70 generations (1125 and 1750 years using 25 years per generation; see Table 2 ). Currat et al. (2002) suggested that the S mutation may have been at low frequency in the population and that this age estimate reflects the time since selection for malaria resistance increased this Senegal haplotype. Modiano et al. (2008) estimated the age of the Benin S haplotype by estimating the number of generations taken to decay to the observed linkage disequilibrium from the maximum expected when the S mutation occurred. They estimated that the age of the mutation was very recent and between 10 and 28 generations (250–700 years).
What is the expected future frequency of S alleles, because malaria has been eradicated in many areas and because modern medical care has increased the survival of individuals with sickle-cell anemia? First, heterozygotes AS do not appear to have symptoms of anemia or other effects (costs) of carrying the S allele in non-malarial environments ( Sheng et al., 2010 ), although some negative effects for AS individuals during strenuous exercise have been suggested ( Connes et al., 2008 ; Tsaras et al., 2009 ). As a result, the National Collegiate Athletic Association in the USA has instituted mandatory testing for sickle-cell carrier status for student athletes in its Division 1 sports ( Bonham et al., 2010 ). Second, with modern medical care, individuals with SS can live up to adulthood and reproduce with a much less reduction in fitness than without modern medical care. For example, Platt et al. (1994) found that the median age at death for individuals with sickle-cell anemia in the United States was 42 and 48 years for males and females, respectively (see similar results in Jamaica in Wierenga et al., 2001 ). As a result, in the absence of malaria and with modern medical care, the relative fitness levels of genotypes AA , AS and SS can generally be indicated by 1, 1 and 1— s . With these fitness levels, we can calculate how long it would take for selection (against SS homozygotes) to reduce frequency of the S allele from q 0 in generation 0 to q t in generation t using the following equation
( Hedrick, 2011a , p. 126).
In African-Americans, the frequency of S is about 0.05 and the proportion of individuals with sickle-cell anemia is about 1 in 400. How long would it take for the frequency of the S allele to be halved to 0.025 (this decreases the expected proportion of individuals with sickle-cell anemia to 1 in 1600)? With q 0 =0.05 and q t =0.025, the term in brackets in equation (4) is 20.7. Therefore, the number of generations to halve the frequency of allele S is about 20.7/ s . If we assume that s is still quite large, say 0.5, then it will take 41.4 generations, or 1035 years, to halve the frequency, quite a long time (of course if s is smaller, the decline would be even slower). The basis for this very slow rate of decrease is that most S alleles are in heterozygotes. Assuming that the frequency of alleles A and S are p and q , the proportion of S alleles in heterozygotes is pq/ ( pq + q 2 )= p. In other words, 95% of the S alleles in this example are in heterozygotes in which selection cannot act against them. Consistent with this prediction, Hanchard et al. (2007) found no significant difference between haplotype frequencies from Jamaica, where there has not been malaria since 1963 (about two generations), and the ancestral African haplotypes. This is in contrast to when selection favored S because of protection from malaria and selection acted favorably on the proportion p of the S alleles that were in heterozygotes and consequently resulted in a fast increase in allele frequency.
Gouagna et al. (2010) showed that there is higher transmission from the human host to the Anopheles mosquito vector of P. falciparum for S - and C -carrying individuals. This is an example of how host genetics can influence the efficiency of malaria transmission, but here the same host genotypes that confer protection against malaria appear to increase its transmission. However, individuals with these protective genotypes also use anti-malarial drugs less frequently because they may have more chronic, asymptomatic infections and therefore may become higher transmitters of infection ( Gouagna et al., 2010 ). Overall, it is not clear as to how these effects influence disease incidence and mortality in genotypically different individuals.
C or β C or HbC
Allison (1956) examined the effects and frequency of the hemoglobin C variant as well as sickle cell in his early studies in Africa. C is much more localized in central West Africa and reaches an allele frequency up to 50% in the Ivory Coast. C provides protection against P . falciparum malaria in both heterozygotes AC and homozygotes CC , although the estimated level of protection in heterozygotes appears to vary in different studies ( Modiano et al., 2008 see discussion below). The protective effect of C may result from increased immune clearance of infected erythrocytes. From examination of haplotypes and from its geographic distribution, there appears to be a single origin of C in Africa ( Flint et al., 1998 ). The C variant has also been found in Thailand on a different, non-African haplotype, indicating a second, independent origin ( Sanchaisuriya et al., 2001 ). Because in routine hemoglobin typing, C is often mistaken for variant E , which is common in parts of Thailand, the frequency of C in Thailand is not known precisely.
The change in the β-globin molecule for C is at the same amino-acid position as for S , but results in a change from glutamic acid to lysine rather than to valine (β6 Glu → Lys or GAG → AAG). There does not appear to be any cost from anemia or other traits to either the heterozygote AC or the homozygote CC , although CC homozygotes have symptoms comparable with a mild form of thalassemia. Individuals with the genotype SC have anemia, but it is milder than sickle-cell anemia (the much lower fitness of SC than AS indicates that C is different from the wild-type A allele). As a result, it appears that C could eventually go to fixation in malarial environments and could even replace S ( Modiano et al., 2001 ; Hedrick, 2004 ). However, Modiano et al. (2008) suggested that C is not as widespread as S because it is recessive in its advantage for malaria resistance (see below for a discussion).
Wood et al. (2005) used a coalescent-based approach to jointly estimate the time since the C allele began to increase in frequency because of strong selection and the selection coefficient utilizing data on the present allele frequency (0.15), effective population size and recombination rate. They estimated that the age of the allele was between 75 and 150 generations (1875 and 3750 years) and that the selection coefficient s was between 0.04 and 0.09 (assuming that the relative fitness levels of genotypes AA, AC and CC were 1, 1+ s , and 1+2 s , respectively). Modiano et al. (2008) calculated the age of the C haplotype in Burkina Faso by estimating the number of generations taken for linkage disequilibrium to decay to the observed level from the maximum expected when the C mutation occurred. They estimated that the age of the mutation was between 38 and 120 generations (950–3000 years), at the lower range of the estimate of Wood et al. (2005) .
E or β E or HbE
Hemoglobin E is the most common structural hemoglobin variant in Southeast Asia and has reached an allele frequency of up to 70% in some areas of northern Thailand and Cambodia. In general, neither hemoglobin S nor C variants are present in Southeast Asia and hemoglobin E is generally absent from populations in which S and C are present. This appears to be the result of historical differences resulting from mutational origin, but little is known about the interaction of E with S and E with C ( Fucharoen, 2001 ). It results from the substitution of lysine for glutamic acid in codon 26 (β26 Glu → Lys and GAG → AAG). Besides being a structural variant, the E variant also causes the production of an abnormal mRNA with less β-globin being synthesized. As a result, there is a mild thalassemia phenotype in EE homozygotes, but it does not appear to have an effect in AE heterozygotes (however, there is a much lower fitness of compound heterozygotes with both the E and the α- and β-thalassemia haplotypes; Fucharoen, 2001 ). AE heterozygotes appear to have protection from invasion into erythrocytes by P. falciparum malaria ( Chotivanich et al., 2002 ). Where E is in high frequency, some other red cell disorders, such as α-thalassemia, are also in high frequency.
Although extensive sequence analysis has not been carried out, from examination of the haplotypic backgrounds for E alleles in northeast Thailand, more than one mutational origin ( Fucharoen et al., 2002 ) is suggested. However, the E allele found in China is on the same haplotype as that found in Thailand ( Ohashi et al., 2005 ), suggesting that it does not have a different origin. From forward simulations, which used the amount of linkage disequilibrium close to the variant and the observed allele frequency, the age of the E allele was estimated to be about 100.3 generations ( Ohashi et al., 2004 ), or 2508 years, and the estimated selection coefficient between genotypes AA and AE was 0.079.
α – thalassemia
As I mentioned above, there are two linked genes with identical amino-acid sequences, HBA1 and HBA2 , on chromosome 16 that code for α-globin. There are two primary types of α-thalassemia: α + thalassemia, in which one pair of the genes is deleted or inactivated by a point mutation, and α 0 -thalassemia, in which both pairs of genes are deleted or inactivated. The deleted forms appear to be generated at a relatively high rate by unequal crossing over and Lam and Jeffreys (2006) estimated the rate of deletion generation in sperm from two men as 0.42 × 10 −4 . The haplotypes or gametes for the deleted forms with 2, 1 and 0 genes are indicated by αα, −α and −−, and the homozygous forms of α + thalassemia and α 0 thalassemia are indicated by −α/−α and −−/−−. The α + thalassemia is the most common monogenic disease in the world; however, as a homozygote, it results in an extremely mild anemia, −−/αα heterozygotes also have a mild anemia, similar to homozygous α + anemia, that is, the phenotypic effect appears to be the same whether the deletions are on the same chromosome −−/αα or on different chromosomes −α/−α. For heterozygotes with three α genes, or −α/αα, there is no anemia. When individuals are homozygous for deletions of both genes, −−/−−, or α 0 thalassemia, it is lethal and they are stillborn.
The frequency of α + thalassemia is generally >10% in regions where malaria is prevalent and in some populations, such as in Nepal, parts of India and Papua New Guinea, it is over 80% ( Flint et al., 1998 ). In fact, α + thalassemia appears similar to a transient polymorphism found in Nepal and Melanesia and it might eventually reach fixation given that selection continues to favor the −α haplotype. However, in sub-Saharan African populations, α + thalassemia frequencies do not exceed 50% despite intense malaria selection and Williams et al. (2005a) suggested that this might occur because of negative epistasis with the S allele (see discussion below).
The four different deletions that commonly cause α + thalassemia (out of about 80 in total) have different somewhat non-overlapping distributions. For example, deletion −α 3.7 is found nearly worldwide, but is most common in Africa, deletion −α 4.2 has a high frequency in Southeast Asia and −α 3.7I is found most commonly in African, Indian and Mediterranean populations ( Flint et al., 1998 ). In addition, there are many different local forms of these single-gene deletions, as well as of the deletions that eliminate both genes. Although the age of these variants has not been estimated, their localized presence suggests that they are recent, have arisen in different malarial areas and are increasing because of their selective advantage.
The distribution of both α and β thalassemia variants correspond closely to the regions that have historically had high rates of malaria ( Weatherall and Clegg, 2001 ) and the local distribution of these variants also corresponds to endemic malaria ( Siniscalco et al., 1961 ; Flint et al., 1986 ; Hill et al., 1988 ). In addition, several studies have shown protection from severe malaria for individuals with α + thalassemia, compared with individuals without thalassemia ( Allen et al., 1997 ; Mockenhaupt et al., 2004 ; Williams et al., 2005b ). In a large case–control study in Kenya ( Williams et al., 2005b ), the OR values, standardized to the normal αα/αα genotype, for α + heterozygotes and homozygotes in Kenya are 0.623 and 0.522 for severe malaria, respectively ( Hedrick, 2011b ). In the other case–control study in Papua New Guinea ( Allen et al. 1997 ), the standardized OR values of α + thalassemia heterozygotes and homozygotes are 0.781 and 0.362, respectively, for severe malaria. Overall, it appears that many haplotypes that reduce expression of α-globin provide a selective advantage in resistance to severe malaria. Such a general advantage to an entire class of variants that reduces expression appears to be also true for β-globin variants and for G6PD-deficiency variants that reduce enzyme expression (see below).
Using the OR values mentioned above and equation (3), and assuming that m =0.1, the estimated relative fitness levels of individuals with different genotypes with different numbers of functional α gene copies are given in Table 3 . Here, the fitness of −−/−− is 0, as discussed above and the fitness of −−/−α with one gene is w 1 and the fitness of −−/αα and −α/−α individuals are the same.
From these values, I intuitively expected the −α gamete to increase in frequency to fixation, except for unequal recombination generating other gametes, and for there to be strong selection against −− gametes. This is true for the estimated Kenyan relative fitness values mentioned above for all values of w 1 ( Hedrick, 2011b ). However, for the Papua New Guinea fitness values −α goes to fixation only when genotype −−/−α has a fitness of 0.79 or above. When w 1 is 0.78 or less, haplotype −α does not increase in frequency and haplotypes −− and αα go to a heterozygote advantage polymorphic equilibrium, not substantially different from that observed for some Asian populations ( Hedrick, 2011b ).
β–thalassemia
Remember that the ‘malaria hypothesis’ of Haldane (1949b) was proposed to explain the very high level of β-thalassemia in some Mediterranean populations. There is ordinarily only one copy of the HBB gene and β + and β 0 thalassemia indicate the reduction and loss, respectively, of the production of functional protein. Individuals with β-thalassemia major are often homozygous for loss of production of β-chain mutants (although they may be compound heterozygotes for two mutants), have profound anemia and, if they are not treated with blood transfusions, die in their first year. Heterozygotes typically have mild anemia (β-thalassemia minor), but symptoms can vary greatly in severity from having severe anemia to being a symptomless carrier. In general, β-thalassemia is more of a public health problem because of this higher morbidity than α-thalassemia.
Most of the β-thalassemias are the result of single-nucleotide substitutions or small insertions or deletions. Around 200 are known worldwide with many different local forms; therefore, the fact that they are the result of different mutations appears clear. Also, because of the local distributions, it appears that the mutants are recent in age. The frequency of carriers of β-thalassemia variants is from 5 to 20% in some areas, although not as high as the frequency of α-thalassemia variants ( Weatherall and Clegg, 2001 ).
Worldwide distribution of β-thalassemia generally coincides with historic distribution of malaria. Malaria is not present today in most areas with higher frequencies of β-thalassemia, such as around the Mediterranean; therefore, case–control studies are not possible. However, Willcox et al. (1983) carried out a case–control study in an endemic malaria area of Liberia and found evidence for malaria protection in β-thalassemia heterozygotes.
β-thalassemia variants exist in areas where there are structural β-globin variants and as a result, compound heterozygotes (heterozygotes with two different mutants) occur. For example, in areas of Africa and around the Mediterranean, there are S β-thalassemia heterozygotes. The severity of disease in these individuals varies depending on the nature of β-thalassemia allele, but at its most extreme can resemble sickle-cell anemia. In Southeast Asia, where the structural variant E is common, the compound heterozygote E β-thalassemia is also common ( O'Donnell et al., 2009 ) and causes about 1/3 of the cases of severe thalassemia in Sri Lanka.
Enzymopathies
G6pd deficiency.
Glucose-6-phosphate dehydrogenase, G6PD, is an important enzyme that catalyzes the first reaction in the pentose-phosphate pathway, part of glycolysis ( Cappellini and Fiorelli, 2008 ). In the erythrocyte, G6PD is the sole source of enzyme activity that protects from the buildup of super-radicals and to withstand oxidative stress. G6PD deficiency is the most common enzymopathy in humans, affecting around 400 million people worldwide with a global prevalence of 4.9%, although there is substantial variation among populations ( Nkhoma et al., 2009 ). Although G6PD-deficient individuals are generally asymptomatic throughout their life, factors that cause oxidative stress, such as particular drugs (including several antimalarials), foods or infections, can cause hemolytic anemia and neonatal jaundice. For example, digestion of fava beans can cause severe hemolytic anemia in individuals with the Med G6PD deficiency, a condition traditionally known as favism. Low mortality and low morbidity are associated with G6PD deficiency and it does not seem to generally affect life expectancy.
The G6PD gene has 13 exons, is 18 kb in length, is on the X chromosome close to the genes for hemophilia and color blindness and codes for a molecule that is 515 amino acids long. The active enzyme is composed of two or four identical 515 amino-acid subunits. The G6PD gene is one of the most polymorphic loci in the human genome and about 140 different molecular variants have been identified ( Cappellini and Fiorelli, 2008 ). Most of these have non-synonymous single-nucleotide replacements, a number of which are polymorphic. These variants range from ones that cause severe deficiency and chronic anemia to some that actually increase G6PD activity. G6PD deficiency can be caused by a reduction in the number of enzyme molecules or a structural change causing enzyme instability. Because G6PD is on the X chromosome, males with a single deficient G6PD allele are affected. For females to show deficiency, they generally need to be homozygous for the deficient variant, although some heterozygous females can exhibit G6PD deficiency and even develop hemolytic anemia.
The ancestral normal activity G6PD allele B is present worldwide, but the variants causing G6PD deficiency are generally more localized to specific geographic areas. Similar to the sickle-cell allele, the worldwide historic distribution of G6PD-deficient mutations reflects the historic distribution of malaria endemicity. For example, much of Africa has three main polymorphic alleles, the ancestral B allele with a frequency of about 0.6–0.8, the A allele, which differs from B by one amino acid (resulting from a difference at one nucleotide) and has a frequency of about 0.15–0.4, and the A - allele, which differs from A by one amino acid (also resulting from a difference at one nucleotide) and has a frequency ranging from 0.0 to 0.25 ( Table 4 ).
The A allele has 85% of normal G6PD activity and has not been demonstrated to provide resistance to malaria in contemporary populations. On the other hand, the A - allele has only 12% of normal activity and is thought to provide resistance to malaria (the A - allele is always found on the A background, so this combination may be necessary for malaria protection). The Mediterranean or Med allele is the second most common allele found in countries surrounding the Mediterranean Sea, India and Indonesia. It differs from allele B by one amino acid (resulting from a one-nucleotide change) at another site from that of A , has only 3% normal activity and is in frequencies ranging from 0.02 to 0.2. The Mahidol allele also differs from B by one amino acid (also from a one-nucleotide change) at another site, has 5–32% the activity of the normal enzyme and reaches a frequency of 0.24 in some populations in Southeast Asia.
Overall, it appears in general that a reduction in G6PD enzyme activity, due to these variants and others, results in an increase in resistance to severe malaria. This is not unlike the general effect from number of variants at the α-globin and β-globin genes that result both in thalassemia from lowered gene expression and consequently an increase in resistance to severe malaria. The exact mechanism of increased resistance to malaria from G6PD deficiency is not known, but appears to be related to the inability of the malaria parasite to survive and reproduce in stressed G6PD-deficient cells.
To understand the effect of X-linkage on G6PD frequencies, let me assume that the frequency of a G6PD deficiency allele, say A - is q (and the frequency of the non-deficiency allele is 1− q ). Because males are hemizygous for genes on the X-chromosome, the frequency of males with G6PD deficiency is equal to the frequency of the A - allele, or q . Assuming Hardy–Weinberg proportions, the three female genotypes, BB , BA - and A - A -, have frequencies of (1− q ) 2 , 2 q (1− q ) and q 2 , respectively. All female A-A - individuals have G6PD deficiency, but at a rate that is only q as much as males. However, some percentage of females heterozygous for G6PD-deficient alleles also show deficient G6PD activity, making this reduction somewhat less. On the other hand, if DNA analysis is used to detect the deficiency allele in heterozygotes, regardless of whether they show G6PD deficiency, then the proportion of females with the deficiency allele, either as heterozygotes or homozygotes, is 2 q (1− q )+ q 2 =2 q − q 2 , a value that is nearly twice as high as the frequency in males.
The results of studies examining the risk of malaria for various G6PD-deficient genotypes are not consistent. For example, Ruwende et al. (1995) found in Gambia and Kenya that the reduction in risk (1— OR ) of severe malaria in male hemizygotes ( A -) was 58% and that the reduction in risk for heterozygous females ( BA -) was 46%. On the other hand, Guindo et al. (2007) found that in two populations in Mali, the reduction in risk of severe malaria in male hemizygotes ( A -) was also 58%, but in female heterozygotes ( BA -) there was no reduction in risk. Further, when Guindo et al. (2007) examined only those individuals who were wild-type AA at the β-globin locus, the effect in males was increased ( OR =0.28) whereas there was still no effect in females. Guindo et al. (2007) suggested that the difference in their results and that of Ruwende et al. (1995) might be based in the control groups used. In addition, Johnson et al. (2009) found no protective effect for either male hemizygotes ( A -) or female heterozygotes ( BA -) in a Uganda population, but did find a protective effect for females that were found to be G6PD deficient. This perplexing finding appears to be based on the incomplete correlation of genotype and phenotype for G6PD deficiency in female heterozygotes due to variable inactivation of the two X chromosomes ( Mason et al., 2007 ).
If we use the OR values from Ruwende et al. (1995) and the approach mentioned above to estimate selection coefficients in females ( s f ) and in males ( s m ), then they are 0.046 and 0.058, respectively. The three female genotypes, BB, BA and A-A- have fitnesses of 1– s f , 1 and 1, respectively, assuming that the heterozygote and the mutant homozygote have the same fitnesses (see Saunders et al., 2002 ). The two male genotypes, B and A -, have fitnesses 1 –s m and 1, respectively. Therefore, if we assume that the initial frequency of A - is 0.01, and using traditional equations for change in allele frequency for X-linked genes ( Hedrick, 2011a , p. 154), A - increases quickly and reaches a frequency near 1 by 200 generations (solid line in Figure 1 ). Interestingly, the examples considered in all three figures here (G6PD deficiency, HBC and α + thalassemia) include situations in which a malaria-resistant variant could go to fixation. If there is a symmetrical heterozygote advantage in females because the A - A - homozygotes have a lowered fitness due to G6PD deficiency, and no differential selection in males, then the frequency approaches its equilibrium value of 0.5 in about 300 generations (broken line in Figure 1 . When the A - A - females and the A - males have a selective disadvantage, but there still is a heterozygote advantage in females, then the frequency of A - slowly approaches its equilibrium value of 0.126 (dotted line in Figure 1 ), a value not too different from that observed in many populations.
The increase in frequency of allele A - at the G6PD locus when it begins at a frequency of 0.01 and fitness of females BB , BA - and A - A - and males B and A - are for females 1— s f , 1, and 1 and for males 1— s m and 1 (solid line), for females 1— s f , 1, and 1— s f and for males 1 and 1 (broken line), and for females 1— s f , 1, and 1—2 s f and for males 1 and 1 - s m (dotted line), where s f =0.046 and s m =0.058 (estimated from the data of Ruwende et al., 1995 ).
The age of G6PD mutants have been estimated using several different approaches, some of which assume selection (assuming s f = s m = s ) and others do not ( Table 2 ). Tishkoff et al. (2001) jointly estimated the age and selection for both A - and Med alleles using forward simulations and found that the estimated age of A - was 6357 years (254 generations) and the selection coefficient was 0.044, and that the estimated age of Med was 3330 years (132 generations) and the analogous selection coefficient was 0.034. From joint estimations using Bayesian methods, Slatkin (2008) estimated that the age of A - was only 1000 years (40 generations) and the selection coefficient was 0.25 and Louicharoen et al. (2009) estimated that the age of allele Mahidol was 1575 years (63 generations) and the selection coefficient was 0.23. The difference in these estimates may be because they were based on different data, with that of Tishkoff et al. (2001) based on microsatellite data and that of Slatkin (2008) based on sequence data. The selection coefficients and consequent times in Figure 1 are somewhat more consistent with that estimated by Tishkoff et al. (2001) than the estimates of Slatkin (2008) and Louicharoen et al. (2009) .
Carter and Mendis (2002) suggested that the original selective force for G6PD deficiency could have been either for resistance to P. falciparum or P. vivax . As evidence for the importance of P. vivax , they cite the high frequency of G6PD deficiency in a population in northern Holland, an area where P. vivax was prevalent for centuries and P. falciparum is thought to have never existed. Recently, evidence has suggested that G6PD deficiency caused by the Mahidol mutation in Southeast Asians provides parasite resistance to infection by P. vivax , but not to that by P. falciparum . Further, G6PD deficiency Med in Pakistan has been shown to provide protection from P. vivax malaria ( Leslie et al., 2010 ).
Red blood cell surface loci
Two red blood cell loci, Duffy and ABO, that appear to provide protection from malaria, are similar in one sense that they both have two common functional alleles and a defective, loss-of-function allele that consequently provides protection against malaria. On the other hand, the Duffy variant protects against P. vivax , whereas the O phenotype of the ABO system appears to provide protection against P. falciparum . The gene for band 3 ovalocytosis, the major transmembrane protein of red blood cells, also has a deletion, loss-of-function, mutant. This mutation provides protection from cerebral malaria, is lethal in homozygotes and appears to provide protection from both P. vivax and P. falciparum .
The Duffy blood group antigen was first observed in 1950 and has two common antigens A and B produced by alleles FY*A and FY*B (for a review of the Duffy gene and malaria, see Zimmerman, 2004 ). The Duffy gene is symbolized either by FY or DARC (Duffy antigen receptor for chemokines). The FY*B allele is in highest frequency in Europe whereas the FY*A allele is nearly fixed in some Asian populations. Surveys showed that many African-Americans and Africans did not possess either the A or B antigens and had another allele, a Duffy-negative allele that did not produce antigens on erythrocytes and is known generally as a Duffy ‘null’ allele ( FY*A and FY*B are also called Duffy positive). This allele, FY * B ES , now termed erythrocyte silent (ES), is near fixation in sub-Saharan Africa (it is not a true null allele because it is expressed on cell types besides erythrocytes). From analysis of the sequences of these alleles, it appears that FY*A is ancestral with amino acid Gly (TGG) at codon 42. Allele FY*B differs from FY*A with Asp (TAG) at position 42. The ES allele in Africa differs from FY*B with a T to C SNP mutation, 33 nucleotides upstream from the transcription starting point (−33). This mutation disrupts a binding site for transcription factor GATA-1 on the gene promoter and blocks expression of the gene in erythrocytes.
Several experiments showed that Duffy ES homozygotes ( FY * B ES /FY * B ES ) were resistant to infection by two malarial species, P. knowlesi and P. vivax , but were susceptible to infection by P. falciparum ( Miller et al., 1976 ). From these and other studies, it was shown that the Duffy antigen was the obligatory binding receptor used by P. knowlesi and P. vivax to enter red blood cells and that in homozygous ES individuals, no antigen was present on red blood cells and these malaria species could not infect them. In fact, it has been proposed ( Hamblin and Di Rienzo, 2000 ) that the near absence of P. vivax in Africa, which is thought to have originated in Asia, occurs because there were no suitable hosts in Africa even though the African environment appears otherwise suitable for P. vivax . That is, nearly all Africans were FY * B ES /FY * B ES , because of an earlier selective sweep due to some previous selective force when P. vivax moved west to Africa from Asia. On the other hand, selection for resistance to P. vivax in Africans could have increased the frequency of the FY * B ES allele and subsequently eliminated P. vivax from Africa ( Carter and Mendis, 2002 ).
There does not appear to be any cost (or disease) associated with the presence of FY * B ES homozygotes or heterozygotes. If there were a cost, then because there is no P. vivax in Africa and therefore no selection for the FY * B ES allele, it would be expected that the frequency of FY * B ES would decline. Similarly, the FY * B ES allele frequency in African-Americans would not be expected to decline, even in the absence of P. vivax malaria, unless there was a pleiotropic cost to maintaining the allele.
Analysis of the population genetic aspects of the Duffy alleles in Africa is consistent with strong past selection. For example, the F ST value for the FY * B ES allele is one of the highest observed for any alleles in humans ( Hamblin and Di Rienzo, 2000 ; Hamblin et al., 2002 ). Further, the level of variation around the FY * B ES allele in Africa is significantly lower than that observed in an Italian sample for the Duffy region. Interestingly, the FY * B ES allele in Africa is found on two different haplotypes in three of the five African populations examined by Hamblin and DiRienzo (2000), suggesting either two mutational origins of the FY * B ES in Africa or recombination/gene conversion between these haplotypes.
On the basis of phylogenetic relationships of the FY sequences, Hamblin and Di Rienzo (2000) estimated the age of the FY * B ES allele around 33 000 years ago. As P. vivax may have been a significant cause of morbidity before human populations increased in density because of the development of agriculture around 5000 to 10 000 years ago, they suggested that this earlier origin may be consistent with the suggested earlier impact of P. vivax . Using a microsatellite locus linked to FY , Seixas et al. (2002) estimated that the origin of the FY * B ES allele was more recent, either 7750 or 12 250 years ago, depending upon the haplotype they used (this assumes a generation length of 25 years).
Surprisingly, a Duffy ES allele found in low frequency (0.022) in Papua New Guinea, differed from FY*A in exactly the same way at the −33 nucleotide in the promoter region as does the FY * B ES allele found in Africa ( Zimmerman et al., 1999 ) and is now called FY * A ES . Although this allele appeared to be on a FY*A background from analysis of restriction fragment patterns, more detailed sequence analysis could definitively exclude African ancestry for this allele. Zimmerman et al. (1999) also documented that there is half as much expression in these heterozygotes FY*A / FY * A ES as that in normal homozygotes FY*A / FY * A . Further, these heterozygotes were protected from P. vivax infection more than normal homozygotes whereas they were not protected from P. falciparum infection ( Kasehagen et al., 2007 ). The low frequency of FY * A ES appears to be the result of its recent origin, which is reflected in its association with specific alleles at two microsatellite loci about 3 cM from FY ( Zimmerman et al., 1999 ). Recently, Ménard et al. (2010) also showed that there is about half as much expression in a sample from Madagascar for FY*A / FY * B ES heterozygotes as that for for normal homozygotes ( FY*A / FY * A ) or heterozygotes ( FY*A / FY * B ). In other words, it does not appear that either the expression or the protection from ES alleles is recessive, but is intermediate between ES homozygotes and genotypes with two normal alleles.
Recently, there have been reports of P. vivax in FY * B ES individuals from Brazil, Kenya and Madagascar. Perhaps the most definitive study is that of Ménard et al. (2010) in Madagascar, a nation where endemic P. vivax is prevalent and a diverse ethnic population with both Duffy-negative ( FY * B ES /FY * B ES ) and Duffy-positive (having at least one allele that is FY * A or FY * B ) individuals. In the sample examined of 476 asymptomatic Duffy-negative individuals from eight locations, P. vivax was found in 8.8%. However, they still found a substantial reduction in clinical P. vivax malaria among Duffy-negative individuals compared with Duffy-positive individuals. It appears that this population with both Duffy-negative and -positive individuals provided an environment for strains of P. vivax to evolve the ability to invade Duffy-negative individuals, giving an example of evolution for utilization of an alternate receptor besides the Duffy antigen for erythrocyte invasion.
SLC4A1 or band 3 ovalocytosis
Gene SLC4A1 (also known as erythrocyte Band 3 protein gene) codes for the major transmembrane protein of red blood cells and is an anion exchanger in the erythrocyte membrane. One mutation in this gene, found in Papua New Guinea and Malaysia, is due to a 27-nucleotide (9-amino-acid) deletion in a region that is highly conserved throughout species ( Jarolim et al., 1991 ). This deletion results in an abnormal protein structure and function and causes ovalocytosis, an abnormal shape of red blood cells (it is known as Southeast Asian ovalocytosis, or SAO), and mild hemolytic anemia in the heterozygote. The homozygote appears to be lethal because it has never been observed even in progeny from heterozygous matings ( Genton et al., 1995 ).
In populations from lowland Melanesia with high malaria rates, the frequency of this deletion can be as high as 15–20%, but it is not present in the malaria-free highlands. Neither S nor E hemoglobin variant is present in these populations and only some thalassemias and G6PD deficiency are known to be present to reduce the effects of malaria. Because most individuals in many of these areas are infected with malaria, and often multiple malaria species ( Mueller et al., 2009 ), other unidentified resistance variants appear to be present. There is substantial evidence that deletion heterozygotes are highly protected against malaria. For example, the frequency of the deletion was 0.146 in a sample of healthy Papua New Guinea children whereas it was not present in any of the children with cerebral malaria ( Genton et al., 1995 see also; Allen et al., 1999 ). Carriers also appear to be at lower risk of infection by both P. vivax and P. falciparum ( Cattani et al., 1987 ; however, see Kimura et al., 2002 ).
The lethality of homozygotes and the high resistance of heterozygotes to malaria suggest that this deletion variant is somewhat similar to the sickle-cell allele in that the benefit to heterozygotes is high enough to counter the large fitness cost in homozygotes. However, it appears to be a relatively recent mutation because of its limited distribution and a detailed examination of variation at closely linked sites may provide insight as to its age.
For many decades, the ABO blood group system has been suggested to be associated with diseases and infections, including malaria. The gene consists of seven exons, is more than 18 kb in length and genomic analysis has found over 70 alleles at this locus, suggesting that it is one of the polymorphic genes in humans ( Calafell et al., 2008 ). The ABO glycosyltransferase performs the final step in the production of the ABO molecule by adding sugars to the precursor H antigen. This enzyme either adds the sugars N-acetylgalactosamine or galactose to form the A and B antigens or does not add a sugar, resulting in a functionless H (O) antigen. The three main antigenic classes (A, B and O) are all comprised of numerous alleles determined by both coding and non-coding sequences ( Calafell et al., 2008 ). The silent O alleles share a one-nucleotide deletion in codon 87 (nucleotide 261) of exon 6 that results in a frameshift mutation and premature termination of the polypeptide ( Yamamoto et al., 1990 ). The O alleles are the most common of the three allelic classes (about 0.6 worldwide) and have frequencies between 0.3 and 0.7 in most populations, the A alleles generally have frequencies between 0.2 and 0.3 and the B alleles have frequencies between 0.1 and 0.2. Native American or Australian aboriginal populations have O allele frequencies of nearly 1.0 ( A and B do not appear to have not been present historically in these populations).
Cserti and Dzik (2007 ) suggested that there is substantial evidence supporting the importance of allele O providing protection against malaria. Their conclusion is based primarily on the consistency between the worldwide distribution of ABO variants and historic presence of malaria, with the O allele more common in areas with historic malaria and the association of O genotypes with higher resistance to malaria and A and B genotypes with lower resistance in a number of studies ( Uneke, 2007 ). In a large recent study, Fry et al. (2008) showed in three African populations a strong association of O individuals with resistance to severe malaria and found that this effect was recessive, that is, AO and BO individuals were as likely to be susceptible as AA and BB individuals and that AB genotypes were the most susceptible. In addition, it appears that the O allele can protect against severe malaria by a mechanism of reduced rosetting (spontaneous binding of infected erythrocytes to uninfected erythrocytes) ( Rowe et al., 2007 ).
Fry et al. (2008) calculated F ST values for the ABO genomic region and found that, even with its polymorphism within populations, it was an outlier for low F ST over populations. They suggested that this similar variation over different populations is the result of similar long-standing balancing selection over populations, perhaps caused by infectious pathogens, including P. falciparum . However, Calafell et al. (2008) found a more complicated pattern of differentiation over the ABO genomic region. Although the major O alleles share a one-nucleotide deletion, they differ in a number of nucleotide substitutions in both exons and introns. Unlike the other recent malaria-resistant alleles discussed here, the O human alleles, although different from the O chimpanzee allele ( Kermarrec et al., 1999 ), are much older. Calafell et al. (2008) estimates that the most common O alleles, O01 and O02 , are the result of separate mutations and are ∼ 1.15 and 2.5 million years old, respectively. In other words, assuming that the O allele provides protection from malaria, this protection may be present because of selection originally favoring O alleles for some other reason.
Immune genes
The genes I have discussed above, such as the hemoglobin mutants and G6PD deficiency, provide innate resistance to malaria. In addition, although human immunity to malaria is complex ( Langhorne et al., 2008 ), variants in the HLA complex have been shown to provide adaptive immunity to malaria. In general, innate resistance is more important in early childhood survival from malaria whereas adaptive immune response is more important in older children and adults. It appears that malaria protection from particular HLA variants is local and not universal ( Hill et al., 1994 ; Ghosh, 2008 ). Various factors, such as variation over different populations in malaria strains (and species), in red blood cell polymorphisms, and in HLA polymorphism, may contribute to this heterogeneous response to malaria ( Ghosh, 2008 ).
HLA-Bw53 and DQw5
HLA genes are in the human MHC (major histocompatibility complex) and are the most variable genes in the human genome. They have been shown to be, and have been, under balancing selection, using a number of different approaches ( Garrigan and Hedrick, 2003 ). It is widely accepted that the mechanism of balancing selection at HLA, and for MHC genes in other vertebrates, which retains genetic variants at these genes over species (trans-species polymorphism), is related to the protection that they provide against infectious disease ( Hedrick and Kim, 2000 ). HLA (and MHC) genes can be divided into class I and class II genes in which class I genes recognize intracellular antigens, as from viruses or intracellular stages of parasites, and class II genes recognize extracellular antigens, as from bacteria and parasites. One of the first conclusive demonstrations that HLA variation provided resistance to contemporary infectious disease was the large study showing protection from both specific classes I and II HLA types to malaria in Gambia ( Hill et al., 1991 ).
As there are so many variant forms at many HLA genes (class I genes HLA-A and HLA-B have over 250 and 500 alleles, respectively; Garrigan and Hedrick, 2003 ), particular associations may have occurred by chance and after correction for multiple comparisons, statistical significance for particular associations may be lost. As a result, Hill et al. (1991) used a two-stage strategy in which they first examined all 45 possible class I HLA-malaria associations in approximately half their samples serologically, and then in stage two in an independent sample examined explicitly the strongest association found in the first group (Bw53), using PCR. As shown in Table 5 , the proportions of young children heterozygous for Bw53 were nearly identical in the two independent samples for both the severe malaria and the mild control categories (the mild control group consists of children attending the same hospitals or clinics as those with severe malaria matched for age and area of residence). The calculated OR values were nearly identical in the two samples and indicated that Bw53 confers substantial protection from severe malaria in this population. In addition, a class II haplotype (DQw5), consisting of both DQ and DR genes, also showed a strong protective effect in this population ( Table 5 ).
The frequency of individuals with Bw53 (either homozygotes or heterozygotes) is highest in sub-Saharan Africans, reaching 40% in Nigeria and 25% in Gambia, whereas it is very rare (<0–1%) in Asians and Caucasians. Therefore, not only does there appear to be a strong contemporary protective effect, but also its geographical distribution is not unlike that for the sickle-cell allele and indicates past selection favoring Bw53 in malarial environments. Similarly, haplotype DQw5 showed a novel arrangement of genes and it appears that this haplotype is not found, or is very rare, in Caucasians. Because of the high variation in the HLA region, it is not possible to age Bw53 in the way discussed above, but we can estimate the amount of selection and determine the number of generations it would take the genotype to increase in frequency to the present level.
For example, for the Bw53 genotype if OR =0.59 (the average of the values in Table 5 ) and m =0.1 ( WHO, 1998 ), then s =0.041 from equation (3). Using this amount of selection (and standardizing such that the relative fitness of genotype with the highest fitness is 1) and assuming that the Bw53 allele is dominant, the relative fitnesses of genotypes Bw53/Bw53, Bw53/B(not w53) and B(not w53)/B(not w53) are 1, 1 and 0.959, respectively, where B(not w53) indicates all other alleles at gene HLA-B . If we assume that the Bw53 was at an initial frequency of 0.005, as it is now in parts of Europe ( Hill, 1991 ), then using a traditional population genetics selection equation ( Hedrick, 2011a , p. 118), it would take ∼ 86 generations or 2150 years for the frequency of individuals with Bw53 to reach 0.25 ( Hill, 1991 ). This time is substantially less than the time that was expected for strong selection to have occurred for malaria resistance in Africa.
It appears that Bw53 was generated by a gene conversion event in which only seven adjacent amino acids of the most common allele in Gambia, Bw35, were converted to another common motif ( Allsopp et al., 1991 ). Because of the presence of potential donor sequences and the recipient sequence (Bw35) in Gambia, it appears that Bw53 arose locally. As the Bw35 allele does not appear to confer protection from malaria, it appears that this small converted region in its new background provides protection. Further examination of DQw5 by Davenport et al. (1995) showed that the higher resistance of this haplotype appeared to be the result of a one-amino-acid difference at the DRB1 gene and proposed that the resistance from the DQw5 haplotype occurs because this haplotype recognizes an epitope from the malaria parasite.
Multiple resistance variants in the same population
As we find out more information about genes conferring resistance to malaria, we can apply population genetics approaches to predict the eventual outcome of selection when there are two or more variants undergoing selection in a given population. Potentially, different resistance variants may be different alleles at a given locus in which selection may function along with the normal ancestral allele to increase, decrease or stabilize genetic change. Different resistance variants at different loci may be statistically associated (in linkage disequilibrium), resulting in change that is not independent over loci. Moreover, resistance variants at different loci may have fitness values that are not independent over loci (epistasis) and influence genetic change.
There are a number of known genes that could contribute independently (without epistasis) to malaria resistance in areas with high rate of infection; for example, G6PD deficiency and S in sub-Saharan Africa, and Duffy, α-thalassemia and ovalocytosis in Papua New Guinea. All of these genes are unlinked and are present on different chromosomes; therefore, no linkage disequilibrium between the resistance variants at different loci is expected, although there may be strong epistasis between them. However, as more resistance genes become known, some may be linked, so that both linkage disequilibrium and epistasis may become important factors influencing genetic change and overall resistance.
Here, I give two examples, one in which the genetic variants are different alleles at the same gene ( S and C ) and the other in which the genetic variants are different alleles at different genes, but there is epistasis between them ( S and α-thalassemia). Remember that the α-globin and β-globin molecules each contribute two subunits to adult hemoglobin molecules, suggesting that this essential connection could provide potential for gene–gene interaction, or epistasis, as far as fitness. The geographical distributions of α and β thalassemia overlap and in individuals with both, the more severe forms of β + thalassemia may be changed to a milder condition. Unbalanced production of α and β chains is a major cause of thalassemia, but if both are low, then the severity of anemia can be less. As a result, α-thalassemia can modify the phenotype of β-thalassemia such that it is not as extreme, a potential example of positive epistasis ( Penman et al., 2009 ). In addition, the sickle-cell anemia can be ameliorated by an increase in the production of fetal hemoglobin ( Flint et al., 1998 ), suggesting positive interaction between these genes.
Before I discuss the specific examples, let me provide a general perspective of resistance when several genes are involved. When several different resistance genes are present in the same population, assuming that the resistance variants function independently, the overall expected mean proportion of the population that is resistant to malaria R can be expressed as
where f ij and R ij are the frequency and resistance, respectively, of genotype j at the i th locus (notice that different loci may have different levels of dominance in this general equation). For example, let me assume that there are two resistant genotypes at locus 1 with frequencies f 11 =0.1 and f 12 =0.5 and resistances R 11 =0.5 and R 12 =0.3, and one resistant genotype at locus 2 with f 21 =0.3 and R 21 =0.5. Therefore, the mean proportion of resistance in the population is
or 32% of the population would be expected to be resistant to malaria.
The polymorphic structural variants at the β-globin locus, S , C and E , confer resistance to malaria. Variant C is more localized in sub-Saharan Africa than S , but does co-occur with S in many populations. For example, Livingstone (1967) complied survey data on 72 populations from West Africa (about 33 000 individuals) and found a negative correlation between the frequencies of S and C when both genes were present. In general, there were more populations with frequencies of C between 0.075 and 0.15 and S <0.05 and with frequencies of S between 0.075 and 0.15 and C <0.05 than expected. Although some early estimates of fitness suggested that there might be a stable equilibrium for the three alleles A , S and C , a very large data set from Burkina Faso from Modiano et al. (2001) suggested that the C allele would eventually become fixed in regions of malaria (see also Hedrick, 2004 ; Modiano et al., 2008 ). Below I will discuss this example and estimate relative fitness levels of the six genotypes when there is simultaneous segregation for S and C in the same population and then use these values to predict change in allele frequencies in this population.
Assume that genotype i provides protection from malaria because the frequency of genotype i in the diseased group is less than that in the control group, or that f id < f ic , as I did above. Further, if s i is the selection coefficient for genotype i , it can be estimated as in equation (3). Because selection for multiple genotypes at the locus is considered here (and relative fitness needs to scaled symmetrically around 1), w i , the relative fitness of genotype i , is
For example, if f id =0.2 and f ic =0.4 and m =0.5, then s i =0.3125 and w i =1.4545.
Now assume that f id > f ic , that is, genotype i provides relative susceptibility to malaria. In this case, the selection coefficient can be defined as
Because selection gives a disadvantage to the genotype, the relative fitness of genotype i is
For example, if f id =0.8 and f ic =0.6 so that OR =2.667. Further, if m =0.5, then s i =0.3125 and w i = 0.6875. Note that 1/0.6875=1.4545, or that the fitness levels in these examples are symmetrical around 1.
In the study by Modiano et al. (2001) , there were statistically significant effects for the hemoglobin genotypes AA , AC , AS and CC ( AC , AS and CC showed relative resistance and AA showed relative susceptibility). Table 6 lists the frequencies of these four genotypes in the control (healthy subjects) and disease (malaria patients) groups and the resulting OR values. The sample sizes for genotypes SC and SS were very small and the fitness levels used here were based on deviations from Hardy–Weinberg proportions ( Hedrick, 2004 ). Assuming m =0.1, Table 6 lists the relative fitness for these genotypes and also the fitness levels relative to the genotype with highest fitness, CC . Using these fitness levels, there are no stable or unstable three-allele equilibria, unlike earlier data ( Hedrick, 2004 ).
With this fitness level, if S is introduced by mutation or gene flow into a monomorphic A population, it will quickly go to a stable two-allele equilibrium with the equilibrium frequency of S equal to 0.12. If C is introduced by mutation or gene flow into a monomorphic A population, it will increase in frequency to fixation in slightly over 100 generations ( Figure 2 )). When the S and A alleles are at their stable equilibrium, C can enter the population and eventually increase to fixation after about 200 generations. When both the C allele and the S allele are introduced simultaneously at low frequencies (0.01 here), the S allele will be eliminated (in about 100 generations in the example in Figure 2 ).
The increase in frequency of allele C when it begins at a frequency of 0.01 (short, broken line), when S also begins at a frequency of 0.01 (solid lines), and when S begins at its equilibrium frequency of 0.0895 (long, broken line) (after Hedrick, 2004 ). The change in frequency of S is also given for the last two situations.
These outcomes occur primarily because genotype CC has the highest estimated relative fitness of any genotype. It may also explain particular situations as, for example, why the Dogon people of Mali have a much lower frequency of S than most other West Africans and have a high frequency of C ( Agarwal et al., 2000 ). Of public health significance, the mean fitness of the population will be 11.2% higher when the population is fixed for C than when it is polymorphic for A and S because of higher average resistance to malaria and the absence of sickle-cell anemia.
Modiano et al. (2008) suggested that C is not as widespread as S because it is recessive in its advantage for malaria resistance. To estimate dominance for the favorable C allele, we can standardize the fitness levels so that genotypes AA , AC and CC have fitness levels of 1, 1+ hs and 1+ s , respectively, where h is the level of dominance for the favorable C allele ( Table 7 ). Given the estimated fitness levels and assuming that the fitness level of AA , AC and CC are w AA , w AC and w CC , respectively, then
Using the data in Table 7 , the estimates are of s =0.166 and of h =0.518. In other words, it appears that the advantageous effect of the C allele is very close to additivity ( h =0.5) and is not recessive in its advantageous effect. Interestingly, if the C allele is more recessive, that is, the fitness of AC is lower, then the C allele cannot invade a population in which the S allele is at equilibrium.
S and α-thalassemia
Two hemoglobinopathies, sickle cell and α + -thalassemia, occur in high frequency in sub-Saharan Africa. Williams et al. (2005a) examined these disorders in Kenya in a population that was segregating for both variants. In this population, they found that the protection from malaria given by each variant was lost when the two disorders were inherited together in the same individual (see also May et al., 2007 ). They suggested that this negative epistasis could be the reason why α + -thalassemia has not been fixed in any population in sub-Saharan Africa. This situation has been examined theoretically by Williams et al. (2005a) and Penman et al. (2009) , using an epidemiological approach. Here, I will use a traditional population genetics approach and show that the results are generally consistent with their findings, using an epidemiological model. An important caveat is that although most general properties discussed here are enlightening and significant, some conclusions can depend on rather small differences in estimated values ( Williams et al., 2005a ; Penman et al., 2009 ; Hedrick, 2011b ).
As background for discussion of the fitness array for these two loci, let us examine what the expectation is when there are independent effects on viability for the two loci (no epistasis on a multiplicative scale Hedrick, 2011a , p.554). Table 8 lists on the left margin, the relative fitness values for the β-globin genotypes AA , AS and SS of 1– s , 1 and 0, respectively, and on the top, the relative fitness values for α-globin genotypes αα/αα, −α/αα and −α/−α of 1, 1+ ht and 1+ t , respectively. The product of these marginal values yields the expected fitness levels in the table.
The expected relative viabilities for two-locus genotypes without epistasis can be calculated in the following manner. First, the difference in the relative values for genotypes AS αα/αα and AA αα/αα yields an estimate of s . The fitness of genotype AA αα/αα divided by the fitness of genotype AA −α/−α yields an estimate of 1+ t . Therefore, for the example in Table 9b that I will discuss below, s =0.448 and t =0.275, resulting in a value of 1.275 for the expected fitness without epistasis of genotype AS −α/−α. The estimated value for this genotype of 0.554 is substantially lower than this expectation and illustrates negative epistasis.
To calculate the relative viabilities of the different two-locus genotypes (the approach used here was suggested by B. Penman), genotype-specific annual mortality rates were used based on the relative rates of hospital admissions in children with severe malaria ( Table 9a ; from legend of Figure 2 in Williams et al. (2005a) ). To each of these mortality rates, an annual non-genotype specific mortality rate was added (in the example here, a rate of 0.034 was added) to yield a total annual mortality rate for each genotype. The reciprocal of these annual mortality rates provides an estimate of the relative life expectancies. When these values are standardized by the largest relative life expectancy (for these data the genotype AS αα/αα has the maximum value of 29.41), the relative viability values in Table 9b are obtained.
When there is a wild-type genotype at both loci, the estimated relative survival is only 0.552. Notice that both the presence of S in AS and the presence of −α in both genotypes −α/αα and −α/−α increases the estimated relative survival, given normal genotypes at the other locus. However, when there is an AS genotype and homozygosity for −α, the relative survival is reduced to only 0.554, essentially the same relative survival for the wild-type genotype.
There are several things to note about this array of relative viability values. First, when only the normal αα haplotype at the α-globin gene is present (first column), there is a stable polymorphism with S having a frequency of 0.309. Second, when only the normal A allele is present at the β-globin locus (first row), the −α/−α genotype has the highest fitness, predicting that this variant haplotype would be fixed (unless there is a negative effect of anemia). Overall then, the normal AA αα/αα genotype has a low relative viability in this malarial environment.
We can intuit some two-locus behavior by examining these viability values further. For example, if the population is polymorphic for A and S , it appears that −α will be able to increase from a low frequency in the population (enter) because the relative viability of AA −α/αα is greater than that of AA αα/αα (the most frequent genotype at this point; first row of Table 9b ) and AS −α/αα is only slightly less viable than AS αα/αα (second row). On the other hand, if the population is fixed for −α−α, it does not appear that S can enter the population because AA −α/−α > AS −α/−α (last column of Table 9b ).
However, the genetic behavior of this two-locus fitness array has some further interesting properties that are observed if the two-locus gamete frequency equations with selection ( Hedrick, 2011a , p. 559) are iterated. First, if the population is polymorphic at the equilibrium for A and S and −α haplotype enters the population, the system will increase to a two-locus stable equilibrium with −α having a frequency of 0.272 and S having a frequency of 0.256, with no linkage disequilibrium. Second, if the −α is polymorphic, there is an unstable equilibrium such that −α will, depending upon its initial frequency, either continue to increase or go to the two-locus equilibrium discussed above, if S is introduced into the population. Figure 3 presents this situation in which the initial frequency of S is 0.01 and the initial frequency of −α is either 0.55 or 0.65. For 0.55, the frequency of −α initially increases and then approaches the two-locus equilibrium at 0.272. On the other hand, when the initial frequency is 0.65, it continues to increase toward unity whereas S initially increases but then decreases to 0. In other words, the presence of S maintains the frequency of −α lower when the fitness of AS −α/−α is low (high negative epistasis). If fitness of AS −α/−α is somewhat higher (less negative epistasis), then −α can be fixed for all initial frequencies.
The change in the frequency of S where the initial frequency is 0.01 and the change in the frequency of −α where the initial frequency is either 0.55 or 0.65 using the relative fitness levels in Table 9b .
Conclusions
Genetic resistance to malaria in humans provides a rich source of data and examples of many aspects of population genetics, particularly cases of strong selection. For example, maintenance of the sickle-cell hemoglobin variant is the classic example of heterozygote advantage, G6PD deficiency is an example of very strong selection at an X-linked locus, variants at the β-globin gene ( S , C and E ) and causing G6PD deficiency ( A -, Med and Mahidol ) illustrate selective advantage from a single-nucleotide change, HLA-B53 provides an example of a selective advantage of a gene conversion product and α-thalassemia provides an example of selection on duplicated loci. In addition, understanding the geographic patterns of these variants includes consideration of selection as well as gene flow, mutation, genetic drift and the interaction of these factors. For example, to illustrate the importance of gene flow, most Mediterranean S alleles appear to descend from the African Benin haplotype ( Flint et al., 1998 ) and the Southeast Asian deletion of both α- globin genes, −− SEA , is widespread in a number of Asian populations ( Chui and Waye 1998 ; Bernini 2001 ). Estimating the age during which these variants have been strongly selected also includes incorporating information about recombination in the genomic region surrounding the selected variants.
An aspect of population genetics that will probably become even more important is the joint impact of multiple malaria resistance variants. Here, I discuss future changes in populations segregating for the two β-globin variants S and C and predict that, given the continued presence of malaria, selection will result in the elimination of S and fixation of C . In populations segregating for α-thalassemia and S , variants at two different loci, because of negative epistasis, either the α-thalassemia haplotype is predicted to be fixed and the S allele eliminated or a stable equilibrium of both variants is predicted depending upon the starting frequencies. Although these results are initially somewhat counterintuitive, one needs to realize that they are dependent upon specific estimated selection coefficients with large confidence intervals and which may vary over time (in the past or in the future) and may vary in different populations.
Obviously, Haldane (1949b) was correct to point out the potential importance of genetic variants in humans for resistance to malaria. Genes conferring resistance to malaria provide some of the best-known case studies of strong positive selection in modern humans and the sickle-cell hemoglobin variant remains the classic example of heterozygote advantage. Overall, the original ‘malaria hypothesis’ of Haldane that diseases such as thalassemia are polymorphisms with an advantage to heterozygotes in malarial environments has been proven correct; however, much is still to be learned about actual mechanisms of protection, other genes that confer resistance and the population genetics of this variation.
Ackerman H, Usen S, Jallow M, Sisay-Joof F, Pinder M, Kwiatkowski DP (2005). A comparison of case-control and family-based association methods: the example of sickle-cell and malaria. Ann Hum Genet 69 : 559–565.
Google Scholar
Agarwal A, Guindo A, Cissoko Y, Taylor JG, Coulibaly D et al . (2000). Hemoglobin C associated with protection from severe malaria in the Dogon of Mali, a West African population with a low prevalence of hemoglobin S. Blood 96 : 2358–2363.
Agresti A (2007). An Introduction to Categorical Data Analysis , 2nd ed. John Wiley & Sons: New York.
Allen SJ, O'Donnell A, Alexander ND, Alpers MP, Peto TE et al . (1997). Alpha+thalassemia protects children against disease caused by other infections as well as malaria. Proc Nat Acad Sci USA 94 : 14736–14741.
Allen SJ, O'Donnell A, Alexander NDE, Mgone CS, Peto TEA et al . (1999). Prevention of cerebral malaria in children in Papua New Guinea by Southeast Asian ovalocytosis band 3. Am J Trop Med Hyg 60 : 1056–1060.
Allison AC (1954). Protection afforded by the sickle-cell trait against subtertian malaria infection. Br Med J 1 : 290–294.
Allison AC (1955). Aspects of polymorphism in man. Cold Spring Harbor Symp Quant Biol 20 : 239–251.
Allison AC (1956). The sickle-cell and haemoglobin C genes in some African populations. Ann Hum Genet 21 : 67–89.
Allison AC (1964). Polymorphism and natural selection in human populations. Aspects of polymorphism in man. Cold Spring Harbor Symp Quant Biol 29 : 137–149.
Allison AC (2004). Two lessons from the interface of genetics and medicine. Genetics 166 : 1591–1599.
Allsopp CEM, Hill AVS, Kwiatkowski D, Hughes A, Bunce M et al . (1991). Sequence analysis of HLA-Bw53, a common west African allele, suggests and origin by gene conversion of HLA-B53. Hum Immunol 30 : 105–109.
Ayodo G, Price AL, Keinon A, Ajwang A, Otieno MF et al . (2007). Combining evidence of natural selection with association analysis increases power to detect malaria-resistance variants. Am J Hum Genet 81 : 234–242.
Beet EA (1946). Sickle cell disease in the Balovale district in Northern Rhodesia. East Afr Med J 23 : 75–86.
Bernini LK (2001). Geographic distribution of α thalassemia. In: Steinberg MH, Forget BG, Higgs DR, Nagel RL (eds). Disorders of Hemoglobin: Genetics, Pathophysiology and Clinical Management . Cambridge University Press: Cambridge. pp 878–894.
Bonham VL, Dover GJ, Brody LC (2010). Screening student athletes for sickle cell trait-a social and clinical experiment. New Eng J Med 363 : 997–999.
Calafell F, Roubinet F, Ramirez-Soriano A, Saitou N Bertranpetit J, Blancher A (2008). Evolutionary dynamics of the human ABO gene. Hum Genet 124 : 123–135.
Cappellini MD, Fiorelli G (2008). Glucose-6-phosphate dehydrogenase deficiency. Lancet 371 : 64–74.
Carter R, Mendis KN (2002). Evolutionary and historical aspects of the burden of malaria. Clin Microbiol Rev 15 : 564–594.
Cattani JA, Gibon FD, Alpers MP, Crane GG (1987). Hereditary ovalocytosis and reduced susceptibility to malaria in Papua New Guinea. Trans Roy Soc Trop Med Hyg 81 : 705–709.
Chotivanich K, Udomsangpetch R, Pattanapanyasat K, Chierakul W, Simpson J et al . (2002). Hemoglobin E: a balanced polymorphism protective against high parasitemias and thus severe P . falciparum malaria Blood 100 : 1172–1176.
Chui DHK, Waye JS (1998). Hydrops felatis caused by α-thalassemia: an emerging health care problem. Blood 91 : 2213–2222.
Clark TG, Diakite M, Auburn S, Campino S, Fry AE et al . (2009). Tumor necrosis factor and lymphotoxin-α polymorphisms and severe malaria in African populations. J Infect Dis 199 : 569–575.
Coluzzi M, Sabatini A, Torre A, Di Deco MA, Petrarca V. (2002). A polytene chromosome analysis of the Anopheles gambiae species complex. Science 298 : 1415–1418.
Connes P, Reid H, Hardy-Dessources M-D, Morrison E, Hue O (2008). Physiological responses of sickle cell trait carriers during exercise. Sports Med 38 : 931–946.
Cox-Singh J, Davis TME, Lee K-S, Shamsul SSG, Matusop A (2008). Plasmodium knowlesi malaria in human is widely distributed and potentially life threatening. Clin Inf Dis 46 : 165–171.
Crow JF (2004). J B. S. Haldane's ideas in biology with special reference to disease and evolution. In: Dronamraju KR (ed). Infectious Disease and Host-Pathogen Evolution . Cambridge University Press: Cambridge. pp 11–17.
Cserti CM, Dzik WH (2007). The ABO blood group system and Plasmodium falciparum malaria. Blood 110 : 2250–2258.
Currat M, Trabucher G, Rees D, Perrin P, Harding RM et al . (2002). Molecular analysis of the β-globin gene cluster in the Niokholo Mandenka population reveals a recent origin of the β S Senegal mutation. Am J Hum Genet 70 : 207–223.
Curtin PD (1989). Death by Immigration: Europe's Encounter with the Tropics in the Nineteenth Century . Cambridge Univ Press: New York.
Davenport MP, Quinn CL, Chicz RM, Green BN, Willis AC et al . (1995). Naturally processed peptides from two disease-resistance associated HLA-DR13 alleles show related sequence motifs and the effects of the dimorphism at position 86 of the HLA DRβ chain. Proc Nat Acad Sci USA 92 : 6567–6571.
Dolo A, Modiano D, Maiga B, Daou M, Dolo G et al . (2005). Difference in susceptibility to malaria between two sympatric ethnic groups in Mali. Am J Trop Med Hyg 72 : 243–248.
Dronamraju KR (ed) (2004). Infectious Disease and Host-Pathogen Evolution . Cambridge University Press: Cambridge.
Eid NA, Hussein AA, Elzein AM, Mohamed HS, Rockett K et al . (2010). Candidate malaria susceptibility/protective SNPs in hospital and population-based studies: the effect of sub-structuring. Malaria J 9 : 119.
Ferwerda B, McCall MBB, Alonso S, Giamarellos-Bourboulis EJ, Mouktaroudi M et al . (2007). TLR4 polymorphisms, infectious diseases, and evolutionary pressure during migration of modern humans. Proc Nat Acad Sci USA 42 : 16645–16650.
Flint J, Harding RM, Boyce AJ, Clegg JB (1998). The population genetics of the haemoglobinopathies. Bailliere Clin Haematol 11 : 1–51.
Flint J, Hill AV, Bowden DK, Oppenheimer SJ, Sill PR et al . (1986). High frequencies of alpha-thalassemia are the result of natural selection by malaria. Nature 321 : 744–750.
Fry AE, Ghansa A, Small KS, Palma A, Auburn S et al . (2009). Positive selection of a CD36 nonsense variant in sub-Saharan Africa, but no association with severe malaria phenotypes. Hum Molec Genet 18 : 2683–2693.
Fry AE, Griffiths MJ, Auburn S, Diakite M, Forton JT et al . (2008). Common variation in the ABO glycosyltransferase is associated with susceptibility to severe Plasmodium falciparum malaria. Hum Molec Genet 17 : 567–576.
Fucharoen S (2001). Hemoglobin E disorders. In: Steinberg MH, Forget BG, Higgs DR, Nagel RL (eds). Disorders of Hemoglobin: Genetics, Pathophysiology and Clinical Management . Cambridge: Cambridge Univ. Press:Cambridge. pp. 1139–1154.
Fucharoen G, Fucharoen S, Sanchaisuriya K, Sae-ung N, Suyasunanond U et al . (2002). Frequency distribution and haplotypic heterogeneity of β E -globin gene among eight minority groups of northeast Thailand. Hum Hered 53 : 18–22.
Garrigan D, Hedrick PW (2003). Detecting adaptive molecular evolution, lessons from the MHC. Evolution 57 : 1707–1722.
Genton B, Al-Yaman F, Mgone CS, Alexander N, Paniu MM, Aplers MP (1995). Ovalocytosis and cerebral malaria. Nature 378 : 564–565.
Ghosh K (2008). Evolution and selection of human leukocyte antigen alleles by Plasmodium falciparum infection. Hum Immunol 69 : 856–860.
Giardine B, van Baal S, Kaimakis P, Riemer C, Miller W et al . (2007). HbVar database of human hemoglobin variants and thalassemia mutations: 2007 update. Hum Mut 28 : 206.
Gouagna LC, Bancone G, Yao F, Yameogo B, Dabiré KR et al . (2010). Genetic variation in human HBB is associated with Plasmodium falciparum transmission. Nat Genet 42 : 328–331.
Guerra CA, Howes RE, Patil AP, Gething PW, Van Boeckel TP et al . (2010). The international limits and population at risk of Plasmodium vivax transmission in 2009. PLoS Neglected Trop Dis 4 : e774.
Guindo A, Fairhurst RM, Doumbo OK, Wellems TE Diallo DA (2007). X-linked G6PD deficiency protects hemizygous males but not heterozygous females against severe malaria. PLoS Med 4 : e66.
Haldane JBS (1949a). Disease and evolution. Ric Sci Suppl 19 : 68–76.
Haldane JBS (1949b). The rate of mutation of human genes. Hereditas 35 (S1): 267–273.
Hamblin MT, Di Rienzo A (2000). Detection of the signature of natural selection in humans: evidence from the Duffy blood group locus. Am J Hum Genet 66 : 1669–1679.
Hamblin MT, Thompson EE, Di Rienzo A (2002). Complex signatures of natural selection at the Duffy blood group locus. Am J Hum Genet 70 : 369–383.
Hanchard N, Elzein A, Trafford C, Rockett K, Pinder M. et al . (2007). Classical sickle beta-globin haplotypes exhibit a high degree of long-range haplotype similarity in African and Afro-Caribbean populations. BMC Genet 8 : 52.
Hedrick PW (2004). Estimation of relative fitnesses from relative risk data and the predicted future of haemoglobin alleles S and C . J Evol Biol 17 : 221–224.
Hedrick PW (2011a). Genetics of Populations . 4th ed. Jones and Bartlett: Boston.
Hedrick PW (2011b). Selection and mutation for α thalassemia in non-malarial and malarial environments. Ann Hum Genet (in press).
Hedrick PW, Kim T (2000). Genetics of complex polymorphisms: parasites and maintenance of MHC variation. In: Singh R. Krimbas C (eds). Evolutionary Genetics: From Molecules to Morphology . Cambridge University Press: Cambridge. pp 204–234.
Hill AVS (1991). HLA associations with malaria in Africa: some implications for MHC evolution. In: Klein J, Klein D (eds). Molecular Evolution of the Major Histocompatibility Complex . Springer-Verlag, Berlin. pp 403–434.
Hill AVS, Allsopp CEM, Kwiatkowski D, Anstey NM, Twumasi P et al . (1991). Common west African HLA antigens are associated with protection from severe malaria. Nature 352 : 595–600.
Hill AVS, Bowden DK, O'Shaughnessy DF, Weatherall DJ, Clegg JB (1988). β thalassemia in Melanesia: association with malaria and characterization of a common variant (IVS1 nt 5 G-C). Blood 72 : 9–14.
Hill AVS, Yates SNR, Allsopp CEM, Gupta A, Gilbert SC et al . (1994). Human leukocyte antigens and natural selection by malaria. Phil Trans Roy Soc Lond B 346 : 379–385.
Ingram VM (2004). Sickle-cell anemia hemoglobin: the molecular biology of the first ‘molecular disease’- the crucial importance of serendipity. Genetics 167 : 1–7.
Jallow M, Teo YY, Small KS, Rockett KA, Deloukas P et al . (2009). Genome-wide and fine-resolution association analysis of malaria in West Africa. Nat Genet 41 : 657–665.
Jarolim P, Palek J, Amato D, Hassan K, Sapak P, Nurse GT et al . (1991). Deletion in erythrocyte band 3 gene in malaria–resistant Southeast Asian ovalocytosis. Proc Nat Acad Sci USA 88 : 11022–11026.
Johnson MK, Clark TD, Njama-Meya D, Rosenthal PJ, Parik (2009). Impact of the method of G6PD deficiency assessment on genetic association studies of malaria susceptibility. PLoS ONE 4 : 9 (e7246).
Joy DA, Feng X, Mu J, Furuya T, Chotivanich K et al . (2003). Early origin and recent expansion of Plasmodium falciparum . Science 300 : 318–321.
Karasov T, Messer PW, Petrov DA (2010). Evidence that adaptation in Drosophila is not limited by mutation at single sites. PLoS Genet 6 (6): e1000924.
Kasehagen LJ, Mueller I Kiniboro B, Bockarie MJ, Reeder JC et al . (2007). Reduced Plasmodium vivax erythrocyte infection in PNG Duffy-negative heterozygotes. PLoS ONE 2 (3): e336.
Kaslow RA, McNicholl JM, Hill AVS (2008). Genetic Susceptibility to Infectious Diseases . Oxford University Press: Oxford.
Kermarrec N, Roubinet F, Apoil PA, Blancher A (1999). Comparison of allele O sequences of the human and non-human primate ABO system. Immunogenetics 49 : 517–526.
Kimura M, Soemantri A, Ishida T (2002). Malaria species and Southeast Asian ovalocytosis defined by a 27-bp deletion in the erythrocyte band 3 gene. Southeast Asian J Trop Med Pub Health 33 : 4–6.
Kwiatkowski DP (2005). How malaria has affected the human genome and what human genetics can teach us about malaria. Am J Hum Genet 77 : 171–192.
Kwiatkowski DP, Luoni G (2008). Malaria. In: Kaslow RA, McNicholl JM, Hill AVS (eds). Genetic Susceptibility to infections Diseases . Oxford University Press. pp 372–386.
Lam K-WG, Jeffreys AJ (2006). Processes of copy-number change in human DNA: the dynamics of α-globin gene deletion. Proc Nat Acad Sci USA 103 : 8921–8927.
Langhorne J, Ndungu FM, Sponaas A-M, Marsh K (2008). Immunity to malaria: more questions than answers. Nat Immunol 9 : 723–732.
Lawniczak MKN, Emrich SJ, Holloway AK, Regier AP, Olson M et al . (2010). Widespread divergence between incipient Anophoeles gambiae species revealed by whole genome sequence. Science 330 : 512–514.
Leslie T, Briceño, Mayan I, Mohammed N, Klinkenberg E et al . (2010). The impact of phenotypic and genotypic G6PD deficiency on risk of Plasmodium vivax infection: A case-control study amongst Afghan refugees in Pakistan. PLoS Med 7 : e1000283.
Liu W, Li V, Rudicell RS, Robertson JD, Keele BF et al . (2010). Origin of the human malaria parasite Plasmodium falciparum in gorillas. Nature 467 : 420–425.
Livingstone FB (1958). Anthropological implications of sickle cell gene distribution in west Africa. Am Anthrop 60 : 533–562.
Livingstone FB (1967). Abnormal Hemoglobins in Human Populations . Aldine: Chicago.
Livingstone FB (1985). Frequencies of Haemoglobin Variants . Oxford University Press: Oxford.
López C, Saravia C, Gomez A, Hoebeke J, Patarroyo MA (2010). Mechanisms of genetically-based resistance to malaria. Gene 467 : 1–12.
Louicharoen C, Patin E, Paul R, Nuchprayoon I, Witoonpanich B et al . (2009). Positively selected G6PD-Mahidol mutation reduces Plasmodium vivax density in southeast Asian. Science 326 : 1546–1549.
MacFie TS, Nerrienet E, Bontrop RE, Mundy NI (2009). The action of falciparum malaria on the human and chimpanzee genomes compared: absence of evidence for a genomic signature of malaria at HBB and G6PD in three subspecies of chimpanzee. Infec Genet Evol 9 : 1248–1252.
Mackinnon MJ, Mwangi TW, Snow RW, Marsh K, Williams TN (2005). Heritability of malaria in Africa. PLoS Med 2 : 1253–1259. (e340).
Mason PJ, Bautista JM, Gilsanz F (2007). G6PD deficiency: the genotype-phenotype association. Blood Rev 21 : 267–283.
May J, Evans JA, Timmann C, Ehmen C, Busch W et al . (2007). Hemoglobin variants and disease manifestations in severe falciparum malaria. J Am Med Assoc 297 : 2220–2226.
Ménard D, Barnakas C, Bouchier C, Henry-Halldin C, Gray LR et al . (2010). Plasmodium vivax clinical malaria is commonly observed in Duffy-negative Malagasy people. Proc Nat Acad Sci USA 107 : 5967–5971.
Miller LH, Mason SJ, Clyde DF, McGinnis MH (1976). The resistance factor to Plasmodium vivax in blacks. The Duffy-blood-group genotype FyFy. New Engl J Med 295 : 302–304.
Mockenhaupt FP, Ehrhardt S, Gellert S, Otchwemah RN, Dietz E et al . (2004). Alpha (+)- thalassemia protects African children from severe malaria. Blood 104 : 2003–2006.
Modiano G, Morpurgo G, Terrenato L, Novelletto A, Di Rienzo A et al . (1991). Protection against malaria morbidity: near-fixation of the α-thalassemia gene in a Nepalese population. Am J Hum Genet 48 : 390–397.
Modiano D, Luoni G, Sirima BS, Simpore J, Verra F et al . (2001). Haemoglobin C protects against clinical Plasmodium falciparum malaria. Nature 414 : 305–308.
Modiano D, Bancone G, Ciminelli BM, Pompei F, Blot I (2008). Haemoglobin S and haemoglobin C: ‘quick but costly’ versus ‘slow but gratis’ genetic adaptations to Plasmodium falciparum malaria. Hum Molec Genet 17 : 789–799.
Mueller I, Widmer S, Michel D, Maraga S, McNamara DT et al . (2009). High sensitivity detection of Plasmodium species reveals positive correlations between infections of different species, shifts in age distribution and reduced local variation in Papua New Guinea. Malaria J 8 : 41.
Neafsey DE, Lawniczak MKN, Park DJ, Redmond SN, Coulibaly MB et al . (2010). SNP genotyping defines complex gene-flow boundaries among African malaria vector mosquitoes. Science 330 : 514–517.
Neel JV, Valentine WN (1947). Further studies on the genetics of thalassemia. Genetics 32 : 38–63.
Nkhoma ET, Poole C, Vannappagari V, Hall SA, Beutler E (2009). The global prevalence of glucose-6-phosphate dehydrogenase deficiency: A systematic review and meta-analysis. Blood Cells Molec Dis 42 : 267–278.
O'Donnell A, Premawardhena A, Arambepola M, Samaranayake R, Allen SJ et al . (2009). Interaction of malaria with a common form of severe thalassemia in an Asian population. Proc Nat Acad Sci USA 106 : 18716–18721.
Ohashi J, Naka I, Patarapotikul J, Hananantachai H, Brittenham G. et al . (2004). Extended linkage disequilibrium surrounding the hemoglobin E variant due to malarial selection. Am J Hum Genet 74 : 1198–1208.
Ohashi J, Naka I, Patarapotikul J, Hananantachai H, Brittenham G. et al . (2005). Strong linkage disequilibrium of a HbE variant with the (AT)9(T)5 repeat in the BP1 binding site upstream of the β-globin gene in the Thai population. J Hum Genet 50 : 7–11.
O'Shaughnessy DF, Hill AVS, Bowden DK, Weatherall DJ, Clegg JB (1990). Globin genes in Micronesia: origins and affinities of Pacific Island peoples. Am J Hum Genet 46 : 144–155.
Pauling L, Itano HA, Singer SJ, Wells IC (1949). Sickle cell anemia a molecular disease. Science 110 : 543–548.
Penman BS, Pybus OG, Weatherall DJ, Gupta S (2009). Epistatic interactions between genetic disorders of hemoglobin can explain why the sickle-cell gene is uncommon in the Mediterranean. Proc Nat Acad Sci USA 106 : 21242–21246.
Platt OS, Brambilla DJ, Rosse WF, Milner PF, Castro O et al . (1994). Mortality in sickle cell disease. New Eng J Med 330 : 1639–1644.
Ralph P, Coop G (2010). Parallel adaptation: one or many waves of advance of an advantageous allele? Genetics 186 : 647–668.
Rowe AK, Rowe SY, Snow RW, Korenromp EL, Schellenberg JR et al . (2006). The burden of malaria mortality among African children in the year 2000. Int J Epidemiol 35 : 691–704.
Rowe JA, Handel IG, Thera MA, Deans A-M, Lyke KE et al . (2007). Blood group O protects against severe Plasmodium falciparum malaria through the mechanism of reduced rosetting. Proc Nat Acad Sci USA 104 : 17471–17476.
Ruwende C, Khoo SC, Snow RW, Yates SNR, Kwiatkowski D et al . (1995). Natural selection of hemi- and heterozygotes for G6PD deficiency in Africa by resistance to severe malaria. Nature 376 : 246–249.
Sabeti PC, Reich DE, Higgins JM, Levine HZ, Richter DJ et al . (2002). Detecting recent positive selection in the human genome from haplotype structure. Nature 419 : 832–837.
Sanchaisuriya K, Fucharoen G, Sae-ung N, Siriratmanawong N, Surapot S, Fucharoen S. (2001). Molecular characterization of hemoglobin C in Thailand. Am J Hematol 67 : 189–193.
Saunders MA, Hammer MF, Nachman MW (2002). Nucleotide variability at G6pd and the signature of malarial selection in humans. Genetics 162 : 1849–1861.
Seixas S, Ferrand N, Rocha J (2002). Microsatellite variation and evolution of the human Duffy blood group polymorphism. Molec Biol Evol 19 : 1802–1806.
Sheng A-M, Chertow DS, Morens DM, Taubenberger JK (2010). Fatal 1918 pneumonia case complicated by erythrocyte sickling. Emerg Infect Dis 16 : 2000–2001.
Siniscalco M, Bernini L, Latte B, Motulsky AG (1961). Favism and thalassemia in Sardinia and their relationship to malaria. Nature 190 : 1179–1180.
Slatkin M (2008). A Bayesian method for jointly estimating allele age and selection intensity. Genet Res 90 : 129–137.
The Malaria Genomic Epidemiology Network (2008). A global network for investigation the genomic epidemiology of malaria. Nature 456 : 732–737.
Timmann C, Evan JA, König IR, Kleensang A Rüschendorf F et al . (2007). Genome-wide linkage analysis of malaria infection intensity and mild disease. PLoS Genet 3 : e48.
Tishkoff SA, Varkonyl R, Cahinhinan N, Abbes S, Argyropoulos G et al . (2001). Haplotype diversity and linkage disequilibrium at human G6PD : recent origin of alleles that confer malarial resistance. Science 293 : 455–45. 462.
Tishkoff SA, Verrelli BC (2004). G6PD deficiency and malarial resistance in humans: insight from evolutionary genetic analysis. In: Dronamraju KR (ed). Infectious Disease and Host-Pathogen Evolution . Cambridge University Press: Cambridge. pp 113–140.
Tsaras G, Owusu-Ansah A, Boateng FO, Amoateng-Adjepong Y (2009). Complications associated with sickle cell trait: a brief narrative review. Amer J Med 122 : 507–512.
Tung J, Primau A, Bouley AJ, Severson TF, Alberts SC, Wray GA (2009). Evolution of a malaria resistance gene in wild primates. Nature 460 : 388–391.
Uneke CJ (2007). Plasmodium falciparum malaria and ABO blood group: is there any relationship. Parasitol Res 100 : 769–765.
Verra F, Mangang VD, Modiano D (2009). Genetics of susceptibility to Plasmodium flaciparum: from classical malaria resistance genes towards genome-wide association studies. Parasite Immunol 31 : 234–253.
Verrelli BC, McDonald JH, Argyropoulos G, Destro-Bisol G, Froment A et al . (2002). Evidence for balancing selection from nucleotide sequence analyses of human G6PD . Am J Hum Genet 71 : 1112–1128.
Verrelli BC, Tishkoff SA, Stone AC, Touchman JW (2006). Contrasting histories of G6PD molecular evolution and malarial resistance in humans and chimpanzees. Mol Biol Evol 23 : 1592–1601.
Weatherall DJ (2004). J. B. S. Haldane and the malaria hypothesis. In: Dronamraju KR (ed). Infectious Disease and Host-Pathogen Evolution . Cambridge University Press. pp 18–36.
Weatherall DJ (2008). Genetic variation and susceptibility to infection: the red cell and malaria. Br J Haematol 141 : 276–286.
Weatherall DJ (2010). The inherited diseases of hemoglobin are an emerging global health burden. Blood 115 : 4331–4336.
Weatherall DJ, Clegg JB (2001). The Thalassemia Syndromes , 4th ed. Blackwell Scientific, Oxford.
WHO (1998). Demographic data for health situation assessment and projections . WHO HST/HSP: Geneva.
WHO (2009). World Malaria Report . World Health Organization, Geneva.
Wierenga KJJ, Hambleton IR, Lewis NA (2001). Survival estimates for patients with homozygous sickle-cell disease in Jamaica: a clinic-based population study. Lancet 357 : 680–683.
Williams TN, Mwangi TW, Wambua S, Peto TEA, Weatherall DJ. (2005a). Negative epistasis between the malaria-protective effects of α + -thalassemia and the sickle cell trait. Nat Genet 37 : 1253–1262.
Williams TN, Wambua S, Uyoga S, Macharia A, Mwacharo JK et al . (2005b). Both heterozygous and homozygous α + thalassemias protect against severe and fatal Plasmodium falciparum malaria on the coast of Kenya. Blood 106 : 368–371.
Willcox M, Björkman A, Brohult J (1983). Falciparum malaria and β-thalassemia trait in northern Liberia. Ann Trop Med Parasit 77 : 335–347.
Wood ET, Stover DA, Slatkin M, Nachman MW, Hammer MF 2005)). The β-globin recombinational hotspot reduces the effects of strong selection around HbC, a recently arisen mutation providing resistance to malaria. Am J Hum Genet 77 : 637–642.
Yamamoto F, Clausen H, White T, Marker J, Hakomori S (1990). Molecular genetic basis of the histo-blood group ABO system. Nature 345 : 229–233.
Zimmerman PA (2004). The enigma of Plasmodium vivax malaria and erythrocyte Duffy negativity. In: Dronamraju KR (ed.). Infectious Disease and Host-pathogen Evolution . Cambridge University Press: Cambridge. pp 141–172.
Zimmerman PA, Woolley I, Masinde GL, Miller SM, McNamara D et al . (1999). Emergence of FY* null in a Plasmodium vivax -endemic region of Papua New Guinea. Proc Nat Acad Sci USA 96 : 13973–13977.
Download references
Acknowledgements
I appreciate the comments on various aspects of the manuscript by B Penman, M Slatkin, A Stone, J Taylor, G Thomson, B Verrelli, P Zimmerman and three anonymous reviewers, and the assistance of M Saraniti on the Italian translation.
Author information
Authors and affiliations.
School of Life Sciences, Arizona State University, Tempe, AZ, USA
P W Hedrick
You can also search for this author in PubMed Google Scholar
Corresponding author
Correspondence to P W Hedrick .
Ethics declarations
Competing interests.
The author declares no conflict of interest.
Rights and permissions
Reprints and permissions
About this article
Cite this article.
Hedrick, P. Population genetics of malaria resistance in humans. Heredity 107 , 283–304 (2011). https://doi.org/10.1038/hdy.2011.16
Download citation
Received : 29 November 2010
Revised : 07 February 2011
Accepted : 11 February 2011
Published : 23 March 2011
Issue Date : October 2011
DOI : https://doi.org/10.1038/hdy.2011.16
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- age of allele
- sickle cell
- thalassemia
This article is cited by
Genetic variants of tlr4, including the novel variant, rs5030719, and related genes are associated with susceptibility to clinical malaria in african children.
- Guicheng Zhang
Malaria Journal (2023)
Therapeutics through glycobiology: an approach for targeted elimination of malaria
- Mallya Divya
- Sowmya R. Prabhu
- Abdul Vahab Saadi
Biologia (2023)
Erythrocyte miRNA-92a-3p interactions with PfEMP1 as determinants of clinical malaria
- Akshay Pramod Ware
- Kapaettu Satyamoorthy
Functional & Integrative Genomics (2023)
Duffy blood system and G6PD genetic variants in vivax malaria patients from Manaus, Amazonas, Brazil
- Natália S. Ferreira
- Jéssica Lorena S. Mathias
- José P. Moura Neto
Malaria Journal (2022)
Genetics of cerebral malaria: pathogenesis, biomarkers and emerging therapeutic interventions
- Lydia Nkuah Nortey
- Alberta Serwah Anning
- George Ghartey-Kwansah
Cell & Bioscience (2022)
Quick links
- Explore articles by subject
- Guide to authors
- Editorial policies
- Open access
- Published: 22 October 2012
Resistance to malaria in humans: the impact of strong, recent selection
- Philip W Hedrick 1
Malaria Journal volume 11 , Article number: 349 ( 2012 ) Cite this article
27k Accesses
36 Citations
4 Altmetric
Metrics details
Malaria is one of the leading causes of death worldwide and has been suggested as the most potent type of selection in humans in recent millennia. As a result, genes involved in malaria resistance are excellent examples of recent, strong selection. In 1949, Haldane initially suggested that infectious disease could be a strong selective force in human populations. Evidence for the strong selective effect of malaria resistance includes the high frequency of a number of detrimental genetic diseases caused by the pleiotropic effects of these malaria resistance variants, many of which are “loss of function” mutants. Evidence that this selection is recent comes from the genetic dating of the age of a number of these malaria resistant alleles to less than 5,000 years before the present, generally much more recent than other human genetic variants. An approach to estimate selection coefficients from contemporary case–control data is presented. In the situations described here, selection is much greater than 1%, significantly higher than generally observed for other human genetic variation. With these selection coefficients, predictions are generated about the joint change of alleles S and C at the β -globin locus, and for α -thalassaemia haplotypes and S , variants that are unlinked but exhibit epistasis. Population genetics can be used to determine the amount and pattern of selection in the past and predict selection in the future for other malaria resistance variants as they are discovered.
Malaria is one of the leading causes of death worldwide and has been suggested as the most potent type of selection in humans in recent millennia [ 1 ]. As a result, genes involved in malaria resistance are excellent examples of recent, strong selection. Perhaps best known is the sickle cell haemoglobin variant, which is often used as an example of heterozygote advantage. In addition, G6PD deficiency illustrates strong selection at an X-linked locus, β -globin variants S , C , and E and G6PD deficiency variants A -, Med , and Mahidol show how selective differences can be the result of a single-nucleotide change. Further, HLA-B53 illustrates how gene conversion can result in an adaptive allele, and α -thalassaemia shows how selection can operate on loci that have different copy numbers.
Haldane, known as one of the three founders of population genetics, is often recognized with first suggesting that disease could be an important evolutionary force in humans. Although his general review [ 2 ] is often cited for this concept, this hypothesis was presented in more detail in 1948 [ 3 ] where he suggested that β -thalassaemia heterozygotes had an increased fitness in the presence of malaria. Therefore, citation of [ 3 ] seems more correct for the hypothesis that malaria resistance in humans might be genetically determined and evolutionarily significant [ 4 – 6 ].
The first generally recognized evidence for genetic resistance to malaria in humans was in 1954 [ 7 ] for sickle-cell haemoglobin heterozygotes AS . Overall, the “malaria hypothesis” of Haldane that some human diseases such as thalassaemia are polymorphisms and provide heterozygote advantage because of the trade-offs between the advantages of resistance to malaria and negative effects due to the disease, is now widely accepted but the exact means of disease resistance have often been difficult to elucidate.
As documentation for the contemporary influence of malaria, in 2010 there were 216 million clinical cases and an estimated 863,000 deaths from malaria [ 8 ], making it one of the leading causes of death worldwide. Although these levels have declined globally 25% since 2000, another analysis estimated that the annual mortality may be much higher than this level at 1.24 million [ 9 ].
The impact of malaria is thought to have increased between 10,000 and 5,000 years ago when there were the beginnings of agriculture and consequently more human settlements. During this period, the numbers of both the human population and the mosquito vector increased, resulting in higher spread of malaria [ 10 ]. Recent molecular studies suggest that malaria in humans from Plasmodium falciparum may have originated from gorillas [ 11 , 12 ]. Using these data, an initial timeline for the origin of P. falciparum as a human pathogen suggests that it may be more recent than previously thought [ 13 ].
The past geographic extent of malaria and the distribution of malaria resistance variants broadly correspond [ 14 – 16 ]. For example, P. falciparum is found across Africa and Asia, as are the variants of haemoglobin and G6PD that provide malaria resistance. In regions of high endemic malaria, such as sub-Saharan tropical Africa and lowland Melanesia, there are often several variants. In contrast, variants do not exist in areas without past malaria, with the exception of ancestry from new immigrants [ 17 ]. To illustrate, the common malaria resistant alleles are not present in New World natives [ 18 ], presumably because their ancestors were unexposed to malaria and malaria only came to the Americas during the transatlantic slave trade between the 16 th and 19 th centuries [ 19 ]. Further, microgeographic variation of β -thalassaemia in Sardinia and the past presence of malaria are concordant [ 20 ] and α -thalassaemia and β -thalassaemia variation in Melanesia is associated with malaria presence [ 21 , 22 ].
Some resistance alleles for malaria are distinctive and others are very general. To illustrate, geographically separated Duffy alleles in Papua New Guinea and Africa result from changes at the same exact genomic location, 33 nucleotides upstream from the start codon. Similarly, the β -globin malaria resistance alleles S and C occur from different changes at the same codon. In contrast, there are many changes that modify levels of expression and provide malaria resistance for G6PD deficiency, α -thalassaemia, and β -thalassaemia.
It is significant that malaria resistance genes are often extremely variable, for example, the malaria resistance genes ABO, G6PD, HLA , α -globin, and β -globin, are some of the most variable human genes. Such variation might be because disease resistance genes have high amounts of standing variation or because these genes have high mutation rates and produce new adaptive alleles very quickly.
There are several current first-rate reviews of the mechanisms of malaria resistance and evaluations of the malaria resistance genes [ 1 , 23 – 26 ] and in an extensive recent review, many aspects of the population genetics of malaria resistance were examined [ 6 ]. Therefore, the focus here will be on the data indicating the very strong and recent selection for malaria resistance in humans. In addition, it will be shown how the expected change when there is strong selection for malaria resistance variants can be predicted with population genetics models.
Strength of selection
Pleiotropic disease effects.
Strong selection for malaria resistance has increased, due to pleiotropic effects, the frequency of genetic diseases, such as G6PD deficiency, the thalassaemias, and ovalocytosis. For example, it has been suggested that this has caused a worldwide problem and has resulted in 300,000 newborns each year with haemoglobin maladies [ 27 ]. These pleiotropic effects explain why such disorders are the most common genetic diseases [ 28 ]. It is also possible that some other diseases may be in high frequency because of the pleiotropic effects of malaria resistance variants [ 1 ].
Around 400 million people (worldwide prevalence of 4.9%) are affected with G6PD deficiency, the most common enzymopathy in humans [ 29 ]. Even though people with G6PD deficiency are often asymptomatic, various factors that result in oxidative stress can cause disease episodes. To illustrate, fava beans can cause severe haemolytic anaemia (favism) for individuals with the Med G6PD deficiency.
Perhaps related to the high frequency of diseases related to genes conferring resistance to malaria is that many of resistance variants, such as G6PD deficiency, the thalassaemias, sickle-cell anaemia, and ovalocytosis, are “loss of function” mutants and lead to reduced expression or altered gene product. Sometimes these variants result in disease because of their pleiotropic effects. Or, in other situations, there is no cost to the change in expression or gene product, as appears to be the case for the Duffy “null” variants and for β -globin gene allele C .
Age of selective variants
Many malaria resistance genetic variants appear to be recent polymorphisms and have been generated in the last 5,000 to 10,000 years or less (see Table 1 and below). These variants appear to date to the time when severe malaria first became an important human evolutionary factor. From this, one can surmise that they were not malaria resistance alleles in ancestral species, i.e., they are not trans-specific polymorphisms, and that they most likely were the result of standing variation or generated by new mutations. Homologous resistance alleles are not generally found in other primates [ 30 – 32 ] or if they are, the variants are different from human variants [ 33 ].
The age of S alleles has been estimated from the amount of association (linkage disequilibrium) in sequence near the HBB locus. To illustrate, all samples of S from eastern Senegal had the same haplotype [ 34 ]. Assuming that selection for AS was 0.152 higher than for AA, the estimated age of this S allele was 1,440 years (see Table 1 ) [ 34 ]. Assuming that this allele was in low frequency (standing variation), it is likely that this age indicates the number of years since selection started to increase this allele.
Overall, there might have been five different mutations originating the S allele from the A allele. This conclusion is based on the presence of these variants in almost non-overlapping geographic regions, one found in both India and Saudi Arabia and four others in different parts of Africa, and the large differences in DNA sequence for genetic regions linked to S , resulting in different haplotypes in different geographic regions. The Mediterranean S alleles appear related to the African S haplotypes, particularly the Benin haplotype [ 14 ]. Recent parallel adaptation of different S alleles is consistent with an encompassing population genetics model [ 35 ].
In addition, Table 1 gives the estimated or assumed selection coefficient for different variants. For example, a selection coefficient of 0.152 for S indicates that the sickle-cell heterozygote AS has a 15.2% advantage over the normal homozygote AA . As a comparison, a selective advantage of more than a few per cent is unusual for other human genes.
A joint estimate of the age of the C allele and the selection coefficient was approximated with a coalescent approach [ 36 ]. The allele age was estimated to be 2,810 years and a selection coefficient of 0.06. More than one mutational origin for the E allele in Thailand is indicated [ 37 ]. On the other hand, the Chinese E allele has the same haplotype as in Thailand [ 38 ], implying the same origin. The age of the E allele was estimated to be about 2,510 years [ 39 ] and selection of 0.079.
Different ways have been used to estimate the age of G6PD mutants (Table 1 ). Using simulations, estimation of the age and selection for the A - allele was 6,360 years with selection of 0.044 and for the Med allele was 3,330 years and selection of 0.034 [ 40 ]. Using Bayesian methods, the estimated age of A - was only 1,000 years and selection was 0.25 [ 41 ] and the age of the Mahidol allele was 1,580 years with selection of 0.23 [ 42 ]. These estimates may differ because those in [ 40 ] used microsatellite loci and those in [ 41 ] used sequence data.
The age of the HLA-Bw53 variant can be estimated using the approach below to estimate s (with OR = 0.59, m = 0.1, so that s = 0.041) and iteration of population genetic equations. Therefore, it would take approximately 2,150 years for Bw53 to reach 0.25 [ 6 ]. This time is substantially less than the time that strong selection in Africa for malaria resistance is thought to have occurred.
There are several other malaria resistant variants that appear to be recent but for which there has been no formal estimation of their age. For example, out of about 80 deletions that result in α + thalassaemia, four common ones have different geographic distributions; for example, deletion – α 3.7 is found nearly worldwide but is most common in Africa, deletion – α 4.2 has a high frequency in southeast Asia, and – α 3.7I is found most commonly in African, Indian and Mediterranean populations [ 14 ]. In addition, there are a number of mutants that have very limited geographic distributions. The age of these mutants has not been estimated, but their narrow distributions indicate recent generation.
Most of the β -thalassaemias are the result of single nucleotide substitutions or small insertions or deletions. Around 200 β -thalassaemias variants are known and they also have narrow geographic distributions suggesting that these changes indicate independent mutations and have been recently generated. β -thalassaemia variants in carriers are in frequencies from 5 to 20%, somewhat lower in frequency than α - thalassaemia variants [ 43 ].
A mutation in gene SLC4A1 (erythrocyte Band 3 protein gene) found in Papua New Guinea and Malaysia results from a 27-nucleotide (9-amino acid) deletion. This genetic region is conserved throughout species [ 44 ] and the mutant causes abnormal ovalocytosis, an atypical red blood cell shape, and anaemia in heterozygotes. Even in progeny from matings between heterozygotes, homozygotes have not observed, indicating they are lethal [ 45 ]. The lethality of homozygotes and the high resistance of heterozygotes to malaria suggest that this variant has properties not very different from the S allele because the advantage to heterozygotes in resistance offsets the cost in fitness in homozygotes. Because of its limited distribution, it is thought to be a recent mutation.
There are several variants that provide resistance to malaria that appear to be older. For example, there are two antigens A and B produced by alleles FY*A and FY*B at the Duffy blood group locus [ 46 , 47 ]. Many African-Americans and Africans do not have either allele and have another allele, known generally as a Duffy null allele. This allele, FY * B ES , is in very high frequency in most of Africa and provides resistance to malaria species Plasmodium vivax , but not to P. falciparum [ 48 ].
The estimated age of the FY * B ES allele, based on sequence analysis, is around 33,000 years [ 49 ], while the estimated age of this allele using microsatellite data is around 10,000 years ago [ 50 ]. The earlier estimated origin may be more consistent with the early impact of P. vivax on human mortality. Surprisingly, a Duffy allele in low frequency (0.022) in Papua New Guinea, also differs from FY*A in the same exact sequence as does the FY * B ES allele found in Africa [ 51 ]. The low frequency of this allele is probably because of its recent origin, a situation seen in the pattern of alleles at closely linked loci [ 51 ].
Examples of strong selection and change in allele frequency
As illustration, here two examples are given of fast expected genetic changes over time due to strong selection at genes conferring resistance to malaria. Both examples are ones in which there are two resistant variants in one population, one in which the genetic variants are different alleles at the same gene ( S and C ) and the other where the genetic variants are different alleles at different genes but there is epistasis between these variants ( S and α -thalassaemia). Before these examples are described, an approach will be briefly introduced to estimate the extent of selection at these variants using contemporary case–control data (see [ 6 ] for details).
Estimation of selection
A general approach to determine the risk of individuals with a given genotype getting a disease, relative to that in the rest of the population, is to calculate the odds ratio ( OR ) as OR = f d (1 – f c )]/ f c (1 - f d )] where the frequencies of the genotype in control and diseased groups are f c and f d , respectively. For example, the selection coefficient for the AS heterozygote can be estimated as s = m (1 – OR ) where m is the rate of non-genotype specific mortality from malaria (generally the mortality given that an individual has severe malaria) [ 52 , 53 ].
To illustrate how OR and s can be calculated, data from a large study of malaria in The Gambia can be used to determine the effect of sickle-cell variation on the presence of severe malaria [ 54 ]. In 619 children with severe malaria, only seven (0.012) were heterozygous AS . On the other hand, in 510 other children, who were outpatients and termed “mild controls”, many more 66 (0.129) were AS . Therefore, OR = 0.082, indicating very great protection from severe malaria for AS genotypes. A value of 0.07 for m was suggested [ 52 ] while, a value of at least m = 0.1 may be appropriate [ 10 , 55 ]. For the AS genotype, if OR = 0.082 and m = 0.1, then s = 0.092.
Increase of allele C in the presence of allele S
The polymorphic structural variants at the β -globin locus, S and C , both are malaria resistance alleles. Variant S is more widespread than C, which is in sub-Saharan Africa, but they both occur in a number of populations. Although some early studies suggested a stable polymorphism of these variants, a Burkina Faso sample [ 56 ] indicated the C allele would become fixed over time [ 53 , 57 ].
In the Burkina Faso sample, the genotypes AC , AS , and CC showed relative resistance to malaria and AA showed relative susceptibility [ 56 ]. The frequencies of these genotypes are given in Table 2 in both control and disease groups along with OR values. Fitnesses for genotypes SC and SS were based on deviations from Hardy-Weinberg proportions [ 53 ]. If m = 0.1, the relative fitness are given at the bottom in Table 2 .
With these fitnesses, when S is introduced to a population with only the A allele, it will go to a stable equilibrium with the frequency of S = 0.12. When C is introduced to a population with only the A allele, it will increase in frequency to fixation in a little more than 2,500 years (Figure 1 ). When the S and A alleles are at their stable equilibrium, then C can enter the population and eventually increase to fixation after about 5,000 years (Figure 1 ).
The increase in frequency of allele C when it begins at a frequency of 0.01 (solid lines) both when S is absent and when S begins at its equilibrium frequency of 0.12 [ [ 53 ] ]. The change in frequency of S is also given for the last situation (broken line).
These results occur because genotype CC has the highest relative fitness of the genotypes. In addition, it might clarify situations, such as in the Dogon people of Mali who have a high frequency of C and a low S frequency [ 58 ]. In addition, and of public health consequence, the mean fitness of the population is about 11% higher when fixed for C than when it is polymorphic for A and S because of the absence of sickle-cell anaemia and higher resistance to malaria.
Effect of epistasis between alleles S and α + - thalassaemia
Sickle cell anaemia and α + -thalassaemia both occur in some sub-Saharan African populations. The adult haemoglobin molecule is composed for two subunits of each of the α -globin and β -globin molecules. As a result, it is not unexpected that there could be gene-gene interaction, or epistasis, at these loci for fitness. In a Kenyan population, these disorders were both present [ 59 ] and when the two disorders were inherited together in the same individual, the protection from malaria given individually by each allele was lost (see also [ 60 ]). This interaction, or negative epistasis, could be part of the explanation why α + -thalassaemia does not appear fixed in sub-Saharan populations. A traditional population genetics approach is used here to examine this situation, for an epidemiological approach, see [ 59 , 61 , 62 ]
The relative viabilities (fitnesses) of the different two-locus genotypes (Table 3 ) were based on genotype-specific annual mortality rates (see details in [ 6 , 59 ]). With a wild-type genotype at both genes, the estimated relative fitness is only 0.55. The presence of S in AS and the presence of − α in both genotypes − α / αα and − α /− α increases the estimated relative fitness, given normal genotypes at the other locus. However, the relative survival is reduced to only 0.55 when there is an AS genotype and homozygosity for − α , the same as the relative fitness for the wild-type genotype.
This array of relative fitness values illustrates several things. First, there is a stable polymorphism with S equal to 0.309 when only the normal αα haplotype at the a -globin gene is present (first column). Second, variant – α haplotype would be fixed when only the normal A allele is present at the β -globin locus (first row) because the – α/ − α genotype has the highest fitness. In addition, some two-locus behaviour can be intuited by looking at these viability values further. For example, if the population is fixed for – α/ – α it does not appear that S can enter the population because AA – α /− α > AS − α /− α (Table 3 ).
Using iterations of two-locus gamete frequency equations with selection [ 63 ], more details of the genetic behaviour of gametes with this two-locus fitness array are observed. For example, if – α is polymorphic, there is an unstable equilibrium so that – α will either increase or go to a two-locus equilibrium, depending upon its initial frequency, if S is introduced into the population. Figure 2 presents this situation where the initial frequency of S is 0.01 and the initial frequency of – α is either 0.55 or 0.65. The frequency of – α initially increases for 0.55 and then approaches the two-locus equilibrium at 0.272. On the other hand, when the initial frequency is 0.65, it increases to unity while S initially increases but then decreases to 0 (for further discussion see [ 6 ]). Notice that although the amount of epistasis between the genes is high, the time for these changes to occur is quite long.
The change in the frequency of – a where the initial frequency is either 0.55 or 0.65 using the relative fitnesses in Table 3 and the initial frequency of S is 0.01.
Conclusions
Obviously, Haldane [ 3 ] was correct to point out that malaria resistance in humans was a significant evolutionary factor. Malaria resistance genes comprise some of the most widely accepted examples of strong positive selection in humans. Overall, the original “malaria hypothesis” of Haldane that diseases like thalassaemia are polymorphisms because of resistance to malaria, has been proven correct. However, much is still to be learned about actual mechanisms of protection, other genes that confer resistance, and the population genetics of this variation.
Malaria resistance offers cases of a number of aspects of population genetics, particularly situations of recent, strong selection. Overall, many of these resistance variants appear less than 5,000 years old, much more recent than for most human variants. They also have selection coefficients significantly larger than 1%, much stronger than for most human variants. Population genetics will probably be even more important in understanding the joint impact of multiple malaria resistance variants in the future. Two situations are examined here, and it is shown that in populations segregating for S and C , selection is expected to eliminate S and fix C . Also, depending upon the starting frequencies, in populations segregating for α -thalassaemia and S , either the α -thalassaemia haplotype will become fixed and the S allele eliminated or a stable equilibrium of both variants will occur because of negative epistasis.
Kwiatkowski DP: How malaria has affected the human genome and what human genetics can teach us about malaria. Amer J Hum Genet. 2005, 77: 171-192. 10.1086/432519.
Article PubMed Central CAS PubMed Google Scholar
Haldane JBS: Disease and evolution. La Ricerca Scientifica. 1949, 19: 68-76.
Google Scholar
Haldane JBS: The rate of mutation of human genes. Hereditas. 1949, 35 (S1): 267-273.
Article Google Scholar
Crow JF: J. B. S. Haldane’s ideas in biology with special reference to disease and evolution. Infectious Disease and Host-Pathogen Evolution. Edited by: Dronamraju KR. 2004, Cambridge: Cambridge University Press, 11-17.
Chapter Google Scholar
Weatherall DJ: J. B. S. Haldane and the malaria hypothesis. Infectious Disease and Host-Pathogen Evolution. Edited by: Dronamraju KR. 2004, Cambridge: Cambridge University Press, 18-36.
Hedrick PW: Population genetics of malaria resistance in humans. Heredity. 2011, 107: 283-304. 10.1038/hdy.2011.16.
Allison AC: Protection afforded by the sickle-cell trait against subtertian malaria infection. Brit Med J. 1954, 1: 290-294. 10.1136/bmj.1.4857.290.
WHO: World Malaria Report. 2011, Geneva: World Health Organization
Murray CJL, Rosenfeld LC, Lim SS, Andrews KG, Foreman KJ, Haring D, Fullman N, Lozano NM, Lozano R, Lopez AD: Global malaria mortality between 1980 and 2010: a systematic analysis. Lancet. 2012, 379: 413-431. 10.1016/S0140-6736(12)60034-8.
Article PubMed Google Scholar
Carter R, Mendis KN: Evolutionary and historical aspects of the burden of malaria. Clin Micro Rev. 2002, 15: 564-594. 10.1128/CMR.15.4.564-594.2002.
Article PubMed Central PubMed Google Scholar
Liu W, Li V, Rudicell RS, Robertson JD, Keele BF, Ndjango J-B N, Sanz CM, Morgan DB, Locatelli S, Gonder MK, Kranzusch PJ, Walsh PD, Delaporte E, Mpoudi-Ngole E, Georgiev AV, Muller MN, Shaw GM, Peeters M, Sharp PM, Rayner JC, Hahn BH: Origin of the human malaria parasite Plasmodium falciparum in gorillas. Nature. 2010, 467: 420-425. 10.1038/nature09442.
Rayner JC, Liu W, Peeters M, Sharp PM, Hahn BH: A plethora of Plasmodium species in wild apes: a source of human infection?. Trends Parasit. 2011, 27: 222-229. 10.1016/j.pt.2011.01.006.
Baron JM, Higgins JM, Dzik WH: A revised timeline for the origin of Plasmodium falciparum as a human pathogen. J Molec Evol. 2011, 73: 297-304. 10.1007/s00239-011-9476-x.
Article CAS PubMed Google Scholar
Flint J, Harding RM, Boyce AJ, Clegg JB: The population genetics of the haemoglobinopathies. Bailliere’s Clin Haemat. 1998, 11: 1-51. 10.1016/S0950-3536(98)80069-3.
Article CAS Google Scholar
Piel FB, Patil AP, Howes RE, Nyangiri OA, Gething PW, Williams TN, Weatherall DJ, Hay SI: Global distribution of the sickle cell gene and geographical confirmation of the malaria hypothesis. Nat Commun. 2010, 1: 104-10.1038/ncomms1104.
Howes RE, Patil AP, Piel FB, Nyangiri OA, Kabaria CW, Gething PW, Zimmerman PA, Bernades C, Beall CM, Gebremedhin A, Ménard D, Williams TN, Weatherall DJ, Hay SI: The global distribution of the Duffy blood group. Nat Commun. 2011, 2: 226-
O’Shaughnessy DF, Hill AVS, Bowden DK, Weatherall DJ, Clegg JB: Globin genes in Micronesia: origins and affinities of Pacific Island peoples. Amer J Hum Genet. 1990, 46: 144-155.
PubMed Central PubMed Google Scholar
Livingstone FB: Frequencies of Haemoglobin Variants. 1985, Oxford: Oxford University Press
Valcindag E, Elguero E, Arnathau C, Durand P, Aklana J, Anderson TJ, Aubouy A, Balloux F, Besnard P, Bogreau H, Canevale P, D’Alessandro U, Fontenille D, Gamboa D, Jombart T, Le Mire J, Leroy E, Maestre A, Mayxay M, Menard , Musset L, Newton PN, Nkoghe D, Noya O, Ollomo B, Rogier , Veron V, Wide A, Zakeri S, Carme B, Legrand E, Chevillon C, Ayala FJ, Renaud F, Prugnolle F: Multiple independent introductions of Plasmodium falciparum in South America. Proc Nat Acad Sci USA. 2012, 109: 511-516. 10.1073/pnas.1119058109.
Siniscaloco M, Bernini L, Latte B, Motulsky AG: Favism and thalassemia in Sardinia and their relationship to malaria. Nature. 1961, 190: 1179-1180. 10.1038/1901179a0.
Flint J, Hill AV, Bowden DK, Oppenheimer SJ, Sill PR, Serjeantson SW, Bana-Koiri J, Bhatia K, Alpers MP, Boyce AJ, Weatherall DJ, Clegg JB: High frequencies of alpha-thalassemia are the result of natural selection by malaria. Nature. 1986, 321: 744-750. 10.1038/321744a0.
Hill AVS, Bowden DK, O’Shaughnessy DF, Weatherall DJ, Clegg JB: b-thalassemia in Melanesia: association with malaria and characterization of a common variant (IVS1 nt 5 G-C). Blood. 1988, 72: 9-14.
CAS PubMed Google Scholar
Kwiatkowski DP, Luoni G: Malaria. Genetic Susceptibility to Infectious Diseases. Edited by: Kaslow RA, McNicholl JM, Hill AVS. 2008, Oxford: Oxford University Press, 372-386.
Verra F, Mangang VD, Modiano D: Genetics of susceptibility to Plasmodium falciparum: from classical malaria resistance genes towards genome-wide association studies. Parasite Immun. 2009, 31: 234-253. 10.1111/j.1365-3024.2009.01106.x.
López C, Saravia C, Gomez A, Hoebeke J, Patarroyo MA: Mechanisms of genetically-based resistance to malaria. Gene. 2011, 467: 1-12.
Driss A, Hibbert JM, Wilson NO, Iqbal SA, Adamkiewicz TV, Stiles JK: Genetic polymorphisms linked to susceptibility to malaria. Malar J. 2011, 10: 271-10.1186/1475-2875-10-271.
Weatherall DJ: The inherited diseases of hemoglobin are an emerging global health burden. Blood. 2010, 115: 4331-4336. 10.1182/blood-2010-01-251348.
Weatherall DJ: Genetic variation and susceptibility to infection: the red cell and malaria. Brit J Haemat. 2008, 141: 276-286. 10.1111/j.1365-2141.2008.07085.x.
Nkhoma ET, Poole C, Vannappagari V, Hall SA, Beutler E: The global prevalence of glucose-6-phosphate dehydrogenase deficiency: A systematic review and meta-analysis. Blood Cells Mol Dis. 2009, 42: 267-278. 10.1016/j.bcmd.2008.12.005.
Verrelli BC, Tishkoff SA, Stone AC, Touchman JW: Contrasting histories of G6PD molecular evolution and malarial resistance in humans and chimpanzees. Molec Biol Evol. 2006, 23: 1592-1601. 10.1093/molbev/msl024.
MacFie TS, Nerrienet E, Bontrop RE, Mundy NI: The action of falciparum malaria on the human and chimpanzee genomes compared: Absence of evidence for a genomic signature of malaria at HBB and G6PD in three subspecies of chimpanzee. Infect Genet Evol. 2009, 9: 1248-1252. 10.1016/j.meegid.2009.06.025.
Demogines A, Truong KA, Sawyer SL: Species-specific features of DARC, the primate receptor for Plasmodium vivax and Plasmodium knowlesi. Molec Biol Evol. 2011, 29: 445-449.
Tung J, Primau A, Bouley AJ, Severson TF, Alberts SC, Wray GA: Evolution of a malaria resistance gene in wild primates. Nature. 2009, 460: 388-391.
Currat M, Trabucher G, Rees D, Perrin P, Harding RM, Clegg JB, Langaney A, Excoffier L: Molecular analysis of the β-globin gene cluster in the Niokholo Mandenka population reveals a recent origin of the βS Senegal mutation. Amer J Hum Genet. 2002, 70: 207-223. 10.1086/338304.
Ralph P, Coop G: Parallel adaptation: one or many waves of advance of an advantageous allele?. Genetics. 2010, 186: 647-668. 10.1534/genetics.110.119594.
Wood ET, Stover DA, Slatkin M, Nachman MW, Hammer MF: The β-globin recombinational hotspot reduces the effects of strong selection around HbC, a recently arisen mutation providing resistance to malaria. Amer J Hum Genet. 2005, 77: 637-642. 10.1086/491748.
Ohashi J, Naka I, Patarapotikul J, Hananantachai H, Brittenham , Looareesuwan S, Clark AG, Tokunaga K: Strong linkage disequilibrium of a HbE variant with the (AT)9(T)5 repeat in the BP1 binding site upstream of the β-globin gene in the Thai population. J Hum Genet. 2005, 50: 7-11. 10.1007/s10038-004-0210-z.
Fucharoen G, Fucharoen S, Sanchaisuriya K, Sae-ung N, Suyasunanond U, Sriwilai , Chinorak P: Frequency distribution and haplotypic heterogeneity of βE-globin gene among eight minority groups of northeast Thailand. Hum Hered. 2002, 53: 18-22. 10.1159/000048600.
Ohashi J, Naka I, Patarapotikul J, Hananantachai H, Brittenham G, Looareesuwan S, Clark AG, Tokunaga K: Extended linkage disequilibrium surrounding the hemoglobin E variant due to malarial selection. Amer J Hum Genet. 2004, 74: 1198-1208. 10.1086/421330.
Tishkoff SA, Varkonyl R, Cahinhinan N, Abbes S, Argyropoulos G, Destro-Bisol G, Drousiotou A, Dangerfield B, Lefranc G, Loiselet J, Piro A, Stoneking M, Tagarelli A, Tagarelli G, Touma EH, Williams SM, Clark AG: Haplotype diversity and linkage disequilibrium at human G6PD: recent origin of alleles that confer malarial resistance. Science. 2001, 293: 455-462. 10.1126/science.1061573.
Slatkin M: A Bayesian method for jointly estimating allele age and selection intensity. Genet Res. 2008, 90: 129-137.
Louicharoen C, Patin E, Paul R, Nuchprayoon I, Witoonpanich B, Peerapittayamongkol C, Casademont I, Sura T, Laird NM, Singhasivanon P, Quintana-Murci L, Sakuntabhai A: Positively selected G6PD-Mahidol mutation reduces Plasmodium vivax density in southeast Asian. Science. 2009, 326: 1546-1549. 10.1126/science.1178849.
Weatherall DJ, Clegg JB: The Thalassemia Syndromes. 2001, Oxford: Blackwell Scientific, 4
Book Google Scholar
Jarolim P, Palek J, Amato D, Hassan K, Sapak P, Nurse GT, Rubin HL, Zhai S, Sahr KE, Liu S-C: Deletion in erythrocyte band 3 gene in malaria–resistant Southeast Asian ovalocytosis. Proc Nat Acad Sci USA. 1991, 88: 11022-11026. 10.1073/pnas.88.24.11022.
Genton B, Al-Yaman F, Mgone CS, Alexander N, Paniu MM, Alpers MP: Ovalocytosis and cerebral malaria. Nature. 1995, 378: 564-565. 10.1038/378564a0.
Zimmerman PA: The enigma of Plasmodium vivax malaria and erythrocyte Duffy negativity. Infectious Disease and Host-Pathogen Evolution. Edited by: Dronamraju KR. 2004, Cambridge: Cambridge University Press, 141-172.
Mercereau-Puijalon O, Ménard D: Plasmodium vivax and the Duffy antigen: A paradigm revisited. Transfus Clin Biol. 2010, 17: 176-183. 10.1016/j.tracli.2010.06.005.
Miller LH, Mason SJ, Clyde DF, McGinnis MH: The resistance factor to Plasmodium vivax in blacks. The Duffy-blood-group genotype FyFy. New England J Med. 1976, 295: 302-304. 10.1056/NEJM197608052950602.
Hamblin MT, Di Rienzo A: Detection of the signature of natural selection in humans: evidence from the Duffy blood group locus. Amer J Hum Genet. 2000, 66: 1669-1679. 10.1086/302879.
Seixas S, Ferrand N, Rocha J: Microsatellite variation and evolution of the human Duffy blood group polymorphism. Molec Biol Evol. 2002, 19: 1802-1806. 10.1093/oxfordjournals.molbev.a004003.
Zimmerman PA, Woolley I, Masinde GL, Miller SM, McNamara D, Hazlett F, Mgone CS, Alpers MP, Genton B, Boatin BA, Kazure JW: Emergence of FY*null in a Plasmodium vivax-endemic region of Papua New Guinea. Proc Nat Acad Sci USA. 2009, 96: 13973-13977.
Hill AVS: HLA associations with malaria in Africa: some implications for MHC evolution. Molecular Evolution of the Major Histocompatibility Complex. Edited by: Klein J, Klein D. 1991, Berlin: Springer, 403-434.
Hedrick PW: Estimation of relative fitnesses from relative risk data and the predicted future of haemoglobin alleles S and C. J Evol Biol. 2004, 17: 221-224.
Hill AVS, Allsopp CEM, Kwiatkowski D, Anstey NM, Twumasi P, Rowe PA, Bennett S, Brewster D, McMichael AJ, Greenwood BM: Common west African HLA antigens are associated with protection from severe malaria. Nature. 1991, 352: 595-600. 10.1038/352595a0.
WHO: Demographic data for health situation assessment and projections. 1998, Geneva: WHO HST/HSP
Modiano D, Luoni G, Sirima BS, Simpore J, Verra F, Konate A, Rashtrelli , Oliveri A, Calissano C, Paganotti GM, D’Urbano L, Sanou I, Sawadogo A, Modiano G, Coluzzi M: Haemoglobin C protects against clinical Plasmodium falciparum malaria. Nature. 2001, 414: 305-308. 10.1038/35104556.
Modiano D, Bancone G, Ciminelli BM, Pompei F, Blot I: Haemoglobin S and haemoglobin C: ‘quick but costly’ versus ‘slow but gratis’ genetic adaptations to Plasmodium falciparum malaria. Hum Molec Genet. 2008, 17: 789-799.
Agarwal A, Guindo A, Cissoko Y, Taylor TG, Coulibaly D, Kone A, Kayentao K, Djimde A, Plowe CV, Documbo O, Wellems TE, Diallo D: Hemoglobin C associated with protection from severe malaria in the Dogon of Mali, a West African population with a low prevalence of hemoglobin S. Blood. 2000, 96: 2358-2363.
Williams TN, Mwangi TW, Wambua S, Peto TEA, Weatherall DJ: Negative epistasis between the malaria-protective effects of α+-thalassemia and the sickle cell trait. Nat Genet. 2005, 37: 1253-1262. 10.1038/ng1660.
May J, Evans JA, Timmann C, Ehmen C, Busch W, Thye T, Agbenyega T, Horstmann RD: Hemoglobin variants and disease manifestations in severe falciparum malaria. J Amer Med Assoc. 2007, 297: 2220-2226. 10.1001/jama.297.20.2220.
Penman BS, Pybus OG, Weatherall DJ, Gupta S: Epistatic interactions between genetic disorders of hemoglobin can explain why the sickle-cell gene is uncommon in the Mediterranean. Proc Nat Acad Sci USA. 2009, 106: 21242-21246. 10.1073/pnas.0910840106.
Penman BS, Habib S, Kanchan K, Gupta S: Negative epistasis between a+ thalassaemia and sickle cell trait can explain interpopulation variation in south Asia. Evolution. 2011, 65: 3625-3632. 10.1111/j.1558-5646.2011.01408.x.
Hedrick PW: Genetics of Populations. 2011, Boston: Jones and Bartlett, 4
Download references
Acknowledgements
This research was partially supported by the Ullman Distinguished Professorship.
Author information
Authors and affiliations.
School of Life Sciences, Arizona State University, Tempe, AZ, 85287, USA
Philip W Hedrick
You can also search for this author in PubMed Google Scholar
Corresponding author
Correspondence to Philip W Hedrick .
Additional information
Competing interests.
The author declares there are no competing interests.
Authors’ contribution
PWH conceived and wrote this manuscript. The author read and approved the final manuscript.
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
Authors’ original file for figure 1
Authors’ original file for figure 2, rights and permissions.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
Reprints and permissions
About this article
Cite this article.
Hedrick, P.W. Resistance to malaria in humans: the impact of strong, recent selection. Malar J 11 , 349 (2012). https://doi.org/10.1186/1475-2875-11-349
Download citation
Received : 25 September 2012
Accepted : 18 October 2012
Published : 22 October 2012
DOI : https://doi.org/10.1186/1475-2875-11-349
Share this article
Anyone you share the following link with will be able to read this content:
Sorry, a shareable link is not currently available for this article.
Provided by the Springer Nature SharedIt content-sharing initiative
- Age of allele
- Sickle cell
- Thalassaemia
Malaria Journal
ISSN: 1475-2875
- Submission enquiries: [email protected]
An official website of the United States government
The .gov means it’s official. Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.
The site is secure. The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.
- Publications
- Account settings
The PMC website is updating on October 15, 2024. Learn More or Try it out now .
- Advanced Search
- Journal List
- Med Sci (Basel)
An Overview of Malaria Transmission Mechanisms, Control, and Modeling
Merveille koissi savi.
1 Center for Development Research ZEF, University of Bonn, Genscherallee 3, 53113 Bonn, Germany; [email protected] or ude.dravrah.hpsh@ivaskm
2 T.H. Chan Harvard School of Public Health, Boston, MA 02115, USA
Associated Data
All data are generated from the content of publications mentioned in this manuscript and they are equally available from the corresponding authors on reasonable request.
In sub-Saharan Africa, malaria is a leading cause of mortality and morbidity. As a result of the interplay between many factors, the control of this disease can be challenging. However, few studies have demonstrated malaria’s complexity, control, and modeling although this perspective could lead to effective policy recommendations. This paper aims to be a didactic material providing the reader with an overview of malaria. More importantly, using a system approach lens, we intend to highlight the debated topics and the multifaceted thematic aspects of malaria transmission mechanisms, while showing the control approaches used as well as the model supporting the dynamics of malaria. As there is a large amount of information on each subject, we have attempted to provide a basic understanding of malaria that needs to be further developed. Nevertheless, this study illustrates the importance of using a multidisciplinary approach to designing next-generation malaria control policies.
1. Introduction
Approximately 2000 children die daily from malaria, with 90% of the victims located in sub-Saharan Africa (SSA) [ 1 , 2 ]. Transmission and control of malaria are mediated by complex interactions and feedback loops among humans, mosquitoes, parasites, their environments, healthcare systems, and policy implementation at a given period of time [ 3 ]. Therefore, malaria transmission can be described as complex, nonlinear, and dynamic. The endemism of malaria in West Africa, for example, has huge economic impacts [ 2 , 4 ]. In Ghana, the cost of treating a single episode of malaria can reach 34% of a household’s yearly income [ 5 ]. As a result of infected people being unable to produce wealth, and as wealth is low, the risk of exposure to malaria increases. There is a negative relationship between national economic growth and malaria control expenditures [ 2 , 6 ]. As a result, malaria can be considered both as a cause of poverty and as a consequence of it [ 7 ].
Furthermore, past interventions have been ineffective as they were unable to account for the nonlinear nature of malaria infection. Particularly, the unexpected consequences of interventions such as mosquito resistance to chemical compounds and parasite resistance to drugs led to their abandonment. This was the case with the eradication control program that was abandoned in 1969 and the Global Malaria Control Strategy in 1992 [ 8 ]. Scientists have widely acknowledged the complexity of malaria, but a depiction of this complexity is weakly documented, and further limits the scope of malaria control.
This short review embracing several topics or thematic areas provides the reader with a piece of basic knowledge of malaria transmission, control, and modeling while illustrating its complexity. Therefore, the manuscript intends to be a didactical material, providing a brief overview of the disease, reducing information asymmetry, while arousing the curiosity of readers who can refer to more specific literature.
2. Materials and Methods
We conducted an unsystematic narrative review of online literature to gain a deeper insight into malaria transmission mechanism, control strategy, and modeling approaches. Thus, we adopted the following approach ( Figure 1 ).
Synthesis of the method. In gray, the final objective of the review is shown, in green, the investigated databases are indicated, in yellow, the exclusion process is indicated, and in red, the clustering of the selected papers is shown.
2.1. Inclusion and Exclusion Criteria for Paper Eligibility
Our search for peer-reviewed papers on malaria in the online scientific database was restricted to the period from 1950 to 2020. The rationale for this time window lies in the fact that the first malaria model was developed in the 1950s and since then, most of the recommendations for malaria control are supported by quantitative analysis. Besides, we excluded all duplicated letters, narrative reviews, case reports, and commentary. Given the tremendous research done on malaria, we limited the scope of the study to only peer-reviewed published papers in either French or English. This was to avoid evidence that may be challenging to replicate and biased translation. Within the topic of interest, we exclude all case reports which happen in non-endemic areas.
2.2. Search Strategy
We searched three databases PUBMED, Google Scholar, and MEDLINE for relevant literature on malaria from 1950 to 2020.
2.3. Screening of Studies and Evidence Extraction
Our screening process focuses on the title of the study published in peer-reviewed journals. Our next step was to review the papers’ abstracts and remove the studies that did not fit under the umbrella of the keywords.
2.4. Analysis and Gaps in the Analysis
To get a comprehensive understanding of malaria, we categorized the list of 77 eligible peer-reviewed papers into topics of interest, such as mechanisms, prevention and treatment, mathematical modeling, and perspectives on modeling. The topical organization of the collected papers allowed us to get a broad perspective of the state of the art.
We found 2,150,000 studies after the search. A total of 150,000 studies were obtained after removing duplicates. We included only 300 studies after screening both titles and abstracts. Furthermore, we removed 50 case reports, 146 studies in malaria non-endemic areas, and 16 reports including 10 thesis and 6 NGO reports.
Using the remaining 77 eligible studies, we provided an overview of each topic and complemented our narrative with WHO, CDC reports, and 10 complimentary readings. There were 23 studies supporting malaria transmission, 19 on prevention and treatment, 23 on mathematical modeling, and 12 on future research on malaria modeling ( Figure 2 ).
Overview of the thematic distribution of the selected papers.
3.1. Malaria Transmission Mechanism
Malaria infection is caused by interactions between humans, mosquitoes, and the pathogen ( Figure 3 ). Female mosquitoes change during the egg-laying period when they experience a deficiency in protein and plant sucrose necessary for egg maturation. Consequently, they blood-feed on humans and can infect them after effective biting where infectious protozoans (of the genus Plasmodium ) are injected into the human bloodstream [ 9 , 10 ].
Complex interactions between humans, mosquitos, and parasites explaining the persistence of malaria. Plasmodium parasites, mosquitoes, and humans constitute three subsystems that interact with one another. The green cycle represents the human sub-system (SS). In addition to the micro-environment, which includes housing conditions, network connectivity, socioeconomic conditions, and the environment within the host (i.e., the individual’s genetic makeup), this SS is characterized by the social environment (social interaction, culture, and behavior regarding malaria risk-taking). The red indicates the SS of mosquitoes defined by the growth conditions favored by humans using or not either ITNs or IRS, and their adaptation to their living environment, including the harsh environmental conditions of cities, such as dirty water and weather conditions. In purple, Plasmodium parasite SS is characterized by interactions with the environment inside the host that may drive resistance to malaria drugs.
Female mosquitoes most found in West Africa belong to the Anopheles Giles genus [ 11 ]. Anopheles gambiae ( sensu stricto ) prefers to use human blood (anthropophilic) to complete its gonotrophic cycle, bites primarily indoors (endophagic), and rests indoors (endophilic). As far as its feeding habits are concerned, An . gambiae is the major malaria vector in West Africa [ 12 ]. In addition, there are other vectors, such as An. arabiensis , An. funestus and An. melas able to carry P. falciparum . Similar to An. gambiae , An. funestus is anthropophilic, endophagic, and endophilic [ 4 , 13 ]. On the other hand, An. arabiensis is zoophilic (prefers blood-feeding animals), exophilic (bites outdoors), and exophagic (rests outdoors) [ 11 ]. An. melas is also anthropophagic, exophagic, and exophilic [ 12 ]. Therefore, P. falciparum adapts well to Anopheles species, increasing malarial vulnerability in the host. The wide range of vector conditions, feeding habits, and geographical heterogeneity in malaria transmission make malaria modeling even more challenging [ 14 ].
A mosquito’s ability to adapt to environmental conditions such as humidity and temperature, is influenced by its sensitivity to those conditions. This adaptation can take the form of a change in the life cycle length, or the biting efficiency [ 15 , 16 , 17 , 18 , 19 ]. As a long-term study carried out in Kenya (1999–2010) has shown, vectors’ feeding habits and effectiveness can change over time [ 20 ].
The transmission intensity may differ significantly across host populations because vectors’ attraction to the host varies [ 21 ]. This variation is elicited by the difference in human bodies’ microflora and their reaction to mosquito bites [ 22 ]. For instance, an infected host is more attractive to other mosquitoes than a non-infected one [ 23 ]. Likewise, mosquitoes prefer the scent of children to that of the adults [ 21 , 24 ].
In SSA, infectious protozoans are, Plasmodium vivax, P. ovale, P. malariae, P. knowlesi, and P. falciparum where P. falciparum accounts for 95% of malaria cases [ 25 , 26 ]. P. falciparum ’s complete life cycle consists of an incubation period of 10 days in the vector and a period of 7 to 20 days in the human host [ 27 ]. When P. falciparum protozoans are injected into a host’s bloodstream via the vector saliva, the protozoan completes its life cycle. After entering the host bloodstream, the sporozoites (asexual protozoans) move into the liver, where they penetrate the hepatocytes (liver cells). In the host hepatocytes, the parasite divides asexually several times, a process known as schizogony [ 28 ]. Eventually, the hepatocytes break, releasing merozoites which infest red blood cells in the host bloodstream. The parasites can diverge into gametocytes after repeated divisions of the red blood cells [ 29 ]. Malaria can cause symptoms such as fever, chills, and sweat before and during the differentiation of P. falciparum into sexual stages. Even so, some hosts may not exhibit clinical symptoms and are considered asymptomatic [ 30 ]. Whenever mosquitoes bite infected people (symptomatic and asymptomatic), they primarily suck out the gametocytes [ 29 ]. Detecting asymptomatic individuals in endemic areas can be challenging, and mass testing may not always be cost-effective in detecting such individuals [ 31 ].
3.2. Malaria Prevention and Treatment
Due to recursive interactions between humans, mosquitoes, and parasites that cause malaria transmission, existing control methods might not be effective in the sense that they target either mosquitoes or parasites. Mosquitoes have been controlled mainly using the obliteration of larvae and adult stages while reducing the entomological inoculation rate. These controls include physical, biological, or chemical changes to the vector’s environment. As part of the physical modification, breeding sites are removed, and the sources of larvae are managed using drainage and weeding [ 32 ]. However, this method requires a constant working force needed for weeding and drain cleaning which led SSA governments to abandon it although it is among the most effective [ 33 ]. Chemical modification of larval environments includes the application of larvicides and insecticides. The downside of this method is that it is also costly, and mosquitoes often become resistant to insecticides [ 34 ]. Natural enemies of mosquito larvae, such as larvivorous fish and bacteria (e.g., Bacillus thuringiensis ), are used in the biological method. Nevertheless, this process is hindered by the temporary persistence of natural enemies [ 32 ].
The Global Malaria Action Plan (GMAP) and the World Health Organization (WHO) recommended the use of indoor residual spray (IRS) and insecticide-treated bed nets (ITNs) for controlling mosquito adult populations in the 2010s across Africa [ 35 ]. The main chemical component of ITNs and IRS is synthetic pyrethroid, a lethal compound, that repels mosquitoes, remains in the environment, and is harmless to mammals. From 2005 to 2010, malaria cases in Ghana declined by 41% due to the successful use of ITNs and IRS [ 36 ]. The residual effects of synthetic pyrethroids in the environment resulting from IRS, ITNs, and agricultural pest management force mosquitoes to develop coping mechanisms [ 37 ]. These include indoor and outdoor spray avoidance mechanisms, earlier biting, and insecticide resistance [ 38 ].
The control of malaria parasites is achieved through three lines of defense: antimalarial drugs, seasonal prevention, and vaccination. The first line of the curative measures adopted by GMAP is the Artemisinin-based Combination Therapies (ACT). ACTs are highly potent against P. falciparum with a higher clearance rate and symptoms resorption than other curative therapies [ 39 ]. Nevertheless, P. falciparum has progressively developed ACT resistance [ 40 ]. Malaria parasite resistance in SSA is exacerbated by using counterfeit malaria drugs [ 41 , 42 ]. Another preventive measure is intermittent prevention, which involves administering single-dose malaria therapy to pregnant women [ 43 ]. The treatment includes the combination of primaquine, sulfadoxine-pyrimethamine, amodiaquine, methylene blue, and dihydroartemisinin-piperaquine which have been proven to be able to prevent the transmission of P. falciparum [ 44 ]. A seasonal vaccine “RTS, S/AS01” is proven effective for children under 5 years old for a short period with an efficacy rate of 82% [ 45 , 46 ]. The authors demonstrated that RTS, S/AS01 induces an immune response to the proteins of infected sporozoites. Vaccine components protect against clinical malaria. The effectiveness of its protective action can wane over time and with advancing age. Particularly, children between the ages of 6 to 12 weeks benefit from the vaccine more than children between the ages of 5 to 17 months.
A less explored line of malaria treatment consists of phytotherapy. For example, a study on phytochemicals identified more than 20 local plants and herbs in central Africa that can heal and prevent malaria transmission with almost zero risks of P. falciparum developing resistance against these natural compounds [ 47 , 48 , 49 ]. According to the authors, the synergistic interplay between a wide range of chemical substances in the plants enables treating malaria holistically while preventing the development of resistance [ 48 , 50 ].
3.3. Mathematical Modeling of Malaria
Mathematical models developed to gain insight into disease transmission have influenced past and present interventions to prevent or treat diseases. Mathematical modeling of malaria transmission began in 1911 with the Susceptible-Infectious-Removed model (SIR), which compartmentalized the population of hosts and vectors into three groups [ 51 , 52 ]. The compartments were denoted susceptible (S), referring to the population likely to become infected, infectious (I), composed of the fraction of the infected population, and removed/ recovered (R), accounting for the fraction of the population that either died or healed from the disease. Several assumptions were assumed in the SIR model, including a closed and finite population, a homogeneous mosquito bite rate, and a well-mixed population. Although the Ross model (SIR) was unable to adjust to the new incidence data due to its limited predictability, SIR provides insights into the intricate relationship between infected hosts and mosquito densities. Because Ross’s model was theoretical and simulation-based, it failed to document proof for campaigns for mosquito eradication.
Subsequently, George Macdonald complemented the SIR model [ 51 ] by feeding the model with real data and embedding an additional compartment for the latency period between the mosquito bite and the onset of symptoms denoted Exposed (E). Additionally, Macdonald’s study argued for eradicating mosquitoes through a massive campaign based on the theory of superinfection [ 53 ]. Consequently, between 1955 and 1969, WHO implemented widespread and rigorous mosquito eradication campaigns as part of the Global Malaria Eradication Program (GMEP). In these campaigns, the pesticide Dichlorodiphenyltrichloroethane (DDT) was successfully used to control malaria in major European and American countries [ 54 ]. The GMEP, however, failed to eradicate malaria worldwide due to mosquito resistance [ 55 ]. Apart from mosquito insecticide resistance, the absence of basic healthcare services, high malaria transmission intensity, and other socio-ecological factors hindered the success of eradication campaigns in the SSA [ 56 ].
The mathematical epidemiology of malaria has evolved steadily in recent decades from “toy models” (which were not realistic but captured the key features of the disease) to “high-level models” (which are precise and sacrifice generality) [ 57 ]. As such, complex models display common features such as the interaction between many components [ 58 ]. For example, an earlier partial differential model split the infected population into age-dependent infections in the SIR framework [ 59 ].
An important metric that summarizes the transmission of a disease is the reproductive number (R 0 ). R 0 represents the expected number of infected human hosts after effective mosquito bites in a fully susceptible population [ 60 , 61 , 62 ]. Thus, R 0 provides a measurement for the intensity of transmission and contributes to the definition of disease-endemic areas when it is greater than one (R 0 > 1) [ 63 ]. Heterogeneity was therefore incorporated in populations for the computation of R 0 . These include, for example, the age structure of the host [ 64 ], migration of the vectors and hosts [ 65 ], host beliefs and practices [ 66 ], and host income classes [ 67 , 68 ]. R 0 also varies with the degree of complexity introduced into the compartmentalization of the population [ 66 ]. The more the modeling framework embodies the heterogeneity in the host population, the closer to real-life transmission dynamics the value obtained for R 0 becomes [ 63 , 68 , 69 , 70 ]. The pioneering models that provided insight into malaria dynamics and its controls were unrealistic for various reasons, and maximum control has hardly been achieved in [ 56 ]. Most specifically, a large R 0 indicates a higher density of mosquitoes that reduces the effectiveness of ITN in a large and likely heterogeneous host population [ 63 ].
Many mathematical models were developed combining the compartments S, E, I, and R (e.g., SIR, SIS, SI, SIRS, SEIR, SEIRS, SEI) and included the heterogeneity of the population (meta-population modeling approaches) [ 69 ]. Most of these models do not capture the dynamics of subpopulations because they tend to assume large and homogeneous populations that do not take into account the existence and specific characteristics of subpopulations [ 71 , 72 ]. Another issue is that they fail to account for host behavior, which influences the disease’s dynamics and control [ 73 , 74 ]. Besides, compartmental models are either knowledge or data-driven (e.g., Ross model, Macdonald models). The models cannot address the complex interactions between determinants that guide the disease dynamics in both cases. For example, socioeconomic determinants interfere with the parameters generally considered in the calculation of R 0 such as mortality, mobility, and birth rate [ 66 ]. Thus, models that capture, realistically, the transmission process of malaria are still missing.
3.4. Malaria Modeling: Present and Future
The ability to track epidemics became more accurate with the advancement in computing power and the availability of data. The deterministic framework uses in this context provides insight into disease dynamics, but usually fails to take uncertainties into account. Stochastic models, on the other hand, incorporate randomness that may occur during the epidemic and, most importantly, accounts for a large range of uncertainty that occurs during epidemics. For example, a stochastic model was used to predict the complex transmission pattern concerning environmental changes [ 75 ]. Like deterministic models, the stochastic models help to disclose interesting features of the transmission such as the impact of the network structure, and the properties of an outbreak most specifically get insights into the rate of the surge of vector-borne diseases in a new region in a context of climate change [ 76 , 77 ]. Furthermore, both stochastic and deterministic models are population-based and often fail to consider the feature of individuals which alternatively the individual-based models (IBMs) do [ 78 ]. IBMs can account for heterogeneity across individual agents and spatial gradients [ 79 , 80 ]. However, parametrizing them remains a daunting task, especially due to the lack and accuracy of data [ 81 ]. Beyond meta-population modeling (Ross-based models), within-host modeling opened new possibilities to address malaria infectiousness while examining the interplay between hosts’ immune systems and the dynamics of the parasite. However, modeling the superinfection of malaria while tracking each parasite strain and incorporating the diverse genetic makeup of the parasites remains extremely challenging, especially in terms of the model parameterization [ 82 , 83 ]. Most recently, the introduction of a new type of data such as human mobility collected from data providers (Facebook, Twitter, Google, etc.) refined the field of epidemiological surveillance. These data have been used to pinpoint malaria hotspots and predict their spread [ 84 , 85 , 86 ]. The use of fine-grained mobility data opens a new prospect of modeling and documenting evidence for policy recommendations; however, privacy concerns need to be addressed [ 87 ].
4. Discussion
Malaria remains a major health challenge in Sub-Saharan Africa, despite extensive research. Several factors contribute to malaria transmission, including socio-ecological factors and changes in the human host and vector behavior, which may be channeled by parasite genetic changes. Considering malaria’s complex nature, modeling it presents numerous challenges. As an example, it can be difficult to accurately parameterize a model with a compartment of asymptomatic individuals as there is a lack of reliable historical data. However, evidence suggests that malaria symptoms could progress rapidly from none to life-threatening while leading to a lifetime burden of cognitive impairments for some patients. It is thus necessary to develop a strategy that allows for the production at a low cost of reliable primary data on asymptomatic people with the capability of identifying infectious Plasmodium [ 88 ]. Additionally, the review highlighted that for each of the thematic areas discussed, there is a web of determinants that can be studied, using different perspectives including the use of a multidisciplinary lens to examine clinically relevant malaria issues.
A historical approach was used to illustrate the evolution of the malaria model from simplistic to more elaborate models. Moreover, this review outlines the challenges of modeling, where, for instance, introducing mobile data to a mathematical framework may pose a problem of re-identification. The number and scale of parameters involved in a complex modeling approach make model parameterization challenging.
Finally, this review demonstrates that malaria can efficiently be controlled by combining several strategies. Among them are constant awareness and education campaigns in communities at risk and among individuals whose resistance to non-pharmaceutical interventions (such as ITNs) can hinder malaria control efforts.
Given the wide range of thematic areas discussed, we acknowledge that we have only scratched the surface of the various cross-cutting topics. Consequently, we did not elaborate on the host immune response to malaria nor on technological advances in diagnosing and treating malaria since they are widely covered elsewhere [ 89 , 90 , 91 , 92 , 93 , 94 ]. Additionally, this paper faces two challenges, namely (1) a self-selection bias that might lead to a more prominent emphasis on some cross-cutting topics, and (2) not considering recently published papers [ 95 ]. Nonetheless, this review provides readers with an overview of malaria from a global perspective.
Acknowledgments
The author wants to thank Christian Borgemeister, Daniel Callo-Concha, Lauren Childs, Deguenon Jean Marcel, Henri Tonnang, and Joshua Ntajal for their edits, comments, and suggestions. In addition, we would like to extend our gratitude to the Center for Development Research (ZEF), Germany, for providing us with the logistics required for this review.
Abbreviations
ACT | Artemisinin-based Combination Therapies |
DDT | Dichlorodiphenyltrichloroethane |
GMAP | Global Malaria Action Plan |
GMEP | Global Malaria Eradication Program |
IBM | Individual-based models |
IRS | Indoor residual spraying |
ITN | Insecticide-treated bed nets |
R | reproductive number |
SSA | Sub-Saharan Africa |
SIR | Susceptible Infectious Removed model |
WHO | World Health Organization |
Funding Statement
This research received no external funding.
Institutional Review Board Statement
This study was approved by the Center for Development Research (ZEF) ethics committees, University of Bonn.
Informed Consent Statement
Not applicable.
Data Availability Statement
Conflicts of interest.
The author declares no competing interests.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
IMAGES
VIDEO
COMMENTS
In 1954, Allison proposed that hemoglobin S (HbS) gene causes protection against fatal malaria. This would explain the high HbS gene frequency observed in certain regions hyperendemic...
Malaria is caused in humans by five species of single-celled eukaryotic Plasmodium parasites (mainly Plasmodium falciparum and Plasmodium vivax) that are transmitted by the bite of Anopheles...
The malaria hypothesis predicts local, balancing selection of deleterious alleles that confer strong protection from malaria. Three protective variants, recently discovered in red cell genes, are indeed more common in African than European populations.
The malaria hypothesis predicts local, balancing selection of deleterious alleles that confer strong protection from malaria. Three protective variants, recently discovered in red cell genes, are indeed more common in African than European populations.
In 1954, Allison proposed that hemoglobin S (HbS) gene causes protection against fatal malaria. This would explain the high HbS gene frequency observed in certain regions hyperendemic for malaria, so-called “malaria hypothesis”.
The high mortality and widespread impact of malaria have resulted in this disease being the strongest evolutionary selective force in recent human history, and genes that confer resistance to ...
Introduction. Malaria remains a disease of global health importance with 3.3 billion people in 97 countries at risk, leading to an estimated 200 million cases and around 600,000 deaths (WHO, 2015).
Malaria resistance genes comprise some of the most widely accepted examples of strong positive selection in humans. Overall, the original “malaria hypothesis” of Haldane that diseases like thalassaemia are polymorphisms because of resistance to malaria, has been proven correct.
The malaria hypothesis predicts local, balancing selection of deleterious alleles that confer strong protection from malaria. Three protective variants, recently discovered in red cell genes, are indeed more common in African than European populations.
Malaria remains a major health challenge in Sub-Saharan Africa, despite extensive research. Several factors contribute to malaria transmission, including socio-ecological factors and changes in the human host and vector behavior, which may be channeled by parasite genetic changes.