How homologous recombination generates a mutable genome

Hurles, Matthew

doi:10.1186/1479-7364-2-3-179

Review
Published: 01 September 2005

How homologous recombination generates a mutable genome

Matthew Hurles¹

Human Genomics volume 2, Article number: 179 (2005) Cite this article

2212 Accesses
11 Citations
Metrics details

Abstract

Recombination and mutation have traditionally been regarded as independent evolutionary processes: the latter generates variation, which the former reshuffles. Recent studies, however, have suggested that allelic recombination influences the underlying mutation rate, as high mutation rates are inferred in regions of high recombination. Furthermore, recombination between duplicated sequences introduces structural variation into the human genome and facilitates the formation of clustered gene families. Comparisons of wholegenome sequences reveal the expansion of gene family clusters to be an important mode of genome evolution. The negative aspect of this genomic dynamism is the contribution of these rearrangements to genetic diseases.

Introduction

Homologous recombination (HR) is one of the fundamental mechanisms of DNA processing which, in various guises, is found in all phyla of life [1, 2]. HR is capable of playing several distinct roles within an individual organism. In sexually reproducing species, meiotic HR is a carefully regulated process that occurs at a defined stage of differentiation in specific cell types. By contrast, in the same species, HR also operates as a major mechanism of DNA repair in all cell types at all times. HR has been clearly co-opted for different functions throughout its deep evolutionary history. Similarly, HR has been exploited as a laboratory tool for, among other applications, genetic engineering in model organisms [3]. The role of HR in DNA repair [4], somatic mutation [5] and chromosomal engineering has been reviewed elsewhere; this paper will focus on meiotic HR and the recent studies demonstrating its impact on the mutability of mammalian genomes.

Evolutionary geneticists have traditionally regarded mutation and recombination (along with selection and genetic drift) as relatively independent 'forces of evolution': while the former generates variation, the latter reshuffles existing variation into novel combinations. These ideas were formulated before DNA was identified as the molecule of inheritance [6–8], however, and well before any understanding two of the molecular mechanisms of mutation was gained. Recent comparative analyses of whole-genome sequences [9–12] give a deeper appreciation of the distinct mutational mechanisms operating to shape genomes over evolutionary timescales. The mutability of any genome can be considered to be the summation of the effects of the distinct mutational mechanisms that operate in that genome. These impacts can be quantified in terms of the rate of each mutational mechanism, the number of bases involved in the resultant mutation and the number of susceptible sites within the genome. Figure 1 displays the major mutational mechanisms operating in mammalian genomes and demonstrates that both in their rate and the size of the resultant genomic alteration vary widely. The distinction between recombination and mutation described above becomes blurred by the involvement of HR in a number of these mutational processes. There is ongoing discussion about the relationship between allelic HR and single nucleotide polymorphism (SNP; see Hellmann et al. [13] and Nachman [14], and section below, for example). Furthermore, HR between non-allelic (duplicated) sequences has been demonstrated to be an important mode of both pathogenic mutation [15] and genome evolution [16]. These duplicated sequences can exist in tandem arrays or in dispersed repeats [17]. HR is the predominant mutational mechanism operating in polymorphic tandem repeat arrays with repeat units longer than five base pairs (bp) [18], including minisatellites and ribosomal DNA arrays, whereas replication slippage operates on arrays of shorter repeat units. These two influences of HR on shaping genomic variation will be considered in turn, but first one should consider the mechanism of HR and the distribution of allelic HR throughout the human genome.

Mechanism and genomic distribution of HR

The specifics of the multifarious protein-DNA interactions that underpin HR are beyond the scope of this paper (but have been reviewed by West [1]). It is sufficient to note that HR is initiated by a DNA double-strand break (DSB). This break is subsequently processed and then invades a homologous acceptor sequence [29]. After further processing, an intermediate is formed that can be resolved in one of two ways: a crossover results in the reciprocal splicing of the donor sequence to the acceptor sequence, whereas a gene conversion results in the non-reciprocal transfer of a short tract of sequence between the sequences (Figure 2). A crossover results in a change of phase of flanking markers on either side of the crossover, whereas a gene conversion is only observable if it encompasses a variant site between the homologous sequences. The ratio of these two outcomes is poorly characterised, although recent empirical [30] and statistical analyses [21] seem to indicate that gene conversion is the more frequent outcome.

The distribution of allelic HR throughout the human genome is likely to be extremely heterogeneous on the fine (kilobase [kb]) scale, although on the coarse scale (tens of megabases), the broad pattern is one of 1.6-fold more recombination events in females than in males, depressed recombination near the centromere and increased recombination in subtelomeric regions [33]. On the fine scale, both empirical studies of recombination in sperm [34] and statistical analysis of patterns of variation in populations [35, 36] indicate the widespread existence of hotspots of recombinatorial activity that can be orders of magnitude more active within an interval of 500 bp to 1 kb than in flanking 'cold' sequences. Population genetics theory suggests that these HR hotspots are likely to be short lived because the dynamics of the HR process are such that recombinogenic variants are doomed to be preferentially gene converted out of existence [37, 38]. This prediction appears to have been bolstered by the recent observation that recombination hotspots are not conserved over the short evolutionary distance that separates humans and chimpanzees [39, 40]. An absence of shared sequence motifs between known recombination hotspots [35] suggests that epigenetic mechanisms might be involved in the inheritance of recombinatorial activity at these locations.

The number of recombination events per meiosis seems to vary significantly both between gametes and between healthy individuals [41]. Intriguingly, mothers with higher rates of recombination tend to have greater reproductive success [42], which would suggest that selection on the dynamics of allelic HR is ongoing.

Allelic HR and sequence diversity

There is, on average, higher sequence diversity in regions of higher recombination rate [14]. It has also been demonstrated that there is a similar correlation between recombination and divergence in comparisons between human and mouse [43] and human and chimpanzee [13]. This correlation need not be explained by a causative relationship between the two (ie HR being mutagenic), although it has been suggested that errors in the repair of DSBs that initiate HR could increase the mutation rate. The hypothesis that recombination is itself mutagenic contrasts with the observation that recombination hotspots are short-lived evolutionary phenomena. Although, genomic location (ie proximity to telomeres) is more evolutionarily stable and plays a role in patterning large-scale recombination activity. Patterns of fine-scale recombination rate in humans, however, may be a poor predictor of the recombinatorial landscape in which sequences have evolved on both human and mouse lineages since they last shared a common ancestor.

Selection has also been invoked to explain the relationship between recombination and diversity. The rationale is that positive (hitchhiking) or negative (background) selection at linked loci is expected to reduce diversity and that, by breaking down linkage, recombination can release neighbouring markers from this selection-induced reduction in diversity [44, 45]. It has been argued, however, that the correlation between recombination and divergence suggests that a neutral, rather than a selective, explanation is more likely. In addition, the relationship between recombination and diversity shows no correlation with gene density [43], which further argues against selection.

It is possible that regions of higher recombination rate are likely to have an elevated mutation rate, not because recombination is itself mutagenic, but because the same genomic features elevate both. The best candidate appears to be GC content, which is positively correlated with recombination rate [46]. Indeed, HR is thought to be GC biased -- that is, gene conversion will preferentially repair a mismatched base pair to a G-C rather than an A-T [47, 48]. As a consequence, a high recombination rate might be expected to lead to a high GC content over time. Correspondingly, the approximately tenfold greater mutability of the CpG dinucleotide [19] over other dinucleotides suggests that mutation should also be elevated in regions of high GC content. Once GC content is taken into account, much of the association between recombination and mutation at larger scales disappears, although a fine-scale correlation of diversity and recombination rate persists [13].

Non-allelic HR between duplicated sequences

Much of the recent interest in non-allelic HR has been caused by the observation from the Human Genome Project that almost half of the human genome is duplicated elsewhere in the genome [49–51]. Approximately 5-6 per cent of the human genome can be found in > 1 kb blocks of > 90 per cent sequence similarity to other locations in the genome (known as segmental duplications) [50, 51]. Furthermore, about 42 per cent of the genome is accounted for by families of dispersed repetitive elements -- short interspersed nuclear elements (SINEs), long interspersed nuclear elements (LINEs and human endogenous retroviruses (HERVs)-- [9]. While these two classes of duplicated sequence are typically considered separately in evolutionary analyses, it is clear that non-allelic HR can occur within both sets of duplicated sequence [52, 53].

Multiple mechanisms account for the origins of these duplicated sequences. Families of dispersed repeats seem to populate the genome in bursts of infectious activity. Non-allelic HR between these small dispersed repeats can in turn lead to much larger segmental duplications [54]. It has been suggested that the rapid populating of an ancestral primate genome with Alu elements facilitated a recent burst of segmental duplication [54] approximately 40 million years ago. It also appears, however, that physical fragility of the DNA sequence additionally plays a role in generating segmental duplications [55].

Non-allelic HR, similarly to allelic HR, can result in crossovers (sometimes known as unequal crossovers) and gene conversion events and is characterised by the existence of hotspots of activity [56]. The duplicated substrates for non-allelic HR can be on the same or non-homologous chromosomes, although it appears that intrachromosomal interactions are much more frequent [57]. Crossovers promoted by non-allelic HR generate rearrangements; the precise structural change depends on the orientation of the duplicated sequences: repeats in direct orientation promote deletions and duplications, whereas repeats in inverted orientation sponsor inversions [15]. Copy number changes sponsored by non-allelic HR need not simply involve deletion or duplication of single copy sequence but may also include dramatic variation in the copy number of tandemly duplicated arrays; for example, an individual X chromosome may carry between one and nine copies of the X-linked opsin genes [58]. The prevalence of these tandem arrays in the human genome has yet to be systematically characterised; however, genes known to exist in polymorphic arrays include those for amylase, alpha-defensins, beta-defensins, opsins, CYP2D6, TSPY, globins, rDNA and histones.

Concerted evolution describes the observation that duplicated sequences appear to be more closely related within species than they are to their orthologues in related species [59]. Concerted evolution can arise from both gene conversion and multiple rounds of unequal crossovers (Figure 3). Gene conversions transfer sequence between duplicated substrates, which can lead to the homogenisation of a family of repeats. Concerted evolutionary processes can thwart attempts to date duplication events that equate sequence similarity between two duplicated sequences to the time since duplication [60, 61]. By homogenising duplicated sequences, gene conversion causes such analyses to underestimate the age of the duplication event; many duplications are older than they might first appear. While concerted evolution was initially characterised in tandemly arrayed gene families (eg ribosomal DNA, globins, opsins), it has more recently been observed in interspersed duplications [62].

As with allelic HR, it has been suggested influence that non-allelic HR sequence diversity and divergence in duplicated sequences. There have been conflicting reports of the direction of this influence. Analysis of the long, almost identical inverted repeats on Yq has indicated a significantly lower sequence divergence between humans and chimpanzees within the repeats than in flanking single copy sequence [63]. This difference was attributed to high levels of gene conversion operating between the inverted repeats repressing sequence divergence between orthologous sequences. By contrast, analyses of the duplicated HERV sequences that promote the AZFa deletion (also on Yq) have revealed elevated sequence divergence within known non-allelic HR hotspots [64] and increased sequence diversity flanking the hotspot [65]. Simulations of the gene conversion process suggest that elevated sequence diversity and orthologous divergence is to be expected when duplicated sequences are themselves slightly differentiated [64]. It remains to be seen whether these observations can be generalised to the entire genome, but the enrichment of apparent SNPs (in the dbSNP database of sequence variation) within segmental duplications [66] and the observation of the gene conversion process operating on other chromosomes is highly suggestive [67].

Evolutionary benefits of having a duplicated genome

Gene duplication and divergence has long been prophesised to be the major mechanism by which novel gene functions arise [68]. Once a gene has been duplicated, selective constraints are relaxed and there are several mechanisms by which the duplicates can diverge in function while fulfilling the role of the ancestral gene (reviewed by Hurles [69]). The widespread existence of gene families pays testament to the importance of gene duplication in evolution. Comparative whole-genome sequence analysis now gives a complete picture of how genomes adapt to novel environments. Comparisons between human and rodent and mammalian and avian genomes [10–12] have implied the importance of lineage-specific expansions of particular clustered gene families, although greater effort is required, on a locus by locus basis, to discount the role of neutral processes in the origins of these structures. The clustering of these genes implicates non-allelic HR, both in their origins and in their patterns of sequence evolution. These lineage-specific expansions are often of gene families involved in sensory perception, toxin metabolism, immune response and reproduction [70]. These gene functions are also observed in single copy genes that show evidence of recent positive selection [71], suggesting that these functions are among the most important for rapid adaptation to novel environments. Interestingly, many pathogens also exhibit clusters of genes involved in antigenic variation; it appears that non-allelic HR is an important mutational mechanism operating in the ongoing arms race between pathogens and immune systems. The relatively high mutation rate of non-allelic HR, compared with that of sequence evolution, is probably an important factor underlying this observation.

Non-allelic HR and disease

As with all mutational processes, non-allelic HR generates variation that is subject to natural selection. While this variation can confer evolutionary benefits, as described above, it can also cause disease. Both unequal crossovers and gene conversions between duplicated sequences have pathogenic potential. A growing number of genetic diseases (Table 1) have been recognised to be caused by deletions and duplications of dosage-sensitive genes, inversions disrupting genic structures and gene conversions ablating normal gene function [15]. Disease-causing rearrangements have been identified within tandemly duplicated gene arrays, as well as between interspersed duplicates. While non-allelic HR is not the sole cause of structural variation in the human genome, HR between tandemly duplicated arrays, between segmental duplications and between dispersed repeats has been demonstrated to be a major cause of these mutations.

Table 1 Examples of diseases caused by non-allelic HR.

Full size table

Perhaps the most common outcome of gene duplication is that one of the copies acquires mutations that may render it non-functional. Not only is the potential for evolving a novel function lost, but this pseudogene now contains a reservoir of mutations that can be gene converted into the remaining functional gene [72–74]. The prevalence of pseudogenes in the human genome suggests that many genes will have associated pseudogenes [75].

While much attention has focused on the role of non-allelic HR in genetic disorders with Mendelian inheritance patterns, little effort has been devoted to investigating its role in the genetics of complex diseases. This is despite the longstanding existence of examples of the role of rearrangements in more complex phenotypes such as drug response [76] and resistance to infectious diseases such as malaria [77]. More recently, elevated copy number of a segmental duplication containing the gene CCL3L1 has been shown to protect against HIV/AIDS [78]. In addition, a chromosomal inversion with a convoluted evolutionary history has been demonstrated to confer a selective advantage in recent generations of the Icelandic population [79]. The lack of studies investigating structural polymorphism and complex traits has perhaps been due to an underestimation of the degree of structural variation in the human genome. Recent studies demonstrate that there is much more large-scale copy number variation than was previously thought to exist and also point towards methods that can redress this under-ascertainment in a systematic fashion [80–82].

Conclusions

The mutagenic potential of non-allelic HR was identified early in the history of molecular genetics, yet, due to the difficulty of experimentally interrogating duplicated sequences, a fuller appreciation of its evolutionary and pathogenic roles has had to await the publication of wholegenome sequences. Clearly, there are both costs and benefits to having a highly duplicated, and therefore mutable, genome.

Despite recent advances, non-allelic HR remains perhaps the most poorly characterised mutation process in the human genome. While the human genome reference sequence provides a reasonable understanding of where non-allelic HR is likely to occur, little is known about the rates of these processes and how they vary between individuals and over evolutionary time-scales. It is worth noting that variation in the frequencies of chromosomal rearrangements along different evolutionary lineages need not reflect the degree of duplication in an ancestral genome, but might result from specific demographic histories (eg population bottlenecks) that have transiently favoured the fixation of chromosomal rearrangements.

Given the role that non-allelic HR appears to have played in the rapid adaptation to novel environments observed within mammalian genome comparisons, it will be of great interest to investigate the genomic changes that must have accompanied the adaptation of different groups of humans to the wide range of environments that our species presently occupies. The recent work on the CCL3L1-containing segmental duplication described above illustrates how a reservoir of structural variation allows a rapid response to the new selective environment posed by a novel human pathogen [78].

References

West SC: Molecular views of recombination proteins and their control. Nat Rev Mol Cell Biol. 2003, 4: 435-445. 10.1038/nrm1127.
Article CAS PubMed Google Scholar
Bhattacharyya MK, Norris DE, Kumar N: Molecular players of homologous recombination in protozoan parasites: Implications for generating antigenic variation. Infect Genet Evol. 2004, 4: 91-98. 10.1016/j.meegid.2004.01.008.
Article CAS PubMed Google Scholar
Thomas KR, Capecchi MR: Introduction of homologous DNA sequences into mammalian cells induces mutations in the cognate gene. Nature. 1986, 324: 34-38. 10.1038/324034a0.
Article CAS PubMed Google Scholar
Dudas A, Chovanec M: DNA double-strand break repair by homologous recombination. Mutat Res. 2004, 566: 131-167. 10.1016/j.mrrev.2003.07.001.
Article CAS PubMed Google Scholar
Bishop AJ, Schiestl RH: Role of homologous recombination in carcinogenesis. Exp Mol Pathol. 2003, 74: 94-105. 10.1016/S0014-4800(03)00010-8.
Article CAS PubMed Google Scholar
Wright S: Evolution in Mendelian populations. Genetics. 1931, 16: 97-159.
PubMed Central CAS PubMed Google Scholar
Haldane JBS: The Causes of Evolution. 1932, Longmans and Green, London, UK
Google Scholar
Fisher RA: The Genetical Theory of Natural Selection. 1930, Clarendon Press, Oxford, UK
Chapter Google Scholar
IHGSC: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.
Article Google Scholar
ICGSC: Sequence and comparative analysis of the chicken genome provide unique perspectives on vertebrate evolution. Nature. 2004, 432: 695-716. 10.1038/nature03154.
Article Google Scholar
MGSC: Initial sequencing and comparative analysis of the mouse genome. Nature. 2002, 420: 520-562. 10.1038/nature01262.
Article Google Scholar
RGSPC: Genome sequence of the Brown Norway rat yields insights into mammalian evolution. Nature. 2004, 428: 493-521.
Article Google Scholar
Hellmann I, Ebersberger I, Ptak SE, et al: A neutral explanation for the correlation of diversity with recombination rates in humans. Am J Hum Genet. 2003, 72: 1527-1535. 10.1086/375657.
Article PubMed Central CAS PubMed Google Scholar
Nachman MW: Single nucleotide polymorphisms and recombination rate in humans. Trends Genet. 2001, 17: 481-485. 10.1016/S0168-9525(01)02409-X.
Article CAS PubMed Google Scholar
Stankiewicz P, Lupski JR: Genome architecture, rearrangements and genomic disorders. Trends Genet. 2002, 18: 74-82. 10.1016/S0168-9525(02)02592-1.
Article CAS PubMed Google Scholar
Eichler EE, Sankoff D: Structural dynamics of eukaryotic chromosome evolution. Science. 2003, 301: 793-797. 10.1126/science.1086132.
Article CAS PubMed Google Scholar
Achaz G, Netter P, Coissac E: Study of intrachromosomal duplications among the eukaryote genomes. Mol Biol Evol. 2001, 18: 2280-2288. 10.1093/oxfordjournals.molbev.a003774.
Article CAS PubMed Google Scholar
Jeffreys AJ, Barber R, Bois P, et al: Human minisatellites, repeat DNA instability and meiotic recombination. Electrophoresis. 1999, 20: 1665-1675. 10.1002/(SICI)1522-2683(19990101)20:8<1665::AID-ELPS1665>3.0.CO;2-L.
Article CAS PubMed Google Scholar
Nachman MW, Crowell SL: Estimate of the mutation rate per nucleotide in humans. Genetics. 2000, 156: 297-304.
PubMed Central CAS PubMed Google Scholar
Xu X, Peng M, Fang Z: The direction of microsatellite mutations is dependent upon allele length. Nat Genet. 2000, 24: 396-399. 10.1038/74238.
Article CAS PubMed Google Scholar
Kayser M, Roewer L, Hedman M, et al: Characteristics and frequency of germline mutations at microsatellite loci from the human Y chromosome, as revealed by direct observation in father/son pairs. Am J Hum Genet. 2000, 66: 1580-1588. 10.1086/302905.
Article PubMed Central CAS PubMed Google Scholar
Leeflang EP, Tavare S, Marjoram P, et al: Analysis of germline mutation spectra at the Huntington's disease locus supports a mitotic mutation mechanism. Hum Mol Genet. 1999, 8: 173-183. 10.1093/hmg/8.2.173.
Article CAS PubMed Google Scholar
Coleman J, Baird DM, Royle NJ: The plasticity of human telomeres demonstrated by a hypervariable telomere repeat array that is located on some copies of 16p and 16q. Hum Mol Genet. 1999, 8: 1637-1646. 10.1093/hmg/8.9.1637.
Article CAS PubMed Google Scholar
Nei M: Molecular Evolutionary Genetics. 1987, Columbia University Press, New York, NY
Google Scholar
Strachan T, Webb D, Dover GA: Transition stages of molecular drive in multiple-copy DNA families in Drosophila. Embo J. 1985, 4: 1701-1708.
PubMed Central CAS PubMed Google Scholar
Deininger PL, Batzer MA: Alu repeats and human disease. Mol Genet Metab. 1999, 67: 183-193. 10.1006/mgme.1999.2864.
Article CAS PubMed Google Scholar
Shaffer LG, Lupski JR: Molecular mechanisms for constitutional chromosomal rearrangements in humans. Annu Rev Genet. 2000, 34: 297-329. 10.1146/annurev.genet.34.1.297.
Article CAS PubMed Google Scholar
Kurahashi H, Emanuel BS: Unexpectedly high rate of de novo constitutional t (11;22) translocations in sperm from normal males. Nat Genet. 2001, 29: 139-140. 10.1038/ng1001-139.
Article CAS PubMed Google Scholar
Szostak JW, Orr-Weaver TL, Rothstein RJ, Stahl FW: The double-strand-break repair model for recombination. Cell. 1983, 33: 25-35. 10.1016/0092-8674(83)90331-8.
Article CAS PubMed Google Scholar
Jeffreys AJ, May CA: Intense and highly localized gene conversion activity in human meiotic crossover hot spots. Nat Genet. 2004, 36: 151-156. 10.1038/ng1287.
Article CAS PubMed Google Scholar
Frisse L, Hudson RR, Bartoszewicz A, et al: Gene conversion and different population histories may explain the contrast between polymorphism and linkage disequilibrium levels. Am J Hum Genet. 2001, 69: 831-843. 10.1086/323612.
Article PubMed Central CAS PubMed Google Scholar
Heyer WD, Ehmsen KT, Solinger JA: Holliday junctions in the eukaryotic nucleus: Resolution in sight?. Trends Biochem Sci. 2003, 28: 548-557. 10.1016/j.tibs.2003.08.011.
Article CAS PubMed Google Scholar
Kong A, Gudbjartsson DF, Sainz J, et al: A high-resolution recombination map of the human genome. Nat Genet. 2002, 31: 241-247.
CAS PubMed Google Scholar
Jeffreys AJ, Kauppi L, Neumann R: Intensely punctate meiotic recombination in the class II region of the major histocompatibility complex. Nat Genet. 2001, 29: 217-222. 10.1038/ng1001-217.
Article CAS PubMed Google Scholar
McVean GA, Myers SR, Hunt S, et al: The fine-scale structure of recombination rate variation in the human genome. Science. 2004, 304: 581-584. 10.1126/science.1092500.
Article CAS PubMed Google Scholar
Crawford DC, Bhangale T, Li N, et al: Evidence for substantial fine-scale variation in recombination rates across the human genome. Nat Genet. 2004, 36: 700-706. 10.1038/ng1376.
Article CAS PubMed Google Scholar
Boulton A, Myers RS, Redfield RJ: The hotspot conversion paradox and the evolution of meiotic recombination. Proc Natl Acad Sci USA. 1997, 94: 8058-8063. 10.1073/pnas.94.15.8058.
Article PubMed Central CAS PubMed Google Scholar
Jeffreys AJ, Neumann R: Reciprocal crossover asymmetry and meiotic drive in a human recombination hot spot. Nat Genet. 2002, 31: 267-271. 10.1038/ng910.
Article CAS PubMed Google Scholar
Winckler W, Myers SR, Richter DJ, et al: Comparison of fine-scale recombination rates in humans and chimpanzees. Science. 2005, 308: 107-111. 10.1126/science.1105322.
Article CAS PubMed Google Scholar
Ptak SE, Hinds DA, Koehler K, et al: Fine-scale recombination patterns differ between chimpanzees and humans. Nat Genet. 2005, 37: 429-434. 10.1038/ng1529.
Article CAS PubMed Google Scholar
Lynn A, Koehler KE, Judis L, et al: Covariation of synaptonemal complex length and mammalian meiotic exchange rates. Science. 2002, 296: 2222-2225. 10.1126/science.1071220.
Article CAS PubMed Google Scholar
Kong A, Barnard J, Gudbjartsson DF, et al: Recombination rate and reproductive success in humans. Nat Genet. 2004, 36: 1203-1206. 10.1038/ng1445.
Article CAS PubMed Google Scholar
Lercher MJ, Hurst LD: Human SNP variability and mutation rate are higher in regions of high recombination. Trends Genet. 2002, 18: 337-340. 10.1016/S0168-9525(02)02669-0.
Article CAS PubMed Google Scholar
Kaplan NL, Hudson RR, Langley CH: The "hitchhiking effect" revisited. Genetics. 1989, 123: 887-899.
PubMed Central CAS PubMed Google Scholar
Charlesworth B, Morgan MT, Charlesworth D: The effect of deleterious mutations on neutral molecular variation. Genetics. 1993, 134: 1289-1303.
PubMed Central CAS PubMed Google Scholar
Fullerton SM, Bernardo Carvalho A, Clark AG: Local rates of recombination are positively correlated with GC content in the human genome. Mol Biol Evol. 2001, 18: 1139-1142. 10.1093/oxfordjournals.molbev.a003886.
Article CAS PubMed Google Scholar
Marais G: Biased gene conversion: Implications for genome and sex evolution. Trends Genet. 2003, 19: 330-338. 10.1016/S0168-9525(03)00116-1.
Article CAS PubMed Google Scholar
Eyre-Walker A: Recombination and mammalian genome evolution. Proc R Soc Lond B Biol Sci. 1993, 252: 237-243. 10.1098/rspb.1993.0071.
Article CAS Google Scholar
IHGSC: Finishing the euchromatic sequence of the human genome. Nature. 2004, 431: 931-945. 10.1038/nature03001.
Article Google Scholar
She X, Jiang Z, Clark RA, et al: Shotgun sequence assembly and recent segmental duplications within the human genome. Nature. 2004, 431: 927-930. 10.1038/nature03062.
Article CAS PubMed Google Scholar
Bailey JA, Gu Z, Clark RA, et al: Recent segmental duplications in the human genome. Science. 2002, 297: 1003-1007. 10.1126/science.1072047.
Article CAS PubMed Google Scholar
Roy AM, Carroll ML, Nguyen SV, et al: Potential gene conversion and source genes for recently integrated Alu elements. Genome Res. 2000, 10: 1485-1495. 10.1101/gr.152300.
Article CAS PubMed Google Scholar
Fredman D, White SJ, Potter S, et al: Complex SNP-related sequence variation in segmental genome duplications. Nat Genet. 2004, 36: 861-866. 10.1038/ng1401.
Article CAS PubMed Google Scholar
Bailey JA, Liu G, Eichler EE: An Alu transposition model for the origin and expansion of human segmental duplications. Am J Hum Genet. 2003, 73: 823-834. 10.1086/378594.
Article PubMed Central CAS PubMed Google Scholar
Zhou Y, Mishra B: Quantifying the mechanisms for segmental duplications in mammalian genomes by statistical analysis and modeling. Proc Natl Acad Sci USA. 2005, 102: 4051-4056. 10.1073/pnas.0407957102.
Article PubMed Central CAS PubMed Google Scholar
Lupski JR: Hotspots of homologous recombination in the human genome: Not all homologous sequences are equal. Genome Biol. 2004, 5: 242-10.1186/gb-2004-5-10-242.
Article PubMed Central PubMed Google Scholar
Murti JR, Bumbulis M, Schimenti JC: Gene conversion between unlinked sequences in the germline of mice. Genetics. 1994, 137: 837-843.
PubMed Central CAS PubMed Google Scholar
Neitz M, Neitz J: Numbers and ratios of visual pigment genes for normal red-green color vision. Science. 1995, 267: 1013-1016. 10.1126/science.7863325.
Article CAS PubMed Google Scholar
Zimmer EA, Martin SL, Beverley SM, et al: Rapid duplication and loss of genes coding for the alpha chains of hemoglobin. Proc Natl Acad Sci USA. 1980, 77: 2158-2162. 10.1073/pnas.77.4.2158.
Article PubMed Central CAS PubMed Google Scholar
Stankiewicz P, Shaw CJ, Withers M, et al: Serial segmental duplications during primate evolution result in complex human genome architecture. Genome Res. 2004, 14: 2209-2220. 10.1101/gr.2746604.
Article PubMed Central CAS PubMed Google Scholar
Hurles ME: Gene conversion homogenizes the CMT1A paralogous repeats. BMC Genomics. 2001, 2: 11-10.1186/1471-2164-2-11.
Article PubMed Central CAS PubMed Google Scholar
Mefford HC, Linardopoulou E, Coil D, et al: Comparative sequencing of a multicopy subtelomeric region containing olfactory receptor genes reveals multiple interactions between non-homologous chromosomes. Hum Mol Genet. 2001, 10: 2363-2372. 10.1093/hmg/10.21.2363.
Article CAS PubMed Google Scholar
Rozen S, Skaletsky H, Marszalek JD, et al: Abundant gene conversion between arms of palindromes in human and ape Y chromosomes. Nature. 2003, 423: 873-876. 10.1038/nature01723.
Article CAS PubMed Google Scholar
Hurles ME, Willey D, Matthews L, Hussain SS: Origins of chromosomal rearrangement hotspots in the human genome: Evidence from the AZFa deletion hotspots. Genome Biol. 2004, 5: R55-10.1186/gb-2004-5-8-r55.
Article PubMed Central PubMed Google Scholar
Bosch E, Hurles ME, Navarro A, Jobling MA: Dynamics of a human inter-paralog gene conversion hotspot. Genome Res. 2004, 14: 835-844. 10.1101/gr.2177404.
Article PubMed Central CAS PubMed Google Scholar
Estivill X, Cheung J, Pujana MA, et al: Chromosomal regions containing high-density and ambiguously mapped putative single nucleotide polymorphisms (SNPs) correlate with segmental duplications in the human genome. Hum Mol Genet. 2002, 11: 1987-1995. 10.1093/hmg/11.17.1987.
Article CAS PubMed Google Scholar
Bagnall RD, Ayres KL, Green PM, Giannelli F: Gene conversion and evolution of Xq28 duplicons involved in recurring inversions causing severe hemophilia A. Genome Res. 2005, 15: 214-223. 10.1101/gr.2946205.
Article PubMed Central CAS PubMed Google Scholar
Ohno S: Evolution by Gene Duplication. 1970, George Allen & Unwin, London, UK
Chapter Google Scholar
Hurles M: Gene duplication: The genomic trade in spare parts. PLoS Biol. 2004, 2: E206-10.1371/journal.pbio.0020206.
Article PubMed Central PubMed Google Scholar
Emes RD, Goodstadt L, Winter EE, Ponting CP: Comparison of the genomes of human and mouse lays the foundation of genome zoology. Hum Mol Genet. 2003, 12: 701-709. 10.1093/hmg/ddg078.
Article CAS PubMed Google Scholar
Clark AG, Glanowski S, Nielsen R, et al: Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science. 2003, 302: 1960-1963. 10.1126/science.1088821.
Article CAS PubMed Google Scholar
Teich N, Nemoda Z, Kohler H, et al: Gene conversion between functional trypsinogen genes PRSS1 and PRSS2 associated with chronic pancreatitis in a six-year-old girl. Hum Mutat. 2005, 25: 343-347. 10.1002/humu.20148.
Article PubMed Central CAS PubMed Google Scholar
Tayebi N, Stubblefield BK, Park JK, et al: Reciprocal and nonreciprocal recombination at the glucocerebrosidase gene region: Implications for complexity in Gaucher disease. Am J Hum Genet. 2003, 72: 519-534. 10.1086/367850.
Article PubMed Central CAS PubMed Google Scholar
Collier S, Tassabehji M, Sinnott P, Strachan T: A de novo pathological point mutation at the 21-hydroxylase locus: Implications for gene conversion in the human genome. Nat Genet. 1993, 3: 260-265. 10.1038/ng0393-260.
Article CAS PubMed Google Scholar
Torrents D, Suyama M, Zdobnov E, Bork P: A genome-wide survey of human pseudogenes. Genome Res. 2003, 13: 2559-2567. 10.1101/gr.1455503.
Article PubMed Central CAS PubMed Google Scholar
Ingelman-Sundberg M: Genetic polymorphisms of cytochrome P450 2D6 (CYP2D6): Clinical consequences, evolutionary aspects and functional diversity. Pharmacogenomics J. 2005, 5: 6-13. 10.1038/sj.tpj.6500285.
Article CAS PubMed Google Scholar
Flint J, Hil AV, Bowden DK, et al: High frequencies of alphathalassaemia are the result of natural selection by malaria. Nature. 1986, 321: 744-750. 10.1038/321744a0.
Article CAS PubMed Google Scholar
Gonzalez E, Kulkarni H, Bolivar H, et al: The influence of CCL3L1 gene-containing segmental duplications on HIV-1/AIDS susceptibility. Science. 2005, 307: 1434-1440. 10.1126/science.1101160.
Article CAS PubMed Google Scholar
Stefansson H, Helgason A, Thorleifsson G, et al: A common inversion under selection in Europeans. Nat Genet. 2005, 37: 129-137. 10.1038/ng1508.
Article CAS PubMed Google Scholar
Tuzun E, Sharp AJ, Bailey JA, et al: Fine-scale structural variation of the human genome. Nat Genet. 2005, 37: 727-732. 10.1038/ng1562.
Article CAS PubMed Google Scholar
Sebat J, Lakshmi B, Troge J, et al: Large-scale copy number polymorphism in the human genome. Science. 2004, 305: 525-528. 10.1126/science.1098918.
Article CAS PubMed Google Scholar
Iafrate AJ, Feuk L, Rivera MN, et al: Detection of large-scale variation in the human genome. Nat Genet. 2004, 36: 949-951. 10.1038/ng1416.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was funded by the Wellcome Trust. The author is grateful to Jim Lupski for his comments on an earlier version of the manuscript.

Author information

Authors and Affiliations

Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, CB10 1SA, UK
Matthew Hurles

Authors

Matthew Hurles
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Matthew Hurles.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Hurles, M. How homologous recombination generates a mutable genome. Hum Genomics 2, 179 (2005). https://doi.org/10.1186/1479-7364-2-3-179

Download citation

Received: 07 July 2005
Accepted: 07 July 2005
Published: 01 September 2005
DOI: https://doi.org/10.1186/1479-7364-2-3-179

How homologous recombination generates a mutable genome

Abstract

Introduction

Mechanism and genomic distribution of HR

Allelic HR and sequence diversity

Non-allelic HR between duplicated sequences

Evolutionary benefits of having a duplicated genome

Non-allelic HR and disease

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Keywords

Human Genomics

Contact us

How homologous recombination generates a mutable genome

Abstract

Introduction

Mechanism and genomic distribution of HR

Allelic HR and sequence diversity

Non-allelic HR between duplicated sequences

Evolutionary benefits of having a duplicated genome

Non-allelic HR and disease

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Human Genomics

Contact us