Robustness of the inference of human population structure: A comparison of X-chromosomal and autosomal microsatellites

Ramachandran, Sohini; Rosenberg, Noah A.; Zhivotovsky, Lev A.; Feldman, Marcus W.

doi:10.1186/1479-7364-1-2-87

Primary research
Published: 01 January 2004

Robustness of the inference of human population structure: A comparison of X-chromosomal and autosomal microsatellites

Sohini Ramachandran¹,
Noah A. Rosenberg²,
Lev A. Zhivotovsky³ &
…
Marcus W. Feldman¹

Human Genomics volume 1, Article number: 87 (2004) Cite this article

1647 Accesses
40 Citations
3 Altmetric
Metrics details

Abstract

In this paper, data on 20 X-chromosomal microsatellite polymorphisms from the HGDP-CEPH cell line panel are used to infer human population structure. Inferences from these data are compared to those obtained from autosomal microsatellites. Some of the major features of the structure seen with 377 autosomal markers are generally visible with the X-linked markers, although the latter provide less resolution. Differences between the X-chromosomal and autosomal results can be explained without requiring major differences in demographic parameters between males and females. The dependence of the partitioning on the number of individuals sampled from each region and on the number of markers used is discussed.

Introduction

Differences in patterns of human genetic variation across genetic systems -- such as autosomes, the X and Y chromosomes and the mitochondrial genome -- can be attributed to two main sources: (1) differences between males and females in demographic parameters such as population size and migration rate; and (2) differences across systems in the mechanism of inheritance. Past studies have reported on differences between evolution in males and females by comparing autosomal data with the non-recombining portion of the Y chromosome (NRY) and to mitochondrial DNA (mtDNA);[1–3] however, the X chromosome has generally not been utilised in these studies.

Unlike the NRY and mtDNA, the X chromosome undergoes recombination and contains numerous independent markers. Additionally, selection, if present in the uni-parental systems, will affect every locus; on the X chromosome, however, it affects only those loci that are closely linked to selected sites. Consequently, the differences in variation between autosomes and the X chromosome may be more directly ascribed to male/female demographic differences than those between autosomes and the uniparental systems.

Here the HGDP-CEPH Human Genome Diversity Cell Line Panel[4] is used to test whether individual multilocus genotypes defined by X-linked markers produce different inferences about population structure from those obtained using autosomal genotypes. Results from X-chromosomal analysis of molecular variance (AMOVA) and cluster analysis (as implemented in structure[5]) are compared with those found using the same techniques on 377 autosomal markers [6]. These comparisons are used to study the extent to which the differences between their findings and the results reported by Rosenberg et al. [6] stem from differences in the mechanism of inheritance and from the smaller amount of information available on the X chromosome. This analysis leads to conclusions about the robustness of population structure inference with respect to number of microsatellite markers and the number of individuals sampled per region.

Methods and results

Data

The 1,056 individuals (677 males, 379 females) analysed by Rosenberg et al. [6] and Zhivotovsky et al.,[7] who derive from 52 populations in seven regional groups, were typed for X-linked markers. The X-chromosomal data were compared to autosomal data from these same individuals [6, 7].

The loci studied on the X chromosome consist of 20 polymorphic microsatellites -- 4 di-, 2 tri- and 14 tetranucleotide repeats -- from Marshfield Screening Set #10, with 5.2 per cent missing data. Three of the markers were pseudoautosomal (tetranucleotide DXYS218, tetranucleotide DXS9900 and dinucleotide DXYS154), so that the males were not hemizygous but homo- or heterozygous at these loci.

In both the autosomal data and the X-chromosomal data, markers are sufficiently widely spaced that, within individual populations, linkage disequilibrium as estimated by homo-zygosity-based statistics[8] is generally not observed (results not shown). Thus, these loci can be treated as independent markers.

Genetic diversity

Heterozygosity in the seven regions, computed using the unbiased estimator,[9] ranged as ollows: 0.57 (America), 0.64 (Oceania), 0.67 (East Asia), 0.71 (Central/South Asia), 0.72 (Europe), 0.74 (Middle East) and 0.78 (Africa). Of the 237 total alleles in the data, 34 were confined to a single population; 29 of these 'private' alleles appeared only once in the sample. Of the 208 alleles found more than once in the sample, 7.2 per cent were exclusive to one of the seven geographical regions listed above.

AMOVA for the X chromosome

It has generally been observed that the within-population component of genetic variation, W = 1 -- F_st, is the largest component of human genetic diversity [1, 6, 10, 11]. Using Genetic Data Analysis (GDA)[12] and assuming Hardy-Weinberg proportions within populations, the variance of allelic indicator variables were partitioned in the same manner as was done for autosomal loci from the same individuals [6]. For the 17 non-pseudoautosomal X-chromosomal markers, the within-group variance component accounted for 87-93 per cent of variation among individuals (Table 1). Note that these values are generally smaller than the corresponding autosomal values in Table 1 of Rosenberg et al. [6] (Table 2).

Table 1 Analysis of molecular variance (AMOVA) for 17 (non-pseudoautosomal) X-chromosomal markers.

Full size table

Table 2 Results for two-sided Wilcoxon tests with r = 0.875 and r = 0.5.

Full size table

This observation may be explained by a faster rate of genetic drift for X-chromosomal markers, by comparison to that for autosomal markers. Because populations contain fewer copies of X chromosomes than of any given autosomes, drift may proceed more rapidly for X-chromosomal markers, leading to greater X-chromosomal differentiation across populations and larger among-population and among-region variance components.

This argument can be investigated using Slatkin's[13, 14] formulation of F_stin a set of d populations, each with constant 'effective population size' of N individuals. Consider a marker for which t₀ and t₁ are the mean coalescence times for two alleles from the same population and from different populations, respectively, and for which the mean coalescence time for two alleles chosen from any two populations is t = t₀/d + (d-1)t₁ /d. Assuming mutation rates are small, Slatkin[13] obtained:

F_{s t} = (t - t_{0}) / t

(1)

Suppose that the d populations diverged simultaneously at time Q in the past, from an ancestral population also with effective population size N, where Q is measured in the same units as t₀, t₁ and t. Noting that t₁ = t₀ + Q and substituting u for (d - 1)Q/d, (1) gives:

F_{s t} = u / (t_{0} + u)

(2)

The value of t₀, in units of generations or years, is proportional to the effective population size, and therefore differs across marker systems. Let F_autand F_Xdenote autosomal and X-chromosomal values of F_st, and let T_autand T_Xdenote autosomal and X-chromosomal values of t₀. Following a similar calculation to that of Pérez-Lezaun et al.,[15] to determine the relationship between F_autand F_X, one can equate expressions for u obtained from autosomal and X-chromosomal versions of (2):

F_{a u t} = \frac{T_{X} F_{X}}{T_{a u t} - F_{X} (T_{a u t} - T_{X})}

(3)

F_{X} = \frac{T_{a u t} F_{a u t}}{T_{X} + F_{a u t} (T_{a u t} - T_{X})}

(4)

The within-population components of genetic variation, W_aut= 1 - F_autand W_X= 1- F_Xfor autosomal and X-chromosomal loci, respectively, satisfy:

W_{a u t} = \frac{T_{a u t} W_{X}}{T_{X} + W_{X} (T_{a u t} - T_{X})}

(5)

W_{X} = \frac{T_{X} W_{a u t}}{T_{a u t} - W_{a u t} (T_{a u t} - T_{X})}

(6)

Writing N = N_f+ N_m, where N_fand N_mare effective population sizes of females and males, respectively, r = N_f/N is the female fraction of the effective population size. Using the expressions for autosomal and X-chromosomal effective population sizes, N_aut= 4r(1 - r)N and N_x= 9r(1 - r)N/[2(2 - r)],[16–18] together with the fact that T_aut/N_aut= T_x/N_x, (3-6) can be simplified.

Restricting attention to (6), leads to:

W_{X} = \frac{9 W_{a u t}}{8 (2 - r) - W_{a u t} (7 - 8 r)}

(7)

In terms of the relative rate of drift in females compared with males, denoted as z = [1/(2N_f)]/[1/(2N_m)] = N_m/N_f, (7) gives:

W_{X} = \frac{9 (z + 1) W_{a u t}}{8 (2 z + 1) - W_{a u t} (7 z - 1)}

(8)

Note that there are two special cases of interest (Table 2). At r = 1/2 (z = 1), drift proceeds at the same rate in males and females, so that W_X= 3W_aut/(4 - W_aut). At r = 7/8 = 0.875 (z = 1/7 ≈ 0.143), the slow speed of drift in females compared with males reduces the drift rate of X chromosomes exactly enough to counteract the increase in X-chromosomal drift rate that results from their smaller number in the population. In other words, W_X= W_aut. The fact that the hypothesis W_X= W_aut(Table 2) at P = 0.05 for 11 of the 13 groupings of data in Table 2 can be rejected means that the hypothesis z = 1/7 can also be rejected.

For each of the 13 datasets, the values of r, the female fraction of effective population size, were varied from 0 to 1. At each choice of r, the transformation in (7) was applied and P-values for the two-sided Wilcoxon test between the list of 377 transformed autosomal within-population variance components and the within-population variance components observed at the 17 non-pseudoautosomal X-linked markers were obtained (Figure 1a-c).

At r = 0.5, when drift proceeds at the same rate in males and females, significant P-values (P < 0.05) were found for Africa, Eurasia (treated both as one region and three regions), Europe, Central/South Asia and East Asia. Therefore, for the remaining seven of the 13 samples, the differences in autosomal and X-chromosomal F_stvalues can be explained by assuming that N_m= N_fand by using the smaller effective population size of X chromosomes alone. Because Rosenberg et al. [19] found that repeat size affected divergence, Wilcoxon tests were also performed between transformations of the 274 autosomal tetranucleotide repeats and the 14 X-linked tetranucleotides and similar results were obtained, whether or not the two pseudoautosomal tetranucleotides were included in analysis (not shown).

The values of r on the interval [0,1] that produced the largest P-value (see Figure 1a-c) are also reported in Table 2. America was the only sample where the value of r corresponding to the maximal P-value was greater than 0.5 (r = 0.66 resulted in P = 1.00 for this case).

Of special interest is the fact that five of the six samples with significant P-values (P < 0.05) at r = 0.5 recorded significant P-values as r varied over the whole range [0,1]. For example, as r varies from 0 to 1 in Figure 1b, the P-value resulting from the autosomal transformation for Eurasia (treated as one region) ranges from 4.95 × 10^-4 (r = 0) to 1.97 × 10^-10 (r = 1): The single exception to this pattern was Africa, where the P-value decreased monotonically as r increased and P < 0.05 for r ≥ 0.06.

For these six groupings of the dataset (Africa, Eurasia both as one and as three regions, Europe, Central/South Asia and East Asia), the divergence model with constant effective population size is likely to provide a poorer approximation, as it does not account for population growth or migration[7] (Figure 1d-f).

Multidimensional scaling analysis

Geographic groups of populations are revealed by multidimensional scaling of pairwise F_stvalues (Figure 2): sub-Saharan Africa, (western) Eurasia (which includes Europe, the Middle East, and Central/South Asia), East Asia, Oceania and America. Three populations from Eurasia (Uygur, Hazara, Brahui) and three populations from East Asia (Dai, Cambodian, Han from North China) overlap in the plot. The American populations show much greater within-region genetic differentiation than other continental groups, with the Mayan population (labelled as 4 in Figure 2) deviating somewhat from the rest of American samples. These results agree with the analysis of the same populations using auto-somal microsatellite markers [7].

X-chromosomal population structure

The structure[5] program identifies subgroupings with distinctive allele frequencies and places individuals into K clusters, where K is defined beforehand by the user and can be varied across independent runs of the program. An individual's membership of a particular cluster is presented as a number between 0 and 1, with membership coefficients summing to 1 across all K clusters.

As is true of autosomal allele frequencies, X-chromosomal allele frequencies are strongly correlated across regions (Table 3). Thus, as was done for the autosomal genotypes from the same individuals,[6] the correlated allele frequencies model implemented in structure[5] was used with runs of the same number of iterations as those used to analyse the autosomal data.

Table 3 Correlation coefficients of allele frequencies. Below the diagonal: correlations for 237 X-chromosomal alleles. Above the diagonal: correlations for 4682 autosomal alleles [6].

Full size table

America and Africa were the two essentially discrete regions generated at K = 2 for the X-chromosomal dataset (Figure 3). To compare results with Rosenberg et al.,[6]K was increased from 2 to 6 incrementally. At K = 3, Eurasian populations were somewhat identified and the Mozabites were observed to have substantial membership with Africans, as may be expected from their location in Algeria. At K = 4, the X-chromosomal data show noticeably different structure from the autosomal data (see Figure 1 of Rosenberg et al. [6]), as East Asia does not separate as a genetic cluster with good resolution. The next distinct cluster appears at K = 6, where the Oceanic, American and African regions are observed; Eurasia and East Asia separate less obviously, but still appear differentiated from each other.

The X chromosome polymorphisms produced similar clustering to the autosomes, but with less resolution. This raises the question of how the resolution of clusters depends on the number of markers available to study. Figure 4 shows that when the same amounts of data are used, the autosomal and X-chromosomal loci are largely in agreement. Clustering from 20 markers on either autosome 5 or autosome 11 (Figure 4) revealed results very similar to those found with the X-chromosomal dataset. (These particular autosomes were chosen because exactly 20 microsatellites had been typed on them.) For these chromosomes, at K = 6, only American, African and (in the case of chromosome 5) Oceanic populations appear distinctly. Furthermore, a sample of 20 markers spread across all of the autosomes yielded similar results, with the Kalash appearing as a distinct group, but with the Oceanic cluster absent. The Kalash -- also seen distinctly in Figure 4 from the markers on chromosome 11--formed the sixth cluster in Rosenberg et al. [6] and was the only major cluster in that study that did not match a major geographical region.

Robustness

The 377 autosomal markers in the HGDP-CEPH Human Genome Diversity Cell Line Panel data[4, 6, 7] comprise the largest multilocus dataset presently available for studying globally distributed populations. Of interest in studies of population structure is the number of loci needed for clustering. Also considered here is the required number of sampled individuals [6, 20].

Rosenberg et al. [6] found that inference of membership coefficients is most successful with at least 150 markers, and this is corroborated in Figure 5. It is also seen in Figure 5 that the addition of more individuals to a subset of the entire autosomal dataset (which contains 377 markers and 1,056 individuals) did not improve population structure inference as much as did the addition of loci. Data from individuals are used to estimate allele frequencies, which can be done fairly accurately with a small number of individuals; however, as structure uses distinctive genotypic combinations for the construction of clusters, and multilocus combinations are more likely to be distinctive to particular groups than are singlelocus types,[21] additional loci can contribute more information to cluster analysis than can the addition of more individuals to the sample (Figure 6).

Oceania appears in Figure 6 as a distinct cluster with only ten loci and between 35 and 100 individuals per region. Because the Oceanic populations together contain 39 individuals, increasing the number of individuals beyond 35 per region meant that every Melanesian and Papuan was included in the subset run of structure. Thus, the distinctive allele frequencies of these populations identify this particular genetic cluster, despite the use of only ten loci.

Discussion

The same techniques as Rosenberg et al. [6] and Zhivotovsky et al. [7] were used to analyse genetic structure as inferred from 20 microsatellite markers on the X chromosome. Multidimensional scaling (Figure 2) did not reveal major departures from the patterns exhibited by the autosomal data. As was also observed on the autosomes, both America and Oceania are the regions exhibiting the lowest heterozygosity (0.57 and 0.64, respectively) on the X chromosome.

Seielstad et al. [3] used a migration model to attribute differences in F_stacross genetic systems to a difference in male and female migration rates. By contrast, a divergence model was used here and it was found that the differences observed in F_stvalues can, in many cases, be explained by the smaller effective population size of X chromosomes compared with autosomes. This is similar to what was observed by Jorde et al.,[1] who reported higher G_stvalues in Y-chromosome restriction-site polymorphisms and mtDNA compared with autosomal systems, and found that this difference was expected because of the lower effective population size of the uniparentally-inherited portions of the genome. In those regions here where the smaller number of X chromosomes does not provide a sufficient explanation (Africa, Eurasia, Europe, Central/South Asia and East Asia), the assumptions of the divergence model--especially that of constant population size--may be responsible for the disagreement.

Upon closer examination of these differences in observed F_st, the data here provide some support for the idea that genetic drift occurs faster in females than in males, or, equivalently, that the female effective population size is smaller than that of males. Many factors could potentially explain this observation; a larger correlation in females between reproductive success in parents and offspring or a smaller generation time in females[22] may increase the rate of drift in females compared with that in males.

The use of X-chromosomal data revealed clustering similar to that obtained using autosomal data, but with less resolution (Figures 3-5). In America, Africa and Oceania, inferred clusters corresponded closely with predefined populations using both the autosomal and X-chromosomal loci, but the pattern of admixture observed by Rosenberg et al. [6] is not exactly the same as that revealed by the X chromosome, due to reduced resolution of clusters.

Note that the Oceanic (Melanesian and Papuan) populations in Figure 3 appear most similar to the African populations for 2 ≤ K ≤ 4, and then appear as their own genetic cluster at K = 6. This contrasts with the analysis of Wilson et al.,[23] whose analysis of 23 X-linked microsatellites using structure showed the Oceanic population combining with the Chinese population at K = 3. A possible explanation for the results here may be a migration from Africa to Oceania separate from the primary migration out of Africa to other regions [24].

While choosing representative individuals from various populations is an important factor in the success of studies concerned with inference of population structure, the robustness of structure is much more dependent on the number of microsatellite markers used (Figures 5 and 6). In common with Rosenberg et al.,[6] it is observed here that ancestry inference is most successful with at least 150 loci (Figure 5). Bamshad et al. [20] reported that correct assignment to the continent of origin with a mean accuracy of at least 90 per cent required a minimum of 60 loci and reached 99-100 per cent accuracy when more than 100 loci were used.

In contrast to this study, Bamshad et al. [20] considered a sample correctly assigned if the cluster with the greatest membership coefficient for an individual was the same as the predefined assignment. The criterion here compares the membership coefficients across all K clusters calculated when using structure on a subset of the data, with assignment made based on the full dataset. Thus, it is a measure of how well the results with smaller amounts of data match those with larger datasets, rather than a measure of 'correct assignment'. The difference in these criteria is likely to account for the smaller amount of genetic data regarded as sufficient by Bamshad et al. [20] The similarity coefficient C may be more sensitive to differences in membership coefficients between two runs and can be viewed as a conservative measure of similarity for the runs: visual similarity between graphs of estimated membership coefficients (Figure 6) can be achieved even with fairly small values of C (Figure 5). In Figure 6, for example, the plot using 100 loci and a maximum of 200 individuals per region is quite similar to the plot of the full data, while the similarity coefficient[6] between the structure runs of that particular subset and the entire dataset is 0.379. C does not make use of the 'correct' predefined structure, and, thus, unlike the criterion used by Bamshad et al. [20], is unaffected by errors among the predefined labels.

While most studies to date have lacked the power to make strong inferences about population structure (due to the very recent availability of datasets with individuals assayed for large numbers of loci), future studies should choose an appropriate number both of individuals per region and of loci for these analyses. Note, however, that the sampling scheme may affect the estimated structure. For example, finer distinctions among populations of interest become visible when individuals who are more distantly related to those populations are omitted from analysis [6].

Although differences between the population structure based on the autosomes and X-linked loci may be expected due to differences in male and female demography, the differences between the results here and those of Rosenberg et al. [6] were largely due to the smaller number of X chromosomes in a population compared with autosomes, and to the smaller amount of data available from the X chromosome. From these results, it might be inferred that sex-biased demographic processes have not had a great influence on human population structure.

References

Jorde LB, Watkins WS, Bamshad MJ, et al: 'The distribution of human genetic diversity: A comparison of mitochondrial, autosomal and Y-chromosome data'. Am J Hum Genet. 2000, 66: 979-988. 10.1086/302825.
Article PubMed Central CAS PubMed Google Scholar
Oota H, Settheetham-Ishida W, Tiwaweck D, et al: 'Human mtDNA and Y-chromosome variation is correlated with matrilocal versus patrilocal residence'. Nature Genet. 2001, 29: 20-21. 10.1038/ng711.
Article CAS PubMed Google Scholar
Seielstad MT, Minch E, Cavalli-Sforza LL: 'Genetic evidence for a higher female migration rate in humans'. Nature Genet. 1998, 20: 278-280. 10.1038/3088.
Article CAS PubMed Google Scholar
Cann HM, de Toma C, Cazes L, et al: 'A human genome diversity cell line panel'. Science. 2002, 296: 261-262.
Article CAS PubMed Google Scholar
Pritchard JK, Stephens M, Donnelly P: 'Inference of population structure using multilocus genotype data'. Genetics. 2000, 155: 945-959.
PubMed Central CAS PubMed Google Scholar
Rosenberg NA, Pritchard JK, Weber JL, et al: 'Genetic structure of human populations'. Science. 2002, 298: 2381-2385. 10.1126/science.1078311.
Article CAS PubMed Google Scholar
Zhivotovsky LA, Rosenberg NA, Feldman MW: 'Features of evolution and expansion of modern humans, inferred from genome-wide microsatellite markers'. Am J Hum Genet. 2003, 72: 1171-1186. 10.1086/375120.
Article PubMed Central CAS PubMed Google Scholar
Sabatti C, Risch N: 'Homozygosity and linkage disequilibrium'. Genetics. 2002, 160: 1707-1719.
PubMed Central PubMed Google Scholar
Weir B: Genetic Data Analysis II. 1996, Sinauer Press, Sunderland, MA
Google Scholar
Barbujani G, Magagni A, Minch E, et al: 'An apportionment of human DNA diversity'. Proc Natl Acad Sci USA. 1997, 94: 4516-4519. 10.1073/pnas.94.9.4516.
Article PubMed Central CAS PubMed Google Scholar
Lewontin RC: 'The apportionment of human diversity'. Evol Biol. 1972, 6: 381-398.
Article Google Scholar
Lewis PO, Zaykin DV: 'Genetic Data Analysis: Computer program for the analysis of allelic data'. 2001, [http://lewis.eeb.uconn.edu/lewishome/software.html]
Google Scholar
Slatkin M: 'Inbreeding coefficients and coalescence times'. Genet Res. 1991, 58: 167-175. 10.1017/S0016672300029827.
Article CAS PubMed Google Scholar
Slatkin M: 'A measure of population subdivision based on microsatellite allele frequencies'. Genetics. 1995, 139: 457-462.
PubMed Central CAS PubMed Google Scholar
Pérez-Lezaun A, Calafell F, Seielstad M, et al: 'Population genetics of Y-chromosome short tandem repeats in humans'. J Mol Evol. 1997, 45: 265-270. 10.1007/PL00006229.
Article PubMed Google Scholar
Ewens WJ: Population Genetics. 1969, Methuen & Co., London, UK
Book Google Scholar
Hartl DL, Clark AG: Principles of Population Genetics. 1997, Sinauer Press, Sunderland, MA
Google Scholar
Nordborg M, Krone S: 'Separation of time scales and convergence to the coalescent in structured populations'. Modern Developments in Theoretical Population Genetics. Edited by: Slatkin M, Veuille M. 2002, Oxford University Press, Oxford, UK
Google Scholar
Rosenberg NA, Pritchard JK, Weber JL, et al: 'Response to comment on 'Genetic structure of human populations''. Science. 2003, 300: 1877-
Article CAS Google Scholar
Bamshad MJ, Wooding S, Watkins WS, et al: 'Human population genetic structure and inference of group membership'. Am J Hum Genet. 2003, 72: 578-589. 10.1086/368061.
Article PubMed Central CAS PubMed Google Scholar
Edwards AWF: 'Human genetic diversity: Lewontin's fallacy'. Bioessays. 2003, 25: 798-801. 10.1002/bies.10315.
Article CAS PubMed Google Scholar
Helgason A, Hrafnkelsson B, Gulcher JR, et al: 'A popula-tionwide coalescent analysis of Icelandic matrilineal and patrilineal genealogies: Evidence for a faster evolutionary rate of mtDNA lineages than Y chromosomes'. Am J Hum Genet. 2003, 72: 1370-1388. 10.1086/375453.
Article PubMed Central CAS PubMed Google Scholar
Wilson JF, Weale ME, Smith AC, et al: 'Population genetic structure of variable drug response'. Nature Genet. 2001, 29: 265-269. 10.1038/ng761.
Article CAS PubMed Google Scholar
Disotell TR: 'Human evolution: the southern route to Asia'. Curr Biol. 1999, 9: R925-R928. 10.1016/S0960-9822(00)80106-2.
Article CAS PubMed Google Scholar
Rosenberg NA: 'Distruct: A program for the graphical display of population structure'. Mol Ecol Notes. 2004, [http://www.blackwell-synergy.com/links/doi/10.1046/j.1471-8286.2003.00566.x/full]
Google Scholar

Download references

Acknowledgements

This research was supported in part by NIH Grants GM28428 and GM28016. Sohini Ramachandran is also supported by a NDSEG fellowship. Noah A. Rosenberg is supported by an NSF Postdoctoral Fellowship in Biological Informatics.

Author information

Authors and Affiliations

Department of Biological Sciences, Stanford University, Stanford, CA, 94305-5020, USA
Sohini Ramachandran & Marcus W. Feldman
Program in Molecular and Computational Biology, University of Southern California, 1042 W. 36th Place DRB 289, Los Angeles, CA, 90089, USA
Noah A. Rosenberg
Vavilov Institute of General Genetics, Russian Academy of Sciences, 3 Gubkin Street, Moscow, 117809, Russia
Lev A. Zhivotovsky

Authors

Sohini Ramachandran
View author publications
You can also search for this author in PubMed Google Scholar
Noah A. Rosenberg
View author publications
You can also search for this author in PubMed Google Scholar
Lev A. Zhivotovsky
View author publications
You can also search for this author in PubMed Google Scholar
Marcus W. Feldman
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

About this article

Cite this article

Ramachandran, S., Rosenberg, N.A., Zhivotovsky, L.A. et al. Robustness of the inference of human population structure: A comparison of X-chromosomal and autosomal microsatellites. Hum Genomics 1, 87 (2004). https://doi.org/10.1186/1479-7364-1-2-87

Download citation

Received: 24 October 2003
Accepted: 24 October 2003
Published: 01 January 2004
DOI: https://doi.org/10.1186/1479-7364-1-2-87

Robustness of the inference of human population structure: A comparison of X-chromosomal and autosomal microsatellites

Abstract

Introduction

Methods and results