Home About us Contact | |||
Allele Frequency Differences (allele + frequency_difference)
Selected AbstractsLarge Allele Frequency Differences between Human Continental Groups are more Likely to have Occurred by Drift During range Expansions than by SelectionANNALS OF HUMAN GENETICS, Issue 1 2009T. Hofer Summary Several studies have found strikingly different allele frequencies between continents. This has been mainly interpreted as being due to local adaptation. However, demographic factors can generate similar patterns. Namely, allelic surfing during a population range expansion may increase the frequency of alleles in newly colonised areas. In this study, we examined 772 STRs, 210 diallelic indels, and 2834 SNPs typed in 53 human populations worldwide under the HGDP-CEPH Diversity Panel to determine to which extent allele frequency differs among four regions (Africa, Eurasia, East Asia, and America). We find that large allele frequency differences between continents are surprisingly common, and that Africa and America show the largest number of loci with extreme frequency differences. Moreover, more STR alleles have increased rather than decreased in frequency outside Africa, as expected under allelic surfing. Finally, there is no relationship between the extent of allele frequency differences and proximity to genes, as would be expected under selection. We therefore conclude that most of the observed large allele frequency differences between continents result from demography rather than from positive selection. [source] Statistical power when testing for genetic differentiationMOLECULAR ECOLOGY, Issue 10 2001N. Ryman Abstract A variety of statistical procedures are commonly employed when testing for genetic differentiation. In a typical situation two or more samples of individuals have been genotyped at several gene loci by molecular or biochemical means, and in a first step a statistical test for allele frequency homogeneity is performed at each locus separately, using, e.g. the contingency chi-square test, Fisher's exact test, or some modification thereof. In a second step the results from the separate tests are combined for evaluation of the joint null hypothesis that there is no allele frequency difference at any locus, corresponding to the important case where the samples would be regarded as drawn from the same statistical and, hence, biological population. Presently, there are two conceptually different strategies in use for testing the joint null hypothesis of no difference at any locus. One approach is based on the summation of chi-square statistics over loci. Another method is employed by investigators applying the Bonferroni technique (adjusting the P -value required for rejection to account for the elevated alpha errors when performing multiple tests simultaneously) to test if the heterogeneity observed at any particular locus can be regarded significant when considered separately. Under this approach the joint null hypothesis is rejected if one or more of the component single locus tests is considered significant under the Bonferroni criterion. We used computer simulations to evaluate the statistical power and realized alpha errors of these strategies when evaluating the joint hypothesis after scoring multiple loci. We find that the ,extended' Bonferroni approach generally is associated with low statistical power and should not be applied in the current setting. Further, and contrary to what might be expected, we find that ,exact' tests typically behave poorly when combined in existing procedures for joint hypothesis testing. Thus, while exact tests are generally to be preferred over approximate ones when testing each particular locus, approximate tests such as the traditional chi-square seem preferable when addressing the joint hypothesis. [source] The pattern of linkage disequilibrium in German Holstein cattleANIMAL GENETICS, Issue 4 2010S. Qanbari Summary This study presents a second generation of linkage disequilibrium (LD) map statistics for the whole genome of the Holstein,Friesian population, which has a four times higher resolution compared with that of the maps available so far. We used DNA samples of 810 German Holstein,Friesian cattle genotyped by the Illumina Bovine SNP50K BeadChip to analyse LD structure. A panel of 40 854 (75.6%) markers was included in the final analysis. The pairwise r2 statistic of SNPs up to 5 Mb apart across the genome was estimated. A mean value of r2 = 0.30 ± 0.32 was observed in pairwise distances of <25 kb and it dropped to 0.20 ± 0.24 at 50,75 kb, which is nearly the average inter-marker space in this study. The proportion of SNPs in useful LD (r2 , 0.25) was 26% for the distance of 50 and 75 kb between SNPs. We found a lower level of LD for SNP pairs at the distance ,100 kb than previously thought. Analysis revealed 712 haplo-blocks spanning 4.7% of the genome and containing 8.0% of all SNPs. Mean and median block length were estimated as 164 ± 117 kb and 144 kb respectively. Allele frequencies of the SNPs have a considerable and systematic impact on the estimate of r2. It is shown that minimizing the allele frequency difference between SNPs reduces the influence of frequency on r2 estimates. Analysis of past effective population size based on the direct estimates of recombination rates from SNP data showed a decline in effective population size to Ne = 103 up to ,4 generations ago. Systematic effects of marker density and effective population size on observed LD and haplotype structure are discussed. [source] Optimal Two-Stage Design for Case-Control Association Analysis Incorporating Genotyping ErrorsANNALS OF HUMAN GENETICS, Issue 3 2008Y. Zuo Summary Two-stage design is a cost effective approach for identifying disease genes in genetic studies and it has received much attention recently. In general, there are two types of two-stage designs that differ on the methods and samples used to measure allele frequencies in the first stage: (1) Individual genotyping is used in the first stage; (2) DNA pooling is used in the first stage. In this paper, we focus on the latter. Zuo et al. (2006) investigated statistical power of such a design, among other things, but the cost of the study was not taken into account. The purpose of this paper is to study the optimal design under the given overall cost. We investigate how to allocate the resources to the two stages. Note that in addition to the measurement errors associated with DNA pooling, genotyping errors are also unavoidable with individual genotyping. Therefore, we discuss the optimal design combining genotyping errors associated with individual genotyping. The joint statistical distributions of test statistics in the first and second stages are derived. For a fixed cost, our results show that the optimal design requires no additional samples in the second stage but only that the samples in the first stage be re-used. When the second stage uses an entirely independent sample, however, the optimal design under a given cost depends on the population allele frequency and allele frequency difference between the case and control groups. For the current genotyping costs, we can roughly allocate 1/3 to 1/2 of the total sample size to the first stage for screening. [source] THE HISTORICAL BIOGEOGRAPHY OF TWO CARIBBEAN BUTTERFLIES (LEPIDOPTERA: HELICONIIDAE) AS INFERRED FROM GENETIC VARIATION AT MULTIPLE LOCIEVOLUTION, Issue 3 2002Neil Davies Abstract Mitochondrial DNA and allozyme variation was examined in populations of two Neotropical butterflies, Heliconius charithonia and Dryas iulia. On the mainland, both species showed evidence of considerable gene flow over huge distances. The island populations, however, revealed significant genetic divergence across some, but not all, ocean passages. Despite the phylogenetic relatedness and broadly similar ecologies of these two butterflies, their intraspecific biogeography clearly differed. Phylogenetic analyses of mitochondrial DNA sequences revealed that populations of D. iulia north of St. Vincent are monophyletic and were probably derived from South America. By contrast, the Jamaican subspecies of H. charithonia rendered West Indian H. charithonia polyphyletic with respect to the mainland populations; thus, H. charithonia seems to have colonized the Greater Antilles on at least two separate occasions from Central America. Colonization velocity does not correlate with subsequent levels of gene flow in either species. Even where range expansion seems to have been instantaneous on a geological timescale, significant allele frequency differences at allozyme loci demonstrate that gene flow is severely curtailed across narrow ocean passages. Stochastic extinction, rapid (re)colonization, but low gene flow probably explain why, in the same species, some islands support genetically distinct and nonexpanding populations, while nearby a single lineage is distributed across several islands. Despite the differences, some common biogeographic patterns were evident between these butterflies and other West Indian taxa; such congruence suggests that intraspecific evolution in the West Indies has been somewhat constrained by earth history events, such as changes in sea level. [source] Nonreplication in Genetic Studies of Complex Diseases,Lessons Learned From Studies of Osteoporosis and Tentative Remedies,JOURNAL OF BONE AND MINERAL RESEARCH, Issue 3 2005Hui Shen Abstract Inconsistent results have accumulated in genetic studies of complex diseases/traits over the past decade. Using osteoporosis as an example, we address major potential factors for the nonreplication results and propose some potential remedies. Over the past decade, numerous linkage and association studies have been performed to search for genes predisposing to complex human diseases. However, relatively little success has been achieved, and inconsistent results have accumulated. We argue that those nonreplication results are not unexpected, given the complicated nature of complex diseases and a number of confounding factors. In this article, based on our experience in genetic studies of osteoporosis, we discuss major potential factors for the inconsistent results and propose some potential remedies. We believe that one of the main reasons for this lack of reproducibility is overinterpretation of nominally significant results from studies with insufficient statistical power. We indicate that the power of a study is not only influenced by the sample size, but also by genetic heterogeneity, the extent and degree of linkage disequilibrium (LD) between the markers tested and the causal variants, and the allele frequency differences between them. We also discuss the effects of other confounding factors, including population stratification, phenotype difference, genotype and phenotype quality control, multiple testing, and genuine biological differences. In addition, we note that with low statistical power, even a "replicated" finding is still likely to be a false positive. We believe that with rigorous control of study design and interpretation of different outcomes, inconsistency will be largely reduced, and the chances of successfully revealing genetic components of complex diseases will be greatly improved. [source] Population structure of the peridomestic mosquito Ochlerotatus notoscriptus in AustraliaMEDICAL AND VETERINARY ENTOMOLOGY, Issue 2 2004D. H. Foley Abstract.,Ochlerotatus notoscriptus (Skuse) (Diptera: Culicidae) is the predominant peridomestic mosquito in Australia where it is the primary vector of dog heartworm, Dirofilaria immitis (Leidy), and a potentially important vector of arboviruses (Barmah Forest, Ross River) with geographical variation of vector competence. Although widespread, Oc. notoscriptus has low dispersal ability, so it may have isolated subpopulations. The identification of gene flow barriers may assist in understanding arbovirus epidemiology and disease risk, and for developing control strategies for this species. We investigated the population structure of Oc. notoscriptus from 17 sites around Australia, using up to 31 putative allozyme loci, 11 of which were polymorphic. We investigated the effect of larval environment and adult morphology on genetic variation. At least five subpopulations were found, four in New South Wales (NSW) and one unique to Darwin. Perth samples appear to be a product of recent colonization from the Australian east coast. For NSW sites, a Mantel test revealed an isolation by distance effect and spatial autocorrelation analysis revealed an area of effective gene flow of 67 km, which is high given the limited dispersal ability of this species. No consistent difference was observed between ,urban' and ,sylvan' habitats, which suggests frequent movement between these sites. However, a finer-scaled habitat study at Darwin revealed small but significant allele frequency differences, including for Gpi. No fixed allozyme differences were detected for sex, size, integument colour or the colour of species-diagnostic pale scales on the scutum. The domestic habit of Oc. notoscriptus and assisted dispersal have helped to homogenize this species geographically but population structure is still detectable on several levels associated with geographical variation of vector competence. [source] Evolutionary and statistical properties of three genetic distancesMOLECULAR ECOLOGY, Issue 8 2002Steven T. Kalinowski Abstract Many genetic distances have been developed to summarize allele frequency differences between populations. I review the evolutionary and statistical properties of three popular genetic distances: DS, DA, and ,, using computer simulation of two simple evolutionary histories: an isolation model of population divergence and an equilibrium migration model. The effect of effective population size, mutation rate, and mutation mechanism upon the parametric value between pairs of populations in these models explored, and the unique properties of each distance are described. The effect of these evolutionary parameters on study design is also investigated and similar results are found for each genetic distance in each model of evolution: large sample sizes are warranted when populations are relatively genetically similar; and loci with more alleles produce better estimates of genetic distance. [source] Large Allele Frequency Differences between Human Continental Groups are more Likely to have Occurred by Drift During range Expansions than by SelectionANNALS OF HUMAN GENETICS, Issue 1 2009T. Hofer Summary Several studies have found strikingly different allele frequencies between continents. This has been mainly interpreted as being due to local adaptation. However, demographic factors can generate similar patterns. Namely, allelic surfing during a population range expansion may increase the frequency of alleles in newly colonised areas. In this study, we examined 772 STRs, 210 diallelic indels, and 2834 SNPs typed in 53 human populations worldwide under the HGDP-CEPH Diversity Panel to determine to which extent allele frequency differs among four regions (Africa, Eurasia, East Asia, and America). We find that large allele frequency differences between continents are surprisingly common, and that Africa and America show the largest number of loci with extreme frequency differences. Moreover, more STR alleles have increased rather than decreased in frequency outside Africa, as expected under allelic surfing. Finally, there is no relationship between the extent of allele frequency differences and proximity to genes, as would be expected under selection. We therefore conclude that most of the observed large allele frequency differences between continents result from demography rather than from positive selection. [source] |