Home About us Contact | |||
Correct Assignment (correct + assignment)
Selected AbstractsPopulation structure of Atlantic salmon (Salmo salar L.): a range-wide perspective from microsatellite DNA variationMOLECULAR ECOLOGY, Issue 4 2001T. L. King Abstract Atlantic salmon (n = 1682) from 27 anadromous river populations and two nonanadromous strains ranging from south-central Maine, USA to northern Spain were genotyped at 12 microsatellite DNA loci. This suite of moderate to highly polymorphic loci revealed 266 alleles (5,37/locus) range-wide. Statistically significant allelic and genotypic heterogeneity was observed across loci between all but one pairwise comparison. Significant isolation by distance was found within and between North American and European populations, indicating reduced gene flow at all geographical scales examined. North American Atlantic salmon populations had fewer alleles, fewer unique alleles (though at a higher frequency) and a shallower phylogenetic structure than European Atlantic salmon populations. We believe these characteristics result from the differing glacial histories of the two continents, as the North American range of Atlantic salmon was glaciated more recently and more uniformly than the European range. Genotypic assignment tests based on maximum-likelihood provided 100% correct classification to continent of origin and averaged nearly 83% correct classification to province of origin across continents. This multilocus method, which may be enhanced with additional polymorphic loci, provides fishery managers the highest degree of correct assignment to management unit of any technique currently available. [source] Fishing new proteins in the twilight zone of genomes: The test case of outer membrane proteins in Escherichia coli K12, Escherichia coli O157:H7, and other Gram-negative bacteriaPROTEIN SCIENCE, Issue 6 2003Rita Casadio Abstract We address the problem of clustering the whole protein content of genomes into three different categories,globular, all-,, and all-, membrane proteins,with the aim of fishing new membrane proteins in the pool of nonannotated proteins (twilight zone). The focus is then mainly on outer membrane proteins. This is performed by using an integrated suite of programs (Hunter) specifically developed for predicting the occurrence of signal peptides in proteins of Gram-negative bacteria and the topography of all-, and all-, membrane proteins. Hunter is tested on the well and partially annotated proteins (2160 and 760, respectively) of Escherichia coli K 12 scoring as high as 95.6% in the correct assignment of each chain to the category. Of the remaining 1253 nonannotated sequences, 1099 are predicted globular, 136 are all-,, and 18 are all-, membrane proteins. In Escherichia coli 0157:H7 we filtered 1901 nonannotated proteins. Our analysis classifies 1564 globular chains, 327 inner membrane proteins, and 10 outer membrane proteins. With Hunter, new membrane proteins are added to the list of putative membrane proteins of Gram-negative bacteria. The content of outer membrane proteins per genome (nine are analyzed) ranges from 1.5% to 2.4%, and it is one order of magnitude lower than that of inner membrane proteins. The finding is particularly relevant when it is considered that this is the first large-scale analysis based on validated tools that can predict the content of outer membrane proteins in a genome and can allow cross-comparison of the same protein type between different species. [source] LiveBench-1: Continuous benchmarking of protein structure prediction serversPROTEIN SCIENCE, Issue 2 2001Janusz M. Bujnicki Abstract We present a novel, continuous approach aimed at the large-scale assessment of the performance of available fold-recognition servers. Six popular servers were investigated: PDB-Blast, FFAS, T98-lib, GenTHREADER, 3D-PSSM, and INBGU. The assessment was conducted using as prediction targets a large number of selected protein structures released from October 1999 to April 2000. A target was selected if its sequence showed no significant similarity to any of the proteins previously available in the structural database. Overall, the servers were able to produce structurally similar models for one-half of the targets, but significantly accurate sequence-structure alignments were produced for only one-third of the targets. We further classified the targets into two sets: easy and hard. We found that all servers were able to find the correct answer for the vast majority of the easy targets if a structurally similar fold was present in the server's fold libraries. However, among the hard targets,where standard methods such as PSI-BLAST fail,the most sensitive fold-recognition servers were able to produce similar models for only 40% of the cases, half of which had a significantly accurate sequence-structure alignment. Among the hard targets, the presence of updated libraries appeared to be less critical for the ranking. An "ideally combined consensus" prediction, where the results of all servers are considered, would increase the percentage of correct assignments by 50%. Each server had a number of cases with a correct assignment, where the assignments of all the other servers were wrong. This emphasizes the benefits of considering more than one server in difficult prediction tasks. The LiveBench program (http://BioInfo.PL/LiveBench) is being continued, and all interested developers are cordially invited to join. [source] Assessing SNP markers for assigning individuals to cattle populationsANIMAL GENETICS, Issue 1 2009R. Negrini Summary The effectiveness of single nucleotide polymorphisms (SNPs) for the assignment of cattle to their source breeds was investigated by analysing a panel of 90 SNPs assayed on 24 European breeds. Breed assignment was performed by comparing the Bayesian and frequentist methods implemented in the structure 2.2 and geneclass 2 software programs. The use of SNPs for the reallocation of known individuals to their breeds of origin and the assignment of unknown individuals was tested. In the reallocation tests, the methods implemented in structure 2.2 performed better than those in geneclass 2, with 96% vs. 85% correct assignments respectively. In contrast, the methods implemented in geneclass 2 showed a greater correct assignment rate in allocating animals treated as unknowns to a reference dataset (62% vs. 51% and 80% vs. 65% in field tests 1 and 2 respectively). These results demonstrate that SNPs are suitable for the assignment of individuals to reference breeds. The results also indicate that structure 2.2 and geneclass 2 can be complementary tools to assess breed integrity and assignment. Our findings also stress the importance of a high-quality reference dataset in allocation studies. [source] |