-
A second generation genetic map for rainbow trout (Oncorhynchus mykiss)
Background:
Genetic maps characterizing the inheritance patterns of traits and markers have been developed for a wide range of species and used to study questions in biomedicine, agriculture, ecology and evolutionary biology. The status of rainbow trout genetic maps has progressed significantly over the last decade due to interest in this species in aquaculture and sport fisheries, and as a model research organism for studies related to carcinogenesis, toxicology, comparative immunology, disease ecology, physiology and nutrition. We constructed a second generation genetic map for rainbow trout using microsatellite markers to facilitate the identification of quantitative trait loci for traits affecting aquaculture production efficiency and the extraction of comparative information from the genome sequences of model fish species.
Results:
A genetic map ordering 1124 microsatellite loci spanning a sex-averaged distance of 2927.10 cM (Kosambi) and having 2.6 cM resolution was constructed by genotyping 10 parents and 150 offspring from the National Center for Cool and Cold Water Aquaculture (NCCCWA) reference family mapping panel. Microsatellite markers, representing pairs of loci resulting from an evolutionarily recent whole genome duplication event, identified 180 duplicated regions within the rainbow trout genome. Microsatellites associated with genes through expressed sequence tags or bacterial artificial chromosomes produced comparative assignments with tetraodon, zebrafish, fugu, and medaka resulting in assignments of homology for 199 loci.
Conclusions:
The second generation NCCCWA genetic map provides an increased microsatellite marker density and quantifies differences in recombination rate between the sexes in outbred populations. It has the potential to integrate with cytogenetic and other physical maps, identifying paralogous regions of the rainbow trout genome arising from the evolutionarily recent genome duplication event, and anchoring a comparative map with the zebrafish, medaka, tetraodon, and fugu genomes. This resource will facilitate the identification of genes affecting traits of interest through fine mapping and positional cloning of candidate genes.
-
Towards systems genetic analyses in barley: Integration of phenotypic, expression and genotype data into GeneNetwork
Background:
A typical genetical genomics experiment results in four separate data sets; genotype, gene expression, higher-order phenotypic data and metadata that describe the protocols, processing and the array platform. Used in concert, these data sets provide the opportunity to perform genetic analysis at a systems level. The predictive power of these experiments is largely determined by the gene expression dataset where tens of millions of data points can be generated using currently available mRNA profiling technologies. Such large, multidimensional data sets often have value beyond that extracted during their initial analysis and interpretation, particularly if conducted on widely distributed reference genetic materials. Besides quality and scale, access to the data is of primary importance as accessibility potentially allows the extraction of considerable added value from the same primary dataset by the wider research community. Although the number of genetical genomics experiments in different plant species is rapidly increasing, none to date been presented in a form that allows quick and efficient on-line testing for possible associations between genes, loci and traits of interest by an entire research community.
Description
Using a reference population of 150 recombinant doubled haploid barley lines we generated novel phenotypic, mRNA abundance and SNP-based genotyping data sets, added them to a considerable volume of 'legacy' trait data provided by several of the authors and entered them into the GeneNetwork (www.genenetwork.org). GeneNetwork is unified on-line analytical environment that enables the user to test genetic hypotheses about how component traits, such as mRNA abundance in this case, may interact to condition more complex biological phenotypes (higher-order traits). Here we describe these barley data sets and demonstrate some of the functionalities GeneNetwork provides as an easily accessible and integrated analytical environment for exploring these complex datasets.
Conclusions:
By integrating barley genotypic, phenotypic and mRNA abundance data sets directly within GeneNetwork's analytical environment we provide simple desktop community access. In this environment, a combination of correlation analysis and linkage mapping provides the potential to identify and substantiate gene targets for saturation mapping and positional cloning. By integrating datasets from an as yet unsequenced crop plant (barley) in a database that has been designed for mouse, we support the feasibility of 'sustainable programming' practice for biological data sets.
-
Heterozygosity increases microsatellite mutation rate, linking it to demographic history
Background:
Biochemical experiments in yeast suggest a possible mechanism that would cause heterozygous sites to mutate faster than equivalent homozygous sites. If such a process operates, it could undermine a key assumption at the core of population genetic theory, namely that mutation rate and population size are indpendent, because population expansion would increase heterozygosity that in turn would increase mutation rate. Here we test this hypothesis using both direct counting of microsatellite mutations in human pedigrees and an analysis of the relationship between microsatellite length and patterns of demographically-induced variation in heterozygosity.
Results:
We find that microsatellite alleles of any given length are more likely to mutate when their homologue is unusually different in length. Furthermore, microsatellite lengths in human populations do not vary randomly, but instead exhibit highly predictable trends with both distance from Africa, a surrogate measure of genome-wide heterozygosity, and modern population size. This predictability remains even after statistically controlling for non-independence due to shared ancestry among populations.
Conclusions:
Our results reveal patterns that are unexpected under classical population genetic theory, where no mechanism exists capable of linking allele length to extrinsic variables such as geography or population size. However, the predictability of microsatellite length is consistent with heterozygote instability and suggest that this has an important impact on microsatellite evolution. Whether similar processes impact on single nucleotide polymorphisms remains unclear.
-
Application of two machine learning algorithms to genetic association studies in the presence of covariates
Background:
Population-based investigations aimed at uncovering genotype-trait associations often involve high-dimensional genetic polymorphism data as well as information on multiple environmental and clinical parameters. Machine learning (ML) algorithms offer a straightforward analytic approach for selecting subsets of these inputs that are most predictive of a pre-defined trait. The performance of these algorithms, however, in the presence of covariates is not well characterized.
Results:
We investigate two approaches: Random Forests (RFs) and Multivariate Adaptive Regression Splines (MARS). Through multiple simulation studies, the performance under several underlying models is evaluated. An application to a cohort of HIV-1 infected individuals receiving anti-retroviral therapies is also provided.
Conclusions:
Consistent with more traditional regression modeling theory, our findings highlight the importance of considering the nature of underlying gene-covariate-trait relationships before applying ML algorithms, particularly when there is potential confounding or effect mediation.
-
Polymorphisms of the IGF1R Gene and Their Genetic Effects on Chicken Early Growth and Carcass Traits
Background:
The insulin-like growth factor I receptor (IGF1R) has an important effect on growth, carcass, and meat quality traits in many chicken species. However, few studies on associations of the IGF1R gene with growth and carcass traits were reported in chickens. The objectives of the present study were to study the associations of the IGF1R gene with chicken early growth and carcass traits using a neutral test, variation scan of the gene, genetic diversity observation, linkage disequilibrium analyses and association analyses.
Result
The tree of the amino acid sequence for 15 species showed that the IGF1R gene was conservative in the whole evolution of the mammalian animals and chickens. In the length of 10,818 bp, 70 single nucleotide polymorphisms were identified in the IGF1R gene. The allelic and genotypic frequency distribution, genetic diversity and linkage disequilibrium of 18 single nucleotide polymorphisms (SNP) in the Xinghua and White Recessive Rock chickens showed that the 6 of them were possibly associated with growth traits. Association analyses showed that the A17299834G SNP was significantly associated with chicken carcass body weight, eviscerated weight with giblets, eviscerated weight, body weight at day 28, 35, 56, leg length at day 56, and gained body weight during 0-4 weeks. The haplotypes of the A17307750G and A17307494G were associated with early growth traits. The haplotypes of the A17299834G and C17293932T were significantly associated with most of the early growth traits and carcass traits.
Conclusion:
There were rich polymorphisms in the chicken IGF1R gene. Several SNPs associated with chicken early growth traits and carcass traits were identified in the IGF1R gene by genetic diversity, linkage disequilibrium, and association analyses in the present study.
-
A Strategy to Setup Codominant Microsatellite Analysis for High-Resolution-Melting-Curve-Analysis (HRM)
Background:
High resolution melting curve analysis (HRM) is a technique that measures exactly the decreasing fluorescence when slowly melting DNA intercalated with fluorescence dye immediately following PCR in a one-step, closed-tube method. The shape of the melting curve depends on the GC content, length and sequence of the amplicon. Hence it is a powerful, fast and cheap method to detect SNPs and other mutations.
Results:
Here we present a strategy to set up microsatellite analysis for HRM including the correct assignment of heterozygous samples by comparative analysis and artificial mixtures of samples. The approach is demonstrated on two SSR loci of different complexity in the genus Origanum. Following this strategy all alleles of our sample sets could be classified correctly.
Conclusions:
HRM can be used in microsatellite analysis and other codominant marker systems implementing a protocol of comparative melting curve assignment with artificial mixtures of samples to overcome difficulties in correctly assigning heterozygous samples. The method is faster, more sensitive and cheaper than standard protocols for microsatellite analysis.
-
Fully Bayesian tests of neutrality using genealogical summary statistics
Background:
Many data summary statistics have been developed to detect departures from neutral expectations of evolutionary models. However questions about the neutrality of the evolution of genetic loci within natural populations remain difficult to assess. One critical cause of this difficulty is that most methods for testing neutrality make simplifying assumptions simultaneously about the mutational model and the population size model. Consequentially, rejecting the null hypothesis of neutrality under these methods could result from violations of either or both assumptions, making interpretation troublesome.
Results:
Here we harness posterior predictive simulation to exploit summary statistics of both the data and model parameters to test the goodness-of-fit of standard models of evolution. We apply the method to test the selective neutrality of molecular evolution in non-recombining gene genealogies and we demonstrate the utility of our method on four real data sets, identifying significant departures of neutrality in human influenza A virus, even after controlling for variation in population size.
Conclusions:
Importantly, by employing a full model-based Bayesian analysis, our method separates the effects of demography from the effects of selection. The method also allows multiple summary statistics to be used in concert, thus potentially increasing sensitivity. Furthermore, our method remains useful in situations where analytical expectations and variances of summary statistics are not available. This aspect has great potential for the analysis of temporally spaced data, an expanding area previously ignored for limited availability of theory and methods.
-
Canine fibroblast growth factor receptor 3 sequence is conserved across dogs of divergent skeletal size
Background:
Fibroblast growth factor receptor 3 (FGFR3) is expressed in the growth plate of endochondral bones and serves as a negative regulator of linear bone elongation. Activating mutations severely limit bone growth, resulting in dwarfism, while inactivating mutations significantly enhance bone elongation and overall skeletal size. Domesticated dogs exhibit the greatest skeletal size diversity of any species and, given the regulatory role of FGFR3 on growth plate proliferation, we asked whether sequence differences in FGFR3 could account for some of the size differences.
Methods:
All exons, the promoter region, and 60 bp of the 3' flanking region of the canine FGFR3 gene were sequenced for nine different dog breeds representing a spectrum of skeletal size. The resultant sequences were compared to the reference Boxer genome sequence.
Results:
There was no variation in sequence for any FGFR3 exons, promoter region, or 3' flanking sequence across all breeds evaluated.
Conclusion:
The results suggest that, regardless of domestication selection pressure to develop breeds having extreme differences in skeletal size, the FGFR3 gene is conserved. This implies a critical role for this gene in normal skeletal integrity and indicates that other genes account for size variability in dogs.
-
Calculating expected DNA remnants from ancient founding events in human population genetics
Background:
Recent advancements in sequencing and computational technologies have led to rapid generation and analysis of high quality genetic data. Such genetic data have achieved wide acceptance in studies of historic human population origins and admixture. However, in studies relating to small, recent admixture events, genetic factors such as historic population sizes, genetic drift, and mutation, can have pronounced effects on data reliability and utility. To address these issues we conducted genetic simulations targeting influential genetic parameters in admixed populations.
Results:
We performed a series of simulations, adjusting variable values to assess the affect of these genetic parameters on current human population studies and what these studies infer about past population structure. Final mean allele frequencies varied from 0.0005 to over 0.50, depending on the parameters.
Conclusions:
The results of the simulations illustrate that, while genetic data may be sensitive and powerful in large genetic studies, caution must be used when applying genetic information to small, recent admixture events. For some parameter sets, genetic data will not be adequate to detect historic admixture. In such cases, studies should consider anthropologic, archeological, and linguistic data where possible.
-
Homoeologous gene silencing in tissue cultured wheat callus
Background:
In contrast to diploids, most polyploid plant species, which include the hexaploid bread wheat, possess an additional layer of epigenetic complexity. Several studies have demonstrated that polyploids are affected by homoeologous gene silencing, a process in which sub-genomic genomic copies are selectively transcriptionally inactivated. This form of silencing can be tissue specific and may be linked to developmental or stress responses.
Results:
Evidence was sought as to whether the frequency of homoeologous silencing in in vitro cultured wheat callus differ from that in differentiated organs, given that disorganized cells are associated with a globally lower level of DNA methylation. Using a reverse transcription PCR (RT-PCR) single strand conformation polymorphism (SSCP) platform to detect the pattern of expression of 20 homoeologous sets of single-copy genes known to be affected by this form of silencing in the root and/or leaf, we observed no silencing in any of the wheat callus tissue tested.
Conclusion:
Our results suggest that much of the homoeologous silencing observed in differentiated tissues is probably under epigenetic control, rather than being linked to genomic instability arising from allopolyploidization. This study reinforces the notion of plasticity in the wheat epi-genome.
-
A highly polymorphic insertion in the Y-chromosome amelogenin gene can be used for evolutionary biology, population genetics and sexing in Cetacea and Artiodactyla
Background:
The early radiation of the Cetartiodactyla is complex, and unambiguous molecular characters are needed to clarify the positions of hippotamuses, camels and pigs relative to the remaining taxa (Cetacea and Ruminantia). There is also a need for informative genealogic markers for Y-chromosome population genetics as well as a sexing method applicable to all species from this group. We therefore studied the sequence variation of a partial sequence of the evolutionary conserved amelogenin gene to assess its potential use in each of these fields.Results and discussionWe report a large interstitial insertion in the Y amelogenin locus in most of the Cetartiodactyla lineages (cetaceans and ruminants). This sex-linked size polymorphism is the result of a 460–465 bp inserted element in intron 4 of the amelogenin gene of Ruminants and Cetaceans. Therefore, this polymorphism can easily be used in a sexing assay for these species.When taking into account this shared character in addition to nucleotide sequence, gene genealogy follows sex-chromosome divergence in Cetartiodactyla whereas it is more congruent with zoological history when ignoring these characters. This could be related to a loss of homology between chromosomal copies given the old age of the insertion.The 1 kbp Amel-Y amplified fragment is also characterized by high nucleotide diversity (64 polymorphic sites spanning over 1 kbp in seven haplotypes) which is greater than for other Y-chromosome sequence markers studied so far but less than the mitochondrial control region.
Conclusion:
The gender-dependent polymorphism we have identified is relevant not only for phylogenic inference within the Cetartiodactyla but also for Y-chromosome based population genetics and gender determination in cetaceans and ruminants. One single protocol can therefore be used for studies in population and evolutionary genetics, reproductive biotechnologies, and forensic science.
|