Candidate gene-based association mapping of growth and wood quality traits in Eucalyptus globulusLabill
© Thavamanikumar et al; licensee BioMed Central Ltd. 2011
Published: 13 September 2011
The identification of polymorphisms that underlie complex phenotypic traits presents new and exciting challenges to molecular genetics. Association mapping, which uses linkage disequilibrium (LD) to map trait variation with nucleotide polymorphisms, has proved suited for this purpose in outcrossed tree species. Association mapping can be undertaken either at a candidate gene level or at the whole-genome level however; until sufficiently dense marker assays are developed the candidate gene approach will remain the most effective way of dissecting complex traits in tree species. To date, this approach has led to the identification of several quantitative trait nucleotides (QTN) that associate with a variety of breeding traits including, early wood specific gravity, percentage latewood , microfibril angle , cellulose and carbon isotope discrimination .
Our research aims to identify single nucleotide polymorphism (SNP) markers using candidate gene-based association mapping that can predict growth and wood quality in Eucalyptus globulus.
Materials and methods
A Eucalyptus globulus provenance-progeny trial, planted in 1989 near Latrobe in north-central Tasmania, Australia, by Gunns Ltd, was used as the association discovery population. An eight-year-old Southern Tree Breeding Association (STBA) breeding trial growing near Frankland, Western Australia, is being used for validating marker-trait associations. Twenty functional candidate genes for wood and fiber formation, were selected for this study. SNPs were discovered by direct sequencing of PCR products from 11 to 28 trees.
The iPLEX Gold assay (Sequenom Inc.)was used to genotype 98 selected polymorphisms in up to 385 individuals from the discovery population. This subset includes individuals from eight races of E. globulus. Linkage disequilibrium between SNPs was estimated using GEVALT and Hardy-Weinberg equilibrium was estimated using FSTAT ver 126.96.36.199 respectively.
To account for genetic structure, 18 SSR markers were genotyped in all the discovery samples. Ancestry (Q) co-efficients were estimated using the model-based clustering method as implemented in STRUCTURE . A matrix of pairwise kinship coefficients (K) was calculated as described in Ritland , using the software SpaGeDi . Marker-trait associations were tested using a mixed linear model (MLM) , which accounted for both population structure and familial relatedness using TASSEL version 2.0.1 .
At a 5% significance level, only 4 SNPs deviated from Hardy Weinberg expectations in more than one race. With all races pooled, LD between SNPs was very low with only 1.8% of the pairwise comparisons having r2 values greater than 0.33. Only where r2 is greater than 0.33 is there sufficiently strong LD to be useful for association mapping . Only one pair of SNPs, EgCSA3_4186 and EgMYB2_1380 between genes was found to be in LD (r2 > 0.5). When LD was computed within races 1.3 to 2.7% of the pairwise comparison had r2 values greater than 0.33.
Of the 98 polymorphisms tested against 12 traits, 33 associated significantly (P < 0.05) with one or more traits giving a total of 62 associations. Individual polymorphisms explained between 0.9 and 3.8% of the phenotypic variation observed. Marker-trait associations found in the discovery population are currently being validated by testing their consistency using the validation population.
Candidate gene based association mapping studies are a useful means of dissecting complex quantitative traits in species like E. globulus with low LD and high nucleotide diversity. This study, like other tree association mapping studies, shows that the percentage of phenotypic variation explained by a single polymorphism will often be small. This is not unexpected because of the complex nature of most wood quality traits. The small proportion of the phenotypic variation explained so far using association mapping should not detract from its future use in tree breeding, since genotyping costs are expected to fall while throughput increases, thereby facilitating larger scale association mapping efforts in the near future.
- Gonzalez-Martinez SC, Wheeler NC, Ersoz E, Nelson CD, Neale DB: Association Genetics in Pinus taeda L. I. Wood Property Traits. Genetics. 2007, 175: 399-409.PubMed CentralView ArticlePubMedGoogle Scholar
- Thumma BR, Nolan MR, Evans R, Moran GF: Polymorphisms in cinnamoyl CoA reductase (CCR) are associated with variation in microfibril angle in Eucalyptus spp. Genetics. 2005, 171: 1257-1265. 10.1534/genetics.105.042028.PubMed CentralView ArticlePubMedGoogle Scholar
- Thumma BR, Matheson BA, Zhang D, Meeske C, Meder R, Downes GM, Southerton SG: Identification of a Cis-Acting Regulatory Polymorphism in a Eucalypt COBRA-Like Gene Affecting Cellulose Content. Genetics. 2009, 183: 1153-1164. 10.1534/genetics.109.106591.PubMed CentralView ArticlePubMedGoogle Scholar
- Gonzalez-Martinez SC, Huber D, Ersoz E, Davis JM, Neale DB: Association genetics in Pinus taeda L. II. Carbon isotope discrimination. Heredity. 2008, 101: 19-26.View ArticlePubMedGoogle Scholar
- Davidovich O, Kimmel G, Shamir R: GEVALT: An integrated software tool for genotype analysis. Bmc Bioinformatics. 2007, 8:Google Scholar
- Goudet J: FSTAT, a program to estimate and test gene diversities and fixation indices (version 188.8.131.52). 2002, Available from http://www.2unilch/popgen/softwares/fstathtmGoogle Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of Population Structure Using Multilocus Genotype Data. Genetics. 2000, 155: 945-959.PubMed CentralPubMedGoogle Scholar
- Ritland K: Estimators for pairwise relatedness and individual inbreeding coefficients. Genetical Research. 1996, 67: 175-185. 10.1017/S0016672300033620.View ArticleGoogle Scholar
- Hardy OJ, Vekemans X: SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Molecular Ecology Notes. 2002, 2: 618-620. 10.1046/j.1471-8286.2002.00305.x.View ArticleGoogle Scholar
- Yu JM, Pressoir G, Briggs WH, Bi IV, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, et al: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nature Genetics. 2006, 38: 203-208. 10.1038/ng1702.View ArticlePubMedGoogle Scholar
- Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ram-doss Y, Buckler ES: TASSEL: Software for Association Mapping of Complex Traits in Diverse Samples. Bioinformatics. 2007, 23: 2633-2635. 10.1093/bioinformatics/btm308.View ArticlePubMedGoogle Scholar
- Ardlie KG, Kruglyak L, Seielstad M: Patterns of linkage disequilibrium in the human genome. Nat Rev Genet. 2002, 3: 299-309. 10.1038/nrg777.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.