Five QTL hotspots for yield in short rotation coppice bioenergy poplar: The Poplar Biomass Loci
BMC Plant Biology volume 9, Article number: 23 (2009)
Concern over land use for non-food bioenergy crops requires breeding programmes that focus on producing biomass on the minimum amount of land that is economically-viable. To achieve this, the maximum potential yield per hectare is a key target for improvement. For long lived tree species, such as poplar, this requires an understanding of the traits that contribute to biomass production and their genetic control. An important aspect of this for long lived plants is an understanding of genetic interactions at different developmental stages, i.e. how genes or genetic regions impact on yield over time.
QTL mapping identified regions of genetic control for biomass yield. We mapped consistent QTL across multiple coppice cycles and identified five robust QTL hotspots on linkage groups III, IV, X, XIV and XIX, calling these 'Poplar Biomass Loci' (PBL 1–5). In total 20% of the variation in final harvest biomass yield was explained by mapped QTL. We also investigated the genetic correlations between yield related traits to identify 'early diagnostic' indicators of yield showing that early biomass was a reasonable predictor of coppice yield and that leaf size, cell number and stem and sylleptic branch number were also valuable traits.
These findings provide insight into the genetic control of biomass production and correlation to 'early diagnostic' traits determining yield in poplar SRC for bioenergy. QTL hotspots serve as useful targets for directed breeding for improved biomass productivity that may also be relevant across additional poplar hybrids.
There is currently a new wave of interest in the use of biomass as a renewable fuel source, both for heat and electricity production as well as for liquid transport fuels such as bioethanol, from biochemical fermentation or bio-oil from thermo-chemical conversion. This is particularly true for second generation lignocellulosic crops that are unlikely to compete with food crops on agricultural land. Irrespective of end use, yields of current second generation crops remain a limiting factor to commercial establishment, since they are largely unimproved, with current commercial yields falling far short of both theoretical and experimental yield maxima . Fast-growing tree species such as poplar (Populus) and willow (Salix) that can be grown as short-rotation coppice (SRC) represent one of the most appealing sources of renewable biomass feedstock  and have significant yield potential . SRC crops are easy to establish, provide a fuel source that is multi-functional, as well as offering secondary benefits such as low nutrient input, good energy balance, bioremediation abilities, and increased biodiversity [4, 5]. However, to date, breeding efforts and scientific studies have concentrated on single-stem growth of poplars and there is a need to identify traits and genomic loci as targets for the development of improved SRC biomass-yielding genotypes.
Woody biomass yield is a highly complex trait as it represents the integrated and combined result of many other complex traits, each themselves under polygenic control. In order to inform breeding for biomass improvement, it is therefore important to understand the traits that contribute to biomass production and to locate the loci involved in the control of those trait components before then moving on to identify the specific desirable allelic variants. We have previously performed a multivariate analysis of phenotypic traits and modelled their contribution to biomass production in the population used here .
In a long-lived species such as poplar, it is also essential to understand how biomass production changes with maturity (or in the case of SRC, the individual stools and the entire stand) with understanding being required at the genetic and physiological/morphological level. A number of studies have reported QTL in the population used for this study at a single time point, for single stem plants usually during early phases of growth [7, 8]. However, several studies report different QTL at different time points or plant age [9–11]. Interpretation of such results can be ambiguous as to whether these are true differential effects with time, or statistical issues resulting from factors such as low sample size or replication. There is currently no available QTL information on traits related to coppice growth.
In the present study, QTL mapping and genetic correlations were used to examine interactions and temporal relationships between biomass-associated traits and to identify key loci controlling those traits in SRC.
There was nearly a 30 fold variation in biomass yield for the final biomiass (CC2-4) harvest with genotypic mean values ranging from 0.58 Kg to 16.3 Kg. The degree of variation for the CC1-1 harvest was far greater with nearly a 100 fold range in yield and the rank order of genotypes differed between the two. Biomass at both CC1-1 and CC2-4 was skewed towards lower values. The largest variation in any trait for the CC2-4 data was seen for total basal diameter where there was >150 fold difference between the genotypic mean min and max values. Height showed the least variation with a minimum value of 0.98 m and a maximum of 6.99 m. These trends were similar for the CC1-1 data. The number of coppice stems varied from 1 to 24 and there was considerable variance in the consistency of the diameter of each stem, with some genotypes having a clearly identified leader and others having many stems of more uniform size.
Physiological trait correlations
We have previously reported heritability values and a multivariate analysis of trait contributions to biomass yield for coppice cycle 1 (CC1. See Table 1 and . See  for details of Leaf Plastochron Index.). The data in  are represented here as SS (Single Stem equivalent to days after planting, DAP) and CC1 (Coppice Cycle 1 equivalent to days after coppice, DAC). Here we present the results of QTL analysis for the data presented in  with the addition of a final biomass harvest following an extra coppice cycle of 4 years (CC2). An overview of the coppice cycle is shown in Figure 1.
Strong phenotypic correlations were found between biomass, height and diameter traits, particularly within the same year of measurement (Figure 2). Biomass accumulation in the first and subsequent years remained a reasonable predictor of biomass yield through to CC2-4 (Biomass 1), suggesting that early screening for elite genotypes would be a reliable indicator of sustained productivity. Weaker yet interesting correlations were also found. For example, cell number has been seen to be a more important determinant of biomass, height and leaf area than is cell area . As well as individual leaf area showing significant correlations to biomass traits, leaf number was also important (both having positive correlations). The number of stems was consistently an important determinant of biomass accumulation within this experimental design.
Genetic correlations (Table 2, Additional file 1) showed a similar pattern with strongest correlations being between biomass, height and stem diameter within years. However, traits scored in the first and second year showed low genetic correlations to those of the final harvest measurements (e.g. r = 0.187 +/- 0.05 between biomass in CC1-1 and biomass in CC2-4). Genetic correlations also showed leaf area to be moderately correlated to height1, diameter and sylleptic branch number, stem number, height3 and stem number 2 (Additional file 1). Two-way ANOVA (data not shown) showed highly significant time, and genotype × time interactions for height, diameter, number of stems and biomass (p < 0.001 in all cases).
In total 207 QTL were mapped (Table 3, Additional file 2) with an average of 5.6 QTL mapped per phenotypic trait and 9.4 QTL per linkage group (LG). The number of QTL mapped per LG varied greatly (standard deviation of 6.1) with apparent QTL 'hot spots' on LGs III and X (22 and 21 QTL respectively) with other LGs having very few mapped QTL. There were additional clusters of QTL on LGs IV, VIIIa, IX, XIV, and XIX (Figure 3) and LG II, XIII and XVIII narrowly missed having hotspots declared using the sliding window criteria. QTL mapped explained a mean 4% of phenotypic trait variance (% Vp) with a maximum of 9.8% for any single QTL (height4; Additional file 2) and a maximum 41.5 total % Vp for a single trait (Plastochron index; Table 4, Additional file 2). Mapped QTL explained a mean 3% Vp in biomass yield across the two biomass harvests.
Biomass, height, stem volume and diameter
A number of QTL for direct biomass related traits (i.e. height and diameter) were mapped consistently in multiple years of growth with consistent % Vp and direction of maternal and paternal effects (Table 3). In some cases, QTL for diameter and height both co-located with QTL for biomass and in other cases only co-location of biomass to height or diameter was observed, allowing inference to be drawn about the underlying architecture of these QTL.
Consistent QTL for diameter traits (basal diameter, basal area, and diameter of leader) were located on LGs I, III, X, XIII, XIV, XV, and XIX. There was a general tendency towards the paternal parent (P. deltoides) contributing a positive effect and the maternal parent (P. trichocarpa) a negative one. Those QTL on LGs X and XIV co-located to QTL for biomass yield. In the case of LG X, the diameter QTL co-located to QTL for height (Figure 4). In contrast, the biomass QTL on LG XIV was influenced only by diameter. Other QTL appeared to be specific to certain years. For example QTL specific to CC2-1 were mapped on LGs V, IX, and XVII and to CC2-4 on XVIII.
Height QTL were consistently mapped to LGs II, III, IV, and X. The QTL on LGs II and X explained relatively high % Vp compared to other QTL within this study (9.8 and 8.7% CC2-4) and in both cases, explained increasing % Vp over time. However, % Vp was reduced when early height measurements were used as covariates for mapping QTL of later height measurements within the same coppice cycle.
Stem number and sylleptic branches
For the three measures of stem number, mapped QTL explained between 23 and 38% Vp. A significant and positive correlation between stem number and biomass yield as well as height and diameter was found. Only one QTL on LG XIV was mapped consistently for these traits across all years. For SS measurements (the only year in which sylleptic branches were examined) there were a number of cases where QTL for sylleptics co-located to QTL for stem number, suggesting that these two traits may be under common genetic control. Co-locating QTL can be seen on LGs III, V, VIII, XIII, and XIV. For CC2-4 data, mapped QTL explained a total of 38% Vp for number of stems with two QTL explaining > 9% Vp (LG III and XVIII).
Leaf area and cell traits
QTL explaining between 18 and 21% Vp in leaf area were mapped with an individual maximum % Vp of 7.5 for a QTL on LG VI. No co-locations between QTL for cell area and leaf area were found but co-locating QTL for leaf area and cell number were found on LGs IX, XVII, and XIX. In total QTL explaining 21% Vp for cell number were mapped. In contrast, very few QTL for cell area were mapped, explaining a total 8.9% Vp. No co-location of QTL for leaf extension rate, cell number and/or leaf area was seen. Co-locations between leaf area related and biomass related QTL were found on LGs II, III, IV, VII, IX, XIX. There was a co-locating QTL for cell number and specific leaf area on LG II.
QTL explaining 30.8% Vp for spring bud flush were mapped. The % Vp values reported here for bud flush are considerably smaller than those reported by  but as  point out, the values in  are likely over-estimates (due to the Beavis effect, see  for a mathematical explanation) and the values reported here are more reliable due to the higher number of genotypes used for our QTL mapping. However, our values are still likely over-estimates .
A number of co-locations were found between leaf number, leaf production and Plastochron Index (PI), however these are to be expected as the traits are highly related measures. In such a case, co-location can be considered to indicate that the QTL are reliable [16–20]. Co-location between biomass yield and PI or leaf number was found on LGs VII, XIX, and X. Co-location between bud burst and biomass was found on LG VIIIa.
Candidate loci for biomass yield in poplar
We identified clusters of co-locating QTL on LGs II, VIIIa, X, XIV, and XIX that we have termed Poplar Biomass Loci (PBL; Figure 3, 4). All but the cluster on LG II and VIIIa were also identified as hotspots for co-location using the sliding window over-representation approach. In order to be classified as a PBL we used the arbitrary set of criteria that QTL clusters had to have at least one QTL explaining > 5% Vp, contain a QTL for biomass yield and have consistent maternal and paternal effects. Of these, PBL-3 (Figure 4) on LG X is perhaps the most interesting and we therefore examined this PBL in greater detail. PBL-3 contains co-locating QTL for height and diameter in multiple years in addition to QTL for biomass (CC1-2), stem volume, sylleptic branch number, and specific leaf area with the majority of QTL explaining > 5% Vp. Although other loci could also be termed PBL, we feel that these are currently the most important and well defined.
Genes within QTL regions
The release of the poplar genome sequence  and the use of sequence based markers (i.e. SSRs) for map construction allowed us to identify gene models between flanking SSR markers on the genetic map for Family 331 (Figure 4). On average, there were ~600 gene models between flanking SSR markers (data not shown).
Here we report results of QTL mapping in a partially inbred F2 population of out-breeding poplar, Family 331, grown as SRC. In general we mapped QTL explaining a relatively large % Vp for numerous traits associated with biomass yield (Figures 3 and 4, Table 2 and Additional file 2 see  for an in-depth multivariate analysis of SS and CC1 data), typically with few QTL explaining the largest percentage of trait variation (Additional file 1). Previous studies conducted in the USA on this, and a related back-cross population, grown as both single stem [7, 8, 22–25] and SRC  and at three sites across Europe  reported similar results. This is the first study in Populus to report the genetic control of traits important to growth as short rotation coppice. Biomass at both CC1-1 and CC2-4 was skewed towards lower values, a result also reported in . This is likely the outcome of inbreeding depression and the resultant high genetic load associated with an inbred F2 population derived from out-breeding species . The number of coppice shoots varied from 1 to 24 and there was considerable variance in the consistency of the diameter of each shoot, with some genotypes having a clearly identified leader and others having many shoots of more uniform size. This trait is likely to be tightly related to the strength of apical dominance. Stem number may also be affected by environmental factors such as planting density. In this study, the trees were grown 1 m apart, towards the lower planting density for commercial field trials, so the amount of variation displayed for this trait may differ for more densely planted trials. Sylleptic branching is also a function of the strength of apical dominance and a significant correlation was found between sylleptic branch number and stem number 1 although no co-locating QTL for the two traits were observed. The relatively low genetic correlations of stem traits between CC1-1 and CC2-4 and phenotypic biomass traits between years suggest differential genetic control of these traits with tree age, although correlations between years for genotypes at the phenotypic extremes were more consistent. The decreasing genetic correlations between height and biomass between years are in agreement with the theory that biomass produced early on is largely related to plant height, but as the tree ages, increased biomass appears to be due to increased girth .
A number of QTL were mapped consistently across different years, in some cases being present for all datasets recorded. From a breeding perspective these may represent the most important targets for directed breeding, as QTL mapped consistently across years and multiple environments represent those that are least likely to be affected by GxE interactions. In the current experiment, the population was grown for two coppice cycles and therefore QTL mapped for the CC2-4 data and common to other years are good targets for genotype improvement in this genetic cross.
QTL co-location: Five key Poplar Biomass Loci
At multiple positions we identified clusters of QTL, often for correlated and allometrically linked traits such as height or diameter and biomass and leaf area. In particular, we found clusters of QTL on LGs II, III, IVa, VIIIa, X, XIV, XVIII, and XIX, with the clusters on II and X being particularly interesting. In the case of LG X we were previously able to show that this is equivalent to LG J from previous mapping work presented in papers by Bradshaw et al. , where QTL for biomass were also identified. In all work on this population in both the USA , in this study, and in  LG X has been universally mapped in relation to biomass yield suggesting that this is a highly robust QTL with consistent effect across environments and growth practices. LG XIX is the same as LG O in the work by Bradshaw and here again, QTL for similar traits (stem number) were identified.
From both the biological and breeding perspective it is of interest to examine positions with co-locating QTL while simultaneously considering phenotypic correlation values (or VIP scores in respect to ) and genetic correlations. We identified many genomic loci with co-locating QTL. In some cases these appear to be loci affecting the control of many traits and in others, they are specific to a particular trait and its most allometrically related or closely correlated traits (e.g. height and biomass).
LG II (PBL-1) is an example of co-locating QTL that appear largely specific to height. QTL for height were mapped in all cases where height was recorded for CC data. This cluster includes QTL for CC2-4 biomass, stem extension rate as well as a QTL for leaf extension rate, which is close enough to suggest co-location. There are various different interpretations of what the underlying causative mechanism of this QTL may be: As the locus appears to affect both stem and leaf extension rate, it is possible that the rate of cell division or expansion (or both) is rate-limiting; however, an alternative hypothesis is that the increased extension rate of leaves results in more rapid development from sink to source. Leaf area (particularly on the terminal shoot) is more tightly correlated to height than diameter [22, 30] and so more rapidly maturing leaves would lead to a more rapid increase in height extension.
LG VIIIa contains co-locating QTL for sylleptic number and the number of coppice stems in addition to bud flush and height (height4). Speculating as to a causative mechanistic link between all of these traits is difficult and trait and genetic correlations do not suggest a link between them. It is therefore possible that this cluster of QTL represents more than one gene but that either our mapping resolution is insufficient to distinguish the two adequately or they are in linkage and are being co-inherited as a single locus.
LG X (PBL-3; Figure 4) contains multiple co-locating QTL for both height and diameter, with many explaining high % Vp within the context of this experiment. It is possible that this represents the location of a gene affecting the activity of the cambial meristem region and we are currently examining evidence from literature sources concerning gene expression, mutational/over-expression studies and genes of known biological function in xylem formation and cambial activity to identify likely candidate genes. LG XIV (PBL-4; Figure 3) represents a QTL cluster more specific to diameter. The presence of QTL for stem number in CC1-1 and CC2-4 suggest that the causative mechanism for this QTL may well be an increase in diameter along with increased stem number. This relationship is not unidirectional as there was clear segregation in the F2 for both traits with differences in the genotypic rank order for both traits.
LG XIX (PBL-5; Figure 3) contains an interesting cluster of co-locating QTL for basal area, stem number, leaf area and cell number and we are particularly interested in the observed correlations between cell number on the abaxial leaf epidermal surface with both leaf area and biomass traits, a result also found for various willow genotypes . The chromosomal region between the flanking SSR markers currently contains only 76 gene models, and none of those with informative annotations can easily be ascribed a role in any of these traits.
Although each trait within a QTL hotspot might only contribute a small positive effect on biomass yield, the co-location of multiple traits indicates a common genetic control mechanism (i.e. pleiotropy) suggesting that selection for the beneficial allele at that locus will result in a cumulative increase in biomass due to the integrative effects of the individually small, positive contributions of the various traits. Where such hot spots contain QTL for traits that are not tightly allometrically linked, it is likely that they represent trans acting QTL (most likely transcription factors) where the effect of alterations in regulation or structural characteristics would be expected to have smaller-scale effects but potentially on many traits. In contrast, cis acting QTL are more likely to have large-scale effects but on a single trait or a far more limited set of highly related traits. Pleiotropic loci may, however, result from tight linkage between genes within the same chromosomal region. Examination of modes of action may help draw inferences but further dissection of such loci is required. In some cases it may be implausible or at least highly unlikely that two allometrically related traits are influenced by the same gene. Careful consideration of such possibilities is especially important if results from inter-specific crosses are to be used to direct breeding in other related species; the same allometric relationship, or link between allometry and genetic control, may not exist in alternative genetic backgrounds that have been exposed to different selection pressures and certain QTL may result from unique epistatic interactions created within the inter-specific cross. However it is interesting to note that in another poplar F1 cross grown at two contrasting sites, LG X and XIX were also identified as linkage groups where QTL for biomass related traits were apparent .
Identifying genes underlying QTL
A major challenge in bridging the gap between QTL and the underlying, causative DNA polymorphism is the lack of resolution associated with QTL mapping, especially in forest tree species where multi-generation inbred populations cannot be developed. It is for this reason that it has recently been proposed that QTL mapping be used as a pre-screening method to direct subsequent fine mapping in a natural population (i.e. association mapping), where historic recombination is utilised to offer far greater mapping resolution – in the case of poplar down to the individual gene level . Even such an integrated approach is not simple: in the current study we found a mean of just under 600 genes within our QTL hotspots. As linkage disequilibrium breaks down very rapidly in natural populations of poplar  this would require developing SNP markers for all of those genes. Even then, the assumption is that linkage exists to the causative polymorphism within the coding region of the gene, which may not be the case where the causative polymorphism lies within the upstream or downstream regions of a gene. Certain factors may improve this situation. Street and co-workers  proposed that candidate genes can be selected by identifying genes with differential expression between genotypes at the extremes of a phenotypic trait distribution. Here, the assumption is that these genotypes are fixed for the alleles contributing positive and negative effects on the phenotype, and additionally that gene expression plays an important role in determining phenotype. Alternatively the list of genes within a QTL hotspot (or individual QTL CI) can be examined and a 'short list' determined based on available annotation information. Although we examined the functional annotation of genes in identified QTL hotspots, with many hundreds of genes exisiting in each, this is not a viable exercise. This is especially true considering the complexity of, and number of contributing traits to, biomass production. We are therefore undertaking work to examine differences in gene expression between the population extremes for biomass yield.
We have identified QTL mapped consistently across multiple coppice cycles in poplar grown as SRC and have defined the five most robust QTL clusters as Poplar Biomass Loci 1–5. In total, 20% of the variation in final harvest biomass yield was explained by mapped QTL. These findings both inform our understanding of the complex and integrative process of biomass yield production as well as providing a short list of the most suitable genomic loci that should be considered in targeted breeding programs using this genetic cross.
The inbred F2 population was created from a cross between a female P. trichocarpa (clone 93–968 from western Washington, USA) and a male P. deltoides (clone ILL-129 from central Illinois, USA). Two siblings, 53–242 (female) and 53–246 (male), from the resulting F1 family (Family 53; ) were crossed in 1988 to form an F2 family of 90 genotypes and again in 1990 to obtain an additional 320 genotypes (Family 331; [26, 29]. This pedigree was imported into the UK in 1999.
A replicated field trial (n = 3 planted in a randomised block design of spacing 1 × 1 m) was conducted in the UK at the Forestry Commission field site, Headley, U.K. (51°07' N, 0°50'] W). The trial was established from 25 cm un-rooted hardwood cuttings of 93–968, ILL-129, the two F1 parents and 300 F2 genotypes.
Cuttings were derived from a stool bed at the University of Washington, Seattle, USA. Planting details have been described previously . Cuttings were planted during spring 2000.
The Single Stem (SS) plants were cut back to initiate the first Coppice Cycle (CC1) on 11th January 2001. CC1 was harvested after one year of growth (CC1-1: year 1 of CC1) in winter 2001–2 to initiate a second coppice cycle (CC2). CC2 was harvested in winter 2005–6 (CC2-4: year 4 of CC2). Only two of the three replicate blocks were measured for CC2 and so all final harvest measurements and QTL mapping are based on n = 2 reps. The date of measurement and replication for all traits is indicated in Table 1.
Details of all traits measured prior to 2006 can be found in . In 2006, end of coppice cycle biomass, total basal diameter, the number of coppice stems and the height of the leader (the largest coppice stem) were recorded as detailed for previous growing seasons in .
Micro-environmental effects were minimised using Papadakis spatial correction , based on a 7 × 3 grid on individual data implemented as a set of custom-written functions (pers. comm. Bastien C, INRA Oreans, France) in R . ANOVA were carried out for all traits in R using the 'aov' function with the following model:
where μ is the general mean, Bi is the effect of block, considered as fixed, and Gj is the effect of genotype j, considered as random.
Phenotypic correlations between traits were tested for using Spearman's Rank correlation and hierarchical clustering was then performed on the trait correlation matrix after removal of insignificant correlations and traits with no significant correlations to any other traits using an R script.
Genetic correlations between traits (rg) were calculated from the variance-covariance matrices obtained from the multivariate ANOVA as rg = Cov G (x, y)/√[σ2 G (x) σ2 G (y)], where Cov G (x, y) is genetic covariance between traits x and y, estimated by equating the mean co-products with their expected values according to the Henderson III procedure .
In addition, traits that were measured at different time points were analysed for genotype by age interaction by carrying out a two way ANOVA in R using the 'aov' function with the following model:
where μ is the general mean, Ai is the effect of plant age is considered as fixed, and Gj is the effect of genotype j considered as random.
All genotypes used for QTL mapping were full-sib progeny (referred to here as the F2 generation) of Family 331. QTL were mapped using the freely available web-based program QTLExpress . The out-breeding module of the program was used. Permutation testing implemented in QTLExpress was used to establish the critical F value for declaring a QTL present (1000 permutation, see ). QTL confidence intervals (CIs) were calculated using a two F drop-off (the cM distance taken for the peak F value to drop by two). The genetic linkage map used was produced by Tuskan et al. (pers. comm.) and consisted of 91 SSR markers genotyped on 350 of the full-sib progeny and 92 fully informative Amplified Fragment Length Polymorphisms (AFLPs) genotyped on 165 genotypes of the progeny. The resulting genetic map consists of 22 Linkage Groups. Where more than one LG has been assigned to a chromosome, they are numbered with the LG number and a letter, with letter order indicating the order of LGs along the chromosome. SSR primer sequences  were located on the genome sequence to align the genetic and physical maps and to provide correct orientation of linkage groups (i.e. 3' to 5'). The location information of SSR markers was used to generate gene lists of all genes between flanking SSR markers of a subset of QTL.
QTL figures were produced using a custom-written R package developed by ourselves and available on request. This package implements a permutation test and sliding window approach to identify regions of the genetic map over-represented with co-locating QTL . For each permutation, QTL are randomly shuffled across the genome and a sliding window of 5 cM is then used to count the number of QTL in each window region. The window was advanced in 1 cM steps across the entire genetic map and the maximum number of QTL in a window region was recorded per permutation. The permutation maximum count results were then sorted and used to determine the critical value at a α0.05 significance level (the 950th value for 1000 permutations). The sliding window was then applied to the original QTL data to identify regions with more than the critical number of co-locating QTL. The critical number for our data was five (1000 permutations). Identified hotspots should be viewed with caution where traits have been measured repeatedly or where derived traits are calculated (such as stem volume) as these can artificially inflate the chances of co-location occurring.
Linking the physical sequence to the genetic map
In order to extract lists of genes within QTL regions, the amplified products of primer sequences of SSR markers used for QTL mapping were located in the genome sequence using a local BLAT server. Primers returning more than one potential amplification product were excluded and any primers amplifying products on scaffolds (un-anchored sections of the genome sequence that cannot currently be assigned to LGs) were excluded. For the purposes of extracting genes underlying QTL regions we produced R functions that first subset the genetic map to only those SSR markers that were located on the genome sequence. QTL regions were then defined by taking the flanking SSR markers from the location of the QTL and subsequently extracting a list of all genes between the genomic coordinates of the SSR markers. This approach typically led to extension of the QTL region beyond that of the QTL mapping confidence interval but occasionally led to a smaller region. We considered other approaches such as converting between cM and bp but such approaches are complicated by variable recombination frequencies both between and within linkage groups (data not shown).
Figure 2 was produced using Cytoscape . Spearman's Rank correlation values were used as edge weights and trait names as nodes. The data matrix for use in Cytoscape was created using a custom R script and the igraph R package .
Quantitative Trait Loci
Coppice Cycle 1
Coppice Cycle 2
Coppice Cycle 2 Year 4
Poplar Biomass Loci
- % Vp:
percentage variance explained
Simple Sequence Repeat
Short Rotation Coppice
Coppice Cycle 1 Year 1
Water Use Efficiency
Single Nucleotide Polymorphism
Nonhebel S: Energy yields in intensive and extensive biomass production systems. Biomass and Bioenergy. 2002, 22: 159-167. 10.1016/S0961-9534(01)00071-X.
Tuskan GA: Short-rotation woody crop supply systems in the United States: What do we know and what do we need to know?. Biomass & Bioenergy. 1998, 14: 307-315. 10.1016/S0961-9534(97)10065-4.
Aylott M, Casella E, Tubby I, Street NR, Smith P, Taylor G: Yield and spatial supply of bioenergy poplar and willow short-rotation coppice in the UK. New Phytologist. 2008, 178: 358-370. 10.1111/j.1469-8137.2008.02396.x.
Rowe R, Street N, Taylor G: Identifying potential environmental impacts of large-scale deployment of dedicated bioenergy crops in the UK. Renewable and Sustainable Energy Reviews. 2008.
Sims R, Hastings A, Schlamadinger B, Taylor G, Smith P: Energy crops: current status and future prospects. Global Change Biology. 2006, 12: 2054-2076. 10.1111/j.1365-2486.2006.01163.x.
Rae AM, Robinson KM, Street NR, Taylor G: Morphological and physiological traits in uencing biomass productivity in short-rotation coppice poplar. Canadian Journal of Forest Research-Revue Canadienne De Recherche Forestiere. 2004, 34: 1488-1498. 10.1139/x04-033.
Bradshaw HD, Stettler RF: Molecular-Genetics of Growth and Development in Populus .4. Mapping Qtls with Large Effects on Growth, Form, and Phenology Traits in a Forest Tree. Genetics. 1995, 139: 963-973.
Wu R, Bradshaw HD, Stettler RF: Molecular genetics of growth and development in Populus (Salicaceae) .5. Mapping quantitative trait loci affecting leaf variation. American Journal of Botany. 1997, 84: 143-153. 10.2307/2446076.
Plomion C, Durel , O'Malley DM: Genetic dissection of height in maritime pine seedlings raised under accelerated growth conditions. Theoretical and Applied Genetics. 1996, 93: 849-858. 10.1007/BF00224085.
Verhaegen D, Plomion C, Gion JM, Poitel M, Costa P, Kremer A: Quantitative trait dissection analysis in Eucalyptus using RAPD markers: 1. Detection of QTL in interspecific hybrid progeny, stability of QTL expression across different ages. Theoretical and Applied Genetics. 1997, 95: 597-608. 10.1007/s001220050601.
Emebiri LC, Devey ME, Matheson AC, Slee MU: Age-related changes in the expression of QTLs for growth in radiata pine seedlings. Theoretical and Applied Genetics. 1998, 97: 1053-1061. 10.1007/s001220050991.
Erickson RO, Michelini FJ: The Plastochron Index. American Journal of Botany. 1957, 44: 297-305. 10.2307/2438380.
Frewen BE, Chen THH, Howe GT, Davis J, Rohde A, Boerjan W, Bradshaw HD: Quantitative trait loci and candidate gene mapping of bud set and bud flush in Populus. Genetics. 2000, 154: 837-845.
Xu S: Theoretical Basis of the Beavis Effect. Genetics. 2003, 165: 2259-2268.
Utz H, Melchinger A, Schon C: Bias and Sampling Error of the Estimated Proportion of Genotypic Variance Explained by Quantitative Trait Loci Determined From Experimental Data in Maize Using Cross Validation and Validation With Independent Samples. Genetics. 2000, 154: 1836-1849.
Korol AB, Ronin YI, Kirzhner VM: Interval mapping of quantitative trait loci employing correlated trait complexes. Genetics. 1995, 140: 1137-1147.
Korol AB, Ronin YI, Tadmor Y, Bar-Zur A, Kirzchner VM, Nevo E: Estimating variance effect of QTL: an important prospect to increase the resolution power of interval maping. Genetic Research. 1996, 67: 187-194.
Korol AB, Ronin YI, Kirzhner VM: Linkage between quantitative trait loci and marker loci-resolution power of 3 statistical approaches in single marker analysis. Biometrics. 1996, 52: 426-441. 10.2307/2532884.
Korol AB, Ronin YI, Nevo E, Hayes PM: Multi-interval mapping of correlated trait complexes. Heredity. 1998, 80: 273-284. 10.1046/j.1365-2540.1998.00253.x.
Korol AB, Ronin YI, Itskovich MA, Peng J, Nevo E: Enhanced efficiency of quantitative trait loci mapping analysis based on multivariate complexes of quantitative traits. Genetics. 2001, 157: 1789-1803.
Tuskan GA, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, Putnam N, Ralph S, Rombauts S, Salamov A, Schein J, Sterck L, Aerts A, Bhalerao RR, Bhalerao RP, Blaudez D, Boerjan W, Brun A, Brunner A, Busov V, Campbell M, Carlson J, Chalot M, Chapman J, Chen GL, Cooper D, Coutinho PM, Couturier J, Covert S, Cronk Q, Cunningham R, Davis J, Degroeve S, Dejardin A, Depamphilis C, Detter J, Dirks B, Dubchak I, Duplessis S, Ehlting J, Ellis B, Gendler K, Goodstein D, Gribskov M, Grimwood J, Groover A, Gunter L, Hamberger B, Heinze B, Helariutta Y, Henrissat B, Holligan D, Holt R, Huang W, Islam-Faridi N, Jones S, Jones-Rhoades M, Jorgensen R, Joshi C, Kangasjarvi J, Karlsson J, Kelleher C, Kirkpatrick R, Kirst M, Kohler A, Kalluri U, Larimer F, Leebens-Mack J, Leple JC, Locascio P, Lou Y, Lucas S, Martin F, Montanini B, Napoli C, Nelson DR, Nelson C, Nieminen K, Nilsson O, Pereda V, Peter G, Philippe R, Pilate G, Poliakov A, Razumovskaya J, Richardson P, Rinaldi C, Ritland K, Rouze P, Ryaboy D, Schmutz J, Schrader J, Segerman B, Shin H, Siddiqui A, Sterky F, Terry A, Tsai CJ, Uberbacher E, Unneberg P, Vahala J, Wall K, Wessler S, Yang G, Yin T, Douglas C, Marra M, Sandberg G, Peer Van de Y, Rokhsar D: The Genome of Black Cottonwood, Populus trichocarpa (Torr. & Gray). Science. 2006, 313: 1596-1604. 10.1126/science.1128691.
Wu R, Stettler R: Quantitative genetics of growth and development in Populus. III. Phenotypic plasticity of crown structure and function. Heredity. 1998, 81: 299-310. 10.1046/j.1365-2540.1998.00397.x.
Wu RL: Genetic mapping of QTLs affecting tree growth and architecture in Populus: implication for ideotype breeding. Theoretical and Applied Genetics. 1998, 96: 447-457. 10.1007/s001220050761.
Wu RL, Ma CX, Zhu J, Casella G: Mapping epigenetic quantitative trait loci (QTL) altering a developmental trajectory. Genome. 2002, 45: 28-33. 10.1139/g01-118.
Wullschleger S, Yin T, Difazio S, Tschaplinski T, Gunter L, Davis M, Tuskan G: Phenotypic variation in growth and biomass distribution for two advanced-generation pedigrees of hybrid poplar. Canadian Journal of Forest Research. 2005, 35: 1779-1789. 10.1139/x05-101.
Wu R, Bradshaw HD, Stettler RF: Developmental quantitative genetics of growth in Populus. Theoretical and Applied Genetics. 1998, 97: 1110-1119. 10.1007/s001220050998.
Rae A, Pinel M, Bastien C, Sabatti M, Street N, Tucker J, Dixon C, Marron N, Dillen S, Taylor G: QTL for yield in bioenergy Populus: identifying GxE interactions from growth at three contrasting sites. Tree Genetics & Genomes. 2008, 4: 1614-2950.
Wu R, Stettler RF: Quantitative genetics of growth and development in Populus. I. A three-generation comparison of tree architecture during the first 2 years of growth. Theoretical and Applied Genetics. 1994, 89: 1046-1054.
Bradshaw HD, Villar M, Watson BD, Otto KG, Stewart S, Stettler RF: Molecular-Genetics of Growth and Development in Populus .3. A Genetic-Linkage Map of a Hybrid Poplar Composed of Rflp, Sts, and Rapd Markers. Theoretical and Applied Genetics. 1994, 89: 167-178.
Wu R, Stettler RF: The genetic resolution of juvenile canopy structure and function in a three-generation pedigree of Populus. Trees – Structure and Function. 1996, 11: 99-108.
Robinson KM, Karp A, Taylor G: Defining leaf traits linked to yield in short-rotation coppice Salix. Biomass and Bioenergy. 2004, 26: 417-431. 10.1016/j.biombioe.2003.08.012.
Dillen SY, Storme V, Marron N, Bastien C, Neyrinck S, Steenackers M, Ceulemans R, Boerjan W: Genomic regions involved in productivity of two interspecific poplar families in Europe. 1. Stem height, circumference and volume. Tree Genetics & Genomes. 2008.
Ingvarsson PK, Garcia V, Luquez V, Hall D, Jansson S: Nucleotide polymorphism and phenotypic associations within and around the phytochrome B2 locus in European aspen (Populus tremula, Salicaceae). Genetics. 2008, 178: 2217-2226. 10.1534/genetics.107.082354.
Street N, Skogstrom O, Sjodin A, Tucker J, Acosta M, Nilsson P, Jansson S, Taylor G: The genetics and genomics of the drought response in Populus. Plant Journal. 2006, 48: 321-341. 10.1111/j.1365-313X.2006.02864.x.
Bradshaw HD, Stettler RF: Molecular-Genetics of Growth and Development in Populus .1. Triploidy in Hybrid Poplars. Theoretical and Applied Genetics. 1993, 86: 301-307. 10.1007/BF00222092.
Papadakis JS: Advances in the analysis of field experiments. Proceedings of the Academy of Athens. 1984, 59: 326-342.
Gentleman RC, Carey VJ, Bates DM, Bolstad B, Dettling M, Dudoit S, Ellis B, Gautier L, Ge Y, Gentry J, Hornik K, Hothorn T, Huber W, Iacus S, Irizarry R, Leisch F, Li C, Maechler M, Rossini AJ, Sawitzki G, Smith C, Smyth G, Tierney L, Yang JYH, Zhang J: Bioconductor: open software development for computational biology and bioinformatics. Genome Biology. 2004, 5: R80-10.1186/gb-2004-5-10-r80.
Henderson C: Estimation of variance and covariance components. Biometrics. 1953, 9: 226-252. 10.2307/3001853.
Seaton G, Haley CS, Knott SA, Kearsey M, Visscher PM: QTL Express: mapping quantitative trait loci in of simple and complex pedigrees. Bioinformatics. 2002, 18: 339-340. 10.1093/bioinformatics/18.2.339.
Churchill GA, Doerge RW: Empirical Threshold Values for Quantitative Triat Mapping. Genetics. 1994, 138: 963-971.
IPGC SSR Resource. [http://www.ornl.gov/sci/ipgc/ssrresource.htm].
Kliebenstein D, West M, van Leeuwen H, Loudet O, Doerge RW, St Clair DA: Identification of QTLs controlling gene expression networks defined a priori. BMC Bioinformatics. 2006, 7: 308-10.1186/1471-2105-7-308.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, Amin N, Schwikowski B, Ideker T: Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Research. 2003, 13: 2498-2504. 10.1101/gr.1239303.
This research was supported by grants to G. Taylor from the United Kingdom Department for Environment, Food and Rural Affairs (NF0410, NF0424), a Biotechnology and Biological Sciences Research Council Co-operative Awards in Science and Engineering (BBSRC CASE) studentship to K.M. Robinson (99/B2/P/05446), a National Environment Research Council studentship to N.R. Street (NER/S/A/2001/06361), and by the European Commission through the Directorate General Research within the Fifth Framework for Research Quality of Life and Management of the Living Resources Programme, contract QLK5-CT-2002-00953 (POPYOMICS), coordinated by the University of Southampton. We thank Gerald Tuskan and colleagues for use of the Family 331 genetic map used for QTL mapping.
AMR performed and interpreted the QTL mapping, calculated the genetic correlations and helped draft the manuscript. NRS interpreted QTL mapping results, performed the correlation analysis, identified genes within QTL regions and drafted the manuscript. KMR designed the field trial, collected the field data and contributed to the interpretation of phenotypic trait data. NH collected the field data for the final biomass harvest. GT supervised the project.
Anne M Rae, Nathaniel Robert Street contributed equally to this work.
About this article
Cite this article
Rae, A.M., Street, N.R., Robinson, K.M. et al. Five QTL hotspots for yield in short rotation coppice bioenergy poplar: The Poplar Biomass Loci. BMC Plant Biol 9, 23 (2009). https://doi.org/10.1186/1471-2229-9-23
- Biomass Yield
- Short Rotation Coppice
- Stem Number
- Sylleptic Branch
- Linkage Group VIIIa