Seedling development traits in Brassica napus examined by gene expression analysis and association mapping

Körber, Niklas; Bus, Anja; Li, Jinquan; Higgins, Janet; Bancroft, Ian; Higgins, Erin Eileen; Papworth Parkin, Isobel Alison; Salazar-Colqui, Bertha; Snowdon, Rod John; Stich, Benjamin

doi:10.1186/s12870-015-0496-3

Research Article
Open access
Published: 09 June 2015

Seedling development traits in Brassica napus examined by gene expression analysis and association mapping

Niklas Körber^1,2,
Anja Bus^1,2,
Jinquan Li¹,
Janet Higgins³,
Ian Bancroft^4,5,
Erin Eileen Higgins⁶,
Isobel Alison Papworth Parkin⁶,
Bertha Salazar-Colqui⁷,
Rod John Snowdon⁷ &
…
Benjamin Stich¹

BMC Plant Biology volume 15, Article number: 136 (2015) Cite this article

3677 Accesses
25 Citations
1 Altmetric
Metrics details

Abstract

Background

An optimal seedling development of Brassica napus plants leads to a higher yield stability even under suboptimal growing conditions and has therefore a high importance for plant breeders. The objectives of our study were to (i) examine the expression levels of candidate genes in seedling leaves of B. napus and correlate these with seedling development as well as (ii) detect genome regions associated with gene expression levels and seedling development traits in B. napus by genome-wide association mapping.

Results

The expression levels of the 15 candidate genes examined in the 509 B. napus inbreds showed an averaged standard deviation of 5.6 across all inbreds and ranged from 3.2 to 8.8. The gene expression differences between the 509 B. napus inbreds were more than adequate for the correlation with phenotypic variation of seedling development. The average of the absolute value correlations of the correlation coefficients of 0.11 were observed with a range from 0.00 to 0.39. The candidate genes GER1, AILP1, PECT, and FBP were strongly correlated with the seedling development traits. In a genome-wide association study, we detected a total of 63 associations between single nucleotide polymorphisms (SNPs) and the seedling development traits and 31 SNP-gene associations for the candidate genes with a P-value < 0.0001. For the projected leaf area traits we identified five different association hot spots on the chromosomes A2, A7, C3, C6, and C7.

Conclusion

A total of 99.4% of the adjacent SNPs on the A genome and 93.0% of the adjacent SNPs on the C genome had a distance smaller than the average range of linkage disequilibrium. Therefore, this genome-wide association study is expected to result on average in 14.7% of the possible power. Compared to previous studies in B. napus, the SNP marker density of our study is expected to provide a higher power to detect SNP-trait/-gene associations in the B. napus diversity set. The large number of associations detected for the examined 14 seedling development traits indicated that these are genetically complex inherited. The results of our analyses suggested that the studied genes ribulose 1,5-bisphosphate carboxylase/oxygenase small subunit (RBC) on the chromosomes A4 and C4 and fructose-1,6-bisphosphatase precursor (FBP) on the chromosomes A9 and C8 are cis-regulated.

Background

Well-developed seedlings lead to a higher yield stability even under suboptimal growing conditions like reduced nutrient input or drought stress [1]. Therefore, variation during early developmental stages of Brassica napus plants is important for selection decisions of plant breeders. Up to now, however, the genetics of seedling development of B. napus had been poorly understood.

In comparison to linkage mapping, association mapping studies could achieve a higher mapping resolution due to the fact that in a diversity set linkage disequilibrium (LD) decays faster than in segregating populations used for linkage mapping [2]. Furthermore, association mapping studies benefit from the broader array of genetic diversity represented compared to linkage mapping studies [3,4]. Hasan et al. [5] identified in an association mapping study in B. napus simple sequence repeat (SSR) markers which were physically linked to candidate genes for glucosinolate biosynthesis in Arabidopsis thaliana, to be associated with variation of the seed glucosinolate content in B. napus. For traits, for which less preinformation is available, a high number of markers would be necessary to detect phenotype-marker associations on a genome-wide level. The number of SSR markers available in the B. napus genome is expected to be too low for this purpose [6]. Furthermore, the genotyping of such a high number of markers is very expensive. To overcome this problem, Honsdorf et al. [7] tested the association between 684 genome-wide distributed amplified fragment-length polymorphism (AFLP) markers and 14 traits in a set of 84 canola quality winter rapeseed cultivars. They identified between one and 22 putative quantitative trait loci (QTL) which explained between 15 and 53% of the phenotypic variance for ten of the 14 traits. The results of LD analyses suggested, however, that more than 2,000 evenly distributed markers will be required for detecting marker-phenotype associations with a reasonable power in rapeseed [2]. However, it is difficult to obtain a higher number of markers with the AFLP technique in rapeseed [7]. Furthermore, due to the fact that the sequence information of AFLPs can not be easily inferred, their use in marker-assisted selection programs is difficult. Hence, single nucleotide polymorphisms (SNPs) would be the most suitable marker type to cover a complex genome like that of B. napus in the required density for genome-wide association studies (GWAS). Therefore, a custom SNP array was used in this study to genotype the entire diversity set.

Differential expression of genes during seedling development stage has the potential to be an important reason for phenotypic variation [8,9]. In our study, genes were selected based on a co-expression network analysis. The gene expression of these genes as well as candidate genes from the literature was examined in the entire diversity set and correlated with the phenotypic observations.

The objectives of our study were to (i) examine the expression levels of candidate genes in seedling leaves of B.napus and correlate these with seedling development as well as (ii) identify genome regions associated with different gene expression levels and seedling development traits in B. napus.

Methods

Plant material and assessment of seedling development traits

A set of 509 rapeseed inbred lines (doi:10.1007/s00122-012-1912-9), assembled to maximize genotypic variation, was used in this study [2,10]. In short, according to available information from genebanks, plant breeders, and our own observations, the accessions were assigned to eight different germplasm types, namely winter oilseed rape (OSR) (183), winter fodder (22), swede (73), semi-winter OSR (7), spring OSR (204), spring fodder (4), vegetable (10), and so far unspecified rapeseed genotypes (6).

The multiplication of the genotypes was done in a way such that maternal environmental effects were minimized. The genotypes were grown in six replicates, for 30 days in an α-lattice design with 24 blocks of 24 pots in a greenhouse experiment. As described in detail earlier [10], a large number of seedling development traits were assessed to cover a wide range of aspects as well as developmental stages during seedling growth which could be measured with high throughput methods (Table 1).

Table 1 Seedling development traits assessed in the rapeseed diversity set, where h ² is the repeatability and R ² the proportion of the phenotypic variance explained by population structure

Full size table

Plant material for weighted gene co-expression network analysis

The doubled haploid (DH) winter oilseed rape mapping population ExV8-DH which segregates for multiple seed quality, developmental and performance traits was the basis for the weighted gene co-expression network analysis (WGCNA). Pooled seedling developmental traits from 250 lines of the ExV8-DH population, described previously by Basunanda et al. [11], were measured in replicated greenhouse trials in 2007, and field trials at four locations from 2005-2007 were used to select two groups of 47 ExV8-DH lines with the highest and lowest respective mean performance for developmental and yield-related traits.

Digital gene expression analysis

For digital gene expression analysis, the 94 pre-selected DH lines, the two parents Express 617 and V8, and their F₁ (Express 617 x V8), were germinated in Jacobsen vessels under controlled conditions in a climate chamber at 20°C for 16 h (day) and 15°C for 8 h (night) with 55% relative humidity. Two experimental replications were performed. At two time points (eight and twelve days after sowing) 100 seedlings from each line were harvested for ribonucleic acid (RNA) extraction within one hour to prevent circadian clock effects during transcriptome analysis. All samples were immediately shock-frozen in liquid nitrogen and stored at -80°C until RNA extraction. Extraction of messenger RNA (mRNA) and digital gene expression sequencing (DGE-seq) was conducted on all as described by Obermeier et al. [12]. WGCNA was performed to identify gene networks correlated to developmental and yield-related traits. Within trait-correlated network modules, hub genes showing the highest interconnectivity to other genes in the module were selected as potential regulatory candidates for reverse transcription quantitative polymerase chain reaction (RT-qPCR) in thediversity set.

RNA extraction, cDNA synthesis, and RT-qPCR

A total of 100 ng of the leaf apex of the second leaf of each of the 509 genotypes of each of the six replicates was collected after 30 days of growing in the greenhouse trial as explained in detail by Körber et al. [10]. After harvest, the sample was directly frozen in liquid nitrogen. The leaf samples were ground to a fine powder in liquid nitrogen. Total RNA was isolated from the fine powder using Trizol reagent following the manufacturer’s protocol (Invitrogen, Karlsruhe, Germany). The total RNA was treated with RNase-free DNase I (Fermentas) (final volume 100 μl) to remove genomic deoxyribonucleic acid (DNA) contamination. RNA concentration was determined using the NanoDrop ND-1000 spectrophotometer (Thermo Fisher Scientific Inc., Waltham, MA, USA). All samples were diluted to an RNA concentration of 100 ng/ μl and the samples from the six replicates of each inbred were pooled to equal amounts in order to reduce error variance. First-strand complementary DNA (cDNA) was synthesized from 15 μl of total RNA using Maxima First Strand cDNA Synthesis Kit for RT-qPCR (Invitrogen, Karlsruhe, Germany) following the manufacturer’s recommendations. The resulting cDNA was diluted to 25 ng/ μl. Gene-specific primers (10 pmol/ μl) for 15 candidate genes as well as the control gene Actin (Table 2) were used for the RT-qPCRs performed on the cDNA samples. Amplifications were performed using 5 μl of cDNA, 7 μl of DyNAmo ColorFlash SYBR Green (Biozym), and 1.5 μl of each primer. To minimize pipetting inaccuracy, the pipetting of the cDNA was done using the pipetting robot Biomek FX (Biomek). The following amplification conditions were used for the RT-qPCR on a LightCycler480 (Roche): Preincubation with 95°C for 3 min and amplification with 45 (APL = 55) cycles of 95°C (10 sec), and 60°C (1 min). At the end of each run, a dissociation analysis was performed to confirm the specificity of the reaction. In each 384-well plate used for RT-qPCR reaction, non-template controls and cDNA of the two trial standards were included. The RT-qPCR products of each of the 15 genes (eight from WGCNA (see below) and seven from literature) for five inbreds of the diversity set were Sanger sequenced at the Max Planck Genome Center Cologne to confirm the specific amplifications.

Table 2 Details of 15 genes and the housekeeping gene Actin which were studied with qRT-PCR in seedling leaves harvested from the greenhouse trial in the 509 B. napus inbreds

Full size table

Genotyping of SNP markers

For the GWAS, the 509 B. napus inbred lines were assayed at Agriculture and Agri-Food Canada using a customized Brassica napus 6K Illumina Infinium SNP array (http://aafc-aac.usask.ca/ASSYST/). This array was designed from next generation sequence (NGS) data from Illumina short read (100 bp paired-end) genomic sequence data from seven B. napus cultivars and three B. rapa cultivars, from 3’ captured cDNA Roche 454 sequence data from seven B. napus cultivars and four B. oleracea cultivars as well as Illumina short read (80 bp single-end) RNA-Seq data from 42 B. napus cultivars [13]. It contained 5,506 successful bead types representing the same number of potential SNPs. Samples were prepared and assayed as per the Infinium HD Assay Ultra Protocol (Infinium HD Ultra User Guide 11328087_RevB, Illumina, Inc. San Diego, CA). The Brassica 6K BeadChips were imaged using an Illumina HiScan system, and the SNP alleles were called using the Genotyping Module v1.9.4, within the GenomeStudio software suite v2011.1 (Illumina, Inc. San Diego, CA). SNP data were available for 505 inbreds of the diversity set and only SNPs with a percentage of missing data < 30% across all genotypes and a minor allele frequency > 0.05 as well as genotypes with a percentage of missing data < 20% across all SNPs were used for the following statistical analysis. From these 3,910 SNPs, 3,828 could be assigned to a physical map position derived from the reference information of B. rapa [14] and B. oleracea [15].

Statistical analyses

Weighted gene co-expression network analysis

WGCNA was performed using the WGCNA R package as described by Langfelder and Horvath [16]. Normalized tagcounts (per ten million reads) were obtained for 154,790 probes (86,908 probes mapping to B. rapa and 67,882 probes to B. oleracea reference unigene sequences) using Illumina sequencing of 3’EST digital gene expression tags. Probes were kept if they had a normalized tagcount of at least five in six or more samples. Replicate probes for each unigene were averaged and the 91,048 unigenes present in both datasets were used for the WGCNA consensus analysis. A total of 108 modules were obtained using the automatic network construction function “blockwiseConsensusModules” with the following settings; power = 5, minModuleSize = 50, deepSplit = 2, maxBlockSize = 35000, reassignThreshold = 0, mergeCutHeight = 0.25, minKMEtoJoin = 1, minKMEtoStay = 0. Using the WGCNA function “chooseTopHubInEachModule”, the top hub unigenes were identified from 15 modules which were highly conserved between the two datasets and eight of these top hub unigenes could be amplified as functional candidate genes by RT-qPCR in the 509 rapeseed inbred lines.

The network of unigenes with an edge weight of ≥ 0.1 was visualized in Cytoscape [17] and the function of the modules position was determined using Gene Ontology Singular Enrichment Analysis (p < 0.001) [18].

Normalization and differences of gene expression data

The C _p-value for which the fluorescence rose above the background fluorescence was calculated for each inbred-gene combination using the LightCycler 480 Software (Roche; version 1.5). The C _p-value, which was designated in the following as gene expression level of the different genes, was normalized to the percentage of the expression level of the housekeeping gene Actin for the corresponding inbred.

Associations among inbreds and genes were revealed by a heatmap analysis and grouped with the complete linkage clustering method.

Genome positions of the candidate genes

A basic local alignment search tool (BLAST) search [19] was performed between the reference sequences of the candidate genes and the reference sequences of B. rapa (v1.2) [14] and B. oleracea (v1) [15]. All positions were used which had a BLAST identity ≥ 85%.

Calculation of adjusted entry means

The adjusted entry mean M of each genotype-trait/-gene combination, which was the basis for all further analyses, were calculated for the seedling development traits and the gene expression data using different mixed-models. For the former, these were calculated as described in detail by Körber et al. [10]. The calculations for the gene expression data were based on the following model:

$$y_{ij} = {\mu + g_{i} + t_{j} + e_{ij},} $$

where y _ij was the observation of the ith genotype of the jth technical replication, μ an intercept term, g _i the genotypic effect of the ith genotype, t _j the effect of the jth technical replicate, and e _ij the residual. For calculating the adjusted entry means, g _i was regarded as fixed and all other effects as random.

Principal component analysis and the assessment of linkage disequilibrium

The 509 rapeseed inbreds of our study were assigned to three clusters (MCLUST) using a principal component analysis (PCA) of 89 SSR markers as described by Buset al. [2].

In order to determine the physical map distance in which LD decays in our B. napus diversity set, r ² (the square of the correlation of the allele frequencies between all pairs of linked SNP loci) was calculated, where linked loci were defined as loci located on the same chromosome, and plotted against the physical distance in megabase pairs. The overall decay of LD was evaluated by nonlinear regression of r ² according to Hill and Weir [20]. The percentage of linked loci in significant LD was determined with the significance threshold of the 95% quantile of the r ² value among unlinked loci pairs, where unlinked loci were defined as loci located on different chromosomes. Pairwise modified Roger’s distance (MRD) estimates between all inbreds and the MCLUST groups 1-3 were calculated according to Wright [21].

Genome-wide association analyses

The genome-wide association analyses of the seedling development traits and the gene expression data were performed as an single marker analysis using the PK method [22]:

$$M_{lm} = \mu + a_{m} + \sum_{u=1}^{z} P_{lu} v_{u} + g_{l}^{*} + e_{lm}, $$

where M _lm was the adjusted entry mean of the lth inbred carrying allele m, a _m the effect of the mth allele, v _u the effect of the uth column of the population structure matrix P, $g_{l}^{*}$ the residual genetic effect of the lth entry, and e _lm the residual. The first and second principal component calculated based on the 89 SSR markers [2] were used as P matrix. The variance of the random effect $g^{*} = \left \{g^{*}_{1},...,g^{*}_{509}\right \}$ was assumed to be Var($g^{*}) = 2K\sigma _{g^{*}}^{2}$, where $\sigma _{g^{*}}^{2}$ was the residual genetic variance. The kinship coefficient K _ij between inbreds i and j were calculated based on the above mentioned SSR markers according to:

$$ K_{ij} = \frac{S_{ij-1}}{1 + T} + 1, $$

where S _ij was the proportion of marker loci with shared variants between inbreds i and j and T the average probability that a variant from one parent of inbred i and a variant from one parent of inbred j are alike in state, given that they are not identical by descent [23]. The optimum T value was calculated according to Stich et al. [22] for each trait. To perform the above outlined association analysis, the R package EMMA [24] was used. We chose the significance threshold of P-value =0.0001 and the threshold after Bonferroni correction (P-value =0.05). The association analysis was performed for all inbreds and for each of the three MCLUST groups. For the separate association analyses of the three MCLUST groups, only the kinship matrix K but no P matrix was considered. SNPs which are associated for multiple traits are defined as hot spots for these traits.

If not stated differently, all analyses were performed with the statistical software R [25].

Results

Linkage disequilibrium and allele frequency

The nonlinear regression trend line of the LD measure r ² vs. the physical distance intersected the Q₉₅ of r ² among unlinked loci pairs (0.145) at 676,992 bp (Figure 1). The allele frequencies of the 3,828 SNPs of all 509 inbreds ranged from 0.05 to 0.95.

Gene expression data

The expression levels of the 15 candidate genes examined in the 509 B. napus inbreds showed an averaged standard deviation (SD) of 5.6 across all inbreds and ranged from 3.2 to 8.8. The average MRD (±standard error) of the MCLUST groups 1 to 3 vs. the other two MCLUST groups were 0.32 (±0.01), 0.34 (±0.01), and 0.28 (±0.01), respectively.

The consensus WGCNA for the two datasets allocated 83,262 unigenes into 108 modules, where 7,776 unigenes were unassigned. Each module comprised between 53 and 10,285 unigenes. The candidate genes were selected as the top hub genes from 15 modules which were highly conserved between the two datasets, and for eight of them amplification via qRT-PCR was successful (Figure 2). Seven further candidate genes were selected from main metabolic pathways.

Across the examined 15 candidate genes, the gene APL was expressed on average lowest relative to Actin, whereas the gene RBC was expressed highest (Figure 3). The genes APL, UBP15, PECT, GRF1, and SPS were assigned to a cluster of genes which had a lower expression compared to Actin, whereas all the other genes clustered to a group of highly expressed genes. Furthermore, based on the expression levels of the 15 genes, the 509 inbreds were clustered in five different subgroups comprising different germplasm types.

The expression levels of the analysed genes differed between the eight germplasm subsets and the three MCLUST groups. Across all 509 inbreds, the expression levels of the genes FBP, SPS, RBC, PK, UBP15, PECT, APL, AILP1, GER1, NOI, GRF1, and GF14 were significantly higher (P-value = 0.05) in the mainly modern winter OSR and spring OSR germplasm types compared to the remaining subsets. In contrast, the expression levels of the genes CEL16 and MyAP showed the opposite trend (Figure 4a-c and Additional file 6: Figure S1a-c - 14a-c). The genes SPS, UBP15, PECT, AILP1, MyAP, GRF1, VPS2, and GF14 were significantly (α = 0.05) higher expressed in the inbreds of the MCLUST group 1 than in the inbreds of the MCLUST groups 2 and 3. On the other hand, the genes FBP, RBC, APL, GER1, and NOI were significantly higher expressed in the inbreds of the MCLUST group 2 and the genes CEL16 and MyAP for the inbreds of the MCLUST group 3 (Figure 4a-c and Additional file 6: Figure S1a-c - 14a-c).

The absolute value of the correlation coefficient between the expression of the 15 candidate genes with the 20 seedling development traits for all 509 inbreds was on average 0.11 with a range from 0.00 to 0.39 (MCLUST 1-3: 0.09, 0.13, and 0.10). The candidate genes GER1 and FBP were mostly negatively correlated with the seedling development traits with a correlation coefficient down to -0.39. In contrast, the candidate genes AILP1 and PECT were mostly positively correlated with the seedling development traits with a correlation coefficient up to 0.26 (Additional file 8: Figure S35–38).

Genome-wide association mapping

In the GWAS with 3,910 SNPs for all 509 B. napus inbreds, we observed a total of 63 SNP-trait associations with a P-value < 0.0001 for 14 of the 20 seedling development traits. A total of 20.6% of these SNP-trait associations were detected for the A genome and more than half of them were located on the chromosomes A10 and A3. In contrast, 76.2% of the associations were detected for the C genome and most of them were located on the chromosomes C7 and C2. In addition, two SNP-trait associations could not be mapped to the genome of B. napus (Table 3). The 63 associations explained individually from 3.0 to 4.9% of the phenotypic variance. Furthermore, between one and 21 SNP-trait associations were associated with the same trait and these explained in simultaneous fits between 3.3 and 20.3% of the phenotypic variance (Table 3).

Table 3 Single nucleotide polymorphism (SNP)-trait/-gene associations with P < 0.0001 across all inbreds

Full size table

For the association analysis of the gene expression levels, we observed across all 509 B. napus inbreds 31 SNP-gene associations for 13 of the 15 examined genes with a P-value < 0.0001. A total of 35.5% of these SNP-gene associations were located on the A genome, whereas no clustering across the chromosomes was observed. In contrast, 64.5% were identified for the C genome and 40% of them were located on the chromosomes C2 and C8 (Figure 5 and Table 3). We identified between one and six SNPs to be associated with the gene expression variation of the individual genes. The identified SNPs explained individually from 3.0 to 13.5% of the phenotypic variance. Furthermore, between two and seven SNP-gene associations were associated with the same gene and these explained in simultaneous fits between 3.6 to 13.7% of the phenotypic variance (Table 3).

Across all 509 inbreds, the SNP-FBP association of the gene expression levels was identical with the SNP-ASR association of the seedling development traits on chromosome C7. The SNP-gene associations of MyAP, PK, and SPS and the SNP-trait association of SPD on chromosome C2 as well as the SNP-gene association of PK and the SNP-trait association of H2O on chromosome A3 were also identical for all 509 inbreds. Furthermore, for the MCLUST group 2 the SNP-gene association of PECT on chromosome C6 corresponded with the associations of the projected leaf area hot spot of the seedling development traits.

Correspondence of associations across subgroups

In the P-value profile from the genome-wide association mapping, several SNP-RBC associations with a P-value < 0.0001 were detected on chromosome A4 for all 509 inbreds and the inbreds of the MCLUST groups 1-2 (Figure 4d-f) as well as on chromosome C4 for all 509 inbreds and the inbreds of the MCLUST group 2 (Figure 4d and f). Furthermore, mentionable SNP-RBC associations with a P-value < 0.0001 were observed on chromosome C6 for the inbreds of the MCLUST group 1 and on chromosome C2 for the inbreds of the MCLUST group 3 (Figure 4e and g).

The SNP-RBC associations detected on chromosome A4 and C4 for all 509 B. napus inbreds and the inbreds of the MCLUST group 2 were in accordance with their physical map position. Furthermore, for the inbreds of the MCLUST group 1 the SNP-RBC association on chromosome A4 was also in accordance with its physical map position, but not on chromosome C4, where the distance in between was ~1.3 Mb. In addition, the SNP-FBP associations were in accordance with their mapped genome positions on chromosome A9 and C8 for all inbreds (Figures 5, 6 and 7).

Discussion

Linkage disequilibrium and SNP density

The nonlinear trend line of LD measure r ² decayed below the significance threshold, the 95% quantile of the r ² value among unlinked loci pairs, within a distance of 677 kb (Figure 1). Bus et al. [2] estimated based on 89 SSR markers that the pairwise LD decayed within a genetic map distance of approximately 1 cM. This corresponds to about 500 kb [26] and is in good accordance to the value observed in our study. The LD observed by Ecke et al. [27] decayed within 2 cM less fast. The reason for this observation could be that the population studied by Ecke et al. [27] was less diverse than the B. napus diversity set examined in the current study.

In our study, 1,755 SNPs mapped to the A genome, whereas 2,073 SNPs mapped to the C genome. Furthermore, 99.4% of the adjacent SNPs on the A genome and 93.0% of the adjacent SNPs on the C genome had a distance smaller than the average range of LD (677 kb). Therefore, this GWAS is expected to result on average in 14.7% of the possible power (Figure 1). Compared to previous studies in B. napus, the SNP marker density of our study is expected to provide a higher power to detect SNP-trait/-gene associations in the B. napus diversity set.

Genome-wide association mapping of seedling development traits

Seedling development traits are important targets for breeding because an optimal seedling development leads to a higher yield stability even under suboptimal growing conditions [1]. Up to now, however, little is known about the genetic mechanisms as well as the natural variation of seedling development in B. napus. Thus, we used an association mapping approach to elucidate the genetics of seedling development in B. napus.

We observed a total of 63 associations between SNPs and 14 of the 20 seedling development traits with a P-value < 0.0001 (Figure 5 and Additional file 7: Figure S15d-g - 34d-g). Furthermore, for the 14 seedling development traits we found between one and 21 SNP-trait associations for a single trait which explained in a simultaneous fit, on average, 8.5% of the phenotypic variance with a range from 3.3 to 20.3% (Table 3). The large number of associations for these 14 seedling development traits suggests that these are genetically complex inherited. In contrast to the seedling development traits examined in our study, Honsdorf et al. [7] carried out an association analysis of phenological, morphological, and quality traits in 84 canola quality winter rapeseed (Brassica napus) and identified 86 putative QTLs for ten of 14 traits which explained, on average, 36.2% of the phenotypic variance. These differences in the explained phenotypic variance could be due to the fact that Honsdorf et al. [7] analysed agronomic and seed quality traits instead of seedling development traits and that a lower number of genotypes were examined compared to our study. The latter leads to an overestimation of marker effects. This overestimation, however, decreases with a higher number of genotypes in a GWAS [28]. Thus, in a GWAS with 509 inbreds this overestimation is expected to be of minor importance.

For the seedling development traits projected leaf area LA08, LA10, LA12, LA14, and LA16, we identified five different hot spots (defined as associated SNPs for multiple traits) on the chromosomes A2, A7, C3, C6, and C7 for all inbreds and/or the MCLUST groups 1 to 3 (Figures 5, 6, 7 and 8). They explained in a simultaneous fit between 4.3 and 39.9% of the phenotypic variance (Table 3 and Additional file 1: Table S1, Additional file 2: Table S2, Additional file 3: Table S3). Basunanda et al. [11] found in sets of B. napus backcrossed test hybrids a QTL for leaf area of 28 days old seedlings in the middle of chromosome A5 at 53.5 cM which explained 3.0% of the phenotypic variance. Furthermore, Edwards and Weinig [29] measured the leaf area of one young, fully expanded leaf at bolting of 150 B. rapa recombinant inbred lines (RILs) across simulated seasonal settings and detected at cool temperature and short photoperiod conditions a QTL in the middle of chromosome A6 at 58.63 cM which explained 7.8% of the phenotypic variance. These discrepancies in the different studies can be explained by dissimilarities in the power to detect QTLs as well as genotype x environment and QTL x environment interactions by examining different genetic material. All three factors have the potential to lead to different QTLs in different studies [30].

The leaf area association hot spots for MCLUST group 2 and 3 on chromosome C6 were separated by ∼ 750 kb. As the average range of LD in the examined diversity set was with 677 kb close to the separation on the physical map of ∼ 750 kb, no differentiation between linkage or pleiotropy was possible in our study.

We identified association hot spots for leaf area on the bottom of chromosome A7 (MCLUST 2) and on the top of chromosome C6 (MCLUST 2 and 3) (Figures 6, 7, 8 and Additional file 7: Figure S15d-g - 19d-g). Furthermore, BLAST searches revealed that the two candidate genes GER1 and GF14 are located up- and downstream of these two hot spots, respectively. Thus, these two genome regions with its flanking candidate genes might be homologous regions. In contrast, the candidate gene sequence of MyAP was mapped by a BLAST search on the top of chromosome A6 and on the bottom of chromosome C6 (Figures 5, 6, 7 and 8). Lydiate et al. [31] and Parkin et al. [32] identified with 399 restriction fragment length polymorphism (RFLP) markers homologous genome regions between the top of chromosome A6 and the top of chromosome C6 in reverse direction as well as between the bottom of chromosome A7 and the bottom of chromosome C6. However, our result implies that these homologous genome regions might be interchanged and that the genome region on the top of chromosome A6 is homologous to the genome region on the bottom of chromosome C6 and that the genome region on the bottom of chromosome A7 is homologous to the genome region on the top of chromosome C6. This finding is in good accordance to the results of Parkin et al. [33] who analysed genome duplications within the B. napus genome with 455 RFLP markers and reported translocated regions which are inverted between A6 and C7 and between A6 and C6.

The MRD between the MCLUST groups 1 to 3 versus the other two MCLUST groups were, on average, 0.32, 0.34, and 0.28, respectively. Furthermore, the phenotypic variation of the examined seedling development traits which was explained by population structure, was on average 30.9% (Table 1). For some traits, the correspondence of the detected associations was low between the three subgroups. The reason for this observation can be a different genetic architecture in the three subgroups. Furthermore, different allele frequencies at the corresponding SNPs in the three different groups and the resulting differences in power to detect the associations can be the reason. It is impossible to decide based on association mapping results on one of the two reasons. This would require further analyses examining a set of bi-parental populations. Nevertheless, in this study we present in addition to the results of the subgroups also the results across all 509 inbreds to benefit from the higher power to detect SNP-trait/-gene associations.

Variation of gene expression in seedling leaves

In the framework of our study, it was not possible to perform a genome-wide gene expression study with the available budget. Therefore, we selected seven candidate genes from main metabolic pathways to examine their correlation with seedling development traits. Furthermore, the top hub genes from a WGCNA were studied because of their potential role as high level regulators (Figure 2).

We observed a high expression of RBC in the seedling leaves of all 509 B. napus inbreds (Figure 3). This could be explained by the fact that RuBisCO (RBC) is the most abundant protein in plants [34]. The low expression levels of APL which is the predominant large subunit isoform in leaves [35] (Figure 3) were due to the fact that APL (E.C. 2.7.7.27) is involved in starch synthesis [36], and starch in exporting leaves represents only a transient store [37].

The expression levels of the 15 candidate genes examined in the 509 B. napus inbreds showed an averaged standard deviation (SD) of 5.6 across all inbreds and ranged from 3.2 to 8.8. For 16 A. thaliana samples, the SDs for more than 24,000 genes mostly varied between 0.5 and 5 [38]. The considerably higher SD in our study compared to that of Hruz et al. [38] might be due to the fact that our examined candidate genes were selected based on the expected different expression levels in B. napus seedlings. Our observation suggested that the measured gene expression differences between the 509 B. napus inbreds were more than adequate for the correlation with phenotypic variation of seedling development.

The candidate genes GER1, AILP1, PECT, and FBP had the highest correlations with the seedling development traits. GER1 might play a role in plant defense, AILP1 is involved in the response to the stimulus of auxin and aluminum ion, PECT is part of the phospholipid synthesis, and FBP is an enzyme that converts fructose-1,6-bisphosphate to fructose 6-phosphate in gluconeogenesis and the Calvin cycle and many other metabolic pathways. Thus, these genes have an essential effect on the development of rapeseed seedlings and could have great potential for breeding rapeseed varieties with improved seedling development. Therefore, not only markers associated with seedling development traits could be used for marker-assisted selection in B. napus to improve seedling development but also the expression of genes correlated with seedling development.

Genome-wide associations mapping of gene expression correlated with seedling development

We mapped the gene expression levels of the 15 candidate genes in our diversity set to identify genome regions contributing to their regulation. These regions could comprise genes or specific regulators of genes influencing seedling development and might be useful for marker-assisted selection in B. napus to improve the seedling development.

We found across all 509 B. napus inbreds 31 SNP-gene associations for 13 of the 15 candidate genes with a P-value < 0.0001. These SNPs associated with the expression of the candidate genes explained in a simultaneous fit on average 6.9% of the phenotypic variance for a single gene (Table 3). This is in accordance with the findings of the seedling development traits which explained in a simultaneous fit, on average, 8.5% of the phenotypic variance for a single trait (Table 3). From this it follows that the expression levels of the candidate genes and the seedling development traits have a similar genetic complexity.

The SNP-gene association of PECT on chromosome C6 for the MCLUST group 2 is identical with the association of the projected leaf area hot spot of the seedling development traits. Mizoi et al. [39] observed that pect1-4/pect1-6 F1 Arabidopsis mutants displayed severe dwarfism. PECT is involved as the rate-limiting step in the Kennedy pathway (phospholipid synthesis) [40] and plays a major role in the structure and function of membranes [41]. Thus, the SNP-PECT association on chromosome C6 may caused the differences in leaf area growth of the examined B. napus seedlings.

The genome positions of the SNP-gene associations of RBC on the chromosomes A4 and C4 and FBP on the chromosomes A9 and C8 were in accordance with their genome position (Figures 5, 6, 7 and 8, red colored SNP-gene associations). According to Chen et al. [42] these associations were defined as cis-regulated, because the SNP-gene associations were within 677 kb upstream or downstream of this gene position mapped by a BLAST search. In the neighborhood of the SNP-gene association of the candidate gene RBC on chromosome A4 at 6,466,112 bp the B. napus genes Bra028181, Bra028174, and Bra028175 were located. Their best BLASTX hits to A. thaliana are the genes AT5G38430 and AT5G38420 which are encoding the RuBisCO small subunit 2B or 1B of A. thaliana, respectively. Furthermore, the SNP-gene association of FBP on chromosome A9 at 27,053,307 bp is nearby the B. napus gene Bra007041 of which the best hit by BLASTX to A. thaliana is the gene AT3G54050 encoding for fructose 1,6-bisphosphate phosphatase. All the other SNP-gene associations were outside this range and therefore defined as trans-regulated (Figures 5, 6, 7 and 8). Thus, these trans-regulatory SNP-gene associations most likely encode transcriptional regulators which requires further research.

Conclusions

In this paper we conducted the largest genome-wide association study on seedling development traits in Brassica napus using a diversity set comprising 509 inbreds. A total of 99.4% of the adjacent SNPs on the A genome and 93.0% of the adjacent SNPs on the C genome had a distance smaller than the average range of LD. Therefore, this genome-wide association study is expected to result on average in 14.7% of the possible power. Compared to previous studies in B. napus, the SNP marker density of our study is expected to provide a higher power to detect SNP-trait/-gene associations in the B. napus diversity set. The large number of associations detected for the examined 14 seedling development traits indicated that these are genetically complex inherited. Based on a weighted gene co-expression network analysis in a segregating population, regulatory genes were selected to analyse their gene expression in seedling leaves in the diversity set. The candidate genes GER1, AILP1, PECT, and FBP were strongly correlated with the seedling development traits. Thus, these genes might be interesting targets for breeding and have potential for breeding rapeseed varieties with improved seedling development. For the projected leaf area traits, we identified five different association hot spots on the chromosomes A2, A7, C3, C6, and C7. Further research is required to identify the causative polymorphisms in these association hot spots.

Availability of supporting data

The data sets supporting the results of this article are included within the article and its additional files (Additional file 4: Table S4, Additional file 5: Table S5).

Abbreviations

QTL:: Quantitative trait loci
LD:: Linkage disequilibrium
SSR:: Simple sequence repeat
AFLP:: Amplified fragment-length polymorphism
SNP:: Single nucleotide polymorphism
GWAS:: Genome-wide association study
OSR:: Oilseed rape
DH:: Doubled haploid
WGCNA:: Weighted gene co-expression network analysis
RNA:: Ribonucleic acid
mRNA:: Messenger RNA
DGE-seq:: Digital gene expression sequencing
RT-qPCR:: Reverse transcription quantitative polymerase chain reaction
DNA:: Deoxyribonucleic acid
cDNA:: Complementary DNA
BLAST:: Basic local alignment search tool
PCA:: Principal component analysis
MRD:: Modified Roger’s distance
RIL:: Recombinant inbred line
RFLP:: Restriction fragment length polymorphism
SD:: Standard deviation

References

Blum A. Crop responses to drought and the interpretation of adaptation. Plant Growth Regul. 1996; 20:135–48. doi:10.1007/BF00024010.
Article CAS Google Scholar
Bus A, Körber N, Snowdon RJ, Stich B. Patterns of molecular variation in a species-wide germplasm set of Brassica napus. Theor Appl Genet. 2011; 123:1413–23. doi:10.1007/s00122-011-1676-7.
Article PubMed Google Scholar
Flint-Garcia SA, Thuillet AC, Yu J, Pressoir G, Romero SM, Mitchell SE, et al.Maize association population: a high-resolution platform for quantitative trait locus dissection. Plant J. 2005; 44(6):1054–64. doi:10.1111/.1365-313X.j2005.02591.x.
Article CAS PubMed Google Scholar
Li H, Peng Z, Yang X, Wang W, Fu J, Wang J, et al.Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat Genet. 2013; 45(1):43–50. doi:10.1038/ng.2484.
Article CAS PubMed Google Scholar
Hasan M, Friedt W, Pons-Kühnemann J, Freitag NM, Link K, Snowdon RJ. Association of gene-linked SSR markers to seed glucosinolate content in oilseed rape (Brassica napus ssp. napus). Theor Appl Genet. 2008; 116(8):1035–49. doi:10.1007/s00122-008-0733-3.
Article CAS PubMed Google Scholar
Wang F, Wang X, Chen X, Xiao Y, Li H, Zhang S, et al.Abundance, marker development and genetic mapping of microsatellites from unigenes in Brassica napus. Mol Breeding. 2011; 30(2):731–44. doi:10.1007/s11032-011-9658-7.
Article Google Scholar
Honsdorf N, Becker HC, Ecke W. Association mapping for phenological, morphological, and quality traits in canola quality winter rapeseed (Brassica napus L). Genome. 2010; 53(11):899–907. doi:10.1139/G10-049.
Article CAS PubMed Google Scholar
Chen ZJ. Genetic and epigenetic mechanisms for gene expression and phenotypic variation in plant polyploids. Annu Rev Plant Biol. 2007; 58:377–406. doi:10.1146/annurev.arplant.58.032806.103835.
Article CAS PubMed Central PubMed Google Scholar
Jia Q, Zhang XQ, Westcott S, Broughton S, Cakir M, Yang J, et al., Expression level of a gibberellin 20-oxidase gene is associated with multiple agronomic and quality traits in barley. Theor Applied Genet. 2011; 122(8):1451–60. doi:10.1007/s00122-011-1544-5.
Article CAS Google Scholar
Körber N, Wittkop B, Bus A, Friedt W, Snowdon RJ, Stich B. Seedling development in a Brassica napus diversity set and its relationship to agronomic performance. Theor Appl Genet. 2012; 125(6):1275–87. doi:10.1007/s00122-012-1912-9.
Article PubMed Google Scholar
Basunanda P, Radoev M, Ecke W, Friedt W, Becker HC, Snowdon RJ. Comparative mapping of quantitative trait loci involved in heterosis for seedling and yield traits in oilseed rape (Brassica napus L.)Theor Appl Genet. 2010; 120(2):271–81. doi:10.1007/s00122-009-1133-z.
Article CAS PubMed Central PubMed Google Scholar
Obermeier C, Salazar-Colqui BM, Spamer V, Snowdon RJ. In: (Batley J, editor.) Multiplexed digital gene expression analysis for genetical genomics in large plant populations. New York: Springer; 2015, pp. 119–140. doi:10.1007/978-1-4939-j1966-6_9.
Harper AL, Trick M, Higgins J, Fraser F, Clissold L, Wells R, et al.Associative transcriptomics of traits in the polyploid crop species Brassica napus. Nat Biotechnol. 2012; 30(8):798–802. doi:10.1038/nbt.2302.
Article CAS PubMed Google Scholar
Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, et al.The genome of the mesopolyploid crop species Brassica rapa. Nature Genet. 2011; 43(10):1035–9. doi:10.1038/ng.919.
Article CAS PubMed Google Scholar
Parkin IA, Koh C, Tang H, Robinson SJ, Kagale S, Clarke WE, et al. Transcriptome and methylome profiling reveals relics of genome dominance in the mesopolyploid Brassica oleracea. Genome Biol. 2014; 15(6):R77. doi:10.1186/gb-2014-15-6-r77.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008; 9:559. doi:10.1186/1471-j2105-9-559.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al.Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003; 13(11):2498–504. doi:10.1101/gr.1239303.
Article CAS PubMed Central PubMed Google Scholar
Du Z, Zhou X, Ling Y, Zhang Z, Su Z. agriGO: a GO analysis toolkit for the agricultural community. Nucl Acids Res. 2010; 38:64–70. doi:10.1093/jnar/gkq310.
Article Google Scholar
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ. Basic local alignment search tool. J Mol Biol. 1990; 215:403–10. doi:10.1016/S0022-j2836(05)80360-2.
Article CAS PubMed Google Scholar
Hill WG, Weir BS. Variances and covariances of squared linkage disequilibria in finite populations. Theor Popul Biol. 1988; 33(1):54–78. doi:10.1016/0040-5809(88)90004-4.
Article CAS PubMed Google Scholar
Wright S. Variability within and among natural populations. In: Evolution and the Genetics of Populations, Vol IV. Chicago: The University of Chicago Press: 1978. p. 91.
Google Scholar
Stich B, Möhring J, Piepho HP, Heckenberger M, Buckler ES, Melchinger AE. Comparison of mixed-model approaches for association mapping. Genetics. 2008; 178(3):1745–54. doi:10.1534/genetics.107.079707.
Article PubMed Central PubMed Google Scholar
Bernardo R. Estimation of coefficient of coancestry using molecular markers in maize. Theor Appl Genet. 1993; 85:1055–62. doi:10.1007/jBF00215047.
Article CAS PubMed Google Scholar
Kang HM, Zaitlen Na, Wade CM, Kirby A, Heckerman D, Daly MJ, et al.Efficient control of population structure in model organism association mapping. Genetics. 2008; 178:1709–23. doi:10.1534/genetics.107.080101.
Article PubMed Central PubMed Google Scholar
R Development Core Team. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2011. vol. 1.
Google Scholar
Arumuganathan K, Earle ED. Nuclear DNA content of some important plant species. Plant Mol Biol Rep. 1991; 9(4):415–5. doi:10.1007/jBF02672016.
Google Scholar
Ecke W, Clemens R, Honsdorf N, Becker HC. Extent and structure of linkage disequilibrium in canola quality winter rapeseed (Brassica napus L.)Theor Appl Genet. 2010; 120(5):921–31. doi:10.1007/s00122-009-1221-0.
Article CAS PubMed Central PubMed Google Scholar
Melchinger AE, Utz HF, Schön CC. QTL analyses of complex traits with cross validation, bootstrapping and other biometric methods. Euphytica. 2004; 137(1):1–11. doi:10.1023/B:EUPH.0000040498.48379.68.
Article CAS Google Scholar
Edwards CE, Weinig C. The quantitative-genetic and QTL architecture of trait integration and modularity in Brassica rapa across simulated seasonal settings. Heredity. 2011; 106(4):661–77. doi:10.1038/hdy.2010.103.
Article CAS PubMed Central PubMed Google Scholar
Melchinger AE, Utz HF, Schön CC. Quantitative trait locus (QTL) mapping using different testers and independent population samples in maize reveals low power of QTL detection and large bias in estimates of QTL effects. Genetics. 1998; 149(1):383–403.
CAS PubMed Central PubMed Google Scholar
Lydiate D, Dale P, Lagercrantz U, Parkin I, Howell P. Selecting the optimum genetic background for transgenic varieties, with examples from Brassica In: Cassells A, Jones P, editors. The Methodology of Plant Genetic Manipulation: Criteria for Decision Making. Developments in Plant Breeding. Netherlands: Springer: 1995. p. 351–8. doi:10.1007/978-94-j011-0357-2_43.
Google Scholar
Parkin IAP, Sharpe AG, J KD, Lydiate DJ. Identification of the A and C genomes of amphidiploid Brassica napus (oilseed rape). Genome. 1995; 38(6):1122–31. doi:10.1139/g95-149.
Article CAS PubMed Google Scholar
Parkin IAP, Sharpe AG, Lydiate DJ. doi:10.1139/G03-006. Genome. 2003; 46:291–303.
Article CAS PubMed Google Scholar
Ellis RJ. The most abundant protein in the world,. Trends Biochem Sci. 1979; 4(11):241–4. doi:10.1016/0968-0004(79)90212-3.
Article CAS Google Scholar
Crevillén P, Ventriglia T, Pinto F, Orea A, Mérida A, Romero JM. Differential pattern of expression and sugar regulation of Arabidopsis thaliana ADP-glucose pyrophosphorylase-encoding genes. J Biol Chem. 2005; 280(9):8143–149. doi:10.1074/jbc.M411713200.
Article PubMed Google Scholar
Espada J. Enzymic synthesis of adenosine diphosphate glucose from glucose 1-phosphate and adenosine triphosphate. J Biol Chem. 1962; 237(12):3577–581.
CAS Google Scholar
Geiger DR, Servaites JC. Diurnal regulation of photosynthetic carbon metabolism in C₃ plants. Annu Rev Plant Physiol Plant Mol Biol. 1994; 45:235–56. doi:10.1146/annurev.pp.45.060194.001315.
Article Google Scholar
Hruz T, Wyss M, Docquier M, Pfaffl MW, Masanetz S, Borghi L, et al.RefGenes: identification of reliable and condition specific reference genes for RT-qPCR data normalization. BMC Genomics. 2011; 12(1):156. doi:10.1186/1471-2164-12-156.
Mizoi J, Nakamura M, Nishida I. Defects in CTP:PHOSPHORYLETHANOLAMINE CYTIDYLYLTRANSFERASE affect embryonic and postembryonic development in Arabidopsis. Plant Cell. 2006; 18(12):3370–385. doi:10.1105/tpc.106.040840.
Article CAS PubMed Central PubMed Google Scholar
Sundler R, Akesson B. Regulation of phospholipid biosynthesis in isolated rat hepatocytes. Effect of different substrates. J Biol Chem. 1975; 250:3359–367.
CAS PubMed Google Scholar
Gibellini F, Smith TK. The Kennedy pathway-de novo synthesis of phosphatidylethanolamine and phosphatidylcholine. IUBMB Life. 2010; 62(6):414–28. doi:10.1002/iub.337.
Article CAS PubMed Google Scholar
Chen W, Zhang Y, Liu X, Chen B, Tu J, Tingdong F. Detection of QTL for six yield-related traits in oilseed rape (Brassica napus) using DH and immortalized F₂ populations. Theor Appl Genet. 2007; 115(6):849–58. doi:10.1007/s00122-007-0613-2.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

The authors thank Wolfgang Ecke (University of Göttingen, Germany), the Leibniz Institute of Plant Genetics and Crop Plant Research, Gatersleben (Germany), Nordic Gene Bank, Alnarp (Sweden), the Centre for Genetic Resources (Netherlands), and Warwick Horticulture Research International Genetic Resources Unit (UK) for providing the seeds of the examined germplasm. This research was funded by the Deutsche Forschungsgemeinschaft and the Max Planck Society. It was performed in the framework of the ERA-NET PG project “ASSYST”. We are deeply grateful to Andrea Lossow, Nele Kaul, Frank Eikelmann, and Andreas Lautscham for excellent technical assistance. Finally, the authors thank the associate editor and two anonymous reviewers for their valuable suggestions.

Author information

Authors and Affiliations

Max Planck Institute for Plant Breeding Research, Carl-von-Linné-Weg 10, Köln, 50829, Germany
Niklas Körber, Anja Bus, Jinquan Li & Benjamin Stich
Institute of Crop Science and Resource Conservation, Plant Breeding and Biotechnology, University of Bonn, Katzenburgweg 5, Bonn, 53115, Germany
Niklas Körber & Anja Bus
The Genome Analysis Centre, Norwich Research Park, Norwich, NR4 7UH, UK
Janet Higgins
John Innes Centre, Norwich Research Park, NR4 7UH, Norwich, UK
Ian Bancroft
Department of Biology, Wentworth Way, University of York, Heslington, York, YO41 5DD, UK
Ian Bancroft
Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK S7N OX2, Canada
Erin Eileen Higgins & Isobel Alison Papworth Parkin
Department of Plant Breeding, Research Centre for BioSystems, Land Use and Nutrition, Justus Liebig University, Heinrich-Buff-Ring 26-32, Giessen, 35392, Germany
Bertha Salazar-Colqui & Rod John Snowdon

Authors

Niklas Körber
View author publications
You can also search for this author in PubMed Google Scholar
Anja Bus
View author publications
You can also search for this author in PubMed Google Scholar
Jinquan Li
View author publications
You can also search for this author in PubMed Google Scholar
Janet Higgins
View author publications
You can also search for this author in PubMed Google Scholar
Ian Bancroft
View author publications
You can also search for this author in PubMed Google Scholar
Erin Eileen Higgins
View author publications
You can also search for this author in PubMed Google Scholar
Isobel Alison Papworth Parkin
View author publications
You can also search for this author in PubMed Google Scholar
Bertha Salazar-Colqui
View author publications
You can also search for this author in PubMed Google Scholar
Rod John Snowdon
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin Stich
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benjamin Stich.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

BS designed and supervised the project. NK contributed to qRT-PCR analyses and carried out most of the experiments. JH and IB performed the WGCNA. EEH and IAAP provided the genotypic information for the SNP markers. BSC and RJS supported the digital gene expression analysis and the field trials of the ExV8-DH population. NK, AB and JL performed the statistical and bioinformatic analyses. NK and BS wrote the manuscript. All authors read and approved the final manuscript.

Additional files

Additional file 1

Table S1. SNP-associations MCLUST 1. SNP associations of MCLUST group 1. Single nucleotide polymorphism (SNP)-trait/-gene associations with P < 0.0001 across the inbreds of MCLUST group 1.

Additional file 2

Table S2. SNP-associations MCLUST 2. SNP associations of MCLUST group 2. Single nucleotide polymorphism (SNP)-trait/-gene associations with P < 0.0001 across the inbreds of MCLUST group 2.

Additional file 3

Table S3. SNP-associations MCLUST 3. SNP associations of MCLUST group 3. Single nucleotide polymorphism (SNP)-trait/-gene associations with P < 0.0001 across the inbreds of MCLUST group 3.

Additional file 4

Table S4. Phenotypic data. Mean values of the phenotypic data of seedling development and gene expression of the 509 inbreds of the ASSYST diversity set of B. napus.

Additional file 5

Table S5. SNP data. Single nucleotide polymorphism (SNP) data of the 509 inbreds of the ASSYST diversity set of B. napus.

Additional file 6

Figure S1-S14. Distribution of the expression levels of the candidate genes and their P-value profile from genome-wide association mapping (GWAS).

Additional file 7

Figure S15-S34. Distribution of the 20 seedling development traits and their P-value profile from genome-wide association mapping (GWAS).

Additional file 8

Figure S35-S38. Correlations of the seedling development traits and the gene expression levels across all 509 inbreds as well as the inbreds of the MCLUST groups 1 to 3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver (https://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Körber, N., Bus, A., Li, J. et al. Seedling development traits in Brassica napus examined by gene expression analysis and association mapping. BMC Plant Biol 15, 136 (2015). https://doi.org/10.1186/s12870-015-0496-3

Download citation

Received: 20 October 2014
Accepted: 20 April 2015
Published: 09 June 2015
DOI: https://doi.org/10.1186/s12870-015-0496-3

Seedling development traits in Brassica napus examined by gene expression analysis and association mapping

Abstract

Background

Results

Conclusion

Background

Methods

Plant material and assessment of seedling development traits

Plant material for weighted gene co-expression network analysis

Digital gene expression analysis

RNA extraction, cDNA synthesis, and RT-qPCR

Genotyping of SNP markers

Statistical analyses

Weighted gene co-expression network analysis

Normalization and differences of gene expression data

Genome positions of the candidate genes

Calculation of adjusted entry means

Principal component analysis and the assessment of linkage disequilibrium

Genome-wide association analyses

Results

Linkage disequilibrium and allele frequency

Gene expression data

Genome-wide association mapping

Correspondence of associations across subgroups

Discussion

Linkage disequilibrium and SNP density

Genome-wide association mapping of seedling development traits

Variation of gene expression in seedling leaves

Genome-wide associations mapping of gene expression correlated with seedling development

Conclusions

Availability of supporting data

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Additional information

Competing interests

Authors’ contributions

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Plant Biology

Contact us