Skip to main content

Association mapping for yield traits in Paeonia rockii based on SSR markers within transcription factors of comparative transcriptome

Abstract

Background

Allelic variation underlying the quantitative traits in plants is caused by the extremely complex regulation process. Tree peony originated in China is a peculiar ornamental, medicinal and oil woody plant. Paeonia rockii, one of tree peony species, is a precious emerging woody oil crop. However, in this valuable plant, the study of functional loci associated with yield traits has rarely been identified. Therefore, to explore the genetic architecture of 24 yield quantitative traits, the association mapping was first reported in 420 unrelated cultivated P. rockii individuals based on the next-generation sequencing (NGS) and single-molecule long-read sequencing (SMLRS).

Results

The developed 58 pairs of polymorphic expressed sequence tag-simple sequence repeat (EST-SSR) markers from 959 candidate transcription factors (TFs) associated with yield were used for genotyping the 420 P. rockii accessions. We observed a high level of genetic diversity (polymorphic information content, PIC = 0.514) and low linkage disequilibrium (LD) between EST-SSRs. Moreover, four subpopulations in the association population were revealed by STRUCTURE analyses. Further, single-marker association analysis identified 141 significant associations, involving 17 quantitative traits and 41 EST-SSRs. These loci were mainly from AP2, TCP, MYB, HSF, bHLH, GATA, and B3 gene families and showed a small proportion of the phenotypic variance (3.79 to 37.45%).

Conclusions

Our results summarize a valuable collection of functional loci associated with yield traits in P. rockii, and provide a precious resource that reveals allelic variation underlying quantitative traits in Paeonia and other woody oil crops.

Background

Woody oil crops are economically essential crops globally because of their high oil content in their seeds and/or fruits, strong resistance, and stable yield. Now they have become the critical source for human edible oil, lubricants, biodiesel, cosmetics, paint and other industries [1, 2]. P. rockii (S. G. Haw et L. A. Lauener) T. Hong et J. J. Li belongs to the Paeonia section Moutan DC., Paeoniaceae, which is originated from northwest China and has become one of the most representative species of tree peonies. The varieties originated from the species have been cultivated widely in China as the second-largest cultivar group of Chinese tree peonies, including about 300 cultivars [3, 4]. In addition to ornamental and medicinal cultivation, P. rockii also presents an advantage agronomic trait for its seed yield and unsaturated fatty acid contents in seeds in recent years [5]. Therefore, P. rockii has been rapidly extended as an emerging woody oil plant. However, how to improve the breeding efficiency and seed yield by using the molecular breeding methods has become a top priority. The breeding system of tree peonies is predominantly outcrossing and the cultivated cultivars are of hybrid origin [3]. Conventional breeding methods for tree peony germplasm mainly include cross and selective breeding, which takes at least 10 years to develop a stable new cultivar. Therefore, the long juvenile phase and complex genetic background make it very difficult to improve the yield using traditional breeding and reverse genetics methods. The better strategies of genetic architecture in the yield traits of P. rockii will be required to shorten the breeding cycle and improve the production value effectively.

Currently, association mapping has been widely used in the identification and genetic testing of important quantitative trait locus (QTL) in various species [6,7,8,9,10,11,12,13,14,15,16,17]. And some progress has been made in studies on crop yield. In popular Upland cotton (Gossypium hirsutum L.), 172 cultivars in China and 331 polymorphic SSRs were used for association mapping of yield-related traits. Totally, 93 significantly associated loci for seven yield traits were identified across more than one environment [18]. To reveal the genetic variations of yield and yield components traits in upland cotton, 403 accessions and 560 genome-wide SSRs were used for the association mapping based on a mixed linear model. A total of 43 marker loci were detected according to the best linear unbiased prediction and in at least three of the six environments (− lgP > 1.30, P < 0.05) [19]. In Hordeum vulgare, a total of 379 cultivars were used for genome-wide association mapping to identify the alleles controlling yield-related traits, and 13 putative genes regulating grain traits in European cultivated barley were obtained [20]. In wheat, association mapping was performed with a mixed linear model to identify the KASP marker for yield-related traits. The results showed that Hap-5A-1/2 of TaSnRK2.9-5A was significantly associated with high thousand kernel weight, while Hap-5A-4 with high grains per spike [21]. These provide a useful reference to identify the molecular markers which are closely associated with yield traits in woody oil crops including tree peonies.

Among dozens of markers, although SNP markers are usually used in crop association mapping, SNP sites are generally biallelic polymorphisms, which are significantly lower than SSR markers, and the development and detection costs are higher. Therefore, SSR markers have become increasingly popular in association analysis because of their codominant inheritance, extensive genome coverage, chromosome-specific location, relative abundance and high throughput genotyping [22]. In particular, the SSR loci in the coding regions, as the expressed TFs, are species conserved and can directly influence the gene transcription or translation [23,24,25]. Currently, EST-SSRs have used for the evaluation of genetic diversity and population structure, variety identification and map construction of different species [26,27,28,29,30,31,32,33]. And EST-SSRs have been considered to be the ideal marker type for the detection of woody plants with high heterozygosity. In tree peonies, reports on the development and application of SSRs are also increasing extensively [34,35,36,37,38,39,40]. However, the coverage of molecular markers, especially EST-SSRs, is still limited, which limits the extensive application of functional markers in genetic diversity, population structure, LD and association analysis.

TFs can modulate gene expression and play a essential critical role in the regulation of yield. For example, IPA1 (Ideal Plant Architecture 1) can promote both yield and disease resistance by sustaining a balance between growth and immunity in rice; The CCT domain-containing gene family has significant impacts on heading date, regional adaptation and grain yield [41, 42]. In wheat, TaNAC2-5A overexpressing transgenic wheat lines show higher grain yield and higher nitrogen accumulation in aerial parts; A wheat CCAAT Box-binding TF can increase the wheat yield with less fertilizer input [43, 44]. In Maize, SBP-box TFs unbranched2 and unbranched3 can affect yield traits by regulating the rate of lateral primordia initiation; Nuclear factor Y (NF-Y) B subunits confer drought tolerance and lead to improved corn yields on water-limited acres [45, 46]. In Brassica napus, overexpression of the brassinosteroid biosynthetic gene DWF4 simultaneously increases seed yield and stress tolerance [47]. In tree peonies, it is the first attempt to combine the SSR loci within TFs associated with yield to conduct the association mapping in P. rockii.

Taken together, the studies on the allelic variation associated with yield traits in P. rockii by combining association mapping and markers within TFs are rarely reported. Here, we sampled the cultivated P. rockii population for association mapping. After that, the genetic diversity and population structure were analyzed based on the developed EST-SSRs by comparative transcriptome. Moreover, LD and single-marker association mapping were conducted to explore the allelic effects on the natural variation of complex yield traits in the constructed population. Our results will lay a foundation to identify the linkage loci of yield traits in P. rockii and will be of great significance for the genetic improvement of yield traits of woody oil crops.

Results

Distribution and statistics of phenotypic traits in the association population

In this study, multiple methods were used to conduct descriptive statistics on phenotypic traits and determine the traits affecting the yield of P. rockii. The one-way ANOVA analysis by R package showed a wide range of phenotypic variation among all the 24 quantitative traits measured. The variation coefficient ranged from 12.03 to 106.63% (mean 38.60%). The average variation coefficient of branch and leaf, flower and fruit traits was 22.57, 37.21 and 49.75%, respectively (Additional file 1: Table S1–1). Correlation analysis between different traits showed 202 significant correlations (P < 0.05), among which 182 showed a highly significant correlation (P < 0.01) (Additional file 1: Table S1–2). The principal component analysis showed that the 24 quantitative traits were divided into six principal components, and the cumulative variance reached up to 73.93% (Additional file 1: Table S1–3). Then, systematic cluster analysis was carried out based on the six principal components. The results showed that the association population was divided into eight categories and 24 quantitative traits were divided into five categories where the square Euclidean distance was respectively 12 and 18 (Additional file 1: Table S1–4, Additional file 2: Figure S1).

To sum up, when individual seed fresh weight was considered as the yield index, we found that each trait was related to yield. Simultaneously, individual fruit number, individual fruit fresh weight, and individual seed number could be considered as the primary traits affecting the yield of P. rockii. A total of 15 traits including flower diameter, petal length, petal width, plant height, crown breadth, compound leaf length, compound leaf width, maximum basal branch angle, single fruit length, single fruit width, single fruit pericarp thickness, multiple fruit fresh weight, multiple fruit seed number, multiple fruit seed fresh weight, and multiple fruit pod fresh weight could be considered as the secondary traits. The remaining traits could be deemed to beviewed as the auxiliary reference factors.

Genetic diversity

All the polymorphic 58 SSRs within TFs were used to evaluate the genetic diversity across 420 genotypes of cultivated P. rockii. Then, a total of 483 alleles were detected. The alleles per locus (NA) ranged from 3 to 16, with an average of 8. The effective number of alleles (NE) ranged from 1.02 to 7.32, with an average of 2.83. The PIC values ranged from 0.021 to 0.849 (mean 0.514) and Shannon’s information index (I) ranged from 0.07 to 2.18 (mean 1.13). The mean values of observed heterozygosity (Ho) and expected heterozygosity (HE) were 0.529 and 0.561, respectively. Since Ho was higher than HE at 26 EST-SSR loci, the Wright’s inbreeding coefficient (FIS) ranged from − 0.828 to 0.867 with a mean value of 0.048 (Table 1). In conclusion, all the EST-SSRs showed a high level of polymorphism.

Table 1 Diversity information parameter at 58 EST-SSRs in the association population of P. rockii

Population structure

STRUCTURE analyses were used to determine the population structure. The result indicated that the Ln P(D) reached a mode at K = 4 before decreasing, and the highest delta K was detected when K = 4 (Fig. 1a, b). The species genetic structure is dicussed with the results up to K = 6. Bayesian methods implemented in the STRUCTURE revealed extensive admixed ancestry for each sampled of P. rockii. Two major well-separated genetic clusters (I-III, IV) were identified at K = 2–6, among which four major well-separated genetic clusters (I, II, III, and IV) were observed at K = 4–6 (Fig. 1c). Therefore, the 420 accessions were divided into four subpopulations.

Fig. 1
figure 1

Estimation of genetic structure of 420 accessions for P. rockii population using 58 EST-SSRs based on the STRUCTURE. a. Log probability data [LnP (D)] for each K value (10 replicates); b. ΔK estimates of the posterior probability distribution of the data for a given K; c. Estimated population structure and clustering of the 420 P. rockii individuals with K = 2 to 6. Individuals are shown by thin vertical lines, which are divided into four major well-separated genetic clusters (I, II, III and IV) standing for the estimated membership probabilities of each individual

LD level of association population

The LD of this association population was evaluated using 58 polymorphic EST-SSR markers. In a total of 1653 marker pairs (r2 ranging from 0.001 to 0.504), 64.13, 58.08 and 47.91% of EST-SSR loci demonstrated significant LD at P < 0.05, P < 0.01 and P < 0.001, respectively. Based on r2 estimates, only 0.67% (r2 ≥ 0.05) and 0.18% (r2 ≥ 0.1) of the loci pairs showed significant LD. Therefore, the overall level of LD between EST-SSR loci was low, and most of them were in linkage equilibrium (r2 < 0.1; P < 0.001), such as PS10 (PB.32979.2), PS12 (PB.59960.1) and PS17 (PB.53756.4) in the MYB gene family. Then, of the 793 assessed loci pairs (P < 0.001), 479 showed r2 levels > 0.005 (60.40%). Among them, several loci showed significant LD values, such as PS50 within PSTCP11(PB.61740.1) (Fig. 2).

Fig. 2
figure 2

Pairwise LD (r2) between EST-SSRs. X and Y axis represent the 58 EST-SSRs. The different colors correspond to the thresholds of r2 and P. r2 < 0.1 and P < 0.001 represent linkage equilibrium, r2 > 0.1 and P < 0.001 represent LD

Association mapping of yield quantitative-related traits in P. rockii

Based on the genotype data, the Q matrix, the K matrix, and yield quantitative traits data, the MLM model was used to analyze the marker-trait associations. For floral traits associated with yield, we performed 232 (58 EST-SSRs × four traits) marker-trait association tests. Out of these, ten associations (4.31%) including 7 EST-SSR loci were significant at the P < 0.01 level. However, correction for multiple testing using the FDR method reduced the number to 8 (3.45%) at a significance threshold of Q < 0.05, involving three traits with 7 EST-SSRs. These loci explained 4.93 to 25.32% of the phenotypic variation, with an average of 11.49%. The number of associations varied across the traits was 2 (petal length) to 3 (flower diameter and petal number). One EST-SSR (PS31) was associated with at least one trait. For 3 out of the eight associations, the gene effect showed overdominance (|d/a| > 1.25). The remaining 5 associations showed additive (|d/a| < 0.50, 2) or partial to full dominance (0.50 < |d/a| < 1.25, 3) (Additional file 3: Table S2–1).

For branch and leaf traits associated with yield, we performed 464 (58 EST-SSRs × eight traits) marker-trait association tests. Among them, 102 associations (21.98%) including 39 EST-SSR loci were significant at the threshold of P < 0.01. However, correction for multiple testing using the FDR method reduced the number to 87 (18.75%) at a significance threshold of Q < 0.05, involving seven traits with 36 EST-SSRs. These loci explained 3.79 to 37.45% (mean 11.06%) of the phenotypic variation. The number of significant associations varied across the traits was 2 (crown breadth and maximum basal branch angle) to 32 (parietal lobule area), as shown in Additional file 3. Thirty-one EST-SSRs were significantly associated with at least one trait. For example, PS25 was associated with plant height, parietal lobule area, parietal lobule length, and compound leaf width. The modes of gene effect for 42 associations were overdominance, 22 were additive, and the remaining 46 were partial to full dominance (Additional file 3: Table S2–2).

For fruit traits associated with yield, we performed 696 (58 EST-SSRs × twelve traits) marker-trait association tests. Out of these, 64 associations (9.20%) including 30 EST-SSRs were significant at the threshold of P < 0.01. However, correction for multiple testing using the FDR method reduced the number to 46 (6.61%) at a significance threshold of Q < 0.05, involving seven traits with 26 EST-SSRs. These loci explained 4.45 to 30.29% (mean 11.47%) of the phenotypic variation. The number of significant associations varied across the traits was 3 (single fruit pericarp thickness, multiple fruit seed number) to 12 (single fruit number), as shown in Additional file 3. Eleven EST-SSRs showed significant associations with at least one trait. For example, PS43 was associated with single fruit number and multiple fruit pod fresh weigh. The modes of gene effect for 21 associations were overdominance, 15 were additive, and the remaining 10 were partial to full dominance (Additional file 3: Table S2–3).

TFs associated with yield traits in P. rockii

Based on the results of association mapping, we observed that the floral traits were significantly associated with SSR loci within TFs from 5 gene families (AP2, NAC, GATA, HSF and GRAS), branch and leaf traits were significantly associated with 16 gene families (AP2, MYB, TCP, bHLH, HSF, GATA, B3, WRKY, etc.). Fruit traits were significantly associated with 15 gene families (AP2, HSF, MYB, WRKY, GATA and so on) (Additional file 3: Table S2). The gene families associated with all the three types of traits were AP2, GATA, HSF, and NAC, among which TFs from AP2 (29) gene family showed the highest association frequency. Moreover, all three types of traits exhibited the associations between identical traits and EST-SSRs from multiple gene families. For example, multiple fruit fresh weight was significantly associated with PS2 from the MADS-box gene family and PS30 from the AP2 gene family. Additionally, pleiotropism was generally shown in all three types of traits. For example, PS53 from the TCP gene family was significantly associated with multiple fruit seed number and multiple fruit seed fresh weight. PS12 from the MYB gene family was significantly associated with multiple fruit fresh weight, multiple fruit seed fresh weight, and multiple fruit pod fresh weight (Additional file 3: Table S2).

SSR loci within TFs associated with fruit traits

As fruit traits were significantly associated with yield, we further dissected the results of association mapping based on the EST-SSRs associated with fruit traits. Thus, five traits with small contribution rate (single fruit number, petal number, parietal lobule area, parietal lobule length and parietal lobule width) were excluded. Further, the results showed that 49 associations were significant at the threshold of P < 0.01 and Q < 0.05, which explained 4.45 to 37.45% (mean 11.09%) of the phenotypic variance. The number of significant associations varied across EST-SSRs was 1 to 7, and 11 out of the 19 EST-SSRs exhibited significant associations with at least one trait. The modes of gene effect for 21 associations were overdominance, 15 were additive, and the remaining 13 were partial to full dominance (Table 2). PS66 (HSF), PS2 (MADS-box), PS53, PS57 (TCP), PS7 (RING), PS145 (WD), PS122, PS131 (WRKY), PS94 (YABBY), PS85 (NAC) and PS43 (bHLH) were the EST-SSRs only associated with fruit traits, so these loci could be considered as the important references for MAS breeding in P. rockii.

Table 2 Summary of significant EST-SSR marker-trait pairs from the association test results in the population of P. rockii

Discussion

Genetic diversity of the population

A vital prerequisite for dissecting population evolution and association analysis is to make clear the genetic diversity of the mapping population and the polymorphism of markers used [48]. Here, we evaluated the genetic diversity of the association population with 58 EST-SSRs developed in P. rockii. The results indicated that the average number of allele was 8, which was higher than that of Ficus carica [27], Pistacia atlantica [28], Cucumis melo [29] and cultivated P. rockii [39], but lower than that of wild P. rockii [4]. Here, we speculate that the differences should be due to the fact that the association population in this research is mainly composed of cultivated germplasm resources and there is a high level of heterozygosity in P. rockii. Simultaneously, the number and type of EST-SSRs and the characteristics of the test population are different.

Additionally, the population FIS [49, 50] ranged from − 0.828 to 0.867 with an average of 0.048, of which 26 EST-SSRs were negative, indicating that there was significant heterozygous redundancy in the mapping population. We speculate that this is related to the hybrid origin and self-incompatibility of cultivated germplasm in P. rockii [3], which is consistent with the reported results in Populus tomentosa [51] and P. rockii [39], etc. Moreover, the positive selection of mutation loci and heterosis in the process of evolution is also critical reasons for heterozygous redundancy of P. rockii. Therefore, our results prove that highly heterozygous redundancy is an important biological characteristic of P. rockii, and further support the view that the cultivated resources in P. rockii are complex hybrid origin [3].

Comparing to other outcrossing woody plants, we detected a high level of genetic diversity in P. rockii (PIC = 0.514) [52, 53], which was higher than that of P. tomentosa [51] and F. carica [27], but lower than that in P. atlantica [28] and Prunus avium [54]. This is different from the reported results of genic-SSRs with low polymorphism level, which may be due to the hybrid origin of cultivated P. rockii or the fact that we sample the accessions with significant genotype differences in this study.

Population structure

The analysis of crop population structure can not only reflect the gene exchange and affinity among individuals, but also be the premise of association mapping, which is conducive to improving the mapping efficiency and avoiding the emergence of false positives [55]. In this research, the association population was classified into four subpopulations based on STRUCTURE analyses, which is different from the previous report of three subpopulations in P. rockii [39]. We speculate that it is due to the differences in the EST-SSRs and population size. Moreover, the genetic information of the cultivated germplasm in P. rockii is mainly divided into four subgroups, which is consistent with the source of cultivated germplasm in this study, as shown in Fig. 3. We speculate that the four subpopulations may correspond to four gene pools and similarly reflect their geographic origins. It is also an essential reason for the rich genetic diversity.

Fig. 3
figure 3

The source and distribution of population materials of P. rockii. A collection of more than 200, 000 cultivated germplasm resources of P. rockii is mainly obtained from Lanzhou, Baiyin, Linxia and Dingxi city, Gansu province in Northwest China and cultivated in open fields, using standard agronomic practices, in Beijing Guose Peony Garden of Yanqing District at Beijing, China. Figure 3 was created in ArcGIS 10.0 http://www.esrichina.com.cn

The LD level of woody plants

The analysis of LD level in population is not only the genetic underpinnings for association mapping, but also provides a reference for the selection of appropriate association analysis strategies. Generally speaking, the LD level is low in cross-pollination plants, such as P. tomentosa [6, 7, 56], Eucalyptus [57], Gossypium hirsutum [58] and P. rockii [39]. The tree peony is an outcrossing species, thus we observed a low level of LD between EST-SSRs in the association population. We speculate that a significant amount of human intervention such as cross, controlled pollination, and germplasm exchange are also essential reasons of low LD levels in P. rockii. Moreover, the LD level of genic-SSR loci is not representative of the whole genome or the gene interval region level, which may also lead to the low LD levels [59]. Additionally, the lack of genomic information and the unknown genetic distance of EST-SSR loci in P. rockii limit the analysis for the decay of LD between markers, which remains to be studied further.

Association mapping QTL for yield quantitative traits

SSR markers based on candidate genes have higher genetic effects in regulating the expression and function of genes related to quantitative traits [60]. In this study, 41 EST-SSRs and 141 associations related to yield traits were identified in the association population of P. rockii. Among them, SSR loci from MADS-box, AP2, MYB, and other gene families showed high genetic effects. Simultaneously, there were not only significant associations between one trait and SSRs in multiple gene families, but also pleiotropism or co-localized associations, which was consistent with the reported results [6, 21, 39]. We speculate that these pleiotropic associations are useful for discovering the important genomic regions and valuable in trait improvement by using MAS.

In this research, we detected 176 combinations associated with three different types of traits. Still, this number was reduced to 141 after correction for multiple testing, which further improved the accuracy of the association results. Moreover, studies have shown that differences in population size and structure can cause variations in the results of association mapping. A typical association population should be a combination of multiple independent and unrelated individuals from the same region [61]. Therefore, to reduce the false positive association and provide a high precision estimation of allelic variation, we still need to verify the association results in combination with the validation populations in different regions or molecular biology experiments [62, 63].

Association mapping of complex quantitative traits in plant can detect many significantly associated markers, but explain a small portion of phenotypic variance [64]. In this study, the average interpretation rate of EST-SSRs on flower (10.81%), branch and leaf (10.40%), and fruit traits (6.53%) was low, which were consistent with the reported results [59, 61]. Besides, the analysis of genetic regulation relationship of quantitative traits in plants is helpful in utilizing the significantly association combinations for breeding. For example, the additive effect loci in the associations can account for significant genetic variation by cumulative genetic effects, while superdominant effect loci indicate that heterozygotes may superior to homozygotes. In this study, single fruit length was significantly associated with 2 EST-SSRs (PS131, PS94) with additive effects, which could add up to explain 14.63% of phenotypic variation without considering the intermarker effect. PS66 was significantly associated with the multiple fruit seed fresh weight and showed a superdominant effect (|d/a| = 4.877), indicating that individuals with PS66 heterozygous loci in the association population might generate heavier seeds. So the combination of superdominant genetic loci might show greater heterosis. All in all, the combinations of genotypes with the same positive or negative effects can be used for the early selection of target traits.

TFs from AP2 gene family associated with yield

TFs with an APETELA2 (AP2) domain play significant roles in plant growth, development, and stress responses. Studies have also shown that TFs from the AP2 gene family are associated with crop yield improvement [65,66,67,68]. In this research, TFs from the AP2 gene family were not only significantly associated with the three types of traits, but also showed the highest association frequency. Therefore, we speculate that TFs from the AP2 gene family are important factors in regulating yield quantitative-related traits of P. rockii. Moreover, TFs with an AP2 domain in Arabidopsis thaliana and Oryza sativa can control seed weight and seed yield [65, 66]. In this research, PS19 (PB.50898.1) from AP2 gene family was significantly associated with seed number (multiple fruit seed number, |d/a| = 1.517) and seed weight (multiple fruit seed fresh weight, |d/a| = 1.444) in P. rockii, and the modes of gene effect were all overdominance (|d/a| > 1.25). Therefore, we speculate that PS19 is likely to be an essential marker for regulating the seed yield in P. rockii. In addition, TFs from the AP2 gene family are also significantly associated with other yield traits such as fruit size and fruit weight, which remains to be further verified in future studies.

MYB-like TFs associated with yield

The MYB family of proteins comprise a large family of plant transcription factors, participating in the regulation of plant growth and development [69,70,71,72,73,74]. TFs with an MYB domain play an essential role in the regulation of crop yield [75,76,77]. In this research, in addition to the AP2 gene family, TFs from the MYB gene family also showed more associations in the mapping population. Three EST-SSRs (PS10, PS12 and PS17) from MYB gene family were significantly associated with fruit (4 associations) and branch and leaf traits (11 associations). Of these, PS12 (PB.59960.1) was significantly associated with fruit weight and seed weight-related traits, as shown in Additional file 3: Table S2. We speculate that PS12 is a key SSR loci for regulating yield of P. rockii. Also, a study has shown that the novel MYB-like TF OsMPH1 can regulate plant height and improve grain yield in rice [76]. We also observed that PS10 (PB.32979.2) and PS17 (PB.53756.4) from the MYB gene family were mainly associated with branch and leaf traits. Among them, only PS17 was significantly associated with plant height in the mapping population. So we speculate that PS17 is an important SSR loci regulating plant height associated with yield in P. rockii.

Conclusions

For the association mapping in P. rockii, we first constructed an association population consisting of 420 individuals. Then, the developed 58 polymorphic SSR loci within TFs associated with yield were selected for association mapping. Moreover, the genetic diversity and population structure were evaluated with the polymorphic loci, which proved that the population showed a high level of polymorphism and four subpopulations. Further, the results of association analysis based on single-marker showed that the 17 yield quantitative traits were regulated by 41 EST-SSR loci from 16 gene families, and 141 significant association combinations were identified. All these results furnish practical information to explore the effective functional loci associated with yield traits in tree peonies, it is also of great significance for the selection of yield-related traits and the cultivation of high-yield cultivars for oil tree peony.

Methods

Plant materials

More than 200, 000 seedling resources of cultivated P. rockii were mainly identified and collected from Gansu Province in northwest China by Beijing Guose Peony Technologies Co., Ltd. All these materials represented diverse genetic resources related to yield quantitative traits and were cultivated with general agronomic practices in the open field of Beijing Guose Peony Garden in Beijing, China (40°45′N, 115°97′E) (Fig. 3). Then based on the principle of covering the existing phenotypic variation as much as possible, we sampled a set of 420 individuals from the collection. All the evaluated samples were approximately 15 years old and covered three major flower types and eight color schemes. In total, 24 quantitative traits with stable performance were observed, demonstrating an effective assemblage of phenotypic traits.

Phenotypic data

The 420 individuals from the association population were scored based on 24 candidate quantitative traits with at least three replicates per genotype. In total, four flower traits (flower diameter, petal length, petal width, and petal number) were measured at the full bloom using one either digital caliper (YB5001B, Kraftwelle Industrial Co. Ltd., China) or measuring tape. A total of 8 branch and leaf traits (plant height, crown breadth, parietal lobule area, parietal lobule length, parietal lobule width, compound leaf length, compound leaf width, and maximum basal branch angle) were measured. Among them, plant height, crown breadth, compound leaf length, and compound leaf width were measured using a measuring tape, the maximum basal branch angle was measured with Protractor Edge, and the other traits were measured with CI-203 laser leaf area meter (CID, USA). In addition, a total of 12 fruit traits (single fruit number, single fruit length, single fruit width, single fruit pericarp thickness, multiple fruit fresh weight, multiple fruit seed number, multiple fruit seed fresh weight, multiple fruit pod fresh weight, individual fruit number, individual fruit fresh weight, individual seed number, and individual seed fresh weight) were measured. The electronic balance was used to measure weight. The other traits were measured with the digital caliper and measuring tape (Additional file 4: Table S3).

DNA extraction and EST-SSR markers genotyping

For each individual in association population, the total genomic DNA was extracted from silica gel-dried fresh and young leaves using the EASYspin Plus Complex Plant DNA kit (Aidlab Biotechnologies Co., Ltd. Beijing, China) according to the manufacturer’s instructions with minor modifications. Five microliter of each total DNA was assessed by 1% Tris-acetic acid-EDTA (TAE) agarose gel electrophoresis, and 1 μL was assessed using NanoDrop ND2000 [78] (A260/A280 > 1.8, 28S/18S > 1.0, DNA concentration ≥ 200 ng/μL). Then all the DNA concentrations were adjusted to 50 ng/μL for polymerase chain reaction (PCR).

We have conducted an RNA-seq experiment of flower buds of P. rockii ‘Jingshunfen’ and P. rockii ‘Fenmiantaosai’ based on NGS and SMLRS technologies [79]. The RNA-seq data have been submitted to the NCBI Sequence Read Archive (SRR9915032, https://www.ncbi.nlm.nih.gov/sra/?term=SRR9915032, and SRR10872586, https://www.ncbi.nlm.nih.gov/sra/?term=SRR10872586). Based on the sequencing data, the candidate 959 TFs from 21 gene families associated with yield based on previous reports were screened. Then a total of 166 EST-SSRs containing six nucleotide repeat types were identified, with an average of one SSR per 5.78 unigenes. Among them, 58 polymorphic EST-SSR markers have been identified and proven to be effective in Paeonia (Unpublished observationsFootnote 1). These 58 EST-SSRs were selected to genotype the 420 accessions (Additional file 5: Table S4).

Each PCR reaction was performed in a total reaction mixture volume of 10 μL containing 5 μL 2 × Power Taq PCR MasterMix (BioTeke, Beijing, China), 0.5 μL (10 pmol) each primer, 3.0 μL dd H2O and 1 μL (50 ng) template DNA. The amplification program was as follows: 5 min at 95 °C, 35 cycles of 30 s at 95 °C, 30 s at the appropriate annealing temperature, 1 min at 72 °C, and 10 min at 72 °C, 4 °C hold. The products were separated by capillary electrophoresis using an ABI3730XL capillary sequencer along with an internal size standard (Applied Biosystems, Carlsbad, CA, USA) after confirmation of PCR amplification by electrophoresis on a 1% agarose gel. The polymorphic EST-SSR loci were read with GeneMarker v1.80 software using LIZ 600 size standards (SoftGenetics, State College, Pennsylvania, USA). Subsequently, Micro-Checker v2.2.3 (http://www.microchecker.hull.ac.uk/) was applied to identify and to correct the genotyping errors [80].

Data analysis

Descriptive statistics were performed for 24 quantitative traits, including coefficient of variation (CV/ % = standard deviation / mean × 100%), one-way ANOVA (single factor completely randomized trial design), correlation analysis, principal component analysis (PCA), and cluster analysis. All the statistics were carried out by using IBM SPSS Statistics 20.0 software and R language package.

The developed 58 polymorphic EST-SSRs were used to analyze the genetic diversity of association population. The summary statistics of NA, NE, Ho, HE, PIC, I and FIS were calculated by GenAlEx v6.501 [81] and POPGENE v1.32 [82]. The developed 58 polymorphic EST-SSRs were also used to analyze and evaluate the population structure. The Bayesian method in the software package, STRUCTURE 2.3.4 (http://pritch.bsd.uchicago.edu/structure.html) [83], was used to infer the number of subpopulations (K) through an admixture model. For each value of K (K = 1–19), ten independent runs were performed with a burn-in period of 100,000 followed by 200,000 Markov Chain Monte Carlo (MCMC) replications. Then the results were submitted to the Structure Harvester (http://taylor0.biology.ucla.edu/struct_harvest/) [84]. As a result, LnP(D) and ΔK were used to detect the optimum K value [85]. Then the CLUMPP v1.1.275 was used to analyze the results from replicate analyses for optimal alignments of replicate clusters [86]. The output from CLUMPP was displayed by the cluster visualization program DISTRUCT [87]. Structural analysis was used to determine the optimum population structure and correct false positives for association mapping.

LD was measured as the squared correlation of allele frequencies r2. The r2 values between pairs of EST-SSRs (minor allele frequencies > 1%) were calculated with 105 permutations using the TASSEL v2.0.1 software (http://www.maizegenetics.net/). The pairs of loci were considered to show a significant LD when P < 0.001. In the association mapping, the TASSEL v2.0.1 software package was used for marker-trait analysis with 104 permutations by using the mixed linear model (MLM) [88]. The Q matrix estimating the membership coefficients for each accession was derived from the STRUCTURE runs. The relative kinship matrix (K) was determined by using SPAGeDi software v1.2 [89]. False discovery rate (FDR) analyses were conducted using QVALUE in R [90].

The gene effect was assessed by using the ratio of dominance (d) to additive (a) effects, which was estimated from least-square means for each genotypic class. The values of 0.50 < |d/a| < 1.25, |d/a| ≤ 0.5 and |d/a| > 1.25 were defined as partial or complete dominance, additive effects and under- or overdominance, respectively. The algorithms of dominance and additive effects were d = GBb - 0.5 (GBB + Gbb) and 2a = |GBB - Gbb|, respectively (Gij: The mean value of the phenotype corresponding to the ijth genotype; BB, bb: Different homozygous genotypes; Bb: Heterozygous genotype) [91].

Availability of data and materials

All the datasets supporting the conclusions of this article are within the paper and its Additional files. The RNA-seq data that support the findings of this study have been deposited to the NCBI Sequence Read Archive (SRR9915032, https://www.ncbi.nlm.nih.gov/sra/?term=SRR9915032 and SRR10872586, https://www.ncbi.nlm.nih.gov/sra/?term=SRR10872586). All the materials that support these findings do not contain wild resources, and all of them are cultivated germplasm resources of P. rockii. Beijing Guose Peony Technologies Co., Ltd. is in full compliance with institutional, national or international guidelines and has obtained appropriate permissions and business licenses.

Notes

  1. Liu N, Cheng FY, Guo X, Zhong Y. Development and application of microsatellite markers within transcription factors in tree peony (Paeonia rockii) based on next-generation and single-molecule long-read RNA-seq. Journal of Integrative Agriculture. 2020.

Abbreviations

P. rockii :

Paeonia rockii

NGS:

Next-generation sequencing

SMLRS:

Single-molecule long-read sequencing

EST-SSR:

Expressed sequence tag-simple sequence repeat

TF:

Transcription factor

PIC:

Polymorphic information content

LD:

Linkage disequilibrium

MAS:

Marker-assisted selection

QTL:

Quantitative trait locus

GWAS:

Genome-wide association studies

P. tomentosa :

Populus tomentosa

F. carica :

Ficus carica

P. atlantica :

Pistacia atlantica

PCR:

Polymerase chain reaction

References

  1. Wu CT, Liu R, Li Y, Zeng RZ. Computational identification of microRNA in five woody oil tree crops and their miRNA target sequences. J Oil Palm Res. 2018;30(1):47–60.

    CAS  Google Scholar 

  2. Wang J, Lin W, Yin Z, Wang L, Dong S, An JY, Lin ZX, Yu HY, Shi LL, Lin SZ, Chen SL. Comprehensive evaluation of fuel properties and complex regulation of intracellular transporters for high oil production in developing seeds of Prunus sibirica for woody biodiesel. Biotechnol Biofuels. 2019;12(1):6.

    Article  PubMed  PubMed Central  Google Scholar 

  3. Cheng FY, Li JJ, Chen DZ, Zhang ZS. Chinese Paeonia Rockii. Beijing: Chinese Forestry Publishing House; 2005.

    Google Scholar 

  4. Yuan JH, Cheng FY, Zhou SL. Genetic structure of the tree peony (Paeonia rockii) and the Qinling Mountains as a geographic barrier driving the fragmentation of a large population. PLoS One. 2012;7(4):e34955.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Li SS, Yuan RY, Chen LG, Wang LS, Hao XH, Wang LJ, Zheng XC, Du H. Systematic qualitative and quantitative assessment of fatty acids in the seeds of 60 tree peony (Paeonia section Moutan DC.) cultivars by GC–MS. Food Chem. 2015a;173:133–40.

    Article  CAS  PubMed  Google Scholar 

  6. Du QZ, Pan W, Xu BH, Li BL, Zhang DQ. Polymorphic simple sequence repeat (SSR) loci within cellulose synthase (PtoCesA) genes are associated with growth and wood properties in Populus tomentosa. New Phytol. 2013a;197(3):763–76.

    Article  CAS  PubMed  Google Scholar 

  7. Du QZ, Pan W, Tian JX, Li BL, Zhang DQ. The UDP-glucuronate decarboxylase gene family in Populus: structure, expression, and association genetics. PLoS One. 2013b;8(4):e60880.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Jiang YM, Jiang QY, Hao CY, Hou J, Wang LF, Zhang HN, Zhang SN, Chen XH, Zhang XY. A yield-associated gene TaCWI, in wheat: its function, selection and evolution in global breeding revealed by haplotype analysis. Theor Appl Genet. 2015;128(1):131–43.

    Article  CAS  PubMed  Google Scholar 

  9. Pace J, Gardner C, Romay C, Ganapathysubramanian B, Lübberstedt T. Genome-wide association analysis of seedling root development in maize (Zea mays L.). BMC Genomics. 2015;16(1):47.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  10. Li PR, Zhang F, Chen SM, Jiang JF, Wang HB, Su JS, Fang WM, Guan ZY, Chen FD. Genetic diversity, population structure and association analysis in cut chrysanthemum (Chrysanthemum morifolium Ramat.). Mol Gen Genomics. 2016;291(3):1117–25.

    Article  CAS  Google Scholar 

  11. Wei LJ, Jian HJ, Lu K, Filardo F, Yin NW, Liu LZ, Qu CM, Li W, Du H, Li JN. Genome-wide association analysis and differential expression analysis of resistance to Sclerotinia stem rot in Brassica napus. Plant Biotechnol J. 2016;14(6):1368–80.

    Article  CAS  PubMed  Google Scholar 

  12. Lu K, Peng L, Zhang C, Lu JH, Yang B, Xiao ZC, Liang Y, Xu XF, Qu CM, Zhang K, et al. Genome-wide association and transcriptome analyses reveal candidate genes underlying yield-determining traits in Brassica napus. Front Plant Sci. 2017;8:206.

    PubMed  PubMed Central  Google Scholar 

  13. Würschum T, Leiser WL, Langer SM, Tucker MR, Longin CFH. Phenotypic and genetic analysis of spike and kernel characteristics in wheat reveals long-term genetic trends of grain yield components. Theor Appl Genet. 2018;131(10):2071–84.

    Article  PubMed  CAS  Google Scholar 

  14. Zhao XW, Luo LX, Cao YH, Liu YJ, Li YH, Wu WM, Lan YZ, Jiang YW, Gao SB, Zhang ZM, et al. Genome-wide association analysis and QTL mapping reveal the genetic control of cadmium accumulation in maize leaf. BMC Genomics. 2018;19(1):91.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  15. Chong XR, Su JS, Wang F, Wang HB, Song AP, Guan ZY, Fang WM, Jiang JF, Chen SM, Chen FD, Zhang F. Identification of favorable SNP alleles and candidate genes responsible for inflorescence-related traits via GWAS in chrysanthemum. Plant Mol Biol. 2019;99(4–5):407–20.

    Article  CAS  PubMed  Google Scholar 

  16. He YJ, Hu DX, You JC, Wu DM, Cui YX, Dong HL, Li JN, Qian W. Genome-wide association study and protein network analysis for understanding candidate genes involved in root development at the rapeseed seedling stage. Plant Physiol Bioch. 2019;137:42–52.

    Article  CAS  Google Scholar 

  17. Mazaheri M, Heckwolf M, Vaillancourt B, Gage JL, Burdo B, Heckwolf S, Barry K, Lipzen A, Ribeiro CB, Kono TJY, et al. Genome-wide association analysis of stalk biomass and anatomical traits in maize. BMC Plant Biol. 2019;19(1):45.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Li CQ, Dong N, Fu YZ, Sun RR, Wang QL. Marker detection and elite allele mining for yield traits in upland cotton (Gossypium hirsutum L.) by association mapping. J Agr Sci. 2017;155(4):613–28.

    Article  CAS  Google Scholar 

  19. Dong CG, Wang J, Chen QC, Yu Y, Li BC. Detection of favorable alleles for yield and yield components by association mapping in upland cotton. Genes Genom. 2018;40(7):1–10.

    Article  Google Scholar 

  20. Xu X, Sharma R, Tondelli A, Russell J, Comadran J, Schnaithmann F, Pillen K, Kilian B, Cattivelli L, Thomas WTB, Flavell AJ. Genome-wide association analysis of grain yield-associated traits in a pan-European barley cultivar collection. Plant Genome. 2018a;11:170073.

    Article  CAS  Google Scholar 

  21. Rehman SU, Wang JY, Chang XP, Zhang XY, Mao XG, Jing RL. A wheat protein kinase gene TaSnRK2.9-5A associated with yield contributing traits. Theor Appl Genet. 2019;132(4):907–19.

    Article  CAS  Google Scholar 

  22. Kalia RK, Rai MK, Kalia S, Singh R, Dhawan AK. Microsatellite markers: an overview of the recent progress in plants. Euphytica. 2011;177(3):309–34.

    Article  CAS  Google Scholar 

  23. Singh N, Choudhury DR, Singh AK, Kumar S, Srinivasan K, Tyagi RK, Singh NK, Singh NR. Comparison of SSR and SNP markers in estimation of genetic diversity and population structure of Indian rice varieties. PLoS One. 2013;8(12):e84136.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  24. Gonzaga ZJ, Aslam K, Septiningsih EM, Collard BCY. Evaluation of SSR and SNP markers for molecular breeding in rice. Plant Breed Biotech. 2015;3:139–52.

    Article  Google Scholar 

  25. Parthiban S, Govindaraj P, Senthilkumar S. Comparison of relative efficiency of genomic SSR and EST-SSR markers in estimating genetic diversity in sugarcane. 3 Biotech. 2018;8(3):144.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Boureima S, Zakaria K, Pauline BK, Mariam K, Ernest TR, Nerbéwendé S, Romaric NK, Hamed OM, Boukaré K. Issa et a, Mahamadou S. evaluation of genetic diversity of African eggplant [Solanum aethiopicum (L.) sub sp Kumba] using EST-SSR molecular markers. Int J Curr Microbiol App Sci. 2018;7(2):2470–9.

    Article  Google Scholar 

  27. Boudchicha RH, Hormaza JI, Benbouza H. Diversity analysis and genetic relationships among local Algerian fig cultivars (Ficus carica l.) using SSR markers. S Afr J Bot. 2018;116:207–15.

    Article  CAS  Google Scholar 

  28. El Zerey-Belaskri A, Ribeiro T, Alcaraz ML, Zerey WE, Castro S, Loureiro J, Benhassaini H, Hormaza JI. Molecular characterization of Pistacia atlantica Desf. Subsp. atlantica (Anacardiaceae) in Algeria: genome size determination, chromosome count and genetic diversity analysis using SSR markers. Sci Hortic. 2018;227:278–87.

    Article  CAS  Google Scholar 

  29. Wang YL, Gao LY, Yang SY, Xu YB, Zhu HY, Yang LM, Li Q, Hu JB, Sun SR, Ma CS. Molecular diversity and population structure of oriental thin-skinned melons, Cucumis melo subsp. agrestis, revealed by a set of core SSR markers. Sci Hortic. 2018;229:59–64.

    Article  CAS  Google Scholar 

  30. Xiang CG, Duan Y, Li HB, Ma W, Huang SW, Sui XL, Zhang ZH, Wang CL. A high-density EST-SSR-based genetic map and QTL analysis of dwarf trait in Cucurbita pepo L. Int J Mol Sci. 2018;19(10):3140.

    Article  PubMed Central  CAS  Google Scholar 

  31. Xu XY, Zhou CP, Zhang Y, Zhang WQ, Gan XH, Zhang HX, Gao Y, Gan SM. A novel set of 223 EST-SSR markers in Casuarina L. ex Adans.: polymorphisms, cross-species transferability, and utility for commercial clone genotyping. Tree Genet Genomes. 2018b;14(2):30.

    Article  Google Scholar 

  32. Ukoskit K, Posudsavang G, Pongsiripat N, Chatwachirawong P, Klomasa-ard P, Poomipant P, Tragoonrung S. Detection and validation of EST-SSR markers associated with sugar-related traits in sugarcane using linkage and association mapping. Genomics. 2019;111(1):1–9.

  33. Torokeldiev N, Ziehe M, Gailing O, Finkeldey R. Genetic diversity and structure of natural Juglans regia L. populations in the southern Kyrgyz Republic revealed by nuclear SSR and EST-SSR markers. Tree Genet Genomes. 2019;15(1):5.

    Article  Google Scholar 

  34. Gai SP, Zhang YX, Mu P, Liu CY, Liu S, Dong L, Zheng GS. Transcriptome analysis of tree peony during chilling requirement fulfillment: assembling, annotation and markers discovering. Gene. 2012;497(2):256–62.

    Article  CAS  PubMed  Google Scholar 

  35. Zhang JJ, Shu QY, Liu ZA, Ren HX, Wang LS, Keyser ED. Two EST-derived marker systems for cultivar identification in tree peony. Plant Cell Rep. 2012;31(2):299–310.

    Article  CAS  PubMed  Google Scholar 

  36. Wang DX, Ma H, Zhang YL, Duan AA, Li WJ, Li ZH. Paeonia (Paeoniaceae) expressed sequence tag-derived microsatellite markers transferred to Paeonia delavayi. Genet Mol Res. 2013;12(2):1278–82.

    Article  CAS  PubMed  Google Scholar 

  37. Cai CF, Cheng FY, Wu J, Zhong Y, Liu GX. The first high-density genetic map construction in tree peony (Paeonia Sect. Moutan) using genotyping by specific-locus amplified fragment sequencing. PloS One. 2015;10(5):e0128584.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  38. Wu J, Cai CF, Cheng FY, Cui HL, Zhou H. Characterisation and development of EST-SSR markers in tree peony using transcriptome sequences. Mol Breeding. 2014;34(4):1853–66.

    Article  CAS  Google Scholar 

  39. Wu J, Cheng FY, Cai CF, Zhong Y, Jie X. Association mapping for floral traits in cultivated Paeonia rockii based on SSR markers. Mol Gen Genomics. 2017;292(1):187–200.

    Article  CAS  Google Scholar 

  40. Peng LP, Cai CF, Zhong Y, Xu XX, Xian HL, Cheng FY, Mao JF. Genetic analyses reveal independent domestication origins of the emerging oil crop Paeonia ostii, a tree peony with a long-term cultivation history. Sci Rep. 2017;7(1):5340.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  41. Wang J, Zhou L, Shi H, Chern M, Yu H, Yi H, He M, Yin JJ, Zhu XB, Li Y, et al. A single transcription factor promotes both yield and immunity in rice. Science. 2018;361(6406):1026–8.

    Article  CAS  PubMed  Google Scholar 

  42. Zhang J, Yong H, Xu L, He Q, Fan XW, Xing YZ. The CCT domain-containing gene family has large impacts on heading date, regional adaptation, and grain yield in rice. J Integr Agr. 2017;16(12):2686–97.

    Article  CAS  Google Scholar 

  43. He X, Qu BY, Li WJ, Zhao XQ, Teng W, Ma WY, Ren YZ, Li B, Li ZS, Tong YP. The nitrate-inducible NAC transcription factor TaNAC2-5A controls nitrate response and increases wheat yield. Plant Physiol. 2015;169(3):1991–2005.

    CAS  PubMed  PubMed Central  Google Scholar 

  44. Qu B, He X, Wang J, Zhao YY, Teng W, Shao A, Zhao XQ, Ma WY, Wang JY, Li B, Li ZS, Tong YP. A wheat CCAAT box-binding transcription factor increases the grain yield of wheat with less fertilizer input. Plant Physiol. 2015;167(2):411–23.

    Article  CAS  PubMed  Google Scholar 

  45. Chuck GS, Brown PJ, Meeley R, Hake S. Maize SBP-box transcription factors unbranched2 and unbranched3 affect yield traits by regulating the rate of lateral primordia initiation. P Natl Acad Sci. 2014;111(52):18775–80.

    Article  CAS  Google Scholar 

  46. Nelson DE, Repetti PP, Adams TR, Creelman RA, Wu JR, Warner DC, Anstrom DC, Bensen RJ, Castiglioni PP, Donnarummo MG, et al. Plant nuclear factor Y (NF-Y) B subunits confer drought tolerance and lead to improved corn yields on water-limited acres. P Natl Acad Sci. 2007;104(42):16450–5.

    Article  CAS  Google Scholar 

  47. Sahni S, Prasad BD, Liu Q, Grbic B, Sharpe A, Singh SP, Krishna P. Overexpression of the brassinosteroid biosynthetic gene DWF4 in Brassica napus simultaneously increases seed yield and stress tolerance. Sci Rep. 2016;6:28298.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Ingvarsson PK. Multilocus patterns of nucleotide polymorphism and the demographic history of Populus tremula. Genetics. 2008;180(1):329–40.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Wright S. Coefficients of inbreeding and relationship. Am Nat. 1922;56(645):330–8.

    Article  Google Scholar 

  50. Slate J, David P, Dodds KG, Veenvliet BA, Glass BC, Broad TE, McEwan JC. Understanding the relationship between the inbreeding coefficient and multilocus heterozygosity: theoretical expectations and empirical data. Heredity. 2004;93(3):255.

    Article  CAS  PubMed  Google Scholar 

  51. Du QZ, Wang BW, Wei ZZ, Zhang DQ, Li BL. Genetic diversity and population structure of Chinese white poplar (Populus tomentosa) revealed by SSR markers. J Hered. 2012b;103(6):853–62.

    Article  PubMed  Google Scholar 

  52. Botstein D, White RL, Skolnick M, Davis RW. Construction of a genetic linkage map in man using restriction fragment length polymorphisms. Am J Hum Genet. 1980;32(3):314.

    CAS  PubMed  PubMed Central  Google Scholar 

  53. Yadav HK, Ranjan A, Asif MH, Mantri S, Sawant SV, Tuli R. EST-derived SSR Markers in Jatropha curcas L.: development, characterization, polymorphism, and transferability across the species/genera. Tree Genet Genomes. 2011;7(1):207–19.

    Article  Google Scholar 

  54. Ganopoulos IV, Kazantzis K, Chatzicharisis I, Karayiannis I, Tsaftaris AS. Genetic diversity, structure and fruit trait associations in Greek sweet cherry cultivars using microsatellite based (SSR/ISSR) and morpho-physiological markers. Euphytica. 2011;181(2):237–51.

    Article  Google Scholar 

  55. King RA, Harris SL, Karp A, Barker JHA. Characterisation and inheritance of nuclear microsatellite loci for use in population studies of the allotetraploid Salix alba–Salix fragilis complex. Tree Genet Genomes. 2010;6(2):247–58.

    Article  Google Scholar 

  56. Chhetri HB, Macaya-Sanz D, Kainer D, Biswal AK, Evans LM, Chen JG, Collins C, Hunt K, Mohanty SS, Rosenstiel T, et al. Multi-trait genome-wide association analysis of Populus trichocarpa identifies key polymorphisms controlling morphological and physiological traits. New Phytol. 2019;223(1):293–309.

    Article  CAS  PubMed  Google Scholar 

  57. Külheim C, Yeoh SH, Maintz J, Foley W, Moran GF. Comparative SNP diversity among four Eucalyptus species for genes from secondary metabolite biosynthetic pathways. BMC Genomics. 2009;10(1):452.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  58. Cai CP, Ye WX, Zhang TZ, Guo WZ. Association analysis of fiber quality traits and exploration of elite alleles in upland cotton cultivars/accessions (Gossypium hirsutum L.). J Integr Plant Biol. 2014;56(1):51–62.

    Article  CAS  PubMed  Google Scholar 

  59. Wegrzyn JL, Eckert AJ, Choi M, Lee JM, Stanton BJ, Sykes R, Davis MF, Tsai CJ, Neale DB. Association genetics of traits controlling lignin and cellulose biosynthesis in black cottonwood (Populus trichocarpa, Salicaceae) secondary xylem. New Phytol. 2010;188(2):515–32.

    Article  CAS  PubMed  Google Scholar 

  60. Ching ADA, Caldwell KS, Jung M, Dolan M, Smith OSH, Tingey S, Morgante M, Rafalski AJ. SNP frequency, haplotype structure and linkage disequilibrium in elite maize inbred lines. BMC Genet. 2002;3(1):19.

    Article  PubMed  PubMed Central  Google Scholar 

  61. Porth I, Klapšte J, Skyba O, Hannemann J, McKown AD, Guy RD, DiFazio SP, Muchero W, Ranjan P, Tuskan GA, et al. Genome-wide association mapping for wood characteristics in Populus identifies an array of candidate single nucleotide polymorphisms. New Phytol. 2013;200(3):710–26.

    Article  CAS  PubMed  Google Scholar 

  62. Long AD, Langley CH. The power of association studies to detect the contribution of candidate genetic loci to variation in complex traits. Genome Res. 1999;9(8):720–31.

    CAS  PubMed  PubMed Central  Google Scholar 

  63. Abdurakhmonov IY, Abdukarimov A. Application of association mapping to understanding the genetic diversity of plant germplasm resources. Int J Plant Genomics. 2008;2008:1–18.

    Article  CAS  Google Scholar 

  64. Sun XY, Du ZM, Ren J, Amombo E, Hu T, Fu JM. Association of SSR markers with functional traits from heat stress in diverse tall fescue accessions. BMC Plant Biol. 2015;15(1):116.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  65. Jofuku KD, Omidyar PK, Gee Z, Okamuro JK. Control of seed mass and seed yield by the floral homeotic gene APETALA2. P Natl Acad Sci. 2005;102(8):3117–22.

    Article  CAS  Google Scholar 

  66. Oh SJ, Kim YS, Kwon CW, Park HK, Jeong JS, Kim JK. Overexpression of the transcription factor AP37 in rice improves grain yield under drought conditions. Plant Physiol. 2009;150(3):1368–79.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  67. Xu ZS, Chen M, Li LC, Ma YZ. Functions and application of the AP2/ERF transcription factor family in crop improvement F. J Integr Plant Biol. 2011;53(7):570–85.

    Article  CAS  PubMed  Google Scholar 

  68. Li B, Li QR, Mao XG, Li A, Wang JY, Chang XP, Hao CY, Zhang XY, Jing RL. Two novel AP2/EREBP transcription factor genes TaPARG have pleiotropic functions on plant architecture and yield-related traits in common wheat. Front Plant Sci. 2016;7:1191.

    PubMed  PubMed Central  Google Scholar 

  69. Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010;15(10):573–81.

    Article  CAS  PubMed  Google Scholar 

  70. Du H, Feng BR, Yang SS, Huang YB, Tang YX. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7(6):e37463.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Wang N, Xu HF, Jiang SH, Zhang ZY, Lu NL, Qiu HR, Qu CZ, Wang YC, Wu SJ, Chen XS. MYB12 and MYB22 play essential roles in proanthocyanidin and flavonol synthesis in red-fleshed apple (Malus sieversii f. niedzwetzkyana). Plant J. 2017;90(2):276–92.

    Article  CAS  PubMed  Google Scholar 

  72. Wu PP, Peng MS, Li ZG, Yuan N, Hu Q, Foster CF, Saski C, Wu GH, Sun DF, Luo H. DRMY1, a Myb-like protein regulates cell expansion and seed production in Arabidopsis thaliana. Plant Cell Physiol. 2018;60(2):285–302.

    Article  CAS  Google Scholar 

  73. Cho JS, Jeon HW, Kim MH, Vo TK, Kim J, Park EJ, Choi YL, Lee H, Han KH, Ko JH. Wood forming tissue-specific bicistronic expression of PdGA20ox1 and PtrMYB221 improves both the quality and quantity of woody biomass production in a hybrid poplar. Plant Biotechnol J. 2018;17(6):1048–57.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  74. Sun WJ, Ma ZT, Chen H, Liu MY. MYB Gene Family in Potato (Solanum tuberosum L.): Genome-Wide Identification of Hormone-Responsive Reveals Their Potential Functions in Growth and Development. Int J Mol Sci. 2019;20(19):4847.

    Article  CAS  PubMed Central  Google Scholar 

  75. Wang JC, Wu FQ, Zhu SS, Xu Y, Cheng ZJ, Wang JL, Li CN, Sheng P, Zhang H, Cai MH, et al. Overexpression of Os MYB 1R1–VP 64 fusion protein increases grain yield in rice by delaying flowering time. FEBS Lett. 2016;590(19):3385–96.

    Article  CAS  PubMed  Google Scholar 

  76. Zhang YX, Yu CS, Lin JZ, Liu J, Liu B, Wang J, Huang AB, Li HY, Zhao T. OsMPH1 regulates plant height and improves grain yield in rice. PLoS One. 2017;12(7):e0180825.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  77. Ren DY, Cui YJ, Hu HT, Xu QK, Rao YC, Yu XQ, Zhang Y, Wang YX, Peng YL, Zeng DL, et al. AH2 encodes a MYB domain protein that determines hull fate and affects grain yield and quality in rice. Plant J. 2019. https://doi.org/10.1111/tpj.14481.

  78. Liu H, Sun M, Du DL, Pan HT, Cheng TR, Wang J, Zhang QX. Whole-transcriptome analysis of differentially expressed genes in the vegetative buds, floral buds and buds of Chrysanthemum morifolium. PLoS One. 2015;10(5):e0128009.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  79. Liu N, Cheng FY, Zhong Y, Guo X. Comparative transcriptome and coexpression network analysis of carpel quantitative variation in Paeonia rockii. BMC Genomics. 2019;20(1):1–18.

    Article  Google Scholar 

  80. Van Oosterhout C, Hutchinson WF, Wills DPM, Shipley P. MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Mol Ecol Notes. 2004;4(3):535–8.

    Article  CAS  Google Scholar 

  81. Peakall ROD, Smouse PE. GENALEX 6: genetic analysis in excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006;6(1):288–95.

    Article  Google Scholar 

  82. Yeh FC, Yang RC, Boyle TB, Ye ZH, Mao JX. POPGENE Version 1.32: The User-Friendly Shareware for Population Genetic Analysis. Molecular Biology and Biotechnology Centre. Canada: University of Alberta; 1999.

    Google Scholar 

  83. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59.

    CAS  PubMed  PubMed Central  Google Scholar 

  84. Earl DA, von Holdt BM. STRUCTURE HARVESTER: a website and program for visualizing STRUCTURE output and implementing the Evanno method. Conserv Genet Resour. 2012;4(2):359–61.

    Article  Google Scholar 

  85. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14(8):2611–20.

    Article  CAS  PubMed  Google Scholar 

  86. Jakobsson M, Rosenberg NA. CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure. Bioinform. 2007;23:1801–6.

    Article  CAS  Google Scholar 

  87. Rosenberg NA. DISTRUCT: a program for the graphical display of population structure. Mol Ecol Notes. 2004;4:137–8.

    Article  Google Scholar 

  88. Yu JM, Pressoir G, Briggs WH, Bi IV, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES. A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006;38(2):203.

    Article  CAS  PubMed  Google Scholar 

  89. Hardy OJ, Vekemans X. SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes. 2002;2(4):618–20.

    Article  CAS  Google Scholar 

  90. Storey JD, Tibshirani R. Statistical significance for genomewide experiments. Proc Natl Acad Sci U S A. 2003;100:9440–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  91. Eckert AJ, Bower AD, Wegrzyn JL, Pande B, Jermstad KD, Krutovsky KV, Clair JBS, Neale DB. Association genetics of coastal Douglas fir (Pseudotsuga menziesii var. menziesii, Pinaceae). I. Cold-hardiness related traits. Genetics. 2009;182(4):1289–302.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgments

We would like to thank Xinyun Cheng (Beijing Guose Peony Technologies Co., Ltd) for his efforts in collecting and maintaining living plant materials of P. rockii for this study. We also thank Chaoying He (Institute of Botany, The Chinese Academy of Sciences) for his suggestions on the study.

Funding

This research was supported by the National Natural Science Foundation of China (31972446), and Special Project to Build World-class Disciplines of Beijing Forestry University (2019XKIS0324). The funding bodies had no roles in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations

Authors

Contributions

CF was involved in the identification and collection of the plant materials of P. rockii used in this study. CF and LN designed the research. LN collected samples and data, performed the experiments, carried out computational analysis and wrote the manuscript. All authors revised and approved the final manuscript.

Corresponding author

Correspondence to Fangyun Cheng.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1 Table S1.

The descriptive statistics, correlation, principal component and systematic cluster analysis of 24 quantitative traits in the association population of P. rockii.

Additional file 2 Figure S1.

The clustering pedigree diagrams of 420 accessions and 24 quantitative traits in the association population of P. rockii.

Additional file 3 Table S2.

Summary of significant EST-SSR marker-trait pairs from the association test results of flower, branch and leaf, and fruit traits in the association population of P. rockii.

Additional file 4 Table S3.

The 24 investigation traits and measurement standard in the association population of P. rockii.

Additional file 5 Table S4.

Information of 58 polymorphic EST-SSRs for association mapping.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, N., Cheng, F. Association mapping for yield traits in Paeonia rockii based on SSR markers within transcription factors of comparative transcriptome. BMC Plant Biol 20, 245 (2020). https://doi.org/10.1186/s12870-020-02449-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-020-02449-6

Keywords