Skip to main content

High-yield hybrid breeding of Camellia oleifolia based on ISSR molecular markers

Abstract

Background

C. Oleifera is among the world’s largest four woody plants known for their edible oil production, yet the contribution rate of improved varieties is less than 20%. The species traditional breeding is lengthy cycle (20–30 years), occupation of land resources, high labor cost, and low accuracy and efficiency, which can be enhanced by molecular marker-assisted selection. However, the lack of high-quality molecular markers hinders the species genetic analysis and molecular breeding.

Results

Through quantitative traits characterization, genetic diversity assessment, and association studies, we generated a selection population with wide genetic diversity, and identified five excellent high-yield parental combinations associated with four reliable high-yield ISSR markers. Early selection criteria were determined based on kernel fresh weight and cultivated 1-year seedling height, aided by the identification of these 4 ISSR markers. Specific assignment of selected individuals as paternal and maternal parents was made to capitalize on their unique attributes.

Conclusions

Our results indicated that molecular markers-assisted breeding can effectively shorten, enhance selection accuracy and efficiency and facilitate the development of a new breeding system for C. oleifera.

Peer Review reports

Introduction

Woody plants traditional breeding is a long-term endeavor characterized by lengthy cycles and low efficiency [1]. The genetic control of complex quantitative traits (e.g., growth and fruit yield) along with their interaction with environmental factors, further complicates traditional breeding, resulting is slower incremental gain over generations [2, 3]. The development of molecular marker-assisted selection (MAS) is expected to enhance breeding effectiveness through shortening breeding cycles [3, 4]. Through linkage disequilibrium, association analysis directly identifies specific marker sites closely associated with phenotypic variation, thereby linking the diversity of target traits with genes’ polymorphisms [5]. Association analysis, characterized by its efficiency, wide application and high precision, is among the main methods to analyzing genotypes of plant quantitative traits, and is widely used in important tree species such as Prunus persica, Populus, Pterocarya stenoptera, Vitis berlandieri [5,6,7,8].

Camellia oleifolia is one of the four major woody edible oil species in the world [9]. By 2021, in China alone, camellia planting area has reached 4.592 × 106 hm2, yielding 3.94 million tons seed production annually, resulting in an annual oil output of 889,000 tons with industry output exceeding 27 billion US dollars [10]. Despite these figures, China still relies heavily on importing edible oil to meet local demand. Camellia oil plays an important role in augmenting the edible oil supply and aiding rural revitalization efforts [10, 11]. C. oleifolia conventional breeding has made great progress, but the contribution rate of improved varieties is less than 20% [12]. Thus, there is a pressing need to accelerate C. oleifolia traditional breeding for high yield. Most yield phenotypic traits can only be assessed at the flowering and bearing stage of hybrid plants, which can lead to long breeding cycle, occupation of land resources, high labor cost, and low accuracy and efficiency selection, which can be solved by MAS [3, 13, 14]. Molecular markers are neutral and exist in different tissues and development stages, irrespective of environmental conditions [15]. So, the development molecular markers applications are particularly important in MAS breeding of C. oleifolia.

ISSR (Inter Simper Sequence Repeat) is based on SSR, involving the addition of nucleotides at the sequence end, amplification of the repeated DNA sequence, and acquisition of a dominant amplified band. Polymorphism is determined based on the specificity of the amplified fragment size [15, 16]. Compared with other molecular markers, ISSR have the characteristics of generating large amount of information, high polymorphism, efficient and stable results, easy operation, fast and lower cost [17, 18]. ISSR have been widely used in assessing plant genetic diversity [19, 20], genetic relationship [16], variety identification [21] and association analysis [18, 22]. C. oleifolia is a highly heterozygous polyploid plant with complex genetic basis, wide distribution, and large regional differences [23]. The lack of high-quality molecular markers hinders the genetic analysis and molecular breeding of C. oleifolia important traits, and brings great difficulties and challenges to the species MAS breeding [14, 24, 25]. Fruit yield is one of the most important quantitative traits of C. oleifolia as it involves interaction of multiple genes [23, 26]. There are no reports on the development of molecular markers closely linked with yield traits for C. oleifolia [14, 27,28,29,30].

Here, we constructed 570 C. oleifolia F2 generation population of 19 hybrid combinations each with 30 plants originated from 5 parental individuals as selection materials (see Fig. 1, for complete pedigree development) over 50 years. We analyzed the variation of 28 traits at the seedling and full fruit stages, assessed the developed ISSR markers genetic diversity of the 19 hybrid combinations, and identified high-yield combinations. Through association analysis, combining with correlation analysis of yield per plant and other traits, excellent high-yield and tightly interlocked ISSR molecular markers were selected. Through the identification of excellent high-yield markers in the F2 generation early selection method was determined, and a molecular-assisted breeding system was designed to achieve the improvement needed of C. oleifolia improved breeding method.

Fig. 1
figure 1

The schematic of C. oleifera crossbreeding. F: the failure of pollen hybridization

Results

Variation and correlation of phenotypic traits in the 19 F2 families at the seedling stage

The studied eight phenotypic traits (HS, DS, HS/DS, Tr, Gs, Pn, Ci, and WUE) of the 19 F2 families produced CV range of 10.63  37.95%, notably all greater than 10% (Table S1). The CV values of seedling morphological indices (HS, DS, and HS/DS) ranged between 10.63 and 17.85%, with HS/DS having the largest value. Additionally, the CV values of photosynthetic physiological indices ranged between 26.99 and 37.95%, with WUE showing the largest value. These high CV values indicate that the F2 generation harbored abundant seedling phenotypic traits variation. The skewness and kurtosis of seedling traits’ absolute values were less than 3, indicating that these traits produced continuous normal distribution.

The correlation between morphological (HS, DS, and HS/DS) and photosynthetic (Tr, Gs, Pn, Ci, and WUE) traits of the 19 F2 families indicated the presence of significant positive correlation between HS and Pn and Ci, as well as significant negative correlation with WUE (Table 1). Additionally, DS correlated positively with Tr and Gs. Similarly, HS/DS correlated positively with Pn and Ci (Table 1). These results indicated that there was an intrinsic relationship between these two sets of traits.

Table 1 Correlation analysis between F2 seedling morphological and photosynthetic traits

Variation and correlation of phenotypic traits in the 19 F2 families at the full fruit stage

The studied 20 traits produced CV values ranging from 9.65 to 46.23%, among which plant morphological traits, including HT, DT and CA, showed CV values ranging between 10.66 and 18.01%, with CA having the largest value (Table S2). Yield traits produced CV values ranging from 9.65 to 46.23%, with FSI showing the smallest value (9.65%), while the remaining traits (NSF, FPW, FSW, DWF, DWP, DWS, DWK, and YPP) had CV values greater than 30% (Table S2). These high CV values, once again, indicate that the 19 F2 families captured abundant variation at full fruit stage, specifically for yield traits. Furthermore, the absolute values of skewness and kurtosis of these traits were less than 3, indicating the presence of continuous normal distribution (Table S2). Selecting those families with 30% higher than the average yield, resulted in identifying 5 F2 families of excellent high-yield with C17 > C16 > C14 > C8 > C2 (Fig. 2a).

Fig. 2
figure 2

The 19 F2 families yield per plant (YPP) and its correlation with seedling and full fruit stage traits. (a) Average YPP of the 19 F2 families. (Dotted lines representing yields equal to more than 30% of the average). (b) Correlation between yield traits at full fruit stage. (FD, fruit diameter; FSI, fruit shape index; PT, pericarp thickness; NSF, number of seeds per fruit; FFW, single fresh fruit weight; FPW, fresh pericarp weight; FSW, fresh seed weight; FWK, fresh weight of kernel; DWF, dry weight of fruit; DWP, dry weight of pericarp; DWS, dry weight of seed; DWK, dry weight of kernel; WCF, water content of fruit; KMC, dry weight of seed; FSR, fresh seed rate). (c) Correlation between YPP and growth traits at seedling and full fruit stage. (CA, crown area of the tree; DT, ground diameter of the tree; HT, height of the tree; WUE, water use efficiency; Ci, intercellular CO2 concentration; Pn, photosynthetic rate; Gs, stomatal conductance; Tr, transpiration rate; HS/DS, height/diameter ratio; DS, seedling ground diameter; HS, seedling height). * and ** P < 0.05 and P < 0.01 level, respectively

Correlation analysis of yield traits at the full fruit stage showed that YPP was significant and positively correlated with FD and NSF, and highly significant and positively correlated with FFW, FSW, FWK, DWF, DWP, DWS, and DWK (Fig. 2b). FFW was positively correlated with DWK, DWS, DWP, DWF, FWK, FSW, FPW, PT, and FD, indicating that yield traits interacted with each other and were coordinated to form the yield per plant. Further analysis of the correlation between YPP and growth traits at the seedling and full fruit stages, showed that YPP was highly significant and positively correlated with HS, significant and positively correlated with HS/DS and Ci, and negatively correlated with WUE. However, there was no significant correlation between YPP and growth traits at the full fruit stage (Fig. 2c).

ISSR genetic diversity and genetic cluster analysis

The 12 ISSR primers produced a total of 106 loci, of which 104 were polymorphic. The number of amplified loci per primer ranged between 5 and 14, with average amplification loci of 8.83 and average polymorphic loci of 8.67 (Table 2). PPB ranged from 85.71 to 100%, with an average of 98.21%. The diversity levels varied among the studied primers across the 19 F2 families with average H and I of 0.3246 and 0.4920, respectively, indicating that these primers reflected the genetic polymorphism among the 19 F2 families.

Table 2 Genetic diversity parameters of ISSR markers in the 19 C. camellia F2 families

The genetic similarity among the 19 F2 families ranged from 0.25 to 0.87, with an average of 0.67 (Fig, 3a). Using genetic distance of 0.67 as a threshold, the 19 F2 families were divided into three groups (P1, P2 and P3) with different compositions (P1: C1 - C4; P2: C5, - C8; and P3:C9 - C19) (Fig. 3b).

Fig. 3
figure 3

Genetic similarity (a) and clustering (b) of 19 F2 families

Genetic structure analysis of 19 F2 families

Population genetic structure analysis indicated that when K = 3, ∆K showed the maximum peak value (Fig. 4a), thus, the 19 F2 families can be divided into 3 subgroups similar to that obtained from the clustering analysis above (Fig. 4b).

Fig. 4
figure 4

Genetic structure of 19 F2 C. oleifera families: (a) Variation trend of K and ∆K, (b) Population genetic structure at K = 3

Association between ISSR markers and characters at seedling traits

Association between ISSR markers and the 8 seedling phenotypic traits yield five (DS, HS, WUE, HS/DS, and Tr) produced highly significant associations with 7 loci (Table 3). These seven out of the 106 loci showed association with seedling phenotypic traits (6.60%) (Table 2). The percent variance (R2) of these associations ranged from 27.63 to 59.54% (Table 3). Among the seven significant associated loci, 820-7 was associated with both HS and HS/DS, and their R2 exceeded 30% (Table 3).

Table 3 Association/correlation between ISSR markers and seedling phenotypic traits

Association between ISSR markers and characters at the full fruiting stage

Association between ISSR markers and the 20 phenotypic traits at the full fruit stage indicted that a total of 13 loci were significantly associated with 16 phenotypic traits (FH, FD, FSI, PT, NSF, FFW, FPW, FSW, FWK, DWF, DWP, DWS, DWK, WCF, FSR and YPP) (Table 4). Thirteen out of the 106 loci showed association with phenotypic traits at the full fruit stage (12.26%) (Table 2). The percent variance R2 at the full fruit stage ranged from 12.37 to 58.42%. With the exception of PT (R2 = 12.37%), the R2 values of the remaining traits were greater than 20% (Table 4).

Table 4 Association/correlation between ISSR markers and phenotypic traits at the full fruit stage

Associated loci in the F2 populations

According to the association analyses results (above), 19 ISSR loci showed significant association with both the seedling and full fruit stages traits, and these significant loci were identified in the F2 generation (Table S3), additionally the polymorphism ratio of significantly associated loci per F2 family (PSL) ranged from 10.53 to 47.37% (Table 5). Nine significantly associated loci (835 − 12, 858-2, 858-3, 827-5, 820-7, 815-5, 815-6, 845-9 and 845 − 11) had higher polymorphism ratio of partial female (PRF) than partial male (PRM) (Table S3), indicating that they followed partial maternal inheritance. While polymorphism ratio of partial male (PRM) of two loci (827-3 and 815-6) were higher than PRF, indicating that they followed partial paternal inheritance (Table S3). The PRF of C2, C10, C13, C14 and C15 were higher than PRM, indicating that these 5 F2 families followed partial maternal inheritance. The PRM of C5, C11, C16 and C18 was higher than PRF, indicating that the four F2 families followed partial paternal inheritance (Table 5).

Table 5 Deviation analysis of significantly associated loci in the 19 hybrid F2 generation

There were significantly associated loci were detected in five F2 high-yield families (C2, C8, C14, C16 and C17) (Table 6). Each of these families showed significant association with specific loci (Family C2 with 835-3, 844-9, 858-2, 873-4, and 815; family C8 with 835-3, 858-3, 858 − 10, 845-7 and 845 − 11; family C14 with 858-3, 827-3, 820-7, 815 − 11 and 845 − 11; family C16 had 835 − 12, 827-3 and 815 − 11 and family C17 with 815 − 11 and 815 − 12 (Table 6).

Table 6 Significant ISSR loci associated in the 5 F2 high-yield C. oleifolia families

Furthermore, these five parents showed polymorphism ratio of significantly associated loci per family (PSL) ranging from 5.26 to 31.58%. These parents showed significant association with various loci (Parent 27 − 5 with 835-3, 858-2, 815-6 and 845-9; parent 41 − 4 with835-12 and 845 − 11; parent 46 − 2 with 835-3, 835 − 12, 858 − 10 and 815-6; parent 51 − 2 with 858-3, 827-3, 827-5, 820-7, 815-5 and 845 − 11 and parent 56 − 6 with 858-3) (Table 7).

Table 7 Significant ISSR loci associated in the 5 C. oleifolia parents

Discussion

Rich genetic diversity laid the foundation for association analysis

For natural variation, association analysis can detect genetic variation extensively with high resolution. Plant phenotypic variation is the embodiment of genetic material diversity [31, 32]. There was a large amount of variation between plant growth and yield traits of C. oleifolia F2 generation at both the seedling and full fruit stages, with coefficient of variation ranging between 9.65 and 46.73%, indicating high degree of dispersion, and confirming that the F2 generation had excellent genetic basis for improvement [33, 34]. Among the yield traits, the CV of NSF was the largest (42.23%), while FSI was the smallest (9.65%), observations similar to those previously reported studies [35]. This may be related to the breeding goal of high yield, which promotes strong separation of yield traits in F2 generation by crossing F1 siblings [36]. The CV of plant morphology indices were the smallest, followed by photosynthetic physiological traits at the seedling stage, and the largest were for yield traits at the full fruit stage, indicating that C. oleifolia F2 generation population was suitable for the implemented association analysis of yield trait and the selection of high-yield family [34, 36].

Genetic diversity analysis also relies on molecular markers, as DNA markers are not affected by environmental conditions and plant development stage (i.e., neutral) [15, 37]. The PPB of the ISSR primers in the studied F2 hybrid combinations ranged between 85.71 and 100% with an average of 98.21%, higher values than those reported between germplasm resources (92.56%) and clones (68.60-87.96%) [38, 39]. The average H and I were 0.3246 and 0.492, respectively, indicating that the C. oleifolia F2 generation had rich genetic diversity and broad genetic basis [40]. Additionally, the ISSR molecular marker had good polymorphic expression, that can fully distinguish among the F2 hybrid combinations on the molecular level for association analysis [40, 41]. Therefore, the rich genetic diversity present in the C. oleifolia F2 hybrid combinations, provided powerful information for the genetic population structure analysis, and can greatly benefit the association analysis efficiency.

High yield molecular markers detected by association analysis

Association analysis is based on the presence of linkage disequilibrium between marker loci and causal genes underpinning target traits. It is necessary to understand population’s linkage disequilibrium before embarking on association analysis [42, 43]. Our genetic structure was completely consistent with the results of cluster analysis in C. oleifolia F2 generation population, indicating that the polymorphic loci detected by ISSR were in linkage disequilibrium with functional loci [44], and would not lead to the generation of pseudo-association [45]. The genetic variation of C. oleifolia F2 hybrid combinations have enhanced the degree of polymorphism and in turn increased the resolution in linkage disequilibrium detection [46, 47]. The number of the test population and marker is also one of the factors affecting linkage disequilibrium [48]. Significant associated locus with test value of 0.0001 could be detected, indicating that the F2 population composed of 597 individuals and 12 ISSR markers has enough sensitivity to detect the observed associations in the studied F2 population [49].

It is generally believed that continuous or intermittent variables of biological phenomena in the natural state, conform to the normal distribution [50]. The skewness and kurtosis of all the studied phenotypic traits in the F2 population were less than 3, indicating that they followed continuous normal distributions and quantitative genetic characteristics, and can be used for association analysis [51, 52]. However, the kurtosis of FH, FFW and YPP were higher than 2, which may be due to the fact that they are important economic traits related to the high-yield breeding process of C. oleifolia [53], and were subjected to selection pressure resulting in the observed skewed distribution [54]. The 19 ISSR loci were significantly associated with the 21 phenotypic traits, and the number of associated loci accounted for 17.92% of the total studied loci. This is slightly higher than that obtained from RNA-Seq technology in a GWAS study that accounted for 16.56% of the total SNPs used [27, 28], indicating that ISSR, as a dominant gene marker, can efficiently detect associated loci [55]. Therefore, it was feasible and efficient to conduct association analysis of C. oleifolia F2 population based on ISSR molecular markers.

Obtaining excellent molecular markers tightly linked to target traits is a prerequisite for developing marker-assisted breeding [56]. Previous studies believed that the accuracy of genome-wide selection can only be ensured if the percent contribution (R2) is greater than 20% [57, 58]. The R2 of C. oleifolia phenotypic traits were greater than 20%, except for PT, indicating the association analyses of C. oleifolia F2 generation were accurate. Among the loci with multiple effects, 820-7 at seedling stage and 835 − 12 at full fruit stage, had R2 greater than 50% and the largest number of significant associated traits. The 858-3 locus was highly significant association with seed fresh weight (FSW), dry weight of seed (DWS) and fresh seed rate (FSR), all with R2 greater than 20%, which determine the seed yield of C. oleifolia. These three loci were present in excellent high-yield families (858-3 in C14 and C8; 835 − 12 in C16; 820-7 in C14). Additionally, 815 − 11 locus was present in the top three excellent high-yield families (C14, C16 and C17). Thus, 815 − 11, 820-7, 858-3 and 835 − 12 loci represent excellent high-yield associated ISSR markers.

Molecular marker assisted C. Oleifolia breeding

It is generally believed that the selected molecular markers associated with the target traits can be used for their detection at any stage of plant growth and development [56, 59]. The correlation between traits at different developmental stages is helpful to the selection and utilization in practical breeding, improve breeding efficiency, and providing a reference for early selection in forest trees [4, 60]. C. oleifolia yield traits interact and coordinate with each other, to ultimately affecting the yield per plant (YPP) [10, 61]. Here, we observed 10 yield traits that were positively and significantly correlated, among which the most significant correlation was fresh weight of kernel (FWK). Oil yield of C. oleifolia is determined by YPP and oil content of fresh fruit [62], and previous studies suggested that FWK are positively and significantly correlated with oil content of fresh fruit [61]. Therefore, FWK was an important trait for early identification of high-yield C. oleifolia. In breeding practice, the heavier the fresh kernel of C. oleifolia, the higher the yield and the greater the rate of improvement. The tightness between growth traits and ISSR markers at the seedling stage was higher than that of yield traits at full fruit stage. It may be that yield phenotypic traits of adult tree were more susceptible to environmental influences than growth phenotypic traits of seedling [63]. So, the yield associated loci obtained at seedling stage were more accurate than those at full fruit stage. The YPP was positively and significantly correlated with height (HS), indicating that the taller the seedling, the more likely the cultivated C. oleifolia plants will produce high yield, similar observations were reported in other studies [25]. In summary, early selection of high-yield superior plant of C. oleifolia can be achieved, by measuring fresh weight of kernel and seedling height, which can be captured with the presence of these associated marker loci (815 − 11, 820-7, 835 − 12 and 858-3).

C. oleifolia traditional breeding often takes ten years or even decades to breed a good variety [24, 61]. The primary problem of C. oleifolia low breeding efficiency, can be enhanced through selecting suitable parents and capitalize on their specific combining ability [53, 61]. In the studied F2 generation, there were excellent high-yield combinations (e.g., C2, C8, C14, C16 and C17), and all produced more than twice the national high yield breeding standard [64]. Among these selections, C2 and C8 had the most distant relation, and C2 expressed partial maternal inheritance, thus it is selected as mother and C8 as father which is expected to produce higher yield C. oleifolia variety [65]. Additionally, while C16 and C17 had close distant relation with the lowest polymorphism rate, if they are selected with C16 as a male parent (as it possesses partial paternal inheritance and the 835 − 12 locus) and C17 as female parent (as it possesses the 815 − 11 locus), this combination can accelerate the introduction of high-yield through the association of high-yield causal genes with 815 − 11 and 835 − 12 loci, and ultimately reducing the breeding workload [66]. Furthermore, the 815 − 11 locus is only present in the top three excellent high-yield F2 families (C14 and C16 with 51 − 2 as a male parent, and C17 with 51 − 2 as a female parent), not in any of the parents of excellent high-yield families, which may be a recessive gene, and produced by hybridization and recombinant with 51 − 2 as parent [67]. Additionally, 51 − 2 had the highest polymorphism ratio of significantly associated loci (PSL) in the F1 generation and the excellent high-yield associated locus of 858-3 and 820-7. The C14 had the highest PSL, and partial maternal inheritance. The backcross (C14 as a female parent and 51 − 2 as male parent) can improve the efficiency of introducing high-yield associated loci of 815 − 11, 858-3 and 820-7, especially 815 − 11. So, to increase the frequency of high-yield gene, specific parental combinations were identified based on their partial maternal or maternal inheritance which can improve the breeding selection accuracy of C. oleifolia [68, 69].

Conclusions

High coefficient of variation, extent of genetic variation, of 28 traits seedling and full fruit stages of C. oleifolia 19 F2 generation hybrids was observed. Five hybrid combinations (C2, C8, C14, C16 and C17) showed great propensity for excellent high-yield. A total of 106 alleles were obtained from 12 ISSR primers, and average polymorphic ratio of 98.21%. Population structure and cluster analysis showed that F2 generation population could be divided into 3 subgroups. Through association analysis, a total of 19 loci were associated with 21 traits at seedling and full fruit stages with R2 ranging from 12.37 to 59.54%. The number of associated loci accounted for 17.92%, and four loci (815 − 11, 835 − 12, 820-7 and 858-3) were selected as excellent high-yield ISSR markers. Fresh weight of kernel and seedling height can be used as high-yield selection indexes. So, early selection of high-yield hybrid superior variety can be realized by screening for greater fresh kernel weight and taller 1-year old seedlings, and identifying these 4 excellent high-yield ISSR markers. Specific parental combinations were identified based on their partial maternal or maternal inheritance and ISSR markers associated with high-yield. The use of molecular assisted breeding can improve the high-yield selection accuracy and efficiency of C. oleifolia.

Materials and methods

Materials

The research material for this study originated from: (1) in 1973, 7 C. oleifolia phenotypically superior individuals (parents: Min 43, Min 49, Min 50, Min 53, Min 54, Min 56, and Min 62) were selected throughout the Fujian Province, and were used to establish a clonal experimental forest at the Tongkou forest farm in Fuzhou City, Fujian province, China; (2) in 1985, these 7 parents produced 5 F1 generation full-sib families through hybridization and were planted at the Tongkou forest farm; (3) in 2007, after testing these F1 families over 5 consecutive years, 5 phenotypically superior individuals (27 − 5, 41 − 4, 46 − 2, 51 − 2, and 56 − 2), were selected to produce complete 5 × 5 diallel mating design that resulted in 19 F2 families (C1, C2, C3, …, C19). Among them, 56 − 2 as the mother and 41 − 4 as the father had incompatibility; (4) in 2008, seed from these 19 F2 diallel crosses (C1, C2, C3, …, C19) were used to produce seedlings for further testing; (5) in 2009, these seedlings were planted in a randomized complete block design with three 19 × 10 row plots in Tongkou forest farm with 30 plants per diallel crosse, and a total of 570 plants; and (6) in 2020, the F2 families’ offspring were in their full fruit stage, then fruits were harvested, and tree yield and growth and fruit traits were measured over 3 consecutive years. See Table 8; Fig. 1 for the genetic material identification and their parental origins.

Table 8 The original 7 parents 5 F2 parents, and their 19 diallel crosses in F2 generation

Crossing method

In late November 2006, flowers at the full-bloom stage were selected at the top of branches. Pollen was collected form male parent plants. For female parent plants, flowers were emasculated with tweezers, then isolated with paper bags (15 × 15 cm). In 2–3 days after emasculation, a special mucus was secreted from the stigma, which can be pollenated. 8 days after pollination, followed by petal colour fading and drying of stigma, demarking the time for pollination bags removal. In early November 2007, the pollinated fruit are ripped and harvested. Each combination (C1, C2, C3, …, C19) of crosses yielded 30 fruits.

Seedlings’ phenotyping

After collecting the fruits of the original 7 parents, 5 F2 parents, and their 19 diallel crosses F2 generation, they were naturally cracked, seeds were collected, and stored in sand over winter. In March 2008, seeds were cultivated for seedlings production.

In March 2010, 30 seedlings were randomly selected for each diallel crosse (570 seedlings from the 19 F2 diallel crosses) were measured for height (HS), ground diameter (DS), and height/diameter ratio (HS/DS). From each F2 cross, 5 seedlings were selected, and the third leaf of the shoot was selected from the four cardinal directions to determine their photosynthetic index. Weather permitting, transpiration rate (Tr), stomatal conductance (Gs), photosynthetic rate (Pn), intercellular CO2 concentration (Ci), and water use efficiency (WUE) were measured using the GFS-3000 portable photosynthesiser (WALZ, Germany).

Full fruit stage phenotyping

In early November 2020, trees at the full fruit stage were selected, and the yield per plant (YPP), height of the tree (HT), ground diameter of the tree (DT), and crown area of the tree (CA) were measured after fruit harvesting. Fifteen fruits were randomly selected from each tree, and fruit height (FH) and diameter (FD) were measured with vernier calipers (accurate to 0.01 mm), and fruit shape index (FSI) was calculated. Then, after measuring single fresh fruit weight (FFW) of 15 fruits with electronic balance (accurate to 0.01 g), pericarp was immediately removed and number of seeds per fruit (NSF) was counted, fresh seed weight (FSW) and fresh pericarp weight (FPW) were determined, and fresh seed rate (FSR) was calculated. Then the seed coat was removed and fresh weight of kernel (FWK) was measured. Pericarp thickness (PT) was measured with vernier caliper. Then the pericarp, seed coat, and seed kernel were placed in a drying oven at 105˚C for green treatment, and then baked to constant weight at 60˚C. The dry weight of pericarp (DWP), seed coat and kernel (DWK) were determined, and dry weight of fruit (DWF), dry weight of seed (DWS), water content of fruit (WCF), and kernel moisture content (KMC) were calculated as follows:

$$FSI=\frac{FH}{FD}$$
(1)
$$FSR=\frac{FSW}{FFW}$$
(2)
$${DWS}={Dry}\,\,{weight}\, {of}\, {seed}\, {coat} +DWK$$
(3)
$$DWF=DWS+DWP$$
(4)
$$WCF=\frac{DWF}{FFW}$$
(5)
$$KMC=\frac{DWK}{FWK}$$
(6)

Genomic DNA extraction and ISSR primer screening

In March 2010, 1–2 young leaves free of pests and diseases were collected from each seedling. After mixing the F2 diallel crosses, F2 parents and original parents, the leaves were stored at -70℃ until further use. DNA was extracted by the CTAB method and stored at -20˚C. DNA concentration and quality of the 19 diallel crosses and the 5 parents were detected. Absorbed 1ul of completely dissolved DNA stock solution and 1ul of 6×Loading buffer, mixed the two solutions evenly. Then they were sampled in1.5% TAE agar gel with 15,000 bp DNA maker as the control. The gel electrophoresis was conducted in 1×TAE buffer (voltage 150 V, current 120 mA) for 30 min, and placed in ethidium bromide (EB) for 10 min. DNA with clear bands and no obvious degradation, and diluting to 50 times (30 ng/µL), was best for ISSR-PCR amplification. A total of 12 ISSR primers with high polymorphism, good stability and clarity were selected (Table S4).

ISSR-PCR reaction system and DNA amplification

The ISSR reaction system was established as 20 µl (Table S5) using the Master-Cycler 05 gradient PCR instrument (Eppendorf, Germany). PCR hot cap temperature to 105˚C, pre-denaturation was 5 min at 94˚C, denaturation was 45s at 94˚C, annealing temperature was 54˚C, and elongation was 1.5 min at 72 C with 39 cycles, extended at 72˚C for 7 min, and stored at 4˚C. Added 1/6 volume of 6×Loading Buffer into the ISSR-PCR amplification product. Loaded the sample on 1.5% agarose gel, use Maker as the control, with 1×TAE electrophoresis buffer solution at 150 V constant voltage and 120 mA current for 1 h. When the Loading Buffer indicator reached the bottom of the gel, electrophoresis was stopped, and the gel was stained in EB (final concentration is 0.5ul/ml) for 10–15 min and imaged under ultraviolet light of the gel imaging system. The 31 genetic groups (the original 7 parents, 5 F2 parents, and their 19 diallel crosses F2 generation) were amplified by screened ISSR primers (Fig.S1).

Genetic parameter estimation and cluster analysis

Max and min values, range, mean, standard deviation (SD), skewness, kurtosis, coefficient of variation (CV), and correlations of C. oleifolia 28 phenotypic characters at the seedling and full fruit stages were analyzed using SPSS26.0 software [70]. When CV < 10%, characters showed low variation, 10 < CV < 20% moderate variation, 20 < CV < 30% high variation, and CV > 30% strong variation [33]. The amplified products were interpreted by visual inspection, and an ISSR molecular label 0/1 matrix was established for each band with or without a value of 1 and 0, respectively. Number of amplification loci (NAL), number of polymorphic loci (NPL), percent of polymorphic band (PPB), and genetics similarity (Gs) were calculated.

$$PPB=\frac{NPL}{NAL}\times 100\%$$
(7)
$${Gs}\,=\,{N}_{ij}/({N}_{i}\,+\,{N}_{j})$$
(8)

where, Nij representing the same number of bands between two F2 families, and Ni and Nj representing the number of bands in each of the two F2 families [71].

POPGENE 1.32 software was used to calculate Nei’s genetic diversity index (H) and Shannon’s Information index (I) [72]. UPGMA method of NTsys2.10e software was used to calculate genetic distance for cluster analysis, and SAHN in clustering program was used to obtain cluster graph [73].

$$H=1-\sum {P}_{i}^{2}$$
(9)

where, Pi is the frequency of an amplified product, which was used to reflect the abundance and uniformity of alleles among F2 families.

$$I=\sum {P}_{i}\,\text{ln}\,{P}_{i}$$
(10)

where, Pi is the frequency of occurrence of a certain genotype in a population, which mainly reflects the richness of phenotypic diversity [40].

Population genetic structure analysis

The software Structure 2.3.4 was used to analyze the population genetic structure of the tested materials, and the predicted population number K was set to 2  6 [74, 75]. Length of burn-in period of Markov Chain Monte Carlo (MCMC) was set to 100,000. Each K value was repeated 10 times to calculate the corresponding Q value for each combination. The optimal number of subgroups is determined by calculating the optimal K value. ΔK was calculated according to lnP(D), and the K value at the maximum. ΔK value was taken as the best subgroup number, and the probability (Q value) of genotype belonging to subgroup K of all the tested materials was calculated synchronically. The online program Structure harvester (http://taylor0.biology.ucla.edu/struct/) was used [75].

Association analysis of phenotypic traits and markers

Association analysis and genetic similarity results were analyzed using SPSS26.0 software [70]. The general linear model (GLM) of TASSEL-5.0 software was used to correlate phenotypic data in different periods with the Q value of each F2 family as the covariate, and the loci associated with marker variation and phenotypic traits were detected under P < 0.01. The rate of explanation on the phenotype variance of related markers (i.e., % variance) (R2) were calculated following Ping et al. [76].

Deviation analysis of significantly association markers

In the 19 F2 families and their 5 parents, ISSR bands with very significant association with phenotypic traits were marked as “√”, and those without any significant association or missing or fuzzy stripe types were marked as “-”. In the 19 F2 families, if the family has the same bands as the mother were labeled “♀” and the same bands as the father were labeled “♂”. The polymorphism ratio of significantly associated loci per F2 family (PSL), polymorphism ratio of partial female (PRF), and polymorphism ratio of partial male (PRM) were calculated as follows:

$$PSL\%=\frac{NA}{NAT}\times 100$$
(11)

where, NA is the number of significantly associated locus within the F2 family and NAT showing the number of significantly associated locus in the F2 population.

$$PRF=\frac{SLM}{NSL}$$
(12)
$$PRM=\frac{SLF}{NSL}$$
(13)

where, SLM is the number of significantly associated loci within the F2 family that were the same as the mother, NSL is the number of significantly associated loci within the F2 family, SLF is the number of significantly associated loci in the F2 family that are the same as the father [77, 78].

Data availability

The original contributions presented in the study are included in this article. Further inquiries can be directed to the corresponding author.

Abbreviations

HS :

Seedling height

DS :

Seedling ground diameter

HS/DS :

Height/diameter ratio

Tr:

Transpiration rate

Gs:

Stomatal conductance

Pn:

Photosynthetic rate

Ci:

Intercellular CO2 concentration

WUE:

Water use efficiency

SD:

Standard deviation

CV:

Coefficient of variation

YPP:

Yield per plant

FD:

Fruit diameter

FH:

Fruit height

FSI:

Fruit shape index

PT:

Pericarp thickness

NSF:

Number of seeds per fruit

FFW:

Single fresh fruit weight

FPW:

Fresh pericarp weight

FSW:

Fresh seed weight

FWK:

Fresh weight of kernel

DWF:

Dry weight of fruit

DWP:

Dry weight of pericarp

DWS:

Dry weight of seed

DWK:

Dry weight of kernel

WCF:

Water content of fruit

KMC:

Kernel moisture content

FSR:

Fresh seed rate

CA:

Crown area of the tree

DT :

Ground diameter of the tree

HT :

Height of the tree

NAL:

Number of amplified loci

NPL:

Number of polymorphic loci

PPB:

Percentage of polymorphic bands

H:

Nei’s genetic diversity index

I:

Shannon’s Information index

SLM:

Number of significant association locus in the hybrid combination that is the same as the mother

SLF:

Number of significant association locus in the hybrid combination that is the same as the father

NSL:

Number of significantly associated loci in the F2 families

PSL:

Polymorphism ratio of significantly associated locus per F2 family

PRF:

Polymorphism ratio of partial female

PRM:

Polymorphism ratio of partial male

SAL:

Significantly associated loci

PSL:

Polymorphism ratio of significant association locus per F2 family

References

  1. Giudice G, Moffa L, Varotto S, Cardone MF, Bergamini C, De Lorenzis G, et al. Novel and emerging biotechnological crop protection approaches. Plant Biotechnol J. 2021;19(8):1495–510. https://doi.org/10.1111/pbi.13605.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Xu Y, Li P, Zou C, Lu Y, Xie C, Zhang X, et al. Enhancing genetic gain in the era of molecular breeding. J Exp Bot. 2017;68(11):2641–66. https://doi.org/10.1093/jxb/erx135.

    Article  CAS  PubMed  Google Scholar 

  3. Cooper M, Messina CD. Breeding crops for drought-affected environments and improved climate resilience. Plant Cell. 2023;35(1):162–86. https://doi.org/10.1093/plcell/koac321.

    Article  PubMed  Google Scholar 

  4. Zhang MY, Xue C, Hu H, Li J, Xue Y, Wang R, et al. Genome-wide association studies provide insights into the genetic determination of fruit traits of pear. Nat Commun. 2021;12(1):1144. https://doi.org/10.1038/s41467-021-21378-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Cao K, Wang L, Zhu G, Fang W, Chen C, Luo J. Genetic diversity, linkage disequilibrium, and association mapping analyses of peach (Prunus persica) landraces in China. Tree Genet Genomes. 2012;8:975–90. https://doi.org/10.1007/s11295-012-0477-8.

    Article  Google Scholar 

  6. Du Q, Gong C, Wang Q, Zhou D, Yang H, Pan W, et al. Genetic architecture of growth traits in Populus revealed by integrated quantitative trait locus (QTL) analysis and association studies. New Phytol. 2016;209(3):1067–82. https://doi.org/10.1111/nph.13695.

    Article  CAS  PubMed  Google Scholar 

  7. Li LF, Cushman SA, He YX, Ma XF, Ge XJ, Li JX, et al. Landscape genomics reveals genetic evidence of local adaptation in a widespread tree, the Chinese wingnut (Pterocarya stenoptera). J Syst Evol. 2022;60(2):386–97. https://doi.org/10.1111/jse.12699.

    Article  Google Scholar 

  8. Blois L, de Miguel M, Bert PF, Ollat N, Rubio B, Voss-Fels KP, et al. Dissecting the genetic architecture of root-related traits in a grafted wild Vitis berlandieri population for grapevine rootstock breeding. Theor Appl Genet. 2023;136(11):223. https://doi.org/10.1007/s00122-023-04472-1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Su W, Zhou Z, Zeng J, Cao R, Zhang Y, Hu D, et al. Genome-wide identification of the WRKY gene family in Camellia Oleifera and expression analysis under phosphorus deficiency. Front Plant Sci. 2023;14:1082496. https://doi.org/10.3389/fpls.2023.1082496.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Chen Y. Scientific and technological progress and future core technologies of oil tea Camellia in China. J Cent South Univ Forestry Technol. 2023;43(7):1–22. https://doi.org/10.14067/j.cnki.1673-923x.2023.07.001.

    Article  Google Scholar 

  11. Wang Y, Ling X, Ma C, Liu C, Zhang W, Huang J, et al. Can China get out of soy dilemma? A yield gap analysis of soybean in China. Agron Sustain Dev. 2023;43(4):47. https://doi.org/10.1007/s13593-023-00897-6.

    Article  Google Scholar 

  12. Xing T. Study on the germplasm investigation, evaluation and population genetics of wild oil-tea Camellia. Sichuan Agricultural Univ. 2022. https://doi.org/10.27345/d.cnki.gsnyu.2022.001014.

    Article  Google Scholar 

  13. Zhang Y, Yang Z, Ma H, Huang L, Ding F, Du Y, et al. Pyramiding of Fusarium head blight resistance quantitative trait loci, Fhb1, Fhb4, and Fhb5, in modern Chinese wheat cultivars. Front Plant Sci. 2021;12:694023. https://doi.org/10.3389/fpls.2021.694023.

    Article  PubMed  PubMed Central  Google Scholar 

  14. Zhu Y, Liang D, Song Z, Tan Y, Guo X, Wang D. Genetic diversity analysis and core germplasm collection construction of Camellia Oleifera based on fruit phenotype and SSR data. Genes. 2022;13(12):2351. https://doi.org/10.3390/genes13122351.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  15. Kumar M, Chaudhary V, Sharma R, Sirohi U, Singh J. Advances in biochemical and molecular marker techniques and their applications in genetic studies of orchid: a review. Int J Chem Stud. 2018;6(6):806–22. https://www.researchgate.net/publication/329026574.

    CAS  Google Scholar 

  16. Ma S, Khayatnezhad M, Minaeifar AA. Genetic diversity and relationships among Hypericum L. species by ISSR markers: a high value medicinal plant from Northern of Iran. Caryologia. 2021;74(1):97–107. http://digital.casalini.it/5016858.

    Article  Google Scholar 

  17. Nasim N, Sandeep IS, Sahoo A, Das S, Panda MK, Acharya L et al. Population genetic structure and diversity analysis in economically important Pandanus odorifer (Forssk.) Kuntze accessions employing ISSR and SSR markers. Ind Crop Prod. 2020;143:111894. https://doi.org/10.1016/j.indcrop.2019.111894.

  18. Kamali M, Samsampour D, Bagheri A, Mehrafarin A, Homaei A. Association analysis and evaluation of genetic diversity of Teucrium Stocksianum Boiss. Populations using ISSR markers. Genet Resour Crop Ev. 2023;70(3):691–709. https://doi.org/10.1007/s10722-022-01529-w.

    Article  CAS  Google Scholar 

  19. Wang HF, Zong XX, Guan JP, Yang T, Sun XL, Ma Y, et al. Genetic diversity and relationship of global faba bean (Vicia faba L.) germplasm revealed by ISSR markers. Theor Appl Genet. 2012;124:789–97. https://doi.org/10.1007/s00122-011-1750-1.

    Article  PubMed  Google Scholar 

  20. Mir MA, Mansoor S, Sugapriya M, Alyemeni MN, Wijaya L, Ahmad P. Deciphering genetic diversity analysis of saffron (Crocus sativus L.) using RAPD and ISSR markers. Saudi J Biol Sci. 2021;28(2):1308–17. https://doi.org/10.1016/j.sjbs.2020.11.063.

    Article  CAS  PubMed  Google Scholar 

  21. Jedrzejczyk I, Rewers M. Identification and genetic diversity analysis of edible and medicinal alva species using flow cytometry and ISSR molecular markers. Agronomy. 2020;10(5):650. https://doi.org/10.3390/agronomy10050650.

    Article  CAS  Google Scholar 

  22. Sun C, Jia L, Xi B, Liu J, Wang L, Weng X. Genetic diversity and association analyses of fruit traits with microsatellite ISSRs in Sapindus. J Res. 2018;30:193–203. https://doi.org/10.1007/s11676-017-0580-7.

    Article  CAS  Google Scholar 

  23. Han C, Zhou T, Li Y, Ye T, Hu X, Zhao J, et al. Regeneration of different ploidy callus from anther culture of Camellia Oleifera. Mol Breed. 2022;20(12):4057–66. https://doi.org/10.13271/j.mpb.020.004057.

    Article  Google Scholar 

  24. Feng JL, Jiang Y, Yang ZJ, Chen SP, El-Kassaby YA, Chen H. Marker-assisted selection in hybrid population. Silvae Genet. 2020;69(1):63–72. https://doi.org/10.2478/sg-2020-0009.

    Article  Google Scholar 

  25. Yang L, Gao C, Xie J, Qiu J, Deng Q, Zhou Y, et al. Fruit economic characteristics and yields of 40 superior Camellia Oleifera Abel plants in the low-hot valley area of Guizhou Province, China. Sci Rep. 2022;12(1):7068. https://doi.org/10.1038/s41598-022-10620-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Chen Y, Wang B, Chen J, Wang X, Wang R, Peng S, et al. Identification of Rubisco rbcL and rbcS in Camellia Oleifera and their potential as molecular markers for selection of high tea oil cultivars. Front Plant Sci. 2015;6:189. https://doi.org/10.3389/fpls.2015.00189.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Lin P, Yin H, Yan C, Yao X, Wang K. Association genetics identifies single nucleotide polymorphisms related to kernel oil content and quality in Camellia Oleifera. J Agr Food Chem. 2019;67(9):2547–62. https://doi.org/10.1021/acs.jafc.8b03399.

    Article  CAS  Google Scholar 

  28. Peng S. Genome-wide association studies of important economic traits in Camellia Oleifera Abel. BeiJing Forestry University; 2021. https://doi.org/10.26949/d.cnki.gblyu.2021.000463.

  29. He Z, Liu C, Wang X, Wang R, Chen Y, Tian Y. Assessment of genetic diversity in Camellia Oleifera Abel. Accessions using morphological traits and simple sequence repeat (SSR) markers. Breed sci. 2020;70(5):586–93. https://doi.org/10.1270/jsbbs.20066.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Zhao Z, Yan W, Xiao M, Xiao T, Lei F. Molecular identification of pathogens causing root rot of Camellia Oleifera in tropical. Mol Breed. 2020;18(19):6433–40. https://doi.org/10.13271/j.mpb.018.006433.

    Article  CAS  Google Scholar 

  31. Huang X, Han B. Natural variations and genome-wide association studies in crop plants. Annu Rev Plant Biol. 2014;65:531–51. https://doi.org/10.1016/j.jia.2023.05.002.

    Article  CAS  PubMed  Google Scholar 

  32. Zhang Y, Cao YF, Huo HL, Xu JY, Tian LM, Dong XG, et al. An assessment of the genetic diversity of pear (Pyrus L.) germplasm resources based on the fruit phenotypic traits. J Integr Agr. 2022;21(8):2275–90. https://doi.org/10.1016/S2095-3119(21)63885-6.

    Article  CAS  Google Scholar 

  33. Gaillard JM, Festa-Bianchet M, Yoccoz NG, Loison A, Toigo C. Temporal variation in fitness components and population dynamics of large herbivores. Annul Rev Ecol Syst. 2000;31(1):367–93. https://doi.org/10.1146/annurev.ecolsys.31.1.367.

    Article  Google Scholar 

  34. Li Z, Wang W, Zhang H, Liu J, Shi B, Dai W, et al. Diversity in fruit morphology and nutritional composition of Juglans mandshurica Maxim in northeast China. Front Plant Sci. 2022;13:820457. https://doi.org/10.3389/fpls.2022.820457.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Chen Y, Xu Y, Zhang Z, Ma L, Wang X, Chen L, et al. Analysis of fruit main quantitative traits and selection of breeding index in Camellia Oleifera. J Cent South Univ Forestry Technol. 2021;41(3):1–9. https://doi.org/10.14067/j.cnki.1673-923x. 2021.03.001.

    Article  CAS  Google Scholar 

  36. Song L, Liu J, Cao B, Liu B, Zhang X, Chen Z, et al. Reducing brassinosteroid signalling enhances grain yield in semi-dwarf wheat. Nature. 2023;617:1–7. https://doi.org/10.1038/s41586-023-06023-6.

    Article  CAS  Google Scholar 

  37. De Kort H, Prunier JG, Ducatez S, Honnay O, Baguette M, Stevens VM, et al. Life history, climate and biogeography interactively affect worldwide genetic diversity of plant and animal populations. Nat Commun. 2021;12(1):516. https://doi.org/10.1038/s41467-021-20958-2.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Wang B, Chen Y, Tan X, Peng S, Shi M. Genetic diversity elite clones of Camellia Oleifera by ISSR. J Northeast Forestry Univ. 2008;36(6):19–2336. https://doi.org/10.3969/j.issn.1000-2006.2013.01.010.

    Article  Google Scholar 

  39. Yu X, Yu F, Liu J, Chen J. Identification and genetic diversity analysis of Camellia oleifera varieties using ISSR marker. J Nanjing Forestry Univ (Natural Sci Edition). 2013;37(1):61–6. https://doi.org/10.3969/j.issn.1000-2006. 2013.01.010.

    Article  CAS  Google Scholar 

  40. Tamang R, Munda S, Darnei RL, Begum T, Lal M. Genetic diversity evaluation of core collection gene bank using simple sequence repeat marker of Acorus calamus L.: an important aromatic species. Ind Crop Prod. 2023;204:117292. https://doi.org/10.1016/j.indcrop.2023.117292.

    Article  CAS  Google Scholar 

  41. Al-daej MI, Rezk AA, El-Malky MM, Shalaby TA, Ismail M. Comparative genetic diversity assessment and marker-trait association using two DNA marker systems in rice (Oryza sativa L). Agronomy. 2023;13(2):329. https://doi.org/10.3390/agronomy13020329.

    Article  CAS  Google Scholar 

  42. Nida H, Girma G, Mekonen M, Tirfessa A, Seyoum A, Bejiga T, et al. Genome-wide association analysis reveals seed protein loci as determinants of variations in grain mold resistance in sorghum. Theor Appl Genet. 2021;134:1167–84. https://doi.org/10.1007/s00122-020-03762-2.

    Article  CAS  PubMed  Google Scholar 

  43. Demirjian C, Vailleau F, Berthomé R, Roux F. Genome-wide association studies in plant pathosystems: success or failure? Trends Plant Sci. 2023;28(4):471–85. https://doi.org/10.1016/j.tplants.2022.11.006.

    Article  CAS  PubMed  Google Scholar 

  44. Ghomi K, Rabiei B, Sabouri H, Gholamalipour Alamdari E. Association analysis, genetic diversity and population structure of barley (Hordeum vulgare L.) under heat stress conditions using SSR and ISSR markers linked to primary and secondary metabolites. Mol Biol Rep. 2021;48:6673–94. https://doi.org/10.1007/s11033-021-06652-y.

    Article  CAS  PubMed  Google Scholar 

  45. Luan MB, Liu CC, Wang XF, Xu Y, Sun ZM, Chen JH. SSR markers associated with fiber yield traits in ramie (Boehmeria nivea L. Gaudich). Ind Crop Prod. 2017;107:439–45. https://doi.org/10.1016/j.indcrop.2017.05.065.

    Article  CAS  Google Scholar 

  46. Hasan N, Choudhary S, Naaz N, Sharma N, Laskar RA. Recent advancements in molecular marker-assisted selection and applications in plant breeding programmes. J Genetic Eng Biotechnol. 2021;19(1):1–26. https://doi.org/10.1186/s43141-021-00231-1.

    Article  CAS  Google Scholar 

  47. Flanagan BA, Krueger-Hadfield SA, Murren CJ, Nice CC, Strand AE, Sotka EE. Founder effects shape linkage disequilibrium and genomic diversity of a partially clonal invader. Mol Ecol. 2021;30(9):1962–78. https://doi.org/10.1111/mec.15854.

    Article  PubMed  Google Scholar 

  48. Zhou Y, Zhang Z, Bao Z, Li H, Lyu Y, Zan Y, et al. Graph pangenome captures missing heritability and empowers tomato breeding. Nature. 2022;606(7914):527–34. https://doi.org/10.1038/s41586-022-04808-9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  49. Wang Q, Dhindsa RS, Carss K, Harper AR, Nag A, Tachmazidou I, et al. Rare variant contribution to human disease in 281,104 UK Biobank exomes. Nature. 2021;597:527–32. https://doi.org/10.1038/s41586-021-03855-y.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Teległów A, Mardyła M, Myszka M, Pałka T, Maciejczyk M, Bujas P, et al. Effect of intermittent hypoxic training on selected biochemical indicators, blood rheological properties, and metabolic activity of erythrocytes in rowers. Biology. 2022;11(10):1513. https://doi.org/10.3390/biology11101513.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Newell KM, Hancock PA. Forgotten moments: a note on skewness and kurtosis as influential factors in inferences extrapolated from response distributions. J Motor Behav. 1984;16(3):320–35. https://peterhancock.ucf.edu/wp-content/uploads/sites/12/2012/03/Newell_Hancock.

    Article  CAS  Google Scholar 

  52. Mbatchou J, Barnard L, Backman J, Marcketta A, Kosmicki JA, Ziyatdinov A, et al. Computationally efficient whole-genome regression for quantitative and binary traits. Nat Genet. 2021;53(7):1097–103. https://doi.org/10.1038/s41588-021-00870-7.

    Article  CAS  PubMed  Google Scholar 

  53. Chen L, Kong X, Wang R, Ma S, Meng Y, Lu Q, et al. Heterosis of plant gross weight and heterotic group classification of inbred lines in Chinese cabbage (Brassica rapa L. ssp. pekinensis). Sci Hortic. 2021;280:109938. https://doi.org/10.1016/j.scienta.2021.109938.

    Article  Google Scholar 

  54. Marrot P, Charmantier A, Blondel J, Garant D. Current spring warming as a driver of selection on reproductive timing in a wild passerine. J Anim Ecol. 2018;87(3):754–64. https://doi.org/10.1111/1365-2656.12794.

    Article  PubMed  PubMed Central  Google Scholar 

  55. Tantasawat PA, Poolsawat O, Kativat C, Arsakit K, Papan P, Chueakhunthod W, et al. Inheritance and identification of ISSR-RGA markers associated with powdery mildew resistance in mungbean for marker-assisted breeding. Chil J Agr Res. 2022;82(1):3–9. https://doi.org/10.4067/S0718-58392022000100003.

    Article  Google Scholar 

  56. De Mori G, Cipriani G. Marker-assisted selection in breeding for fruit trait improvement: a review. Int J Mol Sci. 2023;24(10):8984. https://doi.org/10.3390/ijms24108984.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Meuwissen THE, Hayes BJ, Goddard ME. Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001;157(4):1819–29. https://doi.org/10.1093/genetics/157.4.1819.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  58. Mastrangelo S, Tolone M, Sardina MT, Sottile G, Sutera AM, Di Gerlando R, et al. Genome-wide scan for runs of homozygosity identifies potential candidate genes associated with local adaptation in Valle Del Belice sheep. Genet Sel Evol. 2017;49(1):1–10. https://doi.org/10.1186/s12711-017-0360-z.

    Article  CAS  Google Scholar 

  59. Yang Y, Amo A, Wei D, Chai Y, Zheng J, Qiao P, et al. Large-scale integration of meta-QTL and genome-wide association study discovers the genomic regions and candidate genes for yield and yield-related traits in bread wheat. Theor Appl Genet. 2021;134(9):3083–109. https://doi.org/10.1007/s00122-021-03881-4.

    Article  CAS  PubMed  Google Scholar 

  60. Langridge P, Reynolds M. Breeding for drought and heat tolerance in wheat. Theor Appl Genet. 2021;134:1753–69. https://doi.org/10.1007/s00122-021-03795-1.

    Article  PubMed  Google Scholar 

  61. Chen Y, Zheng J, Yang Z, Xu C, Liao P, Pu S, et al. Role of soil nutrient elements transport on Camellia oleifera yield under different soil types. BMC Plant Biol. 2023. https://doi.org/10.21203/rs.3.rs-2706706/v1.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Chang Y, Hu S, Xu J, Gong H, Guo X, Song Q, et al. Identification of reference genes provides insights into the determinants of self-incompatibility in Camellia Oleifera. Sci Hortic. 2023;321:112301. https://doi.org/10.1016/j.scienta.2023.112301.

    Article  CAS  Google Scholar 

  63. Chadalavada K, Kumari BR, Kumar TS. Sorghum mitigates climate variability and change on crop yield and quality. Planta. 2021;253(5):113. https://doi.org/10.1007/s00425-021-03631-2.

    Article  CAS  PubMed  Google Scholar 

  64. Yao X, Wang K, Ren H, Li P, Long W, Cao Y, et al. Code of practice on breeding technique of oil-tea Camellia. State Forestry and Grassland Administration of China; 2020. https://www.doc88.com/p-90929084087424.

  65. Würschum T, Zhu X, Zhao Y, Jiang Y, Reif JC, Maurer HP. Maximization through optimization? On the relationship between hybrid performance and parental genetic distance. Theor Appl Genet. 2023;136(9):186. https://doi.org/10.1007/s00122-023-04436-5.

    Article  PubMed  PubMed Central  Google Scholar 

  66. Tao A, Huang L, Wu G, Afshar RK, Qi J, Xu J, et al. High-density genetic map construction and QTLs identification for plant height in white jute (Corchorus capsularis L.) using specific locus amplified fragment (SLAF) sequencing. BMC Genomics. 2017;18(1):1–12. https://doi.org/10.1186/s12864-017-3712-8.

    Article  CAS  Google Scholar 

  67. Tichkule S, Jex AR, Van Oosterhout C, Sannella AR, Krumkamp R, Aldrich C, et al. Comparative genomics revealed adaptive admixture in Cryptosporidium hominis in Africa. Microb Genomics. 2021;7(1):mgen000493. https://doi.org/10.1099/mgen.0.000493.

    Article  CAS  Google Scholar 

  68. Tang Y, Qiu X, Hu C, Li J, Wu L, Wang W, et al. Breeding of a new variety of peanut with high-oleic-acid content and high-yield by marker-assisted backcrossing. Mol Breed. 2022;42(7):42. https://doi.org/10.1007/s11032-022-01313-9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Esch L, Ngai QY, Barclay JE, McNelly R, Hayta S, Smedley MA, et al. Increasing amyloplast size in wheat endosperm through mutation of PARC6 affects starch granule morphology. New Phytol. 2023;240(1):224–41. https://doi.org/10.1111/nph.19118.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  70. Huang R, Wang JY, Yao MZ, Ma CL, Chen L. Quantitative trait loci mapping for free amino acid content using an albino population and SNP markers provides insight into the genetic improvement of tea plants. Hortic Res. 2022;9:uhab029. https://doi.org/10.1093/hr/uhab029.

    Article  PubMed  PubMed Central  Google Scholar 

  71. Li C, Fan R, Ma C, Zhang Z, Li Z, Zhu L, et al. Reciprocal translocations hidden by phenotype and genotype within the same wheat cultivar. Crop Sci. 2023;63(5):2727–39. https://doi.org/10.1002/csc2.21041.

    Article  CAS  Google Scholar 

  72. Yeh FC, Yang RC, Boyle TJ, Ye ZH, Mao JX. Popgene ver. 1.32, the user-friendly shareware for population genetic analysis. Molecular Biology and Biotechnology Centre, University of Alberta, Edmonton, Canada; 1999. http://www.ualberta.ca/fyeh.

  73. Rohlf FJ. NTSYSpc: numerical taxonomy and multivariate analysis system. Vesion:2.11x. Exeter software, Setauket, New York; 2000. https://ntsyspc.software.informer.com.

  74. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155(2):945–59. PMC1461096/pdf/ 10835412.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Abdelraheem A, Elassbli H, Zhu Y, Kuraparthy V, Hinze L, Stelly D, et al. A genome-wide association study uncovers consistent quantitative trait loci for resistance to Verticillium wilt and Fusarium wilt race 4 in the US Upland cotton. Theor Appl Genet. 2020;133:563–77. https://doi.org/10.1007/s00122-019-03487-x.

    Article  CAS  PubMed  Google Scholar 

  76. Ping X, Hao L, Hai-yuan L, Ge Z, Sheng-jie D, Xiao-yu C, et al. Genome-wide and candidate gene association studies identifies BnPAP17 conferring utilization of organophosphorus in oilseed rape. J Integr Agr. 2023. https://doi.org/10.1016/j.jia.2023.05.002.

    Article  Google Scholar 

  77. Qiu S, Yong L, Wilson A, Croft DP, Graham C, Charlesworth D. Partial sex linkage and linkage disequilibrium on the guppy sex chromosome. Mol Ecol. 2022;31(21):5524–37. https://doi.org/10.1111/mec.16674.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  78. Errigo A, Bitti A, Galistu F, Salis R, Pes GM, Dore MP. Relationship between glucose-6-phosphate dehydrogenase deficiency, X-chromosome inactivation and inflammatory markers. Antioxidants. 2023;12(2):334. https://doi.org/10.3390/antiox12020334.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references

Acknowledgements

We are grateful to Tong Kou Forest Farm in Minhou County, Fujian Province, China for providing a field experiment site.

Funding

This research was funded by the Local Science and Technology Fund Project Guided by the Central Government (2021L3053), Agricultural University Industry Education Cooperation Project (2023N5003) and Agricultural Guiding (key) Project (2023N0002).

Author information

Authors and Affiliations

Authors

Contributions

JJZ: Methodology, Data analysis, Investigation, Writing-original draft. HQS: Methodology, Data analysis. SSP: Methodology, Data analysis. HC: Resources. YAE: Conceptualization, Writing -review and editing. ZJY: Methodology, Data analysis, Investigation, Resources. JLF: Conceptualization, Project administration, Supervision, Writing-original draft.

Corresponding authors

Correspondence to Yousry A. El-Kassaby, Zhijian Yang or Jinling Feng.

Ethics declarations

Ethical approval and consent to participate

This article does not contain any studies with human or animal subjects. All methods were carried out in compliance with local and national regulations. On behalf of all co-authors, the corresponding author states that the work described is original, previously unpublished research. All the authors listed have approved the manuscript.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zheng, J., Su, H., Pu, S. et al. High-yield hybrid breeding of Camellia oleifolia based on ISSR molecular markers. BMC Plant Biol 24, 517 (2024). https://doi.org/10.1186/s12870-024-05218-x

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-024-05218-x

Keywords