- Research article
- Open Access
QTL mapping and genome-wide association study reveal two novel loci associated with green flesh color in cucumber
BMC Plant Biologyvolume 19, Article number: 243 (2019)
Green flesh color, resulting from the accumulation of chlorophyll, is one of the most important commercial traits for the fruits. The genetic network regulating green flesh formation has been studied in tomato, melon and watermelon. However, little is known about the inheritance and molecular basis of green flesh in cucumber. This study sought to determine the main genomic regions associated with green flesh. Three F2 and two BC1 populations derived from the 9110Gt (cultivated cucumber, green flesh color) and PI183967 (wild cucumber, white flesh color) were used for the green flesh genetic analysis. Two F2 populations of them were further employed to do the map construction and quantitative trait loci (QTL) study. Also, a core cucumber germplasms population was used to do the GWAS analysis.
We identified three indexes, flesh color (FC), flesh extract color (FEC) and flesh chlorophyll content (FCC) in three environments. Genetic analysis indicated that green flesh color in 9110Gt is controlled by a major-effect QTL. We developed two genetic maps with 192 and 174 microsatellite markers respectively. Two novel inversions in Chr1 were identified between cultivated and wild cucumbers. The major-effect QTL, qgf5.1, was identified using FC, FEC and FCC index in all different environments used. In addition, the same qgf5.1, together with qgf3.1, was identified via GWAS. Further investigation of two candidate regions using pairwise LD correlations, combined with genetic diversity of qgf5.1 in natural populations, it was found that Csa5G021320 is the candidate gene of qgf5.1. Geographical distribution revealed that green flesh color formation could be due to the high latitude, which has longer day time to produce the photosynthesis and chlorophyll synthesis during cucumber domestication and evolution.
We first reported the cucumber green flesh color is a quantitative trait. We detected two novel loci qgf5.1 and qgf3.1, which regulate the green flesh formation in cucumber. The QTL mapping and GWAS approaches identified several candidate genes for further validation using functional genomics or forward genetics approaches. Findings from the present study provide a new insight into the genetic control of green flesh in cucumber.
Fruit flesh color, an important feature for consumers’ choice, is a key trait of breeding . Chlorophyll and carotenoid are the main pigments contribute to the flesh color formation . Different composition and concentration of chlorophyll and carotenoid contribute to the red, orange, yellow, green, light green and white flesh color in Cucurbit fruits [3,4,5]. Chlorophyll and its natural or commercial derivatives have demonstrated to have antioxidant, and antimutagenic activity, and have function in modulating xenobiotic metabolizing enzymes, and induction of apoptotic events in cancer cell lines in vitro and in vivo experiments . In addition, they have the ability to induce mammalian phase 2 proteins which protect cells against deleterious effect of oxidants and electrophiles . Protective effects of chlorophyll and their watersoluble salts (chlorophyllin) against consequence of carcinogen exposure like aflatoxin were also confirmed in animals. Carotenoids including β–carotene, α-carotene, lutein, etc. are the important precursors of vitamin A, which is necessary for human health especially the eye health [8, 9].
Cucurbit fruits have rich variations in flesh color, especially in melon and water melon. In melon (Cucumis melo L.), the flesh color exhibited white, yellow and orange because of the accumulation of chlorophyll and carotenoid . In the previous study, several genes/QTLs for flesh color in melon were reported. Hughes  and Imam et al.  first identified the green flesh (gf) gene and white flesh (wf) gene, respectively. Moreover, Clayberg  indicated that green and white flesh are recessive to orange, and also developed a genetic model for the inheritance among white, green and orange flesh. However, several follow-up studies didn’t confirm the above genetic model [13,14,15]. To understand the inheritance and gene control of flesh color more accurately, especially for the orange flesh color, the beta-carotene content combined with the flesh color were used for the analysis. Cuevas et al.  detected eight QTLs associated with the beta-carotene content used a set of 81 recombinant inbred lines (RIL) derived from ‘USDA 846–1’ and ‘Top Mark’. To detect more stable QTLs, Cuevas et al.  constructed a novel genetic map used a set of 116 F3 families derived from the ‘Q 3–2-2’ and ‘Top Mark’, and detected three QTLs related with beta-carotene content. Comparative analysis showed that three QTLs (β-carE.6.1, β-carM.8.1 and β-carM.9.1) could be detected repeatedly in two different populations, which revealed these three QTLs more critical for the orange flesh formation in melon. In addition, Tzuri et al.  also identified a gene, Cmor, which was found to co-segregate with flesh color in melon. However, the molecular base and genetic control of flesh color formation in melon is still not clear.
In watermelon (Citrullus lanatus), different composition and concentration of carotenoids contribute to the red, orange, canary yellow, salon yellow and white flesh color . Tadmor et al.  and Bang et al.  analysed the pigment components of different color flesh, and indicated that red flesh color results from lycopene, orange from prolycopene and rarely from β-carotene, canary yellow from xanthophylls and β-carotene, salmon yellow from pro-lycopene, which suggested the flesh color of watermelon is a complex trait. Briefly, white flesh is epistatic to canary yellow, canary yellow is epistatic to coral red, canary yellow is dominant to red and orange [3, 20]. For the gene/QTL study, Hashizume et al.  detected two flesh color QTLs in a biparental F2 population derived from H-7 (red flesh) and SA-1 (white flesh). Liu et al. [22, 23] identified only one QTL on chromosome 4 used a F2 population derived from LSW-177 (red flesh) and COS (yellow flesh). Branham et al.  combined visual color phenotyping with genotyping-by-sequencing of an F2:3 population derived from NY0016 (orange flesh) and EMB (canary yellow flesh) and detected a major locus on Chr1, which was associated with β-carotene content. Recently, Zhang et al.  reported that chromoplast development plays a crucial role in controlling carotenoid content in watermelon flesh, and detected a key gene ClPHT4;2 which was up-regulated during flesh color formation in watermelon. Thus, it is likely that several different genes and biochemical pathways affect the pigment accumulation during watermelon flesh color formation.
In cucumber, Qi  first described the semi-wild Xishuangbanna cucumber (Cucumis sativus L. var. xishuangbannanesis Qi et Yuan) has orange flesh color, which is due to the accumulation of high level of β-carotene in mature fruits . Inheritance analysis indicated that two recessive genes control orange mesocarp, while a single recessive gene controlled orange endocarp . Then, Bo et al.  first mapped the ore (orange endocarp) gene on Chr3. Another colored cucumber is PI200815 originate from Myanmar, which has the yellow flesh color in the mature fruits . Kooistra  analyzed the inheritance of yellow fruit flesh in cucumber and suggested that flesh color (including orange, yellow, dingy white, and intense white) was determined by two genes. Lu et al.  reported that yellow flesh was controlled by a recessive gene (yf) and finished the initial mapping. However, the above orange and yellow flesh color only appeared in the mature stage, which is difficult to be used for cucumber production. In the modern cucumber breeding, green flesh is already one of the most important quality traits. While there are very few reports regarding the green flesh in cucumber.
In the present study, three indexes (flesh color, flesh extract color and flesh chlorophyll content) were used to identify the green flesh. Three F2, one BC1P1 and one BC1P2 populations were used to analyze the inheritance of flesh color, and the green flesh locus was mapped by using two F2 populations in two environments. In addition, we detected two locus qgf3.1 and qgf5.1 using GWAS. The qgf5.1 is consistent with the major QTL detected in two F2 populations. To further investigate the domestication history of green flesh, we did the world map distribution analysis with 115 core cucumber germplasms (CG), which indicated that the higher latitude region has more green flesh cucumbers. This study thus provides important insights into the green flesh color formation in cucumber.
Phenotypic variation of FC, FEC and FCC in F2 and BC1 populations
Phenotypic data of FC, FEC and FCC were collected from the two parents, their F1, three F2, one BC1P1 and one BC1P2 populations in five experiments over two years (Fig. 1, Additional file 1: Table S1). The FC and FEC were categorized into five color groupings in above populations (Additional file 1: Figures S1 and S2) using the grade scale (Fig. 2a-b). The frequency distributions of FC, FEC and FCC among the populations from different experiments are illustrated in Fig. 2c-e and Additional file 1: Figure S3. All observed distributions of FC and FEC in three F2 populations showed a clear bimodal distribution, which suggested that the flesh color was controlled by a major QTL. The average of FC, FEC and FCC in BJ2017F experiment was found to be higher than that of the SY2016W and SY2017W experiments (Table 1, Fig. 2c-e). The reason could be that longer day time in Beijing promotes chlorophyll synthesis during the fruit development than Sanya (See discussion).
The FC, FEC and FCC correlations among multiple environments were examined for each population and the Spearman’s rank correlation coefficients (rs) are presented in Table 2. Strong positive correlations were observed among FC, FEC and FCC in different environments in each population, which suggested the green flesh phenotype is caused by the higher chlorophyll content in cucumber fruit. Interestingly, for the SY2016W and SY2017W experiments, the average of FC, FEC and FCC in SY2016W is lower than that in SY2017W, which probably because of the small amount of chlorophyll degradation during long-distance transportation from Sanya to Beijing (about 7 days). In SY2016W experiment, all the phenotypic data were collected in Beijing. While in SY2017W experiment, we collected all the data in Sanya using the fresh cucumber flesh. Despite this, the QTL detected with the SY2016W data was very consistent with that identified with other data sets (see below).
To summarize, despite the different environments and scoring scales in the five phenotyping experiments, data collected from these trials were highly correlated, consistent, and of good quality, which provided a solid foundation for subsequent QTL analysis.
Linkage map construction
We screened the cucumber SSR primer pairs between PI183967 and 9110Gt and identified 201 polymorphic ones for genetic map construction. The resulting genetic map is illustrated in Additional file 1: Figures S4 and S5, and the main statistics of the map are presented in Table S2. Detailed information (marker names, map location, 9930 draft genome assembly locations and primer sequences) of two maps was provided in Additional file 1: Table S3 and S4.
We developed two high-density maps using two F2 populations, respectively. The 234 F2 map included 192 markers that spanned 922.3 cM with an average marker interval of 4.95 cM. The 125 F2 map comprised 174 markers that spanned 901.1 cM with an average marker interval of 5.67 cM.
According to the 9930 genome (V2.0) anchored by these markers, these two maps seemed to physically cover the majority of the cucumber genome. Therefore, these two genetic maps were suitable for the QTL mapping analysis.
Chromosome rearrangements between cultivated and wild cucumbers
To investigate possible chromosome structural rearrangements between the cultivated (C. s. var. sativus) and wild (C. s. var. hardwickii) cucumbers, we aligned the two wild cucumber originated maps developed herein with the 9930 genome (V2.0). Several typically shared markers in two maps presented in Additional file 1: Figures S3 and S4 were used for the chromosome by chromosome alignment. Putative structural rearrangements represented by SSR markers between cultivated cucumber and wild cucumber are illustrated in Fig. 3.
In Chr1, 4, 5 and 7, there were two, two, three and one blocks, respectively, in which the orders of molecular markers were inconsistent with the physical location suggesting possible inversions between the cultivated and wild cucumbers. It is known that two, three, and one inversion differentiated the Chr4, 5, and 7 of cultivated and wild cucumbers . The locations of the rearrangements (on Chr4, 5 and 7) identified from the present study were largely consistent with those found between the cultivated and wild cucumbers. Interestingly, we found a novel putative inversion in Chr1 (Fig. 3). The above putative inversions in wild cucumber could be the reason to promote cucumber evolution in order to adapt different environments (See below).
QTL mapping of FC, FEC and FCC
We conducted QTL analysis using the CIM approach with FC, FEC and FCC phenotypic data for each experiment. Initial whole genome QTL mapping was conducted with a window size of 25 cM because of a few large gaps (> 10 cM) in the genetic maps (Additional file 1: Figures S4, S5). The LOD threshold to declare significance of QTL for each trait was determined with 1000 permutation tests (P = 0.05). All green flesh related QTLs detected in the present study were illustrated in Fig. 4a, c. The QTL information for FC, FEC and FCC are shown in Table 3. Figure 4b, d showed the major QTL detected in two populations. We total detected six QTLs by using FC, FEC and FCC three indexes.
We examined the relationships of QTL detected with three different phenotypic indexes (FC, FEC and FCC) in SY2016W and BJ2017F experiments. The results are presented in Table 3 and Fig. 4. All three datasets detected QTL in Chr5. The 1.5-LOD intervals and peak locations of QTL for FC, FEC and FCC were overlapped, suggesting that they probably belonged to the same QTL although the LOD support value and the effects were somewhat different (Table 3). In SY2016W experiment, the QTL with largest effect, qgf5.1 (R2 = 22–38%), was identified using FC, FEC and FCC three different indexes, with a highly consistent peak on the genetic map at 5.7 cM (Fig. 4b and Table 3). In BJ2017F experiment, the QTL with largest effect, qgf5.2 (R2 = 49–64%), was also identified using FC, FEC and FCC three different indexes, with a highly consistent peak on the genetic map at 22.5 cM (Fig. 4d and Table 3).
In brief, two major-effect QTLs qgf5.1 and qgf5.2 were detected by using three indexes in two experiments, which indicated that qgf5.1 and qgf5.2 two loci mainly control the green flesh formation in cucumber. While the qgf5.1 and qgf5.2 QTLs are located on the different locations on Chr5, we still believe they are the same QTL because these two QTLs located in the inversion region on Chr5 (Fig. 3). That also suggested the green flesh could be a domestication trait during cucumber evolution.
Evolution and geographical distribution analysis of green flesh in a natural population
The flesh color of all 115 cucumber CG lines was categorized into five groups based on flesh color (Fig. 5b). To understand the flesh color distribution in the world easier, we defined group 1, 2 and 3 as white flesh (chlorophyll content < 11 μg/L), group 4 and 5 as green flesh (chlorophyll content > 11 μg/L). The approximate geographical coordinates (altitude and latitude) of the origin of each line were used to plot the 115 lines on a world map (Fig. 5a). We found almost all the European cucumber has the green flesh. For the North China cucumber, around half presented green flesh. However, in the South China and India type cucumber, few accessions with the green flesh. To further ascertain the domestication path of green flesh in cucumber, we defined three latitudes on the distribution map: wild cucumber latitude (WCL), North China cucumber latitude (NCCL), and European cucumber latitude (ECL) (Fig. 5a). Interestingly, the higher latitude, the more cucumbers presented green flesh, which suggesting the green flesh could be affected by the environment during domestication and evolution. The reasonable explanation should be the illumination time in higher latitude area is longer than that in lower latitude. Four critical illumination times over one year were showed in Fig. 5c. We found the illumination time at ECL is significantly longer than that in WCL from May to October. Longer illumination time will promote the chlorophyll synthesis in cucumber fruit, which could be the reason of green flesh formation. We also observed that SCCL has higher genetic diversity for cucumber fruit flesh color, which may attribute to its extensive history of domestication or diversity selection in cucumber breeding.
Candidate gene identification of the green flesh using GWAS
We detected a major QTL in Chr5 using two F2 populations, while it is difficult to do the fine mapping work because the QTL is located on the inversion region. To verify the locus on Chr5 and detect more novel locus related to the green flesh, 115 CG lines were used to do the GWAS analysis. Using the general linear model, two locus, qgf3.1 and qgf5.1, were detected that consistently exceeded a significant threshold (−log10P ≥ 7.0) (Fig. 6a). For the qgf3.1, a candidate region was estimated to extend from 11.975 Mb to 12.051 Mb (~ 76 kb) on Chr3 using pairwise LD correlations (r2 ≥ 0.6) (Fig. 6b). According to the Cucumber Genome Browser (http://cucurbitgenomics.org/organism/2), nine annotated genes are located in the qgf3.1 candidate region (Fig. 6c). For the qgf5.1, we observed that the SNP_736045 showed the strongest association with green flesh (Fig. 6d). We extended 100 Kb near the SNP_736045 for pairwise LD correlation analysis. A small candidate region was estimated to extend from 718.225 Kb to 738,975 Kb (~ 20 kb) on Chr5, which included one annotated gene (Fig. 6e). To verify the candidate gene of qgf5.1, we did an alignment among 251 natural cucumber lines. The results showed that four unique SNPs that were associated with green flesh (Fig. 6f). Among the four SNPs, two were located on the fifth exon and the other two were located on the sixth exon. The first and fourth SNP resulted in an amino acid substitution from G (gly) to A (ala) and H (his) to Y (tyr), respectively (Fig. 6f). Interestingly, the SNP variation of green flesh line is not consistent with the phenotype completely, which indicated that the green flesh in natural cucumber could be controlled by multiple genes (such as the loci qgf3.1).
Inheritance and QTL/gene of the cucumber flesh color
In the present work, two parents (P1: PI183967; P2: 9110Gt), their F1, three F2, one BC1P1 and one BC1P2 populations in five experiments over two years were used to study the inheritance of FC, FEC and FCC in cucumber. The FC, FEC and FCC phenotypic data had a significant positive correlation for the same population in different experiments (Table 2). The frequency distributions of FC, FEC and FCC among the three F2 populations were bimodal rather than normal (Fig. 2), especially for the FC and FEC, suggesting that green flesh color is controlled by a major QTL. A number of other cucumber flesh color trait have been also identified, such as orange endocarp, orange mesocarp and yellow flesh. Cuevas et al.  reported the single gene model for the orange endocarp using F2 populations. Bo et al.  also confirmed this result using 124 recombinant inbred lines in three different environments.
Regarding the inheritance of orange mesocarp in cucumber, Navazio  presented single gene model, while Cuevas et al.  showed two gene model. The difference probably results from the population sizes [Navazio  = 46 F2 progenies versus Cuevas et al.  = 111 F2 and 51 BC1P2 progenies] and/or the growing environment. For the yellow flesh, Lu et al.  showed that a single recessive gene controlled the yellow flesh using a large F2 population. The above genetic analysis indicated that cucumber flesh color is mostly a quality trait, which suggests that the specific genes mutant result in the flesh color formation during cucumber evolution and domestication.
Chromosome differentiation of cultivated and wild cucumbers
Based on the taxonomic studies, cucumber can be divided into four different botanical variants: the cultivated cucumber, the wild cucumber [33, 34], the Sikkim cucumber , and the Xishuangbanna cucumber [26, 36]. Among of them, the wild cucumber has a large difference with other cucumbers on the morphological variation. Thus, the cucumber can be also divided into two extremities: the wild cucumber and the cultivated cucumber [37, 38].
Several previous studies reported the chromosome rearrangements between the wild cucumbers and cultivated cucumbers [31, 39, 40]. In order to get more information about the chromosome rearrangements between the wild and cultivated cucumber, we did a global analysis in this study. We detected two, two, three and one chromosome rearrangements in Chr1, 4, 5 and 7, respectively (Fig. 3). Yang et al.  identified six inversions in Chr4, 5, and 7 between wild and cultivated cucumber. Compared with the previous study, we found a novel putative inversion in Chr1. In plants, the inversions play an important role in the domestication and evolution [41,42,43]. Moreover, the previous studies also identified the same six inversions in the cultivated cucumber [31, 40, 44, 45], which suggested that the six inversions are common to wild and cultivated cucumbers. However, the two novel inversions detected in the present study between cultivated and wild cucumbers could be unique for the wild cucumber and further research is needed.
Green flesh, chlorophyll metabolism and photosynthesis
In the present study, FC, FEC and FCC three indexes were used to define the green flesh color. QTL mapping results showed that all these three traits shared the same locus (Fig. 4), which suggesting that cucumber green flesh formation could because the chlorophyll accumulation during fruit development. Moreover, we found the cucumber with higher chlorophyll content often origin from the higher latitude area (Fig. 5a). The reasonable explanation is that the higher latitude area has longer illumination time (Fig. 5c), which can maintain longer photosynthesis and improve the chlorophyll synthesis. The chlorophyll content, as a critical feature of unripe fruits, affects the nutritional components and flavor of ripe fruits. Moreover, the link between chlorophyll content and photosynthesis in fruit tissues has been illuminated by a variety of studies [46, 47].
Tomato (Solanum lycopersicum) is a typical fruit with obvious chlorophyll metabolism at maturation and has been used as a model for chlorophyll metabolism studies. The regular tomato shows red flesh color at the ripening stage, while the green-flesh (gf) mutant of tomato still inhabited green flesh at the ripening stage because of the lack of chlorophyll degradation in gf mutant . Moreover, Akhtar et al.  found that the leaves of the gf mutant also showed stay-green phenotype, which means the GF gene plays the same role both in leaves and fruits. In the gf mutant fruits, lots of chlorophyll still remain in the ripe fruit, suggesting that chlorophyll degradation is defective in the mutant material. And it might due to the amount of the chloroplast thylakoid grana existed in the plastids of ripe fruits . Roca et al.  discussed the carotenoid biosynthesis in the gf mutant and showed that the carotenogenesis can be slowed in mutant lines. In addition, several genes were reported to affect the photosynthesis and chloroplast development for the chlorophyll metabolism in tomato. LeHY5 is positive for the fruit pigment accumulation while LeCOP1LIKE gene is a negative regulator . Tomato high pigment-2dg mutant showed a highly significant increase in chloroplast size compared with the regular tomato . Rohrmann et al.  identified AP2-EREBP, AUX/IAA, C2C2 etc. transcription factors regulated the chlorophyll level. Then, Waters et al.  confirmed that the GLK genes can influence chloroplast development. Recently, APRR2-Like genes were reported to increase plastid number, area, and enhance the chlorophyll levels in immature tomato fruits .
However, very few genes controlling fruits chlorophyll content were reported in cucumber. The candidate region/gene in the present study provides important clues for future fine mapping and cloning of these green flesh color loci. Nine and one candidate genes were predicted for the qgf3.1 and qgf5.1 loci by GWAS, respectively. Based on the predicted function, the candidate genes play roles in carbohydrate metabolic process (Csa3G176330), ribosome biogenesis (Csa3G176340), histone peptidyl-prolyl isomerization (Csa3G176850), glycolytic process (Csa3G176870), chloroplast envelope (Csa3G177380) and chloroplast stroma process (Csa5G021320), etc. Thus, the Csa3G177380 and Csa5G021320 are the most likely candidate genes for qgf3.1 and qgf5.1 loci. The SNPs developed in this work are also useful for marker-assisted selection in breeding for green flesh cucumber. While, due to the F2 population employed in the present study can’t be used to collect phenotypic data in multiple years/environments. To verify the green flesh loci target region and get more accurate and stable loci, we are developing the recombinant inbred lines (RILs). The next plan will verify the candidate genes using RILs and more natural accessions, and will also clone the genes for functional analysis.
We reported the cucumber green flesh color is a quantitative trait. Two novel loci qgf5.1 and qgf3.1, which regulate the green flesh formation in cucumber, were identified using QTL mapping and GWAS approaches. We also identified several candidate genes for further validation using functional genomics or forward genetics approaches. In addition, two novel chromosome rearrangements were detected in Chr1 between cultivated and wild cucumber.
Plant materials and mapping populations
Two inbred lines, 9110Gt (P1) and PI183967 (P2) were used as the parental lines to develop three F2 (234, 125 and 140 individuals, respectively), one BC1P1 (78 individuals) and one BC1P2 (77 individuals) populations for genetic analysis in the present study. 9110Gt was derived from the cross between a European greenhouse hybrid and a Northern Chinese line with dominant European glasshouse cucumber genetic background, which has green flesh color and high chlorophyll content (Fig. 1). PI183967 is a typical wild cucumber, which has white flesh and low chlorophyll content (Fig. 1). Among the above populations, 234 and 125 F2 individuals were used for genetic map construction and QTL mapping. For association analysis, 115 cucumber core germplasm (CG) lines were identified . All the material seeds used in the present study were provided by the Cucumber Research Group of the Institute of Vegetable and Flowers Chinese Academy of Agricultural Science.
Phenotypic data collection and analysis
For the genetic analysis study, phenotypic data of flesh color were collected in three environments over 2 years (2016, 2017) at two locations, which were designated as SY2016W, BJ2017F and SY2017W, respectively. SY2016W and SY2017W were conducted at the Sanya Research Station (109°60′ N, 18°29′ E), Hainan, China in 2016 and 2017 winter, respectively. BJ2017F was performed at the Nankou Research Station (116°10′ N, 40°22′ E), Beijing, China in 2017 fall. The two parental lines and their F1 were included in all screening tests.
For the GWAS study, phenotypic data of flesh color were collected in two environments, which were designated as CG2017S and CG2017F. CG2017S and CG2017F were conducted at the Nankou Research Station (116°10′ N, 40°22′ E), Beijing, China in 2017 spring and fall.
To identify the phenotype more accurately, we used three indexes: flesh color (FC), flesh extract color (FEC) and flesh chlorophyll content (FCC). For the FC identification, about one cubic centimeter flesh block was extracted from each fruit (Additional file 1: Figures S1 and S2). All the blocks were categorized into five color groupings (group 1 to group 5) based on the visual measurement. For the FEC identification, 2 g flesh sample of each fruit was put into 50-ml tubes with 40 ml extraction solution (95% alcohol) and kept in dark for 24 h. Then, the extract color was also categorized into five color groupings (group 1 to group 5) based on the visual measurement. The method of chlorophyll content measurement followed Tang et al. . All three indexes were used for the QTL mapping study.
In each experiment, the FC/FEC/FCC was collected from three commercial mature cucumber fruits in the same individual. Statistical analysis of phenotypic data was performed using SAS v9.3 (SAS Institute Inc., Cary, NC, USA). Pearson’s correlation coefficients among different traits for each population were estimated with the PROC CORR function based on the grand mean of each experiment.
Linkage map development
Cucumber SSR markers described in Ren et al. , Cavagnaro et al. , and Yang et al.  were used to screen for polymorphisms in crosses between 9110Gt and PI183967. Polymorphic markers were used to genotype the two F2 populations (234 and 125 individuals). All markers were tested against the expected segregation ratio of 1:2:1 or 3:1 using Chi-squared tests (χ2, P < 0.05). Linkage analysis was carried out with JoinMap 4.0. Genetic map was developed with the regression mapping method and Kosambi mapping function.
DNA extraction, PCR amplification of molecular markers, and gel electrophoreses were performed as described by Li et al. .
QTL analysis was performed with the R/qtl software package (http://www.rqtl.org/) with composite interval mapping (CIM) method [61, 62]. Genome-wide LOD threshold values (P < 0.05) for declaring the presence of QTLs were determined using 1000 permutations. The refined significant QTLs were assessed for the percentage of phenotypic variations (R2) explained. The support intervals for these QTLs were calculated using a 1.5-LOD drop interval. QTL naming conventions followed Bo et al. . For example, qgf5.1 designated the first QTL for green flesh on cucumber Chr5.
Geographical distribution analysis and day length calculation
The geographical information of 115 CG lines was described in the previous study [57, 63]. DIVA-GIS software was used to construct the geographical map followed by Bo et al. . The day length was calculated by subtracting sunrise time from sunset time. The sunset/sunrise time data among Beijing China, Sanya China, India and Netherland was downloaded from the website http://richuriluo.qhdi.com.
Genome-wide association analyses of flesh chlorophyll content
A general linear model (GLM) was used for association tests, with an estimated relatedness matrix as covariate. A total of 3,877,848 SNPs were used for this analysis . GWAS was conducted, and the genome-wide lowest P value was recorded. The 5% lowest tail was taken from the 200 recorded minimal P values as the threshold for genome-wide significance. The Manhattan map for GWAS was generated by using the R package qqman .
Availability of data and materials
All data generated or analysed during this study are included in this published article and its supplementary information files.
Flesh chlorophyll content
Flesh extract color
Quantitative trait loci
Single nucleotide polymorphism
Simple sequence repeat
Adami M, Franceschi PD, Brandi F, Liverani A, Giovannini D, Rosati C, et al. Identifying a carotenoid cleavage dioxygenase (ccd4) gene controlling yellow/white fruit flesh color of peach. Plant Mol Biol Rep. 2013;31(5):1166–75.
Li L, Yuan H. Chromoplast biogenesis and carotenoid accumulation. Arch Biochem Biophys. 2013;539(2):102–9.
Henderson WR, Scott GH, Wehner TC. Interaction of flesh color genes in watermelon. J Hered. 1998;89(1):50–3.
Burger Y, Paris H, Cohen R, Katzir N, Tadmor Y, Lewinsohn E, et al. Genetic diversity of Cucumis melo. Hortic Rev. Am Soc Hortic Sci. 2009;36(1):165–98.
Zhang L, Zhang ZK, Zheng TT, Wei WL, Zhu YM, Gao YS, et al. Characterization of carotenoid accumulation and carotenogenic gene expression during fruit development in yellow and white loquat fruit. Horticultural Plant J. 2016;2(1):9–15.
Ferruzzia M, Blakeslee J. Digestion absorption and cancer preventative activity of dietary chlorophyll derivatives. Nutr Res. 2007;27(1):1–12.
Gore RD, Palaskar SJ, Bartake AR. Wheatgrass: green blood can help to fight Cancer. J Clin Diagn Res. 2017;11(6):ZC40–2.
DellaPenna D, Pogson BJ. Vitamin synthesis in plants: tocopherols and carotenoids. Annu Rev Plant Biol. 2006;57:711–38.
Tzuri G, Zhou XJ, Chayut N, Yuan H, Portnoy V, Meir A, et al. A ‘golden’ SNP in CmOr governs the fruit flesh color of melon (Cucumis melo). Plant J. 2015;82(2):267–79.
Hughes M. The inheritance of two characters of Cucumis melo and their interrelationship. Proc Am Soc Hortic Sci. 1948;52:399–402.
Imam MKL, Abo-Bakr MA, Hanna HY. Inheritance of some economic characters in crosses between sweet melon and snake cucumber. I. Inheritance of qualitative characters. Assiut J Ag Sco. 1972;3:363–80.
Clayberg C. Interaction and linkage test of flesh color genes in Cucumis melo L. Rep Cucurbit Genet Coop. 1992;15:53.
Perin C, Hagen LS, de Conto V, Katzir N, Danin-Poleg Y, Portnoy V, Baudracco-Arnas S, Chadoeuf J, Dogimont C, Pitrat M. A reference map of Cucumis melo based on two recombinant inbred line populations. Theor Appl Genet. 2002;104:1017–34.
Monforte AJ, Oliver M, Gonzalo MJ, Alvarez JM, Dolcet-Sanjuan R, Arus P. IdentiWcation of quantitative trait loci involved in fruit quality traits in melon (Cucumis melo L.). Theor Appl Genet. 2004;108:750–8.
Fukino N, Ohara T, Monforte A, Sugiyama M, Sakata Y, Kunihisa M, Matsumoto S. Identification of QTLs for resistance to powdery mildew and SSR markers diagnostic for powdery mildew resistance genes in melon (Cucumis melo L.). Theor Appl Genet. 2008;118:165–75.
Cuevas HE, Staub JE, Simon PW, Zalapa JE, McCreight JD. Mapping of genetic loci that regulated quantity ofβ-carotene in fruit of U.S. Western shipping melon (Cucumis melo L.). Theor Appl Genet. 2008;117:1345–59.
Cuevas HE, Staub JE, Simon PW, Zalapa JE. A consensus linkage map identifies genomic regions controlling fruit maturity and beta-carotene-associated flesh color in melon (Cucumis melo L.). Theor Appl Genet. 2009;119:741–56.
Tadmor Y, King S, Levi A, Davis A, Meir A, Wasserman B, Hirschberg J, Lewinsohn E. Comparative fruit colouration in watermelon and tomato. Food Res Int. 2005;38:837–41.
Bang H, Davis AR, Kim S, Leskovar DI, King SR. Flesh color inheritance and gene interactions among canary yellow, pale yellow, and red watermelon. J Am Soc Hortic Sci. 2010;135(4):362–8.
Wehner TC. Gene list for watermelon. Cucurbit Genet Coop Rep. 2007;30:96–120.
Hashizume T, Shimamoto I, Hirai M. Construction of a linkage map and QTL analysis of horticultural traits for watermelon [Citrullus lanatus (THUNB.) MATSUM & NAKAI] using RAPD, RFLP and ISSR markers. Theor Appl Genet. 2003;106:779–85.
Liu S, Gao P, Wang X, Davis AR, Baloch AM, Luan F. Mapping of quantitative trait loci for lycopene content and fruit traits in citrullus lanatus. Euphytica. 2015;202(3):411–26.
Liu S, Gao P, Zhu Q, Luan F, Davis AR, Wang X. Development of cleaved amplified polymorphic sequence markers and a CAPS-based genetic linkage map in watermelon (Citrullus lanatus [Thunb.] Matsum. And Nakai) constructed using whole-genome resequencing data. Breed Sci. 2016;66:244–59.
Branham S, Vexler L, Meir A, Tzuri G, Frieman Z, Levi A, Wechter WP, Tadmor Y, Gur A. Genetic mapping of a major codominant QTL associated with β-carotene accumulation in watermelon. Mol Breeding. 2017;37(12):146.
Zhang J, Guo SG, Ren Y, Zhang HY, Gong GY, Zhou M, et al. High-level expression of a novel chromoplast phosphate transporter ClPHT4;2 is required for flesh color development in watermelon. New Phytol. 2016;213(3):1208–21.
Qi CZ. A new type of cucumber, Cucumis sativus L. var. xishuangbannanesis Qi et Yuan. Acta Hortic Sin. 1983;10(4):259–63.
Bo KL, Song H, Shen J, Qian CT, Staub JE, Simon PW, et al. Inheritance and mapping of the ore gene controlling the quantity of β-carotene in cucumber (Cucumis sativus L.) endocarp. Mol Breed. 2012;30(1):335–44.
Cuevas HE, Song H, Staub JE, Simon PW. Inheritance of beta-carotene-associated flesh color in cucumber (Cucumis sativus L.) fruit. Euphytica. 2010;171(3):301–11.
Lu HW, Miao H, Tian GL, Wehner TC, Gu XF, Zhang SP. Molecular mapping and candidate gene analysis for yellow fruit flesh in cucumber. Mol Breeding. 2015;35(2):64.
Kooistra E. Inheritance of fruit flesh and skin colours in powdery mildew resistant cucumbers (Cucumis sativus L.). Euphytica. 1971;20(4):521–3.
Yang LM, Koo DH, Li Y, Zhang X, Luan F, Havey MJ, et al. Chromosome rearrangements during domestication of cucumber as revealed from high-density genetic mapping and draft genome assembly. Plant J. 2012;71(6):895–906.
Navazio JP. Utilization of high carotene cucumber germplasm for genetic improvement of nutritional quality. Ph.D. thesis: University of Wisconsin-Madison; 1994.
Royle JF. Illustrations of the botany of the Himalayan Mountains. London: Wm. H. Alland and Co. p. 1835.
Duthie JF. Flora of the upper Gangetic plain, and of the adjacent Siwalik and sub-Himalayan tracts. Superintendent of government printing publication, Calcutta; 1903.
Hooker JD. Cucumis sativus var. sikkimensis cultivated in the Himalaya Mountains. Curtis’ Bot Mag 102: tab. 6206. https://species.wikimedia.org/wiki/Cucumis_sativus_var._sikkimensis, 1876.
Bo KL, Ma Z, Chen JF, Weng Y. Molecular mapping reveals structural rearrangements and quantitative trait loci underlying traits with local adaptation in semi-wild Xishuangbana cucumber (Cucumis sativus L. var. xishuangbannanesis Qi et Yuan). Theor Appl Genet. 2015;128(1):25–39.
Kirkbride JH. Biosystematic monograph of the genus Cucumis (Cucurbitaceae). Parkway Publishers, Boone. 1993;pp 84–88.
de Wilde WJJ, Duyfjes BEE. Cucumis sativus L. forma hardwickii (Royle) W.J. de Wilde and Duyfjes and feral forma sativus. Thai For Bull (Bot). 2010;38:98–107.
Ren Y, Zhang Z, Liu J, Staub JE, Han Y, Cheng Z, et al. An integrated genetic and cytogenetic map of the cucumber genome. PLoS One. 2009;4:e5795.
Miao H, Gu XF, Zhang SP, Zhang ZH, Huang SW, Wang Y, et al. Mapping QTLs for fruit-associated traits in Cucumis sativus L. Sci Agric Sin. 2011;44:5031–40.
Hoffmann AA, Rieseberg LH. Revisiting the impact of inversions in evolution, from population genetic markers to drivers of adaptive shifts and speciation? Annu Rev Ecol Evol Syst. 2008;39:21–42.
Kirkpatrick M. How and why chromosome inversions evolve. PLoS Biol. 2010;8:e1000501.
Lowry DB, Willis JH. A widespread chromosomal inversion polymorphism contributes to a major life-history transition, local adaptation, and reproductive isolation. PLoS Biol. 2010;8(9):e1000500.
Weng Y, Johnson S, Staub JE, Huang SW. An extended intervarietal microsatellite linkage map of cucumber, Cucumis sativus L. HortSci. 2010;45(6):882–6.
Zhang WW, Pan JS, He HL, Zhang C, Li Z, Zhao JL, et al. Construction of a high density integrated genetic map for cucumber (Cucumis sativus L.). Theor Appl Genet. 2012;124(2):249–59.
Nadakuduti SS, Holdsworth WL, Klein CL, Barry CS. KNOX genes influence a gradient of fruit chloroplast development through regulation of GOLDEN2-LIKE expression in tomato. Plant J. 2014;78(6):1022–33.
Powell ALT, Nguyen CV, Hill T, Cheng KL, Figueroa-Balderas R, Aktas H, et al. uniform ripening encodes a Golden 2-like transcription factor regulating tomato fruit chloroplast development. Science. 2012;336(6089):1711–5.
Kerr EA. Green flesh, gf. Rpt Tomato Genet Coop. 1956;6:17.
Akhtar MS, Goldschmidt EE, John I, Rodoni S, Matile P, Grierson D. Altered patterns of senescence and ripening in gf, a stay-green mutant of tomato (Lycopersicon esculentum mill.). J Exp Bot. 1999;50(336):1115–22.
Cheung AY, McNellis T, Piekos B. Maintenance of chloroplast components during chromoplast differentiation in the tomato mutant green flesh. Plant Physiol. 1993;101(4):1223–9.
Roca M, Hornero-Mendez D, Gandul-Rojas B, Minguez-Mosquera MI. Stay-green phenotype slows the carotenogenic process in Capsicum annuum (L.) fruits. J Agric Food Chem. 2006;54(23):8782–7.
Liu Y, Roof S, Ye Z, Barry C, van Tuinen A, Vrebalov J, et al. Manipulation of light signal transduction as a means of modifying fruit nutritional quality in tomato. P Natl Acad Sci USA. 2004;101(26):9897–902.
Kolotilin I, Koltai H, Tadmor Y, Bar-Or C, Reuveni M, Meir A, et al. Transcriptional profiling of high pigment-2dg tomato mutant links early fruit plastid biogenesis with its overproduction of phytonutrients. Plant Physiol. 2007;145(2):389–401.
Rohrmann J, Tohge T, Alba R, Osorio S, Caldana C, McQuinn R, et al. Combined transcription factor profiling, microarray analysis and metabolite profiling reveals the transcriptional control of metabolic shifts occurring during tomato fruit development. Plant J. 2011;68(6):999–1013.
Waters MT, Moylan EC, Langdale JA. GLK transcription factors regulate chloroplast development in a cell-autonomous manner. Plant J. 2008;56(3):432–44.
Pan Y, Bradley G, Pyke K, Ball G, Lu C, Fray R, et al. Network inference analysis identifies an aprr2-like gene linked to pigment accumulation in tomato and pepper fruits. Plant Physiol. 2014;161:1476–85.
Qi JJ, Liu X, Shen D, Miao H, Xie BY, Li XX, et al. A genomic variation map provides insights into the genetic basis of cucumber domestication and diversity. Nat Genet. 2013;45(12):1510–5.
Tang YL, Huang JF, Wang RC. Change law of hyperspectral data in related with chlorophyll and carotenoid in rice at different developmental stages. Rice Sci. 2004;11:274–82.
Cavagnaro PF, Senalik DA, Yang LM, Simon PW, Harkins TT, Kodira CD, et al. Genome-wide characterization of simple sequence repeats in cucumber (Cucumis sativus L.). BMC Genomics. 2010;11(1):569.
Li YH, Yang LM, Pathak M, Li DW, He XM, Weng Y. Fine genetic mapping of cp: a recessive gene for compact (dwarf) plant architecture in cucumber, Cucumis sativus L. Theor Appl Genet. 2011;123(6):973–83.
Broman KW, Wu H, Sen S, Churchill GA. R/QTL: QTL mapping in experimental crosses. Bioinformatics. 2003;19(7):889–90.
Weng Y, Colle M, Wang Y, Yang L, Rubinstein M, Sherman A, et al. QTL mapping in multiple populations and development stages reveals dynamic QTL for fruit size in cucumbers of different market classes. Theor Appl Genet. 2015;128(9):1747–63.
Bo KL, Wang H, Pan YP, Behera TK, Pandey S, Wen CL, et al. SHORT HYPOCOTYL1 encodes a SMARCA3-like chromatin remodeling factor regulating elongation. Plant Physiol. 2016;172:1273–92.
Bo KL, Miao H, Wang M, Xie XX, Song ZC, Xie Q, et al. Novel loci fsd6.1 and Csgl3 regulate ultra-high fruit spine density in cucumber. Theor Appl Genet. 2019;132(1):27–40.
Shang Y, Ma YS, Zhou Y, Zhang HM, Duan LX, Chen HM, et al. Biosynthesis, regulation, and domestication of bitterness in cucumber. Science. 2014;346(6213):1084–8.
Turner SD. Qqman: an R package for visualizing GWAS results using QQ and Manhattan plots. bioRxiv. 2014;(5):005165. https://doi.org/10.1101/00516.
The authors thank the Key Laboratory of Biology and Genetic Improvement of Horticultural Crops, Ministry of Agriculture, China.
This work was supported by the National Key Research and Development Program of China [2018YFD0100702], the Earmarked Fund for Modern Agro-industry Technology Research System [CARS-25] and Central Public-interest Scientific Institution Basal Research Fund (Y2017PT52). The funding bodies had no role in the design of the study or collection, analysis, and interpretation of data and neither in writing the manuscript. Publication costs are defrayed by CARS.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Flesh color of five groups in the F2 population. Figure S2. Flesh color of five groups in the backcross population. a Flesh color distribution in BC1P1 population (P1 is the green flesh parent 9110Gt). b Flesh color distribution in BC1P2 population (P2 is the white flesh parent PI183967). Figure S3. Violin and box plots depicting phenotypic distribution of flesh color, flesh extract color and flesh chlorophyll content in SY2017W experiment. a Violin and box plots depicting flesh color. b Violin and box plots depicting flesh extract color. c Violin and box plots depicting flesh chlorophyll content. Figure S4. Linkage map constructed using a 234 F2 population in SY2016W experiment. Figure S5. Linkage map constructed using a 125 F2 population in BJ2017F experiment. Table S1. Summary of populations used for phenotypic data collection, QTL mapping and GWAS analysis. Table S2. Statistics of two linkage maps. Table S3. Information of markers mapped with 234 9110Gt×PI183967 F2 population. Table S4. Information of markers mapped with 125 9110Gt×PI183967 F2 population. Table S5. Information of the CG lines used in this study. (PDF 663 kb)