Skip to main content

Genetic association analysis in sugarcane (Saccharum spp.) for sucrose accumulation in humid environments in Colombia

Abstract

Background

Sucrose accumulation in sugarcane is affected by several environmental and genetic factors, with plant moisture being of critical importance for its role in the synthesis and transport of sugars within the cane stalks, affecting the sucrose concentration. In general, rainfall and high soil humidity during the ripening stage promote plant growth, increasing the fresh weight and decreasing the sucrose yield in the humid region of Colombia. Therefore, this study aimed to identify markers associated with sucrose accumulation or production in the humid environment of Colombia through a genome-wide association study (GWAS).

Results

Sucrose concentration measurements were taken in 220 genotypes from the Cenicaña’s diverse panel at 10 (early maturity) and 13 (normal maturity) months after planting. For early maturity data was collected during plant cane and first ratoon, while at normal maturity it was during plant cane, first, and second ratoon. A total of 137,890 SNPs were selected after sequencing the 220 genotypes through GBS, RADSeq, and whole-genome sequencing. After GWAS analysis, a total of 77 markers were significantly associated with sucrose concentration at both ages, but only 39 were close to candidate genes previously reported for sucrose accumulation and/or production. Among the candidate genes, 18 were highlighted because they were involved in sucrose hydrolysis (SUS6, CIN3, CINV1, CINV2), sugar transport (i.e., MST1, MST2, PLT5, SUT4, ERD6 like), phosphorylation processes (TPS genes), glycolysis (PFP-ALPHA, HXK3, PHI1), and transcription factors (ERF12, ERF112). Similarly, 64 genes were associated with glycosyltransferases, glycosidases, and hormones.

Conclusions

These results provide new insights into the molecular mechanisms involved in sucrose accumulation in sugarcane and contribute with important genomic resources for future research in the humid environments of Colombia. Similarly, the markers identified will be validated for their potential application within Cenicaña’s breeding program to assist the development of breeding populations.

Peer Review reports

Background

Sugarcane (Saccharum spp.) is a major agronomic and industrial crop used to produce sugar, ethanol, and electricity worldwide and represents an important component of the economy of tropical and subtropical countries [1, 2]. Colombia ranks 14th among the world’s largest sugar producers and 12th among the largest exporters, accounting for 1.1% of the world’s sugar trade [3]. Given the economic importance of the crop, breeding programs around the world have directed their efforts in producing genotypes with high biomass (i.e., tons of cane per hectare or TCH), high sucrose content, and resistance to the most limiting diseases of the crop [4,5,6]. In general, this process involves making decisions based on the phenotypic information collected during the early stages of selection (three stages in most breeding programs) and multi environmental trials (METs), which provide information on genotype adaptability to the environment or to the target population of environments [4,5,6,7].

In Colombia, the sugarcane breeding process is carried out by the Colombian sugarcane research center, Cenicaña, which has been releasing varieties specifically adapted to the agroecological conditions of the Cauca River Valley since 1990. Recently, Cenicaña´s breeding program seeks to make its process more efficient by incorporating molecular markers. These markers have been used mainly with a genome wide association study (GWAS) and with a genomic prediction strategy trying to assist the selection of the best genotypes for complex traits like sucrose content, water stress, biomass production, among several others. In specific, GWAS have allowed an accurate and rapid identification of genes associated with traits of interest within sugarcane breeding programs around the world [5, 8]. For instance, using AFLP, DArT, and SSR technologies, six markers were found to be associated with the resistance to yellow leaf virus (SCYLV) in sugarcane [9] and six with the sugarcane brown rust resistance [10]. In Argentina, a total of 43 DArT markers were significantly associated with biomass (kg Plot− 1) and 38 with sugar content in a panel of 88 sugarcane clones [11]. These markers have also been used as part of a genomic selection strategy by incorporating them within the model training phase at the Estación Experimental Agroindustrial Obispo Colombres (EEAOC) Argentina [12]. Similarly, for stalk diameter, leaf width, leaf length, stalk number, internode length, brix, total weight, dry weight, and water content, a total of 217 SNP-type markers were significantly associated in a panel of 308 sugarcane clones in the USA [13]. They also found that from the 217 SNP-type markers, ten were involved in sugar metabolism, specifically with synthases, hydrolases, and transferases [13]. Finally, the marker G1 has been used as a molecular marker associated with the resistance to the sugarcane orange rust [14]. This marker had the ability to predict 65.8% of resistant phenotypes in the original mapping population [14] and 71.43% in a collection of resistant Brazilian cultivars [15] and therefore, it has been used within the breeding schemes in Brazil.

Sucrose content plays an important role in the genetic improvement of crops. This trait depends on various genetic factors, such as enzymes (i.e., sucrose synthase, sucrose phosphate synthase, and invertase), sucrose transporters, environmental factors, and biological processes (i.e., phosphorylation, hydrolysis, and regulatory mechanisms) [16]. For the accumulation of sucrose, the main enzymes reported are sucrose phosphate synthase (SPS), invertase, and sucrose synthase (SUS) [16]. SPS is involved in sucrose synthesis and accumulation, controlling the flux of carbon into sucrose and the movement of photosynthates from source to sink tissues [17,18,19]. The invertase enzymes generate a concentration gradient from source to sink by irreversibly hydrolyzing sucrose and producing an equimolar mixture of glucose and fructose necessary for cell elongation [17,18,19,20]. Sucrose synthase (SUS) is a glycosyltransferase enzyme involved in the reversible conversion of sucrose into fructose and UDP-glucose, both of which are required for respiration, starch biosynthesis, and fiber development [21, 22].

Sucrose accumulation in sugarcane is influenced by the crop cycle and variety planted [6]. For instance, it has been reported a difference between the plant cane and the ratoons, with the accumulation in the ratoons commonly being higher than the accumulation in the plant cane. However, even when the plant cane tends to have a lower sucrose (%Cane) than the ratoons, the difference in the varieties tend to remain stable. For instance, under the conditions of China the varieties with a high, intermediate, and low sucrose content have kept his rank throughout the plant cane, first, and second ratoon [6]. Similar behavior has been observed in Colombia where the curve of sucrose (% Cane) across plant cane and first ratoon have the same tendency between different varieties regardless of the crop cycle [23]. Additionally, when analyzing the commercial database for the 4 most planted varieties in the Cauca River Valley, Colombia, the sugar yield was stable across the crop cycles with 11.00 ± 0.62, 11.05 ± 0.61, and 11.04 ± 0.57 (%Cane) for plant cane, first, and second ratoon, respectively, suggesting that the differences between genotypes tend to be stable throughout the crop cycles even when the accumulation of sucrose was slightly higher in the first ratoon. This behavior could be attributed to the climatic conditions in Colombia. The agro-industrial sector of sugarcane in Colombia is located mainly in the Cauca River Valley, a region with a well-defined bimodal pattern with two rainy and two dry seasons [24], in which the harvest is carried out throughout the year without zafra conditions [25]. To improve the agronomic practices for the crop, in 2001, Cenicaña classified the fields of the region within 51 agroecological zones that integrate soil texture, climate, and water resources [26]. These agroecological zones have allowed the classification of the region within the semidry, humid, and foothill environments, which help to improve the biomass and sugar content for the varieties planted, reducing the difference between the crop cycles for biomass and sucrose content.

Some studies have concluded that the accumulation of sucrose in sugarcane depends on different environmental factors, with soil moisture being the most important during the ripening process [27, 28]. During this process, a delay in plant growth is generated when a decrease in the soil and plant moisture is observed [6, 8, 28], as well as a decline in stalk fresh weight due to dehydration [29]. The sucrose content, expressed in percent cane (%Cane), is calculated based on fresh weight, which translates to a high variability linked to the soil moisture or daytime stomatal activity [30, 31]. Under a dry season, the decrease in internal humidity reduces sugar consumption for growth, and therefore, an increase in sucrose synthesis due to the conversion of reducing sugars into sucrose is produced [5, 28]. In contrast, during the rainy season or under high soil moisture, there is high vegetative growth and low accumulation of sucrose [32]. Brazil have reported an exponential relationship between the rainfall observed 120 days before harvesting and the total recoverable sugars, suggesting that when the precipitation is above 100 mm, a gradual decline in the total recoverable sugars is observed, mainly in late varieties [32]. In Colombia, a similar pattern was found with the southern zone of the Cauca River Valley, classified as a humid environment [25, 33]. Given that increasing the sucrose content in areas with high humidity can be a challenge due to the impact of environmental conditions on the plant’s physiology and sugar accumulation, this study aimed to identify molecular markers and candidate genes associated with sucrose accumulation and/or production in a diverse population in the humid environments of the Cauca River Valley, Colombia. A schematic representation of the analysis and results is presented in Fig. 1.

Fig. 1
figure 1

A graphical abstract of the methodology, analysis, and results presented in this study. The analysis of phenotypic and genotype data (consensus panel) is presented following a linear mixed model. The results of the association analysis (77 associated markers) are subsequently shown. Finally, the 82 genes identified as candidate genes are presented, whose function is involved in the accumulation and/or production of sucrose

Results

Sequencing data

A total of 51.27, 458.74, and 7,012.48 GB data was obtained after sequencing the 220 genotypes from the diverse panel with GBS, RADSeq, and WGS, respectively. The GBS data (with an average depth of 105X) had 2.97 ± 1.49 million reads per sample on average, with a read length of 71.50 ± 1.40 base pairs. For the paired-end RADSeq (with an average depth of 27.0X) and WGS (with an average depth of 39X), there were 12.11 ± 2.69 and 108.00 ± 133.40 million reads per sample on average, respectively. Similarly, for RADSeq and WGS the read length was higher with 87.50 ± 3.10 bp and 138.00 ± 3.40 bp, respectively. After quality control, the reads from each sequencing technology were mapped to CC 01-1940 sugarcane reference genome [34] with an average mapping percentage of 71.30, 30.50, and 85.10 for GBS, RADSeq and WGS, respectively. Subsequently, the aligned data from each technology were merged within a consensus SNP panel, resulting in a total of 137,889 high-quality SNPs used for further analysis (Fig. 1).

Phenotypic analysis

Based on Bonferroni-adjusted p-values, 3 and 24 data points were identified as outliers for early (10 months after planting) and normal maturity (13 months after planting), respectively (data not shown). Outliers are defined as data points that fall outside of the majority of the data for a particular subject and can mask the real distribution of the data [35, 36]. Because of this, outliers are commonly identified and removed from the analysis to reduce the impact on the estimation process of the traits of interest [35]. The accumulation of sucrose (%Cane) showed a continuous normal distribution with a mean of 12.64% at early maturity and 14.37% at normal maturity. For normal maturity, the best-fitting model includes all random effects, while for early maturity, a model excluding the genotype for crop cycle interaction was selected (Table 1). Similarly, for normal maturity, the model allows for heterogeneous residual variance across crop cycles (\(V\left({\epsilon }_{ijkl}\right)={\sigma }_{e\left(i\right)}^{2}\)), while for early maturity, the residual variance was homogeneous (Table 1). Broad sense heritability was 0.90 and 0.83 for early and normal maturity, respectively. The higher heritability values observed in this study are an indication of the high data quality and the good experimental design implemented, both of which help in reducing residual (\({\sigma }_{e}^{2})\) variance. For early maturity, the genotypic effect contributed a significant proportion of the total variance with a value of 2.58, while for normal maturity, the residual variance had a larger effect with 2.94 (Table 2).

Table 1 Bayesian information criteria (BIC) values for each of the 16 possible models. All models contained the crop cycle as a fixed effect and genotype as a random effect, combined with the other random effects (denoted with an X)
Table 2 Variance components for the accumulation of sucrose (%Cane) at early (10 months after planting) and normal (13 months after planting) maturity

Population structure

The 220 genotypes were classified into four subpopulations (Fig. 2). There were six genotypes (S18, S46, S78, S80, S170, and S171) with a posterior probability indicating that they belong to two or more subpopulations. For these cases, the genotypes were assigned to the subpopulation to which they had the highest membership probability. The first and second subpopulations were mainly composed of hybrids, while the third grouped the genotypes from S. spontaneum, S. officinarum, S. sinense, S. barberi, and some interspecific genotypes (Fig. 2). The fourth subpopulation had only the genotypes S1, S132, and S3, all from the genus Erianthus spp. (Fig. 2).

Fig. 2
figure 2

Population structure analysis of 220 sugarcane genotypes based on 137,889 SNPs. The purple and magenta branches correspond to subpopulations 1 and 2, respectively. The cyan branches refer to subpopulation 3, and the yellow branches refer to subpopulation 4. There are two main branches, the first for subpopulations 1 and 2 (purple and magenta), and the second includes subpopulations 3 and 4 (cyan and yellow). The image was created by the authors using the information deposited in data availability

Association analysis and candidate genes

For early and normal maturity, there were 237 and 183 markers associated with sucrose (%Cane), respectively, with the general model having the highest number of markers for both maturities (Table 3). After removing the most significant markers and reanalyzing the general model, a total of 192 and 103 markers were identified as false positives for early (Fig. 3) and normal maturity (Fig. 4), respectively. There were 4 markers identified as false positives for the 2-dom-ref model at early maturity, while 11 markers were false positives for the 1-dom-ref model at normal maturity (Table 3). For the other genetic models, no false positives were identified (Table 3). After this filtering process, a total of 109 markers, 41 for early maturity and 69 for normal maturity, were retained for further analysis (Table 3, Additional file 2). The general model allowed the identification of the highest number of markers for both early (Fig. 5) and normal (Fig. 6) maturity. When analyzing the associated markers per genetic model, a total of 6 (1_33380771, 10_15141188, 4_10932960, contig_39315_31775, contig_50499_5072, and contig_65540_9617) and 8 (1_63454860, 3_1957031, 4_49815442, 4_55115204, 8_13879677, contig_39799_65, contig_40813_18644, and contig_50499_5072) were found associated between two or more genetic models for early and normal maturity, respectively. For these cases, the model in which the marker has the highest R2 was selected. Therefore, there were 77 markers, 16 for early, 45 for normal, and 16 shared between both maturities (Table 4). Finally, the highest percentage of phenotypic variation explained by each of the 77 markers (R2) was 25.39% (Table 4).

Table 3 Total number of associated markers (unfiltered) and markers identified as true associations (True positives) for the accumulation of sucrose (%Cane) at early (10 months after planting) and normal (13 months after planting) maturity
Fig. 3
figure 3

Quantile-quantile (QQ) plots for sucrose accumulation (%Cane) for the general model with sequential removal of markers based on score (-log10 p-value) for early maturity. The black lines represent the theoretical expected values, and the gray shaded regions represent the 95% confidence interval. The green points represent the model with all SNP markers

Fig. 4
figure 4

Quantile-quantile (QQ) plots for sucrose accumulation (%Cane) for the general model with sequential removal of markers based on score (-log10 p-value) for normal maturity. The black lines represent the theoretical expected values, and the gray shaded regions represent the 95% confidence interval. The green points represent the model with all SNP markers

Fig. 5
figure 5

Manhattan plot for the general model, showing the significance of each SNPs (logarithmic scale) in the accumulation of sucrose at early maturity (10 months after planting) along the monoploid genome of the variety CC 01-1940 (chromosomes numbered 1 to 10, CS indicating contigs and scaffolds). The green line indicates the genome-wide threshold of p = 1 × 10-5. The score is presented on the Y axis (-log10 p-value)

Fig. 6
figure 6

Manhattan plot for the general model, showing the significance of each SNPs (logarithmic scale) in the accumulation of sucrose at normal maturity (13 months after planting) along the monoploid genome of the variety CC 01-1940 (chromosomes numbered 1 to 10, CS indicating contigs and scaffolds). The green line indicates the genome-wide threshold of p = 1 × 10-5. The score is presented on the Y axis (-log10 p-value)

Table 4 Marker effect and pseudo R2 per genetic model for the 77 SNPs significantly associated with sucrose accumulation at early (10 map) and normal (13 map) maturities

Candidate genes

A total of 4757 genes were found within the LD region of the 77 markers, with 1695 having a known function in plants. From the 1695 candidate genes, 82 were associated with sucrose accumulation and/or sucrose production and were within the LD region of 39 of the 77 associated markers (Table 5). For the remaining 38 markers, there were no candidate genes with an annotated function related to sucrose accumulation and/or production, and therefore they were not considered for further analysis. From the 82 candidate genes, eight were involved in sucrose transport, seven were involved in starch and sucrose metabolism in Sorghum sp [37,38,39]. , and 67 were glycosyltransferases, glucosidases, hormones, and transcription factors (Table 5).

Table 5 Candidate genes in the LD region window (i.e., 500 Kb upstream and downstream) from 39 of the 77 markers associated with the accumulation and/or production of sucrose at early (10 map), normal (13 map), or shared between both maturities

Discussion

Phenotypic analysis

Sucrose (%Cane) accumulation at early maturity was found to be 1.73% lower than the accumulation at normal maturity. These results were similar to those reported in Ethiopia [40], Egypt [41], and Brazil [42], where increased age at harvest increases sucrose content. Similar results were observed in a breeding population of 100 varieties which represent four improved generations spanning a 20 year node in China [6]. The differences in sucrose content between both ages are a consequence of physiological changes in the plant during its rapid vegetative growth phase (between 4 and 9 months after planting), where the photoassimilates are mainly used for cell elongation [17, 43]. On the other hand, during the maturation phase (between 9 and 13 months after planting), there is a decrease in the concentration of reducing sugars, invertases, and sucrose synthase (SuSy) and an increase in sucrose phosphate synthase (SPS) [41, 44,45,46]. Invertase activity is elevated in the upper internodes during the phase of rapid vegetative growth, but it starts to decrease when the maturation phase begins [20]. This balance between the enzymes of synthesis (i.e., SPS) and hydrolysis of sucrose (i.e., SuSy, reducing sugars, and invertases) leads to an increase in the accumulation of sugars in the sink organs because of a reduction in the demand for growth in the meristematic tissues [28, 47].

The variation in sucrose accumulation (%Cane) is assumed to result from genotypic effects and the interaction between the genotype and crop cycle [48]. For normal maturity, there was a heteroskedastic residual variance, with plant cane having the highest residual (\({\sigma }_{e\left(PC\right)}^{2}=2.94\)) and the first ratoon having the lowest (\({\sigma }_{e\left(FR\right)}^{2}=0.68\)) (Table 2). Similar results were reported in South Africa and Louisiana, where residual variance was the largest source of phenotypic variation for sucrose during plant cane [49,50,51]. Higher residuals in plant cane could be explained by poor, less-established root systems in comparison to the ratoons, where the plants have strong and well-established root systems, facilitating the establishment, germination, development, and growth of the plant [52].

Population structure

The 220 genotypes were classified into four subpopulations (Fig. 2). The first and second subpopulations were composed of varieties mainly bred in Colombia. This varieties represent the 96.4% of the Colombian varieties planted in commercial fields. When analyzing subpopulation 1, the genotypes MZC 74–275, CP 57–603, and V 71 − 51 constitutes the most common parents, while for subpopulation 2, the common parents were POJ 2878, NA 56–79 and Co 775. The varieties MZC 74–275 and Co 775 are half-sibs with the variety POJ 2878, one of the common parents for subpopulation 2. Similarly, the variety POJ 2364 is within the pedigree of all common ancestors for both subpopulations, revealing the genetic resemblance between the two clusters. The genotypes POJ 2878 and POJ 2364 are widely used around the world because of their high productivity potential and adaptability to different environments [53]. The third subpopulation had the S. spontaneum genotype grouped with S. officinarum, S. sinense, S. barberi, and some interspecific genotypes. This grouping is consistent with the evolutionary relationships of the genus Saccharum spp., where the S. spontaneum has contributed with nearly 39% of the genome for the species S. sinense and S. barberi, and between 15 and 27.5% of the genome of modern cultivars [53,54,55,56,57,58]. The fourth subpopulation had only the genotypes S1, S132, and S3, all of them from the genus Erianthus spp. (Fig. 2). The genus Erianthus is one of the most closely related genera to Saccharum spp. and has been used mainly to increase biomass, vigor, ratooning ability, and tolerance to drought and waterlogging stresses [59]. Similarly, Erianthus spp. along with Saccharum spp., Miscanthus spp., Miscanthidium spp., Pseudosorghum spp., Narenga spp., and the trans-Himalayan species make up the Saccharum complex, which may be involved in the origins of cultivated sugarcane [59,60,61]. Finally, the 220 genotypes were divided into two main branches, one grouping the modern genotypes (light blue and magenta in (Fig. 2) and the other the wild species and interspecific crosses (Fig. 2), suggesting a narrow genetic background for the cultivars held at Cenicaña’s germplasm bank.

Association analysis and candidate genes

Sucrose accumulation in sugarcane is a complex process that includes sucrose synthesis in the source tissues, transport of sugars from source to sink, energy generation, and sugar storage [62, 63]. In this study, a total of 82 candidate genes involved in sucrose accumulation and/or production were found within the LD region of 39 of the 77 associated markers. For the remaining 38 SNPs no candidate genes associated with sucrose production and/or accumulation were identified and for that were not considered for this analysis. From the 82 candidate genes, four key enzymes were identified in the process of sucrose synthesis: sucrose synthase 6 (SUS6), near the marker 4_55767476, insoluble beta-fructofuranosidase 2 C isoenzyme 3 (CIN3), near the marker 1_17142404, alkaline/neutral invertase (CINV2) and Cytosolic invertase 1 and (CINV1) near the marker 5_9031549 (Table 5). The markers 4_55767476 and 1_17142404 were significantly associated at normal maturity, with negative effects (Table 4), while the marker 5_9031549 was significantly associated at early maturity with positive effects (Table 4). Sucrose synthase has implications for cell metabolism, the production of metabolites, and the production of cell wall precursors (UDP-glucose) [64]. SUS6 catalyzes the reversible conversion of sucrose to UDP-glucose and fructose for various metabolic pathways [65], while the enzyme CIN3 cleaves the terminal nonreducing beta-fructofuranoside residues [66]. CINV2 can regulate sugar-mediated root development by controlling sucrose catabolism in root cells, while CINV1 participates in osmotic stress-induced inhibition of lateral root growth by controlling the concentration of hexose in the cells [64, 67]. These enzymes play a central role in the sucrose accumulation process, since sucrose is hydrolyzed by sucrose synthase or invertase in sink tissues and later used for cell growth, development, or sugar storage in the plant [21]. The negative effect for these markers suggests that the presence of these two enzymes affects sucrose accumulation by hydrolyzing this disaccharide into glucose and fructose, which are then absorbed in the sink tissues for consumption in the process of cell growth across the plasma membrane [68].

The second process of importance in sucrose accumulation is the transport of sugars from source to sink. Within this process, two plant gene families were found: sucrose transporters (SUTs) and monosaccharide transporters (MSTs) [69]. For early maturity, MST2, the plastidic glucose transporter At1g05030, and SUT4 were found within the LD region of the markers 1_33380771, 2_29103764, and 4_55115204, respectively (Table 5). At normal maturity, the candidate genes monosaccharide-sensing protein 2 MSSP2, polyol transporter 5 PLT5, and sugar-phosphate/phosphate translocator At1g53660 (GTP2) were found near the markers 4_15026970, 10_26811308 and 4_28389065, respectively (Table 5). On the other hand, the markers 1_61256982 and 9_33845426, shared in both maturations, close to the candidate genes MST and ERD6-like respectively. The genes MST1, MST2, GTP2, At1g05030, and PLT5, are all directly involved in the transport of monosaccharides, required compounds for various processes of plant growth, development [68], and osmotic adjustments (e.g., monosaccharide homeostasis) [70]. The ERD6-like gene may be strongly regulated in response to some developmental and environmental signals, including senescence, pathogen attack, Carbon/Nitrogen starvation, and diurnal changes in transient sugar storage in the vacuole [71]. SUT4 is involved in the transport of disaccharides, especially sucrose, the main photoassimilate transported from the source organ to the sink through the phloem [72, 73]. In this study, the markers 4_55115204, 4_15026970, and 10_26811308, within the LD region of the sugar transporters (Table 5), showed a positive effect (Table 4), that is, the presence of these markers appeared to contribute to an increase in sucrose content in the plant. On the other hand, the markers close to the candidate genes MST2 (1_33380771) and At1g05030 (2_29103764) had a negative effect (Table 4). This is to be expected given that at 10 months (early maturity), the plant is in transition from the rapid growth phase to the maturation phase, using the hexoses present in the plant mainly for vegetative growth and not for accumulation [74].

The candidate genes Alpha-2 C-alpha-trehalose-phosphate synthase 6 (TPS6), Alpha-2 C-alpha-trehalose-phosphate synthase 7 (TPS7), Alpha-2 C-alpha-trehalose-phosphate synthase 11 (TPS11) were found near the markers 2_29103764, 3_43581242, and 4_49815442, respectively (Table 5). These genes are involved in various phosphorylation processes, such as trehalose synthesis and glycolysis, which can regulate sucrose accumulation by acting on the expression of genes that code for carbohydrate metabolism and other metabolic enzymes [75]. For example, the candidate gene TPS, found at both maturities, is involved in the synthesis of trehalose-6-phosphate (T6P), which links growth and development to carbon status by exerting a negative feedback regulation on sucrose levels [76, 77]. However, the pyrophosphate-fructose 6-phosphate 1-phosphotransferase alpha (PFP-ALPHA), Glucose-6-phosphate isomerase 2 C cytosolic (PHI1) and hexokinase 3 (HXK3) genes, near markers 2_52421397, 1_12538127 and 8_1387967, respectively (Table 5), are involved in glycolysis, a required process for energy production [78]. PFP-ALFA, present in early maturity, participates in the breakdown of carbohydrates by catalyzing the reversible interconversion between fructose-6-phosphate and fructose-1,6-bisphosphate in a glycolysis intermediate [79]. PHI1 catalyzes the reversible isomerization of glucose-6-phosphate to fructose-6-phosphate, the second reaction step of glycolysis [80], while HXK3, a transferase found at normal maturity, is involved in glucose phosphorylation to produce glucose-6-phosphate, important for glycolysis and the pentose phosphate pathway [81]. The positive effect presented in the marker 8_13879677 near HXK3 (Table 4) suggests that the presence of this marker increases sucrose by storing glucose in the form of glucose-6-phosphate.

Transcription factors are proteins that bind to DNA to control genes in processes such as pentose-phosphate, glycolysis, and the metabolism of sugars and hormones [82]. The present study identifies two candidate genes: the ethylene-responsive transcription factor ERF12 and the ethylene-responsive transcription factor ERF112 (Table 5). Both genes were found during the maturation phase (i.e., normal maturity), suggesting that they can generate an increase in the enzymes sucrose synthase (SUS), invertase (INV), and sucrose phosphate synthase (SPS) [83]. It has been reported that an increase in the ethylene activity would be associated with the production of lignin and fiber, important compounds for source‒sink regulation and sucrose accumulation [83]. Finally, 16 markers were found associated with the accumulation of sucrose at early and normal maturity. These markers were close to genes that may be linked to the fundamental metabolic processes necessary for the production or accumulation of sucrose in any environment and at any developmental age. In particular, the genes Man9GlcNAc2 alpha-1-2C3-glucosyltransferase (At5g38460), endoglucanase 9 (GLU1), bifunctional fucokinase/fucose pyrophosphorylase (FKGP), ERF112, TPS6, MST1, probable plastidic glucose transporter 1, ERD6-like 4, and SUT4 could be considered housekeeping genes because they encode enzymes necessary for basic cellular metabolism on an ongoing basis [84]. For example, At5g38460 adds the first glucose residue to the lipid-linked oligosaccharide precursor for N-linked glycosylation, which is necessary for glycosylation and protein folding and their subsequent exit from the endoplasmic reticulum [85]. GLU1 affects internode elongation and cell wall components [86], while FKGP is involved in the metabolic reactivation of fucose by salvage paths into NDP-sugars and by converting fucose into GDP-fucose to be substrates for the biosynthesis of wall polysaccharides [87]. The SUT4, ERD6-like 4, and MST1 genes are sugar transporters and are generally considered synergistic genes because when sucrose reaches sink tissues, it is hydrolyzed into glucose and fructose (hexoses), which can be used for growth or storage through a sugar-coupled transporter (STP) [72]. The genes found in this study show that sucrose accumulation involves multiple metabolic pathways, such as trehalose and sucrose starch metabolism, and/or biological processes, such as glycolysis, as well as different sugar transporters, which act synergistically for plant development and for the accumulation of sucrose.

Conclusions

In this study, the trait of sucrose concentration was dissected through a GWAS analysis under 12 genetic models (i.e., general, additive, and the dominant models from 1 to 5 dominant alleles) in a diverse sugarcane population of 220 genotypes. From the analysis, 16, 45, and 16 markers were found to be significantly associated with sucrose concentration at early maturity, normal maturity, and shared between both maturities, respectively. After candidate genes analysis, there were 82 genes within the LD region of only 39 markers with an annotated function involved with sucrose accumulation and/or production. For the remaining 38 markers, there was no annotated gene associated with the trait. Among the 82 candidate genes, 18 were highlighted because they were involved in sucrose hydrolysis (SUS6 and CIN3), sugar transport (i.e., MST1, MST2, PLT5, SUT4), phosphorylation processes (TPS genes), glycolysis (PFP-ALPHA and HXK3), and transcription factors (ERF12, and ERF112). These 39 markers will be helpful to further select favorable genetic resources for the sugarcane breeding process in Colombia. The highlight of this study is the genetic dissection of sucrose, a quantitative trait, in a decaploid organism and the identification of several molecular markers related to the accumulation or production of sucrose at different maturity phases. Finally, these results provide new insights into the molecular mechanisms involved in sucrose accumulation in sugarcane and contribute important genomic resources for future research on sucrose accumulation and/or production in humid environments in Colombia.

Methods

Experimental site

The experiment was planted in the humid environment of the Cauca River valley, Colombia, in fields of La Cabaña sugarcane mill located at 3° 10’ 58.44” N and 76° 21’ 7.599” E. The location had an agroecological zone 6H4, which is characterized by the presence of soils with high humidity (with excesses between 400 and 600 mm/year), with a predominance of clayey soils, fine textures, and poorly aerated conditions [26]. The experiment was conducted during the plant cane, first, and second ratoon. The field has a tropical climate with a total rainfall of 1669 mm, 1677.70 mm, and 1507.50 mm of accumulated precipitation for the plant cane, first, and second ratoon, respectively (Table 6). All harvests were done during the second rainy season of the Cauca River valley in the third week of September 2018 for the plant cane, October 2019 for the first ratoon, and October 2020 for the second ratoon. The average temperature ranged between 23.25 °C for the plant cane and 23.79 °C for the second ratoon, with a relative humidity above 80% for the 3 harvests (Table 6).

Table 6 Climate data observed during the plant cane, first, and second ratoon of the experiment planted in the humid environment of the Cauca River valley in fields of La Cabaña sugarcane mill

Plant material

Cenicaña’s diverse panel of 220 genotypes from its sugarcane breeding program was selected for this study [88]. This diverse panel contained 98 genotypes that represents the genetic diversity of Cenicaña’s germplasm bank [89], 31 genotypes representatives from the wild species Saccharum officinarum, Saccharum barberi, Saccharum sinense, Saccharum spontaneum, and Erianthus spp., 58 genotypes of relevance for the breeding program at Cenicaña differential response to the most crop-limiting pests and diseases in Colombia, 33 genotypes belonging to genetic introductions from other breeding programs around the world, commercial varieties in Colombia, and early selection stage genotypes from the breeding program at Cenicaña [88] (Additional file 1). In total, there were 189 modern genotypes, of which 60% were genotypes bred under Cenicaña’s breeding program (Additional file 1).

Experimental design and data collection

The 220 genotypes were planted under an alpha-lattice design with 3 replications. This design belongs to an incomplete block design (IBD), which is used primarily in the reduction of the experimental error by splitting the total field variability in small incomplete blocks [90,91,92,93], minimizing the unknown variation within each replication [90, 91, 94, 95]. This design has been widely used in different crops such as rice [92], barley [91], wheat [91], and bread wheat [96], achieving great control of the experimental error. To increase the presicion in the control of the random errors, the commercial checks S29, S64, and S177 were replicated 8, 7, and 8 times, respectively, within each replicate block, resulting in 240 experimental units per replicate. In this study, each replication contained 12 blocks with 20 experimental units. The experimental unit was a plot of five rows, each 5 m long, with 1.65 m between rows. Agronomic practices were applied following commercial practices implemented by the sugarcane mill. The sampling unit consisted of two rows, the third and fourth row of each plot, to avoid border effects.

Data was collected on a plot basis for the accumulation of sucrose (%Cane) at 10 and 13 months after planting (map), considered from here on as early and normal maturity, respectively. For early maturity, measurements were taken from the plant cane and first ratoon, while for normal maturity, measurements were taken from the plant cane, first, and second ratoon. Sucrose (%Cane) was measured at early maturity using the CeniAD method [97], while the direct analysis (DAC) method [28] was employed at normal maturity. The CeniAD method consists of the evaluation of sucrose in three internodes, one from the apical, one from the middle, and one from the basal part of 11 mature stalks randomly selected from each genotype. On the other hand, the DAC method evaluates the sucrose content in 11 mature stalks by shredding the complete stalk (node and internode) from each genotype. For both methods, samples were shredded, and the juice was extracted with a hydraulic press. The quantification of sucrose was performed in the extracted juice with a near infrared (NIR) spectroscopy methodology.

Data analysis

Data was analyzed by a combine analysis across crop cycle (i.e., plant cane, first ratoon, and second ratoon) [98]. Within this type of models, some factors get nested within the experimental unit, which becomes in block factors innate to the units of the experiment [98]. The statistical model used was as following:

$${Y}_{ijkl}= {C}_{i}+{R}_{j\left(i\right)}+{B}_{k\left(ij\right)}+{G}_{l }+{I}_{il}+{\epsilon }_{ijkl}$$

where \({Y}_{ijkl}\) corresponds to the sucrose content at early or normal maturity of the genotype \(l\) planted in the incomplete block k nested in replication j of crop cycle i. Similarly, Ci corresponds to the effect of crop cycle \(i\), Rj(i) to the replication \(j\) nested within crop cycle \(i\), \({B}_{k\left(ij\right)}\) to the incomplete block \(k\) nested in replication \(j\) and crop cycle \(i\), \({G}_{l}\) to genotype \(l\), \({I}_{ij}\) to the interaction between genotype \(l\) and crop cycle i, and \({\epsilon }_{ijkl}\) to the random residual [98]. All effects were assumed to be random except for the crop cycle (\({C}_{i}\)). For this experiment, the genotypes comprise a random and representative selection for the sugarcane breeding pool used in Cenicaña (Cenicaña´s sugarcane diverse panel [88]) and for that they were assumed to be random effects. Outliers were detected by estimating the probability of obtaining a larger absolute value for each residual using a t-distribution [99, 100]. Subsequently, each p-value was adjusted with a Bonferroni correction at a 2% level of significance [101, 102]. After removing outliers, 16 models were evaluated by making all possible combinations between the random effects and by testing for homogeneity (\(V\left({\epsilon }_{ijkl}\right)={\sigma }_{e}^{2}\)) or heterogeneity (\(V\left({\epsilon }_{ijkl}\right)={\sigma }_{e\left(i\right)}^{2}\)) of the residual variance. To identify the best fitting model, the Bayesian Information Criterion (BIC) [103] was used. The Bayesian information criterium (BIC) evaluates models in terms of their posterior probabilities, penalizing the models based on the number of parameters it includes [104,105,106], with higher penalty for the models that includes a higher number of parameters [107]. Therefore, the lower the BIC value, the better the model balances the goodness of fit with parsimony (i.e., simplicity) [104,105,106]. With the selected model, the best linear unbiased predictors (BLUP) for the genotypes were obtained for further analysis [108]. Broad-sense heritability was calculated following Cullis heritability for unbalanced data, which takes into account the mean variance of the difference between two BLUPs and the genotypic variance [109]. All analyses were carried out using the Proc Mixed procedure of SAS (SAS Institute, Cary, NC).

DNA extraction and sequencing

Genomic DNA was extracted from each of the 220 genotypes by following the phenol‒chloroform protocol [110]. The DNA concentration was determined with a Thermo Fisher ® Nanodrop 2000 spectrophotometer, while the integrity was verified using a 0.8% agarose gel. The genomic DNA was sequenced using Genotype-By-Sequencing (GBS) [111], Restriction-site Associated DNA Sequencing (RADSeq) [112], and Whole Genome Sequencing (WGS) strategies. DNA was digested with the restriction enzyme Pst I for the single-end GBS technique [111] and Eco RI for the paired-end RADseq technique [112]. DNA libraries, for both techniques, were constructed following sequencing service providers and the sequence process was carried out using an Illumina HiSeq 2000 system (Illumina, San Diego, California, USA). For WGS, DNA libraries were constructed following Novogene sequence provider (Novogene Bioinformatics Technology Co. Ltd). Briefly, the genomic DNA was randomly sheared into short fragments (~ 150 bp). Each fragment was end-repaired, A-tailed, and ligated with Illumina adapters. The fragments with adapters were PCR amplified, size selected, purified, and sequenced using a System Illumina NovaSeq platform (HWI-ST1276).

Raw reads quality was assessed for each strategy using cutadapt [113] and FastQC [114] by removing reads with a Phred score lower than 30. Cleaned data from each strategy were mapped to the CC 01-1940 monoploid reference genome [115] using bowtie 2.2.5 [116]. Genotyping and variant detection were performed using the “MultisampleVariantsDetector” of NGSEP 4.0.2 [117]. SNP calling was performed through the “VCFFilter” module of NGSEP v 4.0.2 [117], assuming a ploidy of 10, a minimum allelic frequency (i.e., MAF) of 1%, a calling rate of 75%, a minimum sequencing depth of 30X (at least 30 reads per position in the genome), a minimum genotyping quality of 30 on the Phred scale, a distance between markers of 1 bp, and keeping only biallelic markers. Finally, SNP markers called from GBS, RADSeq, and WGS were merged using SAMtools v.1.10 [118], leaving a total of 137,889 SNPs for each one of the 220 genotypes.

Population structure, kinship, and GWAS analysis

The genetic structure for the 220-genotype population was estimated with a discriminant analysis of principal components (DAPC) with the adegenet package [119] implemented in R [120]. The optimal number of clusters or subpopulations was identified using the Bayesian Information Criterium [103]. The kinship matrix was calculated according to the VanRaden method [121], implemented in the statistical package GWASpoly 2.10 [122]. Genome-wide association analyses were performed with the GWASpoly 2.10 package [122]. Within this package, 12 genetic models were evaluated: general, additive, dominant simplex (1-dom-ref and 1-dom-alt), dominant duplex (2-dom-ref and 2-dom-alt), dominant triplex (3-dom-ref and 3-dom-alt), dominant tetraplex (4-dom-ref and 4-dom-alt), and dominant pentaplex (5-dom-ref and 5-dom-alt) [99]. Briefly, the general model allows each marker allele dosage to be arbitrary and statistically equivalent, while the additive model assumes that the effect of the marker is proportional to the dosage of the alternative allele [122]. For the dominant models, the presence of at least one, two, three, four, or five copies of the alternative (alt) or reference (ref) alleles generates the same effect as the homozygous genotype for this allele [122]. SNPs significantly associated with sucrose (%Cane) were identified using the false discovery rate (FDR) test with a significance level of 5% [123].

The presence of false positives in each model was visually determined through quantile‒quantile (QQ) plots, which assume a uniform distribution of p-values under the null hypothesis of no association [124,125,126]. For each genetic model, the most significant markers were progressively removed before reanalyzing the genetic association (GWAS). Then, a new QQ-plot was generated and deviations from the null hypothesis were analyzed. This process was repeated until there were only markers that did not deviate significantly from the null hypothesis. At this point, the remaining markers (i.e., those that fail to reject the null hypothesis) were identified as false positives and removed from the analysis [127]. Similarly, the proportion of phenotypic variance explained by each significant SNP (R2) was estimated using the Cox & Snell pseudo-R2 [128]. This R2 is based on the relationship between the likelihood of the null model (the intercept-only model) and the full model [129]. Finally, the linkage disequilibrium (LD) region was estimated at an average of 500 kb for all chromosomes (data not shown), and therefore, candidate genes were searched for in the region spanning 500 kb upstream and downstream of the marker. Once the genes were identified, they were filtered by selecting genes with known functions in plants. The function of each gene was manually evaluated, and only those involved in the production or accumulation of sucrose were selected.

Data availability

The datasets generated and/or analyzed during the current study are available in the repository https://github.com/fsilvaag/GWAS-Sugarcane-Sucrose.git.

References

  1. Heller-Uszynska K, Uszynski G, Huttner E, Evers M, Carlig J, Caig V, Aitken K, Jackson P, Piperidis G, Cox M. Diversity arrays technology effectively reveals DNA polymorphism in a large and complex genome of sugarcane. Mol Breeding. 2011;28:37–55.

    Article  CAS  Google Scholar 

  2. OECD/FAO. OECD-FAO Agricultural Outlook 2019–2028. Paris/Food and Agriculture Organization of the United Nations. Rome: OECD Publishing; 2019.

    Google Scholar 

  3. Asociación de Cultivadores de Caña de Azúcar de Colombia A. Informe Anual 2021–2022. Un dulce sabor que se transforma. Cali: Asocaña; 2022. p. 77.

    Google Scholar 

  4. Cassalett Davila C, Rangel Jimenez H. Mejoramiento Genético. In: El cultivo de la cana en la zona azucarera de Colombia 1995.

  5. Meena MR, Kumar R, Chinnaswamy A, Karuppaiyan R, Kulshreshtha N, Ram B. Current breeding and genomic approaches to enhance the cane and sugar productivity under abiotic stress conditions. 3 Biotech. 2020;10:1–18.

    Article  Google Scholar 

  6. Zhao Y, Liu J, Huang H, Zan F, Zhao P, Zhao J, Deng J, Wu C. Genetic improvement of sugarcane (Saccharum spp.) contributed to high sucrose content in China based on an analysis of newly developed varieties. Agriculture. 2022;12(11):1789.

    Article  CAS  Google Scholar 

  7. Barreto FZ, Rosa JRBF, Balsalobre TWA, Pastina MM, Silva RR, Hoffmann HP, de Souza AP, Garcia AAF, Carneiro MS. A genome-wide association study identified loci for yield component traits in sugarcane (Saccharum spp). PLoS ONE. 2019;14(7):e0219843.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Meena MR, Appunu C, Arun Kumar R, Manimekalai R, Vasantha S, Krishnappa G, Kumar R, Pandey S, Hemaprabha G. Recent advances in sugarcane genomics, physiology, and phenomics for superior agronomic traits. Front Genet. 2022;13:854936.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Débibakas S, Rocher S, Garsmeur O, Toubi L, Roques D, D’Hont A, Hoarau J-Y, Daugrois J-H. Prospecting sugarcane resistance to sugarcane yellow leaf virus by genome-wide association. Theor Appl Genet. 2014;127(8):1719–32.

    Article  PubMed  PubMed Central  Google Scholar 

  10. Gouy M, Rousselle Y, Chane AT, Anglade A, Royaert S, Nibouche S, Costet L. Genome wide association mapping of agro-morphological and disease resistance traits in sugarcane. Euphytica. 2015;202(2):269–84.

    Article  Google Scholar 

  11. Racedo J, Gutiérrez L, Perera MF, Ostengo S, Pardo EM, Cuenya MI, Welin B, Castagnaro AP. Genome-wide association mapping of quantitative traits in a breeding population of sugarcane. BMC Plant Biol. 2016;16(1):142.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Ostengo S, Serino G, Perera MF, Racedo J, Mamaní González SY, Yáñez Cornejo F, Cuenya MI. Sugarcane breeding, germplasm development and supporting genetic research in Argentina. Sugar Tech. 2022;24(1):166–80.

    Article  Google Scholar 

  13. Yang X, Luo Z, Todd J, Sood S, Wang J. Genome-wide association study of multiple yield traits in a diversity panel of polyploid sugarcane (Saccharum spp). Plant Genome. 2020;13(1):e20006.

    Article  CAS  PubMed  Google Scholar 

  14. Yang X, Islam MS, Sood S, Maya S, Hanson EA, Comstock J, Wang J. Identifying quantitative trait loci (QTLs) and developing diagnostic markers linked to orange rust resistance in sugarcane (Saccharum spp). Front Plant Sci. 2018;9:350.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Fier Í, Balsalobre TWA, Chapola RG, Hoffmann HP, Carneiro MS. Field resistance and molecular detection of the orange rust resistance gene linked to G1 marker in Brazilian cultivars of sugarcane. Summa Phytopathologica. 2020;46(2):92–7.

    Article  Google Scholar 

  16. Winter H, Huber SC. Regulation of sucrose metabolism in higher plants: localization and regulation of activity of key enzymes. CRC Crit Rev Plant Sci. 2000;19(1):31–67.

    Article  CAS  Google Scholar 

  17. Lingle SE, Smith RC. Sucrose metabolism related to growth and ripening in sugarcane internodes. Crop Sci. 1991;31(1):172–7.

    Article  CAS  Google Scholar 

  18. Verma AK, Upadhyay S, Verma PC, Solomon S, Singh SB. Functional analysis of sucrose phosphate synthase (SPS) and sucrose synthase (SS) in sugarcane (Saccharum) cultivars. Plant Biol. 2011;13(2):325–32.

    Article  CAS  PubMed  Google Scholar 

  19. Zhu YJ, Komor E, Moore PH. Sucrose accumulation in the sugarcane stem is regulated by the difference between the activities of soluble acid invertase and sucrose phosphate synthase. Plant Physiol. 1997;115(2):609–16.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Verma I, Roopendra K, Sharma A, Chandra A, Kamal A. Expression analysis of genes associated with sucrose accumulation and its effect on source–sink relationship in high sucrose accumulating early maturing sugarcane variety. Physiol Mol Biology Plants. 2019;25(1):207–20.

    Article  CAS  Google Scholar 

  21. Stein O, Granot D. An overview of sucrose synthases in plants. Front Plant Sci. 2019;10:95.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Thirugnanasambandam PP, Mason PJ, Hoang NV, Furtado A, Botha FC, Henry RJ. Analysis of the diversity and tissue specificity of sucrose synthase genes in the long read transcriptome of sugarcane. BMC Plant Biol. 2019;19(1):1–14.

    Article  Google Scholar 

  23. Quevedo YMA, Cepeda AMQ, López MÁ. Fisiología aplicada a la producción de la caña de azúcar en Colombia. Agroindustria De La caña de azúcar en Colombia Documentos De Divulgacion técnica. Edited by Cenicaña: Cenicaña; 2023.

    Google Scholar 

  24. Cortés E, Barrios C. Nuevo calendario de temporadas secas y lluviosas en El valle del río Cauca. Carta Trimestral Cenicaña. 2010;32(3–4):4–5.

    Google Scholar 

  25. Chica Ramirez HA, Gómez Gil LF, Bravo Bastidas JJ, Carbonell González JA, Peña Quiñones AJ. Site-specific intra-annual rainfall patterns: a tool for agricultural planning in the Colombian sugarcane production zone. Theoret Appl Climatol. 2021;146(1–2):543–54.

    Article  Google Scholar 

  26. Carbonell González J. Zonificación agroecológica para el cultivo de la caña de azúcar en el valle del río Cauca (cuarta aproximación): principios metodológicos y aplicaciones. In.: cenicaña (Colombia); 2011.

  27. Donaldson R, Redshaw K, Rhodes R, Antwerpen, Rv. Season effects on productivity of some commercial South African sugarcane cultivars, II: Trash production. In: Proceedings of the Annual Congress-South African Sugar Technologists’ Association: 2008. South African Sugar Technologists’ Association: 528–538.

  28. Larrahondo JE, Villegas F. Control y características de maduración. El Cultivo de la Caña de Azúcar en la Zona Azucarera de Colombia CENICAÑA Colombia 1995:297–313.

  29. Robertson M, Donaldson R. Changes in the components of cane and sucrose yield in response to drying-off of sugarcane before harvest. Field Crops Res. 1998;55(3):201–8.

    Article  Google Scholar 

  30. Li Liu D, Helyar K. Simulation of seasonal stalk water content and fresh weight yield of sugarcane. Field Crops Res. 2003;82(1):59–73.

    Article  Google Scholar 

  31. Gilbert RA, Shine J Jr, Miller JD, Rice RW, Rainbolt C. The effect of genotype, environment and time of harvest on sugarcane yields in Florida, USA. Field Crops Res. 2006;95(2–3):156–70.

    Article  Google Scholar 

  32. Cardozo NP, Sentelhas PC. Climatic effects on sugarcane ripening under the influence of cultivars and crop age. Scientia Agricola. 2013;70:449–56.

    Article  Google Scholar 

  33. Muñoz CAU, Trujillo FJV. Effects of meteorological variables on sugarcane ripening in the Cauca river valley, Colombia. Pesquisa Agropecuária Trop 2020, 50.

  34. Trujillo-Montenegro JH, Rodriguez Cubillos MJ, Loaiza CD, Quintero M, Espitia-Navarro HF, Salazar Villareal FA, Viveros Valens CA, Gonzalez Barrios AF, De Vega J, Duitama J, et al. Unraveling the genome of a high Yielding Colombian sugarcane hybrid. Front Plant Sci. 2021;12:694859.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Romão X, Vasanelli E. Identification and Processing of outliers. Non-destructive in situ Strength Assessment of concrete: practical application of the RILEM TC 249-ISC recommendations. Springer; 2021. pp. 161–80.

  36. Boukerche A, Zheng L, Alfandi O. Outlier detection: methods, models, and classification. ACM Comput Surv (CSUR). 2020;53(3):1–37.

    Article  Google Scholar 

  37. Kanehisa M. Toward understanding the origin and evolution of cellular organisms. Protein Sci. 2019;28(11):1947–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  38. Kanehisa M, Furumichi M, Sato Y, Ishiguro-Watanabe M, Tanabe M. KEGG: integrating viruses and cellular organisms. Nucleic Acids Res. 2021;49(D1):D545–51.

    Article  CAS  PubMed  Google Scholar 

  39. Kanehisa M, Goto S. KEGG: kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28(1):27–30.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  40. Urgesa GD, Keyata EO. Effect of harvesting ages on yield and yield components of sugar cane varieties cultivated at finchaa sugar factory, oromia, Ethiopia. Int J Food Sci. 2021;2021:1–6.

    Article  Google Scholar 

  41. Ahmed A, Awadalla A. Effect of Harvesting Age on Yield, Yield Components and Quality of some promising sugarcane varieties. J Plant Prod. 2016;7(12):1501–7.

    Google Scholar 

  42. Marin FR, Rattalino Edreira JI, Andrade JF, Grassini P. Sugarcane yield and yield components as affected by harvest time. Sugar Tech. 2021;23:819–26.

    Article  Google Scholar 

  43. Sanghera GS, Malhotra P, Singh H, Bhatt R. Climate change impact in sugarcane agriculture and mitigation strategies. Harnessing Plant Biotechnol Physiol Stimulate Agricultural Growth. 2019;1(1):99–115.

    Google Scholar 

  44. Hagos H, Mengistu L, Mequanint Y. Determining optimum harvest age of sugarcane varieties on the newly establishing sugar project in the tropical areas of Tendaho, Ethiopia. Adv crop Sci Technol. 2014;2(5):156–9.

    Google Scholar 

  45. Robertson M, Muchow R, Wood A, Campbell J. Accumulation of reducing sugars by sugarcane: effects of crop age, nitrogen supply and cultivar. Field Crops Res. 1996;49(1):39–50.

    Article  Google Scholar 

  46. Venkataramana S, Naidu KM, Singh S. Invertases and growth factors dependent sucrose accumulation in sugarcane. Plant Sci. 1991;74(1):65–72.

    Article  CAS  Google Scholar 

  47. Alexander AG. Sugarcane physiology, a comprehensive study of the Saccharum source-to-sink system. Elsevier Scientific Publishing Co.; 1973.

  48. Meena M, Karuppiayan R, Ram B, Kumar R, Kulshreshtha N. Genotypes x environment interactions and stability analysis of sugarcane clones (Saccharum spp.) by AMMI model in sub-tropical regions of India. Indian J Genet Plant Breed. 2017;77(04):540–6.

    Article  Google Scholar 

  49. Sengwayo S, Zhou M, Labuschagne M. Trends and magnitudes of genotype× environment interaction variance components for yield, quality and agronomic traits among coastal short cycle sugarcane breeding populations. South Afr J Plant Soil. 2018;35(1):41–50.

    Article  Google Scholar 

  50. Milligan S, Gravois KA, Bischoff KP, Martin F. Crop effects on broad-sense heritabilities and genetic variances of sugarcane yield components. Crop Sci. 1990;30(2):344–9.

    Article  Google Scholar 

  51. Zhou M, Joshi SV, Maritz T. Trends and implications of genotype by environment interaction in South African sugarcane breeding. J Crop Improv. 2012;26(2):163–76.

    Article  Google Scholar 

  52. Xu F, Wang Z, Lu G, Zeng R, Que Y. Sugarcane ratooning ability: research status, shortcomings, and prospects. Biology. 2021;10(10):1052.

    Article  PubMed  PubMed Central  Google Scholar 

  53. Orozco H, Quemé JL, Ovalle W, Rosales F. Mejoramiento genético de la caña de azúcar; 2015.

  54. Ali A, Pan Y-B, Wang Q-N, Wang J-D, Chen J-L, Gao S-J. Genetic diversity and population structure analysis of Saccharum and Erianthus genera using microsatellite (SSR) markers. Sci Rep. 2019;9(1):1–10.

    Google Scholar 

  55. Cheavegatti-Gianotto A, De Abreu HMC, Arruda P, Bespalhok Filho JC, Burnquist WL, Creste S, di Ciero L, Ferro JA, de Oliveira Figueira AV, de Sousa Filgueiras T. Sugarcane (Saccharum X officinarum): a reference study for the regulation of genetically modified cultivars in Brazil. Trop Plant Biology. 2011;4:62–89.

    Article  CAS  Google Scholar 

  56. Xiong H, Chen Y, Gao S-J, Pan Y-B, Shi A. Population structure and Genetic Diversity Analysis in sugarcane (Saccharum spp. hybrids) and six related Saccharum Species. Agronomy. 2022;12(2):412.

    Article  CAS  Google Scholar 

  57. Paterson AH, Moore PH, Tew TL. The gene pool of Saccharum species and their improvement. Genomics Saccharinae 2013:43–71.

  58. Piperidis G, Piperidis N, D’Hont A. Molecular cytogenetic investigation of chromosome composition and transmission in sugarcane. Mol Genet Genomics. 2010;284(1):65–73.

    Article  CAS  PubMed  Google Scholar 

  59. Wu J, Huang Y, Lin Y, Fu C, Liu S, Deng Z, Li Q, Huang Z, Chen R, Zhang M. Unexpected inheritance pattern of Erianthus arundinaceus chromosomes in the intergeneric progeny between Saccharum spp. and Erianthus Arundinaceus. PLoS ONE. 2014;9(10):e110390.

    Article  PubMed  PubMed Central  Google Scholar 

  60. Evans DL, Joshi SV. On the validity of the saccharum complex and the saccharinae subtribe: a re-assesment. BioRxiv 2020:2020.2007. 2029.226753.

  61. Mukherjee SK. Revision of the genus Saccharum Linn. Bull Bot Soc Bengal. 1954;8:143–8.

    Google Scholar 

  62. McCormick A, Watt D, Cramer M. Supply and demand: sink regulation of sugar accumulation in sugarcane. J Exp Bot. 2009;60(2):357–64.

    Article  CAS  PubMed  Google Scholar 

  63. Wang L, Cui N, Zhang K-Y, Fan H-Y, Li T-L. Research advance of sucrose phosphate synthase (SPS) in higher plant. Int J Agric Biology 2013, 15(6).

  64. Barratt DP, Derbyshire P, Findlay K, Pike M, Wellner N, Lunn J, Feil R, Simpson C, Maule AJ, Smith AM. Normal growth of Arabidopsis requires cytosolic invertase but not sucrose synthase. Proceedings of the National Academy of Sciences 2009, 106(31):13124–13129.

  65. Paysan-Lafosse T, Blum M, Chuguransky S, Grego T, Pinto BL, Salazar Gustavo A, Bileschi Maxwell L, Bork P, Bridge A, Colwell L, et al. InterPro in 2022. Nucleic Acids Res. 2022;51(D1):D418–27.

    Article  PubMed Central  Google Scholar 

  66. Lallemand F, Martin-Magniette ML, Gilard F, Gakière B, Launay‐Avon A, Delannoy É, Selosse MA. In situ transcriptomic and metabolomic study of the loss of photosynthesis in the leaves of mixotrophic plants exploiting fungi. Plant J. 2019;98(5):826–41.

    Article  CAS  PubMed  Google Scholar 

  67. Wang Y-J, Zhen X-H, Zhou Y-J, Wang Y-L, Hou J-Y, Wang X, Li R-M, Liu J, Hu X-W, Geng M-T. MeNINV1: an alkaline/neutral invertase gene of manihot esculenta, enhanced sucrose catabolism and promoted plant vegetative growth in transgenic arabidopsis. Plants. 2022;11(7):946.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  68. Geiger D. Plant glucose transporter structure and function. Pflügers Archiv-European J Physiol. 2020;472(9):1111–28.

    Article  CAS  Google Scholar 

  69. Lalonde S, Frommer WB. SUT sucrose and MST monosaccharide transporter inventory of the Selaginella genome. Front Plant Sci. 2012;3:24.

    Article  PubMed  PubMed Central  Google Scholar 

  70. Wormit A, Trentmann O, Feifer I, Lohr C, Tjaden J, Meyer S, Schmidt U, Martinoia E, Neuhaus HE. Molecular identification and physiological characterization of a novel monosaccharide transporter from Arabidopsis involved in vacuolar sugar transport. Plant Cell. 2006;18(12):3476–90.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  71. Büttner M. The monosaccharide transporter (-like) gene family in Arabidopsis. FEBS Lett. 2007;581(12):2318–24.

    Article  PubMed  Google Scholar 

  72. Lemoine R, La Camera S, Atanassova R, Deedaldeechamp F, Allario T, Pourtau N, Bonnemain JL, Laloi M, Coutos-Theevenot P, Maurousset L et al. Source-to-sink transport of sugar and regulation by environmental factors. Front Plant Sci 2013, 4.

  73. Wang M, Li AM, Liao F, Qin CX, Chen ZL, Zhou L, Li YR, Li XF, Lakshmanan P, Huang DL. Control of sucrose accumulation in sugarcane (Saccharum spp. hybrids) involves miRNA-mediated regulation of genes and transcription factors associated with sugar metabolism. GCB Bioenergy. 2022;14(2):173–91.

    Article  Google Scholar 

  74. Aluko OO, Li C, Wang Q, Liu H. Sucrose utilization for improved crop yields: a review article. Int J Mol Sci. 2021;22(9):4704.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  75. Papini-Terzi FS, Rocha FR, Vêncio RZ, Felix JM, Branco DS, Waclawovsky AJ, Del Bem LE, Lembke CG, Costa MD, Nishiyama MY. Sugarcane genes associated with sucrose content. BMC Genomics. 2009;10(1):1–21.

    Article  Google Scholar 

  76. Nunes C, Schluepmann H, Delatte TL, Wingler A, Silva AB, Fevereiro PS, Jansen M, Fiorani F, Wiese-Klinkenberg A, Paul MJ. Regulation of growth by the trehalose pathway: relationship to temperature and sucrose. Plant Signal Behav. 2013;8(12):e26626.

    Article  PubMed  PubMed Central  Google Scholar 

  77. Lunn JE, Delorge I, Figueroa CM, Van Dijck P, Stitt M. Trehalose metabolism in plants. Plant J. 2014;79(4):544–67.

    Article  CAS  PubMed  Google Scholar 

  78. Fernie AR, Carrari F, Sweetlove LJ. Respiratory metabolism: glycolysis, the TCA cycle and mitochondrial electron transport. Curr Opin Plant Biol. 2004;7(3):254–61.

    Article  CAS  PubMed  Google Scholar 

  79. Duan E, Wang Y, Liu L, Zhu J, Zhong M, Zhang H, Li S, Ding B, Zhang X, Guo X. Pyrophosphate: fructose-6-phosphate 1-phosphotransferase (PFP) regulates carbon metabolism during grain filling in rice. Plant Cell Rep. 2016;35:1321–31.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  80. Liu H-C, Chen H-C, Huang T-H, Lue W-L, Chen J, Suen D-F. Cytosolic phosphoglucose isomerase is essential for microsporogenesis and embryogenesis in Arabidopsis. Plant Physiol. 2023;191(1):177–98.

    Article  CAS  PubMed  Google Scholar 

  81. Jang J-C, Leon P, Zhou L, Sheen J. Hexokinase as a sugar sensor in higher plants. Plant Cell. 1997;9(1):5–19.

    CAS  PubMed  PubMed Central  Google Scholar 

  82. Franco-Zorrilla JM, López-Vidriero I, Carrasco JL, Godoy M, Vera P, Solano R. DNA-binding specificities of plant transcription factors and their potential to define target genes. Proceedings of the National Academy of Sciences 2014, 111(6):2367–2372.

  83. Chen Z, Qin C, Wang M, Liao F, Liao Q, Liu X, Li Y, Lakshmanan P, Long M, Huang D. Ethylene-mediated improvement in sucrose accumulation in ripening sugarcane involves increased sink strength. BMC Plant Biol. 2019;19(1):1–17.

    Article  Google Scholar 

  84. Silver N, Best S, Jiang J, Thein SL. Selection of housekeeping genes for gene expression studies in human reticulocytes using real-time PCR. BMC Mol Biol. 2006;7(1):1–9.

    Article  Google Scholar 

  85. Guerin ME. Estudios moleculares sobre la UDP-GLC: Glicoproteína glucosiltransferasa: relación estructura-función. Universidad De Buenos Aires. Facultad de Ciencias Exactas y Naturales; 2002.

  86. Zhou H-L, He S-J, Cao Y-R, Chen T, Du B-X, Chu C-C, Zhang J-S, Chen S-Y. OsGLU1, a putative membrane-bound endo-1, 4-ß-d-glucanase from rice, affects plant internode elongation. Plant Mol Biol. 2006;60:137–51.

    Article  CAS  PubMed  Google Scholar 

  87. Barnes WJ, Anderson CT. Release, recycle, rebuild: cell-wall remodeling, autodegradation, and sugar salvage for new wall biosynthesis during plant development. Mol Plant. 2018;11(1):31–46.

    Article  CAS  PubMed  Google Scholar 

  88. Jaimes H, Londoño A, Saavedra-Diaz C, Trujillo‐Montenegro JH, López‐Gerena J, Riascos JJ, Aguilar FS. Sequencing vs. amplification for the estimation of allele dosages in sugarcane (Saccharum spp). Appl Plant Sci 2024:e11574.

  89. Salazar V, Victoria K, Viveros C, Rangel J. Caracterización y agrupamiento con fines de mejoramiento genético, variedades de caña evaluadas en zona seca-semiseca. Congreso Atalac-Tecnicaña. In: VIII Congreso de la Asociación de Técnicos Azucareros de Latinoamérica y el Caribe y IX Congreso de la Asociación Colombiana de Técnicos de la caña de Azúcar 14 Septiembre 2012; Santiago de Cali. Tecnicaña: 2012: 15–24.

  90. Patterson HD, Williams ER. New Class of Resolvable Incomplete Block designs. Biometrika. 1976;63(1):83–92.

    Article  Google Scholar 

  91. Patterson H, Hunter E. The efficiency of incomplete block designs in National List and recommended list cereal variety trials. J Agricultural Sci. 1983;101(2):427–33.

    Article  Google Scholar 

  92. Kashif M, Khan M, Arif M, Anwer M, Ijaz M. Efficiency of alpha Lattice Design in Rice Field trials in Pakistan. J Sci Res 2011, 3(1).

  93. Piepho HP, Buchse A, Truberg B. On the use of multiple lattice designs and alpha-designs in plant breeding trials. Plant Breeding. 2006;125(5):523–8.

    Article  Google Scholar 

  94. Patterson HD, Thompson R. Recovery of Inter-block Information when Block sizes are unequal. Biometrika. 1971;58(3):545–54.

    Article  Google Scholar 

  95. Ismail M, Noureldin N, Saudy H, Mohamed M, Fares W. Using of Alpha Lattice Design for Increasing Precision of Faba Bean yield trials. J Environ Sci. 2018;44(2):81–97.

    Article  Google Scholar 

  96. Abd El-Shafi MA. Efficiency of classical complete and incomplete block designs in yield trial on bread wheat genotypes. Res J Agric Biol Sci. 2014;10(1):17–23.

    Google Scholar 

  97. Amaya A, Larrahondo J, OLIVER M. A new proposal for clonal evaluation based on direct analysis and NIR spectroscopy. In: Proc Int Soc Sugar Cane Technol: 2001. Citeseer: 524–526.

  98. Piepho HP, Mohring J, Schulz-Streeck T, Ogutu JO. A stage-wise approach for the analysis of multi-environment trials. Biom J. 2012;54(6):844–60.

    Article  PubMed  Google Scholar 

  99. Student. The probable error of a Mean. Biometrika. 1908;6(1):1–25.

    Article  Google Scholar 

  100. Jolicoeur P, Jolicoeur P. The distribution of Student’St. Introduction Biometry 1999:36–7.

  101. Bonferroni C. Teoria Statistica delle classi e calcolo delle probabilita. Pubblicazioni Del R Istituto Superiore Di Scienze Economiche E Commericiali Di Firenze. 1936;8:3–62.

    Google Scholar 

  102. Bernal-Vasquez A-M, Utz H-F, Piepho H-P. Outlier detection methods for generalized lattices: a case study on the transition from ANOVA to REML. Theor Appl Genet. 2016;129:787–804.

    Article  PubMed  Google Scholar 

  103. Schwarz G. Estimating dimension of a model. Ann Stat. 1978;6(2):461–4.

    Article  Google Scholar 

  104. Aho K, Derryberry D, Peterson T. Model selection for ecologists: the worldviews of AIC and BIC. Ecology. 2014;95(3):631–6.

    Article  PubMed  Google Scholar 

  105. Chakrabarti A, Ghosh JK. AIC, BIC and recent advances in model selection. Philos Stat 2011:583–605.

  106. Höge M, Wöhling T, Nowak W. A primer for model selection: the decisive role of model complexity. Water Resour Res. 2018;54(3):1688–715.

    Article  Google Scholar 

  107. Heo J, Lee JY, Kim W. Bayesian information criterion accounting for the number of covariance parameters in mixed effects models. Commun Stat Appl Methods. 2020;27(3):301–11.

    Google Scholar 

  108. Henderson CR. Best Linear unbiased estimation and prediction under a selection model. Biometrics. 1975;31(2):423–47.

    Article  CAS  PubMed  Google Scholar 

  109. Cullis BR, Smith AB, Coombes NE. On the design of early generation variety trials with correlated data. J Agricultural Biol Environ Stat. 2006;11(4):381–93.

    Article  Google Scholar 

  110. Dellaporta SL, Wood J, Hicks JB. A plant DNA minipreparation: version II. Plant Mol Biology Report. 1983;1(4):19–21.

    Article  CAS  Google Scholar 

  111. Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE. 2011;6(5):e19379.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  112. Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE. 2008;3(10):e3376.

    Article  PubMed  PubMed Central  Google Scholar 

  113. Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17(1):10–2.

    Article  Google Scholar 

  114. Andrews S. FastQC: a quality control tool for high throughput sequence data. In.: Babraham Bioinformatics, Babraham Institute, Cambridge, United Kingdom; 2010.

  115. Trujillo-Montenegro JH, Rodríguez Cubillos MJ, Loaiza CD, Quintero M, Espitia-Navarro HF, Salazar Villareal FA, Viveros Valens CA, González Barrios AF, De Vega J, Duitama J et al. Unraveling the genome of a high yielding Colombian sugarcane hybrid. Front Plant Sci 2021, 12.

  116. Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  117. Tello D, Gonzalez-Garcia LN, Gomez J, Zuluaga-Monares JC, Garcia R, Angel R, Mahecha D, Duarte E, Leon MR, Reyes F, et al. NGSEP 4: efficient and accurate identification of Orthogroups and whole-genome alignment. bioRxiv. 2022;2022(2001):2027–478091.

    Google Scholar 

  118. Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.

    Article  PubMed  PubMed Central  Google Scholar 

  119. Jombart T. Adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24(11):1403–5.

    Article  CAS  PubMed  Google Scholar 

  120. R Core Team. R: a Language and Environment for Statistical Computing. R Foundation for Statistical Computing. In.; 2019.

  121. VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci. 2008;91(11):4414–23.

    Article  CAS  PubMed  Google Scholar 

  122. Rosyara UR, De Jong WS, Douches DS, Endelman JB. Software for genome-wide association studies in autopolyploids and its application to potato. Plant Genome. 2016;9(2):1–10.

    Article  CAS  Google Scholar 

  123. Benjamini Y, Hochberg Y. Controlling the false Discovery Rate - a practical and powerful Approach to multiple testing. J Royal Stat Soc Ser B-Methodological. 1995;57(1):289–300.

    Article  Google Scholar 

  124. Ferrão LFV, Benevenuto J, Oliveira IB, Cellon C, Olmstead J, Kirst M, Resende MF Jr, Munoz P. Insights into the genetic basis of blueberry fruit-related traits using diploid and polyploid models in a GWAS context. Front Ecol Evol. 2018;6:107.

    Article  Google Scholar 

  125. Storey JD. The positive false Discovery rate: a bayesian interpretation and the q-value. In; 2002.

  126. Storey JD, Taylor JE, Siegmund D. Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. J Royal Stat Soc Ser B-Statistical Methodol. 2004;66:187–205.

    Article  Google Scholar 

  127. Stančáková A, Paananen J, Soininen P, Kangas AJ, Bonnycastle LL, Morken MA, Collins FS, Jackson AU, Boehnke ML, Kuusisto J, et al. Effects of 34 risk loci for type 2 diabetes or hyperglycemia on lipoprotein subclasses and their composition in 6,580 nondiabetic Finnish men. Diabetes. 2011;60(5):1608–16.

    Article  PubMed  PubMed Central  Google Scholar 

  128. Cox DR, Snell EJ. Analysis of binary data. Volume 32. CRC; 1989.

  129. Smith TJ, McKenna CM. A comparison of logistic regression pseudo R2 indices. Multiple Linear Regres Viewpoints. 2013;39(2):17–26.

    Google Scholar 

Download references

Acknowledgements

The authors would like to thank the sugarcane mills and the growers who have supported Cenicaña for more than 46 years. Additionally, the authors give especial thanks to La Cabaña, Manuelita, and Mayaguez sugarcane mills for helping in the field experimentation.

Funding

This study was sponsored within the Colombian Scientific Ecosystem by The World Bank, COLCIENCIAS, ICETEX, the Colombian Ministry of Education, and the Colombian Ministry of Industry and Tourism, who funded this research under Grant ID: FP44842-217-2018 and OMICAS Award ID: 792-61187.

Author information

Authors and Affiliations

Authors

Contributions

CSD performed the analysis of phenotypic and genotypic data and wrote the manuscript. JHTM analyzed the genotypic data and helped in the revision of the manuscript. HAJ helped in the revision of the manuscript and in the analysis of genotypic data. AL helped in the revision of the manuscript and laboratory analysis. FASV responsible for the field experiments and phenotypic data collection. LOL responsible for the field experiments and phenotypic data collection. CAVV responsible for the field experiments and phenotypic data collection. JLG helped in the revision of the manuscript. JJR conceptualized the idea for the study and revised the manuscript. YMQ helped in the revision of the manuscript providing a wider discussion. FSA supervised the study, guided the analysis of phenotypic and genotypic data, and wrote the manuscript. All the authors have read and approved the final manuscript.

Corresponding author

Correspondence to Fernando S. Aguilar.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors have no relevant financial or nonfinancial interests to disclose.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Saavedra-Díaz, C., Trujillo-Montenegro, J.H., Jaimes, H.A. et al. Genetic association analysis in sugarcane (Saccharum spp.) for sucrose accumulation in humid environments in Colombia. BMC Plant Biol 24, 570 (2024). https://doi.org/10.1186/s12870-024-05233-y

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-024-05233-y

Keywords