Discovery of genes involved in anthocyanin biosynthesis from the rind and pith of three sugarcane varieties using integrated metabolic profiling and RNA-seq analysis
BMC Plant Biology volume 21, Article number: 214 (2021)
Sugarcane (Saccharum officinarum) is one of the most valuable feedstocks for sugar production. In addition to the production of industrial raw materials such as alcohol, papermaking, the fiber of livestock feed, respectively, sugarcane can produce bioactive compounds such as anthocyanins. Elucidation of the anthocyanin biosynthesis pathway is critical for the molecular breeding of sugarcane varieties with favorable traits. We aimed to identify candidate genes involved in anthocyanin biosynthesis by transcriptomic and metabolomic analyses.
Three varieties of sugarcane displaying different colors were used in this study: FN15 (greed rind), ROC22 (red rind), and Badila (purple rind). Sample materials were subjected to metabolomic analysis using UPLC-Q-TOF/MS and RNA-seq analysis. The metabolomic profiling results showed Cyanidin, Cyanidin (6’-malonylglucoside), Cyanidin O-glucoside, and Peonidin O-glucoside were the main components responsible for the rind color. Then, through RNA-seq analysis, we identified a total of 3137, 3302, 3014 differentially expressed genes (DEGs) between the rind and pith tissues for the corresponding varieties Badila rind, ROC22, and FN15. We then compared the expression levels of genes among the rind tissues from the three varieties. We identified 2901, 2821, and 3071 DEGs between Badila rind vs. ROC22 rind, Badila rind vs. FN15 rind, ROC22 rind vs. FN15 rind, respectively. We identified two enriched pathways, including phenylpropanoid biosynthesis and flavonoid biosynthesis. Sequencing similarity search identified a total of 50 unigenes belonging to 15 enzyme families as putative genes involved in anthocyanin biosynthesis in sugarcane rind. Seven of them were identified as candidate genes related to anthocyanin biosynthesis in the rind of sugarcane through co-localization analysis with the anthocyanin content in sugarcane. In total, 25 unigenes were selected and subjected to RT-qPCR analysis, and qRT-PCR results were consistent with those obtained with the RNA-Seq experiments.
We proposed a pathway for anthocyanin biosynthesis in sugarcane rind. This is the first report on the biosynthesis of anthocyanin in sugarcane using the combined transcriptomic and metabolomic methods. The results obtained from this study will lay the foundation for breeding purple pith sugarcane varieties with high anthocyanin contents.
Sugarcane (Saccharum officinarum) is one of the most valuable feedstocks for sugar production . Sugar extracted from sugarcane represents 70 % of global sugar production. The processed by-products of sugarcane can be used as industrial raw materials such as alcohol, papermaking, fiber, and livestock feed, respectively . Sugarcane is a proven biofuel feedstock and accounts for about 40 % of the biofuel production worldwide . Besides, sugarcane can provide large numbers of bioactive compounds for human health. The anthocyanins extract from sugarcane peel (Saccharum Officinarum) shows that 51.2 % inhibition of the HT29 cell line at a concentration of 0.625 µg/ml reduces the risk of colon cancer . Duarte-Almeida et al. found that the predominant phenolics in sugarcane culms were phenylpropanoids related to antioxidant activity . However, the bioactive compounds in sugarcane, such as phenolic compounds, have not been utilized and developed adequately.
The flavonoids are an important kind of phenolic compounds containing anthocyanins, flavones, and proanthocyanidins widely existing in the plants’ leaves and fruits. Anthocyanins are naturally occurring polyphenols responsible for the colors in most flowers and fruits of plants. Dietary consumption of anthocyanins has been shown to reduce the risk of cardio- and cerebrovascular diseases, atherosclerosis, cancer, diabetes, and failing vision. Such a beneficial effect could be related to the potent antioxidant activity of anthocyanin compounds . Cyanidin, peonidin, malvidin, pelargonidin, petunidin, and delphinidin are six common natural anthocyanins. Under normal circumstances, anthocyanins are mainly accumulated in plant organs, which give plants colorful colors and contribute to their ornamental and economic values .
Also, anthocyanins play an indispensable role in protecting the growth and development of plants. When plants are exposed to cold stress, CFBs transcription factors are activated, which affects the expression of anthocyanin synthetic genes, resulting in the increase of the anthocyanin contents and cold resistance in plants . The anthocyanin content in sugarcane leaves increased significantly under cold stress, thereby compensating for the lack of antioxidants in a low-temperature environment . Flavonoids’ production in plants changes in response to light intensity . When plants are exposed to intense sunlight, anthocyanins are produced in large quantities to protect plant chloroplasts from oxidation .
Furthermore, anthocyanins have specific effects on the body’s antioxidant and anticancer aspects . For example, anthocyanins can lower blood lipids and cholesterol . Simultaneously, anthocyanins also have specific functions in treating glaucoma and protecting vision . For the bioactive effects of anthocyanins described, increasing its contents in various plants has been one of the most popular research topics.
Sugarcane is an excellent breeding material and is grown on a large scale in China. Anthocyanins have high economic value, and anthocyanin-rich sugarcane can be obtained quickly by cultivating purple-hearted sugarcane. The naturally occurring anthocyanins and flavonoids have been found in Saccharum species such as S. officinarum, S. robustum, S barberi, and their inter-varietal, inter-generic, and interspecific crosses. For example, Mabry et al. used spectroscopic and chemical evidence to build two structures of flavonoids in S. officinarum . Li et al. systematically isolated flavonoids and anthocyanins from Chinese sugarcane (S. sinensis Roxb) . Li et al. determined the flavonoid content of different tissue parts of S. sinensis Roxb . Zhao et al. find that the content and variation of anthocyanins in different cultivars of sugarcane were significant, and 13 anthocyanins and their glycosyl derivatives were identified .
The accumulation of 3-deoxyanthocyanidin in sugarcane has been identified to strengthen the resistance to the red rot pathogen Colletotrichum falcatum . Ganesh et al. used HPLC analysis to reveal the mechanism of nine 3-deoxyanthocyanidin compounds against Colletotrichum falcatum resistance differentially from different sugarcane cultivars . Ganesh et al. revealed the mechanism of differential expression of key genes in the anthocyanin metabolic pathway by studying the antifungal properties of 3-deoxyanthocyanidin . In addition, the researchers have shown that 3-Deoxyanthocyanidin flavonoids increased the resistance to the attack of maize aphids in Sorghum bicolor . In summary, increasing the anthocyanidin contents might increase the resistance to pathogen infection and insect attack.
Several naturally red-fleshed Saccharum clones have been described . Chandran et al. reported nine germplasm clones resource of red-fleshed of Saccharum robustum (28 NG 219, NG 77 − 73, NG 77 − 75, NG 77 − 76, NG 77–78, NG 77–84, NG 77–88, NG 77–90 and NG 77–132). Their breeding value and applications of this sugarcane in terms of yield and morphological traits were discussed in detail . These red-fleshed canes are of great breeding value. The presence of red-fleshed S. robustum clones led us to hypothesize that breeding for red-flashed S. officinarum is also possible.
The whole genome of wild-type sugarcane (S. spontaneum L.) has been sequenced and assembled by Jisen Zhang et al. . However, very few studies have been reported on the biosynthesis of anthocyanins in sugarcane to date. This study aims to reveal anthocyanin-biosynthesis genes in sugarcanes by comparing gene expression levels sugarcane varieties with different rind colors. The results obtained from this study will provide a theoretical basis for the cultivation of purple-hearted sugarcane and provide a basis for subsequent gene cloning, gene function validation, molecular marker screening, and genetic improvement of sugarcane varieties.
Materials and methods
We used three sugarcane (S. officinarum) varieties: FN15, ROC22, and Badila with a green rind, red rind, and purple rind, respectively (Fig. 1). We collected the three varieties collected from Sugarcane Germplasm Resource Nursery of the National Sugarcane Engineering Research Center of Fujian Agricultural and Forestry University (26.0886 N, 119.2435 E), China. We cleaned the surface of the sugarcane rind repeatedly with DEPC water. The rind and inner pith of sugarcane samples were separated with a sharp knife, cut into small pieces, packed with tin foil paper, frozen in liquid nitrogen, stored at − 80 °C until use.
Profiling of anthocyanins using UPLC-Q-TOF/MS
We prepared anthocyanins as described previously . The process including the following steps: we weighed 200 mg sample, then added 1ml 1 % methanolic acetate solution, mixed them by shaking, let the solution stand at low temperature and dark overnight, collected the extract, extracted three times with 1ml 1 % methanolic acetate solution, transferred to a flask, made up to a final volume of 50 ml with a 1 % methanolic acetate solution. In summary, there are three varieties. Samples were taken from the rind and pith tissues of three individual plants of each variety. We then pooled the samples from the biological replicates for further analyses. There are three biological replicates for each sample. The standard compounds were cyanidin, Malvidin, Pelargonidin, Peonidin.
We performed LC-HRMS analyses on a Waters UPLC I-class system equipped with a binary solvent delivery manager and a sample manager, coupled with a Waters VION IMS Q-TOF Mass Spectrometer equipped with an electrospray interface (Waters Corporation, Milford, USA). We used the column Acquity BEH C18 column (100 mm × 2.1 mm i.d., 1.7 μm; Waters, Milford, USA). The separation was achieved using the following gradient: 5–20 % B over 0–2 min, 20–60 % B over 2–8 min, 60–100 % B over 8–12 min. The composition was held at 100 % B for 2 min, then at 100 to 5 % B for 14–14.5 min, and at 5 % B for 14.5–15.5 min at a flow rate of 0.40 mL/min. Here A is aqueous formic acid (0.1 % (v/v) formic acid) and B is acetonitrile (0.1 % (v/v) formic acid). The injection volume was 3.00 µL, and the column temperature was set at 45.0 °C. ESI ion source was used to ensure the data collected in a negative ion mode.
RNA isolation and sequencing
We extracted total RNAs using the mirVana miRNA Isolation Kit (Cat. AM1561, Invitrogen, Thermo Fisher Scientific Inc., USA) following the manufacturer’s protocol. The RNA purity was assessed and quantified using a NanoDrop 2000 spectrophotometer (Thermo Fisher Scientific, Waltham, MA, USA). Then, RNA integrity was appraised using the Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA). The transcriptome sequencing and analysis were conducted by OE Biotech Co., Ltd. (Shanghai, China). Briefly, the libraries were sequenced on an Illumina HiSeq X Ten platform, and 150 bp paired-end reads were generated. Raw data of FASTQ format were corrected with Trimmomatic to remove the adaptor sequences and filter out the low-quality reads . The ploy-N and low-quality reads were filtered out of the raw data with the parameters “LEADING = 3”, “TRAILING = 3”, and “MINLEN = 50”.
De novo assembly and function annotation
The clean reads were de novo assembled into transcripts by Trinity software (version: 2.4) according to the paired-end splicing method . We selected the longest transcript of each unigene for subsequent analysis. The unigenes were compared with known sequences in the NR (ftp://ftp.ncbi.nlm.nih.gov/blast/db/FASTA/nr.gz ); SWISS-PROT (http://www.uniprot.org/ ), and KOG (ftp://ftp.ncbi.nih.gov/pub/COG/KOG/kyva) databases using BLAST with an E-value cutoff of 1e-5. The transcripts were categorized by mapping their sequences against those in the Kyoto Encyclopedia of Genes and Genomes (KEGG: http://www.genome.jp/kegg/) database . The KO numbers and KEGG reference metabolic pathways were inferred based on those of their best hit sequences. Similarly, the unigenes were mapped to proteins from SwissProt. Their Gene Ontology (GO, http://www.geneontology.org/) classifications were inferred from those of their best hits.
Gene expression quantification and differential gene expression analysis
The FPKM  (fragments Per kb per Million reads) of each unigene was calculated using software bowtie2  and eXpress . Differentially expressed genes (DEGs) were identified using the DESeq  with a model based on the negative binomial distribution. The results of all statistical tests were corrected by multiple tests using the Benjamini and Hochberg false discovery rate. Genes were determined to be significantly differentially expressed with |Log2FoldChange| ≥1 and the adjusted P-value of < 0.05 according to the default settings in DESEq. We conducted a hierarchical cluster analysis of DEGs with TBtools .
We performed Gene Ontology (GO) and KEGG enrichment analysis on differentially expressed genes to describe their functions. GO classification was performed by mapping our proteins to those in Swissprot using BLASTN. The related GO terms were then extracted from the annotations for the hit proteins in SwissProt. KEGG enrichment analysis was conducted using the KOBAS database (http://kobas.cbi.pku.edu.cn/kobas3). After that, we counted the number of differential genes included in each GO entry and KEGG pathway. Then we calculated the significance of the enrichment of differential genes in each GO entry and KEGG using the hypergeometric distribution test method. The resulting p-value was subjected to correction to calculate the False Discovery Rate (FDR) or adjusted p-value. An adjusted p-value < 0.05 was used as the cutoff for significant enrichment.
Validation of RNA-seq experiments
We conducted a Reverse transcription-quantitative real-time PCR (RT-qPCR) analysis with the RNA samples used for the RNA-seq experiments. Each experiment had three technical replicates. In total, 1 µl total RNA was processed with the PrimeScript™ RT reagent Kit with gDNA Eraser. The reactions included two steps. Firstly, genomic DNA removal reaction included 1 µL RNA, 1 µL gDNA Eraser, 2 µL 5×gDNA Eraser Buffer, 16 µL RNase Free water. Reverse transcription reaction included 1 µL PrimeScript RT Enzyme Mix, 1 µL RT Primer Mix, 4 µL 5×PrimeScript Buffer 2, 4 µL RNase Free water. The Gene-specific primers were designed by the IDT (https://sg.idtdna.com/Primerquest/Home/Index) . All the primers are shown in Table S1. We chose the NADPH gene as the endogenous control. Reverse transcription-quantitative real-time PCR reaction contained 10 µL 2 × Master Mix, 2 µL cDNA, 0.5 µL 10 μm PCR gene-specific forward primers, 0.5 µL 10 μm PCR gene-specific reverse primers, 7 µL RNase Free water. The cycling conditions were 95 ºC for 30 s with 40 cycles. To establish the melting curve of PCR products, after the amplification reaction is over, press (95 ºC, 10 s; 60 ºC, 60 s; 95 ºC, 15 s); and slowly heat from 60 ºC to 99 ºC. The target gene and internal reference of each sample were subjected to real-time PCR reaction, and three replicate wells were tested for each sample, and the data were analyzed by the 2−△△Ct method .
2.8 Analysis of structural genes related to the biosynthesis of anthocyanins in the rind of sugarcane
The structure genes associated with the biosynthesis of anthocyanins in the rind of sugarcane were identified as described below. First, we downloaded the gene sequences from the KEGG pathway with id ko00941 (flavonoid biosynthesis) and ko00942 (anthocyanin biosynthesis) to construct a local reference database. Then, the unigene sequences obtained from the RNA-seq experiments from this study were searched against the local reference database using the BLASTX algorithm (v2.7.1+) , with an e-value cutoff of 1e-5. Thirdly, the selected unigenes were subjected to the open reading frame (ORF) identification using the TransDecoder program (v5.5.0) with default parameters. Lastly, the ORFs were used to search the CD-search database at https://www.ncbi.nlm.nih.gov/Structure/bwrpsb/bwrpsb.cgi with an e-value cutoff of 1e-5. We identified the full-length protein sequences based on the result of the CD-search. The Pearson correlation coefficients between the expression profiles of genes related to anthocyanin-biosynthesis and the content of the derivative of cyanidin in sugarcane were calculated using TBtools software (Version 1.05).
We performed multiple sequence alignment of full-length sequences using the MUSCLE . We visualized the multiple sequence alignment results using GeneDoc software . Phylogenetic analysis of full-length genes was conducted with the Maximum Likelihood Estimate method implemented in MEGA7 software . We calculated the bootstrap score based on 1000 replications. We ran the weighted correlation network analysis (WGCNA) to determine the relationships between phenotypes and differential genes with the R package of WGCNA .
Anthocyanins involved in sugarcane
To compare the flavonoids and anthocyanin compounds among the three sugarcane varieties, we obtained the rind and pith sugarcane varieties’ total ion chromatograms from ultra-high-performance liquid chromatography-quadrupole time-of-flight mass spectrometry (UPLC-Q-TOF/MS). We identified the chemical components of sugarcane through retention time, exact relative molecular mass, cleavage fragments of MS/MS, and previously reported data. Figure 2 shows the spectra of cyanidin O-glucoside (A), peonidin O-glucoside (B), cyanidin (6’-malonylglucoside) (C) in sugarcane. Cyanidin O-glucoside showed a formula of C21H21O11 and retention time of 6.65 with m/z 449, which produced one fragment located at m/z 287. Transition 449 > 287 represented the loss of glucose (m/z 162) (Fig. 2 a). Peonidin O-glucoside showed a formula of C22H23O11 and retention time of 7.53 with m/z 463. The m/z 301 fragments were formed, which corresponds to peonidin C16H13O6 (m/z 301.07051) (Fig. 2b). Cyanidin (6’-malonylglucoside) showed a formula of C24H23O14 and a retention time of 7.99 with m/z 535.10814, which produced one fragment m/z 448 and 287 (Fig. 2 c). Transition 535 > 448 represented the loss of malonyl (m/z 87), and transition 535 > 287 produced cyanidin (m/z 287) due to the loss of both glucose and malonyl.
As shown in Table 1, a total of 7 anthocyanins were identified and quantified in the rind and pith of sugarcane. Most anthocyanins in the rind and pith of sugarcane are cyanidin, which included cyanidin, cyanidin (6’-malonylglucoside), and cyanidin O-glucoside. The total anthocyanidin contents in rind samples were higher than pith samples. The content of cyanidin, cyanidin (6’-malonylglucoside), cyanidin O-glucoside and peonidin O-glucoside peonidin_C6 in rind is higher than the content in pith. In the rind of Badila with purple color, the contents of peonidin, cyanidin (6’-malonylglucoside), cyanidin O-glucoside, peonidin O-glucoside are significantly higher than the contents in the rind of ROC22 and FN15. As a result, the derivative of cyanidin is the most abundant ingredient in the rind of sugarcane, which is the main factor affecting the rind color of sugarcane.
RNA-seq analysis of the rind and pith tissues of three sugarcane varieties
We constructed six cDNA libraries from three sugarcane varieties with different rind colors to explore the molecular mechanism of anthocyanins biosynthesis and accumulation in the rind and pith tissues of sugarcane. The six samples were named Badila_rind (purple), Badila_pith; ROC22_rind (red), ROC22_pith; FN15_rind (green), FN15_pith. After removing the adapters, low-quality sequences, and reads shorter than 35 bp, we obtained 55.5, 57.1, 55.2, 59.9, 5 6.1, and 57.2 million clean reads for the six samples. These clean data were assembled using the Trinity, and we obtained 73,916 unigenes longer than 300 bp. The average length of those unigenes is 646 bp, and the length of N50 is 1398 bp. The length of distribution for the unigenes is shown in Fig. S1.
To understand these genes’ putative functions, we compared a total of 73,916 unigenes to five public databases using BLASTN and BLASTX. The databases included the following: NR, GO, SWISSPROT, KEGG, and KOG. Finally, 43,546, 21,210, 8,297, 27,966 and 23,821 unigenes were annotated by NR, KOG, KOG, KEGG, SWISSPROT and GO databases. There were 6,114 unigenes annotated by all the databases and 43,827 unigenes annotated only by one database (Fig. S2).
The 23,821 unigenes mapped with GO terms were divided into 64 function groups belonging to the three main GO classifications: biological process, cellular component, and molecular function (Fig. S3). In contrast, 16,133 unique sequences were assigned to KEGG pathways (Fig. S4).The top 10 most mapped pathways were “Transport and Catabolism”(1,013 sequences), “Cell growth and Death”(1,022 sequences), “Signal Transduction”(3,005 sequences), “Translation”(1,027 sequences), “Carbohydrate metabolism”(1,748 sequences), “Amino acid metabolism”(1,057 sequences), “Lipid metabolism”(994 sequences), “Folding, Sorting and Degradation”(944 sequences), “Replication and Repair”(776 sequences), “Energy Metabolism”(714 sequences) (Fig. S3).
Real‐time PCR validation of the expression levels of anthocyanin‐related genes
To validate the RNA-seq data, we selected 25 unigenes for RT-qPCR analysis. Those genes belong to the anthocyanin and flavonoid biosynthetic pathways, and their sequences are presented in supplementary file1. Three genes (CL22745Contig1, CL1Contig881, CL19401Contig1) belong to the CHS family, two genes (CL1Contig5298, CL19316Contig1) belong to the CHI family, one gene (CL6788Contig1) belong to the LDOX family, three genes (CL15263Contig1, CL186Contig2, CL1Contig6521) belong to F3’5’H family, two genes (CL3124Contig1, CL3124Contig2) belong to the F3’H family, three genes (CL1Contig2216, CL576Contig1, CL576Contig2) belong to the FLS family, five genes ( CL6042Contig1, CL23185Contig1,CL28592Contig1, CL28006Contig1, comp35647_c0_seq1_2) belong to the LDOX family, four genes (comp62628_c0_seq1_1, comp72111_c0_seq1_1, comp74241_c0_seq1_2, comp131906_c0_seq1_1) belong to the UFGT family, one gene (CL19110Contig1) belong to the BZ2 family, one gene (comp74919_c0_seq1_2) belong to the MYB family (Table 2). We used the gene expression level in the rind minus that in the pith derived from the same variety of sugarcane as the relative expression level of genes. As shown in Figs. 3 and 64 % of qRT-PCR results were consistent with those obtained with the RNA-seq experiments. These data suggested that the expression patterns deduced from the FPKM values in our transcriptome analyses were reliable and can be used in downstream gene expression analyses.
Differentially expressed genes in the rind and pith of sugarcane
To identify the Differentially Expressed Genes (DEGs) in the rind and pith of sugarcane, we first analyzed the DEGs between the rind and the pith tissues for each variety. There were 2,559 up and 703 down-regulated unigenes between Badila_rind vs. Badila_pith. In contrast, there were 2,138 up and 999 down-regulated unigenes between ROC22_rind vs. ROC22_pith. Lastly, there were 1,687 up and 1,732 down-regulated unigenes between FN15_rind vs. FN15_pith. A total of 1872 DEGs between rind and pith of sugarcane have expression profiles that correlate with the anthocyanin’s contents according to the Pearson correlation coefficient is a threshold value of 0.9 (Table S2).
Then, we identified DEGs in the rind tissues of the three varieties. There were 1,760 up-regulated transcripts and 1,061 down-regulated transcripts between Badila (purple rind) and FN15 (green rind); 1,922 up-regulated transcripts and 1,149 down-regulated transcripts between ROC22 (red rind) and Badila (purple rind); 1,668 up-regulated transcripts and 1,233 down-regulated transcripts between FN15 (green rind) and ROC22 (red rind) (Fig. 4). A total of 1746 DEGs between rinds of sugarcane have expression profiles that correlate with the anthocyanin’s contents according to the Pearson correlation coefficient is a threshold value of 0.9 (Table S3). Among those related genes, ScLDOX (CL6788Contig1), ScF3H (comp30564_c0_seq2_1), ScGT1_7 (comp43983_c0_seq1_2) are related to anthocyanin biosynthesis.
Next, we identified the DGEs between the rind and pith tissues of the three varieties. The comparison results of these DGEs are shown in the Venn diagram (Fig. S5). As shown, 637 DGEs were shared among all three sugarcane varieties. And there are 200, 250, and 426 DEGs shared between Badila and ROC22, Badila and FN15, ROC22, and FN15, respectively (Fig. S5).
Lastly, we compared gene expression levels in the rind tissues of the three varieties. Using the expression level in the rind of FN15 sugarcane as a control, we determine DGEs between Badila and FN15, ROC22, and FN15. The results are shown in Fig. S6. There are 2821 DGEs for Badila/FN15. In contrast, there are 2901 DGEs for ROC22/FN15. Lastly, there are 574 DGEs shared between the two sets.
Enrichment analysis of DEGs
The DEGs identified from the six pairs of comparisons were further subjected to KEGG pathway enrichment analyses to screen genes associated with anthocyanin biosynthesis in the rind and pith tissues. The top 20 enriched pathways included the following: steroid biosynthesis, steroid hormone biosynthesis, phenylpropanoid biosynthesis, flavonoid biosynthesis, tryptophan metabolism, fatty acid elongation, linoleic acid metabolism, indole alkaloid biosynthesis, glyoxylate, and dicarboxylate metabolism, cyanoamino acid metabolism, sesquiterpenoid, and triterpenoid biosynthesis, biosynthesis of unsaturated fatty acids, stilbenoid, diarylheptanoid, and gingerol biosynthesis, MAPK signaling pathway, retinol metabolism, carbon fixation pathways in prokaryotes, ErbB signaling pathway, gap junction, cutin, suberine, and wax biosynthesis, ubiquinone and another terpenoid-quinone biosynthesis (Fig. 5). In particular, phenylpropanoid biosynthesis is the most enriching pathway. Anthocyanin biosynthesis and flavonoid biosynthesis were all enriched in all the above comparisons. The result showed the DEGs were enriched in many metabolic processes that included flavonoid and anthocyanin biosynthesis pathways.
Identification of candidate genes related to anthocyanin biosynthesis in the rind
We identified the putative genes related to anthocyanin biosynthesis based on the sequence similarity to those genes in the KEGG pathways for flavonoid biosynthesis (ko00941) and anthocyanin biosynthesis (ko00942). As shown in Tables 2, we identified a total of 51 genes. These included the following: three CHS (chalcone synthase), two CHI (chalcone–flavanone isomerase), one F3H (Flavanone 3-hydroxylase), two F3’H (flavanone 3’-hydroxylase), three F3’5’H (flavonoid-3’,5’-hydroxylase), one LDOX (leucoanthocyanidin dioxygenase), one MYB (myeloblastosis), one BZ2 (Bronze 2), seven ANR (anthocyanidin reductase), three FLS (flavonol synthase), eight BZ1 (anthocyanidin 3-O-glucosyltransferase), eleven GT1(anthocyanidin 5,3-O-glucosyltransferase), three 5MaT (malonyl-CoA: anthocyanidin 5-O-glucoside-6’’-O-malonyltransferase), one 3MaT (malonyl-coenzyme A: anthocyanin 3-O-glucoside-6’’-O-malonyltransferase) and two MF (O-methyltransferase). To validate the full-length coding sequences, we conducted multiple sequence alignment and phylogenetic analysis for these genes: CHS (Fig. S7), CHI (Fig. S8), F3H (Fig. S9), LDOX (Fig. S10), BZ2 (Fig. S11), MYB (Fig. S12), ANR (Fig. S13), BZ1 (Fig. S14), GT1 (Fig. S15), 5MaT (Fig. S16), 3MaT (Fig. S17). As shown in the multiple sequence alignment, these genes are highly conserved among sugarcane and other plants.
To study the co-expression patterns of these putative genes related to anthocyanin biosynthesis, we performed hierarchical clustering of these 51 genes’ expression profiles and the content of derivative of cyanidin using the Euclidean distance as the metric and Ward’s method. As shown in Fig. 6, two main clusters were readily discernable, which were named C1 and C2. The clusters C1 containing seven unigenes showed the highest expression levels in the rind, which had the highest levels of cyanidin derivatives. The seven unigenes were ScCHS1, ScF3H, ScLDOX, ScMYB, ScBZ2, ScBZ1_2, ScBZ1_4. All of them except ScBZ1_4 appeared to have the full length of the coding sequences. These genes are likely to play important roles in anthocyanins biosynthesis, and their exact functions will be the subject of future investigation.
To further investigate the relationship between DEGs and the abundance of anthocyanin compounds, we performed WGCNA analysis. As shown in Fig. 7, all the rind samples were clustered together, and all the pith samples were clustered together. In the purple rind, the expression levels of the members of the gene module were upregulated. The gene modules have the highest correlation with the abundance of those anthocyanin compounds, including cyanidin, pelargonidin, peonidin, cyanidin (6’-malonylglucoside), cyanidin O-glucoside, and peonidin O-glucoside. Interestingly, the abundance of malvidin was highly correlated with the gene expression profiles in the green rind sugarcane.
Anthocyanin identification in the rind and pith of sugarcane
Anthocyanins are secondary metabolites distributed widely in plants such as vegetables, fruits, and medical plants. Several efforts have been made to increase anthocyanin content in particular plant tissues. For example, the anthocyanin contents have been increased in the purple heart cabbage. Anthocyanin glycosylation modifications affect the stability of anthocyanins in cells. The first step after anthocyanin biosynthesis in both purple potato and Arabidopsis is glycosylation to form anthocyanin-3-O-glucoside. However, the downstream glycosylation modifications become entirely different. In Arabidopsis, a xylose group is transferred to the C2 position of anthocyanin-3-O-glucoside catalyzed by glycosyltransferase At3GGT (UGT79B1), whereas in purple potato, a glucose molecule is presumably transferred to the C2 position of anthocyanin-3-O-glucoside catalyzed by glycosyltransferase (Ib3GGT) to form anthocyanin-3-O-sophoroside .
Our long-term goal is to breed purple-hearted sugarcane with high anthocyanin content. Here, we conducted a combined metabolomic and transcriptomic analysis to identify genes involved in anthocyanins’ biosynthesis and regulation. A total of 7 anthocyanins were identified in the rind and pith of sugarcane by UPLC-Q-TOF/MS (Table 1). We found that the derivative of cyanidin is the determinant of the rind color of sugarcane.
The results from this study might be related to other anthocyanins’ effects. Firstly, Increased expression of genes involved in the phenylalanine pathway leads to increased levels of polyphenols . Polyphenolic compounds such as anthocyanins may be subject to browning by the action of polyphenol oxidase during sugar extraction [41, 42]. However, previous reports have found that sucrose concentrations of around 20 % were protective against anthocyanin browning. . Besides, the browning of anthocyanin-rich fruit juice was affected by pH, and MA (monomeric anthocyanin content). And exogenous application of ascorbic acid had a preventive effect on phloem discoloration. [42, 44]. In conclusion, the enrichment of anthocyanins during the extraction of sugar cane can cause browning. Still, at the same time, effective measures can be taken to reduce the degree of browning significantly.
Three issues need attention. Firstly, there are six natural anthocyanins. However, we detected four of them and their glycosyl derivatives, as we did not detect two natural anthocyanins, delphindin and petunidin. There are two possible reasons: (1) the sample preparation method is not optimal for the extraction of the substance, and the substance is not in the extraction solution; (2) the substance was extracted successfully, but the contents were so low that they might be below the detection limitations of the instrument.
Secondly, it is generally considered that the content of malvidin correlates with the tissue color. However, the content of malvidin is higher in the pith (less colorful) than rind (more colorful), especially for FN15 and ROC22 (Table 1) on the contrary to the general thoughts. This may be due to the complex color-forming mechanism of plants. For example, co-pigmentation of anthocyanins in plant tissues under different anthocyanin combinations or PH values could show different results . The exact reason is currently unclear and needs further investigation.
Thirdly, the mass spectrometry analysis did not deduce the anthocyanin’s accurate structure because there are many isomers of anthocyanins with different glucoside forms lacking standard compounds. Therefore, we can only make a preliminary qualitative and quantitative analysis of anthocyanins’ composition and content by analyzing anthocyanins’ mass spectrum information.
Candidate genes involved in anthocyanins biosynthesis of sugarcane
Through correlation analysis with anthocyanin content in sugarcane, we found that the expression profiles of 7 genes correlate well with those of the anthocyanin abundance. The seven genes were ScCHS1, ScF3H, ScLDOX, ScMYB, ScBZ2, ScBZ1_2, ScBZ1_4. Overexpression of the CHS gene in rice causes anthocyanin accumulation . A transcriptional activation complex composed of R2R3-MYB, basic-helix-loop-helix (bHLH), and WD40 proteins (named MBW complex) has been shown to control the expression of anthocyanin structural genes. F3H mutant cells cannot synthesize anthocyanins and remain white . The Arabidopsis TDS4 gene encodes leucoanthocyanidin dioxygenase (LDOX). It is essential for proanthocyanidin synthesis and vacuole development . The glutathione S-transferase encoded by Bronze2 (BZ2) performs the last genetically defined step in maize anthocyanin biosynthesis, being required for pigment sequestration into vacuoles . Based on the above results, we speculate that the different expression patterns of anthocyanin biosynthesis and related regulatory genes contribute to sugarcane color. This information sheds light on the evolution of anthocyanin glycosylation among other plants, which will provide new ideas for producing specific anthocyanin compounds through genetic engineering.
Putative pathway of biosynthesis of anthocyanin in the sugarcane rind
The anthocyanin biosynthesis pathway is conserved in higher plants (Naing and Kim, 2018). We identified several enzyme-coding structural genes involved in anthocyanins biosynthesis . They include the following: phenylalanine-ammonia lyase (PAL), 4-coumaryl: CoA ligase (4CL), chalcone synthase (CHS), chalcone isomerase (CHI), flavonoid-3′-hydroxylase (F3′H), flavonoid-3′,5′-hydroxylase (F3′5′H), flavanone 3-hydroxylase (F3H), dihydroflavonol 4-reductase (DFR), anthocyanidin synthase (ANS), and UDP-glucose: flavonoid 3-O-glucosyltransferase (UFGT) . So far, the biosynthesis mechanism of anthocyanin in the rind of sugarcane is not clear. In this study, we discovered that cyanidin’s derivative is the main factor affecting sugarcane’s rind color. By analyzing the transcriptome data of three different sugarcane varieties with different rind colors, we hypothesized the putative pathway of biosynthesis of anthocyanin in the rind of sugarcane (Fig. 8). The putative pathway contains 15 protein families, including CHS, CHI, F3H, F3’H, F3`5’H, ANR, BZ2, MYB, GT1, FLS, BZ1, ANS, 5MaT, 3MaT, and MF. In this study, we did not identify the DRX gene in the rind and pith of sugarcane, which may be due to the gene’s tissue-specific expression . These results will further improve understanding of this anthocyanin biosynthesis pathway in the rind of sugarcane.
This study investigated anthocyanin and flavonoid biosynthesis in sugarcane rind and pith using the combined transcriptomic and metabolomic methods. Through UPLC analysis of anthocyanin compounds in sugarcane, we found that cyanidin derivatives were the main factor for the color difference of sugarcane rind. Secondly, we conducted the comparative transcriptome analysis to identify the DEGs between the rind and pith of sugarcane varieties. Thirdly, we identified 51 putative genes related to anthocyanin and flavonoid biosynthesis based on the sequence similarity. Fourthly, seven genes were identified as candidate genes related to anthocyanin and flavonoid biosynthesis through the correlation analysis with cyanidin derivatives content. Finally, we proposed a hypothetical molecular model to explain anthocyanin’s biosynthesis and its glycoside derivatives in sugarcane. These results lay the foundation for improving anthocyanin production in sugarcane through genetic engineering and molecular breeding. This research provides valuable resources for the study of sugarcane anthocyanins and provides a molecular basis for improving sugarcane anthocyanin genetic breeding.
Availability of data and materials
The raw reads generated for this study have been deposited in BioProject with the accession number: PRJNA573557 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA666228).
Ultra-high-performance liquid chromatography-quadrupole time-of-flight mass spectrometry
Differentially expressed genes; RT-qPCR: Real-time quantitative reverse transcription-polymerase chain reaction
Open reading frame
Weighted correlation network analysis
Chandran K, Nisha M, Gireesan P. Characterization of Progenies from Polycrosses of S. robustum Clones f. sanguineum. Sugar Tech. 2020;22(3):379–88.
Waclawovsky AJ, Sato PM, Lembke CG, Moore PH, Souza GM. Sugarcane for bioenergy production: an assessment of yield and regulation of sucrose content. Plant Biotechnol J. 2010;8(3):263–76.
Lam E, Shine Jr J, Da Silva J, Lawton M, Bonos S, Calvino M, Carrer H, SILVA-FILHO MC, Glynn N, Helsel Z: Improving sugarcane for biofuel: engineering for an even better feedstock. Gcb Bioenergy 2009, 1(3):251–255.
Pallavi R, Elakkiya S, Tennety SSR, Devi PS. Anthocyanin analysis and its anticancer property from sugarcane (Saccharum officinarum L) peel. IJRPC. 2012;2(2):338–45.
Duarte-Almeida JM, Salatino A, Genovese MI, Lajolo FM. Phenolic composition and antioxidant activity of culms and sugarcane (Saccharum officinarum L.) products. Food Chem. 2011;125(2):660–4.
Giordano L, Coletta W, Rapisarda P, Donati MB, Rotilio D: Development and validation of an LC-MS/MS analysis for simultaneous determination of delphinidin-3-glucoside, cyanidin-3-glucoside and cyanidin-3-(6-malonylglucoside) in human plasma and urine after blood orange juice administration. J Sep Sci 2007, 30(18):3127–3136.
Zhao D, Tao J. Recent advances on the development and regulation of flower color in ornamental plants. Front Plant Sci. 2015;6:261.
Zhou L, He Y, Li J, Liu Y, Chen H. CBFs Function in Anthocyanin Biosynthesis by Interacting with MYB113 in Eggplant (Solanum melongena L.). Plant Cell Physiol. 2020;61(2):416–26.
Zhu J-J, Li Y-R, Liao J-X. Involvement of anthocyanins in the resistance to chilling-induced oxidative stress in Saccharum officinarum L. leaves. Plant Physiol Biochem. 2013;73:427–33.
Pan J, Chen H, Guo B, Liu C. Understanding the molecular mechanisms underlying the effects of light intensity on flavonoid production by RNA-seq analysis in Epimedium pseudowushanense BL Guo. PloS One. 2017;12(8):e0182348.
Hughes N, Neufeld H, Burkey K. Functional role of anthocyanins in high-light winter leaves of the evergreen herb Galax urceolata. New Phytol. 2005;168(3):575–87.
Wei J, Wu H, Zhang H, Li F, Chen S, Hou B, Shi Y, Zhao L, Duan H. Anthocyanins inhibit high glucose-induced renal tubular cell apoptosis caused by oxidative stress in db/db mice. Int J Mol Med. 2018;41(3):1608–18.
Farrell N, Norris G, Lee SG, Chun OK, Blesso CN. Anthocyanin-rich black elderberry extract improves markers of HDL function and reduces aortic cholesterol in hyperlipidemic mice. Food Funct. 2015;6(4):1278–87.
Shim SH, Kim JM, Choi CY, Kim CY, Park KH. Ginkgo biloba extract and bilberry anthocyanins improve visual function in patients with normal tension glaucoma. J Med Food. 2012;15(9):818–23.
Mabry TJ, Liu Y-L, Pearce J, Dellamonica G, Chopin J, Markham KR, Paton NH, Smith P. New flavonoids from sugarcane (Saccharum). J Nat Prod. 1984;47(1):127–30.
Li X, Ma Z, Yao S. Bioactivity-guided systematic extraction and purification supported by multitechniques for sugarcane flavonoids and anthocyanins. Food Bioproducts Process. 2015;94:547–54.
Li X, Yao S, Tu B, Li X, Jia C, Song H. Determination and comparison of flavonoids and anthocyanins in Chinese sugarcane tips, stems, roots and leaves. J Sep Sci. 2010;33(9):1216–23.
Zhao Z, Yan H, Zheng R, Khan MS, Fu X, Tao Z, Zhang Z. Anthocyanins characterization and antioxidant activities of sugarcane (Saccharum officinarum L.) rind extracts. Ind Crops Prod. 2018;113:38–45.
Viswanathan R, Mohanraj D, Padmanaban P, Alexander KC. Synthesis of phytoalexins in sugarcane in response to infection by Colletotrichum falcatum Went. Acta Phytopathologica Entomol Hung. 1996;31(3):229–37.
Ganesh Kumar V, Viswanathan R, Malathi P, Nandakumar M, Ramesh Sundar A. Differential Induction of 3-deoxyanthocyanidin Phytoalexins in Relation to Colletotrichum falcatum Resistance in Sugarcane. Sugar Tech. 2015;17(3):314–21.
Nandakumar M, Malathi P, Sundar AR, Viswanathan R: Host-pathogen interaction in sugarcane and red rot pathogen: exploring expression of phytoalexin biosynthesis pathway genes. Indian Phytopathology 2021.
Kariyat RR, Gaffoor I, Sattar S, Dixon CW, Frock N, Moen J, De Moraes CM, Mescher MC, Thompson GA, Chopra S. Sorghum 3-Deoxyanthocyanidin Flavonoids Confer Resistance against Corn Leaf Aphid. J Chem Ecol. 2019;45(5):502–14.
Amalraj VA, Balasundaram N: On the taxonomy of the members of ‘Saccharum complex’. Genetic Resources and Crop Evolution 2006, 53(1):35–41.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Grabherr MG, Haas BJ, Yassour M, Levin JZ. others: Trinity: reconstructing a full-length transcriptome without a genome from RNA-Seq data. Nat Biotechnol. 2013;29:644.
Roberts A, Trapnell C, Donaghey J, Rinn JL, Pachter L. Improving RNA-Seq expression estimates by correcting for fragment bias. Genome Biol. 2011;12(3):1–14.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357.
Roberts A, Pachter L. Streaming fragment assignment for real-time analysis of sequencing experiments. Nat Methods. 2013;10(1):71–3.
Anders S, Huber W. Differential expression of RNA-Seq data at the gene level–the DESeq package. Eur Mol Biol Lab. 2012;10:f1000research.
Chen C, Chen H, Zhang Y, Thomas HR, Xia R: TBtools: An Integrative Toolkit Developed for Interactive Analyses of Big Biological Data. Molecular Plant 2020, 13(8).
Owczarzy R, Tataurov AV, Wu Y, Manthey JA, McQuisten KA, Almabrazi HG, Pedersen KF, Lin Y, Garretson J, McEntaggart NOJNar: IDT SciTools: a suite for analysis and design of nucleic acid oligomers. 2008, 36(suppl_2):W163-W169.
Schmittgen TD, Livak K. Analyzing real-time PCR data by the comparative C T method. Nat Protoc. 2008;3(6):1101.
Altschul SF. Basic local alignment search tool (BLAST). J Mol Biol. 1990;215(3):403–10.
Edgar RC. MUSCLE: multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 2004;32(5):1792–7.
Nicholas KB. GeneDoc: analysis and visualization of genetic variation. Embnew News. 1997;4:14.
Kumar S, Stecher G, Tamura K: MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Molecular biology evolution 2016, 33(7):1870–1874.
Langfelder P, Horvath S. WGCNA: an R package for weighted correlation network analysis. BMC Bioinformatics. 2008;9(1):559.
De Villena FA, Fritz VA, Cohen JD, Hutchison WD. Changes in gluconasturtiin concentration in Chinese cabbage with increasing cabbage looper density. HortScience. 2007;42(6):1337–40.
Wang H, Wang C, Fan W, Yang J, Appelhagen I, Wu Y, Zhang P. A novel glycosyltransferase catalyses the transfer of glucose to glucosylated anthocyanins in purple sweet potato. J Exp Botany. 2018;69(22):5444–59.
Kader F, Rovel B, Girardin M, Metche M. Mechanism of browning in fresh highbush blueberry fruit (Vaccinium corymbosum L). Role of blueberry polyphenol oxidase, chlorogenic acid and anthocyanins. J Sci Food Agriculture. 1997;74(1):31–4.
Jiang Y, Duan X, Joyce D, Zhang Z, Li J. Advances in understanding of enzymatic browning in harvested litchi fruit. Food Chem. 2004;88(3):443–6.
Jiang Y. Role of anthocyanins, polyphenol oxidase and phenols in lychee pericarp browning. J Sci Food Agric. 2000;80(3):305–10.
Nikkhah E, Khayamy M, Heidari R, Jamee R. Effect of sugar treatment on stability of anthocyanin pigments in berries. J Biol Sci. 2007;7(8):1412–7.
Dorris MR, Voss DM, Bollom MA, Krawiec-Thayer MP, Bolling BW. Browning Index of Anthocyanin‐Rich Fruit Juice Depends on pH and Anthocyanin Loss More Than the Gain of Soluble Polymeric Pigments. J Food Sci. 2018;83(4):911–21.
Asen S, Stewart RN, Norris KH. Co-pigmentation of anthocyanins in plant tissues and its effect on color. Phytochemistry. 1972;11(3):1139–44.
Reddy AR, Scheffler B, Madhuri G, Srivastava MN, Kumar A, Sathyanarayanan PV, Nair S, Mohan MJPMB. Chalcone synthase in rice (Oryza sativa L.): detection of the CHS protein in seedlings and molecular mapping of the chs locus. Plant Mol Biol. 1996;32(4):735.
Zhou H, Lin-Wang K, Wang H, Gu C, Dare AP, Espley RV, He H, Allan AC, Han Y. Molecular genetics of blood-fleshed peach reveals activation of anthocyanin biosynthesis by NAC transcription factors. Plant J. 2015;82(1):105–21.
Klimek-Chodacka M, Oleszkiewicz T, Baranski R. Visual Assay for Gene Editing Using a CRISPR/Cas9 System in Carrot Cells. Methods Mol Biol. 2019;1917:203–15.
Abrahams S, Lee E, Walker AR, Tanner GJ, Larkin PJ, Ashton AR. The Arabidopsis TDS4 gene encodes leucoanthocyanidin dioxygenase (LDOX) and is essential for proanthocyanidin synthesis and vacuole development. Plant J. 2003;35(5):624–36.
Marrs KA, Alfenito MR, Lloyd AM, Walbot V. A glutathione S-transferase involved in vacuolar transfer encoded by the maize gene Bronze-2. Nature. 1995;375(6530):397–400.
Katsuhisa Y, Miwa O, Yoichiro F, Yozo O, Masayuki F, Chihong S, Yoichi N, Kazuki S, Teruo S, Toshinobu S: Studies on Vacuolar Membrane Microdomains Isolated from Arabidopsis Suspension-Cultured Cells: Local Distribution of Vacuolar Membrane Proteins. Plant & Cell Physiology (10):1571–1584.
Lo C. Molecular Dissection of the Pathogen-Inducible 3-Deoxyanthocyanidin Biosynthesis Pathway in Sorghum. Plant Cell Physiol. 2010;51(7):1173–85.
This project was supported by funds from Innovative Drugs of China [2019ZX09735-002], and National Science & Technology Fundamental Resources Investigation Program of China [2018FY100705], National Natural Science Foundation of China . Chinese Academy of Medical Sciences,Innovation Funds for Medical Sciences (CIFMS) [2017-I2M-1-013]. Open Fund of the National Sugarcane Engineering and Technology Research Center [KJG16005R]. Science and Technology Innovation Special Fund of Fujian Agriculture and Forestry University [KFA17263A], [KF2015080], [ KF2015118].
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Yang Ni and Haimei Chen are co-first authors
About this article
Cite this article
Ni, Y., Chen, H., Liu, D. et al. Discovery of genes involved in anthocyanin biosynthesis from the rind and pith of three sugarcane varieties using integrated metabolic profiling and RNA-seq analysis. BMC Plant Biol 21, 214 (2021). https://doi.org/10.1186/s12870-021-02986-8