Effects of vitro sucrose on quality components of tea plants (Camellia sinensis) based on transcriptomic and metabolic analysis

Background Tea plants [Camellia sinensis (L.) O. Kuntze] can produce one of the three most widely popular non-alcoholic beverages throughout the world. Polyphenols and volatiles are the main functional ingredients determining tea’s quality and flavor; however, the biotic or abiotic factors affecting tea polyphenol biosynthesis are unclear. This paper focuses on the molecular mechanisms of sucrose on polyphenol biosynthesis and volatile composition variation in tea plants. Results Metabolic analysis showed that the total content of anthocyanins, catechins, and proanthocyanidins(PAs) increased with sucrose, and they accumulated most significantly after 14 days of treatment. Transcriptomic analysis revealed 8384 and 5571 differentially expressed genes in 2-day and 14-day sucrose-treated tea plants compared with control-treated plants. Most of the structural genes and transcription factors (TFs) involved in polyphenol biosynthesis were significantly up-regulated after 2d. Among these transcripts, the predicted genes encoding glutathione S-transferase (GST), ATP-binding cassette transporters (ABC transporters), and multidrug and toxic compound extrusion transporters (MATE transporters) appeared up regulated. Correspondingly, ultra-performance liquid chromatography-triple quadrupole mass spectrometry (UPLC-QQQ-MS/MS) analysis revealed that the content of non-galloylated catechins and oligomeric PAs decreased in the upper-stem and increased in the lower-stem significantly, especially catechin (C), epicatechin (EC), and their oligomeric PAs. This result suggests that the related flavonoids were transported downward in the stem by transporters. GC/MS data implied that four types of volatile compounds, namely terpene derivatives, aromatic derivatives, lipid derivatives, and others, were accumulated differently after in vitro sucrose treatment. Conclusions Our data demonstrated that sucrose regulates polyphenol biosynthesis in Camellia sinensis by altering the expression of transcription factor genes and pathway genes. Additionally, sucrose promotes the transport of polyphenols and changes the aroma composition in tea plant. Electronic supplementary material The online version of this article (10.1186/s12870-018-1335-0) contains supplementary material, which is available to authorized users.


Background
The tea plant [Camellia sinensis (L.) O. Kuntze] is one of the most important economic crops cultivated in China, Japan, India, and other countries. Its leaves are used for making the tea beverage, one of three most widely consumed non-alcoholic beverages around the world because it contains abundant polyphenols, theanine, caffeine, and other secondary metabolites [1]. Among them, the polyphenol, also called tea polyphenol, is a collective term for phenolic acids and flavonoids including flavanols (catechins), anthocyanins, PAs (also named condensed tannins), and other special derivatives. Polyphenols account for 18-36% of the dry weight of tender leaves and are responsible for tea's flavor [2][3][4]. Some studies have suggested that polyphenols play crucial roles in plant stress resistance. For example, they are crucial for protecting the tea plant against pathogens and insects [5,6]. Additionally, polyphenols are the main functional ingredient in tea for preventing cancer, cardiovascular diseases, and obesity [7].
Studies have indicated that polyphenol biosynthesis in plants is influenced by chemical and physical factors, such as nutrients, hormones, and environmental conditions [8][9][10][11][12][13]. Among them, sucrose acts not only as carbon source for energy storage and sugar transportation, but also as a signal involved in metabolic processes such as anthocyanin synthesis in plants [14,15]. Since the late twentieth century, the effects of sucrose on flavonoid and anthocyanin biosynthesis in grapes and radishes have been studied [16][17][18]. In Arabidopsis thaliana, sucrose induces anthocyanin biosynthesis through the upregulation of structural genes and positive transcription factors involved in the flavonoid biosynthesis pathway and potentially also through the concurrent down-regulation of the negative transcription factor, MYB-LIKE 2 (MYBL2) [19][20][21]. Previous studies also reported that sucrose could act as a signaling molecule, by first activating PRODUCTION OF ANTHOCYANIN PIGMENT 1 (PAP1) expression by a sucrose-specific signaling pathway and then triggering the expression of structural genes involved in anthocyanin and flavonoid biosynthesis [14,19,22,23]. The sucrose-specific signaling pathway may be activated by different disaccharides, such as sucrose, maltose, and their breakdown products (glucose and fructose); however, sucrose is the most effective inducer of anthocyanin biosynthesis in Arabidopsis [23]. Liu et al. reported sucrose induction increases the content of non-galloylated catechins and up-regulates the expression of putative genes involved in their biosynthetic pathway in both tea callus and seedling [24]. Additionally, Wang et al. also reported sucrose up-regulates the expression of Camellia SINENSIS FLA-VONOID 3′5′-HYDROXYLASE (CsF3′5′H), an important branch point gene involved in catechins biosynthesis [25]. In this study, test-tube tea plantlets were used to test for testing the effects of sucrose on polyphenol biosynthesis after 2, 7, 14, and 28d treatments. The results indicated that sucrose can increase the expression of structural genes involved in the biosynthesis of anthocyanins, catechins, and procyanidins. The sucrose specific induction machenism in tea plant is still unclear, one important reason is that we lack the information supported by accurate genome annotations.
Next-Generation Sequencing (NGS) based on the Illumina Hiseq 2000 platform provides a fast, cost-effective, and reliable approach to acquire abundant transcripts, especially for non-model organisms without reference genomic sequences [26]. In tea plants, the NGS technology has been used for analysis of putative genes associated with tea quality and stress response [27][28][29]. Here, it was performed to investigate the molecular mechanism of sucrose on polyphenol biosynthesis in tea plants and to provide a comprehensive analysis of the network of biochemical and cellular processes responding to sucrose.
In addition, we determined whether in vitro sucrose treatment affects the production of volatiles-the second group of compounds that affect tea taste and flavor in addition to polyphenols.

Effects of sucrose on polyphenol accumulation
Similar sized test-tube tea plantlets were cultured on Murashige and Skoog standard medium (MS, Control) and MS supplemented with 90 mM sucrose (MS + 90 mM sucrose, Suc) for 28d (Fig. 1a). The stem of the plantlets grown on Suc for 9-14d began to turn red (Fig. 1b), while no red pigmentation was observed in the stem of the plantlets grown on MS or MS supplemented with 90 mM mannitol (data not shown). The anthocyanin levels were significantly different only in the lower part of the stem and were 7-fold higher than that in the control (Fig. 1c). Furthermore, the accumulation of total catechins and PAs in various organs of tea plants is affected by sucrose (Fig. 1d). The effects of sucrose treatment on polyphenol accumulation were observed after 7 and 14 days of treatment (Fig. 1d). However, the effects of sucrose on total catechins and PAs accumulation were not observed at 2d treatment (data not shown).
Polyphenol, including phenolic acids, catechin monomers, oligomeric PAs, and flavonols, in different tissues of tea plantlets after 14d treatment was quantitatively measured using UPLC-QQQ-MS/MS (Table 1). Three types of phenolic acids were measured, including quinic acid, gallic acid derivatives (β-glucogallin, galloyl acid and galloylquinic acid), and hydroxycinnamic acid derivatives (caffeoylquinic acid and p-coumaroylquinic acid). The effect of sucrose on compound accumulation was different. For example, sucrose increased the content of galloylquinic acid, a special phenolic acid in the tea plant, increased in most parts of the plants, except for in the bud. However, the content of β-glucogallin, the precursor of galloylated catechins, significantly decreased by 84% in buds and by 71% in upper stems [30]. Monomers of flavanols (catechins) can be classified into non-galloylated and galloylated catechins and mainly exist in buds and upper stems. More non-galloylated catechins accumulated in buds and lower stems after sucrose treatment; however, their content in upper stems decreased significantly. Catechin (C) and epicatechin (EC) decreased by 69% in upper stems. The galloylated catechin content in buds and lower stems was not affected by sucrose, and its content in the 3rd leaf and upper stem decreased by 19%. Seven types of oligomeric PAs accumulated in the bud and 3rd leaf. Their content in lower stems increased 3-fold. However, their content in upper stems significantly decreased after sucrose treatment. For example, B2 (an oligomeric C or EC), decreased by 81%. The content of flavonols in the tea plant was also affected by sucrose. Among them, the flavonol with di-hydroxyl groups on the B-ring was significantly affected by sucrose, and its amount decreased by almost 40% in the third leaf and upper stems and by 14% in buds. However, its content increased by 1-fold in the lower stem.

Effects of sucrose on volatile compounds
Four types of volatile compounds were measured using GC/ MS, including terpene derivatives, aromatic derivatives, lipid derivative and other compounds, the effect of sucrose on their accumulation was different (see Additional file 1: Table S1). For example, the content of α-farnesene belonging to sesquiterpenoid diterpenoid increased 5.77-fold; the expression of one transcript (Unigene 46,443), which was predicted as the key biosynthetic gene encoding farnesene synthase, was significantly upregulated 3-fold after 2 and 14 days of sucrose treatment (see Additional file 2: Table S2). Here, 33 terpene derivatives were detected and classified into monoterpenoid sesquiterpenoid diterpenoid; these compounds were biosynthesized via methylerythritol phosphate (MEP) and mevalonate (MVA) pathways (see Additional file 3: Figure S1). The expression of HMGR (CL12062.Contig1) and DXS (Unigene57617) and DXR (Unigene46601) as the key genes involving in terpenoid Fig. 1 Effects of sucrose on polyphenol accumulation in test-tube tea plantlets. a. Test-tube tea plantlets; b. Red pigments accumulated in stems of plantlet after feeding sucrose; c. Anthocyanin levels are significantly different in the lower part of the stem; d. Accumulation of total catechins and PAs in various organs after 7, 14 and 28 d sucrose treatment. Note: * indicates significance at P < 0.05. The data represents the mean value of three biological replicates Note: ND indicates that the polyphenol was not detected; the data represents the mean value of three biological replicates Digit indicates the ratio of Suc / Control backbone pathway were up-regulated by sucrose. The expression of one transcript (CL1850.Contig3 encoding linalool synthase) was not significantly affected by sucrose; and the content of linalool and geraniol in tea leaf only decreased by 4%. Additionally, the expression of 1 transcript (Unigene9305 encoding (E)-nerolidol synthase) was up-regulated by sucrose after 2d; however, its expression was down-regulated by sucrose after 14d; and the content of the (E)-nerolidol only decreased by 5%.
Effects of sucrose on the expression of key structural genes related to polyphenol biosynthesis using qRT-PCR For further analysis of the effects of sucrose on polyphenol biosynthesis at the transcriptional level, Quantitative real-time-PCR (qRT-PCR) was used to test the expression of 11 key structural genes involved in the polyphenol biosynthetic pathway (Fig. 2). Their expression significantly increased 3-fold after 2d treatment. After 7d, the expression of Chalcone synthase (CHS), Flavanone 3-hydroxylase (F3H), Flavonoid 3′-hydroxylase (F3′H), Leucoanthocyanidin reductase (LAR), and Anthocyanidin reductase (ANR) increased 1-fold. After 14d, the effect of sucrose on the above genes was less noticeable.

Sequencing, de novo assembly, and functional annotation
To obtain the overall transcriptional levels of genes in the tea plant treated by sucrose after 2 and 14d, four normalized cDNA libraries (2d: 2nd D Control and Suc; 14d: 14th D Control and Suc) were constructed for transcriptome sequencing. Based on the Illumina Hiseq 2000 platform, 21,381,193,620 nucleotide (nt) bases were generated from all libraries in total and about 237.6 million clean reads (94.94% of the raw reads) were achieved for de novo assembly (see Additional file 4: Table S3). Finally, a total of 118,843 transcripts were obtained with an average length of 1212 nt and a N50 of 1999 nt (see Additional file 5: Table S4).
To predict the functions of the assembly transcripts, a total of 82,459 transcripts (69.38% of all assembled Unigenes) were annotated using the NR (Non-redundant protein database), NT (Non-redundant nucleotide database), Swiss-Prot (Annotated protein sequence database), KEGG (Kyoto encyclopedia of genes and genomes), COG (Clusters of orthologous groups of protein), and GO (Gene ontology) databases based on two levels of sequence similarity, sequence-based and domain-based alignments, with an e-value<1e-5 (see Additional file 6: Table S5).

Analysis of DEGs responding to sucrose
Using the fragments per kb per million reads (FPKM) method, the DEGs between two samples were identified with a significant threshold of |log2 Ratio (FPKM Control-vs-Suc) | ≥ 1 and the false discovery rate (FDR) of ≤0.001 based on the P-value threshold set as ≤1e-5. A total of 8384 DEGs were detected in 2nd D Control-vs-Suc. Among them, 6187 DEGs (73.80% of the total DEGs) were up-regulated. A total of 5571 DEGs were detected in 14th D Control-vs-Suc, and only 2146 DEGs (38.52% of the total DEGs) were up-regulated (see Fig. 3).

GO function and KEGG pathways analysis of DEGs responding to sucrose
To better understand the biological functions of DEGs responding to sucrose, GO and KEGG analyses were performed for comparisons of 2nd D Control-vs-Suc and 14th D Control-vs-Suc. GO functional enrichment analysis indicated that 49 and 48 GO terms were classified into three ontologies which changed significantly between   Figure S2).
A total of 3553 DEGs (7.46% of all the transcripts aligned to the KEGG database) were annotated and 29 KEGG pathways were enriched significantly in the 2nd D Control-vs-Suc comparison based on a Q-value of ≤0.05. Among them, the most enriched pathway was "flavonoid biosynthesis" (Table 2). In 14th D Control-vs-Suc comparison, 2009 DEGs (4.22% of all the transcripts aligned to KEGG databases) were annotated and 20 KEGG pathways were significantly enriched with the same threshold. The most enriched pathway was that for "plant-pathogen interaction" (Table 3). A total of 17 KEGG-enriched pathways were common between second and fourteenth D Control-vs-Suc. Of the 12 KEGG pathways specific to the second D Control-vs-Suc comparison, one was the KEGG-enriched pathway for anthocyanin biosynthesis (Fig. 4).

Effects of sucrose on polyphenol biosynthesis based on transcriptome sequencing
Based on the ratio of FPKM Control-vs-Suc, most of the transcripts involved in the phenylpropanoid and flavonoid pathways were up-regulated 2-fold or more after 2d of treatment. Additionally, the expression of transcripts annotated as Phenylalanine ammonialyase (PAL), Dihydroflavonol 4-reductase (DFR), LAR, and Anthocyanidin synthase (ANS) was notably up-regulated. After 14 days of treatment, the expression of only PALB increased 1-fold, whereas others were not affected by sucrose (Fig. 5). These results indicate that tea polyphenol biosynthesis is comprehensively affected by sucrose. Effects of sucrose on the expression of transcription factors involved in polyphenol biosynthesis based on transcriptome sequencing Polyphenol biosynthesis in plants is regulated by transcription factors (TFs) including R2R3-MYB, bHLH, and WD40 [31,32]. In this study, 37 DEGs were predicted to be MYB members and were classified into three types: R1 (4 DEGs), R2R3 (29 DEGs), and R1R2R3 (4 DEGs). Most DEGs (23/37) were up-regulated after sucrose treatment for 2 days, and only five DEGs were up-regulated after sucrose treatment for 14 days (Table 4). Additionally, the phylogenetic tree, including 29 R2R3-MYBs and 126 Arabidopsis R2R3-MYBs, were classified into 13 subgroups (see Additional file 8: Figure S3). Phylogenetic analysis indicated that 33 bHLHs were dispersed into 15 subfamilies (see Additional file 9: Figure S4), and 21 of them were up-regulated after sucrose treatment for 2d ( Table 5). The R2R3-MYBs, bHLH, and WD40 TFs, could act as regulators of polyphenol biosynthesis individually or jointly. The R2R3-MYBs in Subgroup (Sg) 4 and Sg7 were predicted to be negative and positive regulators, respectively, for controlling the production of flavonols via regulating the up-stream genes involved in polyphenol biosynthetic pathway [33,34]. However, the R2R3-MYBs in Sg5 and Sg6 require both bHLH (subfamily 2, 5, and 24) and WD40 for construction into a ternary complex MYB-bHLH-WD40 (MBW) for positively regulating down-stream genes involved in polyphenol biosynthetic pathway [31,35,36]. Here, 7 DEGs were classified into the above mentioned 4 subgroups of R2R3-MYBs. After 2d sucrose treatment, the expression of 3 DEGs (Uni-gene12085, Unigene 41,846 and CL8695 Contig1) in Sg6 and Sg5 were significantly up-regulated 6-fold; and the expression of CL13057.Contig2 in Sg4 was down-regulated significantly (Fig. 6a). Additionally, 2 DEGs (Unigene 21,617, Unigene 5385) in Subfamily 5 of bHLHs were up-regulated by sucrose (Fig. 6b). Based on the same method, only one transcript (Unigene25483) was predicted to be involved in the MBW complex, and its expression was not affected by sucrose (Fig. 6c).

Effects of sucrose on the expression of genes involved in polyphenol transport
In plants, transporters (ABCs and MATEs), and GSTs are involved in polyphenol transporting. These transporters are found in many species including Arabidopsis TT19 and TT12 genes (AtTT19; AtTT12), the grape GST and ABCC1 genes (VvGST19; VvABCC1), the maize MRP3 gene (ZmMRP3), and the Medicago truncatula MATE (MtMATE) [37][38][39][40][41][42]. In the present study, 22, 15, and 21 DEGs were predicted to encode GST, ABC, and MATE-transporters, respectively. Phylogenetic analysis showed three transcripts closely corresponding to the above 3 transporters (Fig. 7). Among them, the expression of the ABC (CL11884.Contig7) and MATE (Unigene47970) decreases significantly by sucrose after 2d, and their expression increases after 14d (Additional file 10: Table S6). However, the expression of the GST (Unigene24131) responds to sucrose opposite of the above mentioned two transcripts (Additional file 10: Table S6). The above results indicate there could be different transporters and GSTS for transporting the polyphenol in tea plants.

Using qRT-PCR for transcriptome sequencing validation
To validate the results of transcriptome sequencing, 30 DEGS were randomly selected to be analyzed by qRT-PCR. We found that 83.33% of the total transcripts expression was consistent with the results from transcriptome sequencing, including 11 genes involved in polyphenol biosynthesis. Detailed information regarding the selected DEGs and 11 genes is presented in Additional file 11: Figure S5.

Discussion
The mechanisms of sucrose effects on tea polyphenol biosynthesis In the past decades, exploration of tea polyphenol biosynthesis and their influencing factors have become a hotspot for research in plant secondary metabolism [30,43]. Due to self-incompatibility, rich genetic diversity, and the large genome in tea plants, little genomic information is available and the molecular mechanisms of tea polyphenol biosynthesis are still unclear [44,45]. Our previous research demonstrated tea polyphenol shared a similar biosynthetic pathway to other plants, such as shikimic acid, phenylpropanoid, and flavonoids synthetic pathways [2]. Its biosynthesis is also affected by sucrose, light, and other factors [24,46]. Studies have demonstrated sucrose-specific transcriptional regulation of polyphenol biosynthesis in plants. For example, Boss et al. reported that the expression of DFR involved in anthocyanin and PAs biosynthesis in grape was induced by sucrose treatment, and they speculated that the accumulation of the two metabolites in grape berry skin could be attributed to sugar accumulation during grape berry development [47]. According to microarray data, it was revealed that anthocyanin biosynthesis in Arabidopsisis is stimulated by sucrose which acts as a signal to activate PAP1, a TF for activating the expression of structural genes involved in anthocyanin biosynthetic pathway, such as PAL, Cinnamate 4-hydroxylase (C4H), 4-coumaroyl-CoA ligase (4CL), and others [19,23]. However, the structural gene F3′5′H and transcriptional factor PAP2 are not affected by sucrose [19]. In tea plants, Wang et al. found the expression of Cs F3′5′H increased 15-fold by feeding sucrose [25]. Liu et al. reported that sucrose induced the accumulation of catechins and upregulated the expression of putative genes involved in their biosynthetic pathway [24]. In this study, the total content of catechins and PAs significantly increases by sucrose induction for 7d and the accumulation of anthocyanin increases 7-fold in the stems of tea plantlets after 14d sucrose treatment. Only after 2d treatment, the expression of structural genes  Note: " a "indicates significant up-regulation; "-"indicates no difference; " b "indicates significant down-regulation. Unknown and other indicate Unigene is not grouped involved in their biosynthesis is up-regulated based on qRT-PCR and transcriptome sequencing. After 14d, the effects of sucrose were not detected. In Arabidopsis, the correct expression of BANYULS (BAN) as a key gene of PAs biosynthesis is necessary for activation of TT2 (AtMYB123, an R2R3-MYB TF encoded by the TRANSPARENT TESTA2 gene) and TT8 (AtbHLH42, a bHLH TF encoded by the TRANS-PARENT TESTA8 gene) together with TTG1 (AtTTG1, a WD-repeat protein encoded by the TRANSPARENT-TESTA GLABRA1gene) [48][49][50]. TT2 cannot be replaced by any other AtMYB [51]. Additionally, the genes of Sg4, 5, 6, and 7 R2R3-MYB and the Subfamily2, 5, and 24 bHLH are all involved in flavonoid biosynthesis [35,52]. Based on their amino acid sequence alignment, it was found that 7 R2R3-MYB and 4 bHLH are predicted to participate in flavonoid biosynthesis in tea plants [53]. In the present study, seven DEGs were classified into the aforementioned four subgroups of the R2R3-MYBs and four DEGs into bHLH subfamilies 5 and 2. Among them, the expression of 3 transcripts (Unigene12085, Unigene41846, and CL8695.Contig1) in R2R3-MYB Sg6 and Sg5 were up-regulated 6-fold; this finding is consistent with those of studies indicating that sucrose can induce the expression of PAP1/ MYB75, which is essential for sucrose-induced anthocyanin biosynthesis [19,23,48,54]. In addition, Uni-gene5385 corresponded to TT8 and its expression was significantly increased by sucrose treatment for 2d, indicating that it might be involved with others in regulating the accumulation of anthocyanins and PAs [55,56]. Notably, only one transcript (Unigene25483) corresponds closely to AtTTG1, consistent with the results reported in C. sinensis [53]. However, it was not affected by sucrose, possibly because WD40 proteins have no catalytic activity and act as docking platforms for MYB and bHLH proteins in regulating flavonoid biosynthesis [48,51,53,57].
As described above, it is inferred the accumulation of tea polyphenol might be directly due to high expression of their structural genes which could be synergistically regulated by TFs.

The mechanisms of sucrose effects on tea polyphenol transport
Based on analysis of UPLC-QQQ-MS/MS, the non-galloylated catechins and oligomeric PAs were significantly induced by sucrose in bud, 3rd leaf, and lower stems after 14d treatment; however, their content in upper stems decreased significantly, especially C, EC, and their oligomeric PAs. This suggests there was flavonoid transport in tea plants. Extensive research shows GST, ABC, and MATE transporters could be involved in flavonoid transport and there are at least three mechanisms, GST-linked, Vesicle trafficking (VT), and MATE transporters [38,39,42,[58][59][60][61]. In the present study, only three transcripts annotated as GST, ABC, and MATE were involved in flavonoid transport, and their expression was differently affected by sucrose. As described above, it is inferred that there are varieties of proteins for synergistically transporting tea polyphenol in tea plants. However, the molecular mechanisms remain unclear.

Impact of sucrose on the volatile
It is known that the flavor of tea is basically determined by taste (non-volatile compounds) and aroma (volatile compounds) [62]. The tea polyphenol is crucial for tea taste, and the terpene derivatives including monoterpenoid and sesquiterpenoid are important aroma ingredient due to their delectable fruit fragrance and low detection threshold [63]; for example, linalool and geraniol have fruity and sweet floral scents [62]. Previous research indicated that linalool, geraniol, nerolidol, ionone, and jasmone were identified as odour-active in many types of green teas [64,65]. In the present study, (Z)-jasmone and β-ionone content increased by 2.63 and 0.57-fold, respectively; however, linalool, geraniol and nerolidol were not significantly affected by sucrose. As the biosynthetic pathway volatile compounds is complicated, and the molecular mechanisms involving in volatile compounds affected by sucrose need to be further studied.

Conclusions
In this paper, the test-tube tea plantlets were used for investigating the effects of sucrose on polyphenol biosynthesis. Metabolomics and transcriptomics analyses indicated Tolerance to NaCl and osmotic stresses: bHLH92 [105] CL1061.Contig1 2440 -0.10 b 7 AtbHLH41 Note:" a "indicates significant up-regulation; "-"no difference; " b " indicates significant down-regulation that sucrose up-regulation of anthocyanins, catechins, and PAs biosynthesis. Sucrose controls the expression of structural and regulating genes. Additionally, sucrose promotes the transport of polyphenol in Camellia sinensis by the predicted transporters GST, ABC, and MATE involved in polyphenol transport. In summary, these results and analyses present valuable resources for better understanding the biosynthesis molecular mechanisms underlying the main

Plant materials and cultivation conditions
The test-tube tea plantlets [Camellia sinensis (L.) O. Kuntzevar. cultivar Nongkangzao] were initially grown in vitro on classical solid MS medium and then transferred to solid MS supplemented with 90 mM sucrose for sucrose feeding studies with 10 h of light (42 μmol/m 2 s) at 24 ± 1°C. Correspondingly, similar sized test-tube tea plantlets were transferred to classical solid MS medium for the control under the same conditions. In the above experiments, the tea plantlets were incubated on MS supplemented with 90 mM mannitol for the osmotic control. For metabolic analysis of polyphenol, the samples of different organs (the buds, third leaves, and the upper and lower stems) were collected from the tea plantlets cultivated after 2, 7, 14, and 28d. Meanwhile, samples of leaves were also collected from the tea plantlets cultivated after 2, 7, 14 and 28d for analysis of polyphenol biosynthesis at the transcriptional level. All the collected samples were immediately frozen in liquid nitrogen and stored at − 80°C until use. In this study, approximately 10 independent tea plants were collected for one biological replicate; and three biological replicates were used for analysis.

Extraction and quantitative analysis of the polyphenol
Extraction and quantitative analysis of the polyphenol was performed with UPLC-QQQ-MS/MS as suggested by Jiang et al. [2]. The total catechins were extracted and quantitatively analyzed using 1% vanillin-HCl (w/v) according to the methods described by Wang et al. [66].
Spectrophotometry analysis of anthocyanins was carried out as described by Pang et al. and the molar absorbance of cyanidin-3-O-glucoside was used for calculating the total anthocyanin concentration [67].
The total PAs were extracted and quantitatively analyzed using spectrophotometry by the methods reported by Jang et al. and their concentration was converted by using a standard curve of procyanidin B 2 [2].

Extraction and analysis of the volatile compounds
Extraction and analysis of the volatile compounds collected from the samples of the leaves of tea plantlets cultivated after14 d were performed with a headspace-solid phase microextraction (HS-SPME) fiber, coupled with gas chromatography (Agilent 7697A) and mass spectrometry (Agilent 7890A) (GC/MS). In brief, 0.3 g of leaves samples were cut up and put in the 20 ml headspace bottle 4 mL by adding boiling double distilled water dissolved 0.8 g KCl. After incubation for 1.5 min, the volatile compounds were collected using a 50/30 μm DVB/CAR/PDMS SPME fiber (Supelco, PA, USA) for 50 min at 70°C and then desorbed into the GC injection port at 250°C for 5 min. Subsequently, the volatile compounds were resolved by BD-5 capillary column (30 m × 0.25 mm × 0.25 μm, Agilent) for GC/MS analysis according to Han et al. [64]. mate transporters. Note: The phylogenetic tree was constructed based on amino acid sequences using MEGA5 according to the neighbor-joining method. All protein sequences used in this figure were provided in Additional file 13: Txt S1

RNA extraction and qRT-PCR analysis
Total RNA was extracted as described by Zhao et al. [53]. The RNA concentration, quality, and integrity were measured by using spectrophotometry (Agilent2100) and gel electrophoresis. The single-stranded complementary deoxyribonucleic acid (cDNA) was synthesized using Prime-Script™ (Takara, Dalian, Code: DRR037A) for qRT-PCR analysis. All the primer sequences were designed using Primer Premier 6.0 and the selected Unigene IDs are detailed in the additional file (see Additional file 12: Table  S7). The qRT-PCR assays were performed by using a CFX96™ optical reaction module (Bio-RAD, USA) and the detailed detection system was the same as previously described by Zhao et al. [53]. The resultant relative expression values were normalized against the housekeeping gene glyceraldehyde-3-phosphate dehydrogenase (GAPDH) and evaluated from the mean value of three biological and three technical replicates by the 2 -ΔΔCT method [68].
Library construction, RNA-seq and de novo assembly Library Construction and de novo assembly were performed by Beijing Genome Institute (BGI; Shenzhen, China). Briefly, the specific operations are summarized as follows: the mRNA isolated from the total RNA was fragmented into smaller pieces to create templates for synthesizing the first-strand cDNA. Using the first-strand cDNA as templates, the double-stranded cDNA was produced with random primers (Japan, Takara). Subsequently, these cDNA fragments were processed by end repair using DNA polymerase and polynucleotide kinase and ligation of adapters to produce approximately 200 bp fragments. Finally, these fragments were purified by using Qiaquick Gel Extraction Kit (Qiagen) and enriched with PCR to construct cDNA libraries.
In this study, four cDNA libraries (2d: 2nd D Control and Suc; 14d: 14th D Control and Suc) were examined by using the Agilent 2100 Bioanalyzer and were sequenced using Illumina HiSeq™ 2000. The clean reads were obtained from the raw reads by removing the low-quality reads and the reads with adaptors or unknown nucleotides larger than 5%. Based on assembly of clean reads separately, Unigenes were the resulting sequences after removing redundancy and short contigs separately using the short reads assembling program-Trinity.

Bioinformatics analysis of the assembled Unigenes
By using BLASTx (E-value 10 − 5 ) against the database of NR, NT, GO, Swiss-Prot, COG, and KEGG, the assembled Unigenes were annotated for functional analysis and their expression levels were calculated by the fragments per kb per million reads (FPKM). Differentially expressed genes (DEGs) were identified with a significant threshold of|log2 Ratio of FPKM (Control-vs-Suc)| ≥ 1 and FDR ≤ 0.001 based on the P-value threshold set as ≤1e − 5 . Based on FDR ≤ 0.05, KEGG Pathway analysis was performed to ascertain the main biochemical and signal transduction pathways of DEGs.

Phylogenetic analysis of transcription factors and transport proteins involved in polyphenols
The phylogenetic trees for transcription factors and transport proteins were constructed according to the method as described by Zhao et al. [53]. Briefly, the MEGA 5.0 software was used for the phylogenetic analysis and the neighbor-joining statistical method was carried out based on amino acid sequences. The Bootstrap method with 1000 replicates was performed for evaluating the tree nodes. By using the p-distance method, evolutionary distances were computed. All the sequences used for the alignment were retrieved from The Arabidopsis Information Resource (TAIR, Carnegie Institution for Science Department of Plant Biology, USA), the UniProt Database (UniProt, Switzerland), and the National Center for Biotechnology Information (NCBI, USA).