- Open Access
Transcriptome and metabolome reveal the accumulation of secondary metabolites in different varieties of Cinnamomum longepaniculatum
BMC Plant Biology volume 22, Article number: 243 (2022)
Cinnamomum longepaniculatum (Gamble) N. Chao ex H. W. Li, whose leaves produce essential oils, is a traditional Chinese medicine and economically important tree species. In our study, two C. longepaniculatum varieties that have significantly different essential oil contents and leaf phenotypes were selected as the materials to investigate secondary metabolism.
The essential oil content and leaf phenotypes were different between the two varieties. When the results of both transcriptome and metabolomic analyses were combined, it was found that the differences were related to phenylalanine metabolic pathways, particularly the metabolism of flavonoids and terpenoids. The transcriptome results based on KEGG pathway enrichment analysis showed that pathways involving phenylpropanoids, tryptophan biosynthesis and terpenoids significantly differed between the two varieties; 11 DEGs (2 upregulated and 9 downregulated) were associated with the biosynthesis of other secondary metabolites, and 12 DEGs (2 upregulated and 10 downregulated) were related to the metabolism of terpenoids and polyketides. Through further analysis of the leaves, we detected 196 metabolites in C. longepaniculatum. The abundance of 49 (26 downregulated and 23 upregulated) metabolites differed between the two varieties, which is likely related to the differences in the accumulation of these metabolites. We identified 12 flavonoids, 8 terpenoids and 8 alkaloids and identified 4 kinds of PMFs from the leaves of C. longepaniculatum.
The combined results of transcriptome and metabolomic analyses revealed a strong correlation between metabolite contents and gene expression. We speculate that light leads to differences in the secondary metabolism and phenotypes of leaves of different varieties of C. longepaniculatum. This research provides data for secondary metabolite studies and lays a solid foundation for breeding ideal C. longepaniculatum plants.
Plants can synthesize a large diversity of secondary metabolites, which are notable for their beneficial biological activities that help plants colonize diverse and challenging environments. Secondary metabolites include major groups such as flavonoids, terpenoids, and alkaloids. Flavonoids are low-molecular-weight secondary metabolites that have antioxidant, antiproliferative [1, 2], inflammatory response-regulating  and antilithiatic  effects. Many studies have shown that flavonoids are synthesized in plants via the phenylpropanoid and acetate–malonate metabolic pathways . Many flavonoid biosynthesis-related genes have been studied in depth, including those encoding phenylalanine ammonia lyase (PAL), cinnamate 4-hydroxylase (C4H), 4-coumaroyl CoA ligase (4CL) and MYB transcription factors . In addition, flavonoids can negatively regulate auxin transport . Polymethoxyflavones (PMFs) constitute a group of flavonoids that exert beneficial biological activities for humans, such as anticancer , antilipogenic, antitumor , regulate gut microbiome , and anti-inflammatory activities . Since essential oils have similar functions, we hypothesize that there are PMFs in Cinnamomum longepaniculatum, but there are few reports about which flavonoids are present in the leaves of C. longepaniculatum.
Terpenoids (isoprenoids) are a class of important chemicals produced by plants [12, 13]. Many terpenoids have a long history of being used as flavour-enhancing compounds, pharmaceuticals, insecticides, and industrial compounds . Terpenoids in plants can be divided into monoterpenes (C10), sesquiterpenes (C15), diterpenes (C20), triterpenes (C30), tetraterpenes (C40) and polyterpenes . Plants use two independent pathways to produce terpenoids: the cytosolic mevalonic acid (MVA) pathway, and the plastidial methylerythritol phosphate (MEP) pathway [16,17,18]. During the last few decades, there has been increasing interest in the terpenes of Cinnamomum plants, mainly focusing on analysis of the main compounds of essential oil extracted from leaves [12, 14]. Many studies have attempted to evaluate the antibacterial activity of terpenoids that constitute essential oils [19, 20]. In recent years, many studies have focused on analysing the transcriptomes related to specific terpenoids in leaf metabolic pathways and their regulatory mechanisms  and the differential accumulation of terpenoids in different Cinnamomum chemotypes . However, combined metabolome and transcriptome profiling of terpenoids in different varieties of C. longepaniculatum has not been reported.
Cinnamomum longepaniculatum (Gamble) N. Chao ex H. W. Li  is an evergreen tree species . C. longepaniculatum is extensively cultivated in southwestern China, especially in the Yibin region (Sichuan, China). C. longepaniculatum leaves and twigs are harvested for essential oil extraction. Research on essential oils has mainly focused on their composition , especially their antibacterial activities  and endophytic fungi . Terpenoids (monoterpenes and sesquiterpenes) produced during plant secondary metabolism are the major components of essential oils . However, we know little about other secondary metabolites present in the leaves of C. longepaniculatum.
In this study, two different varieties of C. longepaniculatum, namely, CLH and CLL, were selected as the materials to study the secondary metabolite differences. Previous research has shown that the production of essential oils significantly differs among varieties, so we used transcriptomic and metabolomic data from functional leaves from different varieties to investigate their secondary metabolism. This research not only provides data for C. longepaniculatum secondary metabolite studies but also lays a solid foundation for breeding ideal C. longepaniculatum plant types.
Leaf essential oil content and phenotypic changes of different varieties
We sampled fresh leaves from different varieties for three years and extracted essential oils in the same way. Under the same climatic conditions, the essential oil contents of the different varieties of C. longepaniculatum were different. The results showed that the essential oil content of CLH was significantly higher than that of CLL (Table 1). Many differences in the leaves of the two varieties were detected. The leaf area of CLH was greater than that of CLL, but the difference was not significant (Fig. 1A). We compared cross-sections of leaves from the different varieties by paraffin sectioning. The cross-section of the midrib revealed a biconvex shape, and the average cross-section area for CLH was 82.94 μm2, while that for CLL was 72.82 μm2. We also measured the leaf thickness, and the average for CLH (226.57 μm) was significantly greater than that for CLL (189.10 μm). Many ground parenchyma (gp) cells were observed on the veins of the adaxial sides of the leaves. The vascular system was composed of two vascular bundles, which were arranged in the open arc shape, and the middle was very large. Secretory idioblasts (Si) were present between the midrib and lateral vein of the CLH leaves, but these were not present in the CLL leaves (Fig. 1B, C). The mesophyll comprised one layer of the epidermis. Small collateral vascular bundles were bordered by the endoderm and penetrate through the spongy parenchyma (Fig. 1D, E).
Analysis of transcriptomic data
To explore the molecular mechanisms underlying secondary metabolism, we compared transcriptome between the different varieties. A robust data set was generated after data processing: 48.63 million and 46.68 million high-quality reads were obtained from CLH and CLL, respectively; moreover, 7.34 and 7.05 Gb were obtained, and the GC contents were 45.59 and 45.74%, respectively. The sequencing statistics of the 6 RNA generated libraries are listed in Table 2. BUSCO analysis showed that the completeness of C. longepaniculatum leaf transcriptome was 95.2%, indicating the excellent continuity of the assembly (Table 3). To compare the gene expression in CLH with that in CLL, Venn diagram analysis showed that 34,503 genes were present in both comparison groups (Fig. S1).
After comparing CLH and CLL, we identified 1486 differentially expressed genes (DEGs) (P value < 0.05, fold-change (FC)>2), all of which may be related to the differences in secondary metabolism. Among these DEGs, 1357 were upregulated, and 129 were downregulated; the corresponding FCs and P value are listed in Table S1. Through matching with the Gene Ontology (GO), SwissProt, Clusters of Orthologous Groups of proteins (COG) and Pfam database information, 1208 DEGs (%) were identified (Fig. 2). According to functional classification, 1007 DEGs with known functions were further partitioned into 5 initial categories, and 645 DEGs were found to be involved in metabolism (Table S2). Then, we performed Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analysis of the second KEGG pathway category, which revealed the following pathways: transport and catabolism (36); signal transduction (18); membrane transport (2); folding, sorting and degradation (38); replication and repair (16); transcription (11); translation (235); amino acid metabolism (126); biosynthesis of other secondary metabolites (13); carbohydrate metabolism (230); energy metabolism (122); glycan biosynthesis and metabolism (9); lipid metabolism (48); metabolism of cofactors and vitamins (41); metabolism of other amino acids (27); metabolism of terpenoids and polyketides (13); nucleotide metabolism (16); and environmental adaptation (6) (Fig. 2). The KEGG analysis showed that the phenylpropanoid, tryptophan biosynthesis and terpenoid pathways were significantly different between the two varieties (Table S3). We were more interested in the pathways related to secondary metabolism: biosynthesis of other secondary metabolites (2 upregulated and 9 downregulated DEGs) and metabolism of terpenoids and polyketides (2 upregulated and 10 downregulated DEGs). In addition, there were 3 KEGG pathways that were related to light: photosynthesis (map00195, 35 DEGs), photosynthesis-antenna proteins (map00196, 18 DEGs), and oxidative phosphorylation (map00190, 29 DEGs) (Table S2).
Metabolome profiling of different varieties
Metabolites were extracted from leaf samples (three replicates) for each experimental group and were analysed by GC–MS. Partial least squares discrimination analysis (PLS-DA) resulted in the division of the total variation into two major components (Component 1 and Component 2), which contributed 41.3 and 23.8% of the variation, respectively (Fig. 3A). In total, we detected 196 metabolites from C. longepaniculatum (Table S4): 147 were matched with KEGG database information, and 92 were characterized in the library. The analysis of the function of the metabolites revealed nine main aspects: amino acid metabolism (34), biosynthesis of other secondary metabolites (24), carbohydrate metabolism (24), energy metabolism (8), lipid metabolism (9), metabolism of cofactors and vitamins (17), metabolism of other amino acids (11), metabolism of terpenoids and polyketides (6), and nucleotide metabolism (8) (Fig. S2).
To further determine the differences in metabolites between CLH and CLL, under the thresholds of a variable importance in projection (VIP)_pred_orthogonal projections to latent structures-discriminant analysis (OPLS-DA) > 1.0, a FC ≥ 1 or ≤ 1, and a P value < 0.05, 49 (26 decreased and 23 increased) proteins were found to be differentially accumulated. The screening results were visualized by volcano plots (Fig. 3B). The forty-nine differentially accumulated metabolites (DAMs) were mapped to KEGG pathways (Fig. 3C). The DAMs were mainly enriched in flavonoid biosynthesis (4); phenylpropanoid biosynthesis (3); phenylalanine, tyrosine and tryptophan biosynthesis (3); phenylalanine metabolism (3); and stilbenoid, diarylheptanoid and gingerol biosynthesis (2). Some DAMs related to oxidative phosphorylation and carbon fixation in the photosynthetic organism pathway were also found (Fig. 3C). We identified 12 flavonoids, 8 terpenoids and 8 alkaloids in the leaves of C. longepaniculatum (Table 4). Through searching against the Human Metabolome Database (HMDB) (https://hmdb.ca/), we found that 22 DAMs had matches (Fig. 3D).
Changes in bioactive substances in the leaves of C. longepaniculatum
Through metabolomic analysis (Table 4), we identified 5 types of flavonoids in C. longepaniculatum leaves: flavones, flavanones, flavan-3-ols, dihydroflavonols, flavonols, and PMFs. Among them, PMFs were identified for the first time in C. longepaniculatum leaves, of which there were 4 kinds of compounds present: 3,4′,5,6,7-pentamethoxyflavone, 5-hydroxy-2′,4′,7,8-tetramethoxyflavone, 3-hydroxy-3′,4′-dimethoxyflavone, and 3,7-dihydroxyflavone. We identified 7 types of terpenoids: terpineol, limonene, beta-myrcene, phytol, farnesal, squalene, beta-ionone, and beta-sitosterol. Alkaloids including pipecolic acid, 2,3-butanediol, D-allose, nicotinic acid, hypoxanthine, 6-hydroxynicotinic acid, melatonin, and thebaine were also identified for the first time in C. longepaniculatum leaves.
Correlation analysis between the transcriptomic and Metabolomic data
Correlation analysis between metabolites and transcripts of different varieties was performed to gain insight into the regulatory network of secondary metabolism of C. longepaniculatum. We conducted a joint KEGG pathway enrichment analysis, and identical pathways were enriched in the transcriptome and metabolome. A histogram was construct to visualize the enrichment degree of pathways with both DEGs and DAMs (Fig. 4A). The results showed that three KEGG pathways were related to phenylalanine metabolism: phenylalanine metabolism (map00360); phenylalanine, tyrosine and tryptophan biosynthesis (map00400); and phenylpropanoid biosynthesis (map00940). According to the results, 7 DEGs involved in the amino acid pathway (3) and the secondary metabolism (4) pathway were related to phenylalanine metabolism. After analysis of the secondary metabolites, 9 metabolites were found to be associated with phenylpropanoid biosynthesis.
We used two-way orthogonal partial least squares (O2PLS) to evaluate the intrinsic correlation between the transcriptomic and metabolomic data, calculated the score of each sample and obtained a joint score. Then, the load value of each gene and metabolite was calculated to obtain a load map (Fig. 4B). A joint score plot was constructed to visualize the relationship between the two data matrices, and metabolites/genes with high loading values were considered necessary for the similarity of the two data sets. Finally, the absolute values of the load value of the top 15 DAMs/DEGs were selected to construct a histogram (Fig. 4C). The correlation analysis showed that transcripts and metabolites were related to phenylalanine metabolic pathways.
Confirmation of DEGs via qRT-PCR
To validate the reliability and stability of the RNA sequencing (RNA-seq) data of the DEGs, 8 DEGs were selected for qRT-PCR validation. Among them, four were upregulated in CLH (Fig. 5). The results suggested that the transcriptomic data were reliable.
Between the two varieties, 49 (26 downregulated and 23 upregulated) DAMs, 12 flavonoids, 8 terpenoids and 8 alkaloids were identified. Combined transcriptome and metabolome analyses revealed a strong correlation between metabolite content and gene expression (Fig. 6). We identified 12 flavonoids in C. longepaniculatum leaves (Table 4). Flavonoids are low-molecular-weight secondary metabolites produced by plants ; are present in all plant parts; and show a variety of functions, including antioxidant activity [29, 30], UV protection , pathogen protection and signalling [32, 33]. Constituting a large group of plant secondary metabolites, flavonoids are derived from the phenylpropanoid pathway . Flavonoids, which include, flavones, flavonols, flavan-3-ols, flavanones, isoflavanones, isoflavans, and pterocarpans, have diverse structures. Previous studies suggest that apigenin , chrysin , neohesperidin , epicatechin , catechin , and kaempferol [40, 41] are bioactive substances that have antioxidant, antibacterial, antifungal, and antitumor activities and anti-inflammatory properties. Therefore, we believe that the flavonoids in C. longepaniculatum leaves are related to the plant stress response. It may be that endophytic fungi  cause the accumulation of flavonoids in the leaves of C. longepaniculatum, but we believe that the accumulation of flavonoids is caused by light. There is very important evidence in that 82 DEGs were mapped to 3 KEGG pathways that were related to light. We also found that the KEGG oxidative phosphorylation pathway was enriched in DAMs. Therefore, we speculate that different varieties of C. longepaniculatum have different responses to light, which results in differences in flavonoid contents. Nonetheless, this hypothesis needs further experimental confirmation.
PMFs constitute a unique class of flavonoids that are considered the biologically active constituents in citrus fruits . We identified 4 kinds of PMFs from the leaves of C. longepaniculatum; this is the first report on the isolation of PMFs from C. longepaniculatum. PMFs can be used for anti-inflammatory, antiatherogenic, antiangiogenic and anticancer activities [43, 44], and different PMF structures lead to differences in biological activity . By comparison, the PMFs identified in the present study are different from those in citrus, and the biological activities of PMFs from C. longepaniculatum need to be studied. Some studies have pointed out that citrus PMFs can affect the modulation of gut microbiota , and others have pointed out that microbial community composition and diversity can affect secondary metabolite content in leaves of Cinnamomum camphora . We speculate that there is a complex relationship between PMFs and endophytic fungal communities of C. longepaniculatum.
A total of 7 classes and 8 kinds of terpenoids were identified in our research (Table 4). These components are largely consistent with the findings of previous studies [13, 46, 47]. Owing to their pleasant fragrance, unique biological activities and favourable physicochemical properties, terpineol [48, 49], eucalyptol , limonene , and squalene  have strong antioxidant properties and are widely applied in foods, pharmaceuticals, cosmetics, biomaterials, and biofuels [53, 54]. Among the terpenoids we identified, the contents in CLL were higher than those in CLH, except for squalene and beta-ionone; squalene synthesis occurs downstream of terpenoid synthesis. This difference between varieties may be because more substances may be converted to other terpenes, such as terpineol, in CLL and because less squalene is formed downstream. We analysed 196 terpenoids from fresh leaves of C. longepaniculatum via GC–MS. However, other researchers  used water extracts to identify 23 compounds that were detected from C. camphora leaves. Therefore, different extraction methods could affect the identification of substances.
In recent years, much research has involved the use of transcriptomic data to investigate the key regulatory genes related to terpenoid biosynthesis in C. camphora [21, 35]. Many terpenoid synthases have been isolated and characterized from various plant species . In our research, our transcriptome analysis revealed that 23 DEGs were related to secondary metabolites, of which 12 DEGs were related to the metabolism of terpenoids and polyketides. The genes encoding S-(+)-linalool synthase (TRINITY_DN10559_c0_g1) and geranylgeranyl diphosphate synthase (TRINITY_DN70426_c0_g1) were differentially expressed in the different varieties (Fig. 6). The DEGs were all related to these two pathways, indicating that the essential oils in C. longepaniculatum metabolize are produced through both the MVA and the MEP pathways, which is consistent with the results of Chandran’s study . Interestingly, a large number of DEGs were related to photosynthesis and oxidative phosphorylation. Like for flavonoids, we speculate that differences in light responses between the different varieties led to differences in terpenoid metabolism. C. longepaniculatum leaves synthesize a large number of secondary substances, such as flavonoids and terpenoids, to cope with light stress.
In summary, the different varieties of C. longepaniculatum differ in essential oil content and phenotype. By performing transcriptome and metabolome analyses, we investigated the accumulation of secondary metabolites in different varieties of C. longepaniculatum. Of the 34,503 total genes discovered, we identified 1486 DEGs. KEGG analysis showed that phenylpropanoid, tryptophan biosynthesis and terpenoid pathways were significantly altered between the two varieties. In total, we detected 196 metabolites from C. longepaniculatum. Between the two varieties, 49 (26 downregulated and 23 upregulated) DAMs, and we identified 12 flavonoids, 8 terpenoids and 8 alkaloids. The finding of a high correlation between metabolite content and gene expression combined transcriptome and metabolome analyses. We speculate that light may cause differences in secondary metabolites and phenotypes in the leaves of C. longepaniculatum.
The two varieties of C. longepaniculatum (Gamble) N. Chao ex H. W. Li were identified by Professor Ruizhang Feng (Yibin University, Yibin, PR China). Voucher specimens (CLH, 20171026YBU016; CLL, 20171026YBU014) were deposited in the herbarium of the Faculty of Agriculture, Forestry and Food Engineering, Yibin University, Yibin, PR China. The leaves of different C. longepaniculatum varieties (CLH, CLL) were used as experimental materials. The two varieties were propagated from one mother tree via cuttings. C. longepaniculatum was planted on Hongyan Mountain (27°50′ N, 105° 20′ E) in Yibin, Sichuan Province, PR China. No specific permits were required to collect the plant materials because Hongyan Mountain is a germplasm resource nursery of Yibin University. The leaves used for the experiments were collected in November 2019, November 2020 and November 2021. For each variety, 10 healthy plants that have been growing for 20 years were randomly selected. One hundred functional leaves were selected from the sun-facing and shaded sides of the trees. All the leaves of the same variety were mixed and then divided into three groups as 3 biological replicates. Some of the fresh materials were cut into small pieces, immediately frozen in liquid nitrogen and stored at − 80 °C until further transcriptome and metabolome analysis. The others materials were air dried at room temperature and then subjected to distillation to obtain essential oils.
Essential oil isolation
The leaves used for essential oil extraction were collected in November 2019, November 2020 and November 2021. After air drying, 100 g of leaves was put into a steam distillation instrument for 1.5 h to isolate essential oils. The isolation of essential oils from each sample was repeated three times. The essential oil yield was estimated on a dry weight basis (w/w).
Cytological observations of leaves
Leaves of different varieties were harvested in November 2020, kept in formaldehyde-acetic acid-ethanol (FAA) (70% ethanol:acetic acid:38% methanol = 90:5:5), and then embedded in paraffin after dehydration through a series of ethanol concentrations according to the manufacturer’s protocol. Cross-sections of 2 to 5 mm thickness were cut with a microtome (Leica RM2235). The cross-sections were double dyed with safranine and fast green and covered with a slide cover. The sections of the leaves were observed under a microscope (Motic BA200). The leaf cross-sectional area and leaf thickness were subsequently determined with ImageJ (image processing and analysis in Java).
RNA extraction, library construction and sequencing of fresh leaves
The leaves of different varieties collected in November 2020 were used for RNA-seq. Total RNA was obtained using an RNA purification reagent for plant tissue (Invitrogen, Carlsbad, CA, USA). Three biological replicates were sampled per variety. Six RNA-seq transcriptome libraries were prepared using an Illumina TruSeq™ RNA Sample Preparation Kit (San Diego, CA). Poly(A) mRNA was purified from the total RNA using oligo-dT-attached magnetic beads and then fragmented with fragmentation buffer. Taking these short fragments as templates, double-stranded cDNA was synthesized using a SuperScript Double-Stranded cDNA Synthesis Kit (Invitrogen, CA) with random hexamer primers (Illumina). Then, the synthesized cDNA was subjected to end repair, phosphorylation and polyadenylation according to Illumina’s library construction protocol. Libraries were size selected for cDNA target fragments of 200–300 bp on 2% low-range ultra-agarose followed by PCR amplification using Phusion DNA polymerase (New England Biolabs, Boston, MA), with 15 PCR cycles. After quantification by a TBS-380 instrument, 6 RNA-seq libraries were sequenced in a single lane on an Illumina HiSeq Xten/NovaSeq 6000 sequencer (Illumina, San Diego, CA) for 2 × 150 bp paired-end reads.
De novo assembly and annotation
The raw paired-end reads were trimmed and subjected to quality control by SeqPrep (https://github.com/jstjohn/SeqPrep) and Sickle (https://github.com/najoshi/sickle), with the default parameters. Then, clean data from the samples (C. longepaniculatum) were utilized to perform de novo assembly with Trinity (http://trinityrnaseq.sourceforge.net/) . All the assembled transcripts were searched via BLASTX against the sequence information within the NCBI protein nonredundant (NR), COG, and KEGG databases to identify the proteins whose sequences were most similar to those of the given transcripts to retrieve their functional annotations, and a typical cut-off E-value of less than 1.0 × 10− 5 was set. The BLAST2GO (http://www.blast2go.com/b2ghome)  program was used to obtain GO annotations of unique assembled transcripts for describing biological processes, molecular functions and cellular components. Metabolic pathway analysis was performed using the KEGG database (http://www.genome.jp/kegg/) .
Differential expression analysis and functional enrichment
To identify DEGs between the two varieties, the expression level of each transcript was calculated according to the transcripts per million reads (TPM) method. Afterwards, RSEM (http://deweylab.biostat.wisc.edu/rsem/)  was used to quantify gene abundance. Specifically, differential expression analysis was performed using DESeq2 /EdgeR  with a Q value ≤0.05. Genes with |log2(FC)| > 1 and Q value <= 0.05 (DESeq2 or EdgeR)/Q value <= 0.001 (DEGseq) were considered to be significantly differentially expressed. In addition, functional enrichment analysis via the GO and KEGG databases was performed to identify which DEGs were significantly enriched in GO terms and metabolic pathways with a Bonferroni-corrected P value of ≤0.05 compared with the whole-transcriptome background. GO functional enrichment and KEGG pathway analysis were carried out by GOATOOLS (https://github.com/tanghaibao/Goatools) and KOBAS (http://kobas.cbi.pku.edu.cn/home.do), respectively [62, 63].
Extraction of metabolites from fresh leaves and GC–MS analysis
The leaves of different varieties collected in November 2020 were used for metabolite extractions. Three biological replicates were sampled per variety. Fifty milligrams of leaf tissue was extracted in 500 μL of 75% methanol, 2% L-2 chlorophenylalanine and 200 μL of chloroform were added, and the mixture was homogenized at 50 Hz for 3 min at − 10 °C. The mixture was then subjected to ultrasonic treatment at 40 kHz for 10 min at 5 °C after vortexing for 30 s, the process of which was repeated 3 times. After being allowed to settle at − 20 °C for 30 min, the sample was centrifuged at 12000 rcf at 4 °C for 20 min. The supernatant was then vacuum dried. A total of 80 μL of methoxyamine hydrochloride (15 mg/mL in pyridine) was added to the sample, which was then shaken for 2 min and incubated at 37 °C for 90 min. Finally, 80 μL of bis-(trimethylsilyl) trifluoroacetamide (BSTFA) with 1% trimethylchlorosilane (TMCS) and 20 μL of n-hexane were added, after which the sample was stored at 70 °C for 60 min after shaking for 2 min. The samples were subsequently incubated at room temperature for 30 min and then analysed by GC–MS. An extract of each sample was prepared as a quality control (QC) sample. The QC samples were compiled and tested in the same manner as the analytic samples. To assess the repeatability of the analysis, the QC samples were injected at regular intervals (every 3 samples).
GC–MS analysis was conducted on an Agilent 8890B gas chromatograph coupled to an Agilent 5977B mass selective detector (inert electron impact ionization (EI) source and ionization voltage of 70 eV) (Agilent, USA). The gas chromatograph was equipped with an HP-5MS (30 m × 0.25 mm × 0.25 μm) capillary column, and the gas chromatograph column temperature was programmed to hold at 60 °C, increase to 310 °C at a rate of 8 °C/m and hold at the final temperature for 6 min. The gas chromatograph conditions included an inlet temperature of 310 °C, helium carrier gas at a constant flow rate of 1 mL/min, a resting oven temperature of 60 °C and a GC–MS transfer line temperature of 300 °C. After a sample injection of 1 μL, the sample was introduced in splitless mode with an inlet temperature of 260 °C. The ion source temperature was 230 °C, and the quadrupole temperature was 270 °C. Data acquisition was conducted in full-scan mode with a range of m/z 50–500.
Metabolome: data processing and statistics
The GC–MS data were analysed with MassHunter Workstation Quantitative Analysis (version 10.0.707.0). An internal standard was used for data QC (reproducibility). Samples with metabolite features and a relative standard deviation (RSD) of QC > 30% were discarded. A multivariate statistical analysis was performed using ropls (version 1.6.2, http://bioconductor.org/packages/release/bioc/html/ropls.html). All of the metabolite variables were scaled to unit variances prior to conducting principal component analysis (PCA). VIPs were calculated via a OPLS-DA model, and P values were estimated with paired Student’s t test via single-dimensional statistical analysis. The metabolites that differentially accumulated between the two groups were mapped to their biochemical pathways through metabolic enrichment and pathway analysis based on a database search (KEGG; http://www.genome.jp/kegg/). Scipy.stats (a Python package) (https://docs.scipy.org/doc/scipy/) was exploited to identify a statistically significantly enriched pathway using Fisher’s exact test.
Combined transcriptome and metabolome analyses
Pearson correlation coefficients were calculated for metabolome and transcriptomic data integration. Based on the Pearson correlation coefficient, correlations between genes and metabolites in samples can be measured, and the range of the correlation coefficient is (− 1, + 1). A correlation coefficient less than 0 indicates a negative correlation; when it is greater than 0, it indicates a positive correlation; and when it is equal to 0, it indicates no correlation. The threshold of the correlation coefficient in this study was ±0.8, and the correlation was significant (P < 0.05). The DEGs and DAMs related to essential oil biosynthesis were used to construct coexpression networks, and the candidate target genes were visualized by Cytoscape (version 3.7.2, USA).
Validation of RNA-Seq data by qRT-PCR
Total RNA was isolated from approximately 100 mg of fresh leaves of the different varieties of C. longepaniculatum, which were collected in November 2020, using an RNAprep Pure Plant Plus Kit (TIANGEN, Beijing, China). One microgram of total RNA was subjected to reverse transcription using a SYBR Green PCR Master Mix (TaKaRa) Kit with gDNA Eraser (Perfect for Real-Time). Real-time PCR was carried out by using SYBR Premix Ex Taq II (TaKaRa) on an ABI StepOne™ Plus Real-Time PCR System (Roche, Switzerland). All the primers used for qRT–PCR are listed in Table S5. Relative gene expression was calculated using the 2-ΔΔCt method, and owing to its stable expression via PCR amplification, the ACT (KM086738.1) gene was used as a reference control gene .
The control and treatment groups were analysed for statistically significant differences using one-way ANOVA followed by Duncan’s multiple comparisons test. All the calculations were performed using SPSS software (version 21; IBM, Armonk, NY, USA), and all the results are presented as the mean ± SD of 3 independent biological replications. The treatment means were separated by Duncan’s multiple range test at a P value less than 0.01. We then used the min–max normalization method of TBtools  to analyse the transcriptomic and metabolomic data representing the expression values used in our heatmap.
Availability of data and materials
The datasets generated and analyzed during the current study are available in the Biological Research Project Data (BioProject), National Center for Biotechnology Information (NCBI) repository, accession: PRJNA804339.
Differential expression genes
Differentially accumulated metabolites
Kyoto encyclopedia of genes and genomes
Quantitative real-time PCR
Wen K, Fang X, Yang J, Yao Y, Nandakumar K, Salem M, et al. Recent research on flavonoids and their biomedical applications. Curr Med Chem. 2021;28(5):1042–66.
Peluso I, Miglio C, Morabito G, Ioannone F, Serafini M. Flavonoids and immune function in human: a systematic review. Crit Rev Food Sci Nutr. 2015;55(3):383–95.
Yi Y. Regulatory roles of flavonoids on Inflammasome activation during inflammatory responses. Mol Nutr Food Res. 2018;62(13):1800147.
Zeng X, Xi Y, Jiang W. Protective roles of flavonoids and flavonoid-rich plant extracts against urolithiasis: a review. Crit Rev Food Sci Nutr. 2019;59(13):2125–35.
Buer C, Imin N, Djordjevic M. Flavonoids: new roles for old molecules. J Integr Plant Biol. 2010;52(1):98–111.
Yu L, Huang D, Gu J, Pan D, Tan Y, Huang R, et al. Identification of Isoflavonoid biosynthesis-related R2R3-MYB transcription factors in Callerya speciosa (champ. Ex Benth.) Schot using transcriptome-based gene Coexpression analysis. Int J Genomics. 2021;2021:9939403.
Brown D, Rashotte A, Murphy A, Normanly J, Tague B, Peer W, et al. Flavonoids act as negative regulators of auxin transport in vivo in Arabidopsis. Plant Physiol. 2001;126(2):524–35.
Wang Y, Chen Y, Zhang H, Chen J, Cao J, Chen Q, et al. Polymethoxyflavones from citrus inhibited gastric cancer cell proliferation through inducing apoptosis by upregulating RARβ, both in vitro and in vivo. Food Chem Toxicol. 2020;146:111811.
Lai C, Wu C, Ho C, Pan M. Disease chemopreventive effects and molecular mechanisms of hydroxylated polymethoxyflavones. Biofactors. 2015;41(5):301–13.
Zeng S, Li S, Xiao P, Cai Y, Chu C, Chen B, et al. Citrus polymethoxyflavones attenuate metabolic syndrome by regulating gut microbiome and amino acid metabolism. Sci Adv. 2020;6(1):eaax6208.
Uckoo R, Jayaprakasha G, Vikram A, Patil B. Polymethoxyflavones isolated from the Peel of Miaray mandarin (Citrus miaray) have biofilm inhibitory activity in Vibrio harveyi. J Agric Food Chem. 2015;63(32):7180–9.
Guo S, Geng Z, Zhang W, Liang J, Wang C, Deng Z, et al. The chemical composition of essential oils from Cinnamomum camphora and their insecticidal activity against the stored product pests. Int J Mol Sci. 2016;17(11):1836.
Chen J, Tang C, Zhou Y, Zhang R, Ye S, Zhao Z, et al. Anti-inflammatory property of the essential oil from Cinnamomum camphora (Linn.) Presl leaves and the evaluation of its underlying mechanism by using metabolomics analysis. Molecules. 2020;25(20):4796.
Chen W, Vermaak I, Viljoen A. Camphor—a fumigant during the black death and a coveted fragrant wood in ancient Egypt and Babylon—a review. Molecules. 2013;18(5):5434–54.
Breitmaier E. Terpenes: Importance, general structure, and biosynthesis. In Terpenes—Flavors, Fragrances, Pharmaca, Pheromones. Weinheim: WILEY-VCH; 2006.
Hemmerlin A, Harwood J, Bach T. A raison d’etre for two distinct pathways in the early steps of plant isoprenoid biosynthesis? Prog Lipid Res. 2012;51(2):95–148.
Hemmerlin A. Post-translational events and modifications regulating plant enzymes involved in isoprenoid precursor biosynthesis. Plant Sci. 2013;203-204:41–54.
Ma Y, Yuan L, Wu B, Li X, Chen S, Lu S. Genomewide identification and characterization of novel genes involved in terpenoid biosynthesis in salvia miltiorrhiza. J Exp Bot. 2012;63(7):2809–23.
Chen S, Zheng T, Ye C, Huannixi W, Yakefu Z, Meng Y, et al. Algicidal properties of extracts from Cinnamomum camphora fresh leaves and their main compounds. Ecotoxicol Environ Saf. 2018;163:594–603.
Huang L, Yang L, Zou Y, Luo S, Wang X, Liang Y, et al. Antibacterial activity and mechanism of three isomeric terpineols of Cinnamomum longepaniculatum leaf oil. Folia Microbiol (Praha). 2021;66(1):59–67.
Hou J, Zhang J, Zhang B, Jin X, Zhang H, Jin Z. Transcriptional analysis of metabolic pathways and regulatory mechanisms of essential oil biosynthesis in the leaves of Cinnamomum camphora (L.) Presl. Front Genet. 2020;11:598714.
Chen C, Zheng Y, Zhong Y, Wu Y, Li Z, Xu L, et al. Transcriptome analysis and identification of genes related to terpenoid biosynthesis in Cinnamomum camphora. BMC Genomics. 2018;19(1):550.
Gang Z, Liu B, Rohwer J, Ferguson D, Yang Y. Leaf epidermal micromorphology defining the clades in Cinnamomum (Lauraceae). PhytoKeys. 2021;182:125–48.
Liu Z, Mo K, Fei S, Zu Y, Yang L. Efficient approach for the extraction of proanthocyanidins from Cinnamomum longepaniculatum leaves using ultrasonic irradiation and an evaluation of their inhibition activity on digestive enzymes and antioxidant activity in vitro. J Sep Sci. 2017;40(15):3100–13.
Li L, Li Z, Yin Z, Wei Q, Jia R, Zhou L, et al. Antibacterial activity of leaf essential oil and its constituents from Cinnamomum longepaniculatum. Int J Clin Exp Med. 2014;7:1721–7.
Tao C, Wei Q, Yin Z, Zhou L, Jia R, Xu J, et al. Antifungal activity of essential oil from Cinnamomum longepaniculatum leaves against three dermatophytes in vitro. Afr J Pharm Pharmaco. 2013;7:1148–52.
Zhou W, Wei Q, Feng R, Liu Y, Liang H, Li J, et al. Diversity and spatial distribution of endophytic fungi in Cinnamomum longepaniculatum of Yibin, China. Arch Microbiol. 2021;203(6):3361–72.
Weston L, Mathesius U. Flavonoids: their structure, biosynthesis and role in the rhizosphere, including allelopathy. J Chem Ecol. 2013;39(2):283–97.
Croft K. Dietary polyphenols: antioxidants or not? Arch Biochem Biophys. 2016;595:120–4.
Agati G, Azzarello E, Pollastri S, Tattini M. Flavonoids as antioxidants in plants: location and functional significance. Plant Sci. 2012;196:67–76.
Bais A, Lucas R, Bornman J, Williamson C, Sulzberger B, Austin A, et al. Environmental effects of ozone depletion, UV radiation and interactions with climate change: UNEP environmental effects assessment panel, update 2017. Photochem Photobiol Sci. 2018;17(2):127–79.
Treutter D. Significance of flavonoids in plant resistance and enhancement of their biosynthesis. Plant Biol. 2005;7(6):581–91.
Zhang J, Subramanian S, Stacey G, Yu O. Flavones and flavonols play distinct critical roles during nodulation of Medicago truncatula by Sinorhizobium meliloti. Plant J. 2009;57(1):171–83.
Falcone Ferreyra M, Rius S, Casati P. Flavonoids: biosynthesis, biological functions, and biotechnological applications. Front Plant Sci. 2012;3:222.
Elarabi N, Abdelhadi A, Sief-Eldein A, Ismail I, Abdallah N. Overexpression of chalcone isomerase a gene in Astragalus trigonus for stimulating apigenin. Sci Rep. 2021;11(1):24176.
Mani R, Natesan V. Chrysin: sources, beneficial pharmacological activities, and molecular mechanism of action. Phytochemistry. 2018;145:187–96.
Zhang J, Fu X, Yang L, Wen H, Zhang L, Liu F, et al. Neohesperidin inhibits cardiac remodeling induced by Ang II in vivo and in vitro. Biomed Pharmacother. 2020;129:110364.
Bernatova I. Biological activities of (−)-epicatechin and (−)-epicatechin-containing foods: focus on cardiovascular and neuropsychological health. Biotechnol Adv. 2018;36(3):666–81.
Bansal S, Vyas S, Bhattacharya S, Sharma M. Catechin prodrugs and analogs: a new array of chemical entities with improved pharmacological and pharmacokinetic properties. Nat Prod Rep. 2013;30(11):1438–54.
Berger A, Latimer S, Stutts L, Soubeyrand E, Block A, Basset G. Kaempferol as a precursor for ubiquinone (coenzyme Q) biosynthesis: an atypical node between specialized metabolism and primary metabolism. Curr Opin Plant Biol. 2022;66:102165.
Lee S, Seol H, Eom S, Lee L, Kim C, Park J, et al. Hydroxy Pentacyclic triterpene acid, Kaempferol, inhibits the human 5-Hydroxytryptamine type 3A receptor activity. Int J Mol Sci. 2022;23(1):544.
Tripoli E, Guardia M, Giammanco S, Majo D, Giammanco M. Citrus flavonoids: molecular structure, biological activity and nutritional properties. Food Chem. 2007;104:466–79.
Kurowska E, Manthey J, Casaschi A, Theriault A. Modulation of HepG2 cell net apolipoprotein B secretion by the Citrus Polymethoxyflavone, tangeretin. Lipids. 2004;39(2):143–51.
Hanahan D, Weinberg R. Hallmarks of cancer: the next generation. Cell. 2011;144(5):646–74.
Wang Y, Qian J, Cao J, Wang D, Liu C, Yang R, et al. Antioxidant capacity, anticancer ability and flavonoids composition of 35 citrus (Citrus reticulata Blanco) varieties. Molecules. 2017;22(7):1114.
Zhang G, Huang Q, Bi X, Liu Y, Yuan Z. Analysis of endophytic bacterial community diversity and metabolic correlation in Cinnamomum camphora. Arch Microbiol. 2020;202(1):181–9.
Yu H, Ren X, Yang F, Xie Y, Guo Y, Cheng Y, et al. Antimicrobial and anti-dust mite efficacy of Cinnamomum camphora chvar. Borneol essential oil using pilot-plant neutral cellulase-assisted steam distillation. Lett Appl Microbiol. 2022;74(2):258–67.
Jiang H, Wang J, Song L, Cao X, Yao X, Tang F, et al. GC*GC-TOFMS analysis of essential oils composition from leaves, twigs and seeds of Cinnamomum camphora L. Presl and their insecticidal and repellent activities. Molecules. 2016;21(4):423.
Hu W, Gao H, Jiang X, Yang H. Analysis on constituents and contents in leaf essential oil from three chemical types of Cinnamum camphora. Front Environ Sci Engin. 2012;6(2):288–93.
Chen F, Ro D, Petri J, Gershenzon J, Bohlmann J, Pichersky E, et al. Characterization of a root-specific Arabidopsis terpene synthase responsible for the formation of the volatile monoterpene 1,8-cineole. Plant Physiol. 2004;135(4):1956–66.
Zhang L, Fan G, He J. Advances on limonene biotransformation and related enzymes. Sci Technol Food Ind. 2019;40:317–25.
Serrano S, Mendo S, Caetano T. Haloarchaea have a high genomic diversity for the biosynthesis of carotenoids of biotechnological interest. Res Microbiol. 2021;20:103919.
Ren Y, Liu S, Jin G, Yang X, Zhou Y. Microbial production of limonene and its derivatives: achievements and perspectives. Biotechnol Adv. 2020;44:107628.
Thomsett M, Moore J, Buchard A, Stockman R, Howdle S. New renewably-sourced polyesters from limonene-derived monomers. Green Chem. 2019;21:149–56.
Chandran S, Kealey J, Reeves C. Microbial production of isoprenoids. Process Biochem. 2011;46(9):1703–10.
Grabherr M, Haas B, Yassour M, Levin J, Thompson D, Amit I, et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat Biotechnol. 2011;29:644–52.
Conesa A, Gotz S, Garcia-Gomez J, Terol J, Talon M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21:3674–6.
Kanehisa M, Goto S. KEGG: Kyoto encyclopedia of genes and genomes. Nucleic Acids Res. 2000;28:27–30.
Li B, Dewey C. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12:323.
Love M, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550.
Robinson M, McCarthy D, Smyth G. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Xie C, Mao X, Huang J, Ding Y, Wu J, Dong S. KOBAS 2.0: a web server for annotation and identification of enriched pathways and diseases. Nucleic Acids Res. 2011;39:W316–22.
Chen C, Chen H, Zhang Y, Thomas H, Frank M, He Y, et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):9.
This research was financially supported by the Ph.D. Fund Project of Yibin University (2019QD08) and the Open Fund of Key Lab of Aromatic Plant Resources Exploitation and Utilization in Sichuan Higher Education (2019XLZ007). Sichuan Provincial Department of Science and Technology (No.2019YFSY0001).
Ethics approval and consent to participate
Experimental research and field studies on plants (either cultivated or wild), including the collection of plant material, must comply with relevant institutional, national, and international guidelines and legislation. The collecting of these plant materials complies with the IUCN Policy Statement on Research Involving Species at Risk of Extinction and is allowed by the Convention on the Trade in Endangered Species of Wild Fauna and Flora.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The Venn diagram analysis of genes expressed in different varieties.
The metabolites of different varieties analyzed by the KEGG pathways.
The fold change and P-value of DEGs.
The DEGs functional classification.
The significantly altered KEGG pathways.
The total metabolites from the leaf of C. longepaniculatum.
The primers used for qRT-PCR.
About this article
Cite this article
Zhao, X., Yan, Y., Zhou, Wh. et al. Transcriptome and metabolome reveal the accumulation of secondary metabolites in different varieties of Cinnamomum longepaniculatum. BMC Plant Biol 22, 243 (2022). https://doi.org/10.1186/s12870-022-03637-2
- Cinnamomum longepaniculatum
- Secondary metabolism