- Research article
- Open Access
Transcriptome analysis reveals crucial genes involved in the biosynthesis of nervonic acid in woody Malania oleifera oilseeds
© The Author(s). 2018
- Received: 22 March 2018
- Accepted: 3 October 2018
- Published: 19 October 2018
Malania oleifera Chun et Lee (Olacaceae), an evergreen broad-leaved woody tree native to southwest China, is an important oilseed tree. Its seed oil has a high level of nervonic acid (cis-tetracos-15-enoic acid, over 60%), which is essential for human health. M. oleifera seed oil is a promising source of nervonic acid, but little is known about the physiological and molecular mechanisms underlying its biosynthesis.
In this study, we recorded oil accumulation at four stages of seed development. Using a high-throughput RNA-sequencing technique, we obtained 55,843 unigenes, of which 29,176 unigenes were functionally annotated. By comparison, 22,833 unigenes had a two-fold or greater expression at the fast oil accumulation stage than at the initial stage. Of these, 198 unigenes were identified as being functionally involved in diverse lipid metabolism processes (including de novo fatty acid synthesis, carbon chain elongation and modification, and triacylglycerol assembly). Key genes (encoding KCS, KCR, HCD and ECR), putatively responsible for nervonic acid biosynthesis, were isolated and their expression profiles during seed development were confirmed by quantitative real-time PCR analysis. Also, we isolated regulatory factors (such as WRI1, ABI3 and FUS3) that are putatively involved in the regulation of oil biosynthesis and seed development.
Our results provide novel data on the physiological and molecular mechanisms of nervonic acid biosynthesis and oil accumulation in M. oleifera seeds, and will also serve as a starting point for biotechnological genetic engineering for the production of nervonic acid resources.
- Nervonic acids
- Malania oleifera
- Gene expression
- Oil biosynthesis
Exploration and utilization of non-timber biological resources from woody trees has long been an important area of forestry research. Oilseeds derived from woody trees have great potential to meet the increasing demand for vegetable oils for food or industrial usage. In particular, oilseed trees that produce unusual fatty acids (FAs) provide critical woody non-timber sources of unique FAs. Malania oleifera Chun et Lee (2n = 26) , a monotypic species belonging to the family Olacaceae, is an evergreen broad-leaved woody tree native to southwest China , mainly distributed in limited regions of Guangxi and Yunnan province [3, 4]. For many years, the seeds of M. oleifera have been used for making edible oils and consumed by local people. Seed oil of M. oleifera is distinctive for its high level of 15c-tetracosenoic acid (C24:1Δ15), a kind of Very Long-Chain Monounsaturated Fatty Acid (VLCMFA), namely nervonic acids (over 60% of total fatty acids).
Nervonic acids were first discovered in the sphingolipid of sea animals (such as sharks). They are chiefly found in nervous and brain tissues, comprising the white matter of animal brains and myelinated nerve fibers [5, 6]. Altered nervonic acid levels in human blood or tissues can cause a variety of diseases. They are implicated in a number of neurological disorders and in some mental illnesses, including schizophrenia, psychosis and attention deficit disorder. Nervonic acid oils have become important targets for pharmaceutical and nutraceutical applications in the prevention and treatment of neurological disorders and associated diseases, including multiple sclerosis, adrenoleukodystrophy, Zeellweger syndrome and Alzhemier’s disease [6–13]. Notably, there has been some evidence that nervonic acid inhibits the human immunodeficiency virus-1 (HIV-1) reverse transcriptase in a dose-dependent manner . Thus, nervonic acid is a strong candidate for further evaluation as a bioactive lipid supplement for the promotion of human health. However, the availability of nervonic acid is currently limited because sea animal sources are insufficient to meet the growing market demand for nervonic acid.
There is an urgent need for a sustainable source of nervonic acids derived from plant oils. Recently, several plant seeds including Lunaria annua (honesty), Borago officinalis (borage), Cannabis sativa (hemp), Acer truncatum (purple blow maple), Tropaeolum speciosum (flame flower), Cardamine graeca (bittercress) and Malania oleifera (garlic-like fruit) were found to contain nervonic acid within storage lipids in the form of triacylglycerol (TAG) [15–18]. Although the nervonic acid content in Acer truncatum and Lunaria annua seeds are low (5% and 20% respectively) these two plants have been considered to be potentially important resources for developing nervonic acid products [16, 18]. The high market demand for nervonic acid incentivizes the development of a refined, nervonic acid-enriched plant oil. M. oleifera is a good candidate for the discovery and development of nervonic acid resources because of its nervonic acid-enriched seed oils, but the physiological and molecular mechanisms underlying the biosynthesis of nervonic acid-enriched oils in M. oleifera seeds remain unknown.
Two main pathways are involved in oil accumulation in plant seeds: fatty acid (FA) de novo synthesis (including FAs carbon chain elongation and desaturation) and TAG assembly. FA biosynthesis mainly takes place within plastids and is initiated by the irreversible carboxylation of acetyl-CoA to form malonyl-CoA by acetyl-CoA carboxylase (ACCase). The malonyl group is transferred to ACP (Acyl-carrier protein). Next, fatty acid synthase (FAS) catalyzes the conversion of acetyl-CoA and malnoyl-ACP to 16:0 and 18:0 acyl-ACP. FAS is a protein complex consisting of several individual enzymes, including a set of β-ketoacyl-ACP synthases (KASs) that are enzymes for FA biosynthesis [19, 20]. In addition, 18:0-ACP can be desaturated to 18:1-ACP by stearoyl-ACP desaturase (SAD), which determines the level of unsaturated FAs (UFAs) in the plant cell. After that, these fatty acyl-ACP chains are converted into acyl-CoAs and transferred to endoplasmic reticulum (ER) for further elongation, desaturation and modification, which generates a variety of FAs like very-long-chain fatty acids (VLCFAs) and polyunsaturated FAs (PUFAs). Based on the initial carbon chain backbone of C16:0, biosynthesis of nervonic acids (C24:1Δ15) is considered to comprise the sequential addition of two carbons by four successive enzymatic reactions gathered by a enzymatic complex [21–23]. The first step is catalysis by membrane-bound 3-ketoacyl-CoA synthase (KCS or FA elongase, FAE), which is a key gene for FA elongation in ER [24–28]. The resulting 3-ketoacyl-CoA is then reduced by a 3-ketoacyl-CoA reductase (KCR) generating a 3-hydroxy-acyl-CoA [29, 30]. The third step is dehydration by the reaction of 3-hydroxacyl-CoA dehydratase (HCD, also known as PASTICCINO 2, or PAS2) to a trans-2,3-enoyl-CoA , which is finally reduced by the trans-2,3-enoyl-CoA reductase (ECR) to yield a two-carbon elongated acyl-CoA . Nervonic acid was synthesized by these four enzymatic reactions after three cycles, using mono-unsaturated fatty acids (MUFAs) 18:1 as the substrate. Finally, the TAG assembly consumes acyl-CoAs using substrate glycerol 3-phosphate with four consecutive enzymes that sequentially transfer acyl-CoAs to sn-1, − 2, − 3 positions in glycerol 3-phosphate in the ER, including Glycerol-3-phosphate acyltransferase (GPAT), Lysophosphlipid acyltransferase (LPAT), Phosphatidic acid phosphatase (PAP) and Diacylglycerol acyltransferase (DGAT) [33, 34].
In this study, we de novo assembled and characterized the transcriptome of M. oleifera seeds at two developmental stages. A number of unigenes involved in the processes of FA biosynthesis (in particular, carbon chain elongation) and TAG assembly were identified. To our knowledge, this study is the first report on characterizing the transcriptome data in woody oilseeds which produce rich nervonic acid oils. These transcripts identified in M. oleifera seeds provide valuable resources for discovering novel genes responsible for the biosynthesis of nervonic acid oils in plants.
Plant materials and determination of oil accumulation
Transcriptome sequencing for developing seeds
For two stages (the initial and fast oil accumulation stages) of developing seeds, three independent samples collected from different fruits were pooled equally to generate three biological replicates. Total RNA was isolated using RNAprep pure Plant Kit (TIANGEN, DP432), following the manufacturer’s protocols. For each sample, High-quality RNA was enriched by Oligo (dT) beads. The enriched mRNA was fragmented into short fragments and reverse transcripted into cDNA with random primers. The cDNA fragments were purified with QiaQuick PCR extraction kit, end-repaired and ligated to Illumina sequencing adapters. The ligation products were selected according to their size by agarose gel electrophoresis, and initially amplified by PCR. The PCR production was constructed into a cDNA library and sequenced on Illumina HiSeq4000™ system in BGI-Shenzhen.
After sequencing, the raw reads were preprocessed to filter out clipped adapter sequences, low-quality reads (Q value ≤20 or containing ambiguous nucleotides) and contaminated sequences. The clean reads were subjected to de novo assembly using the Trinity, a short reads assembling program . Based on the overlap of assembled contigs, the fragments were merged or extended into much longer transcripts to form a set of non-redundant unigenes. To further obtain these unigenes’ function, we employed BLASTX (e-value < 0.00001) to search the public databases with the following order: NCBI non-redundant (Nr), Swiss-Prot, KEGG, and COG/KOG. Meanwhile, GO annotation of unigenes was performed by Blast2GO software . GO classification and enrichment analysis of unigenes was performed using WEGO software .
The expression level of unigenes was counted and normalized by RPKM (Reads Per Kb per Million reads) . The formula of RKPM is as follows: RPKM = (1000000*C)/(N*L/1000), where C represents the number of reads uniquely mapping to Unigene; L represents the length (base number) of the Unigene; and N represents the number of total reads uniquely mapped to all Unigenes. Unigenes with significantly different expression were determined by FDR ≤ 0.001 (false discovery rate that was used to rectify the p-value for multiple testing) and fold-change ≥2 in two samples.
Validation of full-length cDNA and expression level
Based on the presence of 5′ and 3′ untranslated sequences, the full-length cDNAs of unigenes potentially involved in nervonic acid biosynthesis were isolated. Subsequently, they were further confirmed by RT-PCR and sequencing. The expression profiles of unigenes were carried out in different tissues. The young leaf, tender stems, and seeds from two developmental stages were subjected to quantitative real-time PCR (qRT-PCR). Total RNAs were isolated (mentioned above) and reverse transcripted using PrimerScrip™ RT reagent Kit with gDNA Erases (Takara, China). qRT-PCR was performed on the CFX96 machine (Bio-Rad, USA) according to the following program: precycling steps of 95 °C for 2 min then 40 cycles of 95 °C for 30 s, 56 °C for 30 s, and 72 °C for 30 s. The UBE (ubiquitin-conjugating enzyme) gene, Unigene0016233 of M. oleifera, was used as an internal reference to normalize the relative expression level of all genes. All primers used in this study were listed in Table S1 (see Additional file 1).
Transcriptome sequencing and de novo assembly
To investigate nervonic acid accumulation during seed development in M. oleifera, we collected seeds at four stages of development (named as S1-S4), as determined by their seed size, and analyzed the fatty acid species for each sample. As seeds developed, the nervonic content increased gradually, from 0.88% (S1) to 63.79% (S4) of the total fatty acids (Additional file 2). Thin layer chromatography analysis showed that while fruits are young (S1) there is an initial stage of oil accumulation. During the expansion growth period (S2) there is a rapid oil accumulation (Fig. 1a). Therefore, we selected these two stages (the initial and fast oil accumulation stages) of seed development for transcriptome sequencing.
Two cDNA libraries were constructed from two stages of developing seeds (Fig. 1a) and yielded a total of 22.8 gigabases (Gb) nucleotides by Illumina high-throughput sequencing. After strict reads filtering, we obtained about 73 and 82 million 150-bp paired-end reads from S1 and S2, respectively. The Trinity package was employed to assemble all high-quality reads to generate a reference transcriptome. As a result, we obtained 55,843 non-redundant unigenes with an average sequence length of 857 bp and an N50 of 1,599 bp (Fig. 1b). The average GC content of M. oleifera unigenes was 42.59%. The size distribution showed that 15,304 unigenes (27.41%) were longer than 1 kb (Fig. 1c).
Functional annotation of non-redundant unigenes of Malania oleifera
Functional classification of Malania oleifera unigenes
Gene Ontology (GO) term system was employed to classify the functions of predicted unigenes. There were 20,423 unigenes that could be annotated to one or more terms under three GO categories including cellular component, biological process, and molecular function (Fig. 2c). In the molecular function division, binding (45.39%) and catalytic activity (44.48%) represented the dominant GO terms. In the cellular component division, GO terms related to cell parts (23.84%) and cell (23.84%) were in joint first place, followed by organelle (19.03%). For the biological processes, the terms related to metabolic processes (24.11%) and cellular processes 11,815 (22.8%) were dominant, followed by single-organism processes 8,352 (16.12%), signaling 957 (1.85%), reproduction 914 (1.76%) and reproductive processes 903 (1.74%). Besides, all unigenes were also subjected to search against the Clusters of Orthologous Groups database (COG). Overall, 17,332 unigenes were clustered into 25 function classes (Fig. 2d). Among these classes, the general function (R, 6,932 hits) represented the largest group (23.34%), followed by posttranslational modification, protein turnover and chaperones (O, 3,395 hits, 11.43%), signal transduction mechanisms (T, 2802 hits, 9.43%). In addition, a small fraction of unigenes were classified into energy production and conversion (C, 1129 hits, 3.80%), lipid transport and metabolism (I, 928 hits, 3.12%) and secondary metabolites biosynthesis, transport and catabolism (Q, 709 hits, 2.39%) (Fig. 2d).
To explore the main pathways in M. oleifera seeds, all unigenes were also used to search against the KEGG classification system. We found that these unigenes were classified into 130 KEGG pathways (Additional file 4). Among five main categories, the largest group was pathways related to metabolism (6,217 hits, 60%), followed by genetic information processing (2,787 hits, 27%), cellular processes (538 hits, 5%), environmental information processing (458 hits, 5%), and organismal system (286 hits, 3%). Among these 130 pathways, the maps with the highest unigene representation (467) were ribosome pathway (ko03010), followed by carbon metabolism (408), biosynthesis of amino acids (353) and protein processing in endoplasmic reticulum (321).
Identification of transcription factors in M. oleifera seed
Differentially expressed genes at the two developmental stages
To further understand the biological functions of these DEGs, they were subjected to enrichment analysis of GO terms. In the biological process category, a large number of up-regulation, as well as down-regulation DEGs were enriched in the cellular process, single-organism process and metabolic process. In the cellular component category, most unigenes were classified into cell, cell part and organelle. For the molecular function category; binding and catalytic activity represented the main GO categories (Additional file 5). KEGG analysis showed that 22 pathways were significantly enriched (Fig. 4c). The most represented pathway was carbon metabolism (87, 9.27%), followed by plant hormone signal transduction (76, 8.09%). There were many pathways closely related to seed oil biosynthesis, such as fatty acid metabolism (43, 4.58%), fatty acid biosynthesis (24, 2.56%), unsaturated fatty acid synthesis (24, 2.56%), and glycerolipid metabolism (20, 2.13%), which provide clues for the identification of novel genes involved in TAG synthesis.
Additionally, we found that 312 unigenes encoding TFs were differentially expressed during seed development of M. oleifera (Additional file 6). The expression level of 153 TFs were significantly up-regulated, while there were 159 TFs exhibiting obvious down-regulation at stage S2, as compared with stage S1. Interestingly, several TFs critical for seed development and oil accumulation in Arabidopsis were identified which were highly expressed at the fast oil accumulation stage (S2), including WRI1 (Unigene0031370), ABI3 (Unigene0002005), FUS3 (Unigene0003221), ABI5 (Unigene0025890), and AGL62 (Unigene0034852).
The unigenes involved into the pathway of triacylglycerol accumulation in M. oleifera seeds
Based on KEGG pathway classification and annotation, we identified 198 unigenes involved in fatty acids (FAs) metabolism processes, including FAs de novo biosynthesis in plastid (51 unigenes), elongation (35 uingenes), modification (40 unigenes) and triacylglycerol (TAG) assembly (72 unigenes) in endoplasmic reticulum (see Additional file 7). The expression levels of these unigenes at stages S1 and S2 were summarized Additional file 7. Among these were 66 unigenes with up-regulated expression and 20 unigenes with down-regulated expression at stage S2 as compared with stage S1 (Additional file 7).
Our main objective was to identify key genes involved in the biosynthesis of long-chain FAs, especially nervonic acids in endoplasmic reticulum. Here, we fully identified 24 unigenes potentially involved in FAs elongation, including 13 unigenes encoding KCS, two encoding KCR, five encoding HCD and five encoding ECR (Fig. 5a). Further, the full-length cDNA transcript sequences were confirmed by RT-PCR and sequencing for 12 unigenes, including six KCS genes (Unigene0005341, Unigene0014503, Unigene0015624, Unigene0025108, Unigene0025737 and Unigene0037518), three HCD genes (Unigene0025829, Unigene0028372, Unigene0041596), one KCR genes (Unigene0021016) and two ECR genes (Unigene0034229, Unigene0028222). The full-length cDNA transcript sequences of the 12 unigenes were shown in the Table S1 (see Additional file 1). Of 13 KCS unigenes, three unigenes (Unigene0009507, Unigene0025108 and Unigene0025737) had a reduced expression level and three unigenes (Unigene0014503, Unigene0015624 and Unigene0037518) up-regulated their expression level during seed development in M. oleifera. The KCS unigene (Unigene0037518) increased its transcript level at least 48-fold (from 18.1 in S1 to 874.2 in S2) at the fast oil accumulation stage. Of two KCR genes, one (Unigene0021016) reduced its expression level, and another (Unigene0034719) significantly increased its expression level (about 3.7-fold) at stage S2. For HCD unigenes, Unigene0025830 was expressed at a low level, though its expression increased at stage S2. Two unigenes (Unigene0025829 and Unigene0041596) exhibited high expression level during seed development and was significantly elevated (about 6.9-fold and 2.7-fold, respectively) at stage S2. Only one unigene (Unigene0028222), encoding a ECR enzyme, exhibited a high expression level and about 6-fold transcript increase at the fast oil accumulation stage (Fig. 5b and Additional file 7). Nervonic acids (24:1-CoA) were synthesized via a sequence of four reactions; catalyzing by KCS, KCR, HCD and ECR after three cycles, using the 18:1-CoA as primary substrate (see Fig. 5a). We also found that six unigenes can encode an omega-6 fatty acid desaturase (FAD2) which catalyzes 18C:1 to form 18C:2, and two unigenes encode an omega-3 fatty acid desaturase (FAD3) which further catalyzes 18C:2 to generate 18C:3 (Additional file 7). Four unigenes encoding chloroplast oleate desaturase (FAD6) were identified in M. oleifera seeds (Additional file 7).
These FAs (including de novo synthesized, modified or elongated FAs) were subsequently assembled into glycerol-3-phosphate (G-3-P) to form TAG in the ERs which was finally stored in the oil of plant seeds. In this pathway, we found that there were four unigenes encoding GPAT, eight encoding LPAT, three encoding PAP and four unigenes encoding DGAT (including two DGAT1 and two DGAT2); these enzymes perform a critical function in the formation of TAG in the ERs (Fig. 5a). Among 19 unigenes in this pathway, 11 unigenes exhibited a high transcript level in M. oleifera seed, and four unigenes substantially up-regulated their expression level including two GPAT genes (Unigene0023532 and Unigene0036813), and two DGAT2 (Unigene0036969 and Unigene0036970) at stage S2 as compared with stage S1 (Fig. 5b and Additional file 7).
Validation of gene expression using quantitative real-time PCR
The pathway of fatty acid biosynthesis (including FAs carbon chain elongation and desaturation) is thought to have been conserved in plants. However, the physiological and molecular mechanisms underlying the biosynthesis of unusual FAs, such as nervonic acids in M. oleifera and conjugated fatty acids in Vernicia fordii, largely remains uncertain. Based on a search for orthologous genes responsible for nervonic acid biosynthesis, we found putative genes that had previously been identified from Cardamine graeca  and Lunaria annua  and had their functions analyzed in yeast or Brassica oilseeds. Most potential key genes involved in nervonic acid biosynthesis in the FA biosynthesis pathway have yet to be investigated. This study is an important investigation of candidate genes involved in nervonic acid biosynthesis at the transcriptomic level in plant seeds, providing valuable data to improve our understanding of the potential physiological and molecular mechanisms for biosynthesis and accumulation of rich nervonic acid oils in developing M. oleifera seeds.
Here, based on high-throughput transcriptome sequencing data we de novo assembled the transcripts in developing seeds of M. oleifera, resulting in 55,843 unigenes, that were comparable to transcriptome data from other woody oil-seed plants such as Vernicia fordii , Jatropha curcas , and Camellia oleifera . Approximately 52.25% of unigenes were annotated and classified into various GO terms or KEGG pathways. The protein homology searches revealed that the transcripts from M. oleifera seeds had the highest similarity to transcripts from V. vinifera, suggesting that M. oleifera is phylogenetically closest to V. vinifera. That some transcript sequences (47.75%) had no hits in the public databases might be due to having a shorter sequence length, incomplete protein domain or species-specific sequences in M. oleifera. Alternatively, these non-annotated transcripts could be non-coding RNAs such as the precursor of small RNAs or long non-coding RNAs.
One of main objectives of this study was to identify potential unigenes involved in nervonic acid biosynthesis or oil accumulation in developing M. oleifera seeds. Nervonic acid was synthesized in ER by using oleic acid (18C:1Δ9) as the substrate, then catalyzed by FAE complex composed of KCS, KCR, HCD and ECR [14, 17, 20]. The oleic acid content (18C:1) was high (over 32%) and relatively stable during seed development (from stages S1 to S3) until seed maturity (stage S4) as shown in Additional file 2. This quantity of oleic acid appears to be sufficient to act as a substrate for nervonic acid biosynthesis. Throughout seed development there were high levels of expression of all five SAD genes that are responsible for producing 18C:1 from the substrate 18C:0 (see Fig. 5b). This may account for the relatively high content of 18C:1 that was maintained at the transcription level in developing seeds of M. oleifera. At the fast oil accumulation stage (S2), the proportion of nervonic acid increased rapidly (from 0.88 to 29.39%), suggesting that FAs elongation began during oil accumulation at stage S2. There has been increasing evidence that KCS is the rate-limiting enzyme in fatty acid elongation and that its expression level is an importance determinant of the final VLCFAs content [27, 28, 44, 45]. The heterologous expression of KCS cloned from Cardamine graeca or Lunaria annua can produce or increase nervonic acid content in transgenic cruciferous plants [15, 18]. In current study, we identified 13 putative KCS genes in M. oleifera seeds, but have yet to determine the functions of these genes. In particular, the KCS unigene (Unigene0037518) exhibited strongly seed-specific expression with an at least 48-fold increase at the transcript level at the fast oil accumulation stage, which strongly indicates that this gene could drive nervonic acid biosynthesis in M. oleifera seeds. Further functional characterization of Unigene0037518 and its substrate specificity assay are required to determine whether it is species-specific; encoding the rate-limiting enzyme for catalyzing nervonic acids in M. oleifera seeds.
Also, several unigenes encoding KCR, HCD and ECR enzymes, which are part of fatty acid carbon-chain elongation in the pathway of FA biosynthesis, were identified. They probably contribute to the accumulation of nervonic acid oils in M. oleifera seeds. Thus, these targeted genes could be important sources for genetic and metabolic engineering for obtaining nervonic acid production by heterologous transgenic technique.
The rich nervonic acids incorporated in TAG molecules are usually dependent not only on the efficient synthesis of nervonic acids, but on the efficient assembly system for the selective or specific incorporation of nervonic acids into TAG. Generally, the DGAT genes are thought to play a critical role in catalyzing the final step of triacylglycerol (TAG) biosynthesis in developing oleaginous seeds . Different types of DGATs, such as DGAT1 and DGAT2, usually exhibit structural and functional divergence , thus DGAT1 has been thought to be responsible for regulating or controlling oil content, whereas DGAT2 was thought to be responsible for selectively or specifically incorporating specific fatty acids into TAG in plants [35, 46, 48]. Here, two homologous DGAT2 genes identified in M. oleifera exhibited a high level of expression in developing seeds; strongly implying that these two genes could play a critical role in selectively or specifically incorporating nervonic acids into TAG in M. oleifera seeds. If so, these two DGAT2s could be combined with the targeted nervonic acid biosynthesis genes identified in this study by genetic engineering to enhance the nervonic acids content in TAGs. In sum, we identified several candidate genes involved in the nervonic acids biosynthesis and TAG assembly, but the molecular basis of high-efficient biosynthesis of nervonic acids in M. oleifera seeds remains to be elucidated. Such study is required to determine whether these candidate genes are unique to M. oleifera, as well as, deduce whether nervonic acids biosynthesis in M. oleifera seeds is closely correlated to the strong seed-specific expression of these identified genes or to the sequence variation when compared with homologous genes from other species, which may alter the enzyme activity or substrate specificity.
We identified a large number of TFs, which were highly expressed at the fast oil accumulation stage of developing seeds. Most of these TFs are functionally uncharacterized or unknown. However, we detected some transcriptional regulators such as WRI1, ABI3 and FUS3, which are expressed in a seed-specific manner and documented to be functionally involved in regulation of lipid biosynthesis in seed development of Arabidopsis and other plants . For example, loss-of-function in mutant of AtWRI1 substantially reduced the seed oil content compared to the wild-type in Arabidopsis . Overexpression of WRI1 significantly enhanced the seed oil content in transgenic plants . Increasing evidence has showed that WRI1 is a master regulator in controlling the gene expression of lipid genes in the pathway of fatty acid biosynthesis [52, 53]. Studies have also revealed that both ABI3 and FUS3 are involved in direct or indirect regulation of the fatty acid biosynthesis and TAG accumulation in other plants [54, 55]. Interestingly, the VLCFAs content in the abi3 and fus3 mutant seeds was significantly decreased, which is associated with reduced activity of FAE1 (a key fatty acid carbon-chain elongase that regulates production of VLCFAs) . Probably, the identified WRI1, ABI3 and FUS3 are involved in the regulation of the biosynthesis processes of rich nervonic acids oils in M. oleifera seeds.
The current study comprehensively reported transcriptome data from nervonic acid oil producing M. oleifera seeds, and identified genes that are potentially critical for driving the processes of nervonic acid biosynthesis and TAG assembly. These results contribute to our understanding of the potential physiological and molecular mechanisms of biosynthesis and accumulation of rich nervonic acid oils in developing M. oleifera seeds. The study has also produced targeted gene resources that can be used for genetic and metabolic engineering for future biotechnological approaches to nervonic acid production.
The authors thank Daishun Jia for his assistance in collecting plant materials. Also, we extend our thanks to Dr. Chao Sun and Dr. Zexi Chen for their helps with collecting plant materials.
This study was financially supported to collect samples and conduct experiments by the National Natural Science Foundation of China (31700285 and 31571709) and Yunnan Applied Basic Research Projects (2018FB037).
Availability of data and materials
The clean reads were deposited in Sequence Read Archive (SRA) under SRP158484.
TY, QY and FC collected samples. DL, WX and AL designed experiments. TY and QY performed the experiments. WX analyzed data. WX and AL wrote the manuscript. All authors read and approved the final version of the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
- Yang LH, Ding KY, Lu SG. The karyotype of Malania oleifera. Acta Bot Yunnanica. 2003;25(4):428–30.Google Scholar
- Li SG. Malania, a new genus of oil-yielding plant. Bull Bot Lab North-East Forest Inst. 1980;1:67–72.Google Scholar
- Qiu XH, Lin YR. Flora of China. Beijing, China: Science Press; 1988. vol. 24Google Scholar
- Fu LG. Red data book of Chinese plant-the rare and endangered plants. Beijing, China: Science Press; 1992. vol. 1Google Scholar
- Poulos A. Very long chain fatty acids in higher animals-a review. Lipids. 1995;30(1):1–14.View ArticlePubMed CentralGoogle Scholar
- Merrill AH, Schmelz EM, Wang E, Dillehay DL, Rice LG, Meridith F, et al. Importance of sphingolipids and inhibitors of sphingolipid metabolism as components of animal diets. J Nutr. 1997;127:830S–3S.View ArticlePubMed CentralGoogle Scholar
- Farquharson J, Jamieson EC, Abbasi KA, Parrick WJ, Logan RW, Cockburn F. Effect of diet on the fatty acid composition of the major phospholipids of infant cerebral cortex. Arch Dis Child. 1995;72(3):198–203.View ArticlePubMed CentralGoogle Scholar
- Assies J, Lieverse R, Vreken P, Wanders RJ, Dingemans PM, Linszen DH. Significantly reduced docosahexaenoic and docosapentaenoic acid concentrations in erythrocyte membranes from schizophrenic patients compared with a carefully matched control group. Biol Psychiatry. 2001;49(6):510–22.View ArticlePubMed CentralGoogle Scholar
- Evans DA, Bennett DA, Wilson RS, Bienias JL, Morris MC, Scherr PA, et al. Incidence Alzheimer disease in a biracial urban community: relation to apolipoprotein E allele status. Arch Neurol. 2003;60(2):185–9.View ArticlePubMed CentralGoogle Scholar
- Chen JR, Hsu SF, Hsu CD, Hwang LH, Yang SC. Dietary patterns and blood fatty acid composition in children with attention-deficit hyperactivity disorder in Taiwan. J Nutr Biochem. 2004;15(8):467–72.View ArticlePubMed CentralGoogle Scholar
- Pamplona R, Dalfó E, Ayala V, Bellmunt MJ, Prat J, Ferrer I, et al. Proteins in human brain cortex are modified by oxidation, glycoxidation, and lipoxidation: effects of Alzheimer disease and identification of lipoxidation targets. J Biol Chem. 2005;280(22):21522–30.View ArticlePubMed CentralGoogle Scholar
- Tanaka K, Shimizu T, Ohtsuka Y, Yamashiro Y, Oshida K. Early dietary treatments with Lorenzo’s oil and docosahexaenoic acid for neurological development in a case with Zellweger syndrome. Brain Dev. 2007;29(9):586–9.View ArticlePubMed CentralGoogle Scholar
- Amminger GP, Schäfer MR, Klier CM, Slavik JM, Holzer I, Holub M, et al. Decreased nervonic acid levels in erythrocyte membranes predict psychosis in help-seeking ultra-high-risk individuals. Mol Psychiatry. 2012;17(12):1150–2.View ArticlePubMed CentralGoogle Scholar
- Kasai N, Mizushina Y, Sugawara F, Sakaguchi K. Three-dimensional structural model analysis of the binding site of an inhibitor, nervonic acid, of both DNA polymerase beta and HIV-1 reverse transcriptase. J Biochem. 2002;132(5):819–28.View ArticlePubMed CentralGoogle Scholar
- Guo Y, Mietkiewska E, Francis T, Katavic V, Brost JM, Giblin M, et al. Increase in nervonic acid content in transformed yeast and transgenic plants by introduction of a Lunaria annua L. 3-ketoacyl-CoA synthase (KCS) gene. Plant Mol Biol. 2009;69(5):565–75.View ArticlePubMed CentralGoogle Scholar
- Wang XY, Wang SQ. A new resource of nervonic acid: purpleblow maple oil. China Oils Fats. 2005;9:021.Google Scholar
- Bettger WJ, McCorquodale ML, Blackadar CB. The effect of a Tropaeolum speciosum oil supplement on the nervonic acid content of sphingomyelin in rat tissues. J Nutr Biochem. 2001;12(8):492–6.View ArticlePubMed CentralGoogle Scholar
- Taylor DC, Francis T, Guo Y, Brost JM, Katavic V, Mietkiewska E, et al. Molecular cloning and characterization of a KCS gene from Cardamine graeca and its heterologous expression in Bracssica oilseeds to engineer high nervonic acid oils for potential medical and industrial ues. Plant Biotechnol J. 2009;7(9):925–38.View ArticlePubMed CentralGoogle Scholar
- Ohlrogge J, Browse G. Lipid biosynthesis. Plant Cell. 1995;7:957–70.View ArticlePubMed CentralGoogle Scholar
- Ohlrogge JB, Jaworski JG. Regulation of fatty acid synthesis. Annu Rev Plant Physiol Plant Mol Biol. 1997;48:109–36.View ArticlePubMed CentralGoogle Scholar
- Bach L, Faure JD. Role of very-long-chain fatty acids in plant development, when chain length does matter. C R Biol. 2010;333(4):361–70.View ArticlePubMed CentralGoogle Scholar
- Kunst L, Samuels L. Plant cuticles shine: advances in wax biosynthesis and export. Curr Opin Plant Biol. 2009;12(6):721–7.View ArticlePubMed CentralGoogle Scholar
- Kunst L, Taylor DC, Underhill EW. Fatty acid elongation in developing seeds of Arabidopsis thaliana. Plant Physiol Biochem. 1992;30:425–34.Google Scholar
- Franke R, Höfer R, Briesen I, Emsermann M, Efremova N, Yephremov A, et al. The DAISY gene from Arabidopsis encodes a fatty acid elongase condensing enzyme involved in the biosynthesis of aliphatic suberin in roots and the chalaza-micropyle region of seeds. Plant J. 2009;57(1):80–95.View ArticlePubMed CentralGoogle Scholar
- Todd J, Post-Beittenmiller D, Jaworski JG. KCS1 encodes a fatty acid elongase 3-ketoacyl-CoA synthase affecting wax biosynthesis in Arabidopsis thaliana. Plant J. 1999;17(2):119–30.View ArticlePubMed CentralGoogle Scholar
- Millar AA, Kunst L. Very-long-chain fatty acid biosynthesis is controlled through the expression and specificity of the condensing enzyme. Plant J. 1997;12(1):121–31.View ArticlePubMed CentralGoogle Scholar
- Lassner MW, Lardizabal K, Metz JG. A jojoba beta-Ketoacyl-CoA synthase cDNA complements the canola fatty acid elongation mutation in transgenic plants. Plant Cell. 1996;8(2):281–92.PubMedPubMed CentralGoogle Scholar
- James DW Jr, Lim E, Keller J, Plooy I, Ralston E, Dooner HK. Directed tagging of the Arabidopsis FATTY ACID ELONGATION1 (FAE1) gene with the maize transposon activator. Plant Cell. 1995;7(3):309–19.View ArticlePubMed CentralGoogle Scholar
- Gan L, Wang X, Cheng Z, Liu L, Wang J, Zhang Z, et al. Wax crystal-sparse leaf 3 encoding a β-ketoacyl-CoA reductase is involved in cuticular wax biosynthesis in rice. Plant Cell Rep. 2016;35(8):1687–98.View ArticlePubMed CentralGoogle Scholar
- Beaudoin F, Wu X, Li F, Haslam RP, Markham JE, Zheng H, et al. Functional characterization of the Arabidopsis β-ketoacyl-coenzyme a reductase candidates of the fatty acid elongase. Plant Physiol. 2009;150(3):1174–91.View ArticlePubMed CentralGoogle Scholar
- Bach L, Michaelson LV, Haslam R, Bellec Y, Gissot L, Marion J, et al. The very-long-chain hydroxy fatty acyl-CoA dehydratase PASTICCINO2 is essential and limiting for plant development. Proc Natl Acad Sci U S A. 2008;105(38):14727–31.View ArticlePubMed CentralGoogle Scholar
- Zheng H, Rowland O, Kunst L. Disruptions of the Arabidopsis Enoyl-CoA reductase gene reveal an essential role for very-long-chain fatty acid synthesis in cell expansion during plant morphogenesis. Plant Cell. 2005;17(5):1467–81.View ArticlePubMed CentralGoogle Scholar
- Xu R, Wang R, Liu A. Expression profiles of genes involved in fatty acid and triacylglycerol synthesis in developing seeds of Jatropha (Jatropha curcas L.). Biomass Bioenergy. 2011;35:1683–92.View ArticleGoogle Scholar
- Baud S, Lepiniec L. Physiological and developmental regulation of seed oil production. Prog Lipid Res. 2010;49(3):235–49.View ArticlePubMed CentralGoogle Scholar
- Xu R, Yang T, Wang R, Liu A. Characterization of DGAT1 and DGAT2 from Jatropha curcas and their functions in storage lipid biosynthesis. Funct Plant Biol. 2014;41:321–9.View ArticleGoogle Scholar
- Yang T, Xu R, Chen J, Liu A. β-ketoacyl-acyl carrier protein synthase I (KASI) plays crucial roles in the plant growth and fatty acids synthesis in tobacco. Int J Mol Sci. 2016;17(8):1287.View ArticleGoogle Scholar
- Grabherr MG, Haas BJ, Yassour M, Levin JZ, Thompson DA, Amit I, et al. Full-length transcriptome assembly from RNA-seq data without a reference genome. Nat Biotechnol. 2011;29(7):644–52.View ArticlePubMed CentralGoogle Scholar
- Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005;21(18):3674–6.View ArticleGoogle Scholar
- Ye J, Fang L, Zheng H, Zhang Y, Chen J, Zhang Z, et al. WEGO: a web tool for plotting GO annotations. Nucleic Acids Res. 2006;34:W293–7.View ArticlePubMed CentralGoogle Scholar
- Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008;5(7):621–8.View ArticleGoogle Scholar
- Galli V, Guzman F, Messias RS, Körbes AP, Silva SD, Margis-Pinheiro M, et al. Transcriptome of tung tree mature seeds with an emphasis on lipid metabolism genes. Tree Genet Genomes. 2014;10:1353–67.View ArticleGoogle Scholar
- Costa GG, Cardoso KC, Del Bem LE, Lima AC, Cunha MA, de Campos-Leite L, et al. Transcriptome analysis of the oil-rich seed of the bioenergy crop Jatropha curcas L. BMC Genomics. 2010;11:462.View ArticlePubMed CentralGoogle Scholar
- Xia EH, Jiang JJ, Huang H, Zhang LP, Zhang HB, Gao LZ. Transcriptome analysis of the oil-rich tea plant, Camellia oleifera, reveals candidate genes related to lipid metabolism. PLoS One. 2014;9(8):e104150.View ArticlePubMed CentralGoogle Scholar
- Huai D, Zhang Y, Zhang C, Cahoon EB, Zhou Y. Combinatorial effects of fatty acid elongase enzymes on nervonic acid production in Camelina sativa. PLoS One. 2015;10(6):e0131755.View ArticlePubMed CentralGoogle Scholar
- Mietkiewska E, Brost JM, Giblin EM, Barton DL, Taylor DC. Cloning and functional characterization of the fatty acid elongase 1 (FAE1) gene from high erucic Crambe abyssinica cv. Prophet Plant Biotechnol J. 2007;5(5):636–45.View ArticlePubMed CentralGoogle Scholar
- Shockey JM, Gidda SK, Chapital DC, Kuan JC, Dhanoa PK, Bland JM, et al. Tung tree DGAT1 and DGAT2 have nonredundant functions in triacylglycerol biosynthesis and are localized to different subdomains of the endoplasmicreticulum. Plant Cell. 2006;18(9):2294–313.View ArticlePubMed CentralGoogle Scholar
- Turchetto-Zolet AC, Maraschin FS, de Morais GL, Cagliari A, Andrade CM, Margis-Pinheiro M, et al. Evolutionary view of acyl-CoA diacylglycerol acyltransferase (DGAT), a key enzyme in neutral lipid biosynthesis. BMC Evol Biol. 2011;11:263.View ArticlePubMed CentralGoogle Scholar
- Burgal J, Shockey J, Lu C, Dyer J, Larson T, Graham I, et al. Metabolic engineering of hydroxy fatty acid production in plants: RcDGAT2 drives dramatic increases in ricinoleate levels in seed oil. Plant Biotechnol J. 2008;6(8):819–31.View ArticlePubMed CentralGoogle Scholar
- Le BH, Cheng C, Bui AQ, Wagmaister JA, Henry KF, Pelletier J, et al. Global analysis of gene activity during Arabidopsis seed development and identification of seed-specific transcription factors. Proc Natl Acad Sci U S A. 2010;107(18):8063–70.View ArticlePubMed CentralGoogle Scholar
- Focks N, Benning C. Wrinkled1: a novel, low-seed-oil mutant of Arabidopsis with a deficiency in the seed-specific regulation of carbohydrate metabolism. Plant Physiol. 1998;118(1):91–101.View ArticlePubMed CentralGoogle Scholar
- Kong Q, Ma W. WRINKLED1 transcription factor: how much do we know about its regulatory mechanism? Plant Sci. 2018;272:153–6.View ArticlePubMed CentralGoogle Scholar
- Baud S, Mendoza MS, To A, Harscoët E, Lepiniec L, Dubreucq B. WRINKLED1 specifies the regulatory action of LEAFY COTYLEDON2 towards fatty acid metabolism during seed maturation in Arabidopsis. Plant J. 2007;50(5):825–38.View ArticlePubMed CentralGoogle Scholar
- Maeo K, Tokuda T, Ayame A, Mitsui N, Kawai T, Tsukagoshi H, et al. An AP2-type transcription factor, WRINKLED1, of Arabidopsis thaliana binds to the AW-box sequence conserved among proximal upstream regions of genes involved in fatty acid synthesis. Plant J. 2009;60(3):476–87.View ArticlePubMed CentralGoogle Scholar
- Stone SL, Braybrook SA, Paula SL, Kwong LW, Meuser J, Pelletier J, et al. Arabidopsis LEAFY COTYLEDON2 induces maturation traits and auxin activity: implications for somatic embryogenesis. Proc Natl Acad Sci U S A. 2008;105(8):3151–6.View ArticlePubMed CentralGoogle Scholar
- Chiu RS, Nahal H, Provart NJ, Gazzarrini S. The role of the Arabidopsis FUSCA3 transcription factor during inhibition of seed germination at high temperature. BMC Plant Biol. 2012;12:15.View ArticlePubMed CentralGoogle Scholar
- Roscoe TT, Guilleminot J, Bessoule JJ, Berger F, Devic M. Complementation of seed maturation phenotypes by ectopic expression of ABSCISIC ACID INSENSITIVE3, FUSCA3 and LEAFY COTYLEDON2 in Arabidopsis. Plant Cell Physiol. 2015;56(6):1215–28.View ArticlePubMed CentralGoogle Scholar