Mining and identification of polyunsaturated fatty acid synthesis genes active during camelina seed development using 454 pyrosequencing

Wang, Fawei; Chen, Huan; Li, Xiaowei; Wang, Nan; Wang, Tianyi; Yang, Jing; Guan, Lili; Yao, Na; Du, Linna; Wang, Yanfang; Liu, Xiuming; Chen, Xifeng; Wang, Zhenmin; Dong, Yuanyuan; Li, Haiyan

doi:10.1186/s12870-015-0513-6

Research article
Open access
Published: 18 June 2015

Mining and identification of polyunsaturated fatty acid synthesis genes active during camelina seed development using 454 pyrosequencing

Fawei Wang¹,
Huan Chen²,
Xiaowei Li¹,
Nan Wang¹,
Tianyi Wang²,
Jing Yang¹,
Lili Guan¹,
Na Yao¹,
Linna Du¹,
Yanfang Wang¹,
Xiuming Liu¹,
Xifeng Chen³,
Zhenmin Wang³,
Yuanyuan Dong¹ &
…
Haiyan Li^1,2

BMC Plant Biology volume 15, Article number: 147 (2015) Cite this article

3301 Accesses
10 Citations
2 Altmetric
Metrics details

Abstract

Background

Camelina (Camelina sativa L.) is well known for its high unsaturated fatty acid content and great resistance to environmental stress. However, little is known about the molecular mechanisms of unsaturated fatty acid biosynthesis in this annual oilseed crop. To gain greater insight into this mechanism, the transcriptome profiles of seeds at different developmental stages were analyzed by 454 pyrosequencing.

Results

Sequencing of two normalized 454 libraries produced 831,632 clean reads. A total of 32,759 unigenes with an average length of 642 bp were obtained by de novo assembly, and 12,476 up-regulated and 12,390 down-regulated unigenes were identified in the 20 DAF (days after flowering) library compared with the 10 DAF library. Functional annotations showed that 220 genes annotated as fatty acid biosynthesis genes were up-regulated in 20 DAF sample. Among them, 47 candidate unigenes were characterized as responsible for polyunsaturated fatty acid synthesis. To verify unigene expression levels calculated from the transcriptome analysis results, quantitative real-time PCR was performed on 11 randomly selected genes from the 220 up-regulated genes; 10 showed consistency between qRT-PCR and 454 pyrosequencing results.

Conclusions

Investigation of gene expression levels revealed 32,759 genes involved in seed development, many of which showed significant changes in the 20 DAF sample compared with the 10 DAF sample. Our 454 pyrosequencing data for the camelina transcriptome provide an insight into the molecular mechanisms and regulatory pathways of polyunsaturated fatty acid biosynthesis in camelina. The genes characterized in our research will provide candidate genes for the genetic modification of crops.

Background

Polyunsaturated fatty acids (PUFAs) are fatty acids that contain more than one double bond in their backbone. They include many important compounds such as essential fatty acids (omega-3 and omega-6 fatty acids) that human beings and animals cannot synthesize and need to acquire through food. Fish oil and vegetable oil supplements are the main sources of PUFAs. Vegetable oils, such as soybean oil, contain about 7 % alpha-linolenic acid (ALA) (omega-3 fatty acid) and 52 % linoleic acid (LA) (omega-6 fatty acid) [1]. The optimal dietary fatty acid profile includes a low intake of both saturated and omega-6 fatty acids and a moderate intake of omega-3 fatty acids [2]. However, the majority of vegetable oils contains excessive amounts of omega-6 fatty acids but are deficient in omega-3 fatty acids, except for camelina oil and linseed oil. Modulation of omega-3/omega-6 polyunsaturated fatty acid ratios has important implications for human health.

Camelina sativa is a flowering plant in the family Brassicaceae and is usually known as camelina. This plant is cultivated as an oilseed crop mainly in Europe and North America. The dominant fatty acids of camelina oil are omega-3 fatty acid (31.1 %) and omega-6 fatty acid (25.9 %) [3]. Importantly, camelina oil also contains high levels of gamma-tocopherol (vitamin E), which protects against lipid oxidation [4]. The fatty acid composition of camelina oil is especially suitable for human health. However, the mechanisms of polyunsaturated fatty acid synthesis in C. sativa are still unknown. In recent years, researchers have paid more and more attention to camelina. Hutcheon et al. [5] characterized two genes of the fatty acid biosynthesis pathway, fatty acid desaturase (FAD) 2 and fatty acid elongase (FAE) 1, which revealed that C. sativa be considered an allohexaploid. The allohexaploid nature of the C. sativa genome brings more complexity in the biosynthesis of PUFAs. Moreover, the functions of three CsFAD2 were further studied soon after [6]. Furthermore, the genome of C. sativa has been sequenced and annotated [7]. C. sativa could also be used as a recipient to overexpress PUFA synthesis genes and produce more PUFAs, such as omega-3 or omega-6 fatty acids [8-10]. In previous studies, the transcriptome analysis of C. sativa had carried out by 454 sequencing, Illumina GAIIX sequencing and paired-end sequencing [11-13]. However, the mechanism of PUFA biosynthesis in C. sativa remains unclear and difficult to predict.

To comprehensively understand the molecular processes underlying the seed development of C. sativa, we characterized the transcriptome of seeds at different developmental stages. We generated 831,632 clean reads and obtained 32,759 unigenes from seed samples. We then matched the unigenes to 187 pathways and identified 47 PUFA biosynthesis related genes. We verified the expression levels of 11 randomly selected genes from 220 up-regulated genes, 10 of which showed the same results in both qRT-PCR and sequencing. To our knowledge, this is the first genome-wide study of transcript profiles in C. sativa seeds at different developmental stages. The assembled, annotated unigenes and gene expression profiles will facilitate the identification of genes involved in PUFA biosynthesis and be a useful reference for other C. sativa developmental studies.

Results

Lipid accumulation at different stages during seed development

To characterize the polyunsaturated fatty acid (PUFA) synthesis genes in camelina, we quantified the lipid contents in camelina seeds harvested from 10 to 40 days after flowering (DAF). After testing, we found that the lipid content was very low in seeds at 10 DAF. The lipid contents increased dramatically during 10 to 25 DAF, reached a maximum level at 25 DAF, and then remained steady until 40 DAF (Fig. 1). According to this result, 10 DAF and 20 DAF seed samples were used for transcriptome sequencing analysis to explore PUFA synthesis genes.

Sequencing output and assembly

Total RNA was extracted from the seeds of C. sativa. The quality of RNA and cDNA were examined by electrophoresis and Agilent2100, which were shown in Additional file 1: Fiugre S2. The cDNA libraries form 10 DAF and 20 DAF were subjected to 454 pyrosequencing. After sequencing, a total of 529,324 and 318,804 high-quality transcriptomic raw sequence reads were obtained from the 10 DAF and 20 DAF samples, respectively (Table 1). To obtain clean reads, contaminating sequences, low quality reads, short reads, highly repetitive sequences and vector sequences were filtered out. Finally, 521,507 and 310,125 clean reads were obtained from 10 DAF and 20 DAF with average lengths of 630 bp and 654 bp. Furthermore, 25,398 and 23,678 unigenes were assembled based on the clean reads of these two samples. The size distribution of these unigenes is shown in Fig. 2. The longest unigene was 7,043 bp. Most of the unigenes (80.72 %) were distributed in the 200–1,000 bp region, while unigenes of 1,001–2,000 bp length accounted for 9.5 % of the total. Of these genes, 9,081 were unique to 10 DAF and 7,361 were unique to 20 DAF (Fig. 3). The differences in unique genes were of interest because of their potential importance at each stage.

Table 1 Overview of sequencing, assembly and data statistics

Full size table

Transcriptional profile analysis of unigenes during seed development

Differentially transcribed sequences were analyzed in the 10 DAF and 20 DAF samples to characterize the PUFA synthesis genes. Of the 32,759 total genes, 12,476 up-regulated genes (log2 ratio (20 DAF/10 DAF) ≥ 1) and 12,390 down-regulated genes (log2 ratio (10 DAF/20 DAF) ≥ 1) were predicted to be significantly differentially expressed genes (DEGs) in the 20 DAF sample compared with 10 DAF (Fig. 4A). The transcriptional levels of 15.61 % of unigenes increased more than 2-fold in 20 DAF and 9.64 % of genes increased more than 2-fold in 10 DAF (Fig. 4B). The differences in the expression of shared genes were of interest to discover PUFA synthesis genes active throughout seed development. Next, the unigenes were analyzed using the COG and KEGG pathway databases for functional annotation.

Functional annotation and classification

To identify which pathways they belonged to, the unigenes were annotated using the COG, KEGG and other databases. The number of matched proteins in different databases was summarized in the Additional file 2: Table S4. Twenty-five functional categories were identified by COG classification (Fig. 5). General function proteins represented the largest category, comprising about 16.46 % of all unigenes. The next largest category was the “posttranslational modification, protein turnover, chaperones” group (14.323 %). “Lipid transport and metabolism”, which we focused on, comprised about 3.503 %. Furthermore, gene annotation based on the DEGs was carried out. There were more up-regulated genes (log2 ratio (20 DAF/10 DAF) ≥ 1) than down-regulated genes (log2 ratio (10 DAF/20 DAF) ≥ 1) in all categories, except “cytoskeleton” (Fig. 6).

In the KEGG pathway annotation, 187 pathways were matched as shown in Additional file 3: Table S1. KEGG pathway network analysis showed that there are 11 and 69 up-regulated unigenes in the “fatty acid biosynthesis” pathway in 10 DAF (10 DAF vs 20 DAF) and 20 DAF (20 DAF vs 10 DAF) samples, respectively. Many genes encoding enzymes were found in this pathway, such as acetyl-CoA carboxylase (6.4.1.2, 6.3.4.14), enoyl-acyl carrier protein reductase (FabK), 3-ketoacyl-acyl carrier protein reductase (FabG) and acyl-acyl carrier protein desaturase (1.14.192) (Fig. 7). FabF, which catalyzes the condensation reaction of fatty acid synthesis by the addition of two carbons to an acyl acceptor, was down-regulated in this pathway. In addition, 51 and 98 up-regulated genes were found in 10 DAF (10 DAF vs 20 DAF) and 20 DAF (20 DAF vs 10 DAF) in the “biosynthesis of unsaturated fatty acids” pathway (Additional file 3: Table S1). However, the only one gene encoding acyl-CoA thioesterase (3.1.2.2) was matched to 22 reactions (Additional file 4: Fig. S1).

DEGs related to PUFA biosynthesis

After gene functional annotation, we searched for fatty acid synthesis genes among the unigenes. We found 220 up-regulated fatty acid biosynthesis genes in the 20 DAF sample (Additional file 5: Table S2). In this group, 47 PUFA synthesis related genes were discovered (Table 2). Most of them were annotated as omega 6 fatty acid desaturase (10 genes), delta-9 acyl-lipid desaturase (8 genes) and long chain acyl-CoA synthetase (7 genes). Omega 6 fatty acid desaturase and delta-9 acyl-lipid desaturase are desaturases that remove two hydrogen atoms from a fatty acid, creating a carbon/carbon double bond. They play an important role in PUFA synthesis. Long chain acyl-CoA synthetase can activate long chain and very long chain fatty acids to form acyl-CoAs. All of these genes are worthy of further investigation in future studies of PUFA synthesis.

Table 2 DEGs involved in the PUFA synthesis pathway

Full size table

Validation of DEGs by quantitative real-time PCR

To confirm the expression data from 454 pyrosequencing, quantitative real-time PCR (qRT-PCR) was performed to analyze the expression of candidate genes. Eleven up-regulated fatty acid biosynthesis related genes in 20 DAF were selected for this verification, and 18S rRNA was used as an internal control. Only unigene3525 was not consistent with the sequencing results. The other 10 unigenes showed largely consistent results between qRT-PCR and 454 pyrosequencing (Fig. 8).

Discussion

Oils extracted from plants have been widely used since ancient times in many countries. In addition, vegetable oils contain enhanced levels of health-promoting natural compounds and are associated with human health. However, researchers have found that a high intake of saturated and omega-6 fatty acids can increase the risk of cardiovascular disease (CVD) and cancer, in particular breast cancer, in recent years [2, 14]. At the same time, omega-3 PUFAs were shown to have chemopreventive properties against various cancers and their complications, including colon and breast cancer [15, 16]. These results suggest that a well-balanced omega-3/omega-6 fatty acid ratio will be beneficial for people’s health. Therefore, it is essential to increase the content of omega-3 fatty acids and reduce the omega-6 fatty acid contents in vegetable oils. Fish, such as salmon, herring, mackerel, anchovies and sardines, are a significant source of omega-3 long-chain PUFAs in the human diet [17]. With ocean exploitation increasing, reducing the amount of fish oil obtained from aquaculture is critical for sustainability and economic reasons [18]. A replacement for fish oil needs to be discovered urgently.

Much work has been done to engineer a sustainable land-based source of omega-3 long-chain PUFAs. Recently, the achievement of a high omega-3/omega-6 ratio through genetic and plant engineering was reported. The results indicated that both Arabidopsis and camelina transgenic plants contained fish oil-like levels of DHA [9, 19]. Therefore, mining and characterization of PUFA biosynthesis genes are essential to improve the FA contents in plants by genetic engineering. In this study, our objective was to characterize the PUFA biosynthesis pathway genes active during seed development using 454 pyrosequencing. The expression levels of FA biosynthesis genes are induced before the early events of seed development [20, 21]. Our results showed that lipid content increased significantly from 10 to 25 DAF. Thus, 10 and 20 DAF samples were selected for expression profiling of camelina seeds. These results are in agreement with data published by Lee et al. [22] and Luo et al. [23].

By transcriptome sequence analysis, we obtained 831,632 clean reads, from which 32,759 predicted genes were subjected to BLAST annotation. The genome of C. sativa was sequenced recently and a total of 89,418 protein-coding genes were annotated [7]. This result confirmed the quality of our sequencing of camelina seeds. To investigate the PUFA biosynthesis pathway, we searched for fatty acid synthesis-associated genes across our sequencing results and found 220 up-regulated fatty acid biosynthesis genes in 20 DAF sample. Among them, several genes were characterized as key enzymes in FA biosynthesis (Fig. 7). 3-Ketoacyl-acyl-carrier-protein reductase (FabG) was reported to be an essential enzyme for type II fatty acid biosynthesis and catalyzes an NADPH-dependent reduction of 3-ketoacyl-ACP to the (R)-3-hydroxyacyl isomer [24, 25]. Another key enzyme, enoyl-acyl-carrier-protein reductase (FabI), found in the FA biosynthesis pathway plays a determinant role in establishing the rate of FASII [26-28]. These results indicate that the genes shown in Fig. 7 would play an important role in FA biosynthesis. Further studies are needed to determine the functions of these genes.

In a previous study, oleic acid (OA), LA and ALA were used as substrates for conversion to the beneficial omega-3 long chain polyunsaturated fatty acid (LC-PUFA) EPA and DHA [9]. The content of unsaturated fatty acids in camelina is higher than in most other plants. In this study, we found 47 up-regulated PUFA biosynthesis-related genes in camelina seeds (Table 2). Twenty-one FAD genes were found and 13 of them were up-regulated and 6 were down-regulated (Additional file 6: Table S3). Ten up-regulated omega-6 FAD genes were found during seed development (Table 2). All of them were annotated as FAD2, which encodes an endoplasmic reticulum (ER) membrane-bound desaturase catalyzing conversion of OA to LA. Similarly, the expression levels of most FAD2 genes were consistent with the results of Hutcheon et al. [5]. FAD2 was characterized to have a key role in the PUFA biosynthesis pathway in higher plant [29, 30]. LA account for about 93 % omega-6 fatty acid (24.2 % vs 25.9 %) in camelina seeds [3], it will be mainly catalyzed by the omega-6 fatty acid desaturases. On the other hand, ALA makes up about 30 % of the total fatty acid in camelina seeds [3]. Three FAD3 (unigene24351, 4386 and 23778) and three FAD7 (unigene13235, 17479 and 8495) were found in camelina transcriptome (Additional file 6: Table S3). However, only one FAD3 (unigene24351) was up-regulated during seed development. The expression level of unigene4386 and unigene13235 were induced slightly in 20 DAF sample. Unigene23778, unigene17479 and unigene8495 did not express in the 20 DAF sample, but they specifically expressed in 10 DAF sample. These results are consistently observed in the genome-wide analysis of FAD3 in Gossypium hirsutum. The transcript level of GhiFAD3-1 could be detected only in the early stage of G. hirsutum seed development [31]. In developing cotton fibers, the expression of GhiFAD3-1 was down-regulated in both wild and domesticated G. hirsutum varieties [31]. These results suggest that ALA could be synthesized in the early stage of camelina and cotton developing seeds.

Other genes involved in PUFA biosynthesis were also found in this study, such as phosphatidylcholine diacylglycerol cholinephosphotransferase (PDAT) and acyl-CoA:diacylglycerol acyltransferase (DGAT). Triacylglycerol (TAG) can be formed via an acyl-CoA-dependent or acyl-CoA-independent process which catalyzed by PDAT and DGAT. The transcripts of 6 PDAT and 3 DGAT genes were found during camelina seed development stage (Table 2). All of them were up-regulated in 20 DAF sample. In previous study, overexpression of Linum usitatissimum PDAT and DGAT gene were characterized to produce more ALA in yeast strain H1246 [32, 33]. Moreover, overexpression of LuPDAT in Arabidopsis seed resulted in an enhanced level of PUFAs [32]. These results indicated that both PDAT and DGAT might have critical role in the TAG and PUFA biosynthesis in camelina seeds. Additionally, long chain acyl-CoA synthetases (ACSL) are key enzymes responsible for the conversion of acyl-AMP to acyl-CoA during fatty acid biosynthesis [34]. Here, we characterized 22 ACSL genes and 9 of them were up-regulated during seed development (Table 2). Therefore, the identified changes in gene expression in C. sativa may facilitate PUFA biosynthesis and the identification of related genes. This study will provide a resource for further studies on individual genes associated with fatty acid biosynthesis.

Conclusions

According to the pyrosequencing, 831,632 clean reads were obtained and 32,759 unigenes were predicted. All unigenes were analyzed with gene annotations from COG, KEGG, NR, NT and SwissProt databases. Among them, 220 up-regulated genes were identified as FA synthesis related genes (Additional files 5: Table S2), 47 of them are involved in PUFA biosynthesis (Table 2). Fifty-nine unigenes encoding FAD2, FAD3, PDAT, DGAT and ACSL genes were found in the camelina transcriptome, most of them were up-regulated in the 20 DAF seeds. This transcriptome results provide a novel insight into the biosynthesis of polyunsaturated fatty acids. This research might represent a powerful tool to understand the molecular mechanisms of seed development and the result might be helpful for further gene expression, functional genomic studies and camelina molecular breeding.

Materials and Methods

Plant culture and collection

During 2011, eight rows (200 m row length and 50 cm spacing) of camelina were planted in the test plots of Jilin Agricultural University in Jilin Province, China at a uniform depth. The plants were subjected to irrigated and non-irrigated conditions until harvest. Irrigation was applied weekly to supplement recorded rainfall using above-ground drip irrigation as described by Campbell and Bauser [35]. The developmental processes of camelina seeds from flowering to seed maturity were observed from July to August 2011. Seeds were harvested at 10 DAF (immature stage), and then every 5 days until 40 DAF (mature stage). After removing the seed coat, the seeds were immediately frozen in liquid nitrogen for oil extraction and RNA isolation.

Measurement of oil content

To extract the oil (or lipids), seeds harvested at 10, 15, 20, 25, 30, 35 and 40 DAF were oven-dried at 85 °C overnight. The dry samples were ground to a fine powder by a disintegrator, and the powder was transferred into glass tubes for oil extraction. Oil was extracted using ligarine to determine total lipids (TL) gravimetrically with the SER148 3/6 extraction apparatus (VELP Scientifica, Italy). Experiments were carried out using triplicate samples for each stage and mean values were determined. Errors are shown as standard deviations. Statistical significance analyses were performed using t-test by SPSS (version 13.0, P < 0.05).

Total RNA extraction and cDNA synthesis

Total RNA was extracted from these materials using TRIzol Reagent (Invitrogen, USA) following the manufacturer’s protocol. The quality of total RNA was determined using a NanoDrop Spectrometer (ND-1000 Spectrophotometer, Peqlab). The mRNAs were isolated from total RNAs using the PolyATtract mRNA Isolation Systems kit (Promega) and condensed using the RNeasy RNA cleaning kit (Qiagen, Germany); their concentration and purity were determined using the Agilent 2100 Bioanalyzer (RNA Nano Chip, Agilent). The mRNAs were fragmented and retrieved using an RNA Fragment reagent kit (Illumina) and RNeasy RNA cleaning kit (Qiagen). Then, random primers and M-MLV were used to synthesize the first chain, and DNA Polymerase I and RNase H were used to synthesize the second chain. Finally, the cDNAs were retrieved using the RNeasy RNA cleaning kit (Qiagen, Germany), and their quality was checked using the Agilent 2100 Bioanalyzer. All procedures were performed according to the manufacturers’ instructions.

454 sequencing and assembly

The raw 454 sequences in SFF files were base called using the python script sff_extract.py developed by COMAV (http://bioinf.comav.upv.es). All of the raw sequences were then processed to remove low quality and adaptor sequences using the programs tagdust [36], LUCY [37] and SeqClean [38] with default parameters. The resulting sequences were then screened against the NCBI UniVec database (http://www.ncbi.nlm.nih.gov/VecScreen/UniVec.html, version 20101122) to remove possible vector sequence contamination. Sequences shorter than 50 bp were discarded. The clean read sequences were assembled using MIRA3 [39] (minimum 30 bases overlap with 80 % identity) and CAP3 (overlap percent identity 90) [40]. The resulting contigs and singletons that were more than 100 nt long were retained as unigenes and annotated in the following steps.

Comparison analysis and functional annotation

To compare the differential expression of genes, we first recorded all reads of a unigene as the expression abundance. Then, expression data normalization was carried out using Reads Per Million reads (RPM) and Reads Per Kilo bases per Million reads (RPKM). The significance of differential gene expression was determined using the False Discovery Rate (FDR) and log2 ratio (T/C). Genes were deemed to be significantly differentially expressed with the threshold of “log2 ratio ≥ 1” and “FDR < 0.001” in sequence counts across the two samples.

Homolog searches against public sequence databases were performed to annotate the functions of the unigenes using BLAST with an E-value cutoff of 1e-6. The annotation of the record with highest similarity in the database was assigned as the functional annotation of the query unigene entry. The databases used for functional annotation included Nr (http://www.ncbi.nlm.nih.gov; version 20101011), Nt (http://www.ncbi.nlm.nih.gov, version 20101011) and SwissProt (http://www.ebi.ac.uk/uniprot, version 20090819). Additional functional classification was conducted using the COG (http://www.ncbi.nlm.nih.gov/COG/) and KEGG pathway (http://www.genome.jp/kegg) databases. ORF analysis was performed by ORF finder (http://www.ncbi.nlm.nih.gov/gorf/gorf.html).

Quantitative real-time PCR (qRT-PCR) analysis

Total RNA was extracted from seeds using TRIzol Reagent (Invitrogen) according to the manufacturer’s protocol. cDNA was synthesized from 2 μg of total RNA using the PrimeScript RT reagent Kit (Takara). Each reaction was performed in a 20 μL volume containing 10 μL SYBR Green Mastermix (Takara), 2 μL 50-fold diluted cDNA template and 1 μM each of the sense and anti-sense primers. qRT-PCR was performed on a Stratagene Mx3000P thermocycler (Agilent) with the following program: 95 °C for 15 s, followed by 40 cycles of 95 °C for 15 s and annealing at 60 °C for 30 s. Triplicates of each reaction were performed using actin as an internal reference. The gene-specific primers used for candidate genes are described in Additional file 7: Table S5.

Availability of supporting data

The sequences used in this study have been submitted to the Sequence Read Archive at NCBI (Accession number: SRX866238).

Abbreviations

ALA:: Alpha linolenic acid
Ascl:: Long chain acyl-CoA synthetase
COG:: Cluster of orthologous groups of proteins
CVD:: Cardiovascular disease
DAF:: Days after flowering
DGAT:: Acyl-CoA:diacylglycerol acyltransferase
DEG:: Differentially expressed genes
ER:: Endoplasmic reticulum
FA:: Fatty acid
FabF:: 3-oxoacyl-acyl-carrier-protein synthase
FabG:: 3-ketoacy-acyl-carrier-protein reductase
FabI/FabK:: Enoyl-acyl-carrier-protein reductase
FAD:: Fatty acid desaturase
FAE:: Fatty acid elongase
FDR:: False disvovery rate
KEGG:: Kyoto encyclopedia of genes and genomes
LA:: Linoleic acid
LC-PUFA:: Long chain polyunsaturated fatty acid
LPCAT:: Lysophosphatidylcholine acyltransferase
NADPH:: Nicotinamide adenine dinucleotide phosphate
OA:: Oleic acid
PDAT:: Phospholipid:diacylglycerol acyltransferase
PDCT:: Phosphatidylcholine diacylglycerol cholinephosphotransferase
PUFA:: Polyunsaturated fatty acid
qRT-PCR:: Quantitative real time polymerase chain reaction
RPKM:: Reads per kilo bases per million reads
RPM:: Reads per million reads
SDA:: Stearidonic acid
TL:: Total lipids

References

Deckelbaum RJ, Torrejon C. The omega-3 fatty acid nutritional landscape: health benefits and sources. J Nutrition. 2012;142(3):587S–91.
Article CAS Google Scholar
Lorgeril D, Patricia S. New insights into the health effects of dietary saturated and omega-6 and omega-3 polyunsaturated fatty acids. BMC Med. 2012;10:50.
Article PubMed Central PubMed Google Scholar
Hixson SM, Parrish CC, Anderson DM. Changes in tissue lipid and fatty acid composition of farmed rainbow trout in response to dietary camelina oil as a replacement fo fish oil. Lipids. 2014;49(1):97–111.
Article CAS PubMed Google Scholar
Eidhin DN, Burke J, O’Beirne D. Oxidative stability of ω3-rich camelina oil and camelina oil-based spread compared with plant and fish oils and sunflower spread. J Food Sci. 2003;68(1):345–53.
Article CAS Google Scholar
Hutcheon C, Ditt RF, Beilstein M, Comai L, Schroeder J, Gold stein E, et al. Polyploid genome of Camelina sativa revealed by isolation of fatty acid synthesis genes. BMC Plant Biol. 2010;10:233.
Article PubMed Central PubMed Google Scholar
Kang JL, Snapp AR, Lu CF. Identification of three genes encoding microsomal oleate desaturases (FAD2) from the oilseed crop Camelina sativa. Plant Physiol Biochem. 2011;49(2):223–9.
Article CAS PubMed Google Scholar
Kagale S, Koh C, Nixon J, Bollina V, Clarke WE, Tuteja R, et al. The emerging biofuel crop Camelina sativa retains a highly undifferentiated hexaploid genome structure. Nat Commun. 2014;23:3706.
Google Scholar
Sayanova O, Ruiz-Lopez N, Haslam RP, Napier JA. The role of deta6-desaturase acyl-carrier specificity in the efficient synthesis of long-chain polyunsaturated fatty acids in transgenic plants. Plant Biotech J. 2012;10(2):195–206.
Article CAS Google Scholar
Petrie JR, Shrestha P, Belide S, Kennedy Y, Lester G, Liu Q, et al. Metabolic engineering Camelina sative with fish oil-like levels of DHA. PLoS One. 2014;9(1):e85061.
Article PubMed Central PubMed Google Scholar
Mansour MP, Shrestha P, Belide S, Petrie JR, Nichols PD, Singh SP. Characterization of oilseed lipids from “DHA-producing Camelina sativa”: A new transformed land plant containing long-chain omega-3 oils. Nutrients. 2014;6(2):776–89.
Article CAS PubMed Central PubMed Google Scholar
Nguyen HT, Silva JE, Podicheti R, Macrander J, Yang W, Nazarenus TJ, et al. Camelina seed transcriptome: a tool for meal and oil improvement and translational research. Plant Biotech J. 2013;11:759–69.
Article CAS Google Scholar
Mudalkar S, Golla R, Ghatty S, Reddy AR. De novo transcriptome analysis of an imminent biofuel crop, Camelina sativa L. using Illumina GAIIX sequencing platform and identification of SSR markers. Plant Mol Biol. 2014;84(1–2):159–71.
Article CAS PubMed Google Scholar
Liang C, Liu X, Yiu SM, Lim BL. De novo assembly and characterization of Camelina sativa transcriptome by paired-end sequencing. BMC Genomics. 2013;14:146.
Article CAS PubMed Central PubMed Google Scholar
Siri-Tarno PW, Sun Q, Hu FB, Krauss RM. Meta-analysis of prospective cohort studies evaluating the association of saturated fat with cardiovascular disease. Am J Clin Nutr. 2010;91(3):535–46.
Article Google Scholar
Cockbain AJ, Toogood GJ, Hull MA. Oemga-3 polyunsaturated fatty acids for the treatment and prevention of colorectal cancer. Gut. 2012;61(1):135–49.
Article CAS PubMed Google Scholar
Patterson RE, Flatt SW, Newman VA, Natarajan L, Rock CL, Thomson CA, et al. Marine fatty acid intake is associated with breast cancer prognosis. J Nutr. 2011;141(2):201–6.
Article CAS PubMed Central PubMed Google Scholar
Hixson SM, Parrish CC, Anderson DM. Effect of replacement of fish oil with camelina (Camelina sativa) oil on growth, lipid class, and fatty acid composition of farmed juvenile Atlantic cod (Gadus morhua). Fish Physiol Biochem. 2013;39(6):1441–56.
Article CAS PubMed Google Scholar
Turchini G, Torstensen B, Wing-Keong N. Fish oil replacement in finfish nutrition. Rev Aquac. 2009;1(1):10–57.
Article Google Scholar
Petrie JR, Shrestha P, Zhou XR, Mansour MP, Liu Q, Belide S, et al. Metabolic engineering plant seeds with fish oil-like levels of DHA. PLoS One. 2012;7(11):e49165.
Article CAS PubMed Central PubMed Google Scholar
Chen H, Wang FW, Dong YY, Nan W, Sun YP, Li XY, et al. Sequence mining and transcript profiling to explore differentially expressed genes associated with lipid biosynthesis during soybean seed development. BMC Plant Biol. 2012;12:122.
Article CAS PubMed Central PubMed Google Scholar
Teoh KT, Requesens DV, Devaiah SP, Johnson D, Huang XZ, Howard JA, et al. Transcriptome analysis of embryo maturation in maize. BMC Plant Biol. 2013;13:19.
Article CAS PubMed Central PubMed Google Scholar
Lee JM, Williams M, Tingey S, Rafalski A. DNA array profiling of gene expression changes during maize embryo development. Funct Integr Genomics. 2002;2(1):13–7.
Article CAS PubMed Google Scholar
Luo M, Liu J, Lee RD, Guo BZ. Characterization of gene expression profiles in developing kernels of maize (Zea mays) inbred Tex6. Plant Breed. 2008;127(6):569–78.
Article CAS Google Scholar
Lai CY, Cronan JE. Isolation and characterization of β-ketoacyl-acyl carrier protein reductase (fabG) mutants of Escherichia coli and Salmonella enterica serovar Typhimurium. J Bacteriol. 2004;186:1869–78.
Article CAS PubMed Central PubMed Google Scholar
Tomura CT, Taguchi K, Gan Z, Kuwabara K, Tanaka T, Takase K, et al. Expression of 3-ketoacyl-acyl carrier protein reductase (fabG) genes enhances production of polyhydroxyalkanoate copolymer from glycose in recombinant Escherichia coli JM109. Appl Environ Microbiol. 2005;71(8):4297–306.
Article Google Scholar
Heath RJ, Rock CO. Enoyl-acly carrier protein reductase (fabI) plays a determinant role in completing cycles of fatty acid elongation in Escherichia coli. J Biol Chem. 1995;270(44):26538–42.
Article CAS PubMed Google Scholar
Heath RJ, Rock CO. Regulation of fatty acid elongation and initiation by acyl-acyl carrier protein in Escherichia coli. J Biol Chem. 1996;271(4):1833–6.
Article CAS PubMed Google Scholar
Yao JW, Abdelrahman YM, Robertson RM, Cox JV, Belland RJ, White SW, et al. Type II fatty acid synthesis is essential for the replication of Chlamydia trachomatis. J Biol Chem. 2014;289(32):22365–76.
Article CAS PubMed Google Scholar
Yadav NS, Wierzbicki A, Aegerter M, Caster CS, Perez-Grau L, Kinney AJ, et al. Cloning of higher plant ω-3 fatty acid desaturases. Plant Physiol. 1993;103:467–76.
Article CAS PubMed Central PubMed Google Scholar
Chen JH, Zhu LH, Salentijn EM, Huang BQ, Gruber J, Dechesne AC, et al. Functional analysis of the omega-6 fatty acid desaturase (CaFAD2) gene family of the oil seed crop Crambe abyssinica. BMC Plant Biol. 2013;13:146.
Article Google Scholar
Yurchenko OP, Park S, Ilut DC, Inmon JJ, Millhollon JC, Liechty Z, et al. Genome-wide analysis of the omega-3 fatty acid desaturase gene family in Gossypium. BMC Plant Biol. 2014;14:312.
Article PubMed Central PubMed Google Scholar
Pan X, Siloto RM, Wickramarathna AD, Mietkiewska E, Weselake RJ. Identification of a pair of phospholipid:diacylglycerol acyltransferases from developing flax (Linum usitatissimum L.) seed catalyzing the selective production of trilinolenin. J Biol Chem. 2013;288(33):24173–88.
Article CAS PubMed Central PubMed Google Scholar
Siloto RM, Truksa M, He X, McKeon T, Weselake RJ. Simple methods to detect triacylglycerol biosynthesis in a yeast-based recombinant system. Lipids. 2009;44:963–73.
Article CAS PubMed Google Scholar
Lopes-Marques M, Cunha I, Reis-Henriques MA, Santos MM, Castro LF. Diversity and history of the long-chain acyl-CoA synthetase (Acsl) gene family in vertebrates. BMC Evol Biol. 2013;13:271.
Article PubMed Central PubMed Google Scholar
Campbell BT, Bauper PJ. Genetic variation for yield and fiber quality response to supplemental irrigation within the Pee Dee Upland cotton germplasm collection. Crop Sci. 2007;47:589–97.
Article Google Scholar
Lassmann T, Hayashizaki Y, Daub CO. TagDust—a program to eliminate artifacts from next generation sequencing data. Bioinformatics. 2009;25(21):2839–40.
Article CAS PubMed Central PubMed Google Scholar
Chen YA, Lin CC, Wang CD, Wu HB, Hwang PI. An optimized procedure greatly improves EST vector contamination removal. BMC Genomics. 2007;8:416.
Article CAS PubMed Central PubMed Google Scholar
Chevreux B, Wetter T, Suhai S (1999) Genome sequence assembly using trace signals and additional sequence information. Computer science and biology: Proceedings of the German conference on bioinformatics pp: 45–56.
Conesa A, Gotz S. Blast2GO: A comprehensive suite for functional analysis in plant genomics. Int J Plant Genomics. 2008;2008:619832.
Article PubMed Central PubMed Google Scholar
Salomonis N, Hanspers K, Zambon AC, Vranizan K, Lawlor SC, Dahlquist KD, et al. GenMAPP 2: new features and resources for pathway analysis. BMC Bioinformatics. 2007;8:217.
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgements

This research was supported by the National “863” program (2011AA100606), the Special Program for Research of Transgenic Plants (2014ZX08010-002), the Development and Reform Commission of Jilin Province in China (JF2012C002-4), the National Natural Science Foundation of China (31271746, 31201144, 31101091, 31401403), and the Excellent Innovation Team Project of Jilin Province, China (20111815).

Author information

Authors and Affiliations

Ministry of Education Engineering Research Center of Bioreactor and Pharmaceutical Development, Jilin Agricultural University, Changchun, Jilin, 130118, China
Fawei Wang, Xiaowei Li, Nan Wang, Jing Yang, Lili Guan, Na Yao, Linna Du, Yanfang Wang, Xiuming Liu, Yuanyuan Dong & Haiyan Li
College of life Sciences, Jilin Agricultural University, Changchun, Jilin, 130118, China
Huan Chen, Tianyi Wang & Haiyan Li
Jilin Technology Innovation Center for Soybean Region, Jilin Agricultural University, Changchun, Jilin, 130118, China
Xifeng Chen & Zhenmin Wang

Authors

Fawei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huan Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaowei Li
View author publications
You can also search for this author in PubMed Google Scholar
Nan Wang
View author publications
You can also search for this author in PubMed Google Scholar
Tianyi Wang
View author publications
You can also search for this author in PubMed Google Scholar
Jing Yang
View author publications
You can also search for this author in PubMed Google Scholar
Lili Guan
View author publications
You can also search for this author in PubMed Google Scholar
Na Yao
View author publications
You can also search for this author in PubMed Google Scholar
Linna Du
View author publications
You can also search for this author in PubMed Google Scholar
Yanfang Wang
View author publications
You can also search for this author in PubMed Google Scholar
Xiuming Liu
View author publications
You can also search for this author in PubMed Google Scholar
Xifeng Chen
View author publications
You can also search for this author in PubMed Google Scholar
Zhenmin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Yuanyuan Dong
View author publications
You can also search for this author in PubMed Google Scholar
Haiyan Li
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Yuanyuan Dong or Haiyan Li.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

Conceived and designed the experiments: FW, YD, HL. Performed the experiments: FW, HC, XL, JY, LG, NY, LD, YW, XL, XC. Analyzed the data: FW, YD, NW, ZW. Read and approved the final manuscript: FW, TW, HL. All authors read and approved the final manuscript.

Additional files

Additional file 1: Fig. S2.

The quality analysis of mRNA and cDNA from C. sativa seeds. The mRNA and cDNA were examined by electrophoresis and shown in (A) and (B). The qualities of mRNA for the construction of cDNA library were further analyzed by Agilent2100 (C-F).

Additional file 2: Table S4.

The number of matched proteins in different database.

Additional file 3: Table S1.

KEGG pathway annotation.

Additional file 4: Fig. S1.

Unsaturated fatty acid biosynthetic pathway in camelina. Red rectangles indicate up-regulated genes in 20 DAF sample. 3.1.2.2/TesB: Acyl-CoA thioesterase (Unigene8524).

Additional file 5: Table S2.

Up-regulated fatty acid biosynthesis genes in the 20 DAF sample.

Additional file 6: Table S3.

Fatty acid desaturase genes involved in the PUFA synthesis pathway.

Additional file 7: Table S5.

Gene-specific primers used in qRT-PCR.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made.

The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.

To view a copy of this licence, visit https://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver (https://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Wang, F., Chen, H., Li, X. et al. Mining and identification of polyunsaturated fatty acid synthesis genes active during camelina seed development using 454 pyrosequencing. BMC Plant Biol 15, 147 (2015). https://doi.org/10.1186/s12870-015-0513-6

Download citation

Received: 10 December 2014
Accepted: 28 April 2015
Published: 18 June 2015
DOI: https://doi.org/10.1186/s12870-015-0513-6

Mining and identification of polyunsaturated fatty acid synthesis genes active during camelina seed development using 454 pyrosequencing

Abstract

Background

Results

Conclusions

Background

Results

Lipid accumulation at different stages during seed development

Sequencing output and assembly

Transcriptional profile analysis of unigenes during seed development

Functional annotation and classification

DEGs related to PUFA biosynthesis

Validation of DEGs by quantitative real-time PCR

Discussion

Conclusions

Materials and Methods

Plant culture and collection

Measurement of oil content

Total RNA extraction and cDNA synthesis

454 sequencing and assembly

Comparison analysis and functional annotation

Quantitative real-time PCR (qRT-PCR) analysis

Availability of supporting data

Abbreviations

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding authors

Additional information

Competing interests

Authors’ contributions

Additional files

Additional file 1: Fig. S2.

Additional file 2: Table S4.

Additional file 3: Table S1.

Additional file 4: Fig. S1.

Additional file 5: Table S2.

Additional file 6: Table S3.

Additional file 7: Table S5.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Plant Biology

Contact us