Transcriptome architecture reveals genetic networks of bolting regulation in spinach
BMC Plant Biology volume 21, Article number: 179 (2021)
Bolting refers to the early flowering stem production on agricultural and horticultural crops before harvesting. Indeed, bolting is an event induced by the coordinated effects of various environmental factors and endogenous genetic components, which cause a large reduction in the quality and productivity of vegetable crops like spinach. However, little is known about the signaling pathways and molecular functions involved in bolting mechanisms in spinach. The genetic information regarding the transition from vegetative growth to the reproductive stage in spinach would represent an advantage to regulate bolting time and improvement of resistant cultivars to minimize performance loss.
To investigate the key genes and their genetic networks controlling spinach bolting, we performed RNA-seq analysis on early bolting accession Kashan and late-bolting accession Viroflay at both vegetative and reproductive stages and found a significant number of differentially expressed genes (DEGs) ranging from 195 to 1230 in different comparisons. These genes were mainly associated with the signaling pathways of vernalization, photoperiod/circadian clock, gibberellin, autonomous, and aging pathways. Gene ontology analysis uncovered terms associated with carbohydrate metabolism, and detailed analysis of expression patterns for genes of Fructose-1, 6-bisphosphate aldolase, TREHALOSE-6-PHOSPHATE SYNTHASE 1, FLOWERING PROMOTING FACTOR 1, EARLY FLOWERING, GIGANTEA, and MADS-box proteins revealed their potential roles in the initiating or delaying of bolting.
This study is the first report on identifying bolting and flowering-related genes based on transcriptome sequencing in spinach, which provides insight into bolting control and can be useful for molecular breeding programs and further study in the regulation of the genetic mechanisms related to bolting in other vegetable crops.
Spinach (Spinacia oleracea L.) from the Amaranthaceae family is an annual, dioecious, and cold-tolerant leafy vegetable plant with diverse nutrients and health-promoting compounds such as fiber, vitamins, iron, and antioxidant activities [1, 2]. This plant is cultivated worldwide and is becoming one of the most important economic vegetable crops with an estimated annual production of ~ 26 million tonnes . Indeed, spinach’s economic value is significantly affected by the yield performance, which is greatly affected by environmental factors, including light, temperature, water, humidity, and nutrition [4, 5]. Bolting or premature flowering, a critical event due to the coordinated effects of various environmental factors and endogenous genetic components [6, 7], is known as one of the most important factors which can cause a large reduction in quality, productivity, and serious economic loss in spinach crop, particularly in the spring [8,9,10]. According to the time of bolting, spinach cultivars are divided into three categories: “early-bolting” cultivars, the floral stem appears earlier than 60 d after planting, “intermediate-bolting” cultivars, bolting time is between 60 and 70 d, and “late-bolting” cultivars, bolting occurs after the 70 d. Late-bolting spinach varieties can cut multiple times during the growing season and increases the overall yield because they are not sensitive to photoperiod . Hence, the use of late-bolting cultivars and regulation of bolting time are the most successful ways to limit the effect of bolting on spinach productivity. Moreover, research on the signaling pathways and molecular functions of flowering-related genes can enable researchers to regulate bolting.
In recent years, many studies have revealed flowering-related signaling pathways and regulatory networks in model plants [11,12,13,14,15], but few studies have focused on spinach. Studies on Arabidopsis , radish , carrot , and lettuce  have discovered substantial information regarding the influence of photoperiod, aging, vernalization, endogenous hormones (especially gibberellins), and signal cascades on bolting. These signals regulate the most important flowering integrators, including SUPPRESSOR OF OVEREXPRESSION OF CO1 (SOC1), FLOWERING LOCUS T (FT), and LEAFY (LFY) that determine the eventual flowering time. The key gene in the photoperiod pathway is CONSTANS (CO), a transcription factor that positively activates FT [19, 20]. In the vernalization pathway, FLOWERING LOCUS C (FLC) is known as a key gene that suppresses flowering through inhibiting SOC1, FT, and LFY genes [21, 22]. SQUAMOSA PROMOTER BINDING-LIKE PROTEIN 1 (SPL1), SPL2, SPL3, SPL9, SPL13, and SPL15 are the most relevant candidate genes related to the aging pathway . Based on both physiological and genetic studies, a decrease in gibberellin content affects flower formation by restricting internode elongation. LFY is one of the main target genes in the gibberellins pathway .
Most of the information has been uncovered over the past few years by using Next-generation sequencing (NGS) technologies. Nowadays, RNA-Seq combined with Digital gene expression (DGE) profiling is a powerful strategy for the global discovery of functional genes and expression analysis under certain conditions in many plant species. For example, RNA-seq has been applied for the detection of genes linked with the flowering process in Arabidopsis thaliana , Raphanus sativus , Eichhornia paniculata , and Lagerstroemia indica , and Lactuca sativa .
Previous studies have reported different genes, pathways, mechanisms, and networks have complex roles in flowering induction that are varied in different plant species under different circumstances [15,16,17,18]. Although the spinach reference genome has been sequenced  by using NGS technologies and bioinformatics approaches, and transcriptome studies are currently in progress, little is known about the genes and genetic networks involved in flowering and bolting mechanisms from vegetative growth to the reproductive stage in spinach. Thus, we performed transcriptome and qPCR analyses to reveal key genes associated with spinach bolting. In this regard, two leafy spinach accessions with different bolting resistance potential, including “Kashan” (early bolting accession), and “Viroflay” (late-bolting accession), were chosen as materials for high-throughput RNA sequencing (RNA-seq) and analyze global transcripts level changes in two vegetative to reproductive stages. Our findings will contribute to identify genes and molecular mechanisms regulating bolting, which could help us to better understand the bolting mechanisms in spinach and can be useful for molecular breeding programs and further study in the regulation of the genetic mechanisms related to bolting in other vegetable crops.
Sequencing data analysis
A total of 559.064 million raw reads were generated from 12 cDNA libraries constructed with leaf samples of two accessions Kashan and Viroflay at vegetative and reproductive stages. The average output of each sample was 46.588 million. Of these reads, an average of 43.269 million clean reads with a Q30 ratio of more than 92.86% was retained from each library after removing low-quality reads and adaptor sequences (Table S1). After the quality check, approximately a range of 97.80–98.30% high-quality sequencing reads of each sample were individually mapped to spinach genome assembly using STAR (Fig. 1, Table S2).
Differential gene expression profiling
To discover the genetic differences between early and late-bolting accessions, DEGs were identified from pairwise comparisons of both stages and accessions (Fig. 2). Developmental stages comparison in each accession revealed a total of 462 and 1230 genes as significant DEGs in Kashan and Viroflay (Fig. 2b), respectively, with a strong DEG signal for upregulated genes in the reproductive stage. The accessions’ comparison in each stage exhibited that the number of DEGs at the reproductive stage (333 genes) was significantly more than that at the vegetative stage (195 genes) (Fig. 2a). Interestingly, at the vegetative stage, more upregulated genes were associated with accession Kashan, whereas at the reproductive stage, more upregulated genes were found in Viroflay. Furthermore, there were 216 DEGs commonly shared between accessions under stages comparison, while 48 DEGs were found to be common between stages in the accessions comparison (Fig. 2a). To obtain a more comprehensive view, the distribution of unique and common DEGs identified in each pairwise comparisons of stages and accessions was shown in Fig. 2c. Additionally, to characterize the subgenome and chromosome distribution of DEGs, the location data and log2 fold change of DEGs in each comparison were identified and plotted on 6 chromosomes of spinach (Fig. S1-S2). It was found that chromosome numbers 1 and 2 have the highest number of DEGs in vegetative stages comparison, similar to stages comparison in accessions Kashan and Viroflay, while DEGs related to reproductive stages comparison showed a more balanced dispersion across all the 6 chromosomes. A file containing the list of DEGs (DataS1) is provided in the supplementary material.
Gene classification of predicted-DEGs
To address the product properties and functional classification of the DEGs, we adopted Gene Ontology (GO) terms for all DEGs identified in pairwise comparisons of stages and accessions using Gene Classification tools located in SpinachBase .
With respect to the stages comparison in accession Kashan, of the 462 DEGs, 303, 325, and 200 were successfully annotated with GO assignments in the three main categories, including biological process (BP, Fig. 3), molecular function (MF, Fig. S3), and cellular component (CC, Fig. S4), respectively. The counterpart numbers of DEGs derived from the Viroflay were 753, 812, and 482 genes, respectively. With respect to the accessions comparison in the vegetative stage, 137, 139, 80 genes were involved in different GO terms of BP, MF, and CC categories, respectively. According to the details of the GO analysis of 333 DEGs that were for the reproductive stage, BP was the dominant category with 275 genes, followed by MF with 220 and CC with 125 genes.
In order to obtain a deeper and better understanding, further analysis included only the GO terms associated with biological processes. Enrichment results revealed that similar functional subcategories, including ‘metabolic processes’, ‘cellular processes’, and ‘biosynthetic processes’ were dominant under the biological processes in all comparisons. Besides that, a more number of DEGs were classified into reproduction (GO:0000003) and flower development (GO:0009908) subclasses in stages comparison of each accession versus accessions comparison in each stage, which is related to the developmental activities taking place during flowering. Additionally, several DEGs were annotated with GO terms related to embryo development (GO:0009790), post-embryonic development (GO: 0009791), pollination (GO: 0009856). These processes are well known to be involved in the transition from the vegetative stage to the reproductive stage.
Moreover, to summarize and characterize DEGs putatively implicated in bolting regulatory networks, the most popular genes were identified and classified in various flowering pathways, including vernalization, photoperiod/circadian clock, GA, autonomous, and aging pathways. In the comparison of vegetative and reproductive stages of each accession, the searching results indicated that 18 and 20 functional genes such as CONSTANS-LIKE 1 (COL1), EARLY FLOWERING 3 (ELF3), FLOWERING LOCUS T (FT), AGAMOUS-LIKE (AGL), and GIGANTEA (GI) were identified and implicated in photoperiod pathway of accessions Kashan and Viroflay, respectively. Besides, the pathways of GA and age contained 7 and 2 DEGs in Kashan and 3 and 1 DEGs in accession Viroflay. More importantly, the pathways of vernalization contained 8 DEGs only in the accession Viroflay.
Transcription factors implicated in spinach bolting
Due to the fact that transcription factor families are associated with the floral development process, we performed a detailed analysis of differential transcription factors to provide further insights into the complex molecular mechanism underlying the bolting.
Among all DEGs identified in the comparison of vegetative and reproductive stages of each accession, a total of 32 and 83 differentially expressed TFs were found in accessions Kashan and Viroflay, respectively. Of these, 13 TFs were commonly expressed in both accessions (Fig. 4a). In the accession Kashan, C2C2-GATA and MADS were the TF families with the most members (5 genes; Fig. 5), MYB was the second-largest TF family with 4 gene members, followed by the bHLH (3 genes). The genes encoding AP2/ERF (10 genes) accounted for the largest proportion of TFs in the accession Viroflay, followed by genes encoding MYB (7 genes), C2C2, and NAC (5 genes). In both accessions, most TFs were upregulated in the reproductive stage compared with the vegetative stage. With respect to the accessions comparison in each stage, 7 and 28 TFs were differentially expressed in vegetative and reproductive stages, respectively (Fig. 4b). The TF families represented by the largest numbers of differentially expressed members were the AP2/ERF (5 genes), WRKY (4 genes), and C2H2 (3 genes) in the reproductive stage (Fig. 6). The majority of TFs were significantly upregulated in the accession Viroflay. The distribution of unique and common TFs identified in each pairwise comparisons of stages and accessions was shown in Fig. 4c.
Gene set enrichment of predicted-DEGs
The gene set enrichment analysis of the DEGs showed that the circadian rhythm and monoterpenoid biosynthesis were common significantly enriched terms in both accessions (Fig. 7). In circadian rhythm, several core genes such as CYCLING DOF FACTOR 1 (CDF1), FT, TWIN SISTER OF FT (TSF), LATE ELONGATED HYPOCOTYL (LHY), F-Box 1 (FKF1), and TRANSPARENT TESTA 4 (TT4), were identified in both accessions. This term also included two flowering-related genes including GI and ELF3 in the accession Kashan (Fig. 7a), while CONSTITUTIVE PHOTOMORPHOGENIC 1 (COP1), PENTATRICOPEPTIDE REPEAT-CONTAINING PROTEIN 5 (PPR5), and CRYPTOCHROME CIRCADIAN REGULATOR 2 (CRY2) genes were unique in the accession Viroflay (Fig. 7b). Based on the functional analysis results, we detected a few terms such as the MAPK signaling pathway and the flavonoid biosynthesis unique to the accession Viroflay and DNA replication and galactose metabolism unique to the accession Kashan, which suggests the transition from vegetative to reproductive stages may be affected by genes involved in these pathways.
Validation of differential gene expression using qRT-PCR
To validate the results of differential expression analysisresults, the relative expression of eight flowering-related genes was examined by qRT-PCR (Fig. 8). Overall, all the selected genes exhibited the same patterns that were consistent with the RNA-seq data, validating positive correlations between the qPCR and RNA-Seq results (Fig. S5). GI, FT, ELF3, and ABA/WDS genes were significantly upregulated in the reproductive stage of both accessions against the vegetative stage, while Agamous-like protein and Zinc finger were downregulated in the reproductive stage. Expression of WRKY 40 was downregulated in the reproductive stage of accession Kashan whereas indicated an opposite expression pattern (as upregulated in reproductive stage) in accession Viroflay. Gibberellin-regulated protein significantly upregulated during the transition from the vegetative to reproductive stage only in accession Viroflay.
In order to identify SNPs associated with the bolting and reveal the effects of SNPs on the functionality of the corresponding genes, we used RNA-seq data of two spinach accessions to identify SNPs and variable genomic regions. In terms of numbers, a total of 168,849 SNPs were generated from 12 libraries using the GATK pipeline. Of these, 59,022 SNPs remained after filtering using cut-off values. Further, these high-quality SNPs were screened for both kinds of homozygous and heterozygous variants, which remained the 3397 SNPs (Table 1). After exploring, a large number of homozygous reference SNPs were identified from Viroflay (1829 SNPs), which was significantly higher than Kashan (550 SNPs), indicating Viroflay is more closely related to genome reference than the accession Kashan. In contrast, the large rate of heterozygous SNPs in Kashan (2727 SNPs) indicates a higher genetic variability in this accession compared to Viroflay.
In further investigation, the SNPs were functionally annotated for obtaining a comprehensive view of the genes associated with the SNPs, giving a total of 3362 annotated SNPs using SnpEff. Of these, 2152 SNPs were distributed across all six chromosomes (Fig. 9). According to the impact results of the SNPs on the functionality of the genes, the vast majority of the variants were categorized into modifier impact (79.12%), followed by low (11.08%), moderate (9.36%), and high (0.42%). It is remarkable the large proportion of variants with modifier effect were observed as downstream (25.84%), upstream (17.1%), intergenic (14.61%), and intron (10.3%) gene variants (Fig. S6), which indicates the presence of these variants in unannotated exons and/or non-coding regions. The low effect SNPs mostly occurred as UTR (11.77%), and synonymous (8.3%) variants, whereas the moderate impact SNPs were observed as missense (11.03%) variants. Moreover, the high impact SNPs were mostly identified as splice variants (0.46%).
Additionally, the SNPs have also been classified according to GO terms for obtaining a comprehensive view of the genes associated with the SNPs. The functional annotation of the filtered SNPs revealed the association of SNPs with genes involved in numerous vital biological processes. Among the biological process annotated, a significant number of SNPs were linked to genes involved in the flowering process, including photosynthesis (49 genes), carbohydrate metabolic process (200 genes), reproduction (77 genes), flower development (55 genes), pollination (11 genes), embryo development (30 genes), and post-embryonic development (70 genes).
The timing of bolting is essential for productivity success in vegetative crops such as spinach. Our past investigation to screen accessions affirmed that bolting time in Kashan (accession with early-bolting phenotype) varies from that of Viroflay with a high potential in the production (accession with late-bolting phenotype) . By studying plants’ genome and transcriptome, researchers have provided much essential information on key genes and mechanisms associated with bolting and flowering regulation. Hence, to determine molecular networks and bolting-related genes in spinach, we investigated the most prominent gene expression changes in leaf tissues of two accessions at different developmental stages using RNA-seq technology. Therefore, a significant number of bolting and flowering related genes which differentially expressed between stages and accessions were recongized by transcriptome analysis., to gain deeper insight into the molecular events that regulate bolting in spinach, DEGs were utilized for GO, KEGG, TFs analyses.
GO terms associated with biological processes for DEGs included ‘metabolic processes’, ‘cellular processes’, and ‘biosynthetic processes’ as the most highly represented terms. Among enrichment terms determined in all comparisons, ‘reproduction’, ‘carbohydrate metabolic process’, ‘generation of precursor metabolites and energy’, and ‘flower development’ were the most related terms, which have roles in the vegetative to reproductive transition. Previous evidence demonstrates the role of carbohydrates in the vegetative to reproductive transition as energy reserves for inflorescence development and signaling molecules . Interestingly, carbohydrates can promote bolting through the up-regulation of flowering promoters . The stringent analyses of genes expression pattern encoding enzymes involved in carbohydrate metabolism indicated a clear differential pattern between stages and accessions. Our integrative results showed that a significant number of carbohydrate-related DEGs (50 genes) were up-regulated in the vegetative stage of Viroflay compared to accession Kashan (16 genes). Among these genes, Fructose-1, 6-bisphosphate aldolase (FBA) that is involved in glycolysis, gluconeogenesis, and the Calvin cycle with significant roles in regulating growth and development processes [32, 33] was found to be up-regulated in accession Viroflay at the vegetative stage. It has been concluded that increase of activity of FBA enhances photosynthetic capacity, growth, and plant productivity or biomass [34,35,36], which is consistent with the high-yield of accession Viroflay as reported in the previous studies  and a higher number of DEGs associated with “photosynthesis” term in Viroflay under present study. Among genes involved in carbohydrate metabolism, TREHALOSE-6-PHOSPHATE SYNTHASE 1 (TPS1), which catalyzes the formation of trehalose-6-phosphate (T6P), has been proposed to function as a proxy for carbohydrate status and a signal that coordinates the induction of flowering in plants [37,38,39]. In this way, Alpha, alpha-trehalose-phosphate synthase (TPS) gene with having important roles in sucrose and starch synthesis [37, 38], and TPS1 were highly up-regulated in both vegetative and reproductive stages of accession Kashan, while in Viroflay showed an increase in expression during the transition from vegetative to reproductive growth. Similarly, there was no obvious difference in sucrose synthase activity during growth development in Kashan (a high expression in both stages), while its expression level increased during the transition from vegetative to reproductive in accession Viroflay. These results demonstrate the potential role of these genes in the flowering delay of accession Viroflay and their target for genome manipulation and genetic improvement in spinach to obtain late-bolting varieties. The GO annotation results of DEGs retrieved FLOWERING PROMOTING FACTOR 1 (FPF1), Transcription factor MADS-box, ELF, and GI as the most important genes in flower development subcategory, which highly expressed in accession Kashan. Previous studies have indicated that FPF1 gene is involved in the promotion of flowering , thereby initiating bolting. We found that FPF1 was highly expressed in the vegetative stage of Kashan while it didn’t show any expression change during growth in accession Viroflay, indicating that probably FPF1 has a stronger effect on the early bolting of accession Kashan. On the other hand, the transcription factor MADS-box showed a sharp increase in expression during the transition from vegetative to the reproductive stage in Kashan, whereas it was considered a non-significant gene in accession Viroflay. In previous reports, MADS-box transcription factors have been characterized in various species such as A. tequilana , radish , carrot , and lettuce  as a major group of regulators controlling floral transition and therefore exhibited differential expression patterns at different developmental stages and in different tissues. GI is a unique plant protein involved in multiple biological functions, including photoperiodic flowering, control over the circadian rhythm, and light and hormone signaling [41,42,43]. Overexpression of GI in wheat  and barley  modified flowering time, leading to early flowering phenotype. According to genetic data, COP1 and ELF3 control flowering by regulating GI stability, so that ELF3 allows COP1 to interact with GI, resulting in GI degradation [43, 46]. In our study, ELF3 and GI were up-regulated in the reproductive stage while COP1 was up-regulated in the vegetative stage of both accessions. Although all three genes showed a similar expression trend during the transition from the vegetative stage to the reproductive stage in both accessions, the accessions comparison in each stage exhibited their different expression level. The results indicated a lower expression level for ELF3 and GI genes in the vegetative stage of accession Viroflay against Kashan. In contrast, COP1 showed a higher expression in a similar stage and accession, suggesting low expression of COP1 in spinach causes early flowering, possibly by increased accumulation of transcripts of floral inducers like GI. Additionally, FLAVIN-BINDING, KELCH REPEAT, F BOX protein 1 (FKF1), a clock-controlled gene was found to be up-regulated in both accession reproductive stages . FKF1 has been reported to promote flowering by making a complex with GI that up-regulates CO and FT [41, 48]. Indeed, expression levels of these key integrator genes precisely adjust the expression of floral specific genes and ultimately determine the exact FT so that the low expression of FKF1 in the vegetative stage of accession Viroflay may cause late flowering in this accession.
Besides the core flowering-related genes, TFs have been reported to be an essential regulators group to control bolting and flowering by affecting the expression of flowering pathways genes and integrating endogenous/exogenous signals [49,50,51]. Since TFs families could also provide insights into comparative transcriptional status between genotypes or cultivars, we performed a detailed analysis of differentially expressed transcription factors. In previous investigations, many specifically expressed TFs such as the C2C2-GATA, AP2/ERF, MADS-box, MYB, bHLH, and NAC have distinguished during the flowering process [15, 16, 18, 51,52,53]. Researchers have demonstrated that GATAs have an essential role in regulating the flowering time so that two GATA factors GNC and CGA1 in A. thaliana directly repress SOC1 expression and thereby repress flowering . In our study, the GATA TFs were overrepresented as up-regulated genes during the vegetative phase of the accession Viroflay but were not identified as DEGs. AP2/ERF superfamily members have a potential impact to positively/negatively regulate various processes such as control of metabolism, growth, and development, as well as flowering regulation through photoperiod pathway . However, information about how these AP2/ERFs regulate flowering time is limited. Thus, only a few examples of AP2/ERFs in other species have been discussed concerning flowering or bolting. For example, the over-expression of seven AP2 genes in Glycine max leads to early flowering , or the expression of miR172, which targets the transcripts of AP2 TFs, inhibits flowering . The transcriptomic profiles obtained in this study indicated seven AP2/ERF TFs were downregulated in vegetative tissue of accession Viroflay, whereas 17 AP2/ERF TFs were upregulated in the same tissue of accession Kashan, but their mRNA levels did not show significant changes against the reproductive stage. From the results presented in this study and previous investigation, we concluded that the different expressions of AP2 TFs between stages and accessions should be responsible for controlling the flowering time. Among TF families, MADS-box (MIKC-type) is introduced to be the most important flowering-related TFs in plants with higher expression in bolting sensitive lines, cultivars, or species [15, 56,57,58]. Interestingly, AGAMOUS has known a subfamily of MADS-box genes that regulate the transcription of two important flowering-time regulators, FLC, and FT . In our study, AGAMOUS-like MADS-box protein recorded higher expressions in the vegetative stage of accession Kashan, supporting the bolting sensitive phenotype in this accession. According to overall expression patterns of TFs in this study and previous researches, we concluded GATA, MADS-box, and MYB TFs in spinach produce similar transcriptomic profiling with those vegetative species that bolting and flowering greatly limit production.
Finally, at the latest consideration, we aimed to investigate the SNPs related to bolting in early and late flowering accessions of spinach using transcriptome data. The revelation of SNPs from the transcriptome data yielded a sum of 168,849 SNPs, which upon intense filtering diminished to 59,022 polymorphic SNPs over the accessions. Furthermore, these high-quality SNPs were screened for both kinds of homozygous and heterozygous variants, which resulted in higher homozygous SNPs in Viroflay against Kashan. Subsequently, from the filtered 3397 SNPs, 2152 were distributed across the length of the six chromosomes, which are useful for the generation of marker-assisted backcross marker probes. However, the number of SNPs was varied among the chromosomes; with the higher number in chromosome 4 followed by chromosome 1 and the lowest in chromosome 5. Additionally, in the current investigation, the SNPs have practically been annotated for obtaining a comprehensive view of the genes associated with the SNPs. The functional annotation results indicated the association of SNPs with genes involved in numerous biological processes. Among the biological processes annotated, the majority of SNPs were linked to genes involved in the cellular process, biosynthetic process, and metabolic process, similar to the functional annotations of SNPs reported in radish, onions, and capsicum [20, 52, 60, 61]. Interestingly, a significant number of SNPs were also linked to genes involved in the flowering process, including photosynthesis, carbohydrate metabolic process, reproduction, and flower development. More investigation indicated that the identified SNPs were mostly associated with the four major transcription factors such as Zinc finger, AP2/ERF, bHLH, and WRKY. In detail, SNPs in the genes involved in the flowering pathways were identified. Among these genes, INDUCER OF CBF EXPRESSION 1 (ICE1), HISTONE-LYSINE N-METHYLTRANSFERASE, FKF1, LUX ARRHYTHMO (LUX), PHYTOCLOCK 1 (PCL1) associated with the vernalization pathway and circadian clock were identified as key genes involved in the flowering-time control.
In the present study, we investigated transcription changes of two spinach accessions in the transition from vegetative growth to the reproductive stage by using RNA-seq. We identified a set of DEGs associated with the vernalization, photoperiod/circadian clock, gibberellin, autonomous, and aging pathways. These results demonstrated the potential role of some of these specific genes in the flowering delay of accession Viroflay and their sufficient target for genome manipulation and genetic improvement in spinach to obtain late-bolting varieties.
According to our previous study , two accessions including Kashan and Viroflay were selected and used as early and late-bolting spinach samples, respectively. In the previous study , the vegetative characteristics of 44 spinach accessions were evaluated based on descriptors investigated by Bioversity International Plant Genetic Resources Institute. According to the results of this research, two accessions Viroflay and Kashan were placed in the group of late and early flowering spinach, respectively. Indeed, maximum variation for the trait of “days to flowering” was found between accessions Viroflay (87 days) and Kashan (43 days). To make a stable condition and eliminate influential environmental factors, seeds of each accession were sown in plastic pots (15 cm diameter, 25 cm high) with sterilized soil and grown in a growth chamber under spring growth conditions for 3 months at Isfahan University of Technology, Isfahan, Iran, in March 2018. In this condition, growth period was also calculated as days to flowering, resulted in 83 and 43 days for accessions Viroflay and Kashan, respectively. To obtain these samples the permissions were not necessary. The formal identification of the plant material was undertaken by the herbarium of Agricultural and Natural Resources College, University of Tehran, and no voucher specimens were collected and deposited in the collection (it is not necessary as we don’t describe a novel species). We also stated that the field studies were in compliance with local legislation of Iran in the experimental greenhouse and growth chamber of Isfahan University of Technology, Isfahan, and no specific licences were required.
RNA isolation, library construction, and sequencing
Total RNAs were extracted from leaf samples of two accessions Kashan and Viroflay at vegetative (four-leaf stage) and reproductive (when 50% of the plants produced flower-stalks) stages in three biological replicates using DENAzist column RNA isolation kit. Indeed, stages were selected so that the majority of differentiation takes place between stages (Days to flowering). Each sample was a pool gathered from at least ten plants in order to decrease the variance caused by interindividual differences in gene expression. The RNAs were quantified on an agarose gel, and the quality was determined based on absorbance ratios (260/280 nm and 260/230 nm) using the NanoDrop spectrophotometer (NanoDrop Technologies). Furthermore, subsequent quality control by using a QC Bioanalyzer, cDNA library preparation, and sequencing was performed at the Personalbio (Shanghai, China), according to the manufacturer’s recommendation. The sequencing was done on an Illumina platform with 150 bp paired-end readers. The reads obtained from sequencing were deposited in the NCBI Sequence Read Archive (https://www.ncbi.nlm.nih.gov/) under accession number PRJNA630139.
Read mapping, expression level calculation, and mining of DEGs
Clean paired-end reads from each of the samples were individually mapped versus the spinach genome assembly version 1  using STAR v2.7.1  software, and then transcripts assembled by the StringTie v2.0.6  with default parameters (without the ‘-e’ option). The assembled transcripts were merged using StringTie’s merge function to create a unique set of transcripts for stages and accessions. The mapped reads were assembled again using Stringtie software with the merged transcripts as a direction, and the ‘-e’ and ‘-B’ options were used to restrict novel transcript prediction and generate an input file for DEGs identification. For differential expression analysis, gene read-count data matrices were produced with python script prepDE.py for stages and accessions. Finally, DEGs were identified through the IDEAMEX website , using the EdgeR , DESeq2 , and NOISeq . The threshold to judge the significance of gene expression differences was “FDR ≤ 0.05, the absolute value of logFC> = 2 and CPM = 1”.
Function investigation of DEGs
Genes classification and transcription factors identification analysis were conducted on all DEGs using gene functional classification tools and the latest genomic reference information of S. oleracea in SpinachBase (http://spinachbase.org) . For gene set enrichment analysis, all DEGs were mapped to the protein sequences source of Arabidopsis (Araport11_genes.201606.pep.fasta) using the BLAST search. This is because of the well-maintained and annotated Arabidopsis genome. Finally, the ClueGO  plug-in v3.7.2 of Cytoscape software  was used to identify significant pathways and to visualize genes in functionally grouped networks.
Quantitative real-time polymerase chain reaction
To validate the expression pattern of bolting and flowering-related genes, quantitative real-time polymerase chain reaction (qRT-PCR) was applied to quantitatively measure the eight candidate genes expression in leaf tissues of two accessions at different developmental stages. In this way, gene-specific primers (Table 2) with melting temperature (Tm) 60 °C were designed using the Primer3 software (http://frodo.wi.mit.edu/primer3/). qRT-PCR reactions were performed in three technical replicates using an ABI system (ABI ViiA 7 Real-time PCR) in a 20 μL final volume, containing 10 μL SYBR Green Master Mix (BioFACT, Korea), 2 μL of diluted cDNA, and 1 μL of each primer (10 μM) in conjunction with adding PCR-grade water. The qPCR experiment was carried out based on a thermal program of 5 min at 95 °C, 40 cycles of 10 s at 95 °C, 20 s at the specific annealing temperature for each primer, 20 s at 72 °C, and finally a melting curve program. The statistical analysis of gene expression was carried out using the 2 − ∆∆Ct method  by using Actin and GAPDH as internal reference (housekeeping) genes.
SNPs detection from RNA-seq data
SNPs were called from transcriptome data by a Joint genotyping method  using the Genome Analysis Toolkit (GATK 4.1) in accordance with the Best Practices workflow for variant calling on RNA-seq data. We first used the two-pass mapping strategy (−twopassMode Basic) with STAR aligner  to generate coordinate-sorted BAM files and then read groups were added to BAM files with the Picard tool AddOrReplaceReadGroups (https://broadinstitute.github.io/picard/). Next, duplicated reads were marked with Picard tools, so that GATK tools could automatically ignore them. Subsequently, SAMtools v1.10  was applied to merge BAM files belong to each stage and the recommended Split’N’Trim and indel realignments steps were also performed on each merged BAM file. In the next step of this approach, the potential variants were called using HaplotypeCaller algorithm with the –ERC GVCF mode, leading to the production of gVCF files. Then, variants were called through a Joint Genotyping analysis from all gVCF files. As recommended by the GATK Best Practices, variant filtering was performed at this step with the following options: --filter-expression “QD < 2.0” --filter-name “SNPQDFilter” --filter-expression “FS > 30.0” --filter-name “SNPFSFilter” --filter-expression “MQ < 40.0” --filter-name “SNPMQFilter” --filter-expression “MQRankSum < -12.5” --filter-name “SNPMQRSQFilter” --filter-expression “ReadPosRankSum < -8.0” --filter-name “SNPRPRSFilter” --filter-expression “HaplotypeScore > 13.0” --filter-name “SNPHSFilter” --filter-expression “DP>100 || DP<5”. Next, this VCF file was fed into VCFtools v0.1.16  to remove indels, and finally, SnpSift v4.3  was used for pulling out specific genotype combinations.
Availability of data and materials
All RNA-Seq data were deposited in the NCBI SRA database under the project PRJNA630139 (https://www.ncbi.nlm.nih.gov/bioproject/PRJNA630139).
Differentially expressed genes
Quantitative real-time polymerase chain reaction
Suppressor of Overexpression of CO1
Flowering locus T
Flowering locus c
Squamosa promoter binding-like protein 1
Trehalose-6-phosphate synthase 1
Inducer of cbf expression 1
Flowering promoting factor 1
Flavin-binding, kelch repeat, F box protein 1
Early flowering 3
Van Treuren R, Coquin P, Lohwasser U. Genetic resources collections of leafy vegetables (lettuce, spinach, chicory, artichoke, asparagus, lamb’s lettuce, rhubarb and rocket salad): composition and gaps. Genet Resour Crop Evol. 2012;59(6):981–97. https://doi.org/10.1007/s10722-011-9738-x.
Koh E, Charoenprasert S, Mitchell AE. Effect of organic and conventional cropping systems on ascorbic acid, vitamin C, flavonoids, nitrate, and oxalate in 27 varieties of spinach (Spinacia oleracea L.). J Agric Food Chem. 2012;60(12):3144–50. https://doi.org/10.1021/jf300051f.
Food F: Agriculture Organization of the United Nations Statistical Databases. 2016. Diponível em: http//faostat fao org/site/339 default aspx Acesso em 2018, 25.
van Treuren R, Hoekstra R, van Hintum TJ. Inventory and prioritization for the conservation of crop wild relatives in the Netherlands under climate change. Biol Conserv. 2017;216:123–39. https://doi.org/10.1016/j.biocon.2017.10.003.
van Treuren R, de Groot L, Hisoriev H, Khassanov F, Farzaliyev V, Melyan G, et al. Acquisition and regeneration of Spinacia turkestanica Iljin and S. tetrandra Steven ex M. Bieb. To improve a spinach gene bank collection. Genet Resour Crop Evol. 2020;67(3):549–59. https://doi.org/10.1007/s10722-019-00792-8.
Cho LH, Yoon J, An G. The control of flowering time by environmental factors. Plant J. 2017;90(4):708–19. https://doi.org/10.1111/tpj.13461.
Chen C, Huang W, Hou K, Wu W. Bolting, an important process in plant development, two types in plants. J Plant Biol. 2019;62(3):161–9. https://doi.org/10.1007/s12374-018-0408-9.
Arif M, Jatoi SA, Rafique T, Ghafoor A. Genetic divergence in indigenous spinach genetic resources for agronomic performance and implication of multivariate analyses for future selection criteria. J Sci Technol Dev. 2013;32(1):7–15.
Chitwood J, Shi A, Mou B, Evans M, Clark J, Motes D, et al. Population structure and association analysis of bolting, plant height, and leaf erectness in spinach. HortScience. 2016;51(5):481–6. https://doi.org/10.21273/HORTSCI.51.5.481.
Ma J, Shi A, Mou B, Evans M, Clark JR, Motes D, et al. Association mapping of leaf traits in spinach (Spinacia oleracea L.). Plant Breed. 2016;135(3):399–404. https://doi.org/10.1111/pbr.12369.
Piñeiro M, Gómez-Mena C, Schaffer R, Martínez-Zapater JM, Coupland G. EARLY BOLTING IN SHORT DAYS is related to chromatin remodeling factors and regulates flowering in Arabidopsis by repressing FT. Plant Cell. 2003;15(7):1552–62. https://doi.org/10.1105/tpc.012153.
Lee Y-S, An G. Regulation of flowering time in rice. J Plant Biol. 2015;58(6):353–60. https://doi.org/10.1007/s12374-015-0425-x.
Alter P, Bircheneder S, Zhou L-Z, Schlüter U, Gahrtz M, Sonnewald U, et al. Flowering time-regulated genes in maize include the transcription factor ZmMADS1. Plant Physiol. 2016;172(1):389–404. https://doi.org/10.1104/pp.16.00285.
Leijten W, Koes R, Roobeek I, Frugis G. Translating flowering time from Arabidopsis thaliana to Brassicaceae and Asteraceae crop species. Plants. 2018;7(4):111. https://doi.org/10.3390/plants7040111.
de Dios EA, Delaye L, Simpson J. Transcriptome analysis of bolting in A. tequilana reveals roles for florigen, MADS, fructans and gibberellins. BMC Genomics. 2019;20(1):473.
Nie S, Li C, Xu L, Wang Y, Huang D, Muleke EM, et al. De novo transcriptome analysis in radish (Raphanus sativus L.) and identification of critical genes involved in bolting and flowering. BMC Genomics. 2016;17(1):389.
Ou CG, Mao JH, Liu LJ, Li CJ, Ren HF, Zhao ZW, et al. Characterising genes associated with flowering time in carrot (Daucus carota L.) using transcriptome analysis. Plant Biol. 2017;19(2):286–97. https://doi.org/10.1111/plb.12519.
Han Y, Chen Z, Lv S, Ning K, Ji X, Liu X, et al. MADS-box genes and gibberellins regulate bolting in Lettuce (Lactuca sativa L.). Front Plant Sci. 2016;7:1889.
Ito S, Song YH, Josephson-Day AR, Miller RJ, Breton G, Olmstead RG, et al. FLOWERING BHLH transcriptional activators control expression of the photoperiodic flowering regulator CONSTANS in Arabidopsis. Proc Natl Acad Sci. 2012;109(9):3582–7. https://doi.org/10.1073/pnas.1118876109.
Wang Y, Liu W, Xu L, Wang Y, Chen Y, Luo X, et al. Development of SNP markers based on transcriptome sequences and their application in germplasm identification in radish (Raphanus sativus L.). Mol Breed. 2017;37(3):26.
Searle I, He Y, Turck F, Vincent C, Fornara F, Kröber S, et al. The transcription factor FLC confers a flowering response to vernalization by repressing meristem competence and systemic signaling in Arabidopsis. Genes Dev. 2006;20(7):898–912. https://doi.org/10.1101/gad.373506.
Deng W, Ying H, Helliwell CA, Taylor JM, Peacock WJ, Dennis ES. FLOWERING LOCUS C (FLC) regulates development pathways throughout the life cycle of Arabidopsis. Proc Natl Acad Sci. 2011;108(16):6680–5. https://doi.org/10.1073/pnas.1103175108.
Preston JC, Hileman L. Functional evolution in the plant SQUAMOSA-PROMOTER BINDING PROTEIN-LIKE (SPL) gene family. Front Plant Sci. 2013;4:80.
Winter CM, Yamaguchi N, Wu MF, Wagner D. Transcriptional programs regulated by both LEAFY and APETALA1 at the time of flower formation. Physiol Plant. 2015;155(1):55–73. https://doi.org/10.1111/ppl.12357.
Zhang Z, Wang P, Li Y, Ma L, Li L, Yang R, et al. Global transcriptome analysis and identification of the flowering regulatory genes expressed in leaves of Lagerstroemia indica. DNA Cell Biol. 2014;33(10):680–8. https://doi.org/10.1089/dna.2014.2469.
Ness RW, Siol M, Barrett SC. De novo sequence assembly and characterization of the floral transcriptome in cross-and self-fertilizing plants. BMC Genomics. 2011;12(1):298. https://doi.org/10.1186/1471-2164-12-298.
Zhang Z, Wang P, Li Y, Ma L, Li L, Yang R, et al. Global Transcriptome analysis and identification of the flowering regulatory genes expressed in leaves of Lagerstroemia indica. DNA Cell Biol. 2014;33(10):680–8. https://doi.org/10.1089/dna.2014.2469.
Xu C, Jiao C, Sun H, Cai X, Wang X, Ge C, et al. Draft genome of spinach and transcriptome diversity of 120 Spinacia accessions. Nat Commun. 2017;8(1):1–10.
Collins K, Zhao K, Jiao C, Xu C, Cai X, Wang X, et al. SpinachBase: a central portal for spinach genomics. Database. 2019;2019. https://doi.org/10.1093/database/baz072.
Abolghasemi R, Haghighi M, Etemadi N, Soorni A, Jafari P. Screening of some native and foreign accessions of spinach for spring culture in Isfahan. Iran Agric Res. 2019;38(1):87–99.
Yu D, Hu Y, Wang H, Pan J, Li Y, Lou D. The DELLA-CONSTANS transcription factor cascade integrates gibberelic acid and photoperiod signaling to regulate flowering. Plant Physiol. 2016.
Berg IA, Kockelkorn D, Ramos-Vera WH, Say RF, Zarzycki J, Hügler M, et al. Autotrophic carbon fixation in archaea. Nat Rev Microbiol. 2010;8(6):447–60. https://doi.org/10.1038/nrmicro2365.
Lv G-Y, Guo X-G, Xie L-P, Xie C-G, Zhang X-H, Yang Y, et al. Molecular characterization, gene evolution, and expression analysis of the fructose-1, 6-bisphosphate aldolase (FBA) gene family in wheat (Triticum aestivum L.). Front Plant Sci. 2017;8:1030.
Furbank RT, Taylor WC. Regulation of photosynthesis in C3 and C4 plants: a molecular approach. Plant Cell. 1995;7(7):797–807. https://doi.org/10.2307/3870037.
Raines CA. The Calvin cycle revisited. Photosynth Res. 2003;75(1):1–10. https://doi.org/10.1023/A:1022421515027.
Uematsu K, Suzuki N, Iwamae T, Inui M, Yukawa H. Increased fructose 1, 6-bisphosphate aldolase in plastids enhances growth and photosynthesis of tobacco plants. J Exp Bot. 2012;63(8):3001–9. https://doi.org/10.1093/jxb/ers004.
van Dijken AJ, Schluepmann H, Smeekens SC. Arabidopsis trehalose-6-phosphate synthase 1 is essential for normal vegetative growth and transition to flowering. Plant Physiol. 2004;135(2):969–77. https://doi.org/10.1104/pp.104.039743.
Wahl V, Ponnu J, Schlereth A, Arrivault S, Langenecker T, Franke A, et al. Regulation of flowering by trehalose-6-phosphate signaling in Arabidopsis thaliana. Science. 2013;339(6120):704–7. https://doi.org/10.1126/science.1230406.
Dai Y, Zhang S, Sun X, Li G, Yuan L, Li F, et al. Comparative Transcriptome analysis of gene expression and regulatory characteristics associated with different Vernalization periods in Brassica rapa. Genes. 2020;11(4):392. https://doi.org/10.3390/genes11040392.
Wang X, Fan S, Song M, Pang C, Wei H, Yu J, et al. Upland cotton gene GhFPF1 confers promotion of flowering time and shade-avoidance responses in Arabidopsis thaliana. PLoS One. 2014;9(3):e91869. https://doi.org/10.1371/journal.pone.0091869.
Sawa M, Nusinow DA, Kay SA, Imaizumi T. FKF1 and GIGANTEA complex formation is required for day-length measurement in Arabidopsis. Science. 2007;318(5848):261–5. https://doi.org/10.1126/science.1146994.
Dalchau N, Baek SJ, Briggs HM, Robertson FC, Dodd AN, Gardner MJ, et al. The circadian oscillator gene GIGANTEA mediates a long-term response of the Arabidopsis thaliana circadian clock to sucrose. Proc Natl Acad Sci. 2011;108(12):5104–9. https://doi.org/10.1073/pnas.1015452108.
Brandoli C, Petri C, Egea-Cortines M, Weiss J. Gigantea: uncovering new functions in flower development. Genes. 2020;11(10):1142. https://doi.org/10.3390/genes11101142.
Zhao XY, Liu MS, Li JR, Guan CM, Zhang XS. The wheat TaGI1, involved in photoperiodic flowering, encodesan Arabidopsis GI ortholog. Plant Mol Biol. 2005;58(1):53–64. https://doi.org/10.1007/s11103-005-4162-2.
Zakhrabekova S, Gough SP, Braumann I, Müller AH, Lundqvist J, Ahmann K, et al. Induced mutations in circadian clock regulator mat-a facilitated short-season adaptation and range extension in cultivated barley. Proc Natl Acad Sci. 2012;109(11):4326–31. https://doi.org/10.1073/pnas.1113009109.
Yu J-W, Rubio V, Lee N-Y, Bai S, Lee S-Y, Kim S-S, et al. COP1 and ELF3 control circadian function and photoperiodic flowering by regulating GI stability. Mol Cell. 2008;32(5):617–30. https://doi.org/10.1016/j.molcel.2008.09.026.
Nelson DC, Lasswell J, Rogg LE, Cohen MA, Bartel B. FKF1, a clock-controlled gene that regulates the transition to flowering in Arabidopsis. Cell. 2000;101(3):331–40. https://doi.org/10.1016/S0092-8674(00)80842-9.
Imaizumi T, Tran HG, Swartz TE, Briggs WR, Kay SA. FKF1 is essential for photoperiodic-specific light signalling in Arabidopsis. Nature. 2003;426(6964):302–6. https://doi.org/10.1038/nature02090.
Kim S-G, Kim S-Y, Park C-M. A membrane-associated NAC transcription factor regulates salt-responsive flowering via FLOWERING LOCUS T in Arabidopsis. Planta. 2007;226(3):647–54. https://doi.org/10.1007/s00425-007-0513-3.
Reeves PH, Ellis CM, Ploense SE, Wu M-F, Yadav V, Tholl D, et al. A regulatory network for coordinated flower maturation. PLoS Genet. 2012;8(2):e1002506. https://doi.org/10.1371/journal.pgen.1002506.
Ning Y-Q, Ma Z-Y, Huang H-W, Mo H, Zhao T-T, Li L, et al. Two novel NAC transcription factors regulate gene expression and flowering time by associating with the histone demethylase JMJ14. Nucleic Acids Res. 2015;43(3):1469–84. https://doi.org/10.1093/nar/gku1382.
Kim J, Kim D-S, Park S, Lee H-E, Ahn Y-K, Kim JH, et al. Development of a high-throughput SNP marker set by transcriptome sequencing to accelerate genetic background selection in Brassica rapa. Hortic Environ Biotechnol. 2016;57(3):280–90. https://doi.org/10.1007/s13580-016-1036-2.
Richter R, Bastakis E, Schwechheimer C. Cross-repressive interactions between SOC1 and the GATAs GNC and GNL/CGA1 in the control of greening, cold tolerance, and flowering time in Arabidopsis. Plant Physiol. 2013;162(4):1992–2004. https://doi.org/10.1104/pp.113.219238.
Zhu L, Liu D, Li Y, Li N. Functional phosphoproteomic analysis reveals that a serine-62-phosphorylated isoform of ethylene response factor110 is involved in Arabidopsis bolting. Plant Physiol. 2013;161(2):904–17. https://doi.org/10.1104/pp.112.204487.
Jiang W, Zhang X, Song X, Yang J, Pang Y. Genome-wide identification and characterization of APETALA2/ethylene-responsive element Binding factor superfamily genes in soybean seed development. Front Plant Sci. 2020;11:1348.
Zhu Q-H, Helliwell CA. Regulation of flowering time and floral patterning by miR172. J Exp Bot. 2011;62(2):487–95. https://doi.org/10.1093/jxb/erq295.
Ning K, Han Y, Chen Z, Luo C, Wang S, Zhang W, et al. Genome-wide analysis of MADS-box family genes during flower development in lettuce. Plant Cell Environ. 2019;42(6):1868–81. https://doi.org/10.1111/pce.13523.
Schilling S, Pan S, Kennedy A, Melzer R. MADS-box genes and crop domestication: the jack of all traits. UK: Oxford University Press; 2018.
Yoo SK, Wu X, Lee JS, Ahn JH. AGAMOUS-LIKE 6 is a floral promoter that negatively regulates the FLC/MAF clade genes and positively regulates FT in Arabidopsis. Plant J. 2011;65(1):62–76. https://doi.org/10.1111/j.1365-313X.2010.04402.x.
Manivannan A, Kim J-H, Yang E-Y, Ahn Y-K, Lee E-S, Choi S, et al. Next-generation sequencing approaches in genome-wide discovery of single nucleotide polymorphism markers associated with pungency and disease resistance in pepper. Biomed Res Int. 2018;2018:1–7. https://doi.org/10.1155/2018/5646213.
Kim J, Manivannan A, Kim D-S, Lee E-S, Lee H-E. Transcriptome sequencing assisted discovery and computational analysis of novel SNPs associated with flowering in Raphanus sativus in-bred lines for marker-assisted backcross breeding. Horticulture Res. 2019;6(1):1–12.
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21. https://doi.org/10.1093/bioinformatics/bts635.
Pertea M, Pertea GM, Antonescu CM, Chang T-C, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5. https://doi.org/10.1038/nbt.3122.
Jiménez-Jacinto V, Sanchez-Flores A, Vega-Alvarado L. Integrative differential expression analysis for multiple experiments (IDEAMEX): a web server tool for integrated rna-seq data analysis. Front Genet. 2019;10:279. https://doi.org/10.3389/fgene.2019.00279.
Anders S, Huber W. Differential expression analysis for sequence count data. Nat Precedings. 2010:1–1.
Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15(12):550. https://doi.org/10.1186/s13059-014-0550-8.
Tarazona S, García-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: a matter of depth. Genome Res. 2011;21(12):2213–23. https://doi.org/10.1101/gr.124321.111.
Bindea G, Mlecnik B, Hackl H, Charoentong P, Tosolini M, Kirilovsky A, et al. ClueGO: a Cytoscape plug-in to decipher functionally grouped gene ontology and pathway annotation networks. Bioinformatics. 2009;25(8):1091–3. https://doi.org/10.1093/bioinformatics/btp101.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13(11):2498–504. https://doi.org/10.1101/gr.1239303.
Rao X, Huang X, Zhou Z, Lin X. An improvement of the 2ˆ (−delta delta CT) method for quantitative real-time polymerase chain reaction data analysis. Biostatistics Bioinformatics Biomathematics. 2013;3(3):71.
Brouard J-S, Schenkel F, Marete A, Bissonnette N. The GATK joint genotyping workflow is appropriate for calling variants in RNA-seq experiments. J Anim Sci Biotechno. 2019;10(1):44. https://doi.org/10.1186/s40104-019-0359-0.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9. https://doi.org/10.1093/bioinformatics/btp352.
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27(15):2156–8. https://doi.org/10.1093/bioinformatics/btr330.
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6(2):80–92. https://doi.org/10.4161/fly.19695.
The authors gratefully acknowledge the Sheikh Bahaei National High Performance Computing Center (SBNHPCC) for providing computing facilities and time. SBNHPCC is supported by scientific and technological department of presidential office and Isfahan University of Technology (IUT).
This research was funded by the Iran National Science Foundation (INSF, No. 97014528) for the financial support of part of this work.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Summary of transcriptome sequencing results from 12 spinach samples. Table S2 Summary of STAR alignment rates from 12 spinach samples. KV, KR, VV, and VR represent Kashan-vegetative, Kashan-reproductive, Viroflay-vegetative, and Viroflay-reproductive, respectively. Fig. S1. Distribution of DEGs. (A) The discovered DEGs in the vegetative stages comparison (B) The discovered DEGs in the reproductive stages comparison. The X-axis represents the location of DEGs on chromosomes. Y-axis is the log2 fold change for each. Fig. S2. Distribution of DEGs across six chromosomes. (A) The discovered DEGs from the stages comparison of accession Kashan (B) The discovered DEGs from the stages comparison of accession Viroflay. The X-axis represents the location of DEGs on chromosomes. Y-axis is the log2 fold change for each. Fig. S3. Gene classification of identified DEGs. The histogram shows the classification of DEGs under the molecular function category for all pairwise comparisons. Fig. S4. Gene classification of identified DEGs. The histogram shows the classification of DEGs under the cellular component category for all pairwise comparisons. Fig. S5. Gene expression correlation between RT-qPCR and RNA-seq data (Log2 values of the fold change). The Pearson correlation coefficients and linear regression line are indicated. Fig. S6. The summary statistics of variant effects by the type and region.
About this article
Cite this article
Abolghasemi, R., Haghighi, M., Etemadi, N. et al. Transcriptome architecture reveals genetic networks of bolting regulation in spinach. BMC Plant Biol 21, 179 (2021). https://doi.org/10.1186/s12870-021-02956-0