- Research article
- Open Access
Transcriptomic profiling of wheat near-isogenic lines reveals candidate genes on chromosome 3A for pre-harvest sprouting resistance
BMC Plant Biology volume 21, Article number: 53 (2021)
Pre-harvest sprouting (PHS) in wheat can cause severe damage to both grain yield and quality. Resistance to PHS is a quantitative trait controlled by many genes located across all 21 wheat chromosomes. The study targeted a large-effect quantitative trait locus (QTL) QPhs.ccsu-3A.1 for PHS resistance using several sets previously developed near-isogenic lines (NILs). Two pairs of NILs with highly significant phenotypic differences between the isolines were examined by RNA sequencing for their transcriptomic profiles on developing seeds at 15, 25 and 35 days after pollination (DAP) to identify candidate genes underlying the QTL and elucidate gene effects on PHS resistance. At each DAP, differentially expressed genes (DEGs) between the isolines were investigated.
Gene ontology and KEGG pathway enrichment analyses of key DEGs suggested that six candidate genes underlie QPhs.ccsu-3A.1 responsible for PHS resistance in wheat. Candidate gene expression was further validated by quantitative RT-PCR. Within the targeted QTL interval, 16 genetic variants including five single nucleotide polymorphisms (SNPs) and 11 indels showed consistent polymorphism between resistant and susceptible isolines.
The targeted QTL is confirmed to harbor core genes related to hormone signaling pathways that can be exploited as a key genomic region for marker-assisted selection. The candidate genes and SNP/indel markers detected in this study are valuable resources for understanding the mechanism of PHS resistance and for marker-assisted breeding of the trait in wheat.
Wheat (Triticum aestivum L.) is a major cereal crop worldwide. Pre-harvest sprouting (PHS) can severely affect to yield and its nutritional and processing qualities, resulting in more than US$ 1 billion of annual losses worldwide [1, 2]. Therefore, PHS resistance is an important trait for genetic studies and breeding in wheat [3, 4].
Seed dormancy and germination, the two major processes concerning PHS, are regulated by numerous environmental and molecular factors; of which, endogenous hormone balance, especially between abscisic acid (ABA) and gibberellic acid (GA), plays a crucial role [5, 6]. In cereal grains, ABA is involved in dormancy development and inhibition of hydrolase synthesis in mature seeds , whereas GA promotes the metabolism of seed reserves and induces hydrolase synthesis for seed germination . Apart from phytohormone transduction genes, many transcription factors (TFs) are involved in PHS regulation, such as those of the B3 domain, AP2 domain, and bZIP factor classes encoded by ABA-insensitive (ABI) genes ABI3, ABI4, and ABI5, respectively [9, 10], TFIIS Transcription Elongation Factor II encoded by Reduced Dormancy 2 (RDO2) , and phytochrome interacting factors (PIFs) .
Resistance to PHS is controlled by quantitative trait loci (QTL) [13,14,15] that are located on all 21 chromosomes in bread wheat; of which, QTL on chromosome groups 3 and 4 consistently explain large phenotypic variation [16,17,18,19]. Several candidate genes have been identified for a major 4AL locus responsible for PHS resistance, including two seed dormancy genes PM19-A1 and A2 by transcriptomic analyses , and a causal seed dormancy gene MKK3 located next to PM19 by a comparative genomics method . Wang et al.  identified five candidate genes for a major 4BL QTL using genotyping and phenotyping characterization of multiple pairs of near isogenic lines (NILs). For group 3 chromosomes, a major locus on 3AS, explaining 23–38% of the phenotypic variation, was identified using a cross-derived RIL population with red-grained parents . Later, Liu et al.  cloned a gene (TaPHS1) from the 3AS QTL Qphs.pseru-3AS. Other known genes on group 3 chromosomes include viviparous (Vp-1) or ABI3  on the long arms of the chromosomes, which act as a regulator of late embryo development in wheat. Kulwal et al.  reported a major PHS resistance QTL on 3AL from RILs of SPR8198 (PHS resistant) / HD2329 (PHS susceptible). QPhs.ccsu-3A.1 explained up to 78.03% of the phenotypic variation across six tested environments, and was located at a genetic distance of ~ 183 cM from the centromere within the marker interval of Xwmc153 and Xgwm155 . This major QTL on chromosome arm 3AL has not been cloned and characterized (Fig. 1).
In our previous study, we developed several sets of resistant and susceptible NILs targeting the major QTL QPhs.ccsu-3A.1 . Near isogenic lines (NILs) are pairs of lines that have the same genetic background except for the targeted locus, and NILs with contrasting trait performance can turn quantitative traits into Mendelian factors, which makes them ideal genetic resources for identifying candidate genes and closely linked markers underlying a targeting QTL [22, 27]. RNA sequencing (RNA-seq) is a powerful approach for detecting differentially expressed genes (DEGs) and novel expressed genes, and is widely used in transcriptomic studies [28,29,30]. The expression trends of all genes from the transcriptomic analysis will be valuable data for in-depth studies of gene function and their interaction networks in complex biological processes . Transcriptomic profiling of contrasting genotypes can reveal associated signaling pathways for molecular responses resulting in biochemical and morphological changes under stresses [29, 30]. Furthermore, RNA-seq on NILs can accurately detect DEGs and QTL-linked single nucleotide polymorphisms (SNPs) within a QTL region, therefore it has been used to identify candidate genes and markers in many crops [20, 32].
In this study, we used two pairs of NILs with highly significant differences in PHS performance between the isolines to investigate their transcriptomic profiles on developing seeds at 15, 25 and 35 days after pollination (DAP). The parental lines Chara and DM5637B*8 used to develop the NILs were white-coloured cultivars, which eliminated the possibility of correlations between PHS resistance and red-grain genes. The study aimed to: 1) analyze DEGs between the NILs at different seed development stages to provide an insight into PHS resistance, 2) validate the candidate genes through expression analysis at different seed developmental stages, and 3) detect SNPs and indels that can distinguish the resistant and susceptible isolines within the QTL interval for use in marker-assisted breeding of PHS resistance in wheat.
Transcriptome assembly quality and mapping statistics
A total of 304 Gb high-quality 150-bp paired-end sequencing reads were generated from the 36 samples after quality control, with an average of 56 million clean reads for each library. Nearly 98 and 96% of the clean reads had a quality score of Q20 and Q30, respectively. Approximately 70% of the sequenced reads were mapped to the wheat reference genome, including 55% with a unique match. The total number of transcripts detected in each library ranged from 72,485 to 96,979, accounting for nearly 60% of all wheat genes. Pearson’s correlation coefficients among the three biological replicates for each combination ranged from 0.84 to 0.99, indicating the consistency of the three replicates.
Differential gene expression related to PHS resistance
Differential gene expressions in the contrasting isolines are summarized in Table 1 and Fig. 2. At 15 DPA, a total of 1195 DEGs between the resistant (‘R’) and susceptible (‘S’) isolines were commonly detected in the two NIL pairs. Similar numbers (1298) of DEGs between the isolines were detected at 35 DPA. However, fewer DEGs (776) were detected at 25 DPA in both of the NIL pairs. To identify the genes underlying the QTL QPhs.ccsu-3A.1, particular attention was given to the common DEGs located on chromosome arm 3AL in both of the NIL pairs. There were 12, 12 and 25 such genes identified at 15, 25, and 35 DPA, respectively. Among them, genes TraesCS3A01G462000 and TraesCS3A01461400 were consistently upregulated (gene expression in ‘R’ isoline was significantly higher than that in ‘S’ isoline, i.e. ‘R>S’), and gene TraesCS3A01G466700 was consistently downregulated (gene expression in ‘R’ isoline was significantly lower than that in ‘S’ isoline, i.e. ‘R<S’) across the three time-points. The three genes were located within the targeted QTL marker interval between Xwmc153 and Xgwm155, were therefore considered major candidate genes underlying QPhs.ccsu-3A.1. Notably, at all three time-points, TraesCS3A01G461400 and TraesCS3A01G466700 showed extremely high upregulation and downregulation, with a mean log2 ratio fold change of 5.70 and 5.19, respectively (Table S1).
Gene expressions significantly different between time-points in each isoline, including DEGs between 25 DPA and 15 DPA (25/15), DEGs between 35 DPA and 25 DPA (35/25), and DEGs between 35 DPA and 15 DPA (35/15), were investigated, especially for those within the 3AL QTL interval. A special focus was put on DEGs that showed noteworthy features, including those associated with hormone transduction and PHS-regulatory TFs (such as bZIP TFs, and B3 or AP2 domain-containing TFs), those associated with the identified SNP and indel variants, and those displaying significant differences between isolines at different time-points. Interestingly, TraesCS3A01G461400 showed down-regulation in the ‘R’ isolines at 35/15, but it had significantly higher expression in the ‘R’ isolines than the ‘S’ isolines of both NIL pairs at 35 DPA. Other genes that shared the same up- or down- regulations in either ‘R’ or ‘S’ isolines at 35/15 included TraesCS3A01G459200 (down-regulated in ‘R’ isolines, and ‘R>S’ at 15 and 25 DPA), TraesCS3A01G470400 (down-regulated in ‘S’ isolines, and ‘R<S’ at 15 and 25 DPA), TraesCS3A01G416200 (up-regulated in ‘R’ isolines, and ‘R>S’ at 35 DPA), and TraesCS3A01G346700 (up-regulated in ‘S’ isolines, and ‘R<S’ at 35 DPA) (Table S1).
Functional annotation of DEGs
Based on GO descriptions, DEGs were functionally categorized into three principal categories: cellular component, molecular function and biological process (Fig. 3).
Cell, cell part, organelle, membrane, and membrane part were the most common terms in the cellular component category. Catalytic activity, binding, and transporter activity were the most abundant terms in the molecular function category at all three time-points. Most of the genes associated with the GO terms in the biological process category were in the subcategories of metabolic process, cellular process, and single organism process. Notably, all three GO categories had similar numbers of upregulated and downregulated genes in each of these categories at 15 and 25 DPA. However, at 35 DPA, both NIL pairs had considerably more upregulated genes than downregulated genes.
Pathway enrichment analysis was performed to investigate related biological pathways that differed between isolines (Fig. 4, Table 2). DEGs across time-points in both NIL pairs were assigned to different pathways belonging to five major categories - cellular processes, environmental information processing, genetic information processing, metabolism and organismal systems. Among them, metabolism was the most enriched pathway in the DEGs, with more downregulated genes than upregulated genes to varying degrees across the three time-points.
Transcription factors (TFs) play a vital role as molecular switches controlling the expression of certain genes and in turn regulating plant growth and development under certain environmental conditions. Extensive database searches of all the DEGs at all the time-points in all the isolines predicted 6050 differentially expressed TFs which were grouped into 59 families (Fig. 5). The MYB and MYB-related TFs had the most genes (742 and 586 genes respectively), followed by NAC (425) and bHLH (410). However, none of the four extensively expressed TFs showed consistent DEG patterns in the NIL pairs at different time-points.
DEGs between the isolines that were common in both NIL pairs at each DAP were scrutinized; those with known functions related to PHS regulation pathways, such as plant hormone signal transduction and MAPK signaling were considered potential candidate genes. Based on this, three other genes TraesCS3A01G459200, TraesCS3A01G245000 and TraesCS3A01G225100 located on chromosome arm 3AL were identified as candidate genes (Table 2).
SNP and indel markers polymorphic between the ‘R’ and ‘S’ isolines
The SNPs and indels showing consistent distinguishable genotypes between the isolines in both NIL pairs were detected. Five SNPs and 11 indels were located within or very close to the targeted 3AL QTL interval. Among them, six variants (three SNPs and three indels) occurred within their associated genes, with five falling in the gene exons and one in the untranslated region (UTR). Although other variants did not overlap any genes annotated in the reference genome, they showed short distances to their closest genes, with marker-gene distances ranging from 49 to 73,788 bp (Table 3).
Twelve genes were associated with the SNPs and indels (Table 3). Of these, eight genes showed different expressions between either isolines or time-points. Apart from TraesCS3A01G449300 functioning as an auxin response factor, no other gene was related to the hormone signaling pathway. For TraesCS3A01G449300, no expression difference was detected between ‘R’ and ‘S’ isolines in either of the NIL pairs (Table S1).
qRT-PCR validation of candidate genes
To confirm the results of the RNA-seq, the six candidate genes were selected for qRT-PCR assays. Relative expressions of TraesCS3A01G461400, TraesCS3A01G462000 and TraesCS3A01G466700 differed significantly between the ‘R’ and ‘S’ isolines at all time-points, while that of TraesCS3A01G245000 differed significantly at 25 DPA and 35 DPA, and TraesCS3A01G225100 differed significantly at 15 DPA. Notably, at all time-points, the relative expression level of TraesCS3A01G461400 differed about two-fold between the isolines. All six genes showed consistent expression patterns with those obtained from the RNA-seq analysis (Table 4). This is a strong indication of the reliability of the RNA-seq conducted in this study.
Candidate genes underlying the major 3AL QTL responsible for PHS resistance
Six candidate genes underlying QPhs.ccsu-3A.1 responsible for PHS resistance were identified in this study, based on their RNA-seq DEG profiles and qRT-PCR validations. Among them, TraesCS3A01G461400 with a forkhead TF function is the most prominent candidate with highly significant differences in expression between ‘R’ and ‘S’ isolines in both its RNA-seq DEG profile and qRT-PCR expression analysis. Forkhead TFs are a family containing a DNA-binding domain known as the forkhead box (FOX). FOX is evolutionarily conserved in eukaryotic organisms and a crucial regulator of embryonic development, which is affected by hormone signaling [33, 34]. In contrast to the highly conserved FOX domain, forkhead TF proteins are highly divergent in other parts of their sequences . In humans, forkhead TFs modulate signaling pathways , and can be a direct target of hormonal medications such as progestin to inhibit epithelial cell growth . In insects, forkhead TFs regulate hormone-mediated signaling, affecting carbohydrate, amino acid and fatty acid metabolism, and the phosphatidylinositol 3-kinase/protein kinase B signaling pathway . In plants, forkhead-associated domains mediate interactions with receptor-like kinases, which in turn regulate signaling pathways involved in growth and pathogen responses; two well-studied genes, KAPP (encodes a kinase-associated protein phosphatase that functions in the internalization of somatic embryogenesis receptor kinase 1) and ABA1 (encodes a zeaxanthin epoxidase that functions in the ABA biosynthesis pathway), are among the 15 identified Arabidopsis genes containing forkhead-associated domains [39, 40]. Furthermore, TraesCS3A01G461400 is involved in purine metabolism pathway which can play a role in activation of ABA metabolism [41, 42].
TraesCS3A01G462000 encodes a B3 domain-containing TF; its family controls embryo development and seed maturation by modulating ABA and GA metabolism . B3 TFs are considered specific to photosynthetic eukaryotes . Vp-1 is the first plant gene identified in maize that encodes a B3 type TF, which is a key component of the ABA signaling pathway during seed maturation in other cereals. Vp-1 and its orthologous genes are associated with the activation of genes encoding seed storage proteins, late embryogenesis abundant enzymes, and anthocyanin biosynthesis enzymes, as well as the repression of post-germination genes for reserve mobilization, e.g. α-amylases and protease .
TraesCS3A01G461400 and TraesCS3A01G462000 are both TF genes. TFs play important roles in plant growth, development, and responses to environmental stress . The interaction of sequence-specific TFs with target sites near their regulated genes is a central mechanism of gene expression regulation by which organisms develop and interact with their environment . Both of the TF genes identified in this study are related to hormone signaling pathways, and showed significantly higher expression in ‘R’ than ‘S’ isolines at all time-points, suggesting that the more active regulation of gene transcriptions in the ‘R’ isolines might contribute to its PHS resistance phenotype.
Interestingly, two other candidate genes TraesCS3A01G459200 and TraesCS3A01G245000 function as receptor-like kinases (RLKs). RLKs are surface localized, transmembrane receptors that regulate a variety of signaling pathways [48, 49]; some have interactions with forkhead-associated domains, such as KAPP [40, 50]. Gene TraesCS3A01G459200 is involved in many metabolism pathways including fatty acid biosynthesis, starch and sucrose metabolism, phenylpropanoid biosynthesis, and biosynthesis of secondary metabolites (Table 2), in which phenylpropanoid metabolism has been reported to relate to primary seed dormancy in Arabidopsis . Gene TraesCS3A01G245000 was directly involved in the plant hormone signal transduction pathway by KEGG enrichment analysis. Leucine-rich repeat (LRR) RLK, encoded by TraesCS3A01G459200, plays an important role in ABA signal transduction in Arabidopsis; it is upregulated by ABA and its loss of function results in ABA insensitivity in seed germination . In this study, LRR RLK had significantly higher expression in ‘R’ than ‘S’ isolines at 15 and 25 DPA, but there was no difference at 35 DPA between the isolines as the gene was downregulated in ‘R’ isolines at 35/15 (Table S1), indicating a higher ABA content in ‘R’ isolines at the early stages.
TraesCS3A01G466700 encodes hydroxyethylthiazole kinase which participates in thiamine biosynthesis pathway . Taking part in glycometabolism, thiamine has a fundamental role in energy metabolism and serves as an energy reserve for seed germination . Golda et al.  found that both cereal and legume seeds lost a significant part of their thiamine reserves during germination. Neumann et al.  reported that seeds treated with thiamine significantly increased their germination rate in legume Phasenius vulgaris. The gene showed significantly lower expression in ‘R’ than ‘S’ isolines at all time-points in this study, implying that a lower thiamine reserve exists in ‘R’ isolines, which may not favor germination.
TraesCS3A01G225100 functions in the S-type anion channel activity which is required for ABA-induced gene expression . KEGG enrichment assigned the gene to plant MAPK signaling pathway. The MAPK module directly responds to ABA, or interacts with MKK3; for example, MAP3K16 is the negative regulator of ABA response (ABR1), and MAP3K17/18-MKK3-MPK1/2/7/14 responds to ABA, senescence and dormancy in Arabidopsis [57, 58]. MKK3 contains an NTF2 domain and its primary gene structure is highly conserved during evolution . Torada et al.  reported that MKK3 was the causal gene underlying the major 4AL QTL responsible for seed dormancy in wheat. In this study, TraesCS3A01G225100showed significant higher expression in ‘R’ than ‘S’ isolines at 15 DPA only, indicating that the gene mainly plays a regulatory role in response to ABA at an early stage of seed development.
SNP and indel markers distinguishable between the ‘R’ and ‘S’ isolines
Ten of the 16 SNP or indel variants between the ‘R’ and ‘S’ isolines did not overlap any annotated genes in RefV1.0. However, as these variants were identified based on the RNA-seq profiles, they should have been within transcribed genes of the tested cultivars/lines. This response may be due to: 1) large structural variations between the genomes of the reference cultivar Chinese Spring (CS) and the tested cultivars/lines; or 2) CS does not contain certain genes that exist in other cultivars, for example, Ppd-B1 and Vrn-A1 alleles were not present in CS .
Gene TraesCS3A01G449300, associated with one of the indel markers, functions as an auxin response factor (ARF). Auxin recruits ARFs to control seed dormancy in Arabidopsis through stimulation of ABA signaling by inducing ARF-mediated ABI3 activation . Auxin is involved in the transition from seed dormancy to germination, which promotes seed dormancy and inhibits seed germination . Although its gene expression did not differ between the contrasting isolines, its location within the targeted major QTL suggestes it is involved in the regulation pathway by interacting with other signal transduction genes sitting in that particular genomic region.
Within the QTL interval, there is another noteworthy genomic region from 622,277,558 to 622,901,141 bp, where a block of consecutive genes exist that are related to basic leucine zipper (bZIP) TFs and ABI5s. BZIP TFs are activated by ABA-mediated signalosome and bind to specific cis-acting sequences called abscisic-acid-responsive elements (ABREs) or GC-rich coupling elements, thereby influencing the expression of their target downstream genes . ABIs are involved in ABA signaling, some of which are TFs; ABI5 is one of the six classes of such TFs that have been identified, and is essential for ABA- or seed-specific gene expression . Transcriptional repressions of ABI5 are associated with reduced seed sensitivity to ABA which results in the switch from dormancy to germination in wheat seed . No consistent expression difference between the contrasting isolines was found in these genes at the three time-points investigated in this study (Table S1). However, as the genes function in the core ABA signaling and appear in a cluster within the targeted QTL interval, it implies the possibility that multiple genes and gene-interactions could underlie the QTL responsible for PHS resistance.
The targeted QTL QPhs.ccsu-3A.1, explaining up to 78.03% of the phenotypic variation , could be exploited as a key locus for marker-assisted selection. Many genes in the QTL region, including the identified candidate genes, SNP/indel marker associated genes and physically clustered genes, are involved in hormone perception and signal transduction, which further demonstrates the significance of the locus in the regulation and control of PHS resistance. For future study, allele characterizations can be conducted in other genotypes known to have the QTL, and transgenic approach can be utilized for functional test of the candidate genes.
Transcriptomic profiling of NILs targeting a major 3AL QTL QPhs.ccsu-3A.1 responsible for PHS resistance revealed six candidate genes related to hormone signaling and energy metabolism. Sixteen SNP or indel markers within the QTL interval showed consistent distinguishable alleles between the ‘R’ and ‘S’ isolines contrasting in PHS performance. The targeted QTL was confirmed as a key genomic region for seed dormancy and PHS resistance as it contained many core genes involved in the ABA signaling pathway, some of which showed significant differences in expression between the contrasting isolines. The identified candidate genes and SNP/indel markers in this study are valuable for understanding the mechanism of PHS resistance and for marker-assisted breeding of the trait in wheat.
Plant material and tissue sampling
In a previous study, we generated a set of NILs using the heterogeneous inbred family method targeting the 3AL QTL . Formal identification of the plant materials have been done through genotyping and phenotyping  by all the authors who are experts in wheat. Two pairs of NILs (each pair was derived from the single seed descent of an F2 individual) with significantly contrasting PHS performance between the isolines were used for RNA-seq in this study (Fig. 2). Both NIL pairs were developed from the population of ‘Chara/DM5637B*8’ and named as ‘NIL pair 1R & 1S’ and ‘NIL pair 2R & 2S’ in this study, matching NIL_PHSR3AL_3R & 3S and NIL_PHSR3AL_6R & 6S in the previous study, respectively . ‘R’ indicates isolines carrying the resistant allele, and ‘S’ is for those with the susceptible allele. The seeds of parent ‘Chara’ were obtained from Australian Grains Genebank, Horsham, Victoria, Australia with a deposition number of AUS30031, and the seeds of parent ‘DM5637B*8’ were obtained from InterGrain Pty Ltd., Australia. The seeds of the NILs used in this study are kept at the University of Western Australia Wheat Seed Collection with deposit numbers of UWANILTa-N11 (NIL_PHSR3AL_3R), UWANILTa-N12 (NIL_PHSR3AL_3S), UWANILTa-N17 (NIL_PHSR3AL_6R) and UWANILTa-N18 (NIL_PHSR3AL_6S).
The NILs were grown with three biological replicates for each isoline in the glasshouse of The University of Western Australia in Perth, Western Australia. The plant growth condition and phenotyping methods were the same as described in Wang et al. . The flowering date was recorded for each spike. Five kernels at 15 DPA and 25 DPA and three kernels at 35 DPA from each isoline in each replicate were randomly collected, frozen immediately in liquid nitrogen and stored at − 80 °C for RNA extraction.
RNA extraction, library construction and Illumina sequencing
Total RNA was extracted from 36 samples (4 genotypes × 3 time-points × 3 replicates) using RNeasy Plus Plant Mini Kit (Qiagen) with the treatment of DNase, following the manufacturer’s instructions. The yield and purity of the extracted RNA were assessed by NanoDrop 2000 (Thermo Fisher Scientific Inc., Australia), and the integrity was checked by 1% (w/v) denatured gel electrophoresis and Agilent 2100 Bioanalyzer (Agilent Technologies Inc., USA). The qualified and quantified RNA samples were sequenced at the Beijing Genomics Institute (BGI), China. The BGI protocol for cDNA synthesis, 150 bp paired-end sequencing and raw data filtering were the same as described in Mia et al. . Clean data were generated as FastQ files, and Q20, Q30 and GC contents were calculated. Downstream analyses were performed on these clean data, which are available at the National Centre for Biotechnology Information (NCBI) website with the SRA accession number of PRJNA554312 (https://www.ncbi.nlm.nih.gov/sra/PRJNA554312).
Sequence data analysis and DEG identification
High-quality reads were mapped to the bread wheat reference genome sequence, international wheat genome sequence consortium (IWGSC) RefSeq V1.0 (https://wheat-urgi.versailles.inra.fr/) , using HISAT2 v2.0.4 . Aligning of the reads to the reference sequence was done by Bowtie2 . Gene expression level were calculated using RSEM v1.2.12  with default parameters. DEGs were identified with DEGseq according to Wang et al.  with the parameters as described in Mia et al. . Up- and down-regulations of DEGs between the isolines were based on the comparison of ‘R’ isoline to ‘S’ isoline, i.e., if a gene expression in ‘R’ isoline was higher or lower than that in ‘S’ isoline, it was considered upregulated or downregulated, respectively.
Functional annotations, gene ontology and pathway analyses
Gene ontology (GO) and functional enrichment of the selected DEGs were performed using a hypergeometric test (phyper); those with a false discovery rate (FDR) ≤ 0.01 were considered as significantly enriched. KEGG annotation was the same as described in Mia et al. . To identify TF encoding genes from the DEGs, Getorf tool  was used to find the open reading frame (ORF) of each DEG. The ORFs were then aligned to TF domains from PlnTFDB using hmmsearch  to identify TF encoding genes from the selected DEGs.
Discovery of SNP and indel markers
To find the SNP and indel variants, all clean reads of the transcripts were mapped to the reference genome sequence of IWGSC RefV1.0 (https://wheat-urgi.versailles.inra.fr/) using Bowtie2 . The SAM tools package was used for calling SNP and indel variants. Variants on the 3A chromosome, especially those within the marker interval of Xwmc153 and Xgwm155, were detected.
Validation of DEGs using quantitative RT-PCR (qRT-PCR) analysis
The candidate genes identified in this study were selected to run qRT-PCR to validate the RNA-seq results. The cDNAs were synthesized using SensiFast cDNA Synthesis Kit (Bioline Australia) with the manufacturer’s protocol. The qRT-PCR was performed on an ABI 7500 Fast system using SensiFAST SYBR kit (Bioline Australia). Gene-specific primers were designed using Primer Premier 5.0 software, and the wheat actin gene was used as an endogenous control for normalization between samples. Three biological replicates were used for each isoline of the two pairs of NILs at the three time-points. For qRT-PCR, cDNA from all biological samples was run in triplicate . Amplification was conducted in a 20 μl reaction mix containing 10 μl of 2 × SensiFAST SYBR Lo-ROX mix, 0.8 μl of 10 μM each forward and reverse primer and 100 ng cDNA, with the following cycling protocol: 1 cycle of 95 °C for 2 min, 40 cycles of 95 °C for 5 s and 60 °C for 30 s. Relative gene expression was calculated using the comparative Ct method . A two sample t-test was used to compare the expression differences between the means of ‘R’ and ‘S’ isolines at different DPAs.
Availability of data and materials
The datasets generated and/or analysed during the current study are available in the National Center for Biotechnology Information (NCBI) website with the SRA accession number of PRJNA554312 (https://www.ncbi.nlm.nih.gov/sra/PRJNA554312). The plant materials (seeds) are kept at the University of Western Australia Wheat Seed Collection.
Differentially expressed genes
False discovery rate
Open reading frame
Phytochrome interacting factor
Quantitative trait locus/loci
Single nucleotide polymorphism
Black MBJ, Halmer P. Preharvest sprouting – economic importance. In: Black M, editor. The encyclopaedia of seeds science, technology and uses. Oxfordshire: CABI Publishing; 2006. p. 528.
Biddulph TB, Plummer JA, Setter TL, Mares DJ. Influence of high temperature and terminal moisture stress on dormancy in wheat (Triticum aestivum L.). Field Crops Res. 2007;103(2):139–53.
Singh A, Knox R, Clarke J, Clarke F, Singh A, Depauw R, Cuthbert R. Genetics of pre-harvest sprouting resistance in a cross of Canadian adapted durum wheat genotypes. Mol Breed. 2014;33:919–29.
Chao S, Elias E, Benscher D, Ishikawa G, Huang Y, Saito M, Nakamura T, Xu S, Faris J, Sorrells M. Genetic mapping of major-effect seed dormancy quantitative trait loci on chromosome 2B using recombinant substitution lines in tetraploid wheat. Crop Sci. 2015;55(1):1–14.
Footitt S, Douterelo-Soler I, Clay H, Finch-Savage WE. Dormancy cycling in Arabidopsis seeds is controlled by seasonally distinct hormone-signaling pathways. Proc Natl Acad Sci. 2011;108(50):20236–41.
Nonogaki H, Barrero JM, Li C. Editorial. Seed dormancy, germination, and pre-harvest sprouting. Front Plant Sci. 2018;9:1783.
Li C, Ni P, Francki M, Hunter A, Zhang Y, Schibeci D, Li H, Tarr A, Wang J, Cakir M, Yu J, Bellgard M, Lance R, Appels R. Genes controlling seed dormancy and pre-harvest sprouting in a rice-wheat-barley comparison. Funct Integr Genomics. 2004;4(2):84–93.
Li Z, Gao Y, Zhang Y, Lin C, Gong D, Guan Y, Hu J. Reactive oxygen species and gibberellin acid mutual induction to regulate tobacco seed germination. Front Plant Sci. 2018;9:1279.
Söderman EM, Brocard IM, Lynch TJ, Finkelstein RR. Regulation and function of the Arabidopsis ABA-insensitive4 gene in seed and abscisic acid response signaling networks. Plant Physiol. 2000;124(4):1752–65.
Duarte KE, de Souza WR, Santiago TR, Sampaio BL, Ribeiro AP, Cotta MG, da Cunha BADB, Marraccini PRR, Kobayashi AK, Molinari HBC. Identification and characterization of core abscisic acid (ABA) signaling components and their gene expression profile in response to abiotic stresses in Setaria viridis. Sci Rep. 2019;9(1):4028.
Liu Y, Geyer R, van Zanten M, Carles A, Li Y, Hörold A, van Nocker S, Soppe WJJ. Identification of the Arabidopsis REDUCED DORMANCY 2 gene uncovers a role for the polymerase associated factor 1 complex in seed dormancy. PLoS One. 2011;6(7):e22241.
Penfield S, Josse E-M, Halliday KJ. A role for an alternative splice variant of PIF6 in the control of Arabidopsis primary seed dormancy. Plant Mol Biol. 2010;73(1):89–95.
Ali A, Cao J, Jiang H, Chang C, Zhang H-P, Sheikh SW, Shah L, Ma C. Unraveling molecular and genetic studies of wheat (Triticum aestivum L.) resistance against factors causing pre-harvest sprouting. Agronomy. 2019;9(3):117.
Kumar S, Knox R, Clarke F, Pozniak C, DePauw R, Cuthbert R, Fox S. Maximizing the identification of QTL for pre-harvest sprouting resistance using seed dormancy measures in a white-grained hexaploid wheat population. Euphytica. 2015;205(1):287–309.
Rodríguez M, Barrero J, Corbineau F, Gubler F, Benech-Arnold R. Dormancy in cereals (not too much, not so little): about the mechanisms behind this trait. Seed Sci Res. 2015;25(02):99–119.
Anderson J, Sorrells M, Tanksley S. RFLP analysis of genomic regions associated with resistance to pre-harvest sprouting in wheat. Crop Sci. 1993;33:453–9.
Mares D, Mrva K. Mapping quantitative trait loci associated with variation in dormancy in Australian wheat. Aust J Agric Res. 2001;52:1257–65.
Chen C, Cai S, Bai G. A major QTL controlling seed dormancy and pre-harvest sprouting resistance on chromosome 4A in a Chinese wheat landrace. Mol Breed. 2008;21(3):351–8.
Somyong S, Ishikawa G, Munkvold J, Tanaka J, Benscher D, Cho Y, Sorrells M. Fine mapping of a preharvest sprouting QTL interval on chromosome 2B in white wheat. Theor Appl Genet. 2014;127(8):1843–55.
Barrero JM, Cavanagh C, Verbyla KL, Tibbits JFG, Verbyla AP, Huang BE, Rosewarne GM, Stephen S, Wang P, Whan A, Rigault P, Hayden MJ, Gubler F. Transcriptomic analysis of wheat near-isogenic lines identifies PM19-A1 and A2 as candidates for a major dormancy QTL. Genome Biol. 2015;16(1):93.
Torada A, Koike M, Ogawa T, Takenouchi Y, Tadamura K, Wu J, Matsumoto T, Kawaura K, Ogihara Y. A Causal gene for seed dormancy on wheat chromosome 4A encodes a MAP kinase kinase. Curr Biol. 2016;26(6):782–7.
Wang X, Liu H, Liu G, Mia MS, Siddique KHM, Yan G. Phenotypic and genotypic characterization of near-isogenic lines targeting a major 4BL QTL responsible for pre-harvest sprouting in wheat. BMC Plant Biol. 2019;19(1):348.
Osa M, Kato K, Mori M, Shindo C, Torada A, Miura H. Mapping QTLs for seed dormancy and the Vp1 homologue on chromosome 3A in wheat. Theor Appl Genet. 2003;106(8):1491–6.
Liu S, Sehgal S, Li J, Lin M, Trick H, Yu J, Gill B, Bai G. Cloning and characterization of a critical regulator for preharvest sprouting in wheat. Genetics. 2013;195(1):263–73.
Kocheshkova AA, Kroupin PY, Bazhenov MS, Karlov GI, Pochtovyy AA, Upelniek VP, Belov VI, Divashuk MG. Pre-harvest sprouting resistance and haplotype variation of ThVp-1 gene in the collection of wheat-wheatgrass hybrids. PLoS One. 2017;12(11):e0188049.
Kulwal P, Kumar N, Gaur A, Khurana P, Khurana J, Tyagi A, Balyan H, Gupta P. Mapping of a major QTL for pre-harvest sprouting tolerance on chromosome 3A in bread wheat. Theor Appl Genet. 2005;111:1052–9.
Wang X, Liu H, Mia MS, Siddique KHM, Yan G. Development of near-isogenic lines targeting a major QTL on 3AL for pre-harvest sprouting resistance in bread wheat. Crop Pasture Sci. 2018;69(9):864–72.
Leach LJ, Belfield EJ, Jiang C, Brown C, Mithani A, Harberd NP. Patterns of homoeologous gene expression shown by RNA sequencing in hexaploid bread wheat. BMC Genomics. 2014;15:276.
Iquebal MA, Sharma P, Jasrotia RS, Jaiswal S, Kaur A, Saroha M, Angadi UB, Sheoran S, Singh R, Singh GP, Rai A, Tiwari R, Kumar D. RNAseq analysis reveals drought-responsive molecular pathways with candidate genes and putative molecular markers in root tissue of wheat. Sci Rep. 2019;9(1):13917.
Mia MS, Liu H, Wang X, Zhang C, Yan G. Root transcriptome profiling of contrasting wheat genotypes provides an insight to their adaptive strategies to water deficit. Sci Rep. 2020;10(1):4854.
Jia M, Guan J, Zhai Z, Geng S, Zhang X, Mao L, Li A. Wheat functional genomics in the era of next generation sequencing: an update. Crop J. 2018;6(1):7–14.
Gao S, Zheng Z, Powell J, Habib A, Stiller J, Zhou M, Liu C. Validation and delineation of a locus conferring Fusarium crown rot resistance on 1HL in barley by analysing transcriptomes from multiple pairs of near isogenic lines. BMC Genomics. 2019;20(1):650.
Hannenhalli S, Kaestner KH. The evolution of Fox genes and their role in development and disease. Nat Rev Genet. 2009;10(4):233–40.
Golson ML, Kaestner KH. Fox transcription factors: from development to disease. Development. 2016;143(24):4558.
Schmitt-Ney M. The FOXO’s advantages of being a family: considerations on function and evolution. In: Cells, vol. 9; 2020.
Pallauf K, Duckstein N, Hasler M, Klotz L-O, Rimbach G. Flavonoids as putative inducers of the transcription factors Nrf2, FoxO, and PPAR. Oxid Med Cell Longev. 2017;2017:4397340.
Kyo S, Sakaguchi J, Kiyono T, Shimizu Y, Maida Y, Mizumoto Y, Mori N, Nakamura M, Takakura M, Miyake K, Sakamoto M, Inoue M. Forkhead transcription factor FOXO1 is a direct target of progestin to inhibit endometrial epithelial cell growth. Clin Cancer Res. 2011;17(3):525–37.
Yin Z-J, Dong X-L, Kang K, Chen H, Dai X-Y, Wu G-A, Zheng L, Yu Y, Zhai Y-F. FoxO transcription factor regulate hormone mediated signaling on nymphal diapause. Front Physiol. 2018;9:1654.
Shah K, Russinova E, Gadella TW Jr, Willemse J, De Vries SC. The Arabidopsis kinase-associated protein phosphatase controls internalization of the somatic embryogenesis receptor kinase 1. Genes Dev. 2002;16(13):1707–20.
Morris ER, Chevalier D, Walker JC. DAWDLE, a forkhead-associated domain gene, regulates multiple aspects of plant development. Plant Physiol. 2006;141(3):932.
Watanabe S, Matsumoto M, Hakomori Y, Takagi H, Shimada H, Sakamoto A. The purine metabolite allantoin enhances abiotic stress tolerance through synergistic activation of abscisic acid metabolism. Plant Cell Environ. 2014;37(4):1022–36.
Takagi H, Ishiga Y, Watanabe S, Konishi T, Egusa M, Akiyoshi N, Matsuura T, Mori IC, Hirayama T, Kaminaka H, et al. Allantoin, a stress-related purine metabolite, can activate jasmonate signaling in a MYC2-regulated and abscisic acid-dependent manner. J Exp Bot. 2016;67(8):2519–32.
Liu X, Hou X. Antagonistic regulation of ABA and GA in metabolism and signaling pathways. Front Plant Sci. 2018;9:251.
Yamasaki K, Kigawa T, Seki M, Shinozaki K, Yokoyama S. DNA-binding domains of plant-specific transcription factors: structure, function, and evolution. Trends Plant Sci. 2013;18(5):267–76.
Carbonero P, Iglesias-Fernández R, Vicente-Carbajosa J. The AFL subfamily of B3 transcription factors: evolution and function in angiosperm seeds. J Exp Bot. 2017;68(4):871–80.
Wu J, Zhang Z, Zhang Q, Liu Y, Zhu B, Cao J, Li Z, Han L, Jia J, Zhao G, Sun X. Generation of wheat transcription factor FOX rice lines and systematic screening for salt and osmotic stress tolerance. PLoS One. 2015;10(7):e0132314.
Nakagawa S, Gisselbrecht SS, Rogers JM, Hartl DL, Bulyk ML. DNA-binding specificity changes in the evolution of forkhead transcription factors. Proc Natl Acad Sci. 2013;110(30):12349.
Goff KE, Ramonell KM. The role and regulation of receptor-like kinases in plant defense. Gene Regul Syst Bio. 2007;1:167–75.
Greeff C, Roux M, Mundy J, Petersen M. Receptor-like kinase complexes in plant innate immunity. Front Plant Sci. 2012;3:209.
MacGregor DR, Kendall SL, Florance H, Fedi F, Moore K, Paszkiewicz K, Smirnoff N, Penfield S. Seed production temperature regulation of primary dormancy occurs through control of seed coat phenylpropanoid metabolism. New Phytol. 2015;205(2):642–52.
Stone J, Collinge M, Smith R, Horn M, Walker J. Interaction of a protein phosphatase with an Arabidopsis serine-threonine receptor kinase. Science. 1994;266(5186):793–5.
Osakabe Y, Maruyama K, Seki M, Satou M, Shinozaki K, Yamaguchi-Shinozaki K. Leucine-rich repeat receptor-like kinase1 is a key membrane-bound regulator of abscisic acid early signaling in Arabidopsis. Plant Cell. 2005;17(4):1105–19.
Tani Y, Kimura K, Mihara H. Purification and properties of 4-methyl-5-hydroxyethylthiazole kinase from Escherichia coli. Biosci Biotech Bioch. 2016;80(3):514–7.
Gołda A, Szyniarowski P, Ostrowska K, Kozik A, Rapała-Kozik M. Thiamine binding and metabolism in germinating seeds of selected cereals and legumes. Plant Physiol Biochem. 2004;42(3):187–95.
Neumann G, Azaizeh HA, Marschner H. Thiamine (vitamin B1) seed treatment enhances germination and seedling growth of bean (Phaseolus vulgaris L.) exposed to soaking injury. Zeitschrift für Pflanzenernährung und Bodenkunde. 1996;159(5):491–8.
Finkelstein RR, Gampala SSL, Rock CD. Abscisic acid signaling in seeds and seedlings. Plant Cell. 2002;14(suppl 1):S15–45.
Boudsocq M, Danquah A, de Zélicourt A, Hirt H, Colcombet J. Plant MAPK cascades: Just rapid signaling modules? Plant Signal Behav. 2015;10(9):e1062197.
Choi SW, Lee SB, Na YJ, Jeung SG, Kim SY. Arabidopsis MAP3K16 and other salt-inducible MAP3Ks regulate ABA response redundantly. Mol Cells. 2017;40(3):230–42.
Colcombet J, Sözen C, Hirt H. Convergence of multiple MAP3Ks on MKK3 identifies a set of novel stress MAPK modules. Front Plant Sci. 2016;7:1941.
Díaz A, Zikhali M, Turner AS, Isaac P, Laurie DA. Copy number variation affecting the photoperiod-B1 and vernalization-A1 genes is associated with altered flowering time in wheat (Triticum aestivum). PLoS One. 2012;7(3):e33234.
Liu X, Zhang H, Zhao Y, Feng Z, Li Q, Yang H-Q, Luan S, Li J, He Z-H. Auxin controls seed dormancy through stimulation of abscisic acid signaling by inducing ARF-mediated ABI3 activation in Arabidopsis. Proc Natl Acad Sci. 2013;110(38):15485–90.
Wu M, Wu J, Gan Y. The new insight of auxin functions: transition from seed dormancy to germination and floral opening in plants. Plant Growth Regul. 2020;91(2):169–74.
Banerjee A, Roychoudhury A. Abscisic-acid-dependent basic leucine zipper (bZIP) transcription factors in plant abiotic stress. Protoplasma. 2017;254(1):3–16.
Skubacz A, Daszkowska-Golec A, Szarejko I. The role and regulation of ABI5 (ABA-Insensitive 5) in plant development, abiotic stress responses and phytohormone crosstalk. Front Plant Sci. 2016;7:1884.
Liu A, Gao F, Kanno Y, Jordan MC, Kamiya Y, Seo M, Ayele BT. Regulation of wheat seed dormancy by after-ripening is mediated by specific transcriptional switches that induce changes in seed hormone metabolism and signaling. PLoS One. 2013;8(2):e56570.
Appels R, Eversole K, Feuillet C, Keller B, Rogers J, Stein N, Stein N, Choulet F, Distelfeld A, Poland J, et al. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361(6403):661.
Kim D, Langmead B, Salzberg SL. HISAT: a fast spliced aligner with low memory requirements. Nat Methods. 2015;12:357.
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9(4):357–9.
Li B, Dewey CN. RSEM: accurate transcript quantification from RNA-Seq data with or without a reference genome. BMC Bioinformatics. 2011;12(1):323.
Wang L, Feng Z, Wang X, Wang X, Zhang X. DEGseq: an R package for identifying differentially expressed genes from RNA-seq data. Bioinformatics. 2009;26(1):136–8.
Rice P, Longden I, Bleasby A. EMBOSS: the European molecular biology open software suite. Trends Genet. 2000;16(6):276–7.
Mistry J, Finn RD, Eddy SR, Bateman A, Punta M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013;41(12):e121.
Liu H, Kishimoto S, Yamamizo C, Fukuta N, Ohmiya A. Carotenoid accumulations and carotenogenic gene expressions in the petals of Eustoma grandiflorum. Plant Breed. 2013;132(4):417–22.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2− ΔΔCT method. Methods. 2001;25(4):402–8.
The authors would like to thank Dr. Md Sultan Mia for his help on uploading the RNA-seq data onto the NCBI SRA database.
The research was funded by the Global Innovation Linkage program (GIL53853) from the Australian Department of Industry, Innovation and Science. The funding bodies were not involved in the design of the study and collection, analysis, and interpretation of data, or in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests for this research.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Wang, X., Liu, H., Siddique, K.H.M. et al. Transcriptomic profiling of wheat near-isogenic lines reveals candidate genes on chromosome 3A for pre-harvest sprouting resistance. BMC Plant Biol 21, 53 (2021). https://doi.org/10.1186/s12870-021-02824-x
- RNA sequencing
- Pre-harvest sprouting
- Marker-assisted selection
- Near-isogenic lines