- Research article
- Open Access
Genome-wide identification and expression analysis of YTH domain-containing RNA-binding protein family in common wheat
BMC Plant Biology volume 20, Article number: 351 (2020)
N6-Methyladenosine (m6A) is the most widespread RNA modification that plays roles in the regulation of genes and genome stability. YT521-B homology (YTH) domain-containing RNA-binding proteins are important RNA binding proteins that affect the fate of m6A-containing RNA by binding m6A. Little is known about the YTH genes in common wheat (Triticum aestivum L.), one of the most important crops for humans.
A total of 39 TaYTH genes were identified in common wheat, which are comprised of 13 homologous triads, and could be mapped in 18 out of the 21 chromosomes. A phylogenetic analysis revealed that the TaYTHs could be divided into two groups: YTHDF (TaDF) and YTHDC (TaDC). The TaYTHs in the same group share similar motif distributions and domain organizations, which indicates functional similarity between the closely related TaYTHs. The TaDF proteins share only one domain, which is the YTH domain. In contrast, the TaDCs possess three C3H1-type zinc finger repeats at their N-termini in addition to their central YTH domain. In TaDFs, the predicated aromatic cage pocket that binds the methylysine residue of m6A is composed of tryptophan, tryptophan, and tryptophan (WWW). In contrast, the aromatic cage pocket in the TaDCs is composed of tryptophan, tryptophan, and tyrosine (WWY). In addition to the general aspartic acid or asparagine residue used to form a hydrogen bond with N1 of m6A, histidine might be utilized in some TaDFb proteins. An analysis of the expression using both online RNA-Seq data and quantitative real-time PCR verification revealed that the TaDFa and TaDFb genes are highly expressed in various tissues/organs compared with that of TaDFcs and TaDCs. In addition, the expression of the TaYTH genes is changed in response to various abiotic stresses.
In this study, we identified 39 TaYTH genes from common wheat. The phylogenetic structure, chromosome distribution, and patterns of expression of these genes and their protein structures were analyzed. Our results provide a foundation for the functional analysis of TaYTHs in the future.
N6-Methyladenosine (m6A) is the most comment posttranscriptional modification in eukaryotic RNAs, which is crucial for gene regulation and the maintenance of genome stability . m6A is a reversible and dynamic modification that is catalyzed by a methyltransferase complex (Writers), such as METTL3, METTL14, WTAP, and RBM15/RBM15B, and removed by demethylases (Erasers), including FTO and ALKBH5. The modification of m6A plays important roles in diverse physiological processes. For example, MTA, MTB, FIP37, and VIR, the homologs of methyltransferase complex subunits in plants, function in the regulation of shoot stem cell fates, root growth, gravitropic responses, and embryo development [2,3,4,5]. m6A plays its roles at the molecular level by affecting the formation of secondary RNA structure, during which m6A modification controls the accessibility of binding sites for RNA binding proteins (Readers) . The recognition of m6A by the readers is important for the metabolism, processing, and folding of RNA, and protein translation. Many m6A reader proteins have been identified, and most contain a YT521-B homologous (YTH) domain with an aromatic cage that specifically recognizes the GG(m6A) C sequence . Five YTH domain-containing RNA-binding proteins in humans were identified, and they can be classified into three categories based on their sequences: YTHDC1 (YTH domain-containing protein 1), YTHDC2 (YTH domain-containing protein 2), and YTHDF (YTH domain-containing family protein) family . YTHDF proteins (DF1, DF2, and DF3) are localized in the cytoplasm and contain a C-terminal YTH domain and a large, low-complexity region. It has been shown that YTHDFs bind all the m6A sites in mRNA. YTHDC1 is nuclear-enriched and contains a YTH domain at its central part and is composed of other multiple functional domains. In contrast, YTHDC2 is a nucleocytoplasmic protein that contains a C-terminal YTH domain and a DEAD-box RNA helicase domain. YTHDC1 binds to certain m6A sites in mRNAs and to noncoding RNAs, whereas YTHDC2 primarily binds to noncoding RNAs . Many YTH genes have been identified in various plant species, including Arabidopsis thaliana, Oryza sativa, Malus domestica, Citrus sinensis and Cucumis sativus [10,11,12,13]. Thirteen ECT genes of A. thaliana, homologs of YTHDFs, were identified, some of which function in controlling the timing of the first true leaf formation, proper leaf morphology, and trichome branching [14,15,16,17]. ECT2 can interact with m6A-containing RNAs in vivo, indicating that YTH domain is an m6A reading domain in Arabidopsis . AtCPSF30 is a homolog of human HYTDC, which is a cleavage and polyadenylation specificity factor that functions in oxidative and nitrate signaling [18, 19]. In A. thaliana, M. hupehensis and C. sativus, YTH genes are involved in plant responses to pathogens, plant hormones (salicylic acid and abscisic acid), and abiotic stresses (water, temperature, and salinity) [10,11,12,13, 20,21,22]. These studies revealed that YTH domain-containing proteins play important roles in plant development and biotic and abiotic stress responses. Wheat is one of the major cereal crops in the world. To our knowledge, the YTH protein family in common wheat (Triticum aestivum L.) has not yet been analyzed in detail. In this study, we identified 39 genes that encode YTH domain-containing proteins in common wheat and analyzed their phylogenetic relationship and expression in different tissues/organs and in response to various stresses.
Identification and phylogenetic analysis of the YTH proteins in common wheat
To identify the TaYTH genes in common wheat, we used the hidden Markov model profiles of the YTH domain (PF04146) as queries to search the protein databases (IWGSC RefSeq v1.1) of common wheat (https://www.ebi.ac.uk/Tools/hmmer/search/hmmsearch) that was downloaded as a local protein database of bread wheat. Additionally, the conserved YTH domain was also used to BLASTp the YTH proteins encoded by the wheat genome on the website http://plants.ensembl.org/Multi/Tools/Blast . Additionally, we performed BLASTP analysis against the local wheat protein database (E-value < 1 × 10− 5) by using the YTH domain-containing protein sequences of Arabidopsis thaliana and Oryza sativa as queries. After the redundant sequences were removed, a total of 39 putative YTH proteins were identified and used for further analysis (Additional file 1). Their encoding genes are 13 sets of homeoalleles with 13 YTH genes in each sub-genome. Additionally, 13, 17, 12, 15, 12, 13 YTH genes in A. thaliana, Glycine max, O. sativa, Zea mays, Brachypodium distachyon, and Hordeum vulgare, respectively, were identified using the same strategy in this study and other study . An unrooted phylogenetic tree of 39 TaYTH genes was constructed based on the YTH domains of the 39 TaYTHs in concert with 82 YTH proteins from A. thaliana, O. sativa, G. max, Z. mays, B. distachyon, and H. vulgare (Additional file 2) to analyze the evolutionary relationships of the TaYTH proteins (Fig. 1a). The phylogenetic analysis revealed that the TaYTH domain proteins were clustered into two groups: YTHDF (TaDF) and YTHDC (TaDC). The YTHDF family contains three subgroups, YTHDFa, YTHDFb, and YTHDFc, which include 12, 9, and 15 members, respectively. The YTHDC family can be divided into two additional subgroups, YTHDCa and YTHDCb. No YTHDCb proteins were identified in common wheat, which is consistent with the previous hypothesis that there are no group-b YTHDCs in monocotyledons . All of the 39 YTH proteins that had been identified were re-named based on their phylogenetic relationship consistent with the YTH proteins that have been identified in other plant species . The subgenome (A, B or D) on which the genes are located is included in the name, and the number before the subgenome indicates the serial number of chromosome. Chromosomal location information of the TaYTH genes was extracted from the GFF3 reference file of the wheat genome, and a chromosomal map was constructed using KnetMiner, an online analytical tool (https://knetminer.rothamsted.ac.uk/Triticum_aestivum/). The map constructed revealed that the TaYTHs are located on all chromosomes with the exception of chromosomes 6A, 6B, and 6D. There are three TaYTH genes on chromosomes 5A, 5B, 5D, 7A, 7B, and 7D, and two on chromosomes 1A, 1B, 1D, 3A, 3B, 3D, 4A, 4B, and 4D. In contrast, only one TaYTH gene is located on each 2A, 2B, and 2D chromosome (Fig. 1b).
Gene structure and synteny of the TaYTH genes
The exon-intron structure provides insight on the evolution of gene families and additional evidence to support phylogenetic grouping. We then analyzed the exon-intron structure of the TaYTH genes based on their evolutionary classification. All the TaYTH genes contain introns, and most of the genes contained from five to eight introns (Fig. 2). In contrast, TaDF3b-7B contains the largest number of introns (11), including three long ones (> 4 Kb) that are not found in the YTH genes of other plant species [11, 12] (Fig. 2). The patterns of distribution of the TaYTH exon/introns were similar among the members within each group in the phylogenetic tree. The proportions of intron phases 1, 2, and 0 are 2.55, 23.3, and 74.2%, respectively, of all the TaYTH genes (Fig. 2). There are similar patterns of the intron phases of TaYTH genes with those of the YTH genes in A. thaliana, O. sativa and other species , i.e., the largest proportion of introns was in phase 0, with a few in phase 1 (Fig. 2).
Genomic replication events often result in the expansion of the gene family during the evolution of angiosperms . Three pairs of genes, TaDF3a-5A and TaDF4a-5A, TaDF3a-5B and TaDF4a-5B, and TaDF3a-5D and TaDF4a-5D, are localized in the region within a 200 kb range in chromosomes 5A, 5B, and 5D, respectively (Fig. 1b, Table 1). In addition, these genes share similar exon-intron structure (Fig. 2). These suggest that these YTH genes may be produced by tandem gene replication. The synteny diagram of the YTH genes between wheat and other plant species (A. thaliana; O. sativa; Z. mays; Medicago truncatula and B. distachyon) revealed that the YTH genes of wheat contained the greatest number of homologous gene pairs with those of B. distachyon, followed by rice and maize (Fig. 3). No YTH homologous gene pairs were identified between wheat and Arabidopsis. Only two homologous gene pairs were identified between wheat and M. truncatula. In contrast, 18, 21, 21, and 24 YTH gene pairs were identified between barley and wheat, rice and wheat, maize and wheat, B. distachyon and wheat, respectively, matching the three expected homologous counterparts in wheat (Fig. 3, Additional file 3). Our results reveals there is a similar copy number of YTH genes in different species and similar numbers of YTH gene pairs among different species and wheat, which suggest that the YTH gene family is conservative and withstand strong selection pressure during plant evolution.
Protein features of TaYTHs
The lengths of TaYTHs ranged from 588 to 767 amino acid residues. The molecular weight (MW) and isoelectric point (pI) of the TaYTH proteins varied from 63.9 to 82.1 kDa and from 5.40 to 8.54, respectively (Table 1). The subcellular localization of TaYTH proteins was predicted by the online tool PSORT (https://www.psort.org/). TaDF1b-7D, TaDF2c-3A, TaDF2c-3B, TaDF2c-3D, and TaDF4c-1B were predicted to be localized in the cytoplasm, while other TaYTHs were localized in nucleus (Table 1).
Sequence alignments revealed that the TaYTH proteins only share 30.85% identity, and the highly conserved regions are located in the YTH domains with 69.57% identity (Additional file 4), which is similar to the YTH domain-containing proteins of other species . In most of the TaDF proteins, the YTH domain is the only recognizable module at their C-terminus (Fig. 4a) that is similar with those of other species . In contrast, in addition to the YTH domain at their middle parts, all three TaDC proteins harbor two additional domains, an N-terminal YTH1 superfamily domain and a C-terminal PRK12678 superfamily domain. The YTH1 superfamily domain is composed of three repeated C3H1-type zinc fingers  (Fig. 4b). All these TaDC proteins were classified as type-a YTHDC (Fig. 1). The type-a YTHDCs in other plants also possess the YTH1 superfamily domain at their N-terminus, but their C-terminus harbors different types of domains (Additional files 5 and 6). In contrast, the type-b YTHDCs in dicotyledonous plants do not contain additional domains besides the YTH domain, which is like that of the YTHDC1s in animals (Additional files 5 and 6). To further analyze the structural diversity in the TaYTH proteins, the conserved motifs in TaYTHs were identified using the online tool MEME (http://meme-suite.org/tools/meme) (Additional file 7). All the TaYTH proteins contained motifs 2, 4, and 6. In the TaDFs, the YTH domain is composed of three motifs (2, 4, and 1) and followed by motif 3 at their C-termini. In contrast, the YTH domain of TaDCs consists of three motifs (2, 4, and 6) and is followed by an additional motif 6 at their C-termini (Fig. 5c). Upstream of the YTH domain, motifs 5, 6, and 8 were shared by subgroup TaDFa; in contrast, they are comprised of motif 5, 6, 7 and 8 in subgroup TaDFb and motif 6 and/or 8 in subgroup TaDFc (Fig. 4c).
An aromatic cage pocket in the YTH domain of YTHDF and YTHDC is utilized to bind the methylysine residue of m6A [6, 8]. This positively-charged pocket is formed by the side chains of tryptophan (W411), tryptophan (W465), and tryptophan (W470) in human YTHDF1; and tryptophan (W377), tryptophan (W428), and leucine (L439) in human YTHDC1. In this study, we found that the aromatic cage in the TaDFs is composed of tryptophan, tryptophan, and tryptophan (WWW), but the cage in TaDCs is composed of tryptophan, tryptophan, and tyrosine (WWY) (Fig. 5a and b). In addition to the methylation-dependent interactions, m6A also forms base-specific hydrogen bonds with the YTH proteins . A comparison of the binding affinity between YTHDF1 and YTHDC1 with m6A revealed that the asparagine (N367) in YTHDC1 forms a hydrogen bond with N1 of m6A that is stronger than the corresponding aspartic acid (D401) residue with N1 of m6A in YTHDF1. In this study, we found that all the TaDFa, six TaDFb and three TaDFc proteins used aspartic acid to form a hydrogen bond with N1 of m6A; however, aspartic acid has been replaced by asparagine in six TaDFb, three TaDFc, and three TaDC proteins (Fig. 5a and b). Additionally, histidine is used in the corresponding position in three TaDFb proteins (Fig. 5a and b), and this mode was not found in other YTH proteins identified in plant species . The N-termini of the TaDF proteins possess low-complexity regions containing Y/P/Q-rich regions (Fig. 5c, Additional file 8); however, the Y/P/Q-rich regions in TaDCs were located between the zinc finger repeat (YTH1 superfamily domain) and the YTH domain (Fig. 5c).
Expression analysis of the TaYTH genes in different tissues/organs and under abiotic stresses
To analyze the patterns of TaYTH expression, we downloaded the RNA-Seq data of wheat from the Wheat Expression Browser website (http://www.wheat-expression.com/) . The expression data of 19 tissues/organs (grain, radicle, embryo proper, coleoptile, endosperm, seedling, root, root apical meristem, stem, shoot, shoot apical meristem, leaf, flag leave, spike, spikelet, glume, lemma, microspore, stigma, and ovary) at different development stages and abiotic/biotic stress conditions were included in our analysis. The results revealed that all the TaDFa and TaDFb genes were highly expressed in the 19 tissues/organs with the exception of TaDF3bs (Fig. 6a). In contrast, the levels of expression of TaDFcs and TaDCas were lower in various tissues/organs with the exception of TaDF3cs and TaDF5cs (Fig. 6a). Most of the TaYTH genes exhibited obvious changes of expression in response to abiotic stresses, such as phosphorus starvation, cold stress, and heat stress (Fig. 6b). In addition, the expression of TaDF3b-7B and TaDF4c-1A are low in all the tissues/organs and stress conditions analyzed (Fig. 6a and b). TaDFa and TaDFb genes showed highly expressed under biotic stresses with the exception of TaDF3b-7B (Additional file 9). In contrast, TaDFc and TaDCa are low expressed except that TaDF5c is high expression under biotic stresses (Additional file 9).
To confirm the tissue-expression of the TaYTH genes in common wheat, we performed qRT-PCR analyses in five different tissues/organs of winter wheat cultivar Jimai22, including roots, stems, leaves at vegetative development, and spikes and flowers at reproductive development. A total of 13 genes (TaDF1a-4D, TaDF2a-5B, TaDF3a-5D, TaDF4a-5B, TaDF1b-7A, TaDF2b-1D, TaDF3b-7A, TaDF1c-2B, TaDF2c-3D, TaDF3c-4A, TaDF4c-1D, TaDF5c-3A, and TaDC1a-7A) and one of each set of homeoalleles were included in our analysis. TaDF1a-4D exhibited the highest level of expression in five tissues among the 13 TaYTH genes (Fig. 6c), which is consistent with the results based on the RNA-Seq data (Fig. 6a). Our results revealed that 11 out of 13 TaYTHs were expressed at relatively higher levels in leaves with the exception of TaDF5c-3A, and TaDC1a-7A (Fig. 6c). In addition, the levels of expression of TaDF3a-5D, TaDF1b-7A, TaDF5c-3A and TaDC1a-7A in the stems and TaDF3a-5D, TaDF4a-5B, TaDF1b-7A, TaDF2c-3D, and TaDF3c-4A in the flowers were at relatively higher levels. In contrast, only TaDF3b-7A and TaDF3c-4A exhibited a higher level of expression in the roots and spikes, respectively (Fig. 6c).
In our study, a total of 39 putative YTH domain-containing protein family genes were identified in common wheat. Yue et al. identified 41 m6A reader proteins in wheat , and 38 of them contained the YTH domain. We identified an additional wheat gene, TaDC1a-7D (TraesCS7D02G450100.1), which encodes a protein containing YTH domain at its middle part, and was designated member of the wheat YTHDC group (Table 1). In the study by Yue et al. (2019) , TaCPSF30–2 (TraesCS6A01G138900.1) and TaCPSF30–5 (TraesCS6B01G167200.1) are designated YTHDC proteins. However, no YTH domain was identified in these two proteins. TaCPSF30–2 and TaCPSF30–5 were localized in chromosomes 6A and 6B . However, in our results, no YTH gene localized in any form of chromosomes 6 (A, B, and D) was identified. These discrepancies might have resulted from the different annotation of version 1 and 1.1 of the wheat genome (IWGSC RefSeq Annotations v1.0 and v1.1). A chromosomal distribution map revealed that the TaYTH genes are dispersed across all the chromosomes of all three wheat sub-genomes with the exception of chromosome 6. A similar pattern of dispersion was also found in the YTH genes in A. thaliana O. sativa and M. domestica, but it is different from those in C. sativus, in which the CsYTH genes are located on three out of seven of the C. sativus chromosomes .
It has been reported that a conserved mechanism was utilized by the YTH domain proteins YTHDF and YTHDC to recognize m6A, an aromatic cage that recognizes the methyl moiety of m6A [6, 8]. The YTHDF family of proteins has a WWW cage, while the YTHDCs have a WWL-type cage in humans. In this study, we found that the aromatic cage in TaDFs is comprised of WWW, but the cage in TaDCs is comprised of WWY. This rule is employed by the YTHDC proteins from other plant species. These analyses indicated that the plant YTHDCs are different from the animal YTHDCs in m6A binding. In animals, two types of YTHDCs were identified, YTHDC1 and YTHDC2. In YTHDC1, the YTH domain is located in the central part of protein, while the YTH domain is located at the C-terminus of YTHDC2. In plant species, two types of YTHDC were also identified. However, the YTH domains in plant YTHDCs were all localized in the central part of proteins, which is similar to that of the animal YTHDC1s. This suggests that the YTHDCs in plants could share similar molecular mechanisms with those of the animal YTHDC1s. In humans, YTHDC1 mediates pre-mRNA splicing by interacting with splicing factor . Pre-mRNA splicing is involved in the regulation of plant development, sexual reproduction, and abiotic stress responses [27,28,29,30,31,32,33,34,35]. Thus, the functions of plant YTHDCs in pre-mRNA splicing merit further study. Additionally, the N-termini of plant YTHDFs do not possess a zinc finger domain like those found in the plant YTHDCs. This is consistent with the suggestion that, aside from their conserved YTH domain, YTHDCs are unrelated to YTHDFs based on their amino acid sequences . Animal YTHDF proteins harbor YTH domain at their C-termini and a low-complexity region at their N-termini. In this low-complexity region, several P/Q/N-rich patches were found in animal YTHDF proteins. P/Q/N-rich patches also were identified in TaYTHs in this study. A P/Q/N-rich region is suggested to function in the regulation of the stability of m6A-containing mRNAs by localizing the YTH domain-containing proteins to the RNA decay sites [36,37,38]. m6A forms base-specific hydrogen bonds with YTH proteins that enhance these interactions . In human YTHDF1 and YTHDC1, asparagine or aspartic acid is used to form the hydrogen bond with N1 of m6A, although asparagine interacts more strongly with N1 of m6A than does aspartic acid. However, in some wheat YTH proteins, histidine might be utilized for the interaction with N1 of m6A. Asparagine and aspartic acid are neutral amino acids, while histidine is basic. Thus, the binding of m6A with YTH proteins in wheat might have substantial differences with those in other plant and animal species [6, 15]. Human YTHDC2 is a nucleocytoplasmic protein that contains a DEAD-box RNA helicase domain in addition to a C-terminal YTH domain. The DEAD-box RNA helicase domain-containing proteins function in unwinding double stranded RNA . In fact, the YTHDC2 proteins play roles in unwinding 5′-UTR and increasing the translational efficiency of the target gene . However, no DEAD-box RNA helicase domain-containing YTH proteins have been identified in the plant kingdom. Thus, whether YTH-involved unwinding is required for m6A recognition by the readers remains unknown.
Wheat is an important crop around the world, which provides more than 20% of the protein and caloric intake of humans. The yield of wheat is closely related with the development and architecture of spikes. Many signaling pathways have been identified that are involved in the regulation of spike development and architecture in wheat , during which the regulation of gene expression and hormone signaling play critical roles. In our study, a set of YTH genes were expressed at higher levels during the development of spikes and flowers of wheat. Thus, these highly expressed YTH genes in the wheat spikes might function in the development and architecture of spikes. The selection and creation of wheat germplasm with tolerance to multiple abiotic and biotic stresses is a key challenge for wheat breeding [42,43,44,45,46,47,48,49,50,51,52,53,54,55,56]. Our results and those of previous studies reveal that the expression of YTH gene is involved in the responses to various stresses and in plant development [10, 13, 15, 17, 21]. Therefore, the function of wheat YTH genes in the response to abiotic stresses merits further study, and these YTH genes might be valuable for wheat genetic improvement.
In this study, 39 TaYTH genes were identified in common wheat (Triticum aestivum L.). A phylogenetic analysis revealed that the TaYTHs could be divided into two groups, YTHDF (TaDF) and YTHDC (TaDC). The chromosomal location and exon-intron structure of the genes, and the motif distribution and domain organization of the proteins were analyzed. In addition, the aromatic cage pockets of TaYTH proteins were predicted. The aromatic cage pocket of TaDFs is composed of tryptophan, tryptophan, and tryptophan (WWW). In contrast, this pocket is composed of tryptophan, tryptophan, and tyrosine (WWY) in TaDC proteins. In addition to the general aspartic acid or asparagine residues used to form a hydrogen bond with N1 of m6A, some TaDFb proteins might utilize histidine in this role. The results of the analysis of expression revealed that the TaDFa and TaDFb genes were highly expressed in various tissues/organs. In addition, the expression of TaYTH genes is changed in response to various abiotic stresses. Our results provide the foundation for the functional analysis of TaYTHs in the future.
Identification of the YTH genes in wheat
The YTH protein family in wheat was identified as described by Yan et al. . The conserved sequence of the YTH domain was used to make a Hidden Markov model and used in BLAST homology searches. The reference genome sequence and annotations (GFF files) of wheat were downloaded from the wheat genome Database (IWGSC RefSeq v2.0) at Ensemble Plants (http://plants.ensembl.org/Triticum_aestivum/). The conserved YTH domain was also used to BLASTp the YTH proteins encoded by the wheat genome on the website http://plants.ensembl.org/Multi/Tools/Blast . Additionally, BLASTP analysis against the local wheat protein database (E-value < 1 × 10− 5) was performed by using the Arabidopsis and rice YTH domain-containing protein sequences as queries (Additional file 2).
Phylogenetic analysis, gene structure, and chromosomal locations
ClustalX2.1 was used for multiple sequence alignments with default parameters. A maximum likelihood phylogenetic tree was constructed using MEGA 6.0 (https://www.megasoftware.net/) with 1000 bootstrap replicates. The phylogenetic tree was visualized by FigTree (http://tree.bio.ed.ac.uk/software/). TBtools software  and the online analytical tool KnetMiner (https://knetminer.rothamsted.ac.uk/Triticum_aestivum/) were used to map the locations of HTY genes in chromosomes and construct the map of exon-intron structure.
Amino acid sequence analysis
Online tool PSORT (https://www.psort.org/) was used to predict the subcellular localization of the TaYTH proteins. The molecular masses and isoelectric points of the TaYTH proteins were predicted using the web tool ExPASy (https://web.expasy.org/compute_pi/). The domains of TaYTH proteins were analyzed using the web tools CDD (https://www.ncbi.nlm.nih.gov/) and ExPASy (https://prosite.expasy.org/). Finally, the domain graphs were visualized using TBtools . The motif compositions of TaYTHs were identified by MEME (http://meme-suite.org/tools/meme) and exhibited using TBtools software. Multiple protein sequence alignments were performed using DNAMAN8. Sequence logos of the conserved motifs of proteins were created using Weblogo (http://weblogo.berkeley.edu/logo.cgi).
Digital gene expression pattern analysis
The transcript per million values of expression of TaYTH genes in 19 tissues/organs (grain, radicle, embryo proper, coleoptile, endosperm, seedling, root, root apical meristem, stem, shoot, shoot apical meristem, leaf, flag leave, spike, spikelet, glume, lemma, microspore, stigma and ovary) and in response to various stresses were downloaded from the Wheat Expression Browser (www.wheat-expression.com) . Heat maps that showed the relative expression levels were constructed using TBtools.
Plant materials and growth conditions
The common wheat used in this study is Jimai22, a winter wheat cultivar that is widely planted in North China. The seeds of Jimai22 were provide by Dr. Yu Xiu Dong (Shandong Agricultural University, China). The wheat seedlings were grown in a greenhouse at 22 °C with a 16 h light/8 h dark cycle at Shandong Agricultural University. The roots, stems, and leaves were sampled from 7-day seedlings. To sample the spikes and flowers, Jimai22 seeds were first vernalized in soil for 30 days at 4 °C. The vernalized seedlings were then transferred into greenhouses. The spikes and flowers were sampled from the plants grown for 30 days and 40 days in a greenhouse, respectively.
Expression of the TaYTH genes analyzed by qRT-PCR
Total RNA was extracted using the TRIzol reagent (Invitrogen, Beijing, China) as described by Zhao et al. . The total RNA was reverse transcribed using a PrimeScript RT Reagent Kit (TakaRa, Dalian, China) according to the manufacturer’s instructions. The first strand cDNAs for qRT-PCR were synthesized using Synthesis SuperMix (TransGen, Beijing, China). The qRT-PCR amplifications were performed as described by Zhao et al. . Three biological replicates and three technical replicates were utilized. The Actin gene was used as an internal control. The sequences of the primers used in this study are listed in Additional file 10.
Availability of data and materials
All data generated or analyzed in this study are included in this published article and its Additional files. The datasets generated and analyzed during the current study are available from the corresponding author on reasonable request.
Yang Y, Hsu PJ, Chen Y-S, Yang Y-G. Dynamic transcriptomic m6A decoration: writers, erasers, readers and functions in RNA metabolism. Cell Res. 2018;28(6):616–24.
Shen L, Liang Z, Gu X, Chen Y, Teo ZWN, Hou X, Cai WM, Dedon PC, Liu L, Yu H. N6-methyladenosine RNA modification regulates shoot stem cell fate in Arabidopsis. Dev Cell. 2016;38(2):186–200.
Bodi Z, Zhong S, Mehra S, Song J, Li H, Graham N, May S, Fray RG. Adenosine methylation in Arabidopsis mRNA is associated with the 3′ end and reduced levels cause developmental defects. Front Plant Sci. 2012;3:48.
Zhong S, Li H, Bodi Z, Button J, Vespa L, Herzog M, Fray RG. MTA is an Arabidopsis messenger RNA adenosine methylase and interacts with a homolog of a sex-specific splicing factor. Plant Cell. 2008;20(5):1278–88.
Růžička K, Zhang M, Campilho A, Bodi Z, Kashif M, Saleh M, Eeckhout D, El-Showk S, Li H, Zhong S. Identification of factors required for m6A mRNA methylation in Arabidopsis reveals a role for the conserved E3 ubiquitin ligase HAKAI. New Phytol. 2017;215(1):157–72.
Patil DP, Pickering BF, Jaffrey SR. Reading m6A in the transcriptome: m6A-binding proteins. Trends Cell Biol. 2018;28(2):113–27.
Xu C, Wang X, Liu K, Roundtree IA, Tempel W, Li Y, Lu Z, He C, Min J. Structural basis for selective binding of m6A RNA by the YTHDC1 YTH domain. Nat Chem Biol. 2014;10(11):927–9.
Liao S, Sun H, Xu C. YTH domain: a family of N(6)-methyladenosine (m(6)a) readers. Genomics Proteomics Bioinformatics. 2018;16(2):99–107.
Meyer KD, Jaffrey SR. Rethinking m6A readers, writers, and erasers. Annu Rev Cell Dev Biol. 2017;33:319–42.
Li D, Zhang H, Hong Y, Huang L, Li X, Zhang Y, Ouyang Z, Song F. Genome-wide identification, biochemical characterization, and expression analyses of the YTH domain-containing RNA-binding protein family in Arabidopsis and rice. Plant Mol Biol Rep. 2014;32(6):1169–86.
Ouyang Z, Duan H, Mi L, Hu W, Chen J, Li X, Zhong B. Genome-wide identification and expression analysis of the YTH domain-containing RNA-binding protein family in Citrus sinensis. J Am Soc Hortic Sci. 2019;144(2):79–91.
Wang N, Yue Z, Liang D, Ma F. Genome-wide identification of members in the YTH domain-containing RNA-binding protein family in apple and expression analysis of their responsiveness to senescence and abiotic stresses. Gene. 2014;538(2):292–305.
Zhou Y, Hu L, Jiang L, Liu S. Genome-wide identification and expression analysis of YTH domain-containing RNA-binding protein family in cucumber (Cucumis sativus). Genes Genom. 2018;40(6):579–89.
Lockhart J. A tale of three studies: uncovering the crucial roles of m6A readers. Plant Cell. 2018;30(5):947.
Scutenaire J, Deragon J-M, Jean V, Benhamed M, Raynaud C, Favory J-J, Merret R, Bousquet-Antonelli C. The YTH domain protein ECT2 is an m6A reader required for normal trichome branching in Arabidopsis. Plant Cell. 2018;30(5):986–1005.
Wei L-H, Song P, Wang Y, Lu Z, Tang Q, Yu Q, Xiao Y, Zhang X, Duan H-C, Jia G. The m6A reader ECT2 controls trichome morphology by affecting mrna stability in Arabidopsis. Plant Cell. 2018;30(5):968–85.
Arribas-Hernández L, Bressendorff S, Hansen MH, Poulsen C, Erdmann S, Brodersen P. An m6A-YTH module controls developmental timing and morphogenesis in Arabidopsis. Plant Cell. 2018;30(5):952–67.
Zhang J, Addepalli B, Yun KY, Hunt AG, Xu R, Rao S, Li QQ, Falcone DL. A polyadenylation factor subunit implicated in regulating oxidative signaling in Arabidopsis thaliana. PLoS One. 2008;3(6):e2410.
Li Z, Wang R, Gao Y, Wang C, Zhao L, Xu N, Chen KE, Qi S, Zhang M, Tsay YF, et al. The Arabidopsis CPSF30-L gene plays an essential role in nitrate signaling and regulates the nitrate transceptor gene NRT1.1. New Phytol. 2017;216(4):1205–22.
Wang N, Guo T, Sun X, Jia X, Wang P, Shao Y, Liang B, Gong X, Ma F. Functions of two Malus hupehensis (Pamp.) Rehd. YTPs (MhYTP1 and MhYTP2) in biotic-and abiotic-stress responses. Plant Sci. 2017;261:18–27.
Wang N, Guo T, Wang P, Sun X, Shao Y, Jia X, Liang B, Gong X, Ma F. MhYTP1 and MhYTP2 from apple confer tolerance to multiple abiotic stresses in Arabidopsis thaliana. Front Plant Sci. 2017;8:1367.
Guo T, Wang N, Xue Y, Guan Q, van Nocker S, Liu C, Ma F. Overexpression of the RNA binding protein MhYTP1 in transgenic apple enhances drought tolerance and WUE by improving ABA level under drought condition. Plant Sci. 2019;280:397–407.
Adams KL, Wendel JF. Polyploidy and genome evolution in plants. Curr Opin Plant Biol. 2005;8(2):135–41.
Ramírez-González R, Borrill P, Lang D, Harrington S, Brinton J, Venturini L, Davey M, Jacobs J, Van Ex F, Pasha A. The transcriptional landscape of polyploid wheat. Science. 2018;361(6403):eaar6089.
Yue H, Nie X, Yan Z, Weining S. N6-methyladenosine regulatory machinery in plants: composition, function and evolution. Plant Biotochnol J. 2019;17(7):1194–208.
Xiao W, Adhikari S, Dahal U, Chen Y-S, Hao Y-J, Sun B-F, Sun H-Y, Li A, Ping X-L, Lai W-Y. Nuclear m6A reader YTHDC1 regulates mRNA splicing. Mol Cell. 2016;61(4):507–19.
Xiong F, Ren JJ, Yu Q, Wang YY, Kong LJ, Otegui MS, Wang XL. AtBUD13 affects pre-mRNA splicing and is essential for embryo development in Arabidopsis. Plant J. 2019;98(4):714–26.
Xiong F, Ren JJ, Yu Q, Wang YY, Lu CC, Kong LJ, Otegui MS, Wang XL. AtU2AF65b functions in abscisic acid mediated flowering via regulating the precursor messenger RNA splicing of ABI5 and FLC in Arabidopsis. New Phytol. 2019;223(1):277–92.
Wang Y-Y, Xiong F, Ren Q-P, Wang X-L. Regulation of flowering transition by alternative splicing: the role of the U2 auxiliary factor. J Exp Bot. 2020;71(3):751–8.
Thatcher SR, Zhou W, Leonard A, Wang B-B, Beatty M, Zastrow-Hayes G, Zhao X, Baumgarten A, Li B. Genome-wide analysis of alternative splicing in Zea mays: landscape and genetic regulation. Plant Cell. 2014;26(9):3472–87.
Qin Z, Wu J, Geng S, Feng N, Chen F, Kong X, Song G, Chen K, Li A, Mao L. Regulation of FT splicing by an endogenous cue in temperate grasses. Nat Commun. 2017;8(1):1–12.
Zhu FY, Chen MX, Ye NH, Shi L, Ma KL, Yang JF, Cao YY, Zhang Y, Yoshida T, Fernie AR. Proteogenomic analysis reveals alternative splicing and translation as part of the abscisic acid response in Arabidopsis seedlings. Plant J. 2017;91(3):518–33.
Chen M-X, Zhu F-Y, Wang F-Z, Ye N-H, Gao B, Chen X, Zhao S-S, Fan T, Cao Y-Y, Liu T-Y. Alternative splicing and translation play important roles in hypoxic germination in rice. J Exp Bot. 2019;70(3):817–33.
Ren RC, Wang LL, Zhang L, Zhao YJ, Wu JW, Wei YM, Zhang XS, Zhao XY. DEK43 is a P-type pentatricopeptide repeat (PPR) protein responsible for the Cis-splicing of nad4 in maize mitochondria. J Integr Plant Biol. 2019. https://doi.org/10.1111/jipb.12843.
Laloum T, Martín G, Duque P. Alternative splicing control of abiotic stress responses. Trends Plant Sci. 2018;23(2):140–50.
Fu Y, Dominissini D, Rechavi G, He C. Gene expression regulation mediated through reversible m6A RNA methylation. Nat Rev Genet. 2014;15(5):293.
Yang Y, Sun B-F, Xiao W, Yang X, Sun H-Y, Zhao Y-L, Yang Y-G. Dynamic m6A modification and its emerging regulatory role in mRNA splicing. Sci Bull. 2015;60(1):21–32.
Wang X, Lu Z, Gomez A, Hon GC, Yue Y, Han D, Fu Y, Parisien M, Dai Q, Jia G. N6-methyladenosine-dependent regulation of messenger RNA stability. Nature. 2014;505(7481):117.
Zhu DZ, Zhao XF, Liu CZ, Ma FF, Wang F, Gao X-Q, Zhang XS. Interaction between RNA helicase ROOT INITIATION DEFECTIVE 1 and GAMETOPHYTIC FACTOR 1 is involved in female gametophyte development in Arabidopsis. J Exp Bot. 2016;67(19):5757–68.
Tanabe A, Tanikawa K, Tsunetomi M, Takai K, Ikeda H, Konno J, Torigoe T, Maeda H, Kutomi G, Okita K. RNA helicase YTHDC2 promotes cancer metastasis via the enhancement of the efficiency by which HIF-1α mRNA is translated. Cancer Lett. 2016;376(1):34–42.
Gao X-Q, Wang N, Wang X-L, Zhang XS. Architecture of wheat inflorescence: insights from rice. Trends Plant Sci. 2019;24(9):802–9.
Gou J-Y, Li K, Wu K, Wang X, Lin H, Cantu D, Uauy C, Dobon-Alonso A, Midorikawa T, Inoue K. Wheat stripe rust resistance protein WKS1 reduces the ability of the thylakoid-associated ascorbate peroxidase to detoxify reactive oxygen species. Plant Cell. 2015;27(6):1755–70.
Ju L, Jing Y, Shi P, Liu J, Chen J, Yan J, Chu J, Chen KM, Sun J. JAZ proteins modulate seed germination through interaction with ABI5 in bread wheat and Arabidopsis. New Phytol. 2019;223(1):246–60.
Sun X, Liu T, Ning T, Liu K, Duan X, Wang X, Wang Q, An Y, Guan X, Tian J-C. Genetic dissection of wheat kernel hardness using conditional QTL mapping of kernel size and protein-related traits. Plant Mol Biol Rep. 2018;36(1):1–12.
Yuan Y, Gao M, Zhang M, Zheng H, Zhou X, Guo Y, Zhao Y, Kong F, Li S. QTL mapping for phosphorus efficiency and morphological traits at seedling and maturity stages in wheat. Front Plant Sci. 2017;8:614.
Zhang M, Gao M, Zheng H, Yuan Y, Zhou X, Guo Y, Zhang G, Zhao Y, Kong F, An Y. QTL mapping for nitrogen use efficiency and agronomic traits at the seedling and maturity stages in wheat. Mol Breeding. 2019;39(5):71.
Wang S, Li QP, Wang J, et al. YR36/WKS1-mediated phosphorylation of PsbO, an extrinsic member of photosystem ii, inhibits photosynthesis and confers stripe rust resistance in wheat. Mol Plant. 2019;12(12):1639–50.
Zhang C, Huang L, Zhang H, et al. An ancestral NB-LRR with duplicated 3'UTRs confers stripe rust resistance in wheat and barley. Nat Commun. 2019;10(1):4023.
Chovancek E, Zivcak M, Botyanszka L, et al. Transient heat waves may affect the photosynthetic capacity of susceptible wheat genotypes due to insufficient photosystem i photoprotection. Plants (Basel). 2019;8(8):282.
Zheng M, Chen J, Shi Y, et al. Manipulation of lignin metabolism by plant densities and its relationship with lodging resistance in wheat. Sci Rep. 2017;7:41805.
Zhang G, Zhang M, Zhao Z, Ren Y, Li Q, Wang W. Wheat TaPUB1 modulates plant drought stress resistance by improving antioxidant capability. Sci Rep. 2017;7(1):7549.
Liu S, Sehgal SK, Lin M, Li J, Trick HN, Gill BS, Bai G. Independent mis-splicing mutations in TaPHS1 causing loss of preharvest sprouting (PHS) resistance during wheat domestication. New Phytol. 2015;208(3):928–35.
Yu G, Hou W, Du X, Wang L, Wu H, Zhao L, Kong L, Wang H. Identification of wheat non-specific lipid transfer proteins involved in chilling tolerance. Plant Cell Rep. 2014;33(10):1757–66.
Tian F, Gong J, Zhang J, Zhang M, Wang G, Li A, Wang W. Enhanced stability of thylakoid membrane proteins and antioxidant competence contribute to drought stress resistance in the tasg1 wheat stay-green mutant. J Exp Bot. 2013;64(6):1509–20.
Biswas DK, Xu H, Li YG, Ma BL, Jiang GM. Modification of photosynthesis and growth responses to elevated CO2 by ozone in two cultivars of winter wheat with different years of release. J Exp Bot. 2013;64(6):1485–96.
Bie X, Wang K, She M, Du L, Zhang S, Li J, Gao X, Lin Z, Ye X. Combinational transformation of three wheat genes encoding fructan biosynthesis enzymes confers increased fructan content and tolerance to abiotic stresses in tobacco. Plant Cell Rep. 2012;31(12):2229–38.
Yan J, Su P, Wei Z, Nevo E, Kong L. Genome-wide identification, classification, evolutionary analysis and gene expression patterns of the protein kinase gene family in wheat and Aegilops tauschii. Plant Mol Biol. 2017;95(3):227–42.
Chen C, Xia R, Chen H, He Y. TBtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface. BioRxiv. 2018;289660.
Zhao XY, Hong P, Wu JY, Chen XB, Ye XG, Pan YY, Wang J, Zhang XS. The tae-miR408-mediated control of TaTOC1 genes transcription is required for the regulation of heading time in wheat. Plant Physiol. 2016;170(3):1578–94.
Zhao B, Wu TT, Ma SS, Jiang DJ, Bie XM, Sui N, Zhang XS, Wang F. TaD27-B gene controls the tiller number in hexaploid wheat. Plant Biotechnol J. 2020;18(2):513–25.
We would like to thank the reviewers for their comments and helpful suggestions.
This work is supported by National Natural Science Foundation of China (91935302) focusing on the identification and mechanism of key genes for the yield traits of wheat, and National Transgenic Science and Technology Program (2019ZX08010–003) focusing on the genetic improvement of yield traits in wheat. The funding agencies were not involved in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Sequences of TaYTHs.
Sequences of YTHs used for the construction of phylogenetic tree.
Gene pairs between M. truncatula and wheat, barley and wheat, rice and wheat, maize and wheat, B. distachyon and wheat.
Sequence alignments among the identified TaYTH proteins.
Organization and distribution of the conserved domain in YTHDC1s in animals and YTHDCs in plants.
Sequences of animal YTHDC1s and plant YTHDCs.
Conserved motifs of TaYTHs.
Y/P/Q-rich region in TaYTHs.
Heat map showing the expression of TaYTH genes in response to biotic stress.
Primers used in this study.
About this article
Cite this article
Sun, J., Bie, X.M., Wang, N. et al. Genome-wide identification and expression analysis of YTH domain-containing RNA-binding protein family in common wheat. BMC Plant Biol 20, 351 (2020). https://doi.org/10.1186/s12870-020-02505-1
- YTH domain-containing RNA-binding protein
- Expression profiling
- Abiotic stress