CCCH Zinc finger genes in Barley: genome-wide identification, evolution, expression and haplotype analysis
BMC Plant Biology volume 22, Article number: 117 (2022)
CCCH transcription factors are important zinc finger transcription factors involved in the response to biotic and abiotic stress and physiological and developmental processes. Barley (Hordeum vulgare) is an agriculturally important cereal crop with multiple uses, such as brewing production, animal feed, and human food. The identification and assessment of new functional genes are important for the molecular breeding of barley.
In this study, a total of 53 protein-encoding CCCH genes unevenly dispersed on seven different chromosomes were identified in barley. Phylogenetic analysis categorized the barley CCCH genes (HvC3Hs) into eleven subfamilies according to their distinct features, and this classification was supported by intron–exon structure and conserved motif analysis. Both segmental and tandem duplication contributed to the expansion of CCCH gene family in barley. Genetic variation of HvC3Hs was characterized using publicly available exome-capture sequencing datasets. Clear genetic divergence was observed between wild and landrace barley populations in HvC3H genes. For most HvC3Hs, nucleotide diversity and the number of haplotype polymorphisms decreased during barley domestication. Furthermore, the HvC3H genes displayed distinct expression profiles for different developmental processes and in response to various types of stresses. The HvC3H1, HvC3H2 and HvC3H13 of arginine-rich tandem CCCH zinc finger (RR-TZF) genes were significantly induced by multiple types of abiotic stress and/or phytohormone treatment, which might make them as excellent targets for the molecular breeding of barley.
Overall, our study provides a comprehensive characterization of barley CCCH transcription factors, their diversity, and their biological functions.
Zinc finger transcription factors (TFs) are some of the most abundant TFs in plants and play key roles in regulating transcription and various biological functions . Zinc finger TFs are characterized by the typical zinc finger motif and a compact three-dimensional finger-type structure formed by cysteine and/or histidine residues coordinated with several zinc atoms . Most of the zinc finger families are protein- or DNA-binding proteins; a recently identified group of zinc finger proteins referred to as the CCCH gene family exhibits RNA binding and processing activity through their specific motifs in animals and plants [3, 4]. The CCCH proteins typically contain 1–6 CCCH-type motifs characterized by three cysteine residues and one histidine residue. The consensus sequence of the CCCH motif was defined as C–X4–17–C–X4–6–C–X3–H (C stands for cysteine, H for histidine, and X for any amino acid) based on the number of amino acid spacers between the cysteine and histidine residues [4,5,6]. CCCH proteins are over-represented by a class of proteins that contains a plant-unique tandem CCCH zinc finger (TZF) domain preceded by an arginine (R)-rich region; hereafter named RR-TZF proteins .
Many studies have shown that CCCH-type zinc finger proteins play a role in cell fate specification and developmental processes in plants. For example, AtKHZ1 and AtKHZ2 are required for flowering and senescence in Arabidopsis . AtC3H59/ZFWD3 plays an essential role in seedling development and seed germination and development by interacting with the PPPDE gene family protein Desil . In rice, OsDOS and OsTZF1 act as repressors of leaf senescence [10, 11]. OsGZF1 affects glutelin accumulation during seed development . GmZF351 and GmZF392 in soybean are involved in the accumulation of lipid in seeds [13, 14]. The involvement of CCCHs in hormone signaling adds complexity to the plant growth and development regulatory network. AtTZF4/5/6 act as negative regulators of light and gibberellins (GAs) and act as positive regulators of abscisic acid (ABA)-mediated regulation of seed development, dormancy, and germination . The CCCH-type zinc finger gene OsLIC is involved in the biosynthesis and/or signal transduction of brassinosteroids, which affects the architecture of rice plants . In switchgrass, PvC3H69 is a negative regulator of leaf senescence by repressing ABA synthesis and the ABA signaling pathway .
Several CCCH genes are implicated in the response to biotic and abiotic stressors in plants. For example, OsC3H10, OsC3H47, and OsTZF5 are involved in the regulation of tolerance to drought stress in rice [18,19,20]. Arabidopsis AtZFP1 has been reported to confer salt tolerance by limiting oxidative and osmotic stress and maintaining an ionic balance . The non-tandem CCCH-type gene AtC3H17 in Arabidopsis has pleiotropic effects in the salt stress response via an ABA-dependent signaling pathway . Switchgrass PvC3H72 was the first CCCH family gene identified to be involved in plant chilling and freezing tolerance, possibly through an ABA-mediated pathway . Moreover, DgC3H1 confers cold tolerance in Chrysanthemum plants by regulating the osmotic and reactive oxygen species (ROS) system, as well as the expression of genes associated with the cold stress response . In addition, CCCH genes are involved in other adaptive processes, such as resistance to bacterial blight disease , zinc homeostasis , hydrogen peroxide , and oxidative stress .
Genome-wide identification and characterization of CCCH genes have been carried out in Arabidopsis, rice , maize , poplar , tomato , Medicago truncatula , grape , citrus , switchgrass , Brassica rapa , and recently in the wheat , Brassica napus , and soybean . Barley (Hordeum vulgare) is one of the world’s oldest domesticated crops and ranks fourth among all cereal crops in area and tonnage harvested . However, CCCH genes have not been identified in barley, and their biological functions and evolutionary history remain poorly understood. This study aimed to genomically identify and characterize the barley CCCH genes (HvC3Hs). The phylogenetic relationships, distribution of motifs, intron–exon organization, and gene duplication events were comprehensively analyzed. Genomic variation, genetic diversity, and selection on these genes during barley domestication were also investigated using barley resequencing data (including wild and landrace barley accessions). Finally, we conducted RNA-seq and quantitative real-time polymerase chain reaction (qRT-PCR) analyses to determine the possible function of HvC3Hs. Our preliminary analysis provides new insight into the evolutionary history of CCCH genes and will aid future efforts to functionally characterize and genetically improve barley.
Genome-wide identification and characterization of CCCH proteins in barley
The most updated barley Morex assembly was used for the identification of barley CCCH genes. A total of 53 high-confident HvC3Hs with complete open reading frames were identified, accounting for 1.62% of the total annotated protein-coding genes in barley (Table 1; Supplementary Table S1). Because there is no standard nomenclature for barley CCCH genes, the candidate HvC3Hs were designated as HvC3H1 to HvC3H53 according to their chromosomal number and location. A BLAST search against the barley ESTs indicated that 38 HvC3Hs possessed EST records, which supported the existence of the HvC3Hs. Analysis of the physicochemical properties of HvC3H proteins demonstrated that the amino acid length varied from 127 amino acids (HvC3H35) to 1,456 amino acids (HvC3H6), with an average length of 482.2 amino acids. The pI varied from 5.13 to 10.15, and the MW ranged from 14.407 kDa to 160.373 kDa. All of these CCCH proteins possessed negative GRAVY values (average value: -0.704), indicating the hydropathic nature of HvC3Hs. Subcellular localization prediction showed that most of these proteins were located in the nucleus (45 HvC3Hs; 84.91%), which was consistent with their localization in Arabidopsis, rice, and wheat [4, 35].
CCCH domain structure analysis of HvC3Hs
Significant differences in the domain organization of HvC3Hs were observed. A total of eleven domain organizations of 141 CCCH motifs (C-X5-17-C-X4-6-C-X3-H) were identified, with an average of 2.66 CCCH motifs per protein. CCCH proteins have been shown to have one to six CCCH motifs [4, 16, 39, 40], and the similar pattern was observed in our study (Fig. 1). Notably, HvC3H6 contained eight CCCH motifs, which was a kind of newly identified motif for CCCH-type Zinc-finger protein. (Supplementary Table S2). Although different frequencies of CCCH motifs have been identified among barley CCCH proteins, two conventional CCCH motifs C-X8-C-X5-C-X3-H and C-X7-C-X5-C-X3-H were the two most common, suggesting that C-X7-8-C-X5-C-X3-H might be ancestral to the other CCCH motifs (Supplementary Fig. S1) . Additionally, a total of nine non-conventional CCCH zinc finger motifs, such as 6 C-X7-C-X4-C-X3-H, 4 C-X9-C-X5-C-X3-H, and 4 C-X5-C-X4-C-X3-H, were observed, which were previously identified to be abundant non-conventional CCCH motifs in Arabidopsis and rice. Furthermore, a total of five HvC3H proteins (HvC3H1, -9, -13, -26, and -31) were assigned to the RR-TZF proteins with an arginine-rich (RR) region located in front of C-X7-8-C-X5-C-X3-H- X16/18-C-X5-C-X4-C-X3-H (TZF) motif. Aside from the CCCH zinc finger motifs, some HvC3H proteins also contained several other known functional domains, such as KH, RRM, and RING. Five (HvC3H11, -41, -42, -43, and -50) and eight (HvC3H3, -8, -23, -27, -34, -36, -37, and -38) HvC3H members possessed the KH and RRM domains, respectively. An extra ankyrin (ANK) domain preceded the CCCH zinc finger motif was observed in HvC3H9, which was categorized as the ANK-RR-TZF protein.
Phylogenetic relationships, gene structure, and conserved domain organization of HvC3H genes
To determine the evolutionary relationships among HvC3Hs, a Maximum Likelihood (ML) phylogenetic tree was constructed based on the alignment of the full-length CCCH protein sequences of barley. According to the criteria proposed by Wang and Peng et al. with slight modifications [4, 6], the HvC3Hs were classified into eleven subfamilies (group I to group XI) (bootstrap values > 60%) (Fig. 2A). Twenty-six HvC3Hs formed thirteen sister gene pairs, twelve of which possessed high bootstrap support (> 98%). The number of CCCH proteins varied greatly for different subfamilies; subfamilies I and II rank the largest clusters with seven members, followed by the subfamilies VII (6 HvC3Hs) and XI (5 HvC3Hs). Notably, four RR-TZF genes (HvC3H1, -13, -26, and -31) were classified into subfamily XI, whereas the phylogenetic relationships of the ANK-RR-TZF gene (HvC3H9) remained ambiguous because of its low bootstrap values. We also constructed another phylogenetic tree based on the alignment of 188 CCCH proteins from Arabidopsis (68), rice (67), and barley (53) (Supplementary Fig. S2). The phylogenetic tree revealed an alternating distribution of monocot and eudicot CCCH genes in certain of the subfamilies.
The intron–exon gene structure provides potential insight into the functional diversification during evolution . Unlike other TF family genes, which tend to lack introns, the average intron number of HvC3Hs was 4.08 (ranging from 0 to 13) (Fig. 2B). In general, genes within the same subfamily had a similar structure of introns and exons. For example, genes from subfamily XI tended to be intron-less; subfamilies VII, VIII, and X were nearly identical in their intron/exon lengths and structural organization.
Consistent with the patterns in intron–exon gene structure, HvC3H proteins within the same subfamily tended to have a similar organization of motifs, and the patterns were highly variable among different phylogenetic clades (Fig. 2C). For example, the HvC3Hs in subfamily X possessed one CCCH motifs and one RRM motif, whereas subfamily I tended to have two or more CCCH motifs and one RRM motif. HvC3Hs in subfamily IV contained the RING domain. The variation in gene structure and motif composition among subfamilies suggests prior sub-functionalization or neofunctionalization of these HvC3Hs.
Chromosomal distribution and gene duplication
Chromosome location analysis revealed that the HvC3Hs were unevenly located on the seven barley chromosomes, and chromosome 3H possessed the most abundant CCCH genes (thirteen HvC3Hs) (Supplementary Fig. S3). By contrast, chromosomes 4H had only four CCCH genes. Chromosome 5H and 6H both contained nine CCCH genes, and chromosome 1H, 2H, and 7H both had six.
Gene duplication is considered one of the primary drivers of gene family expansion in plants and plays an important role in the evolution of new gene functions and adaptation [43, 44]. A total of six duplicated gene pairs were identified (Fig. 3). Two gene pairs (HvC3H14/HvC3H15 and HvC3H41/HvC3H42) were tandemly duplicated genes, the rest four gene pairs were designated as segmentally duplicated genes. According to the phylogenetic tree, these duplicated genes were clustered in the same clade. For example, HvC3H14/HvC3H15 were clustered in subfamily VI, HvC3H27/HvC3H34 were assigned to subfamily X, and HvC3H45/HvC3H48 belonged to subfamily IV. The Ka/Ks ratios of the segmentally duplicated genes were all lower than 1, indicating purifying selection (Supplementary Table S3) .
Syntenic relationships with six other representative species, including three monocots (Zea mays, Oryza sativa, and Triticum aestivum) and three dicots (Brassica rapa, Solanum lycopersicum, and Glycine max), were further analyzed to determine the mechanisms underlying the evolution of HvC3Hs (Fig. 4). A total of 65, 27, and 20 orthologous gene pairs between barley and Triticum aestivum, Zea mays, and Oryza sativa were identified, respectively. Sixteen HvC3H genes were orthologous to three copies of CCCH genes in wheat, which might stem from the fact that the heterologous hexaploid wheat contained three distinct ancestral genomes, namely A, B, and D . By contrast, the number of orthologous gene pairs between barley and three dicots (Glycine max, Brassica rapa, and Solanum lycopersicum) was ten, eight, and three, respectively, which was much lower than those between barley and three monocots. This finding is consistent with the observed phylogenetic relationships between barley and these species. HvC3Hs are phylogenetically closer to CCCHs in Triticum aestivum, Zea mays, and Oryza sativa than CCCHs in Glycine max, Brassica rapa, and Solanum lycopersicum. The overall Ka/Ks ratios between barley and the monocots (Triticum aestivum: 0.2729, Oryza sativa: 0.1840, and Zea mays: 0.1912) were significantly larger than that between barley and the dicots (Brassica rapa: 0.0646, Solanum lycopersicum: 0.0434, and Glycine max: 0.0295), suggesting the degenerated syntenic relationships after the separation of monocot and dicot (Supplementary Table S4).
Cis-element analysis of HvC3H genes
Cis-elements play important roles in the transcriptional regulation of genes throughout the life cycle of plants. A total of 52 functional cis-elements were identified and grouped into five categories. A large number of light-responsive elements were detected in the promoter regions of HvC3Hs, which accounted for most of the putative cis-elements (Supplementary Table S5, Supplementary Fig. S4). We also obtained a total of 11 types of hormone-responsive regulatory elements, such as auxin-responsive elements (AuxRR-core, TGA-box, and TGA-element), gibberellin-responsive elements (P-box, GARE-motif, and TATC-box), salicylic acid-responsive elements (TCA-element), and MeJA-responsive elements (CGTCA-motif and TGACG-motif). Several types of biotic and abiotic stress-related regulatory elements were observed in HvC3H promoters. Anaerobic induction elements (74 ARE and 45 GC-motif) were detected in 43 HvC3Hs. A total of 36 low temperature-responsive elements (LTR) and 39 drought-responsive elements (MBS, myeloblastosis binding site) were detected in 25 and 25 HvC3Hs, respectively. Fourteen HvC3H genes possessed wound-responsive elements (WUN-motif). Additionally, eight types of plant organogenesis-related cis-elements were identified, such as the meristem expression-related element CAT-box (15 genes), zein metabolism regulation-related element O2-site (13 genes) and endosperm expression-related element GCN4-motif (seven genes). These findings suggest that HvC3Hs might play an important role in barley plant growth and development, hormone signal transduction, and the response to biotic and abiotic stress.
Genetic variation of CCCH genes
We analyzed the sequence diversity of HvC3H genes at the population level based on exome-captured sequencing datasets. The average read coverage was 72.80% per sample per gene with the great majority (75.16%) that larger than 60% (Supplementary Table S6). The single nucleotide polymorphism (SNP)-calling pipeline generated 388 high-confident SNPs, 172 of which were in HvC3H32, followed by HvC3H21 (42) and HvC3H51 (23) (Supplementary Table S7; Supplementary Table S8). Most HvC3H-related SNPs were located within the intron regions (362); the rest of the SNPs were non-synonymous (13) and synonymous (13) SNPs. There were 314 InDels ranging from 1 to 55 bp in length, and short InDels 1 to 4 bp (76.54%) in length were more common than long InDels (Supplementary Table S9). Similarly, most HvC3H-related InDels were enriched in introns, which might be explained by the fact that the reading frame-independent variants were under weaker negative selection than frame-change variants.
To investigate the relatedness among the landraces and wild barley accessions worldwide, we carried out principal component analysis using HvC3H-related SNPs (Fig. 5A and B; Supplementary Table S10). The first principal component was correlated with the biological differentiation between landrace from wild barley and explained 22.11% of the total genetic variance; the second and third principal components captured 5.31% and 5.02% of the genetic variance, respectively, and revealed geographical differentiation in barley accessions. The phylogenetic tree further revealed genetically divergent clusters associated with the contrast between barley wild accessions versus landraces rather than their geographical origins (Fig. 5C). ADMIXTURE analysis confirmed these findings (Fig. 5D). When K = 2, two groups coinciding with landraces and wild barley were observed. Increasing K to 4 provided additional insights. Within barley landraces, we detected two geographically distributed components from Europe and Africa, whereas the rest of the landraces from Mediterranean areas displayed signs of genetic admixture. Within wild barley accessions, accessions from the Southern and Northern Levant regions formed two distinct groups.
Genetic diversity and haplotypes of HvC3Hs in wild and domesticated barley populations
Population-based nucleotide diversity was calculated to assess the occurrence of prior genetic bottlenecks of HvC3H genes during barley domestication. The total genetic diversity of HvC3H genes decreased by ~ 29.65% from the wild (π = 0.1050) to domesticated (π = 0.0739) barley population (Fig. 6A).
We further constructed the haplotype network for each HvC3H gene using their SNPs. A total of 922 non-redundant haplotypes belonging to 31 HvC3H genes were observed, with an average of 29.74 haplotypes per gene (Supplementary Fig. S5; Supplementary Table S8). Specific haplotypes represented in more than half of wild or landrace populations were defined as dominant haplotypes. Eleven HvC3H genes had no dominant haplotype, whereas 13 HvC3H genes had the same dominant haplotype in both wild and landrace populations. Nevertheless, clear genetic differentiation in haplotypes between wild and domesticated barley accessions was observed (Fig. 6B). HvC3H30 in wild barley mainly had the AAAAGGGGGGTTTTGGCC haplotype, but domesticated barley mainly had the AAAAGGGGAATTTTGGCC haplotype. The dominant haplotype of HvC3H49 in wild barley was AAGTTTTCCCTTGGGGAA, but haplotype AAGTTTTCTTTTGGGGTT was the most common in domesticated barley. Some rare haplotypes were also observed for specific HvC3H genes, such as HvC3H49, HvC3H51, and HvC3H52. The appearance of novel allelic variants greatly increased the degree of haplotype polymorphism of HvC3Hs. The rare haplotypes were mainly observed in the wild barley group, which was consistent with the results of the genetic diversity analysis. These results indicated that these genes experienced a severe genetic bottleneck during barley domestication and that the haplotype diversity decreased in domesticated barley relative to the wild population.
Temporal-spatial and stress-induced expression pattern analysis
Analysis of tissue-/stage-specific expression profiles provided valuable insights into the potential functions of genes in plant species. Distinct expression patterns were observed for the HvC3Hs by using the publicly available RNA-seq data from 16 different tissues (Fig. 7). The expression levels of HvC3Hs in group I were lower than those of genes in the other groups; eight genes were not expressed in most of the tissues/stages. By contrast, a total of thirteen genes in group III were highly expressed in most of the studied tissues/stages. HvC3H25 was predominantly expressed in LOD, CAR15, and EPI, whereas HvC3H3 and HvC3H18 showed high expression in INF2 and LOD, respectively. Genes in cluster II showed a medium expression level. Within this cluster, HvC3H7, -8, -13, -34, -43, and -52 tended to be expressed in INF1 and INF2. These findings indicate that these HvC3Hs might be associated with the development of these tissues in barley.
We analyzed the expression of HvC3Hs in response to different types of environmental stresses. Under cold treatment, four HvC3H genes displayed increased expression patterns (> 2.0-fold change) (Fig. 8A). Among these genes, HvC3H28, HvC3H2, and HvC3H30 exhibited their highest level of expression under cold treatment, showing fold-changes of 8.23, 4.57, and 2.17, respectively. Salt stress induced differential expression patterns of HvC3H genes in the three root regions (Fig. 8B). Compared with the unstressed control, a total of three, four, and two HvC3H genes were highly expressed in the meristematic, elongation, and maturation zones, respectively, especially HvC3H22, which exhibited a 10.96- and 6.67-fold increase in expression in the elongation and meristematic zones relative to the unstressed control. HvC3H2 was up-regulated in all tissues; its expression was increased 2.38-, 3.95-, and 3.21-fold in the meristematic, elongation, and maturation zones, respectively. Under metal ion stress, the expression of HvC3H2, -5, -13, -16, and -28 was significantly up-regulated, and the up-regulation of HvC3H2 was induced by copper and cadmium treatment (Fig. 8C). Under zinc and cadmium stress, HvC3H16 was up-regulated with 2.09- and 7.98-fold change, respectively.
Expression of HvC3Hs under drought, salt, cold, and ABA treatment by qRT-PCR analysis
To further investigate the expression of HvC3H genes in response to multiple treatments, 26 HvC3Hs were randomly subjected to qRT-PCR analysis. Under drought treatment, nine HvC3Hs were up-regulated at all time points (Supplementary Fig. S6), and the expression of six of the nine HvC3Hs (HvC3H3, -8, -10, -18, -37, and -50) peaked at 24 h. The expression of HvC3H3 was approximately 54-fold larger than that of the control at 24 h.
After salt treatment, the expression of HvC3H19 was suppressed compared with the control at all time points; the expression of 21 genes was significantly up-regulated, and the expression of these genes peaked at different times (Supplementary Fig. S7). For example, the expression of HvC3H3 peaked at 3 h and was up-regulated 43-fold, whereas the expression of HvC3H8, -10, and -18 was initially slightly up-regulated and peaked at 24 h.
The expression levels of HvC3Hs after cold treatment were analyzed, and the expression of six genes (HvC3H6, -8, -11, -30, -40, and -43) was inhibited compared with the control; the other HvC3Hs was up-regulated at specific time points (Supplementary Fig. S8). The expression of three HvC3Hs was up-regulated at 1 h (HvC3H36, -47, and -49), 3 h (HvC3H10, -45, and -50), and 6 h (HvC3H3, -25, and -33), suggesting that these HvC3Hs might primarily function in the initial stage in the response to cold injury. The expression of the other HvC3H genes peaked at 12 h or 24 h.
Plant CCCH proteins have been shown to be effective regulators of ABA-mediated stress responses . qRT-PCR analysis showed that ABA treatment had a pronounced effect on the expression patterns of HvC3Hs, and a complex expression profile was observed (Supplementary Fig. S9). For example, the expression of HvC3H3 was significantly up-regulated at 1 h and 3 h but down-regulated thereafter. By contrast, the expression of HvC3H37 was down-regulated before 12 h but significantly up-regulated at 24 h. Except for HvC3H8, -19, -30, -32, and -47, whose expressions were suppressed relative to the control, the maximum expression levels of the other HvC3Hs peaked at different time points.
Identification of CCCH genes in barley
CCCH domain-containing proteins are involved in various processes, including plant growth, development, and adaptation. Barley is the most important temperate crop in modern society . Herein, 53 highly confident CCCH zinc finger genes were identified in the barley genome through a comprehensive search. The number of CCCH proteins in barley was slightly lower than those identified in Arabidopsis (67), rice (68), maize (68), and grape (69); approximately half of those in poplar (91), Brassica rapa (103), and switchgrass (103), suggesting that the species origin and genome size are not directly associated with the number of CCCH genes.
The phylogenetic tree using the CCCH proteins from barley, rice, and Arabidopsis showed that the HvC3H genes displayed closer relationships with their orthologues than their paralogs. For example, CCCH genes in groups VII, XII, XX, and XXI showed a one-to-one-to-one orthologous pattern referred to a barley gene with one unique counterpart in Arabidopsis and rice. In contrast, groups II, IV, IX, XV, and XXVI did not contain any CCCH genes from barley. These rice and/or Arabidopsis-specific clades suggested that a presumed barley-specific loss of CCCH genes may have occurred after the divergence of barley and other species.
Genetic variation and haplotype polymorphism of HvC3Hs during the domestication of Barley
The genetic divergence of the HvC3H genes between wild and domesticated barley populations was characterized using publicly released exome capture datasets . Two genetically divergent populations associated with domestic vs. wild barley rather than barley populations with different geographic origins were observed, which was consistent with the deep phylogenetic split between wild and domesticated barley of HvmTERFs . The greater effect of human-driven selection on the genomic regions of HvC3Hs relative to natural selection was confirmed by the genetic diversity analysis. The nucleotide diversity of HvC3Hs displayed a significant reduction (~ 29.65%) of the genetic bottleneck between domesticated and wild barley, which was higher compared to that estimated for barley adaptive genes (22.5%) , housekeeping genes (20%) , and disease resistance genes (18.2%) . We concluded that HvC3Hs have undergone a genetic bottleneck from wild barley to cultivated barley and might be domestication-related genes.
Domestication is a plant-animal co-evolutionary process driven by the human demands for certain morphological and physical characteristics of crops . This results in a severe genetic bottleneck that reduces allele nucleotide diversity . The haplotype networks indicated that the haplotype composition of the HvC3H family in wild barley was rich compared with that in cultivated barley, indicating that initial human selection was focused on the retention of specific haplotypes by screening out a large number of undesirable haplotypes during domestication.
HvC3Hs might play a role in plant growth, abiotic stress, and phytohormone responses
The expression patterns of HvC3Hs provide insight into their possible functions. For example, HvC3H11, a KH domain-containing CCCH zinc finger gene, tended to be highly expressed in young inflorescences. In plants, the KH domain-containing genes FLK (Flowering Locus KH Domain) and PEP (PEPPER) regulate flowering by negatively and positively modulating the FLC expression, respectively [53, 54]. HvC3H22 was highly expressed in NOD, PAL, and LEM. Its orthologous gene AtC3H14 is the direct target of the MYB domain TF MYB46 and a master switch for cell elongation in Arabidopsis [55, 56].
Several studies have shown that CCCH proteins are involved in stress tolerance in plants . For example, Arabidopsis AtSZF1 and AtSZF2, two closely related CCCH zinc finger genes, negatively regulate the expression of salt-responsive genes and modulate the tolerance to salt stress . OsTZF1 negatively regulates leaf senescence under salt conditions and confers stress tolerance by delaying stress-responsive phenotypes, possibly through post-transcriptional control of the RNA metabolism of the salt stress-responsive genes . In this study, the RR-TZF protein HvC3H13 was homologous with these genes, and its expression was significantly induced under salt stress according to the RNA-seq and qRT-PCR analyses, suggesting that HvC3H13 may have similar functions in the salinity stress response. HvC3H28 displayed the most upregulated pattern at 1, 6, and 12 h of drought stress. Homology analysis revealed that its orthologous gene OsC3H47 is involved in drought tolerance through its elevated sensitivity to ABA . Another CCCH-tandem zinc finger protein OsTZF5, whose homologous gene was the RR-TZF protein HvC3H1, promotes drought avoidance and drought tolerance in rice . Several MBS cis-acting elements associated with drought inducibility within the promoter regions of HvC3H28 and HvC3H1 were predicted, suggesting that these genes might play a potential role in the response to drought stress. In Chrysanthemum, DgC3H1 enhances low-temperature tolerance by regulating the ROS system and the expression of downstream cold-related genes . However, the expression of HvC3H43, which is orthologous to DgC3H1, was not induced by cold stress according to RNA-seq and qRT-PCR analyses. No cis-acting element, such as LTR, was involved in low-temperature responsiveness within the HvC3H43 promoter region. These results indicate that HvC3H43 may have functionally diversified in barley. The expression of HvC3H13 was significantly upregulated at 1, 3, and 6 h compared to the control under cold stress. Its orthologous gene PvC3H72 acts as an added signaling component by regulating the ICE1-CBF-COR regulon and ABA-responsive genes during the switchgrass response to cold stress . Notably, another RR-TZF protein HvC3H2 is involved in several stressors, such as salt, low temperature, copper, and cadmium treatments. HvC3H2 is homologous to AtZFP1, which encodes a CCCH-type zinc finger protein induced by salt stress in Arabidopsis. Overall, several candidate CCCH genes that could be targets for subsequent genetic isolation and functional investigation in barley as well as in other cereal crops.
Despite the importance of CCCH genes in plant growth and development, the response to biotic and abiotic stress, and disease resistance, the precise roles of CCCH gene family members in barley have not yet been elucidated. Here, our genome-wide identification and characterization of HvC3H genes revealed the physical–chemical properties, phylogeny, intron–exon structure, and expansion patterns of these genes. The population structure based on the most recently released exome capture sequencing data revealed a deep phylogenetic split in the HvC3Hs between wild and domesticated barley. The nucleotide and haplotype diversity of most HvC3Hs indicated that these genes have undergone a severe genetic bottleneck during the transition from wild relatives to domesticated barley populations. The results of the expression profiling analysis suggested that HvC3H members might be associated with multiple physiological, metabolic, and developmental processes, especially in response to various types of biotic stresses. Overall, these findings will aid future studies examining the evolutionary history of HvC3Hs as well as functional studies of candidate HvC3H genes for molecular breeding in barley.
Identification of CCCH proteins in barley
The genomic proteins of barley Morex V2 were downloaded from the IPK database (https://doi.org/10.5447/ipk/2019/8). The CCCH protein sequences of Arabidopsis and rice were used as queries to search against the barley proteins with Basic Local Alignment Search Tool (BLAST) software (e-value < 1e-5). The Hidden Markov Model (HMM) of CCCH conserved domain (PF00642) was used as a query to search against the barley proteins by HMMER v3.0 with the e-value < 0.001. The candidate CCCH proteins were further verified by Simple Modular Architecture Research Tool (SMART) (http://smart.embl.de/), National Center for Biotechnology Information—Conserved Domains Database (NCBI-CDD) (https://www.ncbi.nlm.nih.gov/cdd/) and PFAM (http://pfam.xfam.org/) online databases. Putative proteins without CCCH domain were removed. A BLASTN search against barley expressed sequence tags (ESTs) was conducted to detect the existence of CCCH proteins with the following criteria: e-value < 1e-5, identity > 70% and coverage > 70%. The computational physical and chemical properties of CCCH family members, including molecular weight (MW), theoretical isoelectric point (pI), instability index (II), and grand average of hydropathicity (GRAVY) were evaluated using the online tool ExPASy (http://web.expasy.org/protparam/). The subcellular location was predicted using the cello software (http://cello.life.nctu.edu.tw/).
Phylogeny, gene structure and conserved motif analysis
The ClustalX v2.1 software was used to perform multiple alignments using the full-length CCCH protein sequences with default parameters. ML tree was constructed with IQ-TREE v2.1.3, using the best-fitting substitution model (VT + F + R3) selected automatically with bootstrap value of 1000 replications . The intron–exon organization of HvC3H genes was generated by Gene Structure Display Server (GSDS) (http://gsds.cbi.pku.edu.cn/) based on the gene annotation Gene Transfer Format (GTF) file . The conserved domains were identified using the online SMART tools. The upstream 1.5 kb genomic sequences of HvC3H genes were extracted and then submitted to the PlantCARE online database (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) to detect the potential cis-acting regulatory elements in the promoter regions.
Chromosome localization and gene synteny analysis
The chromosomal locations of HvC3H genes were obtained from IPK database (https://doi.org/10.5447/ipk/2019/8), and the chromosome maps were visualized using MapChart v2.32. The MCScanX software  was employed to analyze the synteny relationships of HvC3Hs in rice (Oryza sativa), wheat (Triticum aestivum), maize (Zea mays) soybean (Glycine max), tomato (Solanum lycopersicum), and Brassica rapa. The gene duplication events of HvC3Hs were identified according to the genomic comparison. Tandem duplicated genes were defined based on the following criteria (1) located within the same chromosome; (2) < 1 intervening gene . The syntenic and duplicated gene pairs were visualized by the Circos v0.67 tool. The non-synonymous substitution (Ka) / synonymous substitution (Ks) ratio was calculated to estimate genes evolutionary rate using the PAL2NAL online tools (http://www.bork.embl.de/pal2nal/) . Ka/Ks > 1, = 1 and < 1 represent positive, neutral, and purifying selection, respectively. The divergence time of syntenic and duplicated gene pairs was calculated based on the formula T = (Ks / 2λ) × 10−6 million years ago (MYA) (λ = 6.5 × 10−9) . The BLAST and orthoVeen2 software were employed to analyze the homologous genes between barley and other related species .
Population genetics analysis of HvC3H-related variants
The exome-captured resequencing data of 220 geographically-referenced barley accessions were retrieved from the NCBI SRA database (PRJEB8044/ERP009079) . Raw reads were trimmed using Trimmonmatic v0.36 with default parameters . The high-quality reads were mapped to the reference genome of barley Morex V2 using BWA-MEM v0.7.13r1126. The Bedtools v2.18 was employed to calculate the reads coverage per sample per gene. The single nucleotide polymorphism (SNP) and insertion-deletion (InDel) were identified using the Picard-GATK pipeline . The following criteria was used for SNPs filtration. (1) minor allele frequency (MAF) > 0.05 and < 0.95; (2) a maximum missing rate < 0.1; (3) biallelic alleles. SNP and InDel were annotated using the SnpEff v4.3 according to the barley genome GTF file . To better reveal the evolutionary relationships, barley accessions with SNP missing rate larger than 0.1 were excluded. Totally, the final collection included 95 landraces and 51 wild barley accessions (Supplementary Table S6). Only SNPs that located within the HvC3H genes were extracted for phylogenetic tree, population structure and principal component analysis (PCA). The HvC3H-related SNPs were used to construct ML tree with the IQ-TREE v2.1.3. The phylogenetic tree was visualized by Figtree v1.4.4. The population structure was quantified using ADMIXTURE v1.3.0 with predefined K values ranging from 2 to 4. The PCA was performed using the smartpca sub-package implemented in EIGENSOFT v4.2. The nucleotide diversity (π) was estimated using vcftools v0.1.16. The DNAsp v6.12.01 was employed to calculate the haplotypes for each HvC3H genes. Finally, the media-joining haplotype networks were constructed using the PopART v1.7 .
Expression patterns of HvC3H gene members
In order to explore the expression patterns of HvC3Hs, the publicly available 142 RNA-seq samples were downloaded from the NCBI Sequence Reading Archive (SRA) database, including different developmental stages and tissues and different biotic and abiotic stresses. The accession number and sample information of RNA-seq were listed in Supplementary Table S11. The fragments per kilobase per million (FPKM) value was calculated by the HISAT2 v2.1.0 and StringTie v1.3.5 pipeline. The R package pheatmap was used to visualize the expression profiles using the log2 transformed FPKM value.
Plant material, stress treatment, RNA extraction, and qRT-PCR analysis
Seeds of barley Morex were obtained from the College of Agronomy, Northwest A&F University, and were used as the experimental material. the barley seeds were hydroponically grown in growth chamber under controlled conditions (23 ± 1 °C, 16-h light/8-h dark cycle). The seedlings were processed for stress treatments at the three-leaf stage. For salt, drought, cold, and ABA treatments, the plants were incubated in 150 mM NaCl, 19.2% (w/v) PEG-6000, 4 °C and 100 μM ABA for 0, 1, 3, 6, 12, and 24 h, respectively. Seedlings without any treatment at the same time point were considered as the control. Leaves of all these samples were randomly collected with three biological replications. The collected samples were immediately frozen in liquid nitrogen and stored at 80 °C for RNA extraction.
To further reveal the possible functions of HvC3Hs, a total of 26 HvC3Hs were randomly selected to investigate their expression profile under various stresses by quantitative real-time PCR (qRT-PCR) analysis. HvACTIN (HORVU.MOREX.r2.5HG0378970) was used as the internal reference gene and the detail information of all the primers was listed in Supplementary Table S12. The total RNA was extracted by Plant RNA Kit reagent (Omega BioTek, USA), and cDNA synthesis was performed using 5X All-in-one RT MasterMix (ABM, Canada) according to the manufacturer’s instructions. The TB-Green ®Premix Ex Taq™ II kit (Takara, Dalian, China) was used to conduct qRT-PCR amplification in the Quant Studio™ Real-Time PCR system (Thermo Fisher, USA). The thermal cycling protocol was as follows: 95 °C temperature for 30 s, followed by 40 cycles of 95 °C for 3 s, and 30 s at 60 °C. The relative expression level was calculated by the 2−ΔΔCT method . Three technical replications were applied for each treatment. The T-test was conducted to evaluate the significance of the results using R. One asterisk (*) indicates 0.05 significance level and double asterisk (**) indicates 0.01 significance level, respectively.
Availability of data and materials
Data pertaining to the study have been included in the article or as supplementary material, further inquiries can be directed to the corresponding authors. The gene expression data and exome-capture resequencing data were downloaded from the NCBI database (http://www.ncbi.nlm.nih.gov/geo/) under BioProject accession number PRJEB14349, PRJEB13621, PRJEB18276, PRJNA382490 and PRJEB8044.
Abscisic Acid Responsive Elements
Anaerobic Induction Elements
Basic Local Alignment Search Tool
Ethylene Responsive Element
Expressed Sequence Tag
Fragments Per Kilobase per Million
Grand Average of Hydropathicity
Gene Structure Display Sever
Gene Transfer Format
Hidden Markov Model
Barley CCCH genes
International Barley Sequencing Consortium
Minor Allele Frequency
Myeloblastosis Binding Site
Multiple Expectation Maximization for Motif Elicitation
Million Years Ago
National Center for Biotechnology Information—Conserved Domains Database
Principal Component Analysis
Quantitative Real-time PCR
Reactive Oxygen Species
Arginine-Rich Tandem CCCH Zinc Finger
Simple Modular Architecture Research Tool
Sequence Read Archive
Single Nucleotide Polymorphism
Hall TMT. Multiple modes of RNA recognition by zinc finger proteins. Curr Opin Struct Biol. 2005;15(3):367–73.
Moore M, Ullman C. Recent developments in the engineering of zinc finger proteins. Brief Funct Genomic Proteomic. 2003;1(4):342–55.
Bai C, Tolias PP. Cleavage of RNA hairpins mediated by a developmentally regulated CCCH zinc finger protein. Mol Cell Biol. 1996;16(12):6661–7.
Wang D, Guo YH, Wu CG, Yang GD, Li YY, Zheng CC. Genome-wide analysis of CCCH zinc finger family in Arabidopsis and rice. Bmc Genomics. 2008;9:1–20.
Berg JM, Shi YG. The galvanization of biology: A growing appreciation for the roles of zinc. Science. 1996;271(5252):1081–5.
Peng XJ, Zhao Y, Cao JG, Zhang W, Jiang HY, Li XY, Ma Q, Zhu SW, Cheng BJ. CCCH-Type Zinc Finger Family in Maize: Genome-Wide Identification, Classification and Expression Profiling under Abscisic Acid and Drought Treatments. Plos One. 2012;7(7):e40120.
Bogamuwa SP, Jang JC. Tandem CCCH zinc finger proteins in plant growth, development and stress response. Plant Cell Physiol. 2014;55(8):1367–75.
Yan Z, Jia J, Yan X, Shi H, Han Y. Arabidopsis KHZ1 and KHZ2, two novel non-tandem CCCH zinc-finger and K-homolog domain proteins, have redundant roles in the regulation of flowering and senescence. Plant Mol Biol. 2017;95(6):549–65.
Seok HY, Bae H, Kim T, Mehdi SMM, Nguyen LV, Lee SY, Moon YH. Non-TZF Protein AtC3H59/ZFWD3 Is Involved in Seed Germination, Seedling Development, and Seed Development, Interacting with PPPDE Family Protein Desi1 in Arabidopsis. International Journal of Molecular Sciences. 2021;22(9):4738.
Kong ZS, Li MN, Yang WQ, Xu WY, Xue YB. A novel nuclear-localized CCCH-type zinc finger protein, OsDOS, is involved in delaying leaf senescence in rice. Plant Physiol. 2006;141(4):1376–88.
Jan A, Maruyama K, Todaka D, Kidokoro S, Abo M, Yoshimura E, Shinozaki K, Nakashima K, Yamaguchi-Shinozaki K. OsTZF1, a CCCH-Tandem Zinc Finger Protein, Confers Delayed Senescence and Stress Tolerance in Rice by Regulating Stress-Related Genes. Plant Physiol. 2013;161(3):1202–16.
Chen Y, Sun AJ, Wang M, Zhu Z, Ouwerkerk PBF. Functions of the CCCH type zinc finger protein OsGZF1 in regulation of the seed storage protein GluB-1 from rice. Plant Mol Biol. 2014;84(6):621–34.
Li QT, Lu X, Song QX, Chen HW, Wei W, Tao JJ, Bian XH, Shen M, Ma BA, Zhang WK, et al. Selection for a Zinc-Finger Protein Contributes to Seed Oil Increase during Soybean Domestication. Plant Physiol. 2017;173(4):2208–24.
Lu L, Wei W, Li QT, Bian XH, Lu X, Hu Y, Cheng T, Wang ZY, Jin M, Tao JJ, et al. A transcriptional regulatory module controls lipid accumulation in soybean. New Phytol. 2021;231(2):661–78.
Bogamuwa S, Jang JC. The Arabidopsis tandem CCCH zinc finger proteins AtTZF4, 5 and 6 are involved in light-, abscisic acid- and gibberellic acid- mediated regulation of seed germination. Plant, Cell Environ. 2013;36(8):1507–19.
Wang L, Xu Y, Zhang C, Ma Q, Joo SH, Kim SK, Xu ZH, Chong K. OsLIC, a Novel CCCH-Type Zinc Finger Protein with Transcription Activation, Mediates Rice Architecture via Brassinosteroids Signaling. Plos One. 2008;3(10):e3521.
Xie ZI, Yu GH, Lei SS, Zhang CC, Xu B, Huang BR: CCCH protein-PvCCCH69 acted as a repressor for leaf senescence through suppressing ABA-signaling pathway. Horticulture Research 2021, 8(1).
Seong SY, Shim JS, Bang SW, Kim JK. Overexpression of OsC3H10, a CCCH-Zinc Finger, Improves Drought Tolerance in Rice by Regulating Stress-Related Genes. Plants-Basel. 2020;9(10):1298.
Wang WY, Liu BH, Xu MY, Jamil M, Wang GP. ABA-induced CCCH tandem zinc finger protein OsC3H47 decreases ABA sensitivity and promotes drought tolerance in Oryza sativa. Biochem Biophys Res Commun. 2015;464(1):33–7.
Selvaraj MG, Jan A, Ishizaki T, Valencia M, Dedicova B, Maruyama K, Ogata T, Todaka D, Yamaguchi-Shinozaki K, Nakashima K, et al. Expression of the CCCH-tandem zinc finger protein gene OsTZF5 under a stress-inducible promoter mitigates the effect of drought stress on rice grain yield under field conditions. Plant Biotechnol J. 2020;18(8):1711–21.
Han GL, Wang MJ, Yuan F, Sui N, Song J, Wang BS. The CCCH zinc finger protein gene AtZFP1 improves salt resistance in Arabidopsis thaliana. Plant Mol Biol. 2014;86(3):237–53.
Seok HY, Nguyen LV, Park HY, Tarte VN, Ha J, Lee SY, Moon YH. Arabidopsis non-TZF gene AtC3H17 functions as a positive regulator in salt stress response. Biochem Biophys Res Commun. 2018;498(4):954–9.
Xie ZN, Lin WJ, Yu GH, Cheng Q, Xu B, Huang BR. Improved cold tolerance in switchgrass by a novel CCCH-type zinc finger transcription factor gene, PvC3H72, associated with ICE1-CBF-COR regulon and ABA-responsive genes. Biotechnology for Biofuels. 2019;12(1):1–1.
Bai HR, Lin P, Li X, Liao XQ, Wan LH, Yang XH, Luo YC, Zhang L, Zhang F, Liu SL, et al. DgC3H1, a CCCH zinc finger protein gene, confers cold tolerance in transgenic chrysanthemum. Scientia Horticulturae. 2021;281:109901.
Deng HQ, Liu HB, Li XH, Xiao JH, Wang SP. A CCCH-Type Zinc Finger Nucleic Acid-Binding Protein Quantitatively Confers Resistance against Rice Bacterial Blight Disease. Plant Physiol. 2012;158(2):876–89.
Yang Z, Wu YR, Li Y, Ling HQ, Chu CC. OsMT1a, a type 1 metallothionein, plays the pivotal role in zinc homeostasis and drought tolerance in rice. Plant Mol Biol. 2009;70(1–2):219–29.
Hunt AG, Xu RQ, Addepalli B, Rao S, Forbes KP, Meeks LR, Xing DH, Mo M, Zhao HW, Bandyopadhyay A, et al. Arabidopsis mRNA polyadenylation machinery: comprehensive analysis of protein-protein interactions and gene expression profiling. Bmc Genomics. 2008;9:1–5.
Chai GH, Hu RB, Zhang DY, Qi G, Zuo R, Cao YP, Chen P, Kong YZ, Zhou GK. Comprehensive analysis of CCCH zinc finger family in poplar (Populus trichocarpa). Bmc Genomics. 2012;13:1–22.
Xu RR. Genome-wide analysis and identification of stress-responsive genes of the CCCH zinc finger family in Solanum lycopersicum. Mol Genet Genomics. 2014;289(5):965–79.
Zhang CQ, Zhang HM, Zhao Y, Jiang HY, Zhu SW, Cheng BJ, Xiang Y. Genome-wide analysis of the CCCH zinc finger gene family in Medicago truncatula. Plant Cell Rep. 2013;32(10):1543–55.
Wang XL, Zhong Y, Cheng ZM. Evolution and Expression Analysis of the CCCH Zinc Finger Gene Family in Vitis vinifera. Plant Genome. 2014;7(3):1272.
Liu SR, Khan MRG, Li YP, Zhang JZ, Hu CE. Comprehensive analysis of CCCH-type zinc finger gene family in citrus (Clementine mandarin) by genome-wide characterization. Mol Genet Genomics. 2014;289(5):855–72.
Yuan SX, Xu B, Zhang J, Xie ZN, Cheng Q, Yang ZM, Cai QS, Huang BR. Comprehensive analysis of CCCH-type zinc finger family genes facilitates functional gene discovery and reflects recent allopolyploidization event in tetraploid switchgrass. Bmc Genomics. 2015;16:1–6.
Pi BY, He XH, Ruan Y, Jang JC, Huang Y. Genome-wide analysis and stress-responsive expression of CCCH zinc finger family genes in Brassica rapa. Bmc Plant Biology. 2018;18:1–5.
Cheng XR, Cao JJ, Gao C, Gao W, Yan SN, Yao H, Xu KL, Liu X, Xu DM, Pan X, et al. Identification of the wheat C3H gene family and expression analysis of candidates associated with seed dormancy and germination. Plant Physiol Biochem. 2020;156:524–37.
Pi B, Pan J, Xiao M, Hu X, Zhang L, Chen M, Liu B, Ruan Y, Huang Y. Systematic analysis of CCCH zinc finger family in Brassica napus showed that BnRR-TZFs are involved in stress resistance. BMC Plant Biol. 2021;21(1):555.
Hu X, Zuo JF. The CCCH zinc finger family of soybean (Glycine max L) genome-wide identification, expression, domestication. GWAS and haplotype analysis Bmc Genomics. 2021;22(1):1–9.
Mayer KFX, Waugh R, Langridge P, Close TJ, Wise RP, Graner A, Matsumoto T, Sato K, Schulman A, Muehlbauer GJ, et al. A physical, genetic and functional sequence assembly of the barley genome. Nature. 2012;491(7426):711.
Hudson BP, Martinez-Yamout MA, Dyson HJ, Wright PE. Recognition of the mRNA AU-rich element by the zinc finger domain of TIS11d. Nat Struct Mol Biol. 2004;11(3):257–64.
Kramer S, Kimblin NC, Carrington M. Genome-wide in silico screen for CCCH-type zinc finger proteins of Trypanosoma brucei, Trypanosoma cruzi and Leishmania major. Bmc Genomics. 2010;11:1–3.
Wang D, Guo Y, Wu C, Yang G, Li Y, Zheng C. Genome-wide analysis of CCCH zinc finger family in Arabidopsis and rice. BMC Genomics. 2008;9:44.
Ke YZ, Wu YW, Zhou HJ, Chen P, Wang MM, Liu MM, Li PF, Yang J, Li JN, Du H. Genome-wide survey of the bHLH super gene family in Brassica napus. Bmc Plant Biology. 2020;20(1):1–6.
Moore RC, Purugganan MD. The early stages of duplicate gene evolution. Proc Natl Acad Sci USA. 2003;100(26):15682–7.
Panchy N, Lehti-Shiu M, Shiu SH. Evolution of Gene Duplication in Plants. Plant Physiol. 2016;171(4):2294–316.
Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290(5494):1151–5.
Appels R, Eversole K, Feuillet C, Keller B, Rogers J, Stein N, Pozniak CJ, Choulet F, Distelfeld A, Poland J, et al. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361(6403):661.
Russell J, Mascher M, Dawson IK, Kyriakidis S, Calixto C, Freund F, Bayer M, Milne I, Marshall-Griffiths T, Heinen S, et al. Exome sequencing of geographically diverse barley landraces and wild relatives gives insights into environmental adaptation. Nature Genetics. 2016;48(9):1024.
Li T, Pan W, Yuan Y, Liu Y, Li Y, Wu X, Wang F, Cui L. Identification, Characterization, and Expression Profile Analysis of the mTERF Gene Family and Its Role in the Response to Abiotic Stress in Barley (Hordeum vulgare L). Frontiers in plant science. 2021;12:684619.
Fu YB. Population-based resequencing analysis of wild and cultivated barley revealed weak domestication signal of selection and bottleneck in the Rrs2 scald resistance gene region. Genome. 2012;55(2):93–104.
Ding YL, Shi YT, Yang SH. Molecular Regulation of Plant Responses to Environmental Temperatures. Mol Plant. 2020;13(4):544–64.
Purugganan MD. Evolutionary Insights into the Nature of Plant Domestication. Curr Biol. 2019;29(14):R705–14.
Haas M, Himmelbach A, Mascher M. The contribution of cis- and trans-acting variants to gene regulation in wild and domesticated barley under cold stress and control conditions. J Exp Bot. 2020;71(9):2573–84.
Mockler TC, Yu XH, Shalitin D, Parikh D, Michael TP, Liou J, Huang J, Smith Z, Alonso JM, Ecker JR, et al. Regulation of flowering time in Arabidopsis by K homology domain proteins. Proc Natl Acad Sci USA. 2004;101(34):12759–64.
Ripoll JJ, Rodriguez-Cazorla E, Gonzalez-Reig S, Andujar A, Alonso-Cantabrana H, Perez-Amador MA, Carbonell J, Martinez-Laborda A, Vera A. Antagonistic interactions between Arabidopsis K-homology domain genes uncover PEPPER as a positive regulator of the central floral repressor FLOWERING LOCUS C. Dev Biol. 2009;333(2):251–62.
Kim WC, Kim JY, Ko JH, Kang H, Kim J, Han KH. AtC3H14, a plant-specific tandem CCCH zinc-finger protein, binds to its target mRNAs in a sequence-specific manner and affects cell elongation in Arabidopsis thaliana. Plant J. 2014;80(5):772–84.
Kim WC, Ko JH, Han KH. Identification of a cis-acting regulatory motif recognized by MYB46, a master transcriptional regulator of secondary wall biosynthesis. Plant Mol Biol. 2012;78(4–5):489–501.
Han GL, Qiao ZQ, Li YX, Wang CF, Wang BS. The Roles of CCCH Zinc-Finger Proteins in Plant Abiotic Stress Tolerance. International Journal of Molecular Sciences. 2021;22(15):8327.
Sun JQ, Jiang HL, Xu YX, Li HM, Wu XY, Xie Q, Li CY. The CCCH-type zinc finger proteins AtSZF1 and AtSZF2 regulate salt stress responses in Arabidopsis. Plant Cell Physiol. 2007;48(8):1148–58.
Kumar S, Stecher G, Li M, Knyaz C, Tamura K. MEGA X: Molecular Evolutionary Genetics Analysis across Computing Platforms. Mol Biol Evol. 2018;35(6):1547–9.
Hu B, Jin JP, Guo AY, Zhang H, Luo JC, Gao G. GSDS 2 0 an upgraded gene feature visualization server. Bioinformatics. 2015;31(8):1296–7.
Tang H, Bowers JE, Wang X, Ming R, Alam M, Paterson AH. Synteny and collinearity in plant genomes. Science. 2008;320(5875):486–8.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–12.
Wang Y, Wang QQ, Zhao Y, Han GM, Zhu SW. Systematic analysis of maize class III peroxidase gene family reveals a conserved subfamily involved in abiotic stress response. Gene. 2015;566(1):95–108.
Xu L, Dong ZB, Fang L, Luo YJ, Wei ZY, Guo HL, Zhang GQ, Gu YQ, Coleman-Derr D, Xia QY, et al. OrthoVenn2: a web server for whole-genome comparison and annotation of orthologous clusters across multiple species. Nucleic Acids Res. 2019;47(W1):W52–8.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, Garimella K, Altshuler D, Gabriel S, Daly M, et al. The Genome Analysis Toolkit: A MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303.
Cingolani P, Platts A, Wang LL, Coon M, Nguyen T, Wang L, Land SJ, Lu XY, Ruden DM. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w(1118); iso-2; iso-3. Fly. 2012;6(2):80–92.
Leigh JW, Bryant D. POPART: full-feature software for haplotype network construction. Methods Ecol Evol. 2015;6(9):1110–6.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(T)(-Delta Delta C) method. Methods. 2001;25(4):402–8.
We thank the High-Performance Computing center of Northwest A&F University for providing computational resources in this work.
This research was mainly funded by the National Natural Science Foundation of China (Grant No. 32060458), Jiangxi Natural Science Foundation (Grant No. 20202BAB215002), and partially supported by the Science and Technology Research Project of Jiangxi Provincial Department of Education (Grant No. GJJ200466). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
The barley (Hordeum vulgare) cultivar Morex was grown and collected by College of Agronomy, Northwest A&F University (Yangling, China), and all samples from this cultivar was adopted for all experiment. These plant materials don’t include any wild species at risk of extinction. No specific permits are required for sample collection in this study. We comply with relevant institutional, national, and international guidelines and legislation for plant study.
Consent for publication
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this article was revised: authors would like to switch Affiliation 1 and Affiliation 2 details.
About this article
Cite this article
Ai, Q., Pan, W., Zeng, Y. et al. CCCH Zinc finger genes in Barley: genome-wide identification, evolution, expression and haplotype analysis. BMC Plant Biol 22, 117 (2022). https://doi.org/10.1186/s12870-022-03500-4
- CCCH gene family
- Genetic variation
- Haplotype analysis
- Expression pattern