- Research article
- Open Access
Genome-wide analysis of the AREB/ABF gene lineage in land plants and functional analysis of TaABF3 in Arabidopsis
BMC Plant Biology volume 20, Article number: 558 (2020)
Previous studies have shown that ABFs (abscisic acid-responsive transcription factors) are important ABA-signaling components that participate in abiotic stress response. However, little is known about the function of ABFs in Triticum aestivum. In addition, although various ABFs have been identified in other species, the phylogenetic relationship between ABF transcription factors has not been systemically investigated in land plants.
In this study, we systemically collected ABFs from land plants and analyzed the phylogenetic relationship of these ABF genes. The ABF genes are present in all the land plants we investigated, including moss, lycophyte, monocots, and eudicots. Furthermore, these ABF genes are phylogenetically divided into seven subgroups, differentiations that are supported by variation in the gene structure, protein properties, and motif patterns. We further demonstrated that the expression of ABF genes varies among different tissues and developmental stages, and are induced by one or more environmental stresses. Furthermore, we found that three wheat ABFs (TaABF1, TaABF2, and TaABF3) were significantly induced by drought stress. Compared with wild-type (WT) plants, transgenic Arabidopsis plants overexpressing TaABF3 displayed enhanced drought tolerance.
These results provide important ground work for understanding the phylogenetic relationships between plant ABF genes. Our results also indicate that TaABFs may participate in regulating plant response to abiotic stresses.
Drought is a major environmental stressor that affects plant growth, survival, distribution, and productivity. Plants have evolved complex mechanisms in molecular, cellular, and physiological processes to respond to environmental stresses in order to survive . Stressful conditions induce the production of stress response genes in plants [2, 3]. The phytohormone abscisic acid (ABA) is an important hormone that regulates some critical biological processes in plants, such as stomatal movement, adaptation to drought stress, and seed germination [4,5,6,7]. The endogenous ABA is produced when plants encounter adverse environmental stresses such as prolonged periods of osmotic stress. Several stress-responsive genes were expressed due to these increased ABA levels. Additional research indicates that many stress-responsive genes can also be induced by the exogenous application of ABA [2, 7,8,9].
ABA detects stress in a unique way and acts as an endogenous messenger in plant cells by inducing a double-negative regulatory pathway where ABA is bound to the ABA receptors RCARs/PYR1/PYLs, forming the complex that provides an active site for the PP2Cs. This inhibits the ability of PP2C to act as a negative regulator of the pathway, leading to the induction of SnRK2 as a positive regulator of downstream signalling and subsequent phosphorylation of the target proteins [10, 11]. Thus, in the presence of ABA, the PP2Cs are inactivated to repress SnRK2 phosphatase activity. SnRK2 could then initiate the ABA-responsive regulation pathway and activate the most significant cis-element ABA-responsive element (ABRE) to regulate the expression of many genes under osmotic stress conditions. Subsequently, through the yeast one-hybrid method, a subgroup of bZIP transcription factors was isolated by using ABREs as bait [12, 13]. Genes of this subgroup of bZIP transcription factors primarily participated in osmotic stress response by regulating stress-related genes. In Arabidopsis, nine group-A bZIP proteins were found as homologs of AREB/ABFs, and phylogenetically divided into two subfamilies, the AREB/ABF subfamily (ABF1, ABF2/AREB1, ABF3 and ABF4/AREB2) and the ABI5/AtDPBF subfamily (AtDPBF1/ABI5, AtDPBF2, AtDPBF3/AREB3 and AtDPBF4/EEL) . ABF/AREB family members have four conserved domains, including two located in the C-terminus (which includes a highly conserved bZIP domain and a C4 domain) and three located in the N-terminus (which include C1, C2 and C3 domains) .
To date, all of these AREB/ABF genes in Arabidopsis have been functionally characterized. These four genes (ABF1, ABF2/AREB1, ABF3 and ABF4/AREB2) are primarily expressed in vegetative tissues [12, 13, 16, 17]. In addition, the induced ABF1 expression changes in response to abiotic stress are minimal , while ABF2/AREB1, ABF3 and ABF4/AREB2 are significantly up-regulated under ABA and osmotic stresses [12,13,14, 17,18,19,20]. Ectopic expression of these four genes in Arabidopsis showed that ABF1 is a functional homolog of ABF2/AREB1, ABF3 and ABF4/AREB2; and ABF2/AREB1, ABF3 and ABF4/AREB2 are the core ABA signaling components responding to abiotic stresses [16,17,18, 20, 21]. Moreover, the areb1areb2abf3 triple mutant and areb1areb2abf3abf1 quadruple mutant showed increased drought sensitivity and decreased ABA sensitivity by impairing the expression of ABA and osmotic stress-responsive genes [18, 22]. Additionally, the overexpression of many AREB/ABFs in various species have been shown to confer increased tolerance to osmotic stress [23,24,25,26,27]. Several studies have reported that AREB/ABF transgenic agricultural plants showed substantial increases in drought tolerance with little or no effect on growth [23, 25, 26].
Bread wheat (Triticum aestivum L.) is the most widely cultivated crop on earth, accounting for approximately one-fifth of the total calories consumed by humans . Consequently, wheat yields and production affect the global economy. However, its productivity is frequently hampered by water scarcity, making improved drought tolerance an important goal of many breeding programs. Although several studies have demonstrated the importance of ABFs in response to abiotic stresses, our knowledge of ABFs in wheat is still very limited. In this study, we systemically described the characteristics of plant ABFs, including gene members, phylogenetic relationships, gene structures, protein structural similarities and differences, and gene expression. We performed additional functional analyses of the wheat ABF gene TaABF3 by investigating drought tolerance in transgenic Arabidopsis plants. Our results provide an important framework for understanding the phylogenetic relationship between plant ABF genes and deepens our understanding of the function and mechanism of wheat ABF genes in responses to drought stress.
Identification and analysis of AREB/ABF family in plants
Based on the 34 genomes listed in the Phytozome database, we performed a genome-wide BLAST search using Arabidopsis ABF1, AREB1/ABF2, AREB2/ABF4, and ABF3 amino acid sequences. We found the candidate ABFs in only 29 land plants, including moss, lycophyte, monocots, and eudicots. Among the ABF sequences we identified, some proteins had shorter amino acid residues (fewer than 200 amino acids). These short sequences were eliminated from subsequent analyses. In the end, 190 ABF-like sequences were collected for further analysis. We subjected these 190 protein sequences to SMART and Pfam analyses, and all of them were classified into the protein family containing bZIP domains (Pfam: 00170).
Previous studies have reported that the plant group-A bZIP family proteins can be phylogenetically clustered into two major groups, the AREB/ABF and the ABI5/AtDPBF subfamilies . As such, we constructed a maximum likelihood (ML) tree, using 190 full-length ABF-like gene sequences (Additional file 1: Figure S1). Our results show that these ABF-like sequences are divided into two major clades, designated as group A and B, each having 95 identified sequences. Group A contains all experimentally characterized AREB/ABFs, including Arabidopsis, Thellungiella salsuginea, and rice ABFs [14, 15]. According to previously characterized genes, such as ABI5/AtDPBF1, AtDPBF2, AREB3/AtDPBF3 and EEL/AtDPBF4 , group B was classified as ABI5/AtDPBF subfamilies. Therefore, group A sequences are designated ABF and were included for further analyses (Additional file 2: Table S1). The number of ABFs in each species is shown in Fig. 1. In summary, the moss Physcomitrella patens and the lycophyte Selaginella moellendorffii have two copies of the ABFs. In monocots, all species contain only four copies of ABFs, with the exception of wheat, maize and Panicum virgatum. The ABF copy number differed, from one to seven, in eudicots. This indicates that several duplication incidents took place. The quantity of ABF paralogs in rice, Arabidopsis, and Thellungiella observed by this study are in line with previous research [14, 15].
We further analyzed protein length, molecular mass, and the pI values of 95 ABF proteins (Additional file 2: Table S1). According to our results, the length and molecular mass of ABFs ranged from 254 to 485 amino acid residues, and 27.81 to 52.95 kD, with a mean of 389 amino acid residues and 42.16 kD. Aquilegia coerulea_ABF1 is the longest and largest ABF (485 amino acid residues and 52.95 kD), while Citrus sinensis_ABF3 is the shortest and smallest ABF (254 amino acid residues and 27.81 kD). Zea mays_ABF5 has the lowest pI value, with 5.44, while Citrus sinensis_ABF3 has the highest value, with 10.42. ABFs in clades III, VI and VII have very close pI values, while the pI values of ABFs in clades I, II, IV, and V varied widely. Interestingly, ABFs from clade V displayed a tendency to maintain acidic pI values, with an average of 6.97, while more alkaline pI values (greater than 7) were observed in 83 out of 95 ABFs belonging to other clades (Table 1; Additional file 2: Table S1).
Phylogenetic and structural analysis of plant ABFs
In order to better understand the evolutionary relationship of AREB/ABF members in land plants, we further constructed an ML tree using full-length protein sequences of 95 ABFs. According to support values (85% or greater) of the phylogenetic tree, ABFs can be divided into seven clades (clades I to VII) (Fig. 1; Fig. 2). Inside the phylogenetic tree, ABFs from the moss Physcomitrella patens and the lycophyte Selaginella moellendorffii form two independent clades, assigned as clades I (P.patens_ABFs) to II (S.moellendorffii_ABFs). The monocots can be placed into the next two clades, IV and V. The eudicots can be divided into three clades: III, VI, and VII.
It is worth mentioning that the phylogenetic tree (Fig. 2) aligns with the species tree shown in Phytozome (Fig. 1) with the ABFs from moss (Physcomitrella patens) and lycophyte (Selaginella moellendorffii) forming the two basal lineages of land plants. Monocot and eudicot ABFs are closer on the phylogenetic tree and form two monophyletic clades. To further investigate the accuracy of the ABF phylogenetic tree, we analyzed the exon/intron organization for each individual gene (Additional file 3: Figure S2). Of the 95ABFs, one has one exon; three have two exons; four have three exons; 71 have four exons, 13 have five exons, two have six exons, and two have seven exons. Within each clade, the gene structure of ABFs is relatively conserved, and the adjacent ABFs have a similar exon/intron structure. We then investigated the intron phases of all ABF gene structures. There are three categories of intron phase: phase 0 intron, phase 1 intron, and phase 2 intron. Our analysis indicated that the intron phase patterns (0, 0, 0) and (0, 0, 0, 0) are the predominant patterns across 95 land plant ABFs (Additional file 3: Figure S2). This analysis indicated that we have constructed a phylogenetic tree of the ABF genes in land plants that is highly accurate.
Motif composition and arrangement of plant ABFs
In order to better understand the phylogenetic relationships between plant ABFs, we aligned all of the ABF sequences to better identify the conserved amino acid residues. Based on the alignment, 35 amino acid residues are completely conserved in 88 ABFs (except for eight shorter ABFs). We further identified the conserved motifs in 95 plant ABFs using the SMART program. Finally, we found fiveconserved protein motifs in all ABFs, which are BRLZ domain and the other four low complexity regions (LCR 1–4; Fig. 3a; Additional file 4: Figure S3). ABFs belong to the basic-leucine zipper (bZIP) domain transcription factor family, and we found that the BRLZ domains are highly conserved in all plant sequences (Fig. 4).
However, the ability of SMART to comprehensively identify the motifs present in ABFs is limited, so we used the MEME program to identify conservation and variation in the motif arrangements among ABFs. We identified 20 distinct motifs in ABFs. The occurrences and arrangements of the motifs in ABFs from seven major clades are shown in Fig. 3b and Additional file 5: Figure S4. Among 20 motifs, 8 motifs are shared by all ABFs, which are components of the BRLZ domain (motif 1 and 2) and the other four conserved low complexity regions (motif 3 and 6 for LCR1, motif 5 for LCR2, motif 4 for LCR3, and motif 7 for LCR4). Next, we examined the non-conserved motif composition in land plant ABFs. We then split the ABFs into four regions, based on the location of the LCR motifs and the BRLZ domain (Fig. 3b): Region 1 is the part before the LCR1, Region 2 is the part between LCR1 and LCR2, Region 3 is the part between LCR3 and the BRLZ domains (there were no motifs between LCR2 and LCR3), and Region 4 is the part between the BRLZ domain and LCR4. Of these four, Regions 2 and 4 are highly conserved in plants on land (they are mainly comprised of motifs 15 and 16). Less conserved is Region 1, which is primarily comprised of motif 11 in clades III, VI, and VII. Region 3 is the most divergent region: motif 8 was observed in clades I, III, IV, V, VI, and VII; motifs 9 and 10 were found in clades III, IV, VI, and VII; motifs 12 and 17 were found in clades III, V, VI, and VII; motifs 13 and 14 were found in clades III, VI,and VII; motifs 18 were found in clades III and VII; motif 19 was found in clades IV and VI; and motif 20 was found in clade V (Fig. 3b). Taken together, the conserved and non-conserved motif patterns of plant ABFs that we identified match the pattern of clades in the phylogenetic tree.
Expression analysis of plant ABF genes
To obtain the expression profiles of Arabidopsis ABFs, we extracted the expression data from the Arabidopsis eFP Browser (http://bar.utoronto.ca/efp/cgi-bin/efpWeb. cgi). We found that the expression of Arabidopsis ABF paralogs displayed tissue differentiation. For example, A.thaliana_ABF1 displayed significantly higher expression in roots, and A.thaliana_ABF2 displayed significantly higher expression in seeds, indicating that ABF paralogs have followed the trend of tissue subfunctionalization. We found that ABF paralogs in clade III (A.thaliana_ABF2) have higher expression levels than clade VI paralogs (A.thaliana_ABF1, A.thaliana_ABF3; Additional file 6: Figure S5A). We next investigated the expression profiles of other plant ABF genes. Our results demonstrated that soybean (Glycine max) and common bean (Phaseolus vulgaris) ABF paralogs are expressed more in leaves, roots, and flowers than in other tissues, and that ABF paralogs in clade III (G.max_ABF3, P.vulgaris_ABF2) have higher expression levels than clade VII paralogs (G.max_ABF1, G.max_ABF1, P.vulgaris_ABF1) (Additional file 6: Figure S5B and C). Within monocots, we studied the expression of the four ABF paralogs from rice (Oryza sativa), and found that O.sativa_OsABF2 (clade III) had higher expression levels than O.sativa_TRAB1, O.sativa_OsAREB1, and O.sativa_OsAREB2 (Additional file 6: Figure S5D). The five ABFs in maize (Zea mays) display similar expression patterns among tissues, except for Z.mays_ABF3, which is less expressed among all tissues (Additional file 6: Figure S5E). The expression divergence of plant ABFs indicated the functional differentiation of ABFs.
We also investigated the expression of Arabidopsis ABFs under abiotic stresses using microarray expression data. The results showed that the expression of all Arabidopsis ABFs was induced by ABA, cold temperatures, drought conditions, and high salinity, but the degrees of induction differed. A.thaliana_ABF1 was significantly induced by cold temperatures; A.thaliana_ABF2 was significantly induced by drought conditions; A.thaliana_ABF3 was significantly induced by ABA, drought conditions and salt; and A.thaliana_ABF4 was significantly induced by drought conditions and salt (Additional file 7: Figure S6). We further investigated the expression of other plant ABFs to abiotic stresses (ABA, drought and highly salinity) using quantitative real-time PCR (qRT-PCR). From the heatmap, we found that the expression of most ABFs was induced by ABA, drought conditions, and high salinity. Except for B.rapa_ABF7, G.max_ABF2, and Z.mays_ABF3, all ABFs were significantly induced by drought conditions. ABFs are known for their importance in ABA-mediated abiotic stress responses, meaning a significant induction in the ABF genes might play a crucial role in plant adaptation to environmental stresses (Fig. 5).
Molecular characterization and expression analysis of TaABFs
Phylogenetic analyses suggest that TaABFs might serve a role in regulating abiotic stress response in wheat. We cloned three TaABF genes from the wheat cv. Chinese Spring. Each gene had three homologous components in the A, B, and D genomes of wheat; we named them TaABF1-5A/B/D, TaABF2-7A/B/D, and TaABF3-6A/B/D. Additional phylogenetic analyses indicated that TaABF1 was most closely related to the rice OsAREB2, TaABF2 was most closely related to the rice OsABF2, and TaABF3 was most closely related to the rice OsAREB1 (Fig. 6a). An analysis of the protein sequence revealed that TaABFs displayed 55–98% sequence similarity (Fig. 6b). We then analyzed the subcellular localization of TaABF3, first constructing the expression cassette and fusing TaABF3 with the GFP protein. The fused proteins were then transiently expressed in Arabidopsis protoplasts. We used fluorescence microscopy to analyze and reveal that the TaABF3-GFP fusion proteins were exclusively localized in the nucleus in the transformed cells, while the control GFP was uniformly distributed throughout the cell (Fig. 7a). These results confirmed that TaABF3 is a nuclear-localized protein.
To examine the expression pattern of TaABFs, we first identified the cis-element in its region of promotion, which was ~ 2 kb upstream of the transcription initiation codon, finding a number of cis-acting elements related to stress response in the promoter of TaABFs. This includes LTR (low temperature-responsive element), MYB (MYB recognition site), MYC (MYC recognition site), MBS (MYB binding site involved in drought-inducibility), ABRE (ABA-responsive element), and DRE (Dehydration-responsive element) (Fig. 7b). In order to better understand the role that TaABFs play in response to drought conditions, we executed quantitative real-time PCR (qRT-PCR) on RNA taken from various tissues and conditions of drought. Considering the highly sequence similarity of wheat homeologous genes, the PCR primers were designed to amplify the conserved locus of three TaABF homeologs; for example, the relative expression level of TaABF1 represents the combined expression of all three TaABF homeologs (TaABF1-5A, TaABF1-5B and TaABF1-5D). The results demonstrated that TaABFs were found in higher levels in the leaves of the seedlings (Fig. 7c) and that under drought stress conditions, all TaABFs in wheat leaves were up-regulated (Fig. 7d).
Overexpression of TaABF3 confers drought tolerance in Arabidopsis
To better understand how TaABFs function in plant abiotic stress tolerance, we generated 35S::TaABF3-GFP transgenic Arabidopsis lines. We then selected three independent transgenic lines for 35S::TaABF3-GFP transgenic Arabidopsis that exhibited higher expression levels of TaABF3 in order to further analyze their response to drought stress (Additional file 8: Figure S7). We then compared the drought tolerance of transgenic and vector-transformed (WT) plants. We grew WT and each 35S::TaABF3-GFP transgenic plants for 3 weeks in soil before withholding water for ~14d. After the drought treatment and 6 days of re-watering, ~ 65–75% of the transgenic plants survived, while only ~ 8% of the WT plants survived (Fig. 8a and b).
We next assayed the proline contents, malondialdehyde (MDA) contents, and the soluble sugar contents in 35S::TaABF3-GFP transgenic and WT plants (Fig. 8c-e). Our results showed that in transgenic lines the proline contents and the soluble sugar contents were significantly higher and the MDA contents were significantly lower than in WT under both well-watered and drought conditions. We also detected the expression of several well-known drought-responsive genes in the transgenic lines, including Arabidopsis-homologous LEA14 , RD29A , DREB2A , RAB18 , RD20 , and GolS2 . These results showed that all of these genes were up-regulated in 35S::TaABF3-GFP transgenic lines (Fig. 8f). Collectively, these findings indicate that the overexpression of TaABF3 in Arabidopsis could enhance the drought tolerance of transgenic plants.
Transcription factors (TFs) are a group of regulatory proteins that regulate gene expression by binding to specific cis-acting elements in the promoters of target genes . Despite the fact that many studies have revealed the crucial role of AREB/ABF TFs in response to abiotic stresses [16,17,18], our knowledge of ABFs is still limited. Previous studies have primarily focused on studying the function of ABF/AREB proteins, whereas phylogenetic studies of ABFs are restricted to some model plants, such as Physcomitrella patens, Selaginella moellendorffii, Arabidopsis, and rice . To advance our understanding of the involvement of ABFs in stress response and other biological processes, it is essential to first understand their evolution and diversity. In this study, we collected most land plant group-A bZIP TFs from available genome databases (Fig. 1; Additional file 2: Table S1) and performed phylogenetic analyses with full coding sequences. This allowed us to identify the ABF clade within the bZIP TFs (Fig. 2), the intron/exon structure of genes (Additional file 3: Figure S2), and the characteristic protein domains (Fig. 3; Fig. 4; Additional file 4: Figure S3; Additional file 5: Figure S4). We next extracted expression profiles of selected plants from a public expression database and explored the functional differences of paralog genes during land plant evolution (Fig. 5; Additional file 6: Figure S5; Additional file 7: Figure S6). In addition, we systemically investigated the function of target genes of TaABF3 (Fig. 6; Fig. 7; Fig. 8). The goal of our study was to provide an overall picture of plant ABFs and deepen our knowledge of the function and mechanism of wheat ABF genes when responding to abiotic stresses.
At the protein level, ABFs in land plants share many of the same structural features, all ABFs have four conserved LCR motifs, and one BRLZ domain (Fig. 3a; Additional file 4: Figure S3). However, the differences of ABF proteins are also existed. For example, the protein structure between the LCR3 motif and the BRLZ domain exhibit the highly variable (Fig. 3b). This region requires additional research to further elucidate the differences in structure and function between the ABFs in land plants from various clades.
A robust phylogenetic tree is essential for tracing the evolutionary history of ABF genes. As sequencing techniques have advanced, increasing amounts of plant genomes have been sequenced and released. In this study, we surveyed 34 different plant genomes and collected 95 ABF genes. With the exception of algae, ABF candidates exist in all land plants, including lowland plants (a moss and a lycophyte) and highland plants (monocots and eudicots). It is increasingly apparent that gene families present in embryophytes (land plants) and absent from sequenced chlorophyte genomes have their origins in the kind of algae from which the ancestral land plant evolved. This indicates that the ABF gene family originated during the evolution of the algal to land plants. Previous studies favor the single-origin theory of land plants, originating from charophycean green algae [36, 37]. Moving from an aqueous to a gaseous environment subjects various plants to different physical conditions, which results in particular changes to their structure and physiology. Significant metabolic pathways, involving flavenoids, lignins, plant hormones, and cutins from vascular plants come from existing structures of the primary metabolism in charophycean algae . During this process, various families of genes evolved and helped land plants to adapt to challenging new environmental conditions, which included abiotic stressors. It is possible that ABF genes could be induced by several abiotic stresses, participating in stress response to abiotic factors [16,17,18]. This evolution of ABFs could have played an important role in allowing plants to adapt to conditions on land.
Our analysis of the phylogenetic relationship demonstrated that the ABF gene family underwent two changes that led to the seven distinct subfamilies (Fig. 2). The first instance happened after Selaginella moellendorffii and Physcomitrella patens diverged from a common ancestor, that of seed plants. This occurrence is consistent with the known patterns of divergence in land plants, where Selaginella moellendorffii and Physcomitrella patens are the precursors of the seed plants. Following this event, the family of ABF genes could have been limited to their historical functions. However, our phylogenetic tree shows that the second instance of duplication that resulted in lineages similar to ABF happened in seed plants. Prior research has found that a whole-genome duplication (WGD) event that occurred in an ancestor of extant angiosperms produced exact copies of each gene [38, 39]. Monocots have seen many instances of WGD throughout their history, which are surely responsible for the high instance of ABF genes (Fig. 1) . All of the ABFs in monocots were found in clades IV and V, which is evidence of a duplication event early in the evolutionary history of monocots. In contrast, eudicot ABFs were all found in clades III, VI, and VII. Members of clade III are paralogs of Arabidopsis ABF2, while members of clade VI are paralogs of ABF1, ABF3, and ABF4. Members of clade VII are more similar to members of clade VI but are not found in the paralogs of Arabidopsis. All of this is evidence of the functional similarity of ABFs, while the differences between the clades are indicative of functional differentiation between the clades. Recent research has shown that ABFs are involved in ABA signaling when responding to abiotic stressors [16,17,18,19,20,21,22,23,24,25,26,27], while the functional differences between ABF genes remain scarce. One study found that ABF2, ABF3, and ABF4 play important roles when regulating the mediation of ABA-triggered Chl degradation as well as leaf senescence in Arabidopsis . This demonstrates evolutionary divergence in the functionality of ABFs,but in order to understand the practical differences within the lineage of ABFs, further research is required.
The responsiveness of ABF genes to abiotic stress strongly suggests that they serve roles in adapting to changing environmental conditions. Our qRT-PCR analyses further revealed that all TaABF genes were induced by drought stress (Fig. 7). To investigate the role of TaABF genes in the abiotic stress response, TaABF3 was transformed into Arabidopsis, and its overexpression was confirmed by RT-PCR (Additional file 8: Figure S7). The transgenic plants showed significantly improved drought and salt tolerance compared to WT plants (Fig. 8). Consistently, several stress-responsive genes, including LEA14, RD29A, DREB2A, RAB18,RD20, and GolS2 were found to be significantly up-regulated in TaABF3 transgenic Arabidopsis under drought stress (Fig. 8). This research strongly indicates that TaABF3 increases the tolerance of transgenic Arabidopsis plants to drought conditions. Prior research has shown that an overexpression of TF genes can slow the growth of transgenic plants [42,43,44,45]. We also closely monitored the growth and morphological features of TaABF3 transgenic Arabidopsis plants, finding that transgenic plants exhibited a slight reduction in the size of rosette leaves (Fig. 8).
In summary, our study provides a comprehensive analysis of the plant ABF genes, include phylogenetic relationships, gene structures, protein structures and properties, and expression profiles. Phylogenetic analysis combined with gene structure and motif composition clustered the plant ABFs into seven distinct clades. In addition, expression analyses demonstrate that plant ABFs have extensively induced by abiotic stress. Further functional analysis of TaABF3 trangenic Arabidopsis showed that they could confers drought tolerance in plants. Our results will help elucidate the functions of the AREB/ABF lineage in plants, and providing clues for the identification of candidate genes involved in abiotic stress responses in plants.
Plant materials and stress treatments
After subjecting four plant species (Brassica rapa, Glycine max, Oryza sativa, and Zea mays) to different stress conditions, we assayed the expression of ABFs. We obtained Brassica rapa cv. ZS11, Glycine max cv. Jidou-7, Oryza sativa cv. Nipponbare, and Zea mays cv. B73 from Northwest A&F University, though these strains also could have been acquired from the Chinese Crop Germplasm Resources Information System (http://www.cgris.net/zhongzhidinggou/index.php). Growth conditions and the application of stress conditions proceeded according to the following: we germinated 1‰ (v/v) Topsin-M sterilized seeds at 25 °C for 3 days on wet filter paper. The germinated seeds were then grown hydroponically, with Hoagland nutrient solution, under a 16 h light/8 h dark photoperiod in an artificially controlled climate chamber at 25 °C. The abscisic acid (ABA) was then applied to the cultivated seedlings, followed by high salinity conditions, and finally, drought conditions. The three-week-old seedlings were placed into a 200 mmol/L NaCl solution for the high-salinity treatment, into a 100 μmol/L ABA culture solution for the ABA treatment, and on a clean bench for the drought treatment (where they were dehydrated at 25 °C and relative humidity of 40–60%). The whole seedlings were collected 0, 2, and 10 h after subjecting them to stress conditions. We collected a minimum of five seedlings from each plant species at each time point, while each experiment was performed three times. All samples were subsequently frozen in liquid nitrogen and refrigerated at − 80 °C prior to RNA extraction.
ABF genes identification
For Arabidopsis, ABFs (AREB1/ABF2, AREB2/ABF4, ABF1 and ABF3) were used to conduct a TBLASTN query in the Phytozome databases (http://www.phytozome.net/). We used 34 plant species, including algae (Chlamydomonas reinhardtii, Coccomyxa subellipsoidea C-169, Micromonas pusilla CCMP1545, Ostreococcus lucimarinus, and Volvox carteri), moss (Physcomitrella patens), lycophyte (Selaginella moellendorffii), monocots (Triticum aestivum, Oryza sativa, Panicum virgatum, Sorghum bicolor, Setaria italica, and Zea mays), and eudicots (Aquilegia coerulea, Arabidopsis lyrata, Arabidopsis thaliana, Brassica rapa, Capsella rubella, Carica papaya, Citrus clementina, Citrus sinensis, Cucumis sativus, Eucalyptus grandis, Fragaria vesca, Glycine max, Gossypium raimondii, Manihot esculenta, Mimulus guttatus, Phaseolus vulgaris, Prunus persica, Ricinus communis, Solanum lycopersicum, Thellungiella salsuginea, Theobroma cacao, and Vitis vinifera). The amino acid sequences, cDNA, and genomic DNA associated with each putative ABF or ABF were obtained from the Phytozome database, while we used the Simple Modular Architecture Research Tool (SMART; http://smart.emblheidelberg. de/smart/set_mode.cgi?NORMAL = 1) to identify ABFs with protein structures containing bZIP and other common domains. The Compute pI/Mw tool, from ExPASy (http://web.expasy.org/compute_pi/), was used to generate the theoretical molecular mass and Pi (isoelectric point) values.
Phylogenetic tree construction
The TranslatorX server (http://translatorx.co.uk/)  was used to align the coding sequences (CDS), while we conducted a jModelTest analysis  to identify the model with the best fit. We created a ML (maximum likelihood) tree using the online program RAxML (http://www.trex.uqam.ca/index.php?action=raxml&project=trex) , via the best-fit model with 100 bootstrap samples. FigTree (http://tree.bio.ed.ac.uk/software/ figtree/) was used to visualize the phylogenetic tree.
Analysis of gene structure analysis and conserved motif detection
The online Gene Structure Display Server (GSDS; http://gsds.cbi.pku.edu.cn) was used to assess the distribution of introns and exons and intron phase patterns. The Multiple Expectation Maximization for Motif Elicitation program (MEME; http://meme.nbcr.net/meme/cgi-bin/meme.cgi) was used to obtain the functional motifs of ABF proteins, using these parameters: maximum number of motifs = 20, optimum motif width = 6 to 100 residues, distribution of motifs = any number of repetitions.
ABF gene expression profile
qRT-PCR was used to assess ABF expression patterns under different stress conditions, and TRIZOL reagent (Biotopped) was used to isolate total RNA using at least five seedlings from the three separate experiments. Total RNA was treated with Rnase-free DNAse (Takara) to remove genetic contamination. A Nanodrop1000 (Thermo Scientific product, USA) was used to measure the total RNA levels, while 5 μg of total RNA was run on 0.8% agarose gel from each sample to validate the number and integrity of RNA. The cDNAs were synthesized using recombinant M-MLV reverse transcriptase and total RNA (1 μg) mixed with 1 μg Oligo (dT)23 (Promega). The PCR conditions involved a preliminary denaturation for 10 min at 95 °C, 40 cycles of 15 s at 95 °C, and 40 cycles of 30 s at 60 °C. The internal control was TaActin (TraesCS1A01G274400). We applied the quantification method (2-ΔCt) and approximated the expression variation using three biological replicates . Additional file 9: Table S2 outlines the primers used in this study.
Subcellular localization of TaABF3-GFP fusion proteins
Prof. Zhensheng Kang’s Lab (Northwest A&F University, China) provided the Triticum aestivum cv. Chinese spring, which was used in the functional analysis of TaABF3. The full-length CDS sequence of TaABF3 was amplified using PCR from the wheat cv. Chinese Spring with specific primers for the subcellular localization assay of TaABF3. This was then placed into the binary vector pCAMV35S::GFP, in between BamH I and Xba I, to find the subcellular localization of TaABF3. Sequencing was used to obtain positive clones, and the wheat mesophyll protoplasts were obtained from the constructs using the methods previously described . A confocal microscope (Olympus, FluoViewTM FV300, Japan) was used to assess GFP fluorescence.
Transformation of Arabidopsis and isolation of TaABF3
Prof. Zhensheng Kang’s Lab (Northwest A&F University, China) provided the Arabidopsis ecotype Columbia, which was used to transform TaABF3. We amplified the full-length opening reading frame of TaABF3 from the wheat cv. Chinese Spring, using gene-specific primers which were subsequently cloned with the cauliflower mosaic virus (CaMV) 35S promoter into the pGreen0029-GFP vector. We then introduced the recombinant vector (35S::TaABF3-GFP) into Agrobacterium tumefaciens, and used the floral dip method  to transform it into Arabidopsis (Arabidopsis thaliana; ecotype Columbia). T1 seeds were placed on a MS medium containing 2% sucrose and 50 mg/mL kanamycin to identify the transformants. Phenotypic analyses were performed using homozygous T3 plants.
Drought tolerance assay
We placed germinated, seven-day-old transgenic Arabidopsis plants on an MS medium into pots with a 130 g mix of 2:1 mixture of Jiffy mix and vermiculite to perform the drought tolerance assays. The 32-day-old plants grown under optimal conditions (22 °C, relative humidity of 60%, 16/8 h light/dark photoperiod) were subjected to drought stress conditions by withholding water from the plant for 14 days, after which they were watered and allowed to recover. We then counted how many plants survived after 6 days. A minimum of 48 plants from each line were analyzed against wild-type (WT) plants for each test. Statistical data shown is based on data obtained from the three independent experiments. We used a student’s t-test to analyze the differences between transgenic and WT plants.
Measuring proline, MDA, and soluble sugar levels
We measured the proline contents, MDA levels, and soluble sugar levels of the transgenic and WT plants subjected to 10 days of drought stress, at which most leaves began to wilt, using detection kits (Solarbio) according to the manufacturer’s instructions.
Each experiment was conducted a minimum of three times. Data shown are the mean ± standard deviation (SD) of the three independent replicates. A Student’s t-test was used to perform the statistical analysis, while P < 0.05 was considered statistically significant and P < 0.01 was considered extremely significant.
Availability of data and materials
The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.
Abscisic acid-responsive transcription factors
Basic local alignment search tool
Gene Expression Omnibus
Gene Structure Display Server
low complexity region
low temperature responsive element
MYB binding site involved in drought-inducibility
Multiple Expectation Maximization for Motif Elicitation
MYB recognition site
MYC recognition site
Simple Modular Architecture Research Tool
Nakashima K, Takasaki H, Mizoi J, Shinozaki K, Yamaguchi-Shinozaki K. NAC transcription factors in plant abiotic stress responses. Biochim Biophys Acta. 2012;1819(2):97–103.
Yamaguchi-Shinozaki K, Shinozaki K. Transcriptional regulatory networks in cellular responses and tolerance to dehydration and cold stresses. Annu Rev Plant Biol. 2006;57:781–803.
Zhu JK. Abiotic stress signaling and responses in plants. Cell. 2016;167(2):313–24.
Finkelstein RR, Gampala SS, Rock CD. Abscisic acid signaling in seeds and seedlings. Plant Cell. 2002;14(Suppl):S15–45.
Himmelbach A, Yang Y, Grill E. Relay and control of abscisic acid signaling. Curr Opin Plant Biol. 2003;6(5):470–9.
Munemasa S, Hauser F, Park J, Waadt R, Brandt B, Schroeder JI. Mechanisms of abscisic acid-mediated control of stomatal aperture. Curr Opin Plant Biol. 2015;28:154–62.
Hauser F, Li Z, Waadt R, Schroeder JI. SnapShot: Abscisic acid signaling. Cell. 2017;171(7):1708.
Zhu JK. Salt and drought stress signal transduction in plants. Annu Rev Plant Biol. 2002;53:247–73.
Fujita Y, Fujita M, Shinozaki K, Yamaguchi-Shinozaki K. ABA-mediated transcriptional regulation in response to osmotic stress in plants. J Plant Res. 2011;124(4):509–25.
Ma Y, Szostkiewicz I, Korte A, Moes D, Yang Y, Christmann A, Grill E. Regulators of PP2C phosphatase activity function as abscisic acid sensors. Science. 2009;324(5930):1064–8.
Park SY, Fung P, Nishimura N, Jensen DR, Fujii H, Zhao Y, Lumba S, Santiago J, Rodrigues A, Chow TF, Alfred SE, Bonetta D, Finkelstein R, Provart NJ, Desveaux D, Rodriguez PL, McCourt P, Zhu JK, Schroeder JI, Volkman BF, Cutler SR. Abscisic acid inhibits type 2C protein phosphatases via the PYR/PYL family of START proteins. Science. 2009;324(5930):1068–71.
Choi H, Hong J, Ha J, Kang J, Kim SY. ABFs, a family of ABA responsive element binding factors. J Biol Chem. 2000;275(3):1723–30.
Uno Y, Furihata T, Abe H, Yoshida R, Shinozaki K, Yamaguchi-Shinozaki K. Arabidopsis basic leucine zipper transcription factors involved in an abscisic acid-dependent signal transduction pathway under drought and high-salinity conditions. Proc Natl Acad Sci U S A. 2000;97(21):11632–7.
Fujita Y, Yoshida T, Yamaguchi-Shinozaki K. Pivotal role of the AREB/ABF-SnRK2 pathway in ABRE-mediated transcription in response to osmotic stress in plants. Physiol Plant. 2013;147(1):15–27.
Vysotskii DA, de Vries-van Leeuwen IJ, Souer E, Babakov AV, de Boer AH. ABF transcription factors of Thellungiella salsuginea: structure, expression profiles and interaction with 14-3-3 regulatory proteins. Plant Signal Behav. 2013;8(1):e22672.
Kang JY, Choi HI, Im MY, Kim SY. Arabidopsis basic leucine zipper proteins that mediate stress-responsive abscisic acid signaling. Plant Cell. 2002;14(2):343–57.
Fujita Y, Fujita M, Satoh R, Maruyama K, Parvez MM, Seki M, Hiratsu K, Ohme-Takagi M, Shinozaki K, Yamaguchi-Shinozaki K. AREB1 is a transcription activator of novel ABRE-dependent ABA signaling that enhances drought stress tolerance in Arabidopsis. Plant Cell. 2005;17(12):3470–88.
Yoshida T, Fujita Y, Maruyama K, Mogami J, Todaka D, Shinozaki K, Yamaguchi-Shinozaki K. Four Arabidopsis AREB/ABF transcription factors function predominantly in gene expression downstream of SnRK2 kinases in abscisic acid signalling in response to osmotic stress. Plant Cell Environ. 2015;38(1):35–49.
Fujii H, Chinnusamy V, Rodrigues A, Rubio S, Antoni R, Park SY, Cutler SR, Sheen J, Rodriguez PL, Zhu JK. In vitro reconstitution of an abscisic acid signalling pathway. Nature. 2009;462(7273):660–4.
Furihata T, Maruyama K, Fujita Y, Umezawa T, Yoshida R, Shinozaki K, Yamaguchi-Shinozaki K. Abscisic acid-dependent multisite phosphorylation regulates the activity of a transcription activator AREB1. Proc Natl Acad Sci U S A. 2006;103:1988–93.
Wang Z, Su G, Li M, Ke Q, Kim SY, Li H, Huang J, Xu B, Deng XP, Kwak SS. Overexpressing Arabidopsis ABF3 increases tolerance to multiple abiotic stresses and reduces leaf size in alfalfa. Plant Physiol Biochem. 2016;109:199–208.
Yoshida T, Fujita Y, Sayama H, Kidokoro S, Maruyama K, Mizoi J, Shinozaki K, Yamaguchi-Shinozaki K. AREB1, AREB2, and ABF3 are master transcription factors that cooperatively regulate ABRE-dependent ABA signaling involved in drought stress tolerance and require ABA for full activation. Plant J. 2010;61(4):672–85.
Oh SJ, Song SI, Kim YS, Jang HJ, Kim SY, Kim M, Kim YK, Nahm BH, Kim JK. Arabidopsis CBF3/DREB1A and ABF3 in transgenic rice increased tolerance to abiotic stress without stunting growth. Plant Physiol. 2005;138(1):341–51.
Chinnusamy V, Zhu JK, Sunkar R. Gene regulation during cold stress acclimation in plants. Methods Mol Biol. 2010;639:39–55.
Orellana S, Yañez M, Espinoza A, Verdugo I, González E, Ruiz-Lara S, Casaretto JA. The transcription factor SlAREB1 confers drought, salt stress tolerance and regulates biotic and abiotic stress-related genes in tomato. Plant Cell Environ. 2010;33(12):2191–208.
Li C, Yue J, Wu X, Xu C, Yu J. An ABA-responsive DRE-binding protein gene from Setaria italica, SiARDP, the target gene of SiAREB, plays a critical role under drought stress. J Exp Bot. 2014;65(18):5415–27.
Kerr TC, Abdel-Mageed H, Aleman L, Lee J, Payton P, Cryer D, Allen RD. Ectopic expression of two AREB/ABF orthologs increases drought tolerance in cotton (Gossypium hirsutum). Plant Cell Environ. 2018;41(5):898–907.
IWGSC. Shifting the limits in wheat research and breeding using a fully annotated reference genome. Science. 2018;361(6403). https://pubmed.ncbi.nlm.nih.gov/30115783/.
Hundertmark M, Hincha DK. LEA (late embryogenesis abundant) proteins and their encoding genes in Arabidopsis thaliana. BMC Genomics. 2008;9:118.
Nakashima K, Fujita Y, Katsura K, Maruyama K, Narusaka Y, Seki M, Shinozaki K, Yamaguchi-Shinozaki K. Transcriptional regulation of ABI3- and ABA-responsive genes including RD29B and RD29A in seeds, germinating embryos, and seedlings of Arabidopsis. Plant Mol Biol. 2006;60(1):51–68.
Sakuma Y, Maruyama K, Osakabe Y, Qin F, Seki M, Shinozaki K, Yamaguchi-Shinozaki K. Functional analysis of an Arabidopsis transcription factor, DREB2A, involved in drought-responsive gene expression. Plant Cell. 2006;18(5):1292–309.
Nylander M, Svensson J, Palva ET, Welin BV. Stress-induced accumulation and tissue-specific localization of dehydrins in Arabidopsis thaliana. Plant Mol Biol. 2001;45(3):263–79.
Aubert Y, Vile D, Pervent M, Aldon D, Ranty B, Simonneau T, Vavasseur A, Galaud JP. RD20, a stress-inducible caleosin, participates in stomatal control, transpiration and drought tolerance in Arabidopsis thaliana. Plant Cell Physiol. 2010;51(12):1975–87.
Selvaraj MG, Ishizaki T, Valencia M, Ogawa S, Dedicova B, Ogata T, Yoshiwara K, Maruyama K, Kusano M, Saito K, Takahashi F, Shinozaki K, Nakashima K, Ishitani M. Overexpression of an Arabidopsis thaliana galactinol synthase gene improves drought tolerance in transgenic rice and increased grain yield in the field. Plant Biotechnol J. 2017;15(11):1465–77.
Badis G, Berger MF, Philippakis AA, Talukder S, Gehrke AR, Jaeger SA, Chan ET, Metzler G, Vedenko A, Chen X, Kuznetsov H, Wang CF, Coburn D, Newburger DE, Morris Q, Hughes TR, Bulyk ML. Diversity and complexity in DNA recognition by transcription factors. Science. 2009;324(5935):1720–3.
Kenrick P, Crane PR. The origin and early evolution of plants on land. Nature. 1997;389:33–9.
Morris JL, Puttick MN, Clark JW, Edwards D, Kenrick P, Pressel S, Wellman CH, Yang Z, Schneider H, Donoghue PCJ. The timescale of early land plant evolution. Proc Natl Acad Sci U S A. 2018;115(10):E2274–83.
Jiao YN, Wickett NJ, Ayyampalayam S, Chanderbali AS, Landherr L, Ralph PE, Tomsho LP, Hu Y, Liang HY, Soltis PS, Clifton SW, Schlarbaum SE, Schuster SC, Ma H, Leebens-Mack J, dePamphilis CW. Ancestral polyploidy in seed plants and angiosperms. Nature 2011;473:97–100.
Jiao YN, Paterson AH. Polyploidy-associated genome modifications during land plant evolution. Philos Transact Royal Soc London Series B. 2014;369:20130355.
Jiao YN, Li JP, Tang HB, Paterson AH. Integrated syntenic and phylogenetic analyses reveal and ancient genome duplication in monocots. Plant Cell. 2014;26:2792–802.
Gao S, Gao J, Zhu X, Song Y, Li Z, Ren G, Zhou X, Kuai B. ABF2, ABF3, and ABF4 promote ABA-mediated chlorophyll degradation and leaf senescence by transcriptional activation of chlorophyll catabolic genes and senescence- associated genes in Arabidopsis. Mol Plant. 2016;9(9):1272–85.
Sun Y, Fan XY, Cao DM, Tang W, He K, Zhu JY, He JX, Bai MY, Zhu S, Oh E, Patil S, Kim TW, Ji H, Wong WH, Rhee SY, Wang ZY. Integration of brassinosteroid signal transduction with the transcription network for plant growth regulation in Arabidopsis. Dev Cell. 2010;19(5):765–77.
Zhang W, Ruan J, Ho TH, You Y, Yu T, Quatrano RS. Cis-regulatory element based targeted genefinding: genome-wide identification of abscisic acid- and abiotic stress-responsive genes in Arabidopsis thaliana. Bioinformatics. 2005;21(14):3074–81.
Lu Z, Yu H, Xiong G, Wang J, Jiao Y, Liu G, Jing Y, Meng X, Hu X, Qian Q, Fu X, Wang Y, Li J. Genome-wide binding analysis of the transcription activator ideal plant architecture1 reveals a complex network regulating rice plant architecture. Plant Cell. 2013;25(10):3743–59.
Birkenbihl RP, Kracher B, Somssich IE. Induced genome-wide binding of three Arabidopsis WRKY transcription factors during early MAMP-triggered immunity. Plant Cell. 2017;29(1):20–38.
Abascal F, Zardoya R, Telford MJ. TranslatorX: multiple alignment of nucleotide sequences guided by amino acid translations. Nucleic Acids Res. 2010;38:W7–13.
Posada D. jModelTest: phylogenetic model averaging. Mol Biol Evol. 2008;25(7):1253–6.
Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22(21):2688–90.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCt method. Methods. 2001;25(4):402–8.
Yoo SD, Cho YH, Sheen J. Arabidopsis mesophyll protoplasts: a versatile cell system for transient gene expression analysis. Nat Protoc. 2007;2(7):1565–72.
Clough SJ, Bent AF. Floral dip: a simplified method for agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998;16(6):735–43.
We thank reviewers for checking our manuscript and the editors for editing the paper. We would like to thank the members of the Bioinformatics Center of Northwest A & F University for their useful input.
This research was supported by Talent Fund of Northwest A&F University (grant no. Z111021602). The funders had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Phylogenetic relationship of group-A bZIP TFs from 29 plant species. The land plant group-A bZIP TFs are grouped into two major clades, designated as AREB/ABF and ABI5/AtDPBF subfamilies.
ABF protein properties and their clade-distributions in land plants. Gene identifiers are obtained from Phytozome database.
Schematic diagram of gene structures of 95 plant ABFs. The thin lines represent introns and thick bars represent exons. The numbers above the gene structure indicate intron phases. A scale bar with a unit of base pair (bp) is graphed on the bottom.
Alignment of 95 plant ABF protein sequences. The alignment was generated using ClustalW implemented in Geneious software and represented as thick lines (aligned characters) and thin lines (gaps). Overall alignment identity and a scale bar indicating the numbers of amino acid residues are graphed on the top.
Combined motif diagram of 95 ABF proteins. Thick lines represent the ABF proteins. Different colored boxes represent separate and distinct motifs identified using MEME program. A scale bar indicating the numbers of amino acid residues is shown on the top. Motifs are drawn approximately to scale as boxes.
Gene expression profile of ABF paralogs in plants. Gene expression data was extracted from Arabidopsis thaliana (http://jsp.weigelworld. org/expviz/expviz.jsp), soybean (Glycine max, http://soybase.org/soyseq/), common bean (Phaseolus vulgaris, http://plantgrn.noble.org/PvGEA/SearchVisual.jsp), maize (Zea mays, http://www.plexdb.org/index.php), and rice (Oryza sativa, http://www.plexdb.org/index.php).
Gene expression profile of ABF paralogs in Arabidopsis thaliana under different abiotic stresses. The mean-normalized expression values were obtained from the AtGenExpress microarray database via the web http://jsp.weigelworld.org/ expviz/expviz.jsp.
RT-PCR analysis of TaABF3 transcription levels in the transgenic Arabidopsis lines.
Primers used in this research.
About this article
Cite this article
Li, F., Mei, F., Zhang, Y. et al. Genome-wide analysis of the AREB/ABF gene lineage in land plants and functional analysis of TaABF3 in Arabidopsis. BMC Plant Biol 20, 558 (2020). https://doi.org/10.1186/s12870-020-02783-9
- Land plants
- Phylogenetic relationship
- Expression analysis
- Drought stress