Skip to main content

Genome-wide identification and characterization of bZIP gene family and cloning of candidate genes for anthocyanin biosynthesis in pomegranate (Punica granatum)



The basic leucine zipper (bZIP) transcription factor is one of the most abundant and conserved gene families in eukaryotes. In addition to participating in plant development and growth, bZIP transcription factors play crucial roles in various abiotic stress responses and anthocyanin accumulation. Up to now, analysis of bZIP gene family members in pomegranate (Punica granatum) has not been reported. Three published pomegranate genome sequences provide valuable resources for further gene function analysis.


Using bioinformatics analysis, 65 PgbZIPs were identified and analyzed from the ‘Taishanhong’ pomegranate genome. We divided them into 13 groups (A, B, C, D, E, F, G, H, I, J, K, M, and S) according to the phylogenetic relationship with those of Arabidopsis, each containing a different number of genes. The regularity of exon/intron number and distribution was consistent with the classification of groups in the evolutionary tree. Transcriptome analysis of different tissues showed that members of the PgbZIP gene family were differentially expressed in different developmental stages and tissues of pomegranate. Among them, we selected PgbZIP16 and PgbZIP34 as candidate genes which affect anthocyanin accumulation. The full-length CDS region of PgbZIP16 and PgbZIP34 were cloned from pomegranate petals by homologous cloning technique, encoding 170 and 174 amino acids, which were 510 bp and 522 bp, respectively. Subcellular localization assays suggested that both PgbZIP16 and PgbZIP34 were nucleus-localized. Real-time quantitative PCR (qPCR) was used to explore the expression of PgbZIP16 and PgbZIP34 in the petals of three kinds of ornamental pomegranates at the full flowering stage. The results demonstrated that the expression of PgbZIP16 in red petals was 5.83 times of that in white petals, while PgbZIP34 was 3.9 times. The results of transient expression in tobacco showed that consistent trends were observed in anthocyanin concentration and expression levels of related genes, which both increased and then decreased. Both PgbZIP16 and PgbZIP34 could promote anthocyanin accumulation in tobacco leaves. We obtained transgenic strains overexpressing PgbZIP16, and the histochemical staining for GUS activity showed that overexpressed PgbZIP16 seedlings were expressed in the stem. Transgenic experiments indicated that overexpression of PgbZIP16 significantly upregulated UF3GT, ANS and DFR genes in Arabidopsis and enhanced anthocyanin accumulation.


The whole genome identification, gene structure, phylogeny, gene cloning, subcellular location and functional verification of the pomegranate bZIP gene family provide a theoretical foundation for the functional study of the PgbZIP gene family and candidate genes for anthocyanin biosynthesis.

Peer Review reports


Anthocyanins belonging to flavonoids are water-soluble pigments providing various color for plants, especially in fruits and flowers [1]. Anthocyanin accumulation in plants is usually to resist abiotic stress, such as drought, ultraviolet radiation, hormone and low temperature [2,3,4,5]. Recently, anthocyanins had potential health benefits that they deserved increasing attention. It has presented that daily intake of natural foods rich in anthocyanin has a potent protective effect on the human body. Moreover, it can play a role in preventing cardiovascular disease and obesity [6,7,8,9]. In many plants, it has been proposed that the anthocyanin biosynthesis pathway is mainly regulated by R2R3-MYB, bHLH and WD40 repeats factors to control their downstream structural genes [10]. While one another transcription factor bZIP is found to promote anthocyanin synthesis in combination with MYBs to increase their expression [11].

Transcription factors (TFs) are the most important part of the plant growth and development regulatory network, which activate or inhibit genes expression by combining with specific promoter sequences. bZIP transcription factors regulate many plant processes through the interaction of DNA binding motifs, transcriptional activation motifs, nuclear localization signals and oligomerization sites [2]. Many TFs can be classified into different gene families according to their conserved domains. Presently, at least 64 families of transcription factors have been identified in the plants [12]. As one of the most abundant and conserved gene families in eukaryotes, the basic leucine zipper motif (bZIP) gene family has an integral role in growth development and abiotic stress responses in plants [13]. The conserved bZIP domain has about 40 ~ 80 amino acid residues, which includes two parts, a highly conserved DNA-binding basic composed of 20 amino acids and a relatively diversified leucine zipper region [14]. The basic amino acid region is located at the C-terminal region and through a fixed N-x7-R/K structure for sequence-specific DNA binding. The leucine zipper region located at the N-terminal region, which consists of several heptapeptide repeats or hydrophobic amino acid residues, such as methionine, isoleucine, valine, etc. This domain main function by forming dimers through the leucine zipper domain [15, 16].

The bZIP transcription factor family has been comprehensively identified in several plants, such as 78, 58, 69, 55, 89, 114, 45, 45 in Arabidopsis [17, 18], maize (Zea mays) [19], tomato (Solanum lycopersicum) [20], grape (Vitis vinifera) [21], rice (Oryza sativa) [13], apple (Malus domestica) [22], poplar (Populus simonii) [23], and Chinese jujube (Ziziphus jujuba) [24], respectively. The Arabidopsis bZIP gene family consists of 78 members divided into 13 groups (groups A-K, M and S) [18]. Currently, a large number of bZIP genes have been found to play important roles in the processes of plant growth and development, such as seed maturation and germination [25], flower development [26], vascular development [27] and embryogenesis [28]. For example, the AtbZIP11 affects plant root development by linking low-energy signals to auxin-mediated control of primary root growth [29]. Overexpression of the ZmbZIP4 in maize can also lead to an increase in the number of lateral roots, longer primary roots and improved plant roots [30]. In addition, the bZIP genes also play an important role in plant biotic and abiotic stress [31,32,33]. In wheat, TabZIP15 promotes the combination of ABF/AREB and ABRE (ABA response element) cis-acting elements through the expression of ABF/AREB to induce downstream target gene expression responding to plant salt and drought stress [34]. Similar results were observed for the GsbZIP67 gene in Alfalfa (Medicago sativa), overexpression of GsbZIP67 promoted the growth of plant roots and shoots and changed the physiological indicators of transgenic plants under bicarbonate salt-alkali stress [32].

Interestingly, a large number of studies have shown that some bZIP genes are involved in plant anthocyanin biosynthetic pathway [35]. ELONGATED HYPOCOTYL5 (HY5), one member of the bZIP gene family, was activated in a light-dependent manner to promote pigment accumulation. HY5 could directly bind to G-box or ACE-box of MYB factors, including PRODUCTION OF ANTHOCYANIN PIGMENT1 (PAP1), PRODUCTION OF FLAVONOL GLYCOSIDES (MYB12 and MYB111), and MYB-like Domain (MYBD) to promote their gene expression [36,37,38,39]. Besides MYBs, HY5 co-regulate with PIF3 the expression of anthocyanin biosynthesis structure genes [35]. Moreover, it has proposed that the overexpression bZIP gene MdHY5 in apple callus induce anthocyanin accumulation by upregulating MdMYB10 expression and its downstream genes [40]. Overexpression of CRY1a could increase accumulation of anthocyanin in tomato and SlHY5 silencing could decrease CRY1a-induced anthocyanin accumulation [41]. In addition, HY5 positively regulates the cold responses through activation of anthocyanin biosynthesis genes such as chalcone synthase (CHI) and chalcone isomerase (CHS) [42]. Under the induction of abscisic acid (ABA), MdbZIP44 (HY5) positively regulates the anthocyanin accumulation by enhancing the interaction between MdMYB1 and its downstream target genes [43].

Pomegranate (Punica granatum) belongs to the Lythraceae family. It is one of the important characteristic economic forest species in the world. It has achieved considerable attention due to its high antioxidant activity, rich color in peel and aril (the edible part of pomegranate), nutritious, active pharmaceutical ingredients and anthocyanin [44, 45]. Pomegranate has fruit and flower pomegranate, its attractive appearance, the long forefronts of flowering, and it is gradually being used in landscaping. Comprehensive analysis, pomegranate has strong health functions, high ornamental value and ecological and economic profitability, and has great development prospects [46]. In recent years, most scholars have sequenced and assembled the genomes of different pomegranate varieties, and obtained high-quality genome maps, such as ‘Dabenzi’ [47], ‘Taishanhong’ [44] and ‘Tunisia’ soft-seed pomegranate [48], which provide an important molecular biology basis for pomegranate genetic improvement research.

In this study, we use bioinformatics methods to identify the members of the pomegranate bZIP transcription factor family members, and analyze the physical and chemical properties, conserved domains, evolutionary relationships, cis-acting elements, tissue and organ expression of transcription factors. At the same time, two candidate genes related to anthocyanin synthesis were identified in pomegranate for the first time, and the gene cloning, subcellular location and differential expression of flower with different colors were analyzed. These results provide a reference for studying the expression of pomegranate bZIP gene family during the growth and development of pomegranate and biotic and abiotic stress, and provide a basis for further elucidating the formation mechanism of pomegranate flower color.


Identification and characterization of the bZIP transcription factor family in pomegranate

In this study, we identified 65 gene family members from the whole genome of ‘Taishanhong’ pomegranate. For subsequent analysis, they renamed them according to scaffold (Table 1). The physicochemical properties of bZIP transcription factors were analyzed by ExPASy online-tool. The results showed that the molecular weight of pomegranate bZIP family proteins ranged from 14,803.81 to 380,571.64 Da, the theoretical isoelectric point ranged from 4.74 to 9.91, and the protein lengths ranged from 128 to 1543 aa, with the shortest being 128 aa (PgbZIP11) and the longest being 1543 aa (PgbZIP8). These results provide a theoretical basis for further purification, activity and function studies of PgbZIP protiens. The subcellular location prediction of each member indicated that all members of the bZIP gene family are expressed in the nucleus.

Table 1 The identified PgbZIP genes and their related information

Phylogenetic tree and bZIP conservative domain analysis

The amino acid positions of the conserved structural domains of bZIP were visualized by multiple sequence alignment of the protein sequences of 65 bZIP family members of pomegranate (Fig. 1). The results showed that the core conserved structural domain of the PgbZIPs protein had an average length of 50 aa. The bZIP structural domain consists of a basic region and a leucine zipper. the basic region was located at the the C-terminus and contains a fixed N-X7-R/K motif bound to a specific DNA sequence, while the leucine zipper region was located at the N-terminal end and consists of several repetitive heptapeptide or hydrophobic amino acid residues. The highly conserved leucine residues were sometimes replaced by isoleucine, methionine, valine, etc. Our results are consistent with previous study in Arabidopsis [18].

Fig. 1
figure 1

Visualization of multiple sequence alignment of the pomegranate bZIP family DNA binding domains. The total height of the letter piles at each position indicates the conservation of the sequence at that position (measured in bits). The height of a single letter in the letter piles represents the relative frequency of the corresponding amino acid at that position

To explore the homologous evolutionary relationships and classification of the bZIP family, we constructed a phylogenetic tree using the bZIP members of pomegranate, Eucalyptus megacephalus and Arabidopsis. As shown in Fig. 2, a deep clustering analysis of the entire evolutionary tree was performed with reference to the evolutionary relationship and naming rules of the Arabidopsis bZIP genes. The pomegranate bZIP gene family was divided into 13 groups (A, B, C, D, E, F, G, H, I, J, K, M, and S). These 13 groups differed greatly in size. Two of the groups have only one member, namely group B and group M. The largest group has 16 members (group S). Throughout the evolutionary tree, the bZIP genes of the three species were distributed in almost all of these 13 subgroups, indicating that the bZIP genes showed different divergence in gene function in pomegranate, Eucalyptus megacephalus, and Arabidopsis. Meanwhile, some bZIP genes of pomegranate, Eucalyptus megacephalus and Arabidopsis each clustered together in a small clade, suggesting that a co-speciation event and species-specific duplication events occurred during the bZIP family divergence. Similar to the evolutionary relationships in Arabidopsis, our further analysis revealed that two pairs of homologous genes, PgbZIP16/AtHY5 and PgbZIP34/AtHYH, in group H, were able to influence anthocyanin accumulation.

Fig. 2
figure 2

The unrooted phylogenetic tree of P. granatum, E. grandis and Arabidopsis bZIP proteins. The dendrogram was drew by MEGA7 with the Maximum Likelihood method and JTT + G + F model. Different groups are marked with different colors. The groups were named with letters representing some of their important members (A for ABF/AREB/ABI5, C for CPRF2-like, G for GBF, H for HY5), protein size (B for big, S for small), or alphabetically

Gene structure and protein conserved motifs of PgbZIP genes family

As the composition of introns/exons and types and numbers of introns were typical marks of evolution within certain gene families, we explored the gene structures of PgbZIP genes structures to further understand their evolutionary trajectory (Fig. 3). We analyzed the intron/exon and motif structure of each member. As expected, members of the different groups had different gene structures, conserved domains and numbers of introns/exons, with the number of introns ranging from 0 to 11. For example, there were no introns in the S group, while PgbZIP54 and PgbZIP61 had the largest number of introns with 11 introns.

Fig. 3
figure 3

Phylogenetic tree, conserved motifs and gene structures of the PgbZIP gene family. Gene structures. Exons and Intron are displayed using Black lines and green bars. Protein motifs in the bZIP members. The colorful boxes delineate different motifs. The clustering is performed according to the results of phylogenetic analysis

To investigate the distribution of conserved patterns bZIP proteins, 65 PgbZIP protein sequences were analyzed by MEME (Fig. 3). The number and type of conserved motifs contained in each protein sequence varied. The distribution of different conserved motifs was revealed with different functions of different genes. Group D has the most motifs, with PgbZIP4 containing 16 motifs, and Group S has the least motifs, with PgbZIP15 containing 2 motifs.

Cis-acting elements of pomegranate bZIP gene family

To further explore the potential mechanism of bZIP gene in biotic and abiotic stress, we submitted the 1500 bp upstream sequence of the PgbZIP translation start site to Plant CARE for detection of cis-acting elements. The PgbZIP gene family cis-acting elements were mapped using the online website GSDS2.0 (Fig. 4). Meanwhile, we analyzed and screened 12 cis-acting elements, mainly including ABA-responsive element ABRE, drought-inducible response element MBS, low-temperature response element LTR, defense and stress-responsive element TC-rich repeats, trauma-responsive element WUN-motif, gibberellin-responsive element P-box, anaerobic-inducible cis-regulatory element ARE, meristematic tissue expression-related cis-regulatory element CAT-box, regulatory element MYB of secondary metabolic pathways, and common cis-acting element CAAT-box of promoter and enhancer regions. Pomegranate had 65 PgbZIPs, each with one or more cis-acting elements, suggesting that expression of PgbZIPs may be associated with these abiotic stresses. In total, 65 genes had one or more ABA response elements and 40 PgbZIPs had one or more LTR response elements, indicating that PgbZIPs may be significantly responsive to ABA and low temperature stresses. 18 PgbZIPs had TC-rich repeats response elements and 17 PgbZIPs had WUN-motif response elements. In conclusion, the analysis of cis-acting elements suggested that the PgbZIP genes may respond to different abiotic stresses.

Fig. 4
figure 4

The distribution of cis-acting elements in the promoter region of pomegranate bZIP gene

Tissue-differential gene expression patterns of pomegranate bZIP genes

To explore expression patterns of the pomegranate bZIPs gene family in different tissues, we analyzed the expression of bZIP genes in pomegranate roots, stems, flowers, endocarp, exocarp and leaves using transcriptome analysis data by RNA-Seq. As shown in Fig. 5, there was a clear tissue-organ specificity in the expression pattern of PgbZIP genes. the expression of PgbZIP1, PgbZIP13, PgbZIP18, PgbZIP24, PgbZIP41, PgbZIP43, PgbZIP46 and PgbZIP64 was higher than other members, indicating that this gene may be involved in the transcriptional regulation of various physiological and biochemical processes during pomegranate development. In contrast, the expression of PgbZIP3, PgbZIP29, PgbZIP55 and PgbZIP59 was generally lower. Compared with other tissues, PgbZIP44 and PgbZIP47 were highly expressed in roots, PgbZIP11 in buds and pericarp, and PgbZIP34 and PgbZIP46 in young leaves. PgbZIP16 and PgbZIP34 were somewhat expressed in pericarp, leaves, flowers and fruits, but relatively low in root development.

Fig. 5
figure 5

Heatmap of PgbZIP gene expression in P. granatum. Note: S1: Outer seed coat; S2: Inner seed coat; S3: Pericarp; S4: Flower; S5: Root; S6: Flesh leaf; S7: Bisexual flowers (3.0-5.0 mm); S8: Bisexual flowers (5.1-13.0 mm); S9: Bisexual flowers (13.1-25.0 mm); S10: Functional male flowers (3.0-5.0 mm); S11: Functional male flowers (5.1-13.0 mm); S12: Functional male flowers (13.1-25.0 mm); S13: Inner seed coat of ‘Tunisia’; S14: Inner seed coat of ‘Baiyushizi’; S15: Mix of leaves, flowers, fruit and roots of ‘Black127’; S16: Mix of leaves, flowers, fruit and roots of ‘nana’; S17: Peels of ‘Wonderful’ (cultivars S1-S6 are ‘Dabenzi’, cultivars S7-S13 are ‘Tunisia’)

Cloning and analysis of PgbZIP16 and PgbZIP34

The 510 bp and 522 bp open reading frame (ORF) of the PgbZIP16 and PgbZIP34 genes were amplified from the mixed-sample cDNA (Fig. 6A). The ORF encodes 170 and 174 aa, respectively. The predicted protein molecular weights were 39,591.74 and 41,604.78 Da, and the theoretical isoelectric points were 5.22 and 5.18, respectively. The amino acid sequence analysis of the proteins of these two genes contains a bZIP domain (BRLZ Domain) located at sites 87 ~ 151 and 96 ~ 160, respectively. Evolutionary tree analysis showed that PgbZIP16 and PgbZIP34 genes belong to HY5 and HYH type transcription factors in the bZIP gene family, respectively, and play an important role in the transcriptional regulation of anthocyanin synthesis. Therefore, PgbZIP16 and PgbZIP34 were selected for cloning and functional analysis in this paper.

Fig. 6
figure 6

Cloning and characteristics of PgbZIP16 and PgbZIP34. a. PCR amplification of PgbZIP16 and PgbZIP34. M: DL ladder 2000 DNA Marker. 1: PgbZIP16. 2: PgbZIP34. b. Three different colors of pomegranate flowers. c. Subcellular localization of the 35S::GFP-PgbZIP16 and 35S::GFP-PgbZIP34 fusion protein in tobacco leaves. Free GFP served as a control. A DAPI staining assay was conducted to confirm the nuclear localization. Bars = 50 μm. d. Tissue petal stage-specific expression of PgbZIP16 and PgbZIP34 in pomegranate. Different lowercase letters indicate significant differences at the 0.05 level

Reverse transcription quantitative PCR (RT-qPCR) revealed that PgbZIP16 and PgbZIP34 were highly expressed in ‘Liuhuahong’ (Fig. 6C). Among the three different colors of pomegranate flowers, the expression level PgbZIP16 in ‘Liuhuahong’ was 5.83 times that of ‘Liuhuabai’ and 5 times that of ‘Liuhuafen’. PgbZIP34 had a similar expression trend, and its expression level was 3.9 times that of ‘Liuhuabai’ and 2.3 times that of ‘Liuhuafen’.

To determine the subcellular localization of PgbZIP16 and PgbZIP34, the construct encoding PgbZIP16 and PgbZIP34 fused to green fluorescent protein (GFP) were transformed into tobacco leaves. Intense fluorescence from 35S::GFP-PgbZIP16 and 35S::GFP-PgbZIP34 were detected in the nucleus (Fig. 6D), indicating that PgbZIP16 and PgbZIP34 localize to nucleus. These results suggested that PgbZIP16 and PgbZIP34 might function as a transcription factor in regulating anthocyanin biosynthesis.

Functional studies in tobacco

To investigate the functions of PgbZIP16 and PgbZIP34 genes, we constructed pBI121-PgbZIP16 and pBI121-PgbZIP34 overexpression vectors and transferred them into tobacco leaves by injection method. The results showed (Fig. 7a) consistent trends were observed in anthocyanin accumulation and gene expression levels, both of which increased and then decreased. The anthocyanin in PgbZIP16 transgenic tobacco leaves started to increase significantly on the 3th day, and reached the highest level on the 5th day, which was 0.067 mg·g−1 FW. It was 5.58 times higher than that in non-infested leaves and 2.79 times higher than that in pBI121 null leaves. Gene expression was consistent with the level of anthocyanin content, with the highest at day 5, which was 9.72-fold higher than that of un-infested leaves and 3.74-fold higher than that of pBI121 null leaves (Fig. 7b-c). The anthocyanin in PgbZIP34 transgenic tobacco leaves started to increase significantly on the 3th day, and reached the highest level on the 5th day, which was 0.047 mg·g−1 FW. it was 3.92-fold higher than that of un-infested leaves and 1.96 times more than that of pBI121 nulled leaves. Gene expression was consistent with anthocyanin content levels, with the highest being on day 5, which was 6.56 times higher than that of un-infested leaves and 2.54 times higher than that of pBI121 unloaded leaves (Fig. 7d-e). In summary, PgbZIP16 had an important role in anthocyanin accumulation, but because they are homologous sequences. Therefore, we further explored the genetic regulation of anthocyanin accumulation by PgbZIP16 in Arabidopsis.

Fig. 7
figure 7

Transient expression of PgbZIP16 and PgbZIP34 in tobacco. a. Images of wild-type, pBI121-overexpressing, PgbZIP16-overexpressing, and PgbZIP34-overexpressing tobacco leaves. b. Anthocyanin content in wild-type and PgbZIP16-overexpressing leaves. c. Relative expression level in wild-type and PgbZIP16-overexpressing leaves. d. Anthocyanin content in wild-type and PgbZIP34-overexpressing leaves. e. Relative expression level in wild-type and PgbZIP34-overexpressing leaves. Different letters indicate significant at the P < 0.05 level

Genetic regulation of PgbZIP16 on Arabidopsis anthocyanin accumulation

It has been previously reported that HY5 acted as a transcriptional activator that positively regulated the biosynthesis of anthocyanins in plants [38,39,40]. HY5 regulated the accumulation of anthocyanins through directly binding to the promoters of CHS, CHI, F3H, F3′H, DFR, and ANS [49, 50]. To investigate whether PgbZIP16 plays a role in promoting anthocyanin biosynthesis in Arabidopsis, we constructed a pBI121-PgbZIP16 overexpression vector and transformed Arabidopsis using the flower-dip method. The PgbZIP16 overexpression Arabidopsis (PgbZIP16-6, PgbZIP16-16, PgbZIP16-21) were compared with control Arabidopsis carrying only the empty vector of the 35S:pBI121 vector. It was confirmed that Arabidopsis overexpressing PgbZIP16 expressed PgbZIP16 at significantly higher levels than control Arabidopsis (Fig. 8a). The PgbZIP16 was constructed into the pBI121 vector to drive stable expression of the GUS reporter gene. Histochemical staining for GUS activity showed that Arabidopsis seedlings overexpressing PgbZIP16 were expressed in stem segments (Fig. 8b). Anthocyanin content was measured in control and PgbZIP16 overexpressing Arabidopsis leaves and was found to be significantly higher in the leaves of PgbZIP16 overexpressing Arabidopsis than in the control (Fig. 8c). In transgenic Arabidopsis, there was a concordance between the expression of most structural genes on the anthocyanin biosynthetic pathway (FLS, 4CL, CHI, CHS, F3H, F3’H, DFR, UF3GT, UGT1, UGT2, and ANS) and PgbZIP16 genes, which were significantly up-regulated compared with the control (Fig. 8d). Moreover, PgbZIP16 significantly up-regulated the expression levels of UF3GT, ANS, and DFR genes in transgenic Arabidopsis and enhanced anthocyanin accumulation. Therefore, the PgbZIP16 gene has a genetic regulatory effect on anthocyanin accumulation in Arabidopsis.

Fig. 8
figure 8

Genetic regulation of PgbZIP16 on Arabidopsis anthocyanin accumulation. a. Expression analysis of PgbZIP16 in the wild-type, PgbZIP16 overexpressing Arabidopsis. b. Images of representative seedling of wild-type, PgbZIP16 overexpressing lines. c. Overexpression of PgbZIP16 facilitated anthocyanin accumulation in Arabidopsis. d. Expression levels of anthocyanin-related genes in wild-type, PgbZIP16 overexpressing Arabidopsis. Total RNA extracted from the different genotypic leaves. The different letters indicate statistically significant differences compared with the corresponding wild-type (P < 0.05)


The bZIP transcription factor plays a crucial role in plant growth, development, and abiotic stress responses, such as seed maturation, flower development, stress response, ect [25, 26, 33]. Currently, the bZIP gene family has been studied in the model plant, especially for Arabidopsis and crops, such as AtbZIP11/18 in Arabidopsis [27, 29], GsbZIP67 in soybeans [32], CabZIP25 in pepper [33], TabZIP15 in wheat [34] and MdHY5/MdbZIP44 in apple [40, 43]. Until now, despite the sequencing of the whole pomegranate gene has been completed, little is known about the bZIP gene family in pomegranate [45, 47, 48]. So, we identified and analyzed the expression pattern of bZIP based on phylogenetic analysis to speculate on the evolution of the PgbZIP gene family.

In this study, a total of 65 PgbZIP genes were identified in pomegranate using bioinformatics methods (Table. 1). The bZIP members in pomegranate were similar to those in Arabidopsis [18], apple [22], poplar [23] and jujube [24], which may be caused by ancient polyploid events. To further understand the evolutionary relationships between Arabidopsis and pomegranate, we constructed a phylogenetic tree of bZIP genes following a clustering approach (Fig. 2) [18]. Due to the highly conserved bZIPs sequences, genes with the same function belong to the same group, which provides a reference for studying this gene family. Similar to the Arabidopsis grouping, the pomegranate bZIP genes were divided into 13 groups.

Gene structure and conservative motifs were also important basis for studying gene evolution and gene duplication (Fig. 3). We analyzed in detail the structure of the pomegranate bZIP gene and the number of introns and exons. Compared with other gene families, we found that the pomegranate bZIP gene structure was relatively simple, with the number of introns ranging from 0 to 11. In conclusion, most of the PgbZIP genes have similar number of introns compared to other plant species [21, 51,52,53]. In Arabidopsis, the subfamily-specific and conserved motifs may play important roles in the functional differentiation of AtbZIPs subfamilies. For example, most members of group A participate in ABA biological pathway and regulate plant responses to abiotic stress [54,55,56]. Therefore, the PgbZIP genes in group A could have similar functions. The significant feature of group C members was the extension of the leucine zipper region, which can be up to 9 repeats. In addition, potential target sites for protein modification, such as phosphorylation sites that regulated nuclear translocation and DNA binding, were also preserved [17]. Group D genes could participate in plant defense against pathogens [17]. Group G gene and their homologues were mainly involved in the signal transduction of blue-violet light [57]. Group H contained two genes that could be directly combined with light-induced gene promoters to regulate plant cell elongation, chloroplast synthesis, hormone synthesis and anthocyanin biosynthesis [58]. Group S had the most members, but the number of well-researched genes was less. Members of this group not only plays an important role in the sucrose metabolism pathway, but also could be activated and transcribed under the antibiotic stress [59]. In our study, transcriptome data indicated that PgbZIPs were highly homologous to Arabidopsis, demonstrating similar roles in specific biological processes. It seems that the evolution events in bZIP gene family members have happened before species divergence, which affected their number and function [60, 61].

The expression intensity of pomegranate PgbZIPs in different tissues was further analyzed (Fig. 5). The results showed that the genes were expressed in leaves, roots, flowers, seed coat and envelope, except for PgbZIP3, PgbZIP12, PgbZIP39, PgbZIP55 and PgbZIP59, which were hardly expressed in the tissues. This indicated that the bZIP gene family play an essential role in the growth and development of pomegranate. At the same time, the expression of PgbZIP46 was found to be higher in all tissues than other genes, especially in roots, leaves, and flowers. This may be that this gene is closely related to growth, development and stress in pomegranate. In addition, we speculated that several genes that were barely expressed in tissues may not be involved in the regulation of pomegranate development or stress.

Anthocyanins are water-soluble pigments involved in pathways of plant secondary metabolism. Anthocyanins were mainly found in flowers, leaves, seed coat and fruits of plants in form of glycoside. Anthocyanins not only provided brilliant colors to plants, but also protected plants from ultraviolet radiation and pathogens [62]. The current research on HY5 and HYH of Arabidopsis, tomato and apple is more in-depth. For example, HY5 and HYH in Arabidopsis are phytochrome receptors of the light signal pathway downstream. Not only the expression of EBGs and LBGs was directly activate, HY5 can positively regulate the transcriptional activation of AtPAP1 [26, 50, 63]. In apple, bZIP transcription factor gene MdHY5 could directly promote the expression of MdMYB10 and MdMYB1 genes, and positively regulate anthocyanin accumulation by enhancing the interaction with its downstream target genes [31, 32, 40, 43]. In addition, SlHY5 gene silencing down-regulated the accumulation of anthocyanins in tomato [41]. The phylogenetic analyses demonstrate that PgbZIP16 and PgbZIP34 shared higher homology with AtHY5 and AtHYH. Therefore, we speculated that PgbZIP16 and PgbZIP34 genes in pomegranate also had similar functions to Arabidopsis, tomato and apple.

Pomegranate, as an ancient fruit widely consumed fresh fruit, is an economically important fruit tree crop in China. Preliminary research has been conducted on the coloring mechanism of pomegranate peel, but the mechanism of flower color formation has not been studied in depth. In this study, we performed gene cloning, subcellular localization and functional verification of PgbZIP16 and PgbZIP34 in the flower color formation mechanism of three kinds of ornamental pomegranate. We results suggested that the patterns of expression of both genes in red were significantly higher than those in white and pink, this result was consistent with the results of grape hyacinth and red pear (Fig. 6) [64, 65]. To investigate the functions of PgbZIP16 and PgbZIP34, we constructed pBI121-PgbZIP16 and pBI121-PgbZIP34 overexpression vectors and transformed tobacco leaves. The results showed that there was a consistency between anthocyanin content and gene expression, which both increased and then decreased (Fig. 7b-e). Both PgbZIP16 and PgbZIP34 promoted anthocyanin accumulation in tobacco leaves. Compared with PgbZIP34, PgbZIP16 played a more important role in anthocyanin accumulation. To determine the genetic relationship between PgbZIP16 and structural genes, we used the dipstick method to transfer the constructed overexpression vector into Arabidopsis and obtained PgbZIP16 overexpression strains. Histochemical staining for GUS activity showed that seedlings overexpressing PgbZIP16 were specifically expressed in stem segments (Fig. 8b). Such an expression pattern suggested that PgbZIP16 may be involved in the accumulation of anthocyanins at an early stages of Arabidopsis development. Furthermore, further studies shown that the overexpression of PgbZIP16 significantly promoted the anthocyanin accumulation in the transgenic strain (Fig. 8c). Meanwhile, most genes on the anthocyanin synthesis pathway (FLS, 4CL, CHI, CHS, F3H, F3’H, DFR, UF3GT, UGT1, UGT2, and ANS) and PgbZIP16 gene expression were consistent (Fig. 8d). The expression of PAL and C4H was lower in PgbZIP16 overexpressing plants than in the control, which may result from the fact that PAL and C4H, as structural genes of the mangiferin synthesis pathway, were not directly involved in anthocyanin synthesis [38]. Based on the present experimental study, further investigation of the relationship between PgbZIP16 and other transcription factors and their role in the process of flower color formation is the focus of future work.


In this study, a total of 65 PgbZIP genes were identified in pomegranate using bioinformatics methods, and their bZIP structural domain were determined. We constructed a phylogenetic tree of pomegranate and Arabidopsis, and divided the PgbZIP genes into 13 groups. Due to the high conservation of bZIP genes, proteins with similar functions were clustered into one group, which provided a reliable basis for studying the functions of related genes in gene families in plants. In addition, we identified two candidate genes in the anthocyanin biosynthesis using transcriptome data analysis and performed their gene cloning, subcellular localization, quantitative fluorescence analysis, transient expression and Arabidopsis transformation. Our results indicated that PgbZIP16 and PgbZIP34 had similar regulatory mechanisms in anthocyanin accumulation. It is believed that future studies will elucidate the exact molecular mechanisms by which PgbZIP16 interacts with other transcription factors to promote anthocyanin accumulation.

Materials and methods

Identification and characterization of bZIP gene family members of pomegranate

Sequences with E-value < e−5 were identified by HMMER v3.2.1 software based on the hidden Markov models (HMM) profile of the bZIP gene family domain (PF00170) downloaded from the Pfam database ( We used the hmmsearch ( with bZIP to search the ‘Taishanhong’ pomegranate amino acid sequences, with a threshold of E-value≤1e−5, and manually removed redundancy [66, 67]. At the same time, the bZIP proteins of other species were downloaded from Plant Transcription Factor Database ( as seed files [68]. Sequence similarity searches to genes in the whole genome sequence of pomegranate were conducted using the BLASTp program on a local NCBI database (E-value <1e−10, identity >50%), and removed duplicates.

Combing the comparison results of HMMER and BlastP, using the online software SMART (, CDD ( and pfam ( databases were used to confirm the integrity of the conserved bZIP domains, and the sequences that did not contain bZIP conserved structural domains were removed [69, 70]. Isoelectric point (PI), molecular weight (MV) and instability indices of the identified bZIP protein were obtained using the online software ExPaSy-Protparam ( Subcellular localization of PgbZIPs was predicted by Cell-PLoc-2 ( [71].

Phylogenetic analysis

To explore the phylogenetic relationships of the pomegranate bZIP gene family, all of the Arabidopsis bZIP protein sequences were obtained from TAIR database ( The amino acid sequences of pomegranate and Arabidopsis bZIPs were imported into MEGA 7.0 and multiple sequence comparisons were performed using MUSCLE [18]. Thereafter, we used the maximum likelihood method of MEGA 7.0 to construct phylogenetic trees [72, 73]. The classification of the pomegranate bZIP protein family was referenced from previous studies in Arabidopsis. Finally, the phylogenetic tree was visualized using the EvolView website ( [74].

Gene structure and conservative motif analysis

Information on 65 PgbZIPs was obtained from the genome annotation GFF files, and the gene structure of PgbZIP gene family was analyzed by the online Gene Structure Display Serve (GSDS, We used ClustalW and WebLogo for multiple sequence comparison and visualization analysis, respectively [75, 76]. Finally, the conserved patterns of the bZIP gene family were identified through the online website MEME ( [77].

Analysis of cis-acting elements of pomegranate bZIP gene family

The 1.5 kb promoter sequence upstream of the transcription start site of each PgbZIP gene was extracted from the pomegranate genome sequence and predicted by PlantCARE for cis-regulatory elements ( [78]. The results were manually deleted and saved as a file in bed format, and the file was submitted to the online website GSDS 2.0 for visualization.

Expression analysis of pomegranate bZIP gene family

To study the expression of PgbZIPs in different tissues and organs, we used the NCBI database ( illumina sequencing platform to obtain 7 tissue transcript datas of pomegranate hermaphrodite, functional male flower, leaf, root, endocarp, ectocarp and pericarp. Their accession numbers were ‘Dabenzi’ SRR5279388, SRR5279391, SRR5279394-SRR5279397; ‘Tunisia’ SRR5446592, SRR5446595, SRR5446598, SRR5446601, SRR5446604, SRR5446607 and SRR5678820; ‘Baiyushizi’ SRR5678819, ‘Black127’ SRR1054190, ‘nana’ SRR1055290 and ‘Wonderful’ SRR080723. The RNA-Seq were quality filtered using the fastp software [79]. To quantify annotated transcript abundance, we used Kallisto version 0.44.0 to obtain transcriptome data, and the transformed TPM value log2(TPM + 1) were visualized using the TBtools.

Experiment material

Research conducted at Baima Base for Teaching and Scientific Research of Nanjing Forestry University, the test materials were ‘Liuhuahong’, ‘Liuhuafen’ and ‘Liuhuabai’white pomegranate. No permission is required for sample collection. Pomegranate samples were collected in June 2020, and samples were frozen in liquid nitrogen and stored in a − 80 °C refrigerator for backup.

Gene cloning and subcellular localization

Using BioTeke Plant Total RNA Extraction Kit (spin column type) to extract RNA from three flower colors of pomegranate petals. cDNA was obtained using reverse transcription kit (PrimeScriptTM RT reagent Kit with gDNAEraser, TaKaRa), and stored at −20 °C. Oligo 7.0 software was used to design the primers (Table S1), and the primer sequence was synthesized by Shanghai Bioengineering Co., Ltd.

The PCR reaction system was as follows: 2 × Taq Plus Master Mix: 25 μl; F: 1 μl; R: 1 μl; DNA template: 2 μl; Nuclease-free ddH2O: 21 μl. The PCR reaction procedure was as follows: 95 °C: 3 min; 95 °C: 15 s, 58 °C: 45 s, 72 °C: 1 min, a total of 35 cycles; 72 °C: 5 min; 4 °C: storage. The PCR products were separated by 1% agarose gel electrophoresis, and the gel was cut to recover the target fragments. After recovery, ligation, transformation, and sequencing, the CDS sequences of the pomegranate PgbZIP16 and PgbZIP34 genes were finally obtained. ExPaSy-Translate online tool ( was used to translate it into amino acid sequence.

The correct recombinant plasmid obtained by sequencing was transferred into Agrobacterium GV3101 by freeze-thaw method, and then the tobacco leaves were infected by Agrobacterium tumefaciens-mediated method for 3-4 weeks. The empty vector pBI121 with GFP tag was transformed into Agrobacterium tumefaciens GV3101 in this study. pBI121-GFP was used as a control. After 24 h of dark culture and light culture respectively, the fluorescence signal was observed under a confocal microscope and photographed.

Expression specificity of pomegranate PgbZIP16 and PgbZIP34

Real-time fluorescent quantitative PCR was used to study the expression patterns of PgbZIP16 and PgbZIP34 genes in the petals of three different flower colors of pomegranate. The BioEasy Master Mix Plus (SYBR Green) was used as a fluorescent dye, and the reaction program was as follows: 95 °C: 3 min; followed by 40 cycles of 95 °C for 30 s and 60 °C for 15 s. Specific primers were designed for qPCR (Table S1). The pomegranate actin was used as the internal reference gene, and was performed on 3 biological and 3 technical replicates for each treatment. Comparison of relative gene expression data of flowers was done using the 2-ΔΔCt method [80, 81].

Agrobacterium infiltration

PgbZIP16 and PgbZIP34 recombinant plasmids with GUS tags were transferred into Agrobacterium tumefaciens GV3101, and the activated Agrobacterium tumefaciens was inoculated into 50 mL of LB liquid medium at a ratio of 1:100, and incubated at 28 °C for 16 h with shaking at 210 r/min. Then centrifuged at 4000 rpm/min for 10 min to collect the bacteria and the bacteria were resuspended and used in permeate (10 mmol/L MES + 10 mmol/L MgCl2.6H2O + 100 mmol/L AS, pH 5.6) to resuspend the bacteria. Then the injection solution was prepared proportionally, placed at room temperature for 2-4 h, injected into the abaxial surface of tobacco leaves, sampled daily after infestation, stored in a refrigerator at −80 °C, and tested 7 days later.

Overexpression of PgbZIP16 in Arabidopsis and GUS activity assay

Wild-type Arabidopsis plants were grown in the incubator (Arabidopsis seeds were kept in our laboratory). We constructed the pBI121-PgbZIP16 overexpression vector and transformed it into Agrobacterium GV3101, inoculated it in 50 mL LB liquid medium, incubated at 210 r/min at 28 °C for 48 h, and then collected the bacteria by centrifugation at 4000 rpm/min within 10 min. After re-suspension of bacteria with an osmotic agent (0.05% sliwet77 + 5% sucrose +1/2MS liquid medium). Arabidopsis plants were subsequently transformed according to the flower dip method and incubated in the dark for 48 h. After three infestations, Arabidopsis seeds were collected and screened for PgbZIP16 transgene-positive plants [82].

Seeds of the identified positive plants were planted in 1/2 MS Petri dishes containing 25 μg/L. Positive seedlings with two true leaves were photographed after two weeks, and selected plants were selected and placed in prepared GUS staining solution and incubated overnight at 37 °C. Positive stained plants were decolorized with anhydrous ethanol and photographed with a stereomicroscope after all the green color faded [83].

Statistical analysis

The data are shown as the means ± standard errors (SEs) of 3 or 6 independent biological replicates. Statistical differences between samples were analyzed by LSD and Duncan (D) (p < 0.05). Data analysis and visualization were processed using SPSS 20.0 and Origin 2018.

Availability of data and materials

The whole genome data of pomegranate is downloaded from the NCBI database (, and the accession number is ASM220158v1. The transcriptome data is obtained from NCBI (, and the accession numbers are SRP103147 and SRP100581. The bZIP protein sequences of A. thaliana downloaded from the PlantTFDB database ( Public access to all databases is open. The datasets supporting the conclusions of this article are included within the article (and its additional files).



Basic Leucine Zipper


Multiple Collinearity Scan toolkit


Theoretical Isoelectric Point


Molecular Weight


Maximum Likelihood


Complementary DNA


Quantitative real-time polymerase chain reaction


  1. Bendokas V, Skemiene K, Trumbeckaite S, Stanys V, Passamonti S, Borutaite V, et al. Anthocyanins: from plant pigments to health benefits at mitochondrial level. Crit Rev Food Sci Nutr. 2020;60(19):3352–65.

    Article  CAS  PubMed  Google Scholar 

  2. Landi M, Tattini M, Gould KS. Multiple functional roles of anthocyanins in plant-environment interactions. Environ Exp Bot. 2015;119:4–17.

    Article  CAS  Google Scholar 

  3. Luo Q, Liu R, Zeng L, Wu Y, Jiang Y, Yang Q, et al. Isolation and molecular characterization of NtMYB4a, a putative transcription activation factor involved in anthocyanin synthesis in tobacco. Gene. 2020;760:144990.

    Article  CAS  PubMed  Google Scholar 

  4. Naing A, Kim C. Abiotic stress-induced anthocyanins in plants: their role in tolerance to abiotic stresses. Physiol Plant. 2021;172(3):1711–23.

    Article  CAS  PubMed  Google Scholar 

  5. Kristine M, Rune S, Unni S, Cato B, Trond L, Peter R, et al. Temperature and nitrogen effects on regulators and products of the flavonoid pathway: experimental and kinetic model studies. Plant Cell Environ. 2009;3(32):286–99.

    Google Scholar 

  6. Kelly E, Vyas P, Weber J, Kelly E, Vyas P, Weber JT. Biochemical properties and neuroprotective effects of compounds in various species of berries. Molecules. 2017;23(1):26.

    Article  PubMed Central  Google Scholar 

  7. de Pascual-Teresa S. Molecular mechanisms involved in the cardiovascular and neuroprotective effects of anthocyanins. Arch Biochem Biophys. 2014;559:68–74.

    Article  PubMed  Google Scholar 

  8. Li X, Zhang Y, Yuan Y, Sun Y, Qin Z, Deng H. Protective effects of selenium, vitamin E, and purple carrot anthocyanins on D-Galactose-induced oxidative damage in blood, liver, heart and kidney rats. Biol Trace Elem Res. 2016;173(2):433–42.

    Article  CAS  PubMed  Google Scholar 

  9. Cassidy A. Berry anthocyanin intake and cardiovascular health. Mol Asp Med. 2018;61:76–82.

    Article  CAS  Google Scholar 

  10. Zhang H, Koes R, Shang H, Fu Z, Wang L, Dong X, et al. Identification and functional analysis of three new anthocyanin R2R3-MYB genes in Petunia. Plant Direct. 2019;3(1):e00114.

    Article  PubMed  PubMed Central  Google Scholar 

  11. Gangappa S, Botto J. The multifaceted roles of HY5 in plant growth and development. Mol Plant. 2016;9(10):1353–65.

    Article  CAS  PubMed  Google Scholar 

  12. Perez-Rodriguez P, Riano-Pachon D, Correa L, Rensing S, Kersten B. Mueller-Roeber B PlnTFDB: updated content and new features of the plant transcription factor database. Nucleic Acids Res. 2010;38:822–7.

    Article  Google Scholar 

  13. Nijhawan A, Jain M, Tyagi A, Khurana J. Genomic survey and gene expression analysis of the basic leucine zipper transcription factor family in rice. Plant Physiol. 2008;146(2):333–50.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  14. Talanian R, Mcknight C, Kim P. Sequence-specific DNA-binding by a short peptide dimer. Science. 1990;249(4970):769–71.

    Article  CAS  PubMed  Google Scholar 

  15. Landschulz W, Johnson P, McKnight S. The leucine zipper: a hypothetical structure common to a new class of DNA binding proteins. Science. 1998;240(4860):1759–64.

    Article  Google Scholar 

  16. Ellenberger T, Brandl C, Struhl K, Harrison S. The GCN4 basic region leucine zipper binds DNA as a dimer of uninterrupted a helices: crystal structure of the protein-DNA complex. Cell. 1992;71(7):1223–37.

    Article  CAS  PubMed  Google Scholar 

  17. Jakoby M, Weisshaar B, Droge-Laser W, Vicente-Carbajosa J, Tiedemann J, Kroj T, et al. bZIP transcription factors in Arabidopsis. Trends Plant Sci. 2002;7(3):106–11.

    Article  CAS  PubMed  Google Scholar 

  18. Dröge-Laser W, Snoek B, Snel B, Weiste C. The Arabidopsis bZIP transcription factor family—an update. Curr Opin Plant Biol. 2018;45(A):36–49.

    Article  PubMed  Google Scholar 

  19. Wei K, Chen J, Wang Y, Chen Y, Chen S, Lin Y, et al. Genome-wide analysis of bZIP-encoding genes in maize. DNA Res. 2012;19(6):463–76.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  20. Li D, Fu F, Zhang H, Song F. Genome-wide systematic characterization of the bZIP transcriptional factor family in tomato (Solanum lycopersicum L.). BMC Genomics. 2015;16(771):60–78.

  21. Liu J, Chen N, Chen F, Cai B, Dal Santo S, Tornielli G, Pezzotti M, Cheng Z. Genome-wide analysis and expression profile of the bZIP transcription factor gene family in grapevine (Vitis vinifera). BMC Genomics. 2014;15(281):1–18.

    Google Scholar 

  22. Li Y, Meng D, Li M, Cheng L. Genome-wide identification and expression analysis of the bZIP gene family in apple (Malus domestica). Tree Genet Genomes. 2016;12(82):1–17.

    Google Scholar 

  23. Zhao K, Chen S, Yao W, Cheng Z, Zhou B, Jiang T. Genome-wide analysis and expression profile of the bZIP gene family in poplar. BMC Plant Biol. 2021;21:122.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Zhang Y, Gao W, Li H, Wang Y, Li D, Xue C, et al. Genome-wide analysis of the bZIP gene family in Chinese jujube (Ziziphus jujuba Mill.). BMC Genomics. 2020;21(1):483.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Toh S, McCourt P, Tsuchiya Y. HY5 is involved in strigolactone-dependent seed germination in Arabidopsis. Plant Signal Behav. 2012;7(5):556–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  26. Strathmann A, Kuhlmann M, Heinekamp T, Droge-Laser W. BZI-1 specifically heterodimerises with the tobacco bZIP transcription factors BZI-2, BZI-3/TBZF and BZI-4, and is functionally involved in flower development. Plant J. 2001;28(4):397–408.

    Article  CAS  PubMed  Google Scholar 

  27. Gibalová A, Steinbachová L, Hafidh S, Bláhová V, Gadiou Z, Michailidis C, et al. Characterization of pollen-expressed bZIP protein interactions and the role of ATbZIP18 in the male gametophyte. Plant Reprod. 2017;30(1):1–17.

    Article  PubMed  Google Scholar 

  28. Guan Y, Ren H, Xie H, Ma Z, Chen F. Identification and characterization of bZIP-type transcription factors involved in carrot (Daucus carota L.) somatic embryogenesis. Plant J. 2009;60(2):207–17.

    Article  CAS  PubMed  Google Scholar 

  29. Weiste C, Pedrotti L, Selvanayagam J, Muralidhara P, Fröschel C, Novák O, et al. The Arabidopsis bZIP11 transcription factor links low-energy signalling to auxin-mediated control of primary root growth. PLoS Genet. 2017;13(2):e1006607.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Ma H, Liu C, Li Z, Ran Q, Xie G, Wang B, et al. ZmbZIP4 contributes to stress resistance in maize by regulating ABA synthesis and root development. Plant Physiol. 2018;178(2):753–70.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Yoshida T, Fujita Y, Sayama H, Kidokoro S, Maruyama K, Mizoi J, et al. AREB1, AREB2, and ABF3 are master transcription factors that cooperatively regulate ABRE-dependent ABA signaling involved in drought stress tolerance and require ABA for full activation. Plant J. 2010;61(4):672–85.

    Article  CAS  PubMed  Google Scholar 

  32. Wu S, Zhu P, Jia B, Yang J, Shen Y, Cai X, et al. A Glycine soja group S2 bZIP transcription factor GsbZIP67 conferred bicarbonate alkaline tolerance in Medicago sativa. BMC Plant Biol. 2018;18(1):234.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Gai W, Ma X, Qiao Y, Shi B, Ul Haq S, Li Q, et al. Characterization of the bZIP Transcription Factor Family in Pepper (Capsicum annuum L.): CabZIP25 Positively Modulates the Salt Tolerance. Front Plant Sci. 2020;11:139.

    Article  PubMed  PubMed Central  Google Scholar 

  34. Bi C, Yu Y, Dong C, Yang Y, Zhai Y, Du F, et al. The bZIP transcription factor TabZIP15 improves salt stress tolerance in wheat. Plant Biotechnol J. 2021;19(2):209–11.

    Article  CAS  PubMed  Google Scholar 

  35. Schepens I, Duek P, Fankhauser C. Phytochrome mediated light signalling in Arabidopsis. Curr Opin Plant Biol. 2004;7(5):564–9.

    Article  CAS  PubMed  Google Scholar 

  36. Shin J, Park E, Choi G. PIF3 regulates anthocyanin biosynthesis in an HY5-dependent manner with both factors directly binding anthocyanin biosynthetic gene promoters in Arabidopsis. Plant J Cell Mol Biol. 2007;49(6):981–94.

    Article  CAS  Google Scholar 

  37. Stracke R, Favory J, Gruber H, Bartelniewoehner L, Bartels S, Binkert M, et al. The Arabidopsis bZIP transcription factor HY5 regulates expression of the PFG1/MYB12 gene in response to light and ultraviolet-B radiation. Plant Cell Environ. 2010;33(1):88–103.

    CAS  PubMed  Google Scholar 

  38. Shin D, Choi M, Kim K, Bang G, Cho M, Choi S, et al. HY5 regulates anthocyanin biosynthesis by inducing the transcriptional activation of the MYB75/PAP1 transcription factor in Arabidopsis. FEBS Lett. 2013;587(10):1543–7.

    Article  CAS  PubMed  Google Scholar 

  39. Nguyen N, Jeong C, Kang G, Yoo S, Hong S, Lee H. MYBD employed by HY5 increases anthocyanin accumulation via repression of MYBL2 in Arabidopsis. Plant J Cell Mol Biol. 2015;84(6):1192–205.

    Article  CAS  Google Scholar 

  40. An J, Qu F, Yao J, Wang X, You C, Wang X, et al. The bZIP transcription factor MdHY5 regulates anthocyanin accumulation and nitrate assimilation in apple. Hortic Res. 2017;4:17023.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Liu C, Chi C, Jin L, Zhu J, Yu J, Zhou Y. The bZIP transcription factor HY5 mediates CRY1a-induced anthocyanin biosynthesis in tomato. Plant Cell Environ. 2018;41(8):1762–75.

    Article  CAS  PubMed  Google Scholar 

  42. Catalá R, Medina J, Salinas J. Integration of low temperature and light signaling during cold acclimation response in Arabidopsis. Proc Natl Acad Sci U S A. 2011;108:16475–80.

    Article  PubMed  PubMed Central  Google Scholar 

  43. An J, Yao J, Xu R, You C, Wang X, Hao Y. Apple bZIP transcription factor MdbZIP44 regulates abscisic acid-promoted anthocyanin accumulation. Plant Cell Environ. 2018;41(11):2678–92.

    Article  CAS  PubMed  Google Scholar 

  44. Yuan Z, Fang Y, Zhang T, Fei Z, Han F, Liu C, et al. The pomegranate (Punica granatum L.) genome provides insights into fruit quality and ovule developmental biology. Plant Biotechnol J. 2018;16(7):1363–74.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  45. Ben L, Kim K, Quah C, Kim W, Shahimi M. Anti-inflammatory potential of ellagic acid,gallic acid and punicalagin A&B isolated from Punica granatum. BMC Complement Altern Med. 2017;17(1):47–57.

    Article  Google Scholar 

  46. Chater J, Merhaut D, Jia Z, Mauk P, Preece J. Fruit quality traits of ten California-grown pomegranate cultivars harvested over three months. Sci Hortic. 2018;237(1):11–9.

    Article  Google Scholar 

  47. Qin G, Xu C, Ming R, Tang H, Guyot R, Kramer E, et al. The pomegranate (Punica granatum L.) genome and the genomics of punicalagin biosynthesis. Plant J. 2017;91(6):1108–28.

    Article  CAS  PubMed  Google Scholar 

  48. Luo X, Li H, Wu Z, Yao W, Zhao P, Cao D, et al. The pomegranate (Punica granatum L.) draft genome dissects genetic divergence between soft- and hard-seeded cultivars. Plant Biotechnol J. 2019;18(4):955–68.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Chattopadhyay S, Ang LH, Puente P, Deng XW, Wei N. Arabidopsis bZIP protein HY5 directly interacts with light-responsive promoters in mediating light control of gene expression. Plant Cell. 1998;10(5):673–83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  50. Jungeun L, Kun H, Viktor S, Horim L, Pablo F, Ying G, et al. Analysis of transcription factor HY5 genomic binding sites revealed its hierarchical role in light regulation of development. Plant Cell. 2007;19(3):731–49.

    Article  Google Scholar 

  51. Fan L, Xu L, Wang Y, Tang M, Liu L. Genome- and transcriptome-wide characterization of bZIP gene family identifies potential members involved in abiotic stress response and anthocyanin biosynthesis in radish (Raphanus sativus L.). Int J Mol Sci. 2019;20(24):6334.

    Article  CAS  PubMed Central  Google Scholar 

  52. Wang X, Chen X, Yang T, Cheng Q, Cheng Z. Genome-wide identification of bZIP family genes involved in drought and heat stresses in strawberry (Fragaria vesca). Int J Genomics. 2017;3981031.

  53. Li H, Li L, ShangGuan G, Jia C, Deng S, Noman M, et al. Genome-wide identification and expression analysis of bZIP gene family in Carthamus tinctorius L. Sci Rep. 2020;10(1):15521.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  54. Choi H, Hong J, Ha J, Kang J, Kim S. ABFs, a family of ABA responsive element binding factors. J Biol Chem. 2000;275(3):1723–30.

    Article  CAS  PubMed  Google Scholar 

  55. Finkelstein R, Lynch T. The Arabidopsis abscisic acid response gene ABI5 encodes a basic leucine zipper transcription factor. Plant Cell. 2000;12(4):599–610.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  56. Lopez-Molina L, Mongrand S, Chua N. A postgermination developmental arrest checkpoint is mediated by abscisic acid and requires the ABI5 transcription factor in Arabidopsis. PNAS. 2001;98(8):4782–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  57. Kircher S, Ledger L, Hayashi H, Weisshaar B, Frohnmeyer H. CPRF4, a novel plant bZIP protein of the CPRF family: comparative analysis of light dependent expression, post-transcriptional regulation, nuclear import and heterodimerisation. Mol General Geneics. 1998;257(6):595–605.

    Article  CAS  Google Scholar 

  58. Chattopadhyay S, Ang L, Puente P, Deng X, Wei N. Arabidopsis bZIP protein HY5 directly interacts with light-responsive promoters in mediating light control of gene expression. Plant Cell. 1998;10:673–84.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Strathmann A, Kuhlmann M, Heinekamp T, Droge-Laser W. BZI-1 specifically heterodimerises with the tobacco bZIP transcription factors BZI-2, BZI-3/TBZF and BZI-4, and is functionally involved in flower development. Plant J. 2001;28:1–15.

    Google Scholar 

  60. Abdullah FS, Mehmood F, Malik HMT, Ahmed I, Heidari P, Poczai P. The GASA Gene family in cacao (Theobroma cacao, Malvaceae): genome wide identification and expression analysis. Agronomy. 2021;11(7):1425.

    Article  CAS  Google Scholar 

  61. Musavizadeh Z, Najafi-Zarrini H, Kazemitabar SK, Hashemi SH, Faraji S, Barcaccia G, et al. Genome-wide analysis of potassium channel genes in rice: expression of the OsAKT and OsKAT genes under salt stress. Genes. 2021;12(5):784.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  62. Leon R, Lightbourn L, Melina L, Amarillas L. Differential gene expression of anthocyanin biosynthetic genes under low temperature and ultraviolet-B radiation in bell pepper (Capsicum annuum). Int J Agric Biol. 2020;23(3):531–8.

    Google Scholar 

  63. Zhang H, He H, Wang X, Wang X, Yang X, Li L, et al. Genome-wide mapping of the HY5-mediated gene networks in Arabidopsis that involve both transcriptional and post-transcriptional regulation. Plant J. 2011;65(3):346–58.

    Article  CAS  PubMed  Google Scholar 

  64. Cao S, Liu Y. Cloning and expression analysis of MaHY5 transcription factor from grape hyacinth. Acta Botan Boreali-Occiden Sin. 2019;39(07):1188–94.

    Google Scholar 

  65. Wang Y, Zhang X, Zhao Y, Yang J, He Y, Li G, et al. Transcription factor PyHY5 binds to the promoters of PyWD40 and PyMYB10 and regulates its expression in red pear 'Yunhongli No. 1′. Plant Physiol Biochem. 2020;154:665–74.

    Article  CAS  PubMed  Google Scholar 

  66. Finn R, Clements J, Eddy S. HMMER web server: interactive sequence similarity searching. Nucleic Acids Res. 2011;39:29–37.

    Article  Google Scholar 

  67. Chen C, Chen H, Zhang Y, Thomas H, Frank M, He Y, et al. TBtools: an integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–202.

    Article  CAS  Google Scholar 

  68. El-Gebali S, Mistry J, Bateman A, Eddy S, Luciani A, Potter S, et al. The Pfam protein families database in 2019. Nucleic Acids Res. 2019;47(D1):d427–32.

    Article  CAS  PubMed  Google Scholar 

  69. Marchler-Bauer A, Bo Y, Han L, He J, Lanczycki C, Lu S, et al. CDD/SPARCLE: functional classification of proteins via subfamily domain architectures. Nucleic Acids Res. 2017;45(D1):D200–3.

    Article  CAS  PubMed  Google Scholar 

  70. Letunic I, Peer B. 20 years of the SMART protein domain annotation resource. Nucleic Acids Res. 2018;46(D1):D493–6.

    Article  CAS  PubMed  Google Scholar 

  71. Shen H, Chou K. Cell-PLoc 2.0: an improved package of web-servers for predicting subcellular localization of proteins in various organisms. Nat Sci. 2010;2(10):1090–103.

    Google Scholar 

  72. Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  73. Lefort V, Longueville J-E, Gascuel O. SMS: Smart model selection in PhyML. Mol Biol Evol. 2017;34(9):2422–4.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Balakrishnan S, Gao S, Lercher M, Hu S, Chen W. Evolview v3: a webserver for visualization, annotation, and management of phylogenetic trees. Nucleic Acids Res. 2019;2019(47):W270–5.

    Google Scholar 

  75. Thompson J, Gibson T, Plewniak F, Jeanmougin F, Higgins D. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucleic Acids Res. 1997;25(24):4876–82.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  76. Crooks G, Hon G, Chandonia J, Brenner S. WebLogo: a sequence logo generator. Genome Res. 2004;14(6):88–90.

    Article  Google Scholar 

  77. Bailey T, Boden M, Buske F, Frith M, Grant C, Clementi L, et al. MEME Suite: tools for motif discovery and searching. Nucleic Acids Res. 2009;37(suppl_2):W202–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  78. Magali L, Patrice D, Gert T, Kathleen M, Yves M, Yves V, et al. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2020;30(1):325–7.

    Google Scholar 

  79. Chen S, Zhou Y, Chen Y, Gu J. fastp: an ultra-fast all-in-one FASTQ preprocessor. Bioinformatics. 2018;34(17):i884–90.

    Article  PubMed  PubMed Central  Google Scholar 

  80. Xu Y, Zhu X, Gong Y, Xu L, Wang Y, Liu L. Evaluation of reference genes for gene expression studies in radish (Raphanus sativusL.) using quantitative real-time PCR. Biochem Biophys Res Commun. 2012;424(3):398–403.

    Article  CAS  PubMed  Google Scholar 

  81. Livak K, Schmittgen T. Analysis of relative gene expression data using realtime quantitative PCR and the 2-ΔΔCT method. Methods. 2001;25(4):402–8.

    Article  CAS  PubMed  Google Scholar 

  82. Clough S, Bent A. Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998;16(6):735–43.

    Article  CAS  PubMed  Google Scholar 

  83. Jefferson RA, Kavanagh TA, Bevan MW. GUS fusions beta-glucuronidase as a sensitive and versatile gene fusion marker in higher plants. EMBO J. 1987;6:3901–7.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


Not applicable.


This work was supported by the Initiative Project for Talents of Nanjing Forestry University (GXL2014070, GXL2018032), the Doctorate Fellowship Foundation of Nanjing Forestry University, and the Priority Academic Program Development of Jiangsu High Education Institutions (PAPD), the National Natural Science Foundation of China (31901341), the Natural Science Foundation of Jiangsu Province (BK20180768). These funding bodies took part in the design of the study and collection, analysis, and interpretation of data, and the writing of the manuscript, as well as in the open access payment.

Author information

Authors and Affiliations



SW and XZ designed research, and also analyzed and interpreted the phylogenetic analysis. SW, XZ and BL performed the expression level examination of the plants. SW was a major contributor in experimentation, analyzing data and writing the manuscript. XZ and SY revised the manuscript. BL and SWconducted data analysis. All authors read and approved the manuscript.

Corresponding author

Correspondence to Zhaohe Yuan.

Ethics declarations

Ethics approval and consent to participate

The use of plant parts in the present study complies with international, national and/or institutional guidelines.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplementary Table 1

. Primers for the gene cloning, subcellular localization and qRT-PCR.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wang, S., Zhang, X., Li, B. et al. Genome-wide identification and characterization of bZIP gene family and cloning of candidate genes for anthocyanin biosynthesis in pomegranate (Punica granatum). BMC Plant Biol 22, 170 (2022).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: