Genome-wide analysis, expression profile of heat shock factor gene family (CaHsfs) and characterisation of CaHsfA2 in pepper (Capsicum annuum L.)
BMC Plant Biology volume 15, Article number: 151 (2015)
Heat shock factors (Hsfs) play crucial roles in plant developmental and defence processes. The production and quality of pepper (Capsicum annuum L.), an economically important vegetable crop, are severely reduced by adverse environmental stress conditions, such as heat, salt and osmotic stress. Although the pepper genome has been fully sequenced, the characterization of the Hsf gene family under abiotic stress conditions remains incomplete.
A total of 25 CaHsf members were identified in the pepper genome by bioinformatics analysis and PCR assays. They were grouped into three classes, CaHsfA, B and C, based on highly conserved Hsf domains, were distributed over 11 of 12 chromosomes, with none found on chromosome 11, and all of them, except CaHsfA5, formed a protein–protein interaction network. According to the RNA-seq data of pepper cultivar CM334, most CaHsf members were expressed in at least one tissue among root, stem, leaf, pericarp and placenta. Quantitative real-time PCR assays showed that all of the CaHsfs responded to heat stress (40 °C for 2 h), except CaHsfC1 in thermotolerant line R9 leaves, and that the expression patterns were different from those in thermosensitive line B6. Many CaHsfs were also regulated by salt and osmotic stresses, as well as exogenous Ca2+, putrescine, abscisic acid and methyl jasmonate. Additionally, CaHsfA2 was located in the nucleus and had transcriptional activity, consistent with the typical features of Hsfs. Time-course expression profiling of CaHsfA2 in response to heat stress revealed differences in its expression level and pattern between the pepper thermosensitive line B6 and thermotolerant line R9.
Twenty-five Hsf genes were identified in the pepper genome and most of them responded to heat, salt, osmotic stress, and exogenous substances, which provided potential clues for further analyses of CaHsfs functions in various kinds of abiotic stresses and of corresponding signal transduction pathways in pepper.
Plants as sessile organisms have formed a variety of defence mechanisms to protect themselves from persistently changing stress factors, such as extreme temperature, salt and drought . Temperature, especially high temperature, can affect crop growth and development, severely reducing the yield and quality [2–4]. Under heat stress (HS), the plant cells rapidly respond to a high temperature by inducing the expression of genes encoding heat shock proteins (Hsps), which are involved in preventing heat-related damage and confer plant thermotolerance [5, 6]. Many Hsps function as molecular chaperones in preventing protein misfolding and aggregation, consequently maintaining protein homeostasis in cells and causing the plant’s acquired thermotolerance [7–9].
Heat shock factors (Hsfs) regulate the expression of Hsps by recognizing heat shock elements (HSEs) within the promoters of Hsps [1, 10]. HSEs are characterised by multiple inverted repeats of the nGAAn sequence, and at least three HSE motifs are required for efficient Hsf oligomer binding in eukaryotic organisms [11, 12]. Under non-stress conditions, Hsfs are maintained in inactive states and form cytoplasmic complexes with Hsp90/Hsp70 chaperone complexes [8, 13]. Under HS conditions, as the result of a cytosolic protein response, Hsfs are released from chaperone complexes and bind to the HSEs of target genes after undergoing phosphorylation, sumoylation, trimerisation and nuclear import [1, 13, 14].
Hsf families share a conserved modular structure. Despite considerable variability in size and sequence, their structures and functions are conserved throughout the eukaryotic kingdom [8, 15]. In plant Hsfs, the highly conserved DNA-binding domain (DBD), which is composed of an antiparallel four-stranded β-sheet (β1, β2, β3 and β4) and three helical bundles (α1, α2 and α3) in the N-terminus is required for the positioning and recognition of HSEs [16–18]. The oligimerisation domain (OD or HR-A/B region), responsible for the transcription factor activity, is connected to the DBD by a flexible linker  and is composed of a heptad pattern of hydrophobic amino acid residues [19–21]. In addition, a cluster of basic amino acid residues, the nuclear localisation signal (NLS), essential for nuclear import, a leucine-rich nuclear export signal (NES) for nuclear export, short peptide motifs (AHA motifs) for activator functions, and a repressor domain (RD), characterised by the tetrapeptide LFGV in the C-terminus, exist in some Hsfs [1, 22–24].
The number of Hsf genes varies greatly among different eukaryotic organisms. Drosophila melanogaster, Caenorhabditis elegans and Saccharomyces cerevisiae have a single Hsf gene, and vertebrate genomes contain four Hsf genes . In contrast to the low numbers of Hsf genes in animals and yeasts, plants possess large Hsf families, with 21 Hsf genes in Arabidopsis (Arabidopsis thaliana), 25 in rice (Oryza sativa), 30 in maize (Zea mays), 52 Hsf genes in soybean (Glycine max), and at least 24 in tomato (Solanum lycopersicum) , indicating that plant Hsfs in various species may have multiple functions in preventing stress damage [1, 26]. Based on the peculiarities of the HR-A/B regions, plant Hsfs are divided into three classes, A, B and C . Class A and C Hsfs contain an extended HR-A/B with 21 and 7 amino acid residues between the HR-A and HR-B region, respectively, whereas class B Hsfs have a compact HR-A/B region lacking an insertion [15, 20]. Additionally, class A Hsfs contain aromatic (W, F, Y), hydrophobic (L, I, V) and acidic (D, E) AHA activation domains that are absent in class B and C Hsfs . Class B Hsfs, except HsfB5, contain the RD in the C-terminus, which is speculated to function as a repressor motif, making HsfB members act as repressors [27–30]. However, Arabidopsis HsfB1 is able to positively regulate the acquired thermotolerance . This apparent contradiction remains to be elucidated in future research.
Many plant Hsf genes from various species have been isolated and comprehensively studied. In Arabidopsis, HsfA1 and HsfA2 can synergistically activate target genes by forming superactivator heterodimers . While negatively regulating the expression levels of heat-inducible Hsfs, HsfB1 and HsfB2b are necessary for acquired thermotolerance . The expression of HsfA9 increases during embryogenesis and seed maturation , whereas HsfA5 is inactive and inhibits HsfA4 activity in tomato . HsfA2 can enhance the tolerance of plants to multiple abiotic stresses, such as HS , salt/osmotic stress , oxidative stress  and anoxia . HsfA2 in tomato contributes to fruit set during HS by activating the protection mechanisms in the anther [26, 36].
Pepper (Capsicum annuum L.), a very important economic crop, is sensitive to HS; however, investigations regarding the molecular mechanisms of heat tolerance have been limited [37, 38]. The Hsf gene family has, so far, been fully characterised only in a few model species, such as Arabidopsis, rice, maize, wheat and Chinese cabbage [8, 20, 39–41]. The genome sequence of pepper has been published recently [42, 43], which enables the characterisation of the pepper Hsf family and their responses to various stresses at the molecular level. In this study, the genome-wide identification of the pepper Hsf family members is performed using bioinformatics and gene expression analyses. A total of 25 Hsf family members from pepper are identified using bioinformatics analysis and PCR tests. The gene structure, conserved domains, chromosomal location, gene duplication and phylogenetic analyses are presented. In addition, we analyze the expression patterns of Hsf genes in different pepper tissues, as well as their responses to various stresses. The results provide a foundation for further functional research on Hsf genes in pepper and will help to reveal the functions of Hsf genes in other plant species.
Identification of the Hsf gene family in pepper
The Hidden Markov Model (HMM) profile of the Hsf DBD domain (Pfam: PF00447) (http://pfam.sanger.ac.uk/) was used as a BLAST query against the pepper genome database PGP (http://peppergenome.snu.ac.kr/), and the Hsf proteins in Arabidopsis, Vitis vinifera and Populus trichocarpa from the PTFD (Plant Transcription Factor Database, http://plntfdb.bio.uni-potsdam.de/v3.0/) were also used as BLAST query against PGP. A total of 26 candidate Hsf genes were originally obtained from pepper cultivar CM334, aligned with the corresponding genes in the cultivar Zunla-1 genome, and the different sequences were re-amplified to correct the corresponding pepper Hsf genes sequences. One candidate gene (Gene ID: CA01g30350) was discarded due to an incomplete DBD domain as identified by Pfam, SMART (http://smart.embl-heidelberg.de/) and Heatster (http://www.cibiv.at/services/hsf/). As a result, 25 Hsf candidate genes, whose classification and naming were based on the rules of Hsf families from Arabidopsis and tomato , were identified in pepper (Table 1). The coding sequence sizes for CaHsfs ranged from 606 bp (CaHsfB5) to 1,518 bp (CaHsfA1b), deduced proteins from 201 to 505 amino acids in length, respectively, and molecular weights from 23.37 kDa to 56.06 kDa, respectively. The predicted isoelectric points of CaHsfs were divergent, ranging from 4.65 to 9.20. Among the 25 pepper Hsf genes (CaHsfs), 17 members belonged to class A (CaHsfAs) and seven members belonged to class B (CaHsfBs), while only one Hsf gene was a class C member (CaHsfC). In CaHsfA, the subclasses of CaHsfA9 (four members, A9a, b, c and d), CaHsfA1 (three members, A1b, d and e), CaHsfA4 (three members, A4a, b and c) and CaHsfA6 (three members, A6a, b and c) were larger than the other subclasses, while subclass CaHsfA7 had no members.
Identification of conserved domains in pepper Hsf proteins
The MEME web server (http://meme-suite.org/tools/meme) was used to analyze motifs in CaHsf proteins (Fig. 1, Table 2). Motif 1 and 3 were found in all 25 pepper Hsf members, while motif 2 was absent in CaHsfA5 and motif 5 was absent in CaHsfC1. Some motifs only existed in certain members, such as motif 4, which was found in most CaHsfA and CaHsfC members, but not in CaHsfB. Generally, the number of motifs in the CaHsfBs was less than those in the CaHsfAs.
To better understand the structural characteristics of the CaHsf family, the conserved domains were predicted using Heatster (Table 3). Six conserved domains, DBD, HR-A/B, NLS, AHA, RD and NES, were identified in three CaHsf classes. As the most conserved domain in the Hsfs, DBD (corresponding to motif 1 and parts of motifs 2 and 3 in Fig. 1) was found in all 25 CaHsf members (Additional file 1: Fig. S1). The DBD domain was composed of three helical bundles (α1, α2 and α3) and four antiparallel β-sheets (β1, β2, β3 and β4), while no α1 and intact β1 were detected in the DBD domain of CaHsfA5, which resulted in its sequence being shorter than those of the other CaHsfs. In addition to DBD, HR-A/B, another core conserved domain, was also presented in all CaHsf proteins, while the other four conserved domains were only found in specific CaHsfs members. For the CaHsfAs, the NLS domain was found in all 17 members. CaHsfA9a had the longest NLS sequence (from the 57th to 248th amino acid), which covered the DBD and HR-A/B domains, while CaHsfA9b, A9c and A9d had the shortest NLSs of two amino acids. Three and four AHA domains, the specific domain that characterizes CaHsfAs, were identified in CaHsfA2 and CaHsfA3, respectively, while none were found in CaHsfA9b. For the CaHsfBs, CaHsfB1 and B5 did not contain the NLS domain, and similar to CaHsfA9a, CaHsfB3a also possessed a long NLS sequence (from the 10th to 218th amino acid) covering the DBD and HR-A/B domains. The tetrapeptide motif LFGV, as the core of the RD, was identified in all CaHsfB members except CaHsfB5, but only CaHsfB4 contained the NES domain. Interestingly, only two domains, DBD and HR-A/B, were identified in CaHsfC1.
Phylogenetic and sequence structure analysis in pepper Hsf proteins
To discover the phylogenetic relationships among the Hsf families, the Hsf conserved amino acid sequences (from the start of the DBD domain to the end of the HR-A/B region) [1, 41] of 25 proteins from pepper, 21 from Arabidopsis, 25 from tomato (S. lycopersicum), 21 from maize (Z. mais) and 25 from rice (O. sativa) were used to generate a phylogenetic tree (Fig. 2). Based on the phylogenetic tree, class HsfA had the maximum number of subclasses among the three classes, and included five smaller clusters of which four (A2 and A6, A1 and A8, A9, A3 and A7) were closer to class HsfC than the fifth cluster of class HsfA (A4 and A5). Two HsfA7 members from Arabidopsis (AT3G51910.1 and AT3G63350.1) were not clustered with the HsfA7 subclass from other plant species, but were closer to the HsfA6 subclass. This was also observed for one member from the maize HsfA7 subclass (ZM2G005815, closer to subclass HsfB2), one member from the maize HsfA8 subclass (ZM2G118485, closer to subclass HsfA4), one member from the rice HsfB4 subclass (Os07g44690.1, closer to subclass HsfA2) and CaHsfA6c (closer to subclass HsfA1). Compared with Arabidopsis, maize and rice, tomato Hsfs were closer to pepper Hsf proteins, which was coincident with the botanical classification.
A phylogenetic tree based on the sequences of conserved domains (from DBD to HR-A/B) in pepper Hsfs was also constructed (Additional file 2: Fig. S2A), which corresponded to the above mentioned motifs distributions (Fig. 1, Table 2) and phylogenetic groups (Fig. 2). The exon/intron structure of all 25 pepper Hsf members was analysed based on their coding sequences and the corresponding genome sequences to obtain further insights into duplication events and evolutionary patterns. CaHsfs shared a highly conserved exon/intron structures, with one intron and zero intron phases (Additional file 2: Fig. S2B). There were 15 CaHsf members with the intron located in the DBD domain, and four members with the intron located between the NLS and AHA domains, while the introns in CaHsfA4a and A4b were located between the AHA1 and AHA2 domains. The length of introns varied from 77 bp (CaHsfB2a) to 3,205 bp (CaHsfA5).
Chromosomal location and Hsf gene duplications in the pepper genome
To determine the chromosomal distribution of the CaHsf genes, the positions were identified based on their physical positions in the pepper genome database PGP. The 25 members mapped to 11 out of the 12 pepper chromosomes, with no genes mapping to chromosome 11 (Additional file 3: Fig. S3). The number of CaHsf genes on each chromosome varied greatly. The largest number of CaHsf genes (5) was located on chromosome 2, four genes were identified on chromosome 3, three genes on chromosome 9, and two genes each on chromosomes 1, 4, 6, 7 and 12. There was only one CaHsf gene each on chromosomes 5, 8 and 10.
The Plant Genome Duplication Database (PGDD, http://chibba.agtec.uga.edu/duplication/) analysis confirmed that two pairs of the pepper Hsfs (CaHsfA4a/A4c and CaHsfB3a/B3b) were segmental duplicated sequences (Additional file 3: Fig. S3, Additional file 4: Table S1), and each of the two pairs were located on different chromosomes (the former on chromosomes 2 and 4, and the latter on chromosomes 5 and 10). The ratios of nonsynonymous to synonymous substitutions (Ka/Ks) for the two duplicated pairs were less than 1.0, which suggested that the pairs had evolved mainly under the influence of purifying selection and that the duplication events occurred 45.9 (CaHsfA4a/A4c) and 71.31 million years ago (CaHsfB3a/B3b) [44, 45].
Protein–protein interaction network among CaHsf members
To provide further biological information on CaHsf members, their protein–protein interaction network of CaHsfs was predicated based on the interolog from the Arabidopsis interactome. Every CaHsf member, except CaHsfA5, generated a complex interaction network (Additional file 5: Fig. S4). The Arabidopsis homolog of CaHsfA5 (AtHsfA5, At4g13980) was not found among the Arabidopsis Hsf interaction partners. Among the CaHsfA members, A2, A3 and A6 (A6a, A6b and A6c) interacted with most other CaHsfs, and the three CaHsfA1 members (CaHsfA1b, A1d and A1e) also interacted with each other. However, CaHsfB1 (9 interaction partners), B3a (6 interaction partners) and B3b (6 interaction partners) owned the simple interaction network compared with class CaHsfA and other class CaHsfB members, and they did not interact with CaHsfA2, A1 and A6 members. In general, the class CaHsfA members had more interaction partners than class CaHsfB and CaHsfC members.
Expression analysis of CaHsf genes at different developmental stages in various organs
To investigate the potential functions of CaHsf genes during pepper development, a heat map of the global transcription patterns of the CaHsf family in CM334 was generated for the pepper genes against RNA-seq data of five tissues (root, stem, leaf, pericarp and placenta) and seven developmental stages of pericarp and placenta (Fig. 3). The expression pattern of each CaHsf gene was significantly different in different tissues and stages. Among class CaHsfA, CaHsfA2, A6a and A9a were constitutively expressed at relatively high levels, while CaHsfA4c, A6b, A9b and A9c were expressed at low levels or undetectable in all tested tissues, and the remained class CaHsfA genes were expressed highly in some specific tissues. For example, the expression levels of CaHsfA4b in root, PL-6DPA (placenta at 6 days post-anthesis) and -16DPA were higher than those in other tissues, but undetectable in PL-B10 (placenta at 10 days post-breaker). Although constitutively expressed, CaHsfA2, A6a and A9a were transcribed with higher levels in reproductive organs [PL-B (placenta at breaker) for CaHsfA2, PC- and PL-MG (pericarp and placenta at mature green) for A6a, and PC-B10 (pericarp 10 days post-breaker) for A9a].
CaHsfB1 was also constitutively expressed in all tested tissues at relatively high abundances, especially in PL (including PL-16DPA, −25DPA, −MG and -B). In comparison, the expression level of CaHsfB2b was at its lowest levels in PC-B (pericarp at breaker) and -B5, and undetectable in other tissues. CaHsfB3a was expressed at a higher level in PL-16DPA than in other organs and other PL developmental stages. In vegetative and reproductive organs, the highest levels of CaHsfC1 were observed in leaf and PL-B5, respectively, and the lowest level was found in PL-B.
Expression analysis of CaHsf genes under HS treatment
To examine the heat response profile for CaHsfs in pepper, we analysed the transcription levels of CaHsf members in the leaves of thermosensitive line B6 and thermotolerant line R9 under HS condition (40 °C for 2 h) [37, 38]. As shown in Fig. 4, in the heat-stressed R9 leaves, 22 CaHsf genes (88 %) were up-regulated (>2-fold) by HS, and two members, CaHsfB3a and B3b, were down-regulated (<0.5-fold), while only CaHsfC1 did not show a marked change. Among the up-regulated members, the expression levels of CaHsfA2, A3, A6c, B1 and B5 were higher than other members under HS, and the greatest increase in expression (>140-fold) was found in CaHsfA3, followed by CaHsfA2 (~20-fold). Compared with other groups, A1 members (CaHsfA1b, A1d and A1e) were not the predominantly expressed CaHsf genes. The transcription levels of the two pairs of duplicated CaHsf genes (CaHsfA4a/A4c and CaHsfB3a/B3b) did not exhibit significant divergences in regulated expression under HS. For the thermosensitive line B6, only 13 genes (52 %) were up-regulated and 10 genes (40 %) maintained a stable expression level under HS conditions, which was more than in R9. CaHsfA1d, A2 and A3 were strongly induced in treated B6 leaves, and CaHsfA2 and A3 showed particularly strong responses to HS in both thermosensitive line B6 and thermotolerant line R9 (Fig. 4).
Expression profiles of CaHsf genes in response to salt and osmotic stress
Although it was well known that Hsfs are involved in plant heat acclimatisation, other adverse factors, like salt and osmotic stresses, also affected plant growth and development, so we wondered whether responses to these stresses involved CaHsfs. Transcription profiles were obtained for CaHsf genes in R9 roots and stems subjected to 300 mM NaCl (salt stress) for 6 h, and 5 % mannitol (osmotic stress) for 6 h, respectively.
Under the salt stress treatment, six members (CaHsfA1b, A3, A9a, A9c, A9d and C1) were up-regulated in roots and stems, while CaHsfA2, A6c and B4 were down-regulated in both tissues (Fig. 5). The expression levels of five genes, CaHsfA1d, A4b, A8, B2b and B3a, were unregulated in roots and stems, while the expression levels of the remaining 11 members only showed obvious changes in either roots or stems. For instance, CaHsfA1e, A4c and B1 were induced by salt stress in roots, but not in stems; however, CaHsfA4a, A5, A6b and B5 exhibited high expression levels only in stems. Interestingly, in the subclass of CaHsfA9, CaHsfA9d was strongly induced (>30-fold) by salt stress in roots, while CaHsfA9a from stems seemed to be more sensitive to salt stress (~40-fold).
Under osmotic stress, nine members (CaHsfA1b, A1d, A4a, A6a, A9a, A9d, B3a, B3b and B5) were up-regulated, while four members (CaHsfA4c, A8, A9b and B4) were unregulated in stems and roots, and the highest expressing CaHsf genes were CaHsfA9d (>160-fold) in root and CaHsfB3b (~46-fold) in stem. However, no gene was down-regulated in both roots and stems. It is noteworthy that the expression of CaHsfA1b, A9a and especially A9d could be induced by both salt and osmotic stresses in both stems and roots.
Expression profiles of CaHsf genes responses to exogenous ABA, MeJA, putrescine (Put) and CaCl2
Phytohormones and plant signalling molecules, such as ABA, MeJA, Put and Ca2+, are involved in various stress signalling pathways [6, 37, 46]. To explore the responses of CaHsf family genes to these signals, we analysed the expression profiles of CaHsf genes in R9 leaves treated with these exogenous substances. As shown in Fig. 5, after a CaCl2 treatment, 13 CaHsf genes were significantly up-regulated, while 12 genes were unregulated. Similarly, 11 CaHsf genes were unregulated by a Put treatment, and no gene was down-regulated. Only five and four of the 25 CaHsf genes were up-regulated by ABA and MeJA treatment, respectively, whereas seven and nine members were down-regulated, respectively.
CaHsfB1 expression could be induced by all four signal substances, while CaHsfA9a and A9b were up-regulated by CaCl2, Put and ABA, but down-regulated by the MeJA treatment. In addition, CaHsfA6a was induced by CaCl2 and Put, but down-regulated by ABA and MeJA. The genes with the highest induced levels by CaCl2, Put and MeJA treatment were CaHsfA9d, CaHsfA9a and CaHsfA1d, respectively; however, the transcriptional levels of CaHsfs after the ABA treatment were not as high as in other three treatments. The highest expression level of CaHsfB1 induced by ABA increased less than 5-fold compared to the control.
CaHsfA2 locates to the cellular nucleus
Because of its dominant role in thermotolerant cells  and significantly up-regulated expression (Fig. 4), we characterized CaHsfA2 in pepper. First, to clarify whether the CaHsfA2 protein localizes to the nucleus, we investigated the cellular localization of CaHsfA2 protein in a transient expression assay by introducing the 35S::CaHsfA2-GFP (pBI221-CaHsfA2-GFP) translational fusion into onion epidermal cells using particle bombardment. The fluorescence of cells transformed with the control 35S::GFP (pBI221-GFP) was distributed throughout the cell, including the nucleus, cytoplasm and cytomembrane. In contrast, the fluorescence of the 35S::CaHsfA2-GFP chimera was associated with the cellular nucleus in onion epidermal cells, suggesting a nuclear localization of CaHsfA2 (Fig. 6).
CaHsfA2 shows transcriptional activity
The transcriptional activity of the CaHsfA2 protein was examined using a yeast expression system. The fusion plasmids pGBKT7-CaHsfA2 and pGBKT7 (control) were transformed into yeast strain AH109, and grown on SD medium lacking tryptophan (SD/Trp-) or lacking tryptophan, histidine and adenine (SD/Trp-Ade-His-). The growth status of transformants was evaluated (Fig. 7). Yeast cells containing either pGBKT7 or pGBKT7-CaHsfA2 could grow well on SD/Trp- plates; however, only cells containing pGBKT7-CaHsfA2 could grow on SD/Trp-Ade-His- plates and turn blue in the presence of 5-bromo-4-chloro-3-indoxyl-α-D-galacto-pyranoside (X-α-gal), which showed that LacZ, the second reporter gene, was activated by CaHsfA2. The above results demonstrated the presence of transcriptional activity in the CaHsfA2 protein.
CaHsfA2 discriminatively responds to HS in two pepper lines differing in thermotolerance
To further determine the response of CaHsfA2 to HS, we analysed the expression of this gene under HS and during recovery at room temperature in thermosensitive line B6 and thermotolerant line R9 (Fig. 8a). During HS, although the CaHsfA2 level was induced after 0.5 h at 40 °C (Fig. 8b, sample b) in B6 and R9, the expression level in R9 leaves was maintained at a higher level (~40-fold) compared with control to the end of the HS treatment (40 °C for 6 h) (Fig. 8b, sample a). After the heat-treated seedlings were moved back to normal temperature conditions for the 48 h recovery treatment, the CaHsfA2 expression level remained at high after a short recovery time (from 2 to 4 h) in R9, while it was down-regulated after a long recovery time (from 24 to 48 h) in both B6 and R9 leaves (Fig. 8b, samples h and i). It is worth noting that the CaHsfA2 expression level in R9 leaves under HS conditions for 4 h (sample d) was lower than after a 1 h HS treatment(sample c), and the level was slightly down-regulated at the end of the HS treatment (sample e) in B6.
More and more evidence suggests that Hsfs play central roles in plant developmental and defence processes [5, 12, 14, 31, 32, 36, 47]. Benefiting from genome availability, the functions of the Hsf family genes have been characterized in many plants, including the model plants Arabidopsis [20, 48], maize  and rice , as well as other plants, including grass , Chinese cabbage  and apple . However, with the limited investigations into the molecular basis of heat tolerance, little is known about the Hsf family in pepper.
In the present study, 25 Hsf genes in the nutritionally and economically important pepper were identified based on the pepper genome (Table 1) [42, 43]. Although the total number of Hsf genes was similar to that of Arabidopsis and tomato , the members of some specific Hsf subclasses in pepper were different from other two species. For example, the number of subclass HsfA1 members in pepper was less, but the number in subclass HsfA6 was more, than in tomato, while no pepper Hsf members were classified into subclass HsfA7, which suggested the possibility of a gene loss event during the evolutionary process . Another interesting observation was the surprising enlargement of the subclass CaHsfA9 with four members in pepper, compared with only one member in tomato. Usually a single HsfA9 gene is found in eudicots, including Arabidopsis and tomato, whereas soybean contained two members and Eucalyptos grandis (Myrtaceae) contained at least 17 closely related HsfA9-encoding genes . The reasons for the expansion of the CaHsfA9 genes remain to be elucidated by further investigations.
The DBD domain of about 100 amino acid residues is highly conserved in yeast, mammals and plants ; however, it is noteworthy that the DBD of CaHsfA5 in pepper, having only 72 amino acid residues, was shorter than the other CaHsfs, lacked the full α1-helix and had a truncated β1-sheet. However, the central helix-turn-helix motif (α2-turn-α3) required for specific interactions with HSEs, and the W amino acid residue in the incomplete β1-sheet required for aromatic-aromatic interactions , were still intact in the DBD domain of CaHsfA5, which might allow the protein to exert basic functions, but the effects of the truncated DBD in CaHsfA5 need to be determined. It was interesting that AHA, the essential domain for activator function in the HsfA class , did not occur in CaHsfA9b and A9d (Table 3). These Hsfs that lacked AHA domains could play their roles using a characteristic pattern of tryptophan residues by providing additive contributions to the activator function or by binding to other HsfAs to form hetero-oligomers [39, 52].
The phylogenetic analysis revealed that pepper Hsf members were more closely related to those from tomato than to those from Arabidopsis, maize and rice (Fig. 2), which was consistent with the fact that both pepper and tomato were members of the Solanaceae family . In the HsfA9 subclass, four CaHsfA9 members (CaHsfA9a, A9b, A9c and A9d) and three tomato members (Sl07g040680, Sl02g072060 and Sl11g008410) clustered in the branch of the HsfA9 group. Only one HsfA9 member from tomato was confirmed, while the other two members, Sl02g072060 and Sl11g008410, were identified to be Hsf-like (Hsfl) genes, Hsfl1 and Hsfl2, respectively . No Hsf members from eudicot species (Arabidopsis, pepper and tomato) clustered in the HsfC2 branch, because gene duplications in the monocots lineage led to subclass HsfC2 being unique in monocot species [1, 41], which was the most marked difference between monocots and eudicots. Conversely, HsfA9, B3 and B5, which emerged presumably after the split of monocots and eudicots , were not found in monocot plants (Fig. 2). The structural analyses showed that all of the CaHsf genes contained only one intron (Additional file 2: Fig. S2B), which was presumed to be a conservative evolutionary pattern, while the diverse length of the inserted introns might influence the functionally divergences of the CaHsf genes.
The CaHsf genes were distributed on 11 out of 12 chromosomes, except chromosome 11. Similarly, chromosomes 1 and 5 in tomato lacked Hsf genes, suggesting that the Hsf genes might distribute widely in the genome of the common ancestor of these members of the Solanaceae family. Gene duplication is a major evolutionary mechanism in genomes that helped plants adapt to various environmental stresses . Two pairs of paralogs (CaHsfA4a/A4c and CaHsfB3a/B3b) were detected in the pepper Hsf family, which was less than the number in rice (9 pairs) , maize (9 pairs) , and apple (12 pairs) . Neither pair was involved in the regional duplications within the same chromosome. The two members of each pair originated from segmental duplications between chromosomes, which occurred at 45.9 (CaHsfA4a and A4c) and 71.31 million years ago (CaHsfB3a and B3b). Because most plants with diploidized polyploids retained numerous duplicated chromosomal blocks within their genomes, segmental duplication occurred more frequently than the other two principal evolutionary patterns (tandem duplication and transposition events) in plants [44, 53]. Based on the vital role of segmental duplications in the family evolution, which occurred frequently in slowly evolving gene families , we proposed that the pepper Hsf gene family might have a slow evolutionary rate, as in maize .
The life activities in plants are attributable to protein–protein interactions. The construction of protein–protein interaction networks would provide necessary information on the mechanisms of life activities and on exploring the biological functions of unknown proteins. Based on the interolog from Arabidopsis, the protein–protein interaction network among the CaHsfs was constructed (Additional file 5: Fig. S4). The number of CaHsfA interaction partners was the greatest among the three CaHsf classes, which might indicate that CaHsfAs have a unique role as the master regulators of thermotolerance, and were essential for plants survival under serious HS conditions [1, 26]. In the complex network, CaHsfA1s (A1b, A1d and A1e) interacted with CaHsfA2 and these proteins might act synergistically to form a super-activator complex, which strongly regulates downstream HS-related genes in Arabidopsis . Despite not participating in the complicated interaction network, the Arabidopsis homolog of CaHsfA5 (AtHsfA5) specifically interacted and suppressed the anti-apoptotic factor AtHsfA4 , which acted as a pro-apoptotic factor. CaHsfA5 might have a similar function in pepper, although such a role needs to be confirmed.
Gene expression patterns are usually closely related to their functions . In this study, the expression profiles of each CaHsf gene in five different tissues were investigated. CaHsfA2, A6a, A9a and B1 were found to be constitutively expressed at relatively high levels in the various tissues and at multiple developmental stages under normal conditions (Fig. 3). A similar situation was found in other plants, like Arabidopsis (A1-type Hsfs) , apple (MdHsfA1a, A1d, B1a and B1b) , wheat (A1 and A8 groups, and some A2 and A6 group members) . The tissue- and stage-specific expression patterns of pepper CaHsf family genes, such as CaHsfA1b in root, CaHsfB3a in PL-16DPA, CaHsfB5 in PC-B10, and CaHsfC1 in leaf and PL-B5, indicated that CaHsfs might be widely involved in the development of various organs and tissues, which is helpful to further understanding the functions of CaHsf genes in pepper developmental biology .
Most of the CaHsf genes were up-regulated under HS conditions. CaHsfA1d, A2, A3, A6c, B1 and B5 were the main members with significantly higher expression levels during HS in B6 and R9 leaves (Fig. 4), which suggested that these CaHsfs were major transcription factors of heat-induced Hsp genes under HS conditions . In tomato, the close relative of pepper, HsfA1a acted as master regulator for triggering the heat response and acquired thermotolerance, and could not be replaced by other Hsfs . Among CaHsfA members in thermotolerant line R9, the expression levels of three members of subclass CaHsfA1 (A1b, A1d and A1e) were not striking, compared with A2, A3 or A6c, and, similarly, no master regulator was found among the four members of HsfA1 from Arabidopsis yet . These results indicated that there were species-specific features in the functions of Hsf members in regulating genes involved in plant HS responses. The number of up-regulated CaHsfA in R9 (17) under HS conditions was greater than in the thermosensitive line B6 (10), and the expression levels of major transcription factors (A2, A3, A6c, B1 and B5) in R9 were higher than their corresponding levels in B6, although the CaHsfA1d expression level in B6 was much higher than in R9. These highly expressed CaHsfAs might co-regulate the downstream HS-related genes, thus enhancing the pepper’s thermotolerance.
Notably, five out of seven members of class CaHsfB, especially B1 and B5, were significantly up-regulated by HS in R9 leaves, which was in accord with the HsfB1 and B2 groups in wheat . Most HsfBs, except HsfB5, contain the tetrapeptide LFGV in the C-terminal domain, which is assumed to function as a repressor motif in the transcriptional machinery . The expression patterns of most CaHsfB members under HS could be explained by a report in tomato. This work indicated that HS-induced HsfB1 could act as a coativator, cooperating with HsfA1a by forming a ternary complex with histone acetyl transferase HAC1 to synergistically activate a reporter gene  or interacted with HsfA1a or A2 to regulate the different stages of HS response . However, the roles of these up-regulated CaHsfB members during HS still needs further investigations. Both CaHsfB3a and B3b were down-regulated in B6 and R9 leaves under HS conditions, which suggested that the paralogs might act as repressors among CaHsf members. The expression level of CaHsfC1 did not present marked changes under HS in the thermotolerant line R9, which was similar to ZmHsf-13 from the maize HsfC class , while the down-regulated expression pattern of CaHsfC1 in the thermosensitive line B6 under HS was similar to those of C1 group members from wheat . The contradictory observation might be attributed to the species- and lines-specific responses to HS in HsfC, but testing this hypothesis requires further research.
In addition to HS, CaHsf genes were also regulated by salt and osmotic stresses (Fig. 5). CaHsfA1b, A9a and A9d could be induced by HS, salt or osmotic stresses, while CaHsfA6c and B4 were inhibited by salt stress but enhanced by HS. The expression of CaHsfB3a and B3b increased under osmotic stress but decreased under HS, which indicated that this pair of paralogs might have a specific regulatory role in strengthening a plant’s adaptability to stresses other than HS . Interestingly, the expression pattern of CaHsfA2 was different under salt stress, HS and even in roots and stems from R9 under osmotic stress; however, the overexpression of AtHsfA2 conferred not only thermotolerance, but also salt and osmotic stress tolerance , which implied that CaHsfs might play different roles in various tissues under different abiotic stresses and that some CaHsfs might participate in shared roles among various stresses .
The signaling substances Ca2+, Put, ABA and MeJA are involved in many signal transduction pathways under various stress conditions, and regulated CaHsfs expression. Different signals regulated different CaHsfs. For instance, both Ca2+ and Put could up-regulate CaHsfA6a, A9a and A9d, while ABA and MeJA down-regulated CaHsfA6a, which indicated that although most CaHsfs were highly conserved, they might play their roles via different signal transduction pathways. Ca2+, as the second messenger, couples extracellular signals with intracellular physiological and biochemical reactions to regulate the process of signal transduction in plant cells under abiotic stress. In Arabidopsis, calmodulin AtCaM3 was required in HS signalling to activate calcium/calmodulin-binding protein kinase (CBK), and the latter could phosphorylate HsfA1a . Thus, it was inferred that the induction of CaHsfA6a, A9a and A9d by Ca2+ might be attributed to CaM3 in pepper through the calcium-signalling pathway. Put can enhance a plant’s tolerance to abiotic stresses by regulating cell pH, balancing reactive oxygen metabolism and stabilizing membrane structures , but there were few reports about the function of Put on the regulation of Hsfs. In this study, Put induced the expression of CaHsfA2, A3, A6a, A9a, A9d and B4, which could explain our previous observation that Put was involved in the HS process . ABA [59–61] and MeJA [62, 63] have been reported to participate in the protection against heat damage. CaHsfA6a was down-regulated by ABA but up-regulated by Ca2+ and Put, which suggested that CaHsfA6a played different roles in the physiological processes mediated by ABA and the Ca2+ or Put pathways. Although the promoters of CaHsfA1b, A4a and A4b contained the CGTCA- or TGACG-motifs (cis-acting elements involved in the MeJA-responsiveness) (data not shown), MeJA could barely influence the expression of these genes, which might result from the joint effects of other cis-elements and complex regulatory mechanisms.
HsfA2, as the HS-induced enhancer of thermotolerance , has been researched in many species [5, 23, 31]; however, only limited characteristics of HsfA2 in pepper have previously been described . In this study, we found that CaHsfA2 possessed the typical features of the Hsf family, including being located in the nucleus by its NLS domain (Fig. 6, Table 3), having transcriptional activity (Fig. 7), and responding to continuous HS (Fig. 8), which confirmed our genome-wide identification of the CaHsf genes. Among the 25 pepper Hsfs, the induced expression of CaHsfA2 in response to HS was only less than CaHsfA3 in R9 or CaHsfA1d in B6 (Fig. 4), which suggested that CaHsfA2 might become a dominant Hsf, as seen with HsfA2 from Arabidopsis and tomato . The lower CaHsfA2 expression level in the thermosensitive line B6 than in the thermotolerant line R9 (Fig. 8) induced under HS might be the reason for the differing thermotolerance levels in the two pepper lines. However, the CaHsfA2 level in R9 after 4 h of HS was lower than after 1 h, and the level was slightly down-regulated at the end of HS treatment in B6. The different expression pattern might be attributed to regulatory genes or the circadian clock. Under long-term HS conditions, when Hsp17-II were maintained at a high level, they directly interacted with HsfA2, forming inactive complexes . Once the molecular chaperones were lacking, HsfA2 would be released from the complexes to act as a transcription factors. In addition, HsfB2b was induced after HS and repressed HsfA2. However, CaHsfA2 might be a rhythmic gene, like Arabidopsis HsfB2b, which is regulated by both HS and the circadian clock , while the regulatory mechanism of CaHsfA2 responding to HS needs to be explored further.
In this study, we identified a total of 25 CaHsfs in the pepper genome. Based on the bioinformatics analysis of highly conserved domains, the CaHsf genes were divided into three classes, class CaHsfA, B and C, and distributed in 11 out of the 12 chromosomes, with none found on Chromosome 11. According to the RNA-seq data from pepper CM334, the CaHsf members were expressed in at least one tissue among root, stem, leaf, pericarp and placenta. Results of quantitative real-time PCR demonstrated that the CaHsfs responded to HS (40 °C for 2 h), except CaHsfC1 in thermotolerant line R9 leaves, while thermosensitive line B6 showed different response patterns. Many CaHsfs were also regulated by salt and osmotic stress, as well as the signal substances Ca2+, Put, ABA and MeJA. Further more, CaHsfA2 had the typical characteristics of Hsfs, including being located to the nucleus, having transcriptional activity and responding to continuous HS. Our research not only added a new member to the plant Hsf family, but also provided information that could be used in further functional analyses of CaHsfs under various abiotic stresses and in elucidating signal transduction pathways in pepper.
Identification and annotation of Hsf family members from Capsicum annuum
The conserved amino acid sequence of DBD (Pfam: PF00047) was used as a BLAST query against the pepper genome database PGP (http://peppergenome.snu.ac.kr/, CM334 and Zunla-1 proteins), and the full-length amino acid sequences of the Hsf proteins in Arabidopsis, Vitis vinifera and Populus trichocarpa in the Plant Transcription Factor Database (PTFD)  were also used as BLAST query against the PGP and National Center for Biotechnology Information (NCBI). All output genes with default (Limit Expect Value 1e-5) were collected and confirmed by the Pfam (http://pfam.xfam.org/search) and SMART (http://smart.embl-heidelberg.de/). The candidate Hsf genes from CM334 and Zunla-1 cultivar were aligned by DNAMAN (Lynnon Biosoft, QC, Canada) for picking out those genes whose sequences were different between the two cultivars. The primers were designed by Primer Premier 5.0 (Premier Biosoft International, CA, USA) (Additional file 6: Table S2) to amplify the different sequences, which were then aligned with the sequences of the same gene from CM334 and Zunla-1 cultivar to confirm the correct sequences. The deduced amino acid sequences were analyzed using Compute pI/MW tool (http://www.expasy.org/tools/pi_tool.html) for computation of the theoretical iso-electric point and protein molecular weight. The classification of candidate Hsf proteins from pepper was performed by Heatster (http://www.cibiv.at/services/hsf/).
Multiple alignments of the N-proximal regions (from the start of the conserved DBD domain to the end of the HR-A/B region) of Hsf proteins from pepper, tomato and Arabidopsis were performed by CLUSTALW and the result of alignment was used for the construction of phylogenetic tree [1, 41] using MEGA 5.10 . The parameters for alignment by CLUSTALW were: gap open penalty: 10; gap extension penalty: 0.2; protein weight matrix: Gonnet; residue-specific gap penalties: on; hydrophilic penalties: on; gap separation distance: 0; end gap separation penalty: on; use negative matrix: on; delay divergent cutoff (%): 30. The neighbor joining phylogenetic trees were constructed with pairwise deletion, 1000 bootstraps and a Poisson model.
Sequence structure analysis and identification of conserved domains in pepper Hsf proteins
Exton/intron organization of the Hsf genes in pepper was illustrated with Gene Structure Display Server program (GSDS, http://gsds.cbi.pku.edu.cn/index.php)  by alignment of the cDNAs with their corresponding genomic DNA sequences. The MEME program (http://meme-suite.org/tools/meme) was used for identification of conserved motifs, with the following parameters: number of repetitions: any; maximum number of motifs: 25; and the optimum motif widths: 6–200 amino acid residues. The conserved domains annotation was performed using Pfam (http://pfam.xfam.org/search), SMART (http://smart.embl-heidelberg.de/) and Heatster online tools.
Chromosomal location and gene duplication
Information about the chromosome locations was based on the PGP and the genes were mapped to the chromosomes using MapDraw  by identifying their physical chromosome position. Identification of pepper Hsf genes duplication was conducted using Plant Genome Duplication Database (PGDD, http://chibba.agtec.uga.edu/duplication/index/locus). Nonsynonymous (Ka) and synonymous (Ks) rates (Ka/Ks) were calculated based on the results of identification of pepper Hsf genes duplication, by which the Ks value was converted into divergence time in millions of years based on a rate of 6.1 × 10−9 substitutions per site per year, and the divergence time (T) was calculated according to the formula: T = Ks/ (2 × 6.1 × 10−9) × 10−6 million years ago [44, 45].
Prediction of protein–protein interaction network
As there were no references for pepper interactome analysis, the interolog from Arabidopsis was used for predicting protein–protein interaction network of CaHsf members. First, Arabidopsis homologous sequences were searched in INPARANOID (http://inparanoid.sbc.su.se/cgi-bin/index.cgi)  based on the sequences of CaHsfs. Then, the edge information file (querynw.sif) of Arabidopsis homologs (AtHsfs) were generated via AraNet (http://www.functionalnet.org/aranet/) , and mapped to CaHsfs to create an edge information file of CaHsf members. Finally, the protein–protein interaction network of CaHsfs was drawn with Cytoscape_v3.2.1 (National Institute of General Medical Sciences, MD, USA).
Plant materials, growth conditions and stress treatments
In this study, two pepper lines, thermotolerant line R9 (introduced from the Asia Vegetable Research and Development Center) and thermosensitive line B6 (selected by the pepper research group in the College of Horticulture, Northwest A&F University, Yangling, China) were used. Pepper seedlings were cultivated under a condition of 26/20 °C day/night and a 16/8 h day/night photoperiod in a growth chamber till the 6–8 true leaves period for various treatments. For the HS treatment, the seedlings of B6 and R9 with 6–8 leaves were directly placed in the 40 °C light incubator (GXZ-380C, Jiangnan Instrument Factory, Ningbo, China). The leaves of treated seedlings were collected after 0 and 2 h of HS treatment [37, 38]. For analysis of CaHsfA2 time-course expression with HS treatment, after being subjected to 40 °C for 0, 0.5, 1, 4, 6 h, and recovered at room temperature (26/20 °C day/night) for 2, 4, 24, 48 h after HS treatment for 6 h, the leaves of B6 and R9 were collected, respectively (Fig. 8a). For other treatments, R9 seedlings were treated with salt stress (300 mM NaCl for 6 h), osmotic stress (5 % mannitol for 6 h), abscisic acid (ABA, 100 μM for 3 h), methyl jasmonate (MeJA , 100 μM for 6 h), CaCl2 (15 mM for 6 h ) and putrescine (Put, 1.5 mM for 6 h). MeJA was dissolved in 10 % ethanol, other substances were dissolved in water, and control seedlings were treated with 10 % ethanol (for MeJA treatment) or water (for NaCl, mannitol, CaCl2, Put and ABA treatment). The roots and stems were collected from seedlings treated with NaCl and mannitol, and the leaves were sampled for other treatments. The samples were frozen in liquid nitrogen and stored at −80 °C for RNA extraction. Each treatment was conducted with three biological replicates, and samples from five seedlings were gathered for each replicate.
RNA extraction and quantitative real-time PCR analysis
Total RNA were isolated according to the instruction of Total RNA kit (BioTeke, Beijing, China), and the cDNA was synthesized according to the manufacturer’s instructions (Takara, Dalian, China) and was diluted to 50 ng/μL with ddH2O. For quantitative real-time PCR (qRT-PCR) assay, primer pairs (Additional file 7: Table S3) for pepper Hsf genes were designed at the C-terminal domain by Primer Premier 5.0, and their specificity was checked by NCBI Primer BLAST (http://www.ncbi.nlm.nih.gov/tools/primer-blast/index.cgi?LINK_LOC=BlastHome). Ubiquitin binding protein gene UBI-3 from pepper was used as the reference gene . The qRT-PCR reactions were carried out on the iQ5.0 Bio-Rad iCycler thermocycler (Bio-Rad, Hercules, CA, USA) using SYBR Green Supermix (Takara, Dalian, China). The 20 μL reaction system contained 10 μL SYBR Green Supermix (2×), 2 μL cDNA template (50 ng/μL), 0.8 μL of each primer (10 μM) and 6.4 μL ddH2O. The qRT-PCR reaction systems were as follows: pre-denaturation at 95 °C for 1 min, followed by 40 cycles of denaturation at 95 °C for 10 s, annealing at 56 °C for 30 s, extension at 72 °C for 30 s. The fluorescent signal was measured at the end of each cycle, and the melting curve analysis was performed with heating the PCR product from 56 °C to 95 °C for verifying the specificity of the primers. Three independent biological replicates were carried out and qPCR of each replicate was performed in triplicate. The relative expression levels of pepper Hsf genes were calculated as -ΔΔCT method .
Pepper Hsf genes tissue-specific expression analysis
Based on the pepper genes against RNA-seq data of each tissue , we chose the data of 17 tissues and developmental stages including root, stem, leaf, pericarp and placenta at 6, 16, 25 days post-anthesis (PC-6DPA, −16DPA and -25DPA, PL-6DPA, −16DPA and -25DPA ), pericarp and placenta at mature green (PC-MG, PL-MG) and at breaker (PC-B, PL-B), pericarp and placenta at 5 and 10 days post-breaker (PC-B5 and -B10, PL-B5 and -B10) from CM334 for the CaHsf genes tissue- and stage-specific expression analysis. The data of the 22 pepper Hsf genes (excluding CaHsfA1e, B3b and B4 whose date are absent) were used for creating a heat map using HemI (The Cuckoo Workgroup, Wuhan, China).
Subcellular localization of CaHsfA2
Based on our previous study , the ORF of CaHsfA2 without the termination codon was prepared by PCR using cDNA from R9 leaves treated with 40 °C for 2 h as the template and a primer pair (forward primer 5'-GCTCTAGATCCATCTTAATTGTATTTAGCGAC-3' and reverse primer 5'-CGGGGTACCAAGGAAACCAAGTTGATCTACAAG-3'). The underlined nucleotides contained the XbaI and KpnI restriction site, respectively. The PCR-amplified CaHsfA2 fragment was fused to the pBI221 expression vector and the CaMV35S::GFP (pBI221) vector without CaHsfA2 was used as a control. For transient expression analysis, 5 μg of CaMV35S::CaHsfA2-GFP plasmid or CaMV35S::GFP plasmid was introduced into the onion (Allium cepa) epidermal cells using a Bio-Rad He/1000 particle delivery system (Bio-Rad, Hercules, CA, USA). Bombarded cells were incubated at 28 °C for 24 h on 1× MS (Murashige and Skoog) agar medium, and then green fluorescent protein (GFP) fluorescence was observed using an A1R confocal-laser scanning microscope (Nikon, Tokyo, Japan).
Analysis of transcriptional activation of CaHsfA2 in yeast cells
Transcriptional activation of CaHsfA2 assay was performed in the yeast strain AH109 with LacZ and His reporter genes. The complete coding sequence of CaHsfA2 was amplified by PCR using primers (forward primer 5'-TCCATCTTAATTGTATTTAGCGAC-3' and reverse primer 5'-AGGGAATGATAGAGTCGTGGG-3'). The PCR products were cloned into pMD19-T vector (Takara, Dalian, China), and the confirmed fragment (SmaI site of pMD19-T vector linked with the 5' untranslated region of CaHsfA2 and PstI site linked with the 3' untranslated region) was cut with restriction enzymes SmaI and PstI from pMD19-T vector, and then linked into SmaI/PstI sites in the pGBKT7 vector to create pGBKT7-CaHsfA2. The constructs of pGBKT7-CaHsfA2 and the negative control pGBKT7 were cloned into AH109 yeast strain, respectively, according to the manufacturer’s protocol (Clontech, Palo Alto, CA, USA). Transformed strains were confirmed by PCR and sequencing and then plated on SD/Trp- or SD/Trp-Ade-His- plates. Transcription activation was evaluated according to the growth status of yeast cells after incubating plates at 30 °C for 3 d with 5-bromo-4-chloro-3-indoxyl-α-D-galacto-pyranoside (X-α-gal).
Availability of supporting data
The phylogenetic data in this publication are available for download at TreeBase with accession number 17530 (http://purl.org/phylo/treebase/phylows/study/TB2:S17530). The RNA-seq data were available from genome of pepper cultivar CM334 (http://www.nature.com/ng/journal/v46/n3/full/ng.2877.html#supplementary-information) .
Heat shock factors
Quantitative real-time PCR
Heat shock proteins
Heat shock elements
Nuclear localisation signal
Nuclear export signal
Hidden Markov Model
Calcium/calmodulin-binding protein kinase
Plant Genome Duplication Database
Green fluorescent protein
Plant Transcription Factor Database
National Center for Biotechnology Information
Scharf KD, Berberich T, Ebersberger I, Nover L. The plant heat stress transcription factor (Hsf) family: structure, function and evolution. Biochim Biophys Acta. 2012;1819(2):104–19.
Wardlaw IF, Wrigley CW. Heat tolerance in temperate cereals: an overview. Aust J Plant Physiol. 1994;21(6):695–703.
Skylas DJ, Cordwell SJ, Hains PG, Larsen MR, Basseal DJ, Walsh BJ, et al. Heat shock of wheat during grain filing: proteins associated with heat-tolerance. J Cereal Sci. 2002;35(2):175–88.
Bar-Tsur A, Rudich J, Bravdo B. High temperature effects on CO2 gas exchange in heat-tolerant and sensitive tomatoes. J Am Soc Hort Sci. 1985;110:582–6.
Charng YY, Liu HC, Liu NY, Chi WT, Wang CN, Chang SH, et al. A heat-inducible transcription factor, HsfA2, is required for extension of acquired thermotolerance in Arabidopsis. Plant Physiol. 2007;143(1):251–62.
Mittler R, Finka A, Goloubinoff P. How do plants feel the heat? Trends Biochem Sci. 2012;37(3):118–25.
Sung DY, Kaplan F, Lee KJ, Guy CL. Acquired tolerance to temperature extremes. Trends Plant Sci. 2003;8(4):179–87.
Lin YX, Jiang HY, Chu ZX, Tang XL, Zhu SW, Cheng BJ. Genome-wide identification, classification and analysis of heat shock transcription factor family in maize. BMC Genomics. 2011;12:76.
Parsell DA, Lindquist S. The function of heat-shock proteins in stress tolerance: degradation and reactivation of damaged proteins. Annu Rev Genet. 1993;27:437–96.
Bienz M, Pelham HRB. Mechanisms of heat-shock gene activation in higher eukaryotes. Adv Genet. 1987;24:31–72.
Xiao H, Perisic O, Lis JT. Cooperative binding of Drosophila heat shock factor to arrays of a conserved 5 bp unit. Cell. 1991;64(3):585–93.
Kumar M, Busch W, Birke H, Kemmerling B, Nürnberger T, Schöffl F, et al. Heat shock factors HsfB1 and HsfB2b are involved in the regulation of Pdf1.2 expression and pathogen resistance in Arabidopsis. Mol Plant. 2009;2:152–65.
Hahn A, Bublak D, Schleiff E, Scharf KD. Crosstalk between Hsp90 and Hsp70 chaperones and heat stress transcription factors in tomato. Plant Cell. 2011;23(2):741–55.
Pérez-Salamó I, Papdi C, Rigó G, Zsigmond L, Vilela B, Lumbreras V, et al. The heat shock factor A4A confers salt tolerance and is regulated by oxidative stress and the mitogen-activated protein kinases MPK3 and MPK6. Plant Physiol. 2014;165(1):319–34.
Baniwal SK, Bharti K, Chan KY, Fauth M, Ganguli A, Kotak S, et al. Heat stress response in plants: a complex game with chaperones and more than twenty heat stress transcription factors. J Biosci. 2004;29(4):471–87.
Harrison CJ, Bohm AA, Nelson HC. Crystal structure of the DNA binding domain of the heat shock transcription factor. Science. 1994;263(5144):224–7.
Vuister GW, Kim SJ, Wu C, Bax A. NMR evidence for similaritie s between the DNA-bindi ng regions of Drosophila melanogaster heat shock factor and the helix– turn– helix and HNF-3/forkhead families of transc ription factors. Biochemistry. 1994;33(1):10–6.
Schultheiss J, Kunert O, Gase U, Scharf KD, Nover L, Rüterjans H. Solution structur e of the DNA-binding domain of the tomato heat-stress transcription factor HSF24. Eur J Biochem. 1996;236(3):911–21.
Peteranderl R, Rabenstein M, Shin YK, Liu CW, Wemmer DE, King DS, et al. Biochemical and biophysical characterization of the trimerization domain from the heat shock transcription factor. Biochemistry. 1999;38(12):3559–69.
Nover L, Bharti K, Döring P, Mishra SK, Ganguli A, Scharf KD. Arabidopsis and the heat stress transcription factor world: how many heat stress transcription factors do we need? Cell Stress Chaperones. 2001;6(3):177–89.
Nover L, Scharf KD, Gagliardi D, Vergne P, Czarnecka-Verner E, Gurley WB. The Hsf world: classification and properties of plant heat stress transcription factors. Cell Stress Chaperones. 1996;1(4):215–23.
Lyck R, Harmening U, Höhfeld I, Treuter E, Scharf KD, Nover L. Intracellular distribution and identification of the nuclear localization signals of two plant heat-stress transcription factors. Planta. 1997;202(1):117–25.
Heerklotz D, Döring P, Bonzelius F, Winkelhaus S, Nover L. The balance of nuclear import and export determines the intracellular distribution and function of tomato heat stress transcription factor HsfA2. Mol Cell Biol. 2001;21(5):1759–68.
Kotak S, Port M, Ganguli A, Bicker F, von Koskull-Döring P. Characterization of C-terminal domains of Arabidopsis heat stress transcription factors (Hsfs) and identification of a new signature combination of plant class A Hsfs with AHA and NES motifs essential for activator function and intracellular localization. Plant J. 2004;39(1):98–112.
Akerfelt M, Morimoto RI, Sistonen L. Heat shock factors: integrators of cell stress, development and lifespan. Nat Rev Mol Cell Biol. 2010;11(8):545–55.
Mishra SK, Tripp J, Winkelhaus S, Tschiersch B, Theres K, Nover L, et al. In the complex family of heat stress transcription factors, HsfA1 has a unique role as master regulator of thermotolerance in tomato. Genes Dev. 2002;16(12):1555–67.
Czarnecka-Verner E, Pan S, Salem T, Gurley WB. Plant class B HSFs inhibit transcription and exhibit affinity for TFIIB and TBP. Plant Mol Biol. 2004;56(1):57–75.
Ikeda M, Ohme-Takagi M. A novel group of transcriptional repressors in Arabidopsis. Plant Cell Physiol. 2009;50(5):970–5.
Ikeda M, Mitsuda N, Ohme-Takagi M. Arabidopsis HsfB1 and HsfB2b act as repressors for the expression of heat-inducible Hsfs but positively regulate the acquired thermotolerance. Plant Physiol. 2011;157(3):1243–54.
Zhu X, Thalor SK, Takahashi Y, Berberich T, Kusano T. An inhibitory effect of the sequence-conserved upstream open-reading frame on the translation of the main open-reading frame of HsfB1 transcripts in Arabidopsis. Plant, Cell Environ. 2012;35(11):2014–30.
Chan-Schaminet KY, Baniwal SK, Bublak D, Nover L, Scharf KD. Specific interaction between tomato HsfA1 and HsfA2 creates hetero-oligomeric superactivator complexes for synergistic activation of heat stress gene expression. J Biol Chem. 2009;284(31):20848–57.
Kotak S, Vierling E, Bäumlein H, von Koskull-Döring P. A novel transcriptional cascade regulating expression of heat stress proteins during seed development of Arabidopsis. Plant Cell. 2007;19(1):182–95.
Ogawa D, Yamaguchi K, Nishiuchi T. High-level overexpression of the Arabidopsis HsfA2 gene confers not only increased themotolerance but also salt/osmotic stress tolerance and enhanced callus growth. J Exp Bot. 2007;58(12):3373–83.
Zhang L, Li Y, Xing D, Gao C. Characterization of mitochondrial dynami cs and subcellul ar localization of ROS reveal that HsfA2 alleviates oxidative damage caused by heat stress in Arabidopsis. J Exp Bot. 2009;60(7):2073–91.
Banti V, Mafessoni F, Loreti E, Alpi A, Perata P. The heat-inducible transcription factor HsfA2 enhances anoxia tolerance in Arabidopsis. Plant Physiol. 2010;152(3):1471–83.
Giorno F, Wolters-Arts M, Grillo S, Scharf KD, Vriezen WH, Mariani C. Developmental and heat stress-regulated expression of HsfA2 and small heat shock proteins in tomato anthers. J Exp Bot. 2010;61(2):453–62.
Guo M, Zhai YF, Lu JP, Chai L, Chai WG, Gong ZH, et al. Characterization of CaHsp70-1, a pepper heat-shock protein gene in response to heat stress and some regulation exogenous substances in Capsicum annuum L. Int J Mol Sci. 2014;15(11):19741–59.
Guo M, Yin YX, Ji JJ, Ma BP, Lu MH, Gong ZH. Cloning and expression analysis of heat-shock transcription factor gene CaHsfA2 from pepper (Capsicum annuum L.). Genet Mol Res. 2014;13(1):1865–75.
Guo J, Wu J, Ji Q, Wang C, Luo L, Yuan Y, et al. Genome-wide analysis of heat shock transcription factor families in rice and Arabidopsis. J Genet Genomics. 2008;35(2):105–18.
Song X, Liu G, Duan W, Liu T, Huang Z, Ren J, et al. Genome-wide identification, classification and expression analysis of the heat shock transcription factor family in Chinese cabbage. Mol Genet Genomics. 2014;289(4):541–51.
Xue GP, Sadat S, Drenth J, McIntyre CL. The heat shock factor family from Triticum aestivum in response to heat and other major abiotic stresses and their role in regulation of heat shock protein genes. J Exp Bot. 2014;65(2):539–57.
Kim S, Park M, Yeom SI, Kim YM, Lee JM, Lee HA, et al. Genome sequence of the hot pepper provides insights into the evolution of pungency in Capsicum species. Nat Genet. 2014;46(3):270–8.
Qin C, Yu C, Shen Y, Fang X, Chen L, Min J, et al. Whole-genome sequencing of cultivated and wild peppers provides insights into Capsicum domestication and specialization. Proc Natl Acad Sci U S A. 2014;111(14):5135–40.
Chen X, Chen Z, Zhao H, Zhao Y, Cheng B, Xiang Y. Genome-wide analysis of soybean HD-Zip gene family and expression profiling under salinity and drought treatments. PLoS One. 2014;9(2), e87156.
Lynch M, Conery JS. The evolutionary fate and consequences of duplicate genes. Science. 2000;290(5494):1151–5.
Fujita M, Fujita Y, Noutoshi Y, Takahashi F, Narusaka Y, Yamaguchi-Shinozaki K, et al. Crosstalk between abiotic and biotic stress responses: a current view from the points of convergence in the stress signaling networks. Curr Opin Plant Biol. 2006;9(4):436–42.
Bechtold U, Albihlal WS, Lawson T, Fryer MJ, Sparrow PA, Richard F, et al. Arabidopsis heat shock transcription factora1b overexpression enhances water productivity, resistance to drought, and infection. J Exp Bot. 2013;64(11):3467–81.
Swindell WR, Huebner M, Weber AP. Transcriptional profiling of Arabidopsis heat shock proteins and transcription factors reveals extensive overlap between heat and non-heat stress response pathways. BMC Genomics. 2007;8:125.
Yang Z, Wang Y, Gao Y, Zhou Y, Zhang E, Hu Y, et al. Adaptive evolution and divergent expression of heat stress transcription factors in grasses. BMC Evol Biol. 2014;14:147.
Giorno F, Guerriero G, Baric S, Mariani C. Heat shock transcriptional factors in Malus domestica: identification, classification and expression analysis. BMC Genomics. 2012;13:639.
Sakurai H, Enoki Y. Novel aspects of heat shock factors: DNA recognition, chromatin modulation and gene expression. FEBS J. 2010;277(20):4140–9.
Bharti K, Schmidt E, Lyck R, Heerklotz D, Bublak D, Scharf KD. Isolation and characterization of HsfA3, a new heat stress transcription factor of Lycopersicon peruvianum. Plant J. 2000;22(4):355–65.
Cannon SB, Mitra A, Baumgarten A, Young ND, May G. The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004;4:10.
Baniwal SK, Chan KY, Scharf KD, Nover L. Role of heat stress transcription factor HsfA5 as specific repressor of HsfA4. J Biol Chem. 2007;282(6):3605–13.
Busch W, Wunderlich M, Schöffl F. Identification of novel heat shock factor dependent genes and biochemical pathways in Arabidopsis thaliana. Plant J. 2005;41(1):1–14.
Liu J, Chen N, Chen F, Cai B, Dal Santo S, Tornielli GB, et al. Genome-wide analysis and expression profile of the bZIP transcription factor gene family in grapevine (Vitis vinifera). BMC Genomics. 2014;15:281.
Bharti K, Von Koskull-Döring P, Bharti S, Kumar P, Tintschl-Körbitzer A, Treuter E, et al. Tomato heat stress transcription factor HsfB1 represents a novel type of general transcription coactivator with a histone-like motif interacting with the plant CREB binding protein ortholog HAC1. Plant Cell. 2004;16(6):1521–35.
Kang GZ, Wang ZX, Sun GC. Physiological mechanism of some exogenous materials on increasing cold-resistance capability of plants. Plant Physiology Comunications. 2002;38:193–7.
Baron KN, Schroeder DF, Stasolla C. Transcriptional response of abscisic acid (ABA) metabolism and transport to cold and heat stress applied at the reproductive stage of development in Arabidopsis thaliana. Plant Sci. 2012;188–189:48–59.
Larkindale J, Huang B. Thermotolerance and antioxidant systems in Agrostis stolonifera: involvement of salicylic acid, abscisic acid, calcium, hydrogen peroxide, and ethylene. J Plant Physiol. 2004;161(4):405–13.
Larkindale J, Knight MR. Protection against heat stress-induced oxidative damage in Arabidopsis involves calcium, abscisic acid, ethylene, and salicylic acid. Plant Physiol. 2002;128(2):682–95.
Clarke SM, Cristescu SM, Miersch O, Harren FJ, Wasternack C, Mur LA. Jasmonates act with salicylic acid to confer basal thermotolerance in Arabidopsis thaliana. New Phytol. 2009;182(1):175–87.
Dang FF, Wang YN, Yu L, Eulgem T, Lai Y, Liu ZQ, et al. CaWRKY40, a WRKY protein of pepper, plays an important role in the regulation of tolerance to heat stress and resistance to Ralstonia solanacearum infection. Plant, Cell Environ. 2013;36(4):757–74.
von Koskull-Döring P, Scharf KD, Nover L. The diversity of plant heat stress transcription factors. Trends Plant Sci. 2007;12(10):452–7.
Kolmos E, Chow BY, Pruneda-Paz JL, Kay SA. HsfB2b-mediated repression of PRR7 directs abiotic stress responses of the circadian clock. Proc Natl Acad Sci U S A. 2014;111(45):16172–7.
Pérez-Rodríguez P, Riaño-Pachón DM, Corrêa LG, Rensing SA, Kersten B, Mueller-Roeber B. PlnTFDB: updated content and new features of the plant transcription factor database. Nucleic Acids Res. 2010;38(Database issue):D822–827.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Guo AY, Zhu QH, Chen X, Luo JC. GSDS: a gene structure display server. Yi Chuan. 2007;29(8):1023–6.
Liu RH, Meng JL. Mapdraw: a Microsoft excel macro for drawing genetic linkage maps based on given genetic linkage data. HEREDITAS (BEIJING). 2003;25(3):317–21.
Remm M, Storm CE, Sonnhammer EL. Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. J Mol Biol. 2001;314(5):1041–52.
Lee I, Ambaru B, Thakkar P, Marcotte EM, Rhee SY. Rational association of genes with traits using a genome-scale gene network for Arabidopsis thaliana. Nat Biotechnol. 2010;28(2):149–56.
Wan H, Yuan W, Ruan M, Ye Q, Wang R, Li Z, et al. Identification of reference genes for reverse transcription quantitative real-time PCR normalization in pepper (Capsicum annuum L.). Biochem Biophys Res Commun. 2011;416(1–2):24–30.
Schmittgen TD, Livak KJ. Analyzing real-time PCR data by the comparative C(T) method. Nat Protoc. 2008;3(6):1101–8.
This work was supported by the National Natural Science Foundation of China (Grant No. 31272163), Zhejiang Province Key Scientific and Technological Project (Grant No. 2011C02001), the Shaanxi Provincial Science and Technology Coordinating Innovative Engineering Project (Grant No. 2012KTCL02-09), the Shaanxi Agriculture Science and Technology Projects (Grant No. 2014 K01-14-01), the Opening Fund of Key Laboratory for Crop Biotechnology of Xinjiang Uygur Autonomous Region (Grant No. XJYS0302-2014-03).
The authors declare that they have no competing interests.
MG, ZHG and MHL contributed to the experimental design manuscript drafting. MG, JPL and YFZ performed the research. MG and MHL performed bioinformatics analysis. WGC, ZHG and MHL contributed reagents/materials/analysis tools. MG wrote the article. All authors read and approved the final manuscript.
Multiple sequence alignment of the DBD domains of 25 members of the Hsf protein family in pepper. The definition of the Hsf names corresponded to the order of alignment. The multiple alignment result clearly shows the highly conserved DBD domains among pepper Hsf genes. The 3D structure and the secondary elements (α1-β1-β2-α2-T-α3-β3-β4) are shown above the alignment. T: turn of helix-turn-helix motif.
Phylogenetic analysis (A) and exon/intron organizations (B) of pepper Hsf genes. Numbers above or below branches in (A) indicates bootstrap values. Differently colored lines shows genes in each subclass. Numbers 0, 1 and 2 in (B) represent introns in phases 0, 1 and 2, respectively.
Chromosomal mapping of pepper Hsf gene family. Chromosomal mapping was based on the physical position (Mb) in 11 out of 12 pepper chromosomes. The chromosome numbers are indicated at the top of each chromosome. Chromosomal positions of the pepper Hsf genes are indicated by gene names. Blue and gray dotted lines connect the CaHsf genes present duplicate chromosomal segments.
Divergence between CaHsf genes pairs in pepper.
The predicted protein–protein interaction network of CaHsfs.
Primers for amplifying the different sequences between CM334 and Zunla-1 genome among CaHsf members.
Primer sequences used for quantitative real-time PCR analysis.
About this article
Cite this article
Guo, M., Lu, JP., Zhai, YF. et al. Genome-wide analysis, expression profile of heat shock factor gene family (CaHsfs) and characterisation of CaHsfA2 in pepper (Capsicum annuum L.). BMC Plant Biol 15, 151 (2015). https://doi.org/10.1186/s12870-015-0512-7
- Identification of CaHsfs family
- Abiotic stress
- Gene expression