- Research article
- Open Access
Comparative analysis of B-BOX genes and their expression pattern analysis under various treatments in Dendrobium officinale
BMC Plant Biologyvolume 19, Article number: 245 (2019)
Studies have demonstrated that BBX (B-BOX) genes play crucial roles in regulatory networks controlling plant growth, developmental processes and stress response. Nevertheless, comprehensive study of BBX genes in orchids (Orchidaceae) is not well studied. The newly released genome sequences of Dendrobium officinale and Phalaenopsis equestris have allowed a systematic analysis of these important BBX genes in orchids.
Here we identified 19 (DoBBX01–19) and 16 (PeBBX01–16) BBX genes from D. officinale and P. equestris, respectively, and clustered into five clades (I-V) according to phylogenetic analysis. Thirteen orthologous, two DoBBXs paralogous and two PeBBXs paralogous gene pairs were validated. This gene family mainly underwent purifying selection, but five domains experienced positive selection during evolution. Noteworthy, the expression patterns of root, root_tips, stem, leaf, speal, column, lip, and flower_buds revealed that they might contribution to the formation of these tissues. According to the cis-regulatory elements analysis of BBX genes, qRT-PCR experiments were carried out using D. officinale PLBs (protocorm-like bodies) and displayed that these BBX genes were differentially regulated under AgNO3, MeJA (Methyl Jasmonate), ABA (abscisic acid) and SA (salicylic acid) treatments.
Our analysis exposed that DoBBX genes play significant roles in plant growth and development, and response to different environmental stress conditions of D. officinale, which provide aid in the selection of appropriate candidate genes for further functional characterization of BBX genes in plants.
Zinc-finger proteins play important roles in regulate of plant growth development, and biotic and abiotic stress [1, 2]. Among them, the B-BOX zinc finger protein subfamily contains one or more B-BOX domains, which are composed of the conserved Cysteine (C) and Histidine (H), and stabilize its unique tertiary structure by binding with Zn-ions [3, 4]. The B-BOX domain might be involved in the interaction between zinc-finger proteins and other proteins [3, 5]. More than 1500 proteins, which containing B-BOX domain, were found in eukaryotes. Most of the B-BOX domains in animal cells were conjugated with the RING finger domain and coiled-coiled domain to form trivalent structural proteins or RBCC. In RBCC complex-mediated protein ubiquitination degradation pathway, B-BOX domain might be involved in substrate recognition process . The B-BOX domain can be divided into two types, namely [C3(C/H)H2](BOX1(C-X2-X-X6–7-C-X2-X4–8-C-X2–3-C/H-X3–4-H-X5–10-H and [CHC(D/C)C2H2](B-box2(C-X2–4-H-X7–10-C-X1–4-D/C4–7-C-X2-C-X3–6-H-X2–5-H). Both of these two domains contain 7 or 8 conserved cysteines (C) and histidine (H) residues and together with two Zn-atoms to form a RING-like fold [7, 8].
In Arabidopsis thaliana, Khanna et al. (2009) identified 32 proteins containing B-BOX domain in N-terminal, named BBX1–32 . As compared to animals, the B-BOX protein of A. thaliana had at least one B-BOX domains that interact with the Zn atom through the aspartic acid (Asp) residue [7, 9]. Although the B-BOX domain was considered to be involved in protein interactions, the function of B-BOX proteins in plants is not well clear. The B-BOX proteins STH3 (SALT TOLERANCE HOMOLOG03) and BBX22 of A. thaliana could interact with two key regulators of the light signaling pathway, HY5 and COPI, to regulate plant light-dependent developmental processes [7, 9,10,11]. In addition, STH3, BBX22 could be degraded by ubiquitination of COPI in vitro [7, 9]. The eight genes encoding double B-BOX domain proteins in A. thaliana were studied. It was proved that the expression of five BBX genes is regulated by the circadian rhythm . Overexpression of OsBBX25 gene in A. thaliana could enhance salt and drought tolerance of A. thaliana plants, and the expression of KIN1, RD29A and COR15 in transgenic plants is up-regulated under salt stress. OsBBX25 might regulate the expression of stress response-related genes as a cofactor of transcriptional regulation, and then participate in plant response to abiotic stress [12, 13].
Orchids occupy about 10% of flowering plants, almost all over the world . D. officinale and P. equestris belong to epiphytic orchids, of which D. officinale is also a valuable Chinese herbal medicine plant. The B-BOX gene family has been identified in several plants, such as Pyrus bretschneideri, O. sativa, and A. thaliana [4, 12, 15]. In addition, many studies have confirmed that B-BOX genes play key roles in plant growth and development, and responses to abiotic and biotic stresses. The draft of the D. officinale and P. equestris genome sequences were reported recently, respectively [16,17,18]. To further understand the B-BOX gene family in orchids, we respectively identified all 19 and 16 BBX members in Dendrobium officinale and Phalaenopsis equestris, and analyzed their phylogenetic relationships, gene structures, cis-regulatory elements, tissue expression patterns, as well as expression profiles under AgNO3, MeJA (Methyl Jasmonate), ABA (abscisic acid) and SA (salicylic acid) treatments. Our study will facilitate further functional studies of specific genes in the BBX family.
The BBX gene family members in D. officinale and P. equestris
A total of 24 and 22 B-BOX (BBX) coding protein sequences were identified in D. officinale and P. equestris, respectively, with the BLASTP program and HMMER software. However, some sequences not contained B-BOX domain (Fig. 1), five and six sequences from D. officinale and P. equestris, respectively, were excluded in this study. According to their scaffold positions, these genes were named DoBBX01-DoBBX19 and PeBBX01-PeBBX16, respectively. The detailed information (gene name, gene identifiers, scaffold position, molecular weight, and theoretical isoelectric point) of each BBX was presented in Table 1. As shown in Table 1, the BBX genes showed large differences in terms of their theoretical isoelectric points and molecular weights. In D. officinale, the molecular weights ranging from 14,648.51 Da (DoBBX14) to 61,659.92 Da (DoBBX05) with an average molecular weight of 34,200.53 Da, with theoretical isoelectric points from 4.74 (DoBBX15) to 8.4 (DoBBX02). In P. equestris, the smallest and largest molecular weights were respectively 7492.76 Da (PeBBX14) and 49,942.19 Da (PeBBX06). Furthermore, the theoretical isoelectric points of these proteins ranged from 4.35 (PeBBX11) to 8.61 (PeBBX05), with an average of 6.24.
Protein domain(s) and phylogenetic analysis
The domains logos of the CCT domain and B-BOX domains (including B-BOX1 and B-BOX2 domain) of these BBX proteins were generated and were shown in Fig. 1 and Fig. 2. Among the 35 BBXs, we scanned that six genes had two conserved B-BOX domains and a CCT domain. Six and six proteins contained only one B-BOX domain, or one B-BOX plus a CCT domain, respectively, while the most numbers of BBX having two B-BOX domains. Among the three domains, we found that some conserved amino acid residues were similar, but not exactly the same. In the B-BOX domains, there are 5 conserved Cys residues in the Cys-X-X-Cys motifs, of which 4 are absolutely conserved. Additionally, there are several other conserved amino acid residues, such as two His, Asp, Ala, and Asn, as shown in Fig. 1. In the CCT domain, the consensus sequence is R-XXXXX-R-Y-X-E-K-XXX-R-XXX-K-XX-R-Y-XX-R-K-XX-A-XX-R-X-R-X-K-G-R-F-X-K.
To analyze the evolutionary relationships and divergences of the BBX genes, the phylogenic tree, including BBX genes from Populus trichocarpa, A. thaliana, O. sativa, D. officinale and P. equestris, were generated (Fig. 2a). All sequences could be clustered into five clades and named as clade I–V in the phylogenetic tree following the previous articles [4, 15]. The BBX genes in clade I, clade II and clade III contained additional CCT domain, with one B-BOX domain in clade III, as well as two B-BOX domains in clades I and II. The remaining clade IV and clade V contained two and one B-BOX domain(s), respectively, but not have CCT domain. To further determine the evolutionary relationship of the B-BOX domain in the plant genomes (Fig. 3), 54 plant genomes covering angiosperms, gymnosperms, mosses and green algae were analyzed to generate maximum likelihood (ML) tree by using FastTree software . Based on the phylogenetic analysis (Fig. 3b), a hypothesis evolutionary relationship of the B-BOX domain was proposed in this study (Fig. 2c). The early BBX sequences of the plant genomes originally contained only one B-BOX domain, and then the B-BOX domain had a duplication event during the evolution process, which was consistent with the fact that most of the green algae (i.e. Chlamydomonas reinhardtii) only had a single B-BOX domain. Remarkably, the CrBBX1 from an alga contained double B-BOX domain indicated that the first B-BOX duplication event occurred before green plants colonized the land . A deletion event of the B-BOX domain in early BBX sequences belonging to the clade IV rises to a BBX sequence with a single B-BOX domain, which is a characteristic of the clade V. Later, the CCT domain was added at the C-terminus to generate a BBX protein with a double B-BOX and a CCT domain, which were early BBX member and belong to clade II. Among the clade II, some BBX members had a deletion event of the B-Box 2 domain resulted in the BBX proteins containing only one B-BOX and CCT domain (i.e. clade III). A duplication event of the B-Box 1 domain of an early BBX protein belonging to the clade II could have been a BBX precursor of clade I, generating BBX sequences with two B-BOX domains and one CCT domain. This proposed hypothesis was supported by similarities and differences in these domain sequences (Fig. 2c and Fig. 3b). For example, the B-Box 2 domain in clade I had a great difference compared with the sequence of the clade II and IV. These changes in the CCT and B-BOX domains resulted in the origin of different clades, which appeared in the early stages of plant genome evolution with retained the biological functions of the B-BOX domain.
Evolutionary patterns and gene structure analysis
To further understand the evolutionary patterns of BBX genes, we performed the analysis of orthologous and paralogous relationships among D. officinale and P. equestris genome. In the present study, two paralogs (Do-Do) in D. officinale, two paralogs (Pe-Pe) in P. equestris, and thirteen orthologs (Do-Pe) between D. officinale and P. equestris were identified by using the OrthoMCL software (Fig. 4a). Previous studies have shown that orchids share a genome-wide duplication event (the value of Ks is approximately equal to one) . Subsequently, the Ka, Ks, and Ka/Ks of all homologous gene pairs were calculated (Additional file 1: Table S1). The Ks value of PeBBX01-PeBBX07 and DoBBX07-DoBBX18 was 1.0658 and 1.4562, respectively, indicating these gene pairs were derived from genome-wide duplication events by shared D. officinale and P. equestris. The Ks values for the other paralogs, including PeBBX08-PeBBX14 and DoBBX10-DoBBX11, were 0.0433 and 0.0827, respectively, suggesting that they are derived from the ancient duplication events. According to the Ka/Ks value distributions (Fig. 4b and c) the homologous could divide into three classes. Nine homologous gene pairs had Ka/Ks values below 0.3, seven homologous gene pairs had Ka/Ks values between 0.3–1, and the remaining one gene pair (DoBBX05-PeBBX08) had ratios greater than 1. These data indicated that the most of BBX homologous gene pairs had undergone strong purifying selection during evolution. To further insight into the Ka/Ks value of each gene pair, we performed sliding-window analysis among all homologous pairs (Additional file 1: Figure S1). Based on this analysis, the majority of coding regions had Ka/Ks values were far below 1, but one or several distinct peaks (i.e. Ka/Ks value is greater than 1). The domains of the majority of BBXs commonly contained lower Ka/Ks values than the regions outside of them (i.e. peaks), which consistent with functional constraints being dominant in these domains. Combining with the above analysis, this BBX gene family mainly underwent purifying selection during evolution in D. officinale and P. equestris genome.
The evolution of multigene families could be driven by structural diversity . In the present study, we constructed the exon-intron organization maps to survey the structural diversity of BBX genes (Additional file 1: Figure S2 and Figure S3). The 35 members of BBX gene family contained a variable number of exons, ranging from 1 to 7. Furthermore, we found that two BBX genes contained seven exons, two genes contained five exons and six genes contained four exons, seven genes contained three exons and ten genes contained two exons, while remaining eight genes only contained one exon. This phenomenon suggested that the BBX gene family has undergone both exon gain and loss during evolution, which might be able to further explain the functional differences of closely related BBX homologous genes. Subsequent the exon-intron structure of the BBX homologous gene pairs were further analyzed. Out of 17 gene pairs, the number of exons in the fifteen gene pairs had changed (Additional file 1: Figure S3), such as DoBBX02-PeBBX13, DoBBX10-DoBBX11 and PeBBX08-PeBBX14. These divergences might be the result of single intron loss or gain events during evolution.
Cis-acting element analysis
Cis-regulatory elements play critical roles in regulatory networks controlling plant growth and development, including multi stimulus-responsive genes, and determining the tissue-specific or stress-responsive expression profiles of genes were closely associated with cis-elements in their promoter regions. Using the PlantCARE database, we identified three category cis-elements, including plant growth and development, biotic and abiotic stress responses and phytohormone responses in the promoter regions (Fig. 5). In growth and development category, cis-acting elements were found extensively in the promoter regions, including Skn-1-motif and GCN4_motif required for endosperm expression, CAT-box and CCGTCC-box for meristem expression, O2-site for zein metabolism regulation, MRE and Box 4for light responsiveness, and other cis-acting elements. Among these cis-acting elements, 98 Skn-1-motifs were identified and these motifs comprised the largest portion of the first category (Fig. 5). In the phytohormone responsive category, the TGA-element and AuxRR-core for auxin responsive, GARE-motif, P-box and TATC-box for gibberellin-responsive elements, and ERE for ethylene responsive were identified in eighteen BBX genes at most. Notably, the most common motif was the TGACG-motif cis-acting elements associated with MeJA-responsiveness, accounting for 29% of the scanned hormone responsive motifs (Fig. 5). Followed by ABRE cis-acting element, which was related to ABA; it accounted for 23% and appeared 32 times. In the last category, various stresses-related elements, such as ARE (anaerobic induction), Box-W1 (fungal elicitors), HSE (heat stress), TC-rich repeats (stress responses) and GC-motif (anoxia), were observed. Our data suggested that BBX genes might respond to abiotic stresses and had the potential to improve abiotic stress responses.
The organ-specific expression profiling of D. officinale BBX genes
To further understand the dynamic gene expression of BBX gene family members in D. officinale, we performed the overall analysis of gene expression profiles in eight tissues (root, root_tips, stem, leaf, speal, column, lip and flower_buds). Based on the expression pattern in eight tissues, these DoBBX genes exhibited distinct organ-specific expression, and further divided into three groups (Fig. 6a). In the group A, five genes (DoBBX02, − 03, − 07, − 08 and − 12) presented modest overall expression in all eight organs, suggesting that these DoBBX genes may play important roles in the formation of these tissues. Out of 19 genes, eight BBXs were classified into group B due to they were basically not expressed in these eight tissues. In group C, the remaining six genes shared similar lower expression in these tissues. Remarkably, not all homologous gene pairs exhibit similar patterns of expression, such as DoBBX07 had highest transcript abundance in the stem and/or leaf, but the expression of its paralog, DoBBX18, was lowest in leaf and highest in root and/or root_tips. Additionally, several genes that were highly expressed in the flower organs have also been identified, including DoBBX02, DoBBX07, DoBBX08 and DoBBX12 (Fig. 6b and c). These highest expression profiles of DoBBXs suggested that these genes might be indirectly or directly involved in the development and/or formation of reproductive organs. These results were also confirmed in AtBBXs (i.e. Arabidopsis BBX genes).
Regulation of the expression of D. officinale BBX genes by abiotic stresses
A variety of abiotic and biotic stresses could affect a plant’s health and growth, and finally influence the regulation of a series of stress-related genes . Therefore, it is very important to clarify the regulatory pathways and master regulators of stress responses in D. officinale. To better understand the stress responses involving the D. officinale BBX genes, the qRT-PCR experiments were used to analysis their expression under AgNO3, MeJA (Methyl Jasmonate), ABA (abscisic acid) and SA (salicylic acid) treatments.
In the ABA treatment, eight DoBBX genes were up-regulated to different degrees by ABA treatment (Fig. 7). The exception was DoBBX09, which was obviously significantly and rapidly down-regulated at all time points. Among these DoBBXs, we found that the highest expression levels of DoBBX01, DoBBX03, DoBBX05, DoBBX07, DoBBX08, DoBBX10, DoBBX11, DoBBX13, DoBBX16, DoBBX17, DoBBX18 and DoBBX19 occurred 96 h after treatment: DoBBX17 and DoBBX11 were strongly up-regulated (by more than 1700-fold and 100-fold, respectively). The expressions of seven DoBBX genes (DoBBX02, DoBBX04, DoBBX06, DoBBX12, DoBBX14 and DoBBX15) peaked at 48 h; DoBBX06 and DoBBX14 showed the greatest up-regulation by more than10-fold. Additionally, we found that the paralogous gene pairs contained similar expression patterns. For example, both DoBBX07-DoBBX18 and DoBBX10-DoBBX11 presented the same trend after at 48 h, with their highest levels at 97 h under ABA treatment. In the SA treatment, eleven DoBBX genes presented increased expression levels to different degrees (Fig. 8). Four of nineteen DoBBX genes were significantly up-regulated at the last time point (96 h), such as DoBBX17 were up-regulated by more than 20-fold. Under AgNO3 treatment, fifteen DoBBX genes were obviously rapidly and significantly down-regulated at all time points. The remaining four DoBBX genes had highest expression levels at 48 h under AgNO3 treatment (Fig. 9). At MeJA treatment, the expression levels of DoBBX04, DoBBX06, DoBBX10, DoBBX11, DoBBX12, DoBBX13, DoBBX15 and DoBBX17 were strongly up-regulated at 24 h, such as DoBBX16 was up-regulated by more than 10-fold. The DoBBX03 and DoBBX19 were up-regulated at 72 h (by more than 1.5-fold and 10-fold, respectively), while the DoBBX17 was significantly up-regulated at 48 h (by more than 65-fold) under MeJA treatment (Fig. 10). The remaining DoBBX genes were down-regulated throughout the entire experimental period, including DoBBX01, DoBBX02, DoBBX07, DoBBX08, DoBBX09, and DoBBX14.
Most transcription factors are specific to plants and play an important role for plant growth and development [4, 12, 15]. As a class of transcription factors, members of the BBX family encode proteins have B-BOX domain(s), and some of them contain additional CTT domain. In this study, we characterized and identified 19 and 16 B-BOX genes in D. officinale and P. equestris, respectively, which was much lower than that from other plants, i.e., 64 for Malus domestica , 25 for Pyrus bretschneideri , 29 for Solanum lycopersicum , 32 for A. thaliana , 30 for O. sativa . The reason for this difference might be the variable state of paralogous genes in these genomes. For example, only two DoBBXs paralogous and two PeBBXs paralogous gene pairs were identified in this study, but 18 OsBBX paralogous and 12 SlBBX paralogous gene pairs from segmental duplication events were found in O. sativa and S. lycopersicum, respectively [12, 25]. According to the phylogenetic analysis, all BBX genes from A. thaliana, Populus trichocarpa, O. sativa, D. officinale and P. equestris were clustered into five clades, which consistent with ML tree of BBX genes from 54 plant genomes and the previous published articles (Fig. 3b) . The BBX gene members from D. officinale and P. equestris in clade I, II and IV had two B-BOX domains. In contrast to animal BBX with two different types of B-BOXs, the conserved amino acid sequences of the two B-BOX domains in both DoBBX and PeBBX were similar, although they are not identical. As shown in Fig. 2a, the BBX genes from D. officinale, P. equestris and O. sativa were more closely than D. officinale and P. trichocarpa. Among them, we found that the genes from clade I, II, and III, had two B-BOX domains plus a CCT domain, indicating that their might contribute to control photoperiodic regulation of flowering [2, 26]. In the clade VI and clade V subfamily, the members only had one or two B-BOX domain, but lacked CCT domain. The previous publication articles suggested that the B-BOX domain sequence in C-X2-C-X8-C-X7-C-X2-C-X4-H-X8-H in the N-terminal region, and the conservative C (Cysteine) and H (Histidine) residues were involved in BBX protein-protein . In the present study, we found that the BBX members from clades I, II and IV were also contained the conservative C (Cysteine) and H (Histidine) residues. The likely evolution of BBX genes from D. officinale and P. equestris also occurred in other plants , which was supported by structure analyses of theses BBX genes in our study.
Gene duplications, which including whole-genome duplications, tandem duplications, transposition events and segmental duplications, contributed to genome expansion . The paralogous genes from D. officinale and P. equestris cannot be distributed clearly on the chromosomes, because the chromosome assembly for D. officinale and P. equestris genomes has not yet been finished [17, 18]. Therefore, we could not authenticate the types of putative duplication events in the current study. To further understand the patterns of evolution in both D. officinale and P. equestris, the value of Ka and Ks was calculated. Particularly, we calculated the frequency distribution of the Ka and Ks for paralogous genes (Do-Do and Pe-Pe) and orthologous genes (Do-Pe), and estimated the Ks value for all homologous gene pairs. We predicted that two paralogous gene pairs (PeBBX01-PeBBX07 and DoBBX07-DoBBX18) were evolved from the genome-wide duplication events by shared D. officinale and P. equestris , because their values of Ks were 1.0658 and 1.4562, respectively. Generally, Ka/Ks ratio greater than 1 signifies positive selection with accelerated evolution, Ka/Ks ratio equal to 1 represents neutral selection, while less than 1 means stabilizing or negative selection. Remarkably, the Ka/Ks ratios of all homologous gene pairs were less than 1, except for DoBBX05-PeBBX08, implying that these gene pairs have been experiencing a markedly purifying selection during evolution. We also noticed two homologous gene pairs (PeBBX08-PeBBX14 and DoBBX10-DoBBX11) contained the comparatively high Ka/Ks values (> 0.5), suggesting that these gene pairs have undergone rapid evolutionary diversification after duplication events in the course of evolution.
The overall analysis of microarray expression profiles in different tissues will contribute to study the tissue-specific and dynamic expression of BBX genes in D. officinale. Therefore, the gene expression profiles of all 19 BBX genes were exhibited in D. officinale by using published RNA-seq data. Among them, several BBX genes (such as DoBBX02, DoBBX03 and DoBBX08) presented highly expression level in eight tissues, indicating that these genes importance in the processes of D. officinale growth and development. Previous studies have shown that BBX genes play key roles in the regulation of flowering [2, 26, 28, 29], such as A. thaliana BBX1, O. sativa BBX1 and Beta vulgaris COL1. In our study, some cis-acting elements associated to flowering were identified in DoBBX promoter regions, such as Skn-1-motif and GCN4_motif required for endosperm expression, and circadian for circadian control elements. The corresponding DoBBX genes (such as DoBBX03 and DoBBX12) were also highly expressed in floral organs, indicating that these genes might important role in the formation of reproductive organs.
In the plants, many stress-associated genes could produce stress responses, which were regulated and/or mediated and by various signaling pathways . The numbers of BBX gene family have been verified to play positive roles in abiotic stress responses and were regulated by environmental signals [12, 24]. In our study, a variety of frequently occurring cis-acting elements were identified in promoter regions of both DoBBXs and PeBBXs, such as MBS, ARE, LTR, HSF, ERE and ABRE. We also noted that these BBX genes contained at least one abiotic stress cis-elements, suggesting that their might contribute to responding the biotic and abiotic stresses. In order to deep understanding of stress responses mechanism in the D. officinale BBX genes, we performed the qRT-PCR under different treatments, such as AgNO3, MeJA (Methyl Jasmonate), ABA (abscisic acid) and SA (salicylic acid) in PLBs. Then, we observed the DoBBX genes exhibited significantly differential expression patterns under these treatments. Some DoBBX genes were strongly up-regulated by these treatments, indicating that these genes might play crucial roles in response abiotic stress in D. officinale. For example, DoBBX17 was highly expressed (over 1700-fold that of CK levels) under ABA and DoBBX19 was highly expressed under AgNO3 and MeJA treatment. Among these DoBBX genes, some members had a CCT domain, but the remaining members lacked CCT domains. In the current study, we found that all DoBBX genes responded to abiotic stress, regardless of whether they contained CCT domain. These results suggested that gene-encoded proteins having a B-BOX or CCT domain might function in response to stress. In our study, we found that DoBBX genes were sensitive to different abiotic stresses, such as AgNO3, MeJA, ABA and SA stresses. These results provided evidence that plant BBX members could participate in responding to abiotic stress responses.
In our study, a comprehensive analysis of BBX genes was conducted in D. officinale and P. equestris, which including phylogenetic, exon-intron structure, cis-acting element, microarray analysis and qRT-PCR analysis of 19 DoBBX genes under various four stress treatments: AgNO3, MeJA (Methyl Jasmonate), ABA (abscisic acid) and SA (salicylic acid). Our experimental findings highlighted the roles in growth and development stage, and response to abiotic stress.
Identification of BBX genes in D. officinale and P. equestris genome
In our study, two different strategies were used to annotate and identify the genes encoding BBXs in D. officinale and P. equestris genome. In the first strategy, we first download the known sequences of BBXs from the TAIR database. Subsequently, we used these sequences to search the potential BBXs in D. officinale and P. equestris genome database by BLASTP program with the E value cutoff set at 1e-5. In the second strategy, we first download the HMM (hidden Markov model) profile of BBX domain (Pfam00643) from Pfam database . Then this HMM profile was used to identify for all of BBXs in D. officinale and P. equestris genome by HMMER 3.0 software with the E value cutoff set at 1e-3. Finally, all putative BBX genes were further verified the presence of B-BOX domain by submitting them to InterProScan , Pfam  and SMART database , respectively. The ExPASY online tool was used to estimate the molecular weight and isoelectric point (pI) of all BBX genes .
Phylogenetic analysis and sequence characterization
A total of 54 plant genomes were included in our phylogenetic analysis, including green alga (Chlamydomonas reinhardtii), moss (Physcomitrella patens), club moss (Selaginella moellendorffii), a single genome for gymnosperms (Picea abies), the early diverging angiosperm (Amborella trichopoda), 13 monocots, Beta vulgaris (non-rosid non-asterid), and 30 rosids (Additional file 1: Table S3 and Fig. 3a). Multiple sequence alignments of all BBX proteins were carried out with the MUSCLE (https://www.ebi.ac.uk/Tools/msa/muscle/) with default parameters. Subsequently, the Neighbor Joining (NJ) tree was generate by MEGA 5.2 software with bootstrap analysis (1000 replicates) . The Maximum likelihood (ML) tree was generated by FastTree software with JTT model [19, 37]. The GFF3 files of individual BBX genes were obtained from the previously published articles, and then their gene structures were generated with GSDS website (http://gsds.cbi.pku.edu.cn/) . The motif logos of the BBX and CCT domains were generated using online MEME program (http://meme.nbcr.net/meme/cgi-bin/meme.cgi) .
Identification of orthologs and paralogs
Orthologs and paralogs were identified by using OrthoMCL software with the E value cutoff set at 1e-5 . Based on the previous papers [41, 42], we used the DnaSP5.0 software to calculate the Ks (synonymous substitution rate), Ka (non-synonymous substitution rate) and Ka/Ks of homologous gene pairs.
Cis-acting elements analysis of BBX genes in D. officinale and P. equestris
To determine the cis-acting elements, we first obtained the promoter sequences (i.e. the 1500 bp of genomic DNA sequence upstream of the initiation code (ATG)) by TBtools software. Then these promoter sequences were submitted to the PlantCARE website (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) to identify the presence of different cis-acting elements .
RNA-seq expression analysis
To gain insight into the DoBBX gene expression patterns in different tissues of D. officinale, the raw RNA-seq reads from eight different tissues (root, root_tips, stem, leaf, speal, column, lip and flower_buds) were downloaded from the SRA database of NCBI (PRJNA348403). By using the HISAT2 software, the paired clean reads were mapped to the D. officinale reference genome with defaults parameters [44, 45]. Then the StringTie software was used to estimate the differently expressed genes . The R script was used to exhibit the heatmap of DoBBX genes in eight different tissues (root, root_tips, stem, leaf, speal, column, lip and flower_buds).
Plant material and stress treatments
The tissue-cultured seedlings of D. officinale were sterilized and planted on Murashige and Skoog (MS) (Murashige and Skoog) medium in the tissue culture room of Anhui Agricultural University under the condition of 25 °C with a constant photoperiod (16 h light/8 h dark) for about one month. Subsequently, it was transferred on MS medium supplemented with 30 g L− 1 sucrose (Aladdin), 0.1 mg L− 1 NAA (Aladdin) and 1.0 mg L− 1 6-BA (biosharp). Protocorm-like bodies (PLBs) were induced from sterilized seeds and maintained on 1/2 Murashige and Skoog (MS) liquid medium supplemented with 0.1 mg/L α-naphthalene acetic acid (NAA), 0.1 g/L lactalbumin hydrolysate and 30 g/L sucrose (pH of 5.8). By liquid suspension culture with temperature 25 °C under darkness for two months, PLBs were cut into 0.5× 0.5 cm pecies, and 7 g of them were inoculated in a triangular flask containing 40 mL MS medium. In these MS medium, 100 μM MeJA (Methyl Jasmonate: Aladdin), 100 μM SA (Salicylic acid: Aladdin), 100 μM ABA (Abscisic acid: Aladdin), and 100 μM AgNO3 (Aladdin) were added after 0.22 μm microporous filtration, based on the previously published articles . The PLBs were sampled at 24 h, 48 h, 72 h and 96 h after treatment. For each induction treatment, each sample (PLBs) was collected and immediately stored at − 80 °C for RNA isolation. Additionally, untreated PLBs (24 h) was used as the control group.
Quantitative real-time PCR analysis
Total RNA from PLBs was extracted with Plant Total RNA Isolation Kit (Sangon Biotech, China) using 300 mg tissue homogenized in liquid nitrogen according to the manufacturer’s protocol, which was reverse transcribed into the first DNA strand subsequently using a One Step RT-qPCR Kit (BBI Life Science, China). The qRT-PCR was executed using 2X TaqMan Fast qPCR Master Mix (BBI Life Science, China) with CFX96 Touch™ Real-Time PCR detection system (Bio-Rad, USA) based on the manufacturer’s introduction. In the present study, each reaction contained 0.75 μl SYBR Abstart One Step RT-PCR Mix, 10 μl 2.5X SYBR One Step RT-PCR buffer, 2 μl cDNA samples, and 1 μl of each primer (10 μM) in a reaction system of 25 μl. The thermal cycle was as follows: 98 °C for 2 min, 40 cycles of 98 °C for 10 s, 60 °C for 10 s, and 68 °C for 30 s. The tubliun gene was used as an internal control , and the gene-specific primers (Additional file 1: Table S2) of each DoBBX genes were designed using Beacon Designer 7 software. Three biological replicates were carried out for each experiment.
Availability of data and materials
Expression data of D. officinale used in this study were available in NCBI SRA database with accession numbers of PRJNA348403.
Takatsuji H. Zinc-finger transcription factors in plants. Cell Mol Life Sci. 1998;54(6):582–96.
Putterill J, Robson F, Lee K, Simon R, Coupland G. The CONSTANS gene of Arabidopsis promotes flowering and encodes a protein showing similarities to zinc finger transcription factors. Cell. 1995;80(6):847.
Klug A, Schwabe JW. Protein motifs 5. Zinc fingers. Faseb J. 1995;9(8):597–604.
Khanna R, Wu SH. The Arabidopsis B-box zinc finger family. Plant Cell. 2009;21(11):3416.
Torok M, Etkin LD. Two B or not two B? Overview of the rapidly expanding B-box family of proteins; 2001.
Tao H, Simmons BN, Singireddy S, Jakkidi M, Short KM, Cox TC, Massiah MA. Structure of the MID1 tandem B-boxes reveals an interaction reminiscent of intermolecular ring heterodimers. Biochemistry. 2008;47(8):2450–7.
Datta S, Johansson H, Hettiarachchi C, Holm M. STH2 has 2 B there: an insight into the role of B-box containing proteins in Arabidopsis. Plant Signal Behav. 2008;3(8):547–8.
Massiah MA, Matts JA, Short KM, Simmons BN, Singireddy S, Yi Z, Cox TC. Solution structure of the MID1 B-box2 CHC (D/C) C2H2 zinc-binding domain: insights into an evolutionarily conserved RING fold. J Mol Biol. 2007;369(1):1–10.
Datta S, Johansson H, Hettiarachchi C, Irigoyen ML, Desai M, Rubio V, Holm M. LZF1/SALT TOLERANCE HOMOLOG3, an Arabidopsis B-box protein involved in light-dependent development and gene expression, undergoes COP1-mediated ubiquitination. Plant Cell. 2008;20(9):2324.
Chang C-SJ, Maloof JN, Wu S-H. COP1-mediated degradation of BBX22/LZF1 optimizes seedling development in Arabidopsis. Plant Physiol. 2011;156(1):228–39.
Crocco CD, Locascio A, Escudero CM, Alabadí D, Blázquez MA, Botto JF. The transcriptional regulator BBX24 impairs DELLA activity to promote shade avoidance in Arabidopsis thaliana. Nat Commun. 2015;6:6202.
Huang J, Zhao X, Weng X, Wang L, Xie W. The rice B-box zinc finger gene family: genomic identification, characterization, expression profiling and diurnal analysis. PLoS One. 2012;7(10):e48242.
Liu Y, Xing L, Li J, Dai S. Rice B-box zinc finger protein OsBBX25 is involved in the abiotic response. Chin Bull Bot. 2012;47(4):366–78.
Zhang G-Q, Liu K-W, Li Z, Lohaus R, Hsiao Y-Y, Niu S-C, Wang J-Y, Lin Y-C, Xu Q, Chen L-J. The Apostasia genome and the evolution of orchids. Nature. 2017;549(7672):379.
Cao Y, Han Y, Meng D, Li D, Jiao C, Jin Q, Lin Y, Cai Y. B-BOX genes: genome-wide identification, evolution and their contribution to pollen growth in pear (Pyrus bretschneideri Rehd.). BMC Plant Biol. 2017;17(1):156.
Zhang G-Q, Xu Q, Bian C, Tsai W-C, Yeh C-M, Liu K-W, Yoshida K, Zhang L-S, Chang S-B, Chen F. The Dendrobium catenatum Lindl. Genome sequence provides insights into polysaccharide synthase, floral development and adaptive evolution. Sci Rep. 2016;6:19029.
Yan L, Wang X, Liu H, Tian Y, Lian J, Yang R, Hao S, Wang X, Yang S, Li Q. The genome of Dendrobium officinale illuminates the biology of the important traditional Chinese orchid herb. Mol Plant. 2015;8(6):922–34.
Cai J, Liu X, Vanneste K, Proost S, Tsai W-C, Liu K-W, Chen L-J, He Y, Xu Q, Bian C. The genome sequence of the orchid Phalaenopsis equestris. Nat Genet. 2015;47(1):65.
Price MN, Dehal PS, Arkin AP. FastTree: computing large minimum evolution trees with profiles instead of a distance matrix. Mol Biol Evol. 2009;26(7):1641–50.
Crocco CD, Botto JF. BBX proteins in green plants: insights into their evolution, structure, feature and functional diversification. Gene. 2013;531(1):44–52.
Cao YP, Han Y, Jin Q, Lin Y, Cai Y. Comparative genomic analysis of the GRF genes in Chinese pear (Pyrus bretschneideri Rehd), poplar (populous), grape (Vitis vinifera), Arabidopsis and rice (Oryza sativa). Front Plant Sci. 2016;7:1750.
Williamson RJ, Josephs EB, Platts AE, Hazzouri KM, Haudry A, Blanchette M, Wright SI. Evidence for widespread positive and negative selection in coding and conserved noncoding regions of Capsella grandiflora. PLoS Genet. 2014;10(9):e1004622.
Grallath S, Weimar T, Meyer A, Gumy C, Suter-Grotemeyer M, Neuhaus J-M, Rentsch D. The AtProT family. Compatible solute transporters with similar substrate specificity but differential expression patterns. Plant Physiol. 2005;137(1):117–26.
Liu X, Li R, Dai Y, Chen X, Wang X. Genome-wide identification and expression analysis of the B-box gene family in the Apple (Malus domestica Borkh.) genome[J]. Mol Gen Genomics. 2018;293(2): 303–15.
Chu Z, Wang X, Li Y, Yu H, Li J, Lu Y, Li H, Ouyang B. Genomic organization, phylogenetic and expression analysis of the B-BOX gene family in tomato. Front Plant Sci. 2016;7:1552.
Cockram J, Thiel T, Steuernagel B, Stein N, Taudien S, Bailey PC, O'Sullivan DM. Genome dynamics explain the evolution of flowering time CCT domain gene families in the Poaceae. PLoS One. 2012;7(9):e45307.
Moore RC, Purugganan MD. The early stages of duplicate gene evolution. Proc Natl Acad Sci U S A. 2003;100(26):15682–7.
Yano M, Katayose Y, Ashikari M, Yamanouchi U, Monna L, Fuse T, Baba T, Yamamoto K, Umehara Y, Nagamura Y. Hd1, a major photoperiod sensitivity quantitative trait locus in rice, is closely related to the Arabidopsis flowering time gene CONSTANS. Plant Cell. 2000;12(12):2473–83.
Suárez-López P, Wheatley K, Robson F, Onouchi H, Valverde F, Coupland G. CONSTANS mediates between the circadian clock and the control of flowering in Arabidopsis. Nature. 2001;410(6832):1116.
Walther D, Brunnemann R, Selbig J. The regulatory code for transcriptional response diversity and its relation to genome structural properties in a. thaliana. PLoS Genet. 2007;3(2):e11.
Mistry J, Finn RD, Eddy SR, Bateman A, Punta M. Challenges in homology search: HMMER3 and convergent evolution of coiled-coil regions. Nucleic Acids Res. 2013;41(12):e121.
Zdobnov EM, Apweiler R. InterProScan--an integration platform for the signature-recognition methods in InterPro. Bioinformatics. 2001;17(9):847–8.
Finn RD, Bateman A, Clements J, Coggill P, Eberhardt RY, Eddy SR, Heger A, Hetherington K, Holm L, Mistry J. Pfam: the protein families database. Nucleic Acids Res. 2013;42(D1):D222–30.
Letunic I, Doerks T, Bork P. SMART 7: recent updates to the protein domain annotation resource. Nucleic Acids Res. 2012;40(D1):D302–5.
Gasteiger E, Hoogland C, Gattiker A, Duvaud SE, Wilkins MR, Appel RD, Bairoch A. Protein identification and analysis tools on the ExPASy server. Proteomics Protoc Handbook. 2005;112(112):571–607.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: molecular evolutionary genetics analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Cao Y, Han Y, Meng D, Abdullah M, Yu J, Li D, Jin Q, Lin Y, Cai Y. Expansion and evolutionary patterns of GDSL-type esterases/lipases in Rosaceae genomes. Funct Integr Genomics. 2018;18(6):673–84.
Hu B, Jin J, Guo YA, Zhang H, Luo J, Gao G. GSDS 2.0: an upgraded gene feature visualization server. Bioinformatics. 2014;31(8):1296.
Bailey TL, Johnson J, Grant CE, Noble WS. The MEME suite. Nucleic Acids Res. 2015;43(W1):W39–49.
Li L, Stoeckert CJ, Roos DS. OrthoMCL: identification of ortholog groups for eukaryotic genomes. Genome Res. 2003;13(9):2178–89.
Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25(11):1451–2.
Cao Y, Han Y, Li D, Lin Y, Cai Y. MYB transcription factors in chinese pear (Pyrus bretschneideri Rehd.): genome-wide identification, classification, and expression profiling during fruit development. Front Plant Sci. 2016;7:577.
Lescot M, Déhais P, Thijs G, Marchal K, Moreau Y, Van de Peer Y, Rouzé P, Rombauts S. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002;30(1):325–7.
Pertea M, Kim D, Pertea GM, Leek JT, Salzberg SL. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat Protoc. 2016;11(9):1650.
Cao Y, Han Y, Li D, Lin Y, Cai Y. Systematic analysis of the 4-Coumarate:coenzyme a ligase (4CL) related genes and expression profiling during fruit development in the Chinese pear. Genes. 2016;7(10):89.
Wang Y, Hammes F, Duggelin M, Egli T. Influence of size, shape, and flexibility on bacterial passage through micropore membrane filters. Environ Sci Technol. 2008;42(17):6749–54.
Fan H, Wu Q, Wang X, Wu L, Cai Y, Lin Y. Molecular cloning and expression of 1-deoxy-d-xylulose-5-phosphate synthase and 1-deoxy-d-xylulose-5-phosphate reductoisomerase in Dendrobium officinale. Plant Cell Tissue Organ Cult. 2016;125(2):381–5.
We would like to thank Honghong Fan, Yi Lin and Muhammad Abdullah for his careful reading and helpful comments on this manuscript. We extend our thanks to the reviewers and editors for their careful reading and helpful comments on this manuscript.
This study was supported by The National Natural Science Foundation of China (grant 31640068). The Funding bodies were not involved in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
The experiments did not involve endangered or protected species. No specific permits were required for these locations/activities because the D. officinale used in this study were obtained from the tissue culture room of Anhui Agricultural University.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Figure S1. Sliding window analysis of Ka/Ks for the orthologous (Pe-Do) and paralogous (Pe-Pe, Do-Do) gene pairs. The window size was 150 bp, and the step size was 9 bp. Figure S2. Gene structures analysis of the BBXs in both D. officinale and P. equestris. The exons and introns are indicated by green rectangles and thin lines, respectively. Figure S3. Comparison of the exon-intron structure between homologous genes (Pe-Pe, Do-Do and Pe-Do). The exons and introns are indicated by green rectangles and thin lines, respectively. Table S1. The 19 DoBBX gene primer sequences. Table S2. The Ka, Ks and Ka/Ks values of orthologous (Pe-Do) and paralogous (Pe-Pe, Do-Do) gene pairs. Table S3. Plant genomes used in this analysis. (DOCX 391 kb)