- Research article
- Open Access
Genome-wide comparative analysis of the BAHD superfamily in seven Rosaceae species and expression analysis in pear (Pyrus bretschneideri)
BMC Plant Biology volume 20, Article number: 14 (2020)
The BAHD acyltransferase superfamily exhibits various biological roles in plants, including regulating fruit quality, catalytic synthesizing of terpene, phenolics and esters, and improving stress resistance. However, the copy numbers, expression characteristics and associations with fruit aroma formation of the BAHD genes remain unclear.
In total, 717 BAHD genes were obtained from the genomes of seven Rosaceae, (Pyrus bretschneideri, Malus domestica, Prunus avium, Prunus persica, Fragaria vesca, Pyrus communis and Rubus occidentalis). Based on the detailed phylogenetic analysis and classifications in model plants, we divided the BAHD family genes into seven groups, I-a, I-b, II-a, II-b, III-a, IV and V. An inter-species synteny analysis revealed the ancient origin of BAHD superfamily with 78 syntenic gene pairs were detected among the seven Rosaceae species. Different types of gene duplication events jointly drive the expansion of BAHD superfamily, and purifying selection dominates the evolution of BAHD genes supported by the small Ka/Ks ratios. Based on the correlation analysis between the ester content and expression levels of BAHD genes at different developmental stages, four candidate genes were selected for verification as assessed by qRT-PCR. The result implied that Pbr020016.1, Pbr019034.1, Pbr014028.1 and Pbr029551.1 are important candidate genes involved in aroma formation during pear fruit development.
We have thoroughly identified the BAHD superfamily genes and performed a comprehensive comparative analysis of their phylogenetic relationships, expansion patterns, and expression characteristics in seven Rosaceae species, and we also obtained four candidate genes involved in aroma synthesis in pear fruit. These results provide a theoretical basis for future studies of the specific biological functions of BAHD superfamily members and the improvement of pear fruit quality.
Pear is an important temperate Rosaceae fruit tree worldwide. According to historical records, pears originated between 55 and 65 million years ago, and their cultivation history can be traced back for more than 30,000 years . In 2012, the genome sequence of pear (Pyrus bretschneideri cv. ‘Dangshansuli’) was released, providing a resource for research on genomics and molecular biology in pear. At present, genome sequences of six other Rosaceae fruit species are also available, allowing comparative genomics studies among economically important Rosaceae species. Fruit quality including sugar content, fruit size, fruit aroma, is tightly coupled with consumer choice and commercial value. Fruit aroma is an inherent quality/characteristic, which distinguishes the different fruit species even different cultivars within a species. In apple, the main aromatic components were 1-hexanol . In red-fleshed peach fruit, fruity note latone γ-hexalactone is the major aroma components .In banana, 2-pentyl acetate, 3-methylbutyl acetate were the most important contributors to the aroma . In European pear cultivars, the volatile organic compounds mainly belong to primary esters, alcohols and alkanes . Moreover, it has been pointed out that esters and aldehydes were key volatile compounds shared by 33 cultivars of the Chinese pear Pyrus ussuriensis . It can be conclude that the fruit aroma is a complex mixture of a large number of volatile compounds, but the esters seem to be the most extensive.
Four pathways were reported involved in volatile aroma compounds biosynthesis, including fatty acids pathway, amino acid pathway, terpenoids pathway and carotenoid pathway. Among them, fatty acids are major precursors of aroma volatiles in most fruit species , fatty acid-derived straight-chain alcohols, aldehydes and esters are important aroma compounds responsible for fresh fruit flavors and are consisted by three processes: α-oxidation, β-oxidation and the lipoxygenase pathway . β-oxidation of fatty acids is the primary biosynthetic process providing alcohols and acyl coenzyme A (CoAs) for ester formation . Previous study showed that fruit aromas volatiles are were formatted through β-oxidation in pear and apple . Multiple enzymes have been reported involved in β-oxidation. For example, acyl CoAs can be reduced by acyl CoA reductase to aldehyde, which was continually reduced by alcohol dehydrogenase to alcohol, finally, the alcohols were used to produce esters by alcohol acyltransferase (AAT) . Moreover, in amino acid pathway, these amino acids can also be the precursors of acyl-CoAs, involved in alcohol esterification reactions catalyzed by AATs . Once the basic skeletons were produced through these pathways, the diversity of these volatile compounds can be achieved via acylation, decarboxylation, glycosylation, oxidation/reduction, hydroxylation and methylation, which expand the basic skeletons and modify enzymes .
Acylation is an important process of modification of secondary metabolites in plant growth and development. BAHD acyltransferase family mainly uses coenzyme A thioester as acyl donor and uses alcohols or amines as receptors to catalyze acylation to form all kinds of acylation products, including lignin monomers, anthocyanins, terpenoids, esters and so on [13,14,15]. The BAHD (benzylalcohol O-acetyl transferase, anthocyanin O-hydroxycinnamoyl transferase, N-hydroxycinnamoyl anthranilate benzoyl transferase and deacetylvindoline 4-O-acetyltransferase ) superfamily is composed of enzymes having two common domains (HXXXD and DFGWG) and similar amino acid sequences . The HXXXD motif region is located in the reaction channel center and participates in catalysis. As an indispensable motif for the reaction, the DFGWG motif is located far from the active site . Benzylalcohol O-acetyl transferase identified from the Californian wildflower Clarkia breweri can produce the floral volatile benzylacetate [13, 19], deacetylvindoline 4-O-acetyltransferase identified from Catharanthus roseus is related to the synthesis of the alkaloid vindoline [13, 20], N-hydroxycinnamoyl anthranilate benzoyl transferase identified from Dianthus caryophyllus is responsible for producing a class of phytoalexins known as anthramides , and anthocyanin O-hydroxycinnamoyl transferases identified from Gentiana triflora can catalyze anthocyanin synthesis [22, 23].
In recent years, members of the BAHD acyltransferase family related to ester metabolism have been discovered, such as the AAT genes MpAAT1 in apple , FaAAT2 and SAAT in strawberries [25, 26], they play a role in the final step of ester biosynthesis, catalyzing the production of ester compounds with coenzyme A thioester as the donor and alcohol as the receptor. The potential roles of BAHD acyltransferase family members in fruit ester synthesis need to be further investigated in pear and other fruit species. Hence, systematically identifying BAHD gene family in pear and screening candidate genes that regulate the synthesis of esters are of great significance for artificially regulating the content of pear esters and improving the quality of pears. In this study, we aimed to identify the repertoires of BAHD superfamily members in the genomes of pear and six other Rosaceae fruit species. In order to unravel the evolution and expansion mechanisms of BAHD superfamily, and screen candidate BAHD acyltransferases related to fruit aroma biosynthesis especially ester synthesis, we performed comprehensive analysis on evolutionary and expression patterns using genome and transcriptome resources. Gene structure, conserved motifs, phylogeny, gene duplication, selection pressure and spatiotemporal expression profiles of BAHD genes were analyzed in this study. Furthermore, we verified gene expression patterns by qRT-PCR, and several candidate BAHD genes that are closely associated with the pear volatile ester content were determined. These results provide insights into the evolution, expansion and functional roles of the BAHD superfamily and will aid in further studies of their molecular functions in these fruit species.
Identification of BAHD genes in seven Rosaceae species
The BAHD superfamily’s characteristic domain (Pfam: PF02458) and the BAHD Hidden Markov Model (HMM) configuration file (PF02458) were used to identify the BAHD members. The online site SMART (http://smart.embl-heidelberg.de/) was used to analyze protein sequences of candidate genes and to determine the presence of the BAHD domain. As a result, 773 putative BAHD family candidate genes were obtained in the seven species with E-values <1e− 10. Furthermore, a multiple sequence alignment was conducted to verify the presence of two characteristic conserved domains (HXXXD and DFGWG) in the BAHD family genes . Because they lacked the two domains, three, six, six, six, seven, eight and 20 genes were removed from Prunus persica (peach), P. bretschneideri (Chinese white pear), Pyrus communis (European pear), Rubus occidentalis (black raspberry), Fragaria vesca (strawberry), Malus domestica (apple) and Prunus avium (sweet cherry), respectively. Finally, 56 sequences were removed, 717 BAHD genes were identified and analyzed (Table 1). Detailed information on the features of BAHD genes is available in Additional file 1: Table S1.
Phylogenetic and conserved motif analyses of BAHD genes in Chinese white pear
The amino acid sequences encoded by BAHD genes in Chinese white pear, Arabidopsis and Populus were used to construct a phylogenetic tree with maximum-likelihood (ML) method. The Chinese white pear BAHD family protein genes were classified into five clades (I, II, III-a, IV and V; Fig. 1). Clade I consisted of two subclades (clades I-a and I-b). The Arabidopsis genes belonging to clade I-a are involved in modifying aromatic and aliphatic alcohols in Arabidopsis and Populus [27, 28]; therefore, we speculated that the BAHD genes clustered into clade I-a in Chinese white pear might encode proteins with similar functions. The members of clade I-b had functions related to the biosynthesis of lignin monomeric intermediates [29, 30], including tobacco and Arabidopsis shikimate hydroxycinnamoyltransferases . Clade II consisted of two subclades (clades II-a and II-b). The functions of the Arabidopsis genes belonging to clade II-a are unknown. Clade II-b contained two Arabidopsis genes, AT3G29590.1 (At5MAT) and AT1G03940.1 (At3AT1), which are associated with anthocyanin biosynthesis [15, 32]. Clade III-a contained few members, including four genes from Chinese white pear, three genes from Arabidopsis, and one gene from poplar. Clade V contained nine members, including one well-studied Arabidopsis gene AT4G24510.1 (CER2) involved in regulating the cuticular wax biosynthesis . Clade IV contained many members involved in catalyzing the acetylation of aromatic alcohols and acetylating small- or medium-chain alcohols . Additionally, only three and four members clustered in clades III-a and V. However, ~ 28.1% (32 of 114) BAHD genes were in clade I and 36.0% (41 of 114) were in clade IV, respectively. BAHD genes closely related to pear volatile ester contents, such as AAT, belonged to clades I and IV.
The ML phylogenetic tree of BAHDs for European pear with Arabidopsis and Populus as outgroups was also constructed (Fig. 2). Paralleling the phylogenetic tree built for Chinese white pear, the European pear’s BAHD genes were divided into five clades. Clades V and III-a only contained four members, respectively. However, ~ 36.1% (35 of 97) genes clustered in clade IV, and 26.8% (26 of 97) clustered in clade I. Additionally, five function-known acyltransferases of Arabidopsis [AT3G0480.1 (CHAT), AT2G23510.1 (SDT), AT5G48930.1 (AtHCT), AT3G29590.1 (At5MAT) and AT1G03490.1 (At3AT1)] were classified into two subgroups (I-a and II-b). These results provided putative candidates for the study of gene functions.
Furthermore, in order to verify the accuracy of the classification of the phylogenetic analysis of the ML method, we also constructed the tree using the neighbor-joining (NJ) method (Additional file 2: Figure S1; Additional file 3: Figure S2). The results show that the NJ phylogenetic tree can also be divided into five clades. In European pear, the type and numbers of the subclades was consistent with the ML phylogenetic tree. In the NJ phylogenetic tree of Chinese white pear, Pbr034977.1 was clustered in clade II-b, Pbr029351.1 was clustered in clade V, however, in ML phylogenetic tree of Chinese white pear, these two members were classified in class IV with a bootstrap value of 100, so we speculate that these two members might belong to clade IV. Similarly, three members (AT4G31910.1, Pbr020016.1, Pbr019034.1) in subclade I-b in NJ tree might belong to subclade I-a, since the bootstrap value of the ML tree is higher than that of NJ tree. In addition to the above genes, other genes were divided into the same subclades in both the NJ method and the ML method of Chinese white pear. Moreover, we also found that the bootstrap value of all branches of ML phylogenetic tree was higher than 50, which further indicated the reliability of the classification in ML tree.
We detected 20 conserved motifs in the BAHD proteins of Chinese white pear using the online software MEME (Fig. 3). All the BAHD family members contained motif1 or motif3, and ~ 65.8% (75 of 114) of members contained them both. Based on a gene structural analysis, we determined that motif1 and motif3, with sequences CGGFAIGLSMSHKVADGSSLSTFINSWAE and FYEADFGWGKP, respectively, correspond to domains HXXXD and DFGWG, respectively. Members of subclades I-a, I-b, II-a, II-b, III-a and V do not contain motif17, except for Pbr005916.1, and members of the subclades I-a, I-b, II-a, II-b and III-a, as well as clade V, do not contain motif19, except for Pbr010925.1. Except for Pbr005746.1, Pbr014025.1, Pbr035166.1 and Pbr036245.1, members of clade IV do not contain motif10. Motif14 and motif16 were only detected in the Clade II-b. The type and distribution of the conservative motifs of the same subclades were similar, further supporting the evolutionary tree’s classification (Fig. 3). Information on conservative motifs is shown in Additional file 1: Table S2.
Gene duplication events identified in the pear BAHD superfamily and a BAHD collinearity analysis of seven Rosaceae species
Different patterns of gene replication have jointly promoted the evolution of the BAHD family, including whole-genome duplication (WGD) or segmental duplication, tandem duplication (TD), proximal duplication (PD), transposed duplication (TRD) and dispersed duplication (DSD) [34, 35]. We used DupGen_finder software  to detect duplicated BAHD family gene pairs in seven Rosaceae genomes. All the BAHD gene family members were assigned to WGD, PD, TD, TRD or DSD. The number of WGD duplications in Chinese white pear and apple were 29 and 59, respectively, but there were only three in strawberry and peach, four in black raspberry and sweet cherry, and 23 in European pear. The number of DSDs in Chinese white pear, European pear and sweet cherry were 113, 95 and 142, respectively. Additionally, there were 91 in strawberry and 81 in apple, which are more than in peach (76) and black raspberry (70). Genomic rearrangements and gene loss may lead to the large proportion of DSDs in these species. Moreover, the RNA- and DNA-based TRD event can also produce this result . WGDs and DSDs impacted the evolution of the BAHD superfamily in Chinese white pear, apple and European pear (Fig. 4). In peach and strawberry TDs and DSDs were the main forces, while PDs and DSDs played major roles in the evolution of black raspberry and sweet cherry. In pear, ~ 57.1% (113 of 198) BAHD genes were involved in DSD events, while there were 66.9% (91 of 136) in strawberry, 44.0% (81 of 184) in apple, 60.3% (76 of 126) in peach, 67.3% (70 of 104) in black raspberry, 66.4% (142 of 214) in sweet cherry and 55.2% (95 of 172) in European pear (Additional file 1: Table S3). The results indicated that DSDs were ubiquitous in all the investigated species.
In addition, we identified intra-genomic synteny blocks for each species . As shown in Fig. 5a, the BAHD genes of Chinese white pear are randomly distributed on 17 chromosomes and there is only one gene on chromosome 13. Similarly, the BAHD genes were detected as randomly distributed in the other species. We found 78 syntenic gene pairs among the seven Rosaceae species. Of these, 17, 21, and 29 syntenic pairs were identified in European pear (Fig. 5g), Chinese white pear (Fig. 5a) and apple (Fig. 5b), compared with only three in strawberry (Fig. 5f), peach (Fig. 5d) and black raspberry (Fig. 5e) and two in sweet cherry (Fig. 5c) (Additional file 1: Table S4).
Nonsynonymous (Ka) and synonymous (Ks) substitutions per site, and Ka/Ks analysis for BAHD family genes
The stage of evolution for the WGD is usually estimated using Ks [37,38,39]. In addition to the original WGD [Ks = 1.5–1.8, ~ 140 million years ago (Mya)] (denoted as a γ-paleohexaploidization event) that was shared by core eudicots , a more recent WGD was detected in pear and dated to 30–45 Mya (Ks = 0.15 to 0.3) . As shown in Additional file 1: Table S5, Ks values of WGD-derived gene pairs in Chinese white pear ranged from 0.006 to 3.909, and the ranges of Ks values for gene pairs derived from TD, PD, TRD and DSD were 0.001–4.247,0.07–3.670,0.029–4.381 and 0.005–5.066, respectively. Similar results were found in apple. In Chinese white pear, there are nine WGD-derived genes pairs with Ks values that ranged from 0.15 to 0.30, demonstrating that they may be derived from the current WGD (30–45 Mya) . Some other duplicated gene pairs possessed higher Ks values (1.992–3.909), implying that they probably originated from more ancient duplication events. The Ks values of the WGD-derived gene pairs in black raspberry, European pear, peach and sweet cherry were 1.356–2.965, 0.145–4.288, 1.416–4.357 and 1.469–4.210, respectively. The higher Ks values of WGD-derived gene pairs in peach, black raspberry and sweet cherry suggested that they were duplicated and retained from more ancient WGD events, supporting the absence of more recent WGD events in these species.
Deleterious mutations can be removed by negative selection (purifying selection). Conversely, new favorable mutations can be accumulated by positive selection (Darwinian selection) and spread through the population . To detect the selection pressure acting on BAHD genes, we analyzed the Ka and Ka/Ks values in the seven Rosaceae species (Additional file 1: Table S5). The direction and magnitude of the selection pressure were inferred based on Ka/Ks ratio (Ka/Ks > 1: positive selection; Ka/Ks = 1: neutral evolution; and Ka/Ks < 1: purifying selection) . The Ka/Ks values of all the BAHD gene pairs in strawberry (Fig. 6d), peach (Fig. 6c) and European pear (Fig. 6b) were less than one, indicating that these genes evolved through purifying selection (Fig. 6). Similar results were found in the other four Rosaceae species [the Chinese pear (Fig. 6f), sweet cherry (Fig. 6a), black raspberry (Fig. 6g) and apple (Fig. 6e)], except for a few gene pairs with Ka/Ks values greater than one. The box plots also indicated that the data distributions were concentrated, especially in Chinese white pear, sweet cherry and apple.
Expression pattern of BAHD genes in Chinese white pear
Based on transcriptome data (Additional file 1: Table S6) from different pear tissues, we determined that most genes in Chinese white pear showed higher expression in roots (Fig. 7), and we discovered that 37 of the BAHD genes were expressed in all four stages of fruit development. Pbr014238.1 was only expressed in the four stages of pollen-tube development, while Pbr020016.1, Pbr027303.1, Pbr029551.1, Pbr014028.1 and Pbr006821.1 were highly expressed in the late stage of fruit development (Fruit_S4). Most members of the BAHD superfamily showed no expression during the four stages of pollen-tube development.
Gene expression analyses with qRT-PCR
Based on the transcriptome expression profiles and the ester content analysis, we selected four potential Chinese white pear genes (Pbr020016.1, Pbr019034.1, Pbr014028.1, and Pbr029551.1) that showed strong correlations with total ester content changes during fruit development (Fig. 8a, Additional file 1: Table S7, Additional file 1: Table S8). We used qRT-PCR to examine these candidate genes. The expression patterns of several individual genes were highly correlated with the ester content changes during pear fruit development (Fig. 8). Our results indicated that the expression level of Pbr014028.1 (Fig. 8c) decreased from S1 (45DAF) to S2 (75DAF), increased sharply from S3 (105DAF) to S4 (145DAF), and then reached a peak value. Surprisingly, three indices of Pbr014028.1 (Fig. 8c), the relative expression level, the RNA-seq data and the changes in total ester content at all stages exhibited correlated trends. In addition, Pbr019034.1 (Fig. 8b), Pbr029551.1 (Fig. 8d) and Pbr020016.1 (Fig. 8) showed similar expression patterns. They each had a sharp increase from S3 (105DAF) to S4 (145DAF) and reached their peak value in the last period. The results showed that the expression levels of these genes, the RNA-seq data and the changes in total ester contents at all the stages exhibited correlated trends. Therefore, the four genes (Pbr020016.1, Pbr019034.1, Pbr029551.1 and Pbr014028.1) appear to be important candidate genes for ester synthesis.
To further investigate this result, four BAHD genes (Pbr020016.1, Pbr019034.1, Pbr014028.1 and Pbr029551.1) from Chinese white pear and 11 biochemically characterized AAT genes from other species were used to construct a maximum-likelihood tree. As seen in Fig. 9, we found that these pear BAHD genes shared a high homology with the reported AAT genes. Thus, we speculated that these genes might have a strong correlation with ester synthesis.
The BAHD superfamily has members in various ester synthetic pathways. Here, by detecting their HXXXD and DFGWG motifs using a HMMER search, we identified 717 BAHD genes from seven Rosaceae species. Similar to the results of the classification of BAHD genes in Arabidopsis , BAHD family protein-encoding genes of pear were classified into five clades (I, II, III-a, IV and V). In addition, a conserved motif analysis of these BAHD amino acid sequences showed that the type and distribution of the conservative motifs among the same subfamily were similar, indicating that the classification results are authentic. All the BAHD family members contained motif1 (HXXXD) or motif3 (DFGWG), and ~ 65.8% (75 of 114) of members contained them both. Thus, we hypothesized that the motifs (1 and 3) may be related to catalytic activity. The HXXXD catalytic motif is located in the solvent channel, and it can also deprotonize oxygen or nitrogen atoms on specific receptor substrates [13, 17]. The DFGWG motif is essential to the reaction but located far from the active site [13, 17, 42, 43]. Combining the phylogenetic tree classification with the results of previous studies, we speculated that the BAHD genes that are closely related to the pear volatile ester content, such as AAT, belonged to clades I and IV, and the AAT are involved in the synthesis of esters, which are involved in determining the aromas of flowers and fruits . At present, the functions of many AAT genes in fruit trees have been identified, such as PpAAT1 in peach , FaAAT2 and SAAT in cultivated strawberry [25, 26], MpAAT1 in apple  and CmAAT1,3,4 in melon [46, 47].
Gene duplication patterns can be generally divided into five types, WGD, PD, TD, DSD and TRD [34, 35]. Each pattern of gene duplication contributes differentially to gene family expansion . It is estimated that 90% of the gene increase in the Arabidopsis thaliana pedigree is due to the result of WGDs . WGD, TD and DSD are the main features of eukaryotic genome evolution and mainly drive the development of new functions in the genome and genetic evolutionary systems [50, 51]. Gene families, such as SWEET and Hydroxycinnamoyl transferases, expanded primarily through WGD and DSD [52, 53]. TD was the major force in the expansion of the AP2/ERF and WRKY gene families [54, 55]. Two gene families, Aluminum-activated malate transporters and heat-shock transcription factors were amplified by WGD and DSD [34, 39]. The numbers of BAHD genes identified in seven plants from the Rosaceae family are different because of recent WGD events and ancient polyploid events. In this study, DSD and WGD were determined to be the main forces that expanded the BAHD genes in apple, European pear and Chinese white pear. The numbers of BAHD genes that had undergone WGD and DSD were the largest, while the numbers of genes that underwent other replication modes were relatively small. In addition, because peach, sweet cherry, strawberry and sweet cherry have not undergone recent WGD events, the number of DSDs account for a large proportion of the gene duplications. Accordingly, WGD patterns account for large proportions in apples and pears, perhaps because they have experienced two rounds of WGD . In addition, based on the Ka, Ks and Ka/Ks analysis, we discovered that in peach, black raspberry and sweet cherry the Ks values of WGD-derived gene pairs were greater than 1.3, suggesting that they were duplicated and retained from more ancient WGD events. This further supported the absence of more recent WGD events in these species. We also found that all the BAHD gene pairs in strawberry, peach and European pear had Ka/Ks < 1, indicating that they have experienced strong purifying selection.
The diversity of the biochemical functions of the BAHD genes has been determined in many species. For example, the BAHD acyltransferase gene can catalyze the last step in cocaine biosynthesis , and the protein plays an important role in plant innate immunity . Furthermore, AAT gene functions have been determined in several species. For instance, the AAT gene in apple, MdAAT2, may provide resistance to stress and can change the volatile compound profile . In Mountain papaya (Vasconcellea pubescens), VpAAT1 is involved in ester biosynthesis . Despite the diverse functions of the BAHD gene family members, we focused on their roles in ester synthesis. Volatile esters produced by these AAT genes usually promote the recognition of plants as food because they contribute to the “fruity” aroma of edible fruits, and some esters are also the key flavor or smell component of a particular plant . In this study, our RNA-seq data revealed that 37 BAHD genes were detected at various expression levels in the four stages of pear fruit development. Surprisingly, though lacking motif3 (DFGWG), the Pbr019034.1 and Pbr020016.1 genes were both still expressed at all the stages. Based on transcriptome expression profiles combined with the ester content analysis, four potential genes (Pbr020016.1, Pbr019034.1, Pbr014028.1 and Pbr029551.1) belonging to clades I and IV and that showed strong correlations with ester content changes during fruit ripening, were selected for further qRT-PCR analysis. The assessed expression patterns of the four genes, Pbr020016.1, Pbr029551.1, Pbr019034.1, and Pbr014028.1, were basically consistent with the RNA-seq data.
Here, we identified 717 BAHD genes from seven species, and among them, 114 belonged to Chinese white pear. We divided the BAHD superfamily into five large clades according to the classification results from model plants. BAHD genes expanded in seven Rosaceae species and experienced strong purifying selection except for a few gene pairs with Ka/Ks values greater than one. Finally, after RNA-seq data and qRT-PCR analysis, Pbr020016.1, Pbr029551.1, Pbr019034.1 and Pbr014028.1 were found to be candidate genes related to ester synthesis. These results provide a foundation for further studies of the molecular mechanisms underlying aroma biosynthesis and release.
Source of plants
The ‘Dangshan suli’  fruit were picked at four different developmental stages from the pear germplasm orchard of the Center of Pear Engineering Technology Research located at Jiangpu in Nanjing. The fruit samples were collected on 26th May (45 DAF), 27th June (75 DAF), 28th July (105 DAF) and 6th Sep (145 DAF) in 2017. Previously published RNA-seq data was used to analyze the expression patterns of ‘Dangshan suli’ BAHD [61, 62]. All the samples were ground in liquid nitrogen and stored at − 80 °C. The genome annotation files and genome sequences of pear were collected from the Nanjing Agricultural University pear genome project website (http://peargenome.njau.edu.cn), and the other four Rosaceae species sequences (Fragaria vesca, Pyrus communis, Prunus avium and Rubus occidentalis) were downloaded from the Genome Database for Rosaceae (http://www.rosaceae.org). The sequences of apple (M. domestica) and peach (P. persica) were downloaded from Joint Genome Institude (http://www.jgi.doe.gov/).
Sequence identification and collection
To identify putative BAHD genes from pear and the six other species, peach, apple, strawberry, European pear, sweet cherry and black raspberry, several approaches were employed. Using a BAHD family characteristic domain (Pfam: PF02458) as a query sequence in accordance with the HMM configuration file (PF02458) of BAHD, we searched for candidate genes using the HMM with E-values <1e− 10. We downloaded the HMM profile (PF02458) from the Pfam website (https://pfam.xfam.org/). The online site SMART (http://smart.embl-heidelberg.de/) was used to determine the domain PF02458 of BAHD, and then their HXXXD and DFGWG motifs were inspected. The online website MAFFT (https://mafft.cbrc.jp/alignment/server/) was used for multiple sequence alignments, and the comparison results were download to GeneDoc (http://www.pscedu/biomed/genedoc).
We use the online website MAFFT for multiple sequence alignments, and then upload the comparison results to IQ-TREE (http://www.cibiv.at/software/iqtree) . The ML phylogenetic trees were constructed with bootstraps of 1000 using IQ-TREE. To verify the ML phylogenetic tree, a NJ phylogenetic tree was constructed using MEGA 7.0, with a bootstrap of 1000 .
Conservative motif analysis of BAHD family members
The conservative motifs were analyzed using the online software MEME (http://meme.sdsc.edu/meme4_3_0/intro.html). The maximum value of the motif was set to 20, and the motif length was set between six and 200.
Identification of gene duplication modes and a collinearity analysis
A gene duplication analysis among seven Rosaceae genomes was performed, and the method developed by PGDD (http://chibba.agtec.uga.edu/duplication/) was carried out locally . First, the chromosomal locational information of the BAHD gene family members and related gene pair information were obtained from the genome annotation files of each species, and then the localization and synteny of the BAHD genes were determined using TB-tools software . MCScanX was further used to identify different duplication patterns in the BAHD superfamily .
Calculating Ka and Ks
Ka, Ks and the Ka/Ks ratio were calculated using the calculate_Ka_Ks_pipeline (https://github.com/qiaoxin/Scripts_for_GB/tree/master/calculate_Ka_Ks_pipeline) . In brief, the coding sequences and gene pairs were prepared first. Then, we ran the computing_Ka_Ks_pipe.pl script to automatically perform multiple alignments using MAFFT software and to convert the AXT format, which was used as input for KaKs_Calculator , to calculate Ka and Ks using the GMYN model. Finally, the readable results, including Ka, Ks, Ka/Ks and P-value, were generated.
Expression analysis of BAHD in 15 samples of P. bretschneideri and a qRT-PCR analysis
The RNA-seq data of 15 ‘Dangshansuli’ samples were used to analyze the expression patterns of ‘Dangshansuli’ BAHD, including Fruit_S1, Fruit_S2, Fruit_S3, Fruit_S4 (obtained from the NCBI biproject PRJNA563942), Root (data not shown), Stem, Leaf, Bud, Petal, Sepal, Ovary, Mature pollen, Hydrated pollen, Pollen tube and Stop growth pollen [61, 62]. Then, the gene expression heatmap was drawn using TB-Tools .
Based on the correlation analysis between transcriptome and ester content data of pear fruit during four developmental stages, we selected five possible AAT genes for qRT-PCR. The primers of all the possible genes were designed using Primer Premier 6.0. Total RNA was extracted using a Plant Total RNA Isolation Kit Plus (Fuji, China), and cDNA was synthesized using TransScript One-Step gDNA Removal and cDNA Synthesis SuperMix (TransGen, China). The PCR mixture was as follows: 1 μl of sense and anti-sense primer (10 μM), 5 μl fluorescent dye, 1 μl template and 3 μl sterilized water. qRT-PCR was performed using a LightCycler 480 SYBRGREEN I Master (Roche, USA). We ran the PCR reaction under the following conditions: 10 min at 95 °C, followed by 45 cycles of 95 °C for 3 s, 60 °C for 10 s and 72 °C for 30 s. Then, the 2−△△ct method was used to determine the relative expression with the reference genes being tubulin and WD-repeat protein (Additional file 1: Table S9).
Analysis of volatile aroma compounds
The SPME-GC method was used to determine volatile aromas from Chinese white pear. The concentrations of components (μg/g) = [peak area of component/peak area of internal standard × the concentration of internal standard (g/ml) × 5 μl]/mass of sample (g). The concentration of the internal standard 3-nonanone was 0.82 × 10− 3 g/ml . The SPME fibers used in this study to adsorb volatile organic compounds were coated with a 65 μm thickness of polydimethylsiloxane–divinylbenzene (65 μm PDMS/DVB; Supelco Co, Bellefonte, PA, USA) . The fibers were activated before sampling according to the manufacturer’s instructions. In each extraction, 5 g of pulp were placed into a 20-ml screw-cap headspace brown vial and then add 5 ml of NaCl solution (0.36 g/ml), finally, add 5 ul 3-nonanone solution (0.82 g/l) as internal standard . The mixture was placed in a constant-temperature water bath at 40 °C while the SPME fibers were exposed to the headspace of the sample for 30 min to adsorb the analytes, and then introduced into the heated inlet of the chromatograph for desorption at 250 °C for 5 min in splitless mode. The specific GC-MS conditions are as follows: The organic extract was analysed with a Bruker 320 mass selective detector coupled to a Bruker 450 gas chromatograph, equipped with a 30 m × 0.25 um × 0.25 mm BR-5 MS (5% phenyl-polymethylsiloxane) capillary column. Helium was used as the carrier gas at a flow of 1.0 ml/min. The injector and detector temperature were 250 and 280 °C, respectively. Mass spectra were recorded at 70 eV in electron impact (EI) ionization mode. The temperatures of the quadrupole mass detector and ion source were 150 and 230 °C, respectively. The temperature of the transfer line was 280 °C. Finally, volatile organic compounds were initially identified by comparing the mass spectra of the samples with the data system library (NIST 2013) .
Availability of data and materials
The pear genome datasets used during the current study are available in our pear center website (http://peargenome.njau.edu.cn/), the other four Rosaceae species sequences (Fragaria vesca, Pyrus communis, Prunus avium and Rubus occidentalis) were downloaded from the GDR (http://www.rosaceae.org). The sequence of apple (Malus domestica) and peach (Prunus persica) were collected from JGI (http://www.jgi.doe.gov/). The RNA-seq data were obtained from the NCBI database (https://www.ncbi.nlm.nih.gov/).
Hidden Markov Model
Million years ago
Wu J, Wang ZW, Shi ZB, Zhang S, et al. The genome of the pear (Pyrus bretschneideri Rehd.). Genome Res. 2013;23(2):396–408.
Wang HB, Chen XS, Xin PG, et al. GC-MS analysis of volatile components in several early apple cultivars. J Fruit Sci. 2007;1.
Xin R, Liu X, Wei C, et al. E-nose and GC-MS reveal a difference in the volatile profiles of white-and red-fleshed peach fruit. Sensors. 2018;18(3):765.
Pino JA, Winterhalter P, Castro-Benítez M. Odour-active compounds in baby banana Fruit (Musa acuminata AA Simmonds cv. Bocadillo). Int J Food Prop. 2017;20(sup2):1448–55.
Chen YY, Yin H, Wu X, Shi XJ, Qi KJ, Zhang SL. Comparative analysis of the volatile organic compounds in mature fruits of 12 occidental pear (Pyrus communis L.) cultivars. Sci Hortic. 2018;240:239–48.
Qin G, Tao S, Cao Y, et al. Evaluation of the volatile profile of 33 Pyrus ussuriensis cultivars by HS-SPME with GC–MS. Food Chem. 2012;134(4):2367–82.
Sanz C, Olias JM, Perez AG. Aroma biochemistry of fruits and vegetables. Phytochemistry of fruit and vegetables. New York: Oxford University Press Inc.; 1997. p. 125–55.
Schwab W, Schreier P. Enzymic formation of flavor volatiles from lipids. Lipid Biotechnol CRC Press. 2002:295–320.
Paillard NNM. The flavour of apple, pears and quinces, in food flavours: part C. Flavor Fruits. 1990.
Bartley IM, Stoker PG, Martin ADE, et al. Synthesis of aroma compounds by apples supplied with alcohols and methyl esters of fatty acids. J Sci Food Agric. 1985;36(7):567–74.
El Hadi MAM, Zhang FJ, Wu FF, et al. Advances in fruit aroma volatile research. Molecules. 2013;18(7):8200–29.
Pichersky E, Noel JP, Dudareva N. Biosynthesis of plant volatiles: Nature's diversity and ingenuity. Science. 2006;311(5762):808–11.
D'Auria JC. Acyltransferases in plants: a good time to be BAHD. Curr Opin Plant Biol. 2006;9(3):331–40.
Srivastava S, Sangwan RS. Analysis of Artemisia annua transcriptome for BAHD alcohol acyltransferase genes: identification and diversity of expression in leaf, stem and root. J Plant Biochem Biotechnol. 2012;21(1):S108–18.
D'Auria JC, Reichelt M, Luck K, et al. Identification and characterization of the BAHD acyltransferase malonyl CoA: Anthocyanidin 5-O-glucoside-6 -O-malonyltransferase (At5MAT) in Arabidopsis thaliana. FEBS Lett. 2007;581(5):872–8.
St-Pierre B, Luca VD. Chapter nine evolution of acyltransferase genes: origin and diversification fo the BAHD superfamily of acyltransferases involved in secondary metabolism. Recent Adv Phytochem. 2000;34(00):285–315.
Isabel M, Dylan K. Role of HXXXD-motif/BAHD acyltransferases in the biosynthesis of extracellular lipids. Plant Cell Rep. 2015;34(4):587–601.
El-Sharkawy I, Manriquez D, Flores FB, Regad F, Bouzayen M, Latche A, Pech JC. Functional characterization of a melon alcohol acyl-transferase gene family involved in the biosynthesis of ester volatiles. Identification of the crucial role of a threonine residue for enzyme activity. Plant Mol Biol. 2005;59(2):345–62.
Dudareva N, D'Auria JC, Nam KH, Raguso RA, Pichersky E. Acetyl-CoA : benzylalcohol acetyltransferase - an enzyme involved in floral scent production in Clarkia breweri. Plant J. 1998;14(3):297–304.
St-Pierre B, Laflamme P, Alarco AM, De Luca V. The terminal O-acetyltransferase involved in vindoline biosynthesis defines a new class of proteins responsible for coenzyme A-dependent acyl transfer. Plant J. 1998;14(6):703–13.
Yang Q, Reinhard K, Schiltz E, Matern U. Characterization and heterologous expression of hydroxycinnamoyl/benzoyl-CoA : anthranilate N-hydroxycinnamoyl/benzoyltransferase from elicited cell cultures of carnation, Dianthus caryophyllus L. Plant Mol Biol. 1997;35(6):777–89.
Fujiwara H, Tanaka Y, Yonekura-Sakakibara K, Fukuchi-Mizutani M, et al. cDNA cloning, gene expression and subcellular localization of anthocyanin 5-aromatic acyltransferase from Gentiana triflora. Plant J. 1998;16(4):421–31.
Fujiwara H, Tanaka Y, Fukui Y, Ashikari T, et al. Purification and characterization of anthocyanin 3-aromatic acyltransferase from Perilla frutescens. Plant Sci. 1998;137(1):87–94.
Souleyre EJF, Greenwood DR, Friel EN, et al. An alcohol acyl transferase from apple (cv. Royal Gala), MpAAT1, produces esters involved in apple fruit flavor. FEBS J. 2005;272(12):3132–44.
Cumplido-Laso G, Medina-Puche L, Moyano E, et al. The fruit ripening-related gene FaAAT2 encodes an acyl transferase involved in strawberry aroma biogenesis. J Exp Bot. 2012;63(11):4275–90.
Aharoni A, Keizer LCP, Bouwmeester HJ, et al. Identification of the SAAT gene involved in strawberry flavor biogenesis by use of DNA microarrays. Plant Cell. 2000;12(5):647–61.
D'Auria JC, Pichersky E, Schaub A, Hansel A, et al. Characterization of a BAHD acyltransferase responsible for producing the green leaf volatile (Z)-3-hexen-1-yl acetate in Arabidopsis thaliana. Plant J. 2007;49(2):194–207.
Yu XH, Gou JY, Liu CJ. BAHD superfamily of acyl-CoA dependent acyltransferases in Populus and Arabidopsis: bioinformatics and gene expression. Plant Mol Biol. 2009;70(4):421–42.
Hoffmann L, Maury S, Martz F, Geoffroy P, et al. Purification, cloning, and properties of an acyltransferase controlling shikimate and quinate ester intermediates in phenylpropanoid metabolism. J Biol Chem. 2003;278(1):95–103.
Niggeweg R, Michael AJ, Martin C. Engineering plants with increased levels of the antioxidant chlorogenic acid. Nat Biotechnol. 2004;22(6):746–54.
Grienenberger E, Besseau S, Geoffroy P, Debayle D, et al. A BAHD acyltransferase is expressed in the tapetum of Arabidopsis anthers and is involved in the synthesis of hydroxycinnamoyl spermidines. Plant J. 2009;58(2):246–59.
Luo J, Nishiyama Y, Fuell C, Taguchi G, et al. Convergent evolution in the BAHD family of acyl transferases: identification and characterization of anthocyanin acyl transferases from Arabidopsis thaliana. Plant J. 2007;50(4):678–95.
Legrand G, Delporte M, Khelifi C, Harant A, Vuylsteker C, Morchen M, Hance P, Hilbert JL, Gagneul D. Identification and characterization of five BAHD Acyltransferases involved in Hydroxycinnamoyl Ester metabolism in chicory. Front Plant Sci. 2016;7:741.
Qiao X, Li M, Li L, Yin H, et al. Genome-wide identification and comparative analysis of the heat shock transcription factor family in Chinese white pear (Pyrus bretschneideri) and five other Rosaceae species. BMC Plant Biol. 2015;15(1):12.
Maher C, Stein L, Ware D. Evolution of Arabidopsis microRNA families through duplication events. Genome Res. 2006;16(4):510–9.
Qiao X, Li Q, Yin H, Qi K, Li L, et al. Gene duplication and evolution in recurring polyploidization–diploidization cycles in plants. Genome Biol. 2019;20(1):38.
Lemoine F, Lespinet O, Labedan B. Assessing the evolutionary rate of positional orthologous genes in prokaryotes using synteny data. BMC Evol Biol. 2007;7:237.
Jun J, Mandoiu II, Nelson CE. Identification of mammalian orthologs using local synteny. BMC Genomics. 2009;10.
Xu LL, Qiao X, Zhang MY, Zhang SL. Genome-wide analysis of aluminum-activated malate transporter family genes in six rosaceae species, and expression analysis and functional characterization on malate accumulation in Chinese white pear. Plant Sci. 2018;274:451–65.
Fawcett JA, Maere S, Van de Peer Y. Plants with double genomes might have had a better chance to survive the cretaceous-tertiary extinction event. Proc Natl Acad Sci. 2009;106(14):5737–42.
Starr TK, Jameson SC, Hogquist KA. Positive and negative selection of T cells. Annu Rev Immunol. 2003;21(1):139–76.
Ma XY, Koepke J, Panjikar S, et al. Crystal structure of vinorine synthase, the first representative of the BAHD superfamily. J Biol Chem. 2005;280(14):13576–83.
Morales-Quintana L, Moya-Leon MA, Herrera R. Computational study enlightens the structural role of the alcohol acyltransferase DFGWG motif. J Mol Model. 2015;21(8):216.
Defilippi BG, Kader AA, Dandekar AM. Apple aroma: alcohol acyltransferase, a rate limiting step for ester biosynthesis, is regulated by ethylene. Plant Sci. 2005;168(5):1199–210.
Cao XM, Xie KL, Duan WY, Zhu YQ, et al. Peach Carboxylesterase PpCXE1 is associated with catabolism of volatile esters. J Agric Food Chem. 2019;67(18):5189–96.
Galaz S, Morales-Quintana L, Moya-León MA, et al. Structural analysis of the alcohol acyltransferase protein family from Cucumis melo shows that enzyme activity depends on an essential solvent channel. FEBS J. 2013;280(5):1344–57.
Lucchetta L, Manriquez D, El-Sharkawy I, Flores FB, et al. Biochemical and catalytic properties of three recombinant alcohol acyltransferases of melon. Sulfur-containing ester formation, regulatory role of CoA-SH in activity, and sequence elements conferring substrate preference. J Agric Food Chem. 2007;55(13):5213–20.
Freeling M. Bias in plant gene content following different sorts of duplication: tandem, whole-genome, segmental, or by transposition. Annu Rev Plant Biol. 2009;60:433–53.
Maere S, De Bodt S, Raes J, Casneuf T, et al. Modeling gene and genome duplications in eukaryotes. Proc Natl Acad Sci. 2005;102(15):5454–9.
Friedman R, Hughes AL. Pattern and timing of gene duplication in animal genomes. Genome Res. 2001;11(11):1842–7.
Moore RC, Purugganan MD. The early stages of duplicate gene evolution. Proc Natl Acad Sci. 2003;100(26):15682–7.
Li JM, Qin MF, Qiao X, Cheng YS, et al. A new insight into the evolution and functional divergence of SWEET transporters in Chinese white pear (Pyrus bretschneideri). Plant Cell Physiol. 2017;58(4):839–50.
Ma C, Zhang HP, Li JM, Tao ST, Qiao X, et al. Genome-wide analysis and characterization of molecular evolution of the HCT gene family in pear (Pyrus bretschneideri). Plant Syst Evol. 2017;303(1):71–90.
Du DL, Hao RJ, Cheng TR, Pan HT, et al. Genome-wide analysis of the AP2/ERF gene family in Prunus mume. Plant Mol Biol Report. 2013;31(3):741–50.
Guo CL, Guo RR, Xu XZ, Gao M, Li XQ, et al. Evolution and expression analysis of the grape (Vitis vinifera L.) WRKY gene family. J Exp Bot. 2014;65(6):1513–28.
Schmidt GW, Jirschitzka J, Porta T, Reichelt M, et al. The last step in cocaine biosynthesis is catalyzed by a BAHD Acyltransferase. Plant Physiol. 2015;167(1):89–101.
Zheng ZY, Qualley A, Fan BF, Dudareva N, et al. An important role of a BAHD acyl transferase-like protein in plant innate immunity. Plant J. 2009;57(6):1040–53.
Li DP, Shen J, Wu T, Xu YF, et al. Overexpression of the apple alcohol acyltransferase gene alters the profile of volatile blends in transgenic tobacco leaves. Physiol Plant. 2008;134(3):394–402.
Balbontin C, Gaete-Eastman C, Fuentes L, et al. VpAAT1, a gene encoding an alcohol Acyltransferase, is involved in Ester biosynthesis during ripening of mountain papaya fruit. J Agric Food Chem. 2010;58(8):5114–21.
Gunther CS, Chervin C, Marsh KB, et al. Characterisation of two alcohol acyltransferases from kiwifruit (Actinidia spp.) reveals distinct substrate preferences. Phytochemistry. 2011;72(8):700–10.
Li QH, Qiao X, Yin H, Zhou YH, et al. Unbiased subgenome evolution following a recent whole-genome duplication in pear (Pyrus bretschneideri Rehd.). Hortic Res. 2019;6.
Zhou HS, Qi KJ, Liu X, Yin H, et al. Genome-wide identification and comparative analysis of the cation proton antiporters family in pear and four other Rosaceae species. Mol Gen Genomics. 2016;291(4):1727–42.
Nguyen LT, Schmidt HA, von Haeseler A, Minh BQ. IQ-TREE: a fast and effective stochastic algorithm for estimating maximum-likelihood phylogenies. Mol Biol Evol. 2015;32(1):268–74.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33(7):1870–4.
Lee TH, Tang HB, Wang XY, Paterson AH. PGDD: a database of gene and genome duplication in plants. Nucleic Acids Res. 2013;41(D1):D1152–8.
Chen C, Xia R, Chen H, et al. TBtools, a toolkit for biologists integrating various biological data handling tools with a user-friendly interface. BioRxiv. 2018.
Wang Y, Tang H, DeBarry JD, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.
Wang DP, Zhang YB, Zhang Z, et al. KaKs_Calculator 2.0: a toolkit incorporating gamma-series methods and sliding window strategies. Genomics, Proteomics Bioinformatics. 2010;8(1):77–80.
The authors would like to thank everyone who contributed to this article.
This study was supported by the National Key Research and Development Program(2018YFD1000200), the National Natural Science Foundation of China (31701890, 31601708), Fundamental Research Funds for the Central Universities (KJQN201818). These fundings provided the financial support to the research projects, but did not involve in project design, data collection, analysis, or preparation of the manuscript.
Ethics approval and consent to participate
All the pear materials used and analyzed for this study were collected from the Center of Pear Engineering Technology Research, Nanjing Agricultural University, which were public and available for non-commercial purpose. This article did not contain any studies with human participants or animals performed by any of the authors.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Gene lists and basic feature of all identified BAHD genes. Table S2. Motif sequences identified by MEME tools in pear BAHDs. Table S3. Information of gene duplication events identified in all identified BAHD genes. Table S4. Collinearity relationship among BAHD genes in the same species. Table S5. Ka, Ks and Ka/Ks value of duplicated gene pairs among BAHD. Table S6. The TPM value of BAHD genes in pear. Table S7. Content of total ester at different developmental stages of Chinese white pear. Table S8. The list of molecules identified as esters. Table S9. Primers for qRT-PCR of candidate genes in Chinese white pear BAHD gene family.
Figure S1. Phylogenetic analysis of BAHD from Arabidopsis, Pyrus bretschneideri and Populus. The software MEGA 7.0 was used to construct the phylogenetic tree. The amino-acid sequences of Arabidopsis and Populus were obtained from phytozome (https://phytozome.jgi.doe.gov/pz/portal.html#).
Figure S2. Phylogenetic analysis of BAHD from Arabidopsis, Pyrus communis and Populus. The software MEGA 7.0 was used to construct the phylogenetic tree. The amino-acid sequences of Arabidopsis and Populus were obtained from phytozome (https://phytozome.jgi.doe.gov/pz/portal.html#).
About this article
Cite this article
Liu, C., Qiao, X., Li, Q. et al. Genome-wide comparative analysis of the BAHD superfamily in seven Rosaceae species and expression analysis in pear (Pyrus bretschneideri). BMC Plant Biol 20, 14 (2020). https://doi.org/10.1186/s12870-019-2230-z
- Volatile esters