Skip to main content
  • Research article
  • Open access
  • Published:

Genome-wide characterization, evolution and expression profiling of UDP-glycosyltransferase family in pomelo (Citrus grandis) fruit



Pomelo is one of the three major species of citrus. The fruit accumulates a variety of abundant secondary metabolites that affect the flavor. UDP-glycosyltransferases (UGTs) are involved in the glycosylation of secondary metabolites.


In the present study, we performed a genome-wide analysis of pomelo UGT family, a total of 145 UGTs was identified based on the conserved plant secondary product glycosyltransferase (PSPG) motif. These UGT genes were clustered into 16 major groups through phylogenetic analysis of these genes with other plant UGTs (A-P). Pomelo UGTs were distributed unevenly among the chromosomes. At least 10 intron insertion events were observed in these UGT genome sequences, and I-5 was identified to be the highest conserved one. The expression profile analysis of pomelo UGT genes in different fruit tissues during development and ripening was carried out by RNA-seq.


We identified 145 UGTs in pomelo fruit through transcriptome data and citrus genome database. Our research provides available information on UGTs studies in pomelo, and provides an important research foundation for screening and identification of functional UGT genes.


Plants produce a variety of secondary metabolites that are involved in important mechanisms at different developmental stages. As an important modification reaction, glycosylation is usually the last step in the biosynthesis of secondary metabolites in plants. In coordination with methylation, hydroxylation, and acylation, glycosylation contributes to the diversity and complexity of secondary metabolites in plants [1]. Glycosylation is the main mechanism for stabilizing and enhancing the solubility of metabolites in plants, thereby regulating plant signaling pathways and intracellular homeostasis [2, 3].

Glycosyltransferase catalyzes glycosylation [4]. In 2020, a total of 110 glycosyltransferase (GT) super families were included in the CAZy database (, of which the GT1 family had the largest number of gene members and is mainly in plants. The GT1 family is commonly referred to as UDP-glycosyltransferase (UGT), because primarily it catalyzes the transfer of UDP-sugars to specific receptors, including plant hormones, secondary metabolites, and xenobiotics [5]. Plant UGT has a conserved sequence of 44 amino acids at the C-terminus, named the plant secondary product glycosyltransferase box (PSPG box), which is responsible for the binding of glycosyl groups [5]. The N-terminus, which varies considerably in sequence, is responsible for the recognition of different receptor molecules [1].

UGTs involved in plant secondary metabolism often show a wide range of substrate specificities. Several UGT genes were functionally characterized to glycosylated terpenoids, phenylpropanoids, and flavonoids in fruits. The phloretin-specific glycosyltransferase UGT88F1 was identified in apple [6]. VvGT5 and VvGT6 from grapevine [7], FaGT6 and FaGT7 from strawberry [8], were identified as being involved in the glycosylation of flavonoids. AdGT4 was identified as glycosylating terpene alcohols in kiwifruit [9], while F3GT1 was involved in the biosynthesis of kiwifruit anthocyanins [10]. In peach fruit, PpUGT85A2 was identified as glycosylating terpenoids and phenylpropanoids [11] and PpUGT78A1 and PpUGT78A2 as being responsible for glycosylation of anthocyanins [12]. In citrus, some UGT members that catalyze flavonoid glycosylation have been reported, including Cm1_2RhaT [13] from pomelo, CsUGT76F1 [14] and Cs1,6RhaT [15, 16] from sweet orange, and three other UGTs were considered to be putative terpenoid glycosyltransferases [17].

To date, complete and nearly complete genome sequencing programs have been available for many plant species, providing the basis for the genome-wide research of the UGT gene family. It has been found that UGT exists in plants as multigene families and has expanded in many plant species. More than 100 UGT members have been identified in Arabidopsis, and even more UGT family members have been found in other plant species. For example, 147, 179, and 180 UGTs have been identified in Zea mays [18], Triticum aestivum [19], and Oryza sativa [5], respectively. In addition, for fruits, genome-wide analysis of UGT gene families has also reported, including in Malus x domestica [5], Vitis vinifera [5], and Prunus persica [20]. However, information on UGT in other fruits is still limited.

Pomelo is one of the most important economic citrus species and the fruit contain special flavors and are rich in flavonoids which are good for human health. In this study, 145 UGT gene members were identified in pomelo based on genome database analysis. Phylogenetic analysis, chromosome localization and gene structure of Extron-intron analysis were carried out. Expression patterns of pomelo UGT genes in various fruit tissues during development and ripening were analyzed by RNA-seq. Genome-wide analysis of UGT gene family provides insights for future screening and functional identification of pomelo UGTs.


Identification of pomelo UGTs

A total of 145 UGTs was identified in pomelo fruit as containing a consensus sequence (PSPG box) at the C-terminus of the protein. These UGT genes encoded predicted proteins ranging from 144 to 680 amino acids (average 459 amino acids) (Table S1). The molecular weight from 16.39 to 76.87 kDa (average Mw = 51.26 kDa) and the isoelectric point (pI) ranged from 4.82 to 9.18 (average pI = 5.81). Subcellular localization of these genes indicated that 88 UGT members (61% of UGTs) were probably in the cytoplasm, and 35 (24%) and 17 (12%) UGTs were most probably in the plasma membrane and chloroplast, respectively. Only one UGT (Cg7g000340) was predicted to be located in the mitochondria, two (Cg6g025740 and Cg8g023190) in the nucleus and two (Cg3g014800 and Cg3g014820) extracellular (Fig. S1, Table S2).

Phylogenetic analysis of pomelo UGTs

In order to explore the evolutionary relationships of plant UGT families, the phylogenetic tree was constructed based on the pomelo and other plant UGT protein sequences, including Arabidopsis, citrus, maize, tomato, grapevine, peach, apple, kiwifruit and strawberry (Fig. 1). All UGT members were divided into 16 phylogenetic groups, including 14 conservative groups (A-N) identified in Arabidopsis [2], and two newly identified groups O and P found in other plants, such as grapevine [5]. Cm1_2RhaT (100% amino acid sequence identity to Cg1g023820) from pomelo and Cs1,6RhaT from sweet orange that were identified as flavonoid 7-O-UGTs [13, 15, 16], were clustered in group A. CsUGT76F1, located in group H, was identified as being involved in the biosynthesis of flavonoid 7-O-glucosides and 7-O-rhamnosides in sweet orange [14]. Arabidopsis UGT73C6 (flavonol-3-O-rhamnoside-7-O-glucosyltransferase) [21] and strawberry FaGT7 (flavonol-3-O-glucosyltransferase) [8] were located in group D. Other UGTs responsible for flavonol-3-O-glycosylation were located in group F, including UGT78D1 from Arabidopsis [22], and VvGT5 and VvGT6 from grapevine [7].

Fig. 1
figure 1

Phylogenetic analysis of UGT family genes from pomelo. The tree was constructed using the neighbor-joining method by aligning the amino acid sequences of pomelo UGTs with several plant functional UGTs from Arabidopsis, citrus, maize, tomato, grapevine, peach, apple, kiwifruit and strawberry. The phylogenetic tree was visualized using the FigTree v1.4.2 program. Each of the 16 groups (A-P) is indicated in a specific color

Three putative terpenoid UGTs were isolated in ‘Valencia’ sweet orange, CsUGT1 and CsUGT3 were clustered in group L, while CsUGT2 was clustered in group D [17]. Several UGT73 (belonging to group D) and UGT71 family members (belonging to group E) were functional in the biosynthesis of anthocyanins and the glycosylation of volatile metabolites, including terpenoids [23, 24]. Some other UGTs in group G also have been functionally characterized as participating in terpenoid glycosylation and affecting fruit flavor, such as kiwifruit AdGT4 [9], grapevine VvGT14 [25], and peach PpUGT85A2 [11].

Distribution of plant UGTs in phylogenetic groups

The evolutionary pattern of the plant UGT gene family was analyzed by comparing the distribution of UGTs in the different phylogenetic groups (Table 1). During the evolution of higher plants, the five phylogenetic groups A, D, E, G, and L appeared to expand more than others, although the number of genes found in these groups varies widely among species. In pomelo, six phylogenetic groups, A, D, E, H, I, and L expanded more than the other groups. There were only 9 pomelo UGTs in group G, more than in Arabidopsis (6 UGTs), but much fewer than in other plants, especially peach and apple, which had up to 34 and 40 UGTs respectively in group G. The number of pomelo UGTs in the group I accounted for 12% of the total pomelo UGTs, much higher than in other plants (Fig. 2a). The proportion of pomelo UGTs in group H (about 12% of the total pomelo UGTs) was much higher (1.5 ~ 12 fold) than in other fruits such as peach (Prunus persica), apple (Malus x domestica) and grapevine (Vitis vinifera) (Fig. 2a).

Table 1 Number of the plant UGTs in the different phylogenetic groups
Fig. 2
figure 2

Expansion of the plant UGTs in several plant species. a The percentage of plant UGTs in the different phylogenetic groups. b Comparison of the fold increase of plant UGTs in each group with reference to Arabidopsis thaliana. Genes from different species are indicated by specific colors

It was worth noting that the number of plant UGTs in group I and group M was significantly increased by comparing the number of plant UGTs in each phylogenetic group with those reported in Arabidopsis (Fig. 2b). In Arabidopsis, there was only one UGT member in group I, while other plants contained 5–17 members, and the number of UGTs in pomelo was the highest (Table 1; Fig. 2b). In group M, the number of UGTs ranged from one in Arabidopsis to 14 in peach, and there was a 7-fold difference between Arabidopsis and pomelo. In addition, the number of pomelo UGTs in some groups was reduced relative to Arabidopsis, including group C, F, H and L.

Chromosomal location of UGT genes in pomelo

To summarize the genomic distribution of pomelo UGT genes, the chromosomal location of UGT genes was investigated based on the genome annotation information retrieved from the pomelo genomic databases (Fig. 3; Table S3). A total of 139 UGT genes were unevenly distributed on the pomelo genome of 9 chromosomes, the remaining 6 UGT genes were localized on the unknown chromosome (chrUn), including CgUng002730 of group K, CgUng021570 of group I, and four UGT genes of group D. In the pomelo genome, chromosome 2 contained the most UGT genes (23 members), followed by 21 members located on chromosome 8, and 20 members on chromosome 6. Only five members were distributed on chromosome 4, which contained the least number of UGT genes.

Fig. 3
figure 3

Chromosome distribution of pomelo UGTs. The pomelo UGTs were distributed among 9 chromosomes (chr1-chr9) and one unknown chromosome (chrUn). The different colored bars represent a chromosome, and chromosome numbers are displayed at the top of each chromosome. Different color letters represent different phylogenetic groups

Since pomelo UGTs could be divided into 16 groups, the localization of these groups on the chromosomes was observed (Fig. 3). The UGT genes of group E with the most members (25 genes) were randomly distributed across eight chromosomes (chromosome 1–7 and 9). For group I, chromosome 3 and 5 each contained six UGT genes, chromosome 6 had three UGT genes, and the remaining two members were located on chromosome 8 and unknown chromosome. Group M contained 7 UGT genes, five of which were located on chromosome 2 and two were located on chromosome 1.

Structural analysis of UGT genes in pomelo

To better explore the relationships among the structure and function of pomelo UGT genes, and further clarify the evolutionary relationships within the UGT gene family, the exon/intron structure was analyzed. Among the 145 pomelo UGT genes, 70 UGTs (48%) had no introns, 63 UGTs (43%) contained one intron, whereas the remaining eight UGTs contained two introns, and three UGTs contained three introns, only one UGT contained eight introns (Table 2; Fig. S2). For UGT groups, group E contained the largest number of genes with losing introns (22 members), followed by 15 in group D and 13 in group A. All members of group B, C and O had no introns. Most of the UGTs in group H and I contained one intron with 14 UGTs out of all 17 members (82%) in each group.

Table 2 Number of pomelo UGT genes in each group according to number of introns

After searching for all of the 75 intron-containing sequences and mapping the introns to the amino acid sequences, 10 independent intron insertion events were observed in the pomelo UGT gene family members (Fig. 4). Based on the positions in the protein sequences, these insertion events were numbered sequentially from I-1 to I-10. Intron 5 (I-5) was indicated to be a highest conserved intron, which was the most widespread intron of UGTs, containing 48 UGT members (64% of the intron containing UGTs), except for group E, L and M. All members of group F, K, J, N and P, and most members of group G, H and I contained the intron 5. A total of eight of the nine UGTs in group G contained intron 5, 13 of 17 in group I, and 12 of 17 in group H. Intron 6 was mainly observed in group L.

Fig. 4
figure 4

Distribution of introns among 75 pomelo UGT genes. The introns were mapped and numbered according to the alignment of the encoding sequences for the UGT genes. The position of introns inserted into each gene is represented by the scale on the top of the map. The gray lines indicate the length of their amino acid sequences. Arrows indicate the position of intron insertion in each gene. Intron phases are represented by different shapes of arrows for 0, 1, and 2, respectively. Phylogenetic relationship on the left indicate the classification of pomelo UGT genes

Most of the total 96 introns identified in the UGT gene structures of pomelo were in phase 1, accounting for 64% (61 introns), followed by 28% (27 introns) in phase 0, and only 8% (8 introns) in phase 2 (Fig. 4). For the highly conserved intron 5, each had one intron in phase 0 and 2, while phase 1 accounted for 96% of all introns. All members of intron 4 and 7 out of 9 UGTs with intron 6 were in phase 0. These findings indicated that most of the high conserved introns were ancient elements and their phases remained stable during evolution.

Expression profiles in different fruit tissues during development and ripening

To detect the expression profiles of 145 pomelo UGT genes, transcript abundances of UGTs in different fruit tissues during development and ripening were analyzed using RNA-seq data (Fig. 5; Fig. 6). The four different tissues of pomelo fruit were flavedo, albedo, segment membrane (SM) and juice sacs (JS) (Fig. 5a, b). A total of 111 UGT genes (accounting for 84.1% of total pomelo UGTs) were expressed in all four fruit tissues. Additionally, 4 UGT genes (accounting for 3%), three UGT genes (2.3%), and one UGT gene (0.8%) were specifically expressed in JS, SM, and flavedo, respectively, but no genes were specifically expressed in albedo (Fig. 5c).

Fig. 5
figure 5

Ripening pomelo fruit and Venn diagram of pomelo UGT genes. a Photo of pomelo fruit during three developmental stages: green stage at 80 days after full blossom (DAB), color breaking stage at 140 DAB, and mature stage at 200 DAB. b Four tissues of pomelo fruit: flavedo (the outer skin), albedo (the inner pith), segment membrane and juice sacs. c Venn diagram of UGT genes in pomelo fruit. The distribution of 145 UGT genes in the four tissues of pomelo fruit

Fig. 6
figure 6

Transcript levels of UGT genes in pomelo fruit during development and ripening. The pomelo UGT genes expression profiles of different fruit tissues were studied during three developmental stages: green stage at 80 days after full blossom (DAB), color breaking stage at 140 DAB, and mature stage at 200 DAB. The heat map was calculated by hierarchical clustering of UGT genes. The transcript abundance of UGT genes is represented by the color scale (representing − 2 to 2). Different letters indicate different phylogenetic groups of pomelo UGT genes

For pomelo fruits at different developmental stages, nearly half (71 members) of the UGT genes showed the highest level of transcript in the flavedo (Fig. 6). Among them, 13 UGTs (52%) belonged to group E, 10 UGTs (56% of members in group D) belonged to group D, 8 UGTs (47%) in group H, 9 UGTs (53%) in group I, and 7 UGTs (58%) in group L. The expression levels of 29 members in pomelo were highest at green stage (80 DAB), followed by 23 at color break stage (140 DAB), and 19 at mature stage (200 DAB). Cg1g023820 in group A had 100% identity with the amino acid sequence of Cm1_2RhaT from pomelo, which was identified to be a flavonoid 7-O-UGT [13, 15, 16], showed the highest transcript levels at color break stage (140 DAB) and mature stage (200 DAB) in flavedo of pomelo fruit. This finding was consistent with previous research. Considering the largest number of pomelo UGT genes in group E, the expression patterns in different fruit tissues during development and ripening were further analyzed (Fig. 6). A total of nine UGT genes showed the highest abundance of transcript at the green stage (80 DAB) of flavedo, two genes at the color break stage of flavedo, and two at the mature stage. Only one UGT gene, Cg2g037190 expressed highest at the color break stage and mature stage of albedo, while the two genes Cg3g022390 and Cg6g012170 had the highest expression level in the segment membrane (SM) of pomelo fruit, and three genes predominantly expressed in the juice sacs (JS).


Plant glycosyltransferases belong to a large and functionally diverse family of enzymes characterized as glycosyltransferase family 1, also named UDP-glycosyltransferase (UGT) [1]. Plant UGTs catalyze a wide range of substrates for glycosylation reactions, including plant hormones, plant secondary metabolites and biological homogeneous/heterologous substances (such as cyanogenic glycosides and herbicides). Plant secondary metabolites are generally divided into three categories: phenols, terpenes, and nitrogen-containing organic compounds. Some of these compounds are chemically modified, and glycosylation is one of the important modification reactions. Several UGT genes have been functionally characterized in plants, such as Arabidopsis thaliana [5], Zea mays [18], Triticum aestivum [26], Brassica rapa [27] and Prunus persica [20]. In citrus, only three UGTs have been functionally identified, Cm1_2RhaT from pomelo, Cs1,6RhaT and CsUGT76F1 from sweet orange associated with flavonoid glycosylation [13,14,15,16], and three putative terpenoid UGTs, CsUGT1, CsUGT2 and CsUGT3 [17]. However, no large-scale analysis was found in citrus.

In this study, a total of 145 UGT genes was identified in pomelo fruit, accounting for about 0.5% of the pomelo gene product, which is lower than 0.6% of peaches [20]. The pomelo UGT genes were clustered into 16 groups, including 14 highly conserved groups (A-N) and two newly discovered O and P groups (Table 1). A new group Q was found only in maize and wheat, suggesting that this group may exist only in monocotyledons and play an important role in the UGT gene evolution of monocotyledons. In contrast, group K had only one UGT member in maize and rice, and no member in wheat, implying that group K may expand members in dicotyledons. Only one member was identified in the pomelo group N, which is the same as in dicotyledons such as Arabidopsis, apple and peach. This finding provides additional evidence for previous speculation that the group N was mainly amplified in monocotyledons [18, 20]. In addition, group E contained the most UGT genes, accounting for 17% of all UGT genes in pomelo. Furthermore, the UGT genes of group E accounted for the highest proportion in the eight species compared in this study, indicating that group E has been expanded in various plant species. In addition, many plant UGT gene members belonging to group E have been functionally identified, including the glycosylation of small molecule volatile compounds, and synthesis of flavonoid glycosides, phlorizin, and anthocyanins [6, 23, 24], which indicates that group E made an important contribution to the glycosylation of plant secondary metabolites.

Introns are an important part of genes, although they do not participate in the encoding of proteins, the intron gain or loss events and the insertion position of introns relative to protein sequences are generally considered as the key clues to understand the evolution or diversification of gene family [28]. Intron mapping of 145 pomelo UGTs revealed that 48% UGT members lacked introns, which is more than the number (43%) in peach [20], while less than the number (58%) in Arabidopsis [5]. A total of ten intron insertion sites was identified in the pomelo UGT genes, the same with those found in peach [20]. Among these introns, intron 5 (I-5) was considered to be the most widely distributed intron, except for groups E, L and M, and the two groups B and C without introns. For the peach 145 UGTs [20], wheat 179 UGTs [26] and maize 147 UGTs [18], intron 5 was also found to be the oldest intron. For the remaining introns, some were predominantly observed in certain phylogenetic groups, such as intron 6 was mainly present in group L, which is similar to the observation in peach [20]. Interestingly, a large quantity of intron 5 were in phase 1, while many of intron 6 were in phase 0, and the number of phase 0 and 1 introns was greater than the number of phase 2 introns. This finding was consistent with previous research, showing that conserved introns were ancient elements and intron phases were generally conservative during evolution, and can remain constant for many years, because any deletion and insertion of small DNA fragments that cause a phase change may lead to changes in gene function, and be eliminated by natural selection [29].

In previous studies, three UGT genes, Cm1_2RhaT from pomelo (Citrus maxima), CsUGT76F1 and Cs1,6RhaT from sweet orange (Citrus sinensis) were functionally characterized to be involved in the biosynthesis of flavonoid 7-O-glucosides [15, 16]. Cm1_2RhaT showed 100% amino acid sequence identity to Cg1g023820 found in our study, CsUGT76F1 had 98.69% amino acid sequence identity with Cg7g000360, Cs1,6RhaT and Cg6g014140 had 81.26% identity. It was worth noting that Cg1g023820, Cg7g000360 and Cg6g014140 were specifically expressed in the flavedo of fruit, which is consistent with the high accumulation of glycosylated flavonoids in flavedo [30, 31]. Additionally, many (~ 50%) of the pomelo UGT genes showed the highest level of transcript in flavedo, indicating that UGTs play an important role in the biosynthesis of glycosylated secondary metabolites.


This study provided useful insights into the evolution, distribution, gene structure, and expression profiling of pomelo UDP-glycosyltransferase. A total of 145 UGTs was identified in pomelo fruit. These genes were distributed unevenly among the 9 chromosomes, and clustered into 16 phylogenetic groups (A-P). Ten intron positions were observed in pomelo UGT genes, which indicated that they played an important role in the evolution and divergence of UGT gene family. The tissue-specific expression in four different fruit tissues during development and ripening was demonstrated by RNA-seq analysis, suggesting essential roles of UGTs in pomelo. This research would facilitate the screening of candidate genes and further characterization of their substrate specificity and biological function.

Materials and methods

Plant materials

Yuhuan (C. grandis (L.) Osbeck ‘Yuhuan’) pomelo fruits were harvested from the KEHAO Fruit Professional Cooperative in Yuhuan County, Zhejiang Province, and the sampling permissions were obtained. Fruit were harvested at three developmental stages, including green stage, 80 days after full blossom (DAB), color breaking stage at 140 DAB, and mature stage at 200 DAB. At each sampling time, the fruit was divided into four different parts: flavedo (the outer skin), albedo (the inner pith), segment membrane and juice sacs. Three biological replicates, each with five fruits were harvested, frozen in liquid nitrogen, and stored at -80o C until further analysis.

Identification of UGT genes

The pomelo UGTs were identified by using ‘UDP-glycosyltransferase’ as a query to screen genes in functional annotation results from transcriptome data. Further confirmation of UGTs was based on the 44-amino acid conserved motif of the plant secondary product glycosyltransferase box (PSPG box). The pomelo UGT gene sequences and the information of UGTs, including chromosome location, exon-intron structure and description were obtained from the citrus genome database ( The length, molecular weight (Mw), and isoelectric point (pI) of each UGT protein were calculated by the online program ExPASy ( Subcellular localization of UGT proteins was predicted using the online analysis tool CELLO v2.5 system ( from Molecular Bioinformatics Center. The chromosome distribution of UGT genes was conducted with the MapChart (v2.32) software.

Phylogenetic analysis

The ClustalX v2.0 program was used for the alignment of the amino acid sequences of UGTs using the neighbor-joining (NJ) method. A phylogenetic tree was constructed using FigTree v1.4.2 program based on the 145 pomelo UGTs with several functional UGTs, including VvGT5 (GenBank XP_002271025), VvGT6 (NC_012017), VvGT7 (XP_002276546), and VvGT14 (XP_002285770) from grapevine (Vitis vinifera); MdPGT1 (XP_008339149) and MdUGT88F1 (EU246349) from apple (Malus x domestica); SlUGT5 (XP_004231207) from tomato (Solanum lycopersicum); F3GT1 (A0A2R6Q8R5) from kiwifruit (Actinidia chinensis); AdGT4 (KF954944) from kiwifruit (Actinidia deliciosa); FaGT1 (AY663784), FaGT2 (AY663785), FaGT6 (DQ289587), FaGT7 (DQ289588), FaUGT71A33 (XP_004303953.1), FaUGT71A34 (XP_004303954.2), FaUGT71A35 (XP_004303955.1), FaUGT71W2 (XP_011468178.1), FaUGT73B23 (XP 004304022.1), FaUGT73B24 (XP 004304022.1), and FaUGT71K3 (XP 004294260.1) from strawberry (Fragaria x ananassa); PpUGT85A2 (XP_007227474.1) from peach (Prunus persica); GRMZM5G834303 (NP_001148991.2) and GRMZM2G075387 (XP_008670630.1) from maize (Zea mays); Cm1,2RhaT (AY048882) from pomelo (Citrus maxima); Cs1,6RhaT (DQ119035), CsUGT76F1 (KDO69246.1), CsUGT1 (GQ221686), CsUGT2 (GQ221687), and CsUGT3 (GQ221688) from sweet orange (Citrus sinensis). And 19 UGT sequences were from Arabidopsis thaliana and obtained from

Intron mapping

The pomelo UGT intron map was constructed by determining the length, splice site, phase, and position of introns. The exon-intron structure of the UGT genes was illustrated with the online Gene Structure Display Server 2.0 program ( using genomic sequences and CDS sequences. The introns were serially numbered according to their position in the amino acid sequence by aligning all pomelo UGTs. Introns were divided into three types based on their phases: phase 0, phase 1, and phase 2. If the intron positioned between two triplet codons, it was defined as phase 0; If the intron positioned after the first and second bases of a codon, it was defined as phase 1 and phase 2 [20, 32].

Gene expression analysis

About 0.3 g of flavedo, 0.3 g of albedo, 0.3 g of segment membrane, and 1 g of juice sacs were used for pomelo fruit total RNA extraction according to Zhang et al. [33]. The quality and concentration of RNA samples were determined by NanoDrop One spectrophotometer at A260/A280 absorbance ratio. RNA integrity was verified by gel electrophoresis. Libraries for high-throughput Illumina strand-specific RNA-Seq were prepared as described previously [34]. The data was calculated by reads per kilobase per million mapped read (RPKM) values as transcript abundance. Three biological replicates were performed for gene expression analysis.

Data analysis

Multi Experiment Viewer (version 4.6.0) was used for heatmap analysis of UGT genes transcript abundance and construction of gene clusters in pomelo. VENNY 2.1 online program ( was used to make Venn diagram.

Availability of data and materials

The raw sequencing reads of transcriptome data in this study are available in the Sequence Read Archive (SRA) database, with the accession number PRJNA663973. All data generated or analysed during this study are included in this published article and its additional files.





Plant secondary product glycosyltransferase


Days after full blossom


Coding sequence




  1. Vogt T, Jones P. Glycosyltransferases in plant natural product synthesis: characterization of a supergene family. Trends Plant Sci. 2000;5:380–6.

    Article  CAS  PubMed  Google Scholar 

  2. Li Y, Baldauf S, Lim EK, Bowles DJ. Phylogenetic analysis of the UDP-glycosyltransferase multigene family of Arabidopsis thaliana. J Biol Chem. 2001;276(6):4338–43.

    Article  CAS  PubMed  Google Scholar 

  3. Ross J, Li Y, Lim EK, Bowles DJ. Higher plant glycosyltransferases. Genome Biol. 2001;2(2):1–6.

    Article  Google Scholar 

  4. Yonekura-Sakakibara K, Hanada K. An evolutionary view of functional diversity in family 1 glycosyltransferases. Plant J. 2011;66(1):182–93.

    Article  CAS  PubMed  Google Scholar 

  5. Caputi L, Malnoy M, Goremykin V, Nikiforova S, Martens S. A genome-wide phylogenetic reconstruction of family 1 UDP-glycosyltransferases revealed the expansion of the family during the adaptation of plants to life on land. Plant J. 2012;69(6):1030–42.

    Article  CAS  PubMed  Google Scholar 

  6. Dare AP, Yauk YK, Tomes S, McGhie TK, Rebstock RS, Cooney JM, et al. Silencing a phloretin-specific glycosyltransferase perturbs both general phenylpropanoid biosynthesis and plant development. Plant J. 2017;91(2):237–50.

    Article  CAS  PubMed  Google Scholar 

  7. Ono E, Homma Y, Horikawa M, Kunikane-Doi S, Imai H, Takahashi S, et al. Functional differentiation of the glycosyltransferases that contribute to the chemical diversity of bioactive flavonol glycosides in grapevines (Vitis vinifera). Plant Cell. 2010;22(8):2856–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  8. Griesser M, Vitzthum F, Fink B, Bellido ML, Raasch C, Munoz-Blanco J, et al. Multi-substrate flavonol O-glucosyltransferases from strawberry (Fragaria x ananassa) achene and receptacle. J Exp Bot. 2008;59(10):2611–25.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  9. Yauk YK, Ged C, Wang MY, Matich AJ, Tessarotto L, Cooney JM, et al. Manipulation of flavour and aroma compound sequestration and release using a glycosyltransferase with specificity for terpene alcohols. Plant J. 2014;80(2):317–30.

  10. Montefiori M, Espley RV, Stevenson D, Cooney J, Datson PM, Saiz A, et al. Identification and characterisation of F3GT1 and F3GGT1, two glycosyltransferases responsible for anthocyanin biosynthesis in red-fleshed kiwifruit (Actinidia chinensis). Plant J. 2011;65(1):106–18.

    Article  CAS  PubMed  Google Scholar 

  11. Wu B, Cao X, Liu H, Zhu C, Klee H, Zhang B, et al. UDP-glucosyltransferase PpUGT85A2 controls volatile glycosylation in peach. J Exp Bot. 2019;70(3):925–36.

    Article  CAS  PubMed  Google Scholar 

  12. Cheng J, Wei G, Zhou H, Gu C, Vimolmangkang S, Liao L, et al. Unraveling the mechanism underlying the glycosylation and methylation of anthocyanins in peach. Plant Physiol. 2014;166(2):1044–58.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Frydman A, Weisshaus O, Bar-Peled M, Huhman DV, Sumner LW, Marin FR, et al. Citrus fruit bitter flavors: isolation and functional characterization of the gene Cm1,2RhaT encoding a 1,2 rhamnosyltransferase, a key enzyme in the biosynthesis of the bitter flavonoids of citrus. Plant J. 2004;40(1):88–100.

    Article  CAS  PubMed  Google Scholar 

  14. Liu X, Lin C, Ma X, Tan Y, Wang J, Zeng M. Functional characterization of a flavonoid Glycosyltransferase in sweet Orange (Citrus sinensis). Front Plant Sci. 2018;9:166.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Frydman A, Liberman R, Huhman DV, Carmeli-Weissberg M, Sapir-Mir M, Ophir R, et al. The molecular and enzymatic basis of bitter/non-bitter flavor of citrus fruit: evolution of branch-forming rhamnosyltransferases under domestication. Plant J. 2013;73(1):166–78.

    Article  CAS  PubMed  Google Scholar 

  16. Chen J, Yuan Z, Zhang H, Li W, Shi M, Peng Z, et al. Cit1,2RhaT and two novel CitdGlcTs participate in flavor-related flavonoid metabolism during citrus fruit development. J Exp Bot. 2019;70(10):2759–71.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Fan J, Chen C, Yu Q, Li ZG, Gmitter FG. Characterization of three terpenoid glycosyltransferase genes in 'Valencia' sweet orange (Citrus sinensis L. Osbeck). Genome. 2010;53(10):816–23.

    Article  CAS  PubMed  Google Scholar 

  18. Li Y, Li P, Wang Y, Dong R, Yu H, Hou B. Genome-wide identification and phylogenetic analysis of Family-1 UDP glycosyltransferases in maize (Zea mays). Planta. 2014;239(6):1265–79.

    Article  CAS  PubMed  Google Scholar 

  19. He Y, Ahmad D, Zhang X, Zhang Y, Wu L, Jiang P, et al. Genome-wide analysis of family-1 UDP glycosyltransferases (UGT) and identification of UGT genes for FHB resistance in wheat (Triticum aestivum L.). BMC Plant Biol. 2018;18(1):67.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Wu B, Gao L, Gao J, Xu Y, Liu H, Cao X, et al. Genome-wide identification, expression patterns, and functional analysis of UDP Glycosyltransferase family in peach (Prunus persica L. Batsch). Front Plant Sci. 2017;8:389.

    PubMed  PubMed Central  Google Scholar 

  21. Jones P, Messner B, Nakajima J, Schaffner AR, Saito K. UGT73C6 and UGT78D1, glycosyltransferases involved in flavonol glycoside biosynthesis in Arabidopsis thaliana. J Biol Chem. 2003;278(45):43910–8.

    Article  CAS  PubMed  Google Scholar 

  22. Yin R, Messner B, Faus-Kessler T, Hoffmann T, Schwab W, Hajirezaei MR, et al. Feedback inhibition of the general phenylpropanoid and flavonol biosynthetic pathways upon a compromised flavonol-3-O-glycosylation. J Exp Bot. 2012;63(7):2465–78.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  23. Song C, Hong X, Zhao S, Liu J, Schulenburg K, Huang FC, et al. Glucosylation of 4-Hydroxy-2,5-Dimethyl-3(2H)-Furanone, the key strawberry flavor compound in strawberry fruit. Plant Physiol. 2016;171(1):139–51.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  24. Song C, Zhao S, Hong X, Liu J, Schulenburg K, Schwab W. A UDP-glucosyltransferase functions in both acylphloroglucinol glucoside and anthocyanin biosynthesis in strawberry (Fragaria x ananassa). Plant J. 2016;85(6):730–42.

    Article  CAS  PubMed  Google Scholar 

  25. Bönisch F, Frotscher J, Stanitzek S, Rühl E, Wüst M, Bitz O, et al. Activity-based profiling of a physiologic aglycone library reveals sugar acceptor promiscuity of family 1 UDP-glucosyltransferases from grape. Plant Physiol. 2014;166(1):23–39.

    Article  PubMed  PubMed Central  Google Scholar 

  26. Liu M, Ma Z, Sun W, Huang L, Wu Q, Tang Z, et al. Genome-wide analysis of the NAC transcription factor family in Tartary buckwheat (Fagopyrum tataricum). BMC Genomics. 2019;20(1):113.

    Article  PubMed  PubMed Central  Google Scholar 

  27. Yu J, Hu F, Dossa K, Wang Z, Ke T. Genome-wide analysis of UDP-glycosyltransferase super family in Brassica rapa and Brassica oleracea reveals its evolutionary history and functional characterization. BMC Genomics. 2017;18(1):474.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Rogozin IB, Lyonsweiler J, Koonin EV. Intron sliding in conserved gene families. Trends Genet. 2000;16:430–2.

    Article  CAS  PubMed  Google Scholar 

  29. Roy SW, Gilbert W. Rates of intron loss and gain: implications for early eukaryotic evolution. Proc Natl Acad Sci U S A. 2005;102:5773–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Zhang M, Nan H, Wang Y, Jiang X, Li Z. Comparison of flavonoid compounds in the flavedo and juice of two pummelo cultivars (Citrus grandis L. Osbeck) from different cultivation regions in China. Molecules. 2014;19(11):17314–28.

    Article  PubMed  PubMed Central  Google Scholar 

  31. Wang S, Yang C, Tu H, Zhou J, Liu X, Cheng Y, et al. Characterization and metabolic diversity of flavonoids in citrus species. Sci Rep. 2017;7(1):10549.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Barvkar VT, Pardeshi VC, Kale SM, Kadoo NY, Gupta VS. Phylogenomic analysis of UDP glycosyltransferase 1 multigene family in Linum usitatissimum identified genes with varied expression patterns. BMC Genomics. 2012;13:175.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Zhang B, Chen K, Bowen J, Allan A, Espley R, Karunairetnam S, et al. Differential expression within the LOX gene family in ripening kiwifruit. J Exp Bot. 2006;57(14):3825–36.

    Article  CAS  PubMed  Google Scholar 

  34. Zhang B, Tieman DM, Jiao C, Xu Y, Chen K, Fei Z, et al. Chilling-induced tomato flavor loss is associated with altered volatile synthesis and transient changes in DNA methylation. Proc Natl Acad Sci U S A. 2016;113(44):12580–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Download references


We would like to thank Prof. Ross Ferguson (Plant & Food Research, New Zealand) for critical reading and revisions to the manuscript, and Dr. Shaojia Li for help with modification.


This research was supported by the National Key R & D Program of China (2016YFD0400101), the National Natural Science Foundation of China (31672100), the 111 project (B17039), and the Scientific Research and Development Foundation of Zhejiang A & F University (2019FR046). The funding organizations provided the financial support to the research projects, but had no role in the design of the study, data collection and analysis, and writing of the manuscript.

Author information

Authors and Affiliations



BW and BZ designed the experiments. XL and BW carried out most of the experiments. BW analyzed the data. BW and KX wrote the manuscript. The authors read and approved the final manuscript.

Corresponding authors

Correspondence to Kai Xu or Bo Zhang.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1 Table S1.

Information of the pomelo UGTs identified in this study.

Additional file 2 Table S2.

Predictive information on subcellular localization of pomelo UGTs.

Additional file 3 Table S3.

Chromosome distribution information of pomelo UGTs.

Additional file 4 Fig. S1.

Number of pomelo UGTs predicted by subcellular localization.

Additional file 5 Fig. S2.

Distribution of introns among UGT genes in pomelo.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wu, B., Liu, X., Xu, K. et al. Genome-wide characterization, evolution and expression profiling of UDP-glycosyltransferase family in pomelo (Citrus grandis) fruit. BMC Plant Biol 20, 459 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: