PhUGT78A22, a novel glycosyltransferase in Paeonia ‘He Xie’, can catalyze the transfer of glucose to glucosylated anthocyanins during petal blotch formation

Background Flower color patterns play an important role in the evolution and subsequent diversification of flowers by attracting animal pollinators. This interaction can drive the diversity observed in angiosperms today in many plant families such as Liliaceae, Paeoniaceae, and Orchidaceae, and increased their ornamental values. However, the molecular mechanism underlying the differential distribution of anthocyanins within petals remains unclear in Paeonia. Results In this study, we used an intersectional hybrid between the section Moutan and Paeonia, hereafter named Paeonia ‘He Xie’, which has purple flowers with dark purple blotches. After Ultra-high performance liquid chromatography-diode array detector (UPLC-DAD) analysis of blotched and non-blotched parts of petals, we found the anthocyanin content in the blotched part was always higher than that in the non-blotched part. Four kinds of anthocyanins, namely cyanidin-3-O-glucoside (Cy3G), cyanidin-3,5-O-glucoside (Cy3G5G), peonidin-3-O-glucoside (Pn3G), and peonidin-3,5-O-glucoside (Pn3G5G) were detected in the blotched parts, while only Cy3G5G and Pn3G5G were detected in the non-blotched parts. This suggests that glucosyltransferases may play a vital role in the four kinds of glucosylated anthocyanins in the blotched parts. Moreover, 2433 differentially expressed genes (DEGs) were obtained from transcriptome analysis of blotched and non-blotched parts, and a key UDP-glycosyltransferase named PhUGT78A22 was identified, which could use Cy3G and Pn3G as substrates to produce Cy3G5G and Pn3G5G, respectively, in vitro. Furthermore, silencing of PhUGT78A22 reduced the content of anthocyanidin 3,5-O-diglucoside in P. ‘He Xie’. Conclusions A UDP-glycosyltransferase, PhUGT78A22, was identified in P. ‘He Xie’, and the molecular mechanism underlying differential distribution of anthocyanins within petals was elucidated. This study provides new insights on the biosynthesis of different kinds of anthocyanins within colorful petals, and helps to explain petal blotch formation, which will facilitate the cultivar breeding with respect to increasing ornamental value. Additionally, it provides a reference for understanding the molecular mechanisms responsible for precise regulation of anthocyanin biosynthesis and distribution patterns. Supplementary Information The online version contains supplementary material available at 10.1186/s12870-022-03777-5.

being involved in biotic and abiotic stress responses, ultimately resulting in the angiosperm diversity [1,2]. The extraordinary array of colors displayed by flowers mainly results from four types of pigment: chlorophylls, carotenoids, flavonoids, and betalains [3]. The most diverse palette of pigments are flavonoids, particularly anthocyanins, which are widely known to confer the shiny orange, pink, red, violet, and blue colors [4], and their biosynthetic pathways are among the most extensively studied in plants to date. Early in the anthocyanin biosynthetic pathway (ABP), three molecules of malonyl-CoA and one molecule of 4-coumarpyl CoA are condensed by chalcone synthase (CHS) to produce chalcones, which are converted to dihydroflavonols sequentially by chalcone isomerase (CHI), flavanone 3-hydroxylase (F3H), flavonoid 3′-hydroxylase (F3'H), and flavonoid 3′5'-hydroxylase (F3'5'H). In the late stage of ABP, anthocyanidins are synthesized by two enzymes, namely, dihydroflavonol 4-reductase (DFR) and anthocyanin synthase (ANS), then further glycosylated by UDP flavonoid glucosyltransferase (UFGT) and/or methylated by flavonoid O-methyltransferases (FOMT). The above ABP enzyme genes are mainly controlled by tissue-specific expression, which are conferred by a MBW protein complex consisting of MYB transcription factors-basic helix-loop-helix (bHLH)-WD40 [2,5,6].
Paeonia, the only genus in the family Paeoniaceae, is divided into three sections including: Moutan, Onaepia, and Paeonia [7]. Plants in the genus Paeonia are worldwide known ornamentals which originated from China and have a long history of cultivation and breeding [8]. In section Moutan, two wild species, P. rockii and P. delavayi, produce flowers with petal blotches, which can also be observed in their offspring [9], as the petal blotches are a dominant genetic trait. Petal blotches occur in various colors and sizes at the base of each petal, which offer a unique ornamental value to its cultivars. We previously compared the anthocyanin composition of blotched and non-blotched parts in 35 cultivars of the section Moutan, and found that the most abundant anthocyanins were cyanidin-based glycosides [cyanidin-3-O-glucoside (Cy3G) and cyanidin-3,5-O-glucoside (Cy3G5G)], however, no anthocyanins were detected in white, non-blotched parts [10]. The transcriptomes of petals with purple blotches and white non-blotched parts of the peony cultivar P. suffruticosa 'Jinrong' were compared, suggesting that petal blotch formation may be attributed to higher transcriptional levels of PsCHS, PsF3'H, PsDFR, and PsANS [11]. In addition, transcriptome analysis of variegated petals of P. rockii, P. ostii, and their F 1 hybrids indicated that CHS, DFR, and ANS might be involved in the variegated pigmentation of Paeonia flowers [9]. In our previous study, anthocyanin O-methyltransferase (AOMT) was identified, and found to be responsible for the methylation of cyanidin glycosides into peonidin glycosides and play an important role in purple coloration of Paeonia plants [12]. Despite glycosylated anthocyanins being important for transportation and other important functions, the gene functions of glycosyltransferase have been understudied in Paeonia. Moreover, our recent study determined petal blotch color formation is governed by the transcriptional control of PsCHS by a MBW complex in the cultivar 'Qing Hai Hu Yin Bo' [6]. However, the above studies only focused on colored blotches against white non-blotched parts. Therefore, it is also necessary to study colored petals with colored blotches against a colored (as opposed to white) background to determine how pigment pattern differences are formed within the same petals.
Intersectional hybrids of Paeonia offer highly desirable traits for ornamental use. The first intersectional hybrid was created by Toichi Itoh in 1948, opening a new era for cross breeding, and its offspring was named after Itoh to honor his memory. Afterwards, breeders such as Arthur Percy Saunders, considered the father of intersectional hybrids, bred many famous cultivars including 'First Arrival' , 'Singing in the Rain' , 'Scarlet Heaven' , 'Pastel Splendor' , 'Hillary' , 'Canary Brilliants' , and 'Julia Rose' , among which the famous Itoh hybrids are included. In China, we were lucky to develop an intersectional hybrid named P. 'He Xie' , which resulted from a cross between the section Moutan and Paeonia [13,14]. Its phenotype combined the characteristics of the tree peony and the herbaceous peony, and its purple flowers with dark purple blotches provided a higher ornamental commercial value. In our previous study, we identified a ring domaincontaining protein (PhRING-H2) in P. 'He Xie' petals, that physically interacts with PhCHS and is required for PhCHS ubiquitination and degradation, suggesting that post-translational regulation of flavonoid biosynthesis exists in P. 'He Xie' , and provides a theoretical basis for the manipulation of flavonoid biosynthesis in Paeonia plants [15]. To further explore the dark purple blotch formation in purple petals, we first compared the types of anthocyanins in blotched and non-blotched parts of petals, then separated the blotched and non-blotched parts of the petals for transcriptome analysis and comparison. Moreover, we found that a key UDP-glycosyltransferase (UGT), PhUGT78A22, can catalyze the transfer of glucose to glucosylated anthocyanins precisely in blotched and non-blotched parts, producing differences in anthocyanin content in the petals of P. 'He Xie' . This study provides new insight on petal blotch patterning against a colorful, non-blotched background, which will illuminate novel color breeding strategies to increase ornamental value, meanwhile, and can be used as a reference for understanding the molecular mechanisms underlying precise control of anthocyanin biosynthesis and accumulation.

Results
Different glycosylated anthocyanins were observed in flower petals of P. 'He Xie' during coloration and opening In this study, four main developmental stages of P. 'He Xie' petals were used (Fig. 1A). To investigate pigment accumulation patterns of petals in the blotched and nonblotched parts throughout four developmental stages, the types and concentrations of anthocyanins were measured using ultra-high performance liquid chromatography-diode array detector (UPLC-DAD). We found that along with petal color develops, the total anthocyanin content increased and reached its highest level at stage 4 (4.17 mg/g FW). From stage 2 to stage 4, the anthocyanin content in the blotched parts was always higher than that in the non-blotched parts, which was 0.91 mg/g FW and 0.37 mg/g FW (stage 2), 1.32 mg/g FW and 0.51 mg/g FW (stage 3), and 2.73 mg/g FW and 1.43 mg/g FW (stage 4) in blotched and non-blotched parts, respectively (Fig. 1B). Four types of anthocyanins were detected in the blotched part, namely Cy3G, Cy3G5G, peonidin-3-O-glucoside (Pn3G), and peonidin-3,5-O-glucoside (Pn3G5G). However, only two types were detected in the non-blotched part, namely Cy3G5G and Pn3G5G ( Fig. 1B & C). These results indicate that anthocyanin glycosylation is different in blotched and non-blotched parts of colored flower petals.

Petal transcriptome analysis and unigenes identified relating to anthocyanin biosynthesis in P. 'He Xie'
A total of six samples (three biological replicates for each group) were sequenced and in total, 40.56-45.87 million clean reads were generated with a proper base distribution and mean quality position, and the clean Q30 base rate was higher than 93.44%, indicating good quality sequences for further analysis (Tab. S1). Furthermore, 127,287 unigenes, with an average length of 630 bp after filtering out low quality reads, were obtained and the mapping ratio from the transcriptome was 89.07% using the reference genome of Vitis vinifera.
Differentially expressed genes (DEGs) were determined by comparing blotched vs non-blotched parts of P. 'He Xie' petals, and a total of 2433 unigenes were identified as DEGs, including 1494 up-regulated genes and 939 down-regulated genes. The DEGs were divided into three categories including 'biological process' , 'cellular component' , and 'molecular function' . The most abundant DEGs were annotated as belonging to categories including 'metabolic process' (biological process), 'cell part' (cellular component), and 'catalytic activity' (molecular function) (Fig. S1A). To identify the specific biochemical pathways involved in the pigment accumulation of petals, the DEGs were subjected to Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway for enrichment analysis, which mapped to 285 pathways in the KEGG database. There were seven significantly enriched pathways, including multiple biosynthetic and metabolic pathways, which indicated that there were some differences in biosynthesis and metabolism between blotched and non-blotched parts (Fig. S1B).

Expression patterns and sequence analysis of PhUGTs involved in anthocyanin biosynthesis in P. 'He Xie' petals
To further study the difference of anthocyanin glycosylation between blotched and non-blotched parts of P. 'He Xie' petals, expression patterns of UAGTs whose absolute values of the log 2 fold change were greater than 0.5 (c95107_g1, c128309_g1, c8473_g1, c99617_g1, c95696_ g1 and c95696_g2) were checked using quantitative RT-PCR (RT-qPCR). After blotches appeared at the base of petals, the expression level of c99617_g1 in non-blotched part was significantly higher than that in blotched parts. The expression level of c99617_g1 in non-blotched parts at stage 3 was 40.21, which was 1.18-fold greater than that of the blotched part, while, it was 1.87-fold higher in non-blotched versus blotched parts at stage 4. Moreover, unlike other UAGTs, c99617_g1 was highly expressed during P. 'He Xie' petal coloration. This was consistent with the previous conjecture that anthocyanins with double glucoses were observed in non-blotched parts. The relative expression level of c8473_g1 was lower than 1.00 during all stages, and c95107_g1 and c128309_g1 expression levels were lower than 1.00 during all stages, except for in the non-blotched part during stages 4 (3.05) and 3 (34.07), respectively. The expression levels of c95696_g1 and c95696_g2 in blotched parts were higher than in non-blotched parts at stages 3 and 4, which was different from that of c99617_g1 (Fig. 3A). Considering the anthocyanin types present in non-blotched part and the higher expression levels of c99617_g1, it was finally selected for further functional characterization.
Sequence analysis of c99617_g1 showed that its amino acid sequence contained a plant secondary product glycosyltransferase (PSPG) motif (331-374 aa) near the C-terminal domain (Fig. S2). The last glutamine (Q) residue in the PSPG motif, considered to confer specificity for UDPglucose as the sugar donor, was also observed in c99617_g1  [16]. All of the Arabidopsis thaliana UGTs were phylogenetically clustered into 14 groups (labelled A-N) [17,18]. Phylogenetic analysis showed that c99617_g1 was closest to group F within the A. thaliana UGT superfamily, and its closest protein homologs were AtUGT78D1, AtUGT78D2, AtUGT78D3, and AtUGT78D4P (Fig. 3B). We also obtained the name of c99617_g1 from the UGT Nomenclature Committee (https:// prime. vetmed. wsu. edu/ resou rces/ udp-glucu ronsy ltran sfera se-homep age). Combining the evolutionary relationship analysis and UGT Nomenclature Committee naming results, we named c99617_g1 as PhUGT78A22.
To further identify the subcellular localization of PhUGT78A22, PhUGT78A22-GFP was expressed together with a nuclear marker expressed from the construct Super promoter::NF-YA4-mCherry and detected a strong GFP signal in the nucleus (Fig. 4).
To further analyze the in vitro function of PhUGT78A22, PhUGT78A22 fused with a GST-tag was expressed in Escherichia coli, and the isolated protein was evaluated with enzymatic assays. UDP-glucose and two anthocyanins (Cy3G and Pn3G) were tested as sugar donor and substrates, respectively. PhUGT78A22 exhibited catalytic activity toward Cy3G and Pn3G with UDPglucose as the sugar donor. Analysis of the enzymatic products by UPLC-DAD showed that the recombinant PhUGT78A22 protein catalyzed the conversion of Cy3G into Cy3G5G and Pn3G into Pn3G5G (Fig. 5E & F). The results suggested that PhUGT78A22 catalyzed the transfer of glucose to glucosylated anthocyanins of Cy3G and Pn3G.

Silencing of PhUGT78A22 altered Cy3G5G and Pn3G5G biosynthesis in P. 'He Xie'
To further validate the activity of PhUGT78A22 in vivo, we silenced PhUGT78A22 using Virus-induced gene silencing (VIGS) technology using a Tobacco rattle virus vector in P. 'He Xie' bud scales. The anthocyanin component of bud scales was the same as that of the blotched part in P. 'He Xie' petals. Silencing of PhUGT78A22 changed the bud scale color (Fig. 6A). The expression levels of PhUGT78A22 was significantly reduced in the TRV::PhUGT78A22 silenced line, which dropped to 68.51% relative to the TRV control (Fig. 6B). After color measurement, a * value of PhUGT78A22-silenced line (3.24) was significantly lower than that of the TRV control (11.65), however, b * and L * values had experienced no significant changes (Fig. 6C), which was consistent with the phenotypic results. To further confirm the changes in anthocyanin content after PhUGT78A22 silencing, we detected the types and contents of anthocyanins in the TRV::PhUGT78A22 silenced line. In PhUGT78A22silenced bud scales, Cy3G5G and Pn3G5G were significantly decreased, however, the content of Cy3G was significantly increased compared to the TRV control. Meanwhile, the content of Pn3G was also increased in PhUGT78A22-silenced bud scales, although no statistical difference was observed (Fig. 6D). This suggests that the silencing of PhUGT78A22 will result in the reduction of Cy and Pn bis-glucosidic anthocyanins. Taken together, our findings suggest a model for how UDP-glycosyltransferase PhUGT78A22 is involved in the transformation of glucose to glucosylated anthocyanins during petal blotch formation in P. 'He Xie' (Fig. S5). In blotched parts, the expression levels of PhUGT78A22 are lower, so only a portion of Cy3G and Pn3G can be glycosylated into Cy3G5G and Pn3G5G, respectively. In non-blotched parts, the expression of PhUGT78A22 is higher, and all Cy3G and Pn3G can be glycosylated into Cy3G5G and Pn3G5G, respectively. The differences in the types and concentrations of glycosylated anthocyanins explains the blotch formation and color differences within P. 'He Xie' petals.

Discussion
Floral color blotches have biological significance in attracting insect pollination and driving angiosperm species evolution. In Paeonia, variations in color and size of blotches not only increase their ornamental value, but also serves as the basis for species and cultivar classification. Our previous research was focused on the regulatory mechanism of blotch color formations against white non-blotched backgrounds, but little is known about colored petals with colored blotches against a colored (as opposed to white) background, since the anthocyanin types, which were under precise control due to variation in glycosylation between blotched and non-blotched parts. In this study, we aimed at investigating differentially expressed UGTs genes and understanding the differences in the types of anthocyanins within the same petals. Additionally, we conducted the functional characterization of PhUGT78A22 to explain petal blotch formation, which will benefit to breeding with the respect of increasing ornamental value and enrich the understanding of anthocyanin patterning in angiosperms.
The anthocyanin composition of Paeonia petals was thoroughly investigated. The flowers of 130 tree peony cultivars (55 red, 28 pink, 38 purple, and 9 white flowered cultivars) from the Zhongyuan cultivation group and 37 (15 red, 10 pink, 8 purple, 3 white, and 1 black flowered cultivars) from Daikon Island were collected in Japan for anthocyanin measurement by Wang et al. [20]. Six anthocyanins, namely Cy3G, Cy3G5G, Pn3G, Pn3G5G, pelargonidin 3-O-glucoside (Pg3G), and pelargonidin 3,5-O-glucoside (Pg3G5G) constituted the petal pigments and all of the flowers contained peonidin-based glycosides in these tree peony cultivars. Peonidin is the methylated form of cyanidin that produces pink to red pigments [3,12]. Jia et al. [21] identified five major anthocyanins in 41 herbaceous peony cultivars, namely Cy3G, Cy3G5G, Pn3G, Pn3G5G, and Pg3G5G, which was basically consisted with the results of Wang et al. [20] and Fan et al. [22]. Excluding two cultivars, 'Huang Jin Lun' with yellow flowers and 'Yang Fei Chu Yu' with white flowers, other 39 cultivars had glucosylated anthocyanins (3,5-O-glucoside, 3G5G), among which only 3 cultivars  had glucosylated anthocyanidins (3-O-glucoside, 3G). Moreover, no 3G-type was detected in 11 cultivars with pink flowers belonging to the "Pn, Cy" group, while, only glucosylated anthocyanins at the 3,5-O-position of the backbone (3G5G-type) were obtained [21]. However, in the other "Pn, Cy" group, which included 31 tree peony cultivars from the Xibei (northwest China) cultivation group, excluding 2 cultivars that only contained 3G, all the other cultivars contained the 3G-type and 3G5G-type of Cy and Pn, and the flower colors in this group were pink, purple, white, or black [23]. It can be seen from the above results that the level of glycosylation modification can directly determine the color of Paeonia petals. In this study, we identified an intersectional hybrid with only the 3G5G-type of Cy and Pn in non-blotched parts and both the 3G-type and 3G5G-type of Cy and Pn in blotched The mean values ± SD from three biological replicates (n = 6) are shown. Asterisks indicate statistically significant differences (two-sided Student's t-test; **, P < 0.01). D Silencing of PhUGT78A22 altered anthocyanin accumulation in P. 'He Xie' bud scales. Anthocyanin accumulation was determined using UPLC-DAD analysis in TRV control and PhUGT78A22-silenced bud scales. One biological sample consisted of a mixture of at least 6 bud scales. The mean values ± SD from three biological replicates (n = 3) are shown. Asterisks indicate statistically significant differences (two-sided Student's t-test; **, P < 0.01) part, which provided an ideal model for elucidating the role of glycosylation modification in petal color formation in Paeonia.
Glycosylation is considered to be the last step in the biosynthetic pathway of the secondary metabolite [24]. The glycosylation process catalyzed by glycosyltransferases, are derived from bacteria, plants, animals and viruses and can be divided into 114 families based on amino acid sequence similarities and catalytic mechanisms in the Carbohydrate-Active Enzymes (CAZymes) Database (Glycosyl Transferase family classification, http:// www. cazy. org/ Glyco sylTr ansfe rase-family). Among them, members of the GT family 1 are often referred to as UGTs, which are known to typically transfer a sugar to a diverse array of substrates including hormones, flavonoids, and even pesticides [17,18,24]. UGTs have been identified in several higher plants such as peaches, with 16 groups (A -P) of UGTs being found in Prunus persica L. Batsch, and two UGTs of group F, namely, Prupe.1G091100 and Prupe.1G091000, were involved in anthocyanin biosynthesis in peach flowers [25]. In this study, we identified PhUGT78A22 belonging to Group F, and to be involved in anthocyanin biosynthesis in Paeonia. Some research progress has been made on Group F UGTs, a subfamily with a small number of members, in recent years. In A. thaliana, UGT78D1 was identified to catalyze the transfer of rhamnose from UDP-rhamnose to quercetin and kaempferol [26], and UGT78D2 could catalyze the glucosylation of both cyanidins and flavonols as UDP-glucose: flavonoid 3-O-glucosyltransferase [27]. In V. vinifera, VvGT1 can glycosylate anthocyanidins [28], VvGT5 (UGT78A11) was identified as a UDPglucuronicacid:flavonol-3-O-glucuronosyltransferase and VvGT6 (UGT78A12) as a bifunctional UDP-glucose/UDP-galactose:flavonol-3-O-glucosyltransferase/ galactosyltransferase [29]. In Glycine max (L.) Merr., the UDP-glucose: flavonoid 3-O-glucosyltransferase (UGT78K1) only uses anthocyanidins and flavonols as substrates [30]. Here, we firstly identified a group F UGT, PhUGT78A22 in Paeonia, and determined its function in glycosylating anthocyanins, which is consistent with the above studies. However, whether it can glycosylate other anthocyanidins or flavonols and participate in processes other than blotch formation in petals remains to be determined. UGT78H2 glycosylated quercetin exclusively using UDP-glucuronic acid and UDP-galactose, but not UDP-glucose [31]. In this study, we determined that PhUGT78A22 transferred glucose to glycosylated anthocyanins with UDP-glucose as the sugar donate, which was different from the results of Chen et al. [31]. This may be due to the differences of species resulting in functional diversification of group F UGTs so as to ensure the accurate modification of corresponding substrates.
Bis-glucosidic anthocyanins are believed to be more hydrophilic and more stable [32]. The present study suggested the high expression level of PhUGT78A22 would explain all detected anthocyanins being bis-glucosidic ones, which confirmed that most areas of the petal had more hydrophilic and more stable anthocyanins in P. 'He Xie' . In recent years, increasing studies have shown that UGTs play a vital role in plant resistance to stress. Overexpression of OsUGT90A1 helped to maintain membrane integrity during cold stress, improved freezing survival and tolerance to salt stress, and promoted leaf growth during stress recovery [33]. Ectopic expression of AtUGT76E11 increased flavonoid accumulation and enhanced abiotic stress tolerance to salinity and drought [34]. Ectopic expression of UFGT2, first identified from maize, in A. thaliana led to increased flavonol contents and enhanced oxidative tolerance [35]. After analyzing comparative genomic and transcriptomic data from three Brassica species and A. thaliana, a series of UGTs were identified to be involved in plant resistance to cold, drought, and hypoxia stress [36]. Moreover, UGT74E2 was involved in drought and salt stress resistance via ABA and IAA signaling in rice [37]. Chen et al. revealed that UGT75B1 modulated ABA activity by glycosylation in stressful environment [38]. In the present study, we determined the function of PhUGT78A22 in relation to anthocyanin biosynthesis, which may play a similar role in UV-B irradiation stress resistance [25]. Further research is needed to determine whether PhUGT78A22 can be involved in responses to other abiotic stresses, such as drought. In the future, the function of PhUGT78A22 should be fully characterized for its regulatory mechanistic role in flavonoid glycosylation, thus to manipulate flower color variation to improve stress resistance of Paeonia, and further synthesize valuable active substances in vitro for health promoting products.

Conclusions
The present study used an intersectional hybrid named P. 'He Xie' which has purple flowers with dark purple blotches to explore the mechanism of differential glycosylation of anthocyanin within petals. It was interesting to find that four kinds of anthocyanins (Cy3G, Cy3G5G, Pn3G, and Pn3G5G) in blotched parts, but only Cy3G5G and Pn3G5G in non-blotched part, which suggests glucosyltransferases play a vital role in establishing this difference. Moreover, 2433 DEGs were obtained from transcriptomic analysis of blotched and non-blotched parts, and a key UDP-glycosyltransferase gene named PhUGT78A22 was identified. It had the conserved PSPG box, suggesting a high affinity to glucosylated anthocyanins at the 3-O-postion in the C ring, and used Cy3G and Pn3G as substrates to produce Cy3G5G and Pn3G5G in vitro through molecular docking analysis and enzymatic assays. Furthermore, silencing of PhUGT78A22 reduced the content of anthocyanidin 3,5-O-diglucoside in P. 'He Xie' . This study provides new insights into different types of anthocyanin biosynthesis within same petals, helps explain petal blotch formations and will inspire cultivar breeding with respect of increasing ornamental value. Meanwhile, it provides a reference for understanding the molecular mechanism on precise regulation of anthocyanin biosynthesis and distribution patterns.

Plant materials
In this study, the cultivar P. 'He Xie' was used, which was grown in Beijing Botanical Garden, Institute of Botany, Chinese Academy of Sciences (Lat. 39°59′ N, 116°12′ E, Alt. 70 m). Flower petals were separated into blotched and non-blotched parts and collected at four developmental stages: 1) the bud is unopened and petals are light yellow-green and lack blotches; 2) the bud is unopened, red blotches beginning to appear at the base of petals, and non-blotched parts are transiting from light yellow green to light pink; 3) the bud is partially opened, blotched parts have deepened in color and expanded to about 0.5 cm in diameter, while the non-blotched parts have turned completely pink; 4) the flower is fully open and blotches have expanded to about 1.0 cm in diameter. Three flowers were collected per stage. Since no blotches are present at stage 1, the whole petal was selected as non-blotched part. Beyond stage 1, blotched and non-blotched parts of every developmental stage were separated into distinct samples. Bud scales were removed from P. 'He Xie' flower buds referred to Gu et al. [15].

UPLC-DAD analyses
Tissue samples from the blotched and non-blotched parts of petals at each developmental stage or bud scales were used for anthocyanin analysis as previously described [6] with minor modifications. Specifically, all non-blotched or blotched parts of each flower were collected as a single biological replicate, and the same amount of tissue, by weight, was taken from each biological repetition to quantify anthocyanin concentration. From each sample, 0.2 g of fresh tissue was treated with 1 mL of 0.2% formic acid/methanol (v/v) solution for 20 minutes, ultrasonically homogenized, then incubated in the dark for 2 h. The resulting extract was centrifuged at 12,000 rpm for 5 min and the supernatant was collected. The above steps were repeated until all anthocyanins had been extracted. The supernatant was filtered through a 0.22 μm filter and stored at −20°C. Anthocyanin types and concentrations were determined using an UPLC-DAD (ACQUITY UPLC ® I-Class, Waters, Massachusetts, USA). The analytical column was an ACQUITY UPLC ® HSS T3 1.8 μm column (Waters, Massachusetts, USA). Four standards were purchased from Solarbio (Beijing, China), namely Cy3G, Cy3G5G, Pn3G, and Pn3G5G. Among them, Cy3G was used as a standard for quantifying anthocyanin concentration through linear regression.

Transcriptome sequencing and analyses
To identify key genes involved in anthocyanin biosynthesis in the blotched and non-blotched parts, transcriptomic analysis was performed on blotched and non-blotched tissues samples taken from stages 2 to 4 from P. 'He Xie' petals, using the Illumina sequencing platform. After extracting total RNA, the integrity was evaluated by 1.0% agarose gel electrophoresis, quality checked using a K5500 micro-spectrophotometer (Kaiao, Beijing, China), and the integrity checked again using a 2100 RNA Nano 6000 assay Kit (Agilent Technologies, CA, USA). Library construction and RNA-seq analysis were performed by Annoroad Gene Technology Co. (Beijing, China) using an Illumina platform (CA, USA). Clean reads were obtained by filtering out contaminated and low quality reads, and reads with more than 5% undistinguished bases [39]. Subsequently, De novo transcriptome assembly was performed using Trinity (version 20140717).
The expression levels of unigenes were calculated using the RPKM (Reads Per Kilobase Million Mapped Reads) method. DESeq was used to identify DEGs, and genes with |log 2 Ratio| ≥ 1 and q < 0.05 were assigned as differentially expressed. In addition, DEGs were annotated and classified using the Gene Ontology (GO) and KEGG databases.

RT-qPCR analyses
Total RNA from blotched and non-blotched parts of P. 'He Xie' petals was extracted using the E.Z.N.A. ® Plant RNA Kit (Omega Bio-Tek, GA, USA). cDNA was synthesized using the HiScript II reverse transcriptase kit (Vazyme, Nanjing, China). RT-qPCR reactions were conducted using a StepOne Real-Time PCR System (Applied Biosystems, Carlsbad, USA) in 10 μL reaction mixture containing 2 × M5 HiPer Realtime PCR Super mix (Mei5bio, Beijing, China). Poβ-Tubulin (Poβ-TUB) was used as an internal control. Each experiment was conducted with three biological repeat. Primers used are listed in Table S5.

Phylogenetic analyses
Phylogenetic analyses was performed as described previously [17,18]. All of the A. thaliana UGTs were obtained from the CAZymes Database (http:// www. cazy. org/ Home. html). The phylogenetic tree of c99617_g1 with A. thaliana UGTs was constructed using the Neighbor-Joining method in MEGA (version X) and Evolview (http:// www. evolg enius. info/ evolv iew. html). In total, 123 amino acid sequences were used in the analysis and all positions containing gaps and missing data were eliminated.

Subcellular localization of PhUGT78A22
Subcellular localization was performed using Super1300 vector with Green fluorescent protein (GFP) as the signal protein [40]. The coding region sequence (CDS) of PhUGT78A22 (1371 bp), with stop codons removed and tagged with GFP, was constructed under a Super promoter [41], then heterologously expressed in N. benthamiana leaves through Agrobacterium tumefaciens strain GV3101. The constructs Super promoter::PhUGT78A22-GFP and nuclear marker Super promoter::NF-YA4-mCherry were co-infiltrated using agroinfiltration. GFP and mCherry fluorescence signal were visualized by confocal microscopy (Leica TCS SP5, Germany) 3 days post infiltration. The primer sequences used to make the subcellular localization constructs are listed in Table S5.

Homology modeling and molecular docking
To determine the molecular basis for the specificity of PhUGT78A22, firstly, the homology model structure of PhUGT78A22 was computed using the SWISS-MODEL server homology modelling pipeline (https:// swiss model. expasy. org/), which uses an anthocyanidin 3-O-glucosyltransferase of V. vinifera [Protein Data Bank (PDB) code is 2c1x] as a template. The Molecular Operating Environment (MOE) Dock was used for molecular docking of molecules with PhUGT78A22. UDP-glucose was used as the sugar donor. Cy, Cy3G, Pn, and Pn3G were used as the sugar acceptors. The two-dimensional (2D) structures of UDP-glucose, Cy, Cy3G, Pn, and Pn3G were downloaded from PubChem (https:// pubch em. ncbi. nlm. nih. gov/) and converted to three-dimensional (3D) structures in MOE through energy minimization, as ligands. The best probable binding mode was visualized by PyMOL (www. pymol. org).

In vitro enzymatic assays of recombinant PhUGT78A22
In vitro enzymatic assays were performed as described previously [18]. The coding sequence of PhUGT78A22 was cloned into pGEX4T-2 and expressed in E. coli strain Rosetta. Protein expression was induced by adding isopropylthio-β-D-galactoside (0.3 mM) to the cell culture, which was then incubated at 120 rpm for 24 h at 16°C. The GST-PhUGT78A22 protein was purified using Glutathione Sepharose 4B (GE Healthcare, PA, USA). UDP-glucose (Solarbio, Beijing, China) was used as the sugar donor. Cy3G and Pn3G (Solarbio, China) were used as the sugar acceptors. The reaction mixture for the PhUGT78A22 enzymatic assay consisted of 100 mM Tris-HCl (pH 7.0), 0.1 mM of each substrate, 2 mM of UDP-glucose, and 3-10 μg of PhUGT78A22 protein, in a 50 μL reaction volume. After incubation for 1 h at 37°C, the reaction was terminated by the addition of 0.2% formic acid/methanol solution (v/v), followed by analysis using a UPLC-DAD assay. The primers used for cloning are listed in Table S5.

VIGS
VIGS was performed as previously described [15,42]. A gene-specific fragment of PhUGT78A22 (327 bp in length) from 3′ end was used to construct the vector TRV2::PhUGT78A22. TRV1, TRV2, and TRV2::PhUGT78A22 were transformed into A. tumefaciens strain GV3101, which were then cultured in Luria-Bertani medium (with 50 mg/L rifampicin and 50 mg/L kanamycin) overnight, then harvested by centrifugation at 5000 rpm for 10 min and resuspended in infiltration buffer (10 mM MgCl 2 , 200 mM acetosyringone and 10 mM 2-(N-Morpholino)-ethanesulfonic acid, pH 5.6), and diluted to an A 600 of 1.5. A. tumefaciens cultures containing TRV1 + TRV2::PhUGT78A22 or TRV1 + TRV2 (as the negative control) were mixed by 1:1, then incubated in the dark at room temperature for 4-6 h before inoculation. P. 'He Xie' bud scales of the same size and color were collected, submerged in the agrobacterium suspensions, and exposed to a vacuum of −0.9 atm twice, each for 5 min. The infiltrated bud scales were briefly washed with distilled water and placed on solid Murashige and Skoog (MS) medium (41.74 g/L MS, 6.5 g/L agar, pH 5.8; Solarbio, Beijing, China) and cultured in the dark at 8°C for 3 d. Phenotypes were observed, and the expression levels of PhUGT78A22 and the types and content of anthocyanins in silenced and controlled bud scales was determined following the above procedure. The primers used for VIGS are listed in Table S5.

Color measurements (CIELAB system)
Chromatic analyses were performed as previously described [12,43]. The colors were represented as L * , a * , and b * values. The value of L * is from black (0) to white (100), which represents lightness. The value of a * is from