Skip to main content

Comprehensive analysis of the MYB transcription factor gene family in Morus alba

Abstract

Background

The V-myb myeloblastosis viral oncogene homolog (MYB) family of proteins is large, containing functionally diverse transcription factors. However, MYBs in Morus are still poorly annotated and a comprehensive functional analysis of these transcription factors is lacking.

Results

In the present study, a genome-wide identification of MYBs in Morus alba was performed. In total 166 MaMYBs were identified, including 103 R2R3-MYBs and four 3R-MaMYBs. Comprehensive analyses, including the phylogenetic analysis with putative functional annotation, motif and structure analysis, gene structure organization, promoter analysis, chromosomal localization, and syntenic relationships of R2R3-MaMYBs and 3R-MaMYBs, provided primary characterization for these MaMYBs. R2R3-MaMYBs covered the subgroups reported for R2R3-MYBs in Arabidopsis and Populus, and had two Morus-specific subgroups, indicating the high retention of MYBs in Morus. Motif analysis revealed high conservative residues at the start and end of each helix and residues consisting of the third helix in R2 and R3 repeats. Thirteen intron/exon patterns (a–m) were summarized, and the intron/exon pattern of two introns with phase numbers of 0 and 2 was the prevalent pattern for R2R3-MaMYBs. Various cis-elements in promoter regions were identified, and were mainly related to light response, development, phytohormone response, and abiotic and biotic stress response and secondary metabolite production. Expression patterns of R2R3-MaMYBs in different organs showed that MaMYBs involved in secondary cell wall components and stress responsiveness were preferentially expressed in roots or stems. R2R3-MaMYBs involved in flavonoid biosynthesis and anthocyanin accumulation were identified and characterized based on functional annotation and correlation of their expression levels with anthocyanin contents.

Conclusion

Based on a comprehensive analysis, this work provided functional annotation for R2R3-MYBs and an informative reference for further functional dissection of MYBs in Morus.

Peer Review reports

Background

V-myb myeloblastosis viral oncogene homolog (MYB) proteins comprise one of the largest families of transcriptional regulators in plants [1]. The MYB transcription factors have been identified as important regulators that work as activators or repressors in diverse processes, including development, stress responses, and metabolism. The “oncogene” v-Myb derived from the avian myeloblastosis virus was the first MYB gene identified and further study showed that three genes related to v-Myb-c-Myb, A-Myb and B-Myb were distributed in many vertebrates [2]. Similar genes have also been widely reported in fungi, plants, and insects [3]. The first plant MYB transcription factor the c1 gene was reported in maize (Zea mays) [4]. Currently, many MYB genes have been reported in diverse species [5].

MYB proteins are characterized by sharing a highly conserved DNA-binding domain (DBD) in their N-terminus This domain is generally composed of up to four imperfect repeats [1, 6]. These repeat units are named R1, R2, and R3 according to their sequence similarity with the c-MYB protein [6]. Each repeat comprises approximately 50–53 amino acids with a similar folding architecture containing three well-defined α-helices. When bound to specific promoter sequences, portions of the second and third helices form a helix-turn-helix (HTH) structure [6, 7]; The third helices of R2 and R3 play a recognition role in binding to a specific DNA sequence, while R1 is not essential for specific DNA recognition [6, 7]. In addition, each repeat contains regularly spaced tryptophan residues (WX18–19WX18–19W), forming a tryptophan cluster in the three-dimensional (3D) HTH structure. The C terminus, which is responsible for the protein’s regulatory activity, is more variable [1, 8]. MYB transcription factors have been divided into four different classes depending on the number of adjacent repeats: 1R (R1/2, R3-MYB), 2R (R2R3-MYB), 3R (R1R2R3-MYB), and 4R, which harbor four R1/R2-like repeats. MYBs with two repeats R2R3-MYBs which is the largest class of MYB factors in plants [9].

R2R3-MYBs have undergone extensive expansion and are thought to perform diverse functions in plant-specific processes [5]. Comprehensive analysis of the MYB gene family have been widely reported in many plants that have available genome information. To date, 126 2R-MYB genes are reported in Arabidopsis thaliana [1], 198 in Populus trichocarpa [10], 108 in Vitis vinifera [11], 157 in Z. mays [12], and 185 in Pyrus bretschneideri [13]. R2R3-MYBs have been classified into 28 (S1–S28) subgroups in Arabidopsis and 45 subgroups (C1–C45) in Populus trichocarpa [1, 10]. Most R2R3-MYBs in Arabidopsis have been well functionally characterized and their roles in diverse biological processes have been identified. The functions of R2R3-MYBs in the regulation of the phenylpropanoid, flavonoid, and lignin biosynthesis pathways are particularly well studied [9, 14]. The first plant MYB transcription factor, the c1 was found to regulate anthocyanin biosynthesis in the aleurone of maize kernels [4]. Arabidopsis R2R3-MYBs in subgroups 6 and 7 were also reported to control the flavonoid biosynthesis pathway, with differences in tissue preference [1]. AtMYB75/ PAP1, AtMYB90/PAP2, AtMYB113 and AtMYB114 in subgroup 6 control anthocyanin biosynthesis in vegetative tissues and AtMYB11/PFG1, AtMYB12/PFG1 and AtMYB111/PFG3 in subgroup 7 control flavonol biosynthesis in all tissues [15]. AtMYB123/TT2 in subgroup 2 controls the biosynthesis of proanthocyanidins (PAs) and regulates tannin biosynthesis [1]. In addition, some R2R3-MYBs also act as repressors for controlling the lignin and flavonoids biosynthesis pathway. AtMYB3 and 4 in subgroup 4 suppress the cinnamate 4-hydroxylase (C4H) gene, and AtMYB4 represses the synthesis of sinapoyl malate [14]. AtMYB32 represses lignin and flavonoid biosynthesis in pollen. AtMYB7 is responsible for reducing flavonol by regulating flavonol synthase (FLS) [14]. PtMYB165, 181, 182, and 194 in C18 were also reported as repressors of flavonoid biosynthesis [14]. In addition, R2R3-MYBs also serve as key factors in the control of plant development and cell fate, as well as various abiotic stresses and defense responses in plants [1]. AtMYB0 and AtMYBB66 are involved in epidermal cell patterning and AtMYB125 is involved in the regulation of male gametophyte development by controlling the cell cycle [16, 17]. AtMYB1, AtMYB2, AtMYB30, AtMYB41, and AtMYB44 were reported to respond to different stresses [18,19,20,21].

Mulberry (Morus, Moraceae) is an economically important plant with nutritional, medicinal, and ecological value. A draft genome of Morus notabilis was released by He et al. (2013) [22], after which genome-wide analyses of several gene families, including the R2R3-MYB gene family, were performed [23]. The functions of four R2R3-MYBs in flavonoid biosynthesis were also characterized [24]. However, MYBs in Morus are still poorly annotated. In the present study, we performed a comprehensive analysis of R2R3 and 3R-MYBs using the latest chromosome-level genome assembly of Morus alba (Ma), which has a basic chromosome number of 14 and total genome assembly size of 338.27 Mb [25].. A strict workflow was adopted to perform a genome-wide identification of MYBs in M. alba. Comprehensive analyses, including phylogenetic analysis, putative functional annotation, gene structure, motif and promoter analysis, chromosomal localization and the syntenic relationships of R2R3 and 3R-MaMYBs, were performed. The expression patterns of R2R3 and 3R-MaMYBs in different organs and fruit development stages were analyzed using RNA-seq data. Real-time quantitative PCR (RT-qPCR) was used to validate selected differentially expressed R2R3-MYBs in different fruit development stages. This study contributed basic functional annotation of R2R3-MaMYBs in Morus and provided candidate MYB regulators as targets for genetic modification.

Materials and methods

Plant materials

Roots, stems and leaves were collected from one-year-old M. alba plants. Fruits at four different development stages (S0, inflorescence; S1, green fruits; S2, reddish fruits; S3, purple fruits) were collected from Zhongsang 5801 plants. All of the above samples were collected from the plantation of National Mulberry Genebank (NMGB), Zhenjiang, China, and immediately stored at − 80 °C. Three biological replicates growing nearby were considered. At least six mulberry fruits were collected from each plant.

Data collection and workflow for searching MYBs in M. alba

Genome sequences were collected from different databases. The Arabidopsis genome sequences (.fasta) and annotation file (.gff) were downloaded from The Arabidopsis Information Resource (TAIR, https://www.arabidopsis.org/), and the Populus genome information was obtained from Phytozome v13 (https://phytozome-next.jgi.doe.gov/). The M. alba genome sequences (.fasta) and annotation file (.gff) were generously provided by Professor Jiao, who released this genome information. The MYBs in Arabidopsis and Populus were extracted using Tbtools v1.098665 [26] based on the information provided in previous reports [1, 8, 10].

To search for MYBs in mulberry, firstly, HMMER search was performed using a hidden Markov model (HMM) profile of the MYB binding domain (PF00249) from the Pfam database (http://pfam.xfam.org/). The hits with E values < 1 were screened, and the short open reading frames (length < 100) were filtered out and confirmed with manual speculation. The filtered sequences were screened as primary candidate MYBs. Secondly, the putative protein sequences of primary candidate MYBs were submitted to a simple MEME wrapper assembled in Tbtools v1.098665 using the following parameters: number of motifs to find 3, minimum motif width 6, and maximum motif width 55. MEME motifs were extracted and the MYB DNA binding domains were further validated manually. The primary candidate MYBs with MYB DNA binding domain were selected as MYBs in mulberry.

Sequence alignment, motif analysis and homologous modeling

Morus alba (Ma) R2R3-MYBs and 3R-MYBs were aligned using clustal W assembled in MEGA7.0. The alignment result was exported and manually speculated for scanning the R1, R2 and R3 repeats. Seqlogos of this domain were generated using TBtools v1.098665. Protein sequence of DBD domain of R2R3-MYB, MaMYB29, was submitted to SWISS-MODEL (https://swissmodel.expasy.org/) to obtain the 3-D structure model using AtMYB66 (WER, PDB accession number: 6kks) as template. The models were submitted to SAVES (http://services.mbi.ucla.edu/SAVES/) and Chiron (http://troll.med.unc.edu/chiron/login.php) for further evaluation and modification. In addition, PSVS (https://montelionelab.chem.rpi.edu/PSVS/) was also adopted to further evaluate the quality of modeled structure and calculate the Ramachandran plot. Pymol was used to visualize the 3-D structure [27].

Gene structure and promoter analysis of R2R3 and 3R-MYBs

The gene structure of each MaMYB was displayed based on the genome sequence and its annotation file using Gene Structure View assembled in Tbtools v1.098665. The upstream 2000 bp sequences were extracted for in silico promoter region analysis. Cis-acting elements were predicted using PlantCARE (http://bioinformatics.psb.ugent.be/webtools/plantcare/html/) (Lescot et al., 2002).

Phylogenetic analysis and functional annotation of R2R3 and 3R-MYBs

A neighbor-joining (NJ) phylogenetic tree was constructed using full-length R2R3-MYB protein sequences from A. thaliana, Populus and Morus using MEGA7.0 [28] with JTT + G model and bootstrap test with 1000 replicates. The other NJ tree was also constructed using only R2R3-MaMYB proteins and 3R-MaMYBs with the same parameters.

Chromosomal location and synteny analysis of R2R3 and 3R-MYBs

Chromosome location information of R2R3 and 3R-MYBs was extracted from Morus alba genome annotation file. BLASTP and Multiple Collinearity Scan toolkit (MCScanX) assembled in Tbtools v1.098665 were used to identify syntenic blocks, tandem duplication, and distinct duplication events using default parameters [29, 30]. R2R3 and 3R-MaMYBs were mapped to Morus chromosomes and displayed using Tbtools v1.098665 and both the tandem duplication and block duplication gene pairs were marked.

Transcriptome based expression profile analysis

Our previous reported RAN-seq data (Accession number: PRJNA660559) was used to obtain the expression profiles of R2R3 and 3R-MaMYBs in roots, stems and leaves [31]. Expression profile of R2R3 and 3R-MaMYBs in different fruit development stages was obtained using RNA-seq data with accession number: CRA006074 in national genomics data center (NGDC). Both above RNA-seq datasets were reanalyzed. Chromosome-level M.alba genome was used as reference genome for alignment using bowtie2 [32]. The genome annotation file was used for calculating the expression matrix using StringTie v2.15 [33]. Differential expressed genes (DEGs) were obtained using DEseq2 by comparing expression levels of each two stages [34]. Organ expression preference genes were identified while ABS (log2EXPi - log2EXPj) > =1 (i, j indicating different organs) were obtained.

RNA extraction and RT-qPCR analysis

RNA extraction and cDNA synthesis were performed as our previous report using Plant RN52 Kit (Aidlab, Beijing, China) and PC54-TRUEscript RT kit (Aidlab, Beijing, China) according to the manual [35]. qRT-PCR (quantitative real-time PCR) was performed to validate the expression patterns of R2R3-MaMYBs in different fruit development stages using ABI StepOnePlus™ Real-Time PCR System (USA). The primers are available in Additional file 1. Actin was used as reference gene [36]. Graphpad Prism8.0 was used to visualize the qRT-PCR results. SPSS19.0 was used to perform T-test and ANOVA, p < 0.05 was marked as significance. Three biological replicates were firstly mixed and three technical replicates respectively were performed for qRT-PCR.

Measurement of anthocyanins content in mulberry fruits

Extraction and measurement of anthocyanins content were performed according to our previous study [35]. The anthocyanins content was given in cyanidin-3-glucoside equivalents.

Results

Identification of MYBs in M. alba

Based on our workflow, 166 genes encoding MaMYBs were identified in M. alba. Among 166 MYB genes, there were 59 1R-MYBs (named MaMYBR1–59), 103 R2R3-MYBs (named MaMYB1–103) and 4 3R-MYBs (named MaMYB3R1–4); and 4R-MYB was not found in M. alba (Table 1 and Additional file 2). Some MYB genes had alternative splicing and generated more than one transcript. The alternative splicing occurred more frequently for 1R-MYBs in M. alba. Similar alternative splicing events were also reported in Arabidopsis and Populus MYBs. A previous study identified 124 MYB-related genes, 116 R2R3-MYBs, and 4 3R-MYBs using a de novo transcriptome of M. alba [23]. This study used a chromosome-level genome and provided stricter and accurate recognition of MYBs in M. alba. Both R2R3-MYBs and 3R-MYBs contained DNA-specific recognition helices, and R2R3-MYBs were the largest class of MYBs in plants, playing diverse roles in plant specific processes. Therefore, we mainly focused on the investigation of 103 R2R3-MYB and 4 3R-MYB family members in this study.

Table 1 MYBs in different plants

Characterization of R2R3 and 3R-MaMYBs in M. alba

Alignment was performed using 115 R2R3 and 3R-MYBs, including the alternative splicing products. The DBD domain was recognized and analyzed based on the alignment result. It was evident that all of these MaMYBs contained DBD domains and two or three repeats could be identified easily (Additional file 2). The DBD 3D structure of MaMYB29 was constructed using AtMYB66 (PDB accession number: 6kks) as a template to identify the helix regions (Fig. 1A). Helices 1, 2, and 3 are shown in Fig. 1. The highly conserved spaced W residues formed the WX19WX19W motif in R2 repeats (Fig. 1B, Additional file 2). Several R2R3-MYBs showed minor differences in the number of spaced amino acids. MaMYB5, 32, 63, 64, 70, and100 had additional G, forming WX18GXWX19W, and MaMYB13 had additional P and L residues, forming WX19WX9PLX10W. The additional amino acids were beyond the helix structures according to the helix regions identified (Fig. 1 and Additional file 2). The spaced W residues in R3 repeats were variable especially at the first W (site 54 in Fig. 1C), although W is still dominant amino acid. The (W/F/I/ M)X18WX18W was recognized in MaMYB R3 repeats (Fig. 1C). The replacement of the first W with F, (less frequently) I, or M in R3 was also reported in Populus, Z. may, Arabidopsis, and Vitis [1, 10,11,12]. MaMYB35 and 37 had replacement of the third W with Y and F, respectively. More conservative amino acids were observed in the third helix region of R2 and R3 repeats than those consisting of the first and second helices (Fig. 1B, C). In addition to the conserved W, there were also several highly conserved amino acids including the amino acids at the start and the end of each helix and amino acids consisting of the third helix in R2 and R3 repeats. In addition, the important MYB function related residues K35, L39, N86, K89, and 90, indicated by red boxes, were also highly conserved. Therefore, we could further modify the three helix regions to W(T/S)X2EDX10GX3WX7(G/L/I)X2RX (G/S)K(S/Q)CR(L/E) WXN(Y/Q/H) for R2 and (W/F/I/M)(T/S)X2E(D/E)X10 GX2W(S/A)X2AX5RTDNX(I/V)KN(Y/H/F)W(N/ H/R)(T/S/V) for R3 (Fig. 1B, C). Interestingly, it was clear that the patterns, W(T/S)X2EDX10G for R2 and (W/F /I/M)(T/S)X2E(D/E)X10G for R3 that formed the first helix were almost same for both R2 and R3 repeats. These patterns were annotated as InterPro MYB domain signature patterns (PS00037). In contrast, the patterns, (G/S)K(S/Q)CRWXN(Y/Q/H) for R2 and TDNX(I/V)KN(Y/H/F)W(N/H/R) (T/S/V) for R3, formed the third helices and were quite different for R2 and R3 repeats. The above results indicated that the conservation of the third helix in R2 or R3 preserved the DNA binding function and the difference in the third helix between R2 and R3 guaranteed recognition for specific DNA sequences. The WX19W18W box was detected for MaMYB3Rs and formed the R1 repeat.

Fig. 1
figure 1

Motif analysis and structural characterization of R2R3-MaMYBs. A The three-dimensional structure of MaMYB29. Helices in R2 and R3 are indicated, and residues at the start and the end of each helix are shown as sticks and balls. B Sequence logos of the R2 repeat of R2R3-MaMYBs. C Sequence logos of the R3 repeat of R2R3-MaMYBs. Helices 1, 2, and 3 are indicated with colored lines below in purple, red, and blue, respectively, and the modified patterns for R2 and R3 are also indicated. Conserved W and F/I/M are indicated by solid purple triangles

Phylogenetic analysis and functional classification of R2R3 and 3R-MaMYBs

Phylogenetic analysis of R2R3 and 3R-MYBs in Arabidopsis, Populus, and Morus showed that MaMYBs clustered well with orthologs in Arabidopsis or Populus. Previous studies have divided R2R3-AtMYBs into 28 subgroups (S1–S28) and R2R3-PtrMYBs into 45 subgroups (C1–C48). R2R3-MaMYBs covered all 28 R2R3-AtMYB subgroups and 44 R2R3-PtrMYB subgroups, except for C13 (Table 2, Additional file 3). In addition, MaMYB65 and 101 in addition to MaMYB53 and 54 (TT2L3) were classified as two Morus specific subgroups. MaMYB54, previously named as TT2L3, was reported to show a similar expression pattern with flavan-3-ol specific gene LAR encoding leucoanthocyanidin reductase during fruit ripening in HG2 [24]. MaMYB59, 73, and 76 belonging to C17 (S4) and MaMYB39 (MYB4) belonging to C18 were annotated as repressors of the phenylpropanoid pathway. AtMYB3, 4, and 32 in S4 were reported to be involved in suppressing lignin or flavonoid biosynthesis [14]. PtrMYB165, 182, and 194 in C18 were characterized as repressors for anthocyanin and proanthocyanidin biosynthesis [14]. MaMYB39, previously named MYB4, can negatively affect both anthocyanin and proanthocyanidin accumulation [24]. MaMYB42, 61, and 75 belonging to C19 (S7), MaMYB83 belonging to C12, MaMYB50, 81, 82, and 99 belonging to C25 (S5), and MaMYB58 belonging to C27 (S6) are annotated as flavonoid related genes based on the functions of their orthologs in Arabidopsis and Populus (Table 2). Other R2R3-MaMYBs with different annotated functions such as responsiveness to abiotic and biotic stresses are also displayed in Table 2.

Table 2 Classification of MaMYBs based on phylogenetic analysis

Promoter analysis and gene structures of R2R3 and 3R-MYBs

Promoter regions (2000 bp upstream) of R2R3 and 3R-MaMYBs were extracted and the cis-elements were predicted. Light-response related cis-elements made up the greatest proportion (48%) of detected cis-elements (Additional file 4). Thirty-one classes of cis-elements were identified and the hormone related cis-elements were widely detected in the promoter regions of MaMYBs (Fig. 2). R2R3-MaMYBs with annotated functions of stress-responsiveness (blue boxes) had more abscisic acid (ABA), salicylic (SA) acid or methyl jasmonate (MeJA) related cis-elements. In particular, MaMYB78, 79, and 80, which were annotated as MYBs involved in stress responses and leaf senescence, had responsive cis-elements for all ABA, SA, MeJA, auxin, and gibberellin. MaMYBs with putative functions of regulating phenylpropanoid pathway were indicated by reddish shadows and blue shadows (for repressors). These R2R3-MaMYBs also possessed MeJA, ABA and stresses related cis-elements. The intron/exon organizations of R2R3 and 3R-MaMYBs were detected, and the exon phases were also indicated (Fig. 3). Thirteen intron/exon patterns (a–m) were summarized, among which, an “a” pattern (two introns with phase numbers of 0 and 2) was the prevalent pattern for MaMYBs at 63.5%. All 3R-MaMYBs belonged to “m” pattern and possessed more than five introns.

Fig. 2
figure 2

Phylogenetic relationships and cis-elements in promoter regions of R2R3 and 3R-MaMYBs. A Phylogenetic tree using 107 R2R3 and 3R-MaMYBs. B Cis-element distribution in the promoter regions of R2R3 and 3R-MaMYBs

Fig. 3
figure 3

Phylogenetic relationships and gene organization of R2R3 and 3R-MaMYBs. A Phylogenetic tree using 107 R2R3 and 3R-MaMYBs. B Exon/intron structures of Populus R2R3-MYBs. C Schematic of intron distribution patterns of R2R3 and 3R-MaMYBs

Chromosomal distribution and synteny analyses of R2R3 and 3R-MaMYBs

All 107 R2R3 and 3R-MaMYBs were mapped to 14 chromosomes based on the genome information. Each chromosome contained several MaMYBs with nonuniform distribution (Fig. 4). Chromosomes 1 and 2, the top two longest chromosomes occupying 30.37% of the whole genome, contain only five and three MaMYBs, respectively (7.17% of total R2R3 and 3R-MaMYBs). Chromosomes 5, 8, and 10 held 13, 14 and 14 members each, respectively, and had the densest distribution of MaMYBs. This nonuniform distribution of MYBs was also observed in other plants including Z. may and Populus [10, 12]. Whole genome duplication (WGD) is the main force driving gene family duplication and is important for neo-function gene occurrence. The duplication events for MaMYBs were detected. Duplicated gene pairs resulting from intra-chromosome duplication and inter-chromosome block duplication were revealed (Additional file 5). Six tandem events in 16 MaMYBs and six block duplication gene pairs in 12 MaMYBs were found for R2R3-MaMYBs (Figs. 4 and 5). Compared with 145 segmental duplication events with 156 R2R3-MYB in Populus, fewer duplication events for MaMYBs occurred because the M. alba lineage underwent no WGDs after its separation from the Eurosid I clade, while Populus underwent a recent WGD event [25, 84].

Fig. 4
figure 4

Distribution of R2R3 and 3R-MaMYBs in Morus alba chromosomes. The tandem gene pairs are linked by blue lines

Fig. 5
figure 5

Schematic representations of collinear relationships of R2R3 and 3R-MaMYBs. Gray and red lines represent all synteny blocks and duplicated R2R2-MYB gene pairs in the Morus genome, respectively. The gene density is indicated in the heatmap by the inner circle. The corresponding chromosome number is shown on each chromosome

Expression profiles of R2R3 and 3R-MaMYBs

A spatial expression profile based on RNA-seq data of roots, stems and leaves showed that 66 of 107 R2R3 and 3R-MaMYBs exhibited organ preference in expression. Among these organ-preferential differentially expressed genes, only 14 MaMYBs (marked with blue colors) showed higher expression levels in leaves (Fig. 6A and Additional file 6). These MaMYBs may be responsible for specific metabolite accumulation in leaves. For example, MaMYB11, 38 and 94 (C2/S9) were annotated as MYBs involved in cuticular wax biosynthesis and showed significant preference in leaves. Flavonoid biosynthesis related MYBs, including MaMYB36, 42, 61, and 102 had the highest expression levels in leaves rather than in stems and roots. MaMYB91, which was annotated as an aliphatic glucosinolate biosynthesis related MYB, was also preferentially expressed in leaves. Most R2R3-MaMYBs with higher expression levels in roots or stems were MYBs involved in secondary cell wall components and stress responsiveness. For example, cell wall component biosynthesis related MYBs, including MaMYB35, 55, and 103 (C42/S21) as well as MaMYB22 and 31 (C7) and stress responsiveness related MYBs, including MaMYB33, 35, 57, 62, 78, 79, and 80 (C40/S22) and MaMYB4 (C22), showed higher expression levels in roots or stems (Fig. 6 A). It was obvious that R2R3-MaMYBs with similar annotated functions always maintained similar expression patterns in different organs.

Fig. 6
figure 6

Expression profiles of R2R3 and 3R-MaMYBs in Morus. A Heatmap of R2R3 and 3R-MaMYB expression levels in Morus roots, stems, and leaves. MaMYBs with higher expression levels in leaves are shown in blue, and MaMYBs with higher expression levels in roots or stems are shown in red. MaMYBs referred to in the text are also indicated by stars. B Heatmap of R2R3 and 3R-MaMYB expression levels in fruits at different developmental stages. The MaMYBs with higher expression levels in S0 or S1 are colored with blue, and the MaMYBs with higher expression levels in S2 or S3 are colored with red. Differentially expressed MaMYBs involved in the phenylpropanoid pathway are indicated by stars

Mulberry fruits are rich in flavonoids and anthocyanins and are used for wine, nutrient food, and medicine. The expression patterns of R2R3-MaMYBs and 3R-MaMYBs were revealed, and 72 of 107 R2R3 and 3R-MaMYBs were identified as DEGs during mulberry ripening (Fig. 6B and Additional file 6). Two clusters of R2R3 and 3R-MaMYBs based on the expression patterns, 27 early-expression MaMYBs, MaMYBs with higher expression levels in S0 or S1 and 45 late-expression MaMYBs, MaMYBs with higher expression levels in S2 or S3, were marked using red and blue respectively. Twenty differentially expressed R2R3-MaMYBs involved in flavonoid biosynthesis, anthocyanin biosynthesis, fruit development, and stress responsiveness and with relatively high overall expression levels were selected for qRT-PCR validation (Fig. 6B, Fig. 7A and Additional files 7 and 8). The mulberry fruit ripening process occurs along with the accumulation of anthocyanin and color changes (Fig. 7B). Both RNA-seq and qRT-PCR showed that R2R3-MaMYBs with putative roles in the phenylpropanoid pathway had different expression patterns during fruit ripening. The correlation relationships of the expression levels of these MaMYBs and anthocyanin contents were also revealed (Fig. 7C). It was clear that MaMYBs with late-expression patterns were positively correlated with anthocyanin accumulation, while MaMYBs with early-expression pattern are negatively correlated with anthocyanin accumulation (Fig. 7C). R2R3-MaMYBs such as MaMYB43, 77, 83, 98, and 58, which may play direct roles in PA or anthocyanin accumulation, showed late-expression patterns. These MaMYBs are likely to positively regulate anthocyanin biosynthesis. For instance, MaMYB58 (MYBA) has been reported to interact with bHLH3 to activate the expression of anthocyanin biosynthetic genes to control anthocyanin biosynthesis. Moreover, some phenylpropanoid pathway repressors, such as MaMYB59 and MaMYB39, also showed increasing expression levels along with fruit ripening. MaMYB59 was annotated as a repressor of lignin biosynthesis, and MaMYB39 was annotated as a repressor of flavonoids biosynthesis. In addition, there were also early-expression pattern R2R3-MaMYBs involved in flavonoid biosynthesis or anthocyanin biosynthesis. MaMYB75 (MYBF), which was reported to be responsible for flavonol accumulation and negatively regulate anthocyanin biosynthesis, showed a decreasing expression level along the fruit ripening. MaMYB36 and 89 were annotated as having functions in anthocyanin biosynthesis and trichome cell differentiation, and also exhibited early-expression patterns, indicating that MaMYB36 and 89 may have been responsible for mulberry fruit trichome development rather than anthocyanin biosynthesis. The different expression patterns of flavonoid related R2R3-MaMYBs along with mulberry fruit ripening suggested that different R2R3-MaMYBs coordinated to maintain homeostasis and avoid the overaccumulation of anthocyanins during fruit ripening.

Fig. 7
figure 7

Relationships between the expression levels of these MaMYBs and anthocyanin accumulation during mulberry fruit ripening. A Expression levels of selected MaMYBs during fruit ripening using qRT-PCR. B Changes in anthocyanin contents of mulberry fruits during fruit ripening. C Correlation between the expression levels of selected MaMYBs and anthocyanin accumulation. The pearson correlation coefficients were marked

Discussion

Genome-wide identification of R2R3-MYB TFs is influenced by quality of genome annotation and assembly [5]. A previous study identified 116 R2R3-MYBs based on de novo transcriptome data [23]. However, in the present study, we identified 103 R2R3-MYBs in M. alba using the latest M. alba genome assembly with a stricter workflow. Sequence length and MEME motif extraction validation were considered as additional two filtering steps to further screen MYBs in this study. The numbers of different kinds of MYBs are close in different plant species, except for Populus (Table 1). Moreover, the number of R2R3-MYBs (103) identified in Morus was close to that of V. vinifera (108), but far less than that of Populus (198). Although the number of R2R3-MaMYBs in Morus was less than that of Populus, the R2R3-MaMYBs covered almost all subgroups of Populus R2R3-MYBs and Arabidopsis MYBs (Table 2). It has been suggested that WGDs and tandem duplication events have contributed to the expansion of R2R3-MYBs in land plants [85]. It was reported that no lineage-specific WGD occurred in M. alba after the γhexaploidization event [25]. Compared with duplication events identified for R2R3-MYBs in Populus which experienced a recent lineage-specific WGD [84], fewer duplication events were found in Morus for the R2R3-MYBs and only 28 MaMYBs resulting from tandem duplication and block duplication were found for the R2R3-MaMYBs (Figs. 4 and 5). The above differences further support the hypothesis that the expansion of R2R3-MYB TFs in land plants has been subfamily/clade asymmetric and lineage specific [5].

The conserved DBD domain in the N-terminal of MYBs was identified in R2R3-MaMYBs. The spaced W forming WX19WX19W was highly conserved in R2 repeat, while the replacement of the first W with F, I, or M at position 54 for R3 was generally reported in plants [10,11,12,13]. The conserved spaced W motifs in R2 and R3 are signature sequences for the DBD domain of MYBs. In addition to the conserved W, there are several highly conserved residues in the DBD domain. These conserved residues mainly contained residues located at the start and end of helices and residues consisting of the third helix (Fig. 1). Residues located at the start and end of helices may be responsible for maintaining the helix structure. The conserved third helices in R2 and R3 are responsible for the specific base recognition in the major groove of the DNA [6]. In addition, the residues consisting of the third helix in R2 and those consisting of the third helix in R3 are greatly different which might be important for recognition of diverse DNA sequences. It has been reported that the residues including L39, N86, K89 and 90 in the third helices of R2 and R3 are important for maintaining MYB functions [7]. In addition, K35 was reported to play an important role in sensing DNA methylation at the fifth position of cytosine (5 mC) [7]. The InterPro MYB domain signature pattern (PS00037) was detected and conserved in the first helices of both R2 and R3 repeats. Given that the first helices were beyond the DNA binding regions, it was possible that the first helix functioned as structure helices.

A total of 103 R2R3-MaMYBs, except for MaMYB53, 54, 65, and 101, were grouped with the R2R3-MYBs from Arabidopsis and Populus, indicating the high retention of R2R3-MYBs in M. alba afterγpaleohexaploidy [25]. MaMYB53, 54, 65, and 101 formed two Morus specific subgroups (Ma1–2). Because R2R3-MYB gene retention after duplication events is biased among angiosperm taxa, and the pervasive duplications of R2R3-MYBs in core eudicots are thought to be gamma-triplication derived [5], R2R3-MYBs in Morus seem to exhibit relatively conserved R2R3-MYBs from before the divergence of the core eudicots. Most R2R3-MYBs in Arabidopsis have been well annotated and functionally characterized [1, 9, 14]. Phylogenetic analysis based functional annotation suggested that R2R3-MaMYBs were mainly involved in responses to abiotic and biotic stresses, development and secondary metabolite biosynthesis (Table 2). AtMYB20, 41 and 44 were reported to involve in response to abiotic stresses and AtMYB44 was reported to enhance stomatal closure to confer abiotic stress tolerance [38, 45, 86, 87]. Wheat TaMYB31 was reported to involve in drought stress responses [81] MaMYB14, 48, 79 and 80 were annotated as MYBs involved in abiotic stress responses in mulberry. AtMYB30 was reported to act as positive regulator of hypersensitive cell death program in plants in response to pathogen attack [50]. MdMYB30 in apple and SlMYB28 in tomato regulated plant resistance to pathogen attack [88, 89]. MaMYB10, 44 and 90 as homologs clustered with AtMYB30 may also function in response to pathogen attack. Mulberry is rich in phenylpropanoid-derived bioactive compounds, such as flavonoids and polyphenols, and mulberry fruits are known for their abundance of anthocyanins [90, 91]. Possible anthocyanin homeostasis was maintained by both cooperative and antagonistic interactions of phenylpropanoid pathway related R2R3-MaMYBs. Both the increasing expression of activators such as MaMYB43, 77, 83, 98 and 58 and decreasing expression levels of MaMYB54 (TT2L3), which was involved in flavonol biosynthesis along with fruit ripening, positively affected anthocyanin accumulation. In contrast, increasing the expression levels of repressors such as MaMYB39 (MYB4) and MaMYB59 helped to avoid the overaccumulation of anthocyanins and preserve the balance between lignin biosynthesis and flavonoid biosynthesis (Fig. 6). Both our expression profiles and a previous study on MaMYB54 (TT2L3) and MaMYB39 (MYB4) support this postulation [24]. The homeostasis between anthocyanin and lignin biosynthesis regulated by cooperation of repressor MdMYB16 and activator MdMYB1 was also reported in apples [92].

Conclusion

A comprehensive analysis of the R2R3-MYB gene family using the latest M. alba chromosome-level genome was conducted in the present study. In total, 166 MYB genes were identified and 103 R2R3-MaMYBs and four 3R-MaMYBs were further analyzed. Functional annotation for R2R3-MaMYBs based on phylogenetic analysis as well as functional annotation of orthologs from Arabidopsis and Populus suggested the possible functional subgroups of R2R3-MaMYBs. Conserved gene organization, promoter analysis, and expression profiles also provided evidence for putative functions. Moreover, R2R3-MYBs involved in the phenylpropanoid pathway were investigated, and MaMYBs involving in flavonoid biosynthesis and anthocyanins accumulation were identified based on functional annotation and the correlation of their expression levels with anthocyanin contents. These findings will be valuable for future genetic improvement of specific biological processes, such as flavonoid or anthocyanin biosynthesis, and provide a basic reference for research on the functional characterization of MaMYB genes involved in specific biological process.

Ethical statement for experimental research and field studies on plants

Experimental research and field studies on plants complies with relevant institutional, national, and international guidelines and legislation.

Availability of data and materials

The datasets generated during the current study are available in the national genomics data center (NGDC) with accession number: CRA006074 (https://ngdc.cncb.ac.cn/ search/?dbId = gsa&q = CRA006074).

References

  1. Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L. MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010;15(10):573–81.

    Article  CAS  PubMed  Google Scholar 

  2. Klempnauer KH, Gonda TJ, Bishop JM. Nucleotide sequence of the retroviral leukemia gene v-myb and its cellular progenitor c-myb: the architecture of a transduced oncogene. Cell. 1982;31(2 Pt 1):453–63.

    Article  CAS  PubMed  Google Scholar 

  3. Rosinski JA, Atchley WR. Molecular evolution of the Myb family of transcription factors: evidence for polyphyletic origin. J Mol Evol. 1998;46(1):74–83.

    Article  CAS  PubMed  Google Scholar 

  4. Paz-Ares J, Ghosal D, Wienand U, Peterson PA, Sa Ed Ler H. The regulatory c1 locus of Zea mays encodes a protein with homology to myb proto-oncogene products and with structural similarities to transcriptional activators. EMBO J. 1987;6(12):3553–8.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  5. Jiang CK, Rao GY. Insights into the diversification and evolution of R2R3-MYB transcription factors in plants. Plant Physiol. 2020;183(2):637–55.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Ogata K, Kanei-Ishii C, Sasaki M, Hatanaka H, Nagadoi A, Enari M, et al. The cavity in the hydrophobic core of Myb DNA-binding domain is reserved for DNA recognition and trans-activation. Nat Struct Mol Biol. 1996;3(2):178–87.

    Article  CAS  Google Scholar 

  7. Wang B, Luo Q, Li Y, Yin L, Zhou N, Li X, et al. Structural insights into target DNA recognition by R2R3-MYB transcription factors. Nucleic Acids Res. 2020;48(1):460–71.

    CAS  PubMed  Google Scholar 

  8. Stracke R, Werber M, Weisshaar B. The R2R3-MYB gene family in Arabidopsis thaliana. Curr Opin Plant Biol. 2001;4(5):447–56.

    Article  CAS  PubMed  Google Scholar 

  9. Liu J, Osbourn A, Ma P. MYB transcription factors as regulators of Phenylpropanoid metabolism in plants. Mol Plant. 2015;8(5):689–708.

    Article  CAS  PubMed  Google Scholar 

  10. Yang X, Li J, Guo T, Guo B, Chen Z, An X. Comprehensive analysis of the R2R3-MYB transcription factor gene family in Populus trichocarpa. Ind Crop Prod. 2021;168:113614.

    Article  CAS  Google Scholar 

  11. Matus JT, Aquea F, Arce-Johnson P. Analysis of the grape MYB R2R3 subfamily reveals expanded wine quality-related clades and conserved gene structure organization across Vitis and Arabidopsis genomes. BMC Plant Biol. 2008;8:83.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. Du H, Feng BR, Yang SS, Huang YB, Tang YX. The R2R3-MYB transcription factor gene family in maize. PLoS One. 2012;7(6):e37463.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Li X, Xue C, Li J, Qiao X, Li L, Li’ang Y, et al. Genome-wide identification, evolution and functional divergence of MYB transcription factors in Chinese white pear (Pyrus bretschneideri). Plant Cell Physiol. 2016;4:824.

    Article  CAS  Google Scholar 

  14. Ma D, Constabel CP. MYB repressors as regulators of Phenylpropanoid metabolism in plants. Trends Plant Sci. 2019;24(3):275–89.

    Article  CAS  PubMed  Google Scholar 

  15. Gonzalez A, Zhao M, Leavitt JM, Lloyd AM. Regulation of the anthocyanin biosynthetic pathway by the TTG1/bHLH/Myb transcriptional complex in Arabidopsis seedlings. Plant J. 2010;53(5):814–27.

    Article  CAS  Google Scholar 

  16. Payne C, Thomas F, Zhang L, Alan M. GL3 Encodes a bHLH Protein That Regulates Trichome Development in Arabidopsis Through Interaction. Genetics. 2000;156(3):1349–62.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Cominelli E, Tonelli C. A new role for plant R2R3-MYB transcription factors in cell cycle regulation. Cell Res. 2009;19(11):1231–2.

    Article  PubMed  Google Scholar 

  18. Ambawat S, Sharma P, Yadav NR, Yadav RC. MYB transcription factor genes as regulators for plant responses: an overview. Physiol Mol Biol Plants. 2013;19(3):307–21.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Raffaele S, Rivas S, Roby D. An essential role for salicylic acid in AtMYB30-mediated control of the hypersensitive cell death program in Arabidopsis. FEBS Lett. 2006;580(14):3498–504.

    Article  CAS  PubMed  Google Scholar 

  20. Lu BB, Li XJ, Sun WW, Li L, Gao R, Zhu Q, et al. AtMYB44 regulates resistance to the green peach aphid and diamondback moth by activating EIN2-affected defences in Arabidopsis. Plant Biol (Stuttg). 2013;15(5):841–50.

    Article  CAS  Google Scholar 

  21. Kosma DK, Murmu J, Razeq FM, Santos P, Bourgault R, Molina I, et al. AtMYB41 activates ectopic suberin synthesis and assembly in multiple plant species and cell types. Plant J. 2014;80(2):216–29.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  22. He N, Zhang C, Qi X, Zhao S, Tao Y, Yang G, et al. Draft genome sequence of the mulberry tree Morus notabilis. Nat Commun. 2013;4(1):2445.

    Article  PubMed  CAS  Google Scholar 

  23. Du XL, Cao X, Yin CR, Tang Z, Du W, Ban YY, et al. Comprehensive analysis of R2R3-MYB genes during adventitious root formation in cuttings of Morus alba. J Plant Growth Regul. 2016;36(2):290–9.

    Article  CAS  Google Scholar 

  24. Li H, Yang Z, Zeng Q, Wang S, Luo Y, Huang Y, et al. Abnormal expression of bHLH3 disrupts a flavonoid homeostasis network, causing differences in pigment composition among mulberry fruits. Hortic Res. 2020;7:83.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Jiao F, Luo R, Dai X, Liu H, Yu G, Han S, et al. Chromosome-level reference genome and population genomic analysis provide insight into the evolution and improvement of domesticated mulberry (Morus alba L). Mol Plant. 2020;13(7):1001–12.

    Article  CAS  PubMed  Google Scholar 

  26. Chen C, Chen H, Zhang Y, Thomas HR, Frank MH, He Y, et al. TBtools: An integrative toolkit developed for interactive analyses of big biological data. Mol Plant. 2020;13(8):1194–202.

    Article  CAS  PubMed  Google Scholar 

  27. Delano WL. The PyMOL molecular graphics system. My Publications. 2014;30:442–54.

    Google Scholar 

  28. Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30(12):2725–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J. BLAST+: architecture and applications. BMC Bioinformatics. 2009;10:421.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  30. Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, et al. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Chao N, Yu T, Hou C, Liu L, Zhang L. Genome-wide analysis of the lignin toolbox for morus and the roles of lignin related genes in response to zinc stress. PeerJ. 2021;9:e11964.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Langdon BW. Performance of genetic programming optimised Bowtie2 on genome comparison and analytic testing (GCAT) benchmarks. BioData Min. 2015;8:1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  33. Pertea M, Pertea GM, Antonescu CM, Chang TC, Mendell JT, Salzberg SL. StringTie enables improved reconstruction of a transcriptome from RNA-seq reads. Nat Biotechnol. 2015;33(3):290–5.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  34. Anders S, Huber W. Differential expression of RNA-Seq data at the gene level-the DESeq package. Heidelberg, Germany: European Molecular Biology Laboratory (EMBL). 2012;10:f1000research.

  35. Chao N, Wang RF, Hou C, Yu T, Miao K, Cao FY, et al. Functional characterization of two chalcone isomerase (CHI) revealing their responsibility for anthocyanins accumulation in mulberry. Plant Physiol Biochem. 2021;161:65–73.

    Article  CAS  PubMed  Google Scholar 

  36. Shukla P, Reddy RA, Ponnuvel KM, Rohela GK, Shabnam AA, Ghosh MK, et al. Selection of suitable reference genes for quantitative real-time PCR gene expression analysis in mulberry (Morus alba L.) under different abiotic stresses. Mol Biol Rep. 2019;46(2):1809–17.

    Article  CAS  PubMed  Google Scholar 

  37. Hai D, Yong-Bin W, Yi X, Zhe L, San-Jie J, Shuang-Shuang Z, et al. Genome-wide identification and evolutionary and expression analyses of MYB-related genes in land plants. DNA Res. 2013;20(5):437–48.

    Article  CAS  Google Scholar 

  38. Cominelli E, Sala T, Calvi D, Gusmaroli G, Tonelli C. Over-expression of the Arabidopsis AtMYB41 gene alters cell expansion and leaf surface permeability. Plant J. 2008;53(1):53–64.

    Article  CAS  PubMed  Google Scholar 

  39. Hoang MH, Nguyen XC, Lee K, Kwon YS, Pham HT, Park HC, et al. Phosphorylation by AtMPK6 is required for the biological function of AtMYB41 in Arabidopsis. Biochem Biophys Res Commun. 2012;422(1):181–6.

    Article  CAS  PubMed  Google Scholar 

  40. Denekamp M, Smeekens SC. Integration of wounding and osmotic stress signals determines the expression of the AtMYB102 transcription factor gene. Plant Physiol. 2003;132(3):1415–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  41. Oshima Y, Mitsuda N. The MIXTA-like transcription factor MYB16 is a major regulator of cuticle formation in vegetative organs. Plant Signal Behav. 2013;8(11):e26826.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  42. Lashbrooke J, Cohen H, Levy-Samocha D, Tzfadia O, Panizel I, Zeisler V, et al. MYB107 and MYB9 homologs regulate Suberin deposition in angiosperms. Plant Cell. 2016;28(9):2097–116.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. To A, Joubes J, Thueux J, Kazaz S, Lepiniec L, Baud S. AtMYB92 enhances fatty acid synthesis and suberin deposition in leaves of Nicotiana benthamiana. Plant J. 2020;103(2):660–76.

    Article  CAS  PubMed  Google Scholar 

  44. Chen Y, Wang H-Y, Chen Y-F. The transcription factor MYB40 is a central regulator in arsenic resistance in Arabidopsis. Plant Commun. 2021;2:100234.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Cui MH, Yoo KS, Hyoung S, Nguyen HT, Kim YY, Kim HJ, et al. An Arabidopsis R2R3-MYB transcription factor, AtMYB20, negatively regulates type 2C serine/threonine protein phosphatases to enhance salt tolerance. FEBS Lett. 2013;587(12):1773–8.

    Article  CAS  PubMed  Google Scholar 

  46. Gao S, Zhang YL, Yang L, Song JB, Yang ZM. AtMYB20 is negatively involved in plant adaptive response to drought stress. Plant Soil. 2014;376(1):433–43.

    Article  CAS  Google Scholar 

  47. Wang S, Li E, Porth I, Chen JG, Mansfield SD, Douglas CJ. Regulation of secondary cell wall biosynthesis by poplar R2R3 MYB transcription factor PtrMYB152 in Arabidopsis. Sci Rep. 2014;4:5054.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  48. Geng P, Zhang S, Liu J, Zhao C, Wu J, Cao Y, et al. MYB20, MYB42, MYB43, and MYB85 regulate phenylalanine and lignin biosynthesis during secondary Cell Wall formation. Plant Physiol. 2020;182(3):1272–83.

    Article  CAS  PubMed  Google Scholar 

  49. Zhang Y, Cao G, Qu LJ, Gu H. Characterization of Arabidopsis MYB transcription factor gene AtMYB17 and its possible regulation by LEAFY and AGL15. J Genet Genom. 2009;36(2):99–107.

    Article  CAS  Google Scholar 

  50. Vailleau F, Daniel X, Tronchet M, Montillet JL, Triantaphylides C, Roby D. A R2R3-MYB gene, AtMYB30, acts as a positive regulator of the hypersensitive cell death program in plants in response to pathogen attack. Proc Natl Acad Sci U S A. 2002;99(15):10179–84.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Cominelli E, Galbiati M, Albertini A, Fornara F, Conti L, Coupland G, et al. DOF-binding sites additively contribute to guard cell-specificity of AtMYB60 promoter. BMC Plant Biol. 2011;11:162.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  52. Chen Y, Chen Z, Kang J, Kang D, Gu H, Qin G. AtMYB14 regulates cold tolerance in Arabidopsis. Plant Mol Biol Report. 2013;31:87–97.

    Article  CAS  PubMed  Google Scholar 

  53. Chen Y, Zhang X, Wu W, Chen Z, Gu H, Qu LJ. Overexpression of the wounding-responsive gene AtMYB15 activates the shikimate pathway in Arabidopsis. J Integr Plant Biol. 2006;48(9):1084–95.

    Article  CAS  Google Scholar 

  54. Zhou J, Lee C, Zhong R, Ye ZH. MYB58 and MYB63 are transcriptional activators of the lignin biosynthetic pathway during secondary cell wall formation in Arabidopsis. Plant Cell. 2009;21(1):248–66.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  55. Gu JN, Zhu J, Yu Y, Teng XD, Lou Y, Xu XF, et al. DYT1 directly regulates the expression of TDF1 for tapetum development and pollen wall formation in Arabidopsis. Plant J. 2014;80(6):1005–13.

    Article  CAS  PubMed  Google Scholar 

  56. Li DD, Xue JS, Zhu J, Yang ZN. Gene regulatory network for Tapetum development in Arabidopsis thaliana. Front Plant Sci. 2017;8:1559.

    Article  PubMed  PubMed Central  Google Scholar 

  57. Fernandez-Marcos M, Desvoyes B, Manzano C, Liberman LM, Benfey PN, Del Pozo JC, et al. Control of Arabidopsis lateral root primordium boundaries by MYB36. New Phytol. 2017;213(1):105–12.

    Article  CAS  PubMed  Google Scholar 

  58. Liberman LM, Sparks EE, Moreno-Risueno MA, Petricka JJ, Benfey PN. MYB36 regulates the transition from proliferation to differentiation in the Arabidopsis root. Proc Natl Acad Sci U S A. 2015;112(39):12099–104.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  59. Liang G, He H, Li Y, Ai Q, Yu D. MYB82 functions in regulation of trichome development in Arabidopsis. J Exp Bot. 2014;65(12):3215–23.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  60. Allan AC, Hellens RP, Laing WA. MYB transcription factors that colour our fruit. Trends Plant Sci. 2008;13(3):99–102.

    Article  CAS  PubMed  Google Scholar 

  61. Diana Lucia Zuluaga ASGA, Elena Loreti B, Chiara Pucciariello A, Elena Degl’Innocenti C, Lucia Guidi C, Amedeo Alpi D, et al. Arabidopsis thaliana MYB75/PAP1 transcription factor induces anthocyanin production in transgenic tomato plants. Funct Plant Biol. 2008;35(7):606–18.

    Article  PubMed  Google Scholar 

  62. Qiao Z, Dixon RA. Transcriptional networks for lignin biosynthesis: more complex than we thought? Trends Plant Sci. 2011;16(4):227–33.

    Article  CAS  Google Scholar 

  63. Steiner-Lange S, Unte US, Eckstein L, Yang C, Saedler H. Disruption of Arabidopsis thaliana MYB26 results in male sterility due to non-dehiscent anthers. Plant J. 2010;34(4):519–28.

    Article  Google Scholar 

  64. Zhang Z, Zhu J, Gao J, Chen W, Hui L, Hong L, et al. Transcription factor AtMYB103 is required for anther development by regulating tapetum development, callose dissolution and exine formation in Arabidopsis. Plant J. 2010;52(3):528–38.

    Article  CAS  Google Scholar 

  65. Zhu J, Zhang GQ, Chang YH, Li XC, Yang J, Huang XY, et al. AtMYB103 is a crucial regulator of several pathways affecting Arabidopsis anther development. Sci China Life Sci. 2010;53(9):1112–22.

    Article  CAS  PubMed  Google Scholar 

  66. Liang YK, Dubos C, Dodd IC, Holroyd GH, Campbell MM. AtMYB61, an R2R3-MYB transcription factor controlling stomatal aperture in Arabidopsis thaliana. Curr Biol. 2005;15(13):1201–6.

    Article  CAS  PubMed  Google Scholar 

  67. Newmant LJ, Perazza DE, Judas L, Campbell MM. Involvement of the R2R3-MYB, AtMYB61, in the ectopic lignification and dark-photomorphogenic components of the det3 mutant phenotype. Plant J. 2004;37(2):239–50.

    Article  CAS  Google Scholar 

  68. Guo Y, Gan S. AtMYB2 regulates whole plant senescence by inhibiting cytokinin-mediated branching at late stages of development in Arabidopsis. Plant Physiol. 2011;156(3):1612–9.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  69. Vos MD, Denekamp M, Dicke M, Vuylsteke M, Loon LV, Smeekens SC, et al. The Arabidopsis thaliana transcription factor AtMYB102 functions in defense against the insect herbivore Pieris rapae. Plant Signal Behav. 2006;1(6):305–11.

    Article  PubMed  PubMed Central  Google Scholar 

  70. Shin B, Choi G, Yi H, Yang S, Choi G. AtMYB21, a gene encoding a flower-specific transcription factor, is regulated by COP1. Plant J. 2010;30(1):23–32.

    Article  Google Scholar 

  71. Yang X, Li J, Pei M, Gu H, Chen Z, Qu L-J. Over-expression of a flower-specific transcription factor gene AtMYB24 causes aberrant anther development. Plant Cell Rep. 2007;26(2):219–28.

    Article  CAS  PubMed  Google Scholar 

  72. Kranz HD, Denekamp M, Greco R, Jin H, Leyva A, Meissner RC, et al. Towards functional characterisation of the members of theR2R3-MYBgene family fromArabidopsis thaliana. Plant J. 2010;16(2):263–76.

    Article  Google Scholar 

  73. Albert NW, Davies KM, Lewis DH, Zhang H, Montefiori M, Brendolise C, et al. A conserved network of transcriptional activators and repressors regulates anthocyanin pigmentation in eudicots. Plant Cell. 2014;26(3):962–80.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  74. Winiewska A, Wojszko K, Róańska E, Lenarczyk K, Sobczak M. Arabidopsis thaliana Myb59 gene is involved in the response to Heterodera schachtii infestation, and its overexpression disturbs regular development of nematode-induced syncytia. Int J Mol Sci. 2021;22(12):6450.

    Article  CAS  Google Scholar 

  75. Fasani E, Dalcorso G, Costa A, Zenoni S, Furini A. The Arabidopsis thaliana transcription factor MYB59 regulates calcium signalling during plant growth and stress response. Plant Mol Biol. 2019;99(6):1–18.

    Article  CAS  Google Scholar 

  76. Imran QM, Hussain A, Lee SU, Mun BG, Falak N, Loake GJ, et al. Transcriptome profile of NO-induced Arabidopsis transcription factor genes suggests their putative regulatory role in multiple biological processes. Sci Rep. 2018;8(1):771.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  77. Nguyen NH, Cheong JJ. H2A.Z-containing nucleosomes are evicted to activate AtMYB44 transcription in response to salt stress. Biochem Biophys Res Commun. 2018;499(4):1039–43.

    Article  CAS  PubMed  Google Scholar 

  78. Shim JS, Choi YD. Direct regulation of WRKY70 by AtMYB44 in plant defense responses. Plant Signal Behav. 2013;8(6):e20783.

    PubMed  Google Scholar 

  79. Zou B, Jia Z, Tian S, Wang X, Gou Z, L B, et al. AtMYB44 positively modulates disease resistance to pseudomonas syringae through the salicylic acid signalling pathway in Arabidopsis. Funct Plant Biol. 2013;40(3):304–13.

    Article  CAS  PubMed  Google Scholar 

  80. Zhong R, Ye ZH. Transcriptional regulation of lignin biosynthesis. Plant Signal Behav. 2009;4(11):1028–34.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  81. Xie Y, Chen P, Yan Y, Bao C, Li X, Wang L, et al. An atypical R2R3 MYB transcription factor increases cold hardiness by CBF-dependent and CBF-independent pathways in apple. New Phytol. 2018;218(1):201–18.

    Article  CAS  PubMed  Google Scholar 

  82. Yin L, Chen H, Cao B, Lei J, Chen G. Molecular characterization of MYB28 involved in aliphatic Glucosinolate biosynthesis in Chinese kale (Brassica oleracea var. alboglabra bailey). Front Plant Sci. 2017;8:1083.

    Article  PubMed  PubMed Central  Google Scholar 

  83. Wang W, Sijacic P, Xu P, Lian H, Liu Z. Arabidopsis TSO1 and MYB3R1 form a regulatory module to coordinate cell proliferation with differentiation in shoot and root. Proc Natl Acad Sci U S A. 2018;115(13):E3045–54.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  84. Tuskan G, Difazio S, Jansson S, Bohlmann J, Grigoriev I, Hellsten U, et al. The genome of black cottonwood, Populus trichocarpa (Torr. & gray). Science. 2006;313(5793):1596–604.

    Article  CAS  PubMed  Google Scholar 

  85. Du H, Liang Z, Zhao S, Nan MG, Li JN. The evolutionary history of R2R3-MYB proteins across 50 eukaryotes: new insights into subfamily classification and expansion. Sci Rep. 2015;5:11037.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  86. Lippold F, Sanchez DH, Musialak M, Schlereth A, Scheible WR, Hincha DK, et al. AtMyb41 regulates transcriptional and metabolic responses to osmotic stress in Arabidopsis. Plant Physiol. 2009;149(4):1761–72.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  87. Jung C, Seo JS, Han SW, Koo YJ, Kim CH, Song SI, et al. Overexpression of AtMYB44 enhances stomatal closure to confer abiotic stress tolerance in transgenic Arabidopsis. Plant Physiol. 2008;146(2):623–35.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  88. Zhang YL, Zhang CL, Wang GL, Wang YX, Qi CH, Zhao Q, et al. The R2R3 MYB transcription factor MdMYB30 modulates plant resistance against pathogens by regulating cuticular wax biosynthesis. BMC Plant Biol. 2019;19(1):362.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  89. Li T, Zhang X-Y, Huang Y, Xu Z-S, Wang F, Xiong A-S. An R2R3-MYB transcription factor, SlMYB28, involved in the regulation of TYLCV infection in tomato. Sci Hortic. 2018;237:192–200.

    Article  CAS  Google Scholar 

  90. He X, Chen X, Ou X, Ma L, Xu W, Huang K. Evaluation of flavonoid and polyphenol constituents in mulberry leaves using HPLC fingerprint analysis. Int J Food Sci Tech. 2019;55(2):526–33.

    Article  CAS  Google Scholar 

  91. Zhang D-Y, Wan Y, Hao J-Y, Hu R-Z, Chen C, Yao X-H, et al. Evaluation of the alkaloid, polyphenols, and antioxidant contents of various mulberry cultivars from different planting areas in eastern China. Ind Crop Prod. 2018;122:298–307.

    Article  CAS  Google Scholar 

  92. Hu Y, Cheng H, Zhang Y, Zhang J, Niu S, Wang X, et al. The MdMYB16/MdMYB1-miR7125-MdCCR module regulates the homeostasis between anthocyanin and lignin biosynthesis during light induction in apple. New Phytol. 2021;231(3):1105–22.

    Article  CAS  PubMed  Google Scholar 

Download references

Acknowledgements

Great thanks for professor Jiao who provided us the genome annotation file of Morus alba. We thank LetPub (www.letpub.com) for its linguistic assistance during the preparation of this manuscript.

Funding

This work was jointly supported by the Natural Science Foundation of Jiangsu Province [BK20210879 to Nan Chao], Crop Germplasm Resources Protection Project of the Ministry of Agriculture and Rural Affairs of the People’s Republic of China (19200382), National Infrastructure for Crop Germplasm Resources (NCGRC-2020-041), and China Agriculture Research System of MOF and MARA (CARS-18).

Author information

Authors and Affiliations

Authors

Contributions

Li Liu guided the work and provided advices; Nan Chao, Yidilisi Keermula and Xiaoru Kang performed the experiments and analyzed the data; Li Liu and Nan Chao organized the figures and wrote the manuscript. Xu Cao provided advises and revised the manuscript. The author(s) read and approved the final manuscript.

Corresponding author

Correspondence to Li Liu.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Primers used in this study.

Additional file 2: Table S2.

MYBs in Morus alba.

Additional file 3: Fig. S1.

Phylogenetic tree of R2R3-MYBs from Arabidopsis, Populus, and Morus. The subgroups are indicated based on both the classification in Arabidopsis and Populus.

Additional file 4: Table S3.

Predicted cis-elements in promoter regions of MaMYBs.

Additional file 5: Table S4.

Gene duplication in Morus alba.

Additional file 6: Table S5.

Expression matrix of MaMYBs based on RNA-seq.

Additional file 7: Fig. S2.

qRT-PCR results for 20 differentially expressed R2R3-MaMYBs during fruit ripening. The significance was indicated by different letters(p < 0.01).

Additional file 8: Table S6.

Relative expression matrix for 20 differentially expressed R2R3-MaMYBs during fruit ripening.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, L., Chao, N., Yidilisi, K. et al. Comprehensive analysis of the MYB transcription factor gene family in Morus alba. BMC Plant Biol 22, 281 (2022). https://doi.org/10.1186/s12870-022-03626-5

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12870-022-03626-5

Keywords