Diversification of plant SUPPRESSOR OF MAX2 1 (SMAX1)-like genes and genome-wide identification and characterization of cotton SMXL gene family
BMC Plant Biology volume 23, Article number: 419 (2023)
Strigolactones (SLs) are a recently discovered class of plant hormones. SUPPRESSOR OF MAX2 1 (SMAX1)-like proteins, key component of the SL signaling pathway, have been studied extensively for their roles in regulating plant growth and development, such as plant branching. However, systematic identification and functional characterization of SMXL genes in cotton (Gossypium sp.), an important fiber and oil crop, has rarely been conducted.
We identified 210 SMXL genes from 21 plant genomes and examined their evolutionary relationships. The structural characteristics of the SMXL genes and their encoded proteins exhibited both consistency and diversity. All plant SMXL proteins possess a conserved Clp-N domain, P-loop NTPase, and EAR motif. We identified 63 SMXL genes in cotton and classified these into four evolutionary branches. Gene expression analysis revealed tissue-specific expression patterns of GhSMXL genes, with some upregulated in response to GR24 treatment. Protein co-expression network analysis showed that GhSMXL6, GhSMXL7-1, and GhSMXL7-2 mainly interact with proteins functioning in growth and development, while virus-induced gene silencing revealed that GhSMAX1-1 and GhSMAX1-2 suppress the growth and development of axillary buds.
SMXL gene family members show evolutionary diversification through the green plant lineage. GhSMXL6/7–1/7–2 genes play critical roles in the SL signaling pathway, while GhSMXL1-1 and GhSMXL1-2 function redundantly in growth of axillary buds. Characterization of the cotton SMXL gene family provides new insights into their roles in responding to SL signals and in plant growth and development. Genes identified in this study could be used as the candidate genes for improvement of plant architecture and crop yield.
Plant growth and development are regulated by a variety of endogenous and exogenous signals. Plant hormones are one of the major endogenous signals that can respond rapidly to environmental stimuli . Strigolactones (SLs), initially found to be secreted from plant roots, act as a germination stimulant for weed seeds and also function in parasitic interactions [2, 3]. SLs are also involved in the development of algae and monocot and dicot plants [4, 5]. In plant growth, SLs regulate stem branching (tillering), leaf senescence, stem secondary wall thickening, photomorphogenesis, and stem elongation, among other [6,7,8,9]. Karrikin (KAR)/karrikin-like (KL) is a chemical signal discovered in wildfire smoke, that can effectively stimulate seed germination and seedling growth in Arabidopsis. KAR/KL activity depends on the F-box protein MORE AXILLARY BRANCHING (MAX2) in the Skp-Cullin-F-box (SCF) complex [10, 11]. The SCFMAX2 complex is involved in regulation of both the SL and KAR signaling pathways, and SL and KAR compounds exhibit partially similar molecular structures and share common biological functions . The receptors for SL and KAR are α/β-hydrolases DWARF14 (D14) and KARRIKIN INSENSITIVE2 (KAI2), respectively . Studies of Arabidopsis d14 and kai2 mutants have shown that MAX2 mediates the development processes regulated by SL and KAR/KL . SUPPRESSOR OF MAX2 1 (SMAX1)-LIKE (SMXL) is a candidate target protein of the SCFMAX2 complex, which may act downstream of MAX2 and respond to KAR and SL signals .
A homolog of SMAX1, named D53, was identified in a SL-insensitive rice (Oryza sativa) mutant and acts as a negative regulator of tillering . The rice D53 homologs, SMAX1, and SMXL6, 7, and 8, are negative regulators of stem branching in the SL signaling pathway, inhibiting the transmission of KAR and SL signals [14, 16]. Exogenous application of the SL analog GR24 leads to the ubiquitination and degradation of SMXL6, 7, and 8. A complex consisting of SMXL, MAX2, and TOPLESS-RELATED PROTEIN2 (TPR2) interacts with D14 in response to GR24 . Treatment of Arabidopsis seedlings with GR24 promotes the degradation of SMAX1 and SMXL2 through the D14-SCFMAX2 signaling pathway, resulting in MAX2-dependent degradation of different members of the SMXL protein family . The 26S proteasome specifically recognizes the SMXL protein and then is degrades it, thereby preventing the inhibitory effect of downstream transcription factor BRANCHED1 (BRC1) . AtSMAX1 and AtSMXL2 regulate the development of roots and root hairs as well as the elongation of hypocotyls in Arabidopsis. The complex formed by SMAX1/SMXL2 and MAX2 binds to KAI2, which lead to SMAX1/SMXL2 ubiquitination and degradation of when perceiving KAR signals [20, 21]. Previous research has shown that SMXL6, SMXL7, and SMXL8 genes are regulated by KAI2 and are involved in the suppression of abscisic acid (ABA) by inhibiting the BRC1 gene through the ethylene response factor-associated amphiphilic repression (EAR) motif. This inhibition ultimately promotes branching in plants . KAI2 is not capable of breaking down SMXL6, SMXL7, or SMXL8, but it is able to target SMXL2 . Overall, these studies suggest that D53-like SMXL proteins are part of the SL signaling pathway and play a significant role in plant growth and development, particularly in regulating branching, by responding to KAR and SL signals.
The phosphatidylethanolamine binding proteins, FLOWERING LOCUS T (FT) and TERMINAL FLOWERING L (TFL1), antagonistically regulate flowering transition and the establishment of plant architecture . In Arabidopsis, the FT gene and the bZIP transcription factor gene FD work together to respond to inductive photoperiods and overcome TFL1-FD inhibition of branching. The TFL1-FD complex is identified as a central regulator that restrains reproductive development and endogenous signaling pathways. Recent research has identified the TFL1-FD complex as a hub that plays key roles in inhibiting plant reproductive development . Through large-scale ChIP-seq and functional analysis of downstream genes, TFL1 and FT have been shown to share several important target genes in multiple phytohormone pathways through competition for FD [25, 26]. Components of SL signaling, SMXL6 and SMXL8, may be target genes of the TFL1-FD complex . In particular, TFL1 protein inhibits the target protein SMXL by competing with FT for FD, thus promoting axillary bud branching [24, 27].
Cotton (Gossypium sp.) is an important economic crop, and its fruiting branch length and branching pattern have a direct impact on plant architecture, mechanical harvesting, and yield [28,29,30]. Previous research has identified the involvement of the SL signal in lateral branch development of cotton. For example, ectopic expression of GhMAX2a and GhMAX2b rescues the dwarf and multi-branch phenotypes of max2 mutants in Arabidopsis . Moreover, protein interaction studies have demonstrated that GhMAX2b interacts with Skp and other proteins to form a SCF-E3 complex, which plays a critical role in lateral branch development . Silencing of GhMAX2 in cotton results in a dwarf plant phenotype, with slow growth and shortened fruiting internodes and fibers . However, the biological functions and mechanisms of SMXL genes in the SL signaling pathway in cotton require further elucidation, and systematic identification and characterization of the SMXL gene family in cotton is still lacking.
In this study, we used a homology search to comprehensively explore the evolution of the SMXL gene family in 21 green plant species using recently updated genome sequences of Chlorophyta, Bryophyta, and angiosperm groups. We also focused on extensive identification and comparative analysis of SMXL gene family members in the genomes of tetraploid (Gossypium hirsutum and Gossypium barbadense) and diploid (Gossypium raimondii, Gossypium arboreum, and Gossypium herbaceum) cotton genomes, performing comprehensive characterization. Our results lay a foundation for further research on the evolution and function of plant, especially cotton, SMXL genes.
Identification and evolution of plant SMXL gene family members
We identified 210 SMXL genes across 21 plant genomes by combining homology comparisons and structural domain searches (Table S1). Phylogenetic analysis of these species indicated that closely related plants were grouped together. Genome-wide analysis revealed a lack of encoded SMXL proteins in green algae, but four potential SMAX1 proteins in moss (Fig. 1a). The information of genome version used in this study was listed in Table S1. Among angiosperms, the number of SMXL genes ranged from six in Vitis vinifera to 26 in Glycine max. Monocot species possessed a similar number of SMXL genes: nine in rice (Oryza sativa), 10 in millet (Setaria italica), and 11 in maize (Zea mays). However, the number of SMXL genes in dicots varied greatly: for example, six in cocoa (Theobroma cacao; Malvaceae), seven in tomato (Solanum lycopersicum; Solanaceae), and 14 in potato (Solanum tuberosum). Similarly, diploid cotton had nine SMXL genes, while tetraploid cotton had 18. In Fabaceae plants, the number of SMXL genes also varied greatly, with 26 in soybean (Glycine max) and 13 in alfalfa (Medicago truncatula). We identified 18, 18, 9, 9, and 9 SMXL genes from G. hirsutum, G. barbadense, G. herbaceum, G. raymondii, and G. arboreum, respectively. The molecular weights and isoelectric points of 63 SMXL proteins are shown in Table S2. We reconstructed a phylogenetic tree using 89 SMXL genes identified in ten species genomes: Arabidopsis (8), Physcomitrella patens (4), Vitis vinifera (6), O. sativa (9), Z. mays (11), T. cacao (6), G. hirsutum (18), G. raymondii (9), G. arboreum (9), and G. herbaceum (9) (Fig. 1b). This showed that the 89 SMXL genes clustered into four distinct branches (I, II, III, and IV) (Fig. 1b), with clade I containing SMAX1 and SMXL2, while SMXL6, SMXL7, and SMXL8 were grouped in clade II. Clade III included only SMXL3, and clade IV contained SMXL4 and SMXL5. Moss, an ancient species in evolutionary terms, had only one evolutionary branch of the SMAX1 gene belonging to clade I. The 11 SMXL genes identified in the monocot maize belonged to three distinct evolutionary branches, namely clades I, III, and IV, while the six SMXL genes in dicotyledon cocoa and the 45 SMXL genes present in cotton species could be classified into four evolutionary branches. Further analysis showed that the SMXL6, 7, and 8 genes in clade II were unique to dicotyledons, with no homologous genes found in monocots. Tetraploid cotton possessed twice as many SMXL genes as diploid cotton, which evolutionarily belonged to four branches (Figs. 1b and 2a).
Cotton SMXL gene structure, and amino acid motifs and 3D structure of GhSMXL proteins
We used 45 SMXL proteins from G. hirsutum, G. herbaceum, G. raymondii, and G. arboreum for phylogenetic analysis. These were clearly clustered into four groups (Fig. 2a), consistent with the above results (Fig. 1b). Gene structure analysis demonstrated a high degree of consistency in gene structure among members of the same subgroup in cotton (Fig. 2b). Most SMXL genes contained three exons and two introns, but individual SMXL genes had specific structures. For example, in the first subgroup, GaSMAX1-1 had four exons and three introns, in the second subgroup, GaSMXL7-1 had five exons and four introns, and in the third subgroup, GaSMXL3-3 and GheSMXL3-3, had four exons and three introns (Fig. 2b). During motif analysis, we select 15 motifs for analysis, which can provide more comprehensive insights into the structural and functional characteristics of SMXL proteins, with a particular emphasis on subgroup’s distinctions. Subsequently, we predicted 15 motif structures between and within GoSMXL subgroups in different ploidy cotton species. The available cotton genome includes a tetraploid species, G. hirsutum (AD1), and three diploid species: G. raimondii (D5), G. herbaceum (A1), and G. arboreum (A2), which showed high conservation among and within subgroups of the SMXL protein family (Fig. 2c, Table S3). Despite the deletion of motifs in individual proteins, most members displayed highly conserved motif structures. Motifs 1–4 represented the double Clp-N domain. Motifs 12–15 were located in the SMXL C-terminal region, showing correspondence with the P-loop NTPase domain. All identified EAR motifs were located within motif 12. Clp-N domains and P-loop NTPase were highly conserved in G. hirsutum, with the exception of GhSMXL3-2-A, which displayed a conserved EAR motif (Fig. S1). We further predicted the three-dimensional (3D) structure of the GhSMXL proteins based on the AlphaFold2 program  (Fig. 2d). Nine pairs of GhSMXL proteins were classified into four types based on their 3D structures: GhSMAX1-1/GhSMAX1-2, GhSMXL3-1/GhSMXL3-2/GhSMXL3-3, GhSMXL4, and GhSMXL6/GhSMXL7-1/GhSMXL7-2. Additionally, five core domains of GhSMXL protein models were defined according to the naming of Shabek et al. (2018) and Zhou et al. (2013): N-terminal Double Clp-N motif (N), hypothetical ATPase domain 1 (D1), middle region (M), and C-terminal hypothetical ATPase domain 2 (D2), which is further divided into D2a and D2b subdomains . These findings suggest that the higher the amino acid similarity between proteins, the more similar their 3D structures, and possibly the more similar their functions.
Chromosomal distribution, collinearity, and selection pressure analysis of the cotton SMXL gene family
The SMXL gene family members were found to be distributed on specific chromosomes in tetraploid cotton species G. hirsutum (AD1) and G. barbadense (AD2) (Fig. S2), consistent with their distribution in AD1 and AD2. Specifically, SMXL genes were found on chromosomes D04, A05, D05, A08, D08, A09, D09, A10, D10, A13, and D13 in both tetraploid species. In G. raimondii (D5), these genes were located on chromosomes 6, 7, 9, 11, and 13, whereas in G. arboreum (A2), they were located on chromosomes 04, 05, 07, 08, 09, 10, and 13. Similarly, in G. herbaceum (A1), these genes were located on chromosomes 5, 7, 8, 9, 10, and 13. Notably, chromosome A05 in both tetraploid species had the highest number of SMXL gene family members, with four genes located on each chromosome. Diploid species, represented by G. arboreum, G. herbaceum, and G. raimondii, had three SMXL genes located on chromosomes 5 and 13. Notably, GheSMXL3-1 (GheUnG00160) in G. herbaceum was not located on a specific chromosome, but on unassembled Contig1004. In addition, most genes except GhSMXL7-2 were distributed at the ends of chromosomes. However, one homeologous gene pair, GhSMXL3-1-A/D, was located on distinct A and D chromosomes. GhSMXL3-1-A locates at the end of chromosome 5, while GhSMXL3-1-D locates at the end of chromosome 4. The same phenomenon was observed for the G. barbadense GbSMXL3-1-A/D gene pair. These findings suggest that the chromosome segment of these genes may have undergone translocation during the evolution of tetraploid cotton.
The Dt genome of allotetraploid cotton originated from G. raimondii, while the At genome is thought to be derived from the common ancestor of G. arboreum and G. herbaceum . To investigate the expansion and contraction of the SMXL gene family during evolution, we identified the collinear gene pairs of diploid cotton in tetraploid cotton. We further analyzed information on 63 SMXL gene pairs discovered in tetraploid cotton (AD1 and AD2) and diploid cotton (D5, A1, and A2) (Table S4). Using MCScanX software , we subsequently reconstructed the collinear relationships between tetraploid and diploid cottons (Fig. 3a) and found that nine SMXL orthologs of upland (tetraploid) cotton were present in the genomes of diploid cotton (A1 and A2). Similarly, analysis of collinear relationships between SMXL genes of island (tetraploid) and diploid cotton revealed that there were also nine homologs of G. barbadense SMXL genes in diploid cotton genomes (Fig. S3). These results suggest that the SMXL genes in tetraploid and diploid cottons have been highly conserved during evolution.
We further identified orthologous genes between tetraploid and diploid cottons, and then analyzed selection pressures on the SMXL gene family in cotton by calculating the ratio of non-synonymous to synonymous substitution rates (Ka/Ks) of orthologous gene pairs. The Ka/Ks values of all upland cotton and diploid cotton colinear gene pairs were between 0.2 and 0.7 (Fig. 3b), except for GhSMXL3-3-D and GrSMXL3-3, which was 1.30336 (Table S5); in addition, the Ka/Ks values of all colinear gene pairs of island cotton and diploid cotton were less than 1 (Fig. 3c), indicating that most SMXL gene family members were mainly subjected to purifying selection during evolution. Duplication type analysis revealed that segmental duplications were the main mechanisms for producing the expansion of the SMXL gene family in cotton (Table S5).
Expression patterns of GhSMXL genes in G. hirsutum
To investigate the expression patterns and potential biological roles of GhSMXL genes in upland cotton, we performed comprehensive tissue-specific expression analysis using RNA-seq data (Fig. 4a). The expression heatmap revealed three distinct classes among the 18 GhSMXL genes, according to their expression patterns across 15 different tissues. The first group of genes (GhSMAX1-1-A/D, GhSMXL7-1-A/D, and GhSMXL7-2-A/D) exhibited high expression levels in six different tissues and organs (root, stem, leaf, petal, receptacle, and sepal), as well as in fiber-bearing ovules at four time points [0, 1, 3, and 5 d post-anthesis (DPA)], ovules of 10 and 20 DPA, and fibers at three time points (10, 20, and 25 DPA). The second group of genes, consisting of GhSMXL3-2-A/D and GhSMXL3-3-A/D, was expressed at high levels primarily in roots, with minimal expression observed in other tissues. By contrast, GhSMXL6-D was detected in multiple tissues, including roots, stems, petals, receptacles, and sepals, with the highest expression level observed in sepals. The third class of genes contained seven genes, consisting of GhSMXL1-2-A/D, GhSMXL3-1-A/D, GhSMXL4-A/D, and GhSMXL6-A, most of which were mainly expressed in roots and stems, except for GhSMXL1-2-A/D. GhSMAX1-2-A showed the highest expression level in fibers at 10 DPA, followed by high expression in 3- and 5-day-old ovules, and moderate expression in stems, leaves, and receptacles, with low or no expression in other tissues. Additionally, GhSMXL6-A was highly expressed in sepals, and GhSMXL3-1-A was highly expressed in receptacles and fibers at 20 DPA. In general, the 18 GhSMXL genes displayed different expression patterns in various tissues, organs, and fiber developmental stages. Some homeologous gene pairs showed distinct tissue-specific expression differences, suggesting the potential functional diversification of the GhSMXL gene family.
Subsequently, we performed qRT-PCR experiments to further verify the reliability of the transcriptome data. We analyzed nine GhSMXL homeologous gene pairs (without distinguishing between gene pairs on the A and D donor). We used roots, stems, leaves, flowers, shoot apical meristems (SAMs), fiber-bearing ovules ( -3, 0, 3, and 5 DPA), and fibers (8, 12, 20, and 30 DPA) for expression validation. The expression levels of most genes were consistent with the RNA-seq data (Fig. S4). GhSMAX1-1 was highly expressed in leaves and its expression level in 20 DPA fibers showed a rapid increase, corresponding to RNA-seq data. GhSMXL3-1 had relatively high expression levels in stems, but its expression level in roots was the highest in the RNA-seq data, with relatively low expression in other tissues. GhSMXL3-2 had relatively high expression in roots, with the lowest expression at the beginning of fiber development, consistent with the RNA-seq data. GhSMXL3-3 had relatively high expression in roots, stems, and leaves, and low expression in ovules and at different periods of fiber development, whereas the RNA-seq data showed high expression only in roots. GhSMXL4 was highly expressed in roots and SAM, with low expression in ovules and during fiber development, consistent with RNA-seq data. The expression patterns of GhSMXL6, GhSMXL7-1, and GhSMXL7-2 were similar, with high expression only in stems and relatively low expression in other tissues and at different developmental stages. In short, the expression characteristics of GhSMXL genes revealed by RNA-seq and qRT-PCR assays suggested that GhSMXL genes regulate multiple aspects of cotton growth and development and may show functional differentiation.
We analyzed the expression of GhSMXL genes at different time points using qRT-PCR after treatment with the SL analogue GR24 for 48 h. Nine GhSMXL genes showed substantial responses to GR24 treatment and exhibited different expression patterns (Fig. 4b). In the first subgroup, the expression levels of GhSMAX1-1 and GhSMAX1-2 genes increased after GR24 treatment. Among them, GhSMAX1-1 gene expression decreased at 4 h after rising and peaked at 24 h, then declined slowly. However, GhSMAX1-2 expression first rose and reached its peak at 4 h, then declined slowly. In the second subgroup, the expression levels of GhSMXL6, GhSMXL7-1, and GhSMXL7-2 gradually increased within 48 h after GR24 treatment (Fig. 4b), reaching their peak of 6–8 times that of the control at 48 h. The third subgroup displayed differential expression patterns. GhSMXL3-1 expression gradually increased at first, peaked at 12 h, but remained high. GhSMXL3-2 expression decreased sharply after GR24 treatment and showed a decreasing trend over time. GhSMXL3-3 expression rose at first, peaking at 4 h, then declined, increasing slightly at 24 h, and reached its lowest level at 48 h, showing a decline in general. However, the expression level of GhSMXL4 in the fourth subgroup was similar to that of GhSMXL3-2, showing a sharp decline at first after GR24 treatment, then a slow rise, and finally a decline. These findings suggested that GhSMXL genes respond to SL signals. We speculate that GhSMXL6, GhSMXL7-1, and GhSMXL7-2 genes are regulated by SL phytohormone signals and may play important roles in the growth and development and phytohormone pathways of cotton.
Co-expression network of SMXL6/7–1/7–2 proteins in G. hirsutum and G. arboreum
In previous studies, Arabidopsis SMXL6/7/8 and rice D53 proteins have been shown to promote branching in the D14-MAX2 signaling pathway activated by SL through ubiquitination-mediated protein degradation [14, 17]. To investigate the potential biological functions of SMXL6, 7, and 8 in cotton, we further explored the co-expression networks of GhSMXL6, GhSMXL7-1, and GhSMXL7-2 with their homologs GaSMXL6, GaSMXL7-1, and GaSMXL7-2 in G. hirsutum and G. arboreum (Fig. 5). The SMXL6, SMXL7-1, and SMXL7-2 proteins of G. hirsutum and G. arboreum and proteins encoded by their co-expressed genes were annotated and functionally classified (Table S6). GhSMXL6/7–1/7–2 and GaSMXL6/7–1/7–2 proteins contain two Clp-N motifs and a P-loop structure of hydrolases (Table S6). Functional predictions indicated that the proteins interacting with GhSMXL6/7–1/7–2 and GaSMXL6/7–1/7–2 are mainly involved in bud development and branching, including WUSCHEL (WUS) , BELL1 (BEL1) , and myb domain protein 5 (MYB5) ; transcription factors involved in the regulation of plant growth and development, such as basic leucine zipper (bZIP) , GRAS , and AP2/ERF ; proteins related to phytohormone signaling pathways such as gibberellin 2-beta-dioxygenase 2 (GA2OX2)  and GA-stimulated transcript 1 (GAST1) , involved in the gibberellin pathway; proteins that respond to red/far-red light and anthocyanin formation, such as cryptochrome circadian regulator 1 (CRY1)  and cytochrome P450 (P450) ; proteins involved in metabolic biosynthesis pathways, such as trehalose-6-phosphate synthase (TPS) , hexokinase 2 (HXK2) , and dehydration-responsive element-binding (DREB) . These results suggest that the target proteins of GhSMXL6/7–1/7–2 are highly expressed in buds and floral organs and participate in biological processes related to branching development, phytohormone signaling, light response, and metabolic biosynthesis pathways, which provides a basis for further investigations into the functional mechanism of SMXL6/7–1/7–2 genes.
Silencing of GhSMXL genes inhibits cotton growth
To investigate the roles of GhSMXL genes in regulating cotton growth and development, we used the virus-induced gene silencing (VIGS) method to silence GhSMAX1-1 and GhSMAX1-2 genes. After silencing of two GhSMXL genes, phenotypic observations showed that growth of the apical meristem was slow or stagnant, development of axillary buds was inhibited, and plant height was dwarfed compared with the control (Fig. 6, Fig. S5). The TRV: SMAX1-1 plants showed an obvious dwarfing phenotype, and growth of the main stem was inhibited (Fig. 6a). The TRV: SMAX1-2 plants also showed dwarfing and delayed growth (Fig. 6b). These results suggest that GhSMXL genes play crucial roles in regulating cotton growth and development. Silencing of the GhSMAX1-1 or GhSMAX1-2 genes had a substantial impact on the growth of the apical meristem and the elongation of the stem, ultimately causing dwarfism in cotton plants. qRT-PCR assays revealed that the expression levels of GhSMAX1-1/2 were significantly reduced in silenced plants compared with the control group (Fig. 6c and d). Taken together, these results suggest that GhSMAX1-1/2 genes play important roles in promoting stem elongation and axillary bud development in cotton.
Evolution of the SMXL gene family in plants
In recent years, owing to the development of genome sequencing, comprehensive genome-wide analyses of SMXL genes haves been conducted in several species, with 8, 10, 31, and 12 SMXL gene family members identified in Arabidopsis, apple, soybean and poplar, respectively [50,51,52,53]. Moturu et al. (2018) dissected the expansion of the SMXL gene family in 58 plant genomes and found that the oldest SMAX1 gene underwent three duplications during the evolutionary process from bryophytes to land plants, resulting in the development of new genes or functional differentiation. Moturu et al. (2018) revealed that the expansion of SMXL genes likely involved 128 duplications and 276–284 loss events . Whole-genome duplication (WGD) and whole-genome triplication (WGT) mechanisms may have played a critical role in this process. It is postulated that the SMXL genes first appeared in liverworts. Previous studies have shown that the SMAX1 gene family emerged in a terrestrial plant (moss), and it is speculated that the SMXL gene family originated from Bryophyta . The results of molecular clock tests indicate that the members of the four branches of the SMXL gene family have undergone functional differentiation or neo-functionalization, indicated by their different evolutionary rates . Using fine-assembled plant genomes and diverse plant species, this study further introduced diploid and tetraploid cotton genomes, identifying 210 SMXL genes in 21 representative species from Chlorophyta, Bryophyta and angiosperms. Aquatic Chlorophyta do not contain any SMXL genes, and only four SMAX1 genes were identified in bryophytes. Bryophytes are considered to be the earliest plants to differentiate into stems and leaves, suggesting that the SMAX1 genes may play an important role in plant morphology formation and adaptation to environmental changes. Our results suggest that gene duplication is the primary driver of SMXL gene evolution in cotton, as the number of SMXL gene families in tetraploid cotton is twice that of diploid cotton. Phylogenetic analysis showed that plant SMXL genes can be classified into four branches (I, II, III, and IV) (Fig. 1b, Fig. 2a), with only one SMAX1 branch identified in bryophytes, consistent with previous research . Moreover, we found that SMXL6, SMXL7, and SMXL8 in the second subgroup of angiosperms are unique to dicotyledonous plants, suggesting that cotton SMXL6, SMXL7-1, and SMXL7-2 may play vital roles in species differentiation.
Structural characteristics and evolution of the SMXL gene family in cotton
Since the successful completion of the first G. raimondii genome sequencing and assembly in 2012 , 41 cotton genomes have been assembled and published in the past decade . These genomic resources have significantly advanced cotton functional genomics research. The D subgenome of G. hirsutum originates from G. raimondii D5, while the A subgenome comes from the common ancestor A0 genome of G. herbaceum and G. arboreum. G. arboreum did not originate from G. herbaceum but was independently domesticated from an unknown A0 genome . In this study, we identified 63 SMXL genes from the genomes of two tetraploid and three diploid cotton species. The presence of nine SMXL genes in diploid cotton and 18 SMXL genes in tetraploid cotton indicates that all SMXL genes in tetraploid cotton were generated by genome duplication. The distribution of SMXL genes on chromosomes is consistent between G. hirsutum and G. barbadense (Fig. S2). Notably, SMXL3-1 in the A and D subgenomes of tetraploid cotton is found at the ends of chromosomes 5 and 4, respectively. We hypothesize that the chromosome segment containing the SMXL3-1 gene underwent translocation during the process of cotton genome duplication. No tandem duplication was found in the cotton SMXL gene family, with most family members arising from segmental duplication. This suggests that segmental duplication events were the primary driver of expansion in the SMXL gene family in cotton. All diploid cotton species have nine SMXL genes that correspond to those in tetraploid cotton, suggesting that the SMXL gene family is highly conserved in cotton. Amino acid changes caused by nonsynonymous substitutions can result in alterations in protein conformation and function . The majority of SMXL genes have a Ka/Ks value of less than 1, indicating that members of the SMXL gene family were under selective pressure for adaptation to environmental changes during cotton tetraploidization. Most SMXL genes have been retained by purifying selection, suggesting their crucial roles in species differentiation.
The evolutionary characteristics and functional differentiation of homologous SMXL genes can be attributed to the presence of consistent and differential functional motifs in their conserved structure and peptide sequences . Within the same subgroup, the structure of genes and protein motifs are similar, with a few exceptions (Fig. 2b and c). Apart from GaSMAX1-1, GaSMXL7-1, GaSMXL3-3, and GheSMXL3-3, all other SMXL genes possess three exons and two introns. There are differences in the number of motifs and amino acids in different subgroups, but the Clp-N domains, P-loop NTPase, and EAR-like motifs of their amino acid sequences are relatively conserved (Fig. S1), which may reflect different biological functions between each subgroup. 3-D structures of SMXL proteins in G. hirsutum have revealed that proteins of the same subgroup exhibit a high degree of structural similarity. Moreover, the N, D1, M, and D2 domains of these proteins are likely to play a crucial role in performing specific functions . Previous studies demonstrated that the Arabidopsis SMXL7N domain contains a nuclear localization signal. The D2 domain of SMAX1 serves as a target for degradation induced by KAR and SL [34, 58]. Yeast two-hybrid experiments with SMAX1 and SMXL7 indicate that the SMXL protein interacts with KAI2 and D14 via the D1M domain, while the N domain is unnecessary . A conserved Arg-Gly-Lys-Thr (RGKT) motif contained in the D2a domain is crucial for MAX2-mediated D53/SMXL7 protein degradation. SMXL3, SMXL4, and SMXL5 proteins cannot be degraded because they lack a RGKT motif . Moreover, SMAX1 and SMXL2 lose their SCFMAX2-induced degradation activity after RGKT motif deletion . We found that protein structures in the same subgroup are highly similar, and the main domain (N, D1, M, and D2) of these proteins may play a key role in their specific functions. This study therefore provides valuable insights into the evolution of the plant SMXL gene family and our understanding of the structural characteristics of this family, which will facilitate further exploration of the biological functions of SMXL proteins.
Cotton SMXL genes regulate various aspects of growth and development
Previous research revealed that members of sub-group I (AtSMAX1/AtSMXL2) play a critical role in the KAR signaling pathway and directly participate in the regulation of seed germination and leaf development in Arabidopsis [14, 59]. These proteins, together with SMXL2, also function downstream in the KAI signaling pathway to regulate root and root hair development in conjunction with SMXL2 . They also promote hypocotyl elongation, resulting in crosstalk between the D14-SMAX1 signal and targeted degradation of SMAX1 protein under GR24 treatment and osmotic stress . By contrast, members of sub-group II (AtSMXL6, 7, 8, and rice D53) encode key inhibitors of the SL signaling pathway. Upon activation of the SL signaling pathway, SMXL6/7/8 proteins are degraded by proteasomes, leading to the inhibition of branching in Arabidopsis and tillering in rice [14, 17, 57]. Members of sub-groups III and IV (AtSMXL3, AtSMXL4, and AtSMXL5) exhibit different characteristics from other members of the SMXL family; they are not involved in the SL or KAR signaling pathways and do not rely on MAX2-mediated protein degradation . Recent research shows that SMXL genes have a wide range of functions in plant growth, development, and stress responses . However, the molecular mechanisms underlying phytohormone signaling and abiotic stress responses remain poorly understood. Analysis of the upstream 2000-bp sequence of the GhSMXL gene transcription start site revealed the presence of various cis-acting elements associated with light, growth and development, phytohormones, and stress (Fig. S6). This suggests that GhSMXL genes may execute functional diversity. GhSMXL genes display diverse patterns of expression across various tissues, organs, and stages of fiber development, with homologous genes exhibiting similar expression characteristics but individual genes showing differential expression. GR24 treatment of cotton seedlings produces a stable increase in the expression levels of SMXL6/7–1/7–2 genes, and rising trends in the expression levels of GhSMAX1-1 and GhSMAX1-2 genes, albeit with fluctuating expression levels. Co-expression network analysis identified proteins that may interact to perform a specific function; for example, the anti-florigen protein TFL1, Cryptochrome Circadian Regulator 1 (CRY1), cytochrome enzyme P450, meristem-associated WUSCHEL protein, gibberellin biosynthesis-related gibberellin 2-beta-dioxygenase 2 (GA2OX2), and GA-stimulated transcript 1 (GAST1) proteins, all of which are involved in phytohormonal signaling pathways. According to the expression characteristics of GhSMXL genes in different tissues and after GR24 treatment, we speculate that cotton SMXL6/7–1/7–2 genes play critical roles in the SL signaling pathway with some functional redundancy. These results imply that the members of the cotton SMXL gene family respond to SL signals and participate in plant growth and development.
GhSMAX1-1/2 play an important role in the regulation of cotton growth
The D14/KAI-SMXL complex interacts with SCFMAX2, resulting in degradation of SMXL proteins . D14 and KAI receptors need F-box protein MAX2 to perceive signaling molecules. Arabidopsis SMAX1 and its homologous protein rice D53 serve as downstream targets of MAX2. The Arabidopsis max2 mutant has a dwarf stature and increased lateral branching phenotype, while SMXL6, 7, and 8 can restore max2 to the wild-type phenotype . Thus, SMXL6, 7, and 8, as target proteins of MAX2, act as suppressors of SL signals, functioning in the regulation of plant height and branching. The Arabidopsis smax1 smxl2 mutant has a very short hypocotyl. Genetic evidence shows that degradation of SMXL6, 7, and 8 proteins does not affect hypocotyl growth, while interactions between SMAX1, SMXL2, and D14 promote hypocotyl elongation in response to GR24 in Arabidopsis . We found that downregulation of GhSMAX1-1 and GhSMAX1-2 genes through VIGS inhibits stem elongation and axillary bud development, resulting in a significant decline in cotton plant height. These results further confirm that SMAX1 homologs play critical roles in regulating hypocotyl elongation to change plant architecture by affecting plant height.
In this study, we identified 210 SMXL genes from 21 plant species and divided these into four different phylogenetic clades with functional diversities. Homologs of SMAX1 have been well conserved during the evolution of terrestrial plants. Among them, 63 SMXL genes identified from five Gossypium species were clustered into four clades, and all cotton SMXL proteins contained conserved Clp-N domains, P-loop NTPases, and EAR motifs. The GhSMXL gene is highly expressed in root and stem tissues, and the expression level of GhSMXL6/7–1/7–2 is considerably responsive to GR24 treatment. Protein network analysis showed that the target proteins of SMXL6, SMXL7-1, and SMXL7-2 in G. hirsutum and G. arboreum are involved in the development of shoots and floral organs. Silencing of GhSMAX1-1 and GhSMAX1-2 genes resulted in plant dwarfism and inhibition of axillary bud development. The present study suggests functional diversities of the plant SMXL gene family and the important roles of GhSMXL in the SL signaling pathway. Rapid technological progress should enable us to dissect the functional specificity of plant SMXL proteins to understand their importance in cotton development.
In summer of 2019, G. hirsutum L. cv. ‘XLZ 33’ were field-grown under natural conditions at Shihezi University's experimental farm in Shihezi City, Xinjiang, China (44°20′ N, 86°0′ E), as previously described . We have obtained the permission to collect plant material G. hirsutum. Samples of roots, stems, true leaves, and SAM at 40 days post-planting and flowers at -3, 0, 3, and 5 days post-anthesis (DPA), as well as fibers at 8, 12, 20, and 30 DPA, were collected, respectively. Seeds of G. hirsutum L. standard line TM-1 were grown in Petri dishes with nutrient soil and vermiculite mixture (1:1). Petri dishes were placed in a growth chamber (16-h light/8-h dark, 200 µmol m–2 s–1). Plants growing for 14 days with 2 days of 50 ml Hoagland nutrient solution irrigation were treated with rac-GR24 (10 μM) (purchased from Solarbio) by wiping the cotton leaves . Leaves from three consistent, young seedlings were collected as three biological replicates at 0, 1, 4, 12, 24, and 48 h after GR24 treatment. All tissues samples were frozen immediately in liquid nitrogen and stored at -80 ℃ for RNA extraction.
Identification of SMXL gene family members in 21 plant species
Genome sequences of five cotton species, G. hirsutum, G. barbadense, G. raimondii, G. arboreum, and G. herbaceum, were obtained from the cotton genome database (table S7). Genome sequences of other Viridiplantae—P. patens, V. vinifera, O. sativa, Z. mays, T. cacao, O. lucimarinus, C. reinhardtii, S. italica, S. lycopersicum, S. tuberosum, M. esculenta, M. truncatula, Brassica rapa, Cucumis sativus, Glycine max, and C. clementina—were downloaded from the Phytozome database (https://phytozome.jgi.doe.gov/pz/portal.html). The protein sequences of SMXLs in A. thaliana were obtained from the TAIR website (https://www.Arabidopsis.org) and used as probes for homology searches with the BLASTP program (E-value < 1 × 10−5). Protein sequences identified were submitted to the PfamScan program (https://pfam.xfam.org/) to obtain candidate SMXL proteins using domain analysis. SMXL homologs in cotton were classified and named according to their similarity to A. thaliana and the structural characteristics of genes. Molecular weight and isoelectric point of proteins were estimated using the ExPASy protein analysis tool (https://web.expasy.org/), and subcellular localization was predicted using the ProtComp 9.0 program (http://linux1.softberry.com/berry.phtml).
Phylogenetic tree reconstruction, gene structure, and protein motif distribution
Divergence times of the above 21 plant species on a scale of MYA (million years ago) with confidence intervals were estimated using the TIMETREE website (http://www.timetree.org/). A Newick file was generated, and a species phylogenetic tree was reconstructed using MEGA11 software . Members of the SMXL gene family were identified and clustered using ClustalW software for multiple sequence alignment of protein sequences with default parameters . A phylogenetic tree of the SMXL genes was reconstructed using the neighbor-joining (NJ) method in MEGA11 software  to analyze their evolutionary relationships. The positions of introns and exons in the SMXL genes in cotton were extracted from the gff file of the cotton database. Multiple sequence alignment was performed using MUSCLE software (https://www.ebi.ac.uk/Tools/msa/muscle/), and conserved motifs in the protein sequences were identified using the online MEME website (http://meme-suite.org/meme) with 15 motifs and default settings. Gene structure diagrams were drawn using TBtools software .
Analysis of cis-acting elements and prediction of 3D protein structure
Promoter sequences were obtained by considering the 2000 bp upstream of the start codon. Plant Cis-acting Regulatory Element (Plant CARE) software  was used to search for cis-acting regulatory elements, and the results were visualized using the R package pheatmap. Protein structure prediction was performed using the AlphaFold2 program  based on known amino acid sequences of the protein family; an initial model was generated, followed by outputting the protein 3D structure in the PyMOL software (https://pymol.org/2/).
Chromosome positions of SMXL genes were determined by identifying their start and end positions from the cotton genome database gff3 file. A chromosome location map was designed using MapGene2C online software (http://mg2c.iask.in/mg2c_v2.1/) .
Comparative evolutionary analysis
Collinearity analysis was conducted to predict homologous genes of SMXL between species using MCScanX  with default parameters. Orthologous genes between diploid and tetraploid cotton were identified using BLASTP alignment with E-value < 1 × 10 − 5. Duplication types of the cotton SMXL gene family were analyzed using the downstream analysis program of MCScanX software, duplicate_gene_classifier . Circos software  was used to visualize collinearity relationships of homologous genes. Homologous genes were aligned using the Muscle method in paraAT2.0 software . The non-synonymous substitution rate (Ka), synonymous substitution rate (Ks), and Ka/Ks ratio were calculated using KaKs_Calculator2.0  to evaluate the selective pressures on SMXL genes during the evolutionary process. A Ka/Ks ratio > 1, < 1, or = 1 indicates positive, negative, or neutral evolution, respectively .
Expression profiles of GhSMXL genes
RNA-seq data of the different tissues of G. hirsutum TM-1—roots, stems, leaves, petals, receptacles, sepals, and ovules at 0, 1, 3, 5, 10, and 20 DPA and fibers at 10, 20, and 25 DPA . The transcriptome data were aligned to the cotton genome using TopHat2 with default settings . Cufflinks software was used to calculate the FPKM (fragments per kilobase of exon model per million mapped reads) value for each gene. Expression levels of GhSMXL genes in nine tissues were represented by log-transformed FPKM (log2[FPKM + 1]) values, and the R package pheatmap was used to visualize data.
Total RNA for each sample was isolated using a FastPure Plant Total RNA Isolation Kit (Polysaccharides & Polyphenolics–rich) (Vazyme, Nanjing) according to the manufacturer’s guidance. Total RNA was reverse transcribed into cDNA using a HiScript III RT SuperMix for qRT-PCR (Vazyme, Nanjing) kit. qRT-PCR was performed using a ChamQ Universal SYBR qPCR (Vazyme, Nanjing) kit on an Applied Biosystems 7500 Fast Real-Time PCR System (Life Technologies, Foster City, CA, USA). Gene-specific primers were designed using the online primer3 software (https://bioinfo.ut.ee/primer3-0.4.0/) (Table S8). qRT-PCR conditions were as previously described , with an initial denaturation step at 95 °C for 20 s, followed by 40 cycles of denaturation at 95 °C for 3 s and annealing/extension at 60 °C for 30 s. GhUBQ7 was used as a reference gene . Three independent qRT-PCR experiments were carried out, each with three mechanical repeats, and relative expression levels were determined using the 2–∆Ct method .
Protein co-expression network analysis
The ccNET database (http://structuralbiology.cau.edu.cn/gossypium) was utilized, which integrates the cotton genome, transcriptome, epigenome, and functional annotations. Protein sequences of SMXL6, SMXL7-1, and SMXL7-2 from G. hirsutum and G. arboreum were submitted to the website to compare the co-expression networks of SMXL proteins in diploid and tetraploid cottons. Key target proteins of SMXL6, SMXL7-1, and SMXL7-2 were identified, and functional annotations were performed.
For VIGS assays, approximately 200-bp nucleotide sequences of the open reading frames of SMAX1-1 and SMAX1-2 genes were amplified by PCR using gene-specific primers designed using the SGN VIGS Tool online software (https://vigs.solgenomics.net/)  (Table S8). The fragments obtained were then inserted into the pTRV2 vector to create pTRV2: GhSMXL constructs. The VIGS experiment was performed as described by Liu et al. (2021). Each pTRV: GhSMXL plasmid was introduced into Agrobacterium tumefaciens strain GV3101 and infiltrated into the cotyledons of 2-week-old G. hirsutum L. cv. ‘XLZ 33’ .
Availability of data and materials
All data sheets and codes to process data are available upon request to the corresponding author, Xianzhong Huang (Huangxz@ahstu.edu.cn). The datasets generated and/or analysed during the current study are available in the NCBI Sequence Read Archive under BioProject ID: PRJNA490626.
SUPPRESSOR OF MAX2 1 (SMAX1)-like
Ethylene response factor-associated amphiphilic repression
More axillary branching
Million years ago
Quantitative real-time PCR
Fragments per kilobase of exon model per million mapped reads
Virus-induced gene silencing
Gray WM. Hormonal regulation of Plant Growth and Development. PLOS Biol. 2004;2(9):e311.
Cook CE, Whichard LP, Turner B, Wall ME, Egley GH. Germination of Witchweed (Striga lutea Lour.): isolation and Properties of a potent stimulant. Science. 1966;154(3753):1189–90.
Akiyama K, Hayashi H. Strigolactones: chemical signals for fungal symbionts and parasitic weeds in plant roots. Ann Bot. 2006;97(6):925–31.
Brewer PB, Koltai H, Beveridge CA. Diverse roles of strigolactones in plant development. Mol Plant. 2013;6(1):18–28.
Delaux PM, Xie X, Timme RE, Puech-Pages V, Dunand C, Lecompte E, Delwiche CF, Yoneyama K, Becard G, Sejalon-Delmas N. Origin of strigolactones in the green lineage. New Phytol. 2012;195(4):857–71.
Gomez-Roldan V, Fermas S, Brewer PB, Puech-Pages V, Dun EA, Pillot JP, Letisse F, Matusova R, Danoun S, Portais JC, Bouwmeester H, Becard G, Beveridge CA, Rameau C, RochangeSF. Strigolactone inhibition of shoot branching. Nature. 2008;455(7210):189–94.
Agusti J, Herold S, Schwarz M, Sanchez P, Ljung K, Dun EA, Brewer PB, Beveridge CA, Sieberer T, Sehr EM, Greb T. Strigolactone signaling is required for auxin-dependent stimulation of secondary growth in plants. Proc Natl Acad Sci U S A. 2011;108(50):20242–7.
Rasmussen A, Mason MG, De Cuyper C, Brewer PB, Herold S, Agusti J, Geelen D, Greb T, Goormachtig S, Beeckman T. Strigolactones suppress adventitious rooting in Arabidopsis and pea. Plant Physiol. 2012;158(4):1976–87.
Ueda H, Kusaba M. Strigolactone regulates leaf senescence in concert with ethylene in Arabidopsis. Plant Physiol. 2015;169(1):138–47.
Nelson DC, Flematti GR, Ghisalberti EL, Dixon KW, Smith SM. Regulation of seed germination and seedling growth by chemical signals from burning vegetation. Annu Rev Plant Biol. 2012;63:107–30.
Nelson DC, Scaffidi A, Dun EA, Waters MT, Flematti GR, Dixon KW, Beveridge CA, Ghisalberti EL, Smith SM. F-box protein MAX2 has dual roles in karrikin and strigolactone signaling in Arabidopsis thaliana. P Natl Acad Sci USA. 2011;108(21):8897–902.
Zheng J, Hong K, Zeng L, Wang L, Kang S, Qu M, Dai J, Zou L, Zhu L, Tang Z, Meng X, Wang B, Hu J, Zeng D, Zhao Y, Cui P, Wang Q, Qian Q, Wang Y, Li J, Xiong G. Karrikin signaling acts parallel to and additively with strigolactone signaling to regulate rice mesocotyl elongation in darkness. Plant Cell. 2020;32(9):2780–805.
Waters MT, Nelson DC, Scaffidi A, Flematti GR, Sun YK, Dixon KW, Smith SM. Specialisation within the DWARF14 protein family confers distinct responses to karrikins and strigolactones in Arabidopsis. Development. 2012;139(7):1285–95.
Soundappan I, Bennett T, Morffy N, Liang Y, Stanga JP, Abbas A, Leyser O, Nelson DC. SMAX1-LIKE/D53 family members enable distinct MAX2-Dependent responses to Strigolactones and Karrikins in Arabidopsis. Plant Cell. 2015;27(11):3143–59.
Jiang L, Liu X, Xiong G, Liu H, Chen F, Wang L, Meng X, Liu G, Yu H, Yuan Y, Yi W, Zhao L, Ma H, He Y, Wu Z, Melcher K, Qian Q, Xu HE, Wang Y, Li J. DWARF 53 acts as a repressor of strigolactone signalling in rice. Nature. 2013;504(7480):401–5.
Kerr SC, Patil SB, de Saint Germain A, Pillot JP, Saffar J, Ligerot Y, Aubert G, Citerne S, Bellec Y, Dun EA, Beveridge CA, Rameau C. Integration of the SMXL/D53 strigolactone signalling repressors in the model of shoot branching regulation in Pisum sativum. Plant J. 2021;107(6):1756–70.
Wang L, Wang B, Jiang L, Liu X, Li X, Lu Z, Meng X, Wang Y, Smith SM, Li J. Strigolactone Signaling in Arabidopsis regulates shoot development by Targeting D53-Like SMXL Repressor Proteins for Ubiquitination and Degradation. Plant Cell. 2015;27(11):3128–42.
Wallner ES, López-Salmerón V, Belevich I, Poschet G, Jung I, Grünwald K, Sevilem I, Jokitalo E, Hell R, Helariutta Y, Agustí J, Lebovka I, Greb T. Strigolactone- and Karrikin-Independent SMXL Proteins are central regulators of phloem formation. Curr Biol. 2017;27(8):1241–7.
Wang L, Xu Q, Yu H, Ma H, Li X, Yang J, Chu J, Xie Q, Wang Y, Smith SM, Li J, Xiong G, Wang B. Strigolactone and Karrikin Signaling Pathways Elicit Ubiquitination and Proteolysis of SMXL2 to regulate hypocotyl elongation in Arabidopsis. Plant Cell. 2020;32(7):2251–70.
Li YG, Ren H, Zhang YJ, Li RN, Ai H, Huang XZ. Analysis of the molecular evolution of the PEBP gene family in cruciferous plants. Sheng Wu Duo Yang Xing. 2022;30(6):21545.
Wang Y, Yao R, Du X, Guo L, Chen L, Xie D, Smith SM. Molecular basis for high ligand sensitivity and selectivity of strigolactone receptors in Striga. Plant Physiol. 2021;185(4):1411–28.
Xie Y, Liu Y, Ma M, Zhou Q, Zhao Y, Zhao B, Wang B, Wei H, Wang H. Arabidopsis FHY3 and FAR1 integrate light and strigolactone signaling to regulate branching. Nat Commun. 2020;11(1):1955.
Wickland DP, Hanzawa Y. The FLOWERING LOCUS T/TERMINAL FLOWER 1 Gene Family: functional evolution and molecular mechanisms. Mol Plant. 2015;8(7):983–97.
Zhu Y, Klasfeld S, Jeong CW, Jin R, Goto K, Yamaguchi N, Wagner D. TERMINAL FLOWER 1-FD complex target genes and competition with FLOWERING LOCUS T. Nat Commun. 2020;11(1):5118.
Collani S, Neumann M, Yant L, Schmid M. FT modulates genome-wide DNA-Binding of the bZIP transcription factor FD. Plant Physiol. 2019;180(1):367–80.
Goretti D, Silvestre M, Collani S, Langenecker T, Schmid M. TERMINAL FLOWER 1 functions as a mobile transcriptional cofactor in the shoot apical meristem. Plant Physiol. 2020;182(4):0086702019.
Zhu Y, Klasfeld S, Wagner D. Molecular regulation of plant developmental transitions and plant architecture via PEPB family proteins: an update on mechanism of action. J Exp Bot. 2021;72(7):2301–11.
Wang Y, Li J. Molecular basis of Plant Architecture. Annu Rev Plant Biol. 2008;59(1):253–79.
Chen W, Yao J, Chu L, Yuan Z, Li Y, Zhang Y. Genetic mapping of the nulliplex-branch gene (gb_nb1) in cotton using next-generation sequencing. Theor Appl Genet. 2015;128(3):539–47.
Huang X, Liu H, Ma B. The current Progresses in the genes and networks regulating Cotton Plant Architecture. Front Plant Sci. 2022;13: 882583.
Peng HE, Zhang HZ, Zhang L, Jiang B, Xiao GH, Jia-Ning YU. The GhMAX2 gene regulates plant growth and fiber development in cotton. J Integr Agr. 2022;21(6):13.
Zhao L, Fang J, Xing J, Liu W, Peng P, Long H, Zhao JF, Zhang WL. Identification and functional analysis of two cotton orthologs of MAX2 which Control shoot lateral branching. Plant Mol Biol. 2017;35(5):480–90.
Jumper J, Evans R, Pritzel A, Green T, Figurnov M, Ronneberger O, Tunyasuvunakool K, Bates R, Žídek A, Potapenko A, Bridgland A, Meyer C, Kohl SAA, Ballard AJ, Cowie A, Romera-Paredes B, Nikolov S, Jain R, Adler J, Back T, Petersen S, Reiman D, Clancy E, Zielinski M, Steinegger M, Pacholska M, Berghammer T, Bodenstein S, Silver D, Vinyals O, Senior AW, Kavukcuoglu K, Kohli P, Hassabis D. Highly accurate protein structure prediction with AlphaFold. Nature. 2021;596(7873):583–9.
Khosla A, Morffy N, Li Q, Faure L, Chang SH, Yao J, Zheng J, Cai ML, Stanga J, Flematti GR, Waters MT, Nelson DC. Structure-function analysis of SMAX1 reveals domains that mediate its Karrikin-Induced Proteolysis and Interaction with the receptor KAI2. Plant Cell. 2020;32:tpc0075202019.
Huang G, Wu Z, Percy RG, Bai M, Li Y, Frelichowski JE, Hu J, Wang K, Yu JZ, Zhu Y. Genome sequence of Gossypium herbaceum and genome updates of Gossypium arboreum and Gossypium hirsutum provide insights into cotton A-genome evolution. Nat Genet. 2020;52(5):516–24.
Wang K, Wang Z, Li F, Ye W, Wang J, Song G, Yue Z, Cong L, Shang H, Zhu S, Zou C, Li Q, Yuan Y, Lu C, Wei H, Gou C, Zheng Z, Yin Y, Zhang X, Liu K, Wang B, Song C, Shi N, Kohel RJ, Percy RG, Yu JZ, Zhu YX, Wang J, Yu S. The draft genome of a diploid cotton Gossypium raimondii. Nat Genet. 2012;44:1098–103.
Wang J, Tian C, Zhang C, Shi B, Cao X, Zhang TQ, Zhao Z, Wang JW, Jiao Y. Cytokinin Signaling activates WUSCHEL expression during Axillary Meristem initiation. Plant Cell. 2017;29:1373–87.
Ikeda T, Tanaka W, Toriba T, Suzuki C, Maeno A, Tsuda K, Shiroishi T, Kurata T, Sakamoto T, Murai M, Matsusaka H, Kumamaru T, Hirano HY. BELL1-like homeobox genes regulate inflorescence architecture and meristem maintenance in rice. Plant J. 2019;98(3):465–78.
Li SF, Milliken ON, Pham H, Seyit R, Napoli R, Preston J, Koltunow AM, Parish RW. The Arabidopsis MYB5 transcription factor regulates mucilage synthesis, seed Coat Development, and trichome morphogenesis. Plant cell. 2009;21:72–89.
Li H, Chen J, Zhao Q, Han Y, Li L, Sun C, Wang K, Wang Y, Zhao M, Chen P, Lei J, Wang Y, Zhang M. Basic leucine zipper (bZIP) transcription factor genes and their responses to drought stress in ginseng, Panax ginseng C.A. Meyer. BMC Genom. 2021;22(1):316.
Waseem M, Nkurikiyimfura O, Niyitanga S, Jakada B, Shaheen I, Aslam M. GRAS transcription factors emerging regulator in plants growth, development, and multiple stresses. Mol Biol Rep. 2022;49(10):9673–85.
Xie W, Ding C, Hu H, Dong G, Zhang G, Qian Q, Ren D. Molecular events of Rice AP2/ERF transcription factors. Int J Mol Sci. 2022;23: 12013.
Sakamoto T, Kobayashi M, Itoh H, Tagiri A, Kayano T, Tanaka H, Iwahori S, Matsuoka M. Expression of a gibberellin 2-Oxidase gene around the shoot apex is related to phase transition in Rice1. Plant Physiol. 2001;125(3):1508–16.
Shi LF, Olszewski N. Gibberellin and abscisic acid regulate GAST1 expression at the level of transcription. Plant Mol Biol. 1999;38:1053–60.
Gao L, Liu Q, Zhong M, Zeng N, Deng W, Li Y, Wang D, Liu S, Wang Q. Blue light-induced phosphorylation of Arabidopsis cryptochrome 1 is essential for its photosensitivity. J Integr Plant Biol. 2022;64:1724–38.
Cornet F, Pillot JP, Le Bris P, Pouvreau JB, Arnaud N, de Saint Germain A, Rameau C. Strigolactones (SLs) modulate the plastochron by regulating KLUH (KLU) transcript abundance in Arabidopsis. Plant Physiol. 2021;232(5):1909–16.
Wang X, Du Y, Yu D. Trehalose phosphate synthase 5-dependent trehalose metabolism modulates basal defense responses in Arabidopsis thaliana. J Integr Plant Biol. 2019;61(4):509–27.
Xiao W, Sheen J, Jang JC. The role of hexokinase in plant sugar signal transduction and growth and development. Plant Mol Biol. 2000;44(4):451–61.
Taylor-Teeples M, Lin L, de Lucas M, Turco G, Toal TW, Gaudinier A, Young NF, Trabucco GM, Veling MT, Lamothe R, Handakumbura PP, Xiong G, Wang C, Corwin J, Tsoukalas A, Zhang L, Ware D, Pauly M, Kliebenstein DJ, Dehesh K, Tagkopoulos I, Breton G, Pruneda-Paz JL, Ahnert SE, Kay SA, Hazen SP, Brady SM. An Arabidopsis gene regulatory network for secondary cell wall synthesis. Nature. 2015;517(7536):571–5.
Moturu TR, Thula S, Singh RK, Nodzynski T, Vareková RS, Friml J, Simon S. Molecular evolution and diversification of the SMXL gene family. J Exp Bot. 2018;69(9):2367–78.
Li R, An JP, You CX, Wang XF, Hao YJ. Genome-wide analysis and identification of the SMXL gene family in apple (Malus x domestica). Tree Genet Genomes. 2018;14(4):61.
Zhang H, Wang L, Gao Y, Guo Y, Zheng N, Xu X, Xu M, Wang W, Liu C, Liu W, Yang W. Genome-wide identification of SMXL Gene family in soybean and expression analysis of GmSMXLs under Shade stress. Plants. 2022;11(18): 2410.
Sun M, Wang D, Liu C, Liu Y, Niu M, Wang J, Li J. Genome-wide identification and analysis of the SUPPRESSOR of MAX2 1-LIKE gene family and its interaction with DWARF14 in poplar. BMC Plant Biol. 2023;23(1):105.
Yang Z, Gao C, Zhang Y, Yan Q, Hu W, Yang L, Wang Z, Li F. Recent progression and future perspectives in cotton genomic breeding. J Integr Plant Biol. 2022;0:1–22.
Wu W, Liu Y, Wang Y, Li H, Liu J, Tan J, He J, Bai J, Ma H. Evolution analysis of the Aux/IAA gene family in plants shows dual Origins and Variable Nuclear localization signals. Int J Mol Sci. 2017;18(10): 2107.
Xu G, Guo C, Shan H, Kong H. Divergence of duplicate genes in exon-intron structure. Proc Natl Acad Sci U S A. 2012;109(4):1187–92.
Liang Y, Ward S, Li P, Bennett T, Leyser O. SMAX1-LIKE7 Signals from the Nucleus to Regulate Shoot Development in Arabidopsis via Partially EAR Motif-Independent Mechanisms. Plant Cell. 2016;28:tpc .00286.02016.
Shabek N, Ticchiarelli F, Mao H, Hinds TR, Leyser O, Zheng N. Structural plasticity of D3–D14 ubiquitin ligase in strigolactone signalling. Nature. 2018;563(7733):652–6.
Stanga J, Smith S, Briggs W, Nelson D. SUPPRESSOR OF MAX2 1 (SMAX1) controls seed germination and seedling development in Arabidopsis thaliana. Plant Physiol. 2013;163(1):318–30.
Villaécija-Aguilar J, Hamon-Josse M, Carbonnel S, Kretschmar A, Gutjahr C. SMAX1/SMXL2 regulate root and root hair development downstream of KAI2-mediated signalling in Arabidopsis. PLoS Genet. 2019;15(8):e1008327.
Li Q, Martín-Fontecha ES, Khosla A, White ARF, Chang S, Cubas P, Nelson DC. The strigolactone receptor D14 targets SMAX1 for degradation in response to GR24 treatment and osmotic stress. Plant Commun. 2022;3(2):100303.
Temmerman A, Guillory A, Bonhomme S, Goormachtig S, Struk S. Masks start to Drop: suppressor of MAX2 1-Like proteins reveal their many faces. Front Plant Sci. 2022;13: 887232.
Guo DL, Li C, Dong R, Li XB, Xiao XW, Huang XZ. Molecular cloning and functional analysis of the FLOWERING LOCUS T (FT) homolog GhFT1 from Gossypium hirsutum. J Integr Plant Biol. 2015;57(6):522–33.
Nakamura H, Xue YL, Miyakawa T, Hou F, Qin HM, Fukui K, Shi X, Ito E, Ito S, Park SH, Miyauchi Y, Asano A, Totsuka N, Ueda T, Tanokura M, Asami T. Molecular mechanism of strigolactone perception by DWARF14. Nat Commun. 2013;4:2613.
Tamura K, Stecher G, Kumar S. MEGA11: Molecular Evolutionary Genetics Analysis Version 11. Mol Biol Evol. 2021;38(7):3022–7.
Thompson JD, Gibson TJ, Higgins DG. Multiple sequence alignment using ClustalW and ClustalX. Curr Protoc Bioinformatics. 2003;00(1):231–222.
Chen C, Chen H, Zhang Y, Thomas HR, Xia R. TBtools: an integrative Toolkit developed for interactive analyses of big Biological Data. Mol Plant. 2020;13(8):1194–202.
Lescot M. PlantCARE, a database of plant cis-acting regulatory elements and a portal to tools for in silico analysis of promoter sequences. Nucleic Acids Res. 2002;30(1):325–7.
Jiangtao C, Yingzhen K, Qian W, Yuhe S, Daping G, Jing L, Guanshan L. MapGene2Chrom, a tool to draw gene physical map based on Perl and SVG languages. Hereditas. 2015;37(1):91–7.
Wang Y, Tang H, Debarry JD, Tan X, Li J, Wang X, Lee TH, Jin H, Marler B, Guo H, Kissinger JC, Paterson AH. MCScanX: a toolkit for detection and evolutionary analysis of gene synteny and collinearity. Nucleic Acids Res. 2012;40(7):e49–9.
Rasche H, Hiltemann S. Galactic Circos: user-friendly Circos plots within the Galaxy platform. GigaScience. 2020;9(6): giaa065.
Zhang Z, Xiao J, Wu J, Zhang H, Liu G, Wang X, Dai L. ParaAT: a parallel tool for constructing multiple protein-coding DNA alignments. Biochem Biophys Res Commun. 2012;419(4):779–81.
Wang D, Zhang Y, Zhang Z, Zhu J, Yu J. KaKs_Calculator 2.0: a Toolkit incorporating Gamma-Series methods and sliding window strategies. Genom Proteom Bioinf. 2010;8(1):77–80.
Yadav C, Bonthala VS, Muthamilarasan M, Pandey G, Khan Y, Prasad M. Genome-wide development of transposable elements-based markers in foxtail millet and construction of an integrated database. DNA Res. 2014;22(1):79–90.
Hu Y, Chen J, Fang L, Zhang Z, Ma W, Niu Y, Ju L, Deng J, Zhao T, Lian J, Baruch K, Fang D, Liu X, Ruan YL, Rahman MU, Han J, Wang K, Wang Q, Wu H, Mei G, Zang Y, Han Z, Xu C, Shen W, Yang D, Si Z, Dai F, Zou L, Huang F, Bai Y, Zhang Y, Brodt A, Ben-Hamo H, Zhu X, Zhou B, Guan X, Zhu S, Chen X, Zhang T. Gossypium barbadense and Gossypium hirsutum genomes provide insights into the origin and evolution of allotetraploid cotton. Nat Genet. 2019;51(4):739–48.
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;25(4):R36.
Cai D, Liu H, Sang N, Huang X. Identification and characterization of CONSTANS-like (COL) gene family in upland cotton (Gossypium hirsutum L). PLoS ONE. 2017;12: e0179038.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2(-Delta Delta C(T)) method. Methods. 2001;25(4):402–8.
Femandez-Pozo N, Rosli HG, Martin GB, Mueller LA. The SGN VIGS Tool: user-friendly Software to Design Virus-Induced Gene silencing (VIGS) constructs for Functional Genomics. Mol Plant. 2015;8(3):486–8.
Liu H, Huang X, Ma B, Zhang T, Sang N, Zhuo L, Zhu J. Components and functional diversification of Florigen Activation Complexes in Cotton. Plant Cell Physiol. 2021;62(10):1542–55.
This research was supported by the National Natural Science Foundation of China (No. 31860393, 31360366), and the University Discipline (Major) Top Talent Cultivation Funding Project (No. gxbjZD2021072), and the Academic Leaders of Anhui University of Science and Technology Introduction Starting Fund (No. NXYJ202001).
Ethics approval and consent to participate
We confirmed that all methods were performed in accordance with the relevant guidelines and regulations. We complied with all relevant institutional, national, and international guide and legislation.
Consent for publication
The authors declare no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary table S1. Identification of SMXL gene family members from 21 plants. Supplementary table S2. Information of SMXL gene family in cotton (Gossypium sp.). Supplementary table S3. Amino acid sequences of motif 1-15 of GhSMXL proteins. Supplementary table S4. Information of the homologus·SMXL proteins in cotton (Gossypium sp.). Supplementary table S5. Syntenic SMXL gene pairs between allotetraploid and diploid cottons. Supplementary table S6. Analysis of network of SMXL6, 7-1, and 7-2 proteins in G.hirsutum and G.arboreum. Supplementary table S7. Information of the cotton database versions and websites used in this study. Supplementary table S8. Primer sequence information used in this study.
Fig. S1. Multi nucleotide sequence alignment of GhSMXL genes. Fig. S2. Chromosomal distributions of SMXL genes in Gossypium spp. Fig. S3. Collinearity analysis of G. barbadense (At and Dt) orthologs in the genomes G. raimondii, G. arboreum, and G. herbaceum. Fig. S4. Expression characteristics of nine GhSMXL genes in 13 tissues determined using qRT-PCR. Fig. S5. Plant heights of the GhSMAX1-1 and GhSMAX1-2 silenced plants. Fig. S6. Information of cis-acting elements of GhSMXL genes.
About this article
Cite this article
Ma, B., Zhu, J. & Huang, X. Diversification of plant SUPPRESSOR OF MAX2 1 (SMAX1)-like genes and genome-wide identification and characterization of cotton SMXL gene family. BMC Plant Biol 23, 419 (2023). https://doi.org/10.1186/s12870-023-04421-6
- Plant architecture