Structural and functional similarities and differences in nucleolar Pumilio RNA-binding proteins between Arabidopsis and the charophyte Chara corallina
BMC Plant Biology volume 20, Article number: 230 (2020)
Pumilio RNA-binding proteins are evolutionarily conserved throughout eukaryotes and are involved in RNA decay, transport, and translation repression in the cytoplasm. Although a majority of Pumilio proteins function in the cytoplasm, two nucleolar forms have been reported to have a function in rRNA processing in Arabidopsis. The species of the genus Chara have been known to be most closely related to land plants, as they share several characteristics with modern Embryophyta.
In this study, we identified two putative nucleolar Pumilio protein genes, namely, ChPUM2 and ChPUM3, from the transcriptome of Chara corallina. Of the two ChPUM proteins, ChPUM2 was most similar in amino acid sequence (27% identity and 45% homology) and predicted protein structure to Arabidopsis APUM23, while ChPUM3 was similar to APUM24 (35% identity and 54% homology). The transient expression of 35S:ChPUM2-RFP and 35S:ChPUM3-RFP showed nucleolar localization of fusion proteins in tobacco leaf cells, similar to the expression of 35S:APUM23-GFP and 35S:APUM24-GFP. Moreover, 35S:ChPUM2 complemented the morphological defects of the apum23 phenotypes but not those of apum24, while 35S:ChPUM3 could not complement the apum23 and apum24 mutants. Similarly, the 35S:ChPUM2/apum23 plants rescued the pre-rRNA processing defect of apum23, but 35S:ChPUM3/apum24+/− plants did not rescue that of apum24. Consistent with these complementation results, a known target RNA-binding sequence at the end of the 18S rRNA (5′-GGAAUUGACGG) for APUM23 was conserved in Arabidopsis and C. corallina, whereas a target region of ITS2 pre-rRNA for APUM24 was 156 nt longer in C. corallina than in A. thaliana. Moreover, ChPUM2 and APUM23 were predicted to have nearly identical structures, but ChPUM3 and APUM24 have different structures in the 5th C-terminal Puf RNA-binding domain, which had a longer random coil in ChPUM3 than in APUM24.
ChPUM2 of C. corallina was functional in Arabidopsis, similar to APUM23, but ChPUM3 did not substitute for APUM24 in Arabidopsis. Protein homology modeling showed high coverage between APUM23 and ChPUM2, but displayed structural differences between APUM24 and ChPUM3. Together with the protein structure of ChPUM3 itself, a short ITS2 of Arabidopsis pre-rRNA may interrupt the binding of ChPUM3 to 3′-extended 5.8S pre-rRNA.
Pumilio proteins are a family of RNA-binding proteins that are evolutionarily conserved in eukaryotes . Typical Pumilio proteins have tandem repeats of 8 Puf domains that recognize 8 RNA bases, and each Puf domain contains 35–39 amino acids that form three α-helical structures [2, 3]. The basis of RNA recognition by these proteins is the crescent-shaped structure [4, 5]. The conserved aromatic and basic amino acids on the concave side of the crescent structure interact with RNA, whereas the amino acids on the convex side interact with partner proteins. Although Pumilio proteins have a variety of biological roles, their major molecular functions are mRNA decay and localization, translational repression [6, 7], and rRNA processing . Most of the Pumilio proteins are localized in the cytoplasm and are involved in the posttranscriptional regulation of mRNA. However, a small subset of these proteins is localized in the nucleolus and participates in rRNA processing. For instance, nucleolar Nop9 of yeast  and TbPUF7 of trypanosomes  are involved in 18S rRNA biosynthesis and ribosome maturation through proper pre-rRNA processing. In plants, two nucleolar Pumilio proteins have been implicated in rRNA processing [10,11,12,13,14,15], including Arabidopsis APUM23, a homolog of yeast Nop9, and APUM24, a homolog of human Puf-A and yeast Puf6. APUM23 not only is required for normal growth patterning, such as leaf development and organ polarity [10, 16] but also is involved in ABA signaling . APUM24 is essential for plant development, as its homozygous mutant displays embryo lethality . APUM24 is implicated in the maturation of 5.8S and 25S rRNAs, while APUM23 participates in the processing of 18S and 5.8S rRNAs.
Pre-rRNA is a long single-stranded RNA transcribed from the rDNA repeat in the nucleolus. This transcript is subsequently cleaved to three mature rRNAs (5.8S, 18S, and 25S) by endoribonucleolytic activities [18, 19]. Misprocessed rRNA byproducts that are produced during rRNA processing are degraded by 5′-to-3′ and 3′-to-5′ exoribonucleolytic activities. These two pre-rRNA processing activities require additional accessory proteins, such as RNA exosome components, Pumilio proteins, and many RNA-binding proteins. It has been reported that Arabidopsis and rice show similar pre-rRNA processing pathways, probably due to the similar flanking sequences around the endocleavage sites of A2 and A3 in ITS1 , suggesting that RNA binding specificity is essential for the selection of cleavage sites. Recently, APUM23 was found to bind 11 nt in the 18S rRNA at positions 1141–1151 , and APUM24 interacts with rRNA segments encompassing the 5.8S and ITS2 regions . Therefore, it is likely that APUM23 and APUM24 play crucial roles in the recruitment of target RNA sequences and interacting proteins for the maturation of 18S and 5.8S rRNAs in Arabidopsis.
Approximately 450 million years ago, land plants evolved from Charophyta living in freshwater and adapted to the terrestrial environment [21,22,23,24]. Charophyta shares numerous molecular and physiological characteristics with living land plants that are not found in Chlorophyta [25, 26]. Charophyta shows high sequence similarities to land plants in plastidal atpB and rbcL, mitochondrial nad5, and nuclear-encoded small subunit rRNA genes .
In this study, we found that all the green plants (Viridiplantae) examined have two putative nucleolar Pumilio proteins homologous to Arabidopsis APUM23 and APUM24. Consistent with this, two nucleolar Pumilio genes, namely, ChPUM2 and ChPUM3, were identified in Chara corallina by transcriptome analysis. We postulated that two Pumilio proteins encoded by these genes might be evolutionarily and functionally conserved, as their Arabidopsis homologs play crucial roles in pre-rRNA processing required for proper protein synthesis. Transiently expressed ChPUM2-RFP and ChPUM3-RFP were localized in the nucleoli of Nicotiana benthamiana leaf cells, suggesting the nucleolar function of ChPUM2 and ChPUM3. The apum23 mutant transformed with 35S:ChPUM2 recovered its defective rRNA processing and morphological phenotypes to normal levels. However, the rRNA processing defects and embryo lethality of the apum24 mutant were not rescued by 35S:ChPUM3 or ChPUM2. Consistent with the failure of complementation of apum24 with 35S:ChPUM3, APUM24 has different domain structures at the C-terminus from ChPUM3. Moreover, the target ITS2 region of Arabidopsis pre-rRNA is 156 nt shorter than that of C. corallina and might not be sufficient for the binding of ChPUM3.
Phylogeny of nucleolar Pumilio proteins
Pumilio proteins are ubiquitous in eukaryotic organisms, albeit in different numbers [1, 5]. Among the organisms whose whole genome sequences are available, higher plants have a higher number of Pumilio proteins than photosynthetic single-cell organisms and nonplant organisms; for example, 25 Pumilio proteins are found in Arabidopsis thaliana, 20 in Oryza sativa, 14 in Physcomitrella patens, 5 in Chlamydomonas reinhardtii, 11 in Caenorhabditis elegans, 7 in Saccharomyces cerevisiae, and 2 in humans . Based on a similarity search using Arabidopsis nucleolar Pumilio proteins (APUM23 and APUM24) as queries and the existence of a nucleolar localization signal(s) (NoLS) as a requirement , the green plants whose genomes have been sequenced (Phytozome v12.1; https://phytozome.jgi.doe.gov) were shown to have two putative nucleolar Pumilio proteins. Using PacBio Iso-Seq analysis, we also identified two putative nucleolar Pumilio proteins out of four Pumilio proteins in C. corallina. Consistent with our transcriptome analysis, four Pumilio proteins were predicted in the Chara genome data, including 2 nucleolar forms . When compared with Arabidopsis APUMs, comprising 25 Pumilio proteins, ChPUM2 and ChPUM3 displayed high homology with APUM23 and APUM24, respectively, while ChPUM1 and ChPUM4 belonged to other distinct clades (Additional file 1: Figure S1).
To gain insight into the evolutionary relationship of putative nucleolar Pumilio proteins, we constructed a phylogenetic tree with the homologous proteins of 14 species of green plants together with 5 outgroup species that contain a NoLS(s)  (Fig. 1a and b). All green plants analyzed in this study had two proteins belonging to the APUM23 and APUM24 clades. The ChPUM2 of C. corallina was closer to APUM23 than APUM24, whereas ChPUM3 was categorized in the APUM24 clade. Phylogenetic analyses using ChPUM2 and ChPUM3 indicated that C. corallina is closer to land plants than other green algae examined in this study, suggesting that the evolution of nucleolar Pumilio proteins is consistent with previously determined phylogenetic positioning [21,22,23,24,25].
We then compared the number and position of Puf domains in ChPUM2, ChPUM3, APUM23, and APUM24 using the SMART web program (http://smart.embl-heidelberg.de) . APUM23 and APUM24 have six and five Puf domains, respectively, and ChPUM2 and ChPUM3 have one and five Puf domains, respectively. As each Puf domain has been known to recognize a single RNA base , this observation raised the possibility that ChPUM2 may have a distinct RNA binding property from APUM23 and that ChPUM3 may bind similar, if not identical, RNA motifs as APUM24 (Fig. 1c).
Structure of ChPUM2 and ChPUM3
Classic structural analysis of Pumilio proteins shows a tandem repeat of 8 Puf domains in the C-terminal region [31, 32]. However, a recent analysis of human Puf-A and yeast Puf6 identified 11 Puf domains, including 3 additional domains, in these Pumilio proteins involved in pre-rRNA processing . A similar analysis previously performed for APUM23 showed 10 Puf domains instead of the six previously known domains [10, 12]. Consistent with a close phylogenetic relationship between APUM23 and ChPUM2 (Fig. 1a and Additional file 1: Figure S1), ChPUM2 also contained 10 Puf domains that showed an identical distribution as in APUM23 (Fig. 2a and Additional file 2: Figure S2a). Each domain of ChPUM2 showed an average 26% identity and 41% homology with the corresponding domain of APUM23. Notably, a high degree of homology was found in the 1st, 2nd, and 5th residues of the 2nd α-helix of each Puf domain (red boxes in Fig. 2a and Additional file 2: Figure S2a). These three residues have been known to play a pivotal role in the recognition of RNA bases .
Using the same approach, ChPUM3 was shown to possess 11 Puf domains (N-R1 to N-R3 at the N-terminus and C-R1 to C-R8 at the C-terminus) (Fig. 2b and Additional file 2: Figure S2b), as reported in human Puf-A and APUM24 . Comparison of amino acids in 3 N-terminal Puf domains displayed an average identity of 47% and homology of 67% between APUM24 and ChPUM3, and that of 8 C-terminal domains showed 39% identity and 55% homology. Out of the 11 Puf domains, two domains (C-R5 and C-R7) had lower identities than the other domains (Fig. 2b).
Comparison of amino acid sequences among Puf domains suggested that ChPUM2 and ChPUM3 may be functional homologs of Arabidopsis APUM23 and APUM24, respectively (Fig. 2a and b), which is consistent with the observation obtained from phylogenetic analysis. However, since ChPUM2 and APUM23 contain different numbers of Puf domains, unlike ChPUM3 and APUM24, in the classic domain analysis (Fig. 1c), they may bind distinct RNA substrates. Therefore, to determine the structural relationship between nucleolar ChPUM and APUM proteins, we predicted the tertiary structure of these proteins using the SWISS-MODEL web server (https://swissmodel.expasy.org/) . A previous high-resolution structural study demonstrated that the C-shaped structure of APUM23 has a long chain between the 2nd and 3rd α-helix of the R3 domain that participates in the recognition of RNA bases . Homology modeling revealed a high similarity between ChPUM2 and the APUM23 reference protein, as well as the C-shaped structure similar to APUM23, in the 3-dimensional structure (Fig. 3a). Compared with APUM23, ChPUM2 has a long random coil in the R3 domain (red colored lines in the bottom panels of Fig. 3a), but it maintains uninterrupted 2nd and 3rd α-helical structures in this domain, similar to APUM23. Therefore, analyses of consensus amino acid sequences and homology modeling suggest that ChPUM2 may recognize similar, if not identical, RNA bases.
In contrast to the C-shaped configuration of ChPUM2 and APUM23, an L-shaped structure was predicted for ChPUM3 and APUM24, similar to the human Puf-A reference protein  (Fig. 3b). The most marked structural differences between ChPUM3 and APUM24 were found in the C-R5, and N-R2 and N-R3 domains. ChPUM3 had a longer random coil in the C-R5 domain than APUM24 (Fig. 2b and the dotted circles in Fig. 3b). Additionally, ChPUM3 contained negatively charged (E210) and uncharged (Q249) amino acids in the N-R2 and N-R3 domains (Fig. 2b), instead of the basic amino acids that are known to be involved in RNA binding and found at both sites of APUM24 and the human Puf-A reference . Thus, it appeared that ChPUM3 has different RNA binding specificity from APUM24, considering the chain length of C-R5 and the lack of basic amino acids in 2 N-terminal Puf domains.
Subcellular localization of ChPUM2 and ChPUM3
Next, we examined the subcellular localization of ChPUM2-RFP and ChPUM3-RFP. Previously, the GFP fusions of APUM23 and APUM24 were known to preferentially localize in the nucleoli of Arabidopsis root and tobacco leaf cells [10, 15]. We performed Agrobacterium-mediated coinfiltration into N. benthamiana leaf cells using 35S:APUM23-GFP and 35S:ChPUM2-RFP and 35S:APUM24-GFP and 35S:ChPUM3-RFP. All GFP- and RFP-tagged Pumilio proteins were found in the nucleoli and were weakly detected in the nucleoplasm (Fig. 4). The colocalization results suggest that ChPUM2 and ChPUM3 may play similar roles in pre-rRNA recognition and processing to APUM23 and APUM24.
ChPUM2 rescued the apum23 mutant phenotype, but ChPUM3 did not rescue apum24
To assess whether ChPUM2 and ChPUM3 are involved in nucleolar functions similar to APUM23 and APUM24, complementation assays were performed on homozygous apum23 (Fig. 5) and heterozygous apum24 (Fig. 6) mutants by transforming the 35S:ChPUM2 and 35S:ChPUM3 constructs. While the apum23–2 mutant showed the phenotypes of delayed germination (Fig. 5a and b), short roots (bottom panel in Fig. 5b), short inflorescence stems (upper panel in Fig. 5c), light green leaves (bottom panel in Fig. 5c), and streptomycin resistance (Fig. 5d), the 35S:ChPUM2/apum23–2−/− plants exhibited normal phenotypes (Fig. 5a-d). Notably, the recovered streptomycin susceptibility of 35S:ChPUM2/apum23–2−/− plants indicates the normal ribosomal functions of complemented plants (Fig. 5d). In contrast to 35S:ChPUM2/apum23–2−/− plants, 35S:ChPUM3/apum23–2−/− plants maintained an apum23–2 mutant phenotype (Fig. 5a-d). The failure to restore the apum23–2 phenotype with 35S:ChPUM3 excluded the possibility that ChPUM3 and APUM23 are orthologous proteins.
In addition to the restoration of morphological phenotypes, 35S:ChPUM2/apum23–2−/− rescued the defects observed in the apum23–2−/− mutant that accumulates poly (A)-tailed 5′ETS-18S-ITS1 and 5.8S-ITS2 pre-rRNAs (Fig. 5e). The poly (A) pre-rRNAs were detected using quantitative reverse transcriptase-PCR (qRT-PCR) and three combinations of primers. In the apum23–2−/− mutant, all three qRT-PCR products (5′ETS-18S, 18S-ITS1, and 5.8S-ITS2) were accumulated, but in 35S:ChPUM2/apum23–2−/− plants, the amounts of poly(A)-tailed 18S-ITS1 and 5.8S-ITS2 pre-rRNAs were greatly reduced compared with those in apum23–2−/−, although the amount of poly(A) 5.8S-ITS2 was slightly decreased. In contrast to 35S:ChPUM2/apum23–2−/−, 35S:ChPUM3/apum23–2−/− showed a nearly identical amount of poly(A) pre-rRNAs to the apum23–2−/− mutant (Fig. 5e, right panel). The qRT-PCR results indicate that ChPUM2 is involved in a similar pre-rRNA processing pathway as APUM23, although it was not fully functional in the removal of the 5.8S pre-rRNA byproducts.
In contrast to the restoration of apum23 by the ChPUM2 transgene, the apum24 phenotype was not restored by the ChPUM2 or ChPUM3 transgenes. As the homozygous apum24−/− mutant is lethal [13, 15, 34], the heterozygous apum24–1+/− mutant was used for complementation analysis. The 35S:ChPUM3/apum24–1+/− plants set normal and abnormal seeds at similar rates as the apum24+/− mutant (Fig. 6a-c and Table 1), showing 30.7 and 33.2% abnormal seeds for the 35S:ChPUM3/apum24–1+/− and apum24–1+/− plants, respectively. As expected, the 35S:ChPUM2/apum24–1+/− plants produced abnormal seeds (32.5%) at a similar ratio to the 35S:ChPUM3/apum24–1+/− plants. Consistent with this result for the morphological phenotype, 35S:ChPUM2/apum24–1+/− and 35S:ChPUM3/apum24–1+/− plants accumulated poly(A)-tailed 18S-ITS1 and 5.8S-ITS2 pre-rRNA similar to the apum24–1+/− plants (Fig. 6d), indicating that ChPUM3 is not a functional ortholog of Arabidopsis APUM24.
ChPUM2 restored the salt- and glucose-hypersensitive phenotypes of apum23, but ChPUM3 did not restore the apum23 and apum24 phenotypes
apum23 and apum24 showed changes in the expression levels of ribosomal biogenesis-related genes, which in turn resulted in hypersensitivity to high concentrations of salt in apum23  and glucose in a weak apum24 mutant . We therefore examined whether ChPUM2 and ChPUM3 could restore the altered physiological phenotypes of apum23−/− and apum24+/−. The 35S:ChPUM2/apum23–2−/− seedlings exhibited a similar degree of resistance to 150 mM NaCl and 200 mM glucose as wild-type Col-0 seedlings, while the 35S:ChPUM3/apum23–2−/− seedlings showed NaCl and glucose susceptibility similar to that of apum23–2−/− seedlings (left panel in Fig. 7a). However, 35S:ChPUM2/apum24–1+/− and 35S:ChPUM3/apum24–1+/− failed to recover the salt sensitivity of apum24–1+/− (right panel in Fig. 7a). Unexpectedly, when 35S:ChPUM2 or 35S:ChPUM3 was overexpressed in apum24–1+/−, their transgenic seedlings showed a similar glucose resistance as that found in wild-type and apum24–1+/− seedlings (right panel in Fig. 7a). It was previously reported that APUM24 gene expression was greatly increased in wild-type plants by exogenously supplied glucose [13, 15]. Our data showed similar expression levels of the APUM24 transcript in the Col-0 (pB2GW7) control, apum24–1+/−, 35S:ChPUM2/apum24–1+/−, and 35S:ChPUM3/apum24–1+/− in the presence of 200 mM glucose (Fig. 7b). The normal growth of apum24–1+/− suggests that heterozygotic expression of APUM24 might be sufficient for the glucose-induced phenotype. Similar to apum24–1+/−, the apum24–1+/− plants transformed with 35S:ChPUM2 or 35S:ChPUM3 grew normally under glucose treatment, probably owing to the expression of heterozygotic APUM24. Moreover, all the plants showed similar amounts of unprocessed 5.8S rRNAs as Col-0 (pB2GW7) control under 200 mM glucose treatment (Fig. 7c), while they displayed 5.08- to 6.26-fold higher amounts of unprocessed 5.8S rRNAs than the Col-0 (pB2GW7) control (Fig. 6d). This result suggests that glucose supplementation of heterozygous apum24+/− increased the level of heterozygous APUM24 up to the homozygous APUM24 level. Therefore, the normal phenotype and 5.8S pre-rRNA processing that were observed in 35S:ChPUM2/apum24–1+/− and 35S:ChPUM3/apum24–1+/− plants resulted from increased levels of Arabidopsis APUM24 caused by exogenously supplied glucose.
Based on our transcriptome data, public databases, and previous results [10, 15, 35], we found that the green plants examined in this study have two nucleolar Pumilio proteins. Protein phylogeny showed a closer relationship of nucleolar Pumilio proteins of land plants with multicellular C. corallina than with single-celled green algae (Chlorophyta). This result is consistent with a previous report showing that the genus Chara is more closely related to land plants than other green plants . Although the possibility that certain putative nucleolar Pumilio proteins do not have a nucleolar function has not been ruled out, land plant species appear to have evolved two Pumilio proteins for the removal of aberrant pre-rRNAs.
We demonstrated that ChPUM2 is a functional ortholog of APUM23 in Arabidopsis cells, as evident from the restoration of defective pre-rRNA processing and morphology of apum23 in 35S:ChPUM2/apum23 plants. Arabidopsis nucleolar Pumilio proteins are known to play a role in recognizing the target sequences on pre-rRNA and recruiting catalytic proteins such as exoribonuclease [10, 12, 13, 15]. Comparison of pre-rRNA identified a target sequence (5′-GGAAUUGACGG-3′) of APUM23  in the 18S rRNA of C. corallina at positions 1148–1158 (Additional file 6: Figure S6). Therefore, it is likely that ChPUM2 may bind to this sequence in C. corallina. This assumption is supported by the primary structure of ChPUM2 and APUM23, each Puf domain of which is highly conserved between ChPUM2 and APUM23 except the fourth domain (Fig. 2a). Therefore, the common target rRNA sequence and similar amino acid composition of Puf domains might enable the restoration of apum23 phenotypes, including morphological and rRNA processing defects, to normal in 35S:ChPUM2/apum23. In apum23 complementation analysis using 35S:ChPUM2, the poly(A)-tailed 5.8S pre-rRNAs that accumulated in the apum23 mutant were not completely removed (Fig. 5e). It seems possible that this might be due to a weak interaction of ChPUM2 with other unknown proteins that belong to the partners of intrinsic APUM23 specifically required for 5.8S rRNA processing. Indeed, the predicted structure of ChPUM2 has more unfolded chains than APUM23 (Fig. 3a), which may interfere with the interaction of ChPUM2 with other protein components. To verify this possibility in planta, it is worthwhile to identify and compare the components interacting with CPUM2 in C. corallina and APUM23 in Arabidopsis.
Although ChPUM3 appeared structurally similar to APUM24 (Fig. 3b), ChPUM3 did not functionally replace APUM24 in Arabidopsis. We assume that this result is due to fine structural differences between ChPUM3 and APUM24. Typical Pumilio proteins bind to a specific RNA base with the second α-helix of the Puf domain, but APUM24 and its homologs are not capable of binding to a specific RNA base through this α-helix domain [14, 15]. ChPUM3 does not complement the apum24 mutant perhaps due to (1) the very long random coil in the C-R5 domain, (2) the negatively charged and uncharged amino acids in two N-terminal domains, and (3) the long ITS2 sequence in C. corallina. First, a very long random chain at the C-R5 domain of ChPUM3 would interrupt the interaction of other C-terminal domains with RNA bases in the 5.8S-ITS2 junction of Arabidopsis pre-rRNA. Indeed, it was reported that human Puf-A and its homolog APUM24 have a long random coil in C-R5 that prevents the C-terminal Puf domains from binding to RNA . ChPUM3 has a random coil that is an 80 aa longer than that of APUM24 (Fig. 2b); thus, ChPUM3 may not recognize Arabidopsis pre-rRNA. Second, in ChPUM3, the N-R2 and N-R3 of patch 1B include negatively charged (E210) and uncharged (Q249) amino acids, unlike the positive amino acids (K) at both positions of APUM24, which may result in differential binding characteristics from APUM24 toward the 5.8S-ITS2 region. The N-R2 and N-R3 domains of the human Puf-A protein are essential for RNA binding . Third, ChPUM3 might be optimized for the recognition of long ITS2 sequences. ITS2 of C. corallina pre-rRNA is 156 nt longer than that of Arabidopsis (Additional file 7: Figure S7). In addition to the long side chain of the C-R5 domain in ChPUM3, the relatively short ITS2 sequence of Arabidopsis pre-rRNA may prevent ChPUM3 from binding to its substrate. Indeed, ITS2 evolved rapidly and has been used to evaluate genetic divergence [36, 37].
In this study, we identified two nucleolar Pumilio proteins, namely, ChPUM2 and ChPUM3, from C. corallina that are phylogenetically and structurally close to the Arabidopsis nucleolar Pumilio proteins APUM23 and APUM24, respectively. Complementation analyses using 35S:ChPUM2 and 35S:ChPUM3 showed that ChPUM2 rescued the defective phenotypes of the apum23 mutant, but ChPUM3 did not restore the phenotypes of the apum24 mutant. Consistent with these complementation results, ChPUM2 showed similar features of Puf domains as APUM23 in the primary amino acid sequence and a predicted 3-D protein structure. ChPUM3 has a long random coil in the C-R5 domain and contains distinct amino acids from those in APUM24 in the N-terminal domain. In addition to the structural difference between ChPUM3 and APUM24, a short ITS2 sequence of Arabidopsis pre-rRNA might prevent ChPUM3 from properly processing Arabidopsis 5.8S pre-rRNA. Taken together, the results show that ChPUM2 was functional in Arabidopsis, similar to APUM23, but ChPUM3 could not substitute for APUM24 in Arabidopsis. Further studies on the nucleolar functions of ChPUM2 and ChPUM3 in Charophyta will help us understand the evolution of rRNA processing in green plants.
Plant materials and growth conditions
C. corallina was collected at the private land (38°33′N, 128°50′E) in South Korea with the kind permission of land owner, and grown at room temperature in a small aquarium. Genomic DNA and voucher specimens of C. corallina were formally identified by Dr. Min Ha Kim at the Plant Resources Division of the National Institute of Biological Resources (https://www.nibr.go.kr/) and deposited under the number NIBRGR0000609814. The apum23  and apum24  mutants, obtained from Arabidopsis Biological Resources Center and reported previously, were used for complementation analyses. The 35S:ChPUM2 and 35S:ChPUM3 transgenic Arabidopsis plants in the apum23 and apum24 backgrounds were produced by transformation using Agrobacterium tumefaciens GV3101 with the floral dipping method . A. thaliana wild-type Col-0 and control (Col-0 transformed with pB2GW7) and the apum23–2−/−, apum24–1+/−, 35S:ChPUM2, and 35S:ChPUM3 overexpression lines were grown on MS medium or in the soil at 22 °C under 16 h light (120 μmol photons m− 2 s− 2) and 8 h dark cycles. For testing antibiotic, salt, and glucose resistance, seeds were germinated on 1/2 MS plates supplemented with 50 mg L− 1 streptomycin, 150 mM NaCl, and 200 mM glucose, respectively, in a growth room for 10 to 12 days. All seeds were stratified at 4 °C for 3 days before sowing.
Identification of ChPUM2 and ChPUM3 transcripts
The cDNA sequences of Pumilio proteins of C. corallina (ChPUMs) were obtained by searching our PacBio Iso-Seq transcriptome data that were generated from whole plants, including thallus, rhizoids, globules (antheridia), and nucules (archegonia). The expression of ChPUMs was verified using RT-PCR. Nucleolar ChPUMs were identified by the comparison of Arabidopsis nucleolar Pumilio proteins (APUM23 and APUM24) with Pumilio proteins of C. corallina and shown to have a NoLS .
Phylogenic analysis of ChPUM2 and ChPUM3 homologs
To perform the phylogenetic analysis, COGs (Clusters of Orthologous Groups) of amino acid sequences of the proteins of ChPUM2, ChPUM3, APUM23, and APUM24 were obtained from representative species of Viridiplantae (green plants) in Phytozome (v 12.1) , Klebsormidium flaccidum in the Klebsormidium genome database (http://www.plantmorphogenesis.bio.titech.ac.jp/~algae_genome_project/klebsormidium/), and two red algae species in the Ensembl Plant database (http://plants.ensembl.org). A phylogenetic tree was constructed by using the maximum likelihood method based on the LG + G model with MEGA7 software [29, 39]. Amino acid sequence alignments were performed using ClustalW (http://www.clustal.org/) and edited using BioEdit software (https://bioedit.software.informer.com/).
For the construction of 35S:ChPUM plasmids, coding sequences (CDSs) for ChPUM2 and ChPUM3 were amplified by RT-PCR using the primers ChPUM2-F and ChPUM2-R1 and ChPUM3-F and ChPUM3-R1, respectively (see Additional file 8: Table S1 for primer sequences). The PCR products were inserted into the pENTR-D-TOPO vector (Invitrogen) and then transferred to pB2GW7 or pK2GW7 (Vlaams Instituut voor Biotechnologie, Ghent University) by Gateway™ LR Cloase II (Invitrogen). For the construction of 35S:ChPUM-RFP and 35S:APUM-GFP plasmids, CDSs of ChPUM2, ChPUM3, APUM23, and APUM24 were amplified by RT-PCR using the primer combinations ChPUM2-F/ChPUM2-R2, ChPUM3-F/ChPUM3-R2, APUM23-F/APUM23-R, and APUM24-F/APUM24-R, respectively. PCR products were inserted into the pENTR-D-TOPO vector and then transferred to the pB7RWG2 or pK7FWG2 vector  by Gateway™ LR Clonase II.
Colocalization assay of ChPUM and APUM fusion proteins
The C-terminal RFP fusion proteins of ChPUM2 and ChPUM3 and C-terminal GFP fusions of APUM23 and APUM24 were transiently expressed in N. benthamiana leaves using agroinfiltration . Briefly, cultures of Agrobacterium carrying fusion constructs were harvested at the stationary phase and resuspended in MMA buffer (10 mM MES, 10 mM MgCl2, and 150 μM acetosyringone) to OD600 = 0.8. For coexpression of ChPUM-RFP and APUM-GFP, equal volumes of two Agrobacterium cultures that had either the 35S:ChPUM-RFP or 35S:APUM-GFP construct were mixed before infiltration. Infiltration was performed on the abaxial side of tobacco leaves using a needleless syringe. Plants were kept in the dark at 22 °C under high humidity for 30–34 h, and the infiltrated leaves were observed under a fluorescence microscope.
Quantitative reverse transcriptase-PCR (qRT-PCR) for analyzing unprocessed rRNA
Total RNA was isolated using the RNeasy Plant Mini Kit (Qiagen, cat. # 74904) from 100 mg seedling and treated with 2 units of RNase-free TURBO™ DNase (Ambion, cat. # AM2238) in 50 μL reaction at 37 °C for 50 min. First-strand cDNA was synthesized from 5 μg of total RNA using the oligo (dT)18 primer in a 20 μL reaction and diluted 3-fold. Then, one μL of cDNA was mixed with 0.6 μL of 10 mM primers and 10 μL of 2 x SYBR® Green Supermix (Bio-Rad, cat. # 172–5261) in a 20 μL reaction and subjected to PCR according to the manufacturer’s instructions. For the detection of unprocessed poly(A) rRNAs, three different combinations of primers (5′ETS/18S, 18S/ITS1, and 5.8S/ITS2) were used. Tubulin (Tub4, At5g44340) cDNA was used as an internal control. For qPCR measurements, two technical and three biological replicates were used. Data were calculated using the 2-ΔΔCT method .
Availability of data and materials
Sequence data for the cDNAs of ChPUM2 and ChPUM3 described in this study can be found in the GenBank database (https://www.ncbi.nlm.nih.gov/) under the accession numbers MN652915 and MN652916, respectively. The transcriptome dataset analyzed during the identification of ChPUM2 and ChPUM3 transcripts is available in the GenBank database (NCBI SRA accession number; SRP249259).
Internal transcribed sequence
Murashige and Skoog
Spassov DS, Jurecic R. The PUF family of RNA-binding proteins: does evolutionarily conserved structure equal conserved function? IUBMB Life. 2003;55:359–66.
Zhang B, Gallegos M, Puoti A, Durkin E, Fields S, Kimble J, et al. A conserved RNA-binding protein that regulates sexual fates in the C. elegans hermaphrodite germ line. Nature. 1997;390:477–84.
Miller MT, Higgin JJ, Hall TM. Basis of altered RNA-binding specificity by PUF proteins revealed by crystal structures of yeast Puf4p. Nat Struct Mol Biol. 2008;15:397–402.
Edwards TA, Pyle SE, Wharton RP, Aggarwal AK. Structure of Pumilio reveals similarity between RNA and peptide binding motifs. Cell. 2001;105:281–9.
Quenault T, Lithgow T, Traven A. PUF proteins: repression, activation and mRNA localization. Trends Cell Biol. 2011;21:104–12.
Murata Y, Wharton RP. Binding of pumilio to maternal hunchback mRNA is required for posterior patterning in drosophila embryos. Cell. 1995;80:747–56.
Sonoda J, Wharton RP. Recruitment of Nanos to hunchback mRNA by Pumilio. Genes Dev. 1999;13:2704–12.
Droll D, Archer S, Fenn K, Delhi P, Matthews K, Clayton C. The trypanosome Pumilio-domain protein PUF7 associates with a nuclear cyclophilin and is involved in ribosomal RNA maturation. FEBS Lett. 2010;584:1156–62.
Thomson E, Rappsilber J, Tollervey D. Nop9 is an RNA binding protein present in pre-40S ribosomes and required for 18S rRNA synthesis in yeast. RNA. 2007;13:2165–74.
Abbasi N, Kim HB, Park NI, Kim HS, Kim YK, Park YI, et al. APUM23, a nucleolar Puf domain protein, is involved in pre-ribosomal RNA processing and normal growth patterning in Arabidopsis. Plant J. 2010;64:960–76.
Abbasi N, Park YI, Choi SB. Pumilio Puf domain RNA-binding proteins in Arabidopsis. Plant Signal Behav. 2011;6:364–8.
Bao H, Wang N, Wang C, Jiang Y, Liu J, Xu L, et al. Structural basis for the specific recognition of 18S rRNA by APUM23. Nucleic Acids Res. 2017;45:12005–14.
Maekawa S, Ishida T, Yanagisawa S. Reduced expression of APUM24, encoding a novel rRNA processing factor, induces sugar-dependent nucleolar stress and altered sugar responses in Arabidopsis thaliana. Plant Cell. 2018;30:209–27.
Qiu C, McCann KL, Wine RN, Baserga SJ, Hall TM. A divergent Pumilio repeat protein family for pre-rRNA processing and mRNA localization. Proc Natl Acad Sci U S A. 2014;111:18554–9.
Shanmugam T, Abbasi N, Kim HS, Kim HB, Park NI, Park GT, et al. An Arabidopsis divergent pumilio protein, APUM24, is essential for embryogenesis and required for faithful pre-rRNA processing. Plant J. 2017;92:1092–105.
Huang T, Kerstetter RA, Irish VF. APUM23, a PUF family protein, functions in leaf development and organ polarity in Arabidopsis. J Exp Bot. 2014;65:1181–91.
Huang KC, Lin WC, Cheng WH. Salt hypersensitive mutant 9, a nucleolar APUM23 protein, is essential for salt sensitivity in association with the ABA signaling pathway in Arabidopsis. BMC Plant Biol. 2018;18:40.
Henras AK, Plisson-Chastang C, O’Donohue M-F, Chakraborty A, Gleizes P-E. An overview of pre-ribosomal RNA processing in eukaryotes. Wiley Interdiscip Rev RNA. 2015;6:225–42.
Weis BL, Kovacevic J, Missbach S, Schleiff E. Plant-specific features of ribosome biogenesis. Trends Plant Sci. 2015;20:729–40.
Hang R, Wang Z, Deng X, Liu C, Yan B, Yang C, et al. Ribosomal RNA biogenesis and its response to chilling stress in Oryza sativa. Plant Physiol. 2018;177:381.
Domozych D, Ciancia M, Fangel J, Mikkelsen M, Ulvskov P, Willats W. The cell walls of green algae: a journey through evolution and diversity. Front Plant Sci. 2012;3:82.
Sørensen I, Domozych D, Willats WGT. How have plant cell walls evolved? Plant Physiol. 2010;153:366.
Sørensen I, Pettolino FA, Bacic A, Ralph J, Lu F, O'Neill MA, et al. The charophycean green algae provide insights into the early origins of plant cell walls. Plant J. 2011;68:201–11.
Domozych D, Sørensen I, Popper ZA. Editorial: Charophytes: evolutionary ancestors of plants and emerging models for plant research. Front Plant Sci. 2017;8:338.
Lewis LA, McCourt RM. Green algae and the origin of land plants. Am J Bot. 2004;91:1535–56.
Nishiyama T, Sakayama H, de Vries J, Buschmann H, Saint-Marcoux D, Ullrich KK, et al. The Chara genome: secondary complexity and implications for plant terrestrialization. Cell. 2018;174:448–64 e24.
Karol KG, McCourt RM, Cimino MT, Delwiche CF. The closest living relatives of land plants. Science. 2001;294:2351–3.
Scott MS, Troshin PV, Barton GJ. NoD: a nucleolar localization sequence detector for eukaryotic and viral proteins. BMC Bioinformatics. 2011;12:317.
Kumar S, Stecher G, Tamura K. MEGA7: molecular evolutionary genetics analysis version 7.0 for bigger datasets. Mol Biol Evol. 2016;33:1870–4.
Schultz J, Copley RR, Doerks T, Ponting CP, Bork P. SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res. 2000;28:231–4.
Miller MA, Olivas WM. Roles of Puf proteins in mRNA degradation and translation. Wiley Interdiscip Rev RNA. 2011;2:471–92.
Wang X, McLachlan J, Zamore PD, Hall TM. Modular recognition of RNA by a human pumilio-homology domain. Cell. 2002;110:501–12.
Waterhouse A, Bertoni M, Bienert S, Studer G, Tauriello G, Gumienny R, et al. SWISS-MODEL: homology modelling of protein structures and complexes. Nucleic Acids Res. 2018;46:W296–303.
Maekawa S, Yanagisawa S. Nucleolar stress and sugar response in plants. Plant Signal Behav. 2018;13:e1442975.
Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, et al. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40:D1178–86.
Caisova L, Marin B, Melkonian M. A close-up view on ITS2 evolution and speciation - a case study in the Ulvophyceae (Chlorophyta, Viridiplantae). BMC Evol Biol. 2011;11:262.
Qin Y, Li M, Cao Y, Gao Y, Zhang W. Molecular thresholds of ITS2 and their implications for molecular evolution and species identification in seed plants. Sci Rep. 2017;7:17316.
Clough SJ, Bent AF. Floral dip: a simplified method for agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998;16:735–43.
Le SQ, Gascuel O. An improved general amino acid replacement matrix. Mol Biol Evol. 2008;25:1307–20.
Karimi M, Inzé D, Depicker A. GATEWAY™ vectors for Agrobacterium-mediated plant transformation. Trends Plant Sci. 2002;7:193–5.
Goodin MM, Dietzgen RG, Schichnes D, Ruzin S, Jackson AO. pGD vectors: versatile tools for the expression of green and red fluorescent protein fusions in agroinfiltrated plant leaves. Plant J. 2002;31:375–83.
Livak KJ, Schmittgen TD. Analysis of relative gene expression data using real-time quantitative PCR and the 2−ΔΔCT method. Methods. 2001;25:402–8.
We thank Dr. Jungho Lee for advice on the culture conditions.
This study was financially supported by grants from the Next-Generation BioGreen21 Program, Rural Development Administration (no. PJ013663), South Korea, to SBC. The funding agencies supported this research project but played no role in the design of the study, data analysis, interpretation, and writing of the manuscript. These were the sole responsibilities of the authors.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Phylogenetic tree of the Pumilio proteins using 25 APUMs from A. thaliana APUMs and 4 ChPUMs from C. corallina. The maximum likelihood tree was generated using the JTT + F + G model with 1000 bootstrapping replicates.
Alignment of five RNA recognition residues in each Pumilio repeat from nucleolar APUMs and ChPUMs. Five residues of each Pumilio repeat in APUM23 and APUM24 known as classically important for RNA recognition are aligned with those of their homologous ChPUMs, namely, ChPUM2 and ChPUM3, respectively.
Alignment of 18S rRNA sequences of A. thaliana and C. corallina. The red box indicates the Arabidopsis rRNA sequence at nt positions 1141–1151 to which APUM23 binds, and the blue box shows the identical sequence at nt positions 1148–1158 of 18S rRNA in C. corallina.
Alignment of 5.8S rRNA and ITS2 sequences of A. thaliana and C. corallina. (a) Alignment of 5.8S rRNA sequences. (b) Alignment of ITS2 sequences.
Primers used in this study.
About this article
Cite this article
Park, S.H., Kim, HS., Kalita, P.J. et al. Structural and functional similarities and differences in nucleolar Pumilio RNA-binding proteins between Arabidopsis and the charophyte Chara corallina. BMC Plant Biol 20, 230 (2020). https://doi.org/10.1186/s12870-020-02444-x