- Research
- Open access
- Published:
Comparative plastome analysis of the sister genera Ceratocephala and Myosurus (Ranunculaceae) reveals signals of adaptive evolution to arid and aquatic environments
BMC Plant Biology volume 24, Article number: 202 (2024)
Abstract
Background
Expansion and contraction of inverted repeats can cause considerable variation of plastid genomes (plastomes) in angiosperms. However, little is known about whether structural variations of plastomes are associated with adaptation to or occupancy of new environments. Moreover, adaptive evolution of angiosperm plastid genes remains poorly understood. Here, we sequenced the complete plastomes for four species of xerophytic Ceratocephala and hydrophytic Myosurus, as well as Ficaria verna. By an integration of phylogenomic, comparative genomic, and selection pressure analyses, we investigated evolutionary patterns of plastomes in Ranunculeae and their relationships with adaptation to dry and aquatic habitats.
Results
Owing to the significant contraction of the boundary of IRA/LSC towards the IRA, plastome sizes and IR lengths of Myosurus and Ceratocephala are smaller within Ranunculeae. Compared to other Ranunculeae, the Myosurus plastome lost clpP and rps16, one copy of rpl2 and rpl23, and one intron of rpoC1 and rpl16, and the Ceratocephala plastome added an infA gene and lost one copy of rpl2 and two introns of clpP. A total of 11 plastid genes (14%) showed positive selection, two genes common to Myosurus and Ceratocephala, seven in Ceratocephala only, and two in Myosurus only. Four genes showed strong signals of episodic positive selection. The rps7 gene of Ceratocephala and the rpl32 and ycf4 genes of Myosurus showed an increase in the rate of variation close to 3.3Â Ma.
Conclusions
The plastomic structure variations as well as the positive selection of two plastid genes might be related to the colonization of new environments by the common ancestor of Ceratocephala and Myosurus. The seven and two genes under positive selection might be related to the adaptation to dry and aquatic habitats in Ceratocephala and Myosurus, respectively. Moreover, intensified aridity and frequent sea-level fluctuations, as well as global cooling, might have favored an increased rate of change in some genes at about 3.3Â Ma, associated with adaptation to dry and aquatic environments, respectively. These findings suggest that changing environments might have influenced structural variations of plastomes and fixed new mutations arising on some plastid genes owing to adaptation to specific habitats.
Background
The plastid is an organelle with an essential role in the photosynthesis of green plants [1]. Typically, angiosperm plastid genomes (plastomes) display a quadripartite circular structure with sequences ranging from 130 to 170Â kb in length [2]. The complete plastome consists of one large single copy (LSC) and one small single copy (SSC) regions, flanked by two inverted repeats (IRA and IRB) [2, 3]. Generally, the plastomes of angiosperms are highly conserved in terms of gene content and structure, but there is considerable variation resulting from the expansion and contraction of IRs [4], the addition, loss and pseudogenization of genes [5], the inversion of genes and regions [6], and polymorphic simple sequence repeats (SSRs) [7]. Some studies have indicated that plastomic variations of heterotrophic plants (parasites or mycoheterotrophs) are correlated with their unique lifestyle [8]. For example, mycoheterotrophic Neottieae (Orchidaceae) have lost plastid NADH dehydrogenase-like complex and photosynthesis-related genes [9]. To date, we know little about whether plastomic variations of an angiosperm group are associated with adaptation or occupancy of new environments, such as dry and aquatic habitats.
Protein-coding genes of a complete angiosperm plastome can be divided into five functional groups, i.e., plastid NADH dehydrogenase-like complex, photosynthesis-related, plastid-encoded RNA polymerase, plastid ATP synthase, and housekeeping [8, 9]. Some plastid protein-coding genes have been reported under positive selection in different taxa, such as seven NADH dehydrogenase-like complex genes (ndhA, ndhB, ndhE, ndhD, ndhF, ndhG, and ndhK) in Allium (Amaryllidaceae) [10], four phyotosynthesis-related genes (psaA, psbB, psbD, and psbH) in Oryza (Poaceae) [11], one plastid-encoded RNA polymerase gene (rpoC2) in Cardamine resedifolia (Brassicaceae) [12], three ATP synthase genes (atpA, atpB, and atpI) in Dipsacales [13], and three housekeeping genes (clpP, ycf2 and rps14) in Euphrasia (Orobanchaceae) [14]. The identification of positively selected genes has been a powerful tool for investigating organismal adaptation to climate changes [10, 15, 16]. In particular, identifying episodic positive selection can be used to determine when adaptation to a specifically environmental condition occurred. As an example, Zecca et al. [17] show that the psbK, rpl20, rpoB, and rps11 genes in Vitaceae had episodic signatures of positive selection and experienced an increase in the rate of variation close to the Cretaceous–Palaeogene transition, which might be influenced by intense environmental perturbations during the transition. It is well-known that extant angiosperms occupy various environments. To gain a better understanding of adaptive evolution of angiosperm plastid genes to environmental changes, we need to examine more taxa growing in diverse environments in a phylogenetic context.
The angiosperm family Ranunculaceae, well known as the buttercup family, is an important herbaceous element of mountain ecosystems in the Northern Hemisphere [18]. Ranunculeae is the largest tribe in this family and consists of 19 genera with approximately 650 species [19, 20]. Within Ranunculeae, most of genera grow in mesophytic habitats except Ceratocephala, Myosurus and some species of Ranunculus [19]. Ceratocephala contains three or four species that are mainly distributed in dry regions of Central Asia with one in New Zealand, and Myosurus has about fifteen species that inhabit wet or seasonally wet regions of all continents. Phylogenetic analyses support the sister relationship between xerophytic Ceratocephala and hydrophytic Myosurus [20,21,22]. Molecular clock estimates suggest a stem age of 31–42 Ma for the Ceratocephala-Myosurus clade, and the split time of these two genera as 25–35 Ma [22]. From the late Eocene onwards, global climate has been altered dramatically [23]. Hence, Ceratocephala and Myosurus provide a good opportunity to explore evolutionary adaptation of plastomes to arid and aquatic environments.
In this study, we first sequenced the complete plastomes for two species of Ceratocephala and two species of Myosurus, as well as one of Ficaria verna. Then, we performed phylogenetic analyses and estimated divergence times for Ranunculeae in combination with the plastomic data of seven other species of Ranunculeae. Within the dated phylogenetic framework, we finally inferred the patterns of structural variations of the plastomes, identified genes that are under positive selection, and explored the changes through time in the rates of variation of plastid genes of Ceratocephala and Myosurus under episodic positive selection. These analyses will contribute to a better understanding of how angiosperm plastomes have evolved and adapted to arid and aquatic environments through large-scale environmental changes.
Results
Genome feature
The complete plastomes of five species of Ranunculeae were de novo assembled. Combining them with seven previously published plastomes (Table S1), we performed comparative analysis of 12 Ranunculeae plastomes with a genome size ranging from 150.4 kb in Myosurus to 157.3 kb in Halerpestes (Figs. 1, S1, S2; Table S1). All plastomes exhibited the typical angiosperm quadripartite structure, which comprises a LSC (83.6–86.7 kb) and a SSC (18.4–22.0 kb) region, separated by two IR regions (23.3–25.8 kb). The plastome sizes of Ceratocephala (150.8–151.4 kb) and Myosurus (ca.150.4 kb) were smaller than that of other Ranunculeae genera (154.2–157.3 kb). The GC content of the plastomes in Ceratocephala (38.4%) and Myosurus (39.8%) was higher than that of other Ranunculeae genera, similar trends in GC content variations were found in the LSC, SSC, and IR regions in Ranunculeae.
We performed high-quality gene annotation for the 12 plastomes, and identified 76–79 plastid protein-coding genes, 29–30 tRNA genes, and 4 rRNA genes (Table S1). Within Ranunculeae, Ceratocephala contains the highest number of plastid protein-coding genes (79), and Myosurus has the lowest number of plastid protein-coding genes (76). The GC3s value of protein-coding genes in Myosurus and Ceratocephala was generally higher than that of other Ranunculeae genera (Fig. S3).
Phylogenetic analysis and divergence time estimation
Phylogenetic relationships within Ranunculeae were reconstructed based on the 78 plastid protein-coding gene data (Figs. 2a and S4). Halerpestes and Oxygraphis formed a clade, sister to the other Ranunculeae. Ceratocephala and Myosurus formed a clade with strong support (bootstrap support (BS) = 100%), sister to Ranunculus. The monophyly of each of Ceratocephala, Myosurus and Ranunculus is strongly supported.
Divergence time estimates for Ranunculeae are shown in Fig. 3. The stem and crown group ages of Ranunculeae are 67.69 Ma (95% highest posterior density (HPD): 60.01–74.64) and 49.13 Ma (95% HPD: 42.71–55.6), respectively. The Ceratocephala-Myosurus clade originated at 35.53 Ma (95% HPD: 25.44–44.99). The split of Ceratocephala and Myosurus occurred at 28.27 Ma (95% HPD: 23.08–37.28). Ceratocephala and Myosurus began to diversify at 1.34 Ma (95% HPD: 0.06–3.72) and 2.46 Ma (95% HPD: 0.15–7.58), respectively.
Plastome structural variations
To assess the plastomic synteny among the Ranunculeae species distributed in different habitats, structural variations of Ranunculeae plastomes were investigated (Figs. 2 and S5; Table S2). A total of 88,069 nucleotide variations were detected in all sampled Ranunculeae species, in which the majority of variants (92.85%) were found in non-coding regions. For the coding regions, 38.94% of the nucleotide variants were functionally missense variations distributed in 56 protein-coding genes. Extensive functional variations were identified among the plastomes of Ceratocephala, Myosurus, and Ranunculus, including 3,156 missense variants in 71 genes between Ceratocephala and Myosurus, 2,533 missense variants in 71 genes between Ceratocephala and Ranunculus, and 3,351 missense variants in 70 genes between Myosurus and Ranunculus (Table S2). Accordingly, these three genera possessed distinct plastid gene components, i.e., the Ceratocephala plastome added an infA and lost two introns of clpP, the Myosurus plastome lost clpP, infA, rps16, one copy of rpl2 and rpl23, and one intron of rpl16 and rpoC1, and the Ranunculus plastomes presents a more diverse variations in different species (Fig. 2a; Table S3).
To investigate the contraction and expansion events of the Ranunculeae plastomes, we compared the exact borders of the IR/SC regions (Figs. 2 and S5; Table S1). For the LSC/IRA region, rpl2 spanned the region while a pseudogene fragment, namely ψrpl2 was located at the IRB region in Ceratocephala with 590 bp and in Oxygraphis with 1,036 bp; rps19 spanned the same region while a pseudogene fragment (ψrps19) was located at the IRB region in Ficaria with 78 bp and Halerpestes with 174 bp. For the SSC/IRB region, ycf1 spanned the region while a pseudogene fragment (ψycf1) was located at the IRA region with a length range of 57–988 bp in Myosurus, Oxygraphis, and three species of Ranunculus (i.e., R. cantoniensis, R. macranthus, and R. repens). In the Ficaria plastome, ycf1 was located in trnNGUU-ndhF at the SSC region, which was unique in the Ranunculeae. Meanwhile, ψycf1 was lost in the plastomes of Ficaria, Halerpestes, and Ceratocephala. For the LSC/IRB region, trnHGUG spanned the region while a pseudogene fragment (ψtrnHGUG) was located at the IRA region with 23 and 61 bp in R. cantoniensis and R. repens, respectively. Additionally, in the LSC region, there was a difference among Myosurus and other genera, the former having the ψclpP and rpl20 relocated to the region of psaI-rps12 (Fig. 1).
Selection pressure analysis
To determine potential genes that could show signals of adaptive evolution to different habitats, we used the likelihood ratio test to search the optimal model in selection pressure analysis. For all 78 plastid protein-coding genes, the free-ratio model (m2), assuming three independent ω (the ratio of dN/dS), was identified as best-fit to Ceratocephala, Myosurus, and other Ranunculeae (Tables S4–S6). In the Ceratocephala plastome, we identified nine positively selected genes (ω > 1), including a photosystem II factor gene (psbJ), a protein-modifying enzyme gene (clpP), six ribosome small subunit genes (rps7, rps12, rps14, rps16, rps18, and rps19), and an unknown function gene (ycf2). In the Myosurus plastome, four genes showed signatures of positive selection (ω > 1), including a photosystem II factor gene (psbJ), a ribosome small subunit gene (rps18), a ribosome large subunit gene (rpl32) and a photosystem assembly factor gene (ycf4).
Investigating trait changes through time
To determine the period when the gene underwent positive selection, we coded the different amino acid sites as character states for Ceratocephala, Myosurus, and Ranunculus (Fig. S6; Table S7). All eleven genes under positive selection fitted the all-rates-different (ARD) likelihood model for discrete character evolution. Changes through time plots generated from sampled stochastic character maps for each gene are shown in Fig. S7.
As obviously temperature and/or precipitation changes occurred at 34 Ma, 23.03 Ma, 17 Ma, 13.9 Ma, and 3.3 Ma, we chose these five periods to generate and fit models in which the rate of change of the composite trait varied between different times on the reference chronogram (Fig. 3; Table S8). By comparing the equal-rates (ER), the symmetrical (SYM), and ARD likelihood models at five periods, we selected the periods in which the rate variation of genes was most likely to change. The ARD model was identified as best-fit for all genes except clpP and rps16 (for which no best-fit model was identified); and the rpl32, rps7, rps18, and ycf4 genes showed strong signals of episodic positive selection (P < 0.05; Table S8). The changes through time plots indicate that the rpl32, rps7, and ycf4 genes showed an increase in the rate of variation at about 3.3 Ma, whereas the increases of rate of the rps18 gene did not occur in any of the five periods (Fig. 3; Fig. S7).
Discussion
Structural variations of plastomes in Ceratocephala and Myosurus
Our results show that Ceratocephala and Myosurus formed a clade, sister to Ranunculus (Figs. 2a and S4), consistent with previous studies [21, 22, 24]. Within Ranunculeae, the plastome size of Myosurus is the smallest, followed by Ceratocephala. Similarly, the IR regions of Myosurus and Ceratocephala are shorter than that of other Ranunculeae (Table S1). The rpl2 gene is located in the LSC/IRA region in Ceratocephala and in the LSC region in Myosurus, respectively, whereas in other Ranunculeae genera, this gene is located in the IR region. In most angiosperms, the rpl2 gene is located in the IR region, such as in Bletilla (Orchidaceae) [25], Piper (Piperaceae) [26], and Mahonia (Berberidaceae) [27]. Thus, the contraction of the IRA towards the LSC resulted in the smaller plastome size and shorter IR length of Ceratocephala and Myosurus (Fig. 2b).
The expansion or contraction of the IR-LSC boundary regions is considered as a primary mechanism causing the length variation of angiosperm plastomes [28]. The expansions or contractions of the IRs into or out of adjacent single-copy regions are frequently observed in angiosperm plastomes [29], which may be related to the colonization of particular habitats. The species of Corydalis (Papaveraceae) occur from sea level to more than 6,000Â m altitude, often in forest understoreys, alpine meadows and screes, deserts, and steppes, and the expansion of IR region were reported widely in this genus [30]. Among the four species of Echinacanthus (Acanthaceae), the one distributed in the western Himalaya has the plastome with the contraction of the IR region, compared to the other three restricted to the Sino-Vietnamese karst region [31]. Thus, the contraction of IR regions might have occurred in the most recent common ancestor (MRCA) of Ceratocephala and Myosurus, in association with the colonization of new habitats and the divergence from the mesophytic Ranunculeae.
Although the IR regions became smaller in Ceratocephala and Myosurus, their GC average contents are higher than in other Ranunculeae genera (Table S1). We also find that the GC3s values of 78 protein-coding genes in Ceratocephala and Myosurus were higher than that of other Ranunculeae genera (Fig. S3). Generally, high GC content imparts more stability to the genome [32]. Compared to mesophytic Ranunculeae, the plastome of xerophytic Ceratocephala has a higher GC content. Genomic DNA with high GC content is considered to be more thermostable in Welwitschia, a unique desert xerophytic genus [33]. The plastome of hydrophytic Myosurus also has a relatively higher GC content than that of mesophytic Ranunculeae. Thus, the higher GC content may have facilitated Ceratocephala and Myosurus to adapt to new environments, such as dry and aquatic habitats. Similarly, Jabbari and Bernardi [34] have suggested that the larger amount of GC base pairs in the genome of vertebrates has an important role in their adaptations to various environments.
Ceratocephala and Myosurus are sister to each other, and contain 79 and 76 plastid protein-coding genes, respectively (Table S1). Compared to other Ranunculeae genera, Ceratocephala added one infA and lost two introns of clpP and ψycf1, whereas in the Myosurus plastome, rps16 was lacking, rpoC1 and rpl16 each lost an intron, and clpP was changed into ψclpP (Fig. 2; Table S3). Variations of plastomic gene contents, including loss, addition, and pseudogenization are often related to environmental selection pressure. For example, in the plastome of Azadirachta indica (Meliaceae), the ten ndh genes (ndhA, ndhC, ndhD, ndhE, ndhF, ndhG, ndhH, ndhI, ndhJ, and ndhK) were lost and ndhB became a pseudogene, which might be correlated with holoparasitic habit and lower light habitats [35]. Whether the variations of plastomic gene contents in Ceratocephala and Myosurus are associated with their respective dry and aquatic habitats need to be studied in the future.
Plastomic genes under positive selection in Ceratocephala and Myosurus
A total of eleven genes are under positive selection in the plastomes of Ceratocephala and Myosurus (Fig. 3). Among them, the psbJ and rps18 genes are positively selected both in Ceratocephala and Myosurus, suggesting that these two genes might have been related to the colonization of new environments by the MRCA of these two genera, and thereby to the divergence from their mesophytic ancestor. In particular, the seven genes (clpP, rps7, rps12, rps14, rps16, rps19, and ycf2) are only found to have signals of positive selection in the Ceratocephala plastome, implying that these seven genes might have been associated with the adaptation of Ceratocephala to arid environments. Zhong et al. [36] also found that the ycf2 was gene under positive selection in Helianthus tuberosus (Asteraceae), which grows in saline, alkaline and dry conditions as a widely cultivated plant in Northwest China. The two genes (rpl32 and ycf4) are only found to have signals of positive selection in the Myosurus plastome, implying that they might have been associated with the adaptation of Myosurus to aquatic environments. In Nicotiana tabacum (Solanaceae), the ycf4 gene has been document to be essential for transcriptional gene regulation and plant photoautotrophic growth [37].
Our divergence time estimation shows that Ceratocephala and Myosurus originated in the late Eocene (35.53 Ma, 95% HPD: 25.44–44.99) and diverged in the late Oligocene (28.27 Ma, 95% HPD: 23.08–37.28; Fig. 3). Since the late Eocene, dramatic changes in global temperature and/or precipitation occurred at least during five periods (Fig. 3) [23, 38]: the Eocene–Oligocene Transition (EOT; ca. 34 Ma) [39], the Oligocene–Miocene boundary (OMB; ca. 23.03 Ma) [40], the onset of the Miocene Climatic Optimum (oMCO; ca. 17 Ma) [41], the mid Miocene Climate Transition (mMCT; ca. 13.9 Ma) [42], and the first major glacial event in the Northern Hemisphere (M2; ca. 3.3 Ma) [23]. Our results indicate that among the eleven genes under positive selection, the four genes (rpl32, rps7, rps18, and ycf4) showed strong signals of episodic positive selection (P < 0.05; Table S8). Importantly, we identified the three genes (rpl32, rps7, and ycf4) showing an increase in the rate of variation at about 3.3 Ma, a timing coinciding with one of the five periods (Fig. 3). These three genes belong to housekeeping genes: rpl32 is a ribosome large subunit gene, rps7 is a ribosome small subunit gene, and ycf4 is a photosystem assembly factor.
The change through time plot indicates that for Ceratocephala, the rate of variation of the rps7 gene increased around 3.3 Ma (Fig. 3), a timing when the first major glacial event in the Northern Hemisphere occurred and accordingly global temperature dropped drastically [23]. During this period, the northeastern and southeastern Tibet Plateau experienced a rapid uplift and outward growth [43, 44]. These events might have resulted in the increasing aridification in central Eurasia. Paleoclimate modeling indicates that the precipitation markedly decreased at 4 Ma [38]. Moreover, the Central Asian arid region was beyond the scope of the Asian monsoon precipitation [45]. Thus, the cooling and aridification in Central Asia during the M2 might have promoted the rps7 gene to arise new favorable mutations in xerophytic and annual Ceratocephala, which could further be fixed by natural selection in arid environments.
For Myosurus, both the rpl32 and ycf4 genes showed an increase in the rate of variation also probably at about 3.3 Ma (Fig. 3). In the last 5 Ma, highly frequent oscillations in sea-level occurred [46], including at least 58 rapid rises over 40 m [47, 48]. Frequent sea-level fluctuations, as well as global cooling (Fig. 3), might have led to annual Myosurus to adapt to changing aquatic habitats. Thus, the rpl32 and ycf4 genes could have evolved rapidly in the Pliocene and further fixed favorable mutations, possibly in association with the adaptation to wet environments.
Conclusions
In this study, we de novo assembled the complete plastome sequences of five species from Ceratocephala, Ficaria, and Myosurus, and updated the annotation of plastomes for seven other Ranunculeae species. Plastid phylogenomic analysis strongly supports a sister relationship between xerophytic Ceratocephala and hydrophytic Myosurus. We discover that the plastome sizes of Ceratocephala and Myosurus are smaller than that of other mesophytic Ranunculeae, mainly due to the contraction of the IR region. The addition, loss and pseudogenization of plastid genes were found in these two genera. Eleven plastid genes showed positive selection in Ceratocephala and Myosurus. Among them, four genes showed strong signals of episodic positive selection. Importantly, the rate of variation of the rps7 gene in Ceratocephala increased around 3.3Â Ma, possibly associated with the adaptation to dry habitats owing to the cooling and aridification in Central Asia; the rpl32 and ycf4 genes in Myosurus showed an increase in the rate of variation at about 3.3Â Ma, possibly associated with the adaptation to aquatic habitats resulting from sea-level fluctuations and global cooling. Our integration of phylogenomic, comparative genomic, and selection pressure analyses can be used to explore adaptive evolution of plastomes of other plant groups to specific habitats.
Materials and methods
Sample sequencing, data assembly and annotation
We sampled three species of Ceratocephala and two of Myosurus. The other three mesophytic genera (Ficaria, Halerpestes, Oxygraphis) and four mesophytic species of Ranunculus in Ranunculeae were also included (Table S1). Based on the results of previous studies [22, 24], we selected Anemone tomentosa of Anemoneae, sister to Ranunculeae, as outgroup. The samples of Ceratocephala orthoceras, C. testiculata, Myosurus apetalus, M. minimus, and F. verna were newly collected from Altay (Xinjiang, China), Huocheng (Xinjiang, China), Nevada (USA), Malkow (Poland), Wieliczka (Poland), respectively, and were deposited in Herbarium, Institute of Botany, Chinese Academy of Sciences, Beijing (PE). Their formal identification was undertaken by Wei Wang and Andrey S. Erst. No specific permissions or licenses were required for our collections and experiments.
Genomic DNA of the five species was extracted from silica gel-dried leaves or herbarium specimens and purified using the Tiangen Isolation/Extraction/Purification Kit (Tiangen Biotech (Beijing) Co., Ltd.). Short insert of 300–500 bp libraries were prepared for sequencing on the Illumina HiSeq X-Ten platform.
The plastome was de novo assembled using GetOrganelle v1.7.6.1 [49] and was annotated by PGA [50] with the plastomes of Ceratocephala, Halerpestes, Oxygraphis, and R. macranthus as references. We used OGDRAW v1.3.1 [51] to visualize the circular plastome map with subsequent manual editing. The amino acid sequences of 78 plastid protein-coding gene regions were extracted and each was aligned in MAFFT v7 [52]. DNA sequences were then aligned using PAL2NAL v14 [53]. We also updated the annotation of plastomes for the other seven species sampled in the study. We used the program DNAsp v6.12.03 [54] to analyze the synonymous codon usage of 78 protein-coding genes by calculating the values of GC3s.
Phylogenetic analyses and divergence time estimation
The maximum likelihood (ML) analysis was performed using RAxML v8.2.12 [55] with 1,000 replicates under the GTRGAMMA model. The analysis was carried out based on the concatenation of coding regions of 78 protein-coding genes.
Divergence times were estimated in BEAST v2.1.3 [56]. Fossil achenes of Myosurus sp. were found from the Oligocene [57]. We set a 23.03 Ma constraint for the split between Ceratocephala and Myosurus, with a lognormal distribution. The offset (minimum age constraint) was set to be equal to the age of the fossil. The 95% upper bound of the distribution (soft maximum) was set by adjusting the standard deviation with 1.25. We also used two calibration points, taking ages estimated in the recent broader study of Ranunculeae [22]: 49.4 Ma (95% HPD: 42.71–55.60) for the crown group age of Ranunculeae and 68.95 Ma (95% HPD: 60.01–74.64) for the root age. Prior normal distributions were assigned for these two calibrations with standard deviations of 3.5 and 3.7, respectively. We used an uncorrelated lognormal relaxed clock model of rate variation across branches, a Yule prior, and the GTR model for each gene partition separately.
Parameters were estimated using four independent runs of 100,000,000 Markov chain Monte Carlo (MCMC) generations each, with sampling every 2,000 generations. Convergence was evaluated in Tracer v1.7.1 [58]. After a burn-in of 25%, we used TreeAnnotator v2.1.2 [56] to generate the maximum clade credibility (MCC) tree with mean ages and 95% HPD intervals on nodes.
Plastomic variation analysis
To assess the expansion/contraction of the IR regions, we compared the boundaries between the SC/IR and their adjacent genes by IRscope [59], and then used the results to further manually modify the annotations. To further detect the location of structural variation, we used snpEff v4.3 [60] to conduct functional annotations for the nucleotide variations in different genera distributed in different habitats.
Positive selection analyses
We used the CODEML program in PAML v4.9 [61] to infer positive selection for 78 protein-coding genes. Pseudogenes and partial genes were excluded. Changes in the selective regime can be detected by calculating the ratio of nonsynonymous (dN) substitutions to synonymous (dS) substitutions of each plastid protein-coding gene, ω (dN/dS). We compared two branch models, H0: the one-ratio model (m0) that assumes to evolve under the same ω ratio for all branches in the phylogeny, and HA: the free-ratio model (m2) for the alternative model that assumes three independent ω ratio in different habitats. We used likelihood ratio tests to test each model’s fit.
Investigating trait changes through time
We selected the 11 genes that were under positive selection (ω > 1) under the optimal model [62]. We coded the different amino acid sequences as characteristic states for Ceratocephala and Myosurus to infer when the genes were positively selected. To avoid the high number of species in Ranunculus to have an influence on the changes through time plots, a single species only was coded in Ranunculus as outgroup. We applied the fitDiscrete function in the R package ‘geiger’ [63] to decide which likelihood models (ER, SYM, or ARD) should be used. Simulated stochastic character maps were obtained using the make.simmap function in the R package ‘phytools’ under the optimal likelihood model [64]. Then we used sampled stochastic map character histories to generate ‘changes through time’ plots showing the mean number of changes and the mean rate of changes per time unit.
We further applied the ‘phytools’ function make.era.map to generate and fit models in which the rate of change of the composite trait varied between different times on the reference tree. Based on Westerhold et al. [23], temporal boundaries were chosen: (1) the EOT (ca. 34 Ma), (2) the OMB (ca. 23.03 Ma), (3) the oMCO (ca. 17 Ma), (4) the mMCT (ca. 13.9 Ma), (5) the first major glacial event in the Northern Hemisphere (M2; ca. 3.3 Ma). By examining the three models (ARD, ER, SYM) and analyzing the significant differences, we selected when the rate of variation in genes changed. The likelihood ratio test (LRT) was used to compare different heterogeneous rate models for each of the genes under positive selection.
Availability of data and materials
All sequences in this study are available in the National Center for Biotechnology Information (NCBI) (https://www.ncbi.nlm.nih.gov/nuccore/), with GenBank accession numbers (PP155434–PP155438) shown in Table S1.
Abbreviations
- ARD:
-
All-rates-different likelihood model
- BS:
-
Bootstrap support
- EOT:
-
Eocene–Oligocene Transition
- ER:
-
Equal-rates likelihood model
- GC3s:
-
GC-content at third codon position
- HPD:
-
Highest posterior density
- IRs:
-
Inverted repeats
- LRT:
-
Likehood Ratio Test
- oMCO:
-
Onset of Miocene Climate Optimum
- LSC:
-
Long single copy
- M2:
-
The first major glacial event in the Northern Hemisphere
- MCC:
-
Maximum clade credibility
- MCMC:
-
Markov chain Monte Carlo
- ML:
-
Maximum likelihood
- OMB:
-
Oligocene–Miocene boundary
- SSC:
-
Small single copy
- SYM:
-
Symmetrical likelihood model
- ca.:
-
Circa
- mMCT:
-
Middle Miocene Climate Transition
References
Gray MW. The evolutionary origins of organelles. Trends Genet. 1989;5:294–9.
Green BR. Chloroplast genomes of photosynthetic eukaryotes. Plant J. 2011;66:34–44.
Palmer JD. Chloroplast DNA evolution and biosystematic uses of chloroplast DNA variation. Am Nat. 1987;130:S6-29.
Sun YX, Moore MJ, Meng AP, Soltis PS, Soltis DE, Li JQ, et al. Complete plastid genome sequencing of Trochodendraceae reveals a significant expansion of the inverted repeat and suggests a Paleogene divergence between the two extant species. PLoS One. 2013;8:e60429.
McNeal JR, Kuehl JV, Boore JL, de Pamphilis CW. Complete plastid genome sequences suggest strong selection for retention of photosynthetic genes in the parasitic plant genus Cuscuta. BMC Plant Biol. 2007;7:57.
Park S, An B, Park S. Reconfiguration of the plastid genome in Lamprocapnos spectabilis: IR boundary shifting, inversion, and intraspecific variation. Sci Rep. 2018;8:13568.
Cheng J, Zhao Z, Li B, Qin C, Wu Z, Trejo-Saavedra DL, et al. A comprehensive characterization of simple sequence repeats in pepper genomes provides valuable resources for marker development in Capsicum. Sci Rep. 2016;6:18919.
Wicke S, Schneeweiss GM, dePamphilis CW, Muller KF, Quandt D. The evolution of the plastid chromosome in land plants: Gene content, gene order, gene function. Plant Mol Biol. 2011;76:273–97.
Peng HW, Lian L, Zhang J, Erst AS, Wang W. Phylogenomics, plastome degradation and mycoheterotrophy evolution of Neottieae (Orchidaceae), with emphasis on the systematic position and Loess Plateau-Changbai Mountains disjunction of Diplandrorchis. BMC Plant Biol. 2022;22:507.
Scobeyeva VA, Artyushin IV, Krinitsina AA, Nikitin PA, Antipin MI, Kuptsov SV, et al. Gene loss, pseudogenization in plastomes of genus Allium (Amaryllidaceae), and putative selection for adaptation to environmental conditions. Front Genet. 2021;12:674783.
Gao LZ, Liu YL, Zhang D, Li W, Gao J, Liu Y, et al. Evolution of Oryza chloroplast genomes promoted adaptation to diverse ecological habitats. Commun Biol. 2019;2:278.
Hu S, Sablok G, Wang B, Qu D, Barbaro E, Viola R, et al. Plastome organization and evolution of chloroplast genes in Cardamine species adapted to contrasting habitats. BMC Genomics. 2015;16:306.
Fan WB, Wu Y, Yang J, Shahzad K, Li ZH. Comparative chloroplast genomics of Dipsacales species: Insights into sequence variation, adaptive evolution, and phylogenetic relationships. Front Plant Sci. 2018;9:689.
Zhou T, Ruhsam M, Wang J, Zhu H, Li W, Zhang X, et al. The complete chloroplast genome of Euphrasia regelii, pseudogenization of ndh Genes and the phylogenetic relationships within Orobanchaceae. Front Genet. 2019;10:444.
Anderson J, Song BH. Plant adaptation to climate change—Where are we? J Syst Evol. 2020;58:533–45.
Raman G, Park SJ. The complete chloroplast genome sequence of the Speirantha gardenii: comparative and adaptive evolutionary analysis. Agronomy. 2020;10:1405.
Zecca G, Panzeri D, Grassi F. Detecting signals of adaptive evolution in grape plastomes with a focus on the cretaceous-palaeogene (K/Pg) transition. Ann Bot. 2022;130:965–80.
Ziman SN, Keener CS. A geographical analysis of the family ranunculaceae. Ann Mo Bot Gard. 1989;76:1012–49.
Tamura M. Ranunculaceae. In: Hiepko P, editor. Die NatürlichenPflanzenfamilien, vol. 17a IV. Berlin: Duncker and Humblot; 1995. p. 223–555.
Emadzade K, Lehnebach C, Lockhart P, Hörandl E. A molecular phylogeny, morphology and classification of genera of Ranunculeae (Ranunculaceae). Taxon. 2010;59:809–28.
Wang W, Li HL, Xiang XG, Chen ZD. Revisiting the phylogeny of Ranunculeae: Implications for divergence time estimation and historical biogeography. J Syst Evol. 2014;52:551–65.
Wang W, Lin L, Xiang XG, Ortiz Rdel C, Liu Y, Xiang KL, et al. The rise of angiosperm-dominated herbaceous floras: Insights from Ranunculaceae. Sci Rep. 2016;6:27259.
Westerhold T, Marwan N, Drury AJ, Liebrand D, Agnini C, Anagnostou E, et al. An astronomically dated record of Earth’s climate and its predictability over the last 66 million years. Science. 2020;369:1383–7.
Zhai W, Duan X, Zhang R, Guo C, Li L, Xu G, et al. Chloroplast genomic data provide new and robust insights into the phylogeny and evolution of the Ranunculaceae. Mol Phylogenet Evol. 2019;135:12–21.
Han S, Wang R, Hong X, Wu C, Zhang S, Kan X. Plastomes of Bletilla (Orchidaceae) and phylogenetic implications. Int J Mol Sci. 2022;23:10151.
Simmonds SE, Smith JF, Davidson C, Buerki S. Phylogenetics and comparative plastome genomics of two of the largest genera of angiosperms, Piper and Peperomia (Piperaceae). Mol Phylogenet Evol. 2021;163:107229.
Tong R, Gui C, Zhang Y, Su N, Hou X, Liu M, et al. Phylogenomics, plastome structure and species identification in Mahonia (Berberidaceae). BMC Genomics. 2022;23:766.
Kim KJ, Lee HL. Complete chloroplast genome sequences from Korean ginseng (Panax schinseng Nees) and comparative analysis of sequence evolution among 17 vascular plants. DNA Res. 2004;11:247–61.
Zhang Y, Du L, Liu A, Chen J, Wu L, Hu W, et al. The complete chloroplast genome sequences of five Epimedium species: Lights into phylogenetic and taxonomic analyses. Front Plant Sci. 2016;7:306.
Raman G, Nam GH, Park S. Extensive reorganization of the chloroplast genome of Corydalis platycarpa: a comparative analysis of their organization and evolution with other Corydalis plastomes. Front Plant Sci. 2022;13:1043740.
Gao C, Deng Y, Wang J. The complete chloroplast genomes of Echinacanthus species (Acanthaceae): phylogenetic relationships, adaptive evolution, and screening of molecular markers. Front Plant Sci. 2019;9:1989.
Vinogradov AE. DNA helix: The importance of being GC-rich. Nucleic Acids Res. 2003;31:1838–44.
Wan T, Liu ZM, Leitch IJ, Xin HP, Maggs-Kölling G, Gong YB, et al. The Welwitschia genome reveals a unique biology underpinning extreme longevity in deserts. Nat Commun. 2021;12:4247.
Jabbari K, Bernardi G. Body temperature and evolutionary genomics of vertebrates: a lesson from the genomes of Takifugu rubripes and Tetraodon nigroviridis. Gene. 2004;333:179–81.
Chen J, Yu R, Dai J, Liu Y, Zhou R. The loss of photosynthesis pathway and genomic locations of the lost plastid genes in a holoparasitic plant Aeginetia indica. BMC Plant Biol. 2020;20:199.
Khan MS, Riaz R, Majid M, Mehmood K, Mustafa G, Joyia FA. The tobacco chloroplast YCF4 gene is essential for transcriptional gene regulation and plants photoautotrophic growth. Front Plant Sci. 2022;13:1014236.
Zhong Q, Yang S, Sun X, Wang L, Li Y. The complete chloroplast genome of the Jerusalem artichoke (Helianthus tuberosus L.) and an adaptive evolutionary analysis of the ycf2 gene. Peer J. 2019;7:e7596.
Farnsworth A, Lunt DJ, Robinson SA, Valdes PJ, Roberts WHG, Clift PD, et al. Past East Asian monsoon evolution controlled by paleogeography, not CO2. Sci Adv. 2019;5:eaax1697.
Sun J, Windley BF. Onset of aridification by 34 ma across the eocene-oligocene transition in Central Asia. Geology. 2015;43:1015–8.
Kim B, Zhang YG. Methane hydrate dissociation across the Oligocene-miocene boundary. Nat Geosci. 2022;15:203–9.
Holbourn A, Kuhnt W, Kochhann KGD, Andersen N, Meier KJS. Global perturbation of the carbon cycle at the onset of the miocene climatic optimum. Geology. 2015;43:123–6.
Flower BP, Kennett JP. The middle Miocene climatic transition: East Antarctic ice sheet development, deep ocean circulation and global carbon cycling. Palaeogeogr Palaeoclimatol Palaeoecol. 1994;108:537–55.
An ZS, Zhang PZ, Wang EC, Wang SM, Qiang XK, Li L, Zhou WJ. Changes of the monsoon-arid environment in China and growth of the Tibetan Plateau since the Miocene. Quatern Sci. 2006;26:678–93.
Li J, Zhou S, Zhao Z, Zhang J. The Qingzang movement: The major uplift of the Qinghai-Tibetan Plateau. Sci China Earth Sci. 2015;58:2113–22.
Ao H, Rohling EJ, Zhang R, Roberts AP, Holbourn AE, Ladant JB, et al. Global warming-induced Asian hydrological climate transition across the Miocene-Pliocene boundary. Nat Commun. 2021;12:6935.
Miller KG, Browning JV, Schmelz WJ, Kopp RE, Mountain GS, Wright JD. Cenozoic sea-level and cryospheric evolution from deep-sea geochemical and continental margin records. Sci Adv. 2020;6:eaaz1346.
Miller KG, Kominz MA, Browning JV, Wright JD, Mountain GS, Katz ME, Sugarman PJ, Cramer BS, Christie-Blick N, Pekar SF. The phanerozoic record of global sea-level change. Science. 2005;310:1293–8.
Woodruff DS, Turner LM. The Indochinese-Sundaic zoogeographic transition: a description and analysis of terrestrial mammal species distributions. J Biogeogr. 2009;36:803–21.
Jin JJ, Yu WB, Yang JB, Song Y, dePamphilis CW, Yi TS, et al. GetOrganelle: a fast and versatile toolkit for accurate de novo assembly of organelle genomes. Genome Biol. 2020;21:241.
Qu XJ, Moore MJ, Li DZ, Yi TS. PGA: A software package for rapid, accurate, and flexible batch annotation of plastomes. Plant Methods. 2019;15:50.
Greiner S, Lehwark P, Bock R. OrganellarGenomeDRAW (OGDRAW) version 1.3.1: Expanded toolkit for the graphical visualization of organellar genomes. Nucleic Acids Res. 2019;47:W59-64.
Katoh K, Toh H. Parallelization of the MAFFT multiple sequence alignment program. Bioinformatics. 2010;26:1899–900.
Suyama M, Torrents D, Bork P. PAL2NAL: Robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609-612.
Rozas J, Ferrer-Mata A, Sanchez-DelBarrio JC, Guirao-Rico S, Librado P, Ramos-Onsins SE, et al. DnaSP 6: DNA sequence polymorphism analysis of large data sets. Mol Biol Evol. 2017;34:3299–302.
Stamatakis A. RaxML version 8: A tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.
Bouckaert R, Heled J, Kuhnert D, Vaughan T, Wu CH, Xie D, et al. BEAST 2: A software platform for Bayesian evolutionary analysis. PloS Comput Biol. 2014;10:e1003537.
Mai DH, Walter H. Die Floren der Haselbacher Serie im Weißelster-Becken (Bezirk Leipzig, DDR). Abh Staatl Mus Mineral Geol Dresden. 1978;28:1–200.
Rambaut A, Drummond AJ, Xie D, Baele G, Suchard MA. Posterior summarization in Bayesian phylogenetics using Tracer 1.7. Syst Biol. 2018;67:901–4.
Amiryousefi A, Hyvönen J, Poczai P. IRscope: An online program to visualize the junction sites of chloroplast genomes. Bioinformatics. 2018;34:3030–1.
Cingolani P, Platts A, le Wang L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly. 2012;6:80–92.
Yang Z. PAML 4: Phylogenetic analysis by maximum likelihood. Mol Biol Evol. 2007;24:1586–91.
Anisimova M, Bielawski JP, Yang Z. Accuracy and power of Bayes prediction of amino acid sites under positive selection. Mol Biol Evol. 2002;19:950–8.
Pennell MW, Eastman JM, Slater GJ, Brown JW, Uyeda JC, FitzJohn RG, et al. geiger v2.0: An expanded suite of methods for fitting macroevolutionary models to phylogenetic trees. Bioinformatics. 2014;30:2216–8.
Revell LJ. Phytools: An R package for phylogenetic comparative biology (and other things). Methods Ecol Evol. 2012;3:217–23.
Acknowledgements
We sincerely thank Florian Jabbour for carefully reading the manuscript and helpful suggestions.
Funding
This research was partially funded by the National Key Research and Development Program of China (2023YFF0805800), the National Natural Science Foundation of China (32361133549, 31770231, 31770233, and 32011530072), the Beijing Natural Science Foundation (5244047), and K.C. Wong Education Foundation (GJTD-2020–05) to W.W. Specimen collection and identification were partially supported by the Russian Science Foundation (24–44-00027) and the state assignments performed by the CSBG SB RAS (AAAA-A21-121011290024–05) to A.S.E.
Author information
Authors and Affiliations
Contributions
WW and KLX planned and designed the research; HWP, ASE and WW collected samples; LJ, WCH and KLX analyzed the data; LJ, WW and KLX wrote the paper. All authors read and approved the manuscript.
Corresponding authors
Ethics declarations
Ethics approval and consent to participate
We complied with all relevant institutional, national and international guide lines with permissions from Institute of Botany, Chinese Academy of Sciences. No materials from animal or human were used in this research.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
About this article
Cite this article
Long, J., He, WC., Peng, HW. et al. Comparative plastome analysis of the sister genera Ceratocephala and Myosurus (Ranunculaceae) reveals signals of adaptive evolution to arid and aquatic environments. BMC Plant Biol 24, 202 (2024). https://doi.org/10.1186/s12870-024-04891-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12870-024-04891-2