De novo assembly of the carrot mitochondrial genome using next generation sequencing of whole genomic DNA provides first evidence of DNA transfer into an angiosperm plastid genome
© Iorizzo et al.; licensee BioMed Central Ltd. 2012
Received: 23 December 2011
Accepted: 1 May 2012
Published: 1 May 2012
Sequence analysis of organelle genomes has revealed important aspects of plant cell evolution. The scope of this study was to develop an approach for de novo assembly of the carrot mitochondrial genome using next generation sequence data from total genomic DNA.
Sequencing data from a carrot 454 whole genome library were used to develop a de novo assembly of the mitochondrial genome. Development of a new bioinformatic tool allowed visualizing contig connections and elucidation of the de novo assembly. Southern hybridization demonstrated recombination across two large repeats. Genome annotation allowed identification of 44 protein coding genes, three rRNA and 17 tRNA. Identification of the plastid genome sequence allowed organelle genome comparison. Mitochondrial intergenic sequence analysis allowed detection of a fragment of DNA specific to the carrot plastid genome. PCR amplification and sequence analysis across different Apiaceae species revealed consistent conservation of this fragment in the mitochondrial genomes and an insertion in Daucus plastid genomes, giving evidence of a mitochondrial to plastid transfer of DNA. Sequence similarity with a retrotransposon element suggests a possibility that a transposon-like event transferred this sequence into the plastid genome.
This study confirmed that whole genome sequencing is a practical approach for de novo assembly of higher plant mitochondrial genomes. In addition, a new aspect of intercompartmental genome interaction was reported providing the first evidence for DNA transfer into an angiosperm plastid genome. The approach used here could be used more broadly to sequence and assemble mitochondrial genomes of diverse species. This information will allow us to better understand intercompartmental interactions and cell evolution.
Keywordsde novo assembly Next generation sequencing Mitochondrial genome Plastid genome Assembly structure visualization Daucus carota Intercompartmental DNA transfer
To date, 23 mitochondrial genomes in seed plants have been fully sequenced and analyzed http://www.ncbi.nlm.nih.gov/Genomes/. These mitochondrial genomes are extremely variable in size, ranging from 221 kb (Brassica napus) to 2,740 kb (Cucumis melo). Sequence analysis revealed that the most abundant portion of the mitochondrial genomes is non-coding , which includes “promiscuous” DNA of plastid and nuclear origin , as well as sequences of horizontal origin from foreign genomes [3–6]. Structural analysis, through use of Southern hybridization or paired-end data, revealed a high frequency of intra- and intermolecular recombination due to accumulation of repetitive sequences. This process has generated a structurally dynamic assemblage of genome configurations within a species [7–9] and a scrambling of gene order within closely related species . This dynamic organization of the plant mitochondrial genome provides a powerful model for the study of genome structure and evolution. In addition, the increasing availability of plant organelle and nuclear genome sequence data provides an understanding of the mechanisms driving plant genome evolution. Indeed, there is a strong structural and functional interaction among plastid, mitochondrial, and nuclear genomes [11, 12]. Transfer of DNA among these three compartments in higher plants has been reported, with exception of transfer into the plastid genome [13, 14].
Despite its importance, technical obstacles of DNA isolation and sequence assembly limit the sequencing of mitochondrial genomes. Conventional approaches to mitochondrial genome sequencing involve extraction and enrichment of mitochondrial DNA, cloning, and sequencing. Large repeats and the dynamic mitochondrial genome organization complicate sequence assembly.
The development of next generation sequencing technologies (NGS), such as the Roche and Illumina platforms, provides a new opportunity for rapid characterization of mitochondrial genomes. Non-enriched whole genome DNA libraries, both shotgun and paired-end, include plastid and mitochondrial DNA that is sequenced along with the nuclear DNA during the sequencing run. NGS technologies have already been used for sequencing the small mitochondrial genome of nematodes [15, 16], human  and fish  with no library enrichment. Recently, sequencing data from non-enriched libraries has been successfully used to assemble plastid genomes of wild and domesticated rice, mung bean, date palm, and milkweed [19–22]. The major limitations for use of this approach on de-novo assembly of mitochondrial genomes are the ability to overcome assembly problems related to large repeat regions, presence of promiscuous DNA, and sequence ambiguity due to sequencing technologies.
The aim of this study was to demonstrate how next generation sequence data from total genomic DNA can be used to de-novo assemble the mitochondrial genome of carrot (Daucus carota). In addition, intergenic sequence analysis provides evidence of a rare transfer of DNA into the plastid genome. This is the first report of mitochondrial genome transfer into an angiosperm plastid genome. The strategy used in this study has broad application to explore more mitochondrial genomes, to further investigate intra-cellular genome interaction and genome evolution.
Summary of assemblies and consensus sequences of 454 whole genome sequences (WGS), plastid sequences (pt) and mitochondrial sequences (mt)
# of reads
# of contigs
# of contigs
# of contigs
Mitochondrial assemblies were based on contig connections. In order to visualize these contig connections we developed and used bb.454contignet [http://www.vcru.wisc.edu/simonlab/sdata/software/]. The tool allows visualization of connections between gsAssembler contigs, along with contig size and average read coverage. This visualization allowed us to establish single and repeat contigs.
In the case of sequences of ambiguous source, common to both mitochondrial and plastid genomes, we determined the true mitochondrial-version of these common sequences. We amplified and sequenced three regions with size of 1,298, 1,545 and 2,257 nt using the Sanger method. PCR results confirmed the size of the expected amplicons and these sequences were then incorporated into the mitochondrial assembly.
Corrections and error rate
Illumina read mapping and corrections
Number of homopolymers
Cumulative homopolymers length (nt)
Number of errorsa
Cumulative errors length (nt)
Percent of errorb
In addition to homopolymers, we also looked for ambiguity due to SNPs or single base insertions/deletions. Overall, no ambiguities were detected resulting in a final assembly of 281,132 nt.
Assemblies of mitochondrial and plastid genomes in this study were carried out using 454 sequence data from whole genome libraries (see Materials and Methods). In order to evaluate plastid and mitochondrial genome coverage obtained from all sets of shotgun 454 and Illumina reads used in this study, we mapped sequences against the assembled plastid and mitochondrial genomes using GNUMAP .
Mapping of five sets of 454 shotgun reads (813,770, 814,668, 771,864, 704,918 and 692,688 reads) gave a coverage ranging from 123× to 173× for plastid, and from 16× and 23× for mitochondrial genomes (Additional file 3, Table S1). Overall, 249,302 and 50,802 reads mapped in the plastid and mitochondrial genome suggesting that about 6.5% and 1.3% of the sequenced DNA was of plastid and mitochondrial origin, respectively.
For Illumina sequences, out of 51 million reads (corresponding to a 10× nuclear genome coverage), 6,949,870 sequences were successfully mapped to the plastid genome and 1,365,825 sequences were mapped to the mitochondria giving a 4,385x and 452x coverage, respectively (Additional file 3: Table S1). This suggests that about 13.7% and 2.7% of the sequenced DNA was of plastid and mitochondrial origin, respectively.
Analysis of GC content across the genomes indicate that in regions where the GC content is high the Illumina read coverage is reduced (Additional file 4: Figure S3 A1, A2). By contrast, high GC content did not affect 454 read coverage (Additional file 4: Figure S3 B1, B2).
These results confirmed that unenriched whole genome sequencing is a practical approach for de novo assembly of higher plant plastid and mitochondrial genomes.
Southern blot analysis
Genome size and content
Summary of mitochondrial genome content
tRNA coding genes
rRNA coding genes
A BLAST search against a local mitochondrial protein and nucleotide database identified 44 putatively functional protein-coding genes and 3 genes coding for ribosomal RNAs (Additional file 6: Table S2). For all of these genes, open reading frames (ORFs) were detected, confirming the quality of the consensus sequence. Distribution of genes is illustrated in Figure 4. As in other angiosperm mitochondrial genomes, most of the known genes encode for proteins of the electron transport chain, with 9 subunits of complex I (nad1 2 3 4 4 L 5 6 7 and 9), one subunit of complex III (cob), three subunits of complex IV (cox1 2 and 3), five subunits of complex V (atp1 4 6 8 and 9) and four genes involved in the biogenesis of cytochrome c (ccmB ccmC ccmFc and ccmFn). Truncated copies of atp1 and atp9 were detected, confirming observations previously reported by [29, 30]. In addition, two conserved ORFs, coding for a maturase (matR) mapped within the nad1 intron and another ORF named mttB coding for a transporter protein, were detected.
The number of ribosomal and succinate dehydrogenase proteins is usually variable among different species (Additional file 6: Table S2). Annotation of the carrot mitochondrial genome allowed identification of 9 putatively functional genes coding for ribosomal proteins (rpl5, rpl10, rpl16, rps1, rps3, rps4, rps7, rps12 and rps13) of mitochondrial origin and one, rpl2 of plastid origin. The genome appears to lack functional copies of the succinate dehydrogenase genes sdh3 and sdh4.
Because they are in repeat regions, one gene (atp8) is duplicated, and six genes occur in triplicate (ccmB, cox3, nad3, rpl5, rps1 and rps12). Detection of ORFs allowed identification of 19 group II introns, 7 of which are trans-spliced.
tRNA content of carrot mitochondrial genome
trnF(gaa) x3 b
trnfM(cau) x2, trnM (cau) x2, trnM(cau)d
In order to investigate the presence of the seven missing mitochondrial genes (sdh3, sdh4, rpl2, rps2, rps10, rps14 and rps19) in the nuclear genome we aligned the corresponding sequences from the mitochondrial genome of the species most closely related to carrot (see Materials and Methods) against a local database containing all the assembled sequences and raw reads used in this study. For three genes, sdh3, rps10 and rps14, a corresponding contig with conserved ORFs was detected in the nuclear genome (data not shown). None or insignificant matches were identified for the remaining four genes, sdh4, rpl2, rps2 and rps19 suggesting that these genes were not present in this dataset.
After annotation of conserved genes, we searched for other ORFs and plastid regions in the intergenic sequences and ORFs longer than 300 nt were annotated (Figure 4, orf26–60). In total, 35 ORFs ranging from 300 to 1,122 nt in length were detected (Figure 4). Out of these, 3 ORFs were conserved among mitochondrial genomes, with the best match of orf58 to orf147 of Nicotiana tabacum, of orf59 to orf115b of Brassica napus and of orf60 to orf122 of Beta vulgaris. Out of the remaining ORFs, 28 (orf26–53) had unknown function, and 4 (orf 54–57) were similar to truncated copies of hypothetical proteins from Vitis vinifera and Arabidopsis thaliana nuclear genes.
BLAST analysis of intergenic sequences against a local database containing all published plant plastid genomes allowed detection of 16 fragments with similarity to plastid sequences, with length ranging from 45 to 1,298 nt, accounting for 5,701 nt (2.0% of the genome) (Figure 4, green box).
Distribution of repeats in the carrot mitochondrial genome
Repeat length (nt)
Number of repeats
Genome coverage (%)
Evidence of DNA transfer into the plastid genome
Here we confirm the presence of this region in the carrot plastid genome in the inbred carrot line B493B analyzed in this study (Figure 5ADcMP region 1, 2, 3). In addition, analysis of intergenic sequences of the carrot mitochondrial genome revealed that this entire sequence is present in the mitochondrial genome as three separate pieces. Comparison of D. carota to Petroselium crispum [GenBank: NC 015821] revealed that the carrot plastid genome lacks a 339 nt sequence (Figure 5A, position 101,297-101,636 of P. crispum plastid genome) present in the P. crispum and other more taxonomically distant species because of the replacement with the DcMP region.
In order to clarify the origin of this region, we first designed primers specific of flanking regions 1, 2 and 3 of the plastid genome or region 3 of the mitochondrial genome. We then amplified and sequenced these regions in different Apiaceae species. PCR results revealed a consistent conservation of the region 3 in the mitochondrial genomes; in fact an expected amplicon of 1,935 nt was amplified across all species (Figure 5B, left). By contrast, a polymorphic pattern has been obtained when the plastid flanking sequences were used to anchor the primers (Figure 5B, right). A short fragment of 500 nt was amplified in all non Daucus species including parsley (Petroselium crispum), fennel (Foeniculum vulgare), celery (Apium graveolens), plains eryngo (Eryngium planum) and 400 nt in coriander (Coriandrum sativum). The only exception was for cumin (Cuminum cyminum) which has the same pattern as carrot. With the exception of D. aureus, D. crinitus and D. muricatus, where amplicons of about 2,000, 2,000 and 1,100 nt were obtained, respectively, other Daucus species and cumin produced the amplicon of expected size, 1,618 nt.
Sanger sequence of above described PCR products confirmed the presence of the large region 3 in the mitochondrial genome of all Daucus and non Daucus species. It also confirmed, the absence of the insertion of regions 1, 2 and 3 in the plastid genome of non Daucus species (Figure 5C). These sequences and its taxonomic distribution in Daucus and close relative cumin [64; see Discussion] provide strong evidence of a transfer of DNA into the plastome. In addition a few variants were observed. An insertion observed in the plastid DcMP region of wild species D. aureus, D. crinitus and D. glochidiatus is similar to a mitochondrial sequence contiguous to the DcMP region 3 (Figure 5 A and C, region 4). This suggests that this region now identified as region 4 was part of the original insertion in the plastid genome of the ancestral progenitor of carrot. Sequence analyses of the plastid D. murucatus DcMP region revealed that the shorter amplicon is due to a deletion of regions 1 and 2.
To investigate the sequence characteristics that might provide a possible explanation of this transfer in the carrot plastid genome, we aligned the plastid and mitochondrial sequences of regions 1, 2 and 3 against NCBI nucleotide and protein databases. The alignment demonstrated that region 1 was an intergenic sequence with no similarity to any other nucleotide or protein sequence present in databases. As previously reported by Goremykin and colleagues , region 2 is part of the cox1 gene. It is more interesting that blastx alignment of region 3 revealed multiple matches with nuclear retrotransposable elements from different species including Vitis vinifera [GenBank: CAN77047 and CAN79321] and Populus trichocarpa [GenBank: XP 002329638 and XP 002334157] (Additional file 7: Figure S5).
Implications for mitochondrial genome study
Next generation sequencing technologies have revolutionized molecular biology by making whole genome sequencing projects possible for any species. Low coverage whole genome shotgun sequencing has proven a valuable approach for rapid and easy acquisition of a large amount of sequence data at relatively low cost. Low coverage data is useful for marker acquisition as well as the assembly of plastid genomes [20, 21, 31–33]. Despite its power, few examples of the application of NGS on de novo assembly of plant mitochondrial genomes have been reported. Recently Straub and colleagues  attempted to assemble the milkweed mitochondrial genome using Illumina data from unenriched whole genome DNA. The assembly resulted in a partial mitochondrial genome assembly of 115 contigs. Here we demonstrate how 454 data from whole genome DNA library can be used for a complete de novo assembly of the mitochondrial genome of Daucus carota. Adequate coverage, sufficient read length, and application of a bioinformatics analysis using a tool such as bb.454contignet were crucial for the final assembly.
Our sequencing library was prepared from whole genome DNA. The fraction of reads of plastid and mitochondrial genome origin obtained from such a library can vary depending on the age of tissue, DNA extraction method, relative size of the organelle and nuclear genomes, and species . Adequate coverage is always necessary for a complete plastid and mitochondrial genome assembly, but this coverage likely can be obtained, even with a low nuclear genome coverage. The fact that we were near the lower limit can be seen from a comparison of the five assembly replicates, which suggests that if the coverage is too low, the complete assembly of the mitochondrial genome can begin to fail apparently due to isolated gaps in the sequence due to random chance.
In our study, leaf tissue was collected from plants without any pretreatment (such as darkness) that could reduce the amount of plastid or mitochondrial DNA. The same DNA extraction was sequenced by both platforms. We observed approximately twice the organellar fraction from Illumina sequencing, relative to 454. The interpretation of this observation is difficult as many factors could be involved. Here we observed that Illumina is more sensitive to regions with variable GC content than is 454. In a range of GC content of 20–50% other authors reported a positive correlation between GC contents and sequence coverage [35, 36]. Since organellar DNA is typically higher in GC content than nuclear DNA, we suggest this as one of possible cause of the higher representation of organellar fraction from Illumina sequencing.
Development of new bioinformatic tools is often a crucial point for interpreting DNA/RNA sequence information into biological information . Repeat regions are the primary difficulty in genome assembly, and the length of reads relative to the length of repeats is a key factor in the ability to assemble a genome . Reads from repeats longer than the read length could assemble to multiple possible sequences, some of which may not exist in vivo. These cannot be collapsed into a single unambiguous sequence, thereby causing breaks in contigs, where one end of a contig can then lead to two or more adjacent contigs which continue the sequence. Some assemblers such as gsAssembler make this connection information available. We developed and used bb.454contignet http://www.vcru.wisc.edu/simonlab/sdata/software/ to parse, filter, and then graphically visualize these contig connections. This tool was very useful for our small-scale assembly project, and it could be suitable for other applications, such as assembly of Bacterial Artificial Chromosome sequences, or specific regions in a whole genome assembly. As confirmation of the validity of our procedure, another group has successfully assembled another genome using this tool . A similar tool, ABySS-Explorer has been developed for assemblies generated by ABySS . However, our results demonstrate the utility of shotgun 454 sequences with a size of 300–500 nt, making the experiment cost efficient, and we confirmed the assembly by PCR, Southern blot and mapping 3 kb paired end data.
Full de novo assembly of the mitochondrial genome can be very difficult due to repeat regions as well as its dynamic organization due to frequent recombination events. Recent advances in Illumina and Roche 454 sequencing technologies have allowed increasing read length up to 150 and 1000 nt, respectively [http://www.illumina.com, http://my454.com]. In addition, both NGS technologies officially support paired sequence data of up to 5 kb for Illumina platform and up 20 kb for Roche 454 platform. Comparison of the distribution of the number and the length of the repeat sequences among sequenced mitochondrial genomes and the whole range of longest sequencing data achievable by both technologies demonstrates the power of NGS to overcome most of the assembly ambiguities due to repeats across these genomes. Moreover, use of long shotgun sequences alone, such as 454 reads, would resolve assembly ambiguity across a large portion of repeat regions detected among these mitochondrial genomes (Additional file 8: Figure S6). By contrast, use of short shotgun reads such as Illumina could still result in a high number of contigs and connections that could make the assembly unreliable. Paired sequence data can support and confirm assembly, and supply information about structural variants due to recombination across repeats. The ongoing development of third generation sequencing technologies such as PacBio [http://www.pacificbiosciences.com/], able to produce longer reads, will facilitate sequencing of new organelle genomes.
The usefulness of new organelle genome sequences will be reduced if their quality is reduced by sequence errors. Next generation sequencing technology is known to generate a sequence error rate higher than the traditional Sanger method . Most of these errors can be corrected with high sequence coverage. However, errors due to inaccurate determination of homopolymer length, particularly when ≥5 nt, are characteristic of pyrosequencing (Roche 454), and are not resolvable by higher coverage. These errors can result in an inaccurate sequence annotation if the error occurs in a reading frame. In our study, homopolymer ambiguity was detected in both coding and non coding sequences. Availability of Illumina sequences allowed us to correct these homopolymer errors as described above. After corrections, ORF detection confirmed contiguous reading frames of all of the genes predicted by BLAST alignment, an additional verification of the correctness of the finished sequences. Combining these two technologies for this type of correction has already been described . The high number (1,593) of homopolymer regions observed in this study would make verification of every homopolymer region by Sanger sequencing uneconomical. By contrast, the low number of ambiguities in coding regions  in such a small scale sequencing project would allow the use of a Sanger sequencing approach as an alternative cost-efficient way to correct ambiguity in those regions, and insure correct annotation of genes.
Carrot is the first mitochondrial genome sequenced among the large euasterid II clade, and only the second, along with N. tabacum, among asterids. The size of the genome (281,132 nt) is similar to a previous estimation (255,000 nt) made by Robinson and Wolyn  based on restriction digestion mapping. Sequence analysis revealed that the closest mitochondrial genome in terms of similarity was tobacco, confirming their phylogenetic relationship in the euasterid II clade. The gene content of the carrot mitochondrial genome confirms the previous report of Adams et al.  where the authors used Southern hybridization to survey mitochondrial gene presence or loss across 280 angiosperms. The major factor accounting for gene loss in the mitochondrial genomes is transfer to the nuclear genome. Presence or absence of mitochondrial genes in the nuclear genome have been reported for several species [45–48], including the transfer of rps10 and a lack of sdh3 and sdh4 in the nuclear genome of carrot. In contrast with these data, our results revealed the presence of a nuclear copy of sdh4 in the carrot genome. In addition, we confirmed the transfer of rps10 from the mitochondria into the nucleus and detected an additional nuclear copy of rps14. We also found a replacement of the original mitochondrial rpl2 with a copy of the plastid rpl2 in the mitochondria. The high level of similarity between the rpl2 nucleotide sequences in these two organellar genomes (99%) suggests a recent translocation. The lack of any sequence similar to the mitochondrial rpl2 gene in the nuclear genome suggests that the original mitochondrial version has been lost. According to Kubo and Arimura , the replacement of a mitochondrial gene by a duplicated plastid counterpart could be due to the complete loss of the gene in either mitochondrial or nuclear genomes. However, detection of the ORFs for nuclear sdh3 rps10 rps14 and plastid rpl2 suggests the possibility of functionality. Further work will be needed to determine whether these nuclear copies are expressed.
The structure of angiosperm mitochondrial genomes is frequently characterized by repeat sequences . The number and the size of these repeats is important, as they influence the size of the genome, and they are the sites of intragenomic recombination, underlining evolutionary changes in mitochondrial genome organization and structural dynamism in vivo[52, 53]. A high proportion of smaller repeat sequences explained (<1 kb) the size of the largest mitochondrial genome of Cucumis melo. In contrast, in maize , variable genome size among different genotypes is mainly caused by large (>1 kb) sequence duplication. The structure of the carrot mitochondrial genome presented here revealed that nearly half (46%) of the genome is composed of large repeated sequences. The fact that carrot has the least number of repeat sequences among sequenced mitochondrial genomes and the presence of four large repeats (1–14 kb), we deduced that the increase of the carrot mitochondrial genome size mainly occurred by large sequence duplication.
Intragenomic recombination is an active phenomenon in the mitochondrial genome of angiosperms [53, 54]. Recombination frequency seems to depend on the size of the repeats. Large size direct repeats (>1 kb) are associated with high frequency of homologous recombination producing sub genomic molecules observed in other mitochondrial genomes [53, 54]. Our Southern blot results provide evidence of recombination across a large repeat 4 (Figure 3), and the in vivo co-existence of sub-genomic circles. Smaller repeats, however, suggest very low recombination frequency, producing aberrant asymmetric recombinant molecules that are generally present at very low copy number and hard to detect [55, 56]. In this study, we tested a possible recombination across short repeats using next generation sequencing data from Roche 454 and Illumina platforms and our results did not support any recombination events (data not shown). It was shown recently that mitochondrial intergenomic recombination required stretches of similarity longer than 500 nt and remained under control of a nuclear gene msh1. Inactivation of msh1 lead to increased frequency of recombination and allowed recombination between repeats only 50 nt-long .
DNA insertion in the plastid genome
Intercompartmental (plastid, mitochondrion and nucleus) DNA migration is a known phenomenon in plant cell evolution. DNA transfer among these compartments resulted in a functional relocation of organellar genes in the early phase of organelle evolution [13, 56–60]. Theoretically, six intercompartmental DNA migrations are possible, and at least four have been observed in angiosperms (Additional file 9: Figure S7). Transfers of DNA from mitochondrion to nucleus and from plastid to mitochondrion are discussed above (see Introduction and Results section). DNA migration from plastid to nucleus has been described in Arabidopsis, soybean and other species [61, 62]. Nuclear DNA has been transferred to the mitochondrial genome of Cucumis melo and other species [10, 63]. In contrast, no evidence of transfer of DNA from nucleus or mitochondria to plastid has been reported in angiosperms . Smith  investigated the presence of mitochondrial DNA in the plastid genome of 42 species including 11 angiosperm, and found a complete absence of mtDNA like sequences in any of the plastid genome of these species. Recently Goremykin and colleagues  reported a fragment of mitochondrial DNA in Vitis with similarity to a sequence of 1452 nt present only in the carrot plastid genome.
In this study, we confirmed the presence of this sequence in the plastid genome of carrot, and discovered that it was also present as three non-contiguous sequences in the mitochondrial genome. PCR results and sequence analysis clearly document conservation of a large portion of this sequence (DcMP region 3, Figure 5B) across all mitochondrial genomes of the diverse Apiaceae species examined, and a “gain pattern” only in Daucus and Cuminum plastid genomes. This evidence strongly suggests a transfer of DNA into the plastid genome. In addition, the observation of the insertion of the DcMP region 4 in the plastid genome of wild Daucus species D. aureus D. crinitus and D. glochidiatus led us to hypothesize that in the ancestor of carrot the DcMP regions 1, 2, 3 and 4 were contiguous when the transfer event occurred. Assuming a single transfer event, the loss of the sequence contiguity in the mitochondrial genome could be due to a recombination of that genome, as is often observed in plant mitochondrial genomes. According to Downie et al. , Cuminum is the closest relative to carrot of the taxa examined here. The PCR patterns obtained across species for the plastid copy of DcMP concur with that classification and suggest that the presence of the DcMP region can be a distinctive feature of species from the Daucus clade.
The fact that the DcMP3 sequence has no matches in sequence data bases makes interpretation of its origin difficult. Limited similarity with a retrotransposon element (in the range of 50–60%; Additional file 7: Figure S5) of Vitis vinifera and other species suggests that a retrotransposable element might have been moved from the nuclear genome into the mitochondrial and then into the plastid genome, or directly and independently from the nuclear genome to plastid and mitochondrial genomes. The fragment of 125 nt at the 5′ end of DcMP3 is very conserved across several even more diverse angiosperm mitochondrial genomes, which suggests a mitochondrial origin, and also suggests that the transfer occurred from mitochondrial to plastid genome. However, considering the exceptional DNA up take system of plant mitochondrial genomes [4, 65], other sources of DNA donors via horizontal transfer or transfer from the nuclear genome cannot be excluded. Rare insertions of retrotransposon elements into the plastid genome have been observed in the alga Chlamydomonas reinhardtii, but based on our knowledge this is the first evidence of transfer of DNA from mitochondrial genome to plastid genome, and the potential presence of a possible retrotransposon in the plastid genome of a flowering plant.
Our results confirmed that unenriched whole genome sequencing is a practical approach for de novo assembly of higher plant mitochondrial genomes. Sequence analysis confirmed the dynamic organization of the mitochondrial genomes in carrot as in other mitochondrial genomes. This first report of a DNA insertion from the mitochondria in the angiosperm plastid genome suggests that new and unknown mechanisms in intercompartimental genome interactions may exist. The new approach developed here for assembly of the mitochondrial genome used here could be used more broadly to sequence and assemble mitochondrial genomes of diverse plant species. This would supplement the currently limited number of sequenced mitochondrial genomes, and may allow us to better understand intra- and inter-genomic DNA transfers and recombination, and this new instance of transfer of DNA to plastid genomes could possibly have occurred in other angiosperms.
Plant material and DNA extraction
A single male fertile plant of USDA carrot inbred line B493B  was used in this study (Additional file 10: Table S3). Leaf tissue was lyophilized, and extracted with DNeasy Plant Maxi Kit (Qiagen). The extracted DNA was analyzed for potential degradation by gel electrophoresis, and DNA concentration was quantified using Pico Green (Invitrogen, Pisley, UK).
DNA samples used for PCR verification of DNA transfer (Coriandrum sativum Petroselium crispum cv “Hamburg”, Cuminum cyminum Apium graveolens ssp. repaceum cv ‘Brilliant’, Anethum graveolens Foeniculum vulgare Daucus carota ssp. sativus Daucus muricatus Daucus glochidiatus Daucus crinitus Daucus capillifolius Daucus aureus Daucus sahariensis Daucus pusillus, and Eryngium planum) (Additional file 10) were extracted as described by Murray and Thompson  and quantified using a NanoDrop spectrophotometer (NanoDrop Technologies).
Sequencing, assembly and finishing
454 sequencing was performed with a GS-FLX platform (Roche, CT, USA), and Illumina sequencing was performed with the HiSeq 2000 platform (Illumina, San Diego, CA). In both cases, the sequencing was done at the University of Wisconsin, Biotechnology Center (University of Wisconsin, Madison, USA). Shotgun libraries were prepared and sequenced according to the manufacturers’ instructions. For 454 sequences sets of 813,770, 814,668, 771,864, 704,918 and 692,688 shotgun reads corresponding to an estimated nuclear genome coverage of 0.6×/set were used for initial assembly and 570,590 3 kb paired-end reads were used for connections verification. In addition, 50,598,879 Illumina reads were used to correct homopolymer ambiguity.
Each set of 454 sequences was independently assembled using gsAssembler v.2.6 (Newbler) (454 Life Sciences Corp, CT, USA). Parameter settings are listed in additional file 11: Table S4. In order to identify contigs of plastid and mitochondrial origin, assembled sequences were aligned against the carrot plastid genome [GenBank: DQ898156] and tobacco mitochondrial genome [GenBank: NC006581] using MUMmer 3.22 ; http://mummer.sourceforge.net/. Contigs with similarity to atpA for plastid, or atp1 for mitochondrion, were used as starting points for a de novo assembly of contigs. In order to visualize the contig connection information in the 454ContigGraph.txt file produced by gsAssembler, we created the program bb.454contignet (http://www.vcru.wisc.edu/simonlab/sdata/software/). Parameters are listed in additional file 11: Table S4. In order to generate a consensus sequence the five replicate assemblies, sequences were aligned using kalign  and a consensus generated using a custom Perl program.
To verify contig and repeat connections, sequences spanning those connections were amplified and sequenced using primer pair 1–12 listed in Additional file 11: Table S5. Fragments were amplified in a 20 μl PCR reaction: 13 μl water, 2 μl 10x DNA polymerase buffer, 0.8 μl dNTPs (2.5 mM each), 1 μl 5 μM of each primer, 0.2 μl Taq polymerase (MBI, Fermentas, USA) and 2 μl of genomic DNA (~50 ng). Amplification conditions were: initial denaturation at 94°C for 2 min., followed by 35 cycles of 94°C for 45 sec., Tm (°C) for 1.0 min., 72°C for 1.0 min. and 20 sec., and a final step at 72°C for 10.0 min. Electrophoresis was carried out for 2–3 hours at 100 V on 2% agarose TAE gels supplemented with 0.2 μg/ml of ethidium bromide. PCR products were then sequenced in a 5 μl reaction including, 1.75 μl of water, 1 μl of 5 μM primer, 0.75 μl 5× BigDye®3.1 sequencing buffer, 0.5 μl of BigDye®3.1 ready reaction mix and 1 μl of PCR product, previously diluted 1:10 with water. Amplification conditions were: 25 cycles of 96°C for 10 sec., and 58°C for 2 min., and a final step at 72°C for 5.0 min. The sequences were analyzed on an ABI 3730xl DNA Analyzer and analyzed using Sequencher software version 4.8 (GeneCodes Corporation, Ann Arbor, MI).
Sequences similar to plastid and mitochondrial genomes longer than 358 nt (which corresponds to the average read length) were amplified using primer pairs 13–18 reported in additional file 12: Table S5. PCR assay and sequence analysis were carried out as described above.
In addition, the consensus sequence was compared using blastn to a local database containing a set of 3 kb 454 paired-end reads and results were filtered and visualized with Circos (http://circos.ca/).
To correct for homopolymer length errors and evaluate genome coverage, both Illumina and 454 shotgun reads were mapped to the mitochondrial consensus sequence using GNUMAP v.3.0.0 , http://dna.cs.byu.edu/gnumap/. Homopolymer regions of ≥5nt were corrected by a custom Perl program when a majority of Illumina reads indicated a different length, in most cases of correction this became 1 nt longer.
Southern blot hybridization
The DNA was digested in separate tubes with the Fermentas enzymes XhoI, Eam1105I (AhdI) and KspAI (HpaI). Each digestion was performed overnight in 37°C in 40 μl containing approx. 1 μg of DNA. After restriction the samples were supplemented with 8 μl of the 6X Loading Dye Solution (Fermentas) and run in a 0.7% agarose gel in TAE buffer, during the first hour at 0.9 V/cm and then for the next 15 h at 0.5 V/cm. After electrophoresis the gel was subjected to depurination, denaturation, neutralization and capillary blotting using procedures described in Roche DIG Application Manual. The probes were synthesized using long PCR carried out in 25 μl containing: 2.5 μl 10x Long PCR Buffer with MgCl2, 2.5 μl dNTPs (2.5 mM each) , 1 μl 1 μM of each primer, 0.2 μl of Long PCR Enzyme Mix (MBI, Fermentas, USA), and 2 μl (70 μM) of alkali-labile DIG-11-dUTP (Roche, Applied Science) and 2 μl of genomic DNA (~4 ng). Probes were synthesized using primer pairs 20–22 listed in additional file 12: Table S5 with the following PCR conditions: initial denaturation at 94°C for 2 min.; 10 cycles of 94°C for 20 sec., 57°C for 30 sec., 68°C for 15 min.; 25 cycles of 94°C for 20 sec., 57°C for 30 sec., 68°C for 15 min. with an automatic 10 sec. extension in each consecutive cycle and finally 10 min. in 68°C. Three μl of the labeling reaction were examined in the standard agarose gel against the unlabelled control (the reaction without DIG-11-dUTP). The remaining portion of the labeling reaction was added to 16 ml of the standard hybridization buffer, the resulting solution was incubated in a water bath at 95°C for 10 min. (probe denaturation) and afterwards immediately applied to the membrane of ca. 400 cm2.
A preliminary annotation of proteins, rRNA, and tRNA was carried out using BLAST with a local database of nucleotide and protein sequences of all published mitochondrial genomes. Consistency of ORFs was then tested using Open Reading Frame Finder http://www.ncbi.nlm.nih.gov/gorf/gorf.html. The tRNA genes were detected with tRNAscan-SE , http://lowelab.ucsc.edu/tRNAscan-SE/. Any ORFs longer than 300 nt and not overlapping conserved genes were searched with the NCBI nucleotide and protein databases and annotated on the mitochondrial sequence. Sequence of sdh3 sdh4 rpl2 rps10 and rps19 from the Nicotiana tabacum mitochondrial genome [GenBank: NC_006581] and rps2 rps14 from the Vitis vinifera mitochondrial genome [GenBank: NC_012119] were used to investigate the presence of those genes in the nuclear genome of carrot. In order to annotate sequences common to plastid and mitochondrial genomes, known conserved mitochondrial genes were first masked in our mitochondrial consensus sequence, which was then queried against a local database containing sequences of all published plastid genomes. Sequences with minimum length of 40 nt and 90% identity not interrupted by masked sequences were added to our annotation.
Regions repeated within the mitochondrial genome were detected with blastn with the following parameters: minimum length 40 nt; percent identity 90.
The complete sequence of the annotated carrot mitochondrial genome was deposited in the NCBI organelle genome database with accession number JQ248574.
Verification of DNA insertion into plastid genome
Mitochondrial and plastid DNA insertions were amplified using primer pairs 15, 16 and 19 listed in additional file 12: Table S5. PCR assay and sequence analysis were carried out as described above. Sequences were then searched against the NCBI nucleotide and protein database using default parameters.
This work was supported by the Agricultural Research Service, United States Department of Agriculture, vegetable seed companies and production industry.
- Kubo T, Newton KJ: Angiosperm mitochondrial genomes and mutations. Mitochondrion. 2008, 8: 5-14. 10.1016/j.mito.2007.10.006.PubMedView ArticleGoogle Scholar
- Kubo T, Mikami T: Organization and variation of angiosperm mitochondrial genome. Physiol Plantarum. 2007, 129: 6-13. 10.1111/j.1399-3054.2006.00768.x.View ArticleGoogle Scholar
- Richardson AO, Palmer JD: Horizontal gene transfer in plants. J Exp Bot. 2007, 58: 1-9.PubMedView ArticleGoogle Scholar
- Archibald J, Richards T: Gene transfer: anything goes in plant mitochondria. BMC Biol. 2010, 8: 147-10.1186/1741-7007-8-147.PubMedView ArticleGoogle Scholar
- Goremykin VV, Salamini F, Velasco R, Viola R: Mitochondrial DNA of Vitis vinifera and the issue of rampant horizontal gene transfer. Mol Biol Evol. 2009, 26: 99-110.PubMedView ArticleGoogle Scholar
- Mower J, Stefanovic S, Hao W, Gummow J, Jain K, Ahmed D, Palmer J: Horizontal acquisition of multiple mitochondrial genes from a parasitic plant followed by gene conversion with host mitochondrial genes. BMC Biol. 2010, 8 (1): 150-10.1186/1741-7007-8-150.PubMedView ArticleGoogle Scholar
- Ogihara Y, Yamazaki Y, Murai K, Kanno A, Terachi T, Shiina T, Miyashita N, Nasuda S, Nakamura C, Mori N, Takumi S, Murata M, Futo S, Tsunewaki K: Structural dynamics of cereal mitochondrial genomes as revealed by complete nucleotide sequencing of the wheat mitochondrial genome. Nucleic Acids Res. 2005, 33: 6235-6250. 10.1093/nar/gki925.PubMedView ArticleGoogle Scholar
- Chang S, Yang T, Du T, Huang Y, Chen J, Yan J, He J, Guan R: Mitochondrial genome sequencing helps show the evolutionary mechanism of mitochondrial genome formation in Brassica. BMC Genomics. 2011, 12: 497-10.1186/1471-2164-12-497.PubMedView ArticleGoogle Scholar
- Allen JO, Fauron CM, Minx P, Roark L, Oddiraju S, Lin GN, Meyer L, Sun H, Kim K, Wang C, Du F, Xu D, Gibson M, Cifrese J, Clifton SW, Newton KJ: Comparisons among two fertile and three male-sterile mitochondrial genomes of maize. Genetics. 2007, 177: 1173-1192. 10.1534/genetics.107.073312.PubMedView ArticleGoogle Scholar
- Alverson AJ, Wei X, Rice DW, Stern DB, Barry K, Palmer JD: Insights into the Evolution of Mitochondrial Genome Size from Complete Sequences of Citrullus lanatus and Cucurbita pepo (Cucurbitaceae). Mol Biol Evol. 2010, 27: 1436-1448. 10.1093/molbev/msq029.PubMedView ArticleGoogle Scholar
- Woodson JD, Chory J: Coordination of gene expression between organellar and nuclear genomes. Nat Rev Genet. 2008, 9: 383-395. 10.1038/nrg2348.PubMedView ArticleGoogle Scholar
- Wolf JB: Cytonuclear interactions can favor the evolution of genomic imprinting. Evolution. 2009, 63: 1364-1371. 10.1111/j.1558-5646.2009.00632.x.PubMedView ArticleGoogle Scholar
- Kleine T, Maier UG, Leister D: DNA transfer from organelles to the nucleus: the idiosyncratic genetics of endosymbiosis. Annu Rev Plant Biol. 2009, 60: 115-138. 10.1146/annurev.arplant.043008.092119.PubMedView ArticleGoogle Scholar
- Smith DR: Extending the limited transfer window hypothesis to inter-organelle DNA migration. Genome Biol Evol. 2011, 3: 743-748. 10.1093/gbe/evr068.PubMedView ArticleGoogle Scholar
- Jex AR, Hall RS, Littlewood DT, Gasser RB: An integrated pipeline for next-generation sequencing and annotation of mitochondrial genomes. Nucleic Acids Res. 2010, 38: 522-533. 10.1093/nar/gkp883.PubMedView ArticleGoogle Scholar
- Webb KM, Rosenthal BM: Next-generation sequencing of the Trichinella murrelli mitochondrial genome allows comprehensive comparison of its divergence from the principal agent of human trichinellosis, Trichinella spiralis. Infect Genet Evol. 2011, 11: 116-123. 10.1016/j.meegid.2010.10.001.PubMedView ArticleGoogle Scholar
- Gunnarsdóttir ED, Li M, Bauchet M, Finstermeier K, Stoneking M: High-throughput sequencing of complete human mtDNA genomes from the Philippines. Genome Res. 2011, 21: 1-11. 10.1101/gr.107615.110.PubMedView ArticleGoogle Scholar
- Cui Z, Liu Y, Li CP, You F, Chu KH: The complete mitochondrial genome of the large yellow croaker, Larimichthys crocea (Perciformes, Sciaenidae): Unusual features of its control region and the phylogenetic position of the Sciaenidae. Gene. 2009, 432: 33-43. 10.1016/j.gene.2008.11.024.PubMedView ArticleGoogle Scholar
- Yang M, Zhang X, Liu G, Yin Y, Chen K, Yun Q, Zhao D, Al-Mssallem IS, Yu J: The complete chloroplast genome sequence of date palm (Phoenix dactylifera L.). PLoS One. 2010, 5: e12762-10.1371/journal.pone.0012762.PubMedView ArticleGoogle Scholar
- Tangphatsornruang S, Sangsrakru D, Chanprasert J, Uthaipaisanwong P, Yoocha T, Jomchai N, Tragoonrung S: The chloroplast genome sequence of mungbean (Vigna radiata) determined by high-throughput pyrosequencing: structural organization and phylogenetic relationships. DNA Res. 2010, 17: 11-22. 10.1093/dnares/dsp025.PubMedView ArticleGoogle Scholar
- Nock CJ, Waters DL, Edwards MA, Bowen SG, Rice N, Cordeiro GM, Henry RJ: Chloroplast genome sequences from total DNA for plant identification. Plant Biotechnol J. 2011, 9: 328-333. 10.1111/j.1467-7652.2010.00558.x.PubMedView ArticleGoogle Scholar
- Straub SC, Fishbein M, Livshultz T, Foster Z, Parks M, Weitemier K, Cronn RC, Liston A: Building a model: developing genomic resources for common milkweed (Asclepias syriaca) with low coverage genome sequencing. BMC Genomics. 2011, 12: 211-10.1186/1471-2164-12-211.PubMedView ArticleGoogle Scholar
- Lassmann T, Sonnhammer EL: Kalign–an accurate and fast multiple sequence alignment algorithm. BMC Bioinforma. 2005, 6: 298-10.1186/1471-2105-6-298.View ArticleGoogle Scholar
- Ruhlman T, Lee SB, Jansen RK, Hostetler JB, Tallon LJ, Town CD, Daniell H: Complete plastid genome sequence of Daucus carota: implications for biotechnology and phylogeny of angiosperms. BMC Genomics. 2006, 7: 222-10.1186/1471-2164-7-222.PubMedView ArticleGoogle Scholar
- Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, Lohman KL, Lu H, Makhijani VB, McDade KE, McKenna MP, Myers EW, Nickerson E, Nobile JR, Plant R, Puc BP, Ronan MT, Roth GT, Sarkis GJ, Simons JF, Simpson JW, Srinivasan M, Tartaro KR, Tomasz A, Vogt KA, Volkmer GA, Wang SH, Wang Y, Weiner MP, Yu P, Begley RF, Rothberg JM: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.PubMedGoogle Scholar
- Clement NL, Snell Q, Clement MJ, Hollenhorst PC, Purwar J, Graves BJ, Cairns BR, Johnson WE: The GNUMAP algorithm: unbiased probabilistic mapping of oligonucleotides from next-generation sequencing. Bioinformatics. 2010, 26: 38-45. 10.1093/bioinformatics/btp614.PubMedView ArticleGoogle Scholar
- Rodriguez-Moreno L, Gonzalez VM, Benjak A, Marti MC, Puigdomenech P, Aranda MA, Garcia-Mas J: Determination of the melon chloroplast and mitochondrial genome sequences reveals that the largest reported mitochondrial genome in plants contains a significant amount of DNA having a nuclear origin. BMC Genomics. 2011, 12: 424-10.1186/1471-2164-12-424.PubMedView ArticleGoogle Scholar
- Alverson AJ, Zhuo S, Rice DW, Sloan DB, Palmer JD: The mitochondrial genome of the legume Vigna radiata and the analysis of recombination across short mitochondrial repeats. PLoS One. 2011, 6: e16404-10.1371/journal.pone.0016404.PubMedView ArticleGoogle Scholar
- Bach I, Olesen A, Simon P: PCR-based markers to differentiate the mitochondrial genomes of petaloid and male fertile carrot (Daucus carota L.). Euphytica. 2002, 127: 353-365. 10.1023/A:1020314802236.View ArticleGoogle Scholar
- Szklarczyk M, Oczkowski M, Augustyniak H, Börner T, Linke B, Michalik B: Organisation and expression of mitochondrial atp9 genes from CMS and fertile carrots. Theor Appl Genet. 2000, 100 (2): 263-270. 10.1007/s001220050035.View ArticleGoogle Scholar
- Macas J, Neumann P, Navratilova A: Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula. BMC Genomics. 2007, 8: 427-10.1186/1471-2164-8-427.PubMedView ArticleGoogle Scholar
- Tangphatsornruang S, Somta P, Uthaipaisanwong P, Chanprasert J, Sangsrakru D, Seehalak W, Sommanas W, Tragoonrung S, Srinives P: Characterization of microsatellites and gene contents from genome shotgun sequences of mungbean (Vigna radiata (L.) Wilczek). BMC Plant Biol. 2009, 9: 137-10.1186/1471-2229-9-137.PubMedView ArticleGoogle Scholar
- Zhu H, Senalik D, McCown BH, Zeldin EL, Speers J, Hyman J, Bassil N, Hummer K, Simon PW, Zalapa JE: Mining and validation of pyrosequenced simple sequence repeats (SSRs) from American cranberry (Vaccinium macrocarponAit.). Theor Appl. Genet, in press.
- Lutz KA, Wang W, Zdepski A, Michael TP: Isolation and analysis of high quality nuclear DNA with reduced organellar DNA for plant genome sequencing and resequencing. BMC Biotechnol. 2011, 11: 54-10.1186/1472-6750-11-54.PubMedView ArticleGoogle Scholar
- Aird D, Ross MG, Chen WS, Danielsson M, Fennell T, Russ C, Jaffe DB, Nusbaum C, Gnirke A: Analyzing and minimizing PCR amplification bias in Illumina sequencing libraries. Genome Biol. 2011, 12: R18-10.1186/gb-2011-12-2-r18.PubMedView ArticleGoogle Scholar
- Ossowski S, Schneeberger K, Clark RM, Lanz C, Warthmann N, Weigel D: Sequencing of natural strains of Arabidopsis thaliana with short reads. Genome Res. 2008, 18: 2024-2033. 10.1101/gr.080200.108.PubMedView ArticleGoogle Scholar
- Nielsen CB, Cantor M, Dubchak I, Gordon D, Wang T: Visualizing genomes: techniques and challenges. Nat Methods. 2010, 7 (Suppl 3): S5-S15.PubMedView ArticleGoogle Scholar
- Cahill MJ, Koser CU, Ross NE, Archer JA: Read length and repeat resolution: exploring prokaryote genomes using next-generation sequencing technologies. PLoS One. 2010, 5: e11518-10.1371/journal.pone.0011518.PubMedView ArticleGoogle Scholar
- Zhang T, Zhang X, Hu S, Jun Y: An efficient procedure for plant organellar genome assembly, based on whole genome data from the 454 GS FLX sequencing platform. Plant methods. 2011, 7: 38-10.1186/1746-4811-7-38.PubMedView ArticleGoogle Scholar
- Nielsen CB, Jackman SD, Birol I, Jones SJ: ABySS-Explorer: visualizing genome sequence assemblies. IEEE Trans Vis Comput Graph. 2009, 15: 881-888.PubMedView ArticleGoogle Scholar
- Kircher M, Kelso J: High-throughput DNA sequencing concepts and limitations. Bioessays. 2010, 32: 524-536. 10.1002/bies.200900181.PubMedView ArticleGoogle Scholar
- Aury JM, Cruaud C, Barbe V, Rogier O, Mangenot S, Samson G, Poulain J, Anthouard V, Scarpelli C, Artiguenave F, Wincker P: High quality draft sequences for prokaryotic genomes using a mix of new sequencing technologies. BMC Genomics. 2008, 9: 603-10.1186/1471-2164-9-603.PubMedView ArticleGoogle Scholar
- Robison MM, Wolyn DJ: Complex organization of the mitochondrial genome of petaloid CMS carrot. Mol Genet Genomics. 2002, 268: 232-239. 10.1007/s00438-002-0740-z.PubMedView ArticleGoogle Scholar
- Adams KL, Qiu YL, Stoutemyer M, Palmer JD: Punctuated evolution of mitochondrial gene content: high and variable rates of mitochondrial gene loss and transfer to the nucleus during angiosperm evolution. Proc Natl Acad Sci USA. 2002, 99: 9905-9912. 10.1073/pnas.042694899.PubMedView ArticleGoogle Scholar
- Figueroa P, Gómez I, Holuigue L, Araya A, Jordana X: Transfer of rps14 from the mitochondrion to the nucleus in maize implied integration within a gene encoding the iron sulphur subunit of succinate dehydrogenase and expression by alternative splicing. Plant J. 1999, 18: 601-609. 10.1046/j.1365-313x.1999.00485.x.PubMedView ArticleGoogle Scholar
- Adams KL, Daley DO, Qiu YL, Whelan J, Palmer JD: Repeated, recent and diverse transfers of a mitochondrial gene to the nucleus in flowering plants. Nature. 2000, 408: 354-357. 10.1038/35042567.PubMedView ArticleGoogle Scholar
- Adams KL, Rosenblueth M, Qiu Y, Palmer JD: Multiple losses and transfers to the nucleus of two mitochondrial succinate dehydrogenase genes during angiosperm evolution. Genetics. 2001, 158: 1289-1300.PubMedGoogle Scholar
- Adams KL, Daley DO, Whelan J, Palmer JD: Genes for two mitochondrial ribosomal proteins in flowering plants are derived from their chloroplast or cytosolic counterparts. Plant Cell. 2002, 14: 931-943. 10.1105/tpc.010483.PubMedView ArticleGoogle Scholar
- Liu S, Zhuang Y, Zhang P, Adams KL: Comparative analysis of structural diversity and sequence evolution in plant mitochondrial genes transferred to the nucleus. Mol Biol Evol. 2009, 26: 875-891. 10.1093/molbev/msp011.PubMedView ArticleGoogle Scholar
- Kubo N, Arimura S: Discovery of the rpl10 gene in diverse plant mitochondrial genomes and its probable replacement by the nuclear gene for chloroplast RPL10 in two lineages of angiosperms. DNA Res. 2010, 17: 1-9. 10.1093/dnares/dsp024.PubMedView ArticleGoogle Scholar
- Knoop V, Volkmar U, Hecht J, Grewe F: Mitochondrial genome evolution inthe plant lineage. In Plant Mitochondria. Volume 1. Edited by Kempken F.New York: Springer; 2011:3–29.View ArticleGoogle Scholar
- Marechal A, Brisson N: Recombination and the maintenance of plant organelle genome stability. New Phytol. 2010, 186: 299-317. 10.1111/j.1469-8137.2010.03195.x.PubMedView ArticleGoogle Scholar
- Alverson AJ, Rice DW, Dickinson S, Barry K, Palmer JD: Origins and recombination of the bacterial-sized multichromosomal mitochondrial genome of cucumber. Plant Cell. 2011, 23: 2499-2513. 10.1105/tpc.111.087189.PubMedView ArticleGoogle Scholar
- Sloan D, Alverson A, Storchova H, Palmer J, Taylor D: Extensive loss of translational genes in the structurally dynamic mitochondrial genome of the angiosperm Silene latifolia. BMC Evol Biol. 2010, 10: 274-10.1186/1471-2148-10-274.PubMedView ArticleGoogle Scholar
- Woloszynska M, Trojanowski D: Counting mtDNA molecules in Phaseolus vulgaris: sublimons are constantly produced by recombination via short repeats and undergo rigorous selection during substoichiometric shifting. Plant Mol Biol. 2009, 70: 511-521. 10.1007/s11103-009-9488-8.PubMedView ArticleGoogle Scholar
- Woloszynska M: Heteroplasmy and stoichiometric complexity of plant mitochondrial genomes though this be madness, yet there’s method in’t. J Exp Bot. 2010, 61: 657-671. 10.1093/jxb/erp361.PubMedView ArticleGoogle Scholar
- Davila J, Arrieta-Montiel M, Wamboldt Y, Cao J, Hagmann J, Shedge V, Xu Y, Weigel D, Mackenzie S: Double-strand break repair processes drive evolution of the mitochondrial genome in Arabidopsis. BMC Biol. 2011, 9: 64-10.1186/1741-7007-9-64.PubMedView ArticleGoogle Scholar
- Gray MW, Burger G, Lang BF: Mitochondrial Evolution. Science. 1999, 283 (5407): 1476-1481. 10.1126/science.283.5407.1476.PubMedView ArticleGoogle Scholar
- Timmis JN, Ayliffe MA, Huang CY, Martin W: Endosymbiotic gene transfer: organelle genomes forge eukaryotic chromosomes. Nat Rev Genet. 2004, 5: 123-135.PubMedView ArticleGoogle Scholar
- Archibald JM: The puzzle of plastid evolution. Curr Biol. 2009, 19: R81-R88. 10.1016/j.cub.2008.11.067.PubMedView ArticleGoogle Scholar
- Gantt JS, Baldauf SL, Calie PJ, Weeden NF, Palmer JD: Transfer of rpl22 to the nucleus greatly preceded its loss from the chloroplast and involved the gain of an intron. EMBO J. 1991, 10: 3073-3078.PubMedGoogle Scholar
- Millen RS, Olmstead RG, Adams KL, Palmer JD, Lao NT, Heggie L, Kavanagh TA, Hibberd JM, Gray JC, Morden CW, Calie PJ, Jermiin LS, Wolfe KH: Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. Plant Cell. 2001, 13: 645-658.PubMedView ArticleGoogle Scholar
- Notsu YN, Masood SM, Nishikawa TN, Kubo NK, Akiduki GA, Nakazono MN, Hirai AH, Kadowaki KK: The complete sequence of the rice (Oryza sativa L.) mitochondrial genome: frequent DNA sequence acquisition and loss during the evolution of flowering plants. Mol Genet Genomics. 2002, 268: 434-445. 10.1007/s00438-002-0767-1.PubMedView ArticleGoogle Scholar
- Downie SR, Katz-Downie DS, Spalik K: A phylogeny of Apiaceae tribe Scandiceae: evidence from nuclear ribosomal DNA internal transcribed spacer sequences. Am J Bot. 2000, 87: 76-95. 10.2307/2656687.PubMedView ArticleGoogle Scholar
- Burger G, Gray MW, Franz Lang B: Mitochondrial genomes: anything goes. Trends Genet. 2003, 19: 709-716. 10.1016/j.tig.2003.10.012.PubMedView ArticleGoogle Scholar
- Fan W, Woelfle MA, Mosig G: Two copies of a DNA element, “Wendy”, in the chloroplast chromosome of Chlamydomonas reinhardtii between rearranged gene clusters. Plant Mol Biol. 1995, 29: 63-80. 10.1007/BF00019119.PubMedView ArticleGoogle Scholar
- Simon PW, Peterson PE: B493 and B9304, carrot inbreds for use in breeding, genetics, and tissue culture. Hortscience. 1990, 25: 815.Google Scholar
- Murray MG, Thompson WF: Rapid isolation of high molecular weight plant DNA. Nucleic Acids Res. 1980, 8: 4321-4326. 10.1093/nar/8.19.4321.PubMedView ArticleGoogle Scholar
- Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL: Versatile and open software for comparing large genomes. Genome Biol. 2004, 5 (2): R12-10.1186/gb-2004-5-2-r12.PubMedView ArticleGoogle Scholar
- Schattner P, Brooks AN, Lowe TM: The tRNAscan-SE, snoscan and snoGPS web servers for the detection of tRNAs and snoRNAs. Nucleic Acids Res. 2005, 33: W686-W689. 10.1093/nar/gki366.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.