The complete chloroplast genome sequence of Citrus sinensis(L.) Osbeck var 'Ridge Pineapple': organization and phylogenetic relationships to other angiosperms
© Bausher et al; licensee BioMed Central Ltd. 2006
Received: 09 April 2006
Accepted: 30 September 2006
Published: 30 September 2006
The production of Citrus, the largest fruit crop of international economic value, has recently been imperiled due to the introduction of the bacterial disease Citrus canker. No significant improvements have been made to combat this disease by plant breeding and nuclear transgenic approaches. Chloroplast genetic engineering has a number of advantages over nuclear transformation; it not only increases transgene expression but also facilitates transgene containment, which is one of the major impediments for development of transgenic trees. We have sequenced the Citrus chloroplast genome to facilitate genetic improvement of this crop and to assess phylogenetic relationships among major lineages of angiosperms.
The complete chloroplast genome sequence of Citrus sinensis is 160,129 bp in length, and contains 133 genes (89 protein-coding, 4 rRNAs and 30 distinct tRNAs). Genome organization is very similar to the inferred ancestral angiosperm chloroplast genome. However, in Citrus the infA gene is absent. The inverted repeat region has expanded to duplicate rps19 and the first 84 amino acids of rpl22. The rpl22 gene in the IRb region has a nonsense mutation resulting in 9 stop codons. This was confirmed by PCR amplification and sequencing using primers that flank the IR/LSC boundaries. Repeat analysis identified 29 direct and inverted repeats 30 bp or longer with a sequence identity ≥ 90%. Comparison of protein-coding sequences with expressed sequence tags revealed six putative RNA edits, five of which resulted in non-synonymous modifications in petL, psbH, ycf2 and ndhA. Phylogenetic analyses using maximum parsimony (MP) and maximum likelihood (ML) methods of a dataset composed of 61 protein-coding genes for 30 taxa provide strong support for the monophyly of several major clades of angiosperms, including monocots, eudicots, rosids and asterids. The MP and ML trees are incongruent in three areas: the position of Amborella and Nymphaeales, relationship of the magnoliid genus Calycanthus, and the monophyly of the eurosid I clade. Both MP and ML trees provide strong support for the monophyly of eurosids II and for the placement of Citrus (Sapindales) sister to a clade including the Malvales/Brassicales.
This is the first complete chloroplast genome sequence for a member of the Rutaceae and Sapindales. Expansion of the inverted repeat region to include rps19 and part of rpl22 and presence of two truncated copies of rpl22 is unusual among sequenced chloroplast genomes. Availability of a complete Citrus chloroplast genome sequence provides valuable information on intergenic spacer regions and endogenous regulatory sequences for chloroplast genetic engineering. Phylogenetic analyses resolve relationships among several major clades of angiosperms and provide strong support for the monophyly of the eurosid II clade and the position of the Sapindales sister to the Brassicales/Malvales.
Chloroplasts are dynamic organelles of prokaryotic origin within the plant cell that house the photosynthetic apparatus. In addition to photosynthesis, other important metabolic activities take place within chloroplasts including the production of starch, certain amino acids and lipids, some of the colorful pigments in flowers, vitamins and several key aspects of sulfur and nitrogen metabolism. Chloroplasts possess their own genome and a full complement of transcriptional and translation machinery to express their genetic information. In particular, chloroplast gene expression machinery is a distinctive assembly of prokaryotic, eukaryotic, and phage-like components–likely the result of acquisition of a great number of regulatory proteins during evolution. The presence of nucleic acids within chloroplasts was established in 1963 . This subsequently led to the selection of cpDNA as one of the first candidates for complete genome sequencing . Studies of the organization and evolution of chloroplast genomes have been rapidly expanding due to the availability of the number of completely sequenced genomes published in the past decade. Fifty-four completed genomes are available from various land plant lineages, with the best representation (36 species) from flowering plants. Comparative studies indicate that chloroplast genomes of land plants are highly conserved in both gene order and gene content . Moreover, the substitution rate in cpDNA is much lower than in nuclear DNA and significantly reduced in the inverted repeat regions as compared to the single copy regions .
Chloroplast bioengineering offers a number of advantages over nuclear transformation including high levels of transgene expression and gene containment . In addition, chloroplast genetic engineering has also become a powerful tool for basic research in biogenesis and function of this organelle. This approach has helped unveil a wealth of information about cpDNA replication origins, introns, maturases, translation elements, proteolysis, import of proteins and several other processes . However, this technology is readily feasible only in tobacco. Lack of complete chloroplast genome sequence is still one of the major limitations preventing the expansion of chloroplast bioengineering to other useful crops. Transgene integration into the chloroplast genome occurs exclusively by homologous recombination of chloroplast DNA flanking sequences. Therefore, chloroplast genome sequence analysis is crucial for identification of spacer regions to integrate transgenes at optimal positions as well as the identification of endogenous regulatory sequences that support optimal expression of transgenes . Prior to 2004 only seven published crop chloroplast genomes were available and this number has increased to 23 during the past two years . Furthermore, the availability of genome sequence information has also made it possible to study evolutionary relationships among chloroplast and nuclear genomes .
Citrus is the largest fruit crop of international economic value because of its many uses including its value as a nutritive food source and for its valuable essential oils utilized by the food, pharmaceutical, and cosmetic industries. The valuable Citrus industry in Florida (USA) has recently been put in peril because of the accidental introduction of the exotic disease Citrus canker. This bacterial disease, which can infect all cultivars of Citrus, is the result of infection by Xanthomonas pv citri . Elimination of this disease by eradication has resulted in a cost of $1.2 billion (US) and the destruction of 7 million commercial and 5 million nursery and residential trees (pers. comm. T.R. Gottwald). Attempts at resistance breeding in Citrus are impeded by many biological characteristics, such as juvenility, incompatibility, heterozygosity, a narrow genetic basis, and nucellar embryony. In this context, genetic engineering of the chloroplast genome with non-host resistance traits would be an effective alternative for transferring desirable traits because of its many advantages over nuclear transformation . However, for Citrus, genetic improvement through chloroplast transformation has been limited due to the lack of available chloroplast genome sequence, not only in the genus Citrus but also in the entire family Rutaceae.
In this article, we report on the complete sequence of the chloroplast genome of Citrus sinensis (L.) Osbeck var. 'Ridge Pineapple', which is the first published whole genome sequence of a member of the family Rutaceae and order Sapindales. We describe the organization of this genome and we present a phylogenetic analysis of Citrus and 27 other angiosperm chloroplast genomes based on 61 shared protein-coding genes. The phylogenetic comparisons enable an examination of relationships among several major clades of angiosperms.
Size, gene content, order and organization of the Citruschloroplast genome
Repeated sequences in the Citrus sinensis chloroplast genome.
IGS (trnS-GCU, trnS-GGA)
IGS (rpl33 – rps18)
IGS (rrn5 – rrn4.5)
IGS (petN – psbM)
IGS (trnG-GCC – trnR-UCU, rpl32 – trnL-UAG)
IGS (trnsG-GCC – trnR-UCU, rpl32 – trnL-UAG)
IGS (trnF-GAA – ndhJ, rps12_3end – trnV-GAC)
IGS (trnF-GAA – ndhJ, trnV-GAC – rps12_3end)
IGS (trnG-GCC – trnR-UCU)
IGS (atpF – atpH)
IGS (psbZ – trnG-GCC)
IGS (trnS-GCU – trnG-GCC, psbM – trnD-GUC)
IGS (rrn4.5 – rrn5)
IGS (rrn4.5 – rrn5)
IGS (rrn4.5 – rrn5)
IGS (rrn5 – rrn4.5)
IGS (rpoB – trnC-GCA)
IGS (rpl32 – trnL-UAG)
Intron (ndhA), IGS (trnV-GAC – rps12_3end)
IGS (rps12_3end – trnV-GAC), Intron (ndhA – ndhA)
IGS (ycf4 – cemA)
IGS (psbT – psbN)
IGS (rps12_3end – trnV-GAC), Intron (ndhA – ndhA)
Intron (ndhA – ndhA), IGS (trnV-GAC – rps12_3end)
IGS (petN – psbM)
IGS (trnG-GCC – trnR-UCU)
IGS (trnS-GCU – trnG-GCC, psbM – trnD-GUC)
Variation between coding sequences and cDNAs
Comparison of the sweet orange chloroplast genome with EST sequences obtained from GenBank and in-house database of Citrus sinensis source.
Gene size (bp)
Sequence analyzed a
Number of variable sites
Variation type b
Amino Acid change
Amino acid characteristics d
Implications for integration of transgenes
Chloroplast genetic engineering offers several advantages, including a high-level of transgene expression , multi-gene engineering in a single transformation event , transgene containment via maternal inheritance  or cytoplasmic male sterility , lack of gene silencing [9, 13], position effect due to site specific transgene integration , and lack of pleiotropic effects due to sub-cellular compartmentalization of transgene products [15–17]. Apart from expressing therapeutic agents , biopolymers  or transgenes to confer valuable agronomic traits, including herbicide resistance , disease resistance , insect resistance , drought tolerance , salt tolerance , and phytoremediation , chloroplast genetic engineering has been used to study chloroplast biogenesis and function, revealing the mechanisms of DNA replication origins, intron maturases, translation elements and proteolysis, import of proteins, and several other processes . Despite the potential of chloroplast genetic engineering, this technology has only recently been extended to the major crops, including soybean , carrot , lettuce , and cotton .
The availability of complete sequences of chloroplast genomes enhances their use for genetic engineering. In chloroplast transformation, finding appropriate intergenic spacer regions is very important for efficient integration of transgenes. In tomato and potato, researchers have used trnfM-trnG, rbcL-accD, trnV-3'-rps12, and 16S rRNA-orf 70B intergenic spacer regions of tobacco to integrate transgenes [29–31]. Unfortunately, none of these regions have 100% sequence identity . For example, the intergenic spacer region between rbcL and accD of potato and tobacco shows only 94% sequence identity. Subsequently, potato chloroplast transformants are generated at 10–30 times lower frequencies than tobacco . Similarly, the trnfM and trnG intergenic spacer region used for tomato chloroplast transformation has only 82% sequence identity with tobacco, resulting in inefficient transgene integration. There are major deletions in the tomato chloroplast genome in this intergenic spacer region when compared to tobacco, which was used for transformation . Therefore, the development of species-specific vectors for transgene integration would enable the use of any of the intergenic spacer regions within the respective chloroplast genomes . Moreover, genome organization is different among some species. For instance the rbcL and accD genes are adjacent in tobacco and most other angiosperm chloroplast genomes, including Citrus. However, they are not adjacent in the soybean chloroplast genome because an inversion has altered gene order . These examples emphasize the importance of choosing appropriate intergenic spacer regions for chloroplast transformation.
Gene order of the Citrus genome is identical to the published genome sequences of the Solanaceae , which have the inferred ancestral angiosperm genome organization . The rps19 gene and the first 84 amino acids of rpl22, which generally are single copy in the LSC on the IRb side, have been duplicated in Citrus. Thus, there is a complete, second copy of rps19 and a truncated copy of rpl22 adjacent to trnH. This duplication is likely due to an expansion of IRb at the LSC junction, a common process in chloroplast genomes . The gene content of Citrus is also very similar to most other angiosperm chloroplast genomes. However, infA, a gene coding for a translation initiation factor in other plant species, is absent in the Citrus genome, and rpl22 is apparently not functional due to a frame shift mutation. Millen et al.  demonstrated at least 24 independent losses of infA in angiosperms, and in four lineages this gene has been shown to be transferred to the nucleus. Three of these losses are evident in our phylogeny based on cpDNA sequences (indicated by bars in Figs. 2, 3). Among the rosid genomes sequenced the infA loss has occurred only once and this change supports the basal split between Vitis and the rest of the rosids (Figs. 2, 3). The rpl22 gene in the IRb region has a nonsense mutation resulting in 9 stop codons indicating that this gene is not functional. This was confirmed by PCR amplification and sequencing using primers that flank the IR/LSC boundaries. The rpl22 gene has been reported to be missing in legume chloroplast genomes and the import of nuclear encoded protein has been demonstrated [32, 35]. Our group recently reported that rpl22 was also missing in the cotton chloroplast genome  but it turns out that this was an annotation error. The lack of a functional copy of rpl22 in Citrus should be investigated further, including an expanded sampling of members of the Rutaceae and Sapindales.
Repeat analysis identified 29 direct and inverted repeats 30 bp or longer with a sequence identity ≥ 90% in the Citrus chloroplast genome with the longest repeat, other than the IR, 53 bp in length (Table 1). The presence of dispersed repeats in chloroplast genomes, especially in intergenic spacer regions, has been reported in a number of angiosperm lineages, including other rosids .
Phylogenies based on 61 protein-coding genes (Figs. 2, 3) generally agree with several recent studies based on multiple genes or complete chloroplast genomes [37–39]. Areas of congruence that are strongly supported include the monophyly of monocots and their sister relationship to eudicots, monophyly of rosids and asterids, and the sister relationship between Caryophyllales (represented by Spinacia) and asterids.
Our chloroplast genome trees (Figs. 2, 3) indicate that the earliest diverging angiosperm lineage is either Amborella or Amborella + Nymphaeales. This incongruence between MP and ML trees was noted previously [37, 39]. This same incongruence was observed in a multigene phylogeny that includes nine genes from the chloroplast, mitochondrial and nuclear genomes . In this case, phylogenies for chloroplast genes supported the Amborella basal hypothesis, whereas mitochondrial genes supported Amborella + Nymphaeales as the earliest angiosperm lineage.
A second incongruence between MP and ML trees concerns the position of the magnoliid Calycanthus, although bootstrap support for the different relationships is weak (Figs. 2, 3). The MP tree places Calycanthus sister to eudicots, whereas the ML tree positions this taxon sister to a clade that includes both monocots and eudicots. This same incongruence was observed in previous phylogenetic analyses based on the 61 protein-coding chloroplast genes [37, 39]. The position of magnoliids continues to be controversial. Several molecular phylogenies have suggested different sets of relationships among magnoliids, monocots, and eudicots. Phylogenies based on phytochrome  and 17 chloroplast  genes placed magnoliids sister to monocots + eudicots but bootstrap support was weak. Several studies supported monocots as the sister group of magnoliids + eudicots [43–45] but bootstrap support was again weak. Both matK  and three gene  phylogenies suggested that eudicots are sister to magnoliids + monocots. Finally, the nine-gene phylogeny of Qiu et al.  recovered all three of these sets of relationships depending on the phylogenetic methods (MP or ML) and the genes used but support was very weak in each case. The different resolutions of relationships of magnoliids are greatly affected by taxon sampling and phylogenetic methodology. The affects of both of these phenomena have been discussed in several recent papers on the utility of whole chloroplast genomes for phylogenetic reconstruction of angiosperms [37, 39, 47–52]. Clearly, additional complete chloroplast genome sequences are needed to resolve the relationships among magnoliids, monocots, and eudicots.
A third incongruence between the MP and ML trees concerns the monophyly of the eurosid I clade (Figs. 2, 3). The MP tree (Fig. 2) strongly supports the monophyly of eurosid I (100% bootstrap), whereas in the ML tree the eurosid I clade in not monophyletic because Cucumis is strongly sister to the Myrtales instead of the Fabales. This same incongruence was detected in Jansen et al.  and was attributed to limited taxon sampling and model misspecification in ML analyses, two phenomena that are known to have adverse effects on phylogenetic reconstruction [53–57]. Expanded taxon sampling of rosids is needed to critically evaluate the monophyly of the eurosid I clade, especially since there is only moderate support for monophyly of eurosid I in previous phylogenies based on a single or few genes [reviewed in 58].
Both MP and ML trees are congruent with regard to the phylogenetic placement of Citrus. The genus is positioned as a member of the eurosid II clade, which has very strong bootstrap support in both MP (98%) and ML (100%) trees (Fig. 2). The eurosid II clade, which currently includes the four groups Brassicales, Malvales, Sapindales, and Tapisciaceae, has received strong support in previous DNA sequence phylogenies based on one to three genes , although relationships among these groups remain unresolved. Previous phylogenies based on whole chloroplast genomes [36, 37, 39, 59] have included only one or two groups (Arabidopsis, Brassicales and/or Gossypium, Malvales). The addition of Citrus from the Sapindales expands the sampling to three of four currently recognized groups of eurosids II. Both MP and ML trees (Figs. 2, 3) provide strong support (98 – 100% bootstrap) for a sister relationship between the Brassicales and Malvales. This same relationship was weakly supported based on phylogenies using one or two chloroplast genes [46, 60]. In contrast, the three gene phylogeny of Soltis et al.  weakly supported a sister relationship between the Malvales and Sapindales. Although taxon sampling is still somewhat limited, our 61-gene phylogeny provides very strong support for a close relationship between the Brassicales and Malvales. Expanded taxon sampling of the eurosid II clade is needed to confirm these results.
Complete chloroplast genome sequences provide valuable information on spacer regions for integration of transgenes at optimal sites via homologous recombination, as well as endogenous regulatory sequences for optimal expression of transgenes and should help in extending this technology to other useful crops. Availability of complete chloroplast genome sequence should pave the way for genetic manipulation of Citrus and other members of the Rutaceae. Furthermore, the addition of the Citrus genome sequence to phylogenetic analyses provides strong support for the monophyly of the eurosid II clade, and the sister group relationship between the Sapindales and the Brassicales/Malvales clade.
Source of DNA
Citrus sinensis (L.) Osbeck var 'Ridge Pineapple' leaf tissue was chosen as the source plant material because it is being used in the US and international effort to sequence the Citrus genome. The lamellar tissue used was obtained from field-grown mature trees. Chloroplast DNA was isolated as described Jansen et al. . Chloroplast DNA was subjected to rolling circle amplification (RCA) using the Repli-g kit following the manufacturers instructions (Molecular Staging Inc, New Haven, CT.).
DNA sequencing and genome assembly
Purified RCA products were subjected to nebulization, followed by end repair and size-fractionated by agarose gel electrophoresis to obtain fragment lengths ranging from 2.0–3.5 kb. Repaired products were blunt-end cloned into pCR®-4Blunt-TOPO and then transformed into ElectroMax™ DH5alpha cells by electroporation (TOPO® shotgun cloning kit, Invitrogen, Carlsbad, CA). Transformed cells were selected on LB agar containing 100 μg/μL ampicillin and arrayed into 30 × 96-well microtitre plates. Sequencing reactions were carried out in both the forward and reverse direction using the BigDye® Terminator v3.1 Cycle sequencing kit and separated by a 3730xL DNA sequence analyzer (Applied Biosystems, Foster City, CA). Sequence data were assembled using Sequencher v4.5 (GeneCodes Ann Arbor, MI) following quality and vector trimming. Gap regions were filled by sequencing PCR fragments generated from primers designed to flank the gaps. The assembly was considered complete when sequence with confidence scores of ≥ 20 as judged by KB Basecaller software (Applied Biosystems) was accumulated at every base position with at least 4X coverage.
Confirmation of IR expansion
To confirm the IR expansion that results in duplication of the genes rps19 and rpl22, PCR amplicons were generated that overlapped the junction of IRa and IRb with the LSC region. Primer sequences were as follows: rpl22F 5'-CAAAGCCCGCCAGGTAATTG-3' and psbAR 5'-CATTTCTTCCTGGCTGCTTG-3' for the amplicon overlapping IRa and LSC region and rpl22R 5'-GGAGAATTTGCGCCCACTAT-3' and rpsF 5'-CTATCCGTGCAATTCCCTCA-3' for the amplicon overlapping IRb and LSC region. Following PCR, the amplicons were cloned into the pCR®4-TOPO vector following the manufacturer's instructions (Invitrogen), then sequenced using methods described above.
The Citrus sinensis genome was annotated using DOGMA [Dual Organellar GenoMe Annotator, 62]. Further, searches against a custom database of the previously published chloroplast genomic sequences using BLASTX were used to identify additional putative protein-coding genes. Both tRNAs and rRNAs were identified by searches against the same database using BLASTN.
To determine the repeat structure of the Citrus chloroplast genome, REPuter  was used to identify the number and location of direct and inverted (palindromic) repeats using a minimum repeat size of 30 bp and a Hamming distance of 3 (i.e., repfind -f -p -l 30 -h 3 -best 10000).
Variation between coding sequences and cDNAs
Positional determination of potential RNA edits was accomplished using 1505 cp sequences from GenBank without chromatographic traces in addition to in-house Citrus sinensis ESTs that contained chromatograms . Only regions having a redundancy of at least four ESTs at each position were considered in the analysis. Differences were counted only when a base change was observed in the consensus sequence based on plurality. All assembly comparisons were made with the help of Sequencher v4.5.
Phylogenetic analysis was performed by using PAUP* version 4.10 b10 . Phylogenetic analyses excluded gap regions to avoid ambiguity in regions where alignment was problematic. All MP searches included 100 random addition replicates and TBR branch swapping with the Multrees option. Modeltest 3.7  was used to determine the most appropriate model of DNA sequence evolution for the combined 61-gene dataset. Hierarchical likelihood ratio tests and the Akaike information criterion were used to assess which of the 56 models best fit the data, which was determined to be GTR + G + I by both criteria. For ML analyses we performed an initial parsimony search with 100 random addition sequence replicates and TBR branch swapping, which resulted in a single tree. Model parameters were optimized onto the parsimony tree. We fixed these parameters and performed a ML analysis with three random addition sequence replicates and TBR branch swapping. The resulting ML tree was used to re-optimize model parameters, which then were fixed for another ML search with three random addition sequence replicates and TBR branch swapping. This successive approximation procedure  was repeated until the same tree topology and model parameters were recovered in multiple, consecutive iterations. Successive approximation has been shown to perform as well as full-optimization for both empirical and simulated datasets . Non-parametric bootstrap analyses  were performed for MP analyses with 1000 replicates with TBR branch swapping, 1 random addition replicate, and the Multrees option and for ML analyses with 100 replicates with NNI branch swapping, 1 random addition replicate, and the Multrees option.
small single copy
large single copy
expressed sequence tags
Polymerase Chain Reaction.
Investigations reported in this article were supported in part by grants from USDA 3611-21000-017-00D to Henry Daniell and from NSF DEB 0120709 to Robert K. Jansen. The authors would like to thank Jerry Mozoruk for technical assistance in sample preparation, the initial genome assembly & DNA preparation, Dr. Phat Dang for sequencing support at the US Horticultural Research Laboratory sequencing facility, and Dr. Kenneth H. Wolfe for alerting us to an annotation error in the cotton chloroplast genome.
- Sager R, Ishida MR: Chloroplast DNA in Chlamydomonas. Proc Natl Acad Sci USA. 1963, 50: 725-730. 10.1073/pnas.50.4.725.PubMedPubMed CentralView Article
- Sugiura M: History of chloroplast genomics. Photosynthesis Research. 2003, 76: 371-377. 10.1023/A:1024913304263.PubMedView Article
- Raubeson LA, Jansen RK: Chloroplast genomes of plants. In Diversity and Evolution of Plants-Genotypic and Phenotypic Variation in Higher Plants Edited by: Henry H. Wallingford: CABI Publishing;2005:45-68.View Article
- Wolfe KH, Li WH, Sharp PM: Rates of nucleotide substitution vary greatly among plant mitochondrial, chloroplast, and nuclear DNAs. Proc Natl Acad Sci USA. 1987, 84: 9054-9058. 10.1073/pnas.84.24.9054.PubMedPubMed CentralView Article
- Daniell H, Cohill PR, Kumar S, Dufourmantel N: Chloroplast Genetic Engineering. In Molecular Biology and Biotechnology of Plant Organelles Edited by: Daniell H, Chase CD. Netherlands: Springer Publishers; 2004:443-490.View Article
- Daniell H, Lee SB, Grevich J, Saski C, Quesada-Vargas T, Guda C, Tomkins J, Jansen RK: Complete chloroplast genome sequences of Solanum bulbocastanum, Solanum lycopersicum and comparative analyses with other Solanaceae genomes. Theor Applied Genet. 2006, 112: 1503-1518. 10.1007/s00122-006-0254-x.View Article
- Martin W, Stoebe B, Goremykin V, Hansmann S, Hasegawa M, Kowallik KV: Gene transfers to the nucleus and the evolution of chloroplasts. Nature. 1998, 393: 162-165. 10.1038/30234.PubMedView Article
- Gabriel DW: Citrus canker. Encyclopedia of Plant Pathology. Edited by: Maloy OC, Murray TD. New York: John Wiley & Sons; 2001:215-217.
- DeCosa B, Moar W, Lee SB, Miller M, Daniell H: Overexpression of the Bt cry2Aa2 operon in chloroplasts leads to formation of insecticidal crystals. Nat Biotechnol. 2001, 19: 71-74. 10.1038/83559.View Article
- Quesada-Vargas T, Ruiz ON, Daniell H: Characterization of heterologous multigene operons in transgenic chloroplasts: transcription, processing, translation. Plant Physiol. 2005, 138: 1746-1762. 10.1104/pp.105.063040.PubMedPubMed CentralView Article
- Daniell H, Khan M, Allison L: Milestones in chloroplast genetic engineering: an environmentally friendly era in biotechnology. Trends Plant Sci. 2002, 7: 84-91. 10.1016/S1360-1385(01)02193-8.PubMedPubMed CentralView Article
- Ruiz ON, Daniell H: Engineering Cytoplasmic Male Sterility via the Chloroplast Genome by expression of β-ketothiolase. Plant Physiol. 2005, 138: 1232-1246. 10.1104/pp.104.057729.PubMedPubMed CentralView Article
- Dhingra A, Portis AR, Daniell H: Enhanced translation of a chloroplast expressed RbcS gene restores small subunit levels and photosynthesis in nuclear RbcS antisense plants. Proc Natl Acad Sci USA. 2004, 101: 6315-6320. 10.1073/pnas.0400981101.PubMedPubMed CentralView Article
- Daniell H, Kumar S, Duformantel N: Breakthrough in chloroplast genetic engineering of agronomically important crops. Trends Biotechnol. 2005, 23: 238-245. 10.1016/j.tibtech.2005.03.008.PubMedPubMed CentralView Article
- Daniell H, Lee SB, Panchal T, Wiebe PO: Expression of cholera toxin B subunit gene and assembly as functional oligomers in transgenic tobacco chloroplasts. J Mol Biol. 2001, 311: 1001-1009. 10.1006/jmbi.2001.4921.PubMedPubMed CentralView Article
- Lee SB, Kwon HB, Kwon SJ, Park SC, Jeong MJ, Han SE, Daniell H: Accumulation of trehalose within transgenic chloroplasts confers drought tolerance. Mol Breed. 2003, 11: 1-13. 10.1023/A:1022100404542.View Article
- Leelavathi S, Reddy VS: Chloroplast expression of His-tagged GUS-fusions: a general strategy to overproduce and purify foreign proteins using transplastomic plants as bioreactors. Mol Breed. 2003, 11: 49-58. 10.1023/A:1022114427971.View Article
- Daniell H, Chebolu S, Kumar S, Singleton M, Falconer R: Chloroplast-derived vaccine antigens and other therapeutic proteins. Vaccine. 2005, 23: 1779-1783. 10.1016/j.vaccine.2004.11.004.PubMedView Article
- Vitanen PV, Devine AL, Khan S, Deuel DL, Van Dyk DE, Daniell H: Metabolic engineering of the chloroplast genome using the E. coli ubiC gene reveals that chorismate is a readily abundant precursor for p-hydroxybenzoic acid synthesis in plants. Plant Physiol. 2004, 136: 4048-4060. 10.1104/pp.104.050054.View Article
- Daniell H, Datta R, Varma S, Gray S, Lee SB: Containment of herbicide resistance through genetic engineering of the chloroplast genome. Nat Biotechnol. 1998, 16: 345-348. 10.1038/nbt0498-345.PubMedView Article
- DeGray G, Rajasekaran K, Smith F, Sanford J, Daniell H: Expression of an antimicrobial peptide via the chloroplast genome to control phytopathogenic bacteria and fungi. Plant Physiol. 2001, 127: 852-862. 10.1104/pp.127.3.852.PubMedPubMed CentralView Article
- Kota M, Daniel H, Varma S, Garczynski SF, Gould F, William MJ: Overexpression of the Bacillus thuringiensis (Bt) Cry2Aa2 protein in chloroplasts confers resistance to plants against susceptible and Bt-resistant insects. Proc Natl Acad Sci USA. 1999, 96: 1840-1845. 10.1073/pnas.96.5.1840.PubMedPubMed CentralView Article
- Kumar S, Dhingra A, Daniell H: Plastid expressed betaine aldehyde dehydrogenase gene in carrot cultured cells, roots and leaves confers enhanced salt tolerance. Plant Physiol. 2004, 136: 2843-2854. 10.1104/pp.104.045187.PubMedPubMed CentralView Article
- Ruiz ON, Hussein HS, Terry N, Daniell H: Phytoremediation of organomercurial compounds via chloroplast genetic engineering. Plant Physiol. 2003, 132: 1344-1352. 10.1104/pp.103.020958.PubMedPubMed CentralView Article
- Grevich J, Daniell H: Chloroplast genetic engineering: Recent advances and perspectives. Crit Rev Plant Sci. 2005, 24: 1-25.View Article
- Dufourmantel N, Pelissier B, Garçon F, Peltier G, Ferullo JM, Tissot G: Generation of fertile transplastomic soybean. Plant Mol Biol. 2004, 55: 479-89. 10.1007/s11103-004-0192-4.PubMedView Article
- Lelivelt CLC, McCabe MS, Newell CA, deSnoo CB, van Dun KMP, Birch-Machin I, Gray JC, Mills KHG, Nugent JM: Stable chloroplast transformation in lettuce (Lactuca sativa L.). Plant Mol Biol. 2005, 58: 763-774. 10.1007/s11103-005-7704-8.PubMedView Article
- Kumar S, Dhingra A, Daniell H: Stable transformation of the cotton plastid genome and maternal inheritance of transgenes. Plant Mol Biol. 2004, 56: 203-216. 10.1007/s11103-004-2907-y.PubMedPubMed CentralView Article
- Sidorov VA, Kasten D, Pang SZ, Hajdukiewicz PT, Staub JM, Nehra NS: Technical advance: stable chloroplast transformation in potato: use of green fluorescent protein as a plastid marker. Plant J. 1999, 19: 209-216. 10.1046/j.1365-313X.1999.00508.x.PubMedView Article
- Ruf S, Hermann M, Berger I, Carrer H, Bock R: Stable genetic transformation of tomato plastids and expression of a foreign protein in fruit. Nat Biotechnol. 2001, 19: 870-875. 10.1038/nbt0901-870.PubMedView Article
- Nguyen TT, Nugent G, Cardi T, Dix PJ: Generation of homoplasmic plastid transformants of a commercial cultivar of potato (Solanum tuberosum L.). Plant Sci. 2005, 168: 1495-1500. 10.1016/j.plantsci.2005.01.023.View Article
- Saski C, Lee S, Daniell H, Wood T, Tomkins J, Kim HG, Jansen RK: Complete chloroplast genome sequence of Glycine max and comparative analyses with other legume genomes. Plant Mol Biol. 2005, 59: 309-322. 10.1007/s11103-005-8882-0.PubMedView Article
- Goulding SE, Olmstead RG, Morden CW, Wolfe KH: Ebb and flow of the chloroplast inverted repeat. Mol Gen Genet. 1996, 252: 195-206. 10.1007/BF02173220.PubMedView Article
- Millen RS, Olmstead RG, Adams KL, Palmer JD, Lao NT, Heggie L, Kavanagh TA, Hibberd JM, Gray JC, Morden CW, Calie PJ, Jermiin LS, Wolfe KH: Many parallel losses of infA from chloroplast DNA during angiosperm evolution with multiple independent transfers to the nucleus. The Plant Cell. 2001, 13: 645-658. 10.1105/tpc.13.3.645.PubMedPubMed CentralView Article
- Gantt JS, Baldauf SL, Caile PJ, Weeden NF, Palmer JD: Transfer of rpl22 to the nucleus greatly preceded its loss from the chloroplast and involved the gain of an intron. The Embo J. 1991, 10: 3073-3078.PubMed
- Lee SB, Kaittanis C, Jansen RK, Hostetler JB, Tallon LJ, Town CD, Daniell H: The complete chloroplast genome sequence of Gossypium hirsutum : organization and phylogenetic relationships to other angiosperms. BMC Genomics. 2006, 7: 61-10.1186/1471-2164-7-61.PubMedPubMed CentralView Article
- Jansen RK, Kaittanis C, Saski C, Lee SB, Tomkins J, Alverson AJ, Daniell H: Phylogenetic analyses of Vitis (Vitaceae) based on complete chloroplast genome sequences: effects of taxon sampling and phylogenetic methods on resolving relationships among rosids. BMC Evol Biol. 2006, 6: 32-10.1186/1471-2148-6-32.PubMedPubMed CentralView Article
- Soltis DE, Soltis PS, Chase MW, Mort ME, Albach DC, Zanis M, Savolainen V, Hahn WJ, Hoot SB, Fay MF, Axtell M, Swensen SM, Prince LM, Kress WJ, Nixon KC, Farris JS: Angiosperm phylogeny inferred from 18S rDNA, rbcL, and atpB sequences. Bot J Linn Soc. 2000, 133: 381-461. 10.1006/bojl.2000.0380.View Article
- Leebens-Mack J, Raubeson LA, Cui L, Kuehl JV, Fourcade MH, Chumley TW, Boore JL, Jansen RK, dePamphilis CW: Identifying the basal angiosperms node in chloroplast genome phylogenies: Sampling one's way out of the Felsenstein zone. Mol Biol Evol. 2005, 22: 1948-1963. 10.1093/molbev/msi191.PubMedView Article
- Qiu Y-L, Li L, Hendry T, Li R, Taylor DW, Issa MJ, Ronen AJ, Vekaria ML, White AM: Reconstructing the basal angiosperm phylogeny: evaluating information content of the mitochondrial genes. Taxon. 2006.
- Mathews S, Donoghue MJ: The root of angiosperm phylogeny inferred from duplicate phytochrome genes. Science. 1999, 286: 947-950. 10.1126/science.286.5441.947.PubMedView Article
- Graham SW, Olmstead RG: Utility of 17 chloroplast genes for inferring the phylogeny of the basal angiosperms. Am J Bot. 2000, 87: 1712-1730. 10.2307/2656749.PubMedView Article
- Zanis MJ, Soltis DE, Soltis PS, Mathews S, Donoghue MJ: The root of the angiosperms revisited. Proc Natl Acad Sci. 2002, 99: 6848-6853. 10.1073/pnas.092136399.PubMedPubMed CentralView Article
- Qiu Y-L, Dombrovska O, Lee J, Li L, Whitlock BA, Bernasconi-Quadroni F, Rest JS, Davis CC, Borsch T, Hilu KW, Renner SS, Soltis DE, Soltis PS, Zanis MJ, Cannone JJ, Gutell RR, Powell M, Savolainen V, Chatrou LW, Chase MW: Phylogenetic analysis of basal angiosperms based on nine plastid, mitochondrial, and nuclear genes. Int J Plt Sci. 2005, 166: 815-842. 10.1086/431800.View Article
- Nickrent DL, Blarer A, Qiu Y-L, Soltis DE, Soltis PS, Zanis M: Molecular data place Hydnoraceae with Aristolochiaceae. Amer J Bot. 2002, 89: 1809-1817.View Article
- Hilu KW, Borsch T, Muller K, Soltis DE, Soltis PS, Savolainen V, Chase M, Powell M, Alice L, Evans R, Sauquet H, Neinhuis C, Slotta T, Rohwer J, Chatrou L: Inference of angiosperm phylogeny based on matK sequence information. Amer J Bot. 2003, 90: 1758-1776.View Article
- Soltis DE, Soltis PS: Amborella not a "basal angiosperm"? Not so fast. Amer J Bot. 2004, 91: 997-1001.View Article
- Soltis DE, Albert VA, Savolainen V, Hilu K, Qiu Y-L, Chase MW, Farris JS, Stefanoviæ S, Rice DW, Palmer JD, Soltis PS: Genome-scale data, angiosperm relationships, and 'ending incongruence': a cautionary tale in phylogenetics. Trends Plant Sci. 2004, 9: 477-483. 10.1016/j.tplants.2004.08.008.PubMedView Article
- Stefanovic S, Rice DW, Palmer JD: Long branch attraction, taxon sampling, and the earliest angiosperms: Amborella or monocots?. BMC Evol Biol. 2004, 4: 35-10.1186/1471-2148-4-35.PubMedPubMed CentralView Article
- Goremykin VV, Holland B, Hirsch-Ernst KI, Hellwig FH: Analysis of Acorus calamus chloroplast genome and its phylogenetic implications. Mol Biol Evol. 2005, 22: 1813-1822. 10.1093/molbev/msi173.PubMedView Article
- Martin W, Deusch O, Stawski N, Grunheit N, Goremykin V: Chloroplast genome phylogenetics: why we need independent approaches to plant molecular evolution. Trends Plant Sci. 2005, 10: 203-209. 10.1016/j.tplants.2005.03.007.PubMedView Article
- Lockhart PJ, Penny D: The place of Amborella within the radiation of angiosperms. Trends Plant Sci. 2005, 10: 201-202. 10.1016/j.tplants.2005.03.006.PubMedView Article
- Bruno WJ, Halpern AL: Topological bias and inconsistency of maximum likelihood using wrong models. Mol Biol Evol. 1999, 16: 564-566.PubMedView Article
- Swofford DL, Waddell PJ, Huelsenbeck JP, Foster PG, Lewis PO, Rogers JS: Bias in phylogenetic estimation and its relevance to the choice between parsimony and likelihood methods. Syst Biol. 2001, 50: 525-539. 10.1080/106351501750435086.PubMedView Article
- Poe S: ensitivity of phylogeny estimation to taxonomic sampling. Syst Biol. 1998, 47: 18-31. 10.1080/106351598261003.PubMedView Article
- Hillis DM: Taxonomic sampling, phylogenetic accuracy, and investigator bias. Syst Biol. 1998, 47: 3-8. 10.1080/106351598260987.PubMedView Article
- Zwickl DJ, Hillis DM: Increased taxon sampling greatly reduces phylogenetic error. Syst Biol. 2002, 51: 588-598. 10.1080/10635150290102339.PubMedView Article
- Soltis DE, Soltis PS, Endress PK, Chase MW: Phylogeny and evolution of AngiospermsSunderland Massachusetts: Sinauer Associates Inc;2005.
- Goremykin VV, Hirsch-Ernst KI, Wolfl S, Hellwig FH: Analysis of the Amborella trichopoda chloroplast genome sequence suggests that Amborella is not a basal angiosperm. Mol Biol Evol. 2003, 20: 1499-1505. 10.1093/molbev/msg159.PubMedView Article
- Savolainen V, Chase MW, Hoot SB, Morton CM, Soltis DE, Bayer C, Fay MF, De Bruijn AY, Sullivan S, Qiu Y-L: Phylogenetics of flowering plants based upon a combined analysis of plastid atpB and rbcL gene sequences. Syst Biol. 2000, 49: 306-362. 10.1080/10635159950173861.PubMedView Article
- Jansen RK, Raubeson LA, Boore JL, dePamphilis CW, Chumley TW, Haberle RC, Wyman SK, Alverson AJ, Peery R, Herman SJ, Fourcade HM, Kuehl JV, McNeal JR, Leebens-Mack J, Cui L: Methods for obtaining and analyzing chloroplast genome sequences. Meth Enzymol. 2005, 395: 348-384. 10.1016/S0076-6879(05)95020-9.PubMedView Article
- Wyman SK, Jansen RK, Boore JL: Automatic annotation of organellar genomes with DOGMA. Bioinformatics. 2004, 20: 3252-3255. 10.1093/bioinformatics/bth352. [http://www.evogen.jgi-psf.org/dogma].PubMedView Article
- Kurtz S, Choudhuri JV, Ohlebusch E, Schleiermacher C, Stoye J, Giegerich R: REPuter: the manifold applications of repeat analysis on a genomic scale. Nucl Acids Res. 2001, 29: 4633-4642. 10.1093/nar/29.22.4633.PubMedPubMed CentralView Article
- Bausher M, Shatters R, Chapparo J, Dang P, Hunter W, Niedz R: An expressed sequence tag (EST) set from Citrus sinensis L. Osbeck whole seedlings and the implications of further perennial source investigations. Plant Science. 2003, 165: 415-422. 10.1016/S0168-9452(03)00202-4.View Article
- Swofford DL: PAUP*: Phylogenetic analysis using parsimony (*and other methods), ver. 4.0. Sunderland MA: Sinauer Associates;2003.
- Posada D, Crandall KA: MODELTEST: testing the model of DNA substitution. Bioinformatics. 1998, 14: 817-818. 10.1093/bioinformatics/14.9.817.PubMedView Article
- Sullivan J, Abdo Z, Joyce P, Swofford DL: Evaluating the performance of a successive-approximations approach to parameter optimization in maximum-likelihood phylogeny estimation. Mol Biol Evol. 2005, 22: 1386-1392. 10.1093/molbev/msi129.PubMedView Article
- Felsenstein J: Confidence limits on phylogenies: an approach using the bootstrap. Evolution. 1985, 39: 783-791. 10.2307/2408678.View Article
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.