Construction of a bacterial artificial chromosome library from the spikemoss Selaginella moellendorffii: a new resource for plant comparative genomics
© Wang et al; licensee BioMed Central Ltd. 2005
Received: 06 January 2005
Accepted: 14 June 2005
Published: 14 June 2005
The lycophytes are an ancient lineage of vascular plants that diverged from the seed plant lineage about 400 Myr ago. Although the lycophytes occupy an important phylogenetic position for understanding the evolution of plants and their genomes, no genomic resources exist for this group of plants.
Here we describe the construction of a large-insert bacterial artificial chromosome (BAC) library from the lycophyte Selaginella moellendorffii. Based on cell flow cytometry, this species has the smallest genome size among the different lycophytes tested, including Huperzia lucidula, Diphaiastrum digita, Isoetes engelmanii and S. kraussiana. The arrayed BAC library consists of 9126 clones; the average insert size is estimated to be 122 kb. Inserts of chloroplast origin account for 2.3% of the clones. The BAC library contains an estimated ten genome-equivalents based on DNA hybridizations using five single-copy and two duplicated S. moellendorffii genes as probes.
The S. moellenforffii BAC library, the first to be constructed from a lycophyte, will be useful to the scientific community as a resource for comparative plant genomics and evolution.
The lycophytes (class Lycopsida) are an ancient group of vascular plants that dominated the earth's flora during the Carboniferous period. The three orders of lycophytes that remain from this period include the homosporous Lycopodiales, the heterosporous Selaginellales and the heterosporous Isoetales. All of these plants are distinguishable from ferns and flowering plants by the presence of microphylls (as opposed to euphylls), the absence of leaf gaps and the absence of lateral roots. In common with ferns but not flowering plants, all lycophytes produce free-living spores, an independent gametophyte generation and non-integumented sporangia. Based upon the fossil record, the lycophytes are thought to have emerged during the early Devonian about 400 Myr ago prior to the evolution of leaves and roots in vascular plants [1, 2]. Based on recent DNA-based phylogenetic analyses, the Lycopsida clade is monophyletic and sister to the fern/seed plant, or euphyllophyte, clade . As representatives of the earliest and still-surviving vascular plant lineage, the lycophytes are an important group of plants for providing insights into the early evolution of land plants.
While genomic resources are available for many species of flowering plants, including the sequences of the Arabidopsis thaliana , rice [5, 6] and poplar  genomes, very few resources exist for plants other than angiosperms. A draft genome sequence is available for Chlamydomonas reinhardtii , a chlorophytic green alga that is a distant relative of the charophytic algal group that gave rise to land plants . The huge phylogenetic gap between the characterized genomes of Chlamydomonas reinhardtii and flowering plants greatly limits our ability to study how important features in plants originated and diversified at a genetic level. To help fill this gap, the genome sizes of several species of lycophytes were determined by cell flow cytometry in order to identify a lycophyte with a relatively small genome. Of those surveyed, the spikemoss Selaginella moellendorffii was found to have the smallest, with a nuclear genome size less than 127 Mbp. Here we describe the construction and characterization of a large insert BAC library for this species.
Results and discussion
Genome size estimates
Results of cell flow cytometry to determine the nuclear genome sizes of some lycophytes.
Internal standard (pg/2C)
DNA content of sample (pg/2C); n1; SD
DNA content of sample species (Mbp/1C)2
Chick red blood cells (2.333)
11.40; 4; 0.036
Chick red blood cells (2.333)
5.45; 4; 0.017
Glycine max leaf (2.254)
3.49; 4; 0.005
Arabidopsis thaliana flower buds (0.365)
Oryza sativa leaf (1.06)
Glycine max leaf (2.254)
0.18; 8; 0.009
0.25; 4; 0.25
0.26; 4; 0.006
Oryza sativa leaf (1.06)
Glycine max leaf (2.254)
0.43; 4; 0.006
0.49; 4; 0.003
BAC library construction and characterization
To estimate the extent of chloroplast and mitochondrial DNA contamination of the BAC library, the arrayed library was probed with two S. moellendorffi DNA fragments that contain either chloroplast- or mitochondrial-encoded genes (Table 1). The S. moellendorffii DNA fragment containing the chloroplast encoded ribosomal proteins S8, L2 and S19 hybridized to 207 BAC clones. The S. moellendorffii DNA fragment containing the mitochondria-encoded NADH DEHYDROGENASE SUBUNIT 5 gene did not hybridize to any BAC clones but did hybridize to itself (data not shown). It is unlikely that either fragment is of nuclear origin given that these genes are encoded by organelles in every plant species studied so far. The results of these hybridizations demonstrate that a very small proportion (2.3%) of the BAC inserts are of chloroplast origin. This is within the expected range for organellar DNA contamination in large insert DNA libraries . The inability to detect clones that hybridize to mitochondrial DNA might reflect a mitochondrial genome that is small enough to be efficiently removed from the nuclear DNA preparation. However, the sizes of mitochondrial genomes in the lycophytes have yet to be reported.
Results of DNA hybridizations to determine gene copy number and the number of genome equivalents represented in the arrayed BAC library.
Homologous to following plant genes (e value; % amino acid identity)1:
Genome copy number:
Number of positive BAC clones from arrayed library:
GIBBERELLIC ACID INSENSITIVE (8e-34; 43)
CYTOCHROME P450 98A3 (1e-40; 66)
OXYANION TRANSLOCATION PROTEIN (1e-36; 53)
SHORTROOT (3e-45; 47)
Zn-FINGER PROTEIN (1e-69; 90)
Mg CHELATASE SUBUNIT H (1e-130; 85)
SYNTAXIN (9e-55; 54)
S. moellendorffii chloroplast fragment2
Ribosomal protein L22 (2e-33; 56)
Ribosomal protein L2 (1e-33; 71)
Ribosomal protein S19 (2e-31; 71)
S. moellendorffii mitochondrial fragment2
Contains NADH DEHYDROGENASE subunit 5 (6e-61; 71)
To our knowledge, this is the first Lycopsida BAC library constructed. A BAC library was recently published for Physcomitrella patens , a moss that has become a popular model system for functional genomics, biochemistry, evolutionary and developmental genetic studies (reviewed by ). As a representative of the early branching vascular plants, the S. moellendorffii library described here will link genomic resources from algae (Chlamydomonas reinhardtii) and moss (P. patens) to seed plants, including important crop species. The library also will be a useful resource for readily identifying genes that are involved in developmental, physiological and biochemical processes in the lycophytes and provide an important tool for the study of plant evolution. The nuclear genome of S. moellendorffi is currently being sequenced by the Department of Energy Joint Genome Institute using a shotgun sequencing approach . The BAC library described here is currently available to the scientific community through the Arizona Genomics Institute .
We have shown that the lycophyte S. moellendorffii has a very small genome size, as small or smaller than that of Arabidopsis thaliana based on cell flow cytometry, and have constructed from this species a large insert BAC library that contains about 10 genome equivalents and has an average insert size of 122 kb.
Selaginella moellendorffii plants were obtained from Plant Delights Nursery, Inc., Raleigh, NC. Huperzia lucidula plants were obtained from Carolina Biological Supply Company (Burlington, NC; referred to there as Lycopodium lucidulum). Diphaiastrum digita plants were obtained from Gar Rothwell (Ohio University, Athens, OH). Isoetes engelmannii plants were obtained from Gerald Gastony (Indiana University, Bloomington, IN). Once obtained, all plants were grown in a local greenhouse under 50% shade cloth.
Nuclear DNA content determination
The procedure used to analyze nuclear DNA content in plant cells was modified from Arumuganathan and Earle . Glycine max, Oryza sativa cv Nipponbare or Arabidopsis thaliana or chicken red blood cell nuclei were used as internal standards. For flow cytometric analysis, 50 mg of fresh leaf tissue was placed on ice in a sterile 35 × 10 mm plastic petri dish. The tissue was sliced into 0.25 mm to 1 mm segments in a solution containing 10 mM MgSO4, 50 mM KCl, 5 mM Hepes, pH 8.0, 3 mM dithiothreitol, 0.1 mg ml-1 propidium iodide, 1.5 mg ml-1 DNase free RNase (Rhoche, Indianapolis, IN) and 0.25% (v/v) Triton X-100. The suspended nuclei were filtered through 30 μm nylon mesh and incubated at 37 C for 30 min before flow cytometric analysis. Suspensions of sample nuclei were each spiked with a suspension of standard nuclei (prepared in above solution) and analyzed with a FAC calibur flow cytometer (Becton-Dickinson, San Jose, CA). For each measurement, the propidium iodide fluorescence area signals (FL2-A) from 1000 nuclei were collected and analyzed by CellQuest software (Becton-Dickinson, San Jose, CA) on a Macintosh computer. The mean position of the G0/G1 (Nuclei) peak of the sample and the internal standard were determined by CellQuest software. The mean nuclear DNA content of each plant sample, measured in pg, was based on 1000 scanned nuclei.
BAC library construction
The growing tips (1 cm) of plants were harvested and flash frozen in liquid nitrogen prior to nuclei preparation. Purified nuclei were prepared according to Luo and Wing . The embedding of nuclei, Hind III restriction enzyme digestion of DNA and the preparation of high molecular weight DNA fragments were performed according to Luo and Wing . The Hind III cloning-ready single copy pIndigoBAC536 vector was prepared from the high copy pCUGIBAC1 plasmid as described by Luo et al. . High molecular weight genomic DNA fragments were ligated to the vector and transformed into E. coli stain DH10B T1 phage-resistant cells (Invitrogen, Carlsbad, CA). Transformed colonies were picked and transferred into individual wells of 384 microtiter plates, grown and then stored at -80C. The BAC library was gridded onto 11.25 × 22.5 cm filters in high density, double spots and 4 × 4 patterns with a Genetix QB (Genetix, UK). To characterize the BAC inserts, BAC DNA samples were prepared with a Tomtec Quadra 96 model 320 (Tomtec, Hamden, CT) in a 96-well format, digested with Not I, separated on 1% agarose CHEF (Bio-Rad, Hercules, CA) gels at 5–15 sec linear ramp time, 6 V/cm, 14C in 0.5 × TBE buffer for 16 hours and stained with ethidium bromide.
Genomic DNA for gel blot analysis and PCR was isolated from S. moellendorffii plants using the Nucleon Phytopure kit (Amersham Biosciences, Piscataway, NJ). RNA was isolated using the RNeasy Plant Mini Kit (Qiagen, Valencia, CA); cDNA was synthesized and RT-PCR performed using the cMaster RTplusPCR System (Eppendorf, Westbury, NY). The 479 bp SmGAI cDNA fragment (GenBank accession AY874058) was initially obtained by RT-PCR using the primers 5'cayttyacigciaaycargci3' and 5' tcraaiarigcrctrtartartg3'. The 473 bp SmCYP98 cDNA fragment (GenBank accession AY843208) was obtained by RT-PCR using the primers 5'gtdgcvttcaacaacatwac3' and 5'ccatnccwgchgtgatcat3'. In all cases, y = c or t, r = a or g, d = a, g or t; v = a, c or g; w = a or t; n = a, t, g or c. All PCR products were cloned into the pGEM-T EASY vector (Promega, Madison, WI) and sequenced. Several genes used as probes were generated by PCR using genomic DNA as template. The 302 bp genomic SmSHR fragment (GenBank accession AY877259) was obtained using the primers 5'ggtggacctctcctctcctc3' and 5'atccaggtttgtagcgcttg3', the 254 bp genomic SmZNF fragment (GenBank accession AY877260) was obtained using the primers 5'gaggtcgtctccttgtcacc3' and 5'cggcgaaagtgtttcttgat3', the 395 bp genomic SmChlH fragment (GenBank accession AY877262) was obtained using the primers 5'ggatcgccttcatatccaaa3' and 5'aaactcgcggtcacagtctt3', and the 375 bp genomic SmSNT fragment (GenBank accession AY877261) was obtained using the primers 5'gatccaggccaagatgaaga3' and 5'tgccagtgaccgtgaagtag3'. The SmOTP gene (accession number AY877263) was identified from a S. moellendorffii EST library; the entire insert was used as a probe. The S. moellendorffii chloroplast (GenBank accession AY877264) and mitochondrial (GenBank accession AY877265) fragment sequences were identified from a partially sequenced, small-insert, sheared genomic library. For DNA blot hybridizations, each cloned insert was gel purified and labeled with 32P using the Megaprime DNA Labelling System (Amersham Biosciences, Piscataway, NJ). For DNA gel blots, 4 μg of S. moellendorffii genomic DNA was digested with restriction enzymes, fractionated and alkaline transferred to nylon membranes according to Sambrook and Russell . All filters were hybridized at 65C in a solution containing 0.5 M phosphate buffer (pH 7.2), 7% (w/v) SDS, and 1 mM EDTA. All membranes were washed under stringent conditions (0.1X SSC, 0.1% SDS, 65C).
We thank Kiran Rao, Cari Soderlund, Angelina Angelova, Amber Hopf and Luke Gumaelius for their assistance in developing the BAC library resource. This work was supported by the Purdue Agricultural Research Programs and National Science Foundation grant numbers 0207110 (Banks), 0211611 (Wing), and 0208502 (Mandoli). This is manuscript number 2005-17581 of the Purdue Agricultural Research Program.
- Kenrick P, Crane PR: The origin and early diversification of plants on land. Nature. 1997, 389: 33-39. 10.1038/37918.View ArticleGoogle Scholar
- Stewart W, Rothwell GW: Paleobotany and the Evolution of Plants. 1993, Cambridge, Cambridge University Press, 2nd.Google Scholar
- Pryer KM, Schneider H, Smith AR, Cranfill R, Wolf PG, Hunt JS, Sipes SD: Horsetails and ferns are a monophyletic group and the closest living relatives to seed plants. Nature. 2001, 409: 618-622. 10.1038/35054555.PubMedView ArticleGoogle Scholar
- Arabdiopsis Genome Initiative: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.View ArticleGoogle Scholar
- Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, Hadley D, Hutchison D, Martin C, Katagiri F, Lange BM, Moughamer T, Xia Y, Budworth P, Zhong J, Miguel T, Paszkowski U, Zhang S, Colbert M, Sun WL, Chen L, Cooper B, Park S, Wood TC, Mao L, Quail P, Wing R, Dean R, Yu Y, Zharkikh A, Shen R, Sahasrabudhe S, Thomas A, Cannings R, Gutin A, Pruss D, Reid J, Tavtigian S, Mitchell J, Eldredge G, Scholl T, Miller RM, Bhatnagar S, Adey N, Rubano T, Tusneem N, Robinson R, Feldhaus J, Macalma T, Oliphant A, Briggs S: A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science. 2002, 296: 92-100. 10.1126/science.1068275.PubMedView ArticleGoogle Scholar
- Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, Cao M, Liu J, Sun J, Tang J, Chen Y, Huang X, Lin W, Ye C, Tong W, Cong L, Geng J, Han Y, Li L, Li W, Hu G, Li J, Liu Z, Qi Q, Li T, Wang X, Lu H, Wu T, Zhu M, Ni P, Han H, Dong W, Ren X, Feng X, Cui P, Li X, Wang H, Xu X, Zhai W, Xu Z, Zhang J, He S, Xu J, Zhang K, Zheng X, Dong J, Zeng W, Tao L, Ye J, Tan J, Chen X, He J, Liu D, Tian W, Tian C, Xia H, Bao Q, Li G, Gao H, Cao T, Zhao W, Li P, Chen W, Zhang Y, Hu J, Liu S, Yang J, Zhang G, Xiong Y, Li Z, Mao L, Zhou C, Zhu Z, Chen R, Hao B, Zheng W, Chen S, Guo W, Tao M, Zhu L, Yuan L, Yang H: A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002, 296: 79-92. 10.1126/science.1068037.PubMedView ArticleGoogle Scholar
- JGI Populus trichocarpa v1.0. [http://genome.jgi-psf.org/Poptr1/Poptr1.home.html].
- JGI Chlamydomonas reinhardtii v2.0. 2005
- Qui YL, Palmer JD: Phylogeny of early land plants: insights from genes and genomes. Trends in Plant Science. 1999, 4: 26-30. 10.1016/S1360-1385(98)01361-2.Google Scholar
- Obermayer R, Leitch IJ, Hanson L, Bennett MD: Nuclear DNA C-values in 30 species double the familial representation in pteridophytes. Ann Bot (Lond). 2002, 90: 209-217. 10.1093/aob/mcf167.View ArticleGoogle Scholar
- Luo M, Wing R: An improved method for plant BAC library construction. Methods in Molecular Biology. 2003, 236: 3-20.PubMedGoogle Scholar
- Zhang X, Choi S, Woo S, Li Z, RA W: Construction and characterization of two rice bacterial artificial chromosome libraries from the parents of a permanent recombinant inbred mapping population. Molecular Breeding. 1996, 2: 11-24.View ArticleGoogle Scholar
- Liang C, Xi Y, Shu J, Li J, Yang J, Che K, Jin D, Liu X, Weng M, He Y, Wang B: Construction of a BAC library of Physcomitrella patens and isolation of a LEA gene. Plant Science. 2004, 167: 491-498. 10.1016/j.plantsci.2004.04.015.View ArticleGoogle Scholar
- Schaefer DG, Zryd JP: The moss Physcomitrella patens, now and then. Plant Physiology. 2001, 127: 1430-1438. 10.1104/pp.127.4.1430.PubMedPubMed CentralView ArticleGoogle Scholar
- JGI Community Sequencing Program Sequencing Plans for 2005. [http://www.jgi.doe.gov/sequencing/cspseqplans.html].
- Arizona Genomics Institute Ordering Website. [http://www.genome.arizona.edu/orders/direct.html?library=SM__Ba].
- Arumuganathan E, Earle ED: Estimation of nuclear DNA content of plants by flow cytometry. Plant Molecular Biology Reporter. 1991, 9: 229-233.View ArticleGoogle Scholar
- Luo M, Wang Y, Frisch D, Joobeur T, Wing RA, Dean RA: Melon bacterial artificial chromosome (BAC) library construction using improved methods and identification of clones linked to the locus conferring resistance to melon Fusarium Wilt. Genome. 2001, 44: 154-162. 10.1139/gen-44-2-154.PubMedView ArticleGoogle Scholar
- Sambrook J, Russell DW: Molecular cloning : a laboratory manual. 2001, Cold Spring Harbor, N.Y, Cold Spring Harbor Laboratory Press, 1: 3Google Scholar
- Galbraith DW, Harkins KR, Maddox JM, Ayres NM, Sharma DP, Firoozabady E: Rapid flow cytometric analysis of the cell cycle in intact plant tissues. Science. 1983, 220: 1049-1051.PubMedView ArticleGoogle Scholar
- Bennett MD, Leitch IJ: Nuclear DNA amounts in angiosperms - 583 new estimates. Annals of Botany. 1997, 8: 169-196. 10.1006/anbo.1997.0415.View ArticleGoogle Scholar
- Bennett MD, Leitch IJ, Price HJ, Johnston JS: Comparisons with Caenorhabditis (approximately 100 Mb) and Drosophila (approximately 175 Mb) using flow cytometry show genome size in Arabidopsis to be approximately 157 Mb and thus approximately 25% larger than the Arabidopsis genome initiative estimate of approximately 125 Mb. Annals of Botany. 2003, 91: 547-557. 10.1093/aob/mcg057.PubMedPubMed CentralView ArticleGoogle Scholar
- Bennett MD, Smith JB: Nuclear DNA amounts in angiosperms. Philosophical Transactions of the Royal Society of London B. 1991, 334: 309-345.View ArticleGoogle Scholar
- Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Journal of Molecular Biology. 1990, 215: 403-410. 10.1006/jmbi.1990.9999.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.