Construction of a bacterial artificial chromosome library from the spikemoss Selaginella moellendorffii: a new resource for plant comparative genomics
BMC Plant Biology volume 5, Article number: 10 (2005)
The lycophytes are an ancient lineage of vascular plants that diverged from the seed plant lineage about 400 Myr ago. Although the lycophytes occupy an important phylogenetic position for understanding the evolution of plants and their genomes, no genomic resources exist for this group of plants.
Here we describe the construction of a large-insert bacterial artificial chromosome (BAC) library from the lycophyte Selaginella moellendorffii. Based on cell flow cytometry, this species has the smallest genome size among the different lycophytes tested, including Huperzia lucidula, Diphaiastrum digita, Isoetes engelmanii and S. kraussiana. The arrayed BAC library consists of 9126 clones; the average insert size is estimated to be 122 kb. Inserts of chloroplast origin account for 2.3% of the clones. The BAC library contains an estimated ten genome-equivalents based on DNA hybridizations using five single-copy and two duplicated S. moellendorffii genes as probes.
The S. moellenforffii BAC library, the first to be constructed from a lycophyte, will be useful to the scientific community as a resource for comparative plant genomics and evolution.
The lycophytes (class Lycopsida) are an ancient group of vascular plants that dominated the earth's flora during the Carboniferous period. The three orders of lycophytes that remain from this period include the homosporous Lycopodiales, the heterosporous Selaginellales and the heterosporous Isoetales. All of these plants are distinguishable from ferns and flowering plants by the presence of microphylls (as opposed to euphylls), the absence of leaf gaps and the absence of lateral roots. In common with ferns but not flowering plants, all lycophytes produce free-living spores, an independent gametophyte generation and non-integumented sporangia. Based upon the fossil record, the lycophytes are thought to have emerged during the early Devonian about 400 Myr ago prior to the evolution of leaves and roots in vascular plants [1, 2]. Based on recent DNA-based phylogenetic analyses, the Lycopsida clade is monophyletic and sister to the fern/seed plant, or euphyllophyte, clade . As representatives of the earliest and still-surviving vascular plant lineage, the lycophytes are an important group of plants for providing insights into the early evolution of land plants.
While genomic resources are available for many species of flowering plants, including the sequences of the Arabidopsis thaliana , rice [5, 6] and poplar  genomes, very few resources exist for plants other than angiosperms. A draft genome sequence is available for Chlamydomonas reinhardtii , a chlorophytic green alga that is a distant relative of the charophytic algal group that gave rise to land plants . The huge phylogenetic gap between the characterized genomes of Chlamydomonas reinhardtii and flowering plants greatly limits our ability to study how important features in plants originated and diversified at a genetic level. To help fill this gap, the genome sizes of several species of lycophytes were determined by cell flow cytometry in order to identify a lycophyte with a relatively small genome. Of those surveyed, the spikemoss Selaginella moellendorffii was found to have the smallest, with a nuclear genome size less than 127 Mbp. Here we describe the construction and characterization of a large insert BAC library for this species.
Results and discussion
Genome size estimates
The approximate genome sizes of several lycophyte species were determined by cell flow cytometry using as an internal standard the nuclei of other plants or cells with genomes of known sizes (Table 1). The homosporous Huperzia lucidula and Diphaiastrum digita (Lycopodiales) have the largest genomes of those surveyed, estimated to be 5585 and 2670 Mbp/1C, respectively. Isoetes engelmanii (Isoetales), a heterosporous lycophyte, has a genome size of 1710 Mbp/1C. The heterosporous Selaginella moellendorffii (Selaginellales) has the smallest genome of those surveyed, between 88 and 127 Mbp/1C. The three estimates of S. moellendorffii genome size (Table 1) vary depending on species used as the internal standard. We also confirmed that the genome size of S. moellendorffii is smaller than that of S. kraussiana (Table 1), which was previously reported to have a genome size ranging from 0.32–0.72 pg/2C, or 157–320 Mbp/1C . The decision to construct a BAC library from the heterosporous S. moellendorffii was largely based on its having the smallest genome size of all lycophyte accessions examined here or reported previously.
BAC library construction and characterization
In constructing the BAC library, nuclear DNA was isolated from the growing tips of S. mollendorffii sporophytes, partially digested with Hind III and the library constructed and processed essentially according to Luo and Wing  as described in the Methods section. A total of 9126 BAC clones were picked, arrayed and processed for long-term storage. To estimate the insert sizes of this library, BAC DNAs were prepared from 410 randomly selected clones, digested with Not I and size-separated by pulse field gel electrophoresis. The Not I restriction patterns of 43 randomly selected clones are illustrated in Figure 1. Of the 410 BAC DNAs prepared, 23 yielded no DNA and 9 lacked an insert. Of the remaining 378 clones with inserts, the inserts ranged in size from 9 to 292 kb, with 75% having inserts sizes 90–159 kb and 84% with inserts greater than 90 kb. The average insert size of the clones with inserts was 122 +/- 44 kb (SD). The distribution of insert sizes is shown in Figure 2.
To estimate the extent of chloroplast and mitochondrial DNA contamination of the BAC library, the arrayed library was probed with two S. moellendorffi DNA fragments that contain either chloroplast- or mitochondrial-encoded genes (Table 1). The S. moellendorffii DNA fragment containing the chloroplast encoded ribosomal proteins S8, L2 and S19 hybridized to 207 BAC clones. The S. moellendorffii DNA fragment containing the mitochondria-encoded NADH DEHYDROGENASE SUBUNIT 5 gene did not hybridize to any BAC clones but did hybridize to itself (data not shown). It is unlikely that either fragment is of nuclear origin given that these genes are encoded by organelles in every plant species studied so far. The results of these hybridizations demonstrate that a very small proportion (2.3%) of the BAC inserts are of chloroplast origin. This is within the expected range for organellar DNA contamination in large insert DNA libraries . The inability to detect clones that hybridize to mitochondrial DNA might reflect a mitochondrial genome that is small enough to be efficiently removed from the nuclear DNA preparation. However, the sizes of mitochondrial genomes in the lycophytes have yet to be reported.
To determine the number of genome equivalents represented by the BAC library, the arrayed library was probed with the seven S. moellendorffii genes listed in Table 2. All seven genes potentially encode proteins that are at least 43% identical in amino acid sequence to similar genes in angiosperms. The two genes homologous to GIBBERELLIN INSENSITIVE (SmGAI) and CYTOCHROME P450 98 (SmCYP98) were obtained by RT-PCR. The gene homologous to OXYANION TRANSLOCATION PROTEIN (SmOTP) was identified from an EST library (C. Chapple, unpub. obs.), while the genes homologous to the SHORTROOT (SmSHR), MAGNESIUM CHELATASE SUBUNIT H (SmChlH), ZINC-FINGER (SmZNF) and SYNTAXIN (SmSNT) gene fragments used as probes were identified from the sequences generated from the ends a sheared S. moellendorffii genomic library (J. Banks, unpub. obs.). Based upon the genomic DNA hybridization results shown in Figure 3, these genes are present in either one or two copies in the genome. The number of BAC clones that hybridized to the single-copy genes ranged from 7–12, and to about twice that number of clones (16 and 23) using the two duplicated genes as probes (Table 2). Based on the results of these experiments, the estimated number of genome equivalents represented by the BAC library is ~10. Assuming that one nuclear genome equivalent is represented by 871 BAC clones with nuclear DNA inserts and each BAC insert is 122 kb, we estimate that the S. moellendorffii genome is 106 Mbp. This estimate coincides well with the estimated size of the nuclear genome of S. moellendorffii based upon cell flow cytometry.
To our knowledge, this is the first Lycopsida BAC library constructed. A BAC library was recently published for Physcomitrella patens , a moss that has become a popular model system for functional genomics, biochemistry, evolutionary and developmental genetic studies (reviewed by ). As a representative of the early branching vascular plants, the S. moellendorffii library described here will link genomic resources from algae (Chlamydomonas reinhardtii) and moss (P. patens) to seed plants, including important crop species. The library also will be a useful resource for readily identifying genes that are involved in developmental, physiological and biochemical processes in the lycophytes and provide an important tool for the study of plant evolution. The nuclear genome of S. moellendorffi is currently being sequenced by the Department of Energy Joint Genome Institute using a shotgun sequencing approach . The BAC library described here is currently available to the scientific community through the Arizona Genomics Institute .
We have shown that the lycophyte S. moellendorffii has a very small genome size, as small or smaller than that of Arabidopsis thaliana based on cell flow cytometry, and have constructed from this species a large insert BAC library that contains about 10 genome equivalents and has an average insert size of 122 kb.
Selaginella moellendorffii plants were obtained from Plant Delights Nursery, Inc., Raleigh, NC. Huperzia lucidula plants were obtained from Carolina Biological Supply Company (Burlington, NC; referred to there as Lycopodium lucidulum). Diphaiastrum digita plants were obtained from Gar Rothwell (Ohio University, Athens, OH). Isoetes engelmannii plants were obtained from Gerald Gastony (Indiana University, Bloomington, IN). Once obtained, all plants were grown in a local greenhouse under 50% shade cloth.
Nuclear DNA content determination
The procedure used to analyze nuclear DNA content in plant cells was modified from Arumuganathan and Earle . Glycine max, Oryza sativa cv Nipponbare or Arabidopsis thaliana or chicken red blood cell nuclei were used as internal standards. For flow cytometric analysis, 50 mg of fresh leaf tissue was placed on ice in a sterile 35 × 10 mm plastic petri dish. The tissue was sliced into 0.25 mm to 1 mm segments in a solution containing 10 mM MgSO4, 50 mM KCl, 5 mM Hepes, pH 8.0, 3 mM dithiothreitol, 0.1 mg ml-1 propidium iodide, 1.5 mg ml-1 DNase free RNase (Rhoche, Indianapolis, IN) and 0.25% (v/v) Triton X-100. The suspended nuclei were filtered through 30 μm nylon mesh and incubated at 37 C for 30 min before flow cytometric analysis. Suspensions of sample nuclei were each spiked with a suspension of standard nuclei (prepared in above solution) and analyzed with a FAC calibur flow cytometer (Becton-Dickinson, San Jose, CA). For each measurement, the propidium iodide fluorescence area signals (FL2-A) from 1000 nuclei were collected and analyzed by CellQuest software (Becton-Dickinson, San Jose, CA) on a Macintosh computer. The mean position of the G0/G1 (Nuclei) peak of the sample and the internal standard were determined by CellQuest software. The mean nuclear DNA content of each plant sample, measured in pg, was based on 1000 scanned nuclei.
BAC library construction
The growing tips (1 cm) of plants were harvested and flash frozen in liquid nitrogen prior to nuclei preparation. Purified nuclei were prepared according to Luo and Wing . The embedding of nuclei, Hind III restriction enzyme digestion of DNA and the preparation of high molecular weight DNA fragments were performed according to Luo and Wing . The Hind III cloning-ready single copy pIndigoBAC536 vector was prepared from the high copy pCUGIBAC1 plasmid as described by Luo et al. . High molecular weight genomic DNA fragments were ligated to the vector and transformed into E. coli stain DH10B T1 phage-resistant cells (Invitrogen, Carlsbad, CA). Transformed colonies were picked and transferred into individual wells of 384 microtiter plates, grown and then stored at -80C. The BAC library was gridded onto 11.25 × 22.5 cm filters in high density, double spots and 4 × 4 patterns with a Genetix QB (Genetix, UK). To characterize the BAC inserts, BAC DNA samples were prepared with a Tomtec Quadra 96 model 320 (Tomtec, Hamden, CT) in a 96-well format, digested with Not I, separated on 1% agarose CHEF (Bio-Rad, Hercules, CA) gels at 5–15 sec linear ramp time, 6 V/cm, 14C in 0.5 × TBE buffer for 16 hours and stained with ethidium bromide.
Genomic DNA for gel blot analysis and PCR was isolated from S. moellendorffii plants using the Nucleon Phytopure kit (Amersham Biosciences, Piscataway, NJ). RNA was isolated using the RNeasy Plant Mini Kit (Qiagen, Valencia, CA); cDNA was synthesized and RT-PCR performed using the cMaster RTplusPCR System (Eppendorf, Westbury, NY). The 479 bp SmGAI cDNA fragment (GenBank accession AY874058) was initially obtained by RT-PCR using the primers 5'cayttyacigciaaycargci3' and 5' tcraaiarigcrctrtartartg3'. The 473 bp SmCYP98 cDNA fragment (GenBank accession AY843208) was obtained by RT-PCR using the primers 5'gtdgcvttcaacaacatwac3' and 5'ccatnccwgchgtgatcat3'. In all cases, y = c or t, r = a or g, d = a, g or t; v = a, c or g; w = a or t; n = a, t, g or c. All PCR products were cloned into the pGEM-T EASY vector (Promega, Madison, WI) and sequenced. Several genes used as probes were generated by PCR using genomic DNA as template. The 302 bp genomic SmSHR fragment (GenBank accession AY877259) was obtained using the primers 5'ggtggacctctcctctcctc3' and 5'atccaggtttgtagcgcttg3', the 254 bp genomic SmZNF fragment (GenBank accession AY877260) was obtained using the primers 5'gaggtcgtctccttgtcacc3' and 5'cggcgaaagtgtttcttgat3', the 395 bp genomic SmChlH fragment (GenBank accession AY877262) was obtained using the primers 5'ggatcgccttcatatccaaa3' and 5'aaactcgcggtcacagtctt3', and the 375 bp genomic SmSNT fragment (GenBank accession AY877261) was obtained using the primers 5'gatccaggccaagatgaaga3' and 5'tgccagtgaccgtgaagtag3'. The SmOTP gene (accession number AY877263) was identified from a S. moellendorffii EST library; the entire insert was used as a probe. The S. moellendorffii chloroplast (GenBank accession AY877264) and mitochondrial (GenBank accession AY877265) fragment sequences were identified from a partially sequenced, small-insert, sheared genomic library. For DNA blot hybridizations, each cloned insert was gel purified and labeled with 32P using the Megaprime DNA Labelling System (Amersham Biosciences, Piscataway, NJ). For DNA gel blots, 4 μg of S. moellendorffii genomic DNA was digested with restriction enzymes, fractionated and alkaline transferred to nylon membranes according to Sambrook and Russell . All filters were hybridized at 65C in a solution containing 0.5 M phosphate buffer (pH 7.2), 7% (w/v) SDS, and 1 mM EDTA. All membranes were washed under stringent conditions (0.1X SSC, 0.1% SDS, 65C).
Kenrick P, Crane PR: The origin and early diversification of plants on land. Nature. 1997, 389: 33-39. 10.1038/37918.
Stewart W, Rothwell GW: Paleobotany and the Evolution of Plants. 1993, Cambridge, Cambridge University Press, 2nd.
Pryer KM, Schneider H, Smith AR, Cranfill R, Wolf PG, Hunt JS, Sipes SD: Horsetails and ferns are a monophyletic group and the closest living relatives to seed plants. Nature. 2001, 409: 618-622. 10.1038/35054555.
Arabdiopsis Genome Initiative: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2000, 408: 796-815. 10.1038/35048692.
Goff SA, Ricke D, Lan TH, Presting G, Wang R, Dunn M, Glazebrook J, Sessions A, Oeller P, Varma H, Hadley D, Hutchison D, Martin C, Katagiri F, Lange BM, Moughamer T, Xia Y, Budworth P, Zhong J, Miguel T, Paszkowski U, Zhang S, Colbert M, Sun WL, Chen L, Cooper B, Park S, Wood TC, Mao L, Quail P, Wing R, Dean R, Yu Y, Zharkikh A, Shen R, Sahasrabudhe S, Thomas A, Cannings R, Gutin A, Pruss D, Reid J, Tavtigian S, Mitchell J, Eldredge G, Scholl T, Miller RM, Bhatnagar S, Adey N, Rubano T, Tusneem N, Robinson R, Feldhaus J, Macalma T, Oliphant A, Briggs S: A draft sequence of the rice genome (Oryza sativa L. ssp. japonica). Science. 2002, 296: 92-100. 10.1126/science.1068275.
Yu J, Hu S, Wang J, Wong GK, Li S, Liu B, Deng Y, Dai L, Zhou Y, Zhang X, Cao M, Liu J, Sun J, Tang J, Chen Y, Huang X, Lin W, Ye C, Tong W, Cong L, Geng J, Han Y, Li L, Li W, Hu G, Li J, Liu Z, Qi Q, Li T, Wang X, Lu H, Wu T, Zhu M, Ni P, Han H, Dong W, Ren X, Feng X, Cui P, Li X, Wang H, Xu X, Zhai W, Xu Z, Zhang J, He S, Xu J, Zhang K, Zheng X, Dong J, Zeng W, Tao L, Ye J, Tan J, Chen X, He J, Liu D, Tian W, Tian C, Xia H, Bao Q, Li G, Gao H, Cao T, Zhao W, Li P, Chen W, Zhang Y, Hu J, Liu S, Yang J, Zhang G, Xiong Y, Li Z, Mao L, Zhou C, Zhu Z, Chen R, Hao B, Zheng W, Chen S, Guo W, Tao M, Zhu L, Yuan L, Yang H: A draft sequence of the rice genome (Oryza sativa L. ssp. indica). Science. 2002, 296: 79-92. 10.1126/science.1068037.
JGI Populus trichocarpa v1.0. [http://genome.jgi-psf.org/Poptr1/Poptr1.home.html].
JGI Chlamydomonas reinhardtii v2.0. 2005
Qui YL, Palmer JD: Phylogeny of early land plants: insights from genes and genomes. Trends in Plant Science. 1999, 4: 26-30. 10.1016/S1360-1385(98)01361-2.
Obermayer R, Leitch IJ, Hanson L, Bennett MD: Nuclear DNA C-values in 30 species double the familial representation in pteridophytes. Ann Bot (Lond). 2002, 90: 209-217. 10.1093/aob/mcf167.
Luo M, Wing R: An improved method for plant BAC library construction. Methods in Molecular Biology. 2003, 236: 3-20.
Zhang X, Choi S, Woo S, Li Z, RA W: Construction and characterization of two rice bacterial artificial chromosome libraries from the parents of a permanent recombinant inbred mapping population. Molecular Breeding. 1996, 2: 11-24.
Liang C, Xi Y, Shu J, Li J, Yang J, Che K, Jin D, Liu X, Weng M, He Y, Wang B: Construction of a BAC library of Physcomitrella patens and isolation of a LEA gene. Plant Science. 2004, 167: 491-498. 10.1016/j.plantsci.2004.04.015.
Schaefer DG, Zryd JP: The moss Physcomitrella patens, now and then. Plant Physiology. 2001, 127: 1430-1438. 10.1104/pp.127.4.1430.
JGI Community Sequencing Program Sequencing Plans for 2005. [http://www.jgi.doe.gov/sequencing/cspseqplans.html].
Arizona Genomics Institute Ordering Website. [http://www.genome.arizona.edu/orders/direct.html?library=SM__Ba].
Arumuganathan E, Earle ED: Estimation of nuclear DNA content of plants by flow cytometry. Plant Molecular Biology Reporter. 1991, 9: 229-233.
Luo M, Wang Y, Frisch D, Joobeur T, Wing RA, Dean RA: Melon bacterial artificial chromosome (BAC) library construction using improved methods and identification of clones linked to the locus conferring resistance to melon Fusarium Wilt. Genome. 2001, 44: 154-162. 10.1139/gen-44-2-154.
Sambrook J, Russell DW: Molecular cloning : a laboratory manual. 2001, Cold Spring Harbor, N.Y, Cold Spring Harbor Laboratory Press, 1: 3
Galbraith DW, Harkins KR, Maddox JM, Ayres NM, Sharma DP, Firoozabady E: Rapid flow cytometric analysis of the cell cycle in intact plant tissues. Science. 1983, 220: 1049-1051.
Bennett MD, Leitch IJ: Nuclear DNA amounts in angiosperms - 583 new estimates. Annals of Botany. 1997, 8: 169-196. 10.1006/anbo.1997.0415.
Bennett MD, Leitch IJ, Price HJ, Johnston JS: Comparisons with Caenorhabditis (approximately 100 Mb) and Drosophila (approximately 175 Mb) using flow cytometry show genome size in Arabidopsis to be approximately 157 Mb and thus approximately 25% larger than the Arabidopsis genome initiative estimate of approximately 125 Mb. Annals of Botany. 2003, 91: 547-557. 10.1093/aob/mcg057.
Bennett MD, Smith JB: Nuclear DNA amounts in angiosperms. Philosophical Transactions of the Royal Society of London B. 1991, 334: 309-345.
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ: Basic local alignment search tool. Journal of Molecular Biology. 1990, 215: 403-410. 10.1006/jmbi.1990.9999.
We thank Kiran Rao, Cari Soderlund, Angelina Angelova, Amber Hopf and Luke Gumaelius for their assistance in developing the BAC library resource. This work was supported by the Purdue Agricultural Research Programs and National Science Foundation grant numbers 0207110 (Banks), 0211611 (Wing), and 0208502 (Mandoli). This is manuscript number 2005-17581 of the Purdue Agricultural Research Program.
JAB, JC, CC, CD, DM, JT and RAW participated in the design of this study. KA performed the cell flow cytometry genome size determination. ML and WW constructed the BAC library and DK, CM, HK, NS, ML, MT, WW and JW participated in library characterization. All authors have read and approved of this study.
About this article
Cite this article
Wang, W., Tanurdzic, M., Luo, M. et al. Construction of a bacterial artificial chromosome library from the spikemoss Selaginella moellendorffii: a new resource for plant comparative genomics. BMC Plant Biol 5, 10 (2005). https://doi.org/10.1186/1471-2229-5-10