MoccaDB - an integrative database for functional, comparative and diversity studies in the Rubiaceaefamily
BMC Plant Biology volume 9, Article number: 123 (2009)
In the past few years, functional genomics information has been rapidly accumulating on Rubiaceae species and especially on those belonging to the Coffea genus (coffee trees). An increasing number of expressed sequence tag (EST) data and EST- or genomic-derived microsatellite markers have been generated, together with Conserved Ortholog Set (COS) markers. This considerably facilitates comparative genomics or map-based genetic studies through the common use of orthologous loci across different species. Similar genomic information is available for e.g. tomato or potato, members of the Solanaceae family. Since both Rubiaceae and Solanaceae belong to the Euasterids I (lamiids) integration of information on genetic markers would be possible and lead to more efficient analyses and discovery of key loci involved in important traits such as fruit development, quality, and maturation, or adaptation. Our goal was to develop a comprehensive web data source for integrated information on validated orthologous markers in Rubiaceae.
MoccaDB is an online MySQL-PHP driven relational database that houses annotated and/or mapped microsatellite markers in Rubiaceae. In its current release, the database stores 638 markers that have been defined on 259 ESTs and 379 genomic sequences. Marker information was retrieved from 11 published works, and completed with original data on 132 microsatellite markers validated in our laboratory. DNA sequences were derived from three Coffea species/hybrids. Microsatellite markers were checked for similarity, in vitro tested for cross-amplification and diversity/polymorphism status in up to 38 Rubiaceae species belonging to the Cinchonoideae and Rubioideae subfamilies. Functional annotation was provided and some markers associated with described metabolic pathways were also integrated. Users can search the database for marker, sequence, map or diversity information through multi-option query forms. The retrieved data can be browsed and downloaded, along with protocols used, using a standard web browser. MoccaDB also integrates bioinformatics tools (CMap viewer and local BLAST) and hyperlinks to related external data sources (NCBI GenBank and PubMed, SOL Genomic Network database).
We believe that MoccaDB will be extremely useful for all researchers working in the areas of comparative and functional genomics and molecular evolution, in general, and population analysis and association mapping of Rubiaceae and Solanaceae species, in particular.
Accumulation of available genetic markers directly contributes to advances in marker-assisted genetic studies with a wide range of applications such as detection and identification of individual genes and/or quantitative trait loci (QTL), or exploration of the genetic diversity and population structure with regard to natural variations [1–3]. The recent and rapid accumulation of sequence resources, mainly from crop species, ensures an improvement of the genetics approach in combination with the comparative genomics. The extension of these genome resources to their close relatives as well as to more distant genera greatly facilitates the elucidation of evolutionary histories. This elucidation involves the discovery and study of key orthologous loci, phylogeny reconstruction and a variety of other biological questions.
The Rubiaceae family is the fourth largest family of flowering plants but, except for rare species such as Kadua centranthoides Hook. & Arn. [as Hedyotis centranthoides]and Kadua affinis Cham. & Schltdl. [as Hedyotis terminalis] (Levesque MP, Twigg RW, Motley T, Katari MS, Dedhia NN, O'Shaughnessy AL, Balija V, Martienssen RA, McCombie RW, Benfey P et al: Expressed tag sequences from Hedyotis centranthoides and Hedyotis terminalis flowers - Stage 2 (NYBG), accessions available from http://www.ncbi.nlm.nih.gov 2003), most of the genomic information has been generated from the major economic crop species of the Coffea genus, cultivated throughout the tropics: C. arabica L. and C. canephora Pierre ex A.Froehner, the Arabica and Robusta coffee trees, respectively. They are thus used as molecular models for the Rubiaceae. Integrative information of genomic and genetic knowledge acquired for these plants can be further extended to other Coffea species but also to other economically important Rubiaceae genera used in medicine (e.g. Cinchona, which produces quinine, is used as a cure for malaria), and in horticulture (e.g. many genera, including Gardenia, Ixora, Pentas, Mussaenda and Sherardia, are well known ornamentals ).
Among PCR-amplified markers, microsatellite (or simple sequence repeat, SSR) markers are commonly used in large-scale genomic studies owing to their ubiquitous distribution in both protein-coding and non-coding regions and the high degree of length polymorphism among individuals . The C. canephora microsatellites were screened in a leaf and fruit EST database  and in a C. canephora BAC sequence . The overall SSR density has been estimated as one SSR every 7.73 kb and one SSR every 4.1 kb, in the ESTs and in the genomic sequences, respectively [2, 6]. However, although microsatellites are distributed ubiquitously throughout the Coffea genome, only a few of them are suitable for designing informative markers with properties such as strong and specific amplified fragment after PCR and easy scoring of allele sizes, high heterozygosity and/or known position along a linkage map.
Functional genomics is particularly promising for identifying genes involved in a variety of biological functions, which include pathways related to the coffee beverage quality such as synthesis of caffeine, sugars, lipids and chlorogenic acids, but also those related to fruit development. The use of markers directly targeting expressed genes important for each specific trait would be beneficial to these studies. Due to the ongoing sequencing of expressed genes from different plant organs, it is now possible to develop EST-SSR markers for important traits, like fruit properties.
Previous publications [1, 2, 7, 8] and the present study have revealed that coffee EST-SSR and SSR markers show a high level of transferability across distantly related species, thereby providing additional markers for orphan Rubiaceae species.
Although the genomic data available on coffee plants are rapidly increasing, they are often isolated and scattered and rarely available online. In the present study, an effort has been made to create a centralized access to both published and original new data on evolutionarily conserved and validated markers. Integrated comprehensive information system and bioinformatics tools are provided, which will be useful for the research community working on plant genetics and evolution of coffee tree related organisms.
Construction and content
The data retrieval and compilation for MoccaDB has involved the following steps: (1) extraction of data from various sources (publications, public databases etc.); (2) development and testing of additional new markers in Rubiaceae species; (3) compilation, elimination of marker redundancy, BLAST annotation; (4) insertion into the database.
Marker and sequence source
The current version of MoccaDB provides information regarding Coffea EST and genomic SSR markers retrieved from 11 published studies as well as original data (table 1). The database stores 638 markers, defined on 259 ESTs and 379 genomic sequences.
Complete information on the origin of the data was reported such as laboratory, DNA library description, and, finally, reference of the published work. Polymerase chain reaction (PCR) primers, amplification conditions, and expected product sizes were directly retrieved from the publications, when available.
For most of the markers, nucleotide sequences were downloaded from GenBank databases http://www.ncbi.nlm.nih.gov and stored in the database.
A unique set of markers
Most of the retrieved markers had been declared by their authors as designed on unigenes or, at least, on non-redundant DNA sequences. Nevertheless, to identify any redundancy due to the multiple origin of the data, all DNA sequences were checked for homology using the DNASTAR software package (Lasergene, Madison, WI, USA). The markers designed on sequences having a similarity percentage >90% were defined as "similar markers" in the database.
Markers stored in the database are provided with a general SSR description: repeat motif and number, corresponding amino acid repeat if any, and, if known, SSR position on the sequence (coding region or UTR, as described in ).
Markers associated with experimentally described metabolic pathways (e.g. sucrose metabolism during coffee fruit development ) were integrated. Putative functions were predicted for all DNA sequences through similarity searches using BLASTx against GenBank protein databases http://www.ncbi.nlm.nih.gov.
Maps, transferability and diversity
The high transferability of SSR markers at evolutionarily conserved (orthologous) loci within the Coffea genus has been previously reported by different authors. For example, the percentage of transferability of SSR markers developed on C. arabica genomic DNA ranged from 72.7% for C. liberica Hiern to 86.4% for C. pseudozanguebariae Bridson .
Our previously published  and newly designed EST-SSR markers (Table 1), at a total of 99, were tested for amplification on a panel consisting of up to 21 Rubiaceae species belonging to the Cinchonoideae and Rubioideae subfamilies (Table 1). A new set of EST-SSR markers, provided by Crouzillat et al. (Table 1), was also tested on the following Coffea species: C. canephora, C. heterocalyx Stoff., and C. pseudozanguebariae. Only those showing a good and specific PCR amplification with an easy scoring of allele sizes were retained.
Both for markers retrieved from publications and for those designed and/or tested in this study, a maximum of available transferability-associated information was stored in the database: transferability status, amplification quality, information on the polymorphism (number and sizes of alleles within a given species, polymorphism information content (PIC) value).
Database and Web application
MoccaDB has been designed for simple and efficient information search and retrieval. It is currently housed on a Linux Red Hat Enterprise server but is generally platform-independent. The database design has been carried out using the Unified Modeling Language (UML). MoccaDB is composed of two major components: a relational database created using open-access MySQL 5.0 and a PHP web application that communicates with the database. The web interface runs on the Apache 2 Web server. The PHP scripts dynamically execute complex SQL queries to retrieve data from the database according to user criteria and display them as a standard HTML output using CSS style sheets. MoccaDB also integrates bioinformatics tools such as BLAST  and CMap . For an overview of the MoccaDB structure and interaction with the bioinformatics tools and external data sources, see Fig. 1.
The database contains mainly public but also some private data. The public data are accessible to any person connected to the MoccaDB Web site. To access to the private data of some scientific projects as well as to insert one's own data (markers, DNA sequences or mapping data) in the database, the user should open an account that is created with the permission of the scientific project manager. Several supplementary Web interfaces have thus been developed allowing the user an administrative access and database feeding.
A user-friendly web interface has been developed to facilitate data retrieval according to specific user needs. One can search for markers, DNA sequences, maps and diversity data by using the corresponding multi-option query forms. The data can be viewed with a different degree of details, either as an overview (a list of search results), or as a detailed result page for a selected marker, sequence or map, with information on marker transferability, diversity and mapping. The experimental conditions, sequences and other relevant data are easily downloadable in different formats. Some additional information, like the construction of DNA libraries or description of the marker types, can be visualized with the help of pop up windows. Extensive, mostly bi-directional, hyperlinks are provided between the different data pages, thus facilitating the navigation within the web site (Fig. 1).
Synthetic and downloadable information on annotated markers
Through the marker search page, markers can be directly searched by their names but the query can also be filtered by marker type, species and sequence origin, as well as by the availability of experimental data on their transferability and mapping.
The search results are displayed in the form of a table providing general information on each marker. The users can select any number of markers from this table and download them as an Excel file, together with additional optional information such as PCR experimental conditions, original DNA sequence, diversity/transferability or mapping data, depending on their scientific interests and future data utilisation. They can also access the detailed individual marker pages via the hyperlinks associated with each marker.
A typical individual marker page (Fig. 2) displays detailed information on diverse marker aspects: original sequence information, map location, transferability and/or intra- and inter-diversity, existence of "similar" markers developed on the same locus by other researchers, etc....
The genetically mapped markers can also be searched through the map search page. For each map, linkage groups can be displayed separately or together thanks to the CMap tool. A link associated with each SSR marker on the map brings the user back to the marker data page (Fig. 2).
Functional markers directly targeting the expressed genes
A user can search for sequences used to design the markers through the sequence search page. The query will optionally take into account the sequence name, species origin, sequence or marker type, and, more specifically, its putative function, namely a keyword in the BLAST annotation (e.g. transferase, Fig. 3). The sequence search page is especially useful when searching for "functional" markers linked to a particular metabolic pathway. Among different functions, the database is hosting markers associated with the sucrose metabolism .
The resulting searched sequences are displayed in a summary table with hyperlinks that give further access to sequence or marker data pages (Fig. 3). From this table, sequences selected by the user can be downloaded in a multi-fasta file to facilitate subsequent external analyses (BLAST search, clustering, etc...).
These functional markers could also be used in such studies as functional mapping, population analyses or association mapping.
Transferable markers and polymorphism status
Transfer of genomic tools across species boundaries is crucial to assess variation in relevant germplasm and constitutes a unique tool to study orphan related species.
In its current release, MoccaDB already gives access to valuable transferability data. In particular, of the C. canephora and C. arabica markers screened for cross-amplification and polymorphism, a minimum of 83% amplified alleles from any wild Coffea species, independently of its genetic relationship to both cultivated species (Fig. 4). Across the Rubiaceae family, many coffee markers were transferable to wild relatives of the Cinchonoideae subfamily, but only a fraction, maximum 12%, was transferable to distantly related genera in the Rubioideae subfamily (Table 2).
When working on one or more given species, the biologist can thus use the diversity query page to search markers that amplify these species, and eventually reveal inter-specific polymorphism (such as species-specific alleles) or intra-specific polymorphism (through the PIC parameter). Results for the searched markers are displayed in the form of a summary table (Fig. 5) with details on the marker transferability: species tested, amplification status, polymorphism, amplified allele range. These data will be particularly useful for researchers looking for an optimal polymorphic marker set for genotyping populations of a given species.
If the objective is the selection of markers for refining mapping in an inter-specific cross, or for discriminating two or more species, the user can identify diagnostic markers (i.e. with species-specific allele range) with known genetic map location or not.
A synthetic results table of these data can be obtained and downloaded from the marker search query page (Fig. 5).
Bioinformatics tools and external links
CMap and the NCBI BLAST2.0  were integrated into MoccaDB. Any given sequence can be searched for similarities against the MoccaDB sequences or updated public GenBank Coffea databases: (1) all C. arabica and/or C. canephora sequences; (2) C. arabica and/or C. canephora EST sequences; (3) C. arabica and/or C. canephora Genome Survey Sequences (GSS) sequences; (4) C. arabica and/or C. canephora «CoreNucleotide» (EST and GSS sequences not included).
External links connect MoccaDB to the NCBI genbank and Pubmed data, and to the SOL Genomics Network database  for some of the sequences developed on C. canephora by Crouzillat et al. (see table 1).
Conclusion and perspectives
Contrary to some currently existing plant marker databases that contain predicted molecular markers (e.g. ), MoccaDB only stores validated markers provided with experimental protocols and related data. Indeed, we intended to centralize information on markers associated with single-copy loci, which can be reproducibly used for genetic analysis within the Coffea genus and related species.
Some Coffea genetic markers were made available by very few open and freely accessible database resources (Trieste , CIRAD ), but these resources are mostly limited to SSR data generated by their own hosting institute.
MoccaDB includes most of the publicly available data in addition to original data. As compared to the previously released databases, MoccaDB provides greater integrated information and specific features:
Multiple options for data search and retrieval;
Complete description of the markers, going from in vitro PCR amplification conditions, SSR and functional annotation of original DNA sequences and marker location on genetic maps, to cross amplification and diversity data;
Synthetic and downloadable cross-amplification and diversity spreadsheet results to help the user in designing an optimal set of orthologous markers for genotypying or mapping studies in selected species and populations;
Data selected by the user can be easily downloaded and used in laboratory experiments (PCR conditions, expected sizes, etc...) or for further analysis such as BLAST similarity searches of SSR-associated sequences (sequences provided in fasta format, etc...);
Access is provided to integrated bioinformatics tools (CMap, BLAST), as well as to external hyperlinks to various public data sources (NCBI GenBank and Pubmed, SOL Genomics Network )
In MoccaDB, a large amount of information is centralized and freely accessible to all users. A login system exists only for private project access and for data submission. To facilitate data integration, comma-separated values (csv) submission forms have been defined to allow automatical submission of data. More markers will be included in the database as and when they are made publicly available.
The database currently houses SSR markers from both genic and non-genic regions of the genome. Markers whose polymorphism is due to single-nucleotide polymorphism (SNPs), insertion/deletion (indels) or transposable elements are in the process of being developed and will be stored in MoccaDB in a near future.
Coffee has increasingly rich genetic and genomic resources including expressed sequences tags (ESTs) [e.g. ] and bacterial artificial chromosome (BAC) libraries [6, 19]. Whole genome sequencing, genetic, physical and comparative maps are being developed. MoccaDB will be extended to include new data types, but also links to cytological maps and morphological data.
Systematic efforts have been initiated to generate PCR-based comparative genetic maps in several clades of plants, particularly in Solanaceae using Conserved Ortholog Set (COS) markers . Data obtained in this family could be of benefit for wide comparative genomics studies including those of Rubiaceae species.
Availability and requirements
The database is open and freely available
Project name: MoccaDB
Project home page: http://moccadb.mpl.ird.fr/
Operating system: Linux but functions also on Windows
Other requirements: none
License: None required
Poncet V, Dufour M, Hamon P, Hamon S, de Kochko A, Leroy T: Development of genomic microsatellite markers in Coffea canephora and their transferability to other coffee species. Genome. 2007, 50 (12): 1156-1161. 10.1139/G07-073.
Poncet V, Rondeau M, Tranchant C, Cayrel A, Hamon S, de Kochko A, Hamon P: SSR mining in coffee tree EST databases: potential use of EST-SSRs as markers for the Coffea genus. Mol Genet Genomics. 2006, 276 (5): 436-449. 10.1007/s00438-006-0153-5.
Varshney RK, Graner A, Sorrells ME: Genic microsatellite markers in plants: features and applications. Trends Biotechnol. 2005, 23 (1): 48-55. 10.1016/j.tibtech.2004.11.005.
Davis A, Bridson D: Rubiaceae. Flowering Plant Families of the World. Edited by: Heywood V, Brummitt R, Culham A. 2007, Seberg O: RBG Kew
Sharma PC, Grover A, Kahl G: Mining microsatellites in eukaryotic genomes. Trends in Biotechnology. 2007, 25 (11): 490-498. 10.1016/j.tibtech.2007.07.013.
Guyot R, de la Mare M, Viader V, Hamon P, Coriton O, Bustamante-Porras J, Poncet V, Campa C, Hamon S, de Kochko A: Microcollinearity in an ethylene receptor coding gene region of the Coffea canephora genome is extensively conserved with Vitis vinifera and other distant dicotyledonous sequenced genomes. BMC Plant Biol. 2009, 9 (1): 22-10.1186/1471-2229-9-22.
Aggarwal RK, Hendre PS, Varshney RK, Bhat PR, Krishnakumar V, Singh L: Identification, characterization and utilization of EST-derived genic microsatellite markers for genome analyses of coffee and related species. Theor Appl Genet. 2007, 114 (2): 359-372. 10.1007/s00122-006-0440-x.
Cubry P, Musoli P, Legnate H, Pot D, de Bellis F, Poncet V, Anthony F, Dufour M, Leroy T: Diversity in coffee using SSR markers: structure of the Coffea genus and perspectives for breeding. Genome. 2008, 51 (1): 50-63. 10.1139/G07-096.
Geromel C, Ferreira LP, Guerreiro SM, Cavalari AA, Pot D, Pereira LF, Leroy T, Vieira LG, Mazzafera P, Marraccini P: Biochemical and genomic analysis of sucrose metabolism during coffee (Coffea arabica) fruit development. J Exp Bot. 2006, 57 (12): 3243-3258. 10.1093/jxb/erl084.
Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.
Coulibaly I, Revol B, Noirot M, Poncet V, Lorieux M, Carasco-Lacombe C, Minier J, Dufour M, Hamon P: AFLP and SSR polymorphism in a Coffea interspecific backcross progeny [(C. canephora × C. heterocalyx) × C. canephora]. Theor Appl Genet. 2003, 107 (6): 1148-1155. 10.1007/s00122-003-1355-4.
Martin NH, Willis JH: Ecological divergence associated with mating system causes nearly complete reproductive isolation between sympatric Mimulus species. Evolution. 2007, 61 (1): 68-82. 10.1111/j.1558-5646.2007.00006.x.
Poncet V, Hamon P, Minier J, Carasco-Lacombe C, Hamon S, Noirot M: SSR cross-amplification and variation within coffee trees (Coffea spp.). Genome. 2004, 47 (6): 1071-1081. 10.1139/g04-064.
Robbrecht E, Manen J-F: The major evolutionary lineages of the coffee family (Rubiaceae, angiosperms). Syst Geogr Pl. 2006, 85-146. 76
Mueller LA, Solow TH, Taylor N, Skwarecki B, Buels R, Binns J, Lin C, Wright MH, Ahrens R, Wang Y, et al: The SOL Genomics Network. A Comparative Resource for Solanaceae Biology and Beyond. Plant Physiol. 2005, 138 (3): 1310-1317. 10.1104/pp.105.060707.
Rudd S, Schoof H, Mayer K: PlantMarkers--a database of predicted molecular markers from plants. Nucleic Acids Res. 2005, D628-632. 33 Database
Coffee DNA. [http://www.coffeedna.net/]
Ruiz M, Rouard M, Raboin LM, Lartaud M, Lagoda P, Courtois B: TropGENE-DB, a multi-tropical crop information system. Nucleic Acids Res. 2004, D364-367. 10.1093/nar/gkh105. 32 Database
Leroy T, Marraccini P, Dufour M, Montagnon C, Lashermes P, Sabau X, Ferreira LP, Jourdan I, Pot D, Andrade AC, et al: Construction and characterization of a Coffea canephora BAC library to study the organization of sucrose biosynthesis genes. Theor Appl Genet. 2005, 111 (6): 1032-1041. 10.1007/s00122-005-0018-z.
Wu F, Mueller LA, Crouzillat D, Petiard V, Tanksley SD: Combining bioinformatics and phylogenetics to identify large sets of single-copy orthologous genes (COSII) for comparative, evolutionary and systematic studies: a test case in the euasterid plant clade. Genetics. 2006, 174 (3): 1407-1420. 10.1534/genetics.106.062455.
Lashermes P, Combes MC, Trouslot P, Charrier A: Phylogenetic relationships of coffee-tree species (Coffea L.) as inferred from ITS sequences of nuclear ribosomal DNA. Theor Appl Genet. 1997, 94 (6-7): 947-955. 10.1007/s001220050500.
Stoffelen P, Noirot M, Couturon E, Bontems S, De Block P, Anthony F: Coffea anthonyi, a new self-compatible Central African coffee species, closely related to an ancestor of Coffea arabica. Taxon. 2009, 58 (1): 133-140.
Davis AP, Govaerts R, Bridson DM, Stoffelen P: An annotated taxonomic conspectus of the genus Coffea (Rubiaceae). Botanical Journal of the Linnean Society. 2006, 152 (4): 465-512. 10.1111/j.1095-8339.2006.00584.x.
Bhat PR, Krishnakumar V, Hendre PS, Rajendrakumar P, Varshney RK, Aggarwal RK: Identification and characterization of expressed sequence tags-derived simple sequence repeats, markers from robusta coffee variety 'CxR' (an interspecific hybrid of Coffea canephora × Coffea congensis). Molecular Ecology Notes. 2005, 5 (1): 80-83. 10.1111/j.1471-8286.2004.00839.x.
Combes MC, Andrzejewski S, Anthony F, Bertrand B, Rovelli P, Graziosi G, Lashermes P: Characterization of microsatellite loci in Coffea arabica and related coffee species. Mol Ecol. 2000, 9 (8): 1178-1180. 10.1046/j.1365-294x.2000.00954-5.x.
Baruah A, Naik P, Hendre S, Rajkumar R, Rajendrakumar P, Aggarwal RK: Isolation and characterization of nine microsatellite markers from Coffea arabica L., showing wide cross-species amplifications. Molecular Ecology Notes. 2003, 3 (4): 647-650. 10.1046/j.1471-8286.2003.00544.x.
Moncada P, McCouch S: Simple sequence repeat diversity in diploid and tetraploid Coffea species. Genome. 2004, 47 (3): 501-509. 10.1139/g03-129.
We gratefully acknowledge the financial support from the IRD-SPIRALES-2007 grant funding. The authors thank D. Crouzillat of Nestlé for permitting the integration of Nestlé's primer data into MoccaDB and Y. Pournin and A. Egorov (system administrators) for technical support. The authors also thank D. Pot for valuable comments on the initial project and L. Mueller and R. Guyot for their advice on the manuscript.
The authors declare that they have no competing interests.
OP designed the project, designed and implemented the database, developed the web interfaces, FB designed the web interface. MC, AT and VV helped in analyzing the published markers, carried out the PCR amplification experiments and the genotyping. PDB identified/supplied plant material of Rubiaceae species from the greenhouses of the National Botanic Garden of Belgium. PDB and PH helped with the cross-amplification experiments and diversity analyses. CC helped in designing the database. AdK secured partial funding from the IRD-SPIRALES Board. AdK and SH coordinated the project. CT managed the project development, assisted in the designing of the database, performed database system administration, integrated the bioinformatics tools in the application. VP served as the principal investigator of the project, performed the data analysis, assisted in the designing of the database, and drafted the manuscript. All authors have contributed in the writing of the manuscript and have read and approved the final submitted version.
About this article
Cite this article
Plechakova, O., Tranchant-Dubreuil, C., Benedet, F. et al. MoccaDB - an integrative database for functional, comparative and diversity studies in the Rubiaceaefamily. BMC Plant Biol 9, 123 (2009). https://doi.org/10.1186/1471-2229-9-123