An archived activation tagged population of Arabidopsis thalianato facilitate forward genetics approaches
© Robinson et al; licensee BioMed Central Ltd. 2009
Received: 7 May 2009
Accepted: 31 July 2009
Published: 31 July 2009
Functional genomics tools provide researchers with the ability to apply high-throughput techniques to determine the function and interaction of a diverse range of genes. Mutagenised plant populations are one such resource that facilitate gene characterisation. They allow complex physiological responses to be correlated with the expression of single genes in planta, through either reverse genetics where target genes are mutagenised to assay the affect, or through forward genetics where populations of mutant lines are screened to identify those whose phenotype diverges from wild type for a particular trait. One limitation of these types of populations is the prevalence of gene redundancy within plant genomes, which can mask the affect of individual genes. Activation or enhancer populations, which not only provide knock-out but also dominant activation mutations, can facilitate the study of such genes.
We have developed a population of almost 50,000 activation tagged A. thaliana lines that have been archived as individual lines to the T3 generation. The population is an excellent tool for both reverse and forward genetic screens and has been used successfully to identify a number of novel mutants. Insertion site sequences have been generated and mapped for 15,507 lines to enable further application of the population, while providing a clear distribution of T-DNA insertions across the genome. The population is being screened for a number of biochemical and developmental phenotypes, provisional data identifying novel alleles and genes controlling steps in proanthocyanidin biosynthesis and trichome development is presented.
This publicly available population provides an additional tool for plant researcher's to assist with determining gene function for the many as yet uncharacterised genes annotated within the Arabidopsis genome sequence http://aafc-aac.usask.ca/FST. The presence of enhancer elements on the inserted T-DNA molecule allows both knock-out and dominant activation phenotypes to be identified for traits of interest.
The adoption of Arabidopsis thaliana as a model plant was suggested as early as 1943, yet its prominence in the study of plant genetics and physiology did not emerge until the 1980's with the recognition that its small genome and ease of manipulation offered the opportunity to mutate and study every gene within the genome . The ability to fully realise this objective has been facilitated through the development of an elegantly simple transformation system  and the completion of the genome sequence . The most recent annotation of the genome sequence has identified a total of 33,282 genes comprising 27,235 protein coding genes, 4,759 pseudogenes or transposable elements and 1,288 non coding RNAs (TAIR8 release; http://www.arabidopsis.org). Computational biology tools allow the potential function of almost half of these proteins to be inferred, which provides an enormous resource for hypothesis driven research, while the remaining unknown proteins present an intriguing palette for curious researchers.
The development of tools to elucidate the function of the inferred genes is required in order to exploit the potential wealth of information provided by the annotated genome sequence. Large scale random mutagenesis has been utilised to successfully address the knowledge gap between sequence and function in a number of plant species [4–6] and has been widely applied in A. thaliana . Numerous strategies have been employed to saturate the genome, including exposure to chemical mutagens such as ethyl methanesulphonate (EMS) , transposon tagging , fast neutron deletion  and agrobacterium-mediated T-DNA mutagenesis . While EMS mutagenesis has the advantages of ease of application, non-biased distribution across the genome and generation of subtle phenotypes, its utility has been somewhat limited by the time-consuming map-based cloning required to verify the underlying gene responsible. The use of specific DNA insertional elements, such as transposons and T-DNAs, allows the rapid identification of the point of entry in the genome using PCR based protocols, which have been optimised for high throughput sequencing [11, 12]. The generation of large collections of mutagenised lines and the concurrent sequencing of insertion sites to develop readily searchable databases for these populations has revolutionised gene characterisation by providing 'in silico' access to thousands of mutant alleles.
The Arabidopsis community is fortunate that a number of populations are readily available for reverse genetics applications and can be accessed through The Arabidopsis Information Resource (TAIR: http://www.arabidopsis.org). In total, three publicly available T-DNA flanking-sequence tag (FST) databases provide access to over 200,000 insertion sites; SIGnAL, FLAGdb and GABI-Kat [11, 13, 14], which have been estimated to interrupt the transcription of 80% of the annotated protein coding genes .
Although the utility of T-DNA mutagenesis has been enhanced through the use of vectors that can facilitate gene, enhancer or promoter trapping , there is an inherent limitation to simple insertional mutagenesis due to functional redundancy within the genome. Approximately 17% of A. thaliana genes are found in direct tandem repeats and 58% of the genome is thought to be duplicated, providing the plant with the ability to compensate for many null mutations . The development of vectors which can generate gain-of-function as well as loss-of-function alleles, so called activation tagging, has led to the discovery of a number of novel alleles controlling important functions in plant development, metabolism and stress responses . Activation tagging exploits a tetrameric repeat of the enhancer element of the cauliflower mosaic virus (CaMV) 35S gene to direct the transcription of adjacent genes generating dominant phenotypes . Although a number of resources have been developed for A. thaliana using this strategy [18, 19], access to these lines is generally via pooled seed samples or through databases of predetermined visual phenotypes (http://www.arabidopsis.org; http://amber.gsc.riken.jp/act/). In addition, Ulker et al (2008)  recently observed unanticipated activation and anomalous expression events in what would traditionally be considered knock-out populations suggesting that such populations may harbour novel phenotypes.
This study describes the generation of an archived activation tagged T-DNA A. thaliana (ecotype Columbia) population derived from almost 50,000 individual T1 lines, where to date at least 19,000 flanking sequence tags (FSTs) have been identified to facilitate reverse and forward genetics applications http://aafc-aac.usask.ca/FST. The distribution of the integration events in the genome was investigated and found to be closely correlated with gene density and not with recombination frequency although a reduction in frequency was observed across all datasets in centromeric regions. The analyses identified the presence of novel alleles, multiple insertions sites, complex Ti plasmid integrations and the somewhat unexpected assimilation of agrobacterium sequences into the genome. The utility of the described population for identifying new mutations controlling a number of physiological traits is being explored and preliminary phenotypes are presented for trichome development and proanthocyanidin metabolism.
Generation of the SK Population
An A. thaliana T-DNA mutagenised population, named SK, was developed and archived as T2 seed derived from 49,160 individual herbicide resistant T1 lines with a T-DNA transformation efficiency estimated to be ~0.05%. Single seed descent with continued selection was employed to generate a population of 44,383 T3 families that will be enriched for homozygous mutant genotypes.
The number of independent insertion events per line was estimated initially by assessing the segregation ratio for herbicide resistance scored in the progeny from 100 T1 plants. This resulted in an estimate of 1.35 insertion loci/line suggesting the entire population may contain ~70,000 independent T-DNA integration events. However, Southern analysis of 102 lines suggested a greater number of actual integration events (3.1 T-DNA insertions/line) with a high percentage (~82%) of the insertion alleles being the result of complex T-DNA integrations events (data not shown). This was later confirmed through sequence analysis of the DNA flanking the T-DNA left border (see below), which is in contrast with the lower frequency of T-DNA integration reported in previously characterised populations [11, 21].
Genomic Distribution of Flanking Sequence Tags (FSTs)
TAIL-PCR was employed as a relatively efficient high-throughput strategy to amplify the sequence flanking the T-DNA insertion events (FST) present in the SK mutagenised population . The genetic origin of 16,428 FST sequences derived from DNA flanking the left border of stably inherited T-DNA molecules was determined by analysing the sequence from amplification products generated from 28,908 individual T2lines. Additional sequencing is on-going to characterise further SK lines.
Position and number of SK FST Integrations in the A. thaliana genome.
Nearing a mutation saturated Arabidopsis thalianagenome
Summary of the publicly available A. thaliana T-DNA insertion events.
No. of FST's b
FSTs in genes including promoters
FSTs in transcribed regions
FSTs in exons
11,219 c (8,324)d
42,788 (17, 230)
83,226 (23,556) e
Characterisation of the A. thalianagenes without insertions
There remain 6,004 A. thaliana genes with no identified T-DNA insertion event when all available populations are considered. After removing 1,550 annotated gene codes that were less than 200 bp in length (largely consisting of tRNAs, microRNAs, and retrotransposons), a number of basic characteristics were assessed for each of the remaining genes. These included gene expression level from carpel tissue, position relative to the centromere, annotated length, and gene copy number (Additional file 2).
A significant bias in gene length was observed with the median length for genes with and without an insert being 2,418 bp and 1,132 bp, respectively (z <-100, p < 0.0001). The distributions of gene expression levels for genes with and without insertions were also distinct (z = -21.99, p < 0.0001). The median absolute expression level was seven-fold lower for those genes without an insertion compared to those having a T-DNA integration event. This observation correlated with the position of the genes relative to the centromere, where gene expression is repressed, since those genes lacking an insertion event were found to be demonstrably closer to the centromeric region (z = -30.76, p < 0.0001). Similarly, pseudogenes that are generally not expressed or expressed at low levels were three-fold over-represented among the gene annotations for gene codes with no observed T-DNA integration.
Identification of complex T-DNA and non-Ti plasmid integration
SK FST data handling and visualisation
Forward Genetic Screens reveal novel mutations
Aberrant morphological variation was observed in individual lines throughout the generation of the SK population and a number of these were confirmed as alleles of previously characterised mutations through the mapping of the FSTs. Some examples of these included mutations in APETALA1 (At1g69120; SK295), LEAFY (At5g61850; SK14914), and CABBAGE (At5g05690; SK4745). In addition to loss-of-function alleles, gain-of-function mutants should also be discovered since the SK population was developed using a vector carrying multiple enhancer elements. Activation of genes adjacent to the insertion site was confirmed for at least two phenotypic variants, one leading to ectopic expression of a gibberellin oxidase resulting in a dwarf phenotype  and the second to activation of an adjacent microRNA resulting in enhanced seed carotenoid levels (Wei et al, submitted).
To fully realise the potential of this genetic resource, a number of forward genetic screens were initiated to identify lesions in targeted developmental and biochemical pathways. The preliminary results from two screens dissecting trichome development and proanthocyanidin accumulation in the seed coat are presented.
Functional genomics tools are used to elucidate the role each gene plays within an organism. Due to its comparatively small size and the breadth of resources available, A. thaliana was a prime target to attempt a holistic assault on the genome (Arabidopsis 2010 Program: http://www.arabidopsis.org/portals/masc/FG_projects.jsp). The Arabidopsis community and indeed related species such as the important crop Brassica species have benefited greatly from the ambitious goal of assigning function to each of the ~30,000 annotated Arabidopsis genes. A number of T-DNA mutagenised populations of A. thaliana have been developed and released into the public domain [11–13, 35, 36], which greatly facilitate reverse genetic analysis of target genes through the identification of knock-out alleles.
The SK population of almost 50,000 activation tagged A. thaliana lines was generated and archived as T3 seed through single seed descent to provide a resource for forward and reverse genetic screens. The activity of the enhancer element present within the integrated T-DNA was expected to produce novel alleles and to increase the likelihood of affecting phenotypes for genes previously masked through the inherent redundancy in the A. thaliana genome. The SK lines carried an average of 1.35 independently segregating insertions per line. Sequencing of DNA flanking insertion sites has genetically characterised 16,428 T-DNA integration events in 15,507 SK lines. The distribution of insertion sites closely mirrored the gene content and gene expression level observed along the A. thaliana chromosomes, with a dearth of insertions in centromeric regions.
A comparison with previously characterised populations determined that the SK population provides 327 unique insertion events in previously untagged A. thaliana genes. Including the SK lines, the available populations provide multiple mutagenic alleles for 27,324 loci. Since the background mutation rate in such populations has been estimated to be as high as 60%  the availability of independent alleles for each gene is essential to confirm functional assignment.
Mutagenic saturation of the A. thaliana gene complement has yet to be achieved, since 6,004 loci still do not have a characterised T-DNA insertion event. An assessment of the loci without insertion events supports the previous analysis which suggested that T-DNA integration preferentially targets transcriptionally active regions of the genome . Among the genes lacking an insertion event there was a bias towards short loci that lacked introns and were expressed at very low levels in carpel tissue (Additional file 2). This bias could explain the prevalence of transcription factors which were found among the non-mutagenised loci. Single copy genes were not over-represented among the untagged loci, which might have been expected for essential non-redundant loci. However, it is possible that such loci are being maintained within the populations in the hemizygous state.
The apparent necessity for accessible or open chromatin regions for T-DNA integration is in conflict with the observed bias of insertion events to intergenic genomic sequence compared to annotated genic regions (χ2 = 1,457, p < 0.0001). There is increasing evidence that there are additional unannotated A. thaliana loci present in the genome [37, 38] that could explain the apparent 'intergenic' insertion events. However, only 275 of the 5,209 intergenic insertion events within the SK population were associated with either the recently described 7,160 sORFs predicted from whole genome expression TILING arrays or 2,263 newly annotated proteins determined from extensive peptide sequencing [37, 38]. The observed discrepancy could be accounted for by insufficient annotation of distal regulatory regions, which have been erroneously classified as intergenic sequence.
Based on the resolvable FST data, a notable number of the T-DNA integration events were found to be complex in nature (11%), predominantly indicating inverted or direct tandem insertion events. Although this implies that single genetic loci are affected, such loci complicate downstream cloning efforts and can potentially lead to additional chromosomal rearrangements [39–41].
In recent years, collections of Arabidopsis mutants (tds and tt lines) have been identified by screening for alterations in seed coat colour, flavonoid biosynthesis and proanthocyanidin accumulation [42–45]. These lines have been used to investigate the flavonoid and proanthocyanidin pathways (reviewed in [46, 47]), yet the biochemical characterization of the latter stages of the pathway has been inadequate and the relative functional position of some proteins remains obscure [48–51]. The poorly characterised steps in flavonoid synthesis could be elucidated further through exploitation of the SK lines. Similarly, questions remaining on the development and regulation of trichome formation  could also be addressed using the described genetic resource.
The SK population is the first A. thaliana activation tagged population to be screened for seed coat colour, proanthocyanidin patterning, and trichome variation. To date, we have recovered a broad assortment of mutants including 12 sk trichome variants, seven sk-tt lines defining new alleles of proanthocyanidin genes and four sk-tt lines with as yet uncharacterised proanthocyanidin phenotypes and genes (Cui et al., Li et al., Gao et al., manuscripts in preparation). These mutant lines will add a wealth of information to our understanding of the flavonoid pathway and trichome development. The SK population already has proven its value by yielding new phenotypes and genes previously unknown to be involved in proanthocyanidin biosynthesis and regulation (Gao et al, manuscript in preparation). The population should be an excellent resource for exploring additional key processes controlling a multitude of traits and is currently being screened for mutants affected in abiotic stress tolerance and caretonoid biosynthesis.
An additional resource of almost 50,000 T-DNA tagged A. thaliana lines has been developed enabling the continuing efforts to assign function to the entire gene complement of a plant http://aafc-aac.usask.ca/FST. This population can be screened for both loss and gain-of-function phenotypes due to the presence of enhancers on the integrated T-DNA molecule. Mapping of the FSTs for 15,507 SK lines has identified insertion events in 327 genes with no previously recorded T-DNA mutation and a second allele for 940 additional genes. The utility of this population for capturing novel phenotypic variation has been demonstrated through initial screens assaying trichome development and proanthocyanidin accumulation. The potential of this resource to genetically dissect complex physiological responses is being exploited in the study of the plant's response to abiotic stress.
T-DNA mediated mutagenesis
All plants were grown in a 50:50 combination of coconut fibre and soil-less mix containing slow release fertilizer under greenhouse conditions at 20°C with 16 hours light. The wild-type A. thaliana accession Columbia-4 (Col-4) was transformed with the binary vector pSKI015  using the A. tumefaciens strain GV3101 according to the floral dip protocol described in Clough et al  with the addition of a vacuum infiltration step subsequent to submersion of the plants in the GV301 culture, when a vacuum of 500 mm/Hg was applied and held for 3 minutes (Gast model DOA-P104-AA vacuum pump, Fisher Scientific, Ottawa, Canada). Primary transformants (T1) were selected using the herbicide glufosinate ammonium (Liberty, Syngenta, Canada). The primary transformants were transplanted into Arasystem trays (BetaTech, Belgium) to allow maintenance of individual lines. In total, 49,160 T1 activation tagged lines (SK lines) were generated from which T2 seed was archived and T3 lines were obtained by single-seed descent. Plant tissue (50–100 mg) was collected from each T2 line into 96-well microtube racks (Qiagen Inc., USA) and stored at -80 C. Samples were freeze-dried, homogenized in a Retsch MM300 mill and DNA was extracted using a CTAB method based on Doyle and Doyle (1990).
Estimation of insertion site number
The segregation ratio of herbicide resistance seedlings were obtained for 100 T2 lines by growing ~50 seeds on Petri plates using the methods described in Robinson et al., (2004) with the addition of 7.5 mg/litre glufosinate ammonium (Bayer CropScience, Regina, SK). The segregation ratio for herbicide resistance was scored 14 days after plating, which was assumed to reflect the number of independent insertions.
Amplification and sequencing of flanking sequence tags (FST)
The genomic sequence flanking the T-DNA insertion sites (FST) was amplified by mTAIL-PCR as described in  using the T-DNA specific primers pSKTAIL-L1: TTCTCATCTAAGCCCCCATTTGG and pSKTAIL-L2: TGGACGTGAATGTAGACACGTCG. The FST sequence was generated using primer pSKTAIL-L3: ATACGACGGATCGTAATTTGTCG. DNA sequence was obtained using 2 μl of the purified product and Big Dye v3.0 chemistry in accordance with the manufacturer's instructions and resolved on an Applied Biosystem 3700 sequencer (Applied Biosystems, Foster City, US).
Custom Perl scripts were developed to delimit the boundaries and orientation of each gene feature (exon, intron, promoter and untranslated regions sequence) from each annotated gene in the TAIR8 release http://www.arabidopsis.org, and the proportion of the genome for each of the summed values for these gene features was determined: exon (35.9%); intron (22.2%); promoter (24.5%); 5' UTR (2.2%); 3' UTR (3.8%); and intergenic sequence (11.4%). For each SK-FST line the DNA sequencing trace files were warehoused in APED http://sourceforge.net/projects/aped. All FST sequences obtained were aligned to the five nuclear and two organellar pseudochromosomes of A. thaliana using the BLAST algorithm  with default BLAST parameters except that low-complexity filtering was disabled. These sequences have been submitted to GenBank (Accession Nos. FI978382 – FI994028). FSTs with no homology to the Arabidopsis genome were aligned to the pSKI015 T-DNA vector (AF187951) and the circular, linear and AT plasmid genome sequences of A. tumefaciens (AE007869; AE007870; AE007872). FSTs with ambiguous genome assignments and those containing complex repetitive elements were excluded from further analysis.
The pseudochromosome coordinate for each FST in the five Arabidopsis FST datasets (SK, SALK, GABI, SAIL, FLAG) was used to identify multiple alleles for each Arabidopsis gene identifier (AGI). Where multiple FSTs from the same dataset were assigned to a single AGI, those FSTs separated by at least 500 bp were considered to be distinct loci.
Distribution of T-DNA insertion events
The number of FSTs in windows of 100 Kb along each pseudochromosome was determined. Expression levels for each gene was calculated as the mean signal value from three replicate measures from available Affymetrix data from carpel tissue obtained from stage 12 floral tissue as described for slides ATGE_37_A-C at http://affymetrix.arabidopsis.info/narrays/experimentpage.pl?experimentid=152. Only those genes present in three replicate samples were included in the analysis. These data were selected since the ovules are considered the target for heritable T-DNA integration events . The recombination frequencies between 676 markers used to develop high resolution A. thaliana linkage groups and their position on each pseudochromosome was obtained from Singer et al (2006) . These data were used to assign recombination frequencies to windows of 100 Kb.
Identification of complex insertion events
The expectation was that each FST sequence would contain the T-DNA left border sequence and the adjacent A. thaliana genomic sequence. FST sequences that aligned to multiple, non-repetitive genomic regions and/or additional regions of the pSKI015 plasmid indicated multiple, imprecise or complex T-DNA integrations (Figure 3). For each FST the number and orientation of additional plasmid sequence was extracted from the alignment data and used to infer the arrangement of each T-DNA integration event. An example of the sequence alignment data for each different integration event identified is shown in Additional file 3.
Analysis of genes lacking T-DNA integrations
Considering all five FST datasets (SK, SALK, GABI, SAIL, FLAG), a set of 6,004 AGIs were identified as having no T-DNA insertion. For each gene, six basic gene parameters were examined, including transcript length, expression level, proximity to the centromere, the presence of introns, gene copy number and annotation as a pseudogene. The position of each centromere was taken as: 15,088,987 bp on chromosome 1; 3,608,427 bp on chromosome 2; 13,599,567 bp on chromosome 3; 3,456,519 bp on chromosome 4 and 11,742,755 bp on chromosome 5. Affymetrix gene expression data was analysed as described above, although only 1,913 of the genes lacking an insertion were present on the array. The proximity to the centromere was determined by subtracting the position of the 5' most coordinate of each AGI model from the centromeric coordinate. The distribution for each parameter was not normal according to the Jarque-Bera goodness-of-fit test  thus data were analysed according to the non-parametric Wilcoxon-Mann-Whitney test . These data were compared to a distribution of the same size generated by random selection from those genes with insertions.
Seed Coat Colour and Proanthocyanidin Screens
Variability in seed colour density, colour hue, and proanthocyanidin distribution patterns was observed under a stereo-compound microscope. Particular attention was given to recovering subtle changes in seed colour patterns. Seed colour variants were analyzed for variability in proanthocyanidins and flavan-3-ols using the histochemical stain dimethylaminocinnamylaldehyde (Sigma, Oakville, Canada) (2% DMACA in 3 N HCl 50% [w/v] methanol) . Seed colour variants were grown for three additional generations to confirm trait stability.
Screening for variation in trichome morphology
Seeds from each line were surface sterilized with chlorine gas and plated onto 1/2 MS media, stratified at 4°C for 3 days in the dark, and then incubated in an Enconair AC-60 at 22°C, under long-day conditions. Trichome morphology was viewed with a Zeiss Stemi 2000-C dissection microscope. Leaves were frozen in liquid nitrogen and micrographs were taken on a JOEL 6400 cry-scanning electron microscope. The average number of branches for trichomes of wild type A. thaliana was calculated from 798 plants.
Additional datasets used
The genome coordinate position data for the FSTs represented in the SALK, FLAGdb, SAIL and GABI-Kat collections were obtained from http://natural.salk.edu/database/transcriptome/T-DNA.SALK, http://natural.salk.edu/database/transcriptome/T-DNA.FLAG, http://natural.salk.edu/database/transcriptome/T-DNA.SAIL and http://natural.salk.edu/database/transcriptome/T-DNA.GABI respectively.
The microarray data used in this study was obtained from slides ATGE_37_A, ATGE_37_B and ATGE_37_C available at http://affymetrix.arabidopsis.info/narrays/experimentpage.pl?experimentid=152.
The work described was supported by funding from AAFC Canadian Crop Genomics Initiative grants to IAPP and MYG, in addition YYW was partially supported by an NSERC discovery grant to SR.
- Meyerowitz EM: Prehistory and History of Arabidopsis Research. Plant Physiol. 2001, 125 (1): 15-19. 10.1104/pp.125.1.15.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang X, Henriques R, Lin S-S, Niu Q-W, Chua N-H: Agrobacterium-mediated transformation of Arabidopsis thaliana using the floral dip method. Nat Protocols. 2006, 1 (2): 641-646. 10.1038/nprot.2006.97.PubMedView ArticleGoogle Scholar
- AGI: Analysis of the genome sequence of the flowering plant Arabidopsis thaliana. Nature. 2007, 408 (6814): 796-815. 10.1038/35048692.Google Scholar
- Ayliffe MA, Pallotta M, Langridge P, Pryor AJ: A barley activation tagging system. Plant Mol Biol. 2007, 64 (3): 329-347. 10.1007/s11103-007-9157-8.PubMedView ArticleGoogle Scholar
- An S, Park S, Jeong DH, Lee DY, Kang HG, Yu JH, Hur J, Kim SR, Kim YH, Lee M, et al: Generation and analysis of end sequence database for T-DNA tagging lines in rice. Plant Physiol. 2003, 133 (4): 2040-2047. 10.1104/pp.103.030478.PubMedPubMed CentralView ArticleGoogle Scholar
- Mathieu M, Winters EK, Kong F, Wan J, Wang S, Eckert H, Luth D, Paz M, Donovan C, Zhang Z, et al: Establishment of a soybean (Glycine max Merr. L) transposon-based mutagenesis repository. Planta. 2009, 229 (2): 279-289. 10.1007/s00425-008-0827-9.PubMedView ArticleGoogle Scholar
- Ostergaard L, Yanofsky MF: Establishing gene function by mutagenesis in Arabidopsis thaliana. Plant J. 2004, 39 (5): 682-696. 10.1111/j.1365-313X.2004.02149.x.PubMedView ArticleGoogle Scholar
- Greene EA, Codomo CA, Taylor NE, Henikoff JG, Till BJ, Reynolds SH, Enns LC, Burtner C, Johnson JE, Odden AR, et al: Spectrum of Chemically Induced Mutations From a Large-Scale Reverse-Genetic Screen in Arabidopsis. Genetics. 2003, 164 (2): 731-740.PubMedPubMed CentralGoogle Scholar
- Sundaresan V, Springer P, Volpe T, Haward S, Jones JD, Dean C, Ma H, Martienssen R: Patterns of gene action in plant development revealed by enhancer trap and gene trap transposable elements. Genes Dev. 1995, 9 (14): 1797-1810. 10.1101/gad.9.14.1797.PubMedView ArticleGoogle Scholar
- Li X, Song Y, Century K, Straight S, Ronald P, Dong X, Lassner M, Zhang Y: A fast neutron deletion mutagenesis-based reverse genetics system for plants. Plant J. 2001, 27 (3): 235-242. 10.1046/j.1365-313x.2001.01084.x.PubMedView ArticleGoogle Scholar
- Alonso JM, Stepanova AN, Leisse TJ, Kim CJ, Chen H, Shinn P, Stevenson DK, Zimmerman J, Barajas P, Cheuk R, et al: Genome-wide insertional mutagenesis of Arabidopsis thaliana. Science. 2003, 301: 653-657. 10.1126/science.1086391.PubMedView ArticleGoogle Scholar
- Sessions A, Burke E, Presting G, Aux G, McElver J, Patton D, Dietrich B, Ho P, Bacwaden J, Ko C, et al: A high-throughput Arabidopsis reverse genetics system. Plant Cell. 2002, 14 (12): 2985-2994. 10.1105/tpc.004630.PubMedPubMed CentralView ArticleGoogle Scholar
- Samson F, Brunaud V, Duchene S, De Oliveira Y, Caboche M, Lecharny A, Aubourg S: FLAGdb++: a database for the functional analysis of the Arabidopsis genome. Nucleic Acids Res. 2004, D347-350. 10.1093/nar/gkh134. 32 Database.
- Li Y, Rosso MG, Viehoever P, Weisshaar B: GABI-Kat SimpleSearch: an Arabidopsis thaliana T-DNA mutant database with detailed information for confirmed insertions. Nucleic Acids Res. 2007, D874-878. 10.1093/nar/gkl753. 35 Database
- Li Y, Rosso MG, Ulker B, Weisshaar B: Analysis of T-DNA insertion site distribution patterns in Arabidopsis thaliana reveals special features of genes without insertions. Genomics. 2006, 87 (5): 645-652. 10.1016/j.ygeno.2005.12.010.PubMedView ArticleGoogle Scholar
- Springer PS: Gene Traps: Tools for Plant Development and Genomics. Plant Cell. 2000, 12 (7): 1007-1020. 10.1105/tpc.12.7.1007.PubMedPubMed CentralView ArticleGoogle Scholar
- Tani H, Chen X, Nurmberg P, Grant JJ, SantaMaria M, Chini A, Gilroy E, Birch PR, Loake GJ: Activation tagging in plants: a tool for gene discovery. Funct Integr Genomics. 2004, 4 (4): 258-266. 10.1007/s10142-004-0112-3.PubMedView ArticleGoogle Scholar
- Weigel D, Ahn JH, Blazquez MA, Borevitz JO, Christensen SK, Fankhauser C, Ferrandiz C, Kardailsky I, Malancharuvil EJ, Neff MM, et al: Activation tagging in Arabidopsis. Plant Physiol. 2000, 122 (4): 1003-1013. 10.1104/pp.122.4.1003.PubMedPubMed CentralView ArticleGoogle Scholar
- Koiwa H, Bressan RA, Hasegawa PM: Identification of plant stress-responsive determinants in Arabidopsis by large-scale forward genetic screens. J Exp Bot. 2006, 57 (5): 1119-1128. 10.1093/jxb/erj093.PubMedView ArticleGoogle Scholar
- Ulker B, Peiter E, Dixon DP, Moffat C, Capper R, Bouche N, Edwards R, Sanders D, Knight H, Knight MR: Getting the most out of publicly available T-DNA insertion lines. Plant J. 2008, 56 (4): 665-677. 10.1111/j.1365-313X.2008.03608.x.PubMedView ArticleGoogle Scholar
- McElver J, Tzafrir I, Aux G, Rogers R, Ashby C, Smith K, Thomas C, Schetter A, Zhou Q, Cushman MA, et al: Insertional mutagenesis of genes required for seed development in Arabidopsis thaliana. Genetics. 2001, 159 (4): 1751-1763.PubMedPubMed CentralGoogle Scholar
- Singer T, Fan Y, Chang HS, Zhu T, Hazen SP, Briggs SP: A high-resolution map of Arabidopsis recombinant inbred lines by whole-genome exon array hybridization. PLoS Genet. 2006, 2 (9): e144-10.1371/journal.pgen.0020144.PubMedPubMed CentralView ArticleGoogle Scholar
- Ulker B, Li Y, Rosso MG, Logemann E, Somssich IE, Weisshaar B: T-DNA-mediated transfer of Agrobacterium tumefaciens chromosomal DNA into plants. Nat Biotechnol. 2008, 26 (9): 1015-1017. 10.1038/nbt.1491.PubMedView ArticleGoogle Scholar
- Stein LD, Mungall C, Shu S, Caudy M, Mangone M, Day A, Nickerson E, Stajich JE, Harris TW, Arva A, et al: The generic genome browser: a building block for a model organism system database. Genome Res. 2002, 12 (10): 1599-1610. 10.1101/gr.403602.PubMedPubMed CentralView ArticleGoogle Scholar
- Robinson SJ, Parkin IAP: Bridging the Gene-to-Function Knowledge Gap Through Functional Genomics. Plant Genomics. 2009, 153-173. full_text.View ArticleGoogle Scholar
- Esch JJ, Oppenheimer DG, Marks MD: Characterization of a weak allele of the GL1 gene of Arabidopsis thaliana. Plant Mol Biol. 1994, 24 (1): 203-207. 10.1007/BF00040586.PubMedView ArticleGoogle Scholar
- Rerie WG, Feldmann KA, Marks MD: The GLABRA2 gene encodes a homeo domain protein required for normal trichome development in Arabidopsis. Genes Dev. 1994, 8 (12): 1388-1399. 10.1101/gad.8.12.1388.PubMedView ArticleGoogle Scholar
- Payne CT, Zhang F, Lloyd AM: GL3 encodes a bHLH protein that regulates trichome development in arabidopsis through interaction with GL1 and TTG1. Genetics. 2000, 156 (3): 1349-1362.PubMedPubMed CentralGoogle Scholar
- El Refy A, Perazza D, Zekraoui L, Valay JG, Bechtold N, Brown S, Hulskamp M, Herzog M, Bonneville JM: The Arabidopsis KAKTUS gene encodes a HECT protein and controls the number of endoreduplication cycles. Mol Genet Genomics. 2003, 270 (5): 403-414. 10.1007/s00438-003-0932-1.PubMedView ArticleGoogle Scholar
- Mathur J, Spielhofer P, Kost B, Chua N: The actin cytoskeleton is required to elaborate and maintain spatial patterning during trichome cell morphogenesis in Arabidopsis thaliana. Development. 1999, 126 (24): 5559-5568.PubMedGoogle Scholar
- Saedler R, Mathur N, Srinivas BP, Kernebeck B, Hulskamp M, Mathur J: Actin control over microtubules suggested by DISTORTED2 encoding the Arabidopsis ARPC2 subunit homolog. Plant Cell Physiol. 2004, 45 (7): 813-822. 10.1093/pcp/pch103.PubMedView ArticleGoogle Scholar
- Potikha T, Delmer DP: A mutant of Arabidopsis thaliana displaying altered patterns of cellulose deposition. Plant J. 1995, 7 (3): 453-460. 10.1046/j.1365-313X.1995.7030453.x.View ArticleGoogle Scholar
- Ilgenfritz H, Bouyer D, Schnittger A, Mathur J, Kirik V, Schwab B, Chua NH, Jurgens G, Hulskamp M: The Arabidopsis STICHEL gene is a regulator of trichome branch number and encodes a novel protein. Plant Physiol. 2003, 131 (2): 643-655. 10.1104/pp.014209.PubMedPubMed CentralView ArticleGoogle Scholar
- Oppenheimer DG, Pollock MA, Vacik J, Szymanski DB, Ericson B, Feldmann K, Marks MD: Essential role of a kinesin-like protein in Arabidopsis trichome morphogenesis. Proc Natl Acad Sci USA. 1997, 94 (12): 6261-6266. 10.1073/pnas.94.12.6261.PubMedPubMed CentralView ArticleGoogle Scholar
- Li Y, Rosso MG, Strizhov N, Viehoever P, Weisshaar B: GABI-Kat SimpleSearch: a flanking sequence tag (FST) database for the identification of T-DNA insertion mutants in Arabidopsis thaliana. Bioinformatics. 2003, 19 (11): 1441-1442. 10.1093/bioinformatics/btg170.PubMedView ArticleGoogle Scholar
- Sakurai T, Satou M, Akiyama K, Iida K, Seki M, Kuromori T, Ito T,Konagaya A, Toyoda T, Shinozaki K: RARGE: a large-scale databaseof RIKEN Arabidopsis resources ranging from transcriptometo phenome. Nucleic Acids Res 2005:D647-650.
- Hanada K, Zhang X, Borevitz JO, Li WH, Shiu SH: A large number of novel coding small open reading frames in the intergenic regions of the Arabidopsis thaliana genome are transcribed and/or under purifying selection. Genome Res. 2007, 17 (5): 632-640. 10.1101/gr.5836207.PubMedPubMed CentralView ArticleGoogle Scholar
- Castellana NE, Payne SH, Shen Z, Stanke M, Bafna V, Briggs SP: Discovery and revision of Arabidopsis genes by proteogenomics. Proc Natl Acad Sci USA. 2008, 105 (52): 21034-21038. 10.1073/pnas.0811066106.PubMedPubMed CentralView ArticleGoogle Scholar
- Nacry P, Camilleri C, Courtial B, Caboche M, Bouchez D: Major chromosomal rearrangements induced by T-DNA transformation in Arabidopsis. Genetics. 1998, 149 (2): 641-650.PubMedPubMed CentralGoogle Scholar
- Laufs P, Autran D, Traas J: A chromosomal paracentric inversion associated with T-DNA integration in Arabidopsis. Plant J. 1999, 18 (2): 131-139. 10.1046/j.1365-313X.1999.00436.x.PubMedView ArticleGoogle Scholar
- Tax FE, Vernon DM: T-DNA-associated duplication/translocations in Arabidopsis. Implications for mutant analysis and functional genomics. Plant physiology. 2001, 126 (4): 1527-1538. 10.1104/pp.126.4.1527.PubMedPubMed CentralView ArticleGoogle Scholar
- Abrahams S, Tanner GJ, Larkin PJ, Ashton AR: Identification and biochemical characterization of mutants in the proanthocyanidin pathway in Arabidopsis. Plant Physiol. 2002, 130 (2): 561-576. 10.1104/pp.006189.PubMedPubMed CentralView ArticleGoogle Scholar
- Koornneef M: Mutations affecting the testa colour in Arabidopsis. Arabidopsis Information Service. 1990, 27: 1-4.Google Scholar
- Shikazono N, Yokota Y, Kitamura S, Suzuki C, Watanabe H, Tano S, Tanaka A: Mutation rate and novel tt mutants of Arabidopsis thaliana induced by carbon ions. Genetics. 2003, 163 (4): 1449-1455.PubMedPubMed CentralGoogle Scholar
- Shirley BW, Kubasek WL, Storz G, Bruggemann E, Koornneef M, Ausubel FM, Goodman HM: Analysis of Arabidopsis mutants deficient in flavonoid biosynthesis. Plant J. 1995, 8 (5): 659-671. 10.1046/j.1365-313X.1995.08050659.x.PubMedView ArticleGoogle Scholar
- Winkel-Shirley B: Flavonoid biosynthesis. A colorful model for genetics, biochemistry, cell biology, and biotechnology. Plant Physiol. 2001, 126 (2): 485-493. 10.1104/pp.126.2.485.PubMedPubMed CentralView ArticleGoogle Scholar
- Marles MA, Ray H, Gruber MY: New perspectives on proanthocyanidin biochemistry and molecular regulation. Phytochem. 2003, 64 (2): 367-383. 10.1016/S0031-9422(03)00377-7.View ArticleGoogle Scholar
- Baxter IR, Young JC, Armstrong G, Foster N, Bogenschutz N, Cordova T, Peer WA, Hazen SP, Murphy AS, Harper JF: A plasma membrane H+-ATPase is required for the formation of proanthocyanidins in the seed coat endothelium of Arabidopsis thaliana. Proc Natl Acad Sci USA. 2005, 102 (7): 2649-2654. 10.1073/pnas.0406377102.PubMedPubMed CentralView ArticleGoogle Scholar
- Kitamura S, Shikazono N, Tanaka A: TRANSPARENT TESTA 19 is involved in the accumulation of both anthocyanins and proanthocyanidins in Arabidopsis. Plant J. 2004, 37 (1): 104-114. 10.1046/j.1365-313X.2003.01943.x.PubMedView ArticleGoogle Scholar
- Marinova K, Pourcel L, Weder B, Schwarz M, Barron D, Routaboul JM, Debeaujon I, Klein M: The Arabidopsis MATE transporter TT12 acts as a vacuolar flavonoid/H+ -antiporter active in proanthocyanidin-accumulating cells of the seed coat. Plant Cell. 2007, 19 (6): 2023-2038. 10.1105/tpc.106.046029.PubMedPubMed CentralView ArticleGoogle Scholar
- Pourcel L, Routaboul JM, Kerhoas L, Caboche M, Lepiniec L, Debeaujon I: TRANSPARENT TESTA10 encodes a laccase-like enzyme involved in oxidative polymerization of flavonoids in Arabidopsis seed coat. Plant Cell. 2005, 17 (11): 2966-2980. 10.1105/tpc.105.035154.PubMedPubMed CentralView ArticleGoogle Scholar
- Ishida T, Kurata T, Okada K, Wada T: A Genetic Regulatory Network in the Development of Trichomes and Root Hairs. Ann Rev Plant Biol. 2008, 59 (1): 365-386. 10.1146/annurev.arplant.59.032607.092949.View ArticleGoogle Scholar
- Clough SJ, Bent AF: Floral dip: a simplified method for Agrobacterium-mediated transformation of Arabidopsis thaliana. Plant J. 1998, 16 (6): 735-743. 10.1046/j.1365-313x.1998.00343.x.PubMedView ArticleGoogle Scholar
- Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997, 25 (17): 3389-3402. 10.1093/nar/25.17.3389.PubMedPubMed CentralView ArticleGoogle Scholar
- Ye G-N, Stone D, Pang S-Z, Creely W, Gonzalez K, Hinchee M: Arabidopsis ovule is the target for Agrobacterium in planta vacuum infiltration transformation. Plant J. 1999, 19 (3): 249-257. 10.1046/j.1365-313X.1999.00520.x.PubMedView ArticleGoogle Scholar
- Jarque CM, Bera AK: Efficient tests for normality, homoscedasticity and serial independence of regression residuals. Econom Letts. 1980, 6 (3): 255-259. 10.1016/0165-1765(80)90024-5.View ArticleGoogle Scholar
- Conover WC: Practical Nonparametric statistics. 2nd edition. New York John Wiley &Sons; 1980.Google Scholar
- Li Y, Tanner G, Larkin P: The DMACA-HCl Protocol and the Threshold Proanthocyanidin Content for Bloat Safety in Forage Legumes. J Sci Food Agri. 1996, 70 (1): 89-101. 10.1002/(SICI)1097-0010(199601)70:1<89::AID-JSFA470>3.0.CO;2-N.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.