Generation and analysis of blueberry transcriptome sequences from leaves, developing fruit, and flower buds from cold acclimation through deacclimation
© Rowland et al; licensee BioMed Central Ltd. 2012
Received: 30 November 2011
Accepted: 2 April 2012
Published: 2 April 2012
There has been increased consumption of blueberries in recent years fueled in part because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease. Very few genomic resources have been available for blueberry, however. Further development of genomic resources like expressed sequence tags (ESTs), molecular markers, and genetic linkage maps could lead to more rapid genetic improvement. Marker-assisted selection could be used to combine traits for climatic adaptation with fruit and nutritional quality traits.
Efforts to sequence the transcriptome of the commercial highbush blueberry (Vaccinium corymbosum) cultivar Bluecrop and use the sequences to identify genes associated with cold acclimation and fruit development and develop SSR markers for mapping studies are presented here. Transcriptome sequences were generated from blueberry fruit at different stages of development, flower buds at different stages of cold acclimation, and leaves by next-generation Roche 454 sequencing. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The assembled sequences were annotated and functionally mapped to Gene Ontology (GO) terms. Frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR was performed to confirm their differential expression patterns. Overall, 14 out of 17 of the genes examined had differential expression patterns similar to what was predicted from their reads alone. The assembled sequences were also mined for SSRs. From these sequences, 15,886 blueberry EST-SSR loci were identified. Primers were designed from 7,705 of the SSR-containing sequences with adequate flanking sequence. One hundred primer pairs were tested for amplification and polymorphism among parents of two blueberry populations currently being used for genetic linkage map construction. The tetraploid mapping population was based on a cross between the highbush cultivars Draper and Jewel (V. darrowii is also in the background of 'Jewel'). The diploid mapping population was based on a cross between an F1 hybrid of V. darrowii and diploid V. corymbosum and another diploid V. corymbosum. The overall amplification rate of the SSR primers was 68% and the polymorphism rate was 43%.
These results indicate that this large collection of 454 ESTs will be a valuable resource for identifying genes that are potentially differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development in blueberry and related species. In addition, the ESTs have already proved useful for the development of SSR and EST-PCR markers, and are currently being used for construction of genetic linkage maps in blueberry.
Blueberry (Vaccinium section Cyanococcus) is an economically important small fruit crop adapted to acidic, sandy soils that otherwise might be considered worthless from an agronomic standpoint . Blueberry is a member of the Ericaceae family, which includes many acid-loving species, such as the commercially important berry crops, blueberry, cranberry, and lingonberry, and the floral and nursery crops, rhododendron, azalea, and mountain laurel. The commercial blueberries are derived principally from four species, the tetraploid highbush blueberry (Vaccinium corymbosum), the diploid and tetraploid lowbush blueberry (V. myrtilloides and V. angustifolium, respectively), and the hexaploid rabbiteye blueberry (V. virgatum), and hybrids thereof. North America is the major producer of blueberries, although production of blueberries is on the rise worldwide. About 2/3 of blueberry production in the U.S. is from improved cultivars of the tetraploid highbush blueberry (V. corymbosum) and about 1/3 is from wild, managed fields of the tetraploid lowbush blueberry (V. angustifolium) .
There has been increased demand for and consumption of blueberries in recent years because of their many recognized health benefits. Blueberry fruit is very high in anthocyanins, which have been linked to improved night vision, prevention of macular degeneration, anti-cancer activity, and reduced risk of heart disease [3, 4]. The compound resveratrol, found in blueberries, has been linked to reduced risk of heart disease and cancer, and another compound, pterostilbene, has been shown to lower cholesterol .
Much progress has been made in traditional breeding of highbush and rabbiteye cultivars since their domestication in the early twentieth century. Breeding efforts have focused on development of cultivars with broader climatic adaptation (increased freezing tolerance for northern regions and reduced chilling requirements for southern regions), broader soil adaptation (ability to grow in higher pH soils), disease and pest resistance, mechanical handling tolerance, and high fruit quality . Lowbush blueberry, on the other hand, is a managed wild crop, and little effort has been made to breed improved varieties. Improved cultural practices, however, have resulted in dramatic increases in yields of lowbush blueberry over the last two decades [6, 7].
Until now, few genomic resources have been available for blueberry, or for the Ericaceae family in general. The availability of genomic tools for molecular breeding could possibly lead to more rapid genetic improvement of blueberry, particularly when combining traits for climatic adaptation with other important traits like fruit and nutritional quality. Blueberry is especially suitable for improvement via marker-assisted breeding because of its long generation time, high heterozygosity, self-infertility especially of diploids, inbreeding depression, and polyploidy of commercial types. In the last 6-7 years, the first few thousand expressed sequence tags (ESTs) were generated and made publicly available for this family, about 5,000 from blueberry [8, 9] and about 1,200 from rhododendron . A limited number of robust molecular markers like simple sequence repeats (SSRs)  and expressed sequence tag-polymerase chain reaction markers (EST-PCRs) [12–14] were developed from some of the publicly available blueberry ESTs and are being used in genetic diversity and mapping studies. Current mapping studies are focused on identifying quantitative trait loci (QTL) associated with chilling requirement, freezing tolerance, and fruit quality traits in blueberry. The first microarray experiments have been carried out in blueberry and have successfully identified many transcripts whose abundances increase with cold acclimation [9, 15]. More gene expression studies need to be undertaken, however, that are based on a larger collection of gene sequences in order to sort out genes that are expressed in response to various stimuli and during development.
Because freezing tolerance (especially of flower buds) and fruit quality are important traits that could be improved upon using marker-assisted-selection, we report here the generation and analysis of a large collection of publicly available ESTs from flower buds at different stages of cold acclimation, fruit at different stages of ripening, and leaves of highbush blueberry using a high-throughput pyrosequencing approach, based on Roche's 454 Genome Sequencer (GS) FLX Titanium platform. Next generation 454 EST sequencing has been shown to be a very efficient, cost-effective approach for transcriptome analysis of non-model species and for the discovery of rare and novel transcripts [16–21]. In our study, over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons. The contigs and singletons have been annotated and functionally mapped to Gene Ontology (GO) terms. A database was developed to house these sequences and their annotations, and a web-based interface was developed to allow others to search/browse the data http://bioinformatics.towson.edu/BBGD454. In addition, the frequency of the most abundant sequences in each of the libraries was compared across all libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Differential expression of most of these genes was confirmed through real-time PCR. The assembled ESTs were also screened for simple sequence repeat (SSR) motifs in order to design SSR primers for ongoing mapping studies. These sequences constitute an important genomic tool for the scientific community, particularly for those interested in gene discovery, expression, and mapping/relationship studies in the Ericaceae family.
Results and discussion
454 EST sequencing and assembly
Nine cDNA libraries were constructed from mRNA from various organs collected from plants of the highbush blueberry (V. corymbosum) variety Bluecrop. Organs included young, fully expanded leaves, flower buds collected at various stages of cold acclimation (0, 397, 789, and 1333 chill units), and fruit collected at various stages of ripening (green, white, pink, and blue). From many years of cold hardiness research, we have shown that 'Bluecrop' flower buds (from plants grown in the Mid Atlantic region) have a cold hardiness level or LT50 of about -13°C in late September or early October (0 chill units). Cold hardiness reaches a maximum level (or minimum LT50) of about -25 to -27°C by about mid to late December (~600 chill units), and buds begin to deacclimate in February and reach a cold hardiness level of about -13 to -14°C by late March (~1300 chill units) [22, 23]. Thus, the four flower bud libraries corresponded to time points when (1) flower buds were being formed and large enough to first collect--here plants were still essentially non-acclimated or in the very early stages of acclimation (0 chill units), (2) plants were approaching maximum cold hardiness (397 chill units), (3) plants had reached and were maintaining maximum cold hardiness (789 chill units), and (4) plants had nearly completely deacclimated and buds were beginning to open (1333 chill units).
Summary of 454 blueberry EST data
Total number of reads assembled
Total number of reads in contigs
Number of contigs/singletons
Average contig/singleton length (nt)a
Flower bud 0'b
Flower bud 397'
Flower bud 789'
Flower bud 1333'
All flower bud samples
All berry samples
Leaves and stems
Annotation of sequences
Annotation of unique sequences (contigs and singletons) from all the various assemblies was attempted based on searches of specific databases for sequence similarity. Blast2Go , an annotation and visualization tool, was used to BLAST the contig and singleton sequences from each assembly against the non-redundant database (nr) of the National Center for Biotechnology Information (NCBI). Domain-finding tools, such as InterProScan , were also used to help annotate those sequences that had no good BLAST hits. Because of their greater length, the percentage of contigs from the 'all' assembly that had significant homology (E-values ≤ 10-5) to other sequences in GenBank was high (86.3%), much higher than the percentage of singletons with significant homology (18.5%). The percentage of these contig sequences that had significant homology to known plant proteins was 84.8%, whereas 1.5% had homology to unknown/hypothetical plant proteins and 13.7% had no significant homology to other sequences in GenBank. The number one species that the top BLASTX (search of the protein databases using a translated nucleotide query) contig searches hit was Vitis vinifera (grape), followed by Ricinis communis (the castor oil plant), Populus tricocarpa (black cottonwood) and Glycine max (soybean) in descending order. Of the various plant species for which we have at least a draft sequence available, Vitis vinifera is the most closely related to blueberry and is also a berry crop.
Comparison of sequence abundance across libraries and real-time PCR results
In addition to comparing the libraries based on annotation of the sequences, the most highly abundant transcripts (contigs with the most number of reads) were identified in each of the libraries. Their sequences were then BLASTed against the other eight libraries to determine the number of homologous reads in each of the other libraries. In this way, highly abundant transcripts that were potentially differentially expressed during cold acclimation and during fruit development were identified. Additional file 1 lists the ~30 most highly abundant transcripts from all nine libraries and their total numbers of reads (and percentages of total reads) across each of the libraries. In the last columns, we give our predictions, from the percentages of reads, as to whether each of the transcript levels goes up, down, up then down, down then up, etc., or stays fairly constant across the four bud time points/stages of cold acclimation (0, 397, 789, and 1333 chill units) and the four stages of fruit development (green, white, pink, and blue).
From Additional file 1, it is clear that many of the transcripts appear to be differentially expressed during cold acclimation. Levels of most of the highly abundant transcripts from the Bud 0' library appeared to either decline as cold acclimation progressed and then rise again during deacclimation (down/up designation), or to decline at all the time points past the 0' point (down). Levels of many of the most highly abundant transcripts from the Bud 397' and Bud 789' libraries appeared to rise during acclimation (from 0' to 397' or from 0' to 789') and then decline during deacclimation (1333'). Thus, they were given an up/down designation. Levels of many of the most highly abundant transcripts from the Bud 1333' library appeared to decline during cold acclimation and then rise again during deacclimation (down/up), as with the Bud 0' library. Levels of about 1/3 of the top 30 most highly abundant transcripts in the Bud 1333' library appeared to be low at all the time points prior to deacclimation and then to rise sharply during deacclimation (up).
These results were very similar to what we found previously using a small microarray to identify differentially expressed transcripts during cold acclimation/deacclimation . Although the microarray was based on only about 1,500 unique genes [9, 15], many of the transcripts identified as differentially expressed in that study were also identified as potentially differentially expressed in this study. Some of the transcripts identified as potentially downregulated during cold acclimation in both studies encoded dehydration-induced RD22-like protein/BURP domain-containing protein, certain heat shock proteins, certain dehydrins, beta-glucosidase, UDP-glucose dehydrogenase, and ascorbate peroxidase (Additional file 1 and ). Transcripts identified as potentially upregulated during cold acclimation in both studies encoded DNA-binding domain proteins, certain LEAs (late embryogenesis abundant)/dehydrins, protease inhibitors, various ribosomal proteins, galactinol synthase, granule-bound starch synthase, and S-adenosylmethionine decarboxylase (Additional file 1 and ). Obtaining similar results from the microarray study and the deeper transcriptome sequencing study suggests that this new database will be very useful for further elucidating the cold acclimation response pathway in blueberry.
Furthermore, this study allowed identification of more potentially differentially expressed transcripts than were discovered in the microarray study, because of the deeper sequencing done here. Many of the transcript sequences obtained in this study are new, so they were not included on the previous microarray. For example, from the comparison of reads across libraries, transcripts encoding pyrophosphate-dependent phosphofructokinase, arginine decarboxylase, lipoxygenase, abscisic stress ripening protein, and a hypothetical protein all appeared to be high at the earliest stage of cold acclimation (0' buds), then downregulated as cold acclimation progressed, then upregulated as deacclimation commenced, except for lipoxygenase which appeared to remain at low levels during deacclimation. On the other hand, transcripts encoding a lipid transfer protein, amino acid selective channel protein, Pointed First Leaf, and a high mobility group protein appeared to be low in 0' buds, upregulated as cold acclimation progressed, and then downregulated during deacclimation, except for Pointed First Leaf which actually appeared to peak during deacclimation. These genes are just a few examples of genes not identified previously in our microarray study as being differentially expressed.
Figure 6B shows transcript levels across the four time points (0, 397, 789, and 1333') for four genes (lipid transfer protein, amino acid selective channel protein, Pointed First Leaf, and a high mobility group protein) that appeared to be upregulated during cold acclimation based on their number of reads, and Figure 6C shows transcript levels for two genes (galactinol synthase and 14 kDa dehydrin) that were already known to be upregulated in flower buds during cold acclimation [9, 29]. Again, all showed the expected trends in gene expression, except for the gene encoding a lipid transfer protein. Instead of being upregulated during cold acclimation, its transcript level was higher in the early stages of cold acclimation, then downregulated as cold acclimation progressed, then upregulated during deacclimation. It is possible that the primers designed from its sequence may have had strong homology to another gene encoding a lipid transfer protein that is, in fact, downregulated during cold acclimation. Lipid transfer proteins have been found to be associated with cold acclimation ability in some Solanum species that are able to acclimate to cold . Based on work done in Arabidopsis, Pointed First Leaf encodes a ribosomal protein S18. Activity of the S18A promoter appears to be restricted to meristems; plants activate an extra copy of this gene in tissues with cell division activity . It is, therefore, notable that transcripts of Pointed First Leaf appear to peak during deacclimation of blueberry flower buds, when buds are resuming growth. Furthermore, expression of a gene encoding a chloroplastic amino acid selective channel protein has been correlated with cold acclimation in cereals .
From Additional file 1, it is also clear that many of the highly abundant transcripts appear to have a complex, differential expression pattern during fruit development. Levels of about half of the most highly abundant transcripts from the green and white fruit libraries appeared to either decline during fruit development or to decline initially and then rise again later during development. The other half appeared to rise, or rise and then decline, or rise, decline, and rise again. Levels of some of the transcripts appearing to decline throughout the ripening period encoded metallothionein-like, lipid transfer, dehydrin, ribulose-bisphosphate carboxylase oxygenase small subunit, and light harvesting complex II proteins. Levels of transcripts encoding a burp domain-containing protein and thioredoxin h appeared to decline until reaching the blue/ripe fruit stage, where they then rose sharply. Levels of transcripts encoding cysteine proteinase precursor appeared to rise during development and then decline at the blue/ripe fruit stage.
Levels of many of the most highly abundant transcripts from the pink fruit library appeared to rise, peaking at the pink fruit stage, and then to decline afterwards at the blue/ripe fruit stage. These included transcripts for pectate lyase, cytochrome b5, cysteine protease-like protein, cyclophilin, glutathione peroxidase, and chalcone synthase. Levels of many of the most highly abundant transcripts from the blue/ripe fruit library appeared to rise during fruit development, and peak at the blue/ripe fruit stage. These included transcripts for aspartic proteinase, flavonoid 3-hydroxylase, and 1-aminocyclopropane-1-carboxylate oxidase.
Figure 7B shows transcript levels across the four stages of fruit ripening (green, white, pink, and blue) for three genes (aspartic proteinase, 1-aminocyclopropane-1-carboxylate oxidase, and flavonoid 3-hydroxylase) whose expression levels appeared to peak at the blue fruit stage based on their number of reads. From real-time PCR, two of the genes (1-aminocyclopropane-1-carboxylate oxidase and flavonoid 3-hydroxylase) showed the expected trends in terms of their gene expression patterns, with highest levels of expression at the blue fruit stage. The gene for aspartic proteinase, however, actually had low levels of expression throughout the different stages of fruit development, but somewhat higher at the white fruit stage than the other stages. It makes sense that the level of transcripts encoding flavonoid 3-hydroxylase, another enzyme involved in anthocyanin biosynthesis, should peak at the blue fruit stage. Finding transcripts encoding 1-aminocyclopropane-1-carboxylate oxidase, an enzyme involved in ethylene biosynthesis, to peak at the pink and blue fruit stages is consistent with findings in strawberry, which have demonstrated increased synthesis of ethylene and ethylene receptors during ripening , even though it, like blueberry, is a non-climacteric fruit.
Overall, we found 14 out of 17 (if we include the gene for pectate lyase), or 82%, of the genes examined to have differential expression patterns similar to what would be predicted from their reads alone. In addition, we BLASTed genes identified in Arabidopsis, as either being part of the CBF-regulon (cold-response pathway turned on by the transcription factor CBF) or as being cold-response regulatory genes [38, 39], against our various 454 sequence assemblies (data not shown). We found homologs to many of these genes among the sequences in our database, such as genes for galactinol synthase, dehydrin cor47, dehydrin erd10, pyruvate decarboxylase, CBF, and ICE (inducer of CBF expression). We also searched our fruit assemblies for enzymes known to be involved in anthocyanin biosynthesis in other plants, and found homologs to all of these genes in our database as well (data not shown). These results suggest that this new database will be very useful for identifying genes that are differentially expressed and play important roles in flower bud development, cold acclimation, chilling unit accumulation, and fruit development.
Mining database for SSRs
Summary of microsatellite repeats in unique 'Bluecrop' ESTs
Percent of Category
Percent of Total
Total No. of SSRs
Total Dinucleotide Repeats
Total Trinucleotide Repeats
Among the trinucleotide (TNR) motifs found in blueberry ESTs, AAG/CTT was the most frequently occurring (31.8%), followed by ACC/GGT (14.6%), and AGC/GCT and AGG/CCT (15.0%) (Table 2). Our findings agree with other reports that AAG/CTT is the most prevalent TNR and CCG/CGG is relatively rare in dicotyledonous plants [40, 42].
Primers were designed from 7,705 of the SSR-containing sequences. Lack of adequate flanking sequence was the most common reason for not designing primers from the remaining 8,181 SSR-containing sequences. One hundred primer pairs were tested for amplification and polymorphism in a tetraploid V. corymbosum mapping population (F1 population resulting from a cross between 'Draper' and 'Jewel') and in an interspecific diploid mapping population (true testcross population resulting from a cross between F1 hybrid #10 [Fla4B (V. darrowii) × W85-20 (diploid V. corymbosum)] and W85-23 (diploid V. corymbosum). Details of results are shown in Additional file 2. In summary, 32 primer pairs failed to generate a product in all accessions while one, VCB-C-13051, only failed in the diploid accessions, not surprising given that ESTs were developed from a tetraploid V. corymbosum cultivar, Bluecrop. This actually indicates a very high rate of cross transference into V. darrowii and diploid V. corymbosum, which agrees with the previous report of Boches et al. . Of the remaining 67 SSR primer pairs that generated a product, 25 were monomorphic in the tetraploid and diploid accessions tested, while 43 resulted in polymorphic products. Six of these 43 were polymorphic only in parents and the two progeny individuals of the tetraploid mapping population (including VCB-C-13051); and 11 were polymorphic only in parents of the diploid mapping population. Therefore, the overall amplification rate was 68% and the polymorphism rate was 43%. Due to the very small number of genotypes assayed, this polymorphism rate is underestimated and is expected to be higher when evaluating a large number of diverse blueberry accessions.
We have generated a large collection of transcript sequences from the commercial highbush blueberry (V. corymbosum L.) using next generation 454 sequencing technology. Transcriptome sequences were obtained from nine different libraries including fruit at four different stages of development, flower buds at four different stages of cold acclimation, and leaves. Over 600,000 reads were assembled into approximately 15,000 contigs and 124,000 singletons, which were annotated and functionally mapped to GO terms. Frequency of the most abundant sequences in each of the libraries was compared across the other libraries to identify genes that are potentially differentially expressed during cold acclimation and fruit development. Real-time PCR confirmed the differential expression patterns of most of the genes that were analyzed. The assembled sequences were also mined for SSRs and over 15,000 blueberry EST-SSR loci were identified. This collection of ESTs should prove to be an important resource for the scientific community particularly for those interested in biological processes such as flower bud development, cold acclimation, chilling unit accumulation/vernalization, flowering, and fruit development, and for those interested in development of molecular markers and genetic linkage maps in blueberry and related species.
From this EST database, we are currently identifying candidate genes for several horticulturally significant traits, such as cold hardiness, chilling requirement, fruit color, etc. based on predicted or real gene expression patterns in blueberry or other plants. We are attempting to map these candidate genes as EST-PCR markers  in our mapping populations to determine if they map to the same regions as QTL for these traits. If the genes themselves cannot be mapped due to lack of length polymorphisms, then we are attempting to identify and map SSRs near the genes. This work is being done through a collaboration with Dr. Allan Brown (North Carolina State University), who is heading up an effort to sequence and assemble the whole blueberry genome.
Leaves, flower buds, and fruit were collected from multiple plants of the highbush blueberry cultivar Bluecrop grown at the USDA/ARS, Beltsville Agricultural Research Center, Beltsville, MD. 'Bluecrop' was chosen because it is relatively cold hardy and is the "industry standard" of highbush cultivars. Flower buds were collected from field plants during the fall and winter of 2006-2007 with increasing exposure to chilling temperatures, measured as chill units (hours between 0-7°C). Buds were harvested at 0 (9/7/06), 397 (11/30/06), 789 (1/16/07) and 1333 (3/27/07) chill units. Leaf and fruit samples were collected from plants during the 2008 growing season. Fruit was harvested from field plants at four stages of ripening: green (6/12/08), white (6/27/08), pink (7/8/08) and blue or ripe (7/8/08). All tissues were frozen in liquid nitrogen immediately after harvest and stored at -80°C.
RNA extraction and cDNA preparation
Total RNA was isolated from the leaf and four bud (0, 397, 789, and 1333 chill units) samples by a modification of the method of Chang et al. . Essentially, two grams of each tissue was ground finely in liquid nitrogen and incubated at 65°C in pre-warmed CTAB extraction buffer. Two chloroform:IAA (24:1) extractions were performed, followed by overnight precipitation with LiCl. RNA pellets were resuspended in DEPC water, precipitated again with ethanol and NaOAc, washed, and finally resuspended in 1 ml DEPC water. For isolation of total RNA from the four fruit (green, white, pink, and blue) samples, the same procedure was used with four grams of tissue and the incorporation of a centrifugation step after initial incubation to remove cell debris . Supernatants were transferred to fresh tubes and chloroform:IAA extractions were performed. Additionally, after overnight LiCl precipitation, pellets were washed two to three times with 70% ethanol to clear pigments  and resuspended in 500 μl DEPC water. RNA quality was checked on 1% agarose gels stained with ethidium bromide, and concentration was measured with a NanoDrop ND-1000 (NanoDrop Technologies, USA).
The Promega PolyATtract mRNA kit (Promega Corp., USA) was used to isolate mRNA from the nine total RNAs. Poly(A) RNA was ethanol precipitated, and quality and concentration were assessed with the NanoDrop ND-1000. Clontech's SMART cDNA Library Construction kit (Clontech, USA) was used for cDNA synthesis with a protocol adapted by K.V. Donohue (personal communication), and provided by the Genomic Sciences Laboratory, North Carolina State University. The cDNAs were precipitated with ammonium acetate and ethanol, and resuspended in TE. Quality of the cDNAs was assessed on 1% agarose gels stained with ethidium bromide; quantity was determined with the NanoDrop ND-1000.
Library construction and 454 sequencing
The nine cDNAs (from leaves, flower buds collected at 0, 397, 789, and 1333 chill units, and fruit collected at green, white, pink, and blue fruit stages) were provided to the Genomic Sciences Laboratory, North Carolina State University, for library construction and 454 sequencing. The nine 454 libraries were constructed essentially as described in Poinar et al.  and multiplexed. Generally, the cDNAs were sheared by nebulization to give fragments of about 500 bp. The fragmented cDNAs were ligated to adaptor sequences and immobilized on beads. DNA fragments were denatured to generate single-stranded DNA libraries and amplified by emulsion PCR. Sequencing of the libraries was performed using the 454-GS FLX Titanium sequencing platform (454 Life Sciences, Roche Diagnostics, USA). All raw 454 sequence data were deposited in the Sequence Read Archive of the NCBI, accession numbers SRX100856, SRX100859, and SRX100861-SRX100867.
Sequences were analyzed with the GS FLX software v2.0.01.14 package (454 Life Sciences, Roche). Using normalization, correction, and quality-filtering algorithms, weak signals and low quality sequences were removed; read ends were also screened and trimmed for 454 adaptor sequences. Another filtering step masked SMART PCR primer sequences (Clontech) and removed sequences shorter than 50 nucleotides. The remaining 454 sequences were then assembled into unique putative transcripts (contigs and singletons) using the GS De Novo Assembler, an application of the GS FLX software. Default parameters were used with the exception that overlaps were dropped from 40 bases to 30 bases. Reads were assembled in a variety of ways. First, reads from each library were assembled separately. Then, all reads from buds were assembled, all reads from fruit were assembled, and, finally, total reads were assembled.
Sequence annotation and determination of number of reads
The 454 contigs and singletons from all the various assemblies of the nine libraries were annotated by the Bioinformatics Laboratory at Towson University. The contig and singleton sequences from the assemblies were batch BLASTed to identify the genes expressed in the respective tissues. A domain finding tool, Interproscan , was used to help annotate the sequences. Blast2Go and other custom programs, built with the scripting language PERL, were used to create annotated, tab-delimited tables, which included information on taxonomy, gene function, tissue specificity, and GO terms . Contigs with the highest number of reads from each of the libraries were pairwise aligned using BLAST against all the other eight libraries to determine the number of reads with homology from the other libraries. A cut-off e-value of 1E-5 was used. In this way, first, the most highly abundant sequences were identified in each of the libraries. Second, of these, transcripts that were potentially differentially expressed during cold acclimation and during fruit development were identified. The sequences, along with their respective annotations, were stored in a custom-built relational database. The web-based database was built using SQLServer 2008 and ASP.NET (Microsoft, Redmond, WA, USA). Search and browse capabilities were added to allow scientists to access and search the data from the internet. The database can be accessed from: http://bioinformatics.towson.edu/BBGD454.
Real-time PCR analysis
Total RNA for real-time PCR analysis was isolated from the four bud (0, 397, 789, and 1333 chill units) and four fruit samples (green, white, pink, and blue/ripe) as described above under the "RNA extraction and cDNA preparation" section. RNA quality was checked on 1% agarose gels stained with ethidium bromide, and concentration was measured with a NanoDrop ND-1000 (NanoDrop Technologies, USA). RNA extracts were treated with DNase I, amplification grade (Invitrogen, USA) prior to cDNA synthesis. Complementary DNAs were synthesized using SuperScript III Platinum Two-Step qRT-PCR Kit (Invitrogen).
Primer sequences for quantitative real-time PCR
Primer sequences-forward (F) and reverse (R)
Abscisic stress ripening
Amino acid selective channel protein
Cysteine protease-like protein
High mobility group family protein
Bud/Fruit housekeeping gene
Bud/Fruit housekeeping gene
Lipid transfer protein
Bud housekeeping gene
Pointed first leaf
Fruit housekeeping gene
Sequences shorter than 120 nt were removed from the total assembled 454 sequences, leaving 87,071 unique 454 transcript sequences, which were analyzed using the online SSR tool (SSR Server) available at the Genome Database for Vaccinium (GDV, http://www.vaccinium.org). SSR Server identifies simple sequence repeats using user-specified motif parameters and generates an EXCEL file containing the identified SSRs and coordinates in the sequence, primers generated from Primer3 , and Open Reading Frame coordinates using getORF http://emboss.sourceforge.net/apps/cvs/emboss/apps/getorf.html. SSRs recorded for the final dataset included dinucleotide repeats (DNR) with at least 5 repeats, trinucleotide repeats (TNR) with at least 4 repeats, tetramers with at least 3 repeats, and pentamers with at least 3 repeats. Of these SSR-containing sequences, those that had a GC content between 40 and 60% and a minimum of 20 bases of sequence on either side of the repeat motif were selected as optimal candidates for primer development.
M13-tailed forward primers, as described by Schuelke et al. , and standard reverse primers were obtained for 100 SSR-containing sequences. These primers were tested for amplification and polymorphism in a tetraploid and a diploid mapping population of blueberry. Individuals tested in the highbush tetraploid (V. corymbosum) mapping population were parents 'Draper' and 'Jewel' and two progeny individuals, BB 05-61-1 and BB 05-61-2. Individuals tested in the interspecific diploid testcross mapping population included the parents #10 [Fla4B (V. darrowii) × W85-20 (diploid V. corymbosum)] and W85-23 (another diploid V. corymbosum), along with the original parents of #10, Fla4B and W85-20. PCR reactions (15 μL total volume) contained 1X reaction buffer, 2 mM MgCl2, 0.2 mM dNTPs, 0.5 μM of the fluorescent M13 primer, 0.12 μM forward primer, 0.50 μM reverse primer, 0.075 units of GoTaq® DNA Polymerase (Promega, USA), and 4.5 ng genomic DNA. The touchdown PCR temperature profile used for amplification followed: one cycle of 94°C for 3 min; 10 cycles of 94°C for 40 s, 65°C for 45 s (-1.0°C per cycle), and 72°C for 45 s; 20 cycles of 94°C for 40 s, 52°C for 45 s, and 72°C for 45 s; eight cycles of 94°C for 40 s, 53°C for 45 s, and 72°C for 45 s, one cycle at 72°C for 30 min. Once PCR success was assessed by 2% agarose gel electrophoresis, PCR products generated from up to four primer pairs were pooled and separated by capillary electrophoresis using the Beckman CEQ 8000 genetic analyzer (Beckman Coulter, USA) for all eight samples.
Abscisic Stress Ripening
C-Repeat Binding Factor
Expressed Sequence Tag
Inducer of CBF
Lethal Temperature resulting in 50% death
National Center for Biotechnology Information
Open Reading Frame
Quantitative Real-Time PCR
Quantitative Trait Loci
Simple Sequence Repeat
We would like to acknowledge Jeremy Jones for his technical support in evaluating SSR amplification and polymorphism and Brittany McCullough for her bioinformatic support. This work was supported by grant 2008-51180-04861 from the USDA/CSREES Specialty Crop Research Initiative program.
- Galletta GJ, Ballington JR: Blueberries, cranberries and lingonberries. Fruit Breeding. Volume II. Vine and small fruit crops. Edited by: Janick J, Moore JN. New York: John Wiley and Sons, Inc; 1996:1-107.Google Scholar
- United States Department of Agriculture National Agricultural Statistics Service. [http://usda.mannlib.cornell.edu/usda/nass/NoncFruiNu//2010s/2011/NoncFruiNu-07-07-2011.pdf].
- Cho E, Seddon JM, Rosner B, Willett WC, Hankinson SE: Prospective study of intake of fruits, vegetables, vitamins, and carotenoids and risk of age-related maculopathy. Arch Ophthalmol. 2004, 122: 883-892. 10.1001/archopht.122.6.883.PubMedView ArticleGoogle Scholar
- Kalt W, Joseph JA, Shukitt-Hale B: Blueberries and human health. A review of the current research. J Am Pomol Soc. 2007, 61: 151-160.Google Scholar
- Rimando AM, Kalt W, Magee JB, Dewey J, Ballington JR: Resveratrol, pterostilbene, and piceatannol in Vaccinium berries. J Agric Food Chem. 2004, 52: 4713-4719. 10.1021/jf040095e.PubMedView ArticleGoogle Scholar
- Yarborough D: Factors contributing to the increase in productivity in the wild blueberry industry. Small Fruits Rev. 2004, 3: 33-43. 10.1300/J301v03n01_05.View ArticleGoogle Scholar
- Yarborough D: Wild blueberry culture in Maine. Wild Blueberry Fact Sheet No. 220. University of Maine; 2009.
- Dhanaraj AL, Slovin JP, Rowland LJ: Analysis of gene expression associated with cold acclimation in blueberry floral buds using expressed sequence tags. Plant Sci. 2004, 166: 863-872. 10.1016/j.plantsci.2003.11.013.View ArticleGoogle Scholar
- Dhanaraj AL, Alkharouf NW, Beard HS, Chouikha IB, Matthews BF, Wei H, Arora R, Rowland LJ: Major differences observed in transcript profiles of blueberry during cold acclimation under field and cold room conditions. Planta. 2007, 225: 735-751. 10.1007/s00425-006-0382-1.PubMedView ArticleGoogle Scholar
- Wei H, Dhanaraj AL, Rowland LJ, Fu Y, Krebs SL, Arora R: Comparative analysis of expressed sequence tags from cold-acclimated and non-acclimated leaves of Rhododendron catawbiense Michx. Planta. 2005, 221: 406-416. 10.1007/s00425-004-1440-1.PubMedView ArticleGoogle Scholar
- Boches PS, Bassil NV, Rowland LJ: Microsatellite markers for Vaccinium from EST and genomic libraries. Mol Ecol Notes. 2005, 5: 657-660. 10.1111/j.1471-8286.2005.01025.x.View ArticleGoogle Scholar
- Rowland LJ, Dhanaraj AL, Polashock JJ, Arora R: Utility of blueberry-derived EST-PCR primers in related Ericaceae species. HortSci. 2003, 38: 1428-1432.Google Scholar
- Rowland LJ, Mehra S, Dhanaraj A, Ogden EL, Arora R: Identification of molecular markers associated with cold tolerance in blueberry. Acta Hort. 2003, 625: 59-69.View ArticleGoogle Scholar
- Rowland LJ, Mehra S, Dhanaraj AL, Ogden EL, Slovin JP, Ehlenfeldt MK: Development of EST-PCR markers for DNA fingerprinting and genetic relationship studies in blueberry (Vaccinium, section Cyanococcus). J Amer Soc Hort Sci. 2003, 128: 682-690.Google Scholar
- Rowland LJ, Dhanaraj AL, Naik D, Alkharouf N, Matthews B, Arora R: Study of cold tolerance in blueberry using EST libraries, cDNA microarrays, and subtractive hybridization. HortSci. 2008, 43: 1975-1981.Google Scholar
- Cheung F, Haas BJ, Goldberg SMD, May GD, Xiao Y, Town CD: Sequencing Medicago truncatula expressed sequenced tags using 454 Life Sciences technology. BMC Genomics. 2006, 7: 272-10.1186/1471-2164-7-272.PubMedPubMed CentralView ArticleGoogle Scholar
- Vera JC, Wheat CW, Fescemyer HW, Frilander MJ, Crawford DL, Hanski I, Marden JH: Rapid transcriptome characterization for a nonmodel organism using 454 pyrosequencing. Mol Ecol. 2008, 17: 1636-1647. 10.1111/j.1365-294X.2008.03666.x.PubMedView ArticleGoogle Scholar
- Barakat A, DiLoreto DS, Zhang Y, Smith C, Baier K, Power WA, Wheeler N, Sederoff R, Carlson JE: Comparison of the transcriptomes of American chestnut (Castanea dentata) and Chinese chestnut (Castanea mollissima) in response to the chestnut blight infection. BMC Plant Biol. 2009, 9: 51-10.1186/1471-2229-9-51.PubMedPubMed CentralView ArticleGoogle Scholar
- Li Y, Luo HM, Sun C, Song JY, Wu Q, Wang N, Yao H, Steinmetz A, Chen SL: EST analysis reveals putative genes involved in glycyrrhizin biosynthesis. BMC Genomics. 2010, 11: 268-10.1186/1471-2164-11-268.PubMedPubMed CentralView ArticleGoogle Scholar
- Luo H, Li Y, Sun C, Wu Q, Song J, Sun Y, Steinmetz A, Chen S: Comparison of 454-ESTs from Huperzia serrata and Phlegmariurus carinatus reveals putative genes involved in lycopodium alkaloid biosynthesis and developmental regulation. BMC Plant Biol. 2010, 10: 209-10.1186/1471-2229-10-209.PubMedPubMed CentralView ArticleGoogle Scholar
- Sun C, Li Y, Wu Q, Luo HM, Sun YZ, Song JY, Lui E, Chen SL: Sequencing and de novo analysis of American ginseng root transcriptome using a GS FLX Titanium platform to discover putative genes involved in ginsenoside biosynthesis. BMC Genomics. 2010, 11: 262-10.1186/1471-2164-11-262.PubMedPubMed CentralView ArticleGoogle Scholar
- Rowland LJ, Ogden EL, Ehlenfeldt MK, Vinyard B: Cold hardiness, deacclimation kinetics, and bud development among 12 diverse blueberry genotypes under field conditions. J Amer Soc Hort Sci. 2005, 130 (4): 508-514.Google Scholar
- Rowland LJ, Ogden EL, Ehlenfeldt MK, Arora R: Cold tolerance of blueberry genotypes throughout the dormant period from acclimation to deacclimation. HortScience. 2008, 43 (7): 1970-1974.Google Scholar
- Conesa A, Götz S, García-Gómez JM, Terol J, Talón M, Robles M: Blast2GO. A universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics. 2005, 21: 3674-3676. 10.1093/bioinformatics/bti610.PubMedView ArticleGoogle Scholar
- Mulder N, Apweiler R: InterPro and InterProScan: tools for protein sequence classification and comparison. Methods Mol Biol. 2007, 396: 59-70. 10.1007/978-1-59745-515-2_5.PubMedView ArticleGoogle Scholar
- Bhardwaj PK, Kaur J, Sobti RC, Ahuja PS, Kumar S: Lipoxygenase in Caragana jubata responds to low temperature, abscisic acid, methyl jasmonate and salicylic acid. Gene. 2011, 483: 49-53. 10.1016/j.gene.2011.05.014.PubMedView ArticleGoogle Scholar
- Wang J, Sun PP, Chen CL, Wang Y, Fu XZ, Liu JH: An arginine decarboxylase gene PtADC from Poncirus trifoliata confers abiotic stress tolerance and promotes primary root growth in Arabidopsis. J Exp Bot. 2011, 62: 2899-2914. 10.1093/jxb/erq463.PubMedView ArticleGoogle Scholar
- Hsu YF, Yu SC, Yang CY, Wang CS: Lily ASR protein-conferred cold and freezing resistance in Arabidopsis. Plant Physiol Biochem. 2011. Google Scholar
- Dhanaraj AL, Slovin JP, Rowland LJ: Isolation of a cDNA clone and characterization of expression of the highly abundant, cold acclimation-associated 14 kDa dehydrin of blueberry. Plant Sci. 2005, 168: 949-957. 10.1016/j.plantsci.2004.11.007.View ArticleGoogle Scholar
- Kielbowicz-Matuk A, Rey P, Rorat T: The organ-dependent abundance of a Solanum lipid transfer protein is up-regulated upon osmotic constraints and associated with cold acclimation ability. J Exp Bot. 2008, 59: 2191-2203. 10.1093/jxb/ern088.PubMedView ArticleGoogle Scholar
- Lijsebettens M, Vanderhaeghen R, De Block M, Bauw G, Villarroel R, Van Montagu M: An S18 ribosomal protein gene copy at the Arabidopsis PFL locus affects plant development by its specific expression in meristems. EMBO J. 1994, 13: 3378-3388.PubMedPubMed CentralGoogle Scholar
- Baldi P, Grossi M, Pecchioni N, Vale G, Cattivelli L: High expression level of a gene coding for a chloroplastic amino acid selective channel protein is correlated to cold acclimation in cereals. Plant Mol Biol. 1999, 41: 233-243. 10.1023/A:1006375332677.PubMedView ArticleGoogle Scholar
- Jimenez A, Creissen G, Kular B, Firmin J, Robinson S, Verhoeyen M, Mullineaux P: Changes in oxidative processes and components of the antioxidant system during tomato fruit ripening. Planta. 2002, 214: 751-758. 10.1007/s004250100667.PubMedView ArticleGoogle Scholar
- Jaakola L, Maatta K, Pirttila AM, Torronen R, Karenlampi S, Hohtola A: Expression of genes involved in anthocyanin biosynthesis in relation to anthocyanin, proanthocyanin, and flavonol levels during bilberry fruit development. Plant Physiol. 2002, 130: 729-739. 10.1104/pp.006957.PubMedPubMed CentralView ArticleGoogle Scholar
- Marin-Rodriguez MC, Orchard J, Seymour GB: Pectate lyases, cell wall degradation and fruit softening. J Exp Bot. 2002, 53: 2115-2119. 10.1093/jxb/erf089.PubMedView ArticleGoogle Scholar
- Santiago-Domenech N, Jimenez-Bemudez S, Matas AJ, Rose JKC, Munoz-Blanco J, Mercado JA, Quesada MA: Antisense inhibition of a pectate lyase gene supports a role for pectin depolymerization in strawberry fruit softening. J Exp Bot. 2008, 59: 2769-2779. 10.1093/jxb/ern142.PubMedPubMed CentralView ArticleGoogle Scholar
- Trainotti L, Pavanello A, Casadoro G: Different ethylene receptors show an increased expression during the ripening of strawberries: does such an increment imply a role for ethylene in the ripening of these non-climacteric fruits?. J Exp Bot. 2005, 56: 2037-2046. 10.1093/jxb/eri202.PubMedView ArticleGoogle Scholar
- Maruyama K, Sakuma Y, Kasuga M, Ito Y, Seki M, Goda H, Shimada Y, Yoshida S, Shinozaki K, Yamaguchi-Shinozaki K: Identification of cold-inducible downstream genes of the Arabidopsis DREB1A/CBF3 transcriptional factor using two microarray systems. Plant J. 2004, 38: 982-993. 10.1111/j.1365-313X.2004.02100.x.PubMedView ArticleGoogle Scholar
- Vogel JT, Zarka DG, Van Buskirk HA, Fowler SG, Thomashow MF: Roles of the CBF2 and ZAT12 transcription factors in configuring the low temperature transcriptome of Arabidopsis. Plant J. 2005, 41: 195-211.PubMedView ArticleGoogle Scholar
- Qiu L, Yang C, Tian B, Yang J-B, Liu A: Exploiting EST databases for the development and characterization of EST-SSR markers in castor bean (Ricinus communis L.). BMC Plant Biol. 2010, 10: 278-288. 10.1186/1471-2229-10-278.PubMedPubMed CentralView ArticleGoogle Scholar
- Kantety RV, La Rota M, Matthews DE, Sorrells ME: Data mining for simple sequence repeats in expressed sequence tags from barley, maize, rice, sorghum and wheat. Plant Mol Biol. 2002, 48: 501-510. 10.1023/A:1014875206165.PubMedView ArticleGoogle Scholar
- Morgante M, Hanafey M, Powell W: Microsatellites are preferentially associated with nonrepetitive DNA in plant genomes. Nat Genet. 2002, 30: 194-200. 10.1038/ng822.PubMedView ArticleGoogle Scholar
- Boches P, Rowland J, Hummer K, Bassil N: Cross-species amplification of SSR loci in the genus Vaccinium. Acta Hort. 2006, 715: 119-128.View ArticleGoogle Scholar
- Chang S, Puryear J, Cairney J: A simple and efficient method for isolating RNA from pine trees. Plant Mol Biol Rep. 1993, 11: 113-116. 10.1007/BF02670468.View ArticleGoogle Scholar
- Liu JJ, Goh CJ, Loh CS, Liu P, Pua EC: A method for isolation of total RNA from fruit tissues of banana. Plant Mol Biol Rep. 1998, 16: 1-6.View ArticleGoogle Scholar
- Jaakola L, Pirttila AM, Halonen M, Hohtola A: Isolation of high quality RNA from bilberry (Vaccinium myrtillus L.) fruit. Mol Biotech. 2001, 19: 201-203. 10.1385/MB:19:2:201.View ArticleGoogle Scholar
- Poinar HN, Schwarz C, Qi J, Shapiro B, Macphee RD, Buigues B, Tikhonov A, Huson DH, Tomsho LP, Auch A, et al: Metagenomics to paleogenomics: large-scale sequencing of mammoth DNA. Science. 2006, 311 (5759): 392-394. 10.1126/science.1123360.PubMedView ArticleGoogle Scholar
- Polashock JJ, Arora R, Yanhui P, Naik D, Rowland LJ: Functional identification of a C-repeat binding factor transcriptional activator from blueberry associated with cold acclimation and freezing tolerance. J Amer Soc Hort Sci. 2010, 135: 40-48.Google Scholar
- Rozen S, Skaletsky H: Primer3 on the WWW for general users and for biologist programmers. Methods Mol Biol. 2000, 132: 365-386.PubMedGoogle Scholar
- Schuelke M: An economic method for the fluorescent labeling of PCR fragments. Nature Biotech. 2000, 18: 233-234. 10.1038/72708.View ArticleGoogle Scholar