Evidence of cryptic introgression in tomato (Solanum lycopersicum L.) based on wild tomato species alleles

  • Joanne A Labate1Email author and

    Affiliated with

    • Larry D Robertson1

      Affiliated with

      BMC Plant Biology201212:133

      DOI: 10.1186/1471-2229-12-133

      Received: 22 March 2012

      Accepted: 30 July 2012

      Published: 7 August 2012

      Abstract

      Background

      Many highly beneficial traits (e.g. disease or abiotic stress resistance) have been transferred into crops through crosses with their wild relatives. The 13 recognized species of tomato (Solanum section Lycopersicon) are closely related to each other and wild species genes have been extensively used for improvement of the crop, Solanum lycopersicum L. In addition, the lack of geographical barriers has permitted natural hybridization between S. lycopersicum and its closest wild relative Solanum pimpinellifolium in Ecuador, Peru and northern Chile. In order to better understand patterns of S. lycopersicum diversity, we sequenced 47 markers ranging in length from 130 to 1200 bp (total of 24 kb) in genotypes of S. lycopersicum and wild tomato species S. pimpinellifolium, Solanum arcanum, Solanum peruvianum, Solanum pennellii and Solanum habrochaites. Between six and twelve genotypes were comparatively analyzed per marker. Several of the markers had previously been hypothesized as carrying wild species alleles within S. lycopersicum, i.e., cryptic introgressions.

      Results

      Each marker was mapped with high confidence (e<1 x 10-30) to a single genomic location using BLASTN against tomato whole genome shotgun chromosomes (SL2.40) database. Neighbor-joining trees showed high mean bootstrap support (86.8 ± 2.34%) for distinguishing red-fruited from green-fruited taxa for 38 of the markers. Hybridization and parsimony splits networks, genomic map positions of markers relative to documented introgressions, and historical origins of accessions were used to interpret evolutionary patterns at nine markers with putatively introgressed alleles.

      Conclusion

      Of the 47 genetic markers surveyed in this study, four were involved in linkage drag on chromosome 9 during introgression breeding, while alleles at five markers apparently originated from natural hybridization with S. pimpinellifolium and were associated with primitive genotypes of S. lycopersicum. The positive identification of introgressed genes within crop species such as S. lycopersicum will help inform conservation and utilization of crop germplasm diversity, for example, facilitating the purging of undesirable linkage drag or the exploitation of novel, favorable alleles.

      Keywords

      Cryptic introgression Linkage drag Breeding DNA sequence Solanum species

      Background

      Introgression is the transfer of genes of one species into the gene pool of another via hybridization. As a phenomenon, it has been an important topic in animal and plant genetics research for many different reasons. For example, introgression has been implicated in the adaptation of modern humans [1] and is of concern to conservation biologists due to loss of integrity of wild bird and mammal populations [2, 3]. In plants, introgression is a key concept in studies of the risks of contamination of natural populations by genetically modified (GM) crops. More commonly in crops, favorable genes from wild relatives are intentionally transferred into breeding lines for cultivar development. This has been particularly valuable in crop species that are relatively low in genetic diversity.

      According to a review of crop introgression breeding [4] the major functional categories of beneficial traits transferred from wild species are resistance or tolerance to abiotic stress or disease, yield, cytoplasmic male sterility or fertility restorers for hybrid production, and quality traits. Among the pioneering uses of wild crop relatives during the late 19th to early 20th centuries were the transfer of disease resistances into grape (Vitis vinifera) [5] and sugarcane (Saccarum officinarum) [6]. A 1986 review of 23 crops estimated that 6% of total annual economic value in the US was contributed by crop wild relatives [7]. For 13 major crops of global importance, it was estimated that 46 wild species have been used in released cultivars, and that furthermore, the introgression breeding approach is increasing [4]. Lack of information on pedigrees, unpublished activities within the private sector and changes in taxonomy are some of the factors that contribute to the uncertainty of the collective impacts of crop introgression breeding [4].

      Some of the earliest tomato introgression breeding in the US may have been done indirectly and unwittingly via the French variety Merville des Marchés. Recent phenotypic data collected for Merville des Marchés PI 109834 showed it to be variable in fruit size and smoothness (http://​www.​ars-grin.​gov/​cgi-bin/​npgs/​acc/​display.​pl?​1129442); its genotype was segregating, showed population admixture, and was an outlier based on genetic distance relative to many other S. lycopersicum accessions [8, 9]. We postulated that these were indications of S. pimpinellifolium in its ancestry (this idea was examined in the current study). The Fusarium wilt-resistant processing variety Marvel [10] was selected from Merville des Marchés in the early 1900s, and Marvel was a parent of Marglobe released in 1925 [11], which in turn can be found in the pedigree of many important varieties from the 1930s through the late 1950s (H.M. Munger’s tomato pedigree chart provided by E.D. Cobb, Cornell University, 2012). Direct introgression of tomato with wild species in the US commenced in the 1930s concurrent with collection expeditions to geographic centers of origin. The first released cultivar, developed from Marglobe x S. pimpinellifolium, was aptly named Pan American [12]. Introgression breeding efforts of tomato increased globally post World War II, involving the screening of a wide range of traits and all wild tomato species [13]. Such efforts continue to be of utmost priority today using sophisticated tools such as introgression libraries for gene discovery [14, 15].

      Compellingly, of 96 introgressed traits tallied in released crop cultivars for 11 species (cassava, wheat, millet, rice, maize, sunflower, lettuce, banana, potato, groundnut, tomato), 55 of them were in tomato (Solanum lycopersicum L.); the next highest numbers were found in rice and potato with 12 traits each [4]. The emphasis and success of introgression breeding in tomato encompasses several factors including its intrinsically narrow genetic base, relative ease of crossing with several wild taxa, production demands based on growing conditions and market niche, its susceptibility to pests and pathogens, and its sensitivity to abiotic factors. In addition to resistance or tolerance to dozens of bacterial, viral, fungal, insect, and nematode pathogens, hundreds of favorable genes or quantitative trait loci (QTL) for abiotic stress resistance, flower and fruit traits, yield, and plant architecture have been mapped in wild tomato species [16] and thus hold the potential to be exploited.

      Introgression breeding carries a cost, namely, genetic linkage of non-targeted loci that are eliminated through repeated backcrossing. Linkage drag can persist within a genome despite backcrossing, especially if recombination is suppressed. Several examples of linkage drag in tomato and other crops have been quantified using molecular markers [1721]. Linkage drag can denote favorable, deleterious or neutral alleles that become inadvertently incorporated into breeding lines or cultivars.

      In this study we apply the term ‘cryptic introgression’ [22] to describe latent genetic variation in S. lycopersicum that originated from wild tomato species. Various scenarios can be evoked for its origins ranging from linkage drag, hybridization between feral S. lycopersicum and wild relatives, to crossing in open-pollinated populations by wind or insect vectors with pollen of introgressed cultivars [23]. Cryptic introgression is of interest in germplasm collections such as those conserved at United States Department of Agriculture, Agricultural Research Service (USDA, ARS) Plant Genetic Resources Unit (PGRU) because it can indicate novel genetic variation for exploitation by end-users, or conversely, reveal unfavorable and hence undesirable alleles with respect to crop improvement.

      In previous reports we hypothesized the detection of cryptic introgression in 5% to 10% of DNA markers that were resequenced in tomato germplasm panels [9, 24, 25]. The aim of the current study was to gather additional evidence on these alleles by resequencing and analyzing the same markers in several accessions of wild tomato species and one accession of weedy S. lycopersicum (Table 1). Although variation within wild species gene pools made it impracticable to attempt to discover the 100% identical homologous allele, the assumption was that introgressed alleles would be more closely related to the alleles of a particular wild species than to their S. lycopersicum homologs. To identify introgressed alleles we also used evidence from mapped locations of markers, phenotypic descriptions, and historical origins of lines and accessions.
      Table 1

      Tomato samples analyzed in this study

      Species

      Accession or line

      Descriptiona

      Solanum habrochaites

      PI 126445

      collected from Peru in 1937, source of Cyc-B, green-fruited

      Solanum pennellii

      PI 414773

      collected from Peru in 1976, source of I-1, I-3, green-fruited

      Solanum peruvianum

      G 32592 (LA4125)

      naturally selfing, collected from Chile in 2001, green-fruited

      Solanum peruvianum

      LA1537

      artificially inbred from PI 128650 collected from Chile in 1938 and source of Tm-2 a , green-fruited

      Solanum arcanum b

      G 32591 (LA2157)

      naturally selfing, collected from Peru in 1980, source of Cm QTLs, green-fruited

      Solanum pimpinellifolium

      PI 370093

      traces back to Vaughan Seed Co., Chicago, USA, circa 1930, source of Cf-2, Cf-3, Pto, red-fruited

      Solanum lycopersicum

      PI 303801

      Peru Wild (syn. with Utah 665) from Utah Agr. Expt. Sta., source of Ve, red-fruited

      Solanum lycopersicum b

      PI 99782

      Tomate, collected in 1932 from Peru, red-fruited

      Solanum lycopersicum b

      PI 109834

      Merville des Marchés, collected in 1935 from France, red-fruited

      Solanum lycopersicum b

      PI 129026

      unnamed, collected in 1938 from Ecuador, red-fruited

      Solanum lycopersicum b

      PI 129128

      unnamed, collected in 1938 from Panama, red-fruited

      Solanum lycopersicum b

      PI 196297

      unnamed, collected in 1951 from Nicaragua, red-fruited

      Solanum lycopersicum b

      PI 258474

      unnamed, collected in 1959 from Ecuador, red-fruited

      Solanum lycopersicum b

      PI 258478

      unnamed, collected in 1959 from Peru, red-fruited

      Solanum lycopersicum b

      PI 390510

      unnamed, collected in 1974 from Ecuador, red-fruited

      Solanum lycopersicum b

      TA496

      from S. Tanksley, Cornell Univ., developed in 1990s from E6203 x Vendor-Tm-2 a , red-fruited

      Germplasm sources of sequenced alleles in wild and cultivated tomato accessions.

      a For complete references tracing history of these germplasm sources used for introgression breeding see [26].

      b Sequences published in [25].

      Results and discussion

      Markers and in silico mapping

      For the 47 markers used in this study (Additional file 1: Table S1), nucleotide primers for PCR and sequencing were originally designed against S. lycopersicum sequences except for the Conserved Ortholog Set II (COSII or C2) and unigene (U) markers, which were designed against Euasterids [27, 28]. Molecular markers have shown good success rates of transferability among distantly related wild tomato species such as S. lycopersicum and S. pennellii (for examples see [2931]). In the current study, some primer pairs did not amplify or give clean or homologous reads in every wild tomato sample. The COSII and U markers did not outperform the other markers in terms of successfully generating high quality sequence data because most of them contained introns, which sometimes carried small indels in a heterozyogus condition. Such heterozygous indels were major contributors to poor quality reads and hence missing data.

      In our current germplasm panel (Table 1), S. lycopersicum and S. pimpinellifolium had no missing markers; sequences were available for at least two of the four green-fruited taxa for the final set of 47 markers. S. peruvianum LA1537, S. peruvianum G32592, S. arcanum, S. pennellii and S. habrochaites gave data for 32, 42, 40, 35 and 38 markers, respectively. The latter two species are usually self-incompatible (SI) and carried greater numbers of polymorphic markers within accessions than the inbred accessions of S. peruvianum and S. arcanum that were sampled. Mean SNP frequency across the polymorphic markers was 0.0127 (n = 17) for S. habrochaites, 0.0082 (n = 17) for S. pennellii, 0.0126 (n = 5) for S. peruvianum LA1537, 0.0058 (n = 2) for S. peruvianum G32592 and 0.0078 (n = 7) for Peru Wild. TA496, Tomate and S. arcanum had no polymorphic sites in any of the markers.

      S. pimpinellifolium showed unusually high polymorphism of 0.0055 (n = 17) relative to the other self-compatible taxa. As one explanation, this accession was categorized as an admixture population in a simple sequence repeat (SSR) genotyping study of S. pimpinellifolium population structure [32], so naturally represents two dissimilar S. pimpinellifolium genomes. Because the seed source traces back to Vaughan Seed Co. in USA and Horticultural Experiment Station, Ontario, Canada [33], another possibility is that S. lycopersicum was incorporated into its pedigree by one of those entities. This may be evidenced by comparing the previously estimated D, number of mutations per kb, [34] between the two species [35] which was approximately four-fold greater than our estimate reported below.

      The majority of markers were mapped with high confidence to a single map location within the genome (Figure 1, Additional file 1: Table S1). This is the first report in which a whole-genome sequence [36] and web based tools were available with which to do this for the expressed sequence tag (EST) -based markers [24]. Accordingly, 12 of the EST-based markers were newly mapped and markers 437_2, 2189_1 and 2819_5 were revised with respect to previously predicted chromosomal location based on identities to restriction fragment length polymorphism (RFLP) markers [25]. All chromosomes were represented by the 47 markers; numbers of markers per chromosome ranged from one on chromosome 12 to eight on chromosome 6. Small gaps were observed in the alignments but close examination revealed that these were in masked regions rich in runs of poly A or poly T. Only marker 1675_1 did not initially provide any hits. A lowered stringency of 1 x 10-30 subsequently found the probable position.
      http://static-content.springer.com/image/art%3A10.1186%2F1471-2229-12-133/MediaObjects/12870_2012_1075_Fig1_HTML.jpg
      Figure 1

      Chromosomal map locations of 47 markers sequenced in this study. Nine markers with dashed outlines showed cryptic introgressions. Also shown on the map (color) are documented introgressions used in tomato breeding that were mentioned in this report, S. habrochaites: blue, S, lycopersicum Peru Wild: red, S. peruvianum: purple, S. pimpinellifolium: orange, S. pennellii: green.

      Four markers had two BLASTN hits each. These were 1260_2 for two tightly linked (2,849 nt apart) sequences on chromosome 6, 2325_3 on chromosomes 1 and 3, 2582_1 on chromosomes 4 and 5, and 2819_5 for two linked (14,114 nt apart) regions on chromosome 6. A check of the primer regions found multiple mismatches in predicted primer binding sites of the secondary hit for three of the markers; these were assumed to have amplified as single-copy. For marker 2819_5 the forward primer had a single mismatch and the reverse had no mismatches. The predicted amplicon in the mismatch region was 86% identical to the reference TA496 sequence. When the mismatch sequence was included in MEGA cluster analysis, it separated from all other (wild and cultivated) alleles with 87% bootstrap support, and the mean D among sequences increased 8-fold (D = 1.5 for n = 10 alleles versus D = 12 for n = 11 alleles). Therefore, it is unlikely that this paralog amplified and confounded the results. All markers had previously been screened in our lab for amplification of single bands and highly homozygous sequences within S. lycopersicum[24, 25]. Results of in silico mapping confirmed that this set of markers provides robust results in sampling single-copy S. lycopersicum genes. All sequences were deposited into the European Molecular Biology Laboratory (EMBL), European Nucleotide Archive (ENA) data base as accession numbers HE977919-HE978211.

      Clustering patterns and divergence estimates among taxa

      PI 99782 Tomate was chosen for the current study to represent a ‘pre-introgression breeding’ genotype. It bears small, slightly ribbed and unimproved fruit with scarring and cracking (http://​www.​ars-grin.​gov/​cgi-bin/​npgs/​acc/​display.​pl?​1127604) and was homozygous for the common S. lycopersicum haplotype at 48 of 50 markers for which it had been sequenced [25]. For 38 of the markers there was high bootstrap support for the red-fruited clade that consisted of S. lycopersicum including Tomate and S. pimpinellifolium alleles (Additional file 1: Table S2). Bootstrap values ranged from 54% to 100% (mean ± standard error = 86.8 ± 2.34%). These bootstrap values subtracted from 100% can be interpreted as estimates of the probability of Type I error, i.e., falsely accepting a cluster that is not signified by the data (see [37] for discussion).

      Results underscored the close relationship of S. lycopersicum to S. pimpinellifolium because their alleles were frequently identical or highly similar. The placement of green-fruited species with respect to the red-fruited clade and each other varied among loci. This can result from incomplete lineage sorting or introgression [38]. An example was the placement of S. habrochaites near S. pimpinellifolium for marker TG11 (Additional file 2: Figure S1); this example was also reported by Nesbitt and Tanksley [35].

      Of the nine markers that did not support the red-fruited clade, seven had previously been hypothesized as carrying introgressions (Additional file 1: Table S2). These are discussed below. The other two markers, 1287_1 and 2280_1, showed patterns that contained S. peruvianum within the red-fruited clade (Additional file 2: Figure S1) seemingly due to lack of resolution. At marker 1287_1 S. peruvianum haplotypes were only one or two mutational steps from Tomate. These mutations were not shared with any other taxa. At marker 2280_1 only four unique haplotypes were observed. These were S. habrochaites S. pennellii S. arcanum and {TA496, Tomate, S. pimpinellifolium, Peru Wild, G32592, LA1537}. For the 38 markers that supported the red-fruited clade, accepted taxonomic relationships among tomato species were generally supported [39, 40].

      Divergence estimates (D) among loci are associated with a high variance over evolutionary time due to differences in mutation and recombination rates, selective constraints, and influences of various factors such as random sampling of gametes and demography. Average D ranged from 5 for marker 4301_3 to 52 mutations per kb for marker C2_At1g44575 (mean ± standard error = 16.5 ± 1.29). At the aggregate scale the mean D ± standard error from PI 99782 Tomate was as follows: TA496 D = 0.001 ± 0.0005, Peru Wild D = 0.002 ± 0.0006, S. pimpinellifolium D = 0.002 ± 0.0004, S. arcanum D = 0.020 ± 0.0022, S. peruvianum LA1537 D = 0.020 ± 0.0025, S. peruvianum G32592 D = 0.022 ± 0.0024, S. habrochaites D = 0.021 ± 0.0024 and S. pennellii D = 0.022 ± 0.033.

      The lack of precise resolution in distinguishing taxa by clustering, or by average divergence from Tomate in the case of the green-fruited taxa was a function of shared polymorphisms in many instances, e.g., at 24 markers at least one single nucleotide polymorphism (SNP) within a species was also segregating between other species pairs. This has been observed in other studies of crop species and their closely related wild relatives ([41] and references therein). In addition, random sampling contributed to low resolution, e.g., S. peruvianum haplotypes appeared to be derived from Tomate at marker 1287_1 based on a small number of noninformative SNPs.

      Evidence for introgression

      In previous studies we reported nine markers (Table 2) with highly diverged alleles within S. lycopersicum and hypothesized that this was due to introgression from wild species. These were 220_1, 437_2, 2325_3, 2534_1R (redesigned into 2534_1b), 2486_1 [24], 2819_5, C2_At1g73180 [25], U146140 and C2_At1g44575 [9]. Of the 47 markers in the current study, seven of these nine showed patterns in cladograms that did not cluster together members of the red-fruited clade (Additional file 2: Figure S1). Hybridization networks (Additional file 2: Figure S1), descriptive information of the accessions, map positions of markers, and species origins of documented introgressed disease resistance alleles (e.g. [16, 26, 4244]) were used as total evidence to categorize the divergent markers into putative linkage drag during introgression breeding versus natural out crossing with S. pimpinellifolium (Table 2). This categorization did not constitute proof of natural hybridization versus introgression breeding. However, it was a useful concept from which to synthesize independent lines of evidence and can serve as a basis for future hypothesis testing of the two scenarios.
      Table 2

      Tomato markers tested for cryptic introgression

      Marker

      SGN gene model, predicted protein

      Chromosome location (MB)

      Divergent line or accession

      Genetic-distance based clustering resultsa

      Observations

      Introgression from crop improvement

      220_1

      Solyc09g014280, hydroxycinnamoyl transferase

      ch09, 5.77

      TA496

      TA496 intermediate between LA1537 and S. pennelli

      Major disease resistance genes on ch09 include:

      Ve2, 0.05 MB, (11.002 cMa), from Peru Wild

      2486_1

      Solyc09g014350, glycerol-3-phosphate acyltransferase 6

      ch09, 5.90

      TA496

      low bootstrap values overall except {TA496, LA1537}

      Ve1, 0.06 MB, (11.002 cM), from Peru Wild

      2534_1b

      Solyc09g018790, succinic semialdehyde reductase isofom1

      ch09, 17.00

      TA496

      TA496 clustered with two S. peruvianum accessions

      Cm9.1, (4.0 – 24.0 cM), from S. peruvianum

      Frl, (27.0 – 37.0 cM), from S. peruvianum

      437_2

      Solyc09g061440, uncharacterized protein

      ch09, 54.71

      TA496

      TA496 intermediate between {S. peruvianum, S. arcanum, Peru Wild, S. pimpinellifolium, Tomate} and {S. pennellii, S. habrochaites}

      Tm2 b , 13.62 MB, (32.002 cM), from S. peruvianum

      Sw-5, 67.30 MB, (78.001 cM), from S. peruvianum

      Ph-3, 66.71 – 66.78 MB, (63.0 – 78.0 cM), from S. pimpinellifolium

      Introgression from natural hybridization with S. pimpinellifolium

      2325_3c

      Solyc01g073640, alcohol dehydrogenase-3

      ch01, 70.26

      TA496

      red-fruited clade was supported

      PI 258478 was collected from Peru in 1959, highly variable, fasciated fruit.

      PI 258478

      C2_At1g44575

      Solyc06g060340, chloroplast photosystem II-associated protein

      ch06, 34.71

      PI 258478

      red-fruited clade was supported

      Introgressions on ch06 include:

      Cyc-B, 42.29 MB, (106 cM), from S. habrochaites

      2819_5c

      Solyc06g082670, ribosomal protein L10

      ch06, 44.70

      PI 258478

      red-fruited clade was split into {Peru Wild-1, Tomate, TA496} and {Peru Wild-2, S. pimpinellifolium, PI 258478}

       

      U146140

      Solyc06g083360, DNA-directed RNA polymerase II subunit

      ch06, 45.08

      PI 109834

      red-fruited clade was split into {PI 109834, S. pimpinellifolium} and {TA496, Peru Wild, Tomate}

      PI 109834 Merville des Marchés was collected from France in 1935.

      C2_At1g73180

      Solyc08g014060, eukaryotic translation initiation factor 3 subunit 9-like protein

      ch08, 3.57

      PI 129026

      {PI 196297, PI 390510} were divergent from other members of red-fruited clade

      PI 196297 was collected in Nicaragua in 1951, fasciated fruit, reported as introgressed by Rick [23]; carries same allele as PI 129026 (from Ecuador, 1938, fasciated fruit), PI 129128 (from Panama, 1938, fasciated fruit), PI 258474 (from Ecuador, 1959, fasciated fruit). PI 390510 was collected in Ecuador in 1974, described as a wild cherry tomato.

      PI 129128

       

      PI 196297

       

      PI 258474

       
         

      PI 390510

        

      a Nine tomato markers previously identified as carrying highly divergent alleles within Solanum lycopersicum.

      b Genetic linkage map positions from [45] or [46].

      c Sequence mapped to two locations, see Results and discussion.

      (Additional file 2: Figure S1).

      Linkage drag was inferred for at least three of the four markers at which TA496 carried an allele that was highly divergent from all other members of the red-fruited clade. All four mapped to chromosome 9 and spanned from 5.77 MB to 54.71 MB (Table 2). Introgressed disease resistance loci documented on chromosome 9 [42] include Ve1 (0.06 MB) and Ve2 (0.05 MB) both from Peru Wild, Frl (physical map position not annotated), Tm-2 a (13.62 MB) and Sw-5 (67.30 MB), all three from S. peruvianum, and Ph-3 (66.71-66.78 MB) from S. pimpinellifolium (Figure 1, Table 2). LA1537 was the most closely related allele to TA496 in cladograms for the three markers spanning 5.77 – 17.00 MB on chromosome 9, which encompasses the Tm-2 a locus at position 13.62 MB. LA1537 is an inbred accession that was derived from PI 128650, the original source of Tm-2 a (Table 1).

      Hybridization networks placed TA496 in various positions for each of the three markers surrounding Tm-2 a : near the red-fruited alleles with reticulation back towards S. peruvianum for marker 2486_1 (Figure 2a and Additional file 2: Figure S1), between the red-fruited alleles and S. peruvianum with reticulation back towards both for marker 2534_1b (Figure 2b and Additional file 2: Figure S1), and within the green-fruited wild species alleles between the two S. peruvianum accessions, with no reticulation for marker 220_1 (Figure 2c and Additional file 2: Figure S1). The size of the Tm-2 a introgressed region was estimated by mapping in segregating F2 populations [19]. RFLP marker TG101 at chromosome 9 location 50.41 MB was tightly linked (≤ 1 cM) to Tm-2 a . The chromosomal position of Tm-2 a has been characterized as very near the centromere with extremely repressed recombination [47]. It is therefore probable that markers 220_1 (5.77 MB), 2486_1 (5.90 MB) and 2534_1b (17.00 MB) were part of the introgressed segment in TA496.
      http://static-content.springer.com/image/art%3A10.1186%2F1471-2229-12-133/MediaObjects/12870_2012_1075_Fig2_HTML.jpg
      Figure 2

      Examples of splits networks at markers with cryptic introgressions in S. lycopersicum. Dashed boxes indicate accessions with introgressed alleles (a-d: hybridization networks using marker U146437 as a control, e: parsimony splits network); a) TA496 showed reticulation with S. peruvianum LA1537 and other green-fruited species, b) TA496 showed reticulation with red-fruited and green-fruited species, c) TA496 nested within green-fruited species, d) Merville des Marchés PI 109834 was closely related to S. pimpinellifolium, which showed reticulation to Tomate PI 99782, e) parsimony splits network used only conservative, global signal within the marker to illustrate the hybrid origin of Merville des Marchés PI 109834.

      The TA496 allele at marker 437_2 (54.71 MB) was not as definitively related to S. peruvianum and its origin was more difficult to interpret. The cladogram placed TA496 in an intermediate position between two clades, namely, {S. pimpinellifolium, Tomate, S. arcanum, Peru Wild, S. peruvianum} versus {S. habrochaites S. pennellii} (Additional file 2: Figure S1). The hybridization network showed reticulation with S. habrochaites and the red-fruited alleles (Additional file 2: Figure S1). Several genes commonly found in tomato varieties have originated from S. habrochaites (Cf-4 Tm-1 Ol-1 Cyc-B and Del) or S. pennellii (I 1 and I 3) but none of these map to chromosome 9 [43, 45]. The chromosome 9 introgressions listed in Table 2 were judged to be commonly used in cultivars based on an informal survey of descriptions from online seed catalogs (unpublished observations) as well as comprehensive reviews of tomato breeding (e.g. [16, 26, 4244]).

      Linkage drag of 437_2 in TA496 with Ph-3 (66.71 MB) or Sw-5 (67.30) was rejected because they originated from S. pimpinellifolium and S. peruvianum, respectively. Based on a BLASTN search of TA496 marker 437_2 against the Solanaceae PlantGDB-assembled Unique Transcipts (PUTs) in the Solanaceae Genomics Resource database (http://​solanaceae.​plantbiology.​msu.​edu/​), TA496 and S. habrochaites shared a sympleisiomorphy at nucleotide position 46 that was absent from all other alleles that we resequenced. This provided tentative but inconclusive evidence of a direct relationship between the introgression and S. habrochaites. As alternative evidence, FM6203, a progenitor of TA496, purportedly carries Asc resistance (pers comm. S. Loewen, University of Guelph, 2005) for which S. pennellii has served as one of the original sources on chromosome 3 in tomato [48].

      The remaining five markers with divergent alleles showed patterns consistent with introgression from S. pimpinellifolium in natural populations. At four markers (2325_3, C2_At1g44575, 2819_5 and U146140) the divergent alleles were more closely related to red-fruited rather than green-fruited taxa with bootstrap values ranging from 82% – 100% (Additional file 1: Table S2), although markers 2819_5 and U146140 did not support monophyly of red-fruited alleles (Additional file 2: Figure S1). Hybridization networks showed red-fruited species alleles to be distinct from green-fruited species alleles for 2325_3 and C2_At1g44575, while 2819_5 and U146140 showed connections between red-fruited and green-fruited species (Additional file 2: Figure S1). Marker U146140 nicely illustrated the S. lycopersicum x S. pimpinellifolium hybrid origin of Merville des Marchés PI 109834 (Figures 2d2e). Marker C2_At1g73180 showed an unusual pattern in that PI 196297 (and three additional accessions with the identical allele, Table 2) and PI 390510 were divergent from all other alleles in the cluster analysis with 99% bootstrap support (Figure 3a and Additional file 2: Figure S1). This suggested a potential paralog. However, no heterozygotes were observed, the marker did not map to more than one genomic location, and COSII markers were designed to amplify highly conserved single copy genes [27].
      http://static-content.springer.com/image/art%3A10.1186%2F1471-2229-12-133/MediaObjects/12870_2012_1075_Fig3_HTML.jpg
      Figure 3

      Extreme divergence of marker C2_At1g73180. a) Clustering at marker C2_At1g73180 showed the distinctiveness of PI 129026 and PI 196297 with 99% bootstrap support, b) the splits network showed a combination of introgression of PI 129026 and PI 196297 (among others, see Table 2) with S. pimpinellifolium and retention of ancient polymorphisms that reticulated down to the root; the latter supported genetic hitchhiking and rejection of selective neutrality.

      The divergent C2_At1g73180 alleles were unlikely to have originated from a green-fruited species because chromosome 8 does not carry any introgressed disease resistance alleles [16, 4244]. The hybridization network depicted PI 196297 and PI 390510 as branching from S. pimpinellifolium, with complex reticulations at the base of the red-fruited clade that extended through the green-fruited taxa, down to S. pennellii at the root (Figure 3b). Among the seven unique polymorphisms carried by PI 196297 and PI 390510, two were non-conservative amino acid substitutions, two were synonymous and three were intronic. One possibility is that selection at or near this locus has caused ancient polymorphism to have been retained. A significant HKA test [49] (χ 2 = 7.36, P = 0.007) strengthened this interpretation. Therefore, diversity at this marker showed patterns of both natural selection and introgression.

      Additional evidence that these five markers represent S. pimpinellifolium introgressions in natural populations includes geographical origins of three accessions from Ecuador where the two species hybridize extensively [50], original collection of three accessions dating to 1935 and 1938 (likely precluding the influence of direct introgression breeding), the primitive fruit phenotype of fasciation (four accessions) or wild cherry tomato (one accession, n.b., cherry tomato was described as an admixture of S. lycopersicum and S. pimpinellifolium by [35]), and previously reported introgression of PI 196297 with S. pimpinellifolium by Rick [23]. In estimates of population structure [51] PI 109834, PI 129128, PI 258474 and PI 258478 all showed high probability of membership in the second of two populations inferred for S. lycopersicum genotypes, consistent with interspecific hybridization [9].

      Conclusions

      It was useful to delineate cryptic introgression within S. lycopersicum into linkage drag stemming from breeding versus natural hybridization with S. pimpinellifolium, although this categorization was not definitive and should be subjected to further scrutiny. Sequences of the wild tomato species markers in the context of their physical map locations have strengthened our previous interpretations of detection of introgressed alleles in domesticated tomato [24, 25]. Genomic tools for fine resolution of introgressed regions in crop species are increasingly available. The strengths and weaknesses of comparative genotyping to verify introgression have been illustrated here using Solanum section Lycopersicon taxa.

      In the current study the implications of linkage drag of introgressed alleles on phenotype remain unknown. At least one marker (2534_1b) codes for an enzyme involved in fruit ripening (Table 2). In cultivars or breeding lines it will be more useful to estimate the proportion of a genome that harbors linkage drag. This should be feasible for TA496 using bioinformatics given the vast amount of public EST sequence data available for this particular line and wild tomato species (http://​solgenomics.​net/​tools/​blast/​dbinfo.​pl as of June 2012 reported 323,465 Lycopersicon mRNAs). Importantly, an understanding of linkage drag will help to distinguish it from selection during crop improvement. Four of six markers that previously rejected neutrality tests (437_2, 2486_1, 2534_1b and 2819_5) (Table 2 in [25]) were found to be introgressed rather than selected. It is anticipated that next-generation sequencing will be utilized to more rapidly eliminate linkage drag in crops [52].

      Natural hybrids will carry high proportions of wild alleles making them somewhat easy to detect using large numbers of molecular markers. The intrinsic value of naturally introgressed germplasm was recognized in common bean (Phaseolus vulgaris) as a source of new alleles for traits such as disease resistance [53]. In S. lycopersicum, if horticultural effects of introgression are subtle then accessions such as Merville des Marchés may be prime sources to screen for new alleles. A search of the literature for accessions that were part of the current study (Table 2) found that PI 129128 showed high lycopene content, similar to lines containing pigment mutations such as og c , hp, and dg[54].

      Finally, it is worth noting that Rick [23] reported the potential of natural hybridization of S. lycopersicum with Solanum chilense S. habrochaites (formerly Lycopersicon hirsutum f. glabratum), and Lycopersicon peruvianum (now revised into four species, [55]) in regions of Chile, Ecuador and Peru where sympatric populations grow in close contact, although he found no evidence of this based on phenotypes and severe postzygotic barriers are well known. Genotyping of populations sampled from these regions would provide evidence to reexamine whether introgression from these wild tomato species into S. lycopersicum has played a role in the crop’s evolutionary history.

      Methods

      Plant material

      Marker genotypes used in this study were previously reported for Solanum arcanum and S. lycopersicum[9, 25] or were newly collected from each of five wild tomato accessions and one weedy S. lycopersicum accession Peru Wild. The specific accessions of S. habrochaites S. pennellii S. pimpinellifolium and Peru Wild have served historically as sources of important disease resistance alleles for tomato cultivars (Table 1). The two S. peruvianum accessions were chosen because they were known to be inbred and predicted to be highly homozygous. One of these, S. peruvianum LA1537, originated from accession PI 128650 which was the original source of Tm-2 a [56, 57]. Two plants per each of the six accessions were sampled as seedlings for genomic DNA isolation and sequencing of markers. Three accessions included in this study had previously published sequences for the markers [25] (Table 1). These were − a breeding line with documented multiple introgressions in its pedigree including Tm-2 a (TA496) [57], an accession that predated tomato introgression breeding (Tomate, PI 99782), and S. arcanum (G 32591) a naturally self-fertilizing accession formerly classified as Lycopersicon peruvianum. For a few markers, published sequences from additional S. lycopersicum accessions (PI 109834 Merville des Marchés, PI 129026, PI 129128, PI 196297, PI 258474, PI 258478, PI 390510, Table 1) were included in analyses because they were previously reported as carrying highly divergent alleles, i.e. putative introgressions, at those loci [9, 25].

      DNA sequences

      Genomic DNA extraction from seedlings, PCR amplification and two-pass sequencing were as described in Labate et al. [28]. Initially, 49 of 50 markers from Labate et al. [9] were sequenced. These represent random loci including expressed genes (expressed sequence tag, EST-based), highly conserved genes (COSII and U) and arbitrary loci. Marker 1523_4 was excluded without testing because it tended to give poor quality sequence within S. lycopersicum. Markers 175_1 and 1909_2 were dropped during this study because they did not consistently amplify S. lycopersicum homologs in wild tomato species, leaving 47 markers representing approximately 24 kb in total (Additional file 1: Table S1). Software packages phred, phrap and Consed [58, 59] and Staden [60] were used for assembly and base calling of reads. Pregap4 (ver. 1.5) of the Staden package was configured to apply a base-calling algorithm “Estimate Base Accuracies“ that is different from phred in order to independently verify the data. Sequence data were trimmed to remove primer binding sites and low quality ends (phred<40), and manually aligned in BioEdit [61]. All SNPs and heterozygous positions were confirmed by visual examination of trace files by two people. If the two plants from one accession had different sequences they were kept distinct; if they were identical they were treated as a single representative sequence of that accession. Heterozygous sites were manually edited to use IUPAC nucleotide ambiguity codes. GeneSeqer (ver. 08 Oct. 2008) [62] was used to compare exon and intron prediction for all markers against previous annotations [28].

      All sequences were mapped using BLASTN [63] against tomato whole genome shotgun chromosomes (SL2.40) database with an e-value threshold of 1 x 10-40 on the SGN web site [64]. Gene models within markers and adjacent regions were identified in the SGN genome browser using ITAG2.3 Release: genomic annotations. For marker 437_2, tomato sequences were compared to transcribed sequences from an evolutionary out group (potato, Solanum tuberosum) by BLASTN searches of the Solanaceae Genomics Resource database at Michigan State University (http://​solanaceae.​plantbiology.​msu.​edu/​).

      Statistical analyses

      For each of the 47 markers, relationships among genotypes (also referred to as taxa) were first examined by applying the neighbor-joining (NJ) clustering method [65] as implemented in MEGA 4.0.2 [66] with 1,000 bootstrap replicates. Genetic distance and average evolutionary divergence (D) were estimated using the Jukes-Cantor method [34]; positions with alignment gaps or missing data were eliminated in pairwise sequence comparisons. For each of 11 markers that did not support the red-fruited clade based on MEGA results, consensus trees were generated by Phylip ver. 3.69 using Seqboot to produce 100 datasets by bootstrap resampling, Dnadist to estimate genetic distances using the Jukes-Cantor method, Neighbor to produce unrooted NJ trees and Consense to compute a consensus tree by the majority-rule consensus tree method [67]. SplitsTree4 ver. 4.12.3 [68] was used to create splits networks from DNA sequences or NJ trees [69]. Hybridization splits networks were created using the consensus tree of marker U146437 as a control that highly supported the red-fruited clade (99%) plus the consensus tree of the marker being tested for an introgression, with balanced sets of taxa (same taxa in each tree). A neutrality test [49] of marker C2_At1g73180 was carried out in DnaSP v. 5.10 [70] using marker TG11 as a control and S. arcanum as the out group.

      Abbreviations

      ARS: 

      Agricultural research service

      COSII: 

      Conserved Ortholog Set II

      EMBL: 

      European Molecular Biology Laboratory

      ENA: 

      European Nucleotide Archive

      EST: 

      Expressed sequence tag

      NJ: 

      Neighbor joining

      GM: 

      Genetically modified

      PGRU: 

      Plant Genetic Resources Unit

      PUT: 

      PlantGDB-assembled Unique Transcipts

      QTL: 

      Quantitative trait loci

      RFLP: 

      Restriction fragment length polymorphism

      SGN: 

      Sol genomics network

      SI: 

      Self-incompatible

      SNP: 

      Single nucleotide polymorphism

      U: 

      Unigene

      USDA: 

      United States Department of Agriculture.

      Declarations

      Acknowledgments

      We thank S. Sheffer and P. Kisly for excellent technical support and E. Cobb for stimulating discussions. This work was funded by CRIS project 1910-21000-019-00D. USDA is an equal opportunity provider and employer.

      The use of trade, firm, or corporation names in this publication is for the information and convenience of the reader. Such use does not constitute an official endorsement or approval by the United States Department of Agriculture or the Agricultural Research Service of any product or service to the exclusion of others that may be suitable.

      Authors’ Affiliations

      (1)
      USDA-ARS Plant Genetic Resources Unit

      References

      1. Evans PD, Mekel-Bobrov N, Vallender EJ, Hudson RR, Lahn BT: Evidence that the adaptive allele of the brain size gene microcephalin introgressed into Homo sapiens from an archaic Homo lineage. P Natl Acad Sci USA 2006,103(48):18178–18183.View Article
      2. Randi E: Detecting hybridization between wild species and their domesticated relatives. Mol Ecol 2008,17(1):285–293.PubMedView Article
      3. Tosi AJ, Detwiler KM, Clifford SL: X-chromosomal synapomorphies provide a non-invasive test for introgression among Cercopithecus monkeys. Conserv Genet 2006,7(5):803–805.View Article
      4. Hajjar R, Hodgkin T: The use of wild relatives in crop improvement: a survey of developments over the last 20 years. Euphytica 2007,156(1):1–13.View Article
      5. Dearing C: Muscadine grape breeding. J Hered 1917,8(9):409–424.
      6. Bremer G: A cytological investigation of some species and species hybrids within the genus Saccharum. Genetica 1923,5(2):97–148.View Article
      7. Prescott-Allen C, Prescott-Allen R: The first resource: Wild species in the North American economy. Yale University, New Haven; 1986.
      8. Baldo AM, Francis DM, Caramante M, Robertson LD, Labate JA: AlleleCoder: a PERL script for coding codominant polymorphism data for PCA analysis. Plant Genetic Resources: Characterization and Utilization 2011, 10:1–3.
      9. Labate JA, Sheffer SM, Balch T, Robertson LD: Diversity and population structure in a geographic sample of tomato accessions. Crop Sci 2011,51(3):1068–1079.View Article
      10. Gardner M: Indiana plant diseases, 1920. In Proceedings of the Indiana Academy of Science. Edited by: Breeze F. Fort Wayne Printing Co., Fort Wayne, Indiana; 1920:187–208.
      11. Boswell VR, Pearson OH, Work P, Brown HD, MacGillivray JH, Seaton HL, Starr GE, Bayles JJ, Friend WH, Hawthorn LR, et al.: Descriptions of types of principal American varieties of tomatoes. In: Misc Pub No 160. USDA, Washington, DC; 1933:176–206.
      12. Porte WS, Walker HB: The Pan American tomato, a new red variety highly resistant to Fusarium wilt. USDA Circ 611, Washington, DC; 1941.
      13. Rick CM, Chetelat RT: Utilization of related wild species for tomato improvement. Acta Hort 1995, 412:21–38.
      14. Zamir D: Improving plant breeding with exotic genetic libraries. Nat Rev Genet 2001,2(12):983–989.PubMedView Article
      15. Zamir D: Plant breeders go back to nature. Nat Genet 2008,40(3):269–270.PubMedView Article
      16. Foolad MR: Genome mapping and molecular breeding of tomato. Int J of Plant Genomics 2007, 2007:64358.
      17. Haanstra JPW, Laugé R, Meijer-Dekens F, Bonnema G, de Wit PJGM, Lindhout P: The Cf-ECP2 gene is linked to, but not part of, the Cf-4/Cf-9 cluster on the short arm of chromosome 1 in tomato. Mol Gen Genet 1999,262(4/5):839–845.PubMedView Article
      18. van der Beek JG, Verkerk R, Zabel P, Lindhout P: Mapping strategy for resistance genes in tomato based on RFLPs between cultivars: Cf9 (resistance to Cladosporium fulvum) on chromosome 1. Theor Appl Genet 1992,84(1/2):106–112.
      19. Young ND, Zamir D, Ganal MW, Tanksley SD: Use of isogenic lines and simultaneous probing to identify DNA markers tightly linked to the Tm-2a gene in tomato. Genetics 1988, 120:579–585.PubMed
      20. Randhawa HS, Mutti JS, Kidwell K, Morris CF, Chen X, Gill KS: Rapid and targeted introgression of genes into popular wheat cultivars using marker-assisted background selection. PLoS One 2009,4(6):e5752.PubMedView Article
      21. Klein RR, Mullet JE, Jordan DR, Miller FR, Rooney WL, Menz MA, Franks CD, Klein PE: The effect of tropical Sorghum conversion and inbred development on genome diversity as revealed by high-resolution genotyping. Crop Sci 2008, 48:S12-S26.View Article
      22. Rieseberg L, Baird S, Gardner K: Hybridization, introgression, and linkage evolution. Plant Mol Biol 2000, 42:205–224.PubMedView Article
      23. Rick CM: The role of natural hybridization in the derivation of cultivated tomatoes of western South America. Econ Bot 1958, 12:346–367.View Article
      24. Labate JA, Baldo AM: Tomato SNP discovery by EST mining and resequencing. Mol Breed 2005, 16:343–349.View Article
      25. Labate JA, Robertson LD, Baldo AM: Multilocus sequence data reveal extensive departures from equilibrium in domesticated tomato (Solanum lycopersicum L.). Heredity 2009, 103:257–267.PubMedView Article
      26. Robertson LD, Labate JA: Genetic resources of tomato (Lycopersicon esculentum Mill.) and wild relatives. In Genetic improvement of Solanaceous crops. Edited by: Razdan MK, Mattoo AK. Science Publishers, Enfield, NH; 2007:25–75. [Tomato, Volume 2]
      27. Wu F, Mueller LA, Crouzillat D, Petiard V, Tanksley SD: Combining bioinformatics and phylogenetics to identify large sets of single-copy orthologous genes (COSII) for comparative, evolutionary and systematic studies: A test case in the Euasterid plant clade. Genetics 2006,174(3):1407–1420.PubMedView Article
      28. Labate JA, Robertson LD, Wu F, Tanksley SD, Baldo AM: EST, COSII, and arbitrary gene markers give similar estimates of nucleotide diversity in cultivated tomato (Solanum lycopersicum L.). Theor Appl Genet 2009, 118:1005–1014.PubMedView Article
      29. Fulton TM, Van der Hoeven R, Eannetta NT, Tanksley S: Identification, analysis, and utilization of conserved ortholog set markers for comparative genomics in higher plants. Plant Cell 2002,14(7):1457–1467.PubMedView Article
      30. Jiménez-Gómez JM, Maloof JN: Sequence diversity in three tomato species: SNPs, markers, and molecular evolution. BMC Plant Biology 2009,9(1):85.PubMedView Article
      31. Shirasawa K, Asamizu E, Fukuoka H, Ohyama A, Sato S, Nakamura Y, Tabata S, Sasamoto S, Wada T, Kishida Y, et al.: An interspecific linkage map of SSR and intronic polymorphism markers in tomato. Theor Appl Genet 2010,121(4):731–739.PubMedView Article
      32. Rao ES, Kadirvel P, Symonds RC, Geethanjali S, Ebert AW: Using SSR markers to map genetic diversity and population structure of Solanum pimpinellifolium for development of a core collection. Plant Genetic Resources: Characterization and Utilization 2012.
      33. Pitblado RE, Kerr EA: Resistance to bacterial speck (Pseudomonas tomato) in tomato. Acta Hort (ISHS) 1980, 100:379–382.
      34. Jukes TH, Cantor CR: Evolution of protein molecules. In Mammalian protein metabolism. Edited by: Munro H. Academic, New York; 1969:21–132.
      35. Nesbitt TC, Tanksley SD: Comparative sequencing in the genus Lycopersicon: Implications for the evolution of fruit size in the domestication of cultivated tomatoes. Genetics 2002,162(1):365–379.PubMed
      36. The Tomato Genome Consortium: The tomato genome sequence provides insights into fleshy fruit evolution. Nature 2012,485(7400):635–641.View Article
      37. Soltis PS, Soltis DE: Applying the bootstrap in phylogeny reconstruction. Stat Sci 2003, 18:256–267.View Article
      38. Zhou YF, Abbott RJ, Jiang ZY, Du FK, Milne RI, Liu JQ: Gene flow and species delimitation: a case study of two pine species with overlapping distributions in southeast China. Evolution 2010,64(8):2342–2352.PubMed
      39. Rodriguez F, Wu F, Ané C, Tanksley S, Spooner DM: Do potatoes and tomatoes have a single evolutionary history, and what proportion of the genome supports this history? BMC Evol Biol 2009,9(1):191.PubMedView Article
      40. Zuriaga E, Blanca J, Nuez F: Classification and phylogenetic relationships in Solanum section Lycopersicon based on AFLP and two nuclear gene sequences. Genet Resour Crop Ev 2009,56(5):663–678.View Article
      41. Zhu Q, Zheng X, Luo J, Gaut BS, Ge S: Multilocus analysis of nucleotide variation of Oryza sativa and its wild relatives: Severe bottleneck during domestication of rice. Mol Biol Evol 2007,24(3):875–888.PubMedView Article
      42. Bai Y, Lindhout P: Domestication and breeding of tomatoes: What have we gained and what can we gain in the future? Ann Bot 2007,100(5):1085–1094.PubMedView Article
      43. Causse M, Caranta C, Saliba-Colombani V, Moretti A, Damidaux R, Rousselle P: Enhancement of tomato genetic resources via molecular markers. Cahiers Agricultures 2000, 9:197–210.
      44. Grube RC, Radwanski ER, Jahn M: Comparative genetics of disease resistance within the Solanaceae. Genetics 2000, 155:873–887.PubMed
      45. Riaño-Pachón D, Nagel A, Neigenfind J, Wagner R, Basekow R, Weber E, Mueller-Roeber B, Diehl S, Kersten B: GabiPD: the GABI primary database – a plant integrative ‘omics‘database. Nucleic Acids Res 2009,37(database issue):D954-D959.PubMedView Article
      46. Mutschler MA, Tanksley SD, Rick CM: 1987 Linkage maps of the tomato (Lycopersicon esculentum). TGC Report 1987, 37:5–34.
      47. Pillen K, Ganal M, Tanksley S: Construction of a high-resolution genetic map and YAC-contigs in the tomato Tm-2a region. Theor Appl Genet 1996,93(1–2):228–233.View Article
      48. van der Biezen EA, Glagotskaya T, Overduin B, John H, Nijkamp J, Hille J: Inheritance and genetic mapping of resistance to Alternaria alternata f. sp. lycopersici in Lycopersicon pennellii. Mol Gen Genet 1995, 247:453–461.PubMedView Article
      49. Hudson RR, Kreitman M, Aguadé M: A test of neutral molecular evolution based on nucleotide data. Genetics 1987,116(1):153–160.PubMed
      50. Nakazato T, Housworth EA: Spatial genetics of wild tomato species reveals roles of the Andean geography on demographic history. Am J Bot 2011,98(1):88–98.PubMedView Article
      51. Gao H, Williamson S, Bustamante CD: An MCMC approach for joint inference of population structure and inbreeding rates from multilocus genotype data. Genetics 2007, 176:1635–1651.PubMedView Article
      52. Berkman PJ, Lai K, Lorenc MT, Edwards D: Next-generation sequencing applications for wheat crop improvement. Am J Bot 2012,99(2):365–371.PubMedView Article
      53. Islam FMA, Beebe S, Munoz M, Tohme J, Redden RJ, Basford KE: Using molecular markers to assess the effect of introgression on quantitative attributes of common bean in the Andean gene pool. Theor Appl Genet 2004,108(2):243–252.PubMedView Article
      54. Hanson PM, Yang R, Wu J, Chen J, Ledesma D, Tsou SCS, Lee TC: Variation for antioxidant activity and antioxidants in tomato. J Am Soc Hortic Sci 2004,129(5):704–711.
      55. Peralta IE, Knapp S, Spooner D: The taxonomy of tomatoes: a revision of wild tomatoes (Solanum section Lycopersicon) and their outgroup relatives (Solanum sections Juglandifolium and Lycopersicoides). Syst Bot Mngr 2008, 84:1–186.
      56. Hall TJ: Resistance at the Tm-2 locus in the tomato to tomato mosaic virus. Euphytica 1980,29(1):189–197.View Article
      57. Tanksley SD, Bernachi D, Beck-Bunn T, Emmatty D, Eshed Y, Inai S, Lopez J, Petiard V, Sayama H, Uhlig J, et al.: Yield and quality evaluations on a pair of processing tomato lines nearly isogenic for the Tm-2a gene for resistance to the tobacco mosaic virus. Euphytica 1998, 99:77–83.View Article
      58. Ewing B, Green P: Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res 1998, 8:186–194.PubMed
      59. Ewing B, Hillier L, Wendl MC, Green P: Base-calling of automated sequencer traces using phred. I. Accuracy assessment. Genome Res 1998, 8:175–185.PubMed
      60. Staden R: The Staden sequence analysis package. Mol Biotechnol 1996,5(3):233–241.PubMedView Article
      61. Hall TA: BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucl Acids Symp Ser 1999, 41:95–98.
      62. Schlueter SD, Dong Q, Brendel V: GeneSeqer@ PlantGDB: Gene structure prediction in plant genomes. Nucleic Acids Res 2003,31(13):3597–3600.PubMedView Article
      63. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, Lipman DJ: Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res 1997, 25:3389–3402.PubMedView Article
      64. Bombarely A, Menda N, Tecle IY, Buels RM, Strickler S, Fischer-York T, Pujar A, Leto J, Gosselin J, Mueller LA: The Sol Genomics Network (solgenomics.net): growing tomatoes using Perl. Nucleic Acids Res 2011,39(Database issue):D1149-D1155.PubMedView Article
      65. Saitou N, Nei M: The neighbor-joining method: A new method for reconstructing phylogenetic trees. Mol Biol Evol 1987, 4:406–425.PubMed
      66. Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 2007, 24:1596–1599.PubMedView Article
      67. Felsenstein J: PHYLIP -- Phylogeny Inference Package (Version 3.2). Cladistics 1989, 5:164–166.
      68. Huson DH, Bryant D: Application of phylogenetic networks in evolutionary studies. Mol Biol Evol 2006, 23:254–267.PubMedView Article
      69. Huson DH, Klopper T, Lockhart PJ, Steel MA: Reconstruction of reticulate networks from gene trees. In Research in computational biology, Lecture notes in computer science. Edited by: Miyano JM S, Kasif S, Istrail S, Pevzner P, Waterman M. Springer, Berlin; 2005:233–249.
      70. Librado P, Rozas J: DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009, 25:1451–1452.PubMedView Article

      Copyright

      © Labate and Robertson; licensee BioMed Central Ltd. 2012

      This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://​creativecommons.​org/​licenses/​by/​2.​0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.