An association mapping approach to identify favourable alleles for tomato fruit quality breeding
© Ruggieri et al.; licensee BioMed Central Ltd. 2014
Received: 9 July 2014
Accepted: 17 November 2014
Published: 3 December 2014
Genome Wide Association Studies (GWAS) have been recently used to dissect complex quantitative traits and identify candidate genes affecting phenotype variation of polygenic traits. In order to map loci controlling variation in tomato marketable and nutritional fruit traits, we used a collection of 96 cultivated genotypes, including Italian, Latin American, and other worldwide-spread landraces and varieties. Phenotyping was carried out by measuring ten quality traits and metabolites in red ripe fruits. In parallel, genotyping was carried out by using the Illumina Infinium SolCAP array, which allows data to be collected from 7,720 single nucleotide polymorphism (SNP) markers.
The Mixed Linear Model used to detect associations between markers and traits allowed population structure and relatedness to be evidenced within our collection, which have been taken into consideration for association analysis. GWAS identified 20 SNPs that were significantly associated with seven out of ten traits considered. In particular, our analysis revealed two markers associated with phenolic compounds, three with ascorbic acid, β-carotene and trans-lycopene, six with titratable acidity, and only one with pH and fresh weight. Co-localization of a group of associated loci with candidate genes/QTLs previously reported in other studies validated the approach. Moreover, 19 putative genes in linkage disequilibrium with markers were found. These genes might be involved in the biosynthetic pathways of the traits analyzed or might be implied in their transcriptional regulation. Finally, favourable allelic combinations between associated loci were identified that could be pyramided to obtain new improved genotypes.
Our results led to the identification of promising candidate loci controlling fruit quality that, in the future, might be transferred into tomato genotypes by Marker Assisted Selection or genetic engineering, and highlighted that intraspecific variability might be still exploited for enhancing tomato fruit quality.
The genetic architecture of nutritional and quality traits in tomato has been extensively investigated due to the economic importance of this species worldwide. However, the genetic dissection of such traits is a challenging task due to their quantitative inheritance. To assist in this effort, an increasing number of genomic and genetic resources are today exploitable, including genome and transcriptome sequences, dense SNP maps, germplasm collections and public databases of genomic information -. The availability of these resources, the recent advances in high-throughput genomic platforms and the increasing interest in exploring natural genetic diversity, make association mapping an appealing and affordable approach to identify genes responsible for quantitative variation of complex traits. In the recent years, in order to dissect complex quantitative traits and identify candidate genes affecting such traits, the association mapping approach has been widely used -. This strategy relies on detecting linkage disequilibrium (LD) between genetic markers and genes controlling the phenotype of interest by exploiting the recombination events accumulating over many generations and thus increasing the accuracy of the associations detected. It offers several advantages over traditional linkage mapping, including an increased resolution, a reduced research time and a higher allele number detection ,. In addition, genome-wide association studies (GWAS) make it possible to simultaneously screen a large number of accessions for genetic variation, thus allowing identification of novel and superior alleles underlying diverse complex traits .
Many association studies have been published to date for studying morpho-physical and fruit quality traits in tomato. Mazzucato et al.  studied associations for 15 morpho-physiological traits using 29 Simple Sequence Repeat (SSR) markers in a collection of 61 accessions including mainly Italian tomato landraces. Recently, Ranc et al.  and Xu et al.  investigated morphological and fruit quality traits in cultivated tomato and its related wild species by using 352 and 192 markers, respectively. Shirasawa et al.  studied the association with agronomical traits, such as fruit size, shape and plant architecture, using an Illumina GoldenGate assay for 1,536 SNPs.
Association mapping requires high-density oligonucleotide arrays to efficiently identify SNPs distributed across the genome at a density that accurately reflects genome-wide LD structure and haplotype diversity. For tomato, a high-density single nucleotide polymorphism (SNP) array was recently built, which resulted suitable for genome-wide association analysis. The SolCAP array, with 7,720 SNPs based on polymorphic transcriptome sequences from six tomato accessions , is actually the largest platform to genotype tomato collections. The SNP distribution on the array reflects their origin, since they mostly derive from ESTs and thus from the euchromatic genomic regions, which in tomato have a very typical sub-telomeric distribution. The SolCAP platform was recently used to infer SNP effects on gene functions in tomato , to map two suppressors of OVATE (ov) loci , to reveal detailed representation of the molecular variation and structure of S. lycopersicum , to investigate the effect of contemporary breeding on the tomato genome  and to identify candidate loci for fruit metabolic traits . Here, a genome-wide association study in a collection of 96 tomato genotypes was undertaken using this high-quality custom-designed genotyping array. Phenotypic data for ten nutritional and quality traits were recorded over two consecutive field seasons. Using this strategy, additional associations and putative novel candidate genes were detected, compared to previous association studies that were carried out for some of the traits analysed in this study ,,,.
The tomato collection was phenotyped for five nutritional and five fruit quality traits. The former group included metabolites with antioxidant activity, such as ascorbic acid (AsA), β-carotene (β-C) cis-lycopene (c-LYC), trans-lycopene (t-LYC) and phenolics (PHE), whereas the latter consisted of dry matter (DMW) and fresh fruit weight (FW), pH, soluble solids content (SSC) and titratable acidity (TA). Detailed information on phenotyping performed for each trait and genotype is reported in Additional file 1.
Phenotypic variation of traits analysed in the whole collection
Ascorbic Acid (mg 100 g−1 FW)
β-carotene (μg g−1 FW)a
Trans-lycopene (μg g−1 FW)
Cis-lycopene (μg g−1 FW)
Phenolics (mg GAE 100 g−1 FW)
Dry matter weight (g 100 g−1 FW)
Fresh Weight (g)
pH (pH units)
Soluble solids content (Brix)
Titratable acidity (g c.a. 100 mL−1 juice)
The Pearson correlation coefficients (r) among traits (Additional file 2) showed a positive value between t-LYC and c-LYC (r = 0.89) and a negative value between pH and TA (r = −0.70). In addition, AsA, PHE and TA were positively correlated with SSC and negatively correlated with FW. PHE content was also negatively correlated with t-LYC and c-LYC (r = −0.38 and −0.49, respectively), whereas it was positively correlated with AsA (r = 0.55).
Genotyping and population structure
Multiple regression analysis between phenotypic traits and population structure
Dry matter weight
Soluble solids content
Association statistics of markers significantly associated with seven traits by Mixed Linear Model (MLM) with two different MAF thresholds (5% and 10%)
AsA content was associated with markers 2383 and 7588, which map on chromosome 3 spanning a region of 150 kbp, and with marker 1241 on chromosome 5. For markers on chromosome 3, genotypes with major alleles showed an increasing AsA level, compared to genotypes with minor alleles (Additional file 9). By contrast, for marker 1241 the minor allele incremented the phenotype. For β-carotene, the analysis revealed significant associations for markers 2022, 2025 and 2028 mapping on chromosome 1. Each markers explained approximately 20% of the phenotypic variation and the minor alleles in all cases contributed to enhance values. Markers 3525 and 3526, co-localized on chromosome 3, and marker 3104 mapping on chromosome 10, were associated with t-LYC with R2 values of 0.175 and 0.150, respectively. In all cases, the major alleles showed a very high effect with respect to the corresponding minor alleles. PHE was associated with markers 354 on chromosomes 8 and 4365 on chromosome 11. In both cases, the minor alleles increased the metabolite content.
FW was associated with six markers when the MLM was applied using MAF > 5%: the first was 2992 on chromosome 2, which explained about 17% of the phenotypic variation. Three markers co-segregated on chromosome 8, and mapped in the same gene (Solyc08g006170.1.1). The fifth marker 2275 explained the largest phenotypic variation (R2 = 0.239), and mapped 300 bp downstream to Solyc08g006170.1.1. The last marker was 1081 on chromosome 11, and it was the only confirmed using MAF >10%. Moreover, since the multiple regression analysis evidenced a great impact of the genetic structure only on FW, for this trait the association analysis was carried out also on the three separate Q sub-populations. Results confirmed that association of marker 1081 is maintained within the sub-populations (Q1, p-value = 1.90 E-05; Q2, p-value = 4.88E-05; Q3, p-value = 4.7E-02), suggesting that the association of this marker could be considered adequately robust. TA was associated with marker 955 on chromosome 1, which explained about 25% of the phenotypic variation (R2). The other five markers explaining the remaining part of phenotypic variation were marker 2032 mapping on chromosome 2, markers 443 and 3999 co-localized on chromosome 3, and markers 1010 and 1210 on chromosome 4. Finally, only one significant SNP was associated with pH, and explained 16.8% of phenotypic variation. Genotypes exhibiting minor allele for all markers associated with TA and pH had significantly higher value than genotypes with the major allele.
Results of association mapping studies depend on different factors, including type and size of mapping population, trait investigated, number of environments and years used for phenotyping, and type and genome coverage of molecular markers. The present study took into account a collection of cultivated tomato genotypes, including mainly Italian landraces but also Latin American and other worldwide-spread landraces and varieties. Genotypes were selected for the high variability of fruit morphological traits, such as size, shape, skin and flesh colour (data not shown), whereas little or no information was available regarding their nutritional and quality traits. Population structure and familial relationships, likely due to local adaptation, selection and breeding history, were found in the collection. Large populations are desirable for association mapping studies in order to obtain a high power to detect genetic effects of moderate size ,; however, there is a high cost associated with genotyping and phenotyping such populations, particularly for traits requiring extensive field trials, chemical or biochemical assays and a number of replications for measures’ reliability. Therefore, we assumed that the size of our tomato collection was adequate for association mapping studies, as previously reported for bean , peanut  and barley , as well as for tomato , in analyses that involved approximately 90 genotypes.
Concerning antioxidants traits, markers associated with AsA, β-C, t-LYC and PHE were searched for, since these are bioactive compounds exhibiting beneficial effects on human health . In particular, three markers (2383, 7588 and 1241) associated with AsA were identified, which differed from those detected by Sauvage et al.  using a similar GWAS approach, but exploiting accessions belonging to different tomato species. Two markers we identified corresponded to genes Solyc03g112630.2.1 and Solyc03g112670.2.1 mapping on chromosome 3 and were annotated as Fas-associated factor 1-like and Genomic DNA chromosome 5 P1, respectively. The other gene (Solyc05g052410.1.1) was located on chromosome 5 and annotated as Ethylene-responsive transcription factor 1 (ERF1). The Fas-associated factor 1-like protein is involved in an apoplastic mechanism and no direct evidence was reported to correlate its function with AsA accumulation. Since no specific functions were also assigned to Solyc03g112670.2.1, it was thought that the polymorphisms identified in this region of chromosome 3 could be in LD with other candidate genes. In order to verify this hypothesis, a scan was performed of the surrounding genomic area in LD with markers 2383 and 7588. A cluster of pectinesterases (120 kbp from marker 7588), one pectate lyase (240 kbp from marker 7588) and one polygalacturonase (350 kbp from marker 7588) were detected in LD block 23 on chromosome 3. These findings suggest that the alternative D-galacturonic biosynthetic pathway could contributes to regulate AsA variation in the tomato population under study, as previously reported in tomato  and other species ,. In addition, concerning the ERF1 gene, Di Matteo and colleagues  showed that in one S. pennellii introgression line a different expression of genes associated with ethylene biosynthesis might trigger pectin degradation resulting in AsA accumulation. Taken together, these results suggest a possible regulation of genes associated with markers 2383 and 7588 (related to pectin degradation) via Ethylene Responsive Factor 1 associated with marker 1241.
Cis and trans isomers of lycopene derive from a cascade of enzymatic reactions taking place in plastids . Intermediates in the first part of the pathway are cis-configured. A pro-lycopene isomerase (CrtISO) then produces all-trans-lycopenes from tetra-cis-lycopene. Subsequent reactions convert trans-lycopene into β-carotene by the action of a lycopene β-cyclase (β-Lcy). Although no associations were detected for c-LYC, three significant associations with t-LYC were identified. Markers 3525 and 3526, co-localized on chromosome 3, matched a putative metallocarboxypeptidase inhibitor (Solyc03g031480.2.1) and tyrosyl-DNA phosphodiesterase (Solyc03g031820.2.1) whereas marker 3104 on chromosome 10 did not match annotated genes. Interestingly, even if they are not directly linked to the trans-lycopene content, analysis of the genomic area highlighted the presence of a phytoene synthase 1 (Solyc03g031860.2.1) close to markers 3525 and 3526 (at 315 and 24 kbp, respectively) and a lycopene β-cyclase 2 (Solyc10g079480.1.1) at 9 kbp from marker 3104. This showed that the association mapping approach used was able to validate two candidate genes already known to be involved in the carotenoid pathway. In fact, the identified phytoene synthase 1, which catalyzes a rate-limiting step in the carotenoids pathway, corresponds to the locus “r”  that carries a recessive mutation conferring a characteristic yellow flesh phenotype. Four accessions in the population showed a genotype associated to the locus “r” and all have yellow flesh fruit as a consequence of a low trans-lycopene content. In addition, on chromosome 10, besides the lycopene β-cyclase, also a carotene isomerase (CrtISO, Solyc10g081650.1.1), which converts pro-lycopene to trans-lycopene , was localized 1.2 Mbp downstream marker 3104 (Figure 3). As concerns β-C, significant associations were found on chromosome 1 with Solyc01g087600.2.1 annotated as Protein E03H4.4, Solyc01g087670.2.1 annotated as a guanine nucleotide-binding protein, involved in blue light perception signal pathways , and Solyc01g087880.2.1, which has no homology with any gene of known function. These results prompted to investigate alternative genes in this region. Scanning the genomic area associated to these three markers (LD block 13 on chromosome 1), a 24-C-sterol-methyltransferase was found that is involved in steroid biosynthesis. Moreover, a cluster of three carotenoid cleavage dioxygenase 1 (CCD1) genes was also identified at 300 kbp from marker 2022. CCD1 genes cleave the carotenoid substrate at different double bonds to produce terpenoid flavour volatiles (apocarotenoids) that contribute to the overall aroma and taste of tomato fruit . It is hypothesized that the variation in the carotenoid pool may depend on the metabolic flux towards the cleavage reactions to produce apocarotenoids, but further functional experiments that will validate this hypothesis are required.
Finally, two polymorphic markers significantly associated with PHE were identified: marker 354 (Solyc08g082350.2.1), which encodes for a protein of unknown function, and marker 4365 (Solyc11g010170.1.1), encoding for a LanC-like protein2, which is involved in the modification and transport of peptides in bacteria . However, no well-defined functions were reported for the latter gene in plants , even if a probable involvement as one receptor for abscisic acid (ABA) was hypothesized . No gene of the phenolics pathways was detected in the putative region in association with marker 354. By contrast, significant co-localizations (LD block 6) found with marker 4365 included transport genes encoding two 14-3-3 proteins (Solyc11g010200 and Solyc11g010470), an ABC-2 transporter (Solyc11g009100) and a MATE efflux family protein (Solyc11g010380). The involvement of these transporters in enhancing the vacuolar compartmentalization of phenolic compounds was previously reported by Gomez et al.  and Di Matteo et al. , suggesting their probable role in the metabolism of this trait. A previous work  also identified QTLs for phenolic content in regions of chromosome 8 and 11 close to the markers detected, confirming the involvement of these regions in phenolics control.
Major fruit quality traits of interest for both the fresh market and processing tomatoes include fruit size, shape, total solids, colour, firmness, ripening, pH, titratable acidity, soluble solids content and dry matter. In this study, a large number of associations with FW and TA were found, only one association with pH and no association with SSC and DWM.
FW is a quantitatively inherited trait controlled by up to 28 QTLs, even though QTL analyses in previous studies revealed that most (67%) phenotypic variation in fruit size could be attributed to six major loci (fw1.1, fw1.2, fw2.1, fw2.2, fw3.2 and fw11.3) localized on chromosomes 1, 2, 3 and 11 -. The present study confirmed only one of the above loci (fw11.3).
Indeed, on chromosome 11 marker 1081 matched Solyc11g071840.1.1, annotated as a calmodulin binding protein, and was located in the LD block 40 that spans an interval of 167 kbp. This region contains both a portion of the fw11.3 locus (starting 20 kbp downstream of marker 1081) and the fas-YABBY locus (24 kbp upstream of the marker), previously hypothesized to determine fruit size . Therefore, the findings here reported not only confirmed the involvement of locus fw11.3 in FW variation but also restricted the region of putative candidate genes with respect to the previously identified region of 149 kbp, which included 22 predicted genes . Indeed, considering the LD block strategy used, marker 1081 was strongly associated only with a portion of fw11.3 of around 70 kbp (52,301,894 – 52,365,467), including eight predicted genes. This region must be enriched in SNPs to locate precisely one or more responsible polymorphisms associated to the trait and further investigation should be carried out to fine map the putative gene responsible for the trait variation.
Among important quality traits in tomato, TA influences shelf-life of processed tomato and low pH values reduce the risk of pathogen growth in tomato products . One locus associated with pH and five loci associated with TA were evidenced. Only marker 2246 that matches Solyc11g017070.1.1 on chromosome 11 (encoding for an eukaryotic translation initiation factor 3 subunit 2) proved associated with pH trait. An ATPase and a Sucrose_H+ symporter co-localized with this marker; they are proton pumps responsible for acidifying cellular compartment and might correlate with this trait. Moreover, previous studies identified a pH QTL in this region , supporting our hypothesis and confirming the possible involvement of this region in regulating the pH level in tomato fruit.
The most significant association with TA was with marker 955, which explains around 25% of the variation. The marker matched the predicted gene Solyc01g107550.2.1, which encodes for a methylthioribose kinase, an enzyme involved in recycling of methionine through the methylthioadenosine (MTA) cycle. The marker did not fall in any significant LD block on chromosome 1 and for this reason a narrow area around the marker was investigated to look for candidate genes. Four annotated genes not related to the trait under study and various unknown genes were identified, leaving the mechanism responsible for TA variation of this locus still obscure. Other significant associations were found on chromosome 3 for markers 443 and 3999 matching Solyc03g083440.2.1 (Glutamate synthase) and Solyc03g093310.2.1 (F-box family protein), respectively, in a region where previous studies detected QTLs for TA ,,. Markers 443 and 3999 are 1 Mbp away from one another and do not fall in the same LD block. Marker 443 falls in the 280 kbp LD block 15 that includes at least 40 genes, none of which might be so far involved in TA determination. On the other hand, for marker 3999 no significant LD block was inferred and few putative co-localizations could be highlighted. In particular, an aquaporin (Solyc03g093230.2.1) was identified as a potential candidate. The involvement of the aquaporin gene family in the modulation of fruit acidity was hypothesized in tomato antisense experiments, indicating a strong effect of this protein on the sugar/organic acid ratio in fruit . An additional locus for TA was found on chromosome 4 between markers 1210 and 1010. This region includes about 30 genes and we pointed out a purple acid phosphatase (Solyc04g005450), a metalloenzyme that hydrolyses phosphate esters and anhydrides under acidic conditions . No relevant co-localizations were instead found for marker 2032 on chromosome 2.
The association mapping approach undertaken allowed detection of 20 SNPs associated with seven traits that are essential for breeding work aimed at improving nutritional and quality traits in both fresh market and processing tomatoes. The findings suggest that the use of a high marker density array and a highly efficient statistical model (K + Q) were suitable for detecting associations with the traits considered. Indeed, the co-localization of a group of associated loci with formerly identified candidate genes/QTLs validated the approach chosen, as evidenced for markers related to t-LYC, β-C content and FW. Consequently, it can be argued that all the SNPs identified might be exploited in the future as markers targeting the specific desirable phenotype for assisted selection. This is noteworthy if considering the different allelic combinations we identified for some traits, whose pyramiding would further enhance tomato fruit quality of the improved genotypes. A further validation in independent accessions panels or bi-parental populations would be in any case desirable.
In addition, a number of new putative candidate genes were detected in the genomic area in linkage disequilibrium with markers that were not functionally congruent with the trait to which they were associated with. These promising genes might be involved in the pathways controlling the biosynthesis and accumulation of the metabolite/trait analysed, as in the case of a group of genes related to cell wall metabolism that might be hypothesized to contribute to a higher AsA content in tomato fruit or a group of vacuolar transporters that might regulate the accumulation of phenolic compounds. In order to detect the functional SNPs determining the different phenotypes, the identified candidate genes are being investigated by a target resequencing approach in a group of varieties belonging to the same collection. Finally the role of these new candidate genes will be validated in the future by functional genomics approaches.
Plant material consisted of 96 tomato genotypes, including Italian and Latin American landraces, and vintage and modern varieties collected from seed banks in Italy and worldwide. In detail, accessions were derived from Italian breeders’ collections, Regional and National Italian Institutions (Regione Campania, ARCA2010 Cooperative for Agriculture, MiPAF-CRA Centre for Research in Agriculture, University of Naples Federico II), and International Institutions (Plant Genetic Resources Unit, USDA, Tomato Genetics Resource Center, Davis, USA, Hebrew University of Jerusalem, Israel). All genotypes were grown during the seasons 2011 and 2012, according to a randomized complete block design with three replicates (10 plants per replicate), in field plots at the Agricultural Experiment Station of CRA-ORT in Battipaglia (Salerno, Italy). Field trials were conducted in accordance with the Italian legislation. All genotypes were subject to phenotypic and genotypic analyses. Out of 96 genotypes, six were excluded from phenotyping, since they produced few fruits or segregated for some morphological traits. Five additional genotypes were filtered out since were considered unreliable due to the large number of low quality SNP scores. Therefore, the number of samples finally used for association mapping was reduced to 85.
Chemical and physical traits were evaluated on ten fruits harvested at red ripe stage for each biological replicate as recommended in the SCAR Agro-Food Tomato Working Group . Traits included fresh weight (FW), total dry matter weight (DMW), soluble solids content (SSC), titratable acidity (TA) and pH.
Metabolic analyses were carried out on collected fruits stored at −80°C (three replicates of 5–8 fruits per accession): ascorbic acid (AsA), β-carotene, trans- and cis-lycopene, and phenolics content were measured as described below. AsA content was determined as reported by Di Matteo et al.  with minor modifications. Briefly, 500 mg of frozen powder were added to 300 μL of ice-cold 6% trichloroacetic acid (TCA) in 2 mL Eppendorf tubes. Samples were vortexed and left on ice for 15 min, and then centrifuged for 15 min at 25,000 g at 4°C. The supernatant was transferred to a clean tube and 20 μL were used for the assay, as described in the manuscript cited above. The AsA content was expressed as mg per 100 g−1 of FW.
Extraction and analysis of carotenoids were carried out on 5 g of fruit pericarp, according to Ishida et al. . Reversed Phase-High Performance Liquid Chromatography (RP-HPLC) analysis was performed through a Waters E-Alliance HPLC system constituted by a 2695 separations module with quaternary pump, autosampler, and a 2996 photodiode array detector; data were acquired and analyzed with Waters Empower software. The chromatographic separations were performed at a flow rate of 0.8 mL min−1 and at 0.005 AUFS (Absorbance Units Full Scale) by using a reversed phase, analytical polymeric C30 column (250 × 4.6 mm i.d.; 3 μm particle diameter; YMC, Wilmington, NC, USA). Results were expressed as μg g−1 of FW. Solvents used for sample preparation and extractions were of analytical grade, while those for HPLC analysis (methyl-t-butyl ether, methanol, ethyl acetate and tetrahydrofuran) were of HPLC grade; all were obtained from Merck (Darmstadt, Germany). Trans-carotenoid standards (lycopene and β-carotene) used in HPLC analyses were purchased from Sigma Chemical Co (Sigma-Aldrich Company, St. Louis, MO, USA).
Total phenolics content was assayed using a modified procedure of the Folin–Ciocalteu test . In brief, 250 mg of frozen ground tissue were homogenized in a mortar with pestle and extracted using 1 mL of 60% methanol. Samples were left on ice for 3 min in the dark. Crude extracts were transferred into a 15 mL tube and volume was increased to 5 mL by adding 60% methanol. The samples were centrifuged at 3000 g for 5 min; afterwards, 62.5 μL of the supernatant, 62.5 μL of Folin–Ciocalteu’s reagent (Sigma) and 250 μl of deionized water were mixed and incubated for 6 min; 625 μL of 7.5% sodium carbonate and 500 μL of deionized water were added to the samples and incubated for 90 min at room temperature in the dark. Absorbance was measured at 760 nm. The concentration of total phenolics was expressed in terms of mg of gallic acid equivalents (GAE) per 100 g−1 of FW.
For each year, the normal distribution of data was verified using a Shapiro-Wilk test. Six (β-C, PHE, FW, pH, SSC, TA) out of 10 traits were not normally distributed and were log10 transformed before performing the association mapping analysis. In order to test the consistency of phenotypic characterization over years, heritability values were calculated as reported in . Moreover, year and genotype effects were assessed by two-factor analysis of variance. Since the genetic effect over the two years was more significant than the genotype x year interaction for all traits, associations were calculated by using means over years. Pearson correlation coefficients were calculated among all pairs of traits. Analyses were carried out with the R program .
Samples were genotyped using a tomato array built in the framework of the Solanaceae Coordinated Agricultural Project (SolCAP) from NIFA/USDA and based on the ILLUMINA Infinium Technology. The SolCAP tomato panel includes 7,720 markers constructed on eSNPs deriving from six tomato genome sequences. Details of the SolCAP SNP discovery pipeline are described in Hamilton et al.  and Sim et al. . Information about SolCAP SNPs are available in the SGN database (http://solgenomics.net/).
For each accession, genomic DNA was extracted from fresh, young leaf tissue using the DNeasy Plant Mini kit (QIAGEN, Valencia, CA) according to the manufacturer’s recommendations. DNA quality and concentration were evaluated on agarose gel and spectrophotometrically using the Nanodrop instrument (Thermo Fisher Scientific, Wilmington, USA). Genotyping was conducted at the Genomix4Life S.r.l (http://www.genomix4life.com) using 250 ng of DNA per accession following the manufacturer’s protocol for the Illumina Infinium assay. Intensity data and SNP calls were performed by GenomeStudio version 1.7.4 (Illumina Inc., San Diego, CA, USA). SNPs were called using the Infinium chip cluster file based on the SolCAP tomato collection and a manual classification was implemented when the default clustering was not clearly defined. In addition, quality and reproducibility were tested using duplicated DNA samples.
The set of SNPs was filtered in order to perform molecular analyses. Markers with more than 10% missing genotypes and with minor allele frequency (MAF) <5% were removed. The physical position of all SNPs on the 12 tomato chromosomes was obtained from the SGN database (http://solgenomics.net).
Linkage disequilibrium (LD) values were calculated on SNP set with MAF greater than 5%. Pairwise r 2 between markers was calculated for each chromosome using TASSEL v.4.0 . Values of r 2 were plotted against physical distance and LD decay was inferred via locally weighted scatterplot smoothing (LOWESS) using R software and testing smoothing parameters fixed to 0.2. In order to determine the distance of LD decay, for each chromosome different r 2 baseline values (0.2, 0.3 and 0.5) were tested . A Bayesian population classification was carried out using STRUCTURE 2.3.3 . Since population structure estimates assume unlinked markers, a sub-set of 600 markers with an average distance of 600 kbp was chosen to perform STRUCTURE analysis. STRUCTURE runs were carried out with a length of burn-in and MCMC (Markov chain Monte Carlo) of 100,000 each. Twelve independent runs were conducted allowing K (number of populations) varying from 2 to 13. Optimal K was inferred by using the Evanno et al.  transformation method. The influence of the population structure on the phenotypic variation of considered traits was assessed by multiple regression analysis. Associations between genotypes and phenotypes were determined using the General Linear Model (GLM) and the Mixed Linear Model (MLM) in TASSEL v4.0. For both models the structured association model (Q model) was performed  and two different thresholds of MAF (>5% and >10%) were used. In the association analysis, we considered the kinship matrix based on the SNP data in the model of MLM, and the population structure covariates detected in the tomato accessions with the STRUCTURE analysis. The significance of association between traits and markers was estimated by using an adjusted P value (Bonferroni correction) and the threshold for the association was set to <5.88×10−4 (0.05/85). In addition, in order to estimate LD blocks according to the definition of Gabriel et al. , the HAPLOVIEW software  was used with the following parameters: MAF >0.05; Hardy–Weinberg P-value cut-off, 0; percentage of genotyped lines >0.75.
The list of genotypes including details on source and distribution, and the SNP genotype dataset are deposited on the LabArchives repository hosted http://dx.doi.org/10.6070/H4TT4NXN.
The authors wish to thank Dr. Mark Walters for editing the manuscript. Thanks are due to Ms. Giovanna Festa, Mr. Alberto Senatore and Dr. Domenico Perrone, CRA-ORT, for assistance in field trials and phenotyping. This research was supported by the Italian Ministry of Agricultural and Forestry Policy (MiPAF) [grants AGRONANOTECH, ESPLORA] and by the Italian Ministry of University and Research (MIUR) [grant MIUR-PON02-GenoPOMpro].
- The tomato genome sequence provides insights into fleshy fruit evolution. Nature. 2012, 485 (7400): 635-641. 10.1038/nature11119.Google Scholar
- Hamilton JP, Sim S-C, Stoffel K, Van Deynze A, Buell CR, Francis DM: Single nucleotide polymorphism discovery in cultivated tomato via sequencing by synthesis. Plant Genome. 2012, 5 (1): 17-29. 10.3835/plantgenome2011.12.0033.View ArticleGoogle Scholar
- Merk HL, Yarnes SC, Van Deynze A, Tong N, Menda N, Mueller LA, Mutschler MA, Loewen SA, Myers JR, Francis DM: Trait diversity and potential for selection indices based on variation among regionally adapted processing tomato germplasm. J Am Soc Horticultural Sci. 2012, 137 (6): 427-437.Google Scholar
- Robbins MD, Sim S-C, Yang W, Van Deynze A, van der Knaap E, Joobeur T, Francis DM: Mapping and linkage disequilibrium analysis with a genome-wide collection of SNPs that detect polymorphism in cultivated tomato. J Exp Bot. 2011, 62 (6): 1831-1845. 10.1093/jxb/erq367.PubMed CentralView ArticlePubMedGoogle Scholar
- Sim SC, Van Deynze A, Stoffel K, Douches DS, Zarka D, Ganal MW, Chetelat RT, Hutton SF, Scott JW, Gardner RG, Panthee DR, Mutschler M, Myers JR, Francis DM: High-density SNP genotyping of tomato (Solanum lycopersicum L.) reveals patterns of genetic variation due to breeding. PLoS One. 2012, 7 (9): e45520-10.1371/journal.pone.0045520.PubMed CentralView ArticlePubMedGoogle Scholar
- Sim S-C, Durstewitz G, Plieske J, Wieseke R, Ganal MW, Van Deynze A, Hamilton JP, Buell CR, Causse M, Wijeratne S, Francis DM: Development of a large SNP genotyping array and generation of high-density genetic maps in tomato. PLoS One. 2012, 7 (7): e40563-10.1371/journal.pone.0040563.PubMed CentralView ArticlePubMedGoogle Scholar
- Hall D, Tegstrom C, Ingvarsson PK: Using association mapping to dissect the genetic basis of complex traits in plants. Brief Funct Genomics. 2010, 9 (2): 157-165. 10.1093/bfgp/elp048.View ArticlePubMedGoogle Scholar
- Myles S, Peiffer J, Brown PJ, Ersoz ES, Zhang Z, Costich DE, Buckler ES: Association mapping: critical considerations shift from genotyping to experimental design. Plant Cell. 2009, 21 (8): 2194-2202. 10.1105/tpc.109.068437.PubMed CentralView ArticlePubMedGoogle Scholar
- Semagn K, Bjørnstad Å, Xu Y: The genetic dissection of quantitative traits in crops. Electron J Biotechnol. 2010, 13: 16-17. 10.2225/vol13-issue5-fulltext-14.View ArticleGoogle Scholar
- Zhu C, Gore M, Buckler ES, Yu J: Status and prospects of association mapping in plants. Plant Genome J. 2008, 1 (1): 5-10.3835/plantgenome2008.02.0089.View ArticleGoogle Scholar
- Yu J, Buckler ES: Genetic association mapping and genome organization of maize. Curr Opin Biotechnol. 2006, 17 (2): 155-160. 10.1016/j.copbio.2006.02.003.View ArticlePubMedGoogle Scholar
- Zhao K, Tung CW, Eizenga GC, Wright MH, Ali ML, Price AH, Norton GJ, Islam MR, Reynolds A, Mezey J, McClung AM, Bustamante CD, McCouch SR: Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oryza sativa. Nat Commun. 2011, 2: 467-10.1038/ncomms1467.PubMed CentralView ArticlePubMedGoogle Scholar
- Mazzucato A, Papa R, Bitocchi E, Mosconi P, Nanni L, Negri V, Picarella ME, Siligato F, Soressi GP, Tiranti B, Veronesi F: Genetic diversity, structure and marker-trait associations in a collection of Italian tomato (Solanum lycopersicum L.) landraces. Theor Appl Genet. 2008, 116 (5): 657-669. 10.1007/s00122-007-0699-6.View ArticlePubMedGoogle Scholar
- Ranc N, Muños S, Xu J, Le Paslier M-C, Chauveau A, Bounon R, Rolland S, Bouchet J-P, Brunel D, Causse M: Genome-wide association mapping in tomato (Solanum lycopersicum) is possible using genome admixture of solanum lycopersicum var. cerasiforme. G3: Genes Genomes Genet. 2012, 2 (8): 853-864. 10.1534/g3.112.002667.View ArticleGoogle Scholar
- Xu J, Ranc N, Munos S, Rolland S, Bouchet JP, Desplat N, Le Paslier MC, Liang Y, Brunel D, Causse M: Phenotypic diversity and association mapping for fruit quality traits in cultivated tomato and related species. Theor Appl Genet. 2013, 126 (3): 567-581. 10.1007/s00122-012-2002-8.View ArticlePubMedGoogle Scholar
- Shirasawa K, Fukuoka H, Matsunaga H, Kobayashi Y, Kobayashi I, Hirakawa H, Isobe S, Tabata S: Genome-wide association studies using single nucleotide polymorphism markers developed by re-sequencing of the genomes of cultivated tomato. DNA Res. 2013, 20 (6): 593-603. 10.1093/dnares/dst033.PubMed CentralView ArticlePubMedGoogle Scholar
- Hirakawa H, Shirasawa K, Ohyama A, Fukuoka H, Aoki K, Rothan C, Sato S, Isobe S, Tabata S: Genome-wide SNP genotyping to infer the effects on gene functions in tomato. DNA Res. 2013, 20 (3): 221-233. 10.1093/dnares/dst005.PubMed CentralView ArticlePubMedGoogle Scholar
- Rodríguez GR, Kim HJ, van der Knaap E: Mapping of two suppressors of OVATE (sov) loci in tomato. Heredity. 2013, 111 (3): 256-264. 10.1038/hdy.2013.45.PubMed CentralView ArticlePubMedGoogle Scholar
- Blanca J, Canizares J, Cordero L, Pascual L, Diez MJ, Nuez F: Variation revealed by SNP genotyping and morphology provides insight into the origin of the tomato. PLoS One. 2012, 7 (10): e48198-10.1371/journal.pone.0048198.PubMed CentralView ArticlePubMedGoogle Scholar
- Sauvage C, Segura V, Bauchet G, Stevens R, Thi Do P, Nikoloski Z, Fernie AR, Causse M: Genome wide association in tomato reveals 44 candidate loci for fruit metabolic traits. Plant Physiol. 2014, 165 (3): 1120-1132. 10.1104/pp.114.241521.PubMed CentralView ArticlePubMedGoogle Scholar
- Zhang N, Brewer MT, van der Knaap E: Fine mapping of fw3.2 controlling fruit weight in tomato. Theor Appl Genet. 2012, 125 (2): 273-284. 10.1007/s00122-012-1832-8.View ArticlePubMedGoogle Scholar
- Tomes ML, Quackenbush FW: Caro-red, a new provitamin a rich tomato. Economic Botany. 1958, 12: 256-260. 10.1007/BF02859771.View ArticleGoogle Scholar
- Bernardo R: Molecular markers and selection for complex traits in plants: learning from the last 20 years. Crop Sci. 2008, 48 (5): 1649-1664. 10.2135/cropsci2008.03.0131.View ArticleGoogle Scholar
- Galeano C, Cortes A, Fernandez A, Soler A, Franco-Herrera N, Makunde G, Vanderleyden J, Blair M: Gene-based single nucleotide polymorphism markers for genetic and association mapping in common bean. BMC Genet. 2012, 13 (1): 48-10.1186/1471-2156-13-48.PubMed CentralView ArticlePubMedGoogle Scholar
- Wang M, Sukumaran S, Barkley N, Chen Z, Chen C, Guo B, Pittman R, Stalker H, Holbrook C, Pederson G: Population structure and marker-trait association analysis of the US peanut (Arachis hypogaea L.) mini-core collection. Theor Appl Genet. 2011, 123 (8): 1307-1317. 10.1007/s00122-011-1668-7.View ArticlePubMedGoogle Scholar
- Gutierrez L, Cuesta-Marcos A, Castro A, von Zitzewitz J, Schmitt M, Hayes P: Association mapping of malting quality quantitative trait loci in winter barley: positive signals from small germplasm arrays. Plant Genome. 2011, 4 (3): 256-272. 10.3835/plantgenome2011.07.0020.View ArticleGoogle Scholar
- Atwell S, Huang YS, Vilhjalmsson BJ, Willems G, Horton M, Li Y, Meng D, Platt A, Tarone AM, Hu TT, Jiang R, Muliyati NW, Zhang X, Amer MA, Baxter I, Brachi B, Chory J, Dean C, Debieu M, de Meaux J, Ecker JR, Faure N, Kniskern JM, Jones JD, Michael T, Nemri A, Roux F, Salt DE, Tang C, Todesco M, et al: Genome-wide association study of 107 phenotypes in Arabidopsis thaliana inbred lines. Nature. 2010, 465 (7298): 627-631. 10.1038/nature08800.PubMed CentralView ArticlePubMedGoogle Scholar
- Raggi L, Tissi C, Mazzucato A, Negri V: Molecular polymorphism related to flowering trait variation in a Phaseolus vulgaris L. collection. Plant Sci. 2014, 215–216: 180-189. 10.1016/j.plantsci.2013.11.001.View ArticlePubMedGoogle Scholar
- Raiola A, Rigano MM, Calafiore R, Frusciante L, Barone A: Enhancing the health-promoting effects of tomato fruit for biofortified food. Mediators Inflamm. 2014, 2014: 16-10.1155/2014/139873.View ArticleGoogle Scholar
- Di Matteo A, Sacco A, Anacleria M, Pezzotti M, Delledonne M, Ferrarini A, Frusciante L, Barone A: The ascorbic acid content of tomato fruits is associated with the expression of genes involved in pectin degradation. BMC Plant Biol. 2010, 10: 163-10.1186/1471-2229-10-163.PubMed CentralView ArticlePubMedGoogle Scholar
- Agius F, Gonzalez-Lamothe R, Caballero JL, Munoz-Blanco J, Botella MA, Valpuesta V: Engineering increased vitamin C levels in plants by overexpression of a D-galacturonic acid reductase. Nat Biotechnol. 2003, 21 (2): 177-181. 10.1038/nbt777.View ArticlePubMedGoogle Scholar
- Hemavathi , Upadhyaya CP, Young KE, Akula N, Kim H, Heung JJ, Oh OM, Aswath CR, Chun SC, Kim DH, Park SW: Over-expression of strawberry d-galacturonic acid reductase in potato leads to accumulation of vitamin C with enhanced abiotic stress tolerance. Plant Sci. 2009, 177 (6): 659-667. 10.1016/j.plantsci.2009.08.004.View ArticleGoogle Scholar
- Fantini E, Falcone G, Frusciante S, Giliberto L, Giuliano G: Dissection of tomato lycopene biosynthesis through virus-induced gene silencing. Plant Physiol. 2013, 163 (2): 986-998. 10.1104/pp.113.224733.PubMed CentralView ArticlePubMedGoogle Scholar
- Rodriguez-Villalon A, Gas E, Rodriguez-Concepcion M: Phytoene synthase activity controls the biosynthesis of carotenoids and the supply of their metabolic precursors in dark-grown Arabidopsis seedlings. Plant J. 2009, 60 (3): 424-435. 10.1111/j.1365-313X.2009.03966.x.View ArticlePubMedGoogle Scholar
- Liu YS, Gur A, Ronen G, Causse M, Damidaux R, Buret M, Hirschberg J, Zamir D: There is more to tomato fruit colour than candidate carotenoid genes. Plant Biotechnol J. 2003, 1 (3): 195-207. 10.1046/j.1467-7652.2003.00018.x.View ArticlePubMedGoogle Scholar
- Warpeha KM, Lateef SS, Lapik Y, Anderson M, Lee B-S, Kaufman LS: G-Protein-coupled receptor 1, G-protein Gα-subunit 1, and prephenate dehydratase 1 are required for blue light-induced production of phenylalanine in etiolated arabidopsis. Plant Physiol. 2006, 140 (3): 844-855. 10.1104/pp.105.071282.PubMed CentralView ArticlePubMedGoogle Scholar
- Floss DS, Walter MH: Role of carotenoid cleavage dioxygenase 1 (CCD1) in apocarotenoid biogenesis revisited. Plant Signal Behav. 2009, 4 (3): 172-175. 10.4161/psb.4.3.7840.PubMed CentralView ArticlePubMedGoogle Scholar
- Chatterjee C, Paul M, Xie L, van der Donk WA: Biosynthesis and mode of action of lantibiotics. Chem Rev. 2005, 105 (2): 633-684. 10.1021/cr030105v.View ArticlePubMedGoogle Scholar
- Gao Y, Zeng Q, Guo J, Cheng J, Ellis BE, Chen J-G: Genetic characterization reveals no role for the reported ABA receptor, GCR2, in ABA control of seed germination and early seedling development in Arabidopsis. Plant J. 2007, 52 (6): 1001-1013. 10.1111/j.1365-313X.2007.03291.x.View ArticlePubMedGoogle Scholar
- Chen JG, Ellis BE: GCR2 is a new member of the eukaryotic lanthionine synthetase component C-like protein family. Plant Signal Behav. 2008, 3 (5): 307-310. 10.4161/psb.3.5.5292.PubMed CentralView ArticlePubMedGoogle Scholar
- Gomez C, Conejero G, Torregrosa L, Cheynier V, Terrier N, Ageorges A: In vivo grapevine anthocyanin transport involves vesicle-mediated trafficking and the contribution of anthoMATE transporters and GST. Plant J. 2011, 67 (6): 960-970. 10.1111/j.1365-313X.2011.04648.x.View ArticlePubMedGoogle Scholar
- Di Matteo A, Ruggieri V, Sacco A, Rigano MM, Carriero F, Bolger A, Fernie AR, Frusciante L, Barone A: Identification of candidate genes for phenolics accumulation in tomato fruit. Plant Sci. 2013, 205–206: 87-96. 10.1016/j.plantsci.2013.02.001.View ArticlePubMedGoogle Scholar
- Frary A, Gol D, Keles D, Okmen B, Pinar H, Sigva H, Yemenicioglu A, Doganlar S: Salt tolerance in Solanum pennellii: antioxidant response and related QTL. BMC Plant Biol. 2010, 10 (1): 58-10.1186/1471-2229-10-58.PubMed CentralView ArticlePubMedGoogle Scholar
- Causse M, Duffe P, Gomez MC, Buret M, Damidaux R, Zamir D, Gur A, Chevalier C, Lemaire-Chamley M, Rothan C: A genetic map of candidate genes and QTLs involved in tomato fruit size and composition. J Exp Bot. 2004, 55 (403): 1671-1685. 10.1093/jxb/erh207.View ArticlePubMedGoogle Scholar
- Grandillo S, Ku HM, Tanksley SD: Identifying the loci responsible for natural variation in fruit size and shape in tomato. Theor Appl Genet. 1999, 99 (6): 978-987. 10.1007/s001220051405.View ArticleGoogle Scholar
- Lippman Z, Tanksley SD: Dissecting the genetic pathway to extreme fruit size in tomato using a cross between the small-fruited wild species Lycopersicon pimpinellifolium and L. esculentum var. Giant Heirloom. Genetics. 2001, 158 (1): 413-422.PubMed CentralPubMedGoogle Scholar
- Munos S, Ranc N, Botton E, Berard A, Rolland S, Duffe P, Carretero Y, Le Paslier MC, Delalande C, Bouzayen M, Brunel D, Causse M: Increase in tomato locule number is controlled by two single-nucleotide polymorphisms located near WUSCHEL. Plant Physiol. 2011, 156 (4): 2244-2254. 10.1104/pp.111.173997.PubMed CentralView ArticlePubMedGoogle Scholar
- Huang Z, van der Knaap E: Tomato fruit weight 11.3 maps close to fasciated on the bottom of chromosome 11. Theor Appl Genet. 2011, 123 (3): 465-474. 10.1007/s00122-011-1599-3.View ArticlePubMedGoogle Scholar
- Foolad MR: Genome mapping and molecular breeding of tomato. Int J Plant Genomics 2007, 2007: Article ID 64358, 52 pages. doi: 10.1155/2007/64358.,Google Scholar
- Saliba-Colombani V, Causse M, Langlois D, Philouze J, Buret M: Genetic analysis of organoleptic quality in fresh market tomato. 1. mapping QTLs for physical and chemical traits. Theor Appl Genet. 2001, 102 (2–3): 259-272. 10.1007/s001220051643.View ArticleGoogle Scholar
- Fulton TM, Bucheli P, Voirol E, López J, Pétiard V, Tanksley SD: Quantitative trait loci (QTL) affecting sugars, organic acids and other biochemical properties possibly contributing to flavor, identified in four advanced backcross populations of tomato. Euphytica. 2002, 127 (2): 163-177. 10.1023/A:1020209930031.View ArticleGoogle Scholar
- Chen GP, Wilson ID, Kim SH, Grierson D: Inhibiting expression of a tomato ripening-associated membrane protein increases organic acids and reduces sugar levels of fruit. Planta. 2001, 212 (5–6): 799-807. 10.1007/s004250000431.View ArticlePubMedGoogle Scholar
- Antanaitis BC, Aisen P, Lilienthal HR, Roberts RM, Bazer FW: The novel “g’ =1.74” EPR spectrum of pink and purple uteroferrin. J Biol Chem. 1980, 255 (23): 11204-11209.PubMedGoogle Scholar
- Measurement of the quality of tomatoes. Recommendations of an EEC working group. Edited by: Eccher Zerbini P, Gorini F, Polesello A. 1991, IVTPA, MilanGoogle Scholar
- Ishida BK, Ma J, Chan B: A simple, rapid method for HPLC analysis of lycopene isomers. Phytochem Anal. 2001, 12 (3): 194-198. 10.1002/pca.576.View ArticlePubMedGoogle Scholar
- Singleton VL, Rossi JA: Colorimetry of total phenolics with phosphomolybdic-phosphotungstic acid reagents. Am J Enology Viticulture. 1965, 16 (3): 144-158.Google Scholar
- Team RDC: R: A Language and Environment for Statistical Computing. 2005Google Scholar
- Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES: TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007, 23 (19): 2633-2635. 10.1093/bioinformatics/btm308.View ArticlePubMedGoogle Scholar
- Breseghello F, Sorrells ME: Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics. 2006, 172 (2): 1165-1177. 10.1534/genetics.105.044586.PubMed CentralView ArticlePubMedGoogle Scholar
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155 (2): 945-959.PubMed CentralPubMedGoogle Scholar
- Evanno G, Regnaut S, Goudet J: Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005, 14 (8): 2611-2620. 10.1111/j.1365-294X.2005.02553.x.View ArticlePubMedGoogle Scholar
- Yu J, Pressoir G, Briggs WH, Vroh Bi I, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006, 38 (2): 203-208. 10.1038/ng1702.View ArticlePubMedGoogle Scholar
- Gabriel SB, Schaffner SF, Nguyen H, Moore JM, Roy J, Blumenstiel B, Higgins J, DeFelice M, Lochner A, Faggart M, Liu-Cordero SN, Rotimi C, Adeyemo A, Cooper R, Ward R, Lander ES, Daly MJ, Altshuler D: The structure of haplotype blocks in the human genome. Science. 2002, 296 (5576): 2225-2229. 10.1126/science.1069424.View ArticlePubMedGoogle Scholar
- Barrett JC, Fry B, Maller J, Daly MJ: Haploview: analysis and visualization of LD and haplotype maps. Bioinformatics. 2005, 21 (2): 263-265. 10.1093/bioinformatics/bth457.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.