Skip to main content

Genetic diversity analysis of cultivated and wild grapevine (Vitis vinifera L.) accessions around the Mediterranean basin and Central Asia



The mountainous region between the Caucasus and China is considered to be the center of domestication for grapevine. Despite the importance of Central Asia in the history of grape growing, information about the extent and distribution of grape genetic variation in this region is limited in comparison to wild and cultivated grapevines from around the Mediterranean basin. The principal goal of this work was to survey the genetic diversity and relationships among wild and cultivated grape germplasm from the Caucasus, Central Asia, and the Mediterranean basin collectively to understand gene flow, possible domestication events and adaptive introgression.


A total of 1378 wild and cultivated grapevines collected around the Mediterranean basin and from Central Asia were tested with a set of 20 nuclear SSR markers. Genetic data were analyzed (Cluster analysis, Principal Coordinate Analysis and STRUCTURE) to identify groups, and the results were validated by Nei’s genetic distance, pairwise FST analysis and assignment tests. All of these analyses identified three genetic groups: G1, wild accessions from Croatia, France, Italy and Spain; G2, wild accessions from Armenia, Azerbaijan and Georgia; and G3, cultivars from Spain, France, Italy, Georgia, Iran, Pakistan and Turkmenistan, which included a small group of wild accessions from Georgia and Croatia. Wild accessions from Georgia clustered with cultivated grape from the same area (proles pontica), but also with Western Europe (proles occidentalis), supporting Georgia as the ancient center of grapevine domestication. In addition, cluster analysis indicated that Western European wild grapes grouped with cultivated grapes from the same area, suggesting that the cultivated proles occidentalis contributed more to the early development of wine grapes than the wild vines from Eastern Europe.


The analysis of genetic relationships among the tested genotypes provided evidence of genetic relationships between wild and cultivated accessions in the Mediterranean basin and Central Asia. The genetic structure indicated a considerable amount of gene flow, which limited the differentiation between the two subspecies. The results also indicated that grapes with mixed ancestry occur in the regions where wild grapevines were domesticated.


Vitis vinifera L., the commonly cultivated grapevine, is one of the most widely grown fruit plants in the world [1]. It has subspecies with West Asiatic and European origins, and ranges from Central Asia to the Mediterranean Basin [2]. Within the genus Vitis, V. vinifera is the primary species used in the global wine industry, which occupied 7.5 million hectares in 2012 and produced more than 67 million tons of grapes ( Within this species, two subspecies have been described, V. vinifera subsp. sylvestris, which includes the wild populations, and V. vinifera subsp. sativa, which includes the cultivated varieties that resulted from the domestication of the wild relatives [3]. The main phenotypic traits that distinguish the subspecies are: flower sex (dioecious for wild populations and hermaphroditic, or rarely female, for cultivated grapevines); and the seed morphology (spherical seeds with a small beak for sylvestris and pyriform seeds with a well-developed beak for the domesticated cultivars) [4, 5]. The two subspecies form a genetic and taxonomic continuum without breeding barriers resulting in spontaneous hybrids where they occur sympatrically or paripatrically [6,7,8,9,10,11,12].

Pioneering work of Negrul [13] divided the grapevine cultivars into three groups or proles: occidentalis, pontica and orientalis depending on geographic distribution and morphological and ecological differences. Grapevines found in the wide area extending from eastern Georgia, Armenia, Azerbaijan, and the former Soviet republics in Central Asia to the Near East have clear distinguishing features and were placed in the proles orientalis. Negrul recognized two sub-proles within this main group: caspica, composed of ancient vines used for vinification before the advent of Islam (from CE 500–1100), and the antasiatica including table and raisin grape cultivars of more recent origin. Varietal ecotypes found from Georgia to the Balkans were designated P. pontica sub-proles georgica and sub-proles balkanica, respectively.

Grape domestication occurred about 8000 years ago, during the Neolithic Age and was closely related to advances in winemaking in the Near East and area around Northern Mesopotamia [14,15,16]. The dissemination of grapevines from the primary domestication center into neighboring regions of Europe and Northern Africa followed three main pathways, first toward Mesopotamia, reaching the Southern Balkans and East Mediterranean Basin (end of the fifth millennium BCE), then toward Sicily to Western Europe and, finally, domesticated grapes were introduced to Central Europe during the first millennium BCE [16]. Meanwhile, during the fourth century BCE grapevine cultivation reached Central Asia, and near the second century BCE domesticated grapes were introduced into China and Japan [14, 15].

The cultivated grape V. vinifera subsp. sativa has played an important economic and cultural role throughout human history in different parts of the world. However, its ancestor the European wild grape V. vinifera subsp. sylvestris, is close to extinction. To capture and maintain the existing genetic diversity, researchers from East and West European countries under the framework of COST Action FA1003 (East-West collaboration for grapevine diversity exploration and mobilization of adaptive traits for breeding) initiated efforts to collect and preserve germplasm from a wide range of countries, including regions where autochthonous germplasm had not been investigated by genetic and ampelographic methods [17, 18].

The wild relatives of crop species have great importance to breeders as unique sources of genetic variation for breeding programs [19]. Wild grapevines are normally found in riparian ravines where they have access to water and can climb into the tree canopies. One impact of increased human population pressure is the destruction of natural habitats of wild flora and rapid erosion of genetic diversity. There is urgent need to characterize and conserve this valuable germplasm for future generations, and to design a strategy to preserve this species ex situ through extensive collections of wild grape that capture the genetic variation present in the Mediterranean basin and Central Asian regions. A closer analysis of Central Asian collections revealed that many genotypes resist fungal disease, such as downy mildew (Plasmopara viticola), powdery mildew (Erysiphe necator), and black rot (Guignardia bidwellii); all of which were supposedly introduced from North America about 150 years ago [20]. Other studies found that plants of V. vinifera subsp. sylvestris located in an area of Spain with heavy metal contamination exhibited high tolerance to copper stress [21]. Biotic and abiotic stresses from new pathogens, pests and a changing climate have spurred the creation of better-adapted varieties. Adequate genetic variation is the key to breeding crops capable of resisting these challenges.

Molecular analysis has provided insights into the genetic diversity of V. vinifera in relation to wild relatives, the genealogy of cultivars and the specific alleles linked to selected traits [15, 22, 23]. Although Central Asia is one of the centers of grapevine diversity, the majority of information about this region’s germplasm has emerged from accessions maintained in European and USA germplasm repositories [10, 12, 24]. The genotyping of wild and cultivated accessions from a broad range of viticultural areas at two large grapevine repositories provided a significant dataset capable of elucidating relationships within and between the two subspecies at the global level [10, 25]. Results from these studies suggest that grapevine spread from East-to-West after the first domestication process. The results also provide evidence of introgression from local sylvestris individuals with cultivated accessions [25], and the impact on genetic structure related to geographic origin and human use [10].

A limitation of previous examinations of grape genetic diversity was unbalanced sampling resulting in a germplasm collection set that was limited to one or more countries and was not broadly representative. In addition, the sylvestris and wild germplasm from the Caucasus Mountains and Central Asia was poorly represented or not analyzed in these studies. Although genetic, archeological and linguistic evidence suggests that southern Anatolia was the cradle of grape domestication, Transcaucasian remains a serious candidate as evidenced by ancient grape remains that were excavated from Neolithic archaeological sites in Azerbaijan as well as in Georgia [5]. Therefore, the results of previous studies may not present a complete picture of relationships between the wild and cultivated grapevine groups in that region and their association with the rest of world. The first large-scale characterization of both wild and domesticated grapevines, was done by Imazio et al. [12], utilizing SSR (Simple Sequence Repeats) fingerprint data from a set of 382 wild and 130 cultivated grapevine samples collected from Georgia. The results found four genetic groups, two for wild accessions and two for cultivated genotypes. The accessions from Georgia were included in a separate clade that highlighted the uniqueness of Georgian germplasm. Two other studies of grape germplasm from the Caucasus region also found that both wild and cultivated grapes had high genetic and morphological diversity [26, 27].

A previous study by Bacilieri et al. [10] analyzed genetic diversity of 2096 cultivated genotypes maintained in the Vassal germplasm collection and suggested the original center of grapevine domestication extended into many Central Asian countries. A comprehensive study that includes samples from the wild and cultivated groups, collected from opposing sides of an East-West gradient, and samples from Central Asian countries would provide a better understanding of the impact of geography and human selection on grapevine domestication and adaptive introgression. It would further allow us to determine the overall relationships of germplasm within the centers of domestication and with their wild progenitors. With these objectives, data were pooled from six previous studies {Laucou et al. [7], De Andrès et al. [8], Imazio et al. [12], Riaz et al. [24], Biagini et al. [28], Zdunić et al. [29]} and new data were generated for wild accessions collected from Croatia, Georgia, Armenia and Azerbaijan, to develop a well-balanced set that represented both subspecies and provided maximum representation of key geographical regions [Mediterranean basin and Central Asia (Spain, France, Italy, Croatia, Georgia, Armenia, Azerbaijan, Iran, Turkmenistan and Pakistan)]. SSR data were analyzed to infer the genetic structure of populations in wild and cultivated grapevines and to determine the role of Central Asian grapevine germplasm in the diversification of the cultivated gene pool. Results are discussed with emphasis on the conservation of wild germplasm tolerant to biotic and abiotic stress and its use in breeding programs.


Plant materials

A total of 1378 wild (V. vinifera spp. sylvestris) and cultivated (V. vinifera spp. sativa) samples from Transcaucasia (Armenia, Azerbaijan and Georgia), the Caspian Sea region (Turkmenistan and Pakistan), and Europe (Croatia, France, Italy and Spain) were included in the study. Table 1 and Additional file 1: Table S1 present a detailed list of the analyzed accessions based on their geographical origin and habitats. This list includes 975 samples of sativa and sylvestris germplasm from France, the Iberian Peninsula, Georgia, Turkmenistan, Pakistan, Italy, and Croatia that were genotyped in previous studies by Laucou et al. [7], De Andrès et al. [8], Imazio et al. [12], Riaz et al. [24], Biagini et al. [28] and Zdunić et al. [29]. In this work, 403 new accessions of V. vinifera spp. sylvestris from Armenia, Azerbaijan, Georgia and Croatia were genotyped. The wild germplasm from Armenia, Azerbaijan, and Georgia was collected as seeds from female vines gathered on two different collection trips. Seedling plants from a total of 17 seed lots are maintained in the USDA National Clonal Germplasm Repository in Davis, California, USA. The sylvestris samples from Croatia were collected from plants located in their natural habitats mostly along the Krka and Neretva rivers in 2013. Care was taken to select plants that were dioecious and notes were made for the flower phenotype and leaf morphology [29]. The Spanish accessions collected from natural habitats are maintained in the “El Encín” germplasm repository (Madrid, Spain). The French sylvestris accessions are maintained in the INRA “Domaine de Vassal” germplasm collection, and the Italian [28] and Georgian [12] samples are maintained in the germplasm repository of the University of Milan (Milano, Italy).

Table 1 List of cultivated and wild accessions of Vitis vinifera (1378) grouped into countries based on their geographic origin and analyzed by 20 SSR markers. Number of samples for each country is presented in brackets

DNA extraction and genotyping

Total genomic DNA was extracted from young leaves using DNeasy Plant Mini Kits (Qiagen, Valencia, CA, USA). Genotyping was carried out by amplifying 20 nuclear SSR loci: VMC1b11, VMC4f3.1, VVIb01, VVIh54, VVIn16, VVIn73, VVIp31, VVIp60, VVIq52, VVIv37, VVIv67, VVMD21, VVMD24, VVMD25, VVMD27, VVMD28, VVMD32, VVMD5, VVMD7, VVS2 [7]. The amplifications were performed as reported in [7]. The amplified loci were detected on an automated ABI 3500 Genetic Analyzer (Applied Biosystems, Life Technologies, Foster City, CA, USA). Allele sizes were scored using GeneMapper 4.0 software (Applied Biosystems, Life Technologies) and recorded in base pairs.

Data analysis

Determination of flower phenotype

The flower phenotype of the V. vinifera subsp. sylvestris samples collected from Armenia, Azerbaijan, Georgia and Croatia was determined by a combination of a specifically designed marker from gene APT3 (adenine phosphoribosyl transferase) that is capable of distinguishing female plants from males or hermaphrodites [24]. We also used a specific allele of the SSR marker VVIb23 that is closely linked with the sex locus on chromosome 2, and is capable of distinguishing hermaphrodites from female or male plants. The VVIb23 locus polymorphism has been detected and reported in [30]. A total of 403 accessions were analyzed with these two markers to assign flower phenotype. The flower phenotypes of additional wild accessions from other countries were determined either during the time of collection or from plants maintainted in the germplasm repositories.

Genetic diversity

In order to combine the fingerprint data of new genotypes with previous data sets [7, 8, 12, 24, 28, 29], genetic profiles of eight reference cultivars (Cabernet Sauvignon, Chardonnay, Dolcetto, Pinot noir, Riesling, Thompson Seedless, Zinfandel, and Sangiovese) were used as references to standardize the allele calls.

The genetic diversity among groups and over all the groups was estimated. The normalized SSR genotyping data were used to determine the number of different alleles (Na), the effective number of alleles (Ne), Shannon’s Information Index (I), observed heterozygosity (Ho) and expected heterozygosity (He; [31]). The parameters were estimated by GenAlEx 6.5 software [32]. Weir and Cockerham’s F-statistics (FIS, FIT, FST; [33]) per locus and FIS values per each population were detected via FSTAT 2.9.3 and Arlequin softwares, respectively [34, 35] p-values were evaluated over 1000 permutations. Allelic richness (AR) and private allelic richness (PAR) for each population were estimated using the rarefaction method, which compensates for differences in sample size (i.e. rarified allelic richness) among populations as implemented in HP-Rare 1.1 [36]. The effective number of migrants per generation (Nm) among the 12 grapevine populations and between the two subspecies was estimated using the private allele method of Barton and Slatkin [37] (1986) using GENEPOP 3.4 software [38].

Genetic relationships and differentiation

Poppr [39] package implemented in R 3.1 software [40] was used to design a phylogenetic tree with Neighbor-Joining. The distance matrix used in Poppr was calculated based on the Nei’s distance [41]. The unrooted dendrogram was plotted with the R package ape [42]. To measure how well the hierarchical structure from the dendrogram represents the actual distances, the cophenetic correlation coefficient (CCC) has been calculated performing the cophenetic function implemented in R software. Hclust R function was used to perfume hierarchical clustering using a neighbor-joining agglomeration method. In order to elucidate the genetic relationships within and among geographic groups, principal coordinate analysis (PCoA) was performed on the multilocus microsatellite data, which was then arranged into geographic groups using the package adegenet implemented in R [43]. Clustering validation and multivariate analysis was carried out using pairwise Nei’s genetic distance [44] and pairwise FST in GenAlEx 6.5 software. Finally, an analysis of molecular variance (AMOVA, [45]) was performed to characterize the partition of the observed genetic variation among and within populations and genetic groups using Arlequin software. The significance test was performed over 1000 permutations.

Analysis of population structure

The microsatellite data were subjected to a Bayesian model-based cluster analysis using STRUCTURE 2.0 [46] to determine the optimum number of genetically supported groupings. STRUCTURE allocates individuals into a number of clusters (K) independent of population information based on genotypic data, so as to minimize deviations from Hardy-Weinberg and linkage equilibrium. The program uses a Markov Chain Monte Carlo (MCMC) procedure to estimate P(X|K), the posterior probability that the data fit the hypothesis of K clusters. The analysis assigns individuals to each of the K clusters based on the membership coefficient (Q-value), which sums to unity over the number of clusters (K) assumed. STRUCTURE was set to ignore population information, and to use an admixture model with correlated allele frequencies as it is considered to be the best option for subtle population structure [47]. The degree of admixture, alpha, was allowed to be inferred from the data. Alpha is close to zero when most individuals are from one population or another, while alpha is greater than one when most individuals are admixed [48]. The allele frequency parameter (lambda) was set to one. During a pilot study, it was found that a burn-in and MCMC (Markov Chain Monte Carlo) simulation lengths of 100,000 replicate runs were optimum to produce accurate parameter estimates. The number of clusters (K) varied from 2 to 10, and 20 replicate runs were carried out to quantify the variation of the likelihood for each K. The K value that provides the maximum likelihood (Ln P(D) in STRUCTURE) across runs is generally inferred as the most probable number of clusters. Nevertheless, the interpretation of K should be treated with care as it merely provides an ad hoc approximation [46] and sometimes genuine and fine population structure may be missed by STRUCTURE. Therefore, we used an ad hoc statistic ΔK to choose the optimum number of clusters (K) based on the second order rate of change in the log probability of data between successive K values as proposed by Evanno et al. [48].


Flower phenotype in the wild accessions

Flower sex phenotype and seed morphology are key criteria normally used to differentiate subsp. sylvestris (dioecious vines, seeds with short beaks) from cultivated sativa forms (predominantly hermaphroditic flowers, seeds with larger beaks). The search for wild accessions was focused on collecting dioecious individuals because most cultivated genotypes are hermaphrodites. Flower phenotype data from the wild samples from Spain and Italy were recorded in the field and previously reported by Benito et al. [49] and Biagini et al. [28, 50]. The sylvestris samples from France, Georgia (University of Milan repository) and Croatia were collected from natural habitats and flower phenotypes were recorded based on the presence of fruit (female) and flower rachis without fruit (male) during collection. Only samples that met the basic dioecious phenotypic profile and leaf morphology of wild grapevines were included in the study. The flower phenotype of the subsp. sylvestris accessions collected from Armenia, Azerbaijan and Georgia (USDA repository) could not be determined because these plants were maintained in small containers. A combination of two DNA markers was used to differentiate the male, hermaphrodite and female flower phenotype for the set of 403 accessions from Armenia, Azerbaijan, Georgia and Croatia (Additional file 2: Table S2). Field phenotypic observations for the 38 accessions from Croatia matched the flower phenotype predicted by DNA analysis. Flower phenotypes assessed by DNA-based flower sex markers and field phenotyping of the wild forms of all the accessions of V. vinifera subsp. sylvestris are presented in Additional file 2: Table S2.

Genetic diversity for sativa and sylvestris germplasm

Genetic data from 20 SSR loci and across 1378 grapevine samples, originating from Asia to Europe (Table 1) and representing both subspecies of V. vinifera (sativa and sylvestris), were used in this study. Additional file 1: Table S1 provides the allelic profiles of all analyzed samples. The number of alleles ranged from 11 for VVIq52 to 38 for VMC4f3.1 with an average of 20.95 alleles/locus. The number of effective alleles ranged from 2.192 for VVIn73 to 7.004 for VVIp31 with an overall average of 4.651. Both observed and expected heterozygosity varied greatly among loci and results of the fixation index with most loci suggested high levels of inbreeding (Table 2). The He values ranged from 0.477 (VVIn73 locus) to 0.803 (VVS2), with a mean value equal to 0.678. While, the Ho values varied from 0.535 (VVIn73) to 0.845 (VVIp31) and the mean overall value was 0.742. The locus with the lowest F value was VVIb01 (0.021), while the highest was VVIq52 (0.189). The mean F value for the dataset was 0.088.

Table 2 Diversity indices* calculated for 1378 distinct genotypes including sativa and sylvestris accessions from Asia to Europe

Allelic profiles were used to calculate statistical indices and determine the genetic diversity of the cultivated and wild genotypes (Table 3). The number of alleles per locus (Na) was 9.120 for sativa and 9.164 for sylvestris samples. The Italian cultivars had the lowest Na value (4.900) of the cultivated accessions and the highest Na value (12.600) was detected in the Georgian cultivars. The number of alleles per locus for the wild accessions varied between 7.050 (Armenia) and 12.850 (Georgia). The Ne value over the whole dataset was 4.441. The sativa accessions from Italy (3.688) and sylvestris accessions from France (2.792) had the lowest Ne values. The highest Ne values were detected in cultivated accessions (5.751) and wild individuals (6.016, Table 3) from Georgia. Within sativa, the allelic richness, adjusted to a minimum sample size of 42 genes, ranged from 6.200 alleles for Spanish accessions to 9.330 for Italian accessions, with an overall mean of 7.848 alleles across loci. Within the sylvestris accessions, allelic richness ranged from 5.870 for the Armenian group to 10.200 for the Georgian group with an overall mean of 7.089 across loci. The private allelic richness for sativa ranged from 0.020 for the Spanish and French groups to 0.520 for the Italian and Turkmenistan/Pakistani groups with an overall mean frequency of 0.314 alleles across loci. Within sylvestris, this richness ranged from 0.020 for the Azerbaijani accessions to 0.980 for Georgian wild grapes with an overall mean of 0.344 private alleles per locus. The mean Shannon’s Information Index (I) value for the wild accessions was slightly lower than that for the cultivars (1.60 vs. 1.641), with an overall value of 1.619 (Table 3). In general, the Ho values were lower than He values for each group, except for cultivated samples from France (0.765 vs. 0.708) and Italy (0.798 vs. 0.682). The Ho value for sativa was higher than sylvestris (0.754 vs. 0.649), while the overall mean value (0.692) was more similar to the sylvestris value than the sativa value. The He value for sativa (0.735) was higher than the sylvestris value (0.722).

Table 3 Genetic diversity estimates in wild and cultivated grapevines for each analyzed population. Results are arranged based on the geographical origin and habitat

The samples were arranged in 12 groups based on their origin and subspecies, and FIS values were calculated (Table 3). The values ranged from − 0.166 (Italian sativa samples) to 0.138 (Georgian sylvestris samples). The values for the sylvestris populations were generally higher than the sativa populations. Among the wild accessions, populations from Georgia and Spain had the highest FIS values (0.138 and 0.131, respectively). The populations of cultivated accessions with the highest inbreeding coefficient were from France (0.057) and Georgia (0.066). The FIS value over all loci and populations was 0.151 and the sativa value was lower than that for sylvestris (0.039 versus 0.169). Most of the FIS values had a p-value lower than 0.1.

Cluster analysis

The neighbor-joining (NJ) cluster analysis based on the pair-wise distance matrix showed clear differentiation between the two subspecies (Fig. 1). A number of wild individuals clustered with the cultivated samples and vice versa. The dendrogram showed three main groups with cophenetic correlation coefficient (CCC) value of 0.75 (Fig. 1). The sylvestris accessions divided into two groups and sativa accessions formed a third major group. The first group of wild germplasm contained most of the Transcaucasian sylvestris accessions from Armenia (#1), Azerbaijan (#2) and Georgia (#5) and the second group consisted of the European wild accessions from Croatia (#3), France (#4), Italy (#6) and Spain (#7). The Spanish wild accessions were further split into two groups, one of them including the French wild samples (#4). There were two sub-groupings within the sativa cluster, one containing the French (#8), Italian (#10), Spanish (#11) and Turkmenistan-Pakistan samples (#12), and the other containing some of the Georgian samples (#9). Two additional minor clusters were identified, both containing Georgian samples. One of these contained the wild samples (#5) and the other both wild and cultivated samples (#5 and #9). The latter cluster also contained a small group of Italian cultivars (#10).

Fig. 1
figure 1

NJ dendrogram showing relationships among 1378 cultivated and wild grapevine accessions obtained by data analysis from 20 SSR loci. Samples are arranged based on their origin and membership in the sativa and sylvestris subspecies

Population structure analysis and differentiation

In order to identify the structure of populations and the correlations among samples, two different analyses were performed. PCoA was based on the genetic distance matrix obtained by the SSR profiles. Projections of the PCoA were plotted in a 2-dimension scatter plot (Fig. 2). The PCoA 2D projection of the first two principal axes accounted for ~ 32% of the total molecular variation (Fig. 2). Significant differentiation between the two subspecies and the European and Transcaucasian sylvestris groups was observed. The sylvestris samples from Armenia (#1), Azerbaijan (#2) and Georgia (#5) were clearly differentiated from the rest of the sativa and sylvestris groups. The European sylvestris groups (#3, #4, #6 and #7) formed overlapping clusters, as did the accessions from Armenia (#1) and Georgia (#5). All five groups of sativa from Europe (#8, #10 and #11), Georgia (#9), Turkmenistan and Pakistan (#12) were closely associated. The sativa groups were closely associated with sylvestris accessions from Europe (#3, 4, 6, 7) and Transcaucasia (#1, 5), with the exception of the sylvestris accessions from Azerbaijan (#2). There was large variability within each of these groups and subspecies. The second method used to evaluate the relationship among genotypes was a clustering algorithm implemented in the program STRUCTURE. The Bayesian analysis results of genetic structure for the wild (sylvestris) and cultivated grapevines (sativa) were roughly comparable with the NJ cluster analysis and PCoA results, but STRUCTURE did not detect subtle differentiation among some of the populations. The estimated log probability values [Ln Pr (X|K)] for different K gradually increased reaching a maximum value at K = 3 with non-significant variation among replicate runs, beyond which the rate of increase between successive K decreased and variance among runs increased (Fig. 3). Plotting the second order rate of change of the log probability of data (∆K) with respect to the number of clusters, against K predicts the true K according to Evanno et al. [48], and such analysis produced a clear peak at K = 2, but the second order rate of change of likelihood distribution showed that the rate of change is bigger between K = 3 and 4, therefore, K = 3 is the most likely number of clusters in the genetic structure of these grape populations. About 84% of genotypes were assigned to a cluster at K = 3, with a percentage of assignment higher than 80%. The proportion of admixed genotypes was about 16% (Additional file 3: Table S3). Plotting the Q matrix values (the estimated membership coefficients for each individual in each K clusters) for K = 3 (Fig. 3), revealed clusters roughly corresponding to the two major groups within sylvestris, one from the Caucasus (Armenia, Azerbaijan and Georgia; G2) and the other from Europe (Croatia, France, Italy and Spain; G3), and one group with the French, Georgian, Italian and Spanish sativa accessions (G1). As observed in the NJ cluster analysis and PCoA, there were genotypes with mixed ancestry in all three groups. The populations with the highest percentage of admixed samples were Armenia (39%) and Georgia (49%) for wild groups and France (32%) for sativa accessions (Additional file 3: Table S3).

Fig. 2
figure 2

Relationships between wild and cultivated grapevine genotypes (1378) as represented by the first two principal coordinates of a PCoA using allelic profiles from 20 SSR molecular markers. Samples are arranged based on their origin and membership in the sativa and sylvestris subspecies

Fig. 3
figure 3

Barplot displaying the admixture proportions of wild and cultivated grapevine genotypes as estimated by STRUCTURE analysis at K = 3 and 7. The Evanno’s ΔKs statistics indicated K = 3 as the best supported level of population subdivision using simulation model with K values ranging from 2 to 10

Population structure among the 12 tested populations, irrespective of the subspecies, was summarized by the Wright’s F-statistics (FIT, FST and FIS) (Additional file 4: Table S4). The VVMD21 locus had the highest value for FIT, FST and FIS (0.380, 0.235 and 0.189, respectively), while the lowest FIT and FIS values were detected for the VMC4f3.1 locus (0.095 and 0.005, respectively), and VVMD25 had the lowest FST value (0.056). The number of migrants (Nm) after correction for sample size was 1.33, when samples were arranged in 12 populations. When the samples were arranged in two subpopulations (sativa and sylvestris), Nm was 4.88.

Nei’s genetic distance and FST were calculated to validate the results obtained from cluster analysis and PCoA. The pairwise values for the 12 geographic groups are listed in Table 4. Nei’s genetic distance had a wide range of values, from 0.116 recorded for the pairwise French and Spanish sativa samples, to 0.830 for the sylvestris samples from Georgia and France. The FST values varied from a low of 0.021 detected for the French and Spanish cultivated accessions to a high of 0.125 for the sylvestris individuals from Azerbaijan and France. Nei’s genetic distance and FST values for sativa and sylvestris groups were 0.159 and 0.023, respectively.

Table 4 Estimates of pairwise Nei’s genetic distance (below the diagonal) and FST values (above the diagonal) within overall wild and cultivated grapevine groups

The AMOVA analysis is presented in Additional file 5: Table S5. When the total genetic variation was partitioned, 9.54% was attributed to the differences among populations, 6.68% to the differences among individuals within populations and 83.78% to the differences within individuals, with levels of significance estimated over 1000 permutations lower than 0.05. FST, FIS and FIT parameters overall the loci and populations were 0.095, 0.073 and 0.162, respectively (p ≤ 0.05).


The main objective of this study was to analyze the pattern of genetic diversity within and between wild and cultivated grapes from the Mediterranean basin and Central Asia – considered to be the center of domestication. We pooled information from six previous studies that examined both wild and cultivated accessions, and genotyped an additiopnal 403 wild accessions from the Caucases region and Croatia at 20 microsatellite loci. The microsatellite marker data from 1378 accessions was subjected to NJ clustering and Bayesian methods to elucidate groupings of wild grapevine populations and to infer gene flow and gene frequency changes that occurred during domestication.

Assessment of flower sex within sylvestris populations

Taxonomic distinctions between the two subspecies, sylvestris and sativa, are based on leaf morphology and the dioecious state of wild forms. According to the model of Antcliff [51], the flower phenotype is controlled by a single major locus with three alleles: male (M) dominant to hermaphrodite (H), which is dominant to the female (F). In the wild, only male and female vines exist in the absence of gene flow from hermaphroditic cultivated varieties. However, the possibility of hybridization and seed dispersion increases where wild vines are in close proximity to cultivated types. The wild accessions from earlier studies were collected with careful consideration of flower phenotype and leaf morphology [7, 8, 12, 28, 29]. The samples from Armenia, Azerbaijan, and Georgia were collected as seed lots. Analyses of flower phenotype based on linked markers found that the Georgia populations had more female than male vines, and that seed lot DVIT3357 consisted of only female and hermaphrodite vines indicating gene flow from cultivated to wild types (Additional file 2: Table S2). However, the Armenian, and Azerbaijan populations had a higher proportion of male plants. Heterogeneous plant sex distribution was also observed in earlier study of Spanish sylvestris samples [49] with a majority of the plants being male.

Pattern of genetic diversity distribution within and among the subspecies

The two subspecies of V. vinifera included in this study exhibited high levels of polymorphism and heterozygosity across the 20 microsatellite loci and significant diversity was observed within and between the subspecies (Tables 2 and 3). This trend was expected in a divergent gene pool composed of subspecies and hermaphroditic cultivars that have undergone intensive human selection during domestication. Data obtained in other studies [10, 11] are similar to the results from our survey. Genetic diversity within and among the different geographic groups in both subspecies, as demonstrated by the effective number of alleles and allelic richness, suggests that there is significant diversity both within and between the subspecies (Table 3). The sativa and sylvestris accessions from Georgia had the highest number of effective alleles and allele richness suggesting that this region is the center of diversity for V. vinifera [2].

In general, we expected to see higher levels of heterozygosity in sylvestris because of its obligate out-crossing nature compared to its domesticated counterpart sativa. The Ho value of the sativa group appeared slightly higher than the He values; while the trend was the opposite for the sylvestris accessions. These differences correspond with the positive FIS values in sylvestris, particularly in the populations from Spain and Georgia, which suggests a high level of genetic relationship among the individuals from the same wild populations (Table 3). Such matings can affect individual and population dynamics and increase inbreeding. However, the FIS values of some wild populations were close to zero as expected in randomly mating populations (Table 3). These opposing results may be explained by random genetic drift of alleles among subpopulations due to sample size. The reduced level of diversity that we observed in sylvestris samples has also been noted in other studies [10,11,12]. The sylvestris accessions in many parts of the world are considered endangered and fragmented due to deforestation and urbanization. Man-made and natural geographical barriers can also lead to the isolation of wild populations in their native habitat, and could lead to significant inbreeding, reduced gene flow within and among different geographic groups and, hence, lower levels of heterozygosity.

The FIS values were close to zero in the cultivated accessions suggesting random mating, except the Italian accessions. The negative FIS values for Italian populations indicated an excess of heterozygotes, but it was not statistically significant (Table 3). The deficiency of homozygotes in the majority of the cultivated groupings suggests that they are made up of germplasm with divergent demographic (founder effects, bottlenecks, dispersal) and selection histories. Germplasm collections are usually mixtures of genotypes. Thus, geographic groups in these collections exhibit relatively high levels of differentiation, resulting in higher than expected levels of heterozygosity. This is commonly observed in woody perennial crops where cultivars are selected for their vigor, which indirectly favors high levels of heterozygosity [52,53,54].

The results of the AMOVA and F-stat analysis confirmed that high levels of diversity were present within populations, while low levels of genetic diversity were found among populations. These results are consistent with the findings from other studies [10,11,12].

Genetic structure and differentiation within and between the subspecies

A significant differentiation within and between the two subspecies was detected by cluster analysis and PCoA (Figs. 1 and 2). Both analyses found clear differentiation between the Western European wild grapevines and the wild samples collected from the Caucasus. The French and Spanish wild grapes were closely allied and had a close genetic relationship. These results were in agreement with Arroyo et al. [22], who used chloroplast markers to find that these populations had the same haplotype. The Spanish wild grapevines showed hierarchical differentiation, suggesting that gene flow among neighboring populations caused a stepping-stone model of population structure. Alternatively, the hierarchical differentiation could be the result of climatic differences across diverse geographic regions. The Croatian sylvestris accessions were related to the European sylvestris individuals and formed a basal sister group indicating a common gene pool. The wild grapevines from Transcaucasia, including Armenia, Azerbaijan, and Georgia, formed a distinct sub-group that contained several accessions of Azerbaijani wild grapevines. Similarly, the Georgian and Armenian wild grapes split into two subgroups each, however they shared a common Transcaucasia gene pool. The sylvestris vines in the Transcaucasia region grow in a wide range of isolated habitats created by the Greater and Lesser Caucasus Mountain systems where they are differentially adapted to local environments [12, 54]. Some of the sylvestris individuals, both in Caucasian and European germplasm, clustered with the cultivated samples. These accessions are most likely feral hybrids of sativa and sylvestris, which may have been used in breeding programs or as cultivated selections (Figs 1 and 2).

Within sativa, two distinct groups of cultivars from Georgia were observed, one appeared as a sister clade of Italian, French and Spanish cultivars (Fig. 1), while the other group was closely related to an Italian sativa and Georgian sylvestris sub-group. This result could suggest that the first domesticated cultivars in Central Asia and Caucasus (proles pontica), left a genetic footprint in the Western European proles occidentalis accessions. This genetic kinship could also be a reflection of early breeding programs in the Mediterranean region where sylvestris or hybrid feral vines with superior fruit were utilized in crosses with domesticated lines.

The overall pattern of differentiation depicted by the PCoA is very similar to the NJ cluster analysis (Figs. 1 and 2). Clusters within sylvestris accessions from Georgia and Armenia overlapped and were closely associated with cultivated forms from Georgia, Pakistan and Turkmenistan. The close association of Georgian wild grapevines with Georgian cultivated accessions strongly supports their involvement in the initial domestication of grapevine [55,56,57]. Evaluation with NJ cluster analysis and PCoA, indicates that local European sylvestris vines might have contributed to the selection and introgression of genes into Western European grapevines in the later part of the domestication process (Fig. 2). The Bayesian STRUCTURE analysis supported differentiation among the major groups only, while the fine-scale differentiation between some of the groups, especially those with mixed ancestry, was not evident (Fig. 3). Bayesian inference of genetic structure indicated considerable gene flow with moderate differentiation between the two subspecies. These results suggest that wine grape cultivation and wine making promoted the domestication of wild grapevines, creation of new varieties, and advancement of growing techniques early in grapevine’s history. Further introgression and mixing of wild germplasm in localized communities would have contributed to the high proportion of grapevines with mixed ancestry. Interestingly, analyses of ancestry values of tested western cultivars identify some with a high ancestry values in Group 3 (Additional file 3: Table S3). These grapevine cultivars correspond to the Spanish cultivars; Albariño, Caiño Blanco, Ferrón, Maturana, Ondarrabi Betlza and the European cultivars Arvine Petite, Cot, Chenin Blanc, Petit Verdot. Pinot Meunier and Sauvignon Blanc. These cultivars have been described as more closely related to wild accessions [8] and our results support the introgression of western sylvestris into some of the current Western European cultivars.

It is difficult to suggest that wild grape forms homogeneous populations considering the vast geographic expanse and the often fragmented and isolated populations that occur under heterogeneous climatic conditions. However, our results suggest Georgia as an ancient center of grapevine domestication with its wild grapes closely related to the cultivated grapes of the same region (proles pontica), and Western European (proles occidentalis). This observation confirms earlier studies that suggested that proles pontica were gradually introduced by human migration towards Western Europe [10, 25, 58, 59]. Cluster analysis shows a relationship between Western European wild grapes and cultivated grapes, suggesting that proles occidentalis grapevines contributed to the early development of wine grapes to a much greater extent than the wild vines from Eastern Europe. Previous studies using SNPs markers [25] proposed a Near East origin of vinifera and presented evidence of introgression from local sylvestris as the grape moved into Europe, but the degree to which local Western European wild sylvestris genetically contributed to Western European vinifera cultivars remains a contentious issue. Our results suggest and support at least two separate domestication events that gave raise to cultivated grape; one derived from the Transcaucasia wild grape, and another from the wild grapes of Western Europe.

Scientific interest in the highly endangered ancestor of cultivated grapevine, V. vinifera subsp. sylvestris, has centered on questions of conservation genetics, and deepening our understanding of the domestication history of the cultivated crop [22]. However, since domestication traits such as higher yield, larger berries, higher sugar content are often accompanied by a loss of resistance to abiotic and biotic stress, it is beneficial to search for such factors in the wild forms of the crop’s ancestors. In fact, salt-tolerant grape accessions can be found in the North African sylvestris population [60], and the recent identification of wild and cultivated accessions from Germany, Iran and Georgia with tolerance to mildew diseases supports the potential of this wild ancestor as a genetic resource for disease resistance breeding [24, 61,62,63]. Given that wild Eurasian and North Africa wild V. vinifera germplasm and Asian Vitis germplasm are largely unexplored, their identification, preservation, and characterization for biotic and abiotic resistance and berry quality [64, 65] traits are very important for the future of the wine and grape industry.


The two sub-species of V. vinifera, subsp. sativa and subsp. sylvestris, are distinct based on analysis of SSR data, but extensive gene flow was observed in regions where these two taxa came in contact. Our results suggest that Georgia is an ancient center of grape domestication based on a genetic affinity between wild accessions from Georgia and cultivated grapes from Georgia (pontica) and Western Europe (occidentalis). Results also suggest that Western European wild grapes were related to cultivated grapes, and that Western European sylvestris contributed to the development of Western European wine grapes. The results also support at least two separate domestication events that gave raise to cultivated grape; one derived from the Transcaucasian wild grape and another from the wild grape of Western Europe.

Finally, wild grape germplasm can contribute many useful traits such as resistance to damaging pests and diseases, and better adaptation to climate change. Thus, we must intensify efforts to collect, characterize and preserve not only the Western and Eurasian wild V. vinifera germplasm, but also sylvestris accessions from North Africa and Central Asia. These wild grape relatives will have a key role in future grape and wine industry.



Analysis molecular of variance


Before current era


Current era




Inbreeding coefficient


Fixation index




Expected heterozygosity


Observed heterozygosity




Sample size


Number of effective alleles


Principal coordinate analysis


Single Nucleotide Polymorphism


Simple sequence repeat


University of Milan


United States Department of Agriculture


  1. Vivier MA, Pretorius IS. Genetically tailored grapevines for the wine industry. Trends Biotechnol. 2002;20:472–8.

    Article  PubMed  CAS  Google Scholar 

  2. Zohary D, Hopf M. Domestication of plants in the old world. 3rd ed. New York: Oxford University Press; 2000.

    Google Scholar 

  3. Levadoux L. Les populations sauvages et cultivées de Vitis vinifera L. Annl Amellior Plantes. 1956;1:59–118.

    Google Scholar 

  4. Zohary D. Domestication of the grapevine Vitis vinifera L. in the Near East. In: Mc Govern PE, Fleming SJ, Katz SH, editors. The Origins and Ancient History of Wine. New York: Gordon and Breach Sciences Publisher; 1995. p. 23–30.

    Google Scholar 

  5. Terral JF, Tabard E, Bouby L, Ivorra S, Pastor T, Figueiral I, Picq S, Chevance JB, Jung C, et al. Evolution and history of grapevine (Vitis vinifera) under domestication: new morphometric perspectives to understand seed domestication syndrome and reveal origins of ancient European cultivars. Ann Bot. 2010;105:443–55.

    Article  PubMed  Google Scholar 

  6. Cipriani G, Spadotto A, Jurman I, Di Gaspero G, Crespan M, Meneghetti S, Frare E, Vignani R, Cresti M, Morgante M, Pezzotti M, Pe E, Policriti A, Testolin R. The SSR-based molecular profile of 1005 grapevine (Vitis vinifera L.) accessions uncovers new synonymy and parentages, and reveals a large admixture amongst varieties of different geographic origin. Theor Appl Genet. 2010;121:1569–85.

    Article  PubMed  Google Scholar 

  7. Laucou V, Lacombe T, Dechesne F, Siret R, Bruno JP, Dessup M, Dessup T, Ortigosa P, Parra P, Roux C, Santoni S, Varés D, Pèros JP, Boursiquot JM, This P. High throughput analysis of grape genetic diversity as a tool for germplasm collection management. Theor Appl Genet. 2011;122:1233–45.

    Article  PubMed  CAS  Google Scholar 

  8. De Andrés MT, Benito A, Pérez-Rivera G, et al. Genetic diversity of wild grapevine populations in Spain and their genetic relationships with cultivated grapevines. Mol Ecol. 2012;21:800–16.

    Article  PubMed  Google Scholar 

  9. Lacombe T, Boursiquot JM, Laucou V, Di Vecchi-Staraz M, Péros JP, This P. Large-scale parentage analysis in an extended set of grapevine cultivars (Vitis vinifera L.). Theor Appl Genet. 2013;126:401–14.

    Article  PubMed  Google Scholar 

  10. Bacilieri R, Lacombe T, Le Cunff L, Di Vecchi-Staraz M, Laucou V, Genna B, Peros JP, This P, Boursiquot JM. Genetic structure in cultivated grapevines is linked to geography and human selection. BMC Plant Biol. 2013;

  11. Emanuelli F, Lorenzi S, Grzeskowiak L, Catalano V, Stefanini M, Troggio M, et al. Genetic diversity and population structure assessed by SSR and SNP markers in large germplasm collection of grape. BMC Plant Biol. 2013;13:39.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  12. Imazio S, Maghradze D, Lorenzis G, Bacilieri R, Laucou V, This P, et al. From the cradle of grapevine domestication: molecular overview and description of Georgian grapevine (Vitis vinifera L.) germplasm. Tree Genet Genomes. 2013;9:641–58.

    Article  Google Scholar 

  13. Negrul AM. Origin of cultivated grapevine and its classification. In: Frolov-Bagreev AM, editor. Ampelography of the Soviet Union. Moscow: Publisher “Pishchepromizdat”; 1946. p. 159–216. (In Russian).

    Google Scholar 

  14. McGovern PE. Ancient Wine.The search for the origins of viniculture. 1st ed. Princeton: Princeton University Press; 2003.

    Google Scholar 

  15. This P, Lacombe T, Thomas MR. Historical origins and genetic diversity of wine grapes. Trends Genet. 2006;22:511–9.

    Article  PubMed  CAS  Google Scholar 

  16. Forni G. The origin of “old world” viticulture. In: Maghradze D, Rustioni L, Scienza A, Turok J, Failla O, editors. Caucasus and northern Black Sea region Ampelography. Vitis, special issue; 2012. p. 27–38.

    Google Scholar 

  17. Failla O. East-west collaboration for grapevine diversity exploration and mobilization of adaptive traits for breeding: a four years story. Vitis. 2015;54:1–4.

    Google Scholar 

  18. Maul E, Töpfer R, Carka F, Cornea V, Crespan M, Dallakyan M, et al. Identification and characterization of grapevine genetic resources maintained in eastern European collections. Vitis. 2015;54:5–12.

    Google Scholar 

  19. Ellstrand NC, Heredia SM, Leak-Garcia JA, Heraty JM, Burger JC, Yao L, Nohzadeh-Malakshah S, Ridley CE. Crops gone wild: evolution of weeds and invasive from domesticated ancestors. Evol Appl. 2010;3:494–504.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Coleman C, Copetti D, Cipriani G, Hoffmann S, Kozma P, Kovács L, Morgante M, Testolin R, Di Gaspero G. The powdery mildew resistance gene REN1 co-segregates with an NBS-LRR gene cluster in two central Asian grapevines. BMC Genet. 2009;10:89.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  21. Cambrollé J, García JL, Ocete R, Figueroa ME, Cantos M. Growth and photosynthetic responses to copper in wild grapevine. Chemosphere. 2013;93:294–301.

    Article  PubMed  CAS  Google Scholar 

  22. Arroyo-García R, Ruiz-García L, Bolling L, Ocete R, Lopez MA, et al. Multiple origins of cultivated grapevine (Vitis vinifera L. ssp sativa) based on chloroplast DNA polymorphims. Mol Ecol. 2006;15:3707–14.

    Article  PubMed  CAS  Google Scholar 

  23. Emanuelli F, Battilana J, Costantini L, Le Cunff L, This P, Grando MS. A candidate gene association study for Muscat flavor in grapevine Vitis vinifera L. BMC Plant Biol. 2010;10:241.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  24. Riaz S, Boursiquot JM, Dangl GS, Lacombe T, Laucou V, Tenscher AC, Walker MA. Identification of mildew resistance in wild and cultivated central Asian grape germplasm. BMC Plant Biol. 2013;13:149.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  25. Myles S, Boyko AR, Owens CL, Brown PJ, Grassi F, Aradhya MK, et al. Genetic structure and domestication history of the grape. PNAS. 2011;108:3457–8.

    Article  Google Scholar 

  26. Grassi F, Labra M, Imazio S, Ocete R, Failla O, Scienza A, Sala F. Phylogeographical structure and conservation genetics of wild grapevine. Conserv Genet. 2006;7:837–45.

    Article  Google Scholar 

  27. Ekhvaia J, Akhalkatsi M. Morphological variation and relationships of Georgian populations of Vitis vinifera L. subsp.sylvestris (C.C. Gmel.) Hegi. Flora. 2010;205:608–17.

    Article  Google Scholar 

  28. Biagini B, De Lorenzis G, Imazio S, Failla O, Scienza A. Italian wild grapevine population: insights into eco-geographical aspects and genetic structure. Tree Genet Genomes. 2014;10:1369.

    Article  Google Scholar 

  29. Zdunić J, Maul E, Hančević K, Leko M, Butorac L, Mucalo A, Radić T, Šimon S, Budić-Leto I, Mihaljević M-Z, Maletić E. Genetic diversity of wild grapevine (Vitis vinifera subsp. sylvestris Gmel Hegi) in the eastern Adriatic region. Am J Enol Vitic. 2017;68:252–7.

    Article  Google Scholar 

  30. Battilana J, Lorenzi S, Moreira FM, Moreno-Sanz P, Failla O, Emanuelli F, Grando MS. Linkage mapping and molecular diversity at the flower sex locus in wild and cultivated grapevine reveal a prominent SSR haplotype in hermaphrodite plants. Mol Biotechnol. 2013;54:1031–7.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  31. Nei M. Analysis of gene diversity in subdivided populations. PNAS. 1973;70:3321–3.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  32. Peakall R, Smouse PE. GenAlEx 6: genetic analysis in excel. Population genetic software for teaching and research. Mol Ecol Notes. 2006;6:288–95.

    Article  Google Scholar 

  33. Weir BS, Cockerham C. Estimating F-statistics for the analysis of population genetics. Evolution. 1984;38:1358–70.

    PubMed  CAS  Google Scholar 

  34. Goudet J. FSTAT, a program to estimate and test gene diversities and fixation indices (version 29.3). 2001.

    Google Scholar 

  35. Excoffier L, Lischer HEL. Arlequin suite ver 3.5: a new series of programs to perform population genetics analyses under Linux and windows. Mol Ecol Resour. 2010;10:564–7.

    Article  PubMed  Google Scholar 

  36. Kalinowski ST. Hp-rare 1.0: a computer program for performing rarefaction on measures of allelic richness. Mol Ecol Notes. 2005;5:187–9.

    Article  CAS  Google Scholar 

  37. Barton NH, Slatkin M. A quasi-equilibrium theory of the distribution of rare alleles in a subdivided population. Heredity. 1986;56:409–15.

    Article  PubMed  Google Scholar 

  38. Raymond M, Rousset F. GENEPOP (version 1.2): population genetics software for exact tests and ecumenicism. J Heredity. 1995;86:248–9.

    Article  Google Scholar 

  39. Kamvar ZN, Brooks JC, Grünwald NJ. Novel R tools for analysis of genome-wide population genetic data with emphasis on clonality. Front Genet. 2015;6:208.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  40. R Core Team. R: A language and environment for statistical computing. Vienna. R Foundation for Statistical Computing; 2014.

    Google Scholar 

  41. Nei M. Genetic distance between populations. Amer Nat. 1972;106:283–392.

    Article  Google Scholar 

  42. Paradis E, Claude J, Strimmer K. APE: analyses of phylogenetics and evolution in R language. Bioinformatics. 2004;20:289–90.

    Article  PubMed  CAS  Google Scholar 

  43. Jombart T. Adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008;24:1403–5.

    Article  PubMed  CAS  Google Scholar 

  44. Nei M. Estimation of average heterozygosity and genetic distance from a small number of individuals. Genetics. 1978;89:583–90.

    PubMed  PubMed Central  CAS  Google Scholar 

  45. Excoffier L, Smouse PE, Quattro JM. Analysis of molecular variance inferred from metric distance among DNA haplotypes: application to human mitochondrial DNA restriction data. Genetics. 1992;131:479–91.

    PubMed  PubMed Central  CAS  Google Scholar 

  46. Pritchard JK, Stephens M, Donnelly P. Inference of population structure using multilocus genotype data. Genetics. 2000;155:945–59.

    PubMed  PubMed Central  CAS  Google Scholar 

  47. Falush D, Stephens M, Pritchard JK. Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. Genetics. 2003;164:1567–87.

    PubMed  PubMed Central  CAS  Google Scholar 

  48. Evanno G, Regnaut S, Goudet J. Detecting the number of clusters of individuals using the software STRUCTURE: a simulation study. Mol Ecol. 2005;14:2611–20.

    Article  PubMed  CAS  Google Scholar 

  49. Benito A, Muñoz-Organero G, Ocete R, García-Muñoz S, López MA, Arroyo-Garcia R, Cabello F. An ex situ ampelographical characterization of wild Vitis vinifera from fifty-one Spanish populations. Austral J Grape Wine Res. 2017;23:143–52.

    Article  CAS  Google Scholar 

  50. Biagini B, Imazio S, Scienza A, Failla O, De Lorenzis G. Renewal of wild grapevine (Vitis vinifera L. subsp. sylvestris (Gmelin) Hegi) populations through sexual pathway: some Italian case studies. Flora. 2016;219:85–93.

    Article  Google Scholar 

  51. Antcliff AJ. Inheritance of sex in Vitis. Ann Amel Plant. 1980;30:113–22.

    Google Scholar 

  52. Sefc KM, Lopes MS, Lefort F, Botta R, Roubelakis-Angelakis KA, Ibáñez J, Pejić I, Wagner HW, Glössl J, Steinkellner H. Microsatellite variability in grapevine cultivars from different European regions and evaluation of assignment testing to assess the geographic origin of cultivars. Theor Appl Genet. 2000;100:498–505.

    Article  Google Scholar 

  53. Aradhya MK, Dang GS, Prins BH, Boursiquot JM, Walker MA, Meredith CP, Simon CJ. Genetic structure and differentiation in cultivated grape Vitis vinifera L. Genet Res. 2003;81:179–92.

    Article  PubMed  CAS  Google Scholar 

  54. De Lorenzis G, Chipashvili R, Failla O, Maghradze D. Study of genetic variability in Vitis vinifera L. germplasm by high-throughput Vitis18kSNP array: the case of Georgian genetic resources. BMC Plant Biol. 2015;15:154.

    Article  PubMed  PubMed Central  CAS  Google Scholar 

  55. Marrano A, Grzeskowiak L, Moreno Sanz P, Lorenzi S, Prazzoli ML, Arzumanov A, Amanova A, Failla O, Maghradze D, Grando MS. Genetic diversity and relationships in the grapevine germplasm collection from Central Asia. Vitis. 2015;54:233–7.

    Google Scholar 

  56. Grassi F, Labra M, Imazio S, Spada A, Sgorbati S, Scienza A, Sala F. Evidence of a secondary grapevine domestication Centre detected by SSR analysis. Theor Appl Genet. 2003;107:1315–20.

    Article  PubMed  CAS  Google Scholar 

  57. Ergul A, Perez-Rivera G, Soybelezoglu G, Kazan K, Arroyo-Garcia R. Genetic diversity in Anatolian wild grapes (Vitis vinifera subsp. sylvestris) estimated by SSR markers. Plant Genet Resour. 2011;9:375–83.

    Article  Google Scholar 

  58. Regner F, Stadlbauer A, Eisenheld C, Kaserer H. Genetic relationships among pinots and related cultivars. Am J Enol Vitic. 2000;51:7–14.

    CAS  Google Scholar 

  59. Lopes MS, Mendonça D, Rodrigues dos Santos M, Eiras-Dias JE, da Câmara Machado A. New insights on the genetic basis of Portuguese grapevine and on grapevine domestication. Genome. 2009;52:790–800.

    Article  PubMed  CAS  Google Scholar 

  60. Askri H, Daldoul S, Ben Ammar A, Rejeb S, Jardak R, Rejeb MN, Mliki A, Ghorbel A. Short-term response of wild grapevines (Vitis vinifera L. ssp. sylvestris) to NaCl salinity exposure: changes of some physiological and molecular characteristics. Acta Physiol Plant. 2012;

  61. Duan D, Halter D, Baltenweck R, Tisch C, Tröster V, Kortekamp A, Hugueney P, Nick P. Genetic diversity of stilbene metabolism in Vitis sylvestris. J Exp Bot. 2015;

  62. Bitsadze N, Aznarashvili M, Vercesi A, Chipashvili R, Failla O, Maghradze D. Screening of Georgian grapevine germplasm for susceptibility to downy mildew (Plasmopara viticola). Vitis. 2015;54:193–6.

    Google Scholar 

  63. Toffolatti SL, Maddalena G, Salomoni D, Maghradze D, Bianco PA, Failla O. Evidence of resistance to the downy mildew agent Plasmopara viticola in the Georgian Vitis vinifera germplasm. Vitis. 2016;55:121–8.

    Google Scholar 

  64. Revilla E, Carrasco D, Benito A, Arroyo-Garcia R. Anthocyanin composition of several wild grape accessions. Am J Enol Vitic. 2010;61:636–42.

    Article  CAS  Google Scholar 

  65. Maghradze D, Rustioni L, Scienza A, Failla O. Phenological diversity of Georgian grapevine cultivars in northern Italy. J Amer Pomolog Soc. 2012;66:56–67.

    Google Scholar 

Download references


We thank the USDA National Clonal Germplasm Repository in Davis, California, USA, for providing the accessions from Armenia, Azerbaijan, Georgia and Croatia.

Ethics approval and consent participate

Not applicable


This publication was financially supported by the grants RTA2014–00016-C03–01 Ministry of Economy and Competitiveness (Spain) and the fellowship program Salvador de Madariaga PRX12/00036 from the Ministry of Education (Spain). Join publication of the COST Action FA1003 “East–West Collaboration for Grapevine Diversity Exploration and Mobilization of Adaptive Traits for Breeding”.

Availability of data and materials

A table of all genotypes and SSR allele calls used for statistical analysis in this study is submitted as Additional file 1: Table S1.

Author information

Authors and Affiliations



RA conceived the idea and design of the study and wrote the manuscript; SR and GDL developed fingerprint data, carried out genetic population analysis and assisted with drafting the manuscript, AK assisted in developing fingerprint profiles; MA assisted in the interpretation of the genetic diversity and population analysis and participated in the drafting of the manuscript; GZ provided the Croatian grape germplasm fingerprint database, DM, ZB and MM provided the Georgian germplasm; MAW, OF and JP were involved in discussion and interpretation of the results and oversaw the final draft and revisions. All authors have read and approved the final manuscript.

Corresponding author

Correspondence to Rosa Arroyo-Garcia.

Ethics declarations

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional files

Additional file 1:

Table S1. SSR allelic profile of 1378 sativa and sylvestris grapevine samples genotyped at 20 SSR molecular markers. (XLSX 643 kb)

Additional file 2:

Table S2. Determination of flower phenotype and genotype. The APT3 gene marker distinguished females (F) from males (M) or hermaphrodites (H). SSR marker VVIb23 is tightly linked to the flower sex locus and the unique allele 282 is linked to hermaphrodism. (XLSX 57 kb)

Additional file 3:

Table S3. Ancestry values (mean and standard deviation values over 20 iterations) for the genetic groups inferred by STRUCTURE analysis of 1378 sativa and sylvestris grapevine samples genotyped at 20 SSR loci. (XLSX 145 kb)

Additional file 4:

Table S4. Locus-wise genetic differentiation parameter comparisons of 12 populations from both V. vinifera subspecies sativa and sylvestris. (DOCX 19 kb)

Additional file 5:

Table S5. Results of the AMOVA analysis carried out among and within 12 populations of wild and cultivated grapevine. (DOCX 16 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Riaz, S., De Lorenzis, G., Velasco, D. et al. Genetic diversity analysis of cultivated and wild grapevine (Vitis vinifera L.) accessions around the Mediterranean basin and Central Asia. BMC Plant Biol 18, 137 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: