Genetic structure in cultivated grapevines is linked to geography and human selection
© Bacilieri et al; licensee BioMed Central Ltd. 2013
Received: 7 January 2013
Accepted: 31 January 2013
Published: 8 February 2013
Grapevine (Vitis vinifera subsp. vinifera) is one of the most important and ancient horticultural plants in the world. Domesticated about 8–10,000 years ago in the Eurasian region, grapevine evolved from its wild relative (V. vinifera subsp. sylvestris) into very diverse and heterozygous cultivated forms. In this work we study grapevine genetic structure in a large sample of cultivated varieties, to interpret the wide diversity at morphological and molecular levels and link it to cultivars utilization, putative geographic origin and historical events.
We analyzed the genetic structure of cultivated grapevine using a dataset of 2,096 multi-locus genotypes defined by 20 microsatellite markers. We used the Bayesian approach implemented in the STRUCTURE program and a hierarchical clustering procedure based on Ward’s method to assign individuals to sub-groups. The analysis revealed three main genetic groups defined by human use and geographic origin: a) wine cultivars from western regions, b) wine cultivars from the Balkans and East Europe, and c) a group mainly composed of table grape cultivars from Eastern Mediterranean, Caucasus, Middle and Far East countries. A second structure level revealed two additional groups, a geographic group from the Iberian Peninsula and Maghreb, and a group comprising table grapes of recent origins from Italy and Central Europe. A large number of admixed genotypes were also identified. Structure clusters regrouped together a large proportion of family-related genotypes. In addition, Ward’s method revealed a third level of structure, corresponding either to limited geographic areas, to particular grape use or to family groups created through artificial selection and breeding.
This study provides evidence that the cultivated compartment of Vitis vinifera L. is genetically structured. Genetic relatedness of cultivars has been shaped mostly by human uses, in combination with a geographical effect. The finding of a large portion of admixed genotypes may be the trace of both large human-mediated exchanges between grape-growing regions throughout history and recent breeding.
Cultivated grapevine, Vitis vinifera subsp. vinifera, is one of the major horticultural crops worldwide. Domesticated from the dioecious taxon V. vinifera subsp. sylvestris[1–4], the modern grape is today more diverse and heterozygous than its wild relative [5–8].
The combined action of selection, breeding, admixture and migration is believed to have shaped the cultivated compartment, possibly starting from multiple gene pools during domestication [9, 10]. Humans certainly selected traits related to fertility, blossom drop (coulure), productivity, berry size, sugar and acidity content [4, 11, 12], since these are keys for successful grape production. Similarly, hermaphroditism has been strongly selected for, almost to complete fixation, as self-pollinating plants achieve higher fruit production. Other traits were also probably selected, such as shoot habit, tolerance to biotic and abiotic stress, adaptation to local environment, and cuttings ability. Vegetative propagation is indeed believed to have been adopted early in the domestication process . Large-scale grafting, in contrast, was only implemented at the end of the 19th century, after the introduction of Phylloxera in Europe . A slow selection process of promising local landraces early in domestication, followed by direct breeding and selection from the end of the Middle Age onwards, may have favored the emergence of family clusters of cultivars [14, 15]. Finally, human-mediated movements of seeds and cuttings occurred even over long distances and were the means of grape extension to the New World, followed by selection and adaptation to local conditions . The combination of such factors has most probably structured modern grapevine genetic diversity.
Up to now, studies searching for genetic structure in grapevine have been based on samples either relatively small (up to a few hundred) [5, 9, 10, 16] or geographically limited . Myles et al. , exploring SNP polymorphism in a set of 583 cultivars (404 of which had known geographic origin), found a weak East–west structure gradient; however their sample was under-representing some important regions (Caucasus, Spain, North Africa) and therefore could not be fully conclusive about genetic structure of the entire cultivated grapevine gene pool. Thus, the analysis of genetic structure in the largest grapevine collection available worldwide (INRA Vassal, France) could enable us to better understand the extent and distribution of grape diversity and how mankind shaped it.
In addition, structure analysis is a prerequisite for deciphering complex traits in genetic resources using association genetics, a methodology that already yielded interesting results in grape [19, 20]. The extensive diversity of grapevine  and recent progresses in DNA analysis technologies make genome-wide association genetics over hundreds of accessions the next target for grapevine research. Therefore, population structure analysis is today all the more pivotal.
The genetic material maintained at the French grapevine collection of Vassal  has been built up over the last 140 years through a large network of international partnerships. More than 5,000 accessions of Vitis vinifera have been morphologically and agronomically characterized  and historically documented. Using twenty microsatellite markers at linkage equilibrium , Laucou et al.  identified 2,323 unique genotypes in this collection. Microsatellite markers (or Simple Sequence Repeats, SSR) are indeed suitable to better understand the genetic structure of cultivated plants [24–26].
The main objectives of the present study were to understand the genetic diversity of cultivated grapevine and how humans molded it over the years, and to provide a description of genetic structure that could be used to select genotype samples appropriate for further genetic association studies. We used a subset of 2,096 microsatellite genotypes from the Vassal database, without missing data and with at least two allele differences. Statistical tests were conducted to control the possible effects of sampling, in particular the bias that could arise from oversampling one geographical region or one family group.
We analyzed the structure using two different methods of clustering, the first based on a Bayesian approach , and the second using Ward’s hierarchical clustering method [28, 29]. We also compared the genetic diversity and family relatedness within the clusters at different levels of ancestry. Finally, to interpret population structure in relation to both cultivar use and history, we characterized the clusters using the geographic origin of cultivars and their phenotypic characteristics.
Our sample of 2,096 cultivated genotypes (Additional file 1: Table S1) displayed from 5 to 34 alleles per locus, with a total of 324 alleles over the 20 loci, an unbiased expected heterozygosity (He) of 77% and a mean polymorphism information content (PIC) of 0.740 (Additional file 2: Table S2). Owing to the large number of loci with moderate allele frequencies, the single parent exclusion probability was quite high (7 × 10-6).
Using a threshold of >85% for group assignation, 1,001 genotypes (out of the 2,096) were assigned to a cluster at Ks = 3 and 817 at Ks = 5. The proportion of admixed genotypes was thus large, i.e. 52% and 61% of the total number of cultivars, at Ks = 3 and Ks = 5 respectively.
Composition of the cultivar groups used in the analysis
Number of cultivars
Number of cultivars for testing sampling effectb
DZA, MAR, TUN
Western & Central Europe
BEL, FRA, GBR, NLD
AUT, DEU, CHE, CZE, SVK, TCH
BGR, HUN, ROU
BIH, CYP, GRC, HRV, SCG, YUG
Russia & Ukrain
Russia & Ukrain
MDA, RUS, UKR, URS
Eastern Mediteranean & Caucasus
Caucasus & Turkey
ARM, AZE, GEO, TUR
EGY, ISR, LBN, SYR,
Middle & Far East
Central Asia & Far East
AFG, CHN, IND, JPN, KAZ, TJK, TKM, UZB
New World Vineyard
New World Vineyard
ARG, AUS, CHL, MEX, PER, USA, ZAF
Geographic origin and viticultural traits of the identified subgroups
a Western Europe group (S-3.1) of wine cultivars (93%) containing 55% of the Western and Central Europe genotypes, without any Asian, Balkans or Maghreb cultivars;
a East group (S-3.2) mostly composed of table cultivars (71% of table grapes and 9% of ‘double-use’ cultivars), including 96% of the Far- and Middle-East genotypes, notably all genotypes from Uzbekistan (n = 33), Afghanistan (8), Tajikistan (4), Turkmenistan (4) and Iran (23), as well as 66% of the Eastern Mediterranean and Caucasus cultivars, and almost no Western and Central Europe cultivars (less than 2%). Interestingly, 43% of the cultivars from Maghreb were positioned within this group;
The clustering at Ks = 5 (Figure 2, Additional file 4: Table S3) identified in addition an Iberian and Maghreb group (S-5.1), and a group comprising mostly table grapes (80%) of recent origin, also called “obtentions”, from Italy and Central Europe (S-5.4). The group S-5.1 derived partially from the S-3.2 group (41% of the varieties), with Iberian varieties composing 69% of the group. The group S-5.4 mostly derived from the admixed Ks = 3 group (78% of the varieties).
Genetic diversity and family structure within and among clusters
Statistics of genetic diversity for the STRUCTURE sub-populations at Ks = 3 and 5
Mean number of alleles / locus
Private alleles (absolute count)a
Private alleles (equal samples)a
Standard deviation He
Ks = 3
S-3.1 (Wine - West & Central Europe)
S-3.2 (Table - East)
S-3.3 (Wine - Balkans& East Europe)
Ks = 5
S-5.1 (Wine & Table - Iberian Peninsula & Maghreb)
S-5.2 (Table - East)
S-5.3 (Wine – West and Central Europe)
S-5.4 (Italy & Central Europe)
S-5.5 (Wine - Balkans & East Europe)
The average genetic differentiation among STRUCTURE groups for Ks = 3 and Ks = 5 was D est = 0.166 and 0.213 respectively (harmonic means; in both cases, standard deviation D est = 0.005). The largest differentiation between pairs of subpopulations was found between the Western-Central Europe and the Eastern groups, for both Ks = 3 and 5, with D est = 0.217 and 0.256 respectively). The Eastern group and the group from the Iberian Peninsula and Maghreb displayed a D est of 0.139, the lowest of all comparisons (Additional file 8: Table S6).
Since the presence of family groups may affect population genetic structure, we explored the distribution of cultivar family relationships within and among STRUCTURE groups. By comparing all possible genotype pairs, the ML-relatedness software outlined in total 1,069 likely parent pairs involving 1,099 putative parents.
Distribution of family relationship among genotype pairs, classified within and between structure subgroups
Related pairs (HS, FS, PO)
Ks = 3
Ks = 5
Kw = 12
Characterization of the clusters at K w = 12, based on their genetic diversity, family relationship and phenotypic characteristics
Percent of genotypes with family relations in the same group
Average relatedness "r" (st dev)
Description of the main family composing the group
General group description
Family group of Gouais blanc
Black and white wine cultivars from Western Europe
Family groups of Savagnin and Cabernet franc
Black and white wine cultivars from South West of France and Portugal
Wine cultivars from Central Europe and Balkan peninsula, with mostly white, red, grey or pink berry skin color
Black and white wine cultivars from Iberian Peninsula and Maghreb.
Black and white cultivars (mostly for wine but also for table grape) from Italy and Alps.
Mix of several recently bred families
Table grape cultivars, new breeding or modern selections, with mostly white berry skin color and muscat flavor.
Several families obtained by breeding in the XIXth and early XXth century
Traditional breeding of table grape (including some cultivars with muscat flavor) selected in several countries during XIXth century and early XXth.
Family groups of Chasselas and Muscat à petits grains
Wine cultivars with white, red, grey or pink berry skin color. Presence of several cultivars with muscat flavor
Family group of Pinot and Riesling
A majority of white wine cultivars from Western Europe
Wine cultivars with mostly black berry skin color, from Italy (Centre and North), France (Alps and South), Spain (North) and Portugal.
Traditional table grape cultivars originated from Far, Middle and Near East, with mainly white, red, grey or pink berry skin color ; this group gathers most of the seedless grapevine cultivars
Diverse wine and table grape cultivars in particular from Caucasus, Russia and Maghreb
All the analyses above allowed us to characterize the different groups (Table 4 and Additional file 9: Table S7) for their genetic diversity, family relationships and phenotypic characteristics, and finally propose an interpretation of grapevine genetic history and geographic partitioning.
The main objective of this study was to explore the genetic diversity and structure of cultivated grapevine and link them to cultivar utilization, putative geographic origin and historical events. Microsatellite markers’ data for 2,323 unique cultivars collected and maintained at the French grapevine collection of Vassal (INRA, France), were available . Inferences of population structure were derived with both a Bayesian and a hierarchical clustering method. Since clustering methods may be sensitive to sampling bias, to improve our chances to detect true structure patterns, we followed three strategies, i) first we focused only on the 2,096 genotypes (out of the 2,323 unique cultivars) without missing SSR data and excluding putative clones and close mutants (with only one or two allele differences over the 40 alleles); indeed, missing data may bias the clustering procedure, and nearly identical SSR genotypes can be considered as redundant for our scope; ii) secondly, we evaluated the possible bias due to unbalanced geographical representativeness of our sample, by running STRUCTURE analysis on two data sets, one with the entire sample and the other balanced in term of cultivar geographical origin (cultivars being randomly picked within each geographical group). STRUCTURE provided a very consistent attribution of genotypes to clusters independently of the data set, thus only the full set of genotypes was further analyzed; iii) third, since the STRUCTURE clustering method can be disputed because human manipulation of cultivars (displacements, breeding, clonal propagation) could have generated a deviation from the Hardy-Weinberg equilibrium, we complemented the STRUCTURE analysis with the method of discriminant analysis developed by Ward, which is independent from any assumptions on population dynamics. According to Odong et al. , the two methods are complementary, so they can conveniently be used together and compared.
The 2,096 cultivars of the Vassal collection studied here originate from 52 countries around the world, making our sample highly representative of the cultivated grapevine gene pool. Our data confirmed the high levels of diversity and heterozygosity of the cultivated grapevine, in agreement with a number of previous studies [4, 5, 8, 17, 18]. This can be due to an intermix of factors: i) a weak bottleneck effect during domestication  as observed for maize and wheat [31, 32], probably in relation to ii) vegetative propagation and diffusion of cuttings across geographic regions , iii) several putative domestication events from different gene pools [9, 10], then intermixed by man with breeding and selection, and iv) diversifying selection in plant breeding . The large diversity found in grapevine opens an avenue for further selection and breeding . Among the 2,096 genotypes studied here, over one half is still poorly known from a viticultural and oenological point of view and may potentially carry new genes and traits of interest for new breeding and selection.
STRUCTURE identified one main level of population subdivision at Ks = 3 and a secondary subdivision at Ks = 5. A PCA analysis and Ward’s hierarchical clustering confirmed this finding. Both the STRUCTURE and Ward methods indicated inconsistencies in clustering for K = 4 and 6, suggesting that these two levels are not appropriate for subdividing the grapevine gene pool. While confirming the main subdivision, Ward’s clustering also pointed to a finer structure linked to grapevine uses, family structure or local geographic groups.
The analysis of family relationships also revealed that STRUCTURE clustered a significant portion of family-related genotypes, nearly double of the fraction found in the admixed group. By contrast almost no parentage was found among genotypes from different K3 groups (inter-group level). These findings are probably the result of the history of grapevine, with the practice of breeding focusing mostly on local varieties.
In the admixed group we could identify approximately 3% of genotypes with parents classified in two different STRUCTURE clusters, such as the wine grape Tarrango, known to be a cross between Touriga (a wine grape from Portugal, S-3.1 group) and Sultanina (a seedless table grape from Turkey, S-3.2 group). The crossing among genotypes from different STRUCTURE groups probably corresponds to recent breeding activity in search for novelties and hybrid vigor, remaining nevertheless proportionally marginal.
We also detected significantly more family relationships within the already know grapevine kin groups of i) Gouais [15, 34, 35], ii) Savagnin and Cabernet franc , iii) Chasselas and Muscat, and iv) Pinot and Riesling , and found traces of existence of two additional groups, each composed by a mix of several families, such as the W-12.6 and W-12.7 groups, comprising family-related table grapes with muscat flavor released by modern breeding.
The interaction of genetic structure and family relationship is known to be difficult to resolve, and 20 microsatellite loci are probably not sufficient to avoid false positives, despite the large number of alleles. Nevertheless, our family relationship analysis, seen as a tentative to understand large scale population patterns and not to precisely detect each single family pair, provided a coherent global picture. This analysis was also coherent with a more specific paper by Lacombe et al. in 2012  who explored direct parentage using an exclusion probabilities algorithm, with a slightly different sample, thus explaining minor differences.
Geography and history
The three main clusters revealed by our study, both with STRUCTURE and Ward’s methods, confirmed previously obtained molecular results [5, 9] and the eco-geographic grouping proposed by Negrul , in particular the correspondences between the “proles” occidentalis and S-3.1/W-3.1 groups, the pontica and S-3.3/W-3.3 groups, and the orientalis and S-3.2/W-3.2 groups. Our results allow us to subdivide these clusters according to cultivar putative geographical origins: i) West and Central Europe (S-3.1), ii) East Mediterranean, Caucasus, Middle and Far East (S-3.2), and iii) Balkans and East Europe (S-3.3). Clustering at K = 5 identified two new groups, an Iberian Peninsula group and a group of table grape obtentions with Italian Peninsula and Central Europe origins.
Genetic characterization of the groups clearly showed the East table grape group (S-3.2 and S-5.2 for K = 3 and 5 respectively) as the most diverse in terms of mean number of alleles, number of private alleles, and non-biased heterozygosity. This is consistent with the hypothesis that grapevine domestication initially occurred in Eastern regions (Caucasus and Fertile Crescent) as suggested earlier [2–4, 9], repeatedly introducing genes from the wild. The high frequency of private alleles in S-3.2 and S-5.2 could also be explained by a history of limited exchanges from East to West, as attested by the high differentiation values (Dest) between these regions, and a slower development of grape breeding in the East, as indicated by the low frequency of family-related genotypes in that region as compared to other regions, revealing a weaker selection bottleneck effect there. However, given the high genetic diversity of grapevine at all subdivision levels, the selection and breeding bottlenecks seem in general weak for this crop.
The second most diverse group was the West and Central Europe wine grape group, probably as a result of this area’s long history of grapevine cultivation and development, in combination, as already stated by other authors, with gene flow from local wild or primo-domesticated grapevines [9, 10, 18]. The Balkans and East Europe cluster also formed a well identified STRUCTURE group with an intermediate diversity. The two additional groups at Ks = 5 (the Iberian Peninsula group and the group of table grape obtentions), appeared as secondary groups with a lesser global diversity.
More generally, the full hierarchical partitioning obtained with the STRUCTURE and Ward methods as well as the Dest differentiation statistics appeared consistent with historical data, such as the diffusion of viticulture around the Mediterranean Sea, with one route connecting Eastern (W-3.2) to Western Europe through the Balkans and Central Europe (W-3.3, W.3.1) [2, 9], and a Southern route to the Maghreb and Iberian peninsula (W-3.2 /W-5.1 / W-12-4).
The Balkans and Eastern Europe group and the Western and Central Europe group were both characterized by a large proportion of genotypes belonging to one STRUCTURE group only, probably corresponding to separate regional grapevine cultivar development and selection. In contrast, other regions as Russia and Ukraine, the Iberian Peninsula, and the New World countries, contain a mix of two or three STRUCTURE groups, in relation to their regional position. In particular, varieties found in Russia and Ukraine appear to have either East (S-3.2), Balkans and East Europe (S-3.3) origins, consistently with what we know of the centralizing impact that Russian agricultural research had during the Soviet period . Similarly, the Iberian peninsula group include cultivars from West Europe (S-3.1), East (S-3.2) and Maghreb (S-5.1) as well as a high proportion of admixed genotypes, in coherence with the long historical exchange relationships this region had both with Europe and North Africa. Based on maternally inherited chloroplast markers, Arroyo-Garcia et al. suggested that the Iberian Peninsula could be a secondary center of domestication . Our results add a new view of Spain and Portugal as platforms of centralization, intermixing and exchange of varieties throughout history.
Finally, at Kw = 12, the genotypes from the eastern regions (proles orientalis) further subdivided into two sub-groups, one mainly composed of wine cultivars of Caucasian origin (including Georgia, Armenia, Azerbaijan and Turkey, W-12.12), and the other comprising table cultivars from Central Asia (Tajikistan, Uzbekistan, Turkmenistan) together with Iran and Afghanistan (W-12.11). The separation of these two groups may be a trace of divergent selection for the main local use for grapevine (table vs. wine). On the other hand, the absence of admixture in the Middle and Far East group, in particular for the 72 cultivars from Uzbekistan, Afghanistan, Tajikistan, Turkmenistan and Iran, and the high K scores of its genotypes may be an indication that the corresponding center of domestication was larger than formerly believed (several authors indeed placed it in a geographic region between the Black Sea and Iran [2, 3, 40, 41]), an hypothesis already proposed in 1976 by Olmo , but not confirmed by later studies. It is difficult to decide between these two scenarii since the information available on grapevine crop development is quite limited for Central Asian countries.
A large proportion of admixed genotypes was found by STRUCTURE, both at Ks = 3 and Ks = 5. A previous study on maize indicated that, in crops, STRUCTURE grouping is generally coherent for first cycle inbreds with simple parentage relationships, while the presence of multiple levels of family relationships and cohort overlapping in more advanced breeding systems leads to different grouping possibilities and low STRUCTURE stability . We can infer that our sample contains both types of material, with a number of ancient varieties anchoring the main clusters (founders), and recent breeds complicating structure resolution. The stability of Ks = 3 and Ks = 5 groupings and the individual percentage of cluster ancestry allowed us to discriminate among these two types of materials. The geographic distribution of the admixed genotypes is not “random” (Table 1): the Middle-Far East is the region displaying the lowest level of admixture, while Italy in particular and secondly the Iberian Peninsula, display the larger proportion of admixed genotypes. We were unable to find other traits characterizing the admixed group: it is composed of even proportions of phenotypic classes of grape use, berry color, flavor, berry seed number, or sex.
While confirming and reinforcing the observation of geographic structure of the cultivated gene pool already described by other authors [5, 9, 16, 18, 38], our results are also coherent with the study of Cipriani et al.  suggesting that Italian varieties present weak or no structure: indeed in our study the Italian cultivars appear to be admixed, probably as a result of the inter-regional exchange role that Roman culture has certainly played.
Our results also provide information about the effect human selection on morphological traits had on shaping the genetic diversity of cultivated grapevine. Table and wine grapes have different berry size and bunch shapes, both important traits used for cultivar classification . Table and wine grapes are clearly separated by STRUCTURE at K s = 3. At K s = 5; only the group including Iberian and Maghreb cultivars (S-5.1) is composed of a mix of table and wine cultivars, which is likely the result of artificial selection and intimate cultivars intermixing in this area.
The black color of berries is considered as an ancestral trait compared with the other colors, both at phenotypic  and molecular level. The molecular basis of the apparition of red, rose, grey and white berry colors has been previously documented [43–45] and the diffusion of the major causal mutations – Gret1 insertion and K980 mutation – within the cultivated compartment was described by Fournier-Level et al. . In the present STRUCTURE analysis, the Central and West Europe subgroup (S-5.3) is composed of a majority of black cultivars. This can be explained by the isolation of these regions from the Eastern cultivars, by local domestication and gene flow from endemic black-berried V. v. sylvestris, or human selection. All other subgroups include a large number of white cultivars, reinforcing the idea of a wide and strong diffusion of Gret1 over the whole geographic range of grapevine . Most of the intermediary phenotypes (red, rose and grey) are concentrated within two groups: Balkans and Central Europe (S-5.5), and East (S-5.2), confirming these regions as putative sources of color variation .
The geographical origin of Muscat flavor is assumed to be Greece or the Balkan Peninsula [46, 47]. Thereafter, human selection aimed to spread this desirable trait in both table and wine grapes . With STRUCTURE, we found the majority of Muscat founders within the Central Europe table group (S-5.4). Only a small number of them were involved in breeding, essentially in the Balkans, forming kingroups with other known parents such as Chasselas.
Seedless cultivars clustered essentially with cultivars of Turkish, Caucasian and Asian origins, belonging to the proles orientalis, coherently with available historical data about their origins from Turkey and Near-East .
The array of analytical methods used here contributed coherent information to interpret the geographic and phenotypic structure, pointing to the main differentiation axes that exist within the cultivated compartment of Vitis vinifera, as it was shown in other species, such as potato , poplar , and maize . Archaeological and historical data suggest that domestication of grapevine took place in the region spanning from the Fertile Crescent to South Caucasus [2, 3, 40, 41], and from there spread in three directions: a Northern route, through Greece and the Roman empire to its western borders; a Southern route, through Egypt, the Arab territories all the way to Spain during the last Arab invasions; and a third route towards Asia. The results of our analysis, in particular the three groups defined by Ks = 3 and the finding of intermixed resources in focal regions of grapevine development, are consistent with this historical scenario. New elements provided by our studies are: i) the identification of the Iberian and Italian Peninsulae as regions of intermixing and exchange of varieties; ii) an East–west bottleneck effect due to limited cultivar migration, complemented by local selection and breeding; iii) a more extended center for grapevine primo-domestication, expanded to Central Asian countries.
Our results also suggest that signs of weak genetic structure in grapevine found by earlier studies were probably due to either regionally and/or numerically limited sampling. By examining the molecular data on the large INRA grape collection of Vassal, we have shown that grape diversity is structured into groups that interestingly reflect historical evolution, migration and human selection.
The subpopulations identified in this study, and in particular at K s = 3 or K s = 5, will be very useful to define samples for linkage disequilibrium and genetic association studies , especially for traits of significance for local adaptation or sensitive to local human selection. The finding of large proportions of family relationships within structure groups should however be taken in account. Genetic association studies may certainly be optimized by the use of the structure and parentage matrices as covariates in the analysis.
In the near future, the study of additional cultivars from regions less represented in our sample (such as the Caucasus area), the study of wild genotypes from around the distribution area including Central Asian countries, and the use of other markers associated with genes of interest linked to domestication and selection processes, will provide more precise information about the evolution of Vitis vinifera.
The plant material was composed of cultivated grapevine varieties belonging to V. vinifera subsp. vinifera held in the INRA grape repository at Vassal (France). This collection includes 3,727 accessions available as field-grown plants and genotyped with 20 microsatellite markers .
Geographic origin of referenced cultivars was derived from general bibliography on ampelography and viticulture [53, 54]. For non-referenced cultivars, the origins were estimated on the basis of the accession origin. Recently bred cultivars (e.g., Tarrango) were assigned to the breeder’s country (in this example, Australia) and not to the countries of origin of their progenitors (in this case, Portugal and Turkey). Countries of origin were grouped as shown in Table 1.
List of the phenotypic traits recorded for group characterization and coded according to the OIV (2009) notation system
Level of expression
Utilization of the fruit
Wine grape (W); Table and/or raisin grape (T).
Berry skin color
White (B); Black (N); Rose (Rs); Grey (G); Red (Rg).
Berry flesh color
Non-colored (NoC); Colored (Col).
None (No); Muscat (Mus); Herbaceous (Herb); Other (Oth).
Presence of seeds
Seeded berry (SD); Seedless berry (SL).
Sex of flower
Hermaphrodite (H); Female (F).
DNA extraction and genotyping
DNA extraction, PCR amplification and genotyping of microsatellites were carried out according to This et al.  and Laucou et al. . The 20 nuclear microsatellite loci were chosen according to their polymorphism level and their position on the linkage groups . Differences of one base pair between alleles at one given locus were double-checked by re-amplification and re-analysis; a test for the presence of null alleles was also carried out .
Since genotypes with only one or two allele differences represent closely related material, such as clones or recent mutants, and provide very little additional information to the analysis, these were considered redundant and not taken into account. Thus, out of the 2,323 single genotypes identified in Laucou et al. , 2,096 genotypes presenting no missing data and at least three allele differences were analyzed in our study for further structure and clustering analyses (full list and characteristics are given in Additional file 1: Table S1).
Main diversity statistics for the 20 microsatellite markers, such as the total number of alleles, expected heterozygosity and total gene diversity  were calculated using Genetix . A private allele index adjusted for sample bias was estimated with ADZE , following a generalized rarefaction approach. Genetic differentiation Dest was computed using the SMOGD software [61, 62], based on the method of Jost . Confidence intervals were calculated using 1000 bootstraps, Polymorphism Information Content (PIC) according to Botstein et al. , and the single parent exclusion probabilities according to Jamieson & Taylor .
The dataset of 2,096 unique genotypes was used to run a series of tests, with the Bayesian method implemented in STRUCTURE , in order to find the best model to infer population structure (with or without admixture, correlated allele frequencies, or prior information about sampling locations) and the best Ks level of population subdivision, with Ks varying from 2 to 12. Within STRUCTURE, we allowed an iterative process with a burn-in phase of 5 × 104 iterations, and a sampling phase of 5 × 104 replicates. Ten replicates of each assumed Ks-level subdivision were compared to estimate group assignation stability.
We first evaluated the different models of admixture, allele frequencies and prior population information available in STRUCTURE. The most appropriate model to interpret our data appeared to be the uncorrelated allele frequencies and prior geographic information model, which showed a better stability between runs for Ks = 3 and 5, and a lower variance for Ks = 5, as compared to other possible STRUCTURE models (Additional file 3: Figure S1).
Since the geographic groups are not equally represented in Vassal, a second analysis was run to measure a possible sampling effects, in particular the bias that could arise from oversampling one region or one family group. This was tested by running STRUCTURE on two different set of genotypes (Table 1), the full set of 2096 genotypes (set1) and a sub-set of 888 genotypes randomly drawn to constitute equally-sized, geographic origin-based groups (set2).
Finally, the most probable uppermost level of structure subdivision between the successive Ks values was estimated with two methods: 1) the calculation of Evanno’s delta-K as the second order change in the likelihood function divided by the standard deviation of the likelihood [66, 67], and 2) the similarity coefficient between each pair of runs, which provides an evaluation of the stability of the solutions between runs.
Genotypes were assigned to a cluster when 85% or more of their inferred genome belonged to the cluster, the genotypes with a lower score being considered as “admixed”. The chosen clusters for each Ks level were then labeled according to a three digit code (e.g. S-2.1, S-2.2,…, S-5.5) for further geographic and phenotypic characterization. A graphical display of the individual and group distances was obtained with a Principal Component Analysis (PCA) using the package adegenet implemented in R [68, 69].
To validate the STRUCTURE clustering, we compared its output with that obtained using a less constrained method of clustering. Odong et al.  highlighted that STRUCTURE and Ward’s method  are convergent and complementary. Thus we used Ward’s method to evaluate the distances between clusters minimizing the sum of squares of any two clusters at each step. Using Ward dissimilarity matrix, we built a dendrogram with DARwin software . The advantage of Ward clustering is to provide details of the relationships at any level, as close as family levels. One disadvantage is that it does not deal with admixed genotypes. We indexed the Ward subdivision levels as Kw, and labeled the subgroups accordingly (W-2.1, W-2.2,…, W-12.12).
Genetic structure partitioning between and within groups - Family relationships
To estimate the part of the population genetic structure due to parentage, we first calculated the most probable family relationship among each pair of genotypes using the ML-relatedness software . Genotype pairs (half of a 2,096 x 2,096 matrix minus the diagonal [(n2-n)/2)] = 2,195,569 couples) were declared either unrelated or family-related, this latter category grouping full sibs, half-sibs, and parent-offsprings, in order to lower the chance of false attribution. Only non-ambiguous relationship assignations (according to 99.9% confidence intervals calculated in 100 mating population simulations) with an experimentally determined LOD score > 9 were taken into account. For each of the above subdivisions we also calculated a weighted average relatedness r 2 .
The results of this analysis were then assigned to categories of STRUCTURE subdivision (e.g. within or among subgroups), according to the group of each parent. Within-group average relatedness was estimated with the formula of Queller and Goodnight  and its standard deviation with 100 jackknifes over loci, using the RERAT software .
Phenotypic evaluation of the different K levels
To interpret the population structure in terms of cultivar utilization, movement and history, each subpopulation was finally characterized for its flower and fruit traits and for its geographic origin. Group names were ultimately based on their main characteristics.
Thierry Lacombe, Blaise Genna: (http://www1.montpellier.inra.fr/vassal/)
The authors thank the staff of the INRA Vassal grapevine collection and more specifically Thierry Dessup, Sandrine Lalet and Mireille Dessup. We thank Philippe Chatelet for helpful comments and text editing. This work was supported in part by a grant from the French Ministry of Agriculture, Fisheries and Forests (Centres de Ressources Biologiques).
- Levadoux L: Les populations sauvages et cultivées de Vitis vinifera L. Ann. Amélior Plantes. 1956, 6: 59-117.
- Zohary D: Domestication of the Grapevine Vitis vinifera L. in the Near East. The origins and ancient history of wine. Edited by: McGovern P, Fleming S, Katz S. 1996, Gordon and Breach, New York, 23-30.
- McGovern P: Ancient wine: the search for the origins of viniculture. 2004, Princeton University Press, Princeton
- This P, Lacombe T, Thomas M: Historical origins and genetic diversity of wine grapes. Trends Genet. 2004, 22: 511-519.View Article
- Aradhya MK, Dangl GS, Prins BH, Boursiquot JM, Walker MA, Meredith CP, Simon CJ: Genetic structure and differentiation in cultivated grape, Vitis vinifera L. Gene Res. 2003, 81: 179-192. 10.1017/S0016672303006177.View Article
- Salmaso M, Faes G, Segala C, Stefanini M, Salakhutdinov L, Zyprian E, Toepfer R, Grando MS, Velasco R: Genome diversity and gene haplotypes in the grapevine (Vitis vinifera L.), as revealed by single nucleotide polymorphisms. Mol Breed. 2004, 14: 385-395. 10.1007/s11032-004-0261-z.View Article
- Di Vecchi-Staraz M, Laucou V, Bruno G, Lacombe T, Gerber S, Bourse T, Boselli M, This P: Low level of pollen-mediated gene flow from cultivated to wild grapevine: consequences for the evolution of the endangered subspecies Vitis vinifera L. subsp. sylvestris. J Hered. 2008, 100: 66-75. 10.1093/jhered/esn084.PubMedView Article
- Laucou V, Lacombe T, Dechesne F, Siret R, Bruno JP, Dessup M, Dessup T, Ortigosa P, Parra P, Roux C, Santoni S, Varès D, Peros JP, Boursiquot JM, This P: High throughput analysis of grape genetic diversity as a tool for germplasm collection management. Theor Appl Genet. 2011, 122: 1233-1245. 10.1007/s00122-010-1527-y.PubMedView Article
- Arroyo-Garcia R, Ruiz-Garcia L, Bolling L, Ocete R, Lopez MA, Arnold C, Ergul A, Soylemezoglu G, Uzun HI, Cabello F, Ibanez J, Aradhya MK, Atanassov A, Atanassov I, Balint S, Cenis JL, Costantini L, Goris-Lavets S, Grando MS, Klein BY, McGovern PE, Merdinoglu D, Pejic I, Pelsy F, Primikirios N, Risovannaya V, Roubelakis-Angelakis KA, Snoussi H, Sotiri P, Tamhankar S, This P, Troshin L, Malpica JM, Lefort F, Martinez-Zapater JM: Multiple origins of cultivated grapevine (Vitis vinifera L. ssp. sativa) based on chloroplast DNA polymorphisms. Mol Ecol. 2006, 15: 3707-3714. 10.1111/j.1365-294X.2006.03049.x.PubMedView Article
- Grassi F, Labra M, Imazio S, Spada A, Sgorbati S, Scienza A, Sala F: Evidence of a secondary grapevine domestication centre detected by SSR analysis. Theor Appl Genet. 2003, 107: 1315-1320. 10.1007/s00122-003-1321-1.PubMedView Article
- Zohary D, Hopf M: Domestication of plants in the Old World: the origin and spread of cultivated plants in West Asia, Europe and the Nile Valley. 2000, Oxford University Press, Oxford
- Olmo HP: Grapes. Vitis, Muscadinia. Evolution of crop plants. Edited by: Simmonds NW. 1976, Longman, London. UK, 294-298.
- Pouget R: The history of the control of grapevine phylloxera in France (1868–1895). 1990, INRA, Paris, 157-
- Boursiquot JM, Lacombe T, Laucou V, Julliard S, Perrin FX, Lanier N, Legrand D, Meredith C, This P: Parentage of Merlot and related winegrape cultivars of Southwestern France: discovery of the missing link. Aust J Grape Wine Res. 2008, 15: 144-155.View Article
- Bowers J, Boursiquot JM, This P, Chu K, Johansson H, Meredith C: Historical genetics: the parentage of Chardonnay, Gamay, and other winegrapes of Northeastern France. Science. 1999, 285: 1562-1565. 10.1126/science.285.5433.1562.PubMedView Article
- Imazio S, Labra M, Grassi F, Scienza A, Failla O: Chloroplast microsatellites to investigate the origin of grapevine. Genet Resour Crop Evol. 2006, 53: 1003-1011. 10.1007/s10722-004-6896-0.View Article
- Cipriani G, Spadotto A, Jurman I, Di Gaspero G, Crespan M, Meneghetti S, Frare E, Vignani R, Cresti M, Morgante M, Pezzotti M, Pe E, Policriti A, Testolin R: The SSR-based molecular profile of 1005 grapevine (Vitis vinifera L.) accessions uncovers new synonymy and parentages, and reveals a large admixture amongst varieties of different geographic origin. Theor Appl Genet. 2010, 121: 1569-1585. 10.1007/s00122-010-1411-9.PubMedView Article
- Myles SBA, Owens CL, Brown PJ, Grassi F, Aradhya MK, Prins B, Reynolds A, Jer-Ming C, Wareh D, Bustamanteb CD, Buckler ES: Genetic structure and domestication history of the grape. PNAS. 2010, 108: 3530-3535.View Article
- Fournier-Level L, Le Cunff L, Boursiquot JM, This P: Evolution of the VvMybA gene family, the major determinant of berry colour in cultivated grapevine (Vitis vinifera L.). Heredity. 2010, 104: 351-362. 10.1038/hdy.2009.148.PubMedView Article
- Emanuelli F, Battilana J, Costantini L, Le Cunff L, Boursiquot JM, This P, Grando MS: A candidate gene association study on muscat flavor in grapevine (Vitis vinifera L.). BMC Plant Biol. 2010, 10: 241-10.1186/1471-2229-10-241.PubMedPubMed CentralView Article
- Centre de resources génétiques de la vigne, Collection de Vassal: INRA, France, http://www1.montpellier.inra.fr/vassal/collections/presentation.html,
- Boursiquot JM, Dessup M, Rennes C: Distribution of the main phenological, agronomical and technological characters of Vitis vinifera L. Vitis. 1995, 34: 31-35.
- Doligez A, Adam-Blondon AF, Cipriani G, Laucou V, Merdinoglu D, Meredith CP, Riaz S, Roux C, This P, Di Gaspero G: An integrated SSR map of grapevine based on five mapping populations. Theor Appl Genet. 2006, 113: 369-382. 10.1007/s00122-006-0295-1.PubMedView Article
- Harter AV, Gardner KA, Falush D, Lentz DL, Bye RA, Rieseberg LH: Origin of extant domesticated sunflowers in Eastern North America. Nature. 2004, 430: 201-205. 10.1038/nature02710.PubMedView Article
- Vigouroux Y, Mitchell S, Matsuoka Y, Hamblin M, Kresovich S, Smith JSC, Jaqueth J, Smith OS, Doebley J: An analysis of genetic diversity across the maize genome using microsatellites. Genetics. 2005, 169: 1617-1630.PubMedPubMed CentralView Article
- Hamblin MT, Warburton ML, Buckler ES: Empirical comparison of simple sequence repeats and single nucleotide polymorphisms in assessment of maize diversity and relatedness. PLoS One. 2007, 12: e1367-View Article
- Pritchard JK, Stephens M, Donnelly P: Inference of population structure using multilocus genotype data. Genetics. 2000, 155: 945-959.PubMedPubMed Central
- Ward JH: Hierarchical grouping to optimize an objective function. J Am Stat Assoc. 1963, 58: 36-244.View Article
- Perrier X, Jacquemoud-Collet JP: DARwin software. 2006, Cirad, http://darwin.cirad.fr/darwin,
- Odong TL, van Heerwaarden J, Jansen J, van Hintum TJL, van Eeuwijk FA: Determination of genetic structure of germplasm collections: are traditional hierarchical clustering methods appropriate for molecular marker data?. Theor Appl Genet. 2011, 123: 195-205. 10.1007/s00122-011-1576-x.PubMedPubMed CentralView Article
- Eyre-Walker A, Gaut RL, Hilton H, Feldman DL, Gaut BS: Investigation of the bottleneck leading to the domestication of maize. Proc Natl Acad Sci USA. 1998, 95: 4441-4446. 10.1073/pnas.95.8.4441.PubMedPubMed CentralView Article
- Kilian B, Ozkan H, Walther A, Kohl J, Dagan T, Salamini F, Martin W: Molecular diversity at 18 loci in 321 wild and 92 domesticated lines reveal no reduction of nucleotide diversity during Triticum monococcum (einkorn) domestication: Implications for the origin of agriculture. Mol Biol Evol. 2007, 24: 2657-2668. 10.1093/molbev/msm192.PubMedView Article
- McCouch S: Diversifying selection in plant breeding. PLoS Biol. 2004, 2 (10): e347-10.1371/journal.pbio.0020347.PubMedPubMed CentralView Article
- Boursiquot JM, Lacombe L, Bowers J, Meredith C: Gouais, a key vine of the European wine heritage. Bulletin de l'OIV. 2004, 77: 875-876.
- Bowers JE, Siret R, Meredith CP, This P, Boursiquot JM: A single pair of parents proposed for a group of grapevine varieties in northeastern France. Acta Horticulturae. 2000, 528: 129-132.View Article
- Vouillamoz JF, Grando MS: Genealogy of wine grape cultivars: ‘Pinot’ is related to ‘Syrah’. Heredity. 2006, 97: 102-110. 10.1038/sj.hdy.6800842.PubMedView Article
- Lacombe T, Boursiquot JM, Laucou V, Di Vecchi-Staraz M, Peros JP, This P: Large-scale parentage analysis in an extended set of grapevine cultivars (Vitis vinifera L.). Theor Appl Genet. 10.1007/s00122-012-1988-2. Published online 27 September 2012
- Negrul AM: Origin and classification of cultured grape. The Ampelography of the USSR. Edited by: Baranov A, Kai YF, Lazarevski MA, Palibin TV, Prosmoserdov NN. 1946, Pischepromizdat, Moscow, 159-216.
- Pistorius R: Scientists, plants and politics: a history of the plant genetic resources movement. 1997, International Plant Genetic Resources Institute, Rome, Italy
- McGovern PE, Rudolph HM: The analytical and archaeological challenge of detecting ancient wine: two case studies from the ancient Near East. The origins and ancient history of wine. Edited by: McGovern P, Fleming S, Katz S. 1996, Gordon and Breach, New York, 57-67.View Article
- McGovern PE, Glusker DL, Exner LJ, Voigt MM: Neolithic resinated wine. Nature. 1996, 381: 480-481. 10.1038/381480a0.View Article
- Camus-Kulandaivelu LJ, Veyrieras B, Gouesnard B, Charcosset A, Manicacci D: Evaluating the reliability of Structure outputs in case of relatedness between individuals. Crop Science. 2007, 47: 887-892. 10.2135/cropsci2006.06.0366N.View Article
- Kobayashi S, Goto-Yamamoto N, Hirochika H: Association of VvMybA1 gene expression with anthocyanin production in grape (Vitis vinifera) skin-color mutants. J Jap Soc Hortic Sci. 2005, 74: 196-203. 10.2503/jjshs.74.196.View Article
- This P, Lacombe T, Cadle-Davidson M, Owens CL: Wine grape (Vitis vinifera L.) color associates with allelic variation in the domestication gene VvMybA1. Theor Appl Genet. 2007, 114: 723-730. 10.1007/s00122-006-0472-2.PubMedView Article
- Walker AR, Lee E, Bogs J, McDavid DAJ, Thomas MR, Robinson SP: White grapes arose through the mutation of two similar and adjacent regulatory genes. Plant J. 2007, 49: 772-785. 10.1111/j.1365-313X.2006.02997.x.PubMedView Article
- Bronner A: Muscats and muscat varieties: survey and universal synonyms, from the origin to the present. INRA Editions, Versailles France:Hachette, Paris: INRA editions, Montpellier:2003..
- Crespan M, Milani N: The Muscats: A molecular analysis of synonyms, homonyms and genetic relationships within a large family of grapevine cultivars. Vitis. 2001, 40: 23-30.
- Bioletti FT: The Seedless Raisin Grapes. Bulletin of the California Agricultural Experimental Station. 1921, 298: 75-86.
- Spooner DM, McLean K, Ramsay G, Waugh R, Bryan GJ: A single domestication for potato based on multilocus amplified fragment length polymorphism genotyping. Proc Nat Acad Sci. 2005, 102: 14694-14699. 10.1073/pnas.0507400102.PubMedPubMed CentralView Article
- Smulders MJM, Cottrell JE, Lefèvre F, van der Schoot J, Arens P, Vosman B, Tabbener HE, Grassi F, Fossati T, Castiglione S, Krystufek V, Fluch S, Burg K, Vornam B, Pohl A, Gebhardt K, Alba N, Agúndez D, Maestro C, Notivol E, Volosyanchuk R, Pospíšková M, Bordács S, Bovenschen J, van Dam BC, Koelewijn HP, Halfmaerten D, Ivens B, van Slycken J, Vanden Broeck A, Storme V, Boerjan W: Structure of the genetic diversity in black poplar (Populus nigra L.) populations across European river systems: Consequences for conservation and restoration. For Ecol Manag. 2008, 255: 1388-1399. 10.1016/j.foreco.2007.10.063.View Article
- Vigouroux Y, Glaubitz JC, Matsuoka Y, Goodman MM, Jesus SG, Doebley J: Population structure and genetic diversity of new world maize races assessed by DNA microsatellites. Am J Bot. 2008, 95: 1240-1253. 10.3732/ajb.0800097.PubMedView Article
- Yu JM, Pressoir G, Briggs WH, Bi IV, Yamasaki M, Doebley JF, McMullen MD, Gaut BS, Nielsen DM, Holland JB, Kresovich S, Buckler ES: A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet. 2006, 38: 203-208. 10.1038/ng1702.PubMedView Article
- Galet P: Dictionnaire encyclopédique des cépages. 2000, Hachette, Paris
- The Vitis International Variety Catalogue (VIVC): 2010, http://www.vivc.de/index.php,
- Truel P: Catalogue des variétés de vigne en collection. 1985, INRA editions, Montpellier, 129-
- Organisation Internationale de la vigne et du vin: OIV descriptor list for grape varieties and Vitis species 2nd edition. Office International de la Vigne et du Vin, Paris: 2009.
- van Oosterhout C, Hutchinson HF, Derek P, Wills M, Shipley P: MICRO-CHECKER: software for identifying and correcting genotyping errors in microsatellite data. Molecular Ecology Notes. 2004, 4: 535-538. 10.1111/j.1471-8286.2004.00684.x.View Article
- Nei M: Molecular Evolutionary Genetics. Columbia University Press, New York: 1987,
- Belkhir K, Borsa P, Chikhi L, Raufaste N, Bonhomme F: GENETIX 4.05. CNRS UMR. 5171, Université de Montpellier II, Montpellier, http://www.genetix.univ-montp2.fr/genetix/intro.htm,
- Zachary A, Szpiech ZA, Jakobsson M, Rosenberg NA: ADZE: a rarefaction approach for counting alleles private to combinations of populations. Bioinformatics. 2008, 24: 2498-2504. 10.1093/bioinformatics/btn478.View Article
- Crawford NG: SMOGD: software for the measurement of genetic diversity. Mol Ecol Resourc. 2010, 10: 556-557. 10.1111/j.1755-0998.2009.02801.x.View Article
- SMOGD: Software for the Measurement of Genetic Diversity: (vsn. 1.2.5). http://www.ngcrawford.com/django/jost/,
- Jost L: GST and its relatives do not measure differentiation. Mol Ecol. 2008, 17: 4015-4026. 10.1111/j.1365-294X.2008.03887.x.PubMedView Article
- Botstein D, White RL, Skolnick M, Davis RW: Construction of a genetic linkage map in man using restriction fragment length polymorphism. Am J Hum Genet. 1980, 32: 314-331.PubMedPubMed Central
- Jamieson A, Taylor SS: Comparisons of three probability formulae for parentage exclusion. Anim Genet. 1997, 28: 397-400. 10.1111/j.1365-2052.1997.00186.x.PubMedView Article
- Evanno G, Regnaut S, Goudet J: Detecting the number of clusters of individuals using the software structure: a simulation study. Mol Ecol. 2005, 14: 2611-2620. 10.1111/j.1365-294X.2005.02553.x.PubMedView Article
- Rosenberg NA, Pritchard JK, Weber JL, Cann HM, Kidd KK, Zhivotovsky LA, Feldman MW: Genetic structure of human populations. Science. 2002, 298: 2381-2385. 10.1126/science.1078311.PubMedView Article
- Jombart T: adegenet: a R package for the multivariate analysis of genetic markers. Bioinformatics. 2008, 24: 1403-1405. 10.1093/bioinformatics/btn129.PubMedView Article
- R Core Team: R: A Language and Environment for Statistical Computing. 2012, R Foundation for Statistical Computing, Vienna, Austria, http://www.R-project.org,
- Kalinowski ST, Wagner AP, Taper ML: ML-Relate: a computer program for maximum likelihood estimation of relatedness and relationship. Mol Ecol Notes. 2006, 6: 576-579. 10.1111/j.1471-8286.2006.01256.x.View Article
- Queller DC, Goodnight KF: Estimating relatedness using genetic markers. Evolution. 1989, 43: 258-275. 10.2307/2409206.View Article
- Schwacke L, Schwacke J, Rosel P: RERAT: relatedness estimation and rarefaction analysis tool. 2005, Available from: http://people.musc.edu/~schwaclh/
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.