- Research article
- Open Access
Dissecting genetic architecture of grape proanthocyanidin composition through quantitative trait locus mapping
BMC Plant Biology volume 12, Article number: 30 (2012)
Proanthocyanidins (PAs), or condensed tannins, are flavonoid polymers, widespread throughout the plant kingdom, which provide protection against herbivores while conferring organoleptic and nutritive values to plant-derived foods, such as wine. However, the genetic basis of qualitative and quantitative PA composition variation is still poorly understood. To elucidate the genetic architecture of the complex grape PA composition, we first carried out quantitative trait locus (QTL) analysis on a 191-individual pseudo-F1 progeny. Three categories of PA variables were assessed: total content, percentages of constitutive subunits and composite ratio variables. For nine functional candidate genes, among which eight co-located with QTLs, we performed association analyses using a diversity panel of 141 grapevine cultivars in order to identify causal SNPs.
Multiple QTL analysis revealed a total of 103 and 43 QTLs, respectively for seed and skin PA variables. Loci were mainly of additive effect while some loci were primarily of dominant effect. Results also showed a large involvement of pairwise epistatic interactions in shaping PA composition. QTLs for PA variables in skin and seeds differed in number, position, involvement of epistatic interaction and allelic effect, thus revealing different genetic determinisms for grape PA composition in seeds and skin. Association results were consistent with QTL analyses in most cases: four out of nine tested candidate genes (VvLAR1, VvMYBPA2, VvCHI1, VvMYBPA1) showed at least one significant association with PA variables, especially VvLAR1 revealed as of great interest for further functional investigation. Some SNP-phenotype associations were observed only in the diversity panel.
This study presents the first QTL analysis on grape berry PA composition with a comparison between skin and seeds, together with an association study. Our results suggest a complex genetic control for PA traits and different genetic architectures for grape PA composition between berry skin and seeds. This work also uncovers novel genomic regions for further investigation in order to increase our knowledge of the genetic basis of PA composition.
Proanthocyanidins (PAs), or condensed tannins, are flavonoid polymers widespread throughout the plant kingdom. They accumulate in many organs and tissues to provide protection against pests . They are also determinant in food quality and their beneficial effects on human health are increasingly investigated [1, 2]. These diverse qualities are directly linked to PA chemical structures. As polymers, PA structure varies depending on the degree of polymerisation and the nature of building blocks, the flavan-3-ols (differences in stereochemistry, hydroxylation pattern on the B-ring and presence/absence of a galloyl group, Figure 1). Our understanding of PA biosynthesis has been significantly improved through the isolation of two genes coding for leucoanthocyanidin reductase (LAR, ) and anthocyanidin reductase (ANR, [4, 5]), two specific enzymes for the formation of flavan-3-ols, respectively (+)-(gallo)catechin and (-)-epi(gallo)catechin. However, several issues concerning PA composition require further study, such as the synthesis of galloylated units, the genetic mechanism of polymerisation, and the origin of extension units, since all flavonoid intermediates are believed to assume a 2,3-trans configuration, similar to the 2,3-configuration of (+)-(gallo)catechin (Figure 1), while major PA extension blocks assume a 2,3-cis configuration (e.g. (-)-epicatechin, Figure 1). Moreover, few studies are available on the genetic basis of PA composition quantitative variation [6, 7].
It is of great interest to understand PA genetics in grape since PAs are involved in grapevine self-defence mechanisms and are responsible for major organoleptic properties of red wine [8–10]. Because of its rich PA composition and the multiple genetic and genomic tools available for this species, such as the whole genome sequence [11, 12], grape could represent also an interesting model for PA genetic study. Indeed, in Arabidopsis, a major model for PA studies, PAs are only detected in the seed coat with the presence of (-)-epicatechin as sole building block. By contrast, PAs are present in different organs of grapevine and are composed of four major building blocks: (+)-catechin, (-)-epicatechin, (-)-epigallocatechin and (-)-epicatechin-3-O-gallate [13–16] while (+)-gallocatechin and (-)-epigallocatechin-3-O-gallate are present in trace amounts only . PAs are abundant in grape berries with drastic differences in composition between skin and seeds: total content is usually higher in seeds while polymer size is much larger in skin [15, 16]. In terms of constitutive building blocks, (-)-epigallocatechin is a major component of grape skin PAs  while it is not detectable in seeds ; (-)-epicatechin-3-O-gallate is present in large proportion as both extension and terminal subunits in seeds while it is present only in small amounts in skin [15, 16]. Advances in understanding grape PA synthesis have been mainly obtained through homologous cloning [14, 17–21]. However, the complex PA composition within a tissue and the contrasted composition between tissues suggest a complex interaction of many factors in the determinism of grape PA composition.
One way to assess genetic determinism of trait variation without a priori knowledge is quantitative trait loci (QTL) mapping. QTL mapping makes use of segregating populations and gives global insights into the genetic architecture of the target phenotype, i.e. the number, position and effects of genomic regions . Among all available mapping methods, the multiple-QTL approach is particularly suitable for complex trait analysis since it uses simultaneously multiple marker intervals with possible inclusion of epistasis terms in QTL mapping model . Instead of creating segregating populations, one can also explore the existing diversity through association mapping to identify loci involved in phenotypic variation . According to the genetic architecture of target traits, one can build appropriate breeding strategy and/or develop further gene function studies.
The aim of this work was to investigate the genetic determinism of PA composition variation in both skin and seeds of grape. For this purpose, we first characterised skin and seed grape PA composition in a pseudo-F1 progeny derived from a cross between Syrah and Grenache cultivars. Three categories of PA variables were constructed in order to capture the complex PA profiles: total content variables, percentages of constitutive subunits, which assessed the biosynthesis efficiency among building blocks, and composite ratio variables, which included estimation of polymer size and metabolite flux between building blocks. We then applied a multiple-QTL genome scan to identify main effect QTLs and pairwise epistatic interactions for PA variables. Nine functional candidate genes, among which eight co-located with QTLs, were sequenced and their SNP-phenotype associations were investigated in a grapevine diversity panel. We present here the first extensive study of genetic architecture of PA composition in grape and confirm the involvement of some candidate genes in PA composition variation.
The two grapevine populations used in this study have been previously described [25–27]. Briefly, the QTL mapping population (S × G) consisted of a pseudo-F1 progeny of 191 individuals from a cross between two wine grape cultivars, Syrah (S) and Grenache (G) and was maintained under classical local training system (3300 plants/ha plant density) at Montpellier SupAgro Domaine du Chapitre (Hérault, France). The S × G population was planted in two blocks. Each individual from the progeny was planted in two elementary plots (one per block) comprising five plants each. Parental cultivars were also planted in each block with nine and 43 elementary plants for Syrah and Grenache, respectively. The association mapping population (CC) consisted of a core-collection of 141 cultivars maximising agro-morphological diversity for 50 quantitative and qualitative traits, and maintained at INRA Domaine de Vassal (Hérault, France) .
DNA extraction, marker genotyping and gene sequencing
DNA extraction and marker genotyping were already described in Adam-Blondon et al. . In order to densify a previous 97-SSR linkage map of the S × G population , additional SSR markers, heterozygous both in Syrah and Grenache, were chosen from recent grapevine reference maps . Based on the 12X grapevine reference sequence (Grape Genome Browser http://www.genoscope.cns.fr), we designed primers for candidate gene amplification using Primer3 (http://www.bioinformatics.nl/cgi-bin/primer3plus/primer3plus.cgi) with default parameters. Primers used in this study are listed in Additional file 1. For SNP analysis, gene fragments were amplified, sequenced and analysed as described in .
Linkage map construction
Framework maps were constructed based on the 97-marker linkage map of  with 56 additional SSRs. All 153 markers had a genotypic error rate lower than 1.5%, using Tmap as check . Linkage maps were constructed using CarthaGène 0.999R  as described in  with Haldane mapping function. The "Syrah" and "Grenache" framework maps were composed of 121 SSRs (total length 1118.8 cM), and 133 SSRs (1349.4 cM total length) respectively. The "Consensus" framework map spanned 1256.4 cM based on 153 SSRs among which over 70% allowed segregation in four genotypic classes in the pseudo F1 progeny (ab × ac and ab × cd). Marker order reliability was ensured at LOD 2 threshold. Segregation distortion on genotypic classes was verified by a χ2 test according to the segregating type of each marker for the different maps (e.g. for markers segregating as ab × cd in the consensus map, the H0 hypothesis was ac:bc:ad:bd = 1:1:1:1). Twenty-five markers out of 153 exhibited distorted segregation (P < 0.05) and were mainly grouped on chromosomes 3 (4 markers), 4 (6 markers) and 10 (5 markers). Markers on chromosome 4 exhibited the most significant allelic deviation (P < 0.001) due to segregating distortion between Syrah alleles (aa:ab ~2/3:1/3).
Phenotyping and PA variable construction
Grapes were harvested at maturity (20° Brix). For each genotype, eight representative berry clusters were harvested from the five plants of the elementary plot. Sample homogenisation was based on the accumulation of total solutes (principally sucrose), a major marker of berry development during ripening. Berry density was assessed by floatation in salt solutions . Twenty-five berries with a density between 130 and 160 g NaCl/L were randomly selected. In the present study, we focused on the analysis of berry skin and seeds since PA concentration is quite low in flesh, flesh PAs accounting for only 2-6% of the total berry PA content . Berry skin and seeds were separated, ground in liquid nitrogen and stored at-80°C until analysis. PAs were extracted and analysed by high performance liquid chromatography (HPLC) after acid-catalysed cleavage in the presence of phloroglucinol according to . For the S × G population, both skin and seeds were analysed for 2 consecutive years: skin was analysed in 2005 (1 block) and 2006 (2 blocks) while seeds were analysed in 2006 (2 blocks) and 2007 (2 blocks). For the CC population, skin was analysed in 2005 and 2006 and seeds in 2006.
In order to obtain an exhaustive view of PA composition, three categories of PA variables were studied in this work: total PA content, subunit percentage and composite ratio variables. For total content variables, concP (mg/g fresh weight) reflects the biosynthesis intensity in each tissue, concB (mg/berry) brings total content to single berry level by taking into account berry size while concK (mg/kg berries) is a common enological measurement taking into account yield-related traits. Since all PA building blocks are derived from the same intermediate structure, naringenin chalcone, we used the percentage of each PA subunit to total subunit quantity to assess partitioning efficiency between PA building blocks. Our PA characterisation method did not distinguish between terminal units of polymers and flavan-3-ol monomers. The notation ending with "T" corresponds thus to the sum of terminal subunits and monomers. We also constructed composite ratio variables, including mean degree of polymerisation (mDP), ratio of 2,3-trans- to 2,3-cis-subunits in extension position (Ftranscis_Ex), terminal position (Ftranscis_T) and overall subunits (Ftranscis_all) and the ratio of B-ring di-hydroxylated to B-ring tri-hydroxylated subunits (F3pr35). Variables studied in this work are summarised in Table 1.
Phenotypic data analysis
All statistical analyses were performed with R software . We identified the best-fit mixed model for each PA variable through Bayesian information criterion (BIC) in order to extract the best linear unbiased predictors (BLUPs) for genotypic values and to estimate the broad sense heritability (H2 ). Mixed model fit was performed with lme4 package . The mixed model assumption of normality of residual and BLUPs was checked after model fitting by quantile-quantile plot comparing the distribution of residual and random effect predictors to a theoretical normal distribution (Additional file 2). No data transformation was performed for PA variables measured in the two populations. More information regarding phenotypic data analysis and best-fit model for each PA variable is in Additional file 3.
QTL analysis was performed on the genotypic BLUPs with R/qtl package . Multiple QTL regression was carried out with "stepwiseqtl" function. This approach uses forward/backward selection to identify a multiple-QTL model with inclusion of both main effect QTLs and pairwise interactions. Maximum QTL number was set to 10 for forward selection (max.qtl = 10). Model choice was made via a penalized LOD score (pLOD) which is the LOD score for the model (the log likelihood ratio comparing the full model to the null model without QTL) with penalties on the number of QTLs and pairwise QTL × QTL interactions . For each PA variable, specific penalties for main effect and digenic pairwise interaction terms were derived from 1000 permutations of two-dimensional scan (the "scantwo" function, method = "hk", n.perm = 1000) and penalties at genome-wide error rate of 0.05 were used for multiple-QTL model fitting. The QTL model with the largest pLOD was identified as the most probable one. Once determined the multiple QTL model, we refined QTL position ("refineqtl" function) and estimate R2 for the whole model and each term of the model, the individual LOD score of each term and the genotypic effect ("fitqtl" function). The "lodint" function was used to derive LOD-1 QTL location confidence interval. Allelic effects for consensus QTLs were estimated as described by Segura et al. . Genome scan was performed with a 1 cM step.
Nine candidate genes were selected for association test according to their function and co-localisation with QTLs. Prior to association test, we used R kinship package  to perform model comparison among different nested models according to  in order to select the best fitted model for association test for each PA variable. Ancestry structure and kinship matrix were estimated based on 20 SSR markers located throughout the whole genome as described in .
After model comparisons, we used TASSEL package to perform association tests . Two models were used: one accounting for ancestry structure effect (with General Linear Model, or GLM in TASSEL) the other for both ancestry structure effect and random genetic background effect (with Mixed Linear Model, or MLM in TASSEL). Association tests were performed on BLUPs for skin variables and raw data for seed variables since seed data were available for 2006 only. For GLM analyses, tests were run with 1000 permutations allowing the determination of site-wise P value, whi ch is the probability of a greater F value under the null hypothesis that polymorphism was independent of the phenotype. The adjusted P value (called p_adj_Marker in TASSEL), is the site-wise P value adjusted for multiple tests which takes into account the dependence between SNPs due to linkage disequilibrium. Because each gene was tested independently, we used an additional Bonferroni correction to correct for the number of studied genes (nine) which led to a threshold of 0.0056 for the adjusted P value. As the permutation method is not available for MLM, we used the threshold proposed by Benjamini and Hochberg  with q equal to 0.05 which led to a threshold of 0.0039. The effect of minor genotypic frequency and non-normality of observed trait distribution was checked (details in Additional file 4).
PA variable distribution and heritability
For the S × G population, all PA variables showed continuous distribution and transgressive segregation and variation extent was equivalent in the S × G and CC populations (Figure 2, Additional file 5). In agreement with previous studies [15, 16, 46], samples taken in 2006, for which both berry skin and seeds were analysed, displayed different PA composition between tissues as illustrated in Figure 2A with the mean values in the S × G population. For both S × G and CC populations, (-)-epicatechin (epiEx) was the predominant extension subunit in all tissues while (-)-epigallocatechin (egcEx) was only detected in skin. (+)-catechin (catT) was the predominant terminal subunit/monomer in both skin and seed while galloylated units (galEx and galT) were more abundant in seed PA. Each subunit exhibited large variation according to genotype. For instance in skin, (-)-epigallocatechin (instead of (-)-epicatechin) could be the predominant subunit in the extension position (67.9%) (Additional file 5).
PA content variables (concP, concB, concK) reached higher values in seeds than in skin regardless of the unit, as illustrated for concP (Figure 2B), which exhibited the largest difference between tissues. Comparison of composite PA ratio variables showed different range of variation between skin and seed. PAs were on average 8-times shorter in seeds than in skin with wide variation in skin (Figure 2C). All three ratio variables assessing the flux between 2,3-trans and 2,3-cis forms (Ftrancis series) pointed to different kinetics for extension and terminal positions: trans subunits were more abundant in skin for terminal units/monomers (Ftranscis_T) while they were much reduced in seeds when considering extension positions alone (Ftranscis_Ex) or extension plus terminal subunits/monomers (Ftranscis_all, Figure 2D-F). Since the major extension blocks were in cis-configuration, i.e. (-)-epi(gallo)catechin, Ftranscis_Ex was always less than 1 both in skin and seeds. The higher Ftranscis_T in skin conformed to the fact that (+)-catechin was the predominant terminal subunits/monomer in skin. (Figure 2B). Means of each PA variable measured in both skin and seeds were systematically different (paired t-test, P < 0.001, data not shown).
For S × G population, average H2 of PA variables was 0.56 (from 0.24 to 0.82) and 0.44 (from 0.26 to 0.54) in skin and seeds, respectively. No significant difference in H2 magnitude was detected between skin and seeds (t-test, P = 0.053). Nevertheless, higher H2 were observed for skin variables, especially catT and mDP (0.76 and 0.82, respectively, Additional file 5). A high H2 value was also found for these two traits in CC (0.86 and 0.72 for catT and mDP, respectively, Additional file 5).
PA variable correlation
We performed PA variable correlation on genotypic values (BLUPs) from S × G population because the two-year data available both in skin and seeds allowed us to work with a much reduced environmental effect. All three total content variables were highly correlated within a given tissue while significant correlations between tissues were only observed for concB and concK (Figure 3). Among subunit percentage variables in skin (Figure 3), the most noticeable features were the significant negative correlation between egcEx (B-ring tri-hydroxylated subunit) and all other units (B-ring di-hydoxylated subunits) and the positive correlation between (-)-epicatechin and (+)-catechin either in extension position (Ex) or in terminal/monomer position (T). In seeds, the most noticeable feature was the significant correlation of epiT with all other variables (negative with extension units and positive with terminal units) while epiEx was negatively correlated with other subunits. For the same subunit in a given tissue, no highly significant correlation was observed between extension and terminal position, except the negative correlation between epiEx and epiT in seeds (P < 0.001). Significant correlation between subunit percentage variable and composite variables inside a tissue reflected the variable construction (Table 1). Between tissues, significant positive correlations were observed for concK, most of terminal subunits/monomers pairs, galEx and also mDP and Ftranscis_all.
Global features of PA QTLs
We performed QTL detection with genotypic BLUPs both on consensus and parental maps. In total, 103 vs 43 QTLs and 24 vs 2 digenic epistatic interactions were identified on the consensus map for seed and skin PA variables, respectively (Figure 4). QTLs detected on parental maps were generally also detected on the consensus map except for skin concP, concK, catEx, galEx, epiT, F3pr35 and seed concP where additional QTLs were identified through parental detection (Additional file 6). More QTLs and digenic pairwise interactions were identified on the consensus map than on parental maps, allowing some QTL models to explain more than 80% of the BLUP variance in consensus mapping (Additional file 6), as illustrated in the case of epiT in seeds (Figure 5D). Some loci were involved in phenotypic variation almost exclusively through digenic epistasis such as locus 10@32 for seed concB or locus firstname.lastname@example.org for seed epiT (Figure 5). Loci were mainly of additive effect while dominance was predominant at some loci for concK, epiEx, mDP and Ftranscis_T in skin and galEx, epiT, Ftranscis_T and Ftranscis_all in seeds (Figure 4). Among all detected QTLs, only 10 main effect loci overlapped for the same variable in both tissues: 1 for concB, 2 for concK, 1 for epiEx, 1 for galEx, 2 for catT, 1 for epiT, 1 for mDP and 1 for Ftranscis_T (see Figure 5 for some examples). Parental alleles contribution to these common loci was not always consistent across tissues (Additional file 6), which could be an indication of tissue-specific genetic mechanisms. Different genetic architectures were observed for the same PA variable between berry skin and seeds as illustrated in Figure 5: few moderate QTLs (< 3) or no QTL in skin vs several (> 5) small to moderate QTLs with possible involvement of epistasis in seeds (for concP, concB, catEx, galEx, Ftranscis_Ex and Ftranscis_T, illustrated by concB in Figure 5A); many QTLs with involvement of epistasis in skin vs a small number (2) of main effect QTLs in seeds (epiEx in Figure 5B); a major QTL (R2 > 50%) and some QTLs of moderate effect in skin v.s. many QTLs of small to moderate effect in seeds (for catT and Ftranscis_all, illustrated by catT in Figure 5C), or few moderate QTLs in skin vs many moderate QTLs in the presence of a QTL of large effect and epistasis in seeds (epiT, Figure 5D). Conversely, similar genetic architecture between skin and seeds was observed for concK (only moderate main effect QTLs, Figure 5E) and mDP (a major QTL and a few QTLs of moderate effect, Figure 5F). Details regarding position, major allelic effect, LOD score, LOD-1 confidence interval and percentage of explained variation (R2 ) for each QTL are given in Additional file 6.
PA total content
In skin, 1, 1 and 3 QTLs were identified on the consensus map for concP, concB and concK, respectively. One additional QTL for concP was identified through parental detection on the Syrah map. Conversely, for concK, all QTLs exhibited a major Grenache allelic effect and one additional QTL was identified on chromosome 9 on the Grenache map.
In seeds, 6, 10 and 5 QTLs were identified for concP, concB, and concK, respectively. The locus positioned at 40-50 cM on chromosome 2 was identified for all three variables. Epistasis was strongly involved in genetic architecture of concB and in total accounted for around 30% of the BLUP variance (Figure 1).
In summary, fewer total content QTLs were detected in skin than in seeds. Common loci between skin and seeds for concB and concK were identified on chromosomes 8, 13, and 17. For each tissue, one locus was identified to be common to the three total content variables: the QTL on chromosome 8 for skin and the QTL on chromosome 2 for seeds.
Simple variables: percentage of constitutive units
In skin, 1, 9, 3 and 5 QTLs respectively were identified on consensus map for catEx, epiEx, galEx, and egcEx, with several overlapping QTLs (Figure 4). For catEx, two additional QTLs on chromosomes 14 and 18, were specifically identified on the Grenache map. Pairwise interactions were identified for epiEx. For terminal subunits/monomers, 2 and 4 consensus QTLs were detected for epiT and catT, respectively. These two traits had co-locating loci on chromosomes 8 and 17 with an especially large R2 for the locus on chromosome 17 (55.8% for catT).
In seeds, 2 QTLs for epiEx were identified on the consensus map while two additional loci on chromosomes 8 and 12 were identified solely on Grenache and Syrah maps, respectively (Figure 4, Additional file 6). For all other simple variables, at least 7 QTLs and 1 pairwise interaction were involved in multiple-QTL models on the consensus map. For galEx, additional loci were identified on chromosomes 3, 5 and 9 through parental detection.
Joint consideration of the results obtained from both tissues showed that different QTLs were identified for subunits of the same nature but with different positions in the polymer (e.g. epiEx and epiT). The locus positioned at approx. 7 cM on chromosome 17 was identified for all PA simple variables in both tissues except for egcEx, galEx and catEx in skin.
Composite ratio variables
In skin, the best QTL model for mDP, Ftranscis_T and Ftranscis_all included only a few main effect QTLs (2 to 4 QTL, Additional file 6) without digenic interaction. The major locus on chromosome 17 was also identified for these 3 variables: in the case of mDP, it explained more than 50% of total BLUP variance (Figure 5F). Six QTLs were detected for F3pr35 on the consensus map while one additional Syrah-specific QTL was identified through parental detections on chromosome 13. For Ftranscis_Ex, QTLs were solely identified through parental detections (1 QTL for Syrah map and 2 QTLs for Grenache map).
In seeds, a three-additive-QTL model was identified for mDP while models with 10 QTLs and one to four digenic interactions were the best ones for Ftranscis_Ex, Ftranscis_T and Ftranscis_all. In addition, digenic interaction accounted for about 30% of BLUP variance for Ftranscis_Ex and Ftranscis_all (Figure 5 and Additional file 6).
In summary, different QTLs were identified for the same variables, depending on berry tissues. Among all composite variables, only the large effect QTL for mDP and Ftranscis_T on chromosome 17 was common to skin and seeds. Comparison of multiple-QTL models between both tissues showed that more digenic interactions were involved in seed variables than in skin variables.
Association analyses on candidate genes
We positioned 21 known grape PA functional candidate genes on the genetic map using their relative position to SSR markers on the grape genome (, http://www.genoscope.cns.fr, Figure 4, see Additional file 7 for the names of flanking markers of candidate genes). Association tests were performed for nine functional candidate genes, eight of them co-locating with QTLs. Among candidates, there were both genes encoding flavonoid pathway enzymes and putative regulators. Genes were partially to totally sequenced (gene coverage from 25 to 100%), mainly in exons (Table 2). Two models were used for association studies since model comparison showed an equivalent fit (Additional file 8): one accounted for fixed ancestry structure effect (GLM in TASSEL), the other for both fixed ancestry structure effect and random genetic background effect (MLM in TASSEL). four out of nine genes showed at least one significant association with PA variables with consistent results between GLM and MLM (Table 3). Seventy-eight percent of significant tests (21 out of 27 tests) were common to GLM and MLM while 6 additional associations were only significant with MLM model. The reason for this discrepancy is probably that the adjusted P-value in GLM was estimated by taking into account dependence between tests due to linkage disequilibrium  while in MLM, each SNP is tested under an hypothesis of independence. Association results were consistent with QTL analyses for following gene-phenotype pairs: VvLAR1-skin catT and VvLAR1-skin mDP (Table 3). In particular, several SNPs in linkage disequilibrium for VvLAR1 (data not shown) were significantly associated to catT and mDP in skin while the confidence interval of the QTLs for these two variables overlapped. Conversely, we observed some SNP-phenotype associations only in the diversity panel: VvLAR1-skin Ftranscis_all, VvMYBPA2-skin concP, VvMYBPA2-skin concK, VvMYBPA2-skin mDP, VvMYBPA2-seed galT, VvCHI1-skin concP, VvMYBPA1-skin epiT and VvMYBPA1-seed Ftranscis_T.
PA variation extent compared to previous studies
A first characterisation of PA composition in a grapevine pseudo-F1 population was provided by Hernandez-Jimenez and co-workers . Their population was composed of 42 offsprings, derived from a cross between Syrah and Monastrell. In all tissues, the subunit percentage in extension position and Ftranscis-series variables of the Syrah × Monastrell population were of a magnitude and extent equivalent to those of the present study. More divergent results were observed for 1) epiT, which is more abundant in the Syrah × Monastrell population; 2) mDP, which is higher in our study and 3) total content variables, for which the population mean and variation extent was three-to two-fold larger in the present study than in . Syrah, the common parent, behaved similarly in both studies although we observed 10-fold and two-fold higher total PA contents in our study for skin and seeds, respectively. The difference observed in offsprings may result from the fact that the two populations differed by one parent but environmental differences as well as PA extraction and quantification methods might also have affected PA variables, as suggested by the different PA composition of the same parental cultivar. Indeed, mixed model fit suggested that year had a major effect on PA-related variables for both tissues, which was consistent with a previous study where PA content and composition were measured in two cultivars for two consecutive years . The large quantitative variation in PA variables in S × G, of equivalent extent in the CC diversity panel, underlines the interest to implement a quantitative genetic approach on a F1 population for grape PA studies.
Two studies only have characterised PA composition in different grape cultivars [54, 55] with at most a 37-cultivar sample . Different biochemical analyses did not allow for result comparison between this latter study and the present work. Nevertheless, CC in our work was composed of 141 grapevine cultivars of broad geographical origin (from East to West Europe) and was initially defined to maximise the diversity of 50 agro-morphological traits . The PA composition variation in the diversity panel provides thus the potential to refine PA QTLs in a population of larger genetic background.
Multiple QTL mapping in a pseudo-F1 population for grape PA composition
To our knowledge, this study presents the first QTL analysis on grape PA composition with comparisons between skin and seeds of grape berry. This is also the first work on grape using multiple QTL models taking into account both main effects and digenic epistasis during the mapping procedure. QTL mapping in animals has shown that epistasis effects are often large enough to be detected and thus merit a systematic scan regardless of population size, although larger populations (> 500 individuals) allow a more powerful epistasis detection . By employing the multiple QTL mapping approach, we actually showed the important involvement of epistatic interaction in shaping PA composition variation; indeed, some loci were involved in phenotypic variation almost exclusively through pairwise interaction. Our mapping population is of sufficient size (191 individuals) to allow identification of small effect QTLs. However, one should keep in mind that the R2 estimate of individual QTLs is usually overestimated  and may have a wide confidence interval . Some of the identified QTLs may therefore be of smaller effect in reality. One should thus be cautious in result interpretation and further identification of causal polymorphism although we did check initially the genome-wide first type error rate.
Allele contribution to individual QTL was mainly due to additive effects between Syrah and/or Grenache alleles (Additional file 6). Parental detection allowed the identification of smaller additive QTLs because of a greater power due to more individuals in each genotypic class compared to consensus detection (i.e., 2 and 4 genotypic classes for parental and consensus detection, respectively). On the other hand, QTL detection on the consensus map allowed us to estimate QTL dominance effect s, i.e. the interaction between allelic classes, but not necessarily with the assumption of a dominant-recessive relationship . In the present work, 9.8% and 30% of QTLs detected in skin and seeds, respectively, had dominance as the major allelic effect (D in Figure 4 and Additional file 6). For example, the locus 8@69 of concB in seed was involved in phenotypic variation almost exclusively through dominance (Additional file 6) and this information would have been overlooked if parental mapping only had been performed.
Genetic architecture of grape PA composition
PA total content
QTL results were consistent with the results of PA variable correlation: no significant correlation was observed for concP between tissues and no co-locating QTL was identified for this variable between skin and seeds, while common QTLs were identified for both concB and concK across tissues. Since concB and concK take into account berry size- and yield-related traits, these co-located QTLs for concB and concK may be involved indirectly in PA total content through alteration of berry development or yield-related traits. Indeed, several QTLs for concK co-located with QTLs for yield-related traits, especially loci on chromosomes 8, 13, 17 and 18 where QTLs for yield related and berry size related-trais were also identified in the same S × G population (Doligez et al., unpublished data). Unlike these yield-related loci, the loci identified for concP, which were also identified for concB and concK could be specific targets for a better understanding of the contrasting PA content in berry compartments.
Association tests were in accordance with the involvement of VvMYBPA2 in PA content variation, as suggested by a previous study . Two VvMYBPA2 SNPs were significantly associated with skin PA content variables. (Table 3). VvMYBPA2 is mainly expressed in berry skin at green stage and its overexpression in grape hairy root significantly increased PA production . The significant associations of VvMYBPA2 were positioned in promoter and intron and might be involved either in transcription level alteration or through linkage disequilibrium with other causal mutations. A minor association was identified between a non-synonymous polymorphism of VvCHI1 and skin concP (Table 3). VvCHI encodes an upstream enzyme in the grape flavonoid pathway. This gene may be involved in PA content variation through the control of the flux of intermediate substrate. However, precise involvement of VvCHI1 in PA content variation needs further genetic and functional confirmation.
Among tested candidate genes, VvMYB5a, located on chromosome 8, is under several QTLs, especially total content-related QTLs. A previous physiological study showed that ectopic expression of VvMYB5a in tobacco induced expression of flavonoid genes and significantly increased both PA and anthocyanin production . The authors therefore proposed VvMYB5a as an upstream regulator of flavonoid pathway. In our work, no significant association was found for VvMYB5a while the whole gene was sequenced (Table 2). Further investigations would be necessary to figure out if this gene is involved in grape PA content variation.
PA subunit synthesis: the hydroxylation patterns of flavan-3-ols
All flavonoids carry a hydroxyl group at the 4' position of B-ring (Figure 1). The flavonoid hydroxylation patterns of B-ring were first studied in ornamental plants for colour engineering because it is a major colour determinant for anthocyanins, another class of flavonoids sharing a similar C6-C3-C6 skeleton with PA monomeric subunits [60, 61]. Links between F3'H and F3'5'H gene activities and their relative flavan-3-ols are less obvious due to the lack of easily assessed reporters. Our results for grape skin variables showed that five genomic regions (on chromosomes 3, 6, 8, 10 and 18) harboured co-located QTLs for epiEx, egcEx and F3pr35. This co-localisation is not surprising since epiEx and egcEx were the major components for F3pr35 variable construction and were therefore highly correlated (Figure 2). These loci are probably involved in the flux between di-hydroxylated and tri-hydroxylated PA building blocks. An interesting point is that the QTL on chromosome 6 for both egcEx and F3pr35 co-located with a genomic region corresponding to the F3'5'H gene family.
However, no significant association was detected between the two tested VvF3'5'H isogenes and hydroxylation pattern variables in this work. F3'5'H is present as a multigenic family in the grapevine genome in which at least 15 isogenes have been identified . For duplicated genes, neofunctionalisation and/or subfunctionalisation could conduct to specialisation of each isoform in a spatio-temporal manner [63–65]. Actually, the isogene VvF3'5'H 1.1 (or VvF3'5'H n in ) was shown to be expressed only in vegetative organs, while VvF3'5'H 2.1 (or VvF3'5'H f in ) is expressed in berry skin. The assessment of the polymorphisms of all isogenes may give more insights for links between F3'5'H and hydroxylation variation.
PA subunit synthesis: the galloylated flavan-3-ols
To date, the underlying genetic determinism for the production of PA galloylated building blocks is still unclear. Our results in both grape berry skin and seeds showed that the quantitative variation of (-)-epicatechin-3-O-gallate was probably under the control of many genomic regions and digenic epistatic interactions (Figures 4 and Additional file 6). Additional information was provided by association tests which revealed 4 weak but significant associations between galT in seeds and SNPs of VvMYBPA2. These associated SNPs are located in introns or in C-terminal of the proteins which could contain protein-protein interaction domains  (Table 2). This result suggests that the associated SNPs might lead to alteration of transcriptional complex recruitment or interaction with other proteins. . Glucosyltransferases were recently identified as putative candidates involved in the first enzymatic step of PA galloylation . Since they are located on chromosome 3 where QTLs for galEx in skin and seed are positioned (Figure 4 and Additional file 7), they may be good candidates to be tested by association genetics in the next future.
PA subunit synthesis: the trans- and cis- subunits
Synthesis of PA trans-and cis-subunits is tightly related to PA polymerisation since intermediate substrates in the flavonoid pathway are assumed to take up a trans-configuration while major extension subunits assume a cis-configuration (e.g. (-)-epicatechin) . Major advances in understanding PA subunits biosynthesis were made through the isolation of two genes coding for specific enzyme activities for the formation of terminal/monomers: 2,3-trans-(gallo)catechin and 2,3-cis-epi(gallo)catechin [3–5]. Recently, another dynamic view of the flux between trans-and cis-terminal units/monomers was provided by Gargouri and co-workers who demonstrated the ability of grape ANR to epimerise (+)-catechin to (-)-epicatechin . Our results seem to be in accordance with this work since a major locus on chromosome 17 was identified for catT and epiT, the two chiral flavan-3-ols, which was also the major locus for Ftranscis_T and Ftranscis_all in skin. This locus furthermore co-located with VvLAR2, an isogene of LAR, which belongs to the Reductase-Epimerase-Dehydrogenase (RED) family, as ANR, and thus might display both epimerase and reductase activity. Similarly, three VvLAR1 SNPs were significantly associated to Ftranscis_all in skin and therefore merit further functional investigation to understand its involvement in in vivo PA subunit synthesis.
On the other hand, the origin of extension subunits is still uncertain: are extension subunits derived from intermediate substrates in the pathway or from end products such as (-)-epicatechin and (+)-catechin? [1, 68, 70]. In our work, significant correlation was not systematically observed for subunits of the same nature but differing in position in the polymer, and few QTLs co-located. In addition, the QTLs for flux between trans-and cis-subunits were most often different between extension position (Ftranscis_Ex) and terminal subunits/monomes (Ftranscis_T). All these results argue in favour of the involvement of different loci in PA building blocks synthesis and in the control of flux between trans-and cis-subunits according to their position in the polymer. Stafford et al. already suggested from radioactive labelling experiments that upper and lower units arise from different steps of the pathway rather than from the condensation of similar units .
An aspect of the debate about PA polymerisation concerns the enzymatic or nonenzymatic polymerisation (reviewed by [1, 70]). The existence of a polymerase is supported by the barley PA mutant ant26, containing amounts of (+)-catechin equivalent to wild-type content but only trace amounts of PAs . On the other hand, in vitro chemical synthesis of PAs has also been reported  and these authors observed a modulation of polymer size through the modulation of the relative amounts of extension unit intermediates and monomers. One can thus hypothesise that instead of a polymerase, the ant26 mutation could directly affect the suitable conditions for spontaneous PA polymerisation, such as appropriate pH . Further investigation of the QTLs identified in this study would bring more insights into this polymerisation issue. Indeed, in the case of skin mDP, H2 was high (0.82) and the multiple QTL model accounted for 70% of the genotypic variance, corresponding to 57% of the total phenotypic variance. The largest QTL on chromosome 17 explained alone 55% of genotypic variance and was also the major locus for seed mDP (Figure 5F). This QTL is therefore an interesting target for mDP genetic mechanism investigation. Another mDP QTL located on chromosome 1 might also be an interesting target for understanding PA polymerisation. This QTL co-located with a gene encoding a PA-specific synthetic enzyme, VvLAR1, for which several SNPs in linkage disequilibrium (data not presented) were significantly associated to mDP and catT in skin, consistent with the corresponding QTL. Interestingly, VvLAR1 is highly polymorphic in Grenache while almost homozygous in Syrah (1 SNP in the coding region, data not presented), in accordance with the fact that the QTL for mDP in this region is mainly due to a Grenache allelic effect.
Tissue-specific genetic architecture for PA composition
In accordance with a previous study which demonstrated the tissue specificity of transcriptional profiles in grape berry , the present work illustrates different genetic mechanisms for grape PA composition between skin and seeds: QTLs differed in terms of number, position, R2 and allelic effects. For total content variables, the major QTLs differed in skin and seeds. For subunit percentage and composite variables, important differences between tissues were also observed. Globally, for the same PA subunit percentage variables in both tissues, only 5 QTLs among 74 had overlapping intervals between skin and seeds. Another contrasting feature was observed: in skin, 54% of all QTLs accounted for Syrah additive effect and 78% for Grenache effect whereas in seeds, 74% of all QTLs accounted for Syrah additive effect and 53% for Grenache effect (Additional file 6 and Figure 4). Even for loci identified in both tissues for a given variable, major allelic effect and R2 differed (e.g. the QTL in chromosome 17 for catT, Additional file 6). Our results suggest that seed PA variation is controlled by QTLs of moderate and equivalent magnitude with involvement of epistasis. On the other hand, skin variables are mainly under the genetic control of a few large effect loci with a fluctuating variance unexplained by QTLs.
The different genetic architectures between tissues could result from divergent functional evolution of PAs in these two berry compartments. For fruits in general, ensuring protection of the embryos is essential. Because of their abundance and their ability to protect plants against biotic stresses, PAs and flavan-3-ols might be the major molecules involved in grape embryo protection. In fact, their influence in maintaining seed dormancy has been demonstrated in Arabidopsis  and their interaction with phytohormones has also been reported [76, 77]. Therefore, to prevent biological fluctuation due to a single polymorphism mutation, a network with multiple cross-talking actors as a product of evolution without human selection could be postulated in the case of seed PAs, as suggested by the identification of numerous small effect QTLs and the involvement of epistasis. Conversely, skin is the first protective barrier of the grape berry against its environment. In plants, PAs are thought to be involved in self-defence mechanisms . For berry consumers, skin PAs confer flavour to berries and are also responsible for major organoleptic qualities of wine. Consumers in turn could help the plant in seed dispersion or vegetative propagation. However, high quantities of PAs would confer to berries too much astringency and bitterness, which would lead consumers (specially humans) to reject grapevines producing such berries for direct consumption (or wine-making). Human selection in particular could therefore have narrowed down the genetic basis of skin PAs over time and consequently led to a specific genetic architecture with a few large effect QTLs for skin PA variables.
QTL mapping and association analysis as complementary approaches for candidate locus identification
In this work, we used both QTL mapping and association analyses to identify phenotype-marker associations. For identification of phenotype-associated markers, grapevine segregating populations have a greater diversity than populations derived from inbred lines due to heterozygous parental cultivars. However, their genetic background remains relatively narrow compared to diversity panels. On the other hand, QTL mapping may reveal associations undetected in diversity panels due to low allelic frequency. The inconsistency between both approaches sometimes encountered in this work could therefore result from the fact that the available genetic polymorphisms were different between the two populations: causal polymorphisms in one population might be monomorphic in the other. In addition, our analysis focused on genes of known function co-locating with QTL while other candidates could underlie QTL intervals. Besides time-consuming fine mapping, candidate genes can also be selected by combining QTL results with other data such as transcriptomics. Nevertheless, LAR1 gene evoked a particular interest through association test. Complementing functional studies performed on a single cultivar , we provide here additional confirmation of LAR1 gene involvement in grape PA composition through a diversity panel study.
The present work confirmed presumptions about the complex genetic architecture of PA composition in grape berries. QTLs for PA total content, PA building blocks, degree of polymerisation and ratio between building blocks were identified. Berry PA composition offers a case study for tissue-specific genetic architecture: in skin, the same major loci were involved in several PA variables while multiple and moderate QTLs with strong epistasis were the principal genetic factors for seed PA composition. These differences might be due to human selection on skin PA leading to a reduced diversity of related genes while a multi-factor network controlling seed PA synthesis would be necessary to protect grapevine embryos. Association tests confirmed the interest of VvLAR1 as a candidate gene in modulating catT and mDP in berry skin, as well as VvMYBPA2 for total content and probably for subunit galloylation. This study provides the first assessment of genetic mechanism underlying grape PA composition which opens doors for further PA genetic studies.
Best linear unbiased predictor
4-coumaroyl CoA ligase
(+)-catechin extension subunits
(+)-catechin terminal subunits/monomers
Total content in mg/g fresh weight
Total content in mg/berry
Total content in mg/kg berries
(-)-epigallocatechin extension units
(-)-epicatechin extension subunits
(-)-epicatechin terminal subunits/monomers
Flavonoid 3' hydroxylase
Flavonoid 3'-5' hydroxylase
(-)-epicatechin-3-O-gallate extension subunits
(-)-epicatechin-3-O-gallate terminal subunits/monomers
Logarithm of odds
Quantitative trait locus
Single nucleotide polymorphism
Simple sequence repeat.
Dixon RA, Xie DY, Sharma SB: Proanthocyanidins--a final frontier in flavonoid research?. New Phytol. 2005, 165 (1): 9-28.
Aron PM, Kennedy JA: Flavan-3-ols: Nature, occurrence and biological activity. Mol Nutr Food Res. 2008, 52 (1): 79-104. 10.1002/mnfr.200700137.
Tanner GJ, Francki KT, Abrahams S, Watson JM, Larkin PJ, Ashton AR: Proanthocyanidin biosynthesis in plants--Purification of legume leucoanthocyanidin reductase and molecular cloning of its cDNA. J Biol Chem. 2003, 278 (34): 31647-31656. 10.1074/jbc.M302783200.
Devic M, Guilleminot J, Debeaujon I, Bechtold N, Bensaude E, Koornneef M, Pelletier G, Delseny M: The BANYULS gene encodes a DFR-like protein and is a marker of early seed coat development. Plant J. 1999, 19 (4): 387-398. 10.1046/j.1365-313X.1999.00529.x.
Xie DY, Sharma SB, Paiva NL, Ferreira D, Dixon RA: Role of anthocyanidin reductase, encoded by BANYULS in plant flavonoid biosynthesis. Science. 2003, 299 (5605): 396-399. 10.1126/science.1078540.
Guzman-Maldonado SH, Martinez O, Acosta-Gallegos JA, Guevara-Lara F, Paredes-Lopez O: Putative quantitative trait Loci for physical and chemical components of common bean. Crop Sci. 2003, 43 (3): 1029-1035. 10.2135/cropsci2003.1029.
Caldas G, Blair M: Inheritance of seed condensed tannins and their relationship with seed-coat color and pattern genes in common bean (Phaseolus vulgaris L.). TAG Theor Appl Genet. 2009, 119 (1): 131-142. 10.1007/s00122-009-1023-4.
Dai GH, Andary C, Mondolot-Cosson L, Boubals D: Involvement of phenolic compounds in the resistance of grapevine callus to downy mildew (Plasmopara viticol). Eur J Plant Pathol. 1995, 101 (5): 541-547. 10.1007/BF01874479.
del Río JA, Gómez P, Báidez A, Fuster MD, Ortuño A, Frías V: Phenolic compounds have a role in the defence mechanism protecting grapevine against the fungi involved in Petri disease. Phytopathologia Mediterranea. 2004, 43 (1): 87-94.
Arnold RA, Noble AC, Singleton VL: Bitterness and astringency of phenolic fractions in wine. J Agric Food Chem. 1980, 28 (3): 675-678. 10.1021/jf60229a026.
Jaillon O, Aury JM, Noel B, Policriti A, Clepet C, Casagrande A, Choisne N, Aubourg S, Vitulo N, Jubin C, et al: The grapevine genome sequence suggests ancestral hexaploidization in major angiosperm phyla. Nature. 2007, 449 (7161): 463-U465. 10.1038/nature06148.
Velasco R, Zharkikh A, Troggio M, Cartwright DA, Cestaro A, Pruss D, Pindo M, FitzGerald LM, Vezzulli S, Reid J, et al: A high quality draft consensus sequence of the genome of a heterozygous grapevine variety. PLoS One. 2007, 2 (12): e1326-10.1371/journal.pone.0001326.
Souquet J-M, Labarbe B, Le Guernevé C, Cheynier V, Moutounet M: Phenolic composition of grape stems. J Agric Food Chem. 2000, 48 (4): 1076-1080. 10.1021/jf991171u.
Bogs J, Downey MO, Harvey JS, Ashton AR, Tanner GJ, Robinson SP: Proanthocyanidin synthesis and expression of genes encoding leucoanthocyanidin reductase and anthocyanidin reductase in developing grape berries and grapevine leaves. Plant Physiol. 2005, 139 (2): 652-663. 10.1104/pp.105.064238.
Souquet J-M, Cheynier V, Brossaud F, Moutounet M: Polymeric proanthocyanidins from grape skins. Phytochemistry. 1996, 43 (2): 509-512. 10.1016/0031-9422(96)00301-9.
Prieur C, Rigaud J, Cheynier V, Moutounet M: Oligomeric and polymeric procyanidins from grape seeds. Phytochemistry. 1994, 36: 781-784. 10.1016/S0031-9422(00)89817-9.
Sparvoli F, Martin C, Scienza A, Gavazzi G, Tonelli C: Cloning and molecular analysis of structural genes involved in flavonoid and stilbene biosynthesis in grape (Vitis-Vinifera L). Plant Mol Biol. 1994, 24 (5): 743-755. 10.1007/BF00029856.
Bogs J, Ebadi A, McDavid D, Robinson SP: Identification of the flavonoid hydroxylases from grapevine and their regulation during fruit development. Plant Physiol. 2006, 140 (1): 279-291.
Castellarin SD, Di Gaspero G, Marconi R, Nonis A, Peterlunger E, Paillard S, Adam-Blondon AF, Testolin R: Colour variation in red grapevines (Vitis vinifera L.): genomic organisation, expression of flavonoid 3'-hydroxylase, flavonoid 3',5'-hydroxylase genes and related metabolite profiling of red cyanidin-/blue delphinidin-based anthocyanins in berry skin. Bmc Genomics. 2006, 7: 12-10.1186/1471-2164-7-12.
Jeong ST, Goto-Yamamoto N, Hashizume K, Esaka M: Expression of the flavonoid 3'-hydroxylase and flavonoid 3',5'-hydroxylase genes and flavonoid composition in grape (Vitis vinifera). Plant Sci. 2006, 170 (1): 61-69. 10.1016/j.plantsci.2005.07.025.
Jeong ST, Goto-Yamamoto N, Kobayashi S, Esaka A: Effects of plant hormones and shading on the accumulation of anthocyanins and the expression of anthocyanin biosynthetic genes in grape berry skins. Plant Sci. 2004, 167 (2): 247-252. 10.1016/j.plantsci.2004.03.021.
Lynch M, Walsh B: Genetics and analysis of quantitative traits. Sunderland: Sinauer Associates; 1998.
Kao C, Zeng Z, Teasdale R: Multiple interval mapping for quantitative trait Loci. Genetics. 1999, 152: 1203-1216.
Buckler ES, Thornsberry JM: Plant molecular diversity and applications to genomics. Curr Opin Plant Biol. 2002, 5 (2): 107-111. 10.1016/S1369-5266(02)00238-8.
Fournier-Level A, Le Cunff L, Gomez C, Doligez A, Ageorges A, Roux C, Bertrand Y, Souquet J-M, Cheynier V, This P: Quantitative genetic bases of anthocyanin variation in grape (Vitis vinifera L. ssp. sativa) berry: a quantitative trait locus to quantitative trait nucleotide integrated study. Genetics. 2009, 183 (3): 1127-1139. 10.1534/genetics.109.103929.
Barnaud A, Lacombe T, Doligez A: Linkage disequilibrium in cultivated grapevine, Vitis vinifera L. Theor Appl Genet. 2006, 112 (4): 708-716. 10.1007/s00122-005-0174-1.
Doligez A, Adam-Blondon A, Cipriani G, Di Gaspero G, Laucou V, Merdinoglu D, Meredith C, Riaz S, Roux C, This P: An integrated SSR map of grapevine based on five mapping populations. TAG Theor Appl Genet. 2006, 113 (3): 369-382. 10.1007/s00122-006-0295-1.
Adam-Blondon AF, Roux C, Claux D, Butterlin G, Merdinoglu D, This P: Mapping 245 SSR markers on the Vitis vinifera genome: a tool for grape genetics. Theor Appl Genet. 2004, 109 (5): 1017-1027. 10.1007/s00122-004-1704-y.
Cipriani G, Di Gaspero G, Canaguier A, Jusseaume J, Tassin J, Lemainque A, Thareau V, Adam-Blondon A, Testolin R: Molecular linkage maps: strategies, resources and achievements. Enfield, USA: Science Publishers; 2011.
Le Cunff L, Fournier-Level A, Laucou V, Vezzulli S, Lacombe T, Adam-Blondon A-F, Boursiquot J-M, This P: Construction of nested genetic core collections to optimize the exploitation of natural diversity in Vitis vinifera L. subsp. sativa. BMC Plant Biol. 2008, 8 (1): 31-10.1186/1471-2229-8-31.
Cartwright DA, Troggio M, Velasco R, Gutin A: Genetic mapping in the presence of genotyping errors. Genetics. 2007, 176 (4): 2521-2527. 10.1534/genetics.106.063982.
de Givry S, Bouchez M, Chabrier P, Milan D, Schiex T: CAR(H)(T)AGene: multipopulation integrated genetic and radiation hybrid mapping. Bioinformatics. 2005, 21 (8): 1703-1704. 10.1093/bioinformatics/bti222.
Doligez A, Audiot E, Baumes R, This P: QTLs for muscat flavor and monoterpenic odorant content in grapevine (Vitis vinifer L.). Mol Breed. 2006, 18: 109-125. 10.1007/s11032-006-9016-3.
Singleton VL, Ough CS, Nelson KE: Density separations of wine grape berries and ripeness distribution. Am J Enol Vitic. 1966, 17 (2): 95-105.
Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008, 18: 1509-1517. 10.1101/gr.079558.108.
Fournand D, Vicens A, Sidhoum L, Souquet JM, Moutounet M, Cheynier V: Accumulation and extractability of grape skin tannins and anthocyanins at different advanced physiological stages. J Agric Food Chem. 2006, 54 (19): 7331-7338. 10.1021/jf061467h.
R-Development-Core-Team: R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2008,[http://www.R-project.org/]. ISBN 3-900051-07-0.
Bates D, Maechler M, Dai B: lme4: Linear mixed-effects models using S4 classes. 2008, [http://lme4.r-forge.r-project.org/].
Broman KW, Wu H, Sen Å, Churchill GA: R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003, 19 (7): 889-890. 10.1093/bioinformatics/btg112.
Manichaikul A, Moon JY, Sen S, Yandell BS, Broman KW: A model selection approach for the identification of quantitative trait Loci in experimental crosses, allowing epistasis. Genetics. 2009, 181 (3): 1077-1086. 10.1534/genetics.108.094565.
Segura V, Denance C, Durel CE, Costes E: Wide range QTL analysis for complex architectural traits in a 1-year-old apple progeny. Genome. 2007, 50 (2): 159-171. 10.1139/G07-002.
Atkinson B, Therneau T: Kinship: mixed kinship: mixed-effects Cox models, sparse matrices, and modeling data from large pedigrees. R package, Version 1.1.0-23. 2009, [http://cran.r-project.org]1.1.0-23.
Saïdou A-A, Mariac C, Luong V, Pham J-L, Bezançon G, Vigouroux Y: Association studies identify natural variation at PHYC linked to flowering time and morphological variation in pearl millet. Genetics. 2009, 182 (3): 899-910. 10.1534/genetics.109.102756.
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES: TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics. 2007, 23 (19): 2633-2635. 10.1093/bioinformatics/btm308.
Benjamini Y, Hochberg Y: Controlling the false discovery rate: a pratical and powerful approach to multiple testing. Journal of the Royal Statistical Society Series B (Statistical Methodology). 1995, 57 (1): 289-300.
Hernandez-Jimenez A, Gomez-Plaza E, Martinez-Cutillas A, Kennedy JA: Grape skin and seed proanthocyanidins from Monastrell × Syrah Grapes. J Agric Food Chem. 2009, 57 (22): 10798-10803. 10.1021/jf903465p.
Deluc L, Bogs J, Walker AR, Ferrier T, Decendit A, Merillon JM, Robinson SP, Barrieu F: The transcription factor VvMYB5b contributes to the regulation of anthocyanin and proanthocyanidin biosynthesis in developing grape berries. Plant Physiol. 2008, 147 (4): 2041-2053. 10.1104/pp.108.118919.
Hichri I, Heppel SC, Pillet J, Leon C, Czemmel S, Delrot S, Lauvergeat V, Bogs J: The basic Helix-Loop-Helix transcription factor MYC1 is involved in the regulation of the flavonoid biosynthesis pathway in grapevine. Mol Plant. 2010, 3 (3): 509-523. 10.1093/mp/ssp118.
Deluc L, Barrieu F, Marchive C, Lauvergeat V, Decendit A, Richard T, Carde J-P, Merillon J-M, Hamdi S: Characterization of a grapevine R2R3-MYB transcription factor that regulates the Phenylpropanoid pathway. Plant Physiol. 2006, 140 (2): 499-511. 10.1104/pp.105.067231.
Terrier N, Torregrosa L, Ageorges A, Vialet S, Verries C, Cheynier V, Romieu C: Ectopic expression of VvMybPA2 promotes proanthocyanidin biosynthesis in grapevine and suggests additional targets in the pathway. Plant Physiol. 2009, 149 (2): 1028-1041.
Matus J, Poupin M, Cañón P, Bordeu E, Alcalde J, Arce-Johnson P: Isolation of WDR and bHLH genes related to flavonoid synthesis in grapevine (Vitis vinifera L.). Plant Mol Biol. 2010, 72 (6): 607-620. 10.1007/s11103-010-9597-4.
Bogs J, Jaffe FW, Takos AM, Walker AR, Robinson SP: The grapevine transcription factor VvMYBPA1 regulates proanthocyanidin synthesis during fruit development. Plant Physiol. 2007, 143 (3): 1347-1361. 10.1104/pp.106.093203.
Chira K, Schmauch G, Saucier Cd, Fabre S, Teissedre P-L: Grape variety effect on proanthocyanidin composition and sensory perception of skin and seed tannin extracts from Bordeaux wine grapes (Cabernet Sauvignon and Merlot) for two consecutive vintages (2006 and 2007). J Agric Food Chem. 2008, 57 (2): 545-553.
Travaglia F, Bordiga M, Locatelli M, Coïsson JD, Arlorio M: Polymeric proanthocyanidins in skins and seeds of 37 Vitis vinifera L. Cultivars: a methodological comparative study. J Food Sci. 2011, 76 (5): C742-C749. 10.1111/j.1750-3841.2011.02194.x.
Bordiga M, Travaglia F, Locatelli M, Coisson JD, Arlorio M: Characterisation of polymeric skin and seed proanthocyanidins during ripening in six Vitis vinifera L. cv. Food Chem. 2011, 127 (1): 180-187. 10.1016/j.foodchem.2010.12.141.
Carlborg O, Haley CS: Epistasis: too often neglected in complex trait studies?. Nat Rev Genet. 2004, 5 (8): 618-625. 10.1038/nrg1407.
Beavis W: The power and deceit of QTL experiments: lessons from comparative QTL studies. Proceedings of the Forty-Ninth Annual Corn and Sorghum Industry Research Conference. 1994, 250-266.
Charcosset A, Gallais A: Estimation of the contribution of quantitative trait loci (QTL) to the variance of a quantitative trait by means of genetic markers. TAG Theor Appl Genet. 1996, 93 (8): 1193-1201. 10.1007/BF00223450.
Lippman ZB, Zamir D: Heterosis: revisiting the magic. Trends Genet. 2007, 23 (2): 60-66. 10.1016/j.tig.2006.12.006.
Mol J, Grotewold E, Koes R: How genes paint flowers and seeds. Trends Plant Sci. 1998, 3 (6): 212-217. 10.1016/S1360-1385(98)01242-4.
Holton TA, Cornish EC: Genetics and biochemistry of anthocyanin biosynthesis. The Plant Cell Online. 1995, 7 (7): 1071-1083. 10.1105/tpc.7.7.1071.
Falginella L, Castellarin S, Testolin R, Gambetta G, Morgante M, Di Gaspero G: Expansion and subfunctionalisation of flavonoid 3',5'-hydroxylases in the grapevine lineage. BMC Genomics. 2010, 11 (1): 562-10.1186/1471-2164-11-562.
Ruzicka DR, Kandasamy MK, McKinney EC, Burgos-Rivera B, Meagher RB: The ancient subclasses of Arabidopsis actin depolymerizing factor genes exhibit novel and differential expression. Plant J. 2007, 52 (3): 460-472. 10.1111/j.1365-313X.2007.03257.x.
Taylor JS, Raes J: Duplication and divergence: the evolution of new genes and old ideas. Annu Rev Genet. 2004, 38 (1): 615-643. 10.1146/annurev.genet.38.072902.092831.
Rodin S, Riggs A: Epigenetic Silencing May Aid Evolution by Gene Duplication. J Mol Evol. 2003, 56 (6): 718-729. 10.1007/s00239-002-2446-6.
Dubos C, Stracke R, Grotewold E, Weisshaar B, Martin C, Lepiniec L: MYB transcription factors in Arabidopsis. Trends Plant Sci. 2010, 15 (10): 573-581. 10.1016/j.tplants.2010.06.005.
Khater F, Fournand D, Vialet S, Meudec E, Cheynier V, Terrier N: Identification and functional characterization of cDNAs coding for hydroxybenzoate/hydroxycinnamate glucosyltransferases co-expressed with genes related to proanthocyanidin biosynthesis. J Exp Bot. 2012, 63: 1201-1214. 10.1093/jxb/err340.
Xie DY, Dixon RA: Proanthocyanidin biosynthesis--still more questions than answers?. Phytochemistry. 2005, 66 (18): 2127-2144. 10.1016/j.phytochem.2005.01.008.
Gargouri M, Manigand C, Mauge C, Granier T, Langlois d'Estaintot B, Cala O, Pianet I, Bathany K, Chaudiere J, Gallois B: Structure and epimerase activity of anthocyanidin reductase from Vitis vinifera. Acta Crystallographica Section D. 2009, 65 (9): 989-1000.
Terrier N, Ollé D, Verriès C, Cheynier V: Biochemical and molecular aspects of flavan-3-OL synthesis during berry development. Grapevine Molecular Physiology and Biotechnology. Edited by: Roubelakis-Angelakis KA, Roubelakis-Angelakis KA. Netherlands: Springer; 2009:365-388.
Stafford HA, Shimamoto M, Lester HH: Incorporation of [14 C]Phenylalanine into Flavan-3-ols and Procyanidins in Cell Suspension Cultures of Douglas Fir. Plant Physiol. 1982, 69: 1055-1059. 10.1104/pp.69.5.1055.
Jende-Strid B: Genetic Control of Flavonoid Biosynthesis in Barley. Hereditas. 1993, 119 (2): 187-204.
Delcour JA, Ferreira D, Roux DG: Synthesis of condensed tannins part 9. The condensation sequence of leucocyanidin with (+)-catechin and with the resultant procyanidins. Journal of the Chemical Society, Perkin Transactions. 1983, 1: 1711-1717.
Grimplet J, Deluc L, Tillett R, Wheatley M, Schlauch K, Cramer G, Cushman J: Tissue-specific mRNA expression profiling in grape berry tissues. BMC Genomics. 2007, 8 (1): 187-10.1186/1471-2164-8-187.
Debeaujon I, Peeters AJM, Léon-Kloosterziel KM, Koornneef M: The TRANSPARENT TESTA12 Gene of Arabidopsis Encodes a Multidrug Secondary Transporter-like Protein Required for Flavonoid Sequestration in Vacuoles of the Seed Coat Endothelium. The Plant Cell Online. 2001, 13 (4): 853-872. 10.1105/tpc.13.4.853.
Corcoran MR, Geissman TA, Phinney BO: Tannins as Gibberellin Antagonists. Plant Physiol. 1972, 49 (3): 323-330. 10.1104/pp.49.3.323.
Green FB, Corcoran MR: Inhibitory action of five tannins on growth induced by several Gibberellins. Plant Physiol. 1975, 56 (6): 801-806. 10.1104/pp.56.6.801.
We thank A.-A. Saïdou for the helpful discussions and kind assistance in association model selection. We thank all the research team members for their participation in sample preparation. We thank M. Farnos and P. Ortigosa for collecting diversity panel samples and the staff of Domaine du Chapitre and Domaine de Vassal for grapevine cultivation. We thank three anonymous reviewers for helpful suggestions and P. Chatelet for helpful English revision. This work was funded in part by the European project, FLAVO (no. 513960) and a Ph.D. grant for YFH from INRA and Languedoc-Roussillon Region.
YFH carried out the gene sequencing and alignment, performed data analyses, prepared tables and figures and drafted the manuscript. AD checked phenotyping and genotyping data, performed linkage map construction and first statistical analysis and participated in manuscript preparation. AFL participated in gene sequencing, sequence alignment, association test analysis and in manuscript preparation. LLC participated in upstream data analyses, data interpretation and in manuscript preparation. YB conducted field experimentation and sample collection. AC performed genotyping for linkage map construction. FV, VM, CM, and JMS carried out biochemical analysis. NT directed PA variable conception, interpreted the result and participated in manuscript preparation. VC and PT conceived the study, participated in its design, coordination, data interpretation and manuscript preparation. All authors read and approved the final manuscript.
Electronic supplementary material
Additional file 2: Distribution of residuals and BLUPs and Quantile-quantile plot of residual and BLUPs of the best fitted model for S × G population. For each PA variable, 4 panels are shown: distribution of residuals of the best fitted model (box-and-whisker plot, topleft), quantile-quantile plot of model residuals against a theoretical normal distribution (topright), distribution of BLUPs of the best fitted model (box-and-whisker plot, bottomleft), quantile-quantile plot of BLUPs against a theoretical normal distribution (bottomright). (PDF 1 MB)
Additional file 4: Effect of minor genotypic frequency and non-normalty of observed phenotype on the association test. Two sections are in this file. 1. Test for the enrichment of low frequency polymorphisms among associated markers. 2. Test for the effect of the non-normalty of the trait in the association tests. (PDF 48 KB)
Additional file 5: H2) in S × G and CC populations. Two tables inside: Table A, summary of S × G population; Table B, summary of CC population. Skin data were collected in 2005 and 2006 and seed data were collected in 2006 and 2007. Parental values are indicated as mean ± standard error. Broad sense heritability (H2 ) was estimated based on the best fitted model as the percentage of phenotypic variance explained by the genotypic variance. (PDF 25 KB)
Additional file 6: . Summary for consensus detection and parental detection are in two separate sheets. Term: main effect QTLs and pairwise epistatic interactions. Main effect QTLs are indicated by chromosome@position of LOD peak while interaction terms are indicated by ":" linking main effect QTLs. Map (for QTL summary of parental detection): the parental map used for QTL detection. LOD score and R2 were estimated by dropping the considered term from the full model. For loci involved in pairwise interaction, their LOD and R2 were estimated by dropping both the main effect and the associated interaction effect. df: degree of freedom dropped for QTL effect estimation. Type III SS: type III sum of squares. CI: LOD-1 confidence interval. For consensus detection, As, Ag and D indicate additive effect from Syrah alleles, additive effect from Grenache alleles and dominance effect which were estimated according to Segura et al. : As: 1/4[(μad+μac)-(μbd+μbc)], Ag: 1/4[(μac+μbc)-(μad+μbd)], D: 1/4[(μac+μbd)-(μbc+μad)], where μbd, μbc and μad are phenotypic means for corresponding genotypes relative to phenotypic mean of ac genotype. Thus here, μac = 0. effect column in QTL summary for consensus detection indicates major effects of the considered locus involved in phenotype variation, satisfying the following condition: (|As| or |Ag| or |D|)/(|As|+|Ag|+|D|) > 0.30. "Estimate" column in QTL summary for parental detection gives the allelic effect between parental alleles or the corresponding interaction effect, the sign is arbitrary. The type III sum of squares, df, LOD and R2 of the QTL model are indicated at the last row of each variables, highlighted on yellow. The loci identified in both skin and seeds for the same variable (i.e. having overlapping LOD-1 confidence interval) are highlighted in pink. (XLS 108 KB)
Additional file 7: . Parental and consensus maps are presented in parallel: left, Syrah map, indicated by S; centre, consensus map, indicated by C and right, Grenache map, indicated by G. QTLs are presented as vertical lines at the right side of each map: the line length corresponds to LOD-1 confidence interval and the LOD peak is indicated by a small horizontal bar in the confidence interval. Known candidate genes are positioned between flanking markers, indicated by red-filled bars, according to 12X grape genome sequence (http://www.genoscope.cns.fr). Red-filled bars indicated flanking marker interval of regulatory genes and green-filled bars for enzyme-coding genes. (PDF 42 KB)
Additional file 8: . -2lnlikelihood is shown. Model comparison was performed by likelihood ratio comparing each model to the most complete model. Significance was assessed using the χ2 distribution with degree of freedom as the difference in the number of parameter between two models. Significance level is indicated as *, P < 0.05, **, P < 0.01, ***, P < 0.001. (PDF 13 KB)
About this article
Cite this article
Huang, YF., Doligez, A., Fournier-Level, A. et al. Dissecting genetic architecture of grape proanthocyanidin composition through quantitative trait locus mapping. BMC Plant Biol 12, 30 (2012). https://doi.org/10.1186/1471-2229-12-30
- Quantitative Trait Locus
- Quantitative Trait Locus Analysis
- Quantitative Trait Locus Mapping
- Grape Berry