- Research article
- Open Access
Genetic analysis of safflower domestication
BMC Plant Biologyvolume 14, Article number: 43 (2014)
Safflower (Carthamus tinctorius L.) is an oilseed crop in the Compositae (a.k.a. Asteraceae) that is valued for its oils rich in unsaturated fatty acids. Here, we present an analysis of the genetic architecture of safflower domestication and compare our findings to those from sunflower (Helianthus annuus L.), an independently domesticated oilseed crop within the same family.
We mapped quantitative trait loci (QTL) underlying 24 domestication-related traits in progeny from a cross between safflower and its wild progenitor, Carthamus palaestinus Eig. Also, we compared QTL positions in safflower against those that have been previously identified in cultivated x wild sunflower crosses to identify instances of colocalization.
We mapped 61 QTL, the vast majority of which (59) exhibited minor or moderate phenotypic effects. The two large-effect QTL corresponded to one each for flower color and leaf spininess. A total of 14 safflower QTL colocalized with previously reported sunflower QTL for the same traits. Of these, QTL for three traits (days to flower, achene length, and number of selfed seed) had cultivar alleles that conferred effects in the same direction in both species.
As has been observed in sunflower, and unlike many other crops, our results suggest that the genetics of safflower domestication is quite complex. Moreover, our comparative mapping results indicate that safflower and sunflower exhibit numerous instances of QTL colocalization, suggesting that parallel trait transitions during domestication may have been driven, at least in part, by parallel genotypic evolution at some of the same underlying genes.
The process of domestication, which has long been considered to be a form of “applied evolution,” inspired some of the earliest studies of evolution in response to natural selection . Indeed, given the parallels between the adaptation of domesticated species to human-disturbed environments and the adaptation of wild populations to survival in natural environments , evolution under domestication is viewed by many as a valuable opportunity for studying the genetics of adaptation. Because many crop species share a common suite of traits (e.g., loss of seed dormancy, uniform flowering time, and fruit size) that evolved in response to selection during domestication (referred to as the “domestication syndrome”; ), comparative analyses across independent crop lineages also hold great promise for studying the genetic basis of parallel phenotypic evolution.
Over the years, quantitative trait locus (QTL) mapping has been used to investigate the genetic architecture of traits comprising the domestication syndrome in numerous crop species. Although early QTL-based studies suggested that domestication traits were predominantly controlled by a small number of large-effect QTL (e.g. [4–6]), other studies have revealed a higher level of genetic complexity (e.g. [7, 8]). Comparisons among QTL analyses can also provide insight into the extent to which parallel phenotypic changes across independent crop lineages are driven by selection on homologous genes, or at least genomic regions. For example, comparative QTL mapping across crops in the Fabaceae , Poaceae [10, 11], and Solanaceae  has provided evidence that many domestication traits, including increased seed weight, increased fruit size, and changes in flowering time and life history may be conditioned by independent changes in homologous genes in different lineages. Beyond providing fundamental evolutionary insights, such comparative genomic analyses also have the potential to aid in the improvement of other crops about which less is known. For example, knowledge that the Arabidopsis dwarfing gene, GAI, is structurally and functionally homologous to the wheat and maize dwarfing genes RHT-B1, RHT-D1, and D8, led to the transformation of the GAI gene into basmati rice to produce dwarf varieties .
In the present study, we investigate the genetic basis of the domestication syndrome in the oilseed crop safflower (Carthamus tinctorius L.; Carduoideae). Safflower is an annual, self-compatible, diploid (2n = 2x = 24; ) crop believed to have had a single origin of domestication in the Fertile Crescent region approximately 4000 years ago . This species is well-adapted to growth in dry environments, having a long taproot (reported to grow over 1.5 m; [16, 17]) that enables water uptake even when surface moisture is limiting. Originally, safflower was cultivated for its floral pigments (carthamine; ). Since its initial domestication, safflower cultivation has spread to other parts of the world, including many underdeveloped countries (e.g., Ethiopia, Afghanistan, and Sudan). Commercialization of safflower in the Americas began in the 1950s, where it has largely been used as an oilseed crop, in bird seed mixes, and as an ornamental species. Safflower is especially attractive as an oilseed crop, given that its seed oils are rich in mono- and polyunsaturated fatty acids. Though safflower possesses many of the standard traits that comprise the domestication syndrome (e.g., loss of seed dormancy, uniform flowering time, increased seed production, and increased seed oil quality and content), most cultivated safflower varieties have retained certain weed-like characteristics of their wild relatives (e.g., branching and leaf spines).
Safflower is a member of the Compositae (a.k.a. Asteraceae), which is currently recognized as the largest family of flowering plants [19, 20]. This family contains ca. 10% of all flowering plant species  and includes over 40 economically important crops grown for a variety of uses, such as safflower, lettuce (Lactuca sativa L.; Cichorioideae), and sunflower (Helianthus annuus L.; Asteroideae). These three crops represent the three major subfamilies within the Compositae, which collectively account for 95% of the species diversity within the family. Like safflower, sunflower is primarily grown as an oilseed crop. Given this, along with the wealth of available information on the origin and evolution of cultivated sunflower (e.g. [7, 8, 21–25]), our work also provides an opportunity to study the genetic basis of parallel phenotypic changes during domestication within this important family.
Here, we describe a genetic map-based study of domestication-related traits in a population derived from a cross between safflower and its wild progenitor (C. palaestinus Eig.; see below). Our results indicate that the genetic architecture of safflower domestication is complex, with the majority of traits being controlled by multiple QTL with small to moderate phenotypic effects. Moreover, a comparison of our results to those derived from similar analyses in sunflower provides evidence of QTL colocalization, highlighting possible parallels in genetic architecture between safflower and sunflower and, in some cases, suggesting that parallel trait transitions may have been driven by parallel genotypic changes in these lineages.
Seeds obtained from the USDA for both safflower (cv AC Sunset; PI 592391) and C. palaestinus (PI 235663) were germinated in the University of Georgia greenhouses during the summer of 2009. AC Sunset is an inbred, dual-purpose (i.e., birdseed and oilseed) cultivar developed in Canada . Like many other high oil varieties, the leaf margins of AC Sunset plants have prominent spines. Genetic analyses based on nuclear and chloroplast markers  as well as archaeological  and geographic evidence [18, 29] all point to the predominantly selfing C. palaestinus (2n = 2x = 24; ) as the wild progenitor of safflower. This species is native to the Middle East in the area around Israel and is fully cross-compatible with safflower. Though it exhibits considerable morphological variation for a variety of traits including leaf spininess, leaf shape, duration of rosette habit, and flower color, C. palaestinus can be distinguished from safflower based on its tendency toward non-uniform germination, an extended rosette habit, and smaller seed size. Also, contrary to the expectation based on most crop-wild comparisons, C. palaestinus exhibits more limited branching than safflower (; unpublished observation).
A single safflower plant served as a pollen donor in a cross between safflower and its wild progenitor. The F1 seeds from this cross were germinated and the resultant plants were selfed to produce F2 families, the largest of which was chosen for use in the QTL analysis described herein. A mapping population consisting of 276 F2 individuals was grown and phenotyped in the greenhouse during the summer of 2010. Additionally, nine plants of the inbred AC Sunset and nine selfed offspring of the C. palaestinus mapping parent were grown in the greenhouse alongside the mapping population to provide estimates of parental trait means under the same conditions as the mapping population. The mapping population and parental plants were transplanted into 12-inch tall treepots (Stuewe and Sons, Tangent, OR) and grown in a completely randomized fashion within a single greenhouse room. All pots were moved weekly throughout the duration of the study to minimize the effects of micro-environmental variations within the greenhouse.
Plants were checked daily and dates were recorded for estimates of root growth rate and the initiation of flowering. Root growth rate was based on the number of days until the roots reached the bottom of each pot. Leaf size, shape, perimeter, and spininess were estimated and averaged across three leaves collected from each plant (the most recent fully expanded leaf, the leaf directly below the primary capitulum, and the longest rosette leaf) and scanned for analysis with ImageJ v1.43u ). Spininess was measured using a modified version of the spine index, which was initially described in  as the number of spines on a leaf multiplied by the length of the longest spine on that leaf. Here, a “standardized spine index” was used, taken as a measure of the number of spines per centimeter of leaf margin multiplied by the length of the longest spine on that leaf.
Heads were bagged on the day of anthesis to prevent cross-pollination and potential seed loss. The height and diameter of the primary head and disc were measured using digital calipers on the day of first flowering of each plant. Stem height was measured as the length of the stem from the base of the plant to the base of the primary head. Fresh florets were collected from the third flowering head on the day that it opened and mature florets were collected from the primary head after all flowering had ceased. These florets were flash frozen in liquid nitrogen to preserve their pigments, which were later measured with a Gardner XL20 colorimeter (Bethesda, MD) using coordinates from CIE L*a*b* color space (in which L* represents luminosity and a* and b* represent the coordinates of each pigment’s hue on the red/green and blue/yellow axes, respectively; [34, 35]). For each measurement, differences in hue due to differences in light intensity were controlled for by holding L* constant at 30 units. Because floret color changes from yellow (anthesis) to red (senescence) in AC Sunset (but not in the wild progenitor), we recorded the magnitude of floret color change in each plant by calculating the difference between the a* value at flowering and the a* value at maturity. Smaller a* values correspond to yellower flowers and larger values correspond to redder flowers, and b* values changed marginally between these two colors.
Heads were harvested at physiological maturity (i.e., when the bracts were no longer green). Seven days after harvest, 12-16 achenes were sterilized using a 10% bleach solution and planted at a 1.5 cm depth into small pots maintained in a growth chamber for the assessment of seed dormancy and viability. Plants with primary heads lacking sufficient seed set (n = 119) were omitted from this analysis. Pots were monitored daily for up to 60 days and dates of seedling emergence were recorded and used to estimate the fraction of achenes that germinated within the 60 day window and calculate the average time to germination for each F3 family.
At the conclusion of flowering, the number of senesced heads, the height (above the soil) of the lowest branching point, the number of internodes, and the length of the second internode were recorded for each plant. As the remaining heads reached maturity, selfed achenes were harvested, counted, weighed, and measured. Average achene weight was based on a random subset of 50 achenes. However, for plants that produced less than 50 achenes, the average achene weight was based on all achenes produced.
Seed oil content and composition were estimated for all plants with sufficient seed set. These measurements were taken following previously established protocols (percent oil content: [21, 36]; seed oil composition: ). Briefly, a Bruker MQ20 minispec NMR analyzer (The Woodlands, TX) was used to determine percent oil content. The standard protocol was modified to accommodate measurements based on small seed sets by placing ca. one cm of tissue paper into the flat-bottomed tubes and adding ca. one cm of cleaned safflower seeds on top of the tissue paper. Percent oil content was estimated using a calibration curve using commercial safflower oil as a standard (Hollywood, Boulder, CO). Oil composition was determined based on gas chromatography of fatty acid methyl esters. A total of ten achenes from each plant were hand ground and fatty acids were extracted and then analyzed using an Agilent 6890 N gas chromatograph (Santa Clara, CA).
Single nucleotide polymorphisms (SNPs) that differentiated the parents of our population were identified from expressed sequence tag (EST) and transcriptome data generated from each parent. For the cultivated mapping parent, we used the AC Sunset EST data produced via Sanger sequencing of cDNAs described in Chapman et al. . For the wild mapping parent, we produced transcriptome sequence data via 454 sequencing (454 Life Sciences, Branford, CT) as follows: RNA was extracted from mature leaves, bracts, florets, and developing ovules collected from a single C. palaestinus plant using a combined trizol (Invitrogen, Carlsbad, CA) and RNeasy mini column (Qiagen, Valencia, CA) method. RNA extracted from each tissue type was pooled in equal proportions, normalized prior to 454 library preparation following the protocols described by Lai et al.  and Meyer et al. , sequenced, and assembled using MIRA v3.0.3  (see Additional file 1).
The assembled sequences from the mapping parent assemblies were aligned to each other using Mosaik . SNPs were identified using SAMtools  and run through the Illumina GoldenGate “assay design tool” (San Diego, CA; http://illumina.com/), which identified SNPs free of other polymorphisms within 60 bp of the targeted SNP site and assigned a quality score predicting the success with which a SNP would be assayed. To facilitate downstream comparative genomic analyses, SNPs used in this study were preferentially selected from Carthamus unigenes with mapped homologs in the high-density sunflower “consensus” map . A total of 384 SNPs meeting design requirements were targeted for genotyping, and a subset of these were validated via genetic mapping (see below).
Genotyping and map construction
Whole genomic DNA was extracted from leaf tissue from each F2 plant as well as the mapping parents using a modified CTAB protocol . DNA concentrations were estimated using the Quant-iT PicoGreen kit (Invitrogen) using a Biotek Synergy 2 plate reader (Winooski, VT). The Illumina GoldenGate Assay described above was then used to genotype each sample on the BeadXpress Reader (Illumina) at the Georgia Genomics Facility. Allele calls were obtained using the Illumina GenomeStudio software v2011.1.
A genetic linkage map was constructed using MapMaker 3.0/EXP [45, 46]. Briefly, initial linkage groups (LGs) were identified using the “group” command with a minimum LOD score of 5.0 and a maximum frequency of recombination of 0.4 between adjacent markers. Preliminary map orders were determined using the “compare” command on a subset of markers within LGs and the remaining markers were placed using the “try” command. For each LG, marker orders were confirmed using the “ripple” command and the final marker orders, presented here, represent the most likely marker orders given the data.
Statistical analyses and QTL mapping
Histograms and trait means of the mapping population and mapping parents were plotted for visualization using the R Statistical Package . Estimated parental trait values were further analyzed to test for significant differences. This was done using either Welch’s t-tests or, when trait distributions deviated significantly from normality (as determined by the Shapiro-Wilk test for normality), Wilcoxon signed-rank tests. Spearman correlation coefficients were calculated among all traits measured in the F2 mapping population using the “hmisc” package  in R . Significance was determined using the sequential Bonferroni method with α = 0.05 .
QTL were identified using QTL Cartographer v1.17j [50, 51] following established approaches (e.g. [7, 8, 21, 24]). Briefly, composite interval mapping (CIM) was performed in ZmapQTL with a 10 cM window and a maximum of five background cofactors identified using SRmapQTL with forward/backward stepwise regression, and tests were performed at 2 cM intervals. Permutation thresholds (α = 0.05 and 0.1) for declaring QTL significance were estimated based on 1000 permutations for each trait [52, 53]. Secondary peaks were not considered as separate QTL unless there was a 2-LOD decline between adjacent peaks.
The results generated from CIM were then used as an initial model for multiple interval mapping (MIM), as implemented by MIMapQTL . This analysis was used to confirm QTL identified via CIM. Following the authors’ recommendations, the information criterion was set as IC(k) = -2(log(L)-kc(n)/2), where c(n) = log(n) was the penalty function and the threshold was set at 0. Epistasis was investigated at a genome-wide level using EPISTACY v2  to test for interactions between all possible pairs of codominant markers that exhibited unique segregation patterns (i.e., redundant markers that showed identical segregation patterns were joined into a single haplotype to reduce the number of pairwise comparisons). As suggested by the author, significance was determined by dividing the comparison-wise error rate (α = 0.05) by g(g-1)/2, where g is the haploid number of LGs in safflower (n = 12).
The mode of gene action for each QTL was estimated by dividing the dominance effect of the cultivar allele by its additive effect (d/a), such that cultivar alleles that are completely recessive have a value of -1 and those completely dominant have a value of +1. Following the cutoffs employed by Burke et al. , the mode of gene action of the cultivar allele at each locus was categorized as follows: underdominant ≤ -1.25 < recessive ≤ -0.75 < partially recessive ≤ -0.25 < additive < 0.25 ≤ partially dominant < 0.75 ≤ dominant < 1.25 ≤ overdominant. Additionally, the magnitude of the effect of each QTL was considered to be “large” if the percentage of segregating phenotypic variance explained (PVE) was greater than 25%, “small” if the PVE was less than 10%, and “intermediate” if in between these values.
Comparative genomic analyses
In order to identify homologous loci between the safflower and sunflower genomes, all ESTs harboring safflower SNPs mapped in this study as well as all loci from the 10,000 feature sunflower consensus map  were compared via BLAST to the lettuce genome, v4 . As noted above, lettuce is a member of the Cichorioideae, which falls at an intermediate phylogenetic position between the Carduoideae and the Asteroideae. The use of the sequenced lettuce genome as an intermediary greatly simplified these analyses because it is the same ploidy level as safflower (though functionally diploid, sunflower is a paleopolyploid [i.e., tetraploid relative to safflower and lettuce] due to an ancient whole genome duplication at the base of the Heliantheae; ) and because it dramatically increased the number of mapped loci bridging the safflower and sunflower maps (i.e., virtually all of the ESTs from which the safflower and sunflower markers were derived could be matched to corresponding sequences in the lettuce genome). The top two BLASTN hits with an e-value better than 1x10-6 were recorded and, to facilitate comparative QTL mapping, efforts were focused on establishing homology across genomes in regions of the sunflower genome containing relevant QTL (see below). Pairs of linkage groups across species that had three or more homologous loci (as determined by BLAST) were considered to be putatively homologous (i.e., syntenic) chromosomal regions. Instances where multiple chromosomal regions containing relevant sunflower QTL exhibited homology to one or more regions in lettuce, perhaps due to the duplication history of these species, were retained for further analysis, as were cases in which multiple lettuce LGs exhibited homology to one or more safflower LGs. Finally, QTL-containing regions exhibiting 1:1:1 homology across the three genomes were also retained.
We then surveyed the literature to identify previously mapped sunflower domestication QTL for traits homologous to those investigated in this study (Additional file 2). Because many of the markers used to map these QTL were included in the sunflower consensus map , it was possible to project the positions of these QTL onto that map for comparative QTL mapping. For instances in which the bounds of 1-LOD intervals could not be directly projected onto the consensus map (due to an absence of shared markers at the 1-LOD boundaries), we estimated the distance from shared markers within the 1-LOD interval to the boundaries based on relative map lengths. To determine the probability that instances of QTL colocalization were due to chance alone, we used the hypergeometric probability distribution function (‘sampling without replacement’; ); as described in Paterson et al.  and Paterson  as follows:
where n is the number of intervals that can be compared (estimated here as the genome size divided by average QTL size for a given trait), m is the number of colocalizing QTL, l is the total number of QTL in the larger sample, and s is the number of QTL in the smaller sample for a given trait.
Of the 24 traits analyzed, 15 differed significantly between the mapping parents when grown alongside the mapping population (Table 1). Comparisons of the means and standard deviations of the mapping parent representatives to the F2 trait distributions revealed transgressive segregation for the majority of the traits analyzed (i.e., there are F2 individuals with trait values exceeding one standard deviation in either direction of the mapping parents; Figure 1). The most extreme examples of transgressive segregation were for traits related to vegetative growth: capitulum height, disc diameter, stem height, leaf size, and achene size. In other words, many of the F2 plants and their achenes were larger than either of the mapping parents.
Approximately one-fourth of the mapping population had florets that developed a deep red color at maturity. The flower color ratio did not differ significantly from a 3:1 distribution (χ2 = 0.16, P = 0.6; data not shown). This suggests that flower color variation in the mapping population is controlled by a single gene, with the ability to turn red being recessive.
Many of the traits under study were correlated within the F2 mapping population (Additional file 3) in a way that was largely consistent with the observed parental trait combinations. Several of these correlations, however, were no longer significant after Bonferonni correction. Notably, the total number of selfed achenes produced was positively correlated with achene oil content (ρ = 0.573, P < 0.001) and stem height (ρ = 0.420, P < 0.001) and negatively correlated with achene dimensions (achene weight: ρ = -0.562, P < 0.001; achene width: ρ = -0.558, P < 0.001; achene length: ρ = - 0.730, P < 0.001).
Of the 384 SNPs designed for the Illumina GoldenGate assay, 244 exhibited interpretable polymorphisms that could be used for genetic mapping (Additional file 4). The remaining 140 markers were omitted due to monomorphism in the mapping population, overly complex segregation patterns (presumably due to paralogy), or failure of the assay probes to hybridize with the target DNA. Although SNPs are typically scored as codominant markers, 26 of the SNPs included in this study were scored as dominant markers due to the lack of a clear distinction between one of the homozygote classes and the heterozygote class (these markers are flagged with an asterisk in Figure 2).
The 244 markers coalesced into 12 LGs, consistent with the haploid number of n = 12 chromosomes that has previously been reported for safflower (Figure 2; ). These LGs ranged in size from 30.7 to 105.3 cM (average = 71.5), with each group containing 6 to 40 markers (average = 20.3). A Poisson goodness-of-fit test revealed that the mapped markers were non-randomly distributed along the linkage groups (P < 0.0001). The total map length summed to 858.2 cM, resulting in an average intermarker distance of 3.7 cM (range = 0.0-39.6 cM).
The segregation patterns of 14 out of the 244 mapped markers deviated from Mendelian expectations (i.e., they exhibited significant segregation distortion after Bonferroni correction; [60–62]. Eleven of these loci were located in two distorted regions spanning 13 cM (on LG K) and 12 cM (on LG L; Figure 2). Distortion occurred in both directions: in some cases, the wild allele was overrepresented in the mapping population while in other cases, the cultivar allele was overrepresented. Two markers exhibited an underrepresentation of both homozygote classes, yielding a heterozygote excess. Within each of the two aforementioned distorted regions, however, the direction of deviation remained consistent. The segregation of the markers on LG K was consistently skewed in favor of the wild allele, whereas the segregation of the markers on LG L was consistently skewed in favor of the cultivar allele. Further, the magnitude of the distortion was significantly greater on LG L (P = 0.01).
A total of 61 QTL were identified for 21 of the 24 traits studied (Table 2, Figure 2). Only LG F lacked QTL. Eight of the sixty-one QTL were marginally significant, having been identified at the α = 0.1 permutation threshold during CIM; the remainder exceeded the α = 0.05 permutation threshold and all of these QTL were confirmed via MIM. The 1-LOD confidence intervals for these QTL averaged 13.5 cM, ranging from 1.5 to 31.9 cM. Of the 21 traits, 16 traits had multiple QTL (range = 2-7). Nearly all mapped QTL 1-LOD intervals overlapped at least 1 of the 244 mapped markers, with the lone exception of a QTL for% oil that mapped between markers on LG L. One trait had two instances of antagonistic QTL on the same LG (capitulum height; on both LGs A and D), where the cultivar-derived allele for one QTL conferred a cultivar-like phenotype and the cultivar-derived allele for the other QTL conferred a wild-like phenotype. The majority of all QTL identified mapped to one of seven QTL clusters on seven different chromosomes, with each cluster harboring three to twelve QTL, and a Poisson goodness-of-fit test revealed that the observed QTL were non-randomly distributed (P < 0.0001). Also, 23 of the QTL identified in this study mapped to 3 of 7 genomic regions that exhibited marker clustering (LGs C, E, and H), and a bionomial goodness-of-fit test (P < 0.0001) indicates that the distribution of markers and QTL are significantly non-independent.
The PVE for the identified QTL ranged from 4.2% to 63.4% (Table 2), with the majority of the QTL having small effects (PVE <10%). There were 13 QTL with intermediate effects and just 2 QTL with large effects (spininess and flower color, 32.7% and 63.4% respectively). For traits that differed significantly between the mapping parents, it was possible to investigate whether the respective QTL had allelic effects in the expected direction (i.e., whether or not the cultivar allele produced a more cultivar-like phenotype). Examination of the 44 QTL identified for the 15 traits with means differing significantly in the parents revealed 12 QTL for 9 traits conferring phenotypes in the “wrong” direction. For the remaining three traits for which multiple QTL were identified, the effects of all QTL were consistent with the observed parental trait differences, even if those differences were not significant (average leaf size, spininess, and number of internodes; Table 2). Interestingly, the majority of the seed width QTL had cultivar alleles conferring a wild-like phenotype. The mode of gene action of each QTL ranged from -21.7 (average seed weight) to 7.3 (total number of seeds per plant), though the majority fell in the -1 to 1.25 range (average ± S.E. = 0.72 ± 0.40). Traits with overdominant QTL included flowering time, total number of seeds, seed weight, seed length, seed width, and seed oil content; those with underdominant QTL included leaf size, number of capitula, and seed weight.
The genome-wide scan for epistasis detected a total of 105 significant epistatic interactions at the α < 0.05 level (after correcting for multiple comparisons) for 19 traits (Additional file 5). Note that multiple interactions detected among loci on the same linkage groups were counted as a single interaction, accounting for the non-independence of these loci due to linkage. Not all traits with QTL were found to be influenced by epistatic loci and some traits only appeared to be influenced by loci with epistatic effects. On average, 5.5 ± 1.27 interactions were identified for the traits with epistasis, ranging from 1 to 19 significant interactions per trait. Additive-by-additive and additive-by-dominant interactions comprised the majority of the interactions (40 and 42 interactions, respectively). Upon closer inspection of the EPISTACY results, we noticed two traits that had QTL × QTL interactions: leaf shape (LG H × L, additive-by-additive) and seed length (LG K × C, additive-by-dominant and dominant-by-dominant).
We were somewhat limited in our ability to directly identify instances of QTL colocalization between safflower and sunflower due to the relative paucity of markers in safflower and, consequently, a limited number of shared markers between the maps. The lettuce genome sequence, however, served as an effective intermediary in bridging the gaps between these maps. We ultimately identified 14 QTL corresponding to 10 different domestication-related traits in safflower (1 to 3 QTL per trait) that colocalized with previously identified QTL in sunflower (Figure 3, Additional file 6; [7, 8, 22–25]). Five of these traits in safflower and sunflower were bridged by at least one lettuce scaffold (days to flower, disc diameter, number of selfed seed, and achene length and width) and an additional two traits had QTL sharing a homologous gene sequence in all three species (number of heads and achene weight). As expected based on the history of genome duplication within the family, we observed a number of duplicated regions between species and, in some cases, these regions contained relevant QTL. Four QTL corresponding to four traits (achene weight, number of selfed seed, number of heads, and days to flower) in safflower colocalized with multiple QTL in sunflower (i.e., a one-to-several relationship existed for these traits in safflower vs. sunflower QTL comparisons; Additional file 6; [7, 8, 24]). Interestingly, we identified colocalized QTL for days to flower, achene length, and number of selfed seeds in which the cultivar allele conferred phenotypic effects in the same direction in both species (Figures 2 and 3, Additional file 6), suggesting that the parallel domestication of these traits is the result of parallel selective pressures.
We saw highly significant evidence of QTL colocalization for % oil content (LG I; P = 0.001; Figure 3; ) as well as marginally significant evidence of QTL colocalization for days to flower (LG I; P = 0.08), achene weight (LGs C and H; P = 0.07), % linoleic acid (LG H; P = 0.08), and achene width (LG C; P = 0.09). The significance of the remaining traits that exhibited QTL colocalization between safflower and sunflower was less compelling (P-values ranged from 0.12 to 0.21 for disc diameter, number of heads, number of selfed seeds, achene length, and % oleic acid). Nonetheless, when applied across all ten traits with evidence of QTL colocalization, Fisher’s combined probability test was highly significant (P = 0.0001) and remained significant even when excluding the highly significant result for % oil content (P = 0.004).
Genetic architecture of safflower domestication
Our results indicate that domestication-related traits in safflower are largely controlled by multiple genes of small to moderate effect. Only two traits (flower color and spininess) had “major” QTL (i.e., PVE > 25%). As such, the genetic architecture of safflower domestication appears similar to that of sunflower, which is the only other crop in which QTL analyses have revealed such a clear paucity of major effect QTL [7, 8, 24]. More commonly, QTL mapping has suggested that domestication-related traits are conditioned by a relatively small number of loci with large phenotypic effects (reviewed in [63, 64]). Recent population genomic analyses have, however, revealed that much larger numbers of genes are typically under selection during crop domestication and/or improvement ([65, 66]; reviewed in [67, 68]). These findings suggest that the genetic architecture of domestication traits is likely to be complex, even for crops in which initial QTL-based approaches have suggested otherwise.
The single largest QTL identified in our study explained 63.4% of the phenotypic variance in flower color. Further, our observation of 3:1 segregation of flower color suggests that differences in the production of carthamine (the quinochalcone pigment responsible for the production of red florets) within our mapping population are due to the effects of a single locus. Earlier crossing studies of safflower suggested that multiple genes influence flower color [69, 70] and more recent studies have shown that there are at least two interacting genes differentiating orange and yellow florets . The fact that we identified just a single QTL suggests that the mapping parents in our population differ primarily in terms of the production of carthamine as opposed to the other floral pigments. More generally, the findings of single, large effect QTL for the presence of a particular floral pigment as well as for leaf spines are consistent with the views of Gottlieb , who argued that presence/absence characters and major or structural differences in plants are commonly controlled by just one or two genes.
Map features and QTL distribution
All but 14 of the markers analyzed exhibited Mendelian segregation ratios, with eleven of these markers occurring within two distorted regions. Though the cause of this distortion remains unknown, it may be due to genomic divergence between the mapping parents in these regions. In this light, it is worth noting the distorted region on LG K harbors QTL for seed-related traits and the distorted region on LG L harbors the large effect QTL for spininess as well as other QTL for internode length, number of internodes, disc diameter, capitulum height, and leaf shape.
In terms of overall marker distribution, we observed numerous tight clusters across multiple LGs. These marker clusters could be a byproduct of an uneven distribution of genes across the genome (recall that all SNPs employed in this study were selected from transcribed sequences), chromosomal rearrangements that differentiate the mapping parents, and suppress recombination in affected regions (though the F1 hybrids did not seem to suffer reduced fertility), or – perhaps more likely – the suppression of recombination in and near centromeres. Clustering has also been reported in other genetic maps generated from C. tinctorius × C. tinctorius crosses, though to a lesser extent [73, 74].
We likewise observed a number of QTL clusters across the genome. In some cases, these clusters appeared to co-occur with the aforementioned marker clusters, suggesting that they may be mapping to gene dense regions or to regions with suppressed recombination. It has been argued that species in which the genes underlying domestication-related traits are clustered may be inherently easier to domesticate . In this context, it is worth noting that clustering of domestication-related loci has previously been documented in a number of other crops, including maize , common bean , pearl millet , pepper  and sunflower  (reviewed in ). While Pernès  predicted that the linkage of domestication genes can aid cross-pollinated crops in maintaining trait complexes that comprise the domestication syndrome, and further modeling has supported this prediction , empirical studies (including the present study) have indicated that these QTL clusters are also found in highly selfing crops [6, 81–83]. It is, however, possible that these crops are more allogamous than they seem, or perhaps that increased allogamy occurred earlier in the domestication process, thereby helping to “assemble the domestication syndrome” .
In general terms, transgressive segregation can be produced by complementary gene action, overdominance, and/or epistasis. The former, in which the parents possess alleles with opposing (i.e., antagonistic) effects at complementary genes and thus have the potential to produce segregating progeny carrying an excess of alleles with effects in the same direction [84, 85], has been implicated as the most common cause of this phenomenon (summarized in [85, 86]). Consistent with this view, the traits with the most extreme transgressive segregation in our population (capitulum height, disc diameter, achene length and width, and achene oil content), were conditioned by multiple QTL (two to seven per trait) and, in many cases, each parental individual carried a mix of alleles with positive and negative effects for these traits. However, we also detected evidence of overdominant QTL effects for three of the traits exhibiting transgressive segregation (total seeds, achene length, achene width, and seed weight). Therefore, neither overdominance nor pseudo-overdominance can be discounted. Finally, many of these same traits were influenced by multiple genetic interactions, suggesting that epistasis could have played a role in producing the observed transgressive segregation – though instances of epistasis were certainly not limited to traits exhibiting transgressive segregation.
The numerous instances of QTL colocalization that we observed suggests that there are many parallels in the genetic architecture of domestication-related traits between safflower and sunflower. While additional work, including fine-mapping, positional cloning, and functional analyses, will be required to establish with certainty that the same underlying genes are responsible for these instances of colocalization, our findings are consistent with the view that selection may, in some cases, have acted in on the same genes during the independent domestications of safflower and sunflower. Such parallel genotypic evolution has been observed in other animal and plant systems, including the evolution of red and green color vision in multiple vertebrate species  (reviewed in ), the ability of bats, dolphins, and whales to echolocate [89, 90] (reviewed in ), herbicide resistance in maize and cocklebur  (reviewed in ), and the glutinous phenotype in rice [93–95], Chinese waxy maize , and foxtail millet  (reviewed in ). Going forward, an improved understanding of the genes underlying parallel trait transitions will provide key insights into the repeatability of evolution, helping us to better predict the phenotypic effects of genotypic changes across a broad array of crops.
Here, we have presented data demonstrating the complex nature of domestication-related traits in safflower. Our work thus contributes to a growing body of literature showing that crop origins are genetically more complex than once thought. By comparing our QTL mapping results to those from previous studies in sunflower, we have further documented the existence of numerous apparent parallels in the genetic architecture of domestication-related traits within the Compositae. Taken together, these results suggest that selection targeting some of the same genes in safflower and sunflower may have contributed to the parallel trait transitions that occurred during their independent domestications. This work also sets the stage for future analyses aimed at identifying the genes underlying these agronomically important traits, making it possible to further test hypotheses regarding the genetic basis of parallel phenotypic evolution. Such efforts will not only provide critical insights into the repeatability of evolution, but will also facilitate the continued development of safflower as an oilseed crop.
Availability of supporting data
The data sets supporting the results of the article are available in the NCBI dbEST (C. tinctorius ESTs; http://www.ncbi.nlm.nih.gov/dbEST) [GenBank:EL372565-EL412381 and EL511108-EL511145] and will become available in the NCBI SRA (C. palaestinus 454 reads; accession number SRR953755) and the Dryad Digital Repository (C. palaestinus assembly; http://dx.doi.org/10.5061/ 783 Q4dryad [NNNN]).
Darwin C: The Variation of Animals and Plants Under Domestication. 1868.
Ross-Ibarra J, Morrell PL, Gaut BS: Plant domestication, a unique opportunity to identify the genetic basis of adaptation. Proc Natl Acad Sci. 2007, 104: 8641-8648. 10.1073/pnas.0700643104.
Harlan JR: Crops and Man. 2nd edition. Madison, WI: American Society ofAgronomy; 1992.
Doebley J: Molecular evidence and the evolution of maize. Econ Bot. 1990, 44: 6-27. 10.1007/BF02860472.
Doebley J, Stec A: Genetic analysis of the morphological differences between maize and teosinte. Genetics. 1991, 129: 285-295.
Koinange EMK, Singh SP, Gepts P: Genetic control of the domestication syndrome in common bean. Crop Sci. 1996, 36: 1037-1045. 10.2135/cropsci1996.0011183X003600040037x.
Burke JM, Tang S, Knapp SJ, Rieseberg LH: Genetic analysis of sunflower domestication. Genetics. 2002, 161: 1257-1267.
Wills DM, Burke JM: Quantitative trait locus analysis of the early domestication of sunflower. Genetics. 2007, 176: 2589-2599. 10.1534/genetics.107.075333.
Fatokun CA, Menancio-hautea DI, Danesh D, Young ND: Evidence for orthologous seed weight genes in cowpea and mung bean based on RFLP mapping. Genetics. 1992, 132: 841-846.
Hu FY, Tao DY, Sacks E, Fu BY, Xu P, Li J, Yang Y, McNally K, Khush GS, Paterson A, Li Z-K: Convergent evolution of perenniality in rice and sorghum. Proc Natl Acad Sci. 2003, 100: 4050-4054. 10.1073/pnas.0630531100.
Paterson A, Lin YR, Li Z, Schertz KF, Doebley JF, Pinson SR, Liu SC, Stansel JW, Irvine JE: Convergent domestication of cereal crops by independent mutations at corresponding genetic loci. Science. 1995, 269: 1714-1718. 10.1126/science.269.5231.1714.
Doganlar S, Frary A, Daunay M-C, Lester RN, Tanksley SD: Conservation of gene function in the Solanaceae as revealed by comparative mapping of domestication traits in eggplant. Genetics. 2002, 161: 1713-1726.
Peng J, Richards DE, Hartley NM, Murphy GP, Devos KM, Flintham JE, Beales J, Fish LJ, Worland AJ, Pelica F, Sudhakar D, Christou P, Centre JI, Snape JW, Gale MD, Harberd NP: “Green revolution” genes encode mutant gibberellin response modulators. Nature. 1999, 400: 256-261. 10.1038/22307.
Patel J, Narayana G: Chromosome numbers in safflower. Curr Sci. 1935, 4: 412.
Weiss EA: Safflower. In Oilseed Crops. 2nd edition. London: Blackwell Science Ltd; 2000:93–129.
Kar G, Kumar A, Martha M: Water use efficiency and crop coefficients of dry season oilseed crops. Agric Water Manag. 2007, 87: 73-82. 10.1016/j.agwat.2006.06.002.
Merrill SD, Tanaka DL, Hanson JD: Root length growth of eight crop species in haplustoll soils. Soil Sci Soc Am J. 2002, 66: 913-923. 10.2136/sssaj2002.0913.
Knowles P, Ashri A: Safflower -- Carthamus Tinctorius (Compositae). Evolution of Crop Plants. Edited by: Smartt J, Simmonds NW. 1995, Harlow, UK: Longman, 47-50.
Angiosperm phylogeny website. http://www.mobot.org/MOBOT/research/APweb/.
Funk VA, Bayer RJ, Keely S, Chan R, Watson L, Gemeinholzer B, Schilling E, Panero JL, Baldwin BG, Garcia-Jacas N, Susanna A, Jansen RK: Everywhere but Antarctica: using a supertree to understand the diversity and distribution of the Compositae. Biologiske Skrifter. 2005, 55: 343-374.
Burke JM, Knapp SJ, Rieseberg LH: Genetic consequences of selection during the evolution of cultivated sunflower. Genetics. 2005, 171: 1933-1940. 10.1534/genetics.104.039057.
Dechaine JM, Burger JC, Chapman MA, Seiler GJ, Brunick R, Knapp SJ, Burke JM: Fitness effects and genetic architecture of plant-herbivore interactions in sunflower crop-wild hybrids. New Phytol. 2009, 184: 828-841. 10.1111/j.1469-8137.2009.02964.x.
Dechaine JM, Burger JC, Burke JM: Ecological patterns and genetic analysis of post-dispersal seed predation in sunflower (Helianthus annuus) crop-wild hybrids. Mol Ecol. 2010, 19: 3477-3488. 10.1111/j.1365-294X.2010.04740.x.
Wills DM, Abdel-Haleem H, Knapp SJ, Burke JM: Genetic architecture of novel traits in the hopi sunflower. J Hered. 2010, 101: 727-736. 10.1093/jhered/esq089.
Chapman MA, Burke JM: Evidence of selection on fatty acid biosynthetic genes during the evolution of cultivated sunflower. Theor Appl Genet. 2012, 125: 828-841.
Mündel H, Huang HC, Braun JP, Kiehn F: AC sunset safflower. Can J Plant Sci. 1996, 76: 469-471. 10.4141/cjps96-083.
Chapman MA, Burke JM: DNA sequence diversity and the origin of cultivated safflower (Carthamus tinctorius L.; Asteraceae). BMC Plant Biol. 2007, 7: 60-10.1186/1471-2229-7-60.
Van Zeist W, Rooijen Waterbolk-Van W: Two interesting floral finds from third millenium BC Tell Hammam et-Turkman, northern Syria. Veg Hist Archaeobot. 1992, 1: 157-161.
Ashri A, Rudich J: Unequal reciprocal natural hybridization rates between two Carthamus L. species. Crop Sci. 1965, 5: 190-191. 10.2135/cropsci1965.0011183X000500020032x.
Ashri A, Knowles PF: Cytogenetics of safflower (Carthamus L.) species and their hybrids. Agron J. 1960, 52: 11-17. 10.2134/agronj1960.00021962005200010004x.
Hanelt P: Monographische ubersicht der gattung Carthamus L. (Compositae). Feddes Repertorium. 1963, 67: 41-180.
Schneider CA, Rasband WS, Eliceiri KW: NIH Image to ImageJ: 25 years of image analysis. Nat Methods. 2012, 9: 671-675. 10.1038/nmeth.2089.
Claassen CE, Ekdahl WG, Severson GM: The estimation of oil percentage in safflower seed and the association of oil percentage with hull and nitrogen percentages, seed size, and degree of spininess of the plant. Agron J. 1950, 42: 478-482. 10.2134/agronj1950.00021962004200100003x.
Colorimetric fundamentals: CIE 1976 L*a*b* (CIELAB). http://industrial.datacolor.com/support/wp-content/uploads/2013/01/Color-Fundamentals-Part-II.pdf.
Technical report: colorimetry. https://law.resource.org/pub/us/cfr/ibr/003/cie.15.2004.pdf.
Tang S, Leon A, Bridges WC, Knapp SJ: Quantitative trait loci for genetically correlated seed traits are tightly linked to branching and pericarp pigment loci in sunflower. Crop Sci. 2006, 46: 721-734. 10.2135/cropsci2005.0006-7.
Chapman MA, Hvala J, Strever J, Matvienko M, Kozik A, Michelmore RW, Tang S, Knapp SJ, Burke JM: Development, polymorphism, and cross-taxon utility of EST-SSR markers from safflower (Carthamus tinctorius L.). Theor Appl Genet. 2009, 120: 85-91. 10.1007/s00122-009-1161-8.
Lai Z, Kane NC, Kozik A, Hodgins KA, Dlugosch KM, Barker MS, Matvienko M, Yu Q, Turner KG, Pearl SA, Bell GDM, Zou Y, Grassa C, Guggisberg A, Adams KL, Anderson JV, Horvath DP, Kesseli RV, Burke JM, Michelmore RW, Rieseberg LH: Genomics of Compositae weeds: EST libraries, microarrays, and evidence of introgression. Am J Bot. 2012, 99: 209-218. 10.3732/ajb.1100313.
Meyer E, Aglyamova G, Wang S, Buchanan-Carter J, Abrego D, Colbourne JK, Willis BL, Matz M: Sequencing and de novo analysis of a coral larval transcriptome using 454 GS-Flx. BMC Genomics. 2009, 10: 219-10.1186/1471-2164-10-219.
Chevreux B, Wetter T, Suhai S: Genome Sequence Assembly Using Trace Signals and Additional Sequence Information. German Conference on Bioinformatics. 1999.
Lee W-P, Stromberg M, Ward A, Stewart C, Garrison E, Marth G: MOSAIK: a hash-based algorithm for accurate next-generation sequencing read mapping. 2013, arXiv: 1309-1149.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, Marth G, Abecasis G, Durbin R: The sequence alignment/Map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352. http://samtools.sourceforge.net/.
Bowers JE, Nambeesan S, Corbi J, Barker MS, Rieseberg LH, Knapp SJ, Burke JM: Development of an ultra-dense genetic map of the sunflower genome based on single-feature polymorphisms. PLoS ONE. 2012, 7: e51360-10.1371/journal.pone.0051360.
Doyle JL, Doyle JR: A rapid DNA isolation procedure for small quantities of fresh leaf tissue. Phytochemistry Bulletin. 1987, 19: 11-15.
Lander E, Green P, Abrahamson J, Barlow A, Daly MJ, Lincoln SE, Newburg L: MAPMAKER: an interactive computer package for constructing primary genetic linkage maps of experimental and natural populations. Genomics. 1987, 1: 174-181. 10.1016/0888-7543(87)90010-3.
Lincoln SE, Lander ES: Systematic detection of errors in genetic linkage data. Genomics. 1992, 14: 604-610. 10.1016/S0888-7543(05)80158-2.
R: A language and environment for statistical computing. http://www.r-project.org.
Harrell FE: Hmisc: harrell miscellaneous. 2012, http://cran.r-project.org/package=Hmisc.
Rice WR: Analyzing tables of statistical tests. Evolution. 1989, 43: 223-225. 10.2307/2409177.
Basten CJ, Weir BS, Zeng Z-B: QTL cartographer. Raleigh NC. 2004, http://statgen.ncsu.edu/qtlcart/.
Basten C, Weir B, Zeng Z-B: Zmap-a QTL Cartographer. In Proceedings of the 5thWorld Congress on Genetics Applied to Livestock Production: Computing Strategiesand Software. Edited by Smith C, Gavora J, Benkel B, Chesnais J, Fairfull W,Gibson J, Kennedy B, Burnside E. Guelph, Ontario, Canada: OrganizingCommittee, 5th World Congress on Genetics Applied to LivestockProduction; 1994:65–66.
Churchill GA, Doerge RW: Empirical threshold values for quantitative trait mapping. Genetics. 1994, 138: 963-971.
Doerge RW, Churchill GA: Permutation tests for multiple loci affecting a quantitative character. Genetics. 1996, 142: 285-294.
Kao CH, Zeng ZB, Teasdale RD: Multiple interval mapping for quantitative trait loci. Genetics. 1999, 152: 1203-1216.
Holland JB: EPISTACY: a SAS program for detecting Two-locus epistatic interactions using genetic marker information. J Hered. 1998, 89: 374-375. 10.1093/jhered/89.4.374. http://www4.ncsu.edu/~jholland/Epistacy/epistacy.htm.
Lettuce genome resource. http://lgr.genomecenter.ucdavis.edu/.
Barker MS, Kane NC, Matvienko M, Kozik A, Michelmore RW, Knapp SJ, Rieseberg LH: Multiple paleopolyploidizations during the evolution of the Compositae reveal parallel patterns of duplicate gene retention after millions of years. Mol Biol Evol. 2008, 25: 2445-2455. 10.1093/molbev/msn187.
Larsen RJ, Marx ML: An Introduction to Probability and Its Applications.Englewood Cliffs, NJ: Prentice Hall Inc.; 1985.
Paterson AH: What has QTL mapping taught us about plant domestication?. New Phytol. 2002, 154: 591-608. 10.1046/j.1469-8137.2002.00420.x.
Sandler L, Novitski E: Meiotic drive as an evolutionary force. Am Nat. 1957, 91: 105-110. 10.1086/281969.
Sandler L, Golic K: Segregation distortion in Drosophila. Trends Genet. 1985, 1: 181-185.
Lyttle TW: Segregation distorters. Annu Rev Genet. 1991, 25: 511-557. 10.1146/annurev.ge.25.120191.002455.
Burger JC, Chapman MA, Burke JM: Molecular insights into the evolution of crop plants. Am J Bot. 2008, 95: 113-122. 10.3732/ajb.95.2.113.
Gross BL, Olsen KM: Genetic perspectives on crop domestication. Trends Plant Sci. 2010, 15: 529-537. 10.1016/j.tplants.2010.05.008.
Wright SI, Bi IV, Schroeder SG, Yamasaki M, Doebley JF, McMullen MD, Gaut BS: The effects of artificial selection on the maize genome. Science. 2005, 308: 1310-1314. 10.1126/science.1107891.
Hufford MB, Xu X, Van Heerwaarden J, Pyhäjärvi T, Chia J-M, Cartwright RA, Elshire RJ, Glaubitz JC, Guill KE, Kaeppler SM, Lai J, Morrell PL, Shannon LM, Song C, Springer NM, Swanson-Wagner RA, Tiffin P, Wang J, Zhang G, Doebley J, McMullen MD, Ware D, Buckler ES, Yang S, Ross-Ibarra J: Comparative population genomics of maize domestication and improvement. Nat Genet. 2012, 44: 808-811. 10.1038/ng.2309.
Morrell PL, Buckler ES, Ross-Ibarra J: Crop genomics: advances and applications. Nat Rev Genet. 2012, 13: 85-96.
Wallace JG, Larsson SJ, Buckler ES: Entering the second century of maize quantitative genetics. Heredity. 2013, 1-9. 00.
Narkhede BN, Deokar AB: Inheritance of corolla colour in safflower. Journal of Maharashtra Agricultural Universities. 1986, 11: 278-281.
Rao M: Inheritance of characters in safflower–Carthamus tinctorius L. Madras Agricultural Journal. 1943, 31: 141-148.
Pahlavani MH, Mirlohi AF, Saeidi G: Inheritance of flower color and spininess in safflower (Carthamus tinctorius L.). J Hered. 2004, 95: 265-267. 10.1093/jhered/esh030.
Gottlieb LB: Genetics and morphological evolution in plants. Am Nat. 1984, 123: 681-709. 10.1086/284231.
Mayerhofer R, Archibald C, Bowles V, Good AG: Development of molecular markers and linkage maps for the Carthamus species C. tinctorius and C. oxyacanthus. Genome. 2010, 53: 266-276. 10.1139/G10-002.
Hamdan YA, García-Moreno MJ, Fernández-Martínez JM, Velasco L, Pérez-Vich B: Mapping of major and modifying genes for high oleic acid content in safflower. Mol Breed. 2012, 30: 1279-1293. 10.1007/s11032-012-9714-y.
Gepts P: Crop domestication as a long-term selection experiment. Plant Breed Rev. 2004, 24 (2): 1-44.
Poncet V, Lamy F, Devos KM, Gale MD, Sarr A, Robert T: Genetic control of domestication traits in pearl millet (Pennisetum glaucum L., Poaceae). Theor Appl Genet. 2000, 100: 147-159. 10.1007/s001220050020.
Ben Chaim A, Paran I, Grube RC, Jahn M, Van Wijk R, Peleman J: QTL mapping of fruit-related traits in pepper (Capsicum annuum). Theor Appl Genet. 2001, 102: 1016-1028. 10.1007/s001220000461.
Poncet V, Sarr TRA, Gepts P: Quantitative Trait Locus Analyses of theDomestication Syndrome and Domestication Process. In Encyclopedia ofPlant and Crop Science. New York: Marcel Dekker, Inc; 2004:1069–1074.
Pernès J: La genetique de la domestication des cereales. La Recherche. 1983, 14: 910-919.
Le Thierry D’Ennequin M, Toupance B, Robert T, Godelle B, Gouyon PH: Plant domestication: a model for studying the selection of linkage. J Evol Biol. 1999, 12: 1138-1147. 10.1046/j.1420-9101.1999.00115.x.
Claassen CE: Natural and controlled crossing in safflower, Carthamus tinctorius L. Agron J. 1950, 42: 381-384. 10.2134/agronj1950.00021962004200080002x.
Ibarra-Perez FJ, Ehdaie B, Waines JG: Estimation of outcrossing rate in common bean. Crop Sci. 1997, 37: 60-65. 10.2135/cropsci1997.0011183X003700010009x.
Verdugo-Hernández S, Reyes-Luna R, Oyama K: Genetic structure and differentiation of wild and domesticated populations of Capsicum annuum (Solanaceae) from Mexico. Plant Syst Evol. 2001, 226: 129-142. 10.1007/s006060170061.
DeVicente MC, Tanksley SD: QTL analysis of transgressive segregation in an interspecific tomato cross. Genetics. 1993, 134: 585-596.
Rieseberg LH, Archer MA, Wayne RK: Transgressive segregation, adaptation and speciation. Heredity. 1999, 83: 363-372. 10.1038/sj.hdy.6886170.
Rieseberg LH, Widmer A, Arntz AM, Burke JM: The genetic architecture necessary for transgressive segregation is common in both natural and domesticated populations. Philos Trans R Soc Lond B. 2003, 358: 1141-1147. 10.1098/rstb.2003.1283.
Yokoyama S, Radlwimmer FB: The molecular genetics and evolution of red and green color vision in vertebrates. Genetics. 2001, 158: 1697-1710.
Christin P-A, Weinreich DM, Besnard G: Causes and evolutionary significance of genetic convergence. Trends Genet. 2010, 26: 400-405. 10.1016/j.tig.2010.06.005.
Li Y, Liu Z, Shi P, Zhang J: The hearing gene Prestin unites echolocating bats and whales. Curr Biol. 2010, 20: R55-R56. 10.1016/j.cub.2009.11.042.
Liu Y, Cotton JA, Shen B, Han X, Rossiter SJ, Zhang S: Convergent sequence evolution between echolocating bats and dolphins. Curr Biol. 2010, 20: R53-R54. 10.1016/j.cub.2009.11.058.
Bernasconi P, Woodworth AR, Rosen BA, Subramanian M, Siehl DL: A naturally occurring point mutation confers broad range tolerance to herbicides that target acetolactate synthase. J Biol Chem. 1995, 270: 17381-17385. 10.1074/jbc.270.29.17381.
Wood TE, Burke JM, Rieseberg LH: Parallel genotypic adaptation: when evolution repeats itself. Genetica. 2005, 123: 157-170. 10.1007/s10709-003-2738-9.
Hirano HY, Eiguchi M, Sano Y: A single base change altered the regulation of the Waxy gene at the posttranscriptional level during the domestication of rice. Mol Biol Evol. 1998, 15: 157-170.
Olsen KM, Purugganan MD: Molecular evidence on the origin and evolution of glutinous rice. Genetics. 2002, 162: 941-950.
Olsen KM, Caicedo AL, Polato N, McClung A, McCouch S, Purugganan MD: Selection under domestication: evidence for a sweep in the rice waxy genomic region. Genetics. 2006, 173: 975-983. 10.1534/genetics.106.056473.
Fan L, Quan L, Leng X, Guo X, Hu W, Ruan S, Ma H, Zeng M: Molecular evidence for post-domestication selection in the Waxy gene of Chinese waxy maize. Mol Breed. 2008, 22: 329-338. 10.1007/s11032-008-9178-2.
Fukunaga K, Kawase M, Kato K: Structural variation in the Waxy gene and differentiation in foxtail millet [Setaria italica (L.) P. Beauv.]: implications for multiple origins of the waxy phenotype. Mol Genet Genomics. 2002, 268: 214-222. 10.1007/s00438-002-0728-8.
This work was supported by funding for the Compositae Genome Project from the NSF Plant Genome Research Program (DBI-0820451 to JMB et al.) and the NSF Doctoral Dissertation Improvement Grant (DEB-1110350 to JMB and SAP). We thank Zhao Lai for assistance with preparation of the 454 transcriptome libraries; Yi Zou, Evan Staton, Jeong-Hyeon Choi, Raj Ayaampalayam, Nolan Kane, Sebastian Reyes Chin-Wo, and the GACRC for bioinformatics assistance; Roger Nilsen and Myriam Belanger at the Georgia Genomics Facility for assistance with the SNP genotyping; Melanie Smith, Mike Boyd, and Jeff Dadisman for their help in the greenhouse; Jennifer Wood, David Sanders, and Hussein Abdel-Haleem for their help with phenotyping; and Lori Schmidt for her advice on how to grow safflowers in the greenhouse. Finally, we are grateful to members of the Burke Lab for their advice on an earlier version of this manuscript.
The authors declare that they have no competing interests.
JMB and SAP conceived the study. SAP produced the mapping population, generated the phenotypic data, designed the genotyping assay, and generated the genotypic data. SR-C-W and RWM were involved in the sequencing and assembly of the lettuce genome. SAP, JMB, and JEB analyzed the safflower data and performed the comparative genomic analyses. SAP and JMB drafted the manuscript with input from the other authors. All authors read and approved the final manuscript.