Linkage mapping and quantitative trait loci analysis of sweetness and other fruit quality traits in papaya
BMC Plant Biology volume 19, Article number: 449 (2019)
The identification and characterisation of quantitative trait loci (QTL) is an important step towards identifying functional sequences underpinning important crop traits and for developing accurate markers for selective breeding strategies. In this study, a genotyping-by-sequencing (GBS) approach detected QTL conditioning desirable fruit quality traits in papaya.
For this, a linkage map was constructed comprising 219 single nucleotide polymorphism (SNP) loci across 10 linkage groups and covering 509 centiMorgan (cM). In total, 21 QTLs were identified for seven key fruit quality traits, including flesh sweetness, fruit weight, fruit length, fruit width skin freckle, flesh thickness and fruit firmness. Several QTL for flesh sweetness, fruit weight, length, width and firmness were stable across harvest years and individually explained up to 19.8% of the phenotypic variance of a particular trait. Where possible, candidate genes were proposed and explored further for their application to marker-assisted breeding.
This study has extended knowledge on the inheritance and genetic control for key papaya physiological and fruit quality traits. Candidate genes together with associated SNP markers represent a valuable resource for the future of strategic selective breeding of elite Australian papaya cultivars.
Papaya (Carica papaya L.) is one of the top five produced tropical fruit crops, listed as a super fruit in the fight against vitamin deficiency [24, 51]. Global annual production of papaya is approximately 11.22 metric tons (Mt), increasing 4.35% per year . In Australia, papaya is an important domestic fresh fruit crop with just 6.5 thousand tons grown annually . The industry is currently relatively small but with large potential to expand to meet the growing global market demand.
Novel and advanced breeding tools will enable faster and more accurate selection of key consumer-driven traits. As such, marker-assisted selection (MAS) has been introduced in papaya breeding programs elsewhere to efficiently develop superior varieties with desired traits [6, 64, 66]. However, progress has been limited by a dearth of genomic information and few identified quantitative trait loci (QTL) associated with markers/sequences for trait selection.
Success in robust QTL identification is dependent on molecular marker map density, directly affecting map resolution, and accurate placement of qualitative data. Previous maps have varied in coverage and resolution. The ‘Sunrise Solo’ x Line UH356 map comprised 61 random amplified polymorphic DNA (RAPD markers distributed in 11 linkage groups (LG) over 999 cM. The subsequent ‘Kapoho’ x ‘Sunup’ map of Ma et al.  comprised 1498 amplified fragment length polymorphism (AFLP) loci in 12 LG over 3294 cM. Later, the ‘AU9’ x ‘Sunup’ map of Chen et al.  comprised 706 simple sequence repeat (SSR) markers in 12 LG over 1070 cM, within which elongated fruit shape was associated with a QTL in LG1. Blas et al.  then exploited the same mapping population and constructed a map comprising 712 SSR and 277 markers in 14 LG and over 945 cM. Meanwhile, the whole genome sequence of papaya ‘Sunup’ was released by Ming et al. (; http://www.plantgdb.org/CpGDB), making the integration of physical and high-density genetic maps possible . Due to narrow genetic base of papaya within the cultigen , a preliminary investigation on these SSR markers on our selected parental lines showed only 16.67% polymorphisms and predicted to cover only 120 loci (Unpublished data). Therefore, single nucleotide polymorphic (SNP) based mapping was introduced to speed-up and uncover the development of linkage maps and the identification of key genomic locations underlying complex traits, including flesh sweetness and other fruit quality traits in papaya.
Once aligned within the linkage map, the identification of putative candidate genes that underlie the major QTL and potentially contribute towards trait expression may be possible. Functionally validated markers may then represent sequences useful in selective breeding strategies. Previously in papaya, QTL for plant height, stem diameter and number of node at first flowering were mapped using RAPD markers in a population of 253 F2 plants (‘Sunrise Solo’ x Line UH356 ;). From two to four QTLs were identified for each trait, which explained 42, 37 and 30% of the total phenotypic variance observed in plant height, stem diameter and number of node at first flowering, respectively. Blas et al.  subsequently identified 14 QTL controlling fruit weight, length, width and shape with phenotypic effects ranging from 5 to 23%. These were mapped on LG 2, 3, 7 and 9 using a population of 219 F2 ‘Khaek Dum’ x Line 2H94 plants.
The identification of reliable markers for selective breeding purposes that are associated with major QTL conditioning a trait of interest is reliant on the genetic stability of the markers with which the QTL has been associated. Indeed, through mutation and/or selective evolution, the sequences residing in close proximity to major QTL may vary among genetic backgrounds. Also, recombination events among different populations, even produced from the same parents, may not be conserved and hence marker transferability is not assured among genotypes or populations [33, 65]. Therefore, individual high-density genetic linkage maps are required for the identification of the genetic loci conditioning key fruit quality traits of a particular genotype.
High density maps are generated via a genotyping by sequencing (GBS) approach, for rapid and cost-effective high-throughput SNP marker discovery . This approach has been applied for uncovering fruit quality trait QTL in zucchini  and tomato . Both studies found GBS to be a highly efficient technology for QTL analysis and candidate gene mining. The construction of a genetic map of zucchini was performed using 120 F8 from an inter-subspecific cross between zucchini and scallop (ssp. pepo x ssp. ovifera). In total, 48 consistent QTL for vine, flowering and fruit quality traits were detected based on three environments analyses. These QTL were distributed across 33 independent positions across 15 LGs and each QTL explained from 1.5 to 62.9% of the phenotypic variance. Eight stable QTL related to leaf incision, fruit shape and length, and rind and flesh colour of zucchini were reported along with their underlying candidate genes. In tomato, Celik et al.  utilised a genetic map of 93 individuals from a backcross of Solanum lycopersicum ‘Tueza’ and Solanum pimpinellifolium (LA1589) for QTL mapping and selection of favourable alleles for 11 desired fruit quality traits. A total of 37 QTL affecting fruit quality of tomato were detected, explaining from 3 to 47% of the phenotypic variation. Among these, three were detected for fruit weight, nine for flesh colour, two for skin colour and four for each of fruit firmness, fruit shape and sugar content .
The advantages of GBS technology holds great promise for simplifying the construction of high-density maps and identifying QTL linked to quality fruit traits in papaya, which has a narrow genetic base and a low rate of sequence diversity [34, 55, 67, 72]. With the increase in information available in the sequence databases, GBS and candidate genes approaches can be combined to speed up the development of new markers for marker-assisted breeding programs .
This study focused on linkage mapping and QTL analysis for fruit quality traits in a papaya F2 population developed from the cross ‘RB2’ x ‘Sunrise Solo’. The aims were 1) to identify the locations of the major genetic components conditioning sweetness, fruit weight, fruit length, fruit width, skin freckle, flesh thickness and fruit firmness and 2) Identify and characterize the putative sweetness candidate genes to determine their potential for use in future marker-assisted selection strategies.
Sequence data and SNP discovery
A total of 57.78 Gb of sequence data, comprising 577.7 million reads, was generated from the parents and 226 F2 samples. Following mapping to the ‘Sunup’ reference genome of Ming et al. , 44,030 SNPs were identified. After filtration to remove SNPs with more than 80% missing data and/or low read depth, 1701 high quality SNPs remained (3.86%). Subsequently, duplicated and monomorphic SNPs were excluded, resulting in 1302 (2.95%) for map construction with a density of 1 SNPs per 285.7 kb.
Linkage map construction
Of the resultant sub-set of high quality 1302 SNPs, a total of 1153 were used to create the initial map of ‘RB2’ x ‘Sunrise Solo’ (Additional file 4: Table S3, Additional file 6: Figure S2). This comprised 23 LG, 15 major and 6 minor, spanning 3096.93 cM with an average marker interval of 2.7 cM. However, 882 (76.4%) of the markers were distorted in their expected segregation ratio (1:2:1) within the F2 population. Among these, 187 (21.3%) were skewed towards the female parent (‘RB2’) and 98 (11.2%) were skewed towards the male parent (‘Sunrise Solo’). The remaining 597 distorted markers were skewed towards an heterozygous genotype (Additional file 5: Table S4).
Of the 1153 initial mapped SNP markers, only 271 segregated as expected (p-value ≥0.05) and following revision of the linkage analyses, 52 remained unlinked. Therefore, the final map consisted of 219 SNP loci within 10 LG (I to X; Table 1 and Fig. 1). Each LG comprised from 3 to 75 SNPs and ranged from 2.2 cM to 134.6 cM in length with average gaps between SNP of 3.5 to 27.6 cM. The final map spanned 509.7 cM, approximately six-times smaller than the initial map.
Composite interval mapping with a sliding window size of 10 cM detected QTL for sweetness and the other fruit quality traits within the two harvest years (2016 and 2017). In total, 21 QTL were distributed across nine LG (all except LG VIII) (Fig. 1). The proportion of phenotypic variance explained by a single QTL ranged from 3.1 to 19.8% (Table 2). The highest percentage of explained phenotypic variance by a single QTL was observed for fruit length (19.8%), followed by fruit width (19.5%) and fruit firmness (15.5%, LG I; Year 2017), while the lowest was detected for fruit firmness (3.1%, LG IX; Year 2016). In general, QTL for individual traits were observed at a similar map location in both 2016 and 2017. The number of QTL detected for each trait varied from 2 to 5 loci. The largest number of QTL was observed for fruit firmness (5 loci). In contrast, the lowest number of QTL was observed for flesh thickness (2 loci), followed by flesh sweetness (3 loci). The relationship among fruit quality traits was evidenced by co-location of QTL on LG I, III, IV, VI, IX and X. For example, QTL for flesh sweetness were clustered together with QTL for fruit firmness and fruit length on LG III. Also, QTL for skin freckle were clustered with QTL for fruit firmness on LG IX. QTL for fruit size characteristics (fruit weight, length and width) and fruit firmness clustered on several LG including I, IV, VI, IX and X.
Candidate genes for flesh sweetness and other fruit quality traits
The regions within major QTL intervals were annotated according to the ‘Sunup’ reference genome. Three candidate genes responsible for regulation of developmental growth (non-canonical poly(A) RNA polymerase and KIN17-like protein (accession number: XP_021903675 and XP_021907879) and protein transmembrane transporter activity (accession number: XP_021887112) were detected within the flesh sweetness QTL peaks (Additional file 7: Table S5). The regions of fruit weight, length and width QTL contained candidate genes involved in cell wall organisation (protein trichome birefringence-like 12 and fatty acid amide hydrolase-like), protein metabolic process (glutamate receptor 3, IST1-like protein, prolyl 4-hydroxylase 9 and bifunctional nuclease 2) and carbohydrate metabolic process (exopolygalacturonase and NAC domain-containing protein 41). The previously identified Carica papaya chromosome Y sequence on LG1 [15, 41] was also found near fruit length QTL. Two candidate genes (Ultraviolet-B receptor and putative disease resistance protein RGA1) were observed within skin freckle QTL. Fruit firmness QTL regions contained one candidate gene encoding pectin catabolic process (pectin acetyl esterase 12-like) and three candidate genes related to transcription factor activity (UPF0553 protein-like, DNA-directed RNA polymerase III subunit 1 and MYB-like protein X). Candidate genes responsible for lignin biosynthetic processes and ethylene-activated signaling pathways were identified within the QTL regions for flesh thickness.
For the first time, genotyping-by-sequencing (GBS) was successfully used to develop a SNP linkage map and identify key genomic locations underlying flesh sweetness and other fruit quality traits in papaya. Also, in conjunction with the existing reference genome, several QTL-linked SNP loci were associated with putative candidate genes.
The frequency and number of SNPs obtained by GBS in the ‘RB2’ x ‘Sunrise Solo’ population was comparable to that reported in sweet cherry , zucchini  and tomato  using the same approach. However, the majority of identified SNPs (96%) were excluded from the map construction, resulting in a far lower number of SNPs in the final linkage map than in the previously mentioned ones. After stringent filtering all loci with minimum read depth, missing data and identifiable parental alleles, the number of SNP loci reduced below that which has been typically reported in other species. In zucchini, the work of Montero-Pau et al.  revealed 25% (16,222 markers) of validated SNPs derived from GBS. Approximately 13% of high quality SNPs (3125 markers) were discovered in tomato by GBS approach . The variation in percentage of validated SNPs obtained in the current study and other studies could be attributed to a number of factors including selection of restriction enzymes and sequencing depth, sample library preparation, genetic background of plant materials and condition of data analysis [16, 43, 71]. Strategies such as adjusting the level of multiplexing, changing the choice of restriction enzyme(s) and increasing sequencing depth could be investigated to increase the capture rate of SNPs in the population [4, 71]. Among these factors, the condition of GBS data analysis was reported as a major impact on the amount and quality of the resulting genotypic information . The number of called SNPs, missing data and genotypic accuracy varied vastly due to the choice of an analytical method and the reference genome used for SNP-mapping [4, 71]. Under the condition used in this study, the detection of a polymorphism was reliant on the existing ‘Sunup’ reference genome , which was incomplete in terms of assembly contiguity, number of gap sequences and genome coverage (~ 75%). It is entirely possible that the quality of the reference genome affected the process of SNP-calling through inability to align raw sequencing output with the existing reference assembly and resulted in the relatively low number of validated SNP for mapping. In future, high coverage genome sequences of both parents (‘RB2’ and ‘Sunrise Solo’; Genbank SRA accession: PRJNA507836) should be used as reference genomes for SNP-discovery and the mapping of their recombinants [29, 30, 39]. Alternatively, if a high quality reference genome is not available, a de novo SNP discovery approach could be considered (Described in Catchen et al., [9, 54, 60];).
Linkage map construction
An extremely high percentage of marker segregation distortion was detected (76.4%, P < 0.05), consistent with previous studies such as Blas et al.  who reported 79% marker segregation distortion in a ‘Khaek Dum’ x ‘2H94’ cross population. Similarly, 66% segregation distortion was observed among markers in a ‘AU9’ and ‘Sunup’ cross population . The underlying reasons for segregation distortion include genetic interaction among loci , the predominance of parental or recombinant genotypes in the population, environmental factors and experimental errors [2, 75, 76]. The high number of distorted loci in this study is likely attributed to dominance of one parental genotype, with twice as many maternal (‘RB2’) than paternal (‘Sunrise Solo’) alleles identified, as well as missing genotypic data .
Although the final map was not as dense as the linkage map of Blas et al. , the marker placement and alignment was robust with adequate resolution for QTL mapping . The quality and applicability of a linkage map with similar density was demonstrated previously by Bielenberg et al.  who used 33 SSR and 201 SNP markers identified from GBS pipeline to construct a genetic map with an average marker interval of 2.85 cM to detect QTL for chilling requirement and bloom date in peach.
The chromosome-specific cytogenetic markers were developed and merged with linkage groups of papaya using the integrated technique of fluorescence in situ hybridisation (FISH) and BAC clones harboring mapped SSR markers as probes . Nine linkage groups was proposed and corresponded to the haploid number of papaya chromosomes. However, we are unable to integrate these maps as there are no anchor markers shared among them. The reason being that different parents were used to construct the mapping populations.
QTL and candidate genes for individual fruit quality traits
QTL mapping is useful for dissecting the genetic components of complex traits . The QTL analysis in the F2 population of ‘RB2’ x ‘Sunrise Solo’ detected 21 QTL affecting fruit quality in papaya. Most of the traits were associated with two to five QTL, indicating their polygenic nature [26, 45, 77]. Ten of the 21 QTL detected in this study had > 10% effect on the phenotypic variance and were characterised as a major QTL . Several of these were stable over two harvest years, indicating their potential for investigation in future trait selection.
Co-location of QTL for different fruit quality traits was indicated in several genome regions as similarly reported in other species [13, 80]. QTL identified in the same location may contain shared and/or distinct genes with potential pleiotropic effects. Multiple QTL with large effects were shown responsible for fruit sweetness in other species including in peach  and apple . These were located close to QTL associated with fruit weight and size but with opposite allelic effects, again suggesting pleiotropic activity [22, 26, 32]. Further studies with near-isogenic lines are required to tease apart the QTL in the current study and to identify possible individual candidate genes for further functional validation of association with each of the specific traits.
In the present study, the exploration of genetic variation and transferability of key fruit quality traits within the parental and progeny population of ‘RB2’ x ‘Sunrise Solo’ genotypes indicated high heritability (> 60%) for flesh sweetness, fruit width and fruit firmness (Additional file 2: Table S2). This confirmed the high heritability of several fruit traits previously described for flesh sweetness, flesh colour, flesh firmness, fruit firmness and fruit size in papaya [53, 63] and other fruit crops [7, 58]. Whereas, the rest of traits showed low to moderate heritability (30–60%) and the lowest heritability was found in fruit weight (32%). The likelihood of success in QTL identification and mapping depends on the heritability of the trait, its genetic nature (dominant, recessive or additive) and the number of genes involve . Theoretically, identification of QTL for high heritability traits should be easier to detect and likely to explain more of the phenotypic variation as they should be less influenced by environmental factors . This assumption appeared to be true in the case of flesh sweetness, fruit width and fruit firmness. The QTL analysis clearly identify their major governing genetic loci across two harvest seasons and with relatively large likelihood (11.6 to 19.5%). Meanwhile, the identification of QTLs of traits with low to moderate heritability also revealed QTLs with large effect in fruit weight and fruit length. It is possible that these traits are closely correlated to traits with high heritability, which are fruit width and fruit firmness, therefore, the clustering of QTLs among these fruit morphology traits may result in large effect size estimates due to the co-location of the detected QTLs. In contrast, most of the QTL identified for skin freckle and flesh thickness were minor QTL. These occurrences are commonly observed for QTL of fruit quality in other species, reflecting their polygenic nature and the high influence of environmental conditions [5, 12, 26, 32].
Flesh sweetness is quantitatively inherited with many studies revealing multiple QTLs responsible including in Rosaceae such as peach, apple and strawberry [22, 26, 38]. The QTLs for flesh sweetness were detected across multiple genome locations with a range of effect (up to 84%). Several QTLs were associated with the sucrose synthase gene (SUSY1) family and a gene encoding vacuolar H + -pyrophosphatase which catalyses solute accumulation [22, 28]. The current study is the first for papaya and proposes that flesh sweetness is under polygenic control in the cross between ‘RB2’ x ‘Sunrise Solo’. At least two genomic regions were identified and associated with genes responsible for growth development and protein transmembrane transporter activity. As expected, alleles of ‘Sunrise Solo’ (the sweeter parent) contributed to an increase of sweetness in the progeny. The sweetness trait-associated major QTL on group VII that contained growth development and protein transmembrane transporter activity genes directly linked with SNP loci; sCT_80_454708 and sCT_12_1083429 require further exploration. These should be assessed for stability and functional association potentially through targeted amplification across a wider range of genotypes and reverse genetics approaches [50, 70].
The genetic governance of fruit weight, length and width has been widely studied in many fruit crops including tomato , pepper  and melon . Accordingly, members of the ovate, sun and fw2.2 gene families were detected within the related QTL [40, 81]. In papaya, QTL for fruit weight and size were previously identified in F2 populations of ‘Sunrise Solo’ x Line 356  and ‘Khaek Dum’ x ‘2H94’  but as in the current study, were not associated with any ovate, sun or fw2.2 genes . Rather, fruit weight, length and width QTL on LGI in this study were in close proximity to a papaya male-specific region previously associated with elongated fruit. The four SNP markers, sCT_6_2754743, sCT_6_2392635, sCT_50_1447788 and sCT_6_2331252, that were mapped within 1 cM of the major QTL for these traits should be explored further for functional association.
Skin freckle is one of the major issues affecting fruit quality of papaya and its genetic basis is not been well understood. Eloisa et al.  reported that skin freckle of papaya fruit was highly influenced by weather condition, fruit growth and fruit sugar content. In this present study, QTL analysis for skin freckle did not detect any relationship between skin freckle and flesh sweetness QTLs, however co-localisation of QTLs for skin freckle, fruit firmness, fruit width and length was observed. Indeed, skin freckle was shown to be conditioned by several minor QTLs on LG II, VI and IX (each accounted for 3.23 to 8.5%). However, these accounted for relatively little of the trait variation again likely due to the missing genome coverage and potential epistatic interactions that reduces detection of small effect QTLs . Therefore, targeting the three loci identified in this study may be insufficient for improving skin quality of papaya.
The genetic basis of variation in fruit firmness and flesh thickness has been studied most extensively in tomato, cucurbits and apple [13, 36, 68, 78]. Most QTLs for fruit firmness and flesh thickness have been described with association with ethylene response factor and members of expansine, pectin methylesterase and protein-lysine methyltransferase gene families [14, 78]. Similarly, genes encoding pectin catabolic process and ethylene-activated signalling pathway were found in this study within locations of stable QTLs in ‘RB2’ x ‘Sunrise Solo’ mapping, suggesting similar functions for these genes in papaya. Five markers (sCT_751_466, sCT_751_404, sCT_6_237757, sCT_48_1243956, sCT_6_1666511) associated with the QTLs for fruit firmness and flesh thickness were mapped within a 3 cM window. These markers may be useful for future breeding selection.
In summary, this study demonstrated the use of GBS technology for efficient QTL detection in papaya (F2 population of ‘RB2’ x ‘Sunrise Solo’). The SNP based genetic map and QTL for flesh sweetness, fruit weight, width, length, skin freckle, firmness and flesh thickness detected in two successive years and associated SNPs provide target regions for candidate gene exploration and selective marker development.
Plant materials and phenotyping of fruit quality characters
Parental lines and 226 segregating F2 progeny of the ‘RB2’ x ‘Sunrise Solo’ cross were planted in Mareeba, Australia and evaluated for fruit quality traits across two harvests; in December 2016 and April 2017. The two parental lines used in the experiments are Australian commercial varieties. These were produced by Papaya Seed Australia who provided permission for their use in this scientific research. Plant experiment was performed in the School of Environment and Science, Griffith University, according to a plant protocol approved by the Research Committee of Griffith University. At each harvest, three fruit from each individual plant were harvested and measured for quantitative phenotypic data of flesh sweetness, fruit weight, fruit length, fruit width skin freckle, flesh thickness and fruit firmness in accordance with the methods outlined in the Papaya Handbook (, Additional file 1: Table S1, Additional file 2: Table S2, Additional file 3: Figure S1).
Genotyping-by-sequencing (GBS) and SNP identification
A GBS approach was used to detect single nucleotide polymorphisms (SNP) between the parental and among the F2 genomes. For this, gDNA was extracted using the modified CTAB protocol of Dellaporta et al.  from individual leaf samples of one-year-old trees of parents and F2 progeny. Quality and quantity of gDNA was assessed with a NanoDrop 1000c (Thermo Fisher Scientific, Australia) and diluted to 100 ng/μl. DNA samples were sent for GBS at the Australian Genome Research Facility, Melbourne, Australia, using a ddRAD-based library preparation protocol, as described in Peterson et al. . The DNA was digested using a combination of restriction enzymes (PstI and MseI) and only tags with both RE sites (one at each end) were selected for library preparation and sequenced on an Illumina HiSeq2500 sequencing platform, producing 100 bp single-end reads. Parental DNA was sequenced thrice and F2 individuals were sequenced once each to generate SNP catalogues (Genbank SRA accession: PRJNA544124). Raw GBS reads were de-multiplexed and sorted according to their barcoded sequences using Stacks software v1.46 . The resultant filtered reads (high-quality sequences from each sample) were aligned to the papaya reference genome ‘Sunup’ variant  using Bowtie2 version 2.3.2 .
SNP identification was carried out using gstacks command in Stacks2 v2.00beta5  to obtain only bi-allelic SNPs polymorphic between the parents. Subsequently, SNPs were filtered using SnpSift v4.3p  with the following parameter settings: Minimum read depth larger than five (DP > 5) and Phred genotype quality score of more than 20 (GQ > 20). In addition, the genomic positions of the SNPs were determined according to the ‘Sunup’ reference genome  and used to assign the SNP ID. Further SNP filtration was performed using in-house R script . Loci with > 80% missing data were discarded. The imputation of missing genotypes was performed using LinkImputeR v1.1.1  and resulted in 1701 high quality SNP loci for linkage map construction.
Linkage map construction
An initial linkage map was constructed after removal of duplicated and monomorphic markers using Onemap R package  and with a logarithm of the odds (LOD) threshold of 5.0 and a maximum recombination fraction (max.rf) threshold of 0.25. Subsequently, linkage groups (LG containing less than four loci and any unlinked markers were excluded. The Rapid Chain Delineation (RCD) algorithm was used to order markers within each LG . Then, 10 equally spaced markers in a LG were selected to create a framework of ordered markers using the “make_seq” and “compare” functions. The remaining markers were added to the framework with the “order_seq” function with the lowest threshold for a positioning marker of LOD 3.0. The combination of markers was then inspected (within a window size of four markers) using the “ripple” function to obtain the final marker order. Map distance in centiMorgans (cM) was estimated by the Kosambi mapping function .
The final linkage map was created after removal of markers with significant deviation from the expected segregation ratio using the “select_segreg” function and the remaining markers were again clustered into LG and ordered as described above. Initial and final maps were visualised using Mapchart . The R/qtl package  was used to generate input files for QTL analyses.
QTL analyses were performed using WinQTLCart software version 2.5 [75, 76].. First, single marker analysis was performed using the nonparametric Kruskal-Wallis test to individually associate markers and traits. Then, interval mapping analyses were undertaken to locate QTL position on the map. Composite Interval Mapping (CIM) was selected as the mapping method for sensitivity and to enable multiple potential QTL detection for each trait. The standard CIM Model was used (model number 6 with a value of 5 for control markers and a forward regression). The LOD threshold was determined by a 1000 permutation test with a significance level (p) set at 0.05. Two sets of fruit quality trait data (harvest years 2016 and 2017) were analysed separately for all tested traits to assess QTL stability and detect additional seasonal QTL. QTL that had a LOD > 3 and a phenotypic variance contribution > 10% were classified as major QTL . In addition, a QTL that appeared in both harvests was classified ‘stable’. Additive effects were estimated where a positive value indicated that alleles contributed from ‘RB2’ increased the trait score and a negative value indicated that alleles contributed from ‘Sunrise Solo’ increased the trait score.
Identification of linked markers and putative candidate genes
Significant association of SNP marker with QTL peak region was determined by the Kruskal-Wallis test with 95% confidence (p ≤ 0.05). Subsequently, the gene annotation database from the ‘Sunup’ reference genome (http://www.plantgdb.org/XGDB/phplib/download.php? GDB=Cp) together with the database of the National Centre for Biotechnology Information (NCBI; https://blast.ncbi.nlm.nih.gov/Blast.cgi) and Phytozome (https://phytozome.jgi.doe.gov/pz/portal.html) were utilised to search for location information of the identified markers and candidate genes within the major QTL peak regions. Flanking sequences at both sides of the significant SNP positions were used as queries in BLAST searches against the DNA database and the Carica papaya genome sequence, ASGPBv0.4 with an E-value ≤1e− 15, identity ≥70% and coverage ≥50%. Gene Ontology (GO) terms associated with each BLAST hit were annotated using the GO Consortium BLAST server (http://www.geneontology.org).
Availability of data and materials
The sequence data generated during this study have been deposited in Genbank repository with accession code PRJNA507836 https://www.ncbi.nlm.nih.gov/sra/PRJNA507836 and PRJNA544124 https://www.ncbi.nlm.nih.gov/sra/PRJNA544124. The other data that support the findings of this study are available within the article and its supplementary information files.
Amplified fragment length polymorphism
Composite Interval Mapping
Minimum read depth
Phred genotype quality score
Logarithm of the odds
Quantitative trait loci
Random amplified polymorphic DNA
Rapid Chain Delineation
Single nucleotide polymorphism
Simple sequence repeat
Abiola O, Angel JM, Avner P, et al. The nature and identification of quantitative trait loci: a community's view. Nat Rev Genet. 2003;4(11):911–6.
Alheit KV, Reif JC, Maurer HP, Hahn V, Weissmann EA, Miedaner T, Würschum T. Detection of segregation distortion loci in triticale (x Triticosecale Wittmack) based on a high-density DArT marker consensus genetic linkage map. BMC Genomics. 2011;12(1):380.
Asins MJ. Present and future of quantitative trait locus analysis in plant breeding. Plant Breed. 2002;121(4):281–91.
Bielenberg DG, Rauh B, Fan S, Gasic K, Abbott AG, Reighard GL, Okie WR, Wells CE. Genotyping by sequencing for SNP-based linkage map construction and QTL analysis of chilling requirement and bloom date in peach [Prunus persica (L.) Batsch]. PloS one. 2015;10(10):e0139406.
Blas AL, Yu Q, Chen C, Veatch O, Moore PH, Paull RE, Ming R. Enrichment of a papaya high-density genetic map with AFLP markers. Genome. 2009;52(8):716–25.
Blas AL, Yu Q, Veatch OJ, Paull RE, Moore PH, Ming R. Genetic mapping of quantitative trait loci controlling fruit size and shape in papaya. Mol Breed. 2012;29(2):457–66.
Brettell RI, Johnson PR, Kulkarni VJ, Müller W, Bally IS. Inheritance of fruit characters in hybrid mangoes produced through controlled pollination. In: VII International Mango Symposium, vol. 645; 2002. p. 319–26.
Broman KW, Wu H, Sen Ś, Churchill GA. R/QTL: QTL mapping in experimental crosses. Bioinformatics. 2003;19(7):889–90.
Catchen JM, Amores A, Hohenlohe P, Cresko W, Postlethwait JH. Stacks: Building and genotyping Loci De Novo from short-read sequences. G3: genes, genomes, genetics. 2011;1(3):171–182.
Catchen J, Hohenlohe PA, Bassham S, Amores A, Cresko WA. Stacks: an analysis tool set for population genomics. Mol Ecol. 2013;22(11):3124–40.
Celik I, Gurbuz N, Uncu AT, Frary A, Doganlar S. Genome-wide SNP discovery and QTL mapping for fruit quality traits in inbred backcross lines (IBLs) of solanum pimpinellifolium using genotyping by sequencing. BMC Genomics. 2017;18(1):1.
Chaib J, Lecomte L, Buret M, Causse M. Stability over genetic backgrounds, generations and years of quantitative trait locus (QTLs) for organoleptic quality in tomato. Theor Appl Genet. 2006;112(5):934–44.
Chaim AB, Borovsky Y, Rao GU, Gur A, Zamir D, Paran I. Comparative QTL mapping of fruit size and shape in tomato and pepper. Israel J Plant Sci. 2006;54(3):191–203.
Chapman NH, Bonnet J, Grivet L, Lynn J, Graham N, Smith R, Sun G, Walley PG, Poole M, Causse M, King GJ. High-resolution mapping of a fruit firmness-related quantitative trait locus in tomato reveals epistatic interactions associated with a complex combinatorial locus. Plant Physiol. 2012;159(4):1644–57.
Chen C, Yu Q, Hou S, Li Y, Eustice M, Skelton RL, Veatch O, Herdes RE, Diebold L, Saw J, Feng Y. Construction of a sequence-tagged high-density genetic map of papaya for comparative structural and evolutionary genomics in brassicales. Genetics. 2007;177(4):2481–91.
Davey JW, Hohenlohe PA, Etter PD, Boone JQ, Catchen JM, Blaxter ML. Genome-wide genetic marker discovery and genotyping using next-generation sequencing. Nat Rev Genet. 2011;12(7):499.
Dellaporta SL, Wood J, Hicks JB. A plant DNA minipreparation: version II. Plant Mol Biol Report. 1983;1(4):19–21.
Doerge RW. Constructing genetic maps by rapid chain delineation. J Quantitative Trait Loci. 1996;2:121–32.
Dole J, Weber DF. Detection of quantitative trait loci influencing recombination using recombinant inbred lines. Genetics. 2007;177(4):2309–19.
Eloisa M, Reyes Q, Paull RE. Skin freckles on solo papaya fruit. Sci Hortic. 1994;58(1–2):31–9.
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE. A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS One. 2011;6:e19379.
Etienne C, Rothan C, Moing A, Plomion C, Bodenes C, Svanella-Dumas L, Cosson P, Pronier V, Monet R, Dirlewanger E. Candidate genes and QTLs for sugar and organic acid content in peach [Prunus persica (L.) Batsch]. Theor Appl Genet. 2002;105:145–59.
Evans EA, Ballen FH. An overview of global papaya production, trade, and consumption. Publication No. FE914. Gainesville: University of Florida; 2012. https://edis.ifas.ufl.edu/pdffiles/FE/FE91400.pdf. Accessed 30 Sept 2018
FAOSTAT. Crop production. 2014. http://www.fao.org/faostat/en/#data/QC. Accessed 14 Oct 2018.
Guajardo V, Solís S, Sagredo B, Gainza F, Muñoz C, Gasic K, Hinrichsen P. Construction of high density sweet cherry (Prunus avium L.) linkage maps using microsatellite markers and SNPs detected by genotyping-by-sequencing (GBS). PloS one. 2015;10(5):e0127750.
Guan Y, Peace C, Rudell D, Verma S, Evans K. QTLs detected for individual sugars and soluble solids content in apple. Mol Breed. 2015;35(6):135.
Hall D, Hallingbäck HR, Wu HX. Estimation of number and size of QTL effects in forest tree traits. Tree Genet Genomes. 2016;12:110.
Harel-Beja R, Tzuri G, Portnoy V, Lotan-Pompan M, Lev S, Cohen S, Dai N, Yeselson L, Meir A, Libhaber SE, Avisar E. A genetic map of melon highly enriched with fruit quality QTLs and EST markers, including sugar and carotenoid metabolism genes. Theor Appl Genet. 2010;121(3):511–33.
Howie B, Marchini J, Stephens M. Genotype imputation with thousands of genomes. G3 (Bethesda). 2011;1(6):457–70.
Huang YF, Poland JA, Wight CP, Jackson EW, Tinker NA. Using genotyping-by-sequencing (GBS) for genomic discovery in cultivated oat. PLoS One. 2014;9(7):e102448.
Hussain W, Baenziger PS, Belamkar V, Guttieri MJ, Venegas JP, Easterly A, Sallam A, Poland J. Genotyping-by-sequencing derived high-density linkage map and its application to QTL mapping of flag leaf traits in bread wheat. Sci Rep. 2017;7(1):16394.
Kenis K, Keulemans J, Davey MW. Identification and stability of QTLs for fruit quality traits in apple. Tree Genet Genomes. 2008;4(4):647–61.
Khan MA, Korban SS. Association mapping in forest trees and fruit crops. J Exp Bot. 2012;63(11):4045–60.
Kim MS, Moore PH, Zee F, Fitch MM, Steiger DL, Manshardt RM, Paull RE, Drew RA, Sekioka T, Ming R. Genetic diversity of Carica papaya as revealed by AFLP markers. Genome. 2002;45(3):503–12.
Kosambi DD. The estimation of map distances from recombination values. Ann Eugenics. 1944;12:172–5.
Lahaye M, Devaux MF, Poole M, Seymour GB, Causse M. Pericarp tissue microstructure and cell wall polysaccharide chemistry are differently affected in lines of tomato with contrasted firmness. Postharvest Biol Technol. 2013;76:83–90.
Langmead B, Salzberg SL. Fast gapped-read alignment with bowtie 2. Nat Methods. 2012;9(4):357.
Lerceteau-Köhler E, Moing A, Guérin G, Renaud C, Maucourt M, Rolin D. QTL analysis for sugars and organic acids in strawberry fruits. Acta Hortic. 2006;708:573–7.
Liu H, Bayer M, Druka A, Russell JR, Hackett CA, Poland J, Ramsay L, Hedley PE, Waugh R. An evaluation of genotyping by sequencing (GBS) to map the Breviaristatum-e (ari-e) locus in cultivated barley. BMC Genomics. 2014;15(1):104.
Liu J, Van Eck J, Cong B, Tanksley SD. A new class of regulatory genes underlying the cause of pear-shaped tomato fruit. Proc Natl Acad Sci. 2002;99(20):13302–6.
Ma H, Moore PH, Liu Z, Kim MS, Yu Q, Fitch MM, Sekioka T, Paterson AH, Ming R. High-density linkage mapping revealed suppression of recombination at the sex determination locus in papaya. Genetics. 2004;166(1):419–36.
McDaniel SF, Willis JH, Shaw AJ. A linkage map reveals a complex basis for segregation distortion in an interpopulation cross in the moss Ceratodon purpureus. Genetics. 2007;176(4):2489–500.
McCormack JE, Hird SM, Zellmer AJ, Carstens BC, Brumfield RT. Applications of next-generation sequencing to phylogeography and phylogenetics. Mol Phylogenet Evol. 2013;66(2):526–38.
Margarido GR, Souza AP, Garcia AA. OneMap: software for genetic mapping in outcrossing species. Hereditas. 2007;144(3):78–9.
Martínez-García PJ, Parfitt DE, Ogundiwin EA, Fass J, Chan HM, Ahmad R, Lurie S, Dandekar A, Gradziel TM, Crisosto CH. High density SNP mapping and QTL analysis for fruit quality characteristics in peach (Prunus persica L.). Tree Genet Genomes. 2013;9(1):19–36.
Ming R, Hou S, Feng Y, Yu Q, Dionne-Laporte A, Saw JH, Senin P, Wang W, Ly BV, Lewis KL, Salzberg SL. The draft genome of the transgenic tropical fruit tree papaya (Carica papaya Linnaeus). Nature. 2008;452(7190):991.
Money D, Gardner K, Migicovsky Z, Schwaninger H, Zhong G, Myles S. LinkImpute: fast and accurate genotype imputation for non-model organisms. G3 genes genomes. Genetics. 2015;5:2383–90.
Montero-Pau J, Blanca J, Esteras C, Martínez-Pérez EM, Gómez P, Monforte AJ, Cañizares J, Picó B. An SNP-based saturated genetic map and QTL analysis of fruit-related traits in zucchini using genotyping-by-sequencing. BMC Genomics. 2017;18(1):94.
Nantawan M, Kanchana-udomkan C, Ford R. Papaya evaluation handbook: productivity and fruit quality traits. 2017. https://www.horticulture.com.au/globalassets/hort-innovation/resource-assets/pp15000-papaya-evaluation-handbook.pdf. Accessed 1 Sept 2017.
Nishitani C, Hirai N, Komori S, Wada M, Okada K, Osakabe K, Yamamoto T, Osakabe Y. Efficient genome editing in apple using a CRISPR/Cas9 system. Sci Rep. 2016;6:31481.
OECD. Concensus document on compositional considerations for new varieties of papaya (Carica papaya L.): key food and feed nutrients, anti-nutrients, toxicants and allergens: Organisation for economic co-operation and development; 2009. https://www.oecd.org/science/biotrack/46815336.pdf. Accessed 14 Oct 2018
Oliveira EJ, Amorim VBO, Matos ELS, Costa JL, Silva Castellen M, Pádua JG, Dantas JLL. Polymorphism of microsatellite markers in papaya (Carica papaya L.). Plant Mol Biol Report. 2010;28(3):519–30.
Oliveira EJ, Fraife FGA, Freitas JPX, Dantas JLL, Resende MDV. Plant selection in F2 segregating populations of papaya from commercial hybrids. Crop Breeding Appl Biotechnol. 2012;12:191–8.
Paris JR, Stevens JR, Catchen JM. Lost in parameter space: a road map for stacks. Methods Ecol Evol. 2017;8(10):1360–73.
Pérez J, Coppens d’Eeckenbrugge G, Risterucci AM, Dambier D, Ollitrault P. Papaya genetic diversity assessed with microsatellite markers in germplasm from the Caribbean region. In: International Symposium on Papaya. Kuala Lumpur: ISHS; 2005. p. 93–101.
Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE. Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS One. 2012;7(5):e37135.
Poland JA, Rife TW. Genotyping-by-sequencing for plant breeding and genetics. Plant Genome. 2012;5(3):92–102.
Praveen KRB, Hameedunnisa B, Sunil N, Thirupathi RM. Variance component analysis of quantitative traits in muskmelon (Cucumis melo L.). Int J Curr Microbiol App Sci. 2017;6(6):2277–85.
R Core Team. R: a language and environment for statistical computing. 2017. https://www.r-project.org. Accessed 1 Feb 2017.
Rochette NC, Catchen JM. Deriving genotypes from RAD-seq short-read data using stacks. Nat Protoc. 2017;12:2640–59.
Ruden DM, Cingolani P, Patel VM, Coon M, Nguyen T, Land SJ, Lu X. Using Drosophila melanogaster as a model for genotoxic chemical mutational studies with a new program, SnpSift. Front Genet. 2012;3:35.
Semagn K, Bjørnstad Å, Ndjiondjop MN. Principles, requirements and prospects of genetic mapping in plants. Afr J Biotechnol. 2006;5:2569–87.
Silva FF, Pereira MG, Ramos HCC, Junior PCD, Pereira TNS, Viana AP, Daher RF, Ferreguetti GA. Estimation of genetic parameters related to morphoagronomic and fruit quality traits of papaya. Crop Breeding Appl Biotechnol. 2008;8:65–73.
Sondur SN, Manshardt RM, Stiles JI. A genetic linkage map of papaya based on randomly amplified polymorphic DNA markers. Theor Appl Genet. 1996;93(4):547–53.
Sorkheh K, Malysheva-Otto LV, Wirthensohn MG, Tarkesh-Esfahani S, Martínez-Gómez P. Linkage disequilibrium, genetic association mapping and gene localization in crop plants. Genet Mol Biol. 2008;31(4):805–14.
Srinivasan R, Manshardt R. Genetic linkage mapping and QTL analysis of economic traits in papaya (Carica papaya L.). HortScience. 2004;39(4):8880–9.
Stiles JI, Lemme C, Sondur S, Morshidi MB, Manshardt R. Using randomly amplified polymorphic DNA for evaluating genetic relationships among papaya cultivars. Theor Appl Genet. 1993;85(6–7):697–701.
Sun R, Chang Y, Yang F, Wang Y, Li H, Zhao Y, Chen D, Wu T, Zhang X, Han Z. A dense SNP genetic map constructed using restriction site-associated DNA sequencing enables detection of QTLs controlling apple fruit quality. BMC Genomics. 2015;16(1):747.
Tanksley SD. Mapping polygenes. Annu Rev Genet. 1993;27(1):205–33.
Tian S, Jiang L, Gao Q, Zhang J, Zong M, Zhang H, Ren Y, Guo S, Gong G, Liu F, Xu Y. Efficient CRISPR/Cas9-based gene knockout in watermelon. Plant Cell Rep. 2017;36(3):399–406.
Torkamaneh D, Laroche J, Belzile F. Genome-wide SNP calling from genotyping by sequencing (GBS) data: a comparison of seven pipelines and two sequencing technologies. PLoS One. 2016;11(8):e0161333.
Van Droogenbroeck B, Breyne P, Goetghebeur P, Romeijn-Peeters E, Kyndt T, Gheysen GA. AFLP analysis of genetic relationships among papaya and its wild relatives (Caricaceae) from Ecuador. Theor Appl Genet. 2002;105(2–3):289–97.
Voorrips RE. MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered. 2002;93(1):77–8.
Wai CM, Ming R, Moore PH, Paull RE, Yu Q. Development of chromosome-specific cytogenetic markers and merging of linkage fragments in papaya. Trop Plant Biol. 2010;3:171–81.
Wang S, Basten CJ, Zeng ZB. Windows QTL Cartographer 2.5. 2012a. http://statgen.ncsu.edu/qtlcart/WQTLCart.htm. Accessed 1 Feb 2017.
Wang W, Huang S, Liu Y, Fang Z, Yang L, Hua W, Yuan S, Liu S, Sun J, Zhuang M, Zhang Y. Construction and analysis of a high-density genetic linkage map in cabbage (Brassica oleracea L. var. capitata). BMC Genomics. 2012b;13(1):523.
Wu J, Li LT, Li M, Khan MA, Li XG, Chen H, Yin H, Zhang SL. High-density genetic linkage map construction and identification of fruit-related QTLs in pear using SNP and SSR markers. J Exp Bot. 2014;65(20):5771–81.
Xu X, Lu L, Zhu B, Xu Q, Qi X, Chen X. QTL mapping of cucumber fruit flesh thickness by SLAF-seq. Sci Rep. 2015;5:15829.
Yu Q, Tong E, Skelton RL, Bowers JE, Jones MR, Murray JE, Hou S, Guan P, Acob RA, Luo MC, Moore PH. A physical map of the papaya genome with integrated genetic map and genome sequence. BMC Genomics. 2009;10(1):371.
Yuan XJ, Li XZ, Pan JS, Wang G, Jiang S, Li XH, Deng SL, He HL, Si MX, Lai L, Wu AZ. Genetic linkage map construction and location of QTLs for fruit-related traits in cucumber. Plant Breed. 2008;127(2):180–8.
Zygier S, Chaim AB, Efrati A, Kaluzky G, Borovsky Y, Paran I. QTLs mapping for fruit size and shape in chromosomes 2 and 4 in pepper and a comparison of the pepper QTL map with that of tomato. Theor Appl Genet. 2005;111(3):437–45.
We would like to acknowledge Lecker Farming Limited to provide planting facility and logistical support for field work in Mareeba, Australia. The authors declare that they have no competing interests.
This project was funded by Horticulture Innovation Australia Limited using the papaya industry levy and funds from the Australian Government (Project number PP15000: “New genetic targets to improve quality in papaya”). There is no role of the funding body in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Table S1. Mean and standard deviation of fruit quality traits of parental lines and their F1 and F2progeny population in 2016 and 2017. This table presents phenotypic evaluation of seven fruit quality traits.
Table S2. Phenotypic variances by generation in 2016 and 2017 and heritability estimates of each fruit quality trait.
Figure S1. Phenotypic variation of fruit quality traits (A-G) among parents, F1 and F2 populations. Mean and median values are represented by black solid lines (−) and red cross (+), respectively in the interior of each box area. The mid-parent values are indicated by horizontal dashed lines.
Table S3. Summary of initial linkage map from F2 population of ‘RB2’ x ‘Sunrise Solo’.
Table S4. Summary of SNPs markers and segregation.
Figure S2. Genetic map of ‘RB2’ x ‘Sunrise Solo’ and QTL for fruit quality traits. The LGs resulted from initial map and final map were labelled by LG1-LG23 and I-X, respectively. The left pane indicates the genetic map position in cM of each SNPs. Homology between both maps was highlighted in turquoise. Colour bars on the right of final map indicate QTL position and LOD interval at 95% confidence; where flesh sweetness (SWE) – red; fruit weight (WEI)-brown; fruit length (LEN)-green; fruit width (WID)-olive; skin freckle (FRE)-pink; flesh thickness (THI)-black; fruit firmness (FIR)-blue. Data from harvest year 2016 and 2017 are represented in solid and diagonal-stripe bar, respectively.
Table S5. Associated SNPs and candidate genes for flesh sweetness and other fruit quality traits.
About this article
Cite this article
Nantawan, U., Kanchana-udomkan, C., Bar, I. et al. Linkage mapping and quantitative trait loci analysis of sweetness and other fruit quality traits in papaya. BMC Plant Biol 19, 449 (2019). https://doi.org/10.1186/s12870-019-2043-0