QTL mapping for growth-related traits by constructing the first genetic linkage map in Simao pine

Background Simao pine is one of the primary economic tree species for resin and timber production in southwest China. The exploitation and utilization of Simao pine are constrained by the relatively lacking of genetic information. Construction a fine genetic linkage map and detecting quantitative trait locis (QTLs) for growth-related traits is a prerequisite section of Simao Pine's molecular breeding program. Results In our study, a high-resolution Simao pine genetic map employed specific locus amplified fragment sequencing (SLAF-seq) technology and based on an F1 pseudo-testcross population has been constructed. There were 11,544 SNPs assigned to 12 linkage groups (LGs), and the total length of the map was 2,062.85 cM with a mean distance of 0.37 cM between markers. According to the phenotypic variation analysis for three consecutive years, a total of seventeen QTLs for four traits were detected. Among 17 QTLs, there were six for plant height (Dh.16.1, Dh16.2, Dh17.1, Dh18.1–3), five for basal diameter (Dbd.17.1–5), four for needle length (Dnl17.1–3, Dnl18.1) and two for needle diameter (Dnd17.1 and Dnd18.1) respectively. These QTLs individually explained phenotypic variance from 11.0–16.3%, and the logarithm of odds (LOD) value ranged from 2.52 to 3.87. Conclusions In our study, a fine genetic map of Simao pine applied the technology of SLAF-seq has been constructed for the first time. Based on the map, a total of 17 QTLs for four growth-related traits were identified. It provides helpful information for genomic studies and marker-assisted selection (MAS) in Simao pine. Supplementary Information The online version contains supplementary material available at 10.1186/s12870-022-03425-y.

for perennial woody plants had constructed [18,26,52], various QTLs associated with essential traits had been identified based on these maps [12], [42] [36]. However, a great majority of them were low saturated frame-work maps that lowered the degree of accuracy of QTL mapping [28,31]. Single nucleotide polymorphism (SNP) is a sort of molecular marker technique developed by high-throughput sequencing. It is convenient, abundant, highly polymorphic and commonly used in genetic map construction [34,45,46]. In pace with the rapid development technology of next-generation sequencing (NGS), the technology of SLAF-seq becomes one of the popular methods for SNP markers development and high-resolution genetic map construction [9,44,69]. Until now, various plants genetic linkage maps had established by SLAF-seq, and it greatly heighten the efficiency and the degree of QTL mapping accuracy [11,37,68,70].
Construction of genetic linkage map for Simao pine will provide helpful information for genomic studies and facilitate the breeding applications. Growth-related traits were important economic traits for woody tree breeding, and detecting QTLs for these traits is a crucial section in the molecular breeding program for Simao Pine. Therefore, we employed the SLAF-seq technology to actualize the fast SNPs development and a high-density linkage map will be constructed. The QTLs linked to growthrelated traits will be identified based on the genetic linkage map. It will provide a powerful tool for future detection of other economic characteristics QTLs and MAS in Simao pine breeding.

Analysis of sequencing data and SLAF markers
The Simao pine SLAF libraries were constructed successfully. A total of 461.41 M reads (guanine-cytosine of 40.23% and Q30 of 92.25%) were obtained. The number of reads for the maternal and paternal parents was 22,947,158 and 18,534,664, the mean for the F 1 individual was 4,659,126 (Table 1). After filtering out the lowquality reads, the number of SLAFs for the two parents and average in F1 progeny was 535,598, 482,851, and 375,315. The average depth of the SLAFs for the maternal and paternal parent was 10.48-fold and 9.77-fold, and the average for each F1 individual was 3.47-fold (Table 1). A total of 633,086 high-quality SLAFs were obtained.

Construction and evaluation of genetic linkage map
After discarding the unsuitable markers, a total of 5,643 SLAFs were used successfully for the linkage map construction. Among them, 11,544 SNP markers were detected ( Table 2). Based on these markers, we constructed a high saturated genetic linkage map covering 2062.85 cM, comprising 12 LGs, and a mean distance of 0.37 cM (Fig. 2, Table 2). The genetic length of individual LGs varied from 147.38 (LG4) to 194.85 cM (LG9) with a mean of 171.90 cM. Among 12 LGs, LG8 was the largest linkage group (523 SLAFs), while LG7 was the smallest group (367 SLAFs). The average number of markers for each LG was 470. For the density, LG5 was the densest linkage group with the minimum marker distance (0.32 cM), whereas LG1, LG3, and LG7 were the lowest density linkage groups (0.40 cM). The max gap in the map was 11.17 cM located in LG2 and LG3.
Three approaches were used for detecting the quality of Simao pine genetic map. (1) The markers integrity analysis showed that each individual mapped marker's complete degree was 99.99% (Fig. 3), the average depth for parents was more than five times of the offspring (Table 3). It suggested that genotyping was accurate and the mapping population was suitable for further analysis.
(2) The result of Haplotype maps analysis revealed that most of the recombination blocks were distinctly defined ( Supplementary Fig. 1). It suggested that the constructed high saturated genetic map was adaptive for subsequent genetic analysis. (3) The analysis of Heat maps showed that the markers were well ordered in most linkage groups, indicated that the constructed Simao pine genetic map with high accuracy (Supplementary Fig. 2).

Phenotypic variation analysis
The 3 years phenotypic data and statistical values for growth-related traits were summarized ( Table 4). The results showed that the four traits were normal distribution for three years (Fig. 4). And a relatively higher degree of genetic variation was found.  (Table 5).

QTL mapping
Using the constructed map and analyzing the data of phenotypic characteristics in the mapping population, 17 QTLs linked to 4 traits were identified ( Table 6, Supplementary Fig. 3). The individual QTL explained the

Discussion
Construction of the high-resolution map for Simao pine will provide helpful information for genomic studies and facilitate the breeding applications. In our study, a fine genetic map of Simao pine applied the technology of SLAF-seq has been constructed. It contained 12 LGs and 11,544 SNPs spanned 2,062.85 cM with a mean marker distance of 0.37 cM, representing a significant improvement over the previous linkage maps in coniferous plants [8,10,14,15,39,60]. As we know, this was one of the highest saturated genetic maps to date in coniferous tree species. Furthermore, a total of 17 QTLs for four growth-related traits were identified based on the constructed genetic map, and these QTLs were valuable resources for genetic breeding and MAS in Simao pine. An appropriate mapping population laid a solid foundation for the genetic map construction [75]. It's hard for perennial woody trees to get Backcross (BC), Recombination Inbred Lines (RILs) and F 2 populations in the short term because of the long generation constraints. The pseudo-testcross strategy has been put forward that the F 1 population was created to replace the other populations [20]. This strategy has been successfully applied to various forestry trees, especially non-model and unsequenced species [32,35,47,61,62,65]. In this report, nine F 1 populations were obtained by artificial hybridization, based on the analysis of field phenotypic characteristics variation among populations and genetic similarity coefficient among parents, superior clones SM11 (high resin content) and JG1 (fast growth) were chosen for maternal and paternal parents. The F 1 hybrid population was applied as the mapping population for map construction in our study. The obvious variation will present in the segregation population due to the significant difference in the resin content and the parents' growth speed, which could facilitate QTL mapping for these traits.
Molecular markers were powerful tools for genetic map construction [2]. The mainstream molecular markers for genetic linkage map construction of heterozygous perennial forest tree species included SNP, simple sequence repeat (SSR), inter-simple sequence repeat (ISSR), amplified fragment length polymorphism (AFLP) and random amplified polymorphic DNA (RAPD) et al. [7,21,28,31,38,56]. Among these markers, SNP was thought of as one of the ideal markers for genetic map construction for the merits of abundance, fast and covering the whole genome [3,16]. Significant changes have taken place in genetic map construction with the development of highthroughput sequencing technology [15]. Recently, SLAFseq technique has become one of the most popular SNP marker development assays [45]. A high-density genetic linkage map for Simao pine had been successfully constructed by using this approach. It indicated that SNP markers could be efficiently applied in constructing a genetic linkage map of Simao pine.
High-quality genetic maps can increase the accuracy of QTL mapping [28,31,32,35]. The number of markers in the genetic map is one of the essential indicators to evaluate its quality. A genetic map with a large number of markers has the characteristics of suitable distance and high-resolution [70]. This constructed genetic map   was the first map that contained over ten thousand SNP markers in coniferous tree species. It supported that we have built a high-quality genetic map for Simao Pine. Moreover, other three approaches were used for evaluating the quality of Simao pine genetic map. All results indicated that the current high-accuracy map would provide sufficient information for QTL mapping.
Growth-related traits were important economic traits for woody tree breeding, and detecting QTLs for growthrelated traits is an introductory section in the molecular breeding program for Simao Pine. In this study, a total of 17 QTLs for four growth-related traits were identified. The individual QTL explained the phenotypic variation varied from 11.0% to16.3%, and it indicated that several significant useful genes might control the growth-related traits of Simao pine (Rönnberg et al., 2005; [28,31]. In the four traits, only the QTLs for plant height were consistently detected during the three years, but QTLs for the other three traits were not consecutively expressed. It suggested that different genes/QTLs might influence Simao pine's growth-related traits in different seasons/ ages or that the QTLs stabilization varied by the effect of environmental change. In agreement with previous studies in the woody tree, growth-related traits were mainly quantitative traits, which dominated by involved genes and easily affected by the environment, probably changes as the tree matures [27,67]. In other words, the multi-environment QTL analysis is more accurate than   a single-environment experiment for the heterozygous perennial woody tree growth-related traits QTL mapping [17]. Thus, to eliminate interference brought by the environment and improve QTLs accuracy, the multienvironment QTL test in other domains using different backgrounds for Simao pine must be carried out in the future [1,28,31]. The size of the mapping population decided the success of genetic mapping and QTL analysis, and the influence of missing genotypes is more obvious in small-sized populations than in large one. In general, mapping population consisting of 50-250 individuals may be sufficient to construct the initial skeletal linkage map [24], [4]. However, a larger population size is needed for high resolution or fine mapping [40,54,55]. In our study, a main limitation of the Simao pine genetic linkage map is that it is smaller mapping population size (100 individuals), it may be provided fragmented linkage groups and inaccurate locus order for the genetic linkage map and affected the accuracy of the QTL mapping [40]. It is well known that artificially controlled pollination of conifers trees is more difficult and the number of hybrid offspring is less than other tree species too [23]. To our knowledge, highdensity genetic maps which constructed with the small size mapping populations have been successfully used for QTL fine mapping [61,62,75]. However, for further improving the accuracy of the map and QTLs, the size of Simao pine mapping population must be increased in the future.

Conclusions
We report the first high-density genetic map for Simao pine. The map was constructed using an F 1 population and was based on SNP markers developed by using the SLAF-seq approach, which allowed the efficient development of a large number of markers in a short time. A total of 17 QTLs for growth-related traits were identified based on the constructed genetic map. The results of this study will provide a platform for map-based gene isolation and molecular breeding for Simao pine.

Mapping population and DNA extraction
According to factorial mating design, eleven superior clones with the good characters of rapid growth and high resin content were selected as the hybrid parents from 105 clones. Among them, five clones as the male parents (superior clone JG1, NR7, LC3, ZY1, PW2) and the other six clones for the female parents (PW12, LC9, JG7, JD5, PW3, SM11). In the spring of 2014, a total of 30 hybridized combinations of artificially controlled pollination were conducted. Two years later, a total of 9 full-sib families were obtained and grown at the farm of Pu'er city institute of forestry sciences (N 22• 47′/E 100• 59′) by harvesting, sowing and culturing the seedlings. Proceed to the next step, family 9 was selected as the mapping population for Simao pine genetic linkage map construction by analyzing population phenotypic variation and parent's genetic similarity coefficient [57]. The parents for the mapping population were superior clone SM11 (maternal, with the characteristic of high resin content) and JG1 (paternal, with the aspect of rapid growth). Fresh and young healthy needles from parents and mapping population (100 hybrid individuals) were collected and frozen in liquid nitrogen at once. The genomic DNA was isolated using the improved cetyl trimethyl ammonium bromide (CTAB) method [53].

SLAF library establishment and sequencing
The similar experiment procedure of high-throughput sequencing and establishment of the SLAF library for the mapping population was performed according to the previous study by Zhang et al. [68] with minor modified. Briefly, two different steps were applied. First, all of the genomic DNA for SLAF library construction were digested by a single enzyme Hae III (New England Biolabs, NEB, USA). Second, only the SLAF fragments in which the length ranging from 414 to 464 bp will be excised and diluted for pair-end sequenced by Illumina HiSeq 2500 platform (Illumina, Inc; San Diego, CA, USA).

Analysing and genotyping for sequence data
The SLAF-seq data grouping and SNP genotyping were the same as Wang et al. [56]. After discarding the lowquality reads, the remaining reads with more than 90% similarity will gather in the same SLAF locus. As Simao pine is a diploid plant, so only the SLAF has 2 to 4 alleles that will designate as the potential and polymorphic marker. The aa × bb segregation pattern markers will not be used to construct the genetic map, as the mapping population is obtaining by a cross between two heterozygote parents of Simao pine [22,25].

Linkage map construction and evaluation
The HighMap software [22] with the cross-pollination (CP) option was utilized for Simao pine genetic linkage map construction. The estimation parameters were set for a minimum LOD threshold of 5.0 for linkage groups and a maximum recombination fraction of 0.4, and the map distance in centi-Morgans was calculated with the Kosambi mapping algorithm [51]. After linkage grouping, the maximum likelihood method was used to order the SLAFs markers in all LGs [50], [37]. The SMOOTH algorithm was utilized to put correct genotyping errors [49]. To evaluate the quality of the constructed Simao pine genetic map, the analysis of mapped markers integrity, construction of haplotype maps and heat maps for each LG were carried out [33,59,63].

Growth-related traits assessment and QTL analysis
Growth-related traits, including plant height, basal diameter, needle length, and needle diameter of the progenies are determined during three consecutive years. The plant height and needle length were measured by a line tape, while the basal diameter and needle diameter were measured with the vernier caliper in December from 2016-2018. The phenotypic variation analysis, including coefficients of variation (CV) and the correlation coefficients between all investigated traits, was performed with software SPSS 20.0. The QTLs underlying the growth-related traits were implemented in MapQTL 6.0 software and the interval mapping method [48]. The 95% Bayesian credible interval method was used to calculate the confidence intervals for all QTLs [43]. One thousand permutations decided the threshold value. According to the permutations, the minimum LOD score of 2.5 was conducted in our study. The percentage of phenotypic variance explained of each detected QTL was achieved based on the phenotypic variance in the population [65].