- Research article
- Open Access
Construction of a high-density genetic map and QTL mapping of leaf traits and plant growth in an interspecific F1 population of Catalpa bungei × Catalpa duclouxii Dode
BMC Plant Biology volume 19, Article number: 596 (2019)
Catalpa bungei is an important tree species used for timber in China and widely cultivated for economic and ornamental purposes. A high-density linkage map of C. bungei would be an efficient tool not only for identifying key quantitative trait loci (QTLs) that affect important traits, such as plant growth and leaf traits, but also for other genetic studies.
Restriction site-associated DNA sequencing (RAD-seq) was used to identify molecular markers and construct a genetic map. Approximately 280.77 Gb of clean data were obtained after sequencing, and in total, 25,614,295 single nucleotide polymorphisms (SNPs) and 2,871,647 insertions-deletions (InDels) were initially identified in the genomes of 200 individuals of a C. bungei (7080) × Catalpa duclouxii (16-PJ-3) F1 population and their parents. Finally, 9072 SNP and 521 InDel markers that satisfied the requirements for constructing a genetic map were obtained. The integrated genetic map contained 9593 pleomorphic markers in 20 linkage groups and spanned 3151.63 cM, with an average distance between adjacent markers of 0.32 cM. Twenty QTLs for seven leaf traits and 13 QTLs for plant height at five successive time points were identified using our genetic map by inclusive composite interval mapping (ICIM). Q16–60 was identified as a QTL for five leaf traits, and three significant QTLs (Q9–1, Q18–66 and Q18–73) associated with plant growth were detected at least twice. Genome annotation suggested that a cyclin gene participates in leaf trait development, while the growth of C. bungei may be influenced by CDC48C and genes associated with phytohormone synthesis.
This is the first genetic map constructed in C. bungei and will be a useful tool for further genetic study, molecular marker-assisted breeding and genome assembly.
Catalpa bungei (2n = 2 × =40) is a woody plant belonging to the genus Catalpa, family Bignoniaceae , and an important ornamental tree species widely used in urban forests in central and northern cities in China due to its beautiful flowers, straight stems and moderate efficiency in particulate matter removal [2, 3]. C. bungei is native to China and remains mainly distributed in China. According to the records, people in ancient China started to cultivate and utilize C. bungei as early as the Han dynasty (202 BC to 220 AD) . In addition to its value in landscaping, C. bungei has wood with excellent mechanical properties and high durability that can resist the corrosion caused by microorganisms and insects. It is usually used to make coffins, musical instruments, boats and other upscale wooden products in ancient China [5,6,7,8]. Even today, it is still a popular material for upmarket furniture making and house decorating. To satisfy the vast demand for timber supply and urban landscaping in China, efforts have been made towards C. bungei hybridization for economic and ecological purposes [9, 10]. However, the lack of genetic information for the target traits has made the formulation of a highly efficient breeding strategy in C. bungei much more difficult than in crops.
As C. bungei is a timber and urban forest tree, its growth traits are its most important economic traits, and recent studies have demonstrated that leaf traits such as leaf area (LA), petiole length (PL) and others are associated with its capacity to capture particles from the air . These traits are complex quantitative traits that may be determined by several factors, including cell division and expansion, phenology, and photosynthesis efficiency . An understanding of quantitative trait loci (QTLs) would be beneficial to reveal the genetic architecture of these important traits.
Genetic mapping is one of the main methods used to identify QTLs and genes that regulate complex but important traits, such as plant growth , flowering time  and abiotic stress resistance [14, 15], in plant breeding. Genetic mapping in forest studies mostly employs F1 populations, and because of the long lifespan and large size of trees, building a genetic mapping population usually requires large fields and a long duration of land use, which makes the application of genetic mapping much less common in perennial forest species than in annual crops. Despite the difficulties in population construction, limited studies have demonstrated that linkage mapping is still a powerful tool for dissecting complex quantitative traits in forest species. For example, Xia et al. identified nine QTLs and candidate genes regulating leaf shape using a genetic map constructed from an F1 population of Populus deltoides × Populus simonii Carr . Du et al. revealed the genetic architecture of growth traits in poplars using linkage analysis and association studies . Other traits, such as bud burst timing , lignin content , reproduction-related traits , and fruit-related traits [21, 22], have also been subjected to linkage-based QTL mapping in forest species and other woody plants. According to QTL studies, genetic mapping is still one of the most efficient methods for studying genetic characteristics. In addition, genetic maps are also basis for map-based cloning and other genetic analyses [23, 24].
Genetic maps are constructed according to the linkage relationships between molecular markers in the genome, including random amplified polymorphic DNA (RAPD) [25, 26], amplified fragment length polymorphisms (AFLPs) [26,27,28], simple sequence repeats (SSRs) [26, 28,29,30], sequence-related amplified polymorphisms (SRAPs) , single nucleotide polymorphisms (SNPs) , and insertions-deletions (InDels) . Among these types of markers, SNPs and InDels are considered potential applied molecular markers . Currently, next-generation sequencing (NGS) technology, such as whole-genome resequencing and reduced-representation genome sequencing (RRGS), has facilitated the identification of SNP and InDel markers and the construction of high-density genetic maps with molecular markers throughout the plant genome . Restriction site-associated DNA sequencing (RAD-seq) is an RRGS method and has been effectively applied in high-throughput molecular marker discovery and QTL mapping of important traits in woody plants [36,37,38]. Consequently, the construction of a C. bungei high-density genetic map using RAD-seq will not only aid in the development of markers for genetic studies but also help accelerate the breeding process in C. bungei; however, such work has not yet been reported.
In this study, an F1 segregating population derived from two Catalpa cultivars, namely, C. bungei “7080” (female parent) and Catalpa duclouxii “16-PJ-3” (male parent), was generated. A high-density genetic map was constructed using RAD technology based on the F1 population. Subsequently, we located and analysed QTLs associated with leaf traits using the genetic map. In addition, we also studied QTLs associated with plant height at six time points during the growing season. This is the first genetic map in C. bungei and lays a foundation for future genetic studies and marker-assisted selection (MAS) of C. bungei.
Construction of the genetic map
Nearly 288.75 Gb (288,747,675,610 bp) of raw data containing paired-end reads was generated by Illumina sequencing of the 200 F1 progeny and their parents using RAD-sequencing (for offspring individuals) and resequencing (for parents). After data filtering, we obtained 963,326,642 clean reads totalling more than 280.72 Gb of clean data with an average Q30 (%) value of 93.0% and a guanine-cytosine (GC, %) content of 37.0%. For the two parents, approximately 9.90 and 9.97 Gb of resequenced clean data was obtained from 16-PJ-3 and 0708, with resequencing coverage of 10.09× and 10.47×, respectively. For the offspring, an average of approximately 1.30 Gb of clean data (ranging from approximately 0.83 to 2.17 Gb) was obtained (Table 1). The clean reads were aligned to the C. bungei genome (Additional file 1). Clean reads aligned to multiple positions or no position in the reference genome were discarded. Consequently, 90.76% clean reads for the female parent and 93.74% clean reads for the male parent were obtained. For F1 individuals, an average of 94.79% clean reads were aligned to unique positions in the reference genome. All clean reads aligned to unique positions in the reference genome were kept for subsequent SNP calling and genotype determination.
SNPs and InDels were identified using the filtered clean reads with the Unified Genotyper. A total of 25,614,295 SNPs and 2,871,647 InDels were initially identified in 200 F1 individuals (Table 1). After removal of the markers with low quality and shallow sequencing depth, 712,786 polymorphic sites were retained and classified into eight segregation patterns. Then, we removed the SNP markers with abnormal bases, a low calling rate in progeny individuals and significant segregation distortion. Finally, 9593 polymorphic sites (including 9072 SNPs and 521 InDels) with four segregation patterns, namely, “nn × np” (3570), “hk × hk” (1119), “lm × ll” (4883) and “ef × eg” (21), were used for linkage analysis. The read counts of individual alleles at the 9593 polymorphic loci were showed in Additional file 2: Table S1.
Female and male maps were first constructed using the selected markers. In the female map (7080), a total of 6023 polymorphic markers fell into 20 linkage groups (LGs) with a 3440.02 cM total distance and a 0.57 cM average marker interval distance. LG2 was the largest LG with a total distance of 212.84 cM and 179 markers (Table 2). LG18 had 662 markers, the maximum number of markers among the 20 LGs. The average distance ranged from 0.22 (LG12) to 1.81 (LG20) cM. Among the 6003 gaps, no gap was less than 5 cM in length, and the length of the largest gap, which was located in LG2, was approximately 4.53 cM. In the male map, 4710 markers fell into 20 LGs, and the total genetic distance was 3226.28 cM, with an average marker interval distance of 0.73 cM. Among the 20 LGs, LG1 had the maximum number of markers (413), with a 173.48-cM total distance, and LG11 had the longest total distance (198.42 cM), with 119 SNPs. The average marker distance ranged from 0.37 (LG18) to 1.8 (LG10) cM. In the male map (16-PJ-3), the longest gap was 3.35 cM in LG11, and no gap was less than 5 cM in length (Table 3). Subsequently, the male and female maps were merged into an integrated map. The final map spanned 3151.63 cM and contained 9593 markers in 20 LGs. Among the 20 LGs, LG5 and LG14 were the longest and shortest groups, spanning 198.05 cM and 125.5 cM and containing 450 and 441 polymorphic sites, respectively. The average distance between markers was 0.32 cM, with a range from 0.16 cM (LG18) to 1.08 cM (LG10). All gaps were fewer than 5 cM in length, and the longest gap was 4.12 cM in LG2 (Table 3). Detailed information on the markers used for genetic map construction and the distances between adjacent markers are provided in Additional file 3: Table S2.
Haplotype analysis and heat maps are two effective methods for evaluating the quality of genetic maps . In our study, we constructed haplotype maps of the 20 LGs with polymorphic markers to reflect the recombination events of each offspring. The haplotype maps indicated that the missing marker ratio in the genetic map was 0.19%, suggesting high quality (Additional file 4: Figure S1 and Additional file 5: Table S3). Heat maps can reflect the recombination relationships between all the markers in the same LG. Heat maps of 20 LGs indicated that the adjacent markers in the linkage groups were strongly linked and became gradually less linked with increasing distance, suggesting a correct order of the markers in most LGs (Additional file 6: Figure S2). In addition, for most LGs, the Spearman correlation coefficient between the genetic and physical locations was 0.99, with an average physical coverage of 99%, suggesting a relatively high level of genetic collinearity (Additional file 7: Table S4).
Analysis of leaf and growth traits
A wide range of variation in the seven leaf traits and plant height at six time points was observed (Table 4). The leaves of “7080” were larger and wider than those of “16-PJ-3”, but the remaining five leaf traits did not show significant differences between the parents. In addition, “7080” grew faster than “16-PJ-3” at all six time points, which may be partly due to the higher efficiency of nitrogen utilization and distribution in the photosynthetic system, according to our previous study . The frequency distribution analysis (Fig. 1) indicated that most of the phenotypic values were generally normally distributed, which suggested that the phenotypic data could be used for further QTL analysis. The coefficient of variation (CV, %) in the seven leaf traits ranged from 11.18 to 25.61%. The leaf perimeter (LP) and leaf length (LL) had similar CVs: 17.93 and 17.03%, respectively. Plant height varied from 1.78 to 3.75 m on the 10th of October, and the CV at the six time points ranged from 10.02 to 14.42%. The plant heights on the 31st of July, 15th of August, 31st of August and 10th of October had very similar CVs (10.07, 10.15, 10.02 and 10.20%, respectively), which suggested only minor differences between these growth time points.
Correlation analysis of the seven leaf traits and plant height (10.10) showed that leaf traits have no correlations with plant height, suggesting that the two types of traits may develop independently of each other (Fig. 1). Similarly, the SPAD readings had no correlations with the other six leaf traits, which implied that the chlorophyll content of leaves may not significantly influence their development. LA exhibited strong positive correlations with leaf width (LW) (0.87) and moderate positive correlations with LL (0.69) and LP (0.68) and weak positive correlations with PL (0.31) but no correlations with L/W. Four leaf traits, LA, LL, LW and LP, had certain positive correlations with each other; PL had weak positive correlations with LA (0.31), LL (0.24), LW (0.35) and LP (0.29), suggesting a possible minor association between the development of petioles and leaves. In addition, no significant negative correlations were found, except for the L/W ratio and LW (−0.35).
QTL mapping of leaf and growth traits
A total of 33 QTLs for seven leaf traits and plant height at five time points were successfully identified using the integrated genetic map and the phenotypic data (Additional file 8: Figure S3 and Additional file 9: Figure S4). Twenty QTLs for leaf traits, including six LA associations, five LL associations, one LW association, one LP association, one L/W ratio association, four PL associations and two SPAD value associations, explained 2.33 to 16.51% of the phenotypic variation. Two QTLs, Q3–172 (LA) and Q17–84 (SPAD), exhibited over-dominance, while six QTLs, Q16–60 (LA), Q16–67 (LA), Q16–60 (LL), Q16–60 (LP), Q16–60 (L/W) and Q16–60 (PL), showed partial dominance. Among the 20 QTLs for leaf traits, Q16–60, Q16–67 and Q16–97 were mapped to chromosome 16, and Q16–60, which was identified for LA, LL, LP, the L/W ratio and PL, explained 5.16, 16.51, 14.0, and 6.21% of the phenotypic variation in LA, LL, LP and L/W with LOD scores of 3.28, 17.64, 5.47 and 4.27, respectively, and the high phenotypic variance explained (PVE) of LA and LL at Q16–60 suggested that this site may be highly associated with these two traits. In addition, four QTLs were detected on chromosome 19, three of which (Q19–106, Q19–116 and Q19–126) were associated with LA and another of which (Q19–137) was associated with SPAD value.
We also detected 13 QTLs for plant height at five time points, except for time point 7.31. The QTLs explained 5.81 (Q18–60) to 9.02% (Q18–73) of the phenotypic variation. Eight QTL sites (13 associations) were mapped to LG3 (1), 9 (1), 16 (2), 18 (2) and 19 (1), and eight QTLs were detected at more than one time point: Q9–1 (at time points 6.30 and 7.15), Q18–66 (at time points 8.15 and 8.31) and Q18–73 (at time points 6.30, 7.15, 8.15 and 8.31). Three of the 13 QTLs, Q9–1 (6.30), Q9–1 (7.15) and Q19–59 (7.15), showed over-dominance, and Q16–56 (8.15) showed partial dominance. Detailed information on the QTLs has been provided in Additional file 10: Table S5 and Additional file 11: Table S6.
Candidate gene prediction in a subset of QTLs
To further test the accuracy and usability of our genetic map, the leaf trait QTL Q16–60 and plant height QTLs Q18–66 and Q18–73 were used for gene prediction because they were identified at more than one time point or for more than one leaf trait (Fig. 2, Additional file 12: Table S7). Q18–66 was found as a QTL for plant height (8.15) and plant height (8.31). Similarly, Q18–73 was found as a QTL for plant height (6.30), plant height (7.15), plant height (8.15) and plant height (8.31). Although Q9–1 was also identified as a QTL at more than two time points (plant height 6.30 and 7.15), no genes were found between marker sca9_231994 and sca9_261968 in the physical map. The QTL region of Q16–60 was 0.45 cM in length, exhibited a physical distance of approximately 58 Kb and contained 5 putative predicted genes in the reference genome of C. bungei. Four of these genes were annotated in the Gene Ontology (GO) database, but none were identified in the Kyoto Encyclopedia of Genes and Genomes (KEGG) pathway database. GO annotation suggested that the genes were involved in DNA binding (evm.model.group5.473), cyclin regulation (evm.model.group5.471), and embryo development (evm.model.group5.475), among other processes.
The QTL regions of Q18–66 and Q18–73 were mapped to 0.320 and 0.359 cM, respectively, and their physical distances were approximately 213 Kb and 52 bp in the reference genome. The 52-bp sequence of Q18–73 was located in the coding region of one putative gene, evm.model.group7.1784, which may encode a cleavage and polyadenylation specificity factor 3 (CPSF) subunit 3-II isoform X3 according to its annotation. GO annotation suggested that this gene mainly participates in catalytic activity, mRNA processing, polar nucleus fusion, and protein binding, among other processes. Fifteen putative predicted genes were found in Q18–66. Except for one unknown gene, GO annotation suggested that the remaining 14 genes in Q18–66 mainly participate in flavonoid biosynthesis, DNA binding, and magnesium ion binding, among other processes. KEGG annotation suggested that two genes, namely, evm.model.group7.1478 and evm.model.group7.1479, mainly participate in glycolysis, carbon metabolism, and biosynthesis of amino acids, among other processes, suggesting possible functions in plant growth. Other genes in Q18–66 may be involved in anthocyanin biosynthesis, alpha-linolenic acid metabolism, aminobenzoate degradation, and the mRNA surveillance pathway, among others.
Using the RAD-seq strategy to identify molecular markers
Genetic maps have been used as an important tool to assist in plant breeding and to elucidate the genetic architecture of complex quantitative traits of interest to breeders. Genetic maps with a high marker density are essential for marker discovery and precise QTL location. RRGS methods, such as RAD-seq, genotyping-by-sequencing (GBS) and specific locus amplified fragment (SLAF) sequencing, have strongly facilitated molecular marker identification. For example, by using 3868 SNPs identified with GBS, Zhang et al. constructed a high-density genetic map of tree peony (Paeonia suffruticosa Andr.) with a much longer total genetic distance (13,175.5 cM) and shorter average marker interval distance (3.40 cM) than the genetic map containing 35 SSR markers with a 9.70 cM average marker interval and 338.2 cM total genetic distance obtained by Guo et al. for the same mapping population [40, 41]. A similar outcome was observed for genetic maps of groundnut, in which SSR-based genetic maps contained approximately 135 to 191 markers and an RRGS-based map contained 1685 SNPs . RAD-seq has proven to be an effective method for identifying large numbers of polymorphic sites in plants. For example, in a report on Vitis plant genetic map construction, RAD-seq identified 8,481,484 SNPs and 1,646,131 InDels in the parents and 176 F1 plants, 65,299 and 4832 of which, respectively, were used to construct a high-density genetic map spanning 3014 cM, with an average coverage of 99.83% and with 99.99% of gaps fewer than 5 cM in length . Using the same method, Guo et al. constructed a genetic map of peach (Prunus persica) that included 1310 SNPs spanning 454.2 cM with an average marker distance of 0.347 cM . This map was of much higher quality than maps constructed with SSRs or other markers [44, 45] because the former included many more markers and was of a larger scale. Although the process of RAD library construction is more complicated than double digest-RAD (dd-RAD), SLAF and GBS, longer genome fragments can be obtained for other studies . In total, 260.85 Gb RAD reads with an average length of nearly 290 bp were obtained from the 200 progeny, and these data could also be used for other studies. The GC content of the RAD sequences was approximately 37.11%, which is slightly different from that of the reference genome (34.12%); this difference may be due to the selection of restriction enzymes. We initially found 22,319,406 SNPs and 1,563,074 InDels according to the RAD sequences. For the SNPs, approximately 66.85% were transition-type, which is similar to the results for Taxodium distichum (64.52%) , Juglans regia (68.32%) , Corchorus capsularis (69.07%)  and other species. In addition, of the initial variants, more than 50% SNPs showed polymorphic between the parents, which suggested a high genetic diversity of C. bungei and provided various genetic and genomic information for further studies.
Analysis of the genetic map
We constructed a genetic map using an F1 population, which is usually used for genetic map construction in forest trees due to their long life cycle, of C. bungei × C. duclouxii. The final genetic map contained 9593 polymorphic markers and 20 LGs with an average of nearly 500 markers on each LG. Compared with the genetic maps of other woody species constructed using the RRGS strategy, the number of mapped markers was greater than those of Ziziphus jujube , Juglans regia  and Paeonia suffruticosa , but less than those of Taxodium distichum , Vitis , and Actinidia chinensis . The smaller number of mapped markers compared to a few previous studies may be partly due to the smaller depth of sequencing in the offspring individuals in our study, which may have led to the elimination of markers that did not satisfy the sequencing depth threshold. To further optimize our genetic map, more markers will be added in the future. The total genetic distance of the genetic map was 3151.63 cM, with a 0.32 cM average interval distance. Moreover, all gaps (distances between adjacent markers) were less than 5 cM, suggesting a high density. Increasing the number of individuals in the mapping population could effectively improve the resolution of the genetic map by detecting the homologous recombination of chromosomes with lower recombination rates during meiosis . Our results suggested that a mapping population with 200 individuals may be enough for the construction of a high-density genetic map. The markers on the 20 LGs were not evenly distributed in our study: a maximum of 996 markers was found on LG 18 and a minimum of 165 markers on LG 10. More than half of the LGs (11) contained fewer than 500 markers (fewer than the average level), which may indicate a lack of genetic information on these LGs. RRGS strategies using double enzyme digestion, such as dd-RAD and SLAF, could obtain markers that are more uniformly distributed, and these techniques could be considered in later studies.
QTL analysis of leaf and growth traits
Concerning leaf and growth trait variation, we found high repeatability for all measured traits. The repeatability ranged from 0.86 (L/W) to 0.94 (PL). Our results suggested that the phenotypic stability of the offspring clones was high. The high-quality genetic map developed in our study allowed for high-resolution QTL mapping, and a total of 16 QTLs for seven leaf traits were mapped, including Q16–60, which was identified as a QTL for LA, LL, LP, the L/W ratio and PL, for a total of five traits. According to the correlation analysis, most of the six leaf traits (LA, LL, LW, LP, L/W ratio and PL) were correlated with each other; however, no QTL associated with all six leaf traits was identified. Similar results can also be found in the QTL mapping of leaf or growth traits in other species [12, 52, 53]. This may be because these traits are complex quantitative characters that are controlled by multiple QTLs, and it is not easy to precisely identify all the QTLs for the target traits. Moreover, several previous studies have found that the results of QTL mapping for flag leaf size in crops could be influenced by the environment, which suggested QTL mapping in different environments could be used to enhance accuracy, and in fact, this strategy has been tried in other studies [54, 55]. In a recent study, Xia et al. mapped 42 QTLs for nine leaf traits in an F1 poplar population at three time points, and 9 of these QTLs were found at two or more time points. This repeatability at different time points suggests high QTL mapping confidence . However, in our study, only one time point was mapped; in a future study, QTLs for leaf traits will be mapped at multiple time points and locations to enhance QTL mapping accuracy. Leaf size is controlled by the interconnection between cell division and cell expansion . According to a previous study, changing the expression of cyclin B1;1 and other cell cycle-related genes significantly influences the cell division rate of leaves [57, 58], and a moderate increase in CYCD3 expression in Arabidopsis increases the cell number and LA [59, 60]. In addition, cyclin genes also influence leaf flatness and erectness [61, 62]. In our study, a possible cyclin gene (evm.model.group5.471) was found in Q16–60, which implied that cyclin-mediated cell division may participate in the formation of leaf traits.
In recent years, examination of the dynamic QTLs for plant growth characteristics in a continuous set of time points has been performed; for example, Du et al. mapped the growth traits of poplar at 12 successive time points and found that some QTLs were specific to one time point, while some QTLs were found at several continuous or discontinuous time points, which may be due to their specific functions in plant growth at different growth stages . In the present study, QTLs for plant height at five time points were mapped, and 8 QTLs were detected at more than one time point for a cumulative PVE of 61.4%. Thus, these loci were further examined to analyse the candidate genes. In our study, the inclusive composite interval mapping (ICIM) method was used to map the QTLs associated with leaf traits and plant height. Yang et al. mapped QTLs of growth traits of Taxodium ‘Zhongshansha’ and identified 5 common QTLs by the CIM and ICIM methods to ensure the reliability of the QTLs. In their study, the number of QTLs detected by the CIM method was only half that identified by the ICIM method, which may have been due to the higher detection power of the ICIM method . Similarly, Dodia et al. used the CIM and ICIM additive (ICIM-ADD) methods to map QTLs for stem rot disease resistance and plant architecture in groundnut and obtained different results: the QTLs obtained with the CIM method explained more phenotypic variance . Improper mapping methods may lead to false results, and different methods may influence QTL mapping results; for this reason, using more mapping methods may be an effective strategy for improving QTL mapping accuracy , and more mapping methods should be used to precisely identify QTLs.
By using QTL mapping, we identified two genomic regions of 213.76 Kb and 52 bp on Chr18 that contained QTLs for plant height. The 213.76-Kb genomic region with the flanking markers sca18_15484674 and sca18_15698439 (0.32 cM in the genetic map) had a PVE of ~ 6.6%. A total of 15 genes were identified in this region according to the genome annotation. The gene evm.model.group7.1464 located in this region is an orthologue of AT3G01610.1 in the genome of Arabidopsis thaliana. AT3G01610.1 encodes the cell division control protein 48 homologue C-like (CDC48C), which is involved in cell division, cell expansion, and cell differentiation, among other processes . CDC48 is reportedly involved in plant growth: Bae et al. suppressed the expression of the CDC48 homologue gene in tobacco by virus-induced gene silencing, and antisense RNA interference caused not only severe growth cessation in the shoot and leaf but also flower sterility . Another gene located in this region, evm.model.group7.1477, is an orthologue of AT1G76680.2 in A. thaliana, which encodes a 12-oxophytodienoate reductase that participates in jasmonic acid (JA) biosynthesis [66, 67]. Although a high level of JA inhibits the growth of plant stems , low concentrations of JA can promote cell expansion and shoot elongation [69, 70], which indicates that endogenous JA synthesis may influence plant height in our mapping population; however, this possibility requires further verification. Additionally, we found an AT5G04660.1 orthologue, namely, evm.model.group7.1470, which may encode a cytochrome P450 family member. Several cytochrome P450 family members have been shown to participate in plant growth. For example, EUI1, a cytochrome P450 monooxygenase cloned in rice, regulates internode elongation by modulating the gibberellin response [71, 72]. In addition, cytochrome P450 members also participate in brassinosteroid  and indole-3-acetic acid  biosynthesis and influence cell elongation or plant height [75, 76]. The 52-bp sequence in Q18–73 was located in evm.model.group7.1784 in the physical map, which may encode a possible cleavage and polyadenylation specificity factor 73 (CPSF73) homologue subunit. CPSF73 is an endonuclease that participates in small nuclear RNA (snRNA) 3′-end processing and plays roles in flower and embryo development . However, it is not clear whether it influences plant height.
To increase the precision of candidate gene prediction, transcriptomics or quantitative real-time PCR analyses can be used to identify RNA variants and the expression of genes within the mapped QTL regions. Other omics analyses, such as metabolomics, can also provide some information on the chemical compounds related to phenotypic variations. In future studies, these analyses will be used to further verify the candidate genes in our study.
A high-density genetic map of C. bungei × C. duclouxii was constructed using the RAD-seq strategy, with the help of which 20 and 13 QTLs were identified that were associated with leaf and growth traits, respectively, which explained moderate phenotypic variation. Our study has laid a foundation for molecular marker-assisted breeding in C. bungei. Moreover, the genome sequences we obtained have enriched the resources for the public to study the evolution and functional genomics of C. bungei. The candidate genes identified within the QTLs may be promising genes for regulating leaf traits and increasing plant growth in C. bungei and will be further studied.
Mapping population and DNA extraction
C. bungei “7080” (female parent, entire leaf, Additional file 13: Figure S5) is an excellent clone selected by Luoyang Academy of Agriculture and Forestry, Luoyang, Henan Province and we have got the permission to applicate “7080” as breeding material from Luoyang Academy of Agriculture and Forestry and cultivated this C. bungei clone in an artificial forest belonging to Research Institute of Forestry, Chinese Academy of Forestry in Luoyang (34.71°N, 112.54°E) in the year 2006. C. duclouxii Dode “16-PJ-3” (male parent, lobed leaf, Additional file 13: Figure S5) is a wild individual grown in Panjiang town, Guizhou, China (25.75°N, 103.83°E). In the year 2016, Dr. Wenjun Ma collected the pollen and shoots of “16-PJ-3” (the collection of “16-PJ-3” did not need any necessary permission after we consulting to the relevant department) and carried out the hybridization. Finally, a total of 681 F1 progenies were obtained by crossing “7080” and “16-PJ-3”. The seeds of F1 progeny were sown and grown into seedlings in the greenhouse of Chinese Academy of Forestry in 2017. In the year 2018, 200 randomly selected F1 individuals and the two parents were asexually propagated and planted in the experimental field of Luoyang Academy of Agricultural and Forestry Science (Luoyang, China, N 112.55°, E 34.71°). A randomized block design was applied, with two ramets per clone in each plot and 5 replicates. All the plant material collections in our study were complied with national guidelines. The field experiment we made were in accordance with local legislation. The voucher specimens were deposited in Research Institute of Forestry, Chinese Academy of Forestry. Dr. Wenjun Ma and Dr. Junhui Wang undertook the formal identification of the samples.
Young and healthy leaf samples (second or third leaves from the apex) of both parents and 200 F1 individuals were collected in June 2018. All leaf samples were frozen in liquid nitrogen immediately and stored at −80 °C. We extracted genomic DNA using a modified cetyltrimethylammonium bromide (CTAB) method [78, 79]. To eliminate residual RNA, all extracted DNA samples were treated with RNase (Takara, Shuzo, Otsu, Japan). Finally, DNA concentration and purity were determined by a NanoDrop 2000 UV-vis spectrophotometer and checked on 1% agarose gels.
RAD library construction and high-throughput sequencing
We prepared the 200 RAD libraries according to the RAD protocol  with minor modification. Briefly, 200 ng of qualified genomic DNA from each sample (200 F1 individuals) was fully digested by 20 U of restriction endonuclease EcoRI (New England Biolabs, Ipswich, MA, USA) at 37 °C in a 50 μl reaction mixture based on an evaluation of the reference genome and the result of a DNA digestion pre-experiment (Additional file 14: Figure S6). After digestion, barcoded P1 adapters were ligated to the EcoRI restriction site for each sample individually. Then, all the samples were sheared to an average size of 500 bp using a Bioruptor (Diagenode, Liège, Belgium). DNA fragments 350 to 450 bp in size were collected using 2% agarose gel for library construction. Thereafter, the fragments were blunt end repaired, and a 3′ adenine overhang was added to the sequences. Finally, a P2 adapter containing unique Illumina barcodes (San Diego, CA, USA) was added to each library. All libraries were amplified using PCR with high-fidelity thermostable DNA polymerase (New England Biolabs, Ipswich, MA, USA) and purified before sequencing. The resequencing libraries of the two parents, “7080” and “16-PJ-3”, were constructed at the same time. The RAD libraries and two resequencing libraries were sequenced using the Illumina HiSeq X Ten platform using 150-bp paired-end reads by Shanghai Major Biological Medicine Technology Co., Ltd.
SNP and InDel calling and genetic map construction
To ensure read quality for later analysis, all raw reads were filtered using Trimmomatic  to discard low-quality reads (quality score below 30), reads with more than 10% unidentified nucleotides and reads aligned to the adapter. Next, the clean data were analysed using a standard SNP and InDel calling pipeline. Briefly, all the clean reads were first mapped to the C. bungei reference genome using Burrows-Wheeler Aligner (BWA) software  with the setting of “mem -t 4-k 32-M”. To avoid false mapping results, only reads with a unique mapping position in the genome were sorted using SAMtools . Subsequently, variants were called and filtered using the Genome Analysis Toolkit (GATK) unified  using standard filtering parameters according to the GATK Best Practices pipeline . Then, the variants were more precisely filtered based on the following three strict criteria: (1) a mapping quality less than 37; (2) a quality depth less than 24; and (3) a sequence depth less than 10-fold (in parents) or 3-fold (in offspring). The screened variant markers were further divided into eight segregation patterns: “ab×cd” (four alleles), “nn × np” (two alleles and one parent heterozygous), “hk × hk” (two alleles and double heterozygous), “ef × eg” (three alleles and double heterozygous), “cc × ab” (three alleles and maternal homozygous), “aa×bb” (two alleles and double homozygous), “ab×cc” (three alleles and parental homozygous) and “lm × ll” (two alleles and maternal heterozygous). Finally, unqualified molecular markers were further removed based on the following three strict criteria: (1) abnormal bases; (2) a variant call rate (missing rate) less than 70%; and (3) significant segregation distortion (chi-square test, P value<0.05). The SNP calling parameters were determined after we overall considering the results of several parameter combinations to guarantee the us enough credible markers for later study (Additional file 15: Table S8).
Because an F1 population was used in our study, markers with the segregation patterns “ab×cd”, “ab×cc”, “cc × ab”, “ef × eg”, “nn × np”, “hk × hk”, and “lm × ll” were selected to construct the genetic map. All filtered markers were first divided into 20 groups according to their physical locations on the same chromosome, and the markers were then ordered using MSTmap software . Next, the SMOOTH algorithm  was used to correct the genotyping errors or deletions according to the relationship between the ordered markers. The genetic distance between markers was calculated using the Kosambi mapping function. Furthermore, a haplotype map and heat map analysis were used to evaluate the quality of the genetic map.
Phenotyping of leaf traits and dynamic plant height
The leaf trait parameters in the “7080 × 16-PJ-3” F1 population were measured on 2018/9/5. We chose the 3rd whorl of fully expanded leaves below the apex to detect the leaf traits (according to our previous study, the 3rd whorl of fully expanded leaves of C. bungei were mature functional leaves, and their characters were stable at the collection time chosen for our study). The chlorophyll content was measured five times at different positions on the surface of each leaf using a SPAD-502 Plus chlorophyll meter (Konica Minolta Holdings, Inc., Chiyoda-ku, Tokyo, Japan), and the average value was calculated to represent the chlorophyll content. After we measured the chlorophyll content, the leaves were harvested and scanned by a CI-203 Portable Laser Leaf Area Meter (CID Inc., Washington, USA) for a total of five leaf parameters: leaf length (LL), leaf width (LW), the leaf length/width (L/W) ratio, leaf area (LA) and leaf perimeter (LP). The petiole length (PL) was measured using a ruler.
The development of trees is a complex dynamic process that is regulated by both gene networks and the environment. A traditional mapping strategy using phenotypic data measured at only one time point may not exactly reflect the genetic control of developmental processes , so growth data at multiple time points were used to map QTLs. According to our previous observations, the growth of C. bungei in Luoyang usually started at the end of April, proceeded rapidly in early July, and finished at the end of September. The height of all plants was measured on 2018/6/30 (before the rapid growth period), 2018/7/15 (during the rapid growth period), 2018/7/31 (during the rapid growth period), 2018/8/15 (during the rapid growth period), 2018/8/31 (the end of the rapid growth period), and 2018/10/10 (the end of the growing season), for a total of six time points, which included the entire rapid growth period (from July to August), in a growth season.
We had two plants of each clone in each block, and the average values of all parameters in each block were calculated from the individual seedlings of the same clone and used as the trait data. The five blocks of plants were measured as five replicates. The correlations between different traits and frequency distribution of the F1 population were calculated using R software with the Pearson method . The repeatability of plant height and leaf traits was calculated using the R ASReml package . The phenotypic data were analysed by a one-way ANOVA to discriminate the seven leaf traits and six growth traits between “7080” and “16-PJ-3” using SPSS version 19.0 (SPSS Inc., Chicago, IL, USA).
QTL mapping and candidate gene selection
Quantitative trait locus (QTL) analysis was conducted using GACD software  with the inclusive composite interval mapping (ICIM) method . A significant logarithm of odds (LOD) threshold was determined using 1000 permutation tests (P < 0.05) for all traits. Finally, because the average LOD significance thresholds of both the seven leaf traits and the six growth traits were 3.0, a LOD significance threshold value of 3.0 with a 95% confidence interval was determined for the traits . The potential locations of the QTLs were described according to their LOD peak locations and their surrounding regions. Additive (a) and dominance (d) effects were calculated based on the formulation of Muchero , according to the computed results by GACD, which has been introduced in detail by Nzuki . The QTL mode of action was calculated as the ratio of dominance to the absolute additive value (d/|a|), where d/|a| ratios larger than 1 were regarded as over-dominance; ratios between 0 and 1 were regarded as partial dominance and ratios less than 1 were regarded as under-dominance . Information about the candidate genes in the mapped QTL regions was obtained according to the annotation of the reference genome.
Availability of data and materials
The clean data of 200 offspring individuals and parents has being uploaded to the NCBI SRA database (http://www.ncbi.nlm.nih.gov/sra), with the accession number: PRJNA551333, Other data that supporting the conclusions of the article have been uploaded as additional files.
Composite interval mapping;
Genotyping by sequencing
Inclusive composite interval mapping
Leaf length/leaf width
Logarithm of odds
Quantitative trait loci
Restriction-site associated DNA sequencing
Reduced-representation genome sequencing
Specific-locus amplified fragment sequencing
Single nucleotide polymorphisms
Soil and plant analyzer development
Olsen R, Kirkbride J. Taxonomic revision of the genus Catalpa (Bignoniaceae). Brittonia. 2017;69(3):387–421.
Jing D, Xia Y, Chen F, Wang Z, Zhang S, Wang J. Ectopic expression of a Catalpa bungei (Bignoniaceae) PISTILLATA homologue rescues the petal and stamen identities in Arabidopsis pi-1 mutant. Plant Sci. 2015;231:40–51.
Baraldi R, Chieco C, Neri L, Facini O, Rapparini F, Morrone L, et al. An integrated study on air mitigation potential of urban vegetation: from a multi-trait approach to modeling. Urban For Urban Green. 2019;41:127–38.
Li T, Wei D, Chen S, Cai Y, Wu Q, Ma Q. Research achievements and prospects of tree culture in China. J Fujian Forestry Sci and Tech. 2019;46(1):134–40.
Lu N, Mei F, Wang Z, Wang N, Xiao Y, Kong L, et al. Single-nucleotide polymorphisms (SNPs) in a sucrose synthase gene are associated with wood properties in Catalpa fargesii bur. BMC Genet. 2018;19(1):99.
Lu N, Ma W, Han D, Liu Y, Wang Z, Wang N, et al. Genome-wide analysis of the Catalpa bungei caffeic acid O-methyltransferase (COMT) gene family: identification and expression profiles in normal, tension, and opposite wood. PeerJ. 2019;7:e6520.
Xiao Y, Ma W, Lu N, Wang Z, Wang N, Zhai W, et al. Genetic Variation of Growth Traits and Genotype-by-Environment Interactions in Clones of Catalpa bungei and Catalpa fargesii f. duclouxii. Forests. 2019;10(1):57.
Liu W, Wang C, Shen X, Liang H, Wang Y, He Z, et al. Comparative transcriptome analysis highlights the hormone effects on somatic embryogenesis in Catalpa bungei. Plant Reprod. 2019;32(2):141–51.
Jia J, Wang J, Zhang J, Zhang S, Zhang JG, Zhao K. Interspecific Hybridization of Catalpa bungei and Catalpa fargesii f. duclouxii. Forest Res. 2010;23(3):382.
Xiao Y, Yi F, Han D, Lu N, Yang G, Zhao K, et al. Difference analysis of growth and nitrogen utilization and distribution in photosyntheic system of Catalpa bungei intraspecific and interspecific hybrids. Scientia Silvae Sci. 2019;55(5):55–64.
Zhang L, Zhang Z, Chen L, McNulty S. An investigation on the leaf accumulation-removal efficiency of atmospheric particulate matter for five urban plant species under different rainfall regimes. Atmos Environ. 2019;208:123–32.
Yang Y, Xuan L, Yu C, Wang Z, Xu J, Fa W, et al. High-density genetic map construction and quantitative trait loci identification for growth traits in (Taxodium distichum var. distichum× T. mucronatum)× T. mucronatum. BMC Plant Biol. 2018;18(1):263.
Samad S, Kurokura T, Koskela E, Toivainen T, Patel V, Mouhu K, et al. Additive QTLs on three chromosomes control flowering time in woodland strawberry (Fragaria vesca L). Horticulture Res. 2017;4:17020.
Jaganathan D, Thudi M, Kale S, Azam S, Roorkiwal M, Gaur P, et al. Genotyping-by-sequencing based intra-specific genetic map refines a “QTL-hotspot” region for drought tolerance in chickpea. Mol Gen Genomics. 2015;290(2):559–71.
Huang Z, Zhao N, Qin M, Xu A. Mapping of quantitative trait loci related to cold resistance in Brassica napus L. J Plant Physiol. 2018;231:147–54.
Xia W, Cao P, Zhang Y, Du K, Wang N. Construction of a high-density genetic map and its application for leaf shape QTL mapping in poplar. Planta. 2018;248(5):1173–85.
Du Q, Gong C, Wang Q, Zhou D, Yang H, Pan W, et al. Genetic architecture of growth traits in Populus revealed by integrated quantitative trait locus (QTL) analysis and association studies. New Phytol. 2016;209(3):1067–82.
Beltramo C, Valentini N, Portis E, Marinoni D, Boccacci P, Prando M. Genetic mapping and QTL analysis in European hazelnut (Corylus avellana L). Mol Breed. 2016;36(3):27.
Muchero W, Guo J, DiFazio S, Chen J, Ranjan P, Slavov G, et al. High-resolution genetic mapping of allelic variants associated with cell wall chemistry in Populus. BMC Genomics. 2015;16(1):24.
Caignard T, Delzon S, Bodénès C, Dencausse B, Kremer A. Heritability and genetic architecture of reproduction-related traits in a temperate oak species. Tree Genet Genomes. 2019;15(1):1.
Sun R, Chang Y, Yang F, Wang Y, Li H, Zhao Y, et al. A dense SNP genetic map constructed using restriction site-associated DNA sequencing enables detection of QTLs controlling apple fruit quality. BMC Genomics. 2015;16(1):747.
Nuñez-Lillo G, Balladares C, Pavez C, Urra C, Sanhueza D, Vendramin E, et al. High-density genetic map and QTL analysis of soluble solid content, maturity date, and mealiness in peach using genotyping by sequencing. Sci Hortic. 2019;257:108734.
Staub J, Serquen F, Gupta M. Genetic markers, map construction, and their application in plant breeding. HortScience. 1996;31(5):729–41.
Liu S, An Y, Li F, Li S, Liu L, Zhou Q, et al. Genome-wide identification of simple sequence repeats and development of polymorphic SSR markers for genetic studies in tea plant (Camellia sinensis). Mol Breed. 2018;38(5):59.
Chen Y, Zhang L, Qi J, Chen H, Tao A, Xu J, et al. Genetic linkage map construction for white jute (Corchorus capsularis L.) using SRAP, ISSR and RAPD markers. Plant Breed. 2014;133(6):777–81.
Chang Y, Oh E, Lee M, Kim H, Moon D, Song K. Construction of a genetic linkage map based on RAPD, AFLP, and SSR markers for tea plant (Camellia sinensis). Euphytica. 2017;213(8):190.
Marques C, Araujo J, Ferreira J, Whetten R, O’malley D, Liu B, Sederoff R. AFLP genetic maps of Eucalyptus globulus and E. tereticornis. Theor Appl Genet. 1998;96(6–7):727–37.
Arias M, Hernandez M, Remondegui N, Huvenaars K, Van Dijk P, Ritter E. First genetic linkage map of Taraxacum koksaghyz Rodin based on AFLP, SSR, COS and EST-SSR markers. Sci Rep. 2016;6:31031.
Khodaeiaminjan M, Kafkas S, Motalebipour E, Coban N. In silico polymorphic novel SSR marker development and the first SSR-based genetic linkage map in pistachio. Tree Genet Genomes. 2018;14(4):45.
Kirungu J, Deng Y, Cai X, Magwanga R, Zhou Z, Wang X, et al. Simple sequence repeat (SSR) genetic linkage map of D genome diploid cotton derived from an interspecific cross between Gossypium davidsonii and Gossypium klotzschianum. Int J Mol Sci. 2018;19(1):204.
Lin Z, Zhang X, Nie Y, He D, Wu M. Construction of a genetic linkage map for cotton based on SRAP. Chin Sci Bull. 2003;48(19):2064–8.
Webb A, Cottage A, Wood T, Khamassi K, Hobbs D, Gostkiewicz K, et al. A SNP-based consensus genetic map for synteny-based trait targeting in faba bean (Vicia faba L.). Plant Biotechnol J. 2016;14(1):177–85.
Srivastava R, Singh M, Bajaj D, Parida S. A high-resolution InDel (insertion–deletion) markers-anchored consensus genetic map identifies major QTLs governing pod number and seed yield in chickpea. Front Plant Sci. 2016;7:1362.
Du Q, Lu W, Quan M, Xiao L, Song F, Li P, et al. Genome-wide association studies to improve wood properties: challenges and prospects. Front Plant Sci. 2018;9:1.
Zhu J, Guo Y, Su K, Liu Z, Ren Z, Li K, et al. Construction of a highly saturated genetic map for Vitis by next-generation restriction site-associated DNA sequencing. BMC Plant Biol. 2018;18(1):347.
Mousavi M, Tong C, Liu F, Tao S, Wu J, Li H, et al. De novo SNP discovery and genetic linkage mapping in poplar using restriction site associated DNA and whole-genome sequencing technologies. BMC Genomics. 2016;17(1):656.26.
Guo F, Yu H, Tang Z, Jiang X, Wang L, Wang X, et al. Construction of a SNP-based high-density genetic map for pummelo using RAD sequencing. Tree Genet Genomes. 2015;11(1):2.
Zhao J, Jian J, Liu G, Wang J, Lin M, Ming Y, et al. Rapid SNP discovery and a RAD-based high-density linkage map in jujube (Ziziphus Mill). PLoS One. 2014;9(10):e109850.
Wang M, Xu Y, Wang W, Wu Z, Xing W, Zhang H. Quantitative trait locus (QTL) mapping of sugar yield-related traits in sugar beet (Beta vulgaris L.). Sugar Tech. 2019;21(1):135–44.
Zhang L, Guo D, Guo L, Guo Q, Wang H, Hou X. Construction of a high-density genetic map and QTLs mapping with GBS from the interspecific F1 population of P. ostii ‘Fengdan Bai’and P. suffruticosa ‘Xin Riyuejin’. Sci Hortic. 2019;246:190–200.
Guo Q, Guo L, Zhang L, Zhang LX, Ma H, Guo D, et al. Construction of a genetic linkage map in tree peony (Paeonia sect. Moutan) using simple sequence repeat (SSR) markers. Sci Hortic. 2017;219:294–301.
Dodia S, Joshi B, Gangurde S, Thirumalaisamy P, Mishra G, Narandrakumar D, et al. Genotyping-by-sequencing based genetic mapping reveals large number of epistatic interactions for stem rot resistance in groundnut. Theor Appl Genet. 2019;132(4):1001–16.
Guo S, Iqbal S, Ma R, Song J, Yu M, Gao Z. High-density genetic map construction and quantitative trait loci analysis of the stony hard phenotype in peach based on restriction-site associated DNA sequencing. BMC Genomics. 2018;19(1):612.
Blenda A, Verde I, Georgi L, Reighard G, Forrest S, Muñoz-Torres M, et al. Construction of a genetic linkage map and identification of molecular markers in peach rootstocks for response to peach tree short life syndrome. Tree Genet Genomes. 2007;3(4):341–50.
Shen Z, Confolent C, Lambert P, Poëssel J, Quilot-Turion B, Yu M, et al. Characterization and genetic mapping of a new blood-flesh trait controlled by the single dominant locus DBF in peach. Tree Genet Genomes. 2013;9(6):1435–46.
Andrews K, Good J, Miller M, Luikart G, Hohenlohe P. Harnessing the power of RADseq for ecological and evolutionary genomics. Nat Rev Genet. 2016;17(2):81.
Zhu Y, Yin Y, Yang K, Li J, Sang Y, Huang L, et al. Construction of a high-density genetic map using specific length amplified fragment markers and identification of a quantitative trait locus for anthracnose resistance in walnut (Juglans regia L). BMC Genomics. 2015;16(1):614.
Tao A, Huang L, Wu G, Afshar R, Qi J, Xu J, et al. High-density genetic map construction and QTLs identification for plant height in white jute (Corchorus capsularis L.) using specific locus amplified fragment (SLAF) sequencing. BMC Genomics. 2017;18:355.
Zhang Z, Wei T, Zhong Y, Li X, Huang J. Construction of a high-density genetic map of Ziziphus jujuba Mill. using genotyping by sequencing technology. Tree Genet Genomes. 2016;12(4):76.
Li S, Lv S, Yu K, Wang Z, Li Y, Ni X, et al. Construction of a high-density genetic map of tree peony (Paeonia suffruticosa Andr. Moutan) using restriction site associated DNA sequencing (RADseq) approach. Tree Genet Genomes. 2019;15(4):63.
Scaglione D, Fornasiero A, Pinto C, Cattonaro F, Spadotto A, Infante R, et al. A RAD-based linkage map of kiwifruit (Actinidia chinensis Pl.) as a tool to improve the genome assembly and to scan the genomic region of the gender determinant for the marker-assisted breeding. Tree Genet Genomes. 2015;11(6):115.
Hussain W, Baenziger P, Belamkar V, Guttieri M, Venegas J, Easterly A, et al. Genotyping-by-sequencing derived high-density linkage map and its application to QTL mapping of flag leaf traits in bread wheat. Sci Rep. 2017;7(1):16394.
Fan X, Cui F, Zhao C, Zhang W, Yang L, Zhao X, et al. QTLs for flag leaf size and their influence on yield-related traits in wheat (Triticum aestivum L.). Mol Breed. 2015;35(1):24.
Wei X, Wang X, Guo S, Zhou J, Shi Y, Wang H, et al. Epistatic and QTL×environment interaction effects on leaf area-associated traits in maize. Plant Breed. 2016;135(6):671–6.
Zhang B, Ye W, Ren D, Tian P, Peng Y, Gao Y, et al. Genetic analysis of flag leaf size and candidate genes determination of a major QTL for flag leaf width in rice. Rice. 2015;8(1):2.
Gonzalez N, Vanhaeren H, Inzé D. Leaf size control: complex coordination of cell division and expansion. Trends Plant Sci. 2012;17(6):332–40.
Lee B, Ko J, Lee S, Lee Y, Pak J, Kim J. The Arabidopsis GRF-INTERACTING FACTOR gene family performs an overlapping function in determining organ size as well as multiple developmental properties. Plant Physiol. 2009;151(2):655–68.
Eloy N, de Freitas LM, Van Damme D, Vanhaeren H, Gonzalez N, De Milde L, et al. The APC/C subunit 10 plays an essential role in cell proliferation during leaf development. Plant J. 2011;68(2):351–63.
Horiguchi G, Gonzalez N, Beemster G, Inzé D, Tsukaya H. Impact of segmental chromosomal duplications on leaf size in the grandifolia-D mutants of Arabidopsis thaliana. Plant J. 2009;60(1):122–33.
Horiguchi G, Tsukaya H. Organ size regulation in plants: insights from compensation. Front Plant Sci. 2011;2:24.
Baekelandt A, Pauwels L, Wang Z, Li N, De Milde L, Natran A, et al. Arabidopsis leaf flatness is regulated by PPD2 and NINJA through repression of CYCLIN D3 genes. Plant Physiol. 2018;178(1):217–32.
Sun S, Chen D, Li X, Qiao S, Shi C, Li C, et al. Brassinosteroid signaling regulates leaf erectness in Oryza sativa via the control of a specific U-type cyclin and cell proliferation. Dev Cell. 2015;34(2):220–8.
Du Q, Yang X, Xie J, Quan M, Xiao L, Lu W, et al. Time-specific and pleiotropic quantitative trait loci coordinately modulate stem growth in Populus. Plant Biotechnol J. 2019;17(3):608–24.
Park S, Rancour D, Bednarek S. In planta analysis of the cell cycle-dependent localization of AtCDC48A and its critical roles in cell division, expansion, and differentiation. Plant Physiol. 2008;148(1):246–58.
Bae H, Choi S, Yang S, Pai H, Kim W. Suppression of the ER-localized AAA ATPase NgCDC48 inhibits tobacco growth and development. Mol Cell. 2009;28(1):57–65.
Kim Y, Khan A, Waqas M, Jeong H, Kim D, Shin J, et al. Regulation of jasmonic acid biosynthesis by silicon application during physical injury to Oryza sativa L. J Plant Res. 2014;127(4):525–32.
Wang Y, Yuan G, Yuan S, Duan W, Wang P, Bai J, et al. TaOPR2 encodes a 12-oxo-phytodienoic acid reductase involved in the biosynthesis of jasmonic acid in wheat (Triticum aestivum L.). Biochem Biophys Res Commun. 2016;470(1):233–8.
Heinrich M, Hettenhausen C, Lange T, Wünsche H, Fang J, Baldwin I, et al. High levels of jasmonic acid antagonize the biosynthesis of gibberellins and inhibit the growth of Nicotiana attenuata stems. Plant J. 2013;73(4):591–606.
Ulloa R, Raíces M, MacIntosh G, Maldonado S, Téllez-Iñón M. Jasmonic acid affects plant morphology and calcium-dependent protein kinase expression and activity in Solanum tuberosum. Physiol Plant. 2002;115(3):417–27.
Zhang Z, Zhou W, Li H, Zhang G, Subrahmaniyan K, Yu J. Effect of jasmonic acid on in vitro explant growth and microtuberization in potato. Biol Plant. 2006;50(3):453–6.
Luo A, Qian Q, Yin H, Liu X, Yin C, Lan Y, et al. EUI1, encoding a putative cytochrome P450 monooxygenase, regulates internode elongation by modulating gibberellin responses in rice. Plant Cell Physiol. 2006;47(2):181–91.
Zhang Y, Zhang B, Yan D, Dong W, Yang W, Li Q, et al. Two Arabidopsis cytochrome P450 monooxygenases, CYP714A1 and CYP714A2, function redundantly in plant development through gibberellin deactivation. Plant J. 2011;67(2):342–53.
Hong Z, Ueguchi-Tanaka M, Umemura K, Uozu S, Fujioka S, Takatsuto S, et al. A rice brassinosteroid-deficient mutant, ebisu dwarf (d2), is caused by a loss of function of a new member of cytochrome P450. Plant Cell. 2003;15(12):2900–10.
Nafisi M, Sønderby I, Hansen B, Geu-Flores F, Nour-Eldin H, Nørholm M, et al. Cytochromes P450 in the biosynthesis of glucosinolates and indole alkaloids. Phytochem Rev. 2006;5(2–3):331–46.
Ramamoorthy R, Jiang S, Ramachandran S. Oryza sativa cytochrome P450 family member OsCYP96B4 reduces plant height in a transcript dosage dependent manner. PLoS One. 2011;6(11):e28069.
Chakrabarti M, Zhang N, Sauvage C, Muños S, Blanca J, Cañizares J, et al. A cytochrome P450 regulates a domestication trait in cultivated tomato. Proc Natl Acad Sci. 2013;110(42):17125–30.
Liu Y, Li S, Chen Y, Kimberlin A, Cahoon E, Yu B. snRNA 3’ end processing by a CPSF73-containing complex essential for development in Arabidopsis. PLoS Biol. 2016;14(10):e1002571.
Kabelka E, Franchino B, Francis M. Two loci from Lycopersicon hirsutum LA407 confer resistance to strains of Clavibacter michiganensis subsp. michiganensis. Phytopathology. 2002;92(5):504–10.
Wang P, Ma Y, Ma L, Li Y, Wang S, Li L, et al. Development and characterization of EST-SSR markers for Catalpa bungei (Bignoniaceae). Appl Plant Sci. 2016;4(4):1500117.
Baird N, Etter P, Atwood T, Currey M, Shiver A, Lewis Z, et al. Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS One. 2008;3(10):e3376.
Bolger A, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30(15):2114–20.
Li H, Durbin R. Fast and accurate short read alignment with burrows-wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25(16):2078–9.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The genome analysis toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20(9):1297–303.
Van der Auwera G, Carneiro M, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high-confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;43(1):11–0.
Wu Y, Bhat P, Close T, Lonardi S. Efficient and accurate construction of genetic linkage maps from the minimum spanning tree of a graph. PLoS Genet. 2008;4(10):e1000212.
van Os H, Stam P, Visser R, van Eck H. SMOOTH: a statistical method for successful removal of genotyping errors from high-density genetic linkage data. Theor Appl Genet. 2005;112(1):187–94.
Wu R, Lin M. Functional mapping—how to map and study the genetic architecture of dynamic complex traits. Nat Rev Genet. 2006;7(3):229.
Zhang M, Bo W, Xu F, Li H, Ye M, Jiang L, et al. The genetic architecture of shoot–root covariation during seedling emergence of a desert tree, Populus euphratica. Plant J. 2017;90(5):918–28.
Gilmour A, Gogel B, Cullis B, Thompson R, Butler D. ASReml user guide release 3.0. VSN International Ltd. UK: Hemel Hempstead; 2009.
Zhang L, Meng L, Wu W, Wang J. GACD: integrated software for genetic analysis in clonal F1 and double cross populations. J Hered. 2015;106(6):741–4.
Li H, Ribaut J, Li Z, Wang J. Inclusive composite interval mapping (ICIM) for digenic epistasis of quantitative traits in biparental populations. Theor Appl Genet. 2008;116(2):243–60.
Muchero W, Sewell M, Ranjan P, Gunter L, Tschaplinski T, Yin T, et al. Genome anchored QTLs for biomass productivity in hybrid Populus grown under contrasting environments. PLoS One. 2013;8(1):e54468.
Nzuki I, Katari M, Bredeson J, Masumba E, Kapinga F, Salum K, et al. QTL mapping for pest and disease resistance in cassava and coincidence of some QTL with introgression regions derived from Manihot glaziovii. Front Plant Sci. 2017;8:1168.
Hua J, Xing Y, Wu W, Xu C, Sun X, Yu S, et al. Single-locus heterotic effects and dominance by dominance interactions can adequately explain the genetic basis of heterosis in an elite rice hybrid. Proc Natl Acad Sci. 2003;100(5):2574–9.
We appreciate Luoyang Academy of Agriculture and Forestry for providing plant materials; We appreciate Guizhou Academy of Forestry for the help of plant material collection.
This work was financially supported by The National Key Research and Development Program of China (2017YFD0600201). The funders had no role in the study design, data analysis and interpretation, and manuscript writing, but just provided the financial.
Ethics approval and consent to participate
This article does not contain any studies with human participants or animals performed by any of the authors. All the plant materials used in this study were provided by Research Institute of Forestry, Chinese Academy of Forestry. The field experiments were conducted under local legislation and permissions.
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Brief information of C. bungei reference genome.
The read counts of individual alleles (including both parents and F1 alleles) at the 9,593 polymorphic loci.
Information for molecular markers and their locations in each LG of “7080”, “16-PJ-3” and the integrated map.
Haplotype maps of the integrated genetic map. Blue and green represent markers originating from the female (7080) and male (16-PJ-3) parent, respectively. Grey represents the “hk×hk” markers, and white represents missing data. Vertical axis indicates the markers in the LG, and horizontal axis indicates the individuals.
Percentages of missing data in each LG.
Heat maps of the integrated genetic map. Each cell represents the pairwise LOD scores of two markers.
Description of correlation coefficients between the genetic and physical positions of 20 LGs in the integrated genetic map.
QTL analysis of the seven leaf traits using the ICIM method in GACD. The x-axis indicates the map position (cM) in the 20 LGs, while the y-axis represents the LOD score. The horizontal line in the chart represents the LOD threshold.
QTL analysis of the plant growth traits at different time points using the ICIM method in GACD. The x-axis indicates the map position (cM) in the 20 LGs, while the y-axis represents the LOD score. The horizontal line in the chart is the LOD threshold.
Summary of QTLs for leaf traits mapped by ICIM.
Summary of QTLs for plant growth mapped by ICIM.
The protein sequences and annotation of genes identified in Q16-60, Q18-66 and Q18-73.
The leaf of “7080” (A) and “16-PJ-3” (B).
The gel electrophoresis of DNA digestion pre-experiment using TaqI, EcoRI and MseI. The “OS” stands for original DNA sample without enzyme treatment; Marker 1 and marker 2 are 1Kb DNA Ladder (TransGen Biotech, Beijing, China) and Marker I (Tiangen, Beijing, China), respectively.
The number of variants we obtained using different selecting parameter combinations.
About this article
Cite this article
Lu, N., Zhang, M., Xiao, Y. et al. Construction of a high-density genetic map and QTL mapping of leaf traits and plant growth in an interspecific F1 population of Catalpa bungei × Catalpa duclouxii Dode. BMC Plant Biol 19, 596 (2019). https://doi.org/10.1186/s12870-019-2207-y
- Catalpa bungei
- Genetic map
- QTL mapping
- Leaf traits
- Plant growth