Targeted amplicon sequencing + next-generation sequencing–based bulked segregant analysis identified genetic loci associated with preharvest sprouting tolerance in common buckwheat (Fagopyrum esculentum)
BMC Plant Biology volume 21, Article number: 18 (2021)
Common buckwheat (2n = 2x = 16) is an outcrossing pseudocereal whose seeds contain abundant nutrients and potential antioxidants. As these beneficial compounds are damaged by preharvest sprouting (PHS) and PHS is likely to increase with global warming, it is important to find efficient ways to develop new PHS-tolerant lines. However, genetic loci and selection markers associated with PHS in buckwheat have not been reported.
By next-generation sequencing (NGS) of whole-genome of parental lines, we developed a genome-wide set of 300 markers. By NGS- based bulked segregant analysis (NGS-BSA), we developed 100 markers linked to PHS tolerance. To confirm the effectiveness of marker development from NGS-BSA data, we developed 100 markers linked to the self-compatibility (SC) trait from previous NGS-BSA data. Using these markers, we developed genetic maps with AmpliSeq technology, which can quickly detect polymorphisms by amplicon-based multiplex targeted NGS, and performed quantitative trait locus (QTL) analysis for PHS tolerance in combination with NGS-BSA. QTL analysis detected two major and two minor QTLs for PHS tolerance in a segregating population developed from a cross between the PHS-tolerant ‘Kyukei 29’ and the self-compatible susceptible ‘Kyukei SC7’. We found different major and minor QTLs in other segregating populations developed from the PHS-tolerant lines ‘Kyukei 28’ and ‘NARO-FE-1’. Candidate markers linked to PHS developed by NGS-BSA were located near these QTL regions. We also investigated the effectiveness of markers linked to these QTLs for selection of PHS-tolerant lines among other segregating populations.
We efficiently developed genetic maps using a method combined with AmpliSeq technology and NGS-BSA, and detected QTLs associated with preharvest sprouting tolerance in common buckwheat. This is the first report to identify QTLs for PHS tolerance in buckwheat. Our marker development system will accelerate genetic research and breeding in common buckwheat.
Common buckwheat (Fagopyrum esculentum Moench; 2n = 2x = 16) is an outcrossing pseudocereal owing to heterostylous self-incompatibility (SI). It is widely grown in the temperate zones of the world. Buckwheat seeds contain health-promoting compounds with antioxidative, antihypertensive, and anti-obesity properties [1, 2], in addition to high levels of starch and high-quality protein with a well balanced amino acid profile [3, 4]. However, these beneficial compounds are strongly influenced by external effects; in particular, germination on the plant, called preharvest sprouting (PHS), severely degrades seed quality.
PHS often occurs under the humid and warm conditions common before harvest (Fig. 1). It degrades the pasting viscosity and quality of buckwheat flour, and decreases the total content of starch and crude fat [5,6,7]. Global warming might extend the range where PHS occurs . Therefore, improving PHS tolerance of buckwheat is a major breeding target worldwide.
We developed four PHS-tolerant cultivars/breeding lines—‘Harunoibuki’, ‘NARO-FE-1’ (NF1), ‘Kyukei 28’ (KY28), and ‘Kyukei 29’ (KY29)—by mass selection of low-PHS individuals [8, 9]. To clarify the inheritance of the PHS tolerance of these lines, we performed genetic analysis using segregating populations derived from crosses between KY28 or KY29 and the self-compatible PHS-susceptible ‘Kyukei SC7’ (KSC7) ; KSC7 was developed by the introduction of the self-compatibility (SC) allele of a wild relative, F. homotropicum . In the F2 progeny derived from KY28 × KSC7, the segregation pattern of the frequency of PHS suggested that several recessive genes regulate PHS tolerance in KY28. On the other hand, in the F2 progeny derived from KY29 × KSC7, the segregation pattern fitted the expected ratio of two dominant genes (15:1), suggesting that the PHS tolerance of KY29 is controlled by two major genes . Because buckwheat is an outcrossing plant, it is difficult to fix favorable traits such as PHS tolerance. Although marker-assisted selection (MAS) is an efficient way to do so , quantitative trait locus (QTL) and selection markers for PHS tolerance in buckwheat have not been reported until now.
Recently, we developed the buckwheat genome database (BGDB, http://buckwheat.kazusa.or.jp/)  to support genetic analysis and marker development in buckwheat . We developed codominant markers linked to the region flanking the gene for self-incompatibility/compatibility (SI/SC) by next-generation sequencing (NGS)-based bulked-segregant analysis (BSA) in F2 progeny . QTL-seq, which is used for whole-genome resequencing (WGS) with BSA, is a powerful tool for the rapid detection of QTLs . However, it needs a database of long scaffolds from which physical maps can be drawn. Unfortunately, the usable reference sequence in BGDB are short (N50 = 25.1 kb) , so it is necessary to construct a genetic linkage map for detecting QTLs.
To construct a genetic linkage map efficiently, we need a genome-wide marker set and an efficient genotyping system. The Ion AmpliSeq Targeted Sequencing technology (Thermo Fisher Scientific, Waltham, MA, USA) can quickly detect polymorphisms by amplicon-based multiplex targeted NGS [15, 16]. Here, we developed genetic maps with AmpliSeq and sought QTLs for PHS tolerance in buckwheat by NGS-BSA. We developed genome-wide markers for QTL analysis and detected several QTLs related to PHS tolerance. In addition, we developed linked markers and investigated the effect of selection with the markers. Furthermore, we demonstrated the effectiveness of NGS-BSA by developing linkage maps from AmpliSeq data of markers linked to the SC allele. Our findings and marker development system will be useful for advancing genetic research for buckwheat breeding.
Distribution frequencies of PHS in three populations
We investigated the distribution frequencies of PHS tolerance in parental lines and five F2 populations developed by self-pollination of each F1 plant derived from crosses between the highly PHS-tolerant lines KY29, KY28, and NF1 and the PHS-susceptible SC line KSC7 (Fig. 2; Additional files 1 and 2: Table S1, S2). The F2 populations derived from KY29 × KSC7 (A_1, A_2) and from KY28 × KSC7 (B_2) are already reported . We investigated those derived from KY28 × KSC7 (B_1) and NF1 × KSC7 (C) here (Additional file 1: Table S1). High rates of PHS tolerance in the progeny of cross A_1 suggest that major dominant tolerance genes in KY29 are involved (Fig. 2) . Low rates of tolerance in the progeny of cross B_1 suggest that major recessive tolerance genes in KY28 are involved (Fig. 2) . To see whether the newly developed PHS-tolerant line NF1 has a different pattern of inheritance, we developed cross C. Its F2 progeny showed mostly low tolerance to PHS, suggesting that major recessive tolerance genes in NF1 are involved, as in KY28 (Fig. 2). However, to determine which other genes are related to PHS tolerance in each population, QTL analysis is needed.
Development of marker sets to cover the whole genome and linkage markers
To develop genome-wide AmpliSeq markers, we first performed local BLAST searches of all microarray probe sequences of Yabe et al.  with BGDB genome data to find scaffolds with the matching sequences. Among all 1129 probe sequences, 1063 probes matched 387 scaffolds distributed in eight linkage groups (Additional file 3: Table S3). We used these 387 scaffolds as genome-wide scaffolds. From these, we selected 300 SNP sites as genome-wide SNPs for the AmpliSeq analysis (Additional file 4: Table S4).
To develop the PHS-linked marker set, we compared sequences between a high-PHS-tolerance bulk and a low-PHS-tolerance bulk of DNA of plants in F2 population of KY29 × KSC7 (A_1) and obtained 535 scaffolds with high values of the PHS-linked SNP index (≥0.700; Additional file 5: Table S5). Here, we selected 100 scaffolds and investigated the linkage relations.
To investigate the effect and certainty of NGS-BSA, we made an AmpliSeq marker set that is linked to the SC trait, Sh. We had already performed NGS-BSA at the locus and developed codominant markers, but we used only the top 50 candidate scaffolds . Here, we selected 100 scaffolds with high SNP-index in previously published WGS data of KSC7 (LH) and KY29 (pin)  and investigated the linkage relations.
Finally, we developed a custom panel of 500 markers which amplify 100 PHS-linked SNPs, 100 Sh-linked SNPs, and 300 genome-wide SNPs (Additional file 6: Table S6).
Construction of linkage maps by AmpliSeq in three F2 populations
Using this 500-marker set, we genotyped progeny derived from crosses A_1, B_1, and C by AmpliSeq. All genotyping data were filtered by R/qtl and linkage maps were constructed (Fig. 3). Unfortunately, several of the markers did not show SNPs and so could not be used for mapping. This may be because the BGDB reference genome contains a large number of unidentified nucleotide sequences ‘N’, which prevented elimination of the off-target hybridization in Ion AmpliSeq Designer (Additional file 10: Fig. S1). All detected polymorphisms and usable markers are summarized in Table 1. All three populations gave eight linkage groups (LGs) with a total map length of 550.1 cM in A_1, 408.3 cM in B_1, and 386.2 cM in C (Fig. 3).
The flower morphology (LH/pin) marker, labeled “Flower_type” on the maps, was located on LG1 (Fig. 3). As expected, most Sh-linked markers developed by NGS-BSA were mapped around it in all populations. As we omitted 45 Sh-linked markers as duplicates in cross A_1, 40 in B_1, and 38 in C during filtering (Table 1), the resultant Sh-linked markers were tightly linked to the Sh locus, indicating the efficiency and accuracy of NGS-BSA for a trait in buckwheat controlled by a single major gene.
The results of mapping with all markers and the Sh-linked markers suggest that the AmpliSeq marker development system functioned well and can be used for PHS analysis. Two-thirds of PHS-linked markers (32/48) were clustered and mapped on LG6 in A_1, and several markers were mapped separately on LG1, LG3, and LG4 in each population (Fig. 3; Table 1). Furthermore, the deletion of 25 of 32 PHS-linked markers as duplicates on LG6 in cross A during filtering suggests that the main genetic locus controlling the PHS tolerance of KY29 could be located on LG6.
QTL analysis for PHS tolerance
We performed QTL analysis for PHS tolerance by composite interval mapping (CIM). The thresholds of log-likelihood (LOD) significance (P < 0.05) were 3.9 in A_1, 3.2 in B_1, and 3.1 in C. In A_1, two major and two minor QTLs (LOD threshold ± ≥2.0) were detected (Table 2, Fig. 4), and explained 42.2% of phenotypic variation (Table 2). QTLs qPHS1_KY29, qPHS6_KY29, and qPHS4_KY29 confer tolerance with the KY29 genotype, and qPHS7_KY29 confers tolerance with the KSC7 genotype. qPHS6_KY29 had the largest effect (15.6%) and was strongly dominant. In B_1, four major and one minor QTLs were detected (Table 2, Fig. 4), and explained 66.4% of phenotypic variation. QTLs qPHS3_KY28 and qPHS4_KY28 confer tolerance with the KY28 genotype, and qPHS1_KY28, qPHS5_KY28, and qPHS8_KY28 confer tolerance with the KSC7 genotype. qPHS4_KY28 had the largest effect (22.0%) and appeared to be recessive. In C, two major and one minor QTLs were detected (Table 2, Fig. 4), and explained 38.0% of phenotypic variation. QTLs qPHS2_NF1 and qPHS3_NF1 confer tolerance with the NF1 genotype, and qPHS8_NF1 confers tolerance with the KSC7 genotype. qPHS3_NF1 and qPHS8_NF1 are located near the marker regions (WG_1572 and WG_1135) for qPHS3_KY28 and qPHS8_KY28 (Table 2).
Our PHS-linked markers in F2 progeny of A_1 were located on LGs 1, 3, 4, and 6 (Table 1). QTLs which provide PHS tolerance with the KY29 genotype were detected near these markers (Fig. 2, Table 2), except on LG3. In addition, many PHS-linked markers were clustered and mapped on LG6, and the QTL with the largest effect was detected on LG6 (qPHS6_KY29; Table 2) in A_1. These results show that the PHS-linked markers developed by NGS-BSA effectively detected genetic regions for PHS tolerance.
Confirmation of the effect of QTLs on PHS tolerance by development of sequence-tagged-site markers
To confirm the association between the major QTLs and germination rate, we converted the markers nearest to sequence-tagged-site (STS) markers and investigated the relations between their genotype and the germination rate in segregating populations of crosses A_1, A_2, B_1, and B_2 (Table 3). The markers nearest to qPHS1_KY29 (LG1) and qPHS6_KY29 (LG6) were PHS_17515 and WG_26859 (Fig. 3, Table 2); they were converted to STS markers qPHS1_KY29_PHS_17515 and qPHS6_KY29_WG_26859 (Additional file 7: Table S7). Alleles of each marker derived from KSC7 and KY29 were designated as A and B, respectively (Table 3). In A_1, plants homozygous for qPHS1_KY29_B and qPHS6_KY29_B had a lower germination rate than plants homozygous for qPHS1_KY29_A and qPHS6_KY29_A (Table 3). Although the effects in A_2 were not significant, the effect was similar (Table 3). In A_1, the average germination rate of plants heterozygous for qPHS6_KY29 was almost the same as that of plants homozygous for the KY29 allele, suggesting that the KY29 allele at qPHS6_KY29 was dominant (Table 3). On the other hand, the KY29 allele at qPHS1_KY29 was partially dominant (Table 3).
In B_1 and B_2, the markers nearest to qPHS1_KY28 (LG1), qPHS3_KY28 (LG3), qPHS4_KY28 (LG4), and qPHS8_KY28 (LG8) were converted to STS markers (Additional file 7: Table S7). Plants homozygous for the KY28 alleles at qPHS3_KY28 (qPHS3_KY28_B) had a lower germination rate than plants homozygous for the KSC7 alleles (qPHS3_KY28_A) both in B_1 and B_2. The KY28 alleles at qPHS4_KY28 (qPHS4_KY28_B) also had a lower germination rate than plants homozygous for the KSC7 (qPHS4_KY28_A) in B_1, but not in B_2 (Table 3). On the other hand, plants homozygous for qPHS8_KY28_B had a higher germination rate (Table 3). qPHS3_KY28 and qPHS8_KY28 decreased germination significantly in B_1 and B_2 (Table 3). In B_1, the average germination rates in plants heterozygous at qPHS3_KY28 and qPHS4_KY28 were slightly lower than those in plants homozygous for the KSC7 allele, suggesting that the KY28 alleles at those loci were recessive or partially dominant (Table 3).
Rapid construction of genetic maps in buckwheat
The development of genome databases makes it easy to do QTL analysis by BSA (QTL-seq) in major crops and model plant species, such as rice, barley, and Arabidopsis [18,19,20]. By WGS in buckwheat, we developed the BGDB , but its small scaffold size so far prevents its use in QTL-seq analysis. In addition, to use genetic maps for QTL analysis, the targeted trait has to segregate in segregating populations. Thus, the construction of genetic maps is still the first step for QTL analysis in buckwheat. However, the construction of maps that cover the whole genome requires much effort, especially in outcrossing plant species.
AmpliSeq technology is often used in cancer research  and is beginning to be used in agronomy. Sato et al.  developed a highly flexible and repeatable AmpliSeq-based genome-wide genotyping system for aquaculture studies to enhance population genetic studies and genome-wide association study. However, there is no report yet of whole-genome maps constructed using AmpliSeq in crops. Here, we developed a 300-marker genome-wide set by using WGS data of the parents of cross A_1 (KY29 and KSC7) in buckwheat. These markers could detect SNPs not only in A_1, but also in B_1 and C, which were located on 8 LGs in all three populations (Table 1), probably owing to the high nucleotide diversity within cultivated buckwheat (π = 0.0065) . This result suggests that this AmpliSeq-based genome-wide genotyping system allows the efficient and rapid construction of genetic linkage maps in buckwheat.
Furthermore, constructed maps and markers could be useful for QTL analysis of other traits, such as photoperiod sensitivity and flavonoid contents, although the traits should segregate in the segregating population. In addition, increasing the number of markers will be important for fine mapping and more detailed QTL analysis through the comparison of WGS data of SNP/indel information among lines and landraces.
NGS-BSA + AmpliSeq offers an efficient way to identify QTLs and to develop selection markers in common buckwheat
As the buckwheat draft genome sequence that we have developed is still divided into 387,594 scaffolds, it is impossible to show QTL peaks on a physical map. Here, we used NGS-BSA combined with AmpliSeq to identify QTLs for PHS efficiently and rapidly and to develop tightly linked markers.
The 48 PHS-linked markers developed by NGS-BSA were located on LGs 1, 3, 4, and 6 (Fig. 3, Table 1). Two-thirds of them (32/48) were clustered and mapped on LG6, including one linked to the QTL with the largest effect (qPHS6_KY29) in cross A_1 (Tables 1 and 2). Since PHS-linked markers were developed in the F2 progeny of cross A_1, we expected that they would be detected in regions derived from KY29 that contribute strongly to PHS resistance. QTLs that provide PHS tolerance with the KY29 genotype were detected on LGs 1 (qPHS1_KY29), 4 (qPHS4_KY29), and 6 (qPHS6_KY29) in cross A_1 (Table 2). On the other hand, a QTL that provides tolerance with the KSC7 genotype was detected on LG7 (qPHS7_KY29) (Table 2). Thus, QTL analysis is consistent with the results of NGS-BSA and indicates that AmpliSeq sequencing worked well to detect QTLs for PHS tolerance that originated from KY29. If we had used only NGS-BSA to develop linkage markers, we would not know the linkage relations of these markers and could not select efficient markers, because we could not select markers from each region linked to different QTLs.
We also developed 100 Sh-linked markers by NGS-BSA and investigated their genetic regions by mapping them on the same maps as used for PHS QTL analyses. All Sh-linked markers were mapped near the region of the floral morphology marker on LG1 in all three populations (Fig. 3, Table 1). This NGS-BSA targeting of one major locus indicates that the method is very effective for developing tightly linked markers. Thus, the combination of NGS-BSA with AmpliSeq is an efficient way to identify genetic regions for both qualitative and quantitative traits in common buckwheat.
Detection and origin of novel dominant and recessive QTLs for PHS tolerance
We previously analyzed the segregation of PHS tolerance in F2 populations derived from KY29 × KSC7 and KY28 × KSC7 and identified different modes of inheritance between KY29 and KY28 . As we expected, KY29 and KY28 have different QTLs that provide PHS tolerance (Fig. 3, Table 2). QTL analysis and association analysis suggested that the PHS tolerance in KY29 is controlled by dominant alleles at qHS6-KY29 and partially dominant alleles at qHS1-KY29, fitting our expectation . As KY29 was developed from a cross between ‘Kanoya-Zairai’ and ‘Hitachiakisoba’, its PHS tolerance alleles might originate from ‘Kanoya-Zairai’, a Japanese landrace with high tolerance .
On the other hand, the PHS tolerance in KY28 is controlled by recessive or partially dominant alleles at qPHS3_KY28 and qPHS4_KY28 (Tables 2 and 3). In crosses B_1 and C, two major QTLs were detected on LG3 (qPHS3_KY28 and qPHS3_NF1) and LG8 (qPHS8_KY28 and qPHS8_NF1). KY28 was developed from PHS-tolerant ‘Harunoibuki’  × ‘Hitachiakisoba’, and NF1 was developed from composite crosses among ‘Kitawasesoba’ , ‘Yaita-Zairai’, ‘Asahimura-Zairai 3’, ‘Hashikamiwase’, ‘Hitachiakisoba’, ‘Chushinkei VII’, ‘Kyukei 30’, and ‘Kyukei 10’. ‘Harunoibuki’ was developed by the mass selection of low-PHS individuals of ‘Hashikamiwase’ . So the PHS tolerance alleles of qPHS3_KY28 and qPHS3_NF1 might originate from ‘Hashikamiwase’.
We detected QTLs which provide PHS tolerance with the KSC7 genotype in three crosses (qPHS7_KY29, qPHS1_KY28, qPHS5_KY28, qPHS8_KY28, and qPHS8_NF1; Table 2). KSC7 have been developed from ‘Norin-PL1’  produced from F. esculentum × F. homotropicum; the latter species has strong seed dormancy . However, we selected lines with a low germination rate to avoid strong seed dormancy for acceleration of the generations during the development of KSC7. Genetic regions contributing strongly to low germination rate would have been eliminated during breeding, but some weak PHS tolerance genes might remain. These QTLs might be useful to improve PHS tolerance. Furthermore, to find new PHS tolerance alleles in F. homotropicum, it may be useful to develop lines with strong PHS tolerance, although undesirable traits such as shattering habit would have to be removed.
Use of PHS tolerance genes in breeding programs
Extensive work has identified genes or major dominant and recessive alleles for PHS tolerance in many plant species, such as wheat, barley, rice, sorghum, and Arabidopsis [28,29,30,31,32,33,34,35]. For example, QPhs.ocs-3A.1 (identified as MOTHER-OF-FT-AND-TFL1) in wheat, Sdr4 (Os07g0585700) in rice, and DELAY OF GERMINATION1 (AT5G45830) in Arabidopsis are dominant or semi-dominant [28,29,30]. On the other hand, Qsd1 (alanine aminotransferase) in barley, Qsd2-AK (mitogen-activated protein kinase kinase kinase: MKK3) in barley, and Phs1 (MKK3) in wheat are recessive [31, 36, 37]. In wheat and barley, QPhs.ocs-3A.1, Qsd1, Qsd2-AK, and Phs1 are available for MAS .
Using BGDB, we investigated whether there are candidate genes near the QTLs that we identified (Additional files 8 and 9: Tables S8, S9). All scaffolds where QTLs were located contained several open reading frames (ORFs), except Fes_sc0026859.1 (Additional file 8: Table S8). Some ORFs were annotated by BGDB (Additional file 9: Table S9), but no candidate gene seemed to be directly related to PHS. However, because we could not develop markers for all of the scaffolds in this experiment, other scaffolds may exist in the QTL vicinity. Further study will be needed to identify genes controlling each QTL and their function.
Because common buckwheat is an SI plant, dominant alleles are more useful for selecting favorable traits. On the other hand, recessive alleles also can be useful if the favorable genotype can be fixed efficiently by MAS. For example, marker qPHS3_KY28_WG_1572 may be able to select PHS-tolerant progeny efficiently so as to select the homozygous KY28 allele (Table 3). That means that pyramiding by MAS may be useful. This information and markers will be useful for accelerating genetic breeding to improve PHS tolerance in buckwheat.
We efficiently constructed linkage maps with AmpliSeq technology and performed QTL analysis for PHS tolerance in combination with NGS-BSA in common buckwheat. Candidate markers linked to PHS developed by NGS-BSA were located near these QTL regions. Furthermore, all candidate markers linked to the single genetic locus Sh developed by NGS-BSA were also mapped to near that locus. Thus, we have shown that the combination of NGS-BSA with AmpliSeq is an efficient way to identify genetic regions for both qualitative and quantitative traits in common buckwheat. The QTLs we detected here possess tolerance alleles with different genetic modes: dominant, partially dominant, and recessive. Those alleles could be efficiently selected for in breeding programs by MAS with the STS markers. This is the first report to identify QTLs for PHS tolerance in buckwheat. Our marker development system will accelerate genetic research and breeding in common buckwheat.
Buckwheat is a heteromorphic SI species with two types of floral architecture: thrum (short style) and pin (long style) . This SI system is controlled by a single genetic locus, S; thrum is heterozygous (Ss) and pin is homozygous recessive (ss). We developed SC buckwheat lines from an interspecific cross between common buckwheat, F. esculentum, and a self-compatible wild relative, F. homotropicum . The SC line has a long homostyle (LH) controlled by a single allele, Sh, in the dominance relationship S > Sh > s . For the development of segregating populations, we used the SC line ‘Kyukei SC7’ (KSC7), which was developed by our research group from the SC line ‘Norin-PL1’ .
We used three PHS-tolerant cultivars/breeding lines: ‘Kyukei 29’ (KY29), ‘Kyukei 28’ (KY28), and ‘NARO-FE-1’ (NF1); those were developed by our research group. KY29 and KY28 had significantly higher PHS tolerance than many leading cultivars in Japan and KSC7, and the evaluation is stable among them [5, 8, 24].
We developed five segregating populations (Additional file 1: Table S1) from independent crosses between five PHS-tolerant lines (KY29_1, KY29_2, KY28_1, KY28_2, and NF1) and KSC7. The segregation patterns of the PHS tolerance of crosses A_1 (KY29_1 × KSC7), A_2 (KY29_2 × KSC7), and B_2 (KY28_2 × KSC7) are reported in Hara et al. , and those of crosses B_1 (KY28_1 × KSC7) and C (NF1 × KSC7) are newly reported here. For linkage map construction and QTL analysis, segregating F2 populations derived from A_1, B_1, and C were used. For association testing of the nearest markers to each QTL, segregating F2 populations derived from A_1, A_2, B_1, and B_2 were used (Additional file 1: Table S1).
Evaluation of PHS tolerance
The F2 populations of crosses A_1 (n = 132), A_2 (n = 106), B_1 (n = 100), B_2 (n = 106), and C (n = 93) and each parental line were grown in a field of the Institute of Crop Science, NARO, Tsukuba, Japan, in 2016 (KY28, NF1, KSC7, A_1, A_2) and 2017 (KY28, KY29, NF1, B_1, B_2, C). Because PHS tolerance evaluated in the field is highly correlated with that evaluated in a Petri dish  and the value is stable among lines, we tested PHS tolerance in Petri dishes as described . In brief, 20 freshly collected seeds of each plant were immediately placed on filter paper saturated with distilled water in a Petri dish. The dishes were incubated in a germination cabinet at a constant 25 °C in the dark. Dishes were checked once a day for 7 days, and germinating seeds were counted and removed. Results are percentages. The significance of the differences among parental lines was tested with Tukey–Kramer’s test using the multcomp R package (Additional file 2: Table S2).
Marker development covering whole genome for AmpliSeq analysis
Genomic DNA was isolated from young leaves of each plant with a DNeasy Plant Mini Kit (Qiagen, Hilden, Germany). We developed genome-wide markers covering the whole genome, PHS-related markers (next section), and SC-related markers (next section). To develop genome-wide markers, we used the sequence information of the markers developed by Yabe et al.  on a high-density linkage map of DNA microarrays. We performed local BLAST searches of all microarray probe sequences used for constructing the linkage map as queries for the BGDB reference genome (FES_r1.0). The searches were performed by Galaxy BLASTN v. 2.7.1 with the default setting. After determining the scaffold number of each microarray probe, we evaluated these scaffolds as genome-wide scaffolds. From these and resequencing data between KY29 and KSC7 (parents of A_1), we selected 300 SNP sites.
Marker development by NGS-BSA for PHS and SI/SC
To develop PHS-linked markers, we used bulked DNA from 46 highly PHS-tolerant plants (0% PHS) and bulked DNA from 10 plants with low PHS tolerance (> 50% PHS) from the F2 population of cross A_1. Paired-end reads of 100 bp from the two bulks were obtained on an Illumina HiSeq-X System at Macrogen Japan (Kyoto, Japan; DDBJ/EMBL/NCBI accession number PRJDB9892). To investigate the effect and certainty of NGS-BSA, we made a set of markers linked to the SC allele Sh from previously published WGS data of KSC7 and KY29 (accession number DRX178921) .
Low-quality reads and adaptors (CACGACGCTCTTCCGATCT and ACCGCTCTTCCGATCTGTAA) were trimmed in Trimmomatic v. 0.32 software  with settings of HEADCROP 2, SLIDINGWINDOW 4:25, LEADING 25, TRAILING 25, and MINLEN: 40. Trimmed reads were mapped to the reference sequences in BWA v. 0.7.15 software  with the ‘bwa aln’ option (−l 32, −k 2, −n 5) and the ‘bwa sample’ option (default settings). Only genomic sequences  of ≥1 kb were selected as reference sequences. Mapping results were processed in SAMtools v. 0.1.18 software .
SNPs were detected with the UnifiedGenotyper tool in GATK v. 3.7 software  with the -glm BOTH option. PHS resistance–linked SNPs were detected with the following criteria: (1) SNPs between KSC7 and KY29 with the depth > 10; (2) heterozygous in high-PHS-tolerance bulked DNA; (3) homozygous with the same nucleotide as KSC7 in low-PHS-tolerance bulked DNA. The reference sequence in the BGDB is divided into 387,594 short scaffolds (N50 = 25.1 kb) , and it is difficult to depict changes in the number of high-PHS-tolerance–linked SNP sites through the scaffolds. Hence, we counted those SNPs in all reference sequences and calculated the ratio of the number of PHS-linked SNPs to the number of all SNPs as the “PHS-linked SNP index” for each scaffold. The top 100 markers of candidate genes in PHS and SI/SC were used for mapping. A custom panel targeting 500 regions (300 as the whole genome, 100 as PHS-linked, and 100 as Sh -linked) was designed based on the KSC7 custom reference genome using the Ion AmpliSeq Designer (https://ampliseq.com/help/startDesign.action)  version 6.0 using the standard DNA (125–275 bp amplicon target sizes) option.
Construction of AmpliSeq library, sequencing, variant detection, and genotyping
The AmpliSeq library was prepared with the Ion AmpliSeq Library Kit 2.0 and the IonCode Barcode Adapter 1–384 Kit (both from Thermo Fisher Scientific) as described in Ogiso-Tanaka et al. . Multiplex PCR amplification was performed in a total reaction volume of 5 μL manually or 4.8 μL robotically on a Mosquito HV instrument (TTP Labtech, Royston, UK) , with 10 ng of each genomic DNA (n = 100 in A and B crosses, n = 93 in C cross). The Ion Library Equalizer Kit was used to normalize the library concentration to 100 pM, and libraries were pooled and sequenced on an Ion Torrent S5 system (Thermo Fisher Scientific). Template preparation (emulsion PCR, enrichment of beads containing template, and chip loading) was performed with the Ion Chef instrument and Ion S5 Kit-Chef according to the manufacturer’s instructions. After the preparation of ion sphere particles, sequencing for 500 cycles was performed on an Ion Torrent S5 system using an Ion 540 Chip according to the manufacturer’s instructions. The sequence data were mapped to the KSC7 custom reference genome by the Ion Torrent Mapping Alignment Program v. 5.8.0 in Torrent Suite v. 5.8.0 software. Coverage analysis and variant detection were performed in Coverage Analysis v. 18.104.22.168 and variantCaller v. 22.214.171.124 software with default parameters (Germ Line with low stringency). All detected variants were listed in a hotspot VCF file. Variants at hotspot sites were detected in variantCaller with default parameters. Finally, the genotype file for the R/qtl package (http://www.rqtl.org/)  was obtained by converting from the output file of variantCaller in the IonBreeders ABH plugin .
Construction of genetic linkage map and QTL analysis
Before constructing a linkage map, we preprocessed data in R/qtl as follows: (1) Find the duplicate markers which show the same genotyping in all individuals except for missing value, select the markers with the fewest missing values, and drop duplicate markers. (2) Drop samples with < 80 genotypes. (3) Drop marker subsets in which < 90% of the population is genotyped. (4) Drop markers with an abnormal genotype distribution. The genetic map was constructed in AntMap v. 1.2 software . QTL analysis was performed in WinQTL Cartographer v. 2.5 software using the composite interval mapping (CIM) model . The significance threshold of the log-likelihood (LOD) score was based on 1000 permutations (P = 0.05).
Development of STS markers linked to QTLs
According to the resequencing data between KY29 and KSC7 or sequencing data from AmpliSeq, we converted QTL nearest AmpliSeq markers to STS markers. Amplification with genomic DNA as a template was performed with the designed specific primers by ExTaq (TaKaRa, Shiga, Japan) as follows: 32 cycles at 94 °C for 30 s, 58 °C for 30 s, and 72 °C for 30 s. Amplification was confirmed by agarose gel electrophoresis, and the DNA fragments were digested with restriction enzymes (AfaI, AluI, EcoRI, or MspI (all from TaKaRa)). The primer sequences and the combination of the restriction enzymes are listed in Additional file 7: Table S7.
Availability of data and materials
All data generated or analyzed during this study are included in the manuscript and its Additional files 1–10. The raw reads of next-generation sequencing-based bulked-segregant analysis obtained in this study is available from the DDBJ/EMBL/NCBI under the accession number PRJDB9892 (http://trace.ddbj.nig.ac.jp/BPSearch/bioproject?acc=PRJDB9892). The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.
Buckwheat genome database
Composite interval mapping
Open reading frame
Quantitative trait locus
Kreft I, Zhou M, Golob A, Germ M, Likar M, Dziedzic K, et al. Breeding buckwheat for nutritional quality. Breed Sci. 2020;70:67–73.
Matsui K, Walker AR. Biosynthesis and regulation of flavonoids in buckwheat. Breed Sci. 2020;70:74–84.
Bonafaccia G, Marocchini M, Kreft I. Composition and technological properties of the flour and bran from common and tartary buckwheat. Food Chem. 2003;80:9–15.
Suzuki T, Noda T, Morishita T, Ishiguro K, Otsuka S, Brunori A. Present status and future perspectives of breeding for buckwheat quality. Breed Sci. 2020;70:48–66.
Hara T, Matsui K, Noda T, Tetsuka T. Effects of preharvest sprouting on flour pasting viscosity in common buckwheat (Fagopyrum esculentum Moench). Plant Prod Sci. 2007;10:361–6.
Hara T, Sasaki T, Tetsuka T, Ikoma H, Kohyama K. Effects of sprouting on texture of cooked buckwheat (Fagopyrum esculentum Moench) noodles. Plant Prod Sci. 2009;12:492–6.
Gao L, Xia M, Li Z, Wang P, Wang M, Gao J. Changes of physicochemical properties and correlation analysis of common buckwheat starch during germination. Fagopyrum. 2019;36:43.
Hara T, Takeshima R, Matsui K. Genes with different modes of inheritance regulate seed germination in preharvest-sprouting-tolerant lines of buckwheat (Fagopyrum esculentum). Jpn Agric Res Q. 2020;54:137–43.
Hara T, Takahisa T, Matsui K. New buckwheat cultivar ‘Harunoibuki’. Bull NARO Kyushu Okinawa Agric Res Cent. 2012;58:37–47.
Matsui K, Yasui Y. Genetic and genomic research for the development of an efficient breeding system in heterostylous self-incompatible common buckwheat (Fagopyrum esculentum). Theor Appl Genet. 2020;133:1641–53.
Yasui Y, Hirakawa H, Ueno M, Matsui K, Katsube-Tanaka T, Yang SJ, et al. Assembly of the draft genome of buckwheat and its applications in identifying agronomically useful genes. DNA Res. 2016;23:215–24.
Yasui Y. History of the progressive development of genetic marker systems for common buckwheat. Breed Sci. 2020;70:13–8.
Matsui K, Mizuno N, Ueno M, Takeshima R, Yasui Y. Development of co-dominant markers linked to a hemizygous region that is related to the self-compatibility locus (S) in buckwheat (Fagopyrum esculentum). Breed Sci. 2020;70:112–7.
Takagi H, Abe A, Yoshida K, Kosugi S, Natsume S, Mitsuoka C, et al. QTL-seq: rapid mapping of quantitative trait loci in rice by whole genome resequencing of DNA from two bulked populations. Plant J. 2013;74:174–83.
Ogiso-Tanaka E, Shimizu T, Hajika M, Kaga A, Ishimoto M. Highly multiplexed AmpliSeq technology identifies novel variation of flowering time-related genes in soybean (Glycine max). DNA Res. 2019;26:243–60.
Tanaka T, Ishikawa G, Ogiso-Tanaka E, Yanagisawa T, Sato K. Development of genome-wide SNP markers for barley via reference- based RNA-Seq analysis. Front Plant Sci. 2019;10:1–9.
Yabe S, Hara T, Ueno M, Enoki H, Kimura T, Nishimura S, et al. Rapid genotyping with DNA micro-arrays for high-density linkage mapping and QTL mapping in common buckwheat (Fagopyrum esculentum Moench). Breed Sci. 2014;64:291–9.
Daware A, Das S, Srivastava R, Badoni S, Singh AK, Agarwal P, et al. An efficient strategy combining SSR markers- and advanced QTL-seq-driven QTL mapping unravels candidate genes regulating grain weight in rice. Front Plant Sci. 2016;7:1–17.
Zhang X, Wang W, Guo N, Zhang Y, Bu Y, Zhao J, et al. Combining QTL-seq and linkage mapping to fine map a wild soybean allele characteristic of greater plant height. BMC Genomics. 2018;19:226.
Liu T, Wang J, Wu C, Zhang Y, Zhang X, Li X, et al. Combined QTL-Seq and traditional linkage analysis to identify candidate genes for purple skin of radish fleshy taproots. Front Genet. 2019;10:1–11.
Tsongalis GJ, Peterson JD, de Abreu FB, Tunkey CD, Gallagher TL, Strausbaugh LD, et al. Routine use of the ion torrent AmpliSeq™ Cancer hotspot panel for identification of clinically actionable somatic mutations. Clin Chem Lab Med. 2014;52:707–14.
Sato M, Hosoya S, Yoshikawa S, Ohki S, Kobayashi Y, Itou T, et al. A highly flexible and repeatable genotyping method for aquaculture studies based on target amplicon sequencing using next-generation sequencing technology. Sci Rep. 2019;9:6904.
Mizuno N, Yasui Y. Gene flow signature in the S-allele region of cultivated buckwheat. BMC Plant Biol. 2019;19:125.
Hara T, Tetsuka T, Matsui K, Ikoma H, Sugimoto A, Shiratsuchi H. Evaluation of cultivar differences in preharvest sprouting of common buckwheat (Fagopyrum esculentum Moench). Plant Prod Sci. 2008;11:82–7.
Inuyama S, Honda Y, Furuyama S, Kimura M, Kasano H. The breeding and characteristics of a buckwheat cultivar, ‘Kitawasesoba’. Res Bull Hokkaido Natl Agric Exp Stn. 1994;159:1–10.
Matsui K, Tetsuka T, Hara T, Morishita T. Breeding and characterization of a new self-compatible common buckwheat parental line, “buckwheat Norin-PL1”. Bull Natl Agric Res Cent Kyushu Okinawa Reg. 2008;49:1–17.
Wang YJ, Campbell CG. Breaking dormancy in buckwheat 2000. Fagopyrum. 2000;17:45–50.
Bentsink L, Jowett J, Hanhart CJ, Koornneef M. Cloning of DOG1, a quantitative trait locus controlling seed dormancy in Arabidopsis. Proc Natl Acad Sci. 2006;103:17042–7.
Sugimoto K, Takeuchi Y, Ebana K, Miyao A, Hirochika H, Hara N, et al. Molecular cloning of Sdr4, a regulator involved in seed dormancy and domestication of rice. Proc Natl Acad Sci. 2010;107:5792–7.
Nakamura S, Abe F, Kawahigashi H, Nakazono K, Tagiri A, Matsumoto T, et al. A wheat homolog of MOTHER OF FT AND TFL1 acts in the regulation of germination. Plant Cell. 2011;23:3215–29.
Sato K, Yamane M, Yamaji N, Kanamori H, Tagiri A, Schwerdt JG, et al. Alanine aminotransferase controls seed dormancy in barley. Nat Commun. 2016;7:1–9.
Lee GA, Jeon YA, Lee HS, Hyun DY, Lee JR, Lee MC, et al. New genetic loci associated with preharvest sprouting and its evaluation based on the model equation in rice. Front Plant Sci. 2017;8:1–9.
Benech-Arnold RL, Rodríguez MV. Pre-harvest sprouting and grain dormancy in Sorghum bicolor: what have we learned? Front Plant Sci. 2018;9:1–8.
Nakamura S. Grain dormancy genes responsible for preventing pre-harvest sprouting in barley and wheat. Breed Sci. 2018;68:295–304.
Vetch JM, Stougaard RN, Martin JM, Giroux MJ. Review: revealing the genetic mechanisms of pre-harvest sprouting in hexaploid wheat (Triticum aestivum L.). Plant Sci. 2019;281:180–5.
Nakamura S, Pourkheirandish M, Morishige H, Kubo Y, Nakamura M, Ichimura K, et al. Mitogen-activated protein kinase kinase 3 regulates seed dormancy in barley. Curr Biol. 2016;26:775–81.
Torada A, Koike M, Ogawa T, Takenouchi Y, Tadamura K, Wu J, et al. A causal gene for seed dormancy on wheat chromosome 4A encodes a MAP kinase kinase. Curr Biol. 2016;26:782–7.
Garber RJ, Quisenberry KS. The inheritance of length of style in buckwheat. J Agric Res. 1927;34:181–3.
Woo SH, Adachi T, Jong SK, Campbell CG. Inheritance of self-compatibility and flower morphology in an inter-specific buckwheat hybrid. Can J Plant Sci. 1999;79:483–90.
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Li H, Durbin R. Fast and accurate short read alignment with burrows–wheeler transform. Bioinformatics. 2009;25:1754–60.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
Ion AmpliSeq Designer. https://ampliseq.com/help/startDesign.action.
Ogiso-Tanaka E, Kaga A, Hajika M. Miniaturization technologies for cost-effective AmpliSeq library preparation for next generation sequencing. J Appl Microbiol Res. 2018;1:43–9.
Broman KW, Wu H, Sen S, Churchill GA. R/QTL: QTL mapping in experimental crosses. Bioinformatics. 2003;19:889–90.
Ogiso-Tanaka E, Yabe S, Tanaka T. IonBreeders: automated bioinformatics plugins toward genomics-assisted breeding. Breed Sci. 2020; https://github.com/DEMETER298/IonBreeders.
Iwata H, Ninomiya S. AntMap: constructing genetic linkage maps using an ant colony optimization algorithm. Breed Sci. 2006;56:371–7.
Wang SC, Basten J, Zeng Z-B. Windows QTL Cartographer 2.5. Raleigh: Department of Statistics, North Carolina State University; 2012. http://statgen.ncsu.edu/qtlcart/WQTLCart.htm.
We thank Toru Yoshioka and Mineko Aburano in NARO for growing buckwheat plants and supporting extracting DNA samples. Computations were partially performed on the NIG supercomputer at ROIS National Institute of Genetics.
This work was supported by NARO and partly supported by JSPS KAKENHI Grant Nos. 18 K14448 to RT and 18H02177 to YY and KM. The authors declare that the funding bodies had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
List of plant materials.
Ratio of pre-harvest sprouting of the parent lines.
Association with array marker sequences (Yabe et al., ) and BGDB scaffolds.
Selected 300 SNP sites between micro array marker sequences (Yabe et al., ) and BGDB scaffolds.
Identification of scaffolds linked to PHS tolerance based on SNP index.
Summary information for AmpliSeq custom panel.
Primer information for sequence-tagged-site markers linked to QTLs.
Scaffolds where QTLs are located and closest marker to each QTL.
Genes annotated near QTLs.
Box plots of on-target rate and uniformity. On-target rate is shown as percentage of aligned reads. Coverage uniformity score is shown as percentage of area covered at ≥0.2× mean coverage depth. Box plots show median and interquartile range; red, Cross A (n = 94); green, Cross B (n = 87); blue, Cross C (n = 84).
About this article
Cite this article
Takeshima, R., Ogiso-Tanaka, E., Yasui, Y. et al. Targeted amplicon sequencing + next-generation sequencing–based bulked segregant analysis identified genetic loci associated with preharvest sprouting tolerance in common buckwheat (Fagopyrum esculentum). BMC Plant Biol 21, 18 (2021). https://doi.org/10.1186/s12870-020-02790-w