- Research
- Open access
- Published:
A GWAS study highlights significant associations between a series of indels in a FLOWERING LOCUS T gene promoter and flowering time in white lupin (Lupinus albus L.)
BMC Plant Biology volume 24, Article number: 722 (2024)
Abstract
Background
White lupin (Lupinus albus L.) is a high-protein Old World grain legume with remarkable food and feed production interest. It is sown in autumn or early spring, depending on the local agroclimatic conditions. This study aimed to identify allelic variants associated with vernalization responsiveness, in order to improve our knowledge of legume flowering regulatory pathways and develop molecular selection tools for the desired phenology as required for current breeding and adaptation to the changing climate.
Results
Some 120 white lupin accessions originating from a wide range of environments of Europe, Africa, and Asia were phenotyped under field conditions in three environments with different intensities of vernalization, namely, a Mediterranean and a subcontinental climate sites of Italy under autumn sowing, and a suboceanic climate site of France under spring sowing. Two hundred sixty-two individual genotypes extracted from them were phenotyped in a greenhouse under long-day photoperiod without vernalization. Phenology data, and marker data generated by Diversity Arrays Technology sequencing (DArT-seq) and by PCR-based screening targeting published quantitative trait loci (QTLs) from linkage map and newly identified insertion/deletion polymorphisms in the promoter region of the FLOWERING LOCUS T homolog, LalbFTc1 gene (Lalb_Chr14g0364281), were subjected to a genome-wide association study (GWAS). Population structure followed differences in phenology and isolation by distance pattern. The GWAS highlighted numerous loci significantly associated with flowering time, including four LalbFTc1 gene promoter deletions: 2388 bp and 2126 bp deletions at the 5’ end, a 264 bp deletion in the middle and a 28 bp deletion at the 3’ end of the promoter. Besides LalbFTc1 deletions, this set contained DArT-seq markers that matched previously published major QTLs in chromosomes Lalb_Chr02, Lalb_Chr13 and Lalb_Chr16, and newly discovered QTLs in other chromosomes.
Conclusions
This study highlighted novel QTLs for flowering time and validated those already published, thereby providing novel evidence on the convergence of FTc1 gene functional evolution into the vernalization pathway in Old World lupin species. Moreover, this research provided the set of loci specific for extreme phenotypes (the earliest or the latest) awaiting further implementation in marker-assisted selection for spring- or winter sowing.
Background
White lupin (Lupinus albus L.) is a cool-season grain legume originating putatively from the eastern Mediterranean region, where wild Greacus-type accessions are still present [1, 2]. White lupin was primarily domesticated in ancient Greece and Egypt more than three thousand years ago [3]. Its cultivars feature high protein (38–42%) [4] and moderate oil (10–13%) seed content with favorable fatty acid composition for human consumption [5]. White lupin cultivation positively contributes to soil fertility through efficient mobilization of soil phosphorus and – like other legumes – symbiotic nitrogen fixation [6, 7].
White lupin may be cultivated as a spring-sown or an autumn-sown crop. Spring sowing is preferred in colder cropping areas of temperate climate, including also central Europe, whereas autumn sowing is carried out in warmer regions (Australia, the Mediterranean basin, and western Europe). A prolonged cold period during germination and juvenile growth, that induces vernalization, usually accelerates the transition from vegetative to generative growth phase in white lupin landraces, except those adapted to spring sowing and drought escape by rapid flowering [8,9,10]. A similar observation was done also for two other Old World lupin crop species, narrow-leafed and yellow lupins, where Palestinian accessions gained significant vernalization independence [11,12,13]. In the latter two species, abolition of vernalization requirements is conferred by large deletions in regulatory regions of one of the four FLOWERING LOCUS T (FT) homologs present in their genomes: LanFTc1 in the narrow-leafed lupin, and LlutFTc1 in the yellow lupin [11, 13, 14]. All three Old World lupin crop species delay flowering under short day photoperiod (as compared to long days). Nevertheless, this response is much more significant in vernalization-requiring accessions than in the thermoneutral genotypes [9, 11, 12, 15,16,17].
The effect of vernalization on growth and phenology in white lupin is proportional to the amount of cold received by a plant but only within a specific range of temperatures [18]. The lower limit is about 0 °C, whereas the upper is about 12–17 °C [18,19,20]. Effective vernalization in spring ecotypes occurs relatively quickly, after about 10 days with temperatures up to 12 °C. In contrast, winter types require at least two weeks (optimum 3–4 weeks) with reported temperatures from 1 to 6 °C [8, 9, 20,21,22]. The timing of flowering is also controlled by the total amount of temperature (growth degree days, GDDs) [22]. Depending on the growth stage, the base temperature is between 0 and 3 °C, with a consensus of 3 °C used for GDDs calculations [18, 19, 22,23,24]. Flowering occurs after about 500–1000 GDDs, depending on the sowing period (autumn vs. spring), ecotype (winter type or thermoneutral) and fulfillment of vernalization requirements [22]. Vernalization dependence is a key trait for white lupin adaptation to autumn sowing in areas with some freezing temperatures during winter, given the modest frost tolerance of this species [24, 25]. In these areas, moderate or high vernalization requirement prevents flowering before winter frost occurs. Moreover, a positive correlation exists between vernalization requirements and frost tolerance during winter [22]. Vernalization-dependent genotypes also offer some flexibility in selection autumn sowing dates before the onset of winter [18, 22]. However, vernalization dependence is unfavorable under spring sowing, where it unnecessarily delays the crop flowering and maturity in the absence of any risk of frost damage. Moreover, early phenology based on vernalization independence is also an essential component of a drought escape strategy [26,27,28], which breeders could exploited to cope with the observed substantial increase in warm-season droughts in white lupin cultivation areas. An ecological classification of white lupin genetic resources for European breeding programs highlighted the immense significance of the different phenological types [29]. The long-term shift in climate zones for agriculture is expected to move northward by 500 to 1200 km, creating new opportunities for crop cultivation and expanding acreage for autumn-sown crops [30]. Nevertheless, the higher temperature in winter may result in incomplete fulfillment of vernalization in autumn-sown crops, resulting in shift of floral induction date [31]. This issue can be compensated by developing white lupin cultivars with phenology adapted to local agroclimatic conditions, however, it requires improving our knowledge of heritable components of vernalization responsiveness present in germplasm collections as well as the development of tools for molecular selection of the desired (early, intermediate or late) phenology.
To facilitate studies on the genetic inheritance of domestication-related traits in white lupin, a recombinant inbred line (RIL) mapping population was developed from a cross between the early flowering vernalization-independent cultivar Kiev Mutant from Ukraine and the late flowering vernalization-responsive landrace P27174 from Ethiopia [32]. Quantitative trait loci (QTL) mapping in this RIL population revealed the presence of several QTLs for time to flowering [32,33,34,35]. Polymorphic loci from the genetic map were transformed into PCR-based markers and used for correlation analysis in germplasm collection carrying domesticated and wild accessions [36]. Although that study confirmed the significance of major QTLs from the RIL population, it highlighted the lack of loci associated with the most contrasting phenotypes (very early or very late). Recently, based on the constructed high-density consensus linkage map [34], two chromosome-scale genome sequences were established [37, 38], supplemented with a pangenome assembly carrying 40 accessions [39]. These resources have enabled us to conduct genome-wide association study (GWAS) addressing phenological diversity in white lupin germplasm collection carrying genotypes from three main climate zones. Recently, GWAS was used in white lupin to highlight significant SNPs associated with anthracnose resistance, seed alkaloids, protein content, yield under moist and dry conditions, and onset of flowering [28, 40, 41].
The study aimed to find genetic variants linked to vernalization independence or responsiveness for breeding and adaptation to climate change. The germplasm diversity panel included 262 white lupin genotypes that were subjected to genotyping by whole-genome Diversity Arrays Technology sequencing (DArT-seq) and PCR-based screening of published flowering time QTLs and novel insertion/deletion (indel) polymorphisms found in the regulatory region of white lupin FTc1 gene homolog, LalbFTc1. Obtained molecular data were used for association analysis encompassing greenhouse observations in Poland under ambient long-day photoperiod (spring sowing without vernalization) and field observations collected in three test environments representing diverse agroclimatic conditions and intensity of vernalization, i.e. Mediterranean and subcontinental climate regions in Italy (autumn sowing) and suboceanic climate in France (spring sowing).
Results
Phenotyping of the white lupin diversity panel revealed high variability in flowering time and vernalization-responsiveness
The white lupin diversity panel [10] originating from three main climate zones (tropical, subtropical and temperate) [42, 43] was evaluated in four environments: spring sowing in greenhouse with absolute lack of vernalization under long day photoperiod (Poznań) as well as in field conditions by autumn sowing with strong vernalization (Lodi), autumn sowing with moderate vernalization (Sanluri) and spring sowing with mild vernalization (Saint Sauvant).
Time from sowing to the start of flowering significantly differed between environments (Supplementary File S1). As expected, it was the longest in autumn sowing, ranging from 173 to 210 days in Lodi and from 89 to 170 days in Sanluri, followed by spring sowing in Saint Sauvant, ranging from 64 to 106 days, and spring sowing in a greenhouse in Poznań, ranging from 41 to 127 days in 2020 and from 41 to 121 days in 2021. As in greenhouse surveys we counted also time from sowing to bud emergence and to end of flowering, it was possible to compare broad sense heritability for these three traits in this environment. It reached the highest value for days to bud emergence (0.968) and days to start of flowering (0.967), followed by days to end of flowering (0.956). It should be noted that average standard deviation of flowering time within accessions in greenhouse reached 2.3 days, whereas for the whole dataset in this environment it was 15.9 days. It indicates that variability in flowering time between accessions in controlled environments was significantly higher than between genotypes within accessions. The correlation of flowering time between greenhouse and field-based phenotyping was the highest for spring sowing in Saint Sauvant (0.89), followed by autumn sowing in Sanluri (0.88) and in Lodi (0.60). All values were statistically significant, nevertheless lower value of correlation between greenhouse and Lodi reflects the influence of vernalization-responsiveness for flowering time in white lupin (see Discussion). Despite thermal differences between environments, the earliest genotypes in Lodi were also very early flowering in other environments, and similar consistency was found for very late genotypes. Indeed, fourteen genotypes from Turkey, three from Syria and one from Jordan revealed rapid flowering irrespective of the length and severity of vernalization. Total photoperiod (daylight) hours from sowing to the start of flowering (Supplementary File S2) ranged from 522.6 to 2247.9 h. It was the lowest in greenhouse (mean value 984.4 h) and the highest in Lodi (mean value 1741.8 h). This trait was highly correlated (as expected) with the number of days from sowing to the start of flowering in all environments (r value above 0.999). The cumulative number of growing degree days from sowing to start of flowering (GDDs) (Supplementary File S3) was the lowest in Saint Sauvant (517.5-1100.3), followed by Lodi (650.5–1048.0), Sanluri (816.0-1435.2), whereas the highest in greenhouse (725.7-2396.6 in 2020 and 698.1-2282.9 in 2021). Differences between environments in GDDs were the highest for vernalization-dependent accessions. Cumulative vernalization effectiveness of daily temperature (VF) significantly differed between environments, reaching at flowering stage 0 days in greenhouse (no pre-sowing vernalization), 18.23–18.67 days in Saint Sauvant, 18.11–55.73 days in Sanluri, and 117.27-121.97 days in Lodi. The number of accessions that initiated flowering before the end the vernalization period ranged from 18 in Saint Sauvant and 33 in Lodi to 158 in Sanluri (Supplementary File S4).
DArT-seq genotyping of white lupin diversity panel provided 6735 high-quality markers
DArT-seq protocol [44, 45] executed for 262 genotypes yielded 16,713 polymorphic presence/absence (dominant) markers (SilicoDArTs) and 11,506 single-nucleotide polymorphism (SNP) markers. SilicoDArT markers achieved an average read depth of 32.8 with an average reference genome alignment e-value of 4.2E-09. Taking into consideration SNP markers, the mean read count was 21.3 for the reference allele and 16.8 for the alternative allele, with an average reference genome alignment e-value of 1.4E-09, whereas the mean genotype call count per line was 10,096, with values ranging from 7266 (LAP106d) to 10,775 (LAP047b). After missing data imputation, duplicated loci removal and minor allele frequency (MAF) filtering, 4971 SNP and 1764 SilicoDArT markers were retained for further analysis (Supplementary File S5).
Screening of the white lupin diversity panel with PCR markers for QTLs from the linkage map identified rare LalbFTa1 gene indel
The analysis of fifteen PCR-based markers (Table 1), associated with QTLs for flowering time derived from the linkage mapping studies [35, 36], revealed the polymorphism in the set of 262 white lupin genotypes for all markers except TP47110. This marker yielded an alternative allele only in a control sample (P27174 landrace). Two other markers, FTa1-F2 and SKIP1-F2, revealed very low MAF values (1.5% and 3.0%, respectively). The FTa1-F2 marker recognizes a deletion localized in the third intron from one of the four Arabidopsis thaliana FT homologs present in the white lupin genome, LalbFTa1 gene (Lalb_Chr02g0156991). This marker co-segregated in mapping studies with one major QTL for flowering time in a RIL population descending from Ethiopian parent germplasm [34, 35]. In the present study, the Ethiopian LalbFTa1 allele was found only in four genotypes, three originating from Ethiopia (LAP079c, LAP084d and LAP084c) and one from Italy (LAP098a). These genotypes differed in phenology. Namely LAP079c was quite early flowering and low-responsive to vernalization, whereas the remaining showed delayed flowering without severe vernalization. Interestingly, other tested Ethiopian germplasm (6 genotypes) carried the reference LalbFTa1 allele and were rather early flowering in all studied environments. The other major QTL marker with low MAF value, SKIP1-F2, targets non-synonymous (H to P) SNP locus (A to C) present in the coding sequence of F-box protein SKIP1 (Lalb_Chr13g0300781). Eight accessions that carried this mutation originated from 6 countries (Egypt, France, Italy, Portugal, Turkey, and Jordan) and showed high variability in flowering time in a controlled environment (greenhouse) and Sanluri, suggesting a possible lack of association between SKIP1-F2 allele and vernalization-independent flowering. Agarose gel electrophoregrams showing polymorphism of PCR-based markers from flowering time QTLs are provided in Supplementary File S6.
White lupin pangenome alignment unveiled a series of indels in the LalbFTc1 gene promoter
Recent studies on the narrow-leafed and yellow lupin genomes highlighted the association between insertion-deletion (indel) polymorphism in the promoter region of FTc1 genes (LanFTc1 and LlutFTc1, respectively) and vernalization-independent early flowering [11,12,13,14]. Therefore, we investigated the potential structural polymorphism of the orthologous FTc1 region in white lupin. Multiple alignment of genome sequences derived from 40 white lupin accessions [39] revealed a complex pattern of indel polymorphism in the regulatory region of the LalbFTc1 gene, Lalb_Chr14g0364281 (Table 2). Considering cv. Amiga genome as a reference [38], seven deletions and two insertions were identified. Two short deletions (7 bp and 28 bp) were found in the region localized about 800 bp upstream of the transcription start site (TSS). Another two short and one longer deletions (5 bp, 24 bp and 264 bp) and one short insertion (25 bp), were identified in the region localized 3.9–4.9 kbp upstream of the TSS. Between these two regions, a remarkably long (3936 bp) insertion was found at ~ 2.5 kbp upstream of the TSS, raising a question about the length of the functional LalbFTc1 promoter to be considered in polymorphism analysis of genotypes carrying this insertion. In the most external region, localized 5.6-8.0 kbp upstream of the TSS, two overlapping large deletions (2388 bp and 2126 bp) were localized, having the same distal starting point (Table 2).
PCR-based screening evidenced the presence of several LalbFTc1 gene promoter indel variants in the white lupin diversity panel
The set of 18 PCR primer pairs was designed and successfully implemented for screening 262 white lupin genotypes towards indel polymorphism and selected SNP polymorphism in the regulatory region of the LalbFTc1 gene (Table 3). As some primer pairs generated more than two alleles corresponding to the different lengths of deletions, several scoring schemes were implemented for these products to maintain bi-allelic segregation in the GWAS dataset, and such markers were named with a, b, c and d suffixes (i.e., markers PR_58a-c, 35a-b, 36a-b, 42a-b and 71a-d). In total, 26 PCR-based markers targeting polymorphism in the LalbFTc1 gene promoter were analyzed in the white lupin diversity panel. Only one marker (PR_43) turned out to be monomorphic in the analyzed set of genotypes. However, another ten markers (PR_32, PR_34, PR_35a, PR_36a, PR_61, PR_37, PR_38, PR_40, PR_42b, PR_71c) revealed MAF values below 5%. Consequently, they were excluded from population structure analysis and further calculations. Moreover, two other markers, PR_30 and PR_31, revealed identical segregation pattern, so only PR_30 was retained. Observed patterns of polymorphism confirmed all indel variants extracted from pangenome sequence alignment. They also designated a novel large candidate (above 6 kbp) deletion in the LalbFTc1 gene promoter region of three Turkish genotypes LAP027a, LAP027b and LAP027c, highlighted by the lack of amplification of any PCR product from markers localized between about 1 and 8 kbp upstream of the TSS. The detected polymorphism for the studied PCR-based markers for LalbFTc1 gene promoter indels is visualized on sample agarose gel electrophoregrams in Supplementary File S7.
Population structure analysis of the white lupin diversity panel uncovered associations with phenology and geographic origin
In total 6765 markers, including 13 QTL-based PCR markers, 17 LalbFTc1 gene indel PCR markers, 4971 SNP and 1764 SilicoDArT markers were used for population structure, principal component analysis (PCA) and GWAS calculations (Supplementary File S5). The mean heterozygosity of markers was 4.7%, ranging from 0 to 61.8%. The heterozygosity of genotypes ranged from 0.5 to 16.2%. Within the investigated germplasm panel, 74 genotypes had heterozygosity below 1%, whereas 39 genotypes above 10%. This result was largely expected, as white lupin is a predominantly inbred crop with a cross-pollination rate of about 10% [48]. Analysis of cross-entropy graph for clustering within the range from K2 to K30 revealed two major inflection points at K = 5 and K = 12 and a cross-entropy minimum at K = 17 (Supplementary File S8). K-values within the range of 4–8 were screened for population structure analysis (Supplementary File S9).
Since the first tested K-value (K4), grouping started to follow differences in flowering time and geographical origin of accessions (Fig. 1). Two genotypes from Syria, seven from Lebanon, two from Israel and two from Jordan formed a cluster (No. 2) that was perfectly stable in all tested K-values. Genotypes within this cluster had very similar phenology, highlighted by low standard deviation of flowering time in all tested environments (0.6 to 4.1 days). The time to flowering of this cluster was close to the mean value observed in these environments. The separation of this cluster from other genotypes was also highlighted by the most distant position on the plot showing the first two principal components (Fig. 2).
Another relatively stable cluster (No. 4) was formed at K4 from a few genotypes originating from each of the East African resources (Ethiopia, Kenya and Sudan), Egypt, Maghreb (Morocco) and West Asia (Israel and Syria) as well as all France cultivars (both spring-type and winter-type) and one Italian genotype (Fig. 1). That cluster can be found on the distant right position on the PCA graph supporting its significant separation from the other groups (Fig. 2). This cluster revealed very high variability in flowering time, and, consequently, at higher K-values it divided into two clusters (No. 4 and 7) that differred by phenology.
A set of 26–27 genotypes from Egypt was initially grouped together with a large amount of Greek, Italian, and Turkish germplasm (cluster No. 3). However, as the analysis progressed, this association diminished at higher K-values, and the genotypes were ultimately divided into three clusters: the first consisted of Egyptian genotypes characterized by early phenology, the second one mostly comprised Italian genotypes and the third included late flowering, vernalization-responsive Greek germplasm, along with very rapid flowering and thermoneutral Turkish, Jordanian, and Syrian genotypes. (Fig. 1). These three clusters can be found at overlapping positions in the middle section of the PCA graph (Fig. 2).
Most Azorean genotypes grouped together with Madeira & Canaries germplasm across all analyzed K-values. This cluster (No. 6) revealed late or very late flowering time in all tested environments and localized at the left side of the PCA (Fig. 2). Spanish and Portuguese accessions formed a cluster with a few Maghreb (Morocco) and Italian genotypes. This group revealed intermediate flowering time with low variability between accessions. Five Ethiopian genotypes (from nine analyzed) clustered together across all tested K-values and revealed quite early flowering phenotype in all environments except Lodi.
Newly discovered LalbFTc1 indels revealed significant associations with flowering time in white lupin diversity panel
Based on the PCA and cross-entropy analysis, K = 5 was selected as the representative number of clusters for population structure in GWAS. Five environments (Greenhouse 2020, Greenhouse 2021, Lodi, Sanluri, and Saint Sauvant) and four traits counted from sowing to start of flowering were analyzed: the number of days, the cumulative number of growing degree days (GDDs), the total photoperiod hours and the cumulative vernalization effectiveness of daily temperature (VF). As the temperature in the greenhouse was always above the vernalization threshold, 18 environment × trait combinations were included in GWAS. The set of 195 markers revealed at least one marker-trait association with false discovery rate (FDR)-corrected P-value below the 0.05 threshold (Supplementary File S10), including 38 markers significantly associated in at least three environment × trait combinations (Table 4, Fig. 3). This set included, among others, four PCR markers recognizing LalbFTc1 promoter indels: 2388 bp and 2126 bp deletions at the 5’ end of the promoter (marker PR_58c), a 264 bp deletion in the middle region (marker PR_36b) and a 28 bp deletion at the 3’ end of the promoter (markers PR_42a and PR_71d) (Figs. 3 and 4).
Considering the number environment × trait combinations with significant associations, PR_36b and co-segregating PR_58c were among the most frequently associated markers in the whole dataset used for GWAS (together with the Chr25_4002891_D marker). PR_36b and PR_58c markers revealed high versatility across different agroclimatic conditions, highlighted by significant associations with GDDs and total photoperiod hours in all tested environments and with days to flowering in all environments except Lodi. The latter two LalbFTc1 indel PCR markers revealed significant associations with days to flowering, GDDs, and total photoperiod hours in Greenhouse 2020 and Lodi (PR_42a) or only in Lodi (PR_71d). In a given scoring scheme (Table 3), alternative alleles (shorter products) were always selective for earliness (Fig. 5). It should be noted that these markers differed from each other by the frequency of alternative alleles, which ranged from 8.2% (PR_42a), through 21.0% (PR_71d) to 79.4% (PR_36b and PR_58c) (Table 4). Therefore, these four markers could constitute a ready-to-use mini array for future PCR-based selection towards early or late flowering in white lupin breeding.
To survey potential indel polymorphism in promoter regions of the remaining three FT homologs present in the white lupin genome, markers for LalbFTa1, LalbFTa2, and LalbFTc2 genes were analyzed in a sub-population of 86 accessions representing observed diversity of flowering time in the studied diversity panel. From 43 primer pairs tested, 17 yielded polymorphic products: 5 for LalbFTa1, 9 for LalbFTa2, and 3 for LalbFTc2 (Supplementary File S11). Seven markers (2 for LalbFTa1 and 5 for LalbFTa2) had MAF values above 0.05. Polymorphic markers revealed remarkably less significant correlations with flowering time (r values between − 0.25 and 0.29) than four LalbFTc1 markers (PR_36b, PR_42a, PR_58c, and PR_71d), highlighted by GWAS as significantly associated (r values between − 0.68 and − 0.45 in the same germplasm panel) (Supplementary File S12).
GWAS confirmed the significance of three major flowering time QTL regions reported by linkage mapping studies
The 38 highly associated markers (Table 4) also encompassed seven DArT-seq markers originating from the chromosome segments overlapping with three major QTL regions identified by the linkage mapping studies [34, 35]. Namely, the Chr02_2625514_D and Chr02_2625564_D markers corresponding to a QTL localized on the linkage group ALB02 at 2.2 cM, the Chr13_12561729_D and Chr13_13913452_D markers matching a QTL from the linkage group ALB13 at 96.2 or 99.3 cM, and the Chr16_366800, Chr16_572706, and Chr16_788665 markers tagging a QTL from the linkage group ALB16 at 0.2 or 2.2 cM (Figs. 3 and 4). Alternative alleles of all these markers were selective towards late flowering, whereas heterozygotes revealed an intermediate phenotypic effect (Fig. 6). In both years, the highest number of environment × trait combinations with significant associations were revealed for the greenhouse. Nevertheless, two markers, Chr13_12561729_D and Chr16_366800, revealed significant associations with at least one trait in all studied environments, whereas the Chr16_572706 marker was particularly selective to Lodi and Chr02_2625514_D to Saint Sauvant. Markers Chr16_366800, Chr02_2625564_D and Chr13_12561729_D were the fourth, fifth, and sixth most frequently associated markers in this GWAS study (Fig. 3). On the other hand, PCR-based markers from the linkage map (Table 1), designed to track these QTLs, did not reveal significant associations with studied traits.
GWAS highlighted novel candidate regulatory regions for flowering time in white lupin
The remaining 27 highly associated markers highlighted novel loci localized on 16 chromosomes (Table 4). The most remarkable in this subset was the Chr25_4002891_D marker, because it was, together with the PR_36b and PR_58c markers, the most frequently associated marker in this study, showing significant associations with at least two traits in every environment (Fig. 3, Supplementary File S10). Moreover, this marker was remarkably associated with vernalization progress (VF) in Sanluri (FDR-corrected P-value 8.7E-34), revealing a very strong negative effect of an alternative allele (-5.1) as compared to other markers (Fig. 3). Six other markers may also attract special attention, as they were very selective in particular environments: Chr04_2175652 and Chr11_18778542 in Greenhouse, Chr06_14434379 in Lodi, Chr08_3090141_D in Sanluri, Chr08_3090075_D and Chr10_13080319 in Saint Sauvant. The high selectivity of these markers in particular environments was reflected by extreme values of allelic effects. Markers Chr04_2175652, Chr10_13080319 and Chr11_18778542 revealed maximum values of an alternative allele effect for at least two environment × trait combinations in the whole dataset, whereas markers Chr06_14434379, Chr08_3090075_D, Chr08_3090141_D – the minimum values (Supplementary File S10, Fig. 7).
Rapid linkage disequilibrium decay
Five genome regions (around 2–4 Mbp) carrying significantly associated markers, localized in chromosomes Lalb_Chr02, Lalb_Chr08, Lalb_Chr13, Lalb_Chr14 (Lalb_FTc1 gene) and Lalb_Chr16, were subjected to linkage disequilibrium (LD) analysis. This analysis revealed the presence of an LD block in the promoter region of the Lalb_FTc1 gene, carrying, among others, PR_36b, PR_42a, PR_58c and PR_71d PCR indel markers (Fig. 8A). Nevertheless, associations quickly diminished with distance between sites in distal part of the promoter and in the coding sequence. For regions localized in chromosomes Lalb_Chr02, Lalb_Chr08 and Lalb_Chr16 only very small LD blocks carrying 3 loci were identified in the proximity of markers highlighted by GWAS as significant (Fig. 8B-D), whereas for the region in chromosome Lalb_Chr13 no LD block was found (Fig. 8E).
Discussion
Germplasm resources for improvement of white lupin adaptation towards higher latitudes
The population structure analysis revealed a germplasm separation originating from geographically isolated regions or belonging to different phenological types (spring- or winter). This finding agrees with the results of previous phylogenetic studies that grouped white lupin accessions into several clades, highlighting the high diversity of graecus-type accessions and the genetic distinctiveness of some Ethiopian landraces that putatively evolved in high isolation [39, 49, 50]. In general, the grouping pattern in our study agreed with the existing variability in flowering time and vernalization responsiveness. We could identify at least four different clusters providing sources of early flowering germplasm for spring-sowing breeding, including one group carrying several rapid flowering genotypes from Turkey, Syria and Jordan that outperformed current French spring-type cultivars by 10–20 days. This finding has high perspective interest in white lupin adaptation to the expected global northward shift of agricultural climate zones [30] and the current emphasis on breeding of legume plants adapted to spring sowing in high latitudes [51]. As reported previously, autumn-sown genotypes with strong vernalization requirement (winter type) formed a separate clade [39]. The difference in mean flowering time between the two most extreme clades was remarkable, reaching 67 days for Sanluri (autumn sowing with moderate vernalization) and 61 days for the greenhouse experiment (spring sowing without vernalization). Plants in the greenhouse were cultivated under temperatures not lower than 18 °C, i.e. at least several degrees above the vernalization threshold which is about 12 °C [20]. In contrast, in Lodi the mean daily temperature from October to February lowered to 5.7 °C, resulting in effective vernalization for about four months (VF value of 121.97 days). Vernalization conditions in Sanluri and Saint Sauvant were only partially fulfilled, due to higher mean temperature in the juvenile growth phase (12.6 °C and 10.7 °C, respectively) and a lower number of days with vernalization-enabling conditions, resulting in VF values of 55.73 and 18.67 days, respectively. The numbers of GDDs from sowing to flowering observed in Lodi and Sanluri matched the range reported for white lupin [18, 22] whereas in Saint Sauvant and both greenhouse experiments were significantly higher, especially for late flowering genotypes. It highlighted the great vernalization requirements of those accessions. Four significantly associated SNPs identified in our study (Chr01_5255422, Chr01_5921843, Chr13_1140246 and Chr13_1469866) originate from similar genome regions as two of seven SNPs significantly associated with days to flowering during drought tolerance experiment in Lodi [28]. Interestingly, plants in the reported drought study were sown in mid-February, allowing partial vernalization, whereas one of those SNPs, Chr13_1140246, in our experiment, was significantly associated with cumulative vernalization effectiveness of daily temperature until flowering in Lodi. Both observations emphasize the vernalization-responsive component of the white lupin flowering regulatory network.
On the other hand, numerous genotypes with intermediate phenology revealed in this study suggests a complex genetic architecture of flowering time control in white lupin. This situation differs from that described for narrow-leafed lupin, where the number of intermediate genotypes is very low, and just a few key mutations controlling flowering time were identified hitherto [12,13,14, 52]. The low number of germplasm accessions with early flowering that were available for narrow-leafed lupin breeding in the mid-1960s (just two) caused a significant genetic bottleneck that resulted in a large loss of genetic diversity in the domesticated germplasm [53,54,55]. The identification of untapped germplasm resources with early phenology from different phylogenetic clades could prevent such as genetic erosion in white lupin breeding. The third Old World cultivated lupin species, Lupinus luteus, revealed a genetic architecture of flowering time similar to white lupin, with a continuous trait distribution and several mutations associated with extreme phenology (early or late) identified and supplemented with PCR markers for molecular selection [11]. The present study provided four PCR indel markers enabling selection towards earliness or vernalization responsiveness, along with several candidate SNP markers that await further implementation.
Indel polymorphism in FT regulatory regions and environmental adaptation
Our study revealed a significant association between flowering time and indel polymorphism in three regions in the LalbFTc1 promoter, the first located from − 7996 bp to -5609 bp in relation to the TSS, the second from − 4408 bp to -4145 bp, and the third from − 780 bp to -753 bp. LalbFTc1 in one of the four FT homologs present in the white lupin genome [11]. Research on the model plant, Arabidopsis thaliana, evidenced that FT is a key component of the flowering regulatory network that integrates signals from various pathways, including those related to ambient temperature, vernalization and photoperiod [56,57,58]. The A. thaliana genome encodes only two FT-like genes, FT and a one close homolog, TWIN SISTER OF FT (TSF), that are partially sub-functionalized into photoperiod signaling (long vs. short day) and ambient temperature response [59, 60]. Regulation of FT expression is performed at two major levels: epigenetic (mainly by histone modifications) and genetic (by transcription factor binding) [61,62,63,64]. Regulatory regions of FT in A. thaliana include an enhancer located about 5.5 kilobases (kb) upstream of the transcriptional start site (TSS), a relatively long promoter sequence (~ 5 kb) with a few conserved blocks, three FT gene introns, and an enhancer located 1 kb downstream of the gene [57, 62, 64,65,66]. A physical convergence of cis-regulatory elements from all major environment-responsive biological pathways controlling flowering induction at the FT gene [57] opened up opportunities for advancement in environmental adaptation during evolution or domestication and breeding by fixation of occasional sequence polymorphism. For example, the major QTL for flowering time in a A. thaliana intercross-recombinant inbred line population was located 6.7-kb upstream of the FT coding sequence, in a region carrying two large overlapping indels and several other polymorphisms [67]. Structural variants of FT promoter (a deletion of ~ 250 bp and an insertion of ~ 1.1 kb) were also found in natural A. thaliana populations in Eurasia and were associated with geographic distribution into lower and higher latitudes, respectively [68].
Legume genomes typically contain more FT-like genes than the genome of A. thaliana, which were putatively preserved after ancestral and lineage-specific whole-genome duplication (WGD) events [59, 69, 70]. These genes were assigned into three subclades: FTa, FTb and FTc [71]. In general, gene duplication can be considered as a mechanism of genomic adaptation to a changing environment [72]. Thereby, selection pressures might be relaxed, providing some opportunity for independent evolution and functional novelty [73]. The occurrence of an ancestral legume WGD event highly converges with the Cretaceous–Paleogene (K–Pg) mass extinction event [74, 75]. Evolution of vernalization responsiveness in temperate plants could be facilitated both by rapid short-term cooling during K–Pg extinction and consecutive global cooling during Eocene–Oligocene transition [76,77,78]. Dating of major climatic events provides a possible explanation of the reason why vernalization pathways differ between plant families: they do, because plant families were already separated at the time of those climate changes [57].
Among legumes, the molecular control of flowering has mainly been studied in soybean, as this species is natively a short day plant, whereas many current cultivation areas are located under long day photoperiod. More than a dozen early maturity loci have been identified in soybean hitherto. Corresponding genes were identified for most of them, including three FLOWERING LOCUS T (FT) homologs for the loci E9, E10 and qDTF-J1, namely GmFT2a, GmFT4 and GmFT5a, that differ for functional mutations and provide adaptation to low and high latitudes, respectively [79,80,81]. Moreover, the geographic distribution of sequence polymorphism at soybean GmFT2a and GmFT5a genes revealed strong latitudinal gradient, highlighting ongoing selection for long juvenile alleles in the tropics [82]. In white lupin such an adaptive mutation for low latitudes could be attributed to a 683 bp deletion in the third intron of LalbFTa1 gene, associated with delayed flowering of some Ethiopian landraces [34,35,36].
FTc1 promoter indels confer vernalization independence in domesticated Old World lupin species
Vernalization independence in Medicago truncatula spring mutants was found to be associated with retroelement insertions at MtFTa1, causing overexpression of this gene [83]. In chickpea, a major QTL for vernalization response explaining the majority of phenotypic variance was localized at LG3, overlapping with the main domestication locus controlling growth habit and early flowering under short days [84, 85]. Phenotypic effects were associated with a high expression of the FT gene cluster (FTa1, FTa2, and FTc), hypothetically driven by unknown cis-acting genetic change [84]. In lentil (Lens culinaris), vernalization independence was reportedly linked with de-repression of the LcFTa1 gene, putatively due to identified sequence polymorphism at LcFTa1–LcFTa2 cluster: a large 7441 bp deletion in the LcFTa1–LcFTa2 intergenic region and a 245 bp deletion located in the LcFTa1 promoter, 3712 bp upstream of the start codon [86]. Interestingly, the coordinates of the latter indel match three overlapping LanFTc1 promoter indel variants found in narrow-leafed lupin and the LLutFTc1 promoter indel found in yellow lupin, all of them conferring overexpression of promoted genes and vernalization independence in those species [11,12,13,14]. Moreover, indel coordinates in these lupin species overlap with a 264 bp deletion in the middle region (-4145 bp) of LalbFTc1 promoter discovered in the present study (marker PR_36b). The direction of effects (early flowering of shorter alleles) was the same for a 28 bp deletion (markers PR_42a and PR_71d) associated with rapid flowering of Turkish, Syrian and Jordanian genotypes (located 780 bp upstream of the LalbFTc1 start codon) and large FTc1 promoter indels in lentil and two lupin species studied hitherto [11,12,13,14, 86]. Rapid LD decay was revealed around the promoter of the LalbFTc1 gene in the white lupin diversity panel. The same phenomenon was observed around the LanFTc1 gene in the narrow-leafed lupin germplasm collection carrying domesticated and wild accessions [13].
Other possible components of flowering time regulatory network in white lupin
Genomic positions of selected DArT-seq markers that revealed significant marker-trait associations (Figs. 6 and 7) were referred to gene annotations reported in the white lupin genome portal [38, 39] (https://www.whitelupin.fr). Six markers were found in exons, three in introns, four in untranslated regions (UTRs) and only one in a region lacking any annotations (Table 5). Taking into consideration expected protein sequences, an alternative allele of Chr06_14434379 marker conferred synonymous mutation, whereas alternative alleles of Chr11_18778542, Chr16_366800 and Chr16_788665 markers – nonsynonymous mutations, namely Ser to Arg, Asn to Asp and Glu to Gly. Interestingly, three gene-based DArT-seq markers were also localized in miRNA clusters. These clusters revealed a remarkably high number of hits to sequences deposited in RNAcentral miRbase [87], namely Cluster_43033 for a presence/absence SilicoDArT marker Chr08_3090075_D − 419, including 362 long non-coding RNAs (lncRNAs), with the best hit e-value of 2.4e-131 (BNAP_LNC001257 from Brassica napus), Cluster_70755 for a presence/absence SilicoDArT marker Chr13_13913452_D − 752, including 646 lncRNAs, with the best hit e-value of 4.3e-153 (PVUL_LNC004086 from Phaseolus vulgaris) and Cluster_127516 for a presence/absence SilicoDArT marker Chr25_4002891_D – 955, including 947 lncRNAs, with the best hit e-value of ~ 0.0 (LANG_LNC003687 from Lupinus angustifolius).
Long non-coding RNAs control developmental processes such as flowering via vernalization and autonomous pathways, seed germination, light- and auxin-regulated development, and are also involved in plant responses to abiotic and biotic stress [88,89,90]. Moreover, long non-coding RNAs drive RNA-dependent DNA methylation to generally suppress transposons in the genome as well as to repress the transcription of specific protein-coding genes, including FLOWERING WAGENINGEN (FWA) conferring a late flowering A. thaliana phenotype [88, 91]. Here, a long non-coding RNA Cluster_43033 was identified in an ATPase gene Lalb_Chr08g0233891. There is little information on the contribution of ATPase genes into flowering pathways, except for the FLOWERING REPRESSOR AAA + ATPase 1 (AaFRAT1) that regulates perennial flowering in a vernalization-dependent manner in Arabis alpine, a mountainous plant from the Brassicaceae family [92]. Taking into consideration that SilicoDArT markers may represent methylation variation at restriction sites, the absence of an allele of the Chr08_3090075_D marker in early flowering accessions from Turkey, Syria and Jordan agrees with the hypothesis of an involvement of RNA-dependent DNA methylation mediated by long non-coding RNAs. Nevertheless, to draw any further conclusions, validation by transcriptomic and epigenetic profiling of germplasm carrying opposite allele phases in contrasting environmental conditions should be performed.
Conclusions
Our study revealed the substantial agreement of population structure with germplasm differences in phenology and geographical origin, and highlighted spring-type germplasm for white lupin breeding targeting high latitudes. We identified a set of markers from PCR-based and DArT-seq datasets that were significantly associated with flowering time in a range of environments and tagged three indels from the LalbFTc1 promoter and several other loci localized in different white lupin chromosomes. Our results supported the convergence of FTc1 promoter indel evolution into vernalization pathway in Old World lupin species, and provided a set of molecular markers for tracking of such indels in white lupin breeding programs. The substantially polygenic control of white lupin flowering time supports the development of genomic selection models exploiting DArT-seq genotyping technology and four LalbFTc1 promoter PCR markers.
Methods
White lupin germplasm diversity panel
The plant material encompassed 262 genotypes randomly selected from 120 accessions representing 11 landrace and 2 cultivar pools [10]. Landraces originated from seven national pools (Azores, Egypt, Greece, Italy, Portugal, Spain, and Turkey) and four transnational pools (East Africa, Madeira & Canaries, Maghreb, and West Asia), each represented by at least eight accessions. Cultivars represented spring-type and winter-type crops included in the French Register of Varieties, each represented by four entries. Collection sites represented diverse climatic conditions such as tropical and subtropical highlands (Ethiopia), cold semi-arid (i.e. some locations in Anatolia and Maghreb), dry-summer subtropical (i.e. hot-summer Mediterranean: the majority of collection sites around Mediterranean Basin), humid subtropical (i.e. warm-summer Mediterranean: Azores), humid temperate (i.e. oceanic: French cultivars). These regions also diverged by photoperiod during the juvenile phase of white lupin growth, ranging from about 9–10 h in winter sowing in northern regions of the Mediterranean Basin to 11–12 h in Ethiopia and 12–14 h in spring sowing in France. The list of accessions is provided in Supplementary File S13. World landrace collection held by Council for Agricultural Research and Economics (CREA) is based one the majority of landrace accessions that were received from INRA Lusignan (now INRAE) in 2002, of which INRA’s accession name is reported in the last column of Supplementary File S13. The material was provided within the framework of the long-standing scientific collaboration between INRA Lusignan and ISCF Lodi (now CREA-ZA Lodi). From each of these accessions, CREA extracted, genotyped, multiplied and characterized up to 4 individual genotypes. CREA’s world landrace collection has been characterized by CREA and Jouffray-Drillaud (currently Cérience).
Phenotyping of flowering time
Field observations were performed during the 2004–2005 growing season as a part of a landrace evaluation study [10] performed under autumn sowing in Lodi (Lombardy, Italy, 45°19’ N, 9°03’ E; representing a temperate subcontinental climate) and Sanluri (Sardinia, Italy, 39°30’ N, 8°50’ E; representative of hot-summer Mediterranean climate) as well as under spring sowing in Saint Sauvant (western France, 46°22’ N 0°5’ E; representative of an oceanic climate). Sowing dates were 14th October in Lodi, 29th October in Sanluri, and 9th March in Saint Sauvant. Data on minimum and maximum air temperature, cumulative growing degree days (GDDs), cumulative vernalization effectiveness of daily temperature (VF) and total photoperiod hours recorded in Lodi, Sanluri, Saint Sauvant and Greenhouse during the course of experiments are provided in Supplementary File S14. Detailed conditions of plot sizes, randomization and plant cultivation were provided in the paper reporting the results of landrace evaluation [10]. Days to onset of flowering from sowing were counted when 50% of plants for a given plot developed first fully colored petals.
Greenhouse observations were performed by spring sowing (19th March 2020 and 11th March 2021) at the Institute of Plant Genetics, Polish Academy of Sciences, Poznań, Poland (52°26′ N 16°54′ E) in Poznań (western Poland, representative of temperate subcontinental climate) under ambient long-day photoperiod, increasing from about 12 to 16 h during plant cultivation. Automatic heating was used to keep the minimum air temperature above 18 °C, whereas cooling was maintained by temperature-dependent window-opening system (activated at 22 °C). Flowering time was recorded as the number of days from sowing to the observation of the first fully colored petal. Observations were made in three biological replicates.
Cumulative growing degree days (GDDs) were calculated using the formula:
Where t and n are days from sowing and the total number of days from sowing to the start of flowering, Td is a daily mean temperature, whereas Tb corresponds to the base temperature of white lupin parameterized in this study as 3 °C [18, 22]. GDD values for fractional days were calculated on a linear scale.
Daily mean temperature was calculated using the formula:
where Tmax and Tmin are daily maximum and minimum temperatures.
Cumulative vernalization effectiveness of daily temperature (VF) was calculated using the formula proposed by Wu et al. 2017 [93]:
where t and n are days from sowing and the total number of days from sowing to the start of flowering, Td, Topt, and Tamp are the mean daily temperature, optimal vernalization temperature and thermal semi-amplitude of vernalizing effect, respectively. Topt and Tamp for white lupin were parameterized in this study as 3 °C and 9 °C corresponding to the range of vernalization-effective temperature between − 6 °C and + 12 °C. VF value of 1 corresponds to 1 day (24 h) of optimal vernalization conditions.
DNA isolation
Two young upper leaves (about 50–100 mg tissue per collection tube) were collected from 5 week-old plants (a single representative per genotype cultivated in the greenhouse) and immediately frozen under liquid nitrogen. Homogenization of frozen plant tissue was performed using TissueLyser II (Qiagen, Hilden, Germany) and two stainless steel beads (ø 5 mm, Qiagen) placed in a 2 mL tube (Eppendorf, Hamburg, Germany) for 45 s at 30 rpm. DNA isolation was performed with Maxwell® RSC PureFood GMO and Authentication Kit (Promega, Mannheim, Germany) harnessing automated station Maxwell® RSC 48 Instrument (Promega) with standard protocol, yielding on average 100 ± 29 µg DNA per sample. DNA concentration and quality were estimated using a NanoDrop 2000 (ThermoFisher Scientific, Warsaw, Poland).
DArT-seq genotyping
DNA isolates were subjected to Lupin DArTseq (1.0) protocol, with 2.5 mln reads sequencing depth (Illumina Novaseq6000). DArT-seq protocol and genotype calling were performed by Diversity Arrays Technology Pty Ltd. (University of Canberra, Bruce, Australia). DArTseq generated two types of data: scores for presence/absence (dominant) markers (SilicoDArTs) and standard single nucleotide polymorphism (SNP) markers. SilicoDArTs represent several possible polymorphisms: SNPs and small indels in restriction enzyme recognition sites, larger insertions/deletions in restriction fragments and methylation variation at restriction sites. The reference allele corresponds to the allele present in white lupin genome assembly GCA_009771035.1.
PCR-based genotyping of indel polymorphism in the FT gene promoters
As in the two other Old World lupin crop species (i.e., narrow-leafed lupin and yellow lupin) indel polymorphism in the regulatory region of FTc1 homologs (LanFTc1 and LlutFTc1, respectively) revealed association with time to flowering and vernalization responsiveness [11,12,13,14], we decided to supplement DArT-seq dataset with PCR markers targeting structural variation in the promoter region of white lupin FTc1 homolog, LalbFTc1. DNA sequence carrying the LalbFTc1 gene (Lalb_Chr14g0364281) [11] with 8100 bp of 5’ regulatory region (full promoter with proximal and distal CCAAT boxes) was extracted from the white lupin genome assembly [38]. The set of primer pairs (Table 3) was designed to amplify overlapping PCR products spanning the region from the 8065 bp upstream of the transcription start site to the first exon of the LalbFTc1 gene. Moreover, additional PCR primers were designed, targeting particular indels identified in the multiple sequence alignment of white lupin pangenome carrying 40 accessions [39]. The same procedure was performed for the remaining white lupin FT homologs, LalbFTa1 (Lalb_Chr02g0156991), LalbFTa2 (Lalb_Chr21g0317021) and LalbFTc2 (Lalb_Chr09g0331851).
Primers were designed using Primer 3 Plus [94] implemented in Geneious Prime [95], whereas sequence alignment was performed using the progressive Mauve algorithm [106] assuming genome collinearity. Primer sequences are provided in Supplementary File S15. PCR products up to 2 kb in length were amplified using GoTaq G2 Flexi DNA Polymerase (Promega) whereas longer products used GoTaq® Long PCR Master Mix (Promega). Length differences were visualized by agarose gel electrophoresis with the agarose concentration (1–3%) adjusted to follow the size of the expected products. Small indels (shorter than about 50 bp) were resolved using high resolution 3:1 agarose (Serva, Heidelberg, Germany) whereas larger ones with wide range agarose (Serva). Selected polymorphic amplicons were directly Sanger-sequenced using BigDye® Terminator v3.1 Cycle Sequencing Kit (Applied Biosystems) and 96-capillary 3730xl DNA Analyzer (Applied Biosystems) by Genomed (Warsaw, Poland). LalbFTc1 markers were analyzed in the whole panel (262 accessions), whereas LalbFTa1, LalbFTa2 and LalbFTc2 markers in a sub-population of 86 accessions representing contrasting values of flowering time in studied environments and allelic composition of significantly associated markers.
PCR-based genotyping of flowering time QTLs from linkage map
Besides DArT-seq and LalbFTc1 indel markers, the white lupin germplasm panel was subjected to genotyping with the set of fifteen PCR-based markers (Table 1) tagging four major QTLs for flowering time from linkage mapping studies [34, 35] and several candidate genes that in germplasm collection revealed significant correlation between sequence polymorphism and time to flowering [36]. Depending on the availability of restriction enzymes, SNPs were resolved by the cleaved amplified polymorphic sequence (CAPS) [46] or derived CAPS (dCAPS) [47] approaches. Full-length original gel images for cropped gels displayed in Supplementary Files S6 and S7 were provided in Supplementary File SS16.
Calculation of heritability
Heritability in a broad sense was calculated using the formula:
Where:
Imputation of genotypes and population structure analysis
All marker data were transformed to 0, 1, 2 code, where 0 stands for the homozygote, 2 for the alternative allele homozygote, and 1 for the heterozygote. For data transformation, custom python script were used. For missing data imputation was performed using beagle software version 4.1 [96] with its default settings. Duplicated loci with identical segregation were removed leaving a single representative. That prepared data was filtered for minor allele frequency (MAF) with threshold of 0.05. Q matrix with population structure for the estimation of ancestral populations (K) in range 2 to 30 was calculated using ‘snmf’ function from the LAE package [97]. The analysis was done for each K-value, using 3000 replications and 5000 iterations. The best run was obtained using cross-entropy criterion. Based on the obtained results for further analysis, K-values in the range of 4–13, 15 and 17 were used. In parallel, principal component analysis (PCA) analysis of 6 765 SNP markers and 262 genotypes was conducted using the prcomp function from R Software.
Genome-wide association mapping and visualization
GWAS was performed using the Bayesian-information and Linkage-disequilibrium Iteratively Nested Keyway (BLINK) model [98] implemented in the GAPIT R package [99]. In the analysis, we accounted for population structure (Q) through a Qmatrix and for relationships among individuals through a kinship (K) matrix [100], both using the marker data. For each trait, Qmatrix of covariates with K = 5 was used. The significance threshold for marker-trait associations (MTA) was set to p = 0.05 after applying the false discovery rate (FDR) [101] correction. Visualization of population structure, PCA map and violin plots was made in R software using packages ggplot2 [102], GAPIT [99] factoextra [103] and pophelper [104]. LD graphs were prepared using LDheatmap [105], whereas Manhattan plots in qqman package (https://cran.r-project.org/web/packages/qqman/index.html).
Data availability
CREA’s world landrace collection has been characterized by CREA in Lodi and evaluated by CREA and Jouffray-Drillaud (currently Cérience), specifically by two co-authors of this work (PA and NH). No voucher specimen has been deposited anywhere by CREA. However, the original landraces of INRA Lusignan’s world collection (from which individual genotypes were extracted, multiplied and genotyped) have become part of INRAE’s germplasm collection in Dijon and, as such, are available according to INRAE’s regulations for germplasm exchange.
All data generated or analyzed during this study are included in this published article and its supplementary information files. The sequence variant data have been deposited in the European Variation Archive (EVA) at European Molecular Biology Laboratory - European Bioinformatics Institute (EMBL-EBI) as a study “Lupinus albus genome-wide association study for phenology traits” under project PRJNA939025, analysis accession number ERZ16297462.
References
Wolko B, Clements JC, Naganowska B, Nelson MN. Yang Ha: Lupinus. In: Wild Crop Relatives: Genomic and Breeding Resources Edited by Kole C. Heidelberg: Springer Berlin Heidelberg; 2011: 153–206.
Gladstones JS. The Mediterranean white lupin. J Department Agric Western Australia Ser. 1976;4(3):70–4.
Gladstones JS. Lupins as crop plants. Field Crop Abstracts. 1970;23(2):26.
Papineau J, Huyghe C. Le Lupin doux protéagineux. Paris: France agricole; 2004.
Boschin G, D’Agostina A, Annicchiarico P, Arnoldi A. The fatty acid composition of the oil from Lupinus albus Cv. Luxe as affected by environmental and agricultural factors. Eur Food Res Technol. 2007;225(5):769–76.
Vance CP, Uhde-Stone C, Allan DL. Phosphorus acquisition and use: critical adaptations by plants for securing a nonrenewable resource. New Phytol. 2003;157(3):423–47.
Schulze J, Temple G, Temple SJ, Beschow H, Vance CP. Nitrogen fixation by white lupin under phosphorus deficiency. Ann Bot. 2006;98(4):731–40.
Adhikari KN, Buirchell BJ, Sweetingham MW. Length of vernalization period affects flowering time in three lupin species. Plant Breed. 2012;131(5):631–6.
Rahman M, Gladstones J. Control of lupin flower initiation by vernalization, photoperiod and temperature under controlled environment. Aust J Exp Agric. 1972;12(59):638–45.
Annicchiarico P, Harzic N, Carroni AM. Adaptation, diversity, and exploitation of global white lupin (Lupinus albus L.) landrace genetic resources. Field Crops Res. 2010;119(1):114–24.
Plewiński P, Rychel-Bielska S, Kozak B, Maureira-Butler IJ, Iqbal MM, Nelson MN, Książkiewicz M. FLOWERING LOCUS T indel variants confer vernalization-independent and photoperiod-insensitive flowering of yellow lupin (Lupinus luteus L). Hortic Res. 2022;9:uhac180.
Rychel-Bielska S, Plewiński P, Kozak B, Galek R, Książkiewicz M. Photoperiod and vernalization control of flowering-related genes: a case study of the narrow-leafed lupin (Lupinus angustifolius L). Front Plant Sci. 2020;11:572135.
Nelson MN, Książkiewicz M, Rychel S, Besharat N, Taylor CM, Wyrwa K, Jost R, Erskine W, Cowling WA, Berger JD, et al. The loss of vernalization requirement in narrow-leafed lupin is associated with a deletion in the promoter and de-repressed expression of a flowering locus T (FT) homologue. New Phytol. 2017;213:220–32.
Taylor CM, Kamphuis LG, Zhang W, Garg G, Berger JD, Mousavi-Derazmahalleh M, Bayer PE, Edwards D, Singh KB, Cowling WA, et al. INDEL variation in the regulatory region of the major flowering time gene LanFTc1 is associated with vernalization response and flowering time in narrow-leafed lupin (Lupinus angustifolius L). Plant Cell Environ. 2019;42(1):174–87.
Rahman M, Gladstones J. Effects of temperature and photoperiod on flowering and yield components of lupin genotypes in the field. Aust J Exp Agric. 1974;14(67):205–13.
Christiansen JL, J⊘rnsgård B, J⊘rgensen ST, Bibby BM. Photoperiod sensitivity in narrow-leafed lupin (Lupinus angustifolius L). Acta Agriculturae Scand Sect B — Soil Plant Sci. 2008;58(4):365–71.
Christiansen JL, Jørnsgård B. Influence of day length and temperature on number of main stem leaves and time to flowering in lupin. Ann Appl Biol. 2002;140(1):29–35.
Huyghe C. Winter growth of autumn-sown white lupin (Lupinus albus L.) main apex growth model. Ann Bot. 1991;67(4):429–34.
Dracup M, Davies C, Tapscott H. Temperature and water requirements for germination and emergence of lupin. Aust J Exp Agric. 1993;33(6):759–66.
Clapham WM, Willcott JB. Thermosensitivity in spring white lupin. Ann Bot. 1995;76(4):349–57.
Putnam DH, Simmons SR, Hardman LL. Vernalization and seeding date effects on yield and yield components of white lupin. Crop Sci. 1993;33(5):1076–83.
Huyghe C, Papineau J. Winter development of autumn sown white lupin: agronomic and breeding consequences. Agronomie. 1990;10(9):709–16.
Siddons PA, Jones RJA, Hollis JM, Hallett SH, Huyghe C, Day JM, Scott T, Milford GFJ. The use of a land suitability model to predict where autumn-sown, determinate genotypes of the white lupin (Lupinus albus) might be grown in England and Wales. J Agricultural Sci. 2009;123(2):199–205.
Shield IF, Scott T, Stevenson HJ, Leach JE, Todd AD. The causes of over-winter plant losses of autumn-sown white lupins (Lupinus albus) in different regions of the UK over three seasons. J Agric Sci. 2000;135(2):173–83.
Annicchiarico P, Iannucci A. Winter survival of pea, faba bean and white lupin cultivars across contrasting Italian locations and sowing times, and implications for selection. J Agric Sci. 2007;145:611–22.
Annicchiarico P, Romani M, Pecetti L. White lupin (Lupinus albus) variation for adaptation to severe drought stress. Plant Breed. 2018;137:782–9.
Markonis Y, Kumar R, Hanel M, Rakovec O, Máca P, AghaKouchak A. The rise of compound warm-season droughts in Europe. Sci Adv 2021, 7(6).
Pecetti L, Annicchiarico P, Crosta M, Notario T, Ferrari B, Nazzicari N. White lupin drought tolerance: genetic variation, trait genetic architecture, and genome-enabled prediction. Int J Mol Sci. 2023;24(3):2351.
Annicchiarico P, Harzic N, Huyghe C, Carroni AM. Ecological classification of white lupin landrace genetic resources. Euphytica. 2011;180(1):17–25.
King M, Altdorff D, Li P, Galagedara L, Holden J, Unc A. Northward shift of the agricultural climate zone under 21st-century global climate change. Sci Rep. 2018;8(1):7904.
Tun W, Yoon J, Jeon J-S, An G. Influence of climate change on flowering time. J Plant Biology. 2021;64(3):193–203.
Phan HTT, Ellwood SR, Adhikari K, Nelson MN, Oliver RP. The first genetic and comparative map of white lupin (Lupinus albus L.): identification of QTLs for anthracnose resistance and flowering time, and a locus for alkaloid content. DNA Res. 2007;14:59–70.
Vipin CA, Luckett DJ, Harper JDI, Ash GJ, Kilian A, Ellwood SR, Phan HTT, Raman H. Construction of integrated linkage map of a recombinant inbred line population of white lupin (Lupinus albus L). Breed Sci. 2013;63:292–300.
Książkiewicz M, Nazzicari N, Yang H, Nelson MN, Renshaw D, Rychel S, Ferrari B, Carelli M, Tomaszewska M, Stawiński S, et al. A high-density consensus linkage map of white lupin highlights synteny with narrow-leafed lupin and provides markers tagging key agronomic traits. Sci Rep. 2017;7:15335.
Rychel S, Książkiewicz M, Tomaszewska M, Bielski W, Wolko B. FLOWERING LOCUS T, GIGANTEA, SEPALLATA and FRIGIDA homologs are candidate genes involved in white lupin (Lupinus albus L.) early flowering. Mol Breed. 2019;39:43.
Rychel-Bielska S, Surma A, Bielski W, Kozak B, Galek R, Książkiewicz M. Quantitative control of early flowering in white lupin (Lupinus albus L). Int J Mol Sci. 2021;22(8):3856.
Xu W, Zhang Q, Yuan W, Xu F, Muhammad Aslam M, Miao R, Li Y, Wang Q, Li X, Zhang X, et al. The genome evolution and low-phosphorus adaptation in white lupin. Nat Commun. 2020;11(1):1069.
Hufnagel B, Marques A, Soriano A, Marquès L, Divol F, Doumas P, Sallet E, Mancinotti D, Carrere S, Marande W, et al. High-quality genome sequence of white lupin provides insight into soil exploration and seed quality. Nat Commun. 2020;11(1):492.
Hufnagel B, Soriano A, Taylor J, Divol F, Kroc M, Sanders H, Yeheyis L, Nelson M, Péret B. Pangenome of white lupin provides insights into the diversity of the species. Plant Biotechnol J. 2021;19(12):2532–43.
Schwertfirm G, Schneider M, Haase F, Riedel C, Lazzaro M, Ruge-Wehling B, Schweizer G. Genome-wide association study revealed significant SNPs for anthracnose resistance, seed alkaloids and protein content in white lupin. Theor Appl Genet. 2024;137(7):155.
Alkemade JA, Nazzicari N, Messmer MM, Annicchiarico P, Ferrari B, Voegele RT, Finckh MR, Arncken C, Hohmann P. Genome-wide association study reveals white lupin candidate gene involved in anthracnose resistance. Theor Appl Genet. 2022;135(3):1011–24.
Parisi SG, Cola G, Gilioli G, Mariani L. Modeling and improving Ethiopian pasture systems. Int J Biometeorol. 2018;62(5):883–95.
Shimabukuro R, Tomita T, Fukui K-i. Update of global maps of Alisov’s climate classification. Prog Earth Planet Sci. 2023;10(1):19.
Ren R, Ray R, Li P, Xu J, Zhang M, Liu G, Yao X, Kilian A, Yang X. Construction of a high-density DArTseq SNP-based genetic map and identification of genomic regions with segregation distortion in a genetic population derived from a cross between feral and cultivated-type watermelon. Mol Genet Genomics. 2015;290(4):1457–70.
Kilian A, Wenzl P, Huttner E, Carling J, Xia L, Blois H, Caig V, Heller-Uszynska K, Jaccoud D, Hopper C, et al. Diversity arrays technology: a generic genome profiling technology on open platforms. Methods Mol Biol. 2012;888:67–89.
Konieczny A, Ausubel FM. A procedure for mapping Arabidopsis mutations using co-dominant ecotype-specific PCR-based markers. Plant J. 1993;4:403–10.
Neff MM, Neff JD, Chory J, Pepper AE. dCAPS, a simple technique for the genetic analysis of single nucleotide polymorphisms: experimental applications in Arabidopsis thaliana genetics. Plant J. 1998;14:387–92.
Huyghe C. White lupin (Lupinus albus L). Field Crops Res. 1997;53(1):147–60.
Raman R, Cowley R, Raman H, Luckett DJ. Analyses using SSR and DArT molecular markers reveal that Ethiopian accessions of white lupin (Lupinus albus L.) represent a unique genepool. Open J Genet 2014:87–98.
Atnaf M, Yao N, Martina K, Dagne K, Wegary D, Tesfaye K. Molecular genetic diversity and population structure of Ethiopian white lupin landraces: implications for breeding and conservation. PLoS ONE. 2017;12(11):e0188696.
Dong L, Li S, Wang L, Su T, Zhang C, Bi Y, Lai Y, Kong L, Wang F, Pei X et al. The genetic basis of high-latitude adaptation in wild soybean. Curr Biol 2022.
Taylor CM, Garg G, Berger JD, Ribalta FM, Croser JS, Singh KB, Cowling WA, Kamphuis LG, Nelson MN. A trimethylguanosine Synthase1-like (TGS1) homologue is implicated in vernalisation and flowering time control. Theor Appl Genet. 2021;134(10):3411–26.
Mousavi-Derazmahalleh M, Bayer PE, Nevado B, Hurgobin B, Filatov D, Kilian A, Kamphuis LG, Singh KB, Berger JD, Hane JK, et al. Exploring the genetic and adaptive diversity of a pan-mediterranean crop wild relative: narrow-leafed lupin. Theor Appl Genet. 2018;131(4):887–901.
Berger JD, Buirchell BJ, Luckett DJ, Nelson MN. Domestication bottlenecks limit genetic diversity and constrain adaptation in narrow-leafed lupin (Lupinus angustifolius L). Theor Appl Genet. 2012;124(4):637–52.
Plewiński P, Ćwiek-Kupczyńska H, Rudy E, Bielski W, Rychel-Bielska S, Stawiński S, Barzyk P, Krajewski P, Naganowska B, Wolko B, et al. Innovative transcriptome-based genotyping highlights environmentally responsive genes for phenology, growth and yield in a non-model grain legume. Plant Cell Environ. 2020;43:2680–98.
Turck F, Fornara F, Coupland G. Regulation and identity of florigen: FLOWERING LOCUS T moves center stage. Annu Rev Plant Biol. 2008;59:573–94.
Bratzel F, Turck F. Molecular memories in the regulation of seasonal flowering: from competence to cessation. Genome Biol. 2015;16:192.
Lee JH, Yoo SJ, Park SH, Hwang I, Lee JS, Ahn JH. Role of SVP in the control of flowering time by ambient temperature in Arabidopsis. Genes Dev. 2007;21(4):397–402.
Yamaguchi A, Kobayashi Y, Goto K, Abe M, Araki T. TWIN SISTER OF FT (TSF) acts as a floral pathway integrator redundantly with FT. Plant Cell Physiol. 2005;46(8):1175–89.
Kim W, Park TI, Yoo SJ, Jun AR, Ahn JH. Generation and analysis of a complete mutant set for the Arabidopsis FT/TFL1 family shows specific effects on thermo-sensitive flowering regulation. J Exp Bot. 2013;64(6):1715–29.
Cao S, Kumimoto RW, Gnesutta N, Calogero AM, Mantovani R, Holt BF 3. A distal CCAAT/NUCLEAR FACTOR Y complex promotes chromatin looping at the FLOWERING LOCUS T promoter and regulates the timing of flowering in Arabidopsis. Plant Cell. 2014;26(3):1009–17.
Adrian J, Farrona S, Reimer JJ, Albani MC, Coupland G, Turck F. cis-Regulatory elements and chromatin state coordinately control temporal and spatial expression of FLOWERING LOCUS T in Arabidopsis. Plant Cell. 2010;22(5):1425–40.
Luo X, Gao Z, Wang Y, Chen Z, Zhang W, Huang J, Yu H, He Y. The NUCLEAR FACTOR-CONSTANS complex antagonizes polycomb repression to de-repress FLOWERING LOCUS T expression in response to inductive long days in Arabidopsis. Plant J. 2018;95(1):17–29.
Kinoshita A, Richter R. Genetic and molecular basis of floral induction in Arabidopsis thaliana. J Exp Bot. 2020;71(9):2490–504.
Pandey SP, Benstein RM, Wang Y, Schmid M. Epigenetic regulation of temperature responses: past successes and future challenges. J Exp Bot. 2021;72(21):7482–97.
Zicola J, Liu L, Tänzler P, Turck F. Targeted DNA methylation represses two enhancers of FLOWERING LOCUS T in Arabidopsis thaliana. Nat Plants. 2019;5(3):300–7.
Schwartz C, Balasubramanian S, Warthmann N, Michael TP, Lempe J, Sureshkumar S, Kobayashi Y, Maloof JN, Borevitz JO, Chory J, et al. Cis-regulatory changes at FLOWERING LOCUS T mediate natural variation in flowering responses of Arabidopsis thaliana. Genetics. 2009;183(2):723–32.
Liu L, Adrian J, Pankin A, Hu J, Dong X, von Korff M, Turck F. Induced and natural variation of promoter length modulates the photoperiodic response of FLOWERING LOCUS T. Nat Commun. 2014;5:4558.
Laurie RE, Diwadkar P, Jaudal M, Zhang L, Hecht V, Wen J, Tadege M, Mysore KS, Putterill J, Weller JL, et al. The Medicago FLOWERING LOCUS T homolog, MtFTa1, is a key regulator of flowering time. Plant Physiol. 2011;156(4):2207–24.
Ren L, Huang W, Cannon SB. Reconstruction of ancestral genome reveals chromosome evolution history for selected legume species. New Phytol. 2019;223(4):2090–103.
Hecht V, Laurie RE, Vander Schoor JK, Ridge S, Knowles CL, Liew LC, Sussmilch FC, Murfet IC, Macknight RC, Weller JL. The pea GIGAS gene is a FLOWERING LOCUS T homolog necessary for graft-transmissible specification of flowering but not for responsiveness to photoperiod. Plant Cell. 2011;23(1):147–61.
Kondrashov FA. Gene duplication as a mechanism of genomic adaptation to a changing environment. Proceedings of the Royal Society B: Biological Sciences 2012, 279(1749):5048–5057.
Conant GC, Wolfe KH. Turning a hobby into a job: how duplicated genes find new functions. Nat Rev Genet. 2008;9(12):938–50.
Koenen EJM, Ojeda DI, Steeves R, Migliore J, Bakker FT, Wieringa JJ, Kidner C, Hardy OJ, Pennington RT, Bruneau A, et al. Large-scale genomic sequence data resolve the deepest divergences in the legume phylogeny and support a near-simultaneous evolutionary origin of all six subfamilies. New Phytol. 2020;225(3):1355–69.
Koenen EJM, Ojeda DI, Bakker FT, Wieringa JJ, Kidner C, Hardy OJ, Pennington RT, Herendeen PS, Bruneau A, Hughes CE. The origin of the legumes is a complex paleopolyploid phylogenomic tangle closely associated with the cretaceous–paleogene (K–Pg) mass extinction event. Syst Biol. 2020;70(3):508–26.
Linnert C, Robinson SA, Lees JA, Bown PR, Pérez-Rodríguez I, Petrizzo MR, Falzoni F, Littler K, Arz JA, Russell EE. Evidence for global cooling in the late cretaceous. Nat Commun. 2014;5(1):4194.
Vellekoop J, Sluijs A, Smit J, Schouten S, Weijers JW, Sinninghe Damsté JS, Brinkhuis H. Rapid short-term cooling following the Chicxulub impact at the Cretaceous-Paleogene boundary. Proc Natl Acad Sci U S A. 2014;111(21):7537–41.
Hutchinson DK, Coxall HK, Lunt DJ, Steinthorsdottir M, de Boer AM, Baatsen M, von der Heydt A, Huber M, Kennedy-Asser AT, Kunzmann L, et al. The eocene–oligocene transition: a review of marine and terrestrial proxy data, models and model–data comparisons. Clim Past. 2021;17(1):269–315.
Zhao C, Takeshima R, Zhu J, Xu M, Sato M, Watanabe S, Kanazawa A, Liu B, Kong F, Yamada T, et al. A recessive allele for delayed flowering at the soybean maturity locus E9 is a leaky allele of FT2a, a FLOWERING LOCUS T ortholog. BMC Plant Biol. 2016;16(1):20.
Samanfar B, Molnar SJ, Charette M, Schoenrock A, Dehne F, Golshani A, Belzile F, Cober ER. Mapping and identification of a potential candidate gene for a novel maturity locus, E10, in soybean. Theor Appl Genet. 2017;130(2):377–90.
Takeshima R, Hayashi T, Zhu J, Zhao C, Xu M, Yamaguchi N, Sayama T, Ishimoto M, Kong L, Shi X, et al. A soybean quantitative trait locus that promotes flowering under long days is identified as FT5a, a FLOWERING LOCUS T ortholog. J Exp Bot. 2016;67(17):5247–58.
Li X, Fang C, Yang Y, Lv T, Su T, Chen L, Nan H, Li S, Zhao X, Lu S, et al. Overcoming the genetic compensation response of soybean florigens to improve adaptation and yield at low latitudes. Curr Biol. 2021;31(17):3755–e37673754.
Jaudal M, Yeoh CC, Zhang L, Stockum C, Mysore KS, Ratet P, Putterill J. Retroelement insertions at the Medicago FTa1 locus in spring mutants eliminate vernalisation but not long-day requirements for early flowering. Plant J. 2013;76(4):580–91.
Ortega R, Hecht VFG, Freeman JS, Rubio J, Carrasquilla-Garcia N, Mir RR, Penmetsa RV, Cook DR, Millan T, Weller JL. Altered expression of an FT Cluster underlies a major locus controlling domestication-related changes to chickpea phenology and growth habit. Front Plant Sci. 2019;10:824–824.
Samineni S, Kamatam S, Thudi M, Varshney RK, Gaur PM. Vernalization response in chickpea is controlled by a major QTL. Euphytica. 2016;207(2):453–61.
Rajandran V, Ortega R, Vander Schoor JK, Butler JB, Freeman JS, Hecht VFG, Erskine W, Murfet IC, Bett KE, Weller JL. Genetic analysis of early phenology in lentil identifies distinct loci controlling component traits. J Exp Bot. 2022;73(12):3963–77.
Consortium R. RNAcentral 2021: secondary structure integration, improved sequence search and new member databases. Nucleic Acids Res. 2021;49(D1):D212–20.
Jampala P, Garhewal A, Lodha M. Functions of long non-coding RNA in Arabidopsis thaliana. Plant Signal Behav. 2021;16(9):1925440.
Zhao Z, Zang S, Zou W, Pan Y-B, Yao W, You C, Que Y. Long non-coding RNAs: new players in plants. Int J Mol Sci. 2022;23(16):9301.
Jha UC, Nayyar H, Jha R, Khurshid M, Zhou M, Mantri N, Siddique KHM. Long non-coding RNAs: emerging players regulating plant abiotic stress response and adaptation. BMC Plant Biol. 2020;20(1):466.
Fujimoto R, Sasaki T, Kudoh H, Taylor JM, Kakutani T, Dennis ES. Epigenetic variation in the FWA gene within the genus Arabidopsis. Plant J. 2011;66(5):831–43.
de la Viñegra N, Vayssières A, Obeng-Hinneh E, Neumann U, Zhou Y, Lázaro A, Roggen A, Sun H, Stolze SC, Nakagami H et al. FLOWERING REPRESSOR AAA + ATPase 1 is a novel regulator of perennial flowering in Arabis alpina. New Phytol 2022, 236(2):729–744.
Wu X, Liu H, Li X, Tian Y, Mahecha MD. Responses of winter wheat yields to warming-mediated vernalization variations across temperate Europe. Front Ecol Evol 2017, 5.
Untergasser A, Nijveen H, Rao X, Bisseling T, Geurts R, Leunissen JAM. Primer3Plus, an enhanced web interface to Primer3. Nucleic Acids Res. 2007;35:W71–74.
Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, et al. Geneious Basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–9.
Browning SR, Browning BL. Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering. Am J Hum Genet. 2007;81(5):1084–97.
Frichot E, François O. LEA: an R package for landscape and ecological association studies. Methods Ecol Evol. 2015;6(8):925–9.
Huang M, Liu X, Zhou Y, Summers RM, Zhang Z. BLINK: a package for the next level of genome-wide association studies with both individuals and markers in the millions. GigaScience 2019, 8(2).
Lipka AE, Tian F, Wang Q, Peiffer J, Li M, Bradbury PJ, Gore MA, Buckler ES, Zhang Z. GAPIT: genome association and prediction integrated tool. Bioinformatics. 2012;28(18):2397–9.
VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci. 2008;91(11):4414–23.
Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Royal Stat Soc Ser B (Methodological). 1995;57(1):289–300.
Wickham H. ggplot2 elegant graphics for data analysis. New York: Springer; 2009.
Kassambara A, Mundt F. Factoextra: Extract and visualize the results of multivariate data analyses. In.: R Package Version 1.0.7; 2020.
Francis RM. Pophelper: an R package and web app to analyse and visualize population structure. Mol Ecol Resour. 2017;17(1):27–32.
Shin J-H, Blay S, McNeney B, Graham J. LDheatmap: an R function for graphical Display of pairwise linkage Disequilibria between single nucleotide polymorphisms. J Stat Softw Code Snippets. 2006;16(3):1–9.
Darling AC, Mau B, Blattner FR, Perna NT. Mauve: multiple alignment of conserved genomic sequence with rearrangements. Genome Res. 2004;14(7):1394–1403.
Acknowledgements
Authors would like to acknowledge National Research Institute for Agriculture, Food and Environment (INRAE) for providing the landrace accessions of white lupin subjected to multi-environment phenotyping in the study. We would like to thank Antonio Carroni for collection of phenology and climatic data in Sanluri.
Funding
This research was funded by National Science Centre, Poland (SONATINA3, 2019/32/C/NZ9/00055 and SONATA17, 2021/43/D/NZ9/00293) and the Ministry of Agricultural, Food and Forestry Policies of Italy (project RGV-FAO Treaty). An open access charge was co-financed by Wroclaw University of Environmental and Life Sciences as well as by National Science Centre, Poland. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to publish the results.
Author information
Authors and Affiliations
Contributions
SR-B designed the experiments and obtained funding in Poland, whereas PA obtained funding for phenotyping in Italy and maintenance of the germplasm collection. SR-B, AS, JB and WB performed phenotypic observations and plant sampling in Poland. PA was responsible for the collection of phenological data in Italy, while NH was responsible for data collection in France. SR-B, MK designed PCR markers. SR-B, WB and AS performed DNA isolation and PCR-marker screening. BK and MK performed data analysis and interpretation. BK and MK visualized the results and prepared supplementary files. MK drafted the manuscript. RG and PA contributed to the interpretation of the results and to manuscript writing.
Corresponding author
Ethics declarations
Ethics approval and consent to participate
All experimental studies on plants were complied with relevant institutional, national, and international guidelines and legislation.
Consent for publication
Not applicable.
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
12870_2024_5438_MOESM1_ESM.xlsx
Supplementary Material 1: Supplementary_File_S1.xlsx: Phenotypic observations recorded in studied environments for white lupin germplasm diversity panel.
12870_2024_5438_MOESM2_ESM.xlsx
Supplementary Material 2: Supplementary_File_S2.xlsx: Total photoperiod (day light) hours from sowing to start of flowering recorded in studied environments for white lupin germplasm diversity panel.
12870_2024_5438_MOESM3_ESM.xlsx
Supplementary Material 3: Supplementary_File_S3.xlsx: Cumulative number of growing degree days from sowing to start of flowering (GDDs) recorded in studied environments for white lupin germplasm diversity panel.
12870_2024_5438_MOESM4_ESM.xlsx
Supplementary Material 4: Supplementary_File_S4.xlsx: Cumulative vernalization effectiveness of daily temperature (VF) from sowing to start of flowering recorded in studied environments for white lupin germplasm diversity panel.
12870_2024_5438_MOESM5_ESM.xlsx
Supplementary Material 5: Supplementary_File_S5.xlsx: DarT-seq and PCR-based markers used for population structure analysis and genome-wide association study in white lupin germplasm diversity panel.
12870_2024_5438_MOESM6_ESM.pdf
Supplementary Material 6: Supplementary_File_S6.pdf: Agarose gel electrophoregrams showing polymorphism of PCR-based markers tagging white lupin flowering time quantitative trait loci (QTLs) from linkage mapping studies.
12870_2024_5438_MOESM7_ESM.pdf
Supplementary Material 7: Supplementary_File_S7.pdf: Agarose gel electrophoregrams showing polymorphism of PCR-based markers developed for white lupin LalbFTc1 gene promoter indels.
12870_2024_5438_MOESM8_ESM.pdf
Supplementary Material 8: Supplementary_File_S8.pdf: Values of the cross-entropy criterion for a number clusters ranging from K1 to K30.
12870_2024_5438_MOESM9_ESM.xlsx
Supplementary Material 9: Supplementary_File_S9.xlsx: Results of population structure analysis in white lupin germplasm diversity panel.
12870_2024_5438_MOESM10_ESM.xlsx
Supplementary Material 10: Supplementary_File_S10.xlsx: FDR-corrected P-values and phenotypic effects of markers analyzed in genome-wide association study of white lupin germplasm diversity panel for flowering time in controlled environment and field conditions.
12870_2024_5438_MOESM11_ESM.xlsx
Supplementary Material 11: Supplementary_File_S11.xlsx: Results of PCR-based screening of indel polymorphism in promoter regions of LalbFTa1, LalbFTa2, LalbFTc1 and LalbFTc2 genes in white lupin germplasm diversity panel.
12870_2024_5438_MOESM12_ESM.xlsx
Supplementary Material 12: Supplementary_File_S12.xlsx: Correlations between studied traits and PCR-based markers tagging indel polymorphism in promoter regions of LalbFTa1, LalbFTa2, LalbFTc1 and LalbFTc2 genes in white lupin germplasm diversity panel.
12870_2024_5438_MOESM13_ESM.xlsx
Supplementary Material 13: Supplementary_File_S13.xlsx: The list of genotypes from white lupin germplasm diversity panel analyzed in the study.
12870_2024_5438_MOESM14_ESM.xlsx
Supplementary Material 14: Supplementary_File_S14.xlsx: Minimum and maximum air temperature, cumulative growing degree days (GDDs), cumulative vernalization effectiveness of daily temperature (VF) and total photoperiod hours recorded in Lodi, Sanluri, Saint Sauvant and Greenhouse during the course of experiments.
12870_2024_5438_MOESM15_ESM.xlsx
Supplementary Material 15: Supplementary_File_S15.xlsx: Primer sequences of PCR-based markers developed for white lupin LalbFTa1, LalbFTa2, LalbFTc1 and LalbFTc2 gene promoter indels and QTL screening.
12870_2024_5438_MOESM16_ESM.pdf
Supplementary Material 16: Supplementary_File_S16.pdf: Full-length original gel images for cropped gels displayed in Supplementary Files.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Rychel-Bielska, S., Bielski, W., Surma, A. et al. A GWAS study highlights significant associations between a series of indels in a FLOWERING LOCUS T gene promoter and flowering time in white lupin (Lupinus albus L.). BMC Plant Biol 24, 722 (2024). https://doi.org/10.1186/s12870-024-05438-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1186/s12870-024-05438-1