Mapping non-host resistance to the stem rust pathogen in an interspecific barberry hybrid

Bartaula, Radhika; Melo, Arthur T. O.; Kingan, Sarah; Jin, Yue; Hale, Iago

doi:10.1186/s12870-019-1893-9

Research article
Open access
Published: 16 July 2019

Mapping non-host resistance to the stem rust pathogen in an interspecific barberry hybrid

Radhika Bartaula¹,
Arthur T. O. Melo²,
Sarah Kingan³,
Yue Jin⁴ &
…
Iago Hale ORCID: orcid.org/0000-0002-8558-7861²

BMC Plant Biology volume 19, Article number: 319 (2019) Cite this article

2430 Accesses
5 Citations
10 Altmetric
Metrics details

Abstract

Background

Non-host resistance (NHR) presents a compelling long-term plant protection strategy for global food security, yet the genetic basis of NHR remains poorly understood. For many diseases, including stem rust of wheat [causal organism Puccinia graminis (Pg)], NHR is largely unexplored due to the inherent challenge of developing a genetically tractable system within which the resistance segregates. The present study turns to the pathogen’s alternate host, barberry (Berberis spp.), to overcome this challenge.

Results

In this study, an interspecific mapping population derived from a cross between Pg-resistant Berberis thunbergii (Bt) and Pg-susceptible B. vulgaris was developed to investigate the Pg-NHR exhibited by Bt. To facilitate QTL analysis and subsequent trait dissection, the first genetic linkage maps for the two parental species were constructed and a chromosome-scale reference genome for Bt was assembled (PacBio + Hi-C). QTL analysis resulted in the identification of a single 13 cM region (~ 5.1 Mbp spanning 13 physical contigs) on the short arm of Bt chromosome 3. Differential gene expression analysis, combined with sequence variation analysis between the two parental species, led to the prioritization of several candidate genes within the QTL region, some of which belong to gene families previously implicated in disease resistance.

Conclusions

Foundational genetic and genomic resources developed for Berberis spp. enabled the identification and annotation of a QTL associated with Pg-NHR. Although subsequent validation and fine mapping studies are needed, this study demonstrates the feasibility of and lays the groundwork for dissecting Pg-NHR in the alternate host of one of agriculture’s most devastating pathogens.

Background

Stem rust, caused by the fungal pathogen Puccinia graminis (Pg), has for millennia been one of the most destructive diseases of wheat and related small grains [1,2,3]. Effective control of the disease was realized in the middle of the twentieth century through the concerted development of resistant wheat varieties and the removal of Pg’s alternate host, common barberry (Berberis vulgaris L.), from major wheat growing areas [3, 4]. In the last 20 years, however, the emergence of new virulent stem rust races has rendered some long-used resistance genes ineffective [5, 6]. For example, when the wheat stem rust race Ug99 was first detected in East Africa in 1998, more than 80% of the world’s wheat germplasm was estimated to be vulnerable to its unprecedented virulence on the widely-deployed resistance gene Sr31 [7]. The rapid distribution and continued evolution of the Ug99 family of races, combined with recent stem rust outbreaks in Europe [8], underscore the need for new sources of resistance [9]. Traditionally, such new sources have been sought almost entirely from within the diverse Triticum genepool. Although translatability to wheat improvement may be less straightforward, or potentially even unachievable, a complementary approach may to look beyond this genepool for potential mechanisms of non-host resistance (NHR) to the complex Pg pathogen.

NHR is a form of resistance in which all individuals of a potential host species exhibit immunity to all individuals (e.g. races) of a potential pathogen [10]. As the most common form of disease resistance and one that possesses intrinsic durability, NHR presents a compelling strategy for achieving broad-spectrum, durable protection against many plant pathogens, including the causal organism of wheat stem rust [11, 12]. The genetic mechanisms underlying Pg-NHR remain largely unknown, especially in comparison to the relatively well-studied mechanisms of race specific and quantitative, race non-specific host resistance. Over the past decade, however, efforts have mounted to understand NHR to rust pathogens using various model and non-model plants. Many plant species, including Arabidopsis thaliana, Brachypodium distachyon, rice, barley, and cowpea [13,14,15,16,17,18], have been used to study NHR to P. striiformis f. sp. tritici, the causal organism of wheat stripe rust. In contrast, NHR to the wheat stem rust pathogen Pg has thus far been studied only in rice [13], as distinct from the studies of intermediate Pg resistance conducted in barley and B. distachyon [19, 20].

As the only globally important small grain immune to all known rust diseases, rice (Oryza spp.) presents a logical potential source of Pg-NHR genes. Genetic studies of Pg-NHR in rice are difficult, however, precisely because populations of non-hosts fail, by definition, to segregate for resistance. Although some limited progression of Pg infection has been shown in rice, thus raising the possibility of dissecting Pg-NHR in that system, the infection process exhibits little variation, requires tedious microscopic studies to characterize, and ultimately fails to complete [13]. As an alternative to rice, the Berberis-Pg system was recently proposed as a tractable pathosystem for studying the genetics of Pg-NHR [21]. Numerous species within the highly diverse Berberis, or barberry, genus are susceptible to Pg infection (e.g. European barberry B. vulgaris L., the target of massive eradication efforts from wheat-growing regions in the twentieth century) [22, 23]. Others, however, are considered non-hosts. Japanese barberry B. thunbergii DC., for example, is considered a non-host of Pg due to two lines of evidence: 1) Over nearly a century of extensive testing at the USDA Cereal Disease Lab, no Pg infection has ever been observed in the species [24,25,26,27,28,29,30,31,32,33], and 2) No Pg infection has been observed on B. thunbergii under natural conditions, despite rampant proliferation of the species in the landscape. Because hybridization between such host and non-host species are known to occur in nature (e.g. B. ×ottawensis C.K. Scheid) [34], populations of interspecific barberry hybrids present a potential means of mapping and dissecting the genetic basis of Pg-NHR.

The barberries are a compelling model for other reasons as well. Unlike rice, which has no known co-evolutionary relationship with Pg, barberries are thought to be one of the first eudicots parasitized by the rusts (Fig. 1). Indeed, multiple lines of evidence support the idea that the barberries may have played an important role in the evolution of the rust fungi. First, Berberis spp. host a wide diversity of rusts, including numerous macrocyclic, heteroecious species of Puccinia (e.g. Pg, P. striiformis, P. montanensis, P. brachypodii, P. pigmea, P. koeleriae, and P. arrhenatheri), a number of autoecious rusts (e.g. Cumminsiella spp., belonging to Pucciniaceae; Edythea spp., belonging to Uropyxidaceae; and Pucciniosira spp., belonging to Pucciniosiraceae), and even some anamorphic rusts (e.g. Acedidium and Uredo spp.). Second, only slight morphological differences exist among the teliospores of the various macrocyclic rusts [35], suggesting a single evolutionary origin of these pathogens. Third, a recent palaeobotanical finding of B. wuyunensis from a sediment layer between 55 to 65 million years ago in northeastern China suggests that the barberries are one of the earliest groups of angiosperms [36].

More specific to the grass rusts, there are eight known Puccinia spp. that complete their sexual (aecial) stage on barberry and their asexual (uredinial and telial) stages on graminaceous plants from the Poaceae family. This relationship, in combination with the relative ages of these two plant families, suggests that Puccinia spp. likely parasitized the Berberidaceae prior to their host expansion to the grasses. Today, the genus Puccinia is comprised of more than 2000 species; and within that diverse genus, host jump rather than co-speciation is believed to be the primary means of speciation [37]. As more recent examples, a host jump from Poaceae to Ranunculaceae likely produced the P. recondita complex and its aligned species, a jump to Liliaceae likely produced P. hordei and its aligned species, and a jump to Oxalidaceae likely produced P. sorghi and its aligned species. Because the relationship between barberries and the rusts likely predates such speciation (Fig. 1), it is of fundamental interest to probe the mechanism(s) of NHR exhibited by some contemporary species of barberry.

In this study, an interspecific B. ×ottawensis mapping population was created to study the inheritance of the gene(s) underlying the putative Pg-NHR of B. thunbergii. To support this work, necessary genetic and genomic resources were developed, including genetic linkage maps for the two parental species (B. thunbergii and B. vulgaris) and a chromosome-scale reference genome for B. thunbergii. This study not only establishes foundational resources for the Berberis-Pg pathosystem but also demonstrates their use in an initial dissection of Pg-NHR, with the long-term hope of contributing insight into possible novel mechanisms of durable resistance to the stem rust pathogen.

Results

Variant detection and linkage map construction

Genotyping-by-sequencing (GBS) libraries were constructed for the two parental lines (B. vulgaris accession ‘Wagon Hill’ and B. thunbergii accession ‘BtUCONN1’) and their 182 interspecific B. ×ottawensis F₁ progeny, generating a total of 60 Gb of data [~ 401 million 150-bp paired end (PE) reads]. After quality parsing and demultiplexing, an average of 3 million high quality reads per genotype were retained by the GBS-SNP-CROP pipeline [38] (Additional file 1). Using the high quality reads from the two parents, a mock reference (MR) comprised of 87,089 centroids (i.e. consensus GBS fragments) was generated, comprising a total length of approximately 15.4 Mbp.

A total of 15,411 polymorphic markers, including 14,043 SNPs (average depth D_SNPs = 41.5) and 1368 indels (D_indels = 36.4), were identified by mapping all high-quality reads from the population to the MR. A detailed account of the winnowing of these markers via a progression of filters to obtain the final sets of markers for linkage map construction is provided in Table 1. Separate genetic linkage maps were constructed for each parental species, using a two-way pseudo-testcross mapping strategy [39]. After culling individual F₁ progeny with > 30% missing data, 161 and 162 individuals were retained for B. thunbergii and B. vulgaris linkage map construction, respectively. The B. thunbergii map was constructed using a total of 1757 markers (1497 and 260 from Marker Sets 1 and 2, respectively; see Table 1), and the B. vulgaris map was constructed using a total of 706 markers (600 and 106 from Marker Sets 3 and 4, respectively). For both parental species, the remaining markers coalesced into 14 distinct linkage groups, in agreement with the reported chromosomal number in these Berberis spp. (Additional file 2: Figure S1).

Table 1 Description of the sequence of filters applied to obtain the final marker sets for linkage map construction

Full size table

Summary statistics of the two genetic linkage maps are detailed in Table 2. The B. thunbergii map consists of 598 recombination bins (i.e. mapped loci) and has a total length of 1474 cM. The numbers of bins in each of the 14 linkage groups (LGs) range from 23 (LG14) to 60 (LG2), with an average distance between adjacent bins of 2.6 cM. In comparison, the B. vulgaris map consists of 347 bins and a total length of 1714 cM. The numbers of bins in each of these 14 LGs range from 13 (LG14) to 37 (LG2), with an average distance between adjacent bins of 5.5 cM. Marker names, alleles, and genetic positions (cM), as well as a color-coded visualization of the recombination events within all members of the mapping population are provided in Additional file 3 (B. thunbergii) and Additional file 4 (B. vulgaris).

Table 2 Comparative summary statistics of the genetic linkage maps for B. thunbergii accession ‘BtUCONN1’ (Bt) and B. vulgaris accession ‘Wagon Hill’ (Bv)

Full size table

Disease phenotyping

To determine disease responses to Pg, the parents and all F₁ progeny were inoculated with basidiospores ejected from germinated teliospores produced by overwintered telia of Pg found on naturally infected Elymus repens. The progeny segregated into four clear phenotypic classes, ranging from resistant to susceptible (Fig. 2, Table 3). Disease phenotypes were successfully obtained for 153 progeny used for linkage map construction. Of those, 25 exhibited a clear resistant reaction similar to that of the B. thunbergii parent (Fig. 2c) and 61 exhibited a clear susceptible reaction similar to that of the B. vulgaris parent (Fig. 2f). Of the remaining 67 lines, 38 exhibited moderate resistance (Fig. 2d) and 29 exhibited moderate susceptibility (Fig. 2e).

Table 3 Descriptions of the disease reactions of the B. ×ottawensis progeny comprising the F₁ mapping population

Full size table

QTL analysis

To map regions associated with Pg-NHR in B. thunbergii, composite interval mapping (CIM) analysis was conducted using the linkage maps of both parents and the 4-point stem rust reaction type described above. Based on the LOD threshold score of 3.9 declared via permutation analysis, CIM analysis resulted in the identification of a single significant QTL (peak LOD value = 28.2) located 25 cM from the telomere of the short arm of B. thunbergii chromosome 3 (Fig. 3). The flanking markers for this 13 cM QTL region, hereafter referred to as QPgr-3S, were determined via a detailed characterizations of the F₁ individuals with recombination events on either side of peak QTL marker M1128. The distal flanking marker M441 is set by Pg-resistant individual WH15–192, and the proximal flanking marker M969 is set by Pg-resistant individual WH15–101 (Additional file 3). No significant QTL was detected in the B. vulgaris map.

Building a reference genome for B. thunbergii cv. ‘Kobold’

Approximately 129 Gb of sequence data was generated from 115 PacBio Single Molecule Real Time (SMRT) cells (P6-C4 chemistry on RS II), with an average read length of 10,409 bp and a read length N50 of 15,021 bp (Additional file 2: Table S1). The haploid genome size of Kobold, a widespread green-leafed B. thunbergii ornamental cultivar, was estimated to be 1.37 Gbp based on k-mer analysis and 1.72 Gb based on flow cytometry (data not shown), two values which bound the previously published B. thunbergii haploid genome size (1C) of 1.51 Gb [40]. The FALCON-Unzip pipeline [41] resulted in a 1.36 Gb assembly consisting of 4671 primary contigs with contig length N50 of 0.67 Mbp (Table 4). Their corresponding 7144 phased haplotigs had a total length of 0.88 Gb, approximately 64% of the primary contig space. Further curation, in the form of chimera breaking and cryptic haplotig identification (see Materials and Methods), resulted in a final 1.23 Gbp assembly consisting of 2698 primary contigs with contig length N50 of 0.76 Mbp (Table 4). The number of haplotigs in the final assembly increased to 8790, with a combined length of 0.99 Gb (> 80% of the primary contig space).

Table 4 Summary statistics of the B. thunbergii cv. ‘Kobold’ genome assembly, by stage

Full size table

Genome completeness and contamination analyses revealed a final genome assembly of acceptable quality, featuring complete representation of 80.9% of the BUSCO core plant gene set and only 15.1% missing BUSCO genes. 83.0% of the BtUCONN1 GBS fragments, 80.71% of the PacBio preads, and 92.2% of the RNA-seq data (in proper pair) aligned to the final assembly. After the initial FALCON-Unzip assembly, 119 primary contigs showed significant sequence similarity to plant cpDNA and mtDNA sequence; but this number dropped to only one primary contig in the final assembly as a result of intensive haplotig purging and curation.

The primary contigs from the final assembly were guided into chromosome-level scaffolds (pseudo-molecules) on the basis of three-dimensional proximity information obtained via chromosome conformation capture analysis (Hi-C) [42]. Of the 2698 primary contigs, 97% (2611 contigs, 1.20 Gbp) successfully assembled into 14 pseudo-molecules representing the 14 chromosomes of B. thunbergii, as shown in the Hi-C heatmap (Additional file 2: Figure S2). The remaining 3% (156 contigs, 33.5 Mbp) were designated as unscaffolded contigs. Detailed summary statistics of the 14 pseudo-molecules comprising the B. thunbergii cv. ‘Kobold’ reference assembly can be found in Additional file 2: Table S2.

Anchoring the genetic linkage maps to the physical assembly and assigning chromosome numbers

Using BLASTn with MR centroids as queries, the positions of the mapped GBS markers within the final Hi-C assembly were used to anchor the genetic linkage maps of both parental species to the Kobold physical map. As illustrated in Fig. 4, a very high degree of synteny is observed between the two species, with co-linearity to the Kobold physical map being 95.1 and 92.9% for the B. thunbergii and B. vulgaris linkage maps, respectively. The physical positions of a small percentage of loci in both linkage maps (3.9% in B. thunbergii and 5.1% in B. vulgaris) were ambiguous, in that they could not be assigned to unique positions in the physical assembly. Another small percentage of loci (0.93% in B. thunbergii and 1.12% in B. vulgaris) exhibited unambiguous BLAST hits to different chromosomes than in the linkage map, as indicated by dots in Fig. 4. The approximate centromere positions were visually inferred from the Hi-C heatmap (Additional file 2: Figure S2).

To assign chromosome numbers to linkage groups, the pseudo-molecules from the Kobold physical assembly were sorted, longest to shortest. The linkage group (LG) that anchored to the longest pseudo-molecule in the Kobold assembly (99.76 Mbp) was designated LG1; the next longest pseudo-molecule was designated LG2 (99.56 Mbp); and so on to LG14 (54.72 Mbp) (see Additional file 2: Table S2). Because there was perfect agreement between the number of observed linkage groups and the expected chromosome number for the species [40], LG1 was simply re-assigned as Chromosome 1 and so on.

Transcriptome assembly

A total of 59.6 Gb of data, comprised of ~ 198 million 150-bp PE reads, was obtained by sequencing a library of 10 different tissues from the B. thunbergii reference accession ‘Kobold’, including immature leaf tissues sampled at various time points following inoculation with Pg (Additional file 2: Table S3). Using the Trinity pipeline [43] and the final Kobold assembly as a guide, a 189.3 Mbp transcriptome was assembled, containing 122,872 putative transcripts and 55,186 cDNA sequences (complete ORFs) (see Table 5 for summary statistics). Quality and completeness of the transcriptome assembly were assessed via TransRate [44] and BUSCO analysis [45]. To date, a TransRate score of 0.22 exceeds 50% of the published de novo assembled transcriptomes deposited in the NCBI TSA [44]. In comparison, the TransRate score of the Kobold transcriptome is 0.40, indicating its relative quality. Completeness statistics are also acceptable, as indicated by the fact that, of the BUSCO set of 1440 core plant genes, 1286 (89.3%) were represented in the transcriptome, of which 651 (45.2%) were single copy and 635 (44.1%) were duplicated.

Table 5 Descriptive statistics of the B. thunbergii cv. ‘Kobold’ reference-guided transcriptome assembly

Full size table

Identification of candidate genes

The 13 cM QPgr-3S region was found to correspond to a 5.35 Mbp region in the physical assembly, implicating 20 contigs (length N50 = 389.7 kbp). In an effort to refine the assembly within the QTL region, these 20 contigs were locally re-assembled using canu [46], resulting in a final set of 13 contigs with a reduced total length of 5.10 Mbp and an increased contig length N50 of 508.5 kbp. Using RepeatMasker [47], 5.6% (~ 373 kbp) of the Qpgr-3S region was masked as repetitive elements using A. thaliana as the model. A total of 219 retroelements were found, of which 178 are LTRs (79 Ty1/Copia and 99 Gypsy/DIRS1) and 41 are LINEs (L1/CIN4). Another approximately 9 kbp of sequence were found to correspond to DNA transposons. Regions of simple sequence repeats occupy a total length of 130 kbp, and 32 small RNAs were found.

Functional annotation of the QPgr-3S region resulted in the identification of 576 high confidence (HC) genes. Of these, 450 were annotated based on the reference transcriptome (evidence-based) and 126 were annotated based on gene prediction models (ab initio). To help identify a short list of candidate genes potentially associated with Pg-NHR and prioritized for ongoing investigation, the list of HC genes was cross-referenced to the results of two other analyses: Differential gene expression (DGE) and presence/absence analysis (see Materials and Methods). Time course DGE analysis led to the identification of five genes (TR27614, TR9306, TR20791, TR5393, and TR12856) that express differentially under Pg inoculation (Additional file 2: Figures S3 and S4). Genes TR27614 and TR9306 exhibit a similar pattern of gradual down-regulation starting around 48 h post-inoculation (hpi). Gene TR20791 exhibits up-regulation during the first 48 hpi, followed by down regulation after 72 hpi. In contrast, genes TR5339 and TR12856 appear initially down-regulated before gradually climbing back to their original levels after 72 hpi. Presence/absence analysis identified two genes that are present in the B. thunbergii reference but appear to be either completely absent (MA26) or are missing whole exons (MA262) in B. vulgaris (Additional file 2: Figure S5). The evidence for possible absence in B. vulgaris is particularly strong with MA026 due to the high coverage of B. vulgaris reads in the immediate vicinity of the gene (Additional file 2: Figure S5).

Combined with the linkage evidence from the QTL analysis, the results of the time course DGE and presence/absence analyses elevate the seven genes identified above to a status of candidate genes associated with Pg-NHR. As such, these candidates were selected for detailed functional annotation; and orthologous sequences were found for three of them (TR20791, TR27614, and TR12856) in the UNIPROT and Phytozome databases. Specifically, gene TR20791 is associated with a dormancy-related auxin repressor protein family; TR27614 exhibits high sequence similarity with zinc finger DNA-binding proteins; and TR12856 belongs to the glutamine synthetase (glutamate-ammonia ligase activity) protein family (Additional file 5). The other four candidate genes had no hits in any public database used for functional annotation and thus are potentially Berberis-specific genes, or at the very least are novel genes previously uncharacterized in other species. As the application of next-generation sequencing has become routine in genomic studies, identification of high numbers of completely novel transcripts has been found to be common in both model and non-model species (e.g. see [48,49,50,51]).

Discussion

Genetic and genomic resource development

Familiar, commonly used mapping populations for genetic linkage map construction in plants include segregating F₂ lines, backcross populations, doubled haploids, and recombinant inbred lines. In self-incompatible perennial plant species, however, particularly those with long generation times like barberries, such typical mapping populations are difficult, if not impossible, to produce. To overcome such challenges, the so-called “pseudo-testcross” strategy was first proposed by Grattapaglia and Sederoff (1994) and successfully applied to construct a genetic linkage map in forest trees [39]. According to this strategy, a mapping population of full-sib F₁ progeny is developed by crossing two unrelated and highly heterozygous (i.e. not inbred) individuals. Gametic recombinations can be tracked in such a population because strategically-chosen sets of markers obey the segregation patterns found in typical testcrosses. The strategy has been widely used in plant species for which other approaches are unsuitable [52,53,54].

In this study, using a pseudo-testcross strategy, genetic linkage maps were developed for both B. thunbergii and B. vulgaris from a single interspecific F₁ mapping population. As a result of the stringent quality filters applied to the set of de novo GBS markers used, nearly 100% of the markers were placed successfully in the linkage maps of the two species. Although flow cytometry analysis indicates comparable genome sizes between the two parents (B. thunbergii: 1.72 Gbp; B. vulgaris: 1.69 Gbp), the total length of the BtUCONN1 (B. thunbergii) linkage map obtained in this study is roughly 15% smaller than that of the Wagon Hill (B. vulgaris) map (1474 cM vs. 1714 cM). This incongruity with the expected differences in physical genome sizes is likely due to the significantly fewer markers available for the B. vulgaris map as compared to those available for B. thunbergii (706 vs. 1757). Low marker density often results in inflated genetic distances [55], so it is expected that additional markers would reduce the overall length of B. vulgaris linkage map. The significantly lower number of markers available for B. vulgaris is likely a result of the relatively lower level of diversity observed in this species as a result of the severe genetic bottleneck presumed during its colonial introduction from Europe into North America [21].

The two linkage maps developed in this study are the first for any species within the plant order Ranunculales. The relatively even distribution of markers across the 14 chromosomes of both species permits initial QTL analysis of acceptable resolution, with approximately 87 and 65% of the inter-marker distances being less than 5 cM for B. thunbergii and B. vulgaris, respectively. In addition, the strong synteny observed between the two independent maps is strong evidence of their reliability (Fig. 4).

As a complement to genetic resources like mapping populations and linkage maps, a high-quality reference genome can serve as an invaluable resource in dissecting QTLs, identifying underlying candidate genes, and facilitating their detailed characterization. In this study, contemporary sequencing and scaffolding technologies were used to develop a highly contiguous de novo reference genome for B. thunbergii. Using PacBio SMRT sequencing and chromosome conformation capture data, a 1.2 Gb haploid assembly of B. thunbergii cv. ‘Kobold’ was successfully assembled into 14 chromosome-scale pseudo-molecules. As with the linkage maps, this reference is the first of its kind for a member of both the Berberidaceae family as well as the order Ranunculales, more broadly. Given the previous lack of molecular resources for barberries, the reference genome assembled in this study exemplifies the power of recent technologies to make rapid progress even in non-model systems and establishes a benchmark for the de novo assembly of a highly heterozygous plant species with a moderately sized genome.

In conclusion, the development of foundational genetic and genomic resources, including a genotyped interspecific mapping population, linkage maps for its two parental species, a chromosome-scale reference genome, and a multiple-tissue transcriptome establishes Berberis spp. as a viable research model for studying Pg-NHR. Furthermore, such resources promise to facilitate related endeavors, including global rust surveillance work and ornamental horticulture breeding.

QPgr-3S and the identification of candidate genes for Pg-NHR

The long-term goal of this research is to identify candidate gene(s) governing Pg-NHR in B. thunbergii. As an initial step in that direction, the genetic and genomic resources developed here enabled the identification of a single QTL of large effect (LOD > 28) on the short arm of B. thunbergii chromosome 3 (Fig. 3). This 13 cM QTL region, dubbed Qpgr-3S, was found to span 13 physical contigs and contain a total of 576 high-confidence genes. Of these, seven were short-listed as relatively high priority candidate genes for follow up studies, including three exhibiting homology to genes in public databases, including dormancy-associated auxin repressor proteins (TR20791), zinc ion binding proteins (TR27614), and glutamine synthetase proteins (TR12856).

The current model of disease resistance suggests that plant immune responses can be grouped broadly into two major classes, namely pre-invasion defense triggered by pathogen-associated molecular patterns (PAMP-triggered immunity) and post-invasion defense triggered by pathogen effectors (effector-triggered immunity) [56, 57], both of which have been shown to implicate a wide range of defense-related proteins. Three of the seven candidate genes identified here in this study exhibit homology to gene families implicated in disease resistance in the literature. For example, auxin is known to function as a modulator of salicylic acid, a phyto-hormone essential to the induction of systemic acquired resistance in plants [58]; zinc finger transcription factors have been implicated in the regulation of a gene affecting rust germ tube differentiation [59]; and glutamine synthetase proteins are known to play key roles in plant defense against pathogens via amino acid metabolism [60].

The identification of both the QPgr-3S region and a set of high-priority candidate genes demonstrates the utility of the genetic and genomic resources developed in the study to probe the genes underlying Pg-NHR exhibited by B. thunbergii. Such results, however, are but the first step toward identifying the genes governing Pg-NHR; and further work is required to validate and dissect the QTL region, in addition to testing candidate gene hypotheses.

Possible modes of inheritance of Pg-NHR

From the practical standpoint of breeding for improved resistance to wheat stem rust, the central questions regarding Pg-NHR concern the nature and modes of inheritance of the underlying genes. As previously observed in a natural interspecific barberry hybrid population [21], F₁ interspecific hybrids exhibit a range of reactions to Pg, from fully resistant to fully susceptible, with various intermediate forms. This range of reactions was similarly observed in the F₁ mapping population developed for this study (Fig. 2c-f and Table 3). If one assumes that the Pg-resistance in B. thunbergii is governed by a single gene, independent assortment during meioses would invariably result in homozygous Pg-susceptible B. thunbergii progeny. To date, however, no accession of B. thunbergii has exhibited such susceptibility, despite extensive investigation (see Background); thus a single gene governing the Pg-resistance in B. thunbergii is unlikely. Polygenic NHR has been suggested in other studies as well, including rice NHR to wheat stem rust and barley NHR to powdery mildews, oat stem rust, and other non-adapted rust species [19, 61, 62].

If indeed the QPgr-3S region plays a role in Pg-NHR, the data suggest that its underlying gene(s) are necessary but not sufficient for resistance. In other words, this study at most provides a first insight into a larger gene network regulating Pg-NHR in B. thunbergii. Indeed, in light of the lack of segregation in the non-host parental species B. thunbergii, the segregation of resistance among F₁ hybrids suggests the possible existence of some critical gene(s), by definition fixed within the B. thunbergii genepool, upstream of QPgr-3S. Because of their fixed state within B. thunbergii, such gene(s) cannot be mapped in an F₁ population; but if recessive, their single dosage in an F₁ would permit susceptibility to Pg, thus allowing the detection of background resistance genes (e.g. QPgr-3S). In all likelihood, QPgr-3S is not a critical region conferring Pg-NHR but is rather a region contributing to Pg resistance. Strategic crosses among the F₁ progeny and/or backcrosses to B. thunbergii will be necessary to test this hypothesis and identify those critical gene(s) regulating Pg-NHR in B. thunbergii, work shown to be feasible by the current study.

Conclusions

In this paper, we report the development of publicly-available foundational genetic and genomic resources for the novel Berberis-Pg pathosystem, including the first genetic maps for two Berberis species (B. thunbergii and B. vulgaris), a chromosome-scale reference genome for B. thunbergii, and a related transcriptome to facilitate the characterization of genetic mechanism(s) of Pg-NHR. Future work should focus on the validation, further characterization, and dissection of the identified QTL, including testing of candidate gene hypotheses. Beyond this, now that the Berberis-Pg pathosystem has been shown to be a viable means of probing the mechanism of Pg-NHR in B. thunbergii, future work must also wrestle with the significant question of potential translatability of such resistance to wheat. Such translatability is certainly not a given, particularly in light of the fact that the infecting spores are different for Berberis (basidiospores) and grass (urediniospores) hosts. However, because the two life stages in question belong to the same pathogenic organism and because Berberis is the likely ancestral host of that organism prior to its host expansion to the grasses (see Background), the possibility exists that the mechanism of Pg-NHR in B. thunbergii may provide relevant insight into breeding durable resistance in wheat. With this study, the foundation is laid to eventually answer this question.

Methods

Mapping population development

A B. ×ottawensis mapping population consisting of 182 F₁ individuals was derived from an interspecific cross between B. thunbergii accession ‘BtUCONN1’ (pollen parent) and B. vulgaris accession ‘Wagon Hill’ (female parent). True to its species, BtUCONN1 is a non-host to the stem rust pathogen and is a small shrub (0.5–2.5 m tall) that displays 1.3–3.8 cm long entire leaves and 1–2 cm long inflorescences with few umbellate but mostly solitary flowers. In contrast, Wagon Hill is susceptible to stem rust and is a relatively taller shrub (~ 3 m tall) that displays 2–5 cm long obovate to obovate-oblong leaves with highly serrated margins (> 50 serrations) and has 5–8 cm long pendant racemes of bright yellow flowers. The pollen parent BtUCONN1 was a feral plant maintained in the barberry collection at the research farm of the University of Connecticut (N41°47′40.63″, W072°13′39.61″), and the female parent Wagon Hill is a feral plant growing along the shoreline of the Great Bay Estuary in Durham, New Hampshire (N43°07′30.64″, W70°52′17.95″).

To make the interspecific cross, pollen was harvested from mature flowers of BtUCONN1 using the previously described N-pentane method [63] and stored at 4 °C until flowers of Wagon Hill reached reproductive maturity. Emasculation and hand pollination of female flowers were performed at the so-called balloon stage, when the petals begin to part slightly at the top, giving the appearance of an inflated balloon prior to opening. To break dormancy before sowing, seeds from successful crosses were stratified in wet sand in a petri dish at 4 °C for three months. Propagated cuttings of the two parents were maintained along with the F₁ mapping population in plastic pots (11.5 cm diameter; 6.5 cm tall) filled with PRO-MIX HP growth media in the Macfarlane Greenhouse facility at the University of New Hampshire.

To verify the putative F₁ status of the individuals in the mapping population, a PCR-based species-specific marker was designed based on available GBS data [21]. A universal primer pair was designed to amplify a short genomic sequence exhibiting a length polymorphism between the two parents. Specifically, the primers (F: 5′-CCTGATTGGGGCTCATTATC-3′; R: 5′-AGTGAGGAATTCCGAGCTGA-3′) amplified a 208 bp fragment in Wagon Hill but only a 195 bp fragment in BtUCONN1, due to the presence of a 13 bp indel (see Additional file 6: Text S1). PCR was conducted with a total reaction volume of 20 μl (0.25 mM of each primer, 100 μM of each dNTP, 0.75 U Taq DNA Polymerase, 10x standard Taq buffer, and 100 ng of template DNA) subjected to the following cycling conditions: 5 mins at 94 °C; 32 cycles of 30 s at 94 °C, 30 s at 52 °C, and 15 s at 68 °C; and 5 mins at 68 °C. Amplified products were separated on a 3% TBE/EtBr agarose gel for 60 min at 75 V and imaged with UV transillumination. The F₁ status of a putative hybrid individual was considered validated if both bands from the two parental species were detected (Additional file 2: Figure S6).

Genotyping and variant detection

Genomic DNA of the 182 verified F₁ individuals and both parents was extracted from ~ 100 mg of lyophilized leaf tissue using a modified CTAB method [64]. Prior to GBS library preparation, isolated DNA was purified using Zymo Research’s Genomic DNA Clean & Concentrator™^-10 column (Catalog # D4011), following manufacturer’s protocol. Reduced representation libraries were constructed using the two-enzyme (PstI-MspI) GBS protocol described by Poland et al. [65] and sequenced via 150 bp paired-end (PE) reads on an Illumina HiSeq 2500 at the Hubbard Center for Genome Studies, UNH.

Raw FASTQ files were generated by CASAVA 1.8.3 and analyzed using the reference-free bioinformatics pipeline GBS-SNP-CROP [38, 66]. A Mock Reference (MR) was constructed using the high quality PE reads from the two parents; and putative variants, both SNPs and indels, were identified via alignment of high quality PE reads from the parents and all F₁ progeny to the MR, following the pipeline’s recommended parameters for diploid species. Complete details of the GBS-SNP-CROP command lines used in this analysis, including all specified pipeline parameters, are provided in Additional file 6: Text S2.

Genetic linkage map construction

The sequence of filters applied to obtain the final sets of markers for linkage map construction is summarized in Table 1. In short, a marker was culled if it met any of the following criteria: 1) It was unscored for more than 30% of the individuals in the population; 2) It was heterozygous for both parents; 3) It failed to segregate in the population (i.e. all progeny were heterozygous for the marker); 4) Its mean ratio of primary to alternate allele depth deviated significantly from the expected ratio of 1:1; and/or 5) Its segregation ratio deviated significantly from the expected ratio of 1:1, according to its marker class. As a final filter, genotypes with > 30% missing data were removed.

Linkage analysis was performed using the R package ONEMAP v2.0–4 [67], and separate linkage maps were constructed for the two parents according to a two-way pseudo-testcross mapping strategy [30]. The BtUCONN1 linkage map was constructed using Marker Sets 1 and 2, while the Wagon Hill map was constructed using Marker Sets 3 and 4 (see Table 1). For each map, a two-point test was first performed for all marker pairs, using a minimum LOD score of 4 and a maximum recombination fraction of 0.25 to group markers into linkage groups (LGs). Next, markers within each LG were ordered using the ‘try’ algorithm within ONEMAP.

To identify potential genotyping errors, common in GBS data [68], maps were manually inspected for the presence of singletons (apparent double crossovers) [69], which were replaced with missing values. If multiple markers were found to map to the same genetic bin, a consensus of the set of markers was chosen to represent the linkage bin for final mapping iterations, which were made until no alternative orders were generated by the ‘ripple.seq’ function. Final map distances were calculated with the Kosambi mapping function [70], and ideograms were generated using Mapchart 2.0 [71].

Stem rust disease phenotyping

To determine disease responses, the parents and all F₁ individuals in the mapping population were inoculated with basidiospores ejected from germinated teliospores produced by Pg telia found on naturally-infected Elymus repens, as previously described [21]. The pollen parent BtUCONN1 exhibits the clear non-host reaction typical of B. thunbergii. In contrast, the female parent Wagon Hill exhibits the clear susceptible reaction of B. vulgaris, with well-developed mature aecia visible on the abaxial surfaces of leaves. Images of typical reactions of the parents and of individuals in the F₁ mapping population are presented in Fig. 2. As detailed in Table 3, a 4-point scale was developed in response to the particular segregating characteristics observed in this population. The levels of this scale are based on the following symptoms: 1) Degree of flecking; 2) Presence and intensity of necrotic lesions; and 3) Presence and density of pycnia and aecia. All plants were scored for reaction to stem rust 14 days after inoculation.