Grain protein content variation and its association analysis in barley
© Cai et al.; licensee BioMed Central Ltd. 2013
Received: 27 August 2012
Accepted: 27 February 2013
Published: 3 March 2013
Skip to main content
© Cai et al.; licensee BioMed Central Ltd. 2013
Received: 27 August 2012
Accepted: 27 February 2013
Published: 3 March 2013
Grain protein content (GPC) is an important quality determinant for barley used as malt, feed as well as food. It is controlled by a complex genetic system. GPC differs greatly among barley genotypes and is also variable across different environments. It is imperative to understand the genetic control of barley GPC and identify the genotypes with less variation under the different environments.
In this study, 59 cultivated and 99 Tibetan wild barley genotypes were used for a genome-wide association study (GWAS) and a multi-platform candidate gene-based association analysis, in order to identify the molecular markers associated with GPC. Tibetan wild barley had higher GPC than cultivated barley. The significant correlation between GPC and diastatic power (DP), and malt extract confirmed the importance of GPC in determining malt quality. Diversity arrays technology (DArT) markers associated with barley GPC were detected by GWAS. In addition, GWAS revealed two HvNAM genes as the candidate genes controlling GPC. No association was detected between HvNAM1 polymorphism and GPC, while a single nucleotide polymorphism (SNP) (798, P < 0.01), located within the second intron of HvNAM2, was associated with GPC. There was a significant correlation between haplotypes of HvNAM1, HvNAM2 and GPC in barley.
The GWAS and candidate gene based-association study may be effectively used to determine the genetic variation of GPC in barley. The DArT markers and the polymorphism of HvNAM genes identified in this study are useful in developing high quality barley cultivars in the future. HvNAM genes could play a role in controlling barley GPC.
Grain protein content (GPC) is an important quality determinant in cereal crops. In barley, GPC is closely associated with feed and malt quality. Higher protein content is favorable for feed quality, while lower or moderate protein content is expected for malt barley. GPC affects malting quality in many ways, including yeast nutrition, haze formation in beer and enzyme activities [1, 2].
Barley GPC is under polygenic control, with many quantitative trait loci (QTLs) having been mapped on all seven chromosomes, mainly on 2H, 4H, 5H and 6H [3, 4]. All these loci had been determined by QTL mapping. Recently, genome-wide association study (GWAS) has been developed to dissect a variety of complex traits in plant [5, 6]. GWAS has the advantage over the conventional QTL mapping in that GWAS can be performed on a number of genotypes. While a population used for conventional QTL mapping is developed from a bi-parental cross, only allowing the detection of a subset of loci/alleles within a plant and offering limited the resolution, due to insufficient recombination between the linked genetic loci. Hence, GWAS may present wider genetic variations and higher mapping resolution on phenotypes and traits at population level than conventional QTL mapping . In barley, seven malt quality traits and some important agronomic traits have been effectively analyzed using GWAS [7–9].
Qinghai-Tibet Plateau, considered as one of the original centers of cultivated barley in the world, is rich in barley germplasm . The polymorphism information content (PIC) value of Tibetan wild barley is higher than that of Chinese landraces according to analysis of SSR markers, and the wild barley has more unique alleles than the cultivated barley [11–13]. Thus, Tibetan wild barley is assumed to have wider variability in the genes controlling GPC [11–13]. Therefore, the population derived from Tibetan wild barley and cultivated barley worldwide could provide high resolution for GWAS in barley GPC.
A wheat QTL controlling GPC, named as Gpc-B1, was cloned, and a transcription factor (NAM-B1) was related to GPC by regulating senescence and protein remobilization . Two orthologs genes (Genbank accession number DQ869678 and DQ869679) of TtNAM-B1 in barley were identified on chromosomes 6H and 2H, respectively . The single nucleotide polymorphism (SNP) analysis showed that allelic variation of the NAM-1 gene could be associated with GPC variation within the Hordeum genus. The differences in expression of HvNAM-1 or other genes among barley cultivars or species could be attributed to GPC variation . However, little research has been done regarding barley HvNAM2 up to date,except that the sequence of HvNAM2 was published .
The objectives of the current study are (1) to examine the correlation between GPC and malt quality; (2) to identify molecular markers associated with GPC in a barley mapping population by GWAS and determine the candidate genes controlling GPC; and (3) to analyze the association between HvNAM genes and GPC.
A collect of 158 barley accessions was used for association mapping and GPC analysis. These accessions included 59 barley cultivars (H. vulgare L.) from different areas of the world and 99 Tibetan wild barley (H. spontaneum L.). All barley cultivars and accessions were planted at the Huajiachi campus of Zhejiang University (Hangzhou, China, 120.0°E. 30.5°N) in the early winter of 2008 and 2009. Each accession was sown into a two-line plot, 2 m long and 0.24 m interval between the lines, and 40 seeds were planted in each line. All plots were supplied with 150 kg/ha of N, including 40 kg/ha of N as compound fertilizer applied before seeding, and 110 kg/ha of N as urea supplied at two-leaf stage and booting stage, respectively with equal amount. In addition, 180 kg/ha of potassium chloride was applied prior to seeding. The experiments were arranged in a block design with two replications. In each block, the 158 barley accessions were arranged randomly. All other agronomic managements, including weed and disease control, were the same as those applied locally. At seedling stage, leaves of each genotype were collected for DNA extract. The harvested seeds were stored at 4°C prior to malting. GPC and malt quality of all samples were measured, three measurements were done for each sample.
Mature grains were ground in a Cyclotec 1093 sample mill (Tecator AB, Hoganas, Sweden) and passed through a 0.5 mm screen. GPC was measured using the Kjeldahl method . Protein content is calculated by duplicating a factor of 6.25 with N content.
Grain samples (around 200 g) were micro-malted in a Micro-malting Apparatus (Phoenix System, Adelaide, Australia) using the following regime: 6 h steep, 14 h air-rest, 8 h steep, 14 h air-rest and 4 h steep, followed by 96 h germination – all performed at 15°C. The malts were then kilned at 65°C for 24 h, de-rooted and milled using a Tecator Cyclone mill fitted with a 0.5 mm screen. The soluble and total protein contents (SPC and TPC) in malt and the malt quality parameters (malt extract, Kolbach index, viscosity and DP) were determined according to the Analytica EBC Official Methods (European Brewery Convention, 1975).
Genomic DNA samples from young leaves of the barley seedlings were isolated as described by Uzunova et al. . In brief, the leaf tissues were ground, and the resulting powder was re-suspended with CTAB (Hexadecyl trimethylammonium bromide) buffer (pH 5.0). To purify the DNA, insoluble particulates were removed through centrifugation. DNAs were precipitated from the aqueous phase and were washed thoroughly to remove contaminating salts.
Whole-genome profiling of DArT in all the DNA samples were analyzed using the Barley PstI (BstNI) version 1.7 array  at the Diversity Arrays Technology Pty Ltd in Australia. There are around 1,500 DArT markers, polymorphic in a wide range of barley cultivars, and 1,000 markers detected in wild barley accessions (http://www.triticarte.com.au/content/barley_diversity_analysis.html). Among the 1,576 reported markers and 1,319 polymorphic DArT markers, those with P value < 0.05 were used in the current study.
The primer pairs were designed using Primer3  based on the HvNAM1 and HvNAM2 sequences (Genbank accession number DQ869678 and DQ869679, NCBI). 5′-atgggcagcccggactcatcctcc-3′ and 5′-tacagggattccagttcacgccggat-3′, 5′-atgggcagctcggactcatcttcc-3′ and 5′-tcagggattccagttcacgccgga-3′ were used for amplification of HvNAM1 and HvNAM2, respectively. The PCR reaction mixture contains 20 mM Tris–HCl, 50 mM KCl, 2 mM MgCl2, l M of each dNTP, 5 pmol of each primer, 50–100 ng of genomic DNA and one unit of Taq DNA polymerase (Major-bio, Shanghai, China). The reaction was initially denatured at 95°C for 5 min, followed by 35 cycles of 95°C for 45 s, 60°C for 45 s and 72°C for 1.5 min. The PCR was terminated at 72°C for 10 min. The BigDye Terminator v3.1 cycle sequencing kit (Applied Biosystems, Foster City, CA, USA) was used for sequencing. The complete gene sequence was analyzed using Bioedit software (http://www.mbio.ncsu.edu/bioedit/bioedit.html).
Pearson correlation analysis was conducted between GPC, SPC, TPC and malt quality parameters using SPSS 13.0 and SigmaPlot 10.0. Alignment of all the sequences was performed by ClustalW . Genetic diversity was examined by 1319 randomly-distributed barley DArT markers over the genome at Diversity Arrays Technology Pty Ltd, Australia. The genetic polymorphism data from 1319 DArT markers were utilized to detect population structure by STRUCTURE software version 2.3.3 using an admixture model and five independent replicates of 100,000 Markov Chain iterations [22, 23]. K values ranging from 1 to 10 were tested with a burn-in of 100,000 iterations and 100,000 Markov Chain Monte Carlo (MCMC) iterations according to the software’s instructions. The effect of population structure on GPC was tested using SAS GLM (SAS Institute, Cary, North Carolina, USA). The model included the components of the Q matrix obtained with STRUCTURE 2.2.3, which was used to illustrate population structure. R2 (variance explained by the model) was considered as an estimate of the proportion of phenotypic variation explained by population structure. The principal component analysis (PCA) was performed on the genotype data derived from 1319 DArT markers, which were standardized firstly using Unscrambler 9.7 (CAMO PROCESS AS, Oslo, Norway). TASSEL 2.01 was used to calculate linkage disequilibrium (LD) based on the parameter r2, which is a measurement of the correlation between a pair of variables . The pair-wise relationship matrix (K-matrix), which was further employed for population correction in the association models, was calculated with 1319 DArT markers using TASSEL 2.01. The two-year data of GPC were averaged for future association analysis. The structure-based association analysis with a K-matrix between DArT markers, HvNAM genes and GPC was calculated using TASSEL 2.01. Association between DArT markers and the total trait variation was tested using mixed linear models (MLM), which was implemented in TASSEL 2.01. The P values were adjusted with permutation test using a step-down MinP procedure implemented in the TASSEL 2.01. The adjusted P value < 0.05 or <0.01 was considered as a criterion for association. The Manhattan plot of DArT markers and P value were drawn with the R software version 2.14.2 (http://www.r-project.org/). The association map was constructed using MapDraw version 2.1 .
Sequences of HvNAM1 and HvNAM2 were aligned using VectorNTI 10.0 (Invitrogen Corporation, Carlsbad, USA) or CLC main workbench 5 (CLC bio, Aarhus, Denmark), and alignments were edited manually using the BioEdit software. Haplotypes were inferred using the software TASSEL 2.01. One barley accession was inferred as rare haplotypes and was excluded from further analysis. Grouped according to haplotypes in the HvNAM genes, GPC variation among the 59 cultivated and 99 Tibetan wild barley accessions was performed using the software SAS 9.0 software (SAS Institute, Cary, North Carolina, USA). For further association analysis between haplotype and GPC in the total 158 accessions, the SAS 9.0 software (SAS Institute, Cary, North Carolina, USA) was used to conduct analysis of variance (ANOVA) and multi-comparison analyses with least significant differences (LSD), the mean difference is significant at 0.05 level.
The values of GPC, SPC and TPC between 2008 and 2009 were significantly and positively correlated (R2 = 0.4435** for GPC; R2 = 0.3937** for SPC; R2 = 0.3937** for TPC) (Figure 3), while the data of Kolbach index in 2008 could account for 55.11% of variation in 2009. Thus, it may suggest that GPC, SPC, TPC and Kolbach index are mainly controlled by genetic factors and also affected by environmental variation.
The correlations between SPC (soluble protein content in malt), TPC (soluble protein content in malt), GPC (grain protein content), Kolbach index, DP (distatic power), malt extract and viscosity
Generally, a stringent model may cause less spurious background association. In the current study, the structure-based association analysis with a K-matrix was calculated using TASSEL 2.01.
The comparison between previously published and newly identified molecular markers in this study
Proximal markers in this study
The structure and nucleotide diversity of HvNAM genes in cultivated and Tibetan wild barleys
Tibetan wild barley
The amplified HvNAM2 gene contained 3 exons and 2 introns with a NAM super-family domain between amino acids 28 and 157, and its length was 1528bp. The polypeptide sequence of HvNAM2 showed 80% identity to that of HvNAM1. Eight SNPs were located on bases 307, 732, 798, 962, 979, 991, 1034 and 1289 in Tibetan wild barley, while 4 polymorphisms were present on bases 307, 798, 979 and 991 in the cultivated barley. Among these SNPs, SNP307 and SNP797 were within introns, while the others were within the coding sequence. SNP732, SNP979, SNP1034 and SNP1289 led to amino acid substitutions, specifically Arg, Ala, Ser and Asn replacement with Lys, Thr, Ala and Tyr, respectively. Six haplotypes could be classified according to polymorphisms among cultivated and Tibetan wild barley. Moreover, one haplotype in cultivated barley and 2 haplotypes in Tibetan wild barley were found unique (Table 3). The presence of new polymorphisms in Tibetan wild barley indicated that it could provide a new genetic resource in the genetic improvement of barley. However, only one SNP (798, P < 0.05) located within the second intron of HvNAM2 (Figure 6 and Additional file 7: Figure S5) was associated with GPC as determined in two consecutive years in 59 cultivated and 99 Tibetan wild barley genotypes. Moreover, in order to analyze the effect of HvNAM2 haplotypes on GPC in barley, one haplotype with one accession was excluded from the six haplotypes of HvNAM2. The haplotypes of HvNAM2 explained 7.2% GPC variance in our population. We observed that the haplotype 3 of HvNAM2 was higher in GPC, while the haplotype 5 of HvNAM2 had the lowest GPC in both years (Figure 7).
Barley used for malting should have a GPC lower than 11.5%. GPC is influenced to a large extent by both genotype and environment [31, 32]. In the current study, phenotyping of the diversity panel provided some valuable information about the range and distribution of GPC in barley. Genotype and environment interactions are indeed apparent when comparing the GPC data over the two consecutive years. Our results showed that some Tibetan wild accessions with higher GPC could be useful for breeding both feed and food barley cultivars. Although there were significant differences in GPC, SPC and TPC among genotypes over the two consecutive years, the traits were mainly controlled by genetic factors as indicated by their high consistency over the two years.
A negative correlation between GPC and malt extract and a positive correlation between GPC and DP have been reported . Similarly, in the current study, we found that TPC was negatively correlated with malt extract and positively correlated with DP. Interestingly, SPC was correlated with all malt quality parameters except malt extract. Obviously, the protein content in both grain and malt is closely related to malt quality. Therefore, it is imperative for us to develop barley varieties with stable GPC in malt barley breeding.
The advantages of GWAS over the conventional QTL mapping, based on a population from a bi-parental cross have been confirmed . Compared to QTL mapping, GWAS increases the range of natural variation that can be surveyed in a single experiment and the number of significant regions that are likely to be identified . Hence, GWAS could provide higher resolution than QTL mapping, and facilitate fine-mapping and gene discovery. The materials used in our GWAS study, included 59 worldwide cultivated and 99 Tibetan wild barley accessions, which cover representative accessions from most of the barley-growing regions in the world.
GPC were mainly controlled by genetic factors and also affected by environmental variation according the correlation analysis. However, a stringent criterion for significance, may bias studies against detection of causal associations that show significant Genotype-Environment interactions . Thus, we chose 0.01 and 0.05 as the threshold of association analysis, in order to detect possible markers associated with GPC. As a result, GWAS identified as many as 5, 7, 6, 5, 6 and 8 loci to be associated with barley GPC on chromosomes 1H, 2H, 3H, 5H, 6H and 7H, respectively. These results showed that many more molecular markers associated with GPC could be detected by GWAS than by conventional QTL mapping.
In addition to the discovery of the DArT markers for GPC, the completion of the association map for GPC is a significant step towards the cloning of GPC related genes. The identified markers for GPC will be very useful in the evaluation and screening of barley accessions with reasonable GPC. In comparison with previous studies [1, 4, 15, 31], we found more markers in this study, including 3, 3, and 1 marker(s) on chromosome 6H, 2H and 5H, respectively (Table 2). Three major QTLs were identified on chromosomes 6H and 2H using a barley mapping population developed from a cross between ‘Karl’, a low grain protein six-rowed variety and ‘Lewis’, a high grain protein two-rowed variety. The three QTLs could explain 56% of the total heritable variance of GPC . Two of them were identified as the HvNAM1 and HvNAM2 genes in barley, the homologs of a NAC transcription factor (NAM-B1) that increases GPC by regulating senescence in wheat . Therefore, we considered HvNAM1 and HvNAM2 as the candidate genes controlling GPC. Due to the effect of gene-target association to identify SNP markers for use in barley , the association between two candidate genes, HvNAM1 and HvNAM2, and GPC was analyzed, in order to examine the genetic architecture of GPC and to identify GPC loci in barley.
Jamar et al. found that allelic variation of the functional NAM-1 gene could be associated with GPC variation within the genus Hordeum, and the 13 genotypes used in their study could be classified into three haplotypes: 11 European varieties of H. vulgare being gathered as haplotype 1, one H. spontaneum (Hs) and one Hordeum bulbosum (Hb) being classified as haplotype 2 (Genbank accession number EU908210) and haplotype 3 (Genbank accession number EU908211), respectively. By comparing to the reference sequence (DQ869678), 3 SNPs were identified on bases 355, 483 and 554 of HvNAM1. However, we did not identify these SNPs in the current study. Instead, we found 3 SNPs located on bases 234, 544 and 1433 in the cultivated barley and 3 SNPs on bases 544, 1190 and 1427 in Tibetan wild barley. No association was detected between the polymorphisms of HvNAM1 and GPC, however there was significant correlation between HvNAM1 haplotypes and GPC. Moreover, eight SNPs within HvNAM2 were located on bases 307, 732, 798, 962, 979, 991, 1034 and 1289 in the Tibetan wild barley, but only 4 SNPs were present on bases 307, 798, 979 and 991 in the cultivated barley. Interestingly, a single SNP (798, P < 0.05) within HvNAM2 gene, located on the second intron, was associated with GPC. To gain further insight, the correlation between HvNAM2 haplotypes and GPC was analyzed in barley, where The DArT markers close to HvNAM1 and HvNAM2 explained 18% and 6.4% GPC variance, while the haplotypes of HvNAM1 and HvNAM2 accounted for 20.6% and 7.2% of GPC variance, respectively. The comprehensive analysis, including the primary GWAS, the colinearity of NAM locus between barley and wheat, the best Neighbor Joining tree of NAM genes in Arabidopsis and other crops and the association analysis of HvNAM genes, indicated that HvNAM genes could drive the variation in barley GPC. Moreover, the results also showed that the adjusted P value < 0.05 could be reasonable for finding the molecular markers associated with traits which are greatly affected by environmental factors. In fact, the threshold with P <0.05 used in our primary GWAS of GPC ensured identification of the DArT markers, which were not detected in the analysis with the threshold of P <0.01. One of candidate genes, HvNAM1, detected in the association analysis with adjusted P values <0.05, was found to be associated with GPC. The current results indicate the suitability of the adjusted P value <0.05 for identifying the molecular markers associated with GPC. Similarly, the adjusted P values <0.05 was used as the criteria for association analysis in other research .
Ultimately, the identification of SNPs and haplotypes of HvNAM genes could enable the development of useful molecular markers for GPC. Here, the association analysis may provide some molecular markers of HvNAM genes with potential importance for the early selection in malt barley breeding.
More importantly, it will shed some light on the molecular mechanisms responsible for the genotypic differences of GPC in cultivated and wild barley. Furthermore, the exact chromosome regions of these markers would be interesting for researchers to understand the genetics of GPC, since most of these regions have been not annotated in terms of their function. However, association mapping only provides statistical and indirect evidences for the function of identified genes, so we are targeting some direct evidences into the underlying molecular mechanisms of GPC and malting quality in future research.
This study has demonstrated close correlation between protein content and malt quality parameters, indicating that it is imperative for us to develop barley varieties with a stable GPC. The identified markers for GPC in this study will be very useful in evaluation and screening of barley germplasm with reasonable GPC. Moreover, the haplotypes of HvNAM1 and HvNAM2, SNP and DArT markers, which were associated with GPC in barley, could provide key molecular markers for the selection of malt quality traits. In addition, GWAS is very useful for finding candidate genes and may provide a powerful tool for identifying the different loci influencing GPC in barley.
This research was supported by National Natural Science Foundation of China (No.30800681, 31129005), Zhejiang Provincial Natural Science Foundation of China (Y3100044), the Fundamental Research Funds for the Central Universities (2011FZA6005 and 2012FZA6011) and Qianjiang Talents Project of Zhejiang Province (No. 2011R10079). We also thank Dr. Zhonghua Chen (University of Western Sydney) for his comments and revision on the manuscript.
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.