Skip to main content

Identification and validation of mutation points associated with waxy phenotype in cassava



The granule-bound starch synthase I (GBSSI) enzyme is responsible for the synthesis of amylose, and therefore, its absence results in individuals with a waxy starch phenotype in various amylaceous crops. The validation of mutation points previously associated with the waxy starch phenotype in cassava, as well as the identification of alternative mutant alleles in the GBSSI gene, can allow the development of molecular-assisted selection to introgress the waxy starch mutation into cassava breeding populations.


A waxy cassava allele has been identified previously, associated with several SNPs. A particular SNP (intron 11) was used to develop SNAP markers for screening heterozygote types in cassava germplasm. Although the molecular segregation corresponds to the expected segregation at 3:1 ratio (dominant gene for the presence of amylose), the homozygotes containing the SNP associated with the waxy mutation did not show waxy phenotypes. To identify more markers, we sequenced the GBSS gene from 89 genotypes, including some that were segregated from a cross with a line carrying the known waxy allele. As a result, 17 mutations in the GBSSI gene were identified, in which only the deletion in exon 6 (MeWxEx6-del-C) was correlated with the waxy phenotype. The evaluation of mutation points by discriminant analysis of principal component analysis (DAPC) also did not completely discriminate the waxy individuals. Therefore, we developed Kompetitive Allele Specific PCR (KASP) markers that allowed discrimination between WX and wx alleles. The results demonstrated the non-existence of heterozygous individuals of the MeWxEx6-del-C deletion in the analyzed germplasm. Therefore, the deletion MeWxEx6-del-C should not be used for assisted selection in genetic backgrounds different from the original source of waxy starch. Also, the alternative SNPs identified in this study were not associated with the waxy phenotype when compared to a panel of accessions with high genetic diversity.


Although the GBSSI gene can exhibit several mutations in cassava, only the deletion in exon 6 (MeWxEx6-del-C) was correlated with the waxy phenotype in the original AM206–5 source.


Starch is widely used for multiple commercial purposes. Not only does it provide more than 80% of the calories in the human diet [1], it also has many industrial applications, such as for glues and adhesives. It basically consists of two types of polymers: amylose (essentially α1,4-polyglucans, linear) and amylopectin (α1,4-polyglucans and α-1,6-polyglucans, branched) [2], whose proportions and arrangements confer characteristics that define the starch’s commercial advantages and are the focus of breeding programs. The main commercial sources of starch are maize, cassava, rice, wheat and potato. Cassava is the second most important source of starch worldwide after maize. The functional and specific properties of each of these species define their industrial applications [3,4,5,6].

Amylopectin, the basic unit of the starch granule, is composed of many short chains of glucose molecules. In contrast, amylose is a smaller molecule with longer chains [7]. Starch from different crops typically consists of 20–30% amylose and 70–80% amylopectin [1]. Starches with low or no amylose, known as waxy types, are important because they present lower syneresis and retrogradation, are generally clear and have a more viscoelastic gel form [8]. Retrogradation is a process in which the disrupted amylose and amylopectin chains (usually heated in the presence of water) can gradually re-associate into a different ordered structure. Over time, this reorganization can release the water retained within the structure (syneresis), with detrimental effects on the sensory qualities and storage time of foods derived from this type of starch, primarily because it alters the texture and nutritional properties of the food [9]. Not only does waxy cassava starch present lower syneresis in comparison with waxy starch from other crops [10], the structure of amylopectin is not modified by the mutation [11]. Cassava starch has a neutral taste due to its low contents of lipids and proteins, which gives it competitive advantages over starches from cereals for use in the food industry [12].

Though it is possible to perform chemical and physical modifications of starch to change its functional properties to meet specific market demands, there are certain limitations on these modifications. In addition, regardless of the type, making such changes to starch is always associated with a higher industrial cost. Another aspect to be considered is that consumers are increasingly demanding products that are more natural, with a minimum of industrial processing. Hence, there is a growing market for naturally differentiated starches. Breeding programs can contribute to the discovery of genetic variants of interest and the further incorporation of these characteristics in commercial varieties, and waxy starch is one of these special traits.

From the genetic point of view, a clear understanding of the genes involved in the starch biosynthesis allows the identification of accessions from germplasm carrying the mutations without the need for a laborious phenotyping approach. Among the enzymes related to starch synthesis, GBSSI (granule-bound starch synthase I) is a synthase related to amylose elongation. Waxy mutants of many species exhibit deficient activity of this enzyme [7, 13, 14]. Several allelic forms derived from different molecular mechanisms have been identified as responsible for the waxy phenotype. In maize, the coding region of the waxy gene comprises 3718 bp, which is composed of 14 exons, ranging from 64 to 392 base pairs (bp), and 13 introns of 81 to 139 bp [15]. According to Fan et al. [16] the wx-D7 allele involves a 30-bp deletion at the junction of the 7th exon–intron and generates an abnormal transcript retaining the 7th intron, which introduces an immature stop codon and inactivates GBSSI. In rice, two SNPs were identified in exons 6 and 10 of the GBSSI gene, with associations to the amylose content and paste properties [17]. A strong correlation was also observed between an SNP located at exon1/intron1 and paste properties in rice starch [18]. In common hexaploid wheat (Triticum aestivum L.), each of the genomes A, B and D has a waxy protein, whose respective locus coder (Wx-A1, Wx-B1, and Wx-D1) has several mutations reported in the literature. In another work, the insertion of an SNP and deletion of a single nucleotide in the null allele Wx-A1 induced premature termination codons approximately 55 nucleotides forward of the mutation point in Triticum dicoccoides A. and T. dicoccum S [19]. More recently, a new null allele was characterized by the insertion of a transposon and consequent loss of function in this species [20]. Two new mutations have also recently been discovered in maize, associated with transposable elements, with a 466-bp retrotransposon inserted in exon 6 and a transposable repeating element of 116 bp inserted in exon 7 [21].

In cassava, studies of the genetic control of the waxy phenotype and even its exploitation for the development of varieties with this characteristic are relatively recent. The first waxy clones in cassava were induced by modifications in GBSSI gene expression via transgenic techniques [22, 23] and later by Zhao et al. [24]. Then, a natural mutation was reported by [25] in a series of self-pollinations carried out at the International Center for Tropical Agriculture (CIAT), which evidenced the recessive nature of the waxy gene from genotype AM206–5. Recently, the CRISPR-Cas9 technology was used in cassava to mediate targeted mutagenesis of two genes involved in amylose biosynthesis (GBSSI and Protein Targeting to Starch - PTST1) resulting in waxy clones or clones with reduced amylose content [26].

Aiemnaka et al. [27] performed molecular characterization of the GBSSI gene in segregating progenies from the AM206–5 source and identified potential functional mutations. The authors found an indel in exon 6 with a single base exclusion (cytosine) that creates a premature stop codon; a two-base variant in exon 11 (GC and AT); and a substitution in intron 11 (C to G). Based on this last mutation, the authors developed a SNAP (single-nucleotide-amplified polymorphism) marker whose information would allow breeders to direct self-pollination and backcrossing assisted by molecular markers, with the aim of reducing the costs and time required for breeding programs. In this context, the objectives of the present work were as follows: i) to validate the mutations (SNPs and indels) previously associated with the original source of the waxy starch from AM206–5 in Latin American cassava germplasm; and ii) to identify alternative alleles in the GBSSI gene in a panel of waxy and non-waxy accessions that may be useful in molecular-assisted selection (MAS) for this trait.


Screening of cassava germplasm with SNAP markers

SNAP primers developed previously for intron 11 [27] were used to screen 1529 cassava accessions from four countries. The profile analysis of the electrophoresis gels was based on the presence or absence of the fragment amplified by the primers MeWxI11-G and MeWxI11-C, which indicated the presence of heterozygous individuals, whereas the amplification of only one of the two alleles indicated the presence of homozygous individuals for the waxy (MeWxI11-G - GG genotype) or non-waxy (MeWxI11-C, CC genotype) phenotype. According to the molecular analyses, none of the cassava accessions evaluated were identified as having the recessive allelic condition (GG). This result was expected, since none of the 1529 genotyped accessions had the waxy starch phenotype. However, 206 of the evaluated individuals were identified as heterozygotes (CG), because they had alleles amplified by the primers MeWxI11-C and MeWxI11-G.

Genotypic and phenotypic evaluation of S1 populations with SNAP markers

Twenty-eight heterozygous accessions from the total (206) were self-pollinated in the field with the aim of generating S1 segregation populations for the waxy gene. From this total (28), three heterozygous accessions (BGM0061, BGM0935 and BGM0438) produced 187 plants (61, 101 and 25, respectively) that were used to evaluate the molecular segregation of the alleles and the waxy phenotype. The results of the allelic segregation of the S1 progenies, analyzed with the primers MeWxI11-C and MeWxI11-G, showed no significant deviations from the expected proportion of individuals (1:2:1) in progenies S1-BGM0061, S1-BGM0935 and S1-BGM0438, according to the χ2 test (Table 1). Therefore, even with a small number of individuals in these three progenies, it was possible to observe the Mendelian segregation of the C and G alleles located at intron 11, position 3197 bp of the GBSSI gene, as expected for a recessive trait (Supplement Fig. S1). However, although molecular segregation occurred as expected in progenies S1 - BGM0061, S1 - BGM0935 and S1 - BGM0438, the evaluation of the waxy phenotype in all 28 of the S1 progenies in the field, based on the 2% iodine test, did not identify any individual with the waxy phenotype. Therefore, this result showed that the mutation of the C by G allele at the 3197 bp position of the GBSSI gene is not a suitable molecular marker for use in molecular-assisted selection of waxy individuals when the genetic background is completely different from the original source of the mutation (AM206–5).

Table 1 Molecular segregation of the single nucleotide polymorphism (SNP) at position 3197 bp (Intron 11) of the granule-bound starch synthase I (GBSSI-I) gene in three cassava S1 progenies

Identification of new genetic variation in the GBSSI by gene sequencing

To identify new allelic variants associated with waxy starch, complete GBSSI gene sequencing was performed on 89 cassava genotypes. After the alignment trimming of the sequences, 16 SNPs were identified in the GBSSI gene as well as one deletion, indicating genotypic differences among the 89 cassava accessions evaluated, which included waxy (AM206–5 derived population) and non-waxy genotypes (germplasm bank) (Fig. 1). Six SNPs were found in untranslated regions (UTRs), one in a non-coding region and nine in coding regions. However, the allelic variation identified in the GBSSI gene was not able to identify any SNP that could precisely distinguish the waxy and non-waxy individuals (Fig. 2). Only the deletion of the nucleotide cytosine (MeWxEx6-del-C) previously identified by Aiemnaka et al. [27] was found exclusively in waxy accessions. Therefore, the SNPs identified in the GBSSI gene were not able to individually determine stop codons or errors in the genetic code reading that led to the development of a waxy phenotype in cassava.

Fig. 1
figure 1

Schematic representation of the GBSSI (granule-bound starch synthase I) gene indicating the presence of single nucleotide polymorphisms (SNPs) identified in 90 cassava accessions. The red lines represent the SNPs, and the arrow indicates the deletion

Fig. 2
figure 2

Allelic variation of single nucleotide polymorphisms (SNPs) and indels identified in the sequencing of the GBSSI gene in waxy and non-waxy cassava genotypes. Numbers forward of the SNP coding represent the position in the GBSSI gene (in base pairs). Code: Me - species (M. esculenta); Wx - gene (GBSSI); U’5 - UTR’5 region, E - coding regions (exon); I - non-coding regions (introns); U’3 - UTR’3 region

Discriminant analysis of principal components (DAPC) of the waxy and non-waxy genotypes based on the allelic variation of the GBSSI gene

Additionally, DAPC was performed to determine a discriminant function for clustering the different individuals based on the set of SNPs found in the GBSSI gene and the starch types. Only the 12 SNPs that showed minimum variation between the waxy and non-waxy individuals were used. The SNPs of exon 10 (position 2734 bp) and exon 11 (positions 3025, 3026 and 3046 bp) were the most divergent among individuals with the waxy phenotype (Fig. 2) and were withdrawn from the joint analysis. The deletion MeWxEx6-del-C (position 1701 bp) was also removed from the analysis, since the main objective was to determine how the other mutations could jointly discriminate the waxy phenotype, in order to increase the possibilities of identifying individuals with waxy alleles in the cassava germplasm from Brazil.

The overlap of the discriminant function density and the membership assignment indicated that the combination of SNPs was not able to cluster the waxy and non-waxy individuals with 100% accuracy (Fig. 3). Although the classification model demonstrated good accuracy, there was no complete discrimination (100%) of non-waxy individuals (BGM1444, BGM1288, BGM0399, BGM1140, and BGM0263), according to its membership assignment (Fig. 3).

Fig. 3
figure 3

Discriminant analysis of principal components (DAPC) based on the analysis of 12 single nucleotide polymorphisms (SNPs) identified in the GBSSI gene in waxy and non-waxy starch individuals. a Density graph of the first discriminant functions; b Membership assignment of the individuals to the a priori clusters defined with DAPC

The SNPs that contributed most to the discriminant function are located in the 3’UTR at 4041 and 4195 bp, and exon 13 at 3459 bp. Since the non-waxy individuals BGM1444, BGM1288 and BGM0399 shared these same SNPs with the waxy individuals, the waxy phenotype discrimination could not be performed accurately. The non-waxy individuals BGM1140 and BGM0263 shared the SNPs with small contributions (≤ 0.01) to discriminate the phenotype, located in the 5’UTR at 86 and 93 pb, as well as exon 1 at 381 and 534 bp and exon 3 at 1010 bp. The clone BGM1140 also shared the SNP of 3’UTR region (4195 bp) with waxy individuals.

Genotypic evaluation of the germplasm bank via KASP

After the complete sequencing of the GBSSI gene, we also identified a deletion in exon 6 at position 1701 bp (henceforth referred to as MeWxEx6-del-C), previously described by Aiemnaka et al. [27]. Considering that only the deletion of cytosine at the 1701 bp position of the GBSSI gene was able to distinguish the waxy from non-waxy phenotypes (Fig. 2), the next step was to implement a genotyping system for this deletion on a large scale. Thus, an initial analysis with the KASP (Kompetitive Allele Specific PCR) technique was performed to evaluate the amplification of the target fragments. In the previous genotyping, the SNAP primers MeWxI11-G and MeWxI11-C were enriched with three nucleotides at the 3′ end to allow differential amplification of an SNP at intron 11. The same strategy was used in the other mutations reported by Aiemnaka et al. [27], without success [data not shown]. In contrast, KASP technology could detect only one base difference by the use of primers differentially marked by fluorescence [28].

HEX fluorescence was used to detect waxy homozygous (wxwx) accessions, and FAM fluorescence to detect non-waxy homozygous (WxWx) accessions, while the heterozygous accessions were identified by intermediate fluorescence (Wxwx). In the first analysis of the MeWxEx6-del-C deletion via KASP, we used a set of 94 accessions, and only the 2017wx-02-17 and 7934–1 genotypes did not generate consistent signals for fluorescence detection or did not amplify (Fig. 4), while the genotypes 2017wx-01-02 and 2017wx-02-19 did not present the expected alleles. Therefore, this initial analysis yielded a 96% success rate in identifying the waxy and non-waxy phenotypes. Then the 1529 germplasm accessions were genotyped for validation of the method, and at this stage there were only 18 genotypes (S1–1662-65, BGM-0052, BGM-0056, BGM-0314, BGM-0336, BGM-1037, BGM-1415, BGM-1598, BGM-1698, BGM-1716, BGM-1756, BGM-2245, BGM-2252, BGM-2257, BGM-2278, BGM-0150, BGM-0856, Brasileira) for which we could not identify the alleles associated with the waxy phenotype (Fig. 4). Thus, KASP allowed the precise identification of 98% of the individuals.

Fig. 4
figure 4

Genotyping via Kompetitive Allele Specific PCR (KASP) for the deletion MeWxE6-del-C. a Training population composed of 94 cassava accessions including 31 waxy starch homozygotes (wxwx); 31 non-waxy starch heterozygotes (Wxwx); and 32 non-waxy starch homozygotes (WxWx); and b Validation population composed of 1529 cassava accessions from Latin America

The main results of these analyses were as follows: i) absence of heterozygous cassava accessions for deletion of the C allele at position 1701 bp of the GBSSI gene in the evaluated germplasm collection; and ii) high specificity in the identification of the allelic condition of the cassava genotypes. Therefore, this point mutation can allow the accurate identification of the allelic condition of the waxy gene for any cassava germplasm (Fig. 4).


The main objective of the initial evaluation of cassava germplasm with SNAP markers was to identify the presence of alleles associated with the waxy phenotype and then guide the self-pollination of the accessions once the waxy phenotype is expressed under the recessive condition. So, we used primers related to the waxy mutations previously described by Aiemnaka et al. [27] in the original source AM205–6. However, the inefficiency of the MeWxI11-G/C primers for the identification of individuals with the waxy phenotype in the germplasm from Latin America was demonstrated when evaluating the 28 S1 families derived from self-pollination identified as having the G allele associated with the waxy trait. The reported primers did not identify the waxy phenotype, since the S1 individuals with the G/G genotype had high amylose content according to the iodine test in the field. Therefore, it is possible to speculate that the SNP previously identified by Aiemnaka et al. [27] (C-G at position 3197 bp) can be used for molecular marker-assisted selection only in segregating populations derived from the AM205–6 source, and it is not possible to validate its use for different genetic backgrounds.

Based on the complete GBSSI gene sequencing performed in waxy and non-waxy individuals, all mutations described by Aiemnaka et al. [27], comprising one indel in exon 6 (MeWxE6-del-C), two SNPs in exon 11 (MeWxE11–3025 and 3026), and one substitution in intron 11 (MeWxI11–3046), were identified. In addition, 13 other SNPs in the GBSSI gene were also found (MeWxU’5–86 and 93; MeWxE1–381,435, 543; MeWxE10–2734; MeWxE13–3424, 3459; MeWxU’3–3790, 4041, 4113, 4195). However, out of the 16 SNPs identified in the GBSSI gene, none of them were individually able to distinguish waxy and non-waxy phenotypes.

For the GBSSI gene sequencing, a panel of 90 cassava accessions from different origins in Latin America representing the great genetic diversity of this crop was used. This high diversity was crucial to identify SNPs that were common to waxy and non-waxy genotypes. The data are therefore consistent and can serve as a basis for further investigations of cassava in relation to waxy starch. Similar results have also been reported for Indian rice varieties in Northeast India, where again mutations in the GBSSI gene previously associated with amylose content did not distinguish the waxy phenotype [29]. Of the five varieties of waxy rice evaluated by the authors, two did not have the mutation, and among the non-waxy, one mutation occurred in one variety. Choudhury et al. [29] also investigated the OsC1 locus responsible for the apiculus color in rice, and they found that a 10 bp deletion in the OsC1 gene, attributed to a colorless apiculus, was not detected in five of the 21 varieties with this phenotype. Moreover, one of the nine rice varieties with a colored apiculus presented the deletion. Thus, the mutations considered to be associated with waxy starch and apiculus color in rice did not necessarily correspond to the expected phenotype in different genetic backgrounds.

Only the deletion MeWxE6-delC (position 1701 bp) was exclusive of waxy individuals from the AM206–05 original source and thus able to characterize the waxy phenotype in these populations. This deletion creates a premature TGA stop codon at Exon 8 (position 2069 bp) that prevents complete synthesis of the GBSSI enzyme [27]. GBSS is bound to the granule, most likely by synthesizing the amylose within the granular matrix formed by amylopectin. Therefore, mutants produce less or no amylose, suggesting that no other synthase can replace it in this function [30].

The KASP genotyping for screening of the MeWxE6-del-C deletion showed high accuracy of waxy and non-waxy classification [98%]. Indeed, other authors have reported the great flexibility and high accuracy of the KASP technique for high-performance genotyping [28]. The KASP genotyping allowed a clear distinction between the homozygous and heterozygous accessions in the training population, evidencing its great potential in molecular-assisted selection. In this sense, our work is an important advance for cassava breeding, as it validates KASP genotyping for routine detection of the waxy phenotype by the marker MeWxEx6-del-C, when using alleles from the AM206–05 source. In the future, high-throughput screening (HTS) with several important functional markers for cassava will be possible, as has been the case in other crops such as wheat, for which more than 38 polymorphisms from different agronomic traits have been analyzed with the KASP technique [31]. The speed of the KASP genotyping assays was 45 times higher than that of the gel-based PCR markers.

Analyses in different species support the perception that the waxy phenotype is related to the absence/deficiency of the GBSSI enzyme, associated with partial and complete deletions of the gene, the presence of SNPs, and transposable elements, sometimes leading to stop codons [17,18,19,20,21, 32, 33]. In cassava, a first natural recessive mutation was identified in the GBSSI gene characterizing the inheritance of this phenotype [27]. However, this causal mutation determined by the deletion of a cytokine in exon 6 at position 1701 bp was not identified in the cassava germplasm analyzed here, so it will not be possible to explore the use of this mutation to generate segregating populations for the waxy gene in backgrounds other than the original source AM206–05 [34]. However, it is important to note that throughout the evolutionary process, different mutation points have already been described as responsible for the waxy phenotype in other amylaceous species. Therefore, other points of causal mutations can be discovered in cassava based on gene sequencing GBSSI in the whole cassava germplasm of Latin America. Indeed, unpublished results suggest there are at least two different mutations resulting in these waxy phenotypes in cassava (Ceballos, personal communication). In maize, several mutations related to waxy starch have been reported, including insertions and deletions of varying sizes as well as transposable elements and SNPs [15]. The most recent mutations include transposition of the rf2 gene [35], deletions in exons 7 and 10 [16] and transposable elements in exons 6 and 7 [21] identified as functional mutations in the GBSSI gene and related to the waxy phenotype in maize.

Another important aspect is the phenotyping of individuals for quantitative determination of the amylose content, which could allow the identification of alternative allelic patterns for different amylose contents. According to Dobo et al. [36], three polymorphism in rice: exon 1 (G/T polymorphism), exon 6 (A/C polymorphism), and exon 10 (C/T polymorphism), exhibited five different allelic patterns: TCC, TAC, GCC, GAC and GAT. The allelic forms TAC and TCC were found in low-amylose varieties, GCC in varieties with intermediate amylose content, and GAT and GAC in varieties with high levels of amylose.

In general, starch synthesis is complex and derived from the coordinated action of several enzymes and their isoforms [13]. An example is the Protein Targeting to Starch (PTST) identified in Arabidopsis thaliana (L.), which is related to amylose synthesis. This protein acts on targeting and GBSSI transport [37, 38], so amylose synthesis is not yet completely elucidated. Mutants without the PTST factor in A. thaliana do not produce amylose because the GBSS protein, which normally binds to the starch, cannot bind in the absence of PTST. Recently, the application of CRISPR-Cas9 was demonstrated to generate cassava clones with modified starch by targeted mutagenesis of PTST1 genes. These cassava clones exhibited lower amylose content in comparison with the wild type, showing that PTST also participates in the synthesis of amylose in the cassava storage roots [26]. Therefore, it is possible that in addition to direct mutations at the gene level, other genomic regions may help to explain the waxy phenotype, and therefore be selectable.

The identification of different functional mutations in the PTST gene (Manes.02G075700) already described and mapped on cassava chromosome 2, as well as the other GBSSI isoform located on chromosome 1 (Manes.01G055700) [39], not evaluated in this work, may bring new contributions to a better understanding of the molecular bases of waxy gene expression in the germplasm of Latin America. Considering that the GBSSI gene is the only synthase known to be responsible for amylose synthesis and that the PTST protein is responsible for GBSSI targeting, a complete (paired-end) sequencing of such genes, including promoter regions, in the cassava germplasm would be one of the first steps in the search for alternative alleles for the waxy phenotype. All of the nucleotide diversity of the evaluated genes would allow an extensive in silico evaluation of points of potential mutations for inactivation of amylose production. This information, linked to the sequencing results, would direct the self-pollination of heterozygous accessions. With populations that have already been obtained, a molecular assessment of the segregation preceded by a field assessment using the iodine test would identify promising individuals. Finally, with validated markers in new sources of waxy genotypes, a KASP evaluation with different genetic backgrounds could be carried out, reaffirming the potential of these mutations to select potential cassava parents for waxy breeding.


Previously reported GBSSI-related SNPs and those found in this work are not useful for molecular marker-assisted selection in genetic backgrounds other than the AM206–5 source. Only the deletion in exon 6 (MeWxEx6-del-C) was able to completely discriminate the non-waxy and waxy genotypes. Although the evaluated germplasm did not have the allele associated with the original AM206–5 source, our results contribute to the establishment of a practical model for the use of molecular-assisted selection for allelic variants associated with the waxy phenotype in cassava, with the goal of optimizing the genotyping and early identification of specific genotypes with the desired alleles for generation of segregating populations.


Screening of cassava germplasm with single nucleotide amplified polymorphism [SNAP] markers associated with the waxy phenotype in the source AM206–5

A total of 1529 accessions belonging to the Brazilian Cassava Germplasm Bank of Embrapa Mandioca e Fruticultura (“Brazilian Agricultural Research Corporation – Embrapa Cassava and Fruit”), originating from different ecosystems of Brazil, Colombia, Venezuela and Uganda, were analyzed (Supplement - Table S1). DNA extraction was performed according to the CTAB (cetyltrimethylammonium bromide) protocol [40]. To verify the quality and quantity of extracted DNA, ethidium bromide (1.0 mg. L− 1) was used to stain the DNA in 1% (w/v) agarose gel (Invitrogen, USA) and was visually compared with various concentrations of Lambda DNA (Invitrogen, USA).

The mutations located in exon 6 (deletion of 1 bp at nucleotide 92) and the SNP at exon 11 (GC to AT) found by Aiemnaka et al. [27] were not optimized even after several PCR adjustments. Therefore, the genotyping of cassava germplasm (landraces and improved varieties) was performed using only the single nucleotide amplified polymorphism (SNAP) primers MeWxI11-G and MeWxI11-C developed by Aiemnaka et al. [27] and located at intron 11 of the waxy gene (GBSSI). The polymerase chain reaction (PCR) reactions were performed in a final volume of 15 μL containing 10 ng of DNA, PCR 1x buffer, 1.5 mM of MgCl2 (4G, Brazil), 0.2 mM of dNTP (Promega, USA), 0.2 uM of each primer (Integrated DNA Technologies, USA), and 1 U of Taq DNA Polymerase (Promega, USA). The amplification program consisted of a cycle at 94 °C for 2 min; 30 cycles at 94 °C for 30 s; annealing steps at 56 °C for 30 s and 72 °C for 1 min; and a final extension at 72 °C for 5 min, performed using a Veriti® 96-well model thermocycler (Applied Biosystems, USA). For the development of the amplification products, 2% (w/v) 1000 agarose gel (Invitrogen, USA) containing 1.0 mg. L− 1 of ethidium bromide was used and electrophoresed in TBE 0.5x buffer (45 mM Tris-borate, 1 mM EDTA). The products on the gel were visualized in UV light and recorded with the Gel Logic 212 Pro Photodocumentator (Carestream Molecular Imaging, USA).

Evaluation of S1 progenies using SNAP markers

Twenty-eight genotypes previously identified with the G allele at position 3197 bp of intron 11 (MeWxI11-G - associated with the waxy phenotype) in the heterozygous form [27] were self-fertilized for the generation of homozygous individuals with the waxy phenotype. The selected cassava accessions for self-pollination according to their flowering from August to November 2014 were BGM0061, BGM0131, BGM0132, BGM0222, BGM0463, BGM0505, BGM0614, BGM0650, BGM0726, BGM0729, BGM0741, BGM0872, BGM0935, BGM0941, BGM0962, BGM1023, BGM1041, BGM1120, BGM1143, BGM1148, BGM1253, BGM1284, BGM1288, BGM1335, BGM1378, BGM1383, BGM1413, and BGM1819.

To perform the self-pollination of the cassava accessions, the receptive female flowers were covered before opening in the morning with cloth bags (20 × 15 cm) to protect them from pollen contamination. Male flowers of the same genotype were collected in the morning and placed in pre-labeled, large-cap bottles. In the late morning and early afternoon, self-pollination was carried out through the contact of the anthers with the stigma of the female flower to ensure artificial pollination. Then, the pollinated flowers were covered with voile [light cotton cloth] until the collection of the seeds after natural dehiscence.

The seeds from the self-pollination of the 28 accessions were sown in a greenhouse, and after 30 days, the seedlings were transplanted to the field. The plants were harvested and evaluated 10 months after planting. The S1 plants of each family were evaluated for the presence of waxy starch using the 2% iodine test (2 g KI and 0.2 g I2 in distilled water), applied to a cross section of the roots of all genotypes [25]. The long amylose chains have a high ability to bind to the iodine in the solution, which gives a blue color to the starches containing amylose when stained with iodine. In contrast, amylopectin has a low iodine-binding capacity, and therefore, red-brown stains are characteristic of starches essentially containing only this polymer [7].

After the evaluations of S1 families in the field, 185 genotypes belonging to families S1-BGM0061 (60 individuals), S1-BGM0463 (24 samples), and S1-BGM0935 (101 samples) were genotyped with SNAP markers, as previously described. The molecular segregation for the GBSSI gene in the S1 individuals based on the primers MeWxI11-G and MeWxI11-C was evaluated by the χ2 test \( {\sum}_{i=1}^n\frac{{\left[{f}_o-{f}_e\right]}^2}{f_e} \), where fo and fe are the observed and expected frequencies of the phenotypes.

GBSSI gene sequencing

Complete GBSSI gene sequencing was performed on 89 cassava genotypes. Of this total, 54 accessions were considered homozygous (CC) or heterozygous (CG) using the primers MeWxI11-G and MeWxI11-C (non-waxy phenotype), and 35 genotypes were segregating from AM206–5 derived populations. Of the latter, 3 were homozygous (CC) or heterozygous (CG) (non-waxy phenotype), and 32 were homozygous (GG) genotypes (waxy phenotype) (Table 2). The extraction and quantification of the genomic DNA was performed as described previously.

Table 2 List of cassava accessions used in GBSSI (granule-bound starch synthase I) gene sequencing

For complete amplification of the gene, five primers were developed using the reference sequence of the GBSSI gene (Manes.02G001000) from cassava genome v6.1 [39] deposited in the Phytozome v12.1 database [41] (Table 3). The Primer3 program [42] was used to design primers with the following criteria: product size amplified between 800 and 1100 bp; annealing temperature above 60 °C; and G / C percentage above 40%. The primers were allocated to overlap each of the sequences to ensure complete gene coverage.

Table 3 Primers used for amplification and complete sequencing of the GBSSI (granule-bound starch synthase I) gene in waxy and non-waxy cassava genotypes

For primers optimization, different annealing temperatures (58 to 64 °C), magnesium chloride concentrations (1, 1.5 and 2 mM), and different numbers of PCR extension cycles were tested. PCR reactions were optimized in final volume of 50 μl containing 10 ng of DNA, 1X PCR buffer, 1.5 / 2.0 mM MgCl 2 (Invitrogen, USA), 0.2 mM dNTP (Promega, USA), 2 mM of each primer (Integrated DNA Technologies, USA), and 1 U of Taq DNA High Fidelity Polymerase (Invitrogen, USA). The amplification program was optimized using an initial denaturation of one cycle at 95 °C for 1 min; followed by 30/35 cycles at 95 °C for 15 s, annealing at 62 °C for 15 s and 72 °C for 30 s; and final extension at 72 °C for 7 min, performed in a Veriti® 96-well thermocycler (Applied Biosystems, USA).

The electrophoresis product was purified with ExoSap-IT (Affymetrix, USA), sequenced in both directions using BigDye® Terminator v3.1 Cycle Sequencing (Applied Biosystems, USA) and analyzed with an ABI Prism 3730 XL analyzer (Applied Biosystems, USA). The chromatograms were assigned and trimmed with Phred [43, 44], aligned with the reference sequence Manes.02G001000, and the SNPs were identified with the aid of the newSNP v3.0.1 program [45]. Identification of the coding and non-coding regions was performed by comparison with the GBSSI gene (Manes.02G001000) of the cassava genome v6.1 [39].

Discriminant analysis of principal components

Discriminant analysis of principal components (DAPC) was performed based on the allelic variants identified by the complete GBSSI gene sequencing. DAPC was performed, retaining more than 90% of the variation of the data with the first 10 principal components and one discriminant function. The analysis was performed using the adegenet package [46] of the software R v3.5 [47].

Genotypic evaluation of the cassava germplasm for deletion MeWxEx6-del-C

The Kompetitive Allele Specific PCR (KASP) technology [28] for screening the deletion MeWxEx6-del-C was performed by the company Intertek AgriTech. The KASP genotyping optimization was performed in two stages. In the first step, a set of individuals with known allelic conditions was used, i.e., 31 individuals homozygous for the waxy gene (wxwx); 31 heterozygous individuals (Wxwx) from segregating populations for the waxy genotype derived from the AM206–5 source, and 32 non-waxy individuals (WxWx) from the germplasm bank (Table 4). Allele-specific fluorescence (HEX and FAM) was detected using the SNPviewer2 v4.0.0 program. and clusters were visualized using R v.3.5 [47]. After optimization and validation of the MeWxEx6-del-C marker, genotyping of 1529 cassava accessions was performed.

Table 4 Cassava genotypes used to optimize the genotyping of the cytosine deletion at exon 6, position 1701 bp (MeWxEx6-del-C) by Kompetitive Allele Specific PCR (KASP)

Availability of data and materials

The datasets used and/or analysed during the current study are available from the corresponding author on reasonable request.



Granule-bound starch synthase I (GBSSI)


Discriminant principal component analysis


Kompetitive Allele Specific PCR


Single-nucleotide polymorphism


Single nucleotide polymorphisms in intron 11 indicated as G/C base substitution


1-bp deletion at nucleotide 92 of exon 6 that creates a premature stop codon on GBBSI gene


Molecular-assisted selection


Single nucleotide-amplified polymorphism markers


Untranslated region


  1. Keeling PL, Myers AM. Biochemistry and genetics of starch synthesis. Annu Rev Food Sci Technol. 2010;1:271–303.

    Article  CAS  PubMed  Google Scholar 

  2. Pérez S, Bertoft E. The molecular structures of starch components and their contribution to the architecture of starch granules: a comprehensive review. Starch/Stärke. 2010;62:389–420.

    Article  CAS  Google Scholar 

  3. Copeland L, Blazek J, Salman H, Tang MC. Form and functionality of starch. Food Hydrocoll. 2009;23:1527–34.

    Article  CAS  Google Scholar 

  4. Hoover R. Composition, molecular structure, and physicochemical properties of tuber and root starches: a review. Carbohydr Polym. 2001;45:253–67.

    Article  CAS  Google Scholar 

  5. Singh N, Singh J, Kaur L, Sodhi NS, Gill BS. Morphological, thermal and rheological properties of starches from different botanical sources. Food Chem. 2003;81:219–31.

    Article  CAS  Google Scholar 

  6. Wang K, Henry RJ, Gilbert RG. Causal relations among starch biosynthesis, structure, and properties. Springer Sci Rev. 2014;2:15–33.

    Article  Google Scholar 

  7. Denyer K, Johnson P, Zeeman S, Smith AM. The control of amylose synthesis. J Plant Physiol. 2001;158:479–87.

    Article  CAS  Google Scholar 

  8. Šárka E, Dvořáček V. New processing and applications of waxy starch (a review). J Food Eng. 2017;206:77–87.

    Article  CAS  Google Scholar 

  9. Wang S, Li C, Copeland L, Niu Q, Wang S. Starch retrogradation: a comprehensive review. Compr Rev Food Sci Food Saf. 2015;14:568–85.

    Article  CAS  Google Scholar 

  10. Sánchez T, Dufour D, Moreno IX, Ceballos H. Comparison of pasting and gel stabilities of waxy and normal starches from potato, maize, and rice with those of a novel waxy cassava starch under thermal, chemical, and mechanical stress. J Agric Food Chem. 2010;58:5093–9.

    Article  CAS  PubMed  Google Scholar 

  11. Rolland-Sabaté A, Sánchez T, Buléon A, Colonna P, Jaillais B, Ceballos H, Dufour D. Structural characterization of novel cassava starches with low and high-amylose contents in comparison with other commercial sources. Food Hydrocoll. 2012;27:161–74.

    Article  Google Scholar 

  12. Waterschoot J, Gomand SV, Fierens E, Delcour JA. Production, structure, physicochemical and functional properties of maize, cassava, wheat, potato and rice starches. Starch/Stärke. 2015;67:14–29.

    Article  CAS  Google Scholar 

  13. Tetlow IJ, Morell MK, Emes MJ. Recent developments in understanding the regulation of starch metabolism in higher plants. J Exp Bot. 2004;55:2131–45.

    Article  CAS  PubMed  Google Scholar 

  14. Zeeman SC, Kossmann J, Smith AM. 2010. Starch: its metabolism, evolution, and biotechnological modification in plants. Annu Rev Plant Biol. 2010;61:209–34.

    Article  CAS  PubMed  Google Scholar 

  15. Huang B-Q, Tian M-L, Zhang J-J, Huang Y-B. Waxy locus and its mutant types in maize Zea mays L. Agric Sci China. 2010;9:1–10.

    Article  CAS  Google Scholar 

  16. Fan L, Bao J, Wang Y, Yao J, Gui Y, Hu W, Zhu J, Zeng M, Li Y, Xu Y. Post-domestication selection in the maize starch pathway. PLoS One. 2009;4:e7612.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  17. Larkin PD, Park WD. Association of waxy gene single nucleotide polymorphisms with starch characteristics in rice (Oryza sativa L.). Mol Breed. 2003;12:335–9.

    Article  CAS  Google Scholar 

  18. Kharabian-Masouleh A, Waters DLE, Reinke RF, Ward R, Henry RJ. SNP in starch biosynthesis genes associated with nutritional and functional properties of rice. Sci Rep. 2012;2:557.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  19. Saito M, Nakamura T. Two-point mutations identified in emmer wheat generate null Wx-A1 alleles. Theor Appl Genet. 2005;110:276–82.

    Article  CAS  PubMed  Google Scholar 

  20. Zhang L, Chen H, Luo M, Zhang X, Deng M, Ma J, Qi P, Wang J, Chen G, Liu Y, Pu Z, Li W, Lan X, Wei Y, Zheng Y, Jiang Q. Transposon insertion resulted in the silencing of Wx-B1n in Chinese wheat landraces. Theor Appl Genet. 2015;130:1331.

    Article  Google Scholar 

  21. Xiaoyang W, Dan C, Yuqing L, Weihua L, Xinming Y, Xiuquan L, Juan D, Lihui L. Molecular characteristics of two new waxy mutations in China waxy maize. Mol Breed. 2017.

  22. Raemakers K, Schreuder M, Suurs L, Furrer-Verhorst H, Vincken J-P, de Vetten N, Jacobsen E, Visser RGF. Improved cassava starch by antisense inhibition of granule-bound starch synthase I. Mol Breed. 2005;16:163–72.

    Article  CAS  Google Scholar 

  23. Koehorst-van Putten HJJ, Wolters AM, Pereira-Bertram IM, van den Berg HH, van der Krol AR, Visser RG. Cloning and characterization of a tuberous root-specific promoter from cassava (Manihot esculenta Crantz). Planta. 2012;236:1955–65.

    Article  CAS  PubMed  Google Scholar 

  24. Zhao S-S, Dufour D, Sánchez T, Ceballos H, Zhang P. Development of waxy cassava with different biological and physico-chemical characteristics of starches for industrial applications. Biotechnol Bioeng. 2011;108:1925–35.

    Article  CAS  PubMed  Google Scholar 

  25. Ceballos H, Sánchez T, Morante N, Fregene M, Dufour D, Smith AM, Denyer K, Pérez JC, Calle F, Mestres C. Discovery of an amylose-free starch mutant in cassava (Manihot esculenta Crantz). J Agric Food Chem. 2007;55:7469–76.

    Article  CAS  PubMed  Google Scholar 

  26. Bull SE, Seung D, Chanez C, Mehta D, Kuon J, Truernit E, et al. Accelerated ex situ breeding of GBSS - and PTST1 -edited cassava for modified starch. Sci Adv. 2018;4:eaat6086.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  27. Aiemnaka P, Wongkaew A, Chanthaworn J, Nagashima SK, Boonma S, Authapun J, Jenweerawat S, Kongsila P, Kittipadakul P, Nakasathien S, Sreewongchai T, Wannarat W, Vichukit V, López-Lavalle LAB, Ceballos H, Rojanaridpiched C, Phumichai C. Molecular characterization of a spontaneous waxy starch mutation in cassava. Crop Sci. 2012;52:2121–30.

    Article  CAS  Google Scholar 

  28. Semagn K, Babu R, Hearne S, Olsen M. Single nucleotide polymorphism genotyping using Kompetitive allele specific PCR (KASP): overview of the technology and its application in crop improvement. Mol Breed. 2014;33:1–14.

    Article  CAS  Google Scholar 

  29. Choudhury BI, Khan ML, Dayanandan S. Patterns of nucleotide diversity and phenotypes of two domestication related genes (OsC1 and Wx) in indigenous rice varieties in Northeast India. BMC Genet. 2014;15:71.

    Article  PubMed  PubMed Central  Google Scholar 

  30. Pfister B, Zeeman SC. Formation of starch in plant cells. Cell Mol Life Sci. 2016;73:2781–807.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  31. Rasheed A, Wen W, Gao F, Zhai S, Jin H, Liu J, Guo Q, Zhang Y, Dreisigacker S, Xia X, He Z. Development and validation of KASP assays for genes underpinning key economic traits in bread wheat. Theor Appl Genet. 2016;129:1843–60.

    Article  CAS  PubMed  Google Scholar 

  32. Monari AM, Simeone MC, Urbano M, Margiotta B, Lafiandra D. Molecular characterization of new waxy mutants identified in bread and durum wheat. Theor Appl Genet. 2005;110:1481–9.

    Article  CAS  PubMed  Google Scholar 

  33. Yi X, Jiang Z, Hu W, Zhao Y, Bie T, Gao D, Liu D, Wu R, Cheng X, Cheng S, Zhang Y. Development of a kompetitive allele-specific PCR marker for selection of the mutated Wx-D1d allele in wheat breeding. Plant Breed. 2017;136:460–6.

    Article  CAS  Google Scholar 

  34. Morante N, Ceballos H, Sánchez T, Rolland-Sabaté A, Calle F, Hershey C, Gilbert O, Dufour D. Discovery of new spontaneous sources of amylose-free cassava starch and analysis of their structure and techno-functional properties. Food Hydrocoll. 2016;56:383–95.

    Article  CAS  Google Scholar 

  35. Liu J, Rong T, Li W. Mutation loci and intragenic selection marker of the granule-bound starch synthase gene in waxy maize. Mol Breed. 2007;20:93–102.

    Article  CAS  Google Scholar 

  36. Dobo M, Ayres N, Walker G, Park WD. Polymorphism in the GBSS gene affects amylose content in US and European rice germplasm. J Cereal Sci. 2010;52:450–6.

    Article  CAS  Google Scholar 

  37. Seung D, Boudet J, Monroe J, Schreier TB, David LC, Abt M, Lu K, Zanella M, Zeeman SC. Homologs of protein targeting to starch control starch granule initiation in Arabidopsis leaves. Plant Cell. 2017;29:1657 LP–1677.

    Article  CAS  Google Scholar 

  38. Seung D, Soyk S, Coiro M, Maier BA, Eicke S, Zeeman SC. Protein targeting to starch is required for localising granule-bound starch synthase to starch granules and for normal amylose synthesis in Arabidopsis. PLoS Biol. 2015;13:e1002080.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  39. Bredeson JV, Lyons JB, Prochnik SE, Wu GA, Ha CM, Edsinger-Gonzales E, Grimwood J, Schmutz J, Rabbi IY, Egesi C, Nauluvula P, Lebot V, Ndunguru J, Mkamilo G, Bart RS, Setter TL, Gleadow RM, Kulakow P, Ferguson ME, Rounsley S, Rokhsar DS. Sequencing wild and cultivated cassava and related species reveals extensive interspecific hybridization and genetic diversity. Nat Biotechnol. 2016;34:562–70.

    Article  CAS  PubMed  Google Scholar 

  40. Doyle J, Doyle J. Isolation of plant DNA from fresh tissue. Focus. 1990;12:13–5.

    Google Scholar 

  41. Goodstein DM, Shu S, Howson R, Neupane R, Hayes RD, Fazo J, Mitros T, Dirks W, Hellsten U, Putnam N, Rokhsar DS. Phytozome: a comparative platform for green plant genomics. Nucleic Acids Res. 2012;40:1178–86.

    Article  CAS  Google Scholar 

  42. Untergasser A, Cutcutache I, Koressaar T, Ye J, Faircloth BC, Remm M, Rozen SG. Primer3--new capabilities and interfaces. Nucleic Acids Res. 2012;40:e115.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  43. Ewing B, Green P. Base-calling of automated sequencer traces using Phred. II Error probabilities. Genome Res. 1998;8:186–94.

    Article  CAS  PubMed  Google Scholar 

  44. Ewing B, Hillier L, Wendl MC, Green P. Base-calling of automated sequencer traces using Phred. I Accuracy assessment. Genome Res. 1998;8:175–85.

    Article  CAS  PubMed  Google Scholar 

  45. Weckx S, Del-Favero J, Rademakers R, Claes L, Cruts M, De Jonghe P, Van Broeckhoven C, De Rijk P. novoSNP, a novel computational tool for sequence variation discovery. Genome Res. 2005;15:436–42.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  46. Jombart T, Ahmed I. Adegenet 1.3-1: new tools for the analysis of genome-wide SNP data. Bioinformatics. 2011;27:3070–1.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  47. R Core Team. R: A Language and Environment for Statistical Computing. Vienna: R Foundation for Statistical Computing; 2018. ISBN 3–900051–07-0, URL.

    Google Scholar 

Download references


I would like to thanks the technicians of the laboratory Andresa Priscila de Souza Ramos, Raimundo Pereira da Silva and Vandeson Rodrigues de Sousa, for their help in offering me the support need to develop the research.


We thank the CNPq (Conselho Nacional de Desenvolvimento Científico e Tecnológico) for the research productivity fellowship (E.J.O.) and CAPES (Coordenação de Aperfeiçoamento de Pessoal de Nível Superior) for the doctoral fellowship (C.D.C.). These funding bodies have had no role in the design of the study and collection, analysis, and interpretation of data and in writing the manuscript.

Author information

Authors and Affiliations



CDC, MBS, HC and EJO, conceived, designed and analyzed the data of the experiments. CDC, GAFO and PPSS are involved in GBBSI cloning, KASP analysis, and writing; EJO is involved in supervision of field and lab evaluations and Catia’s thesis. All authors have read and approved the manuscript.

Corresponding author

Correspondence to Eder Jorge de Oliveira.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no conflict of interest.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Additional file 1:

Figure S1. Example of amplification of the primers MeWxI11-G and MeWxI11-C in 2% agarose gel stained with ethidium bromide in the cassava populations S1-BGM0061. HT - Heterozygous (CG), HD - Dominant Homozygous (CC), and HR - Homozygous recessive (GG).

Additional file 2:

Table S1. List of cassava clones used in screening with single nucleotide amplified polymorphism [SNAP] markers.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

do Carmo, C.D., Sousa, M.B.e., dos Santos Silva, P.P. et al. Identification and validation of mutation points associated with waxy phenotype in cassava. BMC Plant Biol 20, 164 (2020).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: