DNA barcoding of the genus Nepenthes (Pitcher plant): a preliminary assessment towards its identification
BMC Plant Biology volume 18, Article number: 153 (2018)
DNA barcoding is impending towards the generation of universal standards for species discrimination with a standard gene region that can be sequenced accurately and within short span of time. In this study, we were successful in developing efficient barcode locus in the Nepenthes genus. A total of 317 accessions were retrieved from GenBank of NCBI which represent 140 different species Nepenthes and evaluated the efficacy of ITS, rbcl and matK barcode candidates using barcode gap, applied distance similarity, and tree-based methods.
Our result indicates that single-locus ITS or combined with plastid regions (matK) showed the best species discrimination with distinctive barcoding gaps. Therefore, we tentatively proposed the combination of ITS+matK as a core barcode for Nepenthes genus.
This study provides a report on DNA barcoding for unique insectivores’ Nepenthes genus. As the different species of Nepenthes are higly endemic and endangered, it would be a useful study to understand the evolutionary relationship, sketched in emigration, mislabeling and can be a probable assessment for its biodiversity.
Taxonomy is the fundamental base for exact nomenclature of a species in an ecosystem. The knowledge gap in taxonomy is increasing due to inadequate taxonomic experts and till today millions of species are still unidentified without proper genetic and biological distribution. Therefore, it is an urgent requirement for definite classification and taxonomy of various delineated species for many theoretical studies and realistic applications . Traditionally morphology-based taxonomy provides ambiguous phylogenetic evidence of large diversified plants genera . To overcome this problem in taxonomy, sequencing of genomic DNA can serve as a standardized method for species identification since, more closely related species hold more homologous DNA sequences in contrast to the distantly associated species . DNA barcoding is regarded as a promising method for proper identification of species using short region of specific DNA sequence efficiently [2, 4]. In animal genomes, mitochondrial cytochrome oxidase I (COI) gene is universally accepted DNA barcode while this region in plants shows insufficient variability caused by its low mutation rate and hence requiring alternative barcoding regions [5,6,7]. As a result, several chloroplast loci and combinations of these loci have been proposed as a promising DNA barcode in plants . In addition to plastid DNA sequence, nuclear ribosomal internal transcribed spacer (ITS) region is also being used in plants [9, 10]. However, it endures complications in amplification that render it feasibility as a universal barcode for land plants. Despite these complications, many researchers proved that ITS can perform better amplification when compared to other coding or non-coding plastids markers [11,12,13]. As limited research is carried out in different genera of angiosperm and Nepenthes being one of the highly endangered genus, so it is imperative to study about its taxonomic classification and diversity.
Nepenthes (Caryophyllales: Nepenthaceae), which includes 170 species around the world, ranging from northern Australia throughout South-east Asia to southern China  and New Caledonia and extending westwards to Seychelles and Malagasy. They exhibit a vast diversity in its growth forms, habitats, prey spectra and pitcher form. Nepenthes sp. protected under Law no. 5 (1990) on Conservation of Biological Resources and Ecosystem and lined with the regulations of the Convention on International Trade in Endangered Species (CITES) where N. rajah and N. khasiana are listed on Appendix-I and the rest in APPENDIX-II [15, 16]. This makes the trading activity restricted for this genus. Human interest in Nepenthes ranges from the utilization to its therapeutic efficacy. Its unique features of habitat and varied pitcher forms made the genus as an object of fascination and fashionable towards the mankind. Moreover, the highly slippery wax surfaces of the pitcher interior also encouraged engineers to develop many unique products based on this feature. The population of this genus is declining rapidly due to overexploitation and if such declination continues then it will lead to decrease in diversity and result into its extinction. The taxonomy of Nepenthes is primarily based on morphology such as shape, color, size and ornamentation [17, 18]. The record on the botanical history of Nepenthes showed that there were various cases of taxonomic confusion such as N. pilosa with N. chaniana until 2006, similarly N. talangensis with N. bongso and N. lamii with N. vieillardii [18, 19]. In addition to this, the evolution of genus is challenging as they have no close relatives/ancestral types or transitional species. But Nepenthes have distant relatives which can provide a clue about the origin of the genus. Previously, molecular phylogenetic studies in Nepenthes were based on chloroplast (trnK and matK gene) and nuclear (PRT1) sequences [20, 21]; however recent studies are based on molecular markers like RAPD, ISSR, etc. [22, 23]. The applicability and effectiveness of DNA barcoding in discriminating the species of Nepenthes were conducted for the first time in this study. On the other hand, it is difficult to collect all the species of this genus throughout the large geographical regions. So, this study focuses on the sequences of Nepenthes species which are reported in the National centre for Biotechnology Information (NCBI) database. Here, we assessed three potential barcodes by sampling 140 species of Nepenthes with the aims of proposing a practical and universal standard barcode region that must be conserved and distinguish the species from the other genera.
The loci of ITS, rbcl and matK were selected as barcode candidates in this study. All the available sequences of Nepenthes were downloaded from GenBank of NCBI. The sequences were chosen based on two criteria: i. appropriate voucher specimens, and ii. more than 300 bp in length. The taxa, authors and GenBank accession numbers used in this study are shown in Additional file 1: Table S1.
The downloaded sequences for each region were aligned using Clustal Xv1.8.7  and synchronized manually in BioEdit v22.214.171.124 . For ITS, we adjusted the regions (ITS1 and ITS2) in two ends of 5.8S rDNA based on parsimony principle . Parsimony principle states that in a given set of possible explanation, the simplest explanations are expected to be accurate. On the basis of phylogeny, parsimony means hypothesis of relationships in which least number of character changes is considered most likely to be correct. Hence, all the ITS sequences were aligned and arranged based on parsimony principle in order to avoid erroneous results.
The genetic pair wise distance was computed with Kimura-2-parameter (K2P) distance in MEGA 7. K2P is one of the optimal models for very small distances . The differences between intra- and inter-specific distances for each pair of three single barcodes were compared using pair wise distance in MEGA 7 software. Barcoding gap is the measure of effective barcode locus that exists when the minimum K2P interspecific distance is larger than the maximum intraspecific distance . Taxon DNA with ‘pairwise summary function’ was used to estimate the barcoding gap comparing the distributions of the pairwise intra- and inter-specific distance for each barcode candidate with an interval distance of 0.05.
In order to analyze the species accurately, each barcode candidate was measured for correct identification proportion using Taxon DNA with Best match, ‘Best close match’ and ‘all species barcodes functions. The ‘Best match’ analyses determine the closest match for a given sequence. If the compared sequences were from the same species then the identification is considered as correct whereas incorrect if the sequences did not belong to the same species .
To access the effectiveness of marker discriminatory performance, we evaluated the origin of monophyletic by conducting tree-based analysis [26, 28, 29]. The phylogenetic trees were estimated using Neighbor-joining (NJ) in MEGA 7, and node support was assessed by a bootstrap test  with 1000 pseudo-replicates of run with the K2P distance as a model of substitution. Triphyophyllum peltatum was used as an outgroup.
Based on the two criteria of screening sequences, we obtained 317 sequences from NCBI, which include 183, 33 and 101 sequences of ITS, rbcl and matK, respectively (Additional file 1: Table S1).
Genetic divergence analysis
The aligned sequence lengths ranged from 1251 bp for rbcl to 951 bp for ITS (Table 1).ITS had the maximum variable sites and parsimony-informative characters followed by matK. The intra-specific distance in the six barcodes ranged from 0.0 to 0.9% and the mean intra-specific distances were least for rbcl+matK (0.02%) and highest for ITS (1.31%). Subsequently, the pairwise inter-specific distances were ranged from 0.0 to 1.18% and the mean inter-specific distance was minimum for ITS+rbcl (0.16%) and maximum for ITS (0.84%). In summary, ITS reveal the highest mean intra- and inter-specific distances (Table 2).
Barcoding gap analysis
The relative distribution of barcoding gap between intra- and inter-specific genetic distances were calculated using K2P distances in Taxon DNA software for three barcode candidates. The inter-specific distances were higher in all subgenera and did not fully overlap with intra-specific distance. Therefore, we analyzed barcoding gap for all datasets and subgenera. Three barcodes i.e. ITS (Fig. 1a), matK (Fig. 1c) and ITS+matK (Fig. 1e) showed relatively clear barcoding gaps. All other barcodes had overlapped between their intra- and inter- specific distances without clear barcoding gaps (Fig. 1).
Discrimination of species
Analysis of discriminating species was performed using Taxon DNA, ITS had the highest success rate for correct identification of species (Best match: 78.12%; Best close match: 77.67%; All species barcodes: 80.76%) followed by ITS+matK and least discrimination success rate was observed in ITS+rbcl (Table 3).
Discriminating sequences of six barcode candidates based on phylogenetic trees were estimated by evaluating the percentage of each species or variety as well as determined to be monophyletic using NJ tree based analysis (Fig. 2). We observed that all single-locus barcodes had low levels of species discrimination varying from 11.76 to 30.68% (Table 1). Among the multilocus barcodes, ITS+matK showed the maximum success rate (83.33%) followed by ITS+rbcl (50.00%). Thus, it can be concluded that species discrimination was higher when ITS was included among three combinations. We accomplished that our result suggests that ITS+matk is preeminent among all the core barcodes.
Several studies were carried out to discover suitable barcodes for different plants but the desired consensus was achieved so far [31, 32]. In the present study, we included Nepenthes sp. sequences obtained from different studies through their GenBank records. Thus, we strongly assumed that all reported sequences of Nepenthes sp. were based on correctly identified plant species. Plastids region were initially proposed as core barcode in plants, but they are not successful in all genus of plants. Moreover, many researchers found ITS as a challenging barcode in plants and thus rejected for incorporation in the core barcode region of plants [9, 33,34,35]. With advanced researches, we observed that the region of ITS was widely used for recovering high rates of correctly assigned species as it posses less intra-specific variation but higher inter-specific divergence . Moreover, the combinations of ITS and plastids loci were found to be the best option in some plant genus. According to our results, ITS and matK had better parsimony informative sites and discriminating power among the proposed barcode loci i.e. ITS, rbcl and matK which relate similarly to the results of previous studies [14, 37, 38]. Discriminating species on the basis of pairwise distances are subjected to be prolific if the inter-specific distances are greater than intra-specific distances  and finally we observed that ITS had the highest intra- and inter-specific sequence divergence based on distance analysis methods. The statistics of “best match”, “best close match” and “all species barcodes” options were used in this study and ITS was again observed with high species discrimination rate followed by ITS+MatK. Based on NJ tree, ITS+matK barcode posse’s maximum and rbcl contain minimum species resolution rate for the genus. On the other hand, several combinations of two or three barcodes are being proposed as core barcodes in plants, including ITS+trnH-psbA , ITS+rbcl , matK+rbcl  and ITS+matK+rbcl  but a consensus regarding its utility has not been achieved yet. matK+rbcl was considered as an universal barcode for all land plants but in Nepenthes sp. matK+rbcl posses low species resolution among the three barcode combinations because of low substitution rates in coding genes where ITS+matK posses the highest percent of species identification as compared to the other single or combinations of barcode candidates which posses well-defined barcode gaps. However, all species of Nepenthes are specific or restricted to different geographical regions. Therefore, a potential solution of identifying the species from illegal transfer and geographical information could be achieved with the application of DNA barcoding. In future, these findings will potentially be helpful in delineating the species of Nepenthes and hence, they could most likely be successful as barcodes for this genus.
The present study evaluates DNA barcoding technique for the taxonomic origin/identification of endangered and endemic plants which are illegally traded. From this study, we can conclude that DNA barcode identification can be made more authentic by relying on integrated approach including prior and a posteriori date. In this study, it depicts that among the six barcode candidates single locus ITS and multiple locus ITS+matK posses high rate of discriminating power which can be further accessed as core barcode for Nepenthes genus. As this genus is unique in different parts of the world, an irrefragable system like DNA barcoding is required for conservation in biodiversity and control in the illegal trade of the species.
Dayrat B. Towards integrative taxonomy. Biol J Linn Soc. 2005;85:407–15.
Hebert PDN, Cywinska A, Ball SL, deWaard JR. Biological identifications through DNA barcodes. Proc R Soc Lond B. 2003;270:313–21.
Laprise S, Rodgers V. Analysis of putative DNA barcodes for identification and distinction of native and invasive plant species Babson Faculty Research Fund Working Papers; 2010. p. Paper 75.
Meyer CP, Paulay G. DNA barcoding: error rates based on comprehensive sampling. PLoS Biol. 2005;3(12):422.
Kress WJ, Wurdack KJ, Zimmer EA, Weigt LA, Janzen DH. Use of DNA barcodes to identify flowering plants. Proc Natl Acad Sci U S A. 2005;102:8369–74.
Chase MW, Salamin N, Wilkinson M, Dunwell JM, Kesanakurthi RP, Haider N, Savolainen V. Land plants and DNA barcodes: short-term and long-term goals. Philos Trans R Soc Lond B. 2005;360:1889–95.
Fazekas AJ, Kesanakurti PR, Burgess KS, Percy DM, Graham SW, Barrett SC, Newmaster SG, Hajibabaei M, Husband BC. Are plant inherently harder to discriminate than animal species using DNA barcoding markers? Mol Ecol Resour. 2009;9:130–9.
CBOL Plant Working Group. A DNA barcode for land plants. Proc Natl Acad Sci U S A. 2009;106:12794–7.
Chase MW, Cowan RS, Hollingsworth PM, Berg CVD, Madriñán S, Petersen G, Seberg O, Jørgsensen T, Cameron KM, Carine M, Pedersen N, Hedderson TAJ, Conrad F, Gerardo GA, Richardson JE, Hollingsworth ML, Barraclough TG, Kelly L, Wilkinson M. A proposal for a standardized protocol to barcode all land plants. Taxon. 2007;56:295–9.
Giudicelli GC, Mader G, Freitas LBD. Efficiency of ITS sequences for DNA barcoding in Passiflora (Passifloraceae). Int J Mol Sci. 2015;16:7289–303.
Muellner AN, Schaefer H, Lahaye R. Evaluation of candidate DNA barcoding loci for economically important timber species of the Mahogany family (Meliaceae). Mol Ecol Resour. 2011;11:450–60.
Yang JB, Wang YP, Möller M, Gao LM, Wu D. Applying plant DNA barcodes to identify species of Parnassia (Parnacciaceae). Mol Ecol Resour. 2012;12:267–75.
Zhang D, Duan L, Zhou N. Application of DNA barcoding in Roscoea (Zingiberaceae) and a primary discussion on taxonomic status of Roscoea cautleoides var. Pubescens. Biochem Syst Ecol. 2014;52:14–9.
Xu S, Li D, Li J, Xiang X, Jin W, Huang W, Jin X, Huang L. Evaluation of the DNA barcodes in Dendrobium (Orchidaceae) from Mainland Asia. PLoS One. 2015;10(1):e0115168.
Clarke CM, Bauer U, Lee CC, Tuen AA, Rembold K, Moran JA. Tree shrew lavatories: a novel nitrogen sequestration strategy in a tropical pitcher plant. Biol Lett. 2009;5(5):632–5.
Lestariningsih N, Setyaningsih D. Explorative study of tropical pitcher plants (Nepenthes sp.) types and insects that trapped inside in Sebangau National Park Palangka Raya Central Kalimantan. J Phys Conf Ser. 2017;795:012062.
Phillipps A, Lamb A. Pitcher-plants of borneo. 1st ed. Kota Kinabalu: Natural History Publications; 1996. p. 12–31.
McPherson SR. Pitcher Plants of the Old World 2 volumes. 1st ed. Poole: Redfern Natural History Productions; 2009;p. 768. ISBN 9780955891830.
Clarke CM. Nepenthes of Sumatra and Peninsular Malaysia. Kota Kinabalu: Natural History Publications; 2002. p. 336.
Soltis PS, Soltis DE. Multiple origins of the allotetraploid Tragopogon mirus (Compositae): rDNA evidence. Syst Bot. 1991;16:407–13.
Meimberg H, Thalhammer S, Brachmann A, Heubl G. Comparative analysis of a translocated copy of the trnK intron in the carnivorous genus Nepenthes (Nepenthaceae). Mol Phylogenet Evol. 2006;39:478–90.
Bhau BS, Medhi K, Sarkar T, Saikia SP. PCR based molecular characterization of Nepenthes khasiana Hook.f. pitcher plant. Genet Resour Crop Evol. 2009;56:1183–93.
Bunawan H, Yen CC, Yaakop S, Noor NM. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences. BMC Res Notes. 2017;10:67.
Thompson JD, Gibson TJ, Plewniak F, Jeanmougin F, Higgins DG. The CLUSTAL_X windows interface: flexible strategies for multiple sequence alignment aided by quality analysis tools. Nucl Acids Res. 1997;25:4876–82.
Hall TA. BioEdit: a user-friendly biological sequence alignment editor and analysis program for windows 95/98/NT. Nucleic Acids Symp Ser. 1999;41:95–8.
Krawczyk K, Szczecińska M, Sawicki J. Evaluation of 11 single-locus and seven multilocus DNA barcodes in Lamium. L. (Lamiaceae). Mol Ecol Resour. 2014;14:272–85.
Meier R, Shiyang K, Vaidya G, Ng PKL. DNA barcoding and taxonomy in Diptera: a tale of high intraspecific variability and low identification success. Syst Biol. 2006;55:715–28.
Zhang CY, Wang FY, Yan HF, Hao G, Hu CM, Ge XJ. Testing DNA barcoding in closely related groups of Lysimachia. L. (Myrsinaceae). Mol Ecol Resour. 2012;12:98–108.
Alves TLS, Chauveau O, Eggers L, de Souza-Chies TT. Species discriminatory in Sisyrinchium (Iridaceae): assessment of DNA barcodes in a taxonomically challenging genus. Mol Ecol Resour. 2013;14:324–35.
Felsenstein J. Phylogenies from molecular sequences: inference and reliability. Annu Rev Genet. 1988;22:521–65.
Lahaye R, Van der Bank M, Bogarin D, Warner J, Pupulin F, Gigot G, Maurin O, Duthoit S, Barraclough TG, Savolainen V. DNA barcoding the floras of biodiversity hotspots. Proc Natl Acad Sci U S A. 2008;105:2923–8.
Farrington L, MacGillivray P, Faast R, Austin A. Investigating DNA barcoding options for the identification of Caladenia (Orchidaceae) species. Aust J Bot. 2009;57:276–86.
Alvarez I, Wendel JF. Ribosomal ITS sequences and plant phylogenetic inference. Mol Phylogenet Evol. 2003;29:417–34.
Starr JR, Naczi RFC, Chouinard BN. Plant DNA barcodes and species resolution in sedges (Carex, Cyperaceae). Mol Ecol Resour. 2009;9(1):151–63.
Hollingsworth ML, Clark AA, Forrest LL, Richardson J, Pennington RT, Long DG, Cowan R, Chase MW, Gaudeul M, Hollingsworth PM. Selecting barcoding loci for plants: evaluation of seven candidate loci with species-level sampling in three divergent groups of land plants. Mol Ecol Resour. 2009;9:439–57.
Zhu RW, Li YC, Zhong DL, Zhang JQ. Establishment of the most comprehensive ITS2 barcode database to date of the traditional medicinal plant Rhodiola (Crassulacaee). Sci Rep. 2017;7:10051.
Li DZ, Gao LM, Li HT, Wang H, Ge XJ, Liu JQ, Chen ZD, Zhou SL, Chen SL, Yang JB, Fu CX, Zeng CX, Yan CF, Zhu YJ, Sun YS, Chen SY, Zhao L, Wang K, Yang T, Duan GW. Comparative analysis of a large dataset indicates that internal transcribed spacer (ITS) should be incorporated into the core barcode for seed plants. Proc Natl Acad Sci U S A. 2011;108:19641–6.
Cabelin VL, Alejandro GJ. Efficiency of matK, rbcL, trnH-psbA, and trnL-F (cpDNA) to molecularly authenticate Philippine ethnomedicinal Apocynaceae through DNA barcoding. Pharmacogn Mag. 2016;12(3):S384–8.
Yu WB, Huang PH, Ree RH, Liu ML, Ll DZ, Wang H. DNA barcoding of Pedicularis L.(Orobanchaceae): evaluating four universal barcode loci in a large and Hemiparasitic genus. J Syst Evol. 2011;49:425–37.
The authors are thankful to Director, CSIR-North-East Institute of Science and Technology, Jorhat, Assam, India.
This work was supported by CSIR-North-East Institute of Science and Technology, Jorhat, Assam, India under In-house project MLP1000.
Availability of data and materials
Plant material was collected from the CSIR-NEIST germplasm collection.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Gogoi, B., Bhau, B.S. DNA barcoding of the genus Nepenthes (Pitcher plant): a preliminary assessment towards its identification. BMC Plant Biol 18, 153 (2018). https://doi.org/10.1186/s12870-018-1375-5