Skip to main content

Interspecific chloroplast genome sequence diversity and genomic resources in Diospyros



Fruits of persimmon plants are traditional healthy food in China, Korea and Japan. However, due to the shortage of morphological and DNA markers, the development of persimmon industry has been heavily inhibited.


Chloroplast genomes of Diospyros cathayensis, D. virginiana, D. rhombifolia and D. deyangensis were newly sequenced. Comparative analyses of ten chloroplast genomes including six previously published chloroplast genomes of Diospyros provided new insights into the genome sequence diversity and genomic resources of the genus. Eight hyper-variable regions, trnH-psbA, rps16-trnQ, rpoB-trnC, rps4-trnT-trnL, ndhF, ndhF-rpl32-trnL, ycf1a, and ycf1b, were discovered and can be used as chloroplast DNA markers at/above species levels. The complete chloroplast genome sequences provided the best resolution at inter-specific level in comparison with different chloroplast DNA sequence datasets.


Diospyros oleifera, D. deyangensis, D. virginiana, D. glaucifolia, D. lotus and D. jinzaoshi are important wild species closely related to the cultivated persimmon D. kaki. The hyper-variable regions can be used as DNA markers for global genetic diversity detection of Diospyros. Deeper study on these taxa would be helpful for elucidating the origin of D. kaki.


Diospyros is the largest genus in the family Ebenaceae, including more than 400 species all over the world, with a wide distribution and cultivation from tropical to temperate zones. Their fruits have been traditional healthy food source in China, Korea and Japan. Ebony trees are valued for their hard, heavy and dark timber. In addition, the bark, leaves, wood, fruits and seeds of several species are the main sources of medicines. Diospyros kaki is an important economic tree crop and widely cultivated species of the genus [1, 2].

In China, cultivation of persimmon plants (Diospyros kaki) for edible fruits and Chinese medicine is commonly one of the best choices for regional development of characteristic fruit tree industry, with different regions having different cultivars. At present, there are about 1000 persimmon cultivars in China. Most of the extant persimmon cultivars are elite plants from natural seedlings or bud mutation. Among these cultivars, a majority is of the pollination- constant and astringent (PCA) type, pollination-constant and non-astringent (PCNA) type of premature persimmon and disease-resistant cultivars are rarer and are highly desirable. In general, closely related wild species possess elite genetic background for qualitative improvement and high-yield breeding of the persimmon crops [1, 2].

Breeding of woody Diospyros plants is time-consuming, commonly around ten years being necessary for evaluating the performance of a hybrid plant. Due to the difficulty of identifying the Diospyros germplasm resource diversities, directive breeding through artificial hybridization is severely inhibited. Therefore, accumulation of genetic information for revealing the genetic diversity of Diospyros plants is in urgent need.

The germplasm resources of Diospyros plants have been documented in aspects of the use and phenotypic characteristics, such as fruit morphology, fruit color, fruit quality and floral characteristics. Diospyros plants have four ploidy levels (2n = 2× = 30; 2n = 4× = 60; 2n = 6× = 90; 2n = 9× = 135) according to their chromosome number [1,2,3,4,5,6,7]. However, due to the shortage and limited sensitivity of phenotypic and DNA markers, study on genetic differentiation detection techniques for the huge amount of Diospyros plants worldwide is still a global challenge [8,9,10,11,12].

In Diospyros, molecular resources have been developed in recent years for species identification. Some chloroplast genome markers (such as rbcL, matK, trnH-psbA) and nuclear DNA markers (e.g., internal transcribed spacer of ribosomal DNA, ITS) were used to discriminate plants in Diospyros. However, these markers are low in variability, or have limited resolution and cannot reach the goals [3, 13,14,15,16,17,18]. Development of more effective DNA barcodes is also important for Diospyros plants.

The angiosperm chloroplast genomes are uniparentally inherited with relatively stable structure [16, 17]. Accordingly, it is considered to be an informative and valuable resource for plant phylogenetic analyses at family/genus/species levels [16, 19,20,21,22,23,24]. In the past decades, the chloroplast genomes have been proven to be more powerful in revealing phylogeny of plants and resolving previously ambiguous taxonomic and phylogenetic issues [16,17,18,19,20,21,22,23,24,25,26,27,28].

In this study, we report newly sequenced complete chloroplast genomes of D. cathayensis, D. deyangensis, D. rhombifolia and D. virginiana. The aims of our study are: (1) to evaluate the variation in Diospyros; and (2) to develop new and efficient cpDNA markers for species identification in Diospyros.


Sampling and DNA extraction

The fresh leaves of the Diospyros taxa were collected in spring 2016 from the Beijing Botanical Garden of Chinese Academy of Sciences and the National Field Genbank for Persimmons of College of Horticulture, Northwest A&F University, Yangling, Shaanxi, China and were dried immediately using silica gels for DNA extraction. Total genomic DNAs were extracted following the procedure of Plant Genomic DNA Kit (DP305) from Tiangen Biotech (Beijing) Co., Ltd., China.

Chloroplast genome sequencing, assembling and annotation

DNA was sheared to fragments of 400–600 bp with an ultrasonic disruptor. An Illumina paired-end library was constructed with the NEBNext® Ultra™ DNA Library Prep Kit according to the manufacturer’s protocol. Paired-end sequencing (2 × 150 bp) was conducted on an Illumina HiSeq 4000 platform. The high-throughput sequencing data were qualitatively assessed and assembled using SPAdes 3.10.1 [29]. Using the chloroplast genome sequence of Diospyros kaki (GenBank accession No. KT223565) as a reference sequence, we selected chloroplast genome contigs using the BLAST method. The contigs of the chloroplast genomes were assembled using Sequencher (v5.4) with default parameters. Ambiguous nucleotides or gaps and the four junction regions between the IRs and SSC/LSC in the chloroplast genome sequences were further confirmed by PCR amplification and Sanger sequencing with specific primers or the universal primers [20]. After that, all reads were mapped to the spliced chloroplast genome sequence using Geneious 8.1 [30] to avoid assembly errors. Chloroplast genome annotation was performed using the Dual Organellar Genome Annotator (DOGMA) [31]. The chloroplast genome map was drawn using Genome Vx software [32].

Analysis of tandem repeats and single sequence repeats in chloroplast genomes

MISA (MIcroSAtellite; was used to detect simple sequence repeats (SSR) within the chloroplast genomes. Thresholds for a minimum number of repeat units were established as follows: > 10 for mono-nucleotide, > 5 for di-nucleotide, > 4 for tri-nucleotide, and > 3 for tetra-nucleotide, penta-nucleotide, or hexa-nucleotide SSR. Repeating sequences were scanned over the complete chloroplast DNA sequences, taxon by taxon, using the REPuter program.

Sequence divergence analysis in chloroplast genomes

Four newly sequenced chloroplast genomes and six chloroplast genomes from GenBank were aligned using MAFFT [33] and manually adjusted using Se-Al 2.0 [34]. Variable and parsimony-informative base sites and genetic distance across the complete chloroplast genomes were calculated using MEGA 6.0 software [35]. Sliding window analysis was conducted to generate nucleotide diversity (Pi) of the chloroplast genomes using DnaSP (DNA Sequences Polymorphism version 5.10.01) software [36]. The step size was set to 200 bp, with a 600 bp window length.

Chloroplast DNA barcoding analysis

Distance and tree based methods were used to evaluate discriminatory power of detected hyper variable regions and core DNA barcodes rbcL and matK. The function nearNeighbour of SPIDER was used for Distance method barcoding analyses [37]. Neighbor-Joining (NJ) trees were constructed by each hyper-variable marker and the different marker combinations using MEGA 6.0 based on a K2P distance model [35]. Relative support for the clades of the NJ tree was assessed via 1000 bootstrap replicates.

Phylogenetic analysis based on chloroplast genome sequence data

Phylogenetic trees were constructed by Maximum Parsimony (MP), Maximum Likelihood (ML) and Bayesian Inference (BI) using the entire chloroplast genome.

MP analyses were performed using PAUP v4b10 [38] described in Dong et al. [24]. The ML analyses were conducted using RAxML 8.0 [39]. For ML analyses, the best-fit model, General Time Reversible, with gamma distribution (GTR + G) was used as suggested with 1000 bootstrap replicates. BI method was conducted with MrBayes v3.2 [40]. The Markov Chain Monte Carlo (MCMC) analysis was run for 2 × 5,000,000 generations with trees sampled every 1000 generations, with the first 25% discarded as burn-in. The remaining trees were used to build a 50% majority-rule consensus tree. The stationarity was regarded to be reached when the average standard deviation of split frequencies remained below 0.01.


Chloroplast genome characterization

Previous experiments indicated that chloroplast genome sequences are identical among individual plants of the same species. Therefore, we sequenced four taxa of Diospyros (D. cathayensis, D. virginiana, D. rhombifolia and D. deyangensis) using an Illumina Hiseq 4000 system, 20,675,288 to 33,584,779 paired-end raw reads were obtained (average read length was 150 bp). After mapping the paired-end reads of each Diospyros taxon, 357,721 to 1,338,624 reads were extracted, yielding 340 × to 1271 × coverage (Table 1). The inverted repeat (IR) junction regions in assembled sequences were confirmed by using PCR-based sequencing. High quality chloroplast genome sequences were thus obtained and were used for downstream analyses. The four Diospyros chloroplast genome sequences were deposited in GenBank (accession numbers, MF288575- MF288578).

Table 1 Summary of the sequencing data for four Diospyros taxa

The ten complete chloroplast genomes of Diospyros investigated in this study ranged from 157,300 base pairs (D. jinzaoshi) to 157,934 base pairs (D. deyangensis) in length. All the chloroplast genomes possessed the typical quadripartite structure of angiosperms, consisting of a pair of the inverted repeat region (IRs: 25,910–26,119 bp) separated by a large single-copy region (LSC: 86,948–87,246 bp) and a small single-copy region (SSC: 18,076–18,485 bp) (Fig. 1; Table 2). Each of the complete chloroplast genomes possesses 113 unique genes arranged in the same order, including 79 protein-coding, 30 tRNA, and 4 rRNA genes. The GC content of each chloroplast genome is identically 37.4% (Table 2). The genomic structure including gene number and gene order was highly conserved within Diospyros (Fig. 1; Table 2).

Fig. 1

Gene map of Diospyros chloroplast genomes. The genes inside and outside of the circle are transcribed in clockwise and counterclockwise directions, respectively. Genes belonging to different functional groups are shown in different colors. The thick lines indicate the extent of the inverted repeats (IRa and IRb) that separate the genomes into small single-copy (SSC) and large single-copy (LSC) regions

Table 2 Complete chloroplast genome features of ten Diospyros taxa

Comparative analysis of chloroplast genomes

The number of simple sequence repeats (SSRs) ranged from 48 (D. cathayensis or D. rhombifolia) to 82 (D. jinzaoshi) among the ten Diospyros taxa. The homopolymer repeat number with the highest variability, ranged from 29 (D. cathayensis) to 70 (D. jinzaoshi), while the number of di-nucleotide, tri-nucleotide or penta-nucleotide repeats had no significant difference among the ten Diospyros taxa (Fig. 2). The homopolymer repeats represented the major source of genetic diversity in Diospyros. In total, 505 SSRs were detected in LSC region, 141 in SSC region and 26 in IR region (Fig. 2).

Fig. 2

Comparison of simple sequence repeats among ten chloroplast genomes. a. Numbers of SSRs detected in ten Diospyros chloroplast genomes; b. Frequencies of identified SSRs in LSC, IR and SSC regions; c. Numbers of SSR types detected in ten Diospyros chloroplast genomes

Four repeat types were detected in the chloroplast genomes using REPuter software. Forward repeat number ranged from 17 (D. kaki) to 22 (D. deyangensis), which were found to have no significant difference among the ten chloroplast genomes. Tandem repeat number ranged from 20 (D. jinzaoshi) to 29 (D. virginiana). Palindromic repeat number ranged from 18 (D. discolor or D. virginiana) to 33 (D. kaki). The highest value of palindromic repeat number which occurred in D. kaki is significantly different from that of other taxa. The highest tandem repeat number was observed in D. virginiana (Fig. 3).

Fig. 3

Numbers of repeats in comparison among ten chloroplast genomes

The lowest value (31) of nucleotide substitution number was observed between D. glaucifolia and D. lotus, while the highest value (1493) of nucleotide substitution number occurred between D. cathayensis and D. virginiana, showing a wider range of variability according to the sequence alignment of the ten chloroplast genomes (Table 3).

Table 3 Numbers of nucleotide substitutions and sequence distance among the complete chloroplast genomes of ten Diospyros taxa

The sequence distance between Diospyros taxa ranged from 0.0002 to 0.0092. The smallest sequence distance occurred between D. lotus and D. glaucifolia, and the largest sequence distance occurred between D. virginiana and D. cathayensis (Table 3). LSC region was the most rapidly evolving region in the chloroplast genomes, while the IR region was the most slowly evolving region. The evolutionary rate of SSC region is moderate.

Phylogenetic analysis

Phylogenetic analyses indicated that all the ten taxa were clearly discriminated and seven clades were recognized among them (Fig. 4). The cultivars of Diospyros belonged to one clade of D. kaki. The DNA sequence data supported the isolated positions of D. deyangensis and D. jinzaoshi which were regarded previously as cultivars and have been suggested to rank to species level according to morphological and DNA characteristics by other recent researches [2,3,4,5,6]. The clade of D. oleifera and D. deyangensis, and the clade of D. jinzaoshi included the closest wild relatives of the cultivated persimmon plants of D. kaki (Fig. 4). Relatively speaking, the clade of D. lotus and D. glaucifolia, and the clade of D. virginiana have close genetic relationships with D. kaki. D. lotus and D. virginiana produce edible fruits, while D. glaucifolia is mainly used for timber production. D. cathayensis and D. rhombifolia that are genetically close with each other formed one clade. This is identical with their classification based on morphological characteristics. D. discolor is genetically the most distant taxon from the cultivated plants of D. kaki among the taxa investigated in this study.

Fig. 4

Phylogenetic relationships of ten Diospyros taxa inferred from Maximum Parsimony (MP), Bayesian Inference (BI), and Maximum Likelihood (ML) analyses of the complete chloroplast genomes. ML topology shown with ML bootstrap support value/Bayesian posterior probability/MP bootstrap support value given at each node. Nodes with 100 ML bootstrap support value/1.0 Bayesian posterior probability/100 MP bootstrap support value are not marked

Chloroplast DNA marker development

According to the chloroplast genome sequence alignment of the ten Diospyros taxa, eight hyper-variable regions, trnH-psbA, rps16-trnQ, rpoB-trnC, rps4-trnT-trnL, ndhF, ndhF-rpl32-trnL, ycf1a, and ycf1b were discovered (Fig. 5). These eight sequences could be used as DNA markers for classification and revealing the genetic divergence of the Diospyros taxa, with a high discrimination success ranging from 60 to 100% (Table 4), of which, the sequences of the three most rapidly evolving regions (i.e., rps4-trnT-trnL, ndhF-rpl32-trnL, and ycf1a) were able to discriminate all the taxa investigated in this study.

Fig. 5

Sliding window analysis of the whole chloroplast genomes of nine Diospyros taxa (window length: 600 bp, step size: 200 bp). X-axis: position of the midpoint of a window; Y-axis: nucleotide diversity of each window

Table 4 Variability of nine hyper-variable markers and universal chloroplast DNA barcodes (rbcL and matK) in Diospyros

In those most rapidly evolving regions, 72, 122 and 123 variable base sites were detected, respectively, of which, 39, 54 and 60 informative base sites, made up 2.52–2.80% in each of the sequences. Comparatively, the commonly recommended DNA fragments (rbcL and matK) achieved only 40% and 80% of discrimination success respectively (Table 4, Additional file 1: Figure S1).

Similar results were obtained when different methods were used for phylogenetic tree reconstruction (Additional file 1: Figure S1). The combined sequence data from the eight hyper-variable cpDNA regions provided a well-resolved phylogenetic topology of the ten Diospyros taxa, similar to the topology obtained using complete chloroplast genome sequences, but with much lower experimental cost (Fig. 6). All the nodes in the phylogenetic tree received high bootstrap value (100%).

Fig. 6

Phylogenetic relationships of ten Diospyros taxa inferred from Maximum Parsimony (MP), Bayesian Inference (BI), and Maximum Likelihood (ML) analyses using the chloroplast DNA sequences from eight hyper-variable regions


Chloroplast genome variation and evolution

In this study, we sequenced four chloroplast genomes of Diospyros using Illumina Hiseq-4000 platforms and compared these genomes with the other six published chloroplast genomes downloaded from GenBank. The chloroplast genomes of Diospyros displayed the typical quadripartite structure of flowering plants, were conservative in gene order and gene content, in comparison with the most lineages of angiosperms. The chloroplast genomes ranged from 157,300 to 157,934 bp in length. IR expansion and contraction and the occurrence of larger indels (insertion/deletion) are considered to be the primary mechanisms affecting length variation of angiosperm chloroplast genomes. Only minor variations were detected at the SC/IR boundaries of the ten Diospyros chloroplast genomes. Occurrence of indels was the main factor effecting the variation of the length in Diospyros chloroplast genomes. Similar to previously published chloroplast genomes of the angiosperms, the Diospyros chloroplast genomes contained more AT content than GC content.

Simple sequence repeats (SSRs) are genetically variable molecular markers and are used widely in population genetics [41, 42], polymorphism investigations, and phylogenetic analyses [43]. Using MISA, the SSRs in the ten Diospyros chloroplast genomes were identified. The number of SSRs ranged from 48 to 82 in Diospyros, similar to those in Lagerstroemia [28]. SSRs with AT richness have been reported in other plants [44, 45]. Homopolymers are the most common SSRs in chloroplast genomes. Since the structure and organization of chloroplast genomes are conservative, SSR primers are transferable across species, genera, and even families. Information involving SSRs in this study will provide useful sources for estimating the genetic diversity and studying phylogenetic relationships among species and genera.

Potential cpDNA barcodes

Diospyros is the largest genus in its family, including more than 400 species all over the world. For effective exploration, conservation, and domestication, accurately identified wild species would provide a clear genetic background of this genus. However, the taxonomic inventory of Diospyros still has a long way to go, because of the vast amount of species with extensive global distribution and the plasticity of the morphological characteristics. DNA barcoding has been largely used as a new biological tool to facilitate accurate species identification [46]. The two chloroplast DNA regions, rbcL and matK, are recommended as core universal DNA barcodes in plants. Therefore, genomic comparative researches of more complete chloroplast genome sequences have become necessary for developing variable DNA barcodes. These mutation “hotspot” regions could provide adequate genetic information for species identification, and can be used to develop novel DNA barcodes [19]. The eight potential mutational hotspots (trnH-psbA, rps16-trnQ, rpoB-trnC, rps4-trnT-trnL, ndhF, ndhF-rpl32-trnL, ycf1a, and ycf1b) identified in this study could be suitable barcodes for plant classification in Diospyros. In previous study, ycf1 gene showed high divergence in chloroplast genome and was recommended as core DNA barcode for plants [22]. Ycf1 gene has been more and more widely applied in plant phylogeny and DNA barcoding studies [47,48,49,50,51]. TrnH-psbA, ndhF and rps16-trnQ were popular candidates for phylogenetic studies [52, 53]. Rps4-trnT-trnL and ndhF-rpl32-trnL were newly identified in this study.

Recently, using the chloroplast genome as a super-barcode for plant species identification was discussed [49]. The analyses on chloroplast genome sequence divergence and phylogeny showed that the chloroplast genome may indeed be useful as a super-barcode for species identification of Diospyros (Table 3 and Fig. 4). Further research is necessary to investigate whether these hyper-variable regions or complete chloroplast genome sequences could be used as reliable and effective DNA barcodes for species of Diospyros. The results obtained in this study have significant value for future studies on global genetic diversity assessment, phylogeny, and population genetics of Diospyros.

Perspectives of persimmon research in future

It is important and fundamental to develop efficient identification methods and elucidate the genetic relationship of Diospyros taxa for planning breeding strategies, intensive management and conservation of Diospyros germplasm resources. The origin of D. kaki and its relationship to other Diospyros species have been also hot issues attracting scientists. Ng [54] suggested that D. oleifera was a parent of D. kaki based on morphological, geographical and cytological evidences. Yonemori et al. [55] suggested that D. virginiana and D. kaki had close relationship, as revealed by restriction fragment length polymorphism of chloroplast DNA. However, Choi et al. [9] considered the relationship between D. virginiana and D. kaki to be possibly phylogentically more remote than others.

The DNA evidences obtained in this study clearly support the previous taxonomic proposals that D. deyangensis and D. jinzaoshi should be raised to species rank based on morphological, molecular characteristics and chromosome numbers [4, 9,10,11,12]. In the cpDNA–based tree, the D. kaki lineage and the lineage including the dioecious D. deyangensis and the monoecious D. oleifera shared a common furcation. Interestingly, the chromosome number of D. kaki (2n = 6× = 90) is equal to the sum of the chromosome numbers of D. deyangensis (2n = 4× = 60) and D. oleifera (2n = 2× = 30). This strongly implied that cultivated D. kaki might be resulted from a superposition event of genetic materials between D. deyangensis and D. oleifera through certain genetic mechanism during the evolutionary history of Diospyros plants. D. oleifera is naturally distributed south of the Changjiang River in China, has strong adaptability and resistance to wet condition, and has been used as rootstocks of D. kaki in southern China. Its fruits can also be used for extraction of persimmon paint. D. deyangensis, naturally distributed in Sichuan Province, China, produces red flowers and has fine hairs on the surface of the leaves, shoots, fruits, and some parts of the flowers [7].

Diospyros rhombifolia and D. cathayensis are genetically remote from D. kaki, being identical with the classification based on phenotypic characteristics. D. rhombifolia could not be used as rootstocks or inter-stocks because of its dwarfing effect [2]. D. glaucifolia and D. lotus were clustered together based on cpDNA sequences. They are very similar in phenotypic characteristics [2, 5, 6, 55]. D. lotus has made great contributions as rootstocks in persimmon production areas of northern China because of its highest cold resistance in the genus. The small fruits of D. lotus have been used as food and Chinese medicine with a long history. In southern China, the dioecious species D. glaucifolia is used to produce wood and could also be used as rootstocks. The species boundary between D. lotus and D. glaucifolia should be further studied by sampling more individual plants in future [2, 5, 6, 56].

Elucidation of the questions mentioned above would definitely improve our understanding on phylogeny, relationship, and the origin of the cultivated persimmons in Diospyros, and further accelerating directive breeding of Diospyros plants.


Chloroplast DNA sequences can be used for classification of Diospyros plants at inter-specific level. The results obtained in this study implied that D. oleifera, D. deyangensis, D. virginiana, D. glaucifolia, D. lotus and D. jinzaoshi are important wild species closely related to the cultivated persimmon D. kaki and deeper studies on these taxa would be helpful for understanding the origin of D. kaki. Our results have significant value for global genetic diversity assessment, phylogeny, and population genetics in Diospyros in future.



Bayesian Inference


Chloroplast DNA


DNA Sequences Polymorphism


Dual Organellar Genome Annotator


General time reversible




Inverted repeat


Internal transcribed spacer of ribosomal DNA


Large single copy


Markov Chain Monte Carlo


Maximum Likelihood


Maximum Parsimony



PCA type:

Pollination-constant and astringent type

PCNA type:

Pollination-constant and non-astringent type


Nucleotide diversity


Ribosomal RNA


The small single copy


Simple sequence repeats


Transfer RNA


  1. 1.

    Lee S, Michael GG, Frank W. Ebenaceae. In: Wu ZY, Raven PH, Hong DY, editors. . Beijing/Missouri/St. Louis: Flora of China. Science Press/Botanical Garden Press; 1996. p. 215–34.

    Google Scholar 

  2. 2.

    Wang RZ, Yang Y, Li GC. Chinese persimmon germplasm resources. Acta Hort. 1997;(436):43–50.

  3. 3.

    Tang DL, Hu Y, Zhang QL, Yang Y, Luo ZR. Discriminant analysis of ‘Jinzaoshi’ from persimmon (Diospyros kaki Thunb.; Ebenaceae): a comparative study conducted based on morphological as well as ITS and matK sequence analyses. Sci Hortic. 2014;168:168–74.

    CAS  Article  Google Scholar 

  4. 4.

    Yang Y, Jing ZB, Ruan XF, Cheng JM. Development of simple sequence repeat markers in persimmon (Diospyros L.) and their potential use in related species. Genet Mol Res. 2015;14:609–18.

    CAS  Article  Google Scholar 

  5. 5.

    Yang Y, Yang TT, Jing ZB. Genetic diversity and taxonomic studies of date plum (Diospyros lotus L.) using morphological traits and SCoT markers. Biochemical Systematics Ecol. 2015;61:253–9.

    CAS  Article  Google Scholar 

  6. 6.

    Zhang YF, Hu CQ, Yang Y, Zhu RS, Guo J. Wang RZ. Pollen morphology observation of eight resources in Diospyros. Acta Horticulturae Sinica. 2016; 43 (6): 1167–1174, doi:

  7. 7.

    Zhang YF, Yang Y, Guo J, Hu CQ, Zhu RH. Taxonomic status of ‘Deyangshi’ based on chromosome number and SRAP markers. Sci Hortic. 2016;207:57–64.

    Article  Google Scholar 

  8. 8.

    Mallavadhani UV, Panda AK, Rao YR. Pharmacology and chemotaxonomy of Diospyros. Phytochemistry. 1998;49:901–51.

    CAS  Article  Google Scholar 

  9. 9.

    Choi YA, Tao R, Yonemori K, Sugiura A. Genomic distribution of three repetitive DNAs in cultivated hexaploid Diospyros spp. (D. kaki and D. virginiana) and their wild relatives. Genes Genet Syst. 2003;78:301–8.

    CAS  Article  Google Scholar 

  10. 10.

    Geng P, Ruan XF, Yang Y, Zhang HX. Analysis of genetic diversity of Diospyros spp. germplasm resources by using SSR markers. J Northwest A&F University (Natural Science Edition). 2010;38(12):190–6.

  11. 11.

    Jing ZB, Ruan XF, Wang RZ, Yang Y. Genetic diversity and relationships between and within persimmon (Diospyros L.) wild species and cultivated varieties by SRAP markers. Plant Systematics Evol. 2013;299(8):1485–92.

    Article  Google Scholar 

  12. 12.

    Liang JJ, Liang YQ, Fu JM. Genetic relationships of Diospyros kaki and related Diospyros species using chloroplast DNA PCR-RFLP markers. Acta Hortic Sin. 2014;41(12):2474–80.

    CAS  Google Scholar 

  13. 13.

    Duangjai S, Wallnöffer B, Samuel R, Munzinger J, Chase MW. Generic delimitation and relationships in Ebenaceae sensu lato: evidence from six plastid DNA regions. Am J Bot. 2006;93:1808–27.

    CAS  Article  Google Scholar 

  14. 14.

    Duangjai S, Samuel R, Munzinger J, Forest F, Wallnöfer B, Barfuss MH, Fischer G, Chase MW. A multi-locus plastid phylogenetic analysis of the pantropical genus Diospyros (Ebenaceae), with an emphasis on the radiation and biogeographic origins of the new Caledonian endemic species. Mol Phylogenet Evol. 2009;52:602–20.

    CAS  Article  Google Scholar 

  15. 15.

    Du XY, Zhang QL, Luo ZR. Comparison of four molecular markers for genetic analysis in Diospyros L. (Ebenaceae). Plant Syst Evol. 2009;281:171–81.

    CAS  Article  Google Scholar 

  16. 16.

    Suo ZL, Zhang CH, Zheng YQ, He LX, Jin XB, Hou BX, Li JJ. Revealing genetic diversity of tree peonies at micro-evolution level with hyper-variable chloroplast markers and floral traits. Plant Cell Rep. 2012;31:2199–213.

    CAS  Article  Google Scholar 

  17. 17.

    Suo ZL, Chen LN, Pei D, Jin XB, Zhang HJ. A new nuclear DNA marker from ubiquitin ligase gene region for genetic diversity detection of walnut germplasm resources. Biotechnol Reports. 2015;5:40–5.

    Article  Google Scholar 

  18. 18.

    Suo ZL, Li WY, Jin XB, Zhang HJ. A new nuclear DNA marker revealing both microsatellite variations and single nucleotide polymorphic loci: a case study on classification of cultivars in Lagerstroemia indica L. J Microb Biochem Technol. 2016;8:266–71.

  19. 19.

    Dong WP, Liu J, Yu J, Wang L, Zhou SL. Highly variable chloroplast markers for evaluating plant phylogeny at low taxonomic levels and for DNA barcoding. PLoS One. 2012;7:e35071.

    CAS  Article  Google Scholar 

  20. 20.

    Dong WP, Xu C, Cheng T, Lin K, Zhou SL. Sequencing angiosperm plastid genomes made easy: a complete set of universal primers and a case study on the phylogeny of Saxifragales. Genome Biol Evol. 2013;5(5):989–97.

    Article  Google Scholar 

  21. 21.

    Dong WP, Liu H, Xu C, Zuo YJ, Chen ZJ, Zhou SL. A chloroplast genomic strategy for designing taxon specific DNA mini-barcodes: a case study on ginsengs. BMC Genet. 2014;15:138.

  22. 22.

    Dong WP, Dong WP, Xu C, Li CH, Sun JH, Zuo YJ, Shi S, Cheng T, Guo JJ, Zhou SL. ycf1, the most promising plastid DNA barcode of land plants. Sci Rep. 2015;5:8348.

  23. 23.

    Dong WP, Xu C, Li DL, Jin XB, Lu Q, Suo ZL. Comparative analysis of the complete chloroplast genome sequences in psammophytic Haloxylon species (Amaranthaceae). PeerJ. 2016;4:e2699.

    Article  Google Scholar 

  24. 24.

    Dong WP, Xu C, Li WQ, Xie XM, Lu YZ, Liu YL, Jin XB, Suo ZL. Phylogenetic resolution in Juglans based on complete chloroplast genomes and nuclear DNA sequences. Front Plant Sci. 2017;8:1148.

  25. 25.

    He LX, Suo ZL, Zhang CH, Jin XB, Zhao DX, Zhao XQ, Hou BX, Deng CF. Classification of Chinese medicinal tree peony cultivars based on chloroplast DNA sequences. AASRI Procedia. 2012;1:344–52.

    Article  Google Scholar 

  26. 26.

    Li WQ, Yang Y, Xie XM, Lu YZ, Chang Q, Jin XB, Suo ZL. Diospyros oleifera and D. deyangensis are revealed as the closest relatives to D. kaki by E3 ubiquitin-protein ligase UPL3 DNA sequences. Hans J. Agricultural Sci. 2018; 8(6): 657–673. (In Chinese with English abstract).

  27. 27.

    Fu J, Fu JM, Liu HM, Hu JJ, Liang YQ, Liang JJ, Wuyun TN, Tan XF. Five complete chloroplast genome sequences from Diospyros: genome organization and comparative analysis. PLoS One. 2016;11(7):e0159566.

    Article  Google Scholar 

  28. 28.

    Xu C, Dong WP, Li WQ, Lu YZ, Xie XM, Jin XB, Shi JP, He KH, Suo ZL. Comparative analysis of six Lagerstroemia complete chloroplast genomes. Front Plant Sci. 2017;8:15.

  29. 29.

    Bankevich A, Nurk S, Antipov D, Gurevich AA, Dvorkin M, Kulikov AS, Lesin VM, Nikolenko SI, Pham S, Prjibelski AD, Pyshkin AV, Sirotkin AV, Vyahhi N, Tesler G, Alekseyev MA, Pevzner PA. SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing. J Comput Biol. 2012;19:455–77.

    CAS  Article  Google Scholar 

  30. 30.

    Kearse M, Moir R, Wilson A, Stones-Havas S, Cheung M, Sturrock S, Buxton S, Cooper A, Markowitz S, Duran C, Thierer T, Ashton B, Meintjes P, Drummond A. Geneious basic: an integrated and extendable desktop software platform for the organization and analysis of sequence data. Bioinformatics. 2012;28:1647–9.

    Article  Google Scholar 

  31. 31.

    Wyman SK, Jansen RK, Boore JL. Automatic annotation of organellar genomes with DOGMA. Bioinformatics 2004; 20: 3252–3255. doi: PubMed: 15180927.

    CAS  Article  Google Scholar 

  32. 32.

    Conant GC, Wolfe KH. GenomeVx: simple web-based creation of editable circular chromosome maps. Bioinformatics. 2008;24(6):861–2.

    CAS  Article  Google Scholar 

  33. 33.

    Katoh K, Standley DM. MAFFT multiple sequence alignment software version 7: improvements in performance and usability. Mol Biol Evol. 2013;30:772–80.

    CAS  Article  Google Scholar 

  34. 34.

    Rambaut A. Se-Al: sequence alignment editor. Version 2.0. Oxford: University of Oxford, Department of Zoology; 1996.

    Google Scholar 

  35. 35.

    Tamura K, Stecher G, Peterson D, Filipski A, Kumar S. MEGA6: molecular evolutionary genetics analysis version 6.0. Mol Biol Evol. 2013;30:2725–9.

    CAS  Article  Google Scholar 

  36. 36.

    Librado P, Rozas J. DnaSP v5: a software for comprehensive analysis of DNA polymorphism data. Bioinformatics. 2009;25:1451–2.

    CAS  Article  Google Scholar 

  37. 37.

    Brown SD, Collins RA, Boyer S, Lefort MC, Malumbres-Olarte J, Vink CJ, Cruickshank RH. Spider: an R package for the analysis of species identity and evolution, with particular reference to DNA barcoding. Molecular Ecol Resources. 2012;12:562–5.

    Article  Google Scholar 

  38. 38.

    Swofford DL. PAUP*. Phylogenetic Analysis Using Parsimony (*and Other Methods). Version 4b10. Sunderland. Massachusetts: Sinauer; 2003.

    Google Scholar 

  39. 39.

    Stamatakis A. RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models. Bioinformatics. 2006;22:2688–90 PMID: 16928733.

    CAS  Article  Google Scholar 

  40. 40.

    Ronquist F, Huelsenbeck JP. MrBayes 3.2: efficient Bayesian phylogenetic inference and model choice across a large model space. Syst Biol. 2012;61:539–42.

    Article  Google Scholar 

  41. 41.

    Li YC, Korol AB, Fahima T, Beiles A, Nevo E. Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol Ecol. 2002;11:2453–65.

    CAS  Article  Google Scholar 

  42. 42.

    Provan J, Soranzo N, Wilson NJ, Goldstein DB, Powell W. A low mutation rate for chloroplast microsatellites. Genetics. 1999;153:943–7.

    CAS  PubMed  PubMed Central  Google Scholar 

  43. 43.

    Perdereau A, Klaas M, Barth S, Hodkinson TR. Plastid genome sequencing reveals biogeographical structure and extensive population genetic variation in wild populations of Phalaris arundinacea L. in North-Western Europe. Glob Change Biol Bioenergy. 2017;9:46–56.

    Article  Google Scholar 

  44. 44.

    Song Y, Wang SJ, Ding YM, Xu J, Li MF, Zhu SF, Chen NZ. Chloroplast genomic resource of Paris for species discrimination. Sci Rep. 2017;7:3427.

    Article  Google Scholar 

  45. 45.

    Yu XQ, Drew BT, Yang JB, Gao LM, Li DZ. Comparative chloroplast genomes of eleven Schima (Theaceae) species: insights into DNA barcoding and phylogeny. PLoS One. 2017;12:e0178026.

    Article  Google Scholar 

  46. 46.

    Hebert PDN, Cywinska A, Ball SL, DeWaard JR. Biological identifications through DNA barcodes. Proc Royal Soc London Series B-Biological Sciences. 2003;270:313–21.

    CAS  Article  Google Scholar 

  47. 47.

    Drew BT, Sytsma KJ. Phylogenetics, biogeography, and staminal evolution in the tribe Mentheae (Lamiaceae). Am J Bot. 2012;99:933–53.

    Article  Google Scholar 

  48. 48.

    Parks M, Liston A, Cronn R. Newly developed primers for complete ycf1 amplification in Pinus (Pinaceae) chloroplasts with possible family-wide utility. Am J Bot. 2011;98:e185–8.

    CAS  Article  Google Scholar 

  49. 49.

    Hernandez-Leon S, Gernandt DS, Perez de la Rosa JA, Jardon-Barbolla L. Phylogenetic relationships and species delimitation in Pinus section Trifoliae inferred from plastid DNA. PLoS One. 2013;8:e70501.

    Article  Google Scholar 

  50. 50.

    Yang J, Vazquez L, Chen X, Li H, Zhang H, Liu Z, Zhao G. Development of chloroplast and nuclear DNA markers for Chinese oaks (Quercus subgenus Quercus) and assessment of their utility as DNA barcodes. Front Plant Sci. 2017;8:816.

    Article  Google Scholar 

  51. 51.

    Dastpak A, Osaloo SK, Maassoumi AA, Safar KN. Molecular phylogeny of Astragalus sect. Ammodendron (Fabaceae) inferred from chloroplast ycf1 gene. Ann Bot Fenn. 2018;55:75–82.

    Article  Google Scholar 

  52. 52.

    Shaw J, Lickey EB, Beck JT, Farmer SB, Liu W, Miller J, Siripun KC, Winder CT, Schilling EE, Small RL. The tortoise and the hare II: relative utility of 21 non-coding chloroplast DNA sequences for phylogenetic analysis. Amer J Bot. 2005;92:142–66.

    CAS  Article  Google Scholar 

  53. 53.

    Shaw J, Lickey EB, Schilling EE, Small RL. Comparison of whole chloroplast genome sequences to choose noncoding regions for phylogenetic studies in angiosperms: the tortoise and the hare III. Amer J Bot. 2007;94:275–88.

    CAS  Article  Google Scholar 

  54. 54.

    Ng FSP. Diospyros roxburghii and the origin of Diospyros kaki. Malays Forest. 1978;41:43–50.

    Google Scholar 

  55. 55.

    Yonemori K, Kanzaki S, Parfitt DE, Utsunomiya N, Subhadrabandhu S, Sugiura A. Phylogenetic relationship of Diospyros kaki (persimmon) to Diospyros spp. (Ebenaceae) of Thailand and four temperate zone Diospyros spp. from an analysis of RFLP variation in amplified cpDNA. Genome. 1998;41:173–82.

  56. 56.

    Zhang HP, Zhang JY, Liu QL. Research progress on the advantages of persimmon germplasm resources and genetic breeding in China. Heilongjiang Agricultural Sci. 2016;9:149–51 (In Chinese with English Abstract).

    Google Scholar 

Download references


The authors thank Chao Xu for his kind help in molecular experiments and data analysis, and Jianping Wang and Renzi Wang for their instructions in field investigations.


The study was financially supported by “Collection, Conservation and Evaluation of Forest Tree Germplasm Resources” (LKZ201496–1-3) of Shandong Provincial Agricultural Elite Varieties Project, grant No. 2017YFD0600604, No. LCNZ2016–36, the National Forest Genetic Resources Platform (2005DKA21003), National Service Sharing Infrastructure of Agricultural Crop Germplasm Resources of Persimmon Resources (NICGR2017–50).

Availability of data and materials

The complete chloroplast genomes of D. cathayensis, D. deyangensis, D. rhombifolia and D. virginiana were submitted to the NCBI database with GenBank accession numbers MF288576 (D. cathayensis), MF288575 (D. deyangensis), MF288578 (D. rhombifolia) and MF288577 (D. virginiana).

Author information




ZS, WD, and WL designed this study; WD and YL designed experiments, sequenced chloroplast genomes; YY, XX, YL, XJ, ZY and ZS conducted field investigations; WD, YL, XJ and ZY analyzed the data; ZS, WD, WL and XJ wrote the manuscript; all authors have read and approved the final manuscript.

Corresponding authors

Correspondence to Wenpan Dong or Zhili Suo.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Additional file

Additional file 1:

Figure S1. NJ trees of Diospyros taxa based on each of the ten chloroplast barcodes, showing the resolutions of each different locus for revealing the phylogeny. The figures above the lines are the bootstrap support values for the clades. (PDF 506 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Li, W., Liu, Y., Yang, Y. et al. Interspecific chloroplast genome sequence diversity and genomic resources in Diospyros. BMC Plant Biol 18, 210 (2018).

Download citation


  • Diospyros
  • Chloroplast genome
  • Hyper-variable region
  • Genetic diversity