Identification of QTLs affecting scopolin and scopoletin biosynthesis in Arabidopsis thaliana
BMC Plant Biology volume 14, Article number: 280 (2014)
Scopoletin and its glucoside scopolin are important secondary metabolites synthesized in plants as a defense mechanism against various environmental stresses. They belong to coumarins, a class of phytochemicals with significant biological activities that is widely used in medical application and cosmetics industry. Although numerous studies showed that a variety of coumarins occurs naturally in several plant species, the details of coumarins biosynthesis and its regulation is not well understood. It was shown previously that coumarins (predominantly scopolin and scopoletin) occur in Arabidopsis thaliana (Arabidopsis) roots, but until now nothing is known about natural variation of their accumulation in this model plant. Therefore, the genetic architecture of coumarins biosynthesis in Arabidopsis has not been studied before.
Here, the variation in scopolin and scopoletin content was assessed by comparing seven Arabidopsis accessions. Subsequently, a quantitative trait locus (QTL) mapping was performed with an Advanced Intercross Recombinant Inbred Lines (AI-RILs) mapping population EstC (Est-1 × Col). In order to reveal the genetic basis of both scopolin and scopoletin biosynthesis, two sets of methanol extracts were made from Arabidopsis roots and one set was additionally subjected to enzymatic hydrolysis prior to quantification done by high-performance liquid chromatography (HPLC). We identified one QTL for scopolin and five QTLs for scopoletin accumulation. The identified QTLs explained 13.86% and 37.60% of the observed phenotypic variation in scopolin and scopoletin content, respectively. In silico analysis of genes located in the associated QTL intervals identified a number of possible candidate genes involved in coumarins biosynthesis.
Together, our results demonstrate for the first time that Arabidopsis is an excellent model for studying the genetic and molecular basis of natural variation in coumarins biosynthesis in plants. It additionally provides a basis for fine mapping and cloning of the genes involved in scopolin and scopoletin biosynthesis. Importantly, we have identified new loci for this biosynthetic process.
Plants produce a great variety of secondary metabolites. It is estimated that between 4000 to 20 000 metabolites per species can be expected . This great biochemical diversity reflects the variety of environments in which plants live, and the way they have to deal with different environmental stimuli. The production of specialized secondary metabolites is assumed to protect plants against biotic and abiotic stresses . Although Arabidopsis is a small plant with short generation time and highly reduced genome, it has a set of secondary metabolites that is as abundant and diverse as those of other plant taxa . In recent years, this model plant was extensively used towards identification of genes and enzymes working in a complex network involved in secondary metabolites biosynthesis and regulation .
Currently, genetic variation found between natural Arabidopsis accessions is an important basic resource for plant biology -. Arabidopsis with its extensive genetic natural variation provides an excellent model to study variation in the biosynthesis of secondary metabolites in natural populations. Recent genetic analysis of natural variation in untargeted metabolic composition uncovered many qualitative and quantitative differences in metabolite accumulation between Arabidopsis accessions -. Numerous studies ,- proved the presence of abundant genetically controlled variation for various classes of secondary metabolites. Coumarins (scopoletin, scopolin, skimmin and esculetin) are one of the secondary metabolite classes found in Arabidopsis' roots -. But up to now, nothing is known about natural variation in coumarins content between Arabidopsis accessions.
Coumarins are a group of important natural compounds that provide for the plant antimicrobial and antioxidative activities, and are produced as a defence mechanism against pathogen attack and abiotic stresses . Importantly, coumarins are widely recognized in the pharmaceutical industry for their wide range of therapeutic activities and are an active source for drug development. Numerous coumarins have medical application in the treatment of burns and rheumatoid diseases. Furanocoumarins, which are coumarin derivatives, are used in the treatment of leucoderma, vitiligo and psoriasis , due to their photoreactive properties. Moreover, they are used in symptomatic treatment of demyelinating diseases, particularly multiple sclerosis . Furanocoumarin-producing plants that are currently studied are non-model organisms  and many approaches to identify the genes underlying genetic variation in coumarins accumulation are not yet available in those species. Scopoletin, which is a major coumarin compound of Arabidopsis, has been found in many plant species -, and was clearly shown to have antifungal and antibacterial activities important for medical purposes . All these properties make coumarins attractive from the commercial point of view.
Coumarins are derived from phenylopropanoid pathway, which serves as a rich source of metabolites in plants ,. It was suggested that in Arabidopsis several branch pathways leading from phenylpropanoid compounds to coumarins are probable . Scopoletin and scopolin biosynthesis was shown to be strongly dependent on the CYP98A3 , which is the cytochrome P450 catalyzing 3'-hydroxylation of p-coumarate units in the phenylpropanoid pathway . The feruloyl-CoA was suggested to be a major precursor in scopoletin biosynthesis . A key enzyme involved in the final step of scopoletin biosynthesis, which is the conversion of feruloyl-CoA into 2-hydroxy-feruloyl-CoA, is encoded by a member of the iron (Fe) II- and 2-oxoglutarate-dependent dioxygenase (2OGD) family, designated as F6'H1 . Despite the advances that have been made in previous years ,- (Figure 1), many questions with regard to coumarins biosynthesis are still open . In particular, the regulation of the biosynthesis of coumarins is not well understood. Up to now, all studies investigating coumarins biosynthesis in the model plant Arabidopsis were done with one laboratory accession Col-0, which was used as the genetic background of all mutant and transgenic plants.
To gain an understanding of the genetic architecture of coumarins biosynthesis, we screened a set of Arabidopsis accessions for variation in scopolin and scopoletin content, and subsequently conducted a quantitative trait locus (QTL) mapping. Our study addressed the following questions. Is there a natural variation in accumulation of scopolin and scopoletin between Arabidopsis accessions and what are genetic regions responsible for the observed differences? What are candidate genes possibly underlying QTLs involved in scopolin and scopoletin biosynthesis?
Phenotypic variation between accessions
A set of seven natural Arabidopsis accessions, which are the parents of existing RIL populations and represent accessions from different locations, were used in the initial screening for variation in scopolin and scopoletin accumulation. Accessions were grown in vitro in liquid cultures in order to obtain the optimal growth of plant roots. Under these conditions, most of the scopoletin is stored in root cells in vacuoles as its glycoside form, scopolin. In order to reveal the content of both scopolin and that of scopoletin, a subset of the methanol extracts made from Arabidopsis roots were subjected to enzymatic hydrolysis in order to hydrolyze the glycoside forms of coumarins. Using high-performance liquid chromatography (HPLC), we detected in the roots scopoletin (sct in Figure 2), as well as scopolin (scl in Figure 2BC). The identification of scopoletin in HPLC fraction (Figure 3A) was further confirmed using gas chromatography/mass spectrometry (GC/MS) by comparison to spectrum library (Figure 3B). The quantification of coumarins in methanol root extracts made from seven Arabidopsis accessions clearly showed the presence of natural variation in scopolin content before enzymatic hydrolysis (Figure 4A) and scopoletin after hydrolysis (Figure 4B). In spite of the fact that scopolin standard was not available and in order to unify further analysis, we measured the amounts of both scopolin and scopoletin as area% of total chromatogram signals. The statistically significant differences between group means for scopolin and scopoletin accumulation were determined by one-way ANOVA (p < 0.001 and p < 0.0001, respectively). Values that are not significantly different based on the post hoc test (least significant differences [LSD]) are indicated by the same letters (Figure 4). Based on the obtained results we have selected an Advanced Intercross Recombinant Inbred Lines (AI-RILs) mapping population derived from the cross between Col-0 and Est-1, because these parents significantly differed in coumarins content. Further genetic analysis was performed using values for the accumulation of scopolin before enzymatic hydrolysis and the content of scopoletin after hydrolysis of methanol extracts.
Genetic analyses of scopolin and scopoletin accumulation
The scopoletin and scopolin content values were determined for three biological replicates of AI-RILs (n = 144 and n = 140, respectively) and parental lines, which were grown in independent flasks in liquid cultures. A set of lines (AI-RILs) showed a wider range of scopolin (Figure 5A) and scopoletin (Figure 5B) values than the ones observed for both parental lines (Col-0 and Est-1), which indicated the presence of transgressive segregation and suggested that multiple loci contribute to variation in the EstC population. The lowest scopolin content within AI-RILs was 1.90 (measured as an area% of total chromatogram signals) that corresponds to 20% of the minimum Col-0 value. The maximal relative value of scopolin was 45.13, which corresponds to 159% of the maximal Est-1 value. For scopoletin content, these values were respectively 7.82 (54% of the minimum Col-0 value) and 54.93 (159% of the maximal Est-1 value) (Table 1). Having a commercially available scopoletin standard, we were able to quantify the scopoletin contents as μg/g fresh weight ( μg/gFW) in both parental lines of the AI-RILs mapping population (Col-0 and Est-1) before and after enzymatic hydrolysis. The scopoletin levels in root samples not subjected to hydrolysis were ~3 μg/gFW and ~10 μg/gFW in Col-0 and Est-1 respectively, and ~16 μg/gFW and ~86 μg/gFW in samples after hydrolysis. These values correspond to ~18, 54, 82 and 449 nmol/gFW respectively that is in the range found in the literature data, which vary from ~1 to 1200 nmol/gFW depending on plant culture being used . The calculated quantities of parental lines (Table 2) can be used as references for the overall quantity of the products in the whole mapping population.
In order to identify the fraction of variation that is genetically determined, the broad sense heritability (H 2) for scopolin and scopoletin content was estimated as described in Methods section. In the AI-RIL population, the broad sense heritability ranged from 0.45 for scopoletin to 0.50 for scopolin content (Table 1). To explore the relationship between scopolin content in methanol root extracts before enzymatic hydrolysis and scopoletin levels in extracts subjected to hydrolysis, the mean values of coumarins for each AI-RILs were used as phenotype values in trait correlation analysis. A relatively strong genetic correlation (R2 = 0.6634) was observed between the level of coumarins measured before and after hydrolysis in the AI-RILs population, indicating genetic co-regulation of scopolin and scopoletin biosynthesis (Figure 6).
Mapping QTLs for scopolin and scopoletin accumulation
Six QTLs were identified, with one QTL being detected for scopolin and five QTLs for scopoletin accumulation (Table 3). The QTL effect sizes ranged from the 7.0% to 16.7% of the phenotypic variance explained by the QTL (PVE), with three of the six QTLs having effect sizes below 10% PVE. One QTL (SCL1) was detected for scopolin accumulation at the bottom of chromosome 5 (Figure 7) explaining the 13.86% PVE (Table 3), and five QTLs (SCT1 - SCT5) for scopoletin accumulation were identified on chromosome 1, 3 and 5 (Figure 8, Table 3). No QTLs were detected on chromosome 2 and 4. To improve the QTL model explaining variation in a scopoletin content, the MQM approach was performed using two QTLs (SCT4 and SCT5) as cofactors. We have included in the model QTL on chromosome 1 (SCT1), despite its LOD score was slightly below the threshold (3.327). The whole model explains 37.6% variance for scopoletin content. No epistasis between the main effect loci were detected.
QTL mapping identifies known and new loci for coumarins biosynthesis
Some of the mapped QTLs underlying variation in scopolin (SCL1) and scopoletin (SCT1 and SCT2) accumulation in the AI-RILs population, co-localize with the genes annotated to be involved in coumarin biosynthetic process (Plant Metabolic Network, http://plantcyc.org/, Figure 1). We detected seven cloned and characterized genes encoding enzymes for scopoletin and scopolin biosynthesis that co-localize with detected QTLs (see Additional file 1). Within the SCL1 interval, which is characterized by one of the highest LOD score values, there are two very good candidates. One of them is At5g48930 encoding a shikimate O-hydroxycinnamoyltransferase (HCT), while the other one (At5g54160) encodes caffeic acid/5-hydroxyferulic acid O-methyltransferase (OMT1). Importantly, both genes are expressed in roots (SCL1 in Table 4). Within the SCT1 and SCT2 intervals underlying variation in scopoletin content more possible candidate genes were detected: At1g33030, At1g51990, At1g67980 and At1g67990 (TSM1) encoding proteins from O-methyltransferase family; At1g51680 and At1g65060 encoding isoforms of 4-coumarate:CoA ligase (4CL1 and 4CL3 respectively); At1g62940 encoding acyl-CoA synthetase (ACOS5); and At1g55290 encoding feruloyl CoA ortho-hydroxylase 2 (F6'H2).
In order to reveal other candidate genes possibly underlying detected QTLs, two QTLs for scopoletin content (SCT4 and SCT5) and one QTL associated with scopolin (SCL1) accumulation were chosen for further in silico analyses. The selected intervals are characterized by the highest percentage of phenotypic variance explained by each QTL and the highest LOD score values. The annotated functions for all genes located in the selected QTL intervals were checked. As a result, we selected genes encoding transcription factors that might be induced by environmental stresses and enzymes that according to the annotation functions could be possibly involved in scopolin and scopoletin biosythensis. Subsequently, we performed in silico analysis of the tissue distribution and level of expression of selected genes. Only genes that were expressed in roots were selected as possible candidates for further studies. As a result, we selected a set of genes that deserve close attention as possible new loci underlying variation in scopolin and scopoletin accumulation (Table 4). Among candidates possibly involved in scopoletin accumulation, a particularly interesting one is a CYP81D11 gene (At3g28740) encoding a member of the cytochrome P450 family, which is located within the QTL on chromosome 3 (SCT4 in Table 4). According to the 1001 Genomes Project database (www.1001genomes.org) and re-sequencing data of Est-1 from our laboratory (see Additional files 2 and 3, indicated as Est-1*), the CYP81D11 gene contains several SNPs and one indel in the coding sequences of the parental lines of EstC mapping population and in the other accessions tested in this study (see Additional file 2). Other interesting candidates are three genes (At5g14340, At5g14750, At5g15130) located within the QTL interval on chromosome 5 (SCT5 in Table 4), which encode members of the MYB and WRKY transcription factor families. These genes are relatively highly expressed in roots and their expression is induced by various environmental stresses . A particularly interesting candidate that could be possibly linked to scopolin accumulation was detected within the QTL on chromosome 5 (SCL1 in Table 4). It is At5g53990 encoding a UDP-glycosyltransferase, which is relatively highly expressed in Arabidopsis roots . According to the 1001 Genomes Project and our re-sequencing data of Est-1, this gene contains several SNPs in the coding sequences of tested accessions including the parental lines (see Additional file 3). Interestingly, the CYP81D11 and UDP-glycosyltransferase sequences originating from Est, Est-1 (both taken from the 1001 Genomes Project database) and Est-1* that was re-sequenced in our laboratory are not identical (see Additional files 2 and 3). This needs to be further verified.
Here, we report a QTL mapping study of variation in scopoletin and scopolin accumulation between two Arabidopsis accessions and thereby we demonstrate the usefulness of Arabidopsis natural variation in elucidating the genetic and molecular basis of coumarins biosynthesis.
A large number of Arabidopsis recombinant inbred line (RIL) populations are available and extensively used for identification of numerous QTLs controlling various traits such as growth, development or resistance to different biotic and abiotic stresses as well as the content of chemical compounds ,,,,. In most studies, the average number of QTLs identified is between one and 10 and at least one major QTL is detected . Here, one QTL for scopolin and five QTLs for scopoletin accumulation were detected, which is in agreement with the average result in the field. Using an AI-RILs mapping population has the advantage in comparison to RILs due to the fact that the opportunity for recombination is increased before genotypes are fixed upon selfing . As a result, using AI-RILs mapping population that captures an increased number of recombination events , enabled us to detect QTLs with effect size as low as 7.0% PVE.
Once QTL has been identified, the next challenge is to identify the gene(s) underlying detected QTL. In most cases, a large number of genes that are present in the QTL interval cannot be directly tested for candidacy. In order to reduce the mapped region, a fine-mapping is performed in which many individuals are genotyped for markers around the QTL. More accurate QTL localization might lead to the selection of candidate genes. Nonetheless, performing a fine mapping may be practically difficult if the QTL effect is relatively small . When multiple data sets are available, which is the case for Arabidopsis, it is possible to improve accuracy and to test the candidacy of genes within mapped QTL intervals  based on the available information. Therefore, it seems like a realistic possibility to identify candidate genes underlying a QTL by using the high throughput expression data and the complete genome sequences of numerous Arabidopsis accessions that were used to construct mapping populations. There are successful examples of using expression arrays in identifying genes causally associated with quantitative traits of interest, both in plants and animals ,. In this study, possible candidate genes were found within mapped QTL intervals for scopolin and scopoletin content, including known and novel loci. Further functional analysis, including re-sequencing, characterization of loss-of-function alleles and conducting gene complementation either by crossing or genetic transformation, are required to prove the role of selected possible candidate genes in coumarins biosynthesis and their regulation.
Expanding molecular understanding of coumarins biosynthesis at an ecological level will be beneficial for the future discovery of the physiological mechanisms of action of genes involved in coumarins biosynthesis. It was suggested recently that some members the 2'-OG dioxygenase family, including the F6'H1 that is a key enzyme in scopoletin biosynthesis, may be involved in Fe deficiency responses and metabolic adjustments linked to Fe homeostasis in plant cells . Other latest studies showed that Fe deficiency induces the secretion of scopoletin and its derivatives by Arabidopsis roots , and that F6'H1 is required for the biosynthesis of coumarins that are released into the rhizosphere as part of the strategy I-type Fe acquisition machinery . Previously, the existence of natural variation in root exudation profiles was clearly detected among eight Arabidopsis accessions . The above mentioned findings make a study of coumarins biosynthesis in Arabidopsis using naturally occurring intraspecific variation even more promising and up-to-date.
In summary, we have presented here for the first time a presence of naturally occurring intraspecies variation in scopoletin and its glucoside, scopolin, accumulation among seven Arabidopsis accessions. Even though, these accessions do not completely represent a wide genetic variation existing in Arabidopsis, it is assumed that these accessions should reflect genetic adaptation to local environmental factors . A QTL mapping study of scopoletin and scopolin variation within EstC mapping population was conducted leading to the identification of new loci. The results presented here suggest that natural variation in coumarins content in Arabidopsis has a complex molecular basis. Importantly, they also provide a basis for fine mapping and cloning of the genes involved in coumarins biosynthesis.
Seven Arabidopsis thaliana accessions Antwerpen (An-1, Belgium), Columbia (Col-0, Germany), Estland (Est-1, Estonia), Kashmir (Kas-2, India), Kondara (Kond, Tadjikistan), Landsberg erecta (Ler, Poland) and Tsu (Tsu-1, Japan), which are the parents of existing RIL populations and represent accessions from different locations, were used in the initial screening for variation in scopoletin and scopolin accumulation. An advanced recombinant inbred lines (AI-RILs) mapping population (EstC) derived from the cross between Columbia (Col-0) and Estland (Est-1) was used in the QTL mapping experiment . All seeds of the Arabidopsis accessions and mapping population were kindly provided by Maarten Koornneef from the Max Planck Institute for Plant Breeding Research in Cologne, Germany. Arabidopsis accessions are available at the stock centre NASC (http://arabidopsis.info/). The EstC mapping population together with the marker data are available at the NASC under the stock number CS39389.
The seeds were surface sterilized by soaking in 70% ethanol for two min and subsequently kept in 5% calcium hypochlorite solution for eight min. Afterwards seeds were rinsed three times in autoclaved millipore water and planted on 0.5 Murashige and Skoog's (MS) medium containing 1% sucrose, 0.8% agar supplemented with 100 mg/l myo-inositol, 1 mg/l thiamine hydrochloride, 0.5 mg/l pyridoxine hydrochloride and 0.5 mg/l nicotinic acid. For stratification, plates were kept in the dark at 4°C for 72 h and then placed under defined growth conditions. All plants were grown in vitro in plant growth chambers under a photoperiod of 16 h light (35 μmol m-2 s-1) at 20°C and 8 h dark at 18°C. After 10 days seedlings were transferred from agar plates into 200 ml glass culture vessels (5.5°Cm diameter × 10°Cm high, glass jars with magenta B caps) containing 8 ml sterile liquid medium. Plants grown in liquid cultures were incubated on rotary platform shakers at 120 rpm. After 17 days plants were harvested (28th day of culture), leaves and roots were frozen separately in liquid nitrogen and stored at -80°C. All genotypes were grown in three biological replicates (in independent flasks). The growth conditions were monitored by a HOBO U12 data logger (Onset Computer Corporation, Bourne, MA) that recorded the parameters (temperature, light intensity and relative humidity) in an interval at every five minutes.
Preparation of methanol extracts from Arabidopsis roots
The root tissue was homogenized using steel beads and sonication. The coumarins were extracted at 4°C with 80% methanol. After 24 h two sets of methanol extracts were centrifuged for 20 min at 13000 rpm, one set was additionally subjected to enzymatic hydrolysis using β-glucosidase from almonds (Sigma-Aldrich) dissolved in acetate buffer according to modified protocol of .
Scopoletin and scopolin quantification by High-Performance Liquid Chromatography (HPLC)
The methanol extracts of Arabidopsis roots with and without enzymatic treatment were analyzed (Figure 2) using a Perkin Elmer series 200 HPLC system comprising of a quaternary LC pump, autosampler, column oven and a UV detector. All samples were filtered with 0.22 μm filters before loading. The volume injected was 10 μl. Gradient elution on Perkin Elmer C18 column SC18 (250×4.6 mm) was performed at flow rate of 0.7 ml/min with the following solvent system: (A) 50 mm ammonium acetate pH 4.5, (B) Methanol: starting from 30% B for 2 min, 30-80% B in 40 min followed by isocratic elution and column regeneration. The fluorescence detector was based on absorbance at 340 nm excitation wavelength and emission at 460 nm. The data analysis consisted of scopoletin and scopolin relative analysis (area percent of total chromatogram).
Scopoletin identification by Gas Chromatography/Mass Spectrometry (GC/MS)
The HPLC fractions containing scopoletin peak were collected and scopoletin identification was confirmed (Figure 3A) with Gas Chromatography/Mass Spectrometry (GC/MS) by comparison to spectrum library (Figure 3B). GC/MS analysis was performed using a Perkin-Elmer GC XL Gas Chromatograph interfaced to a Mass Spectrometer equipped with an Elite-5MS (5% diphenyl/ 95% dimethyl polysiloxane) fused to a capillary column (30 × 0.25 μm ID × 0.25 μm df). For GC/MS detection, an electron ionization system operated in electron impact mode with an ionization energy of 70 eV. Helium gas was used as a carrier gas at a constant flow rate of 1 ml/min, and an injection volume of 2 μl was employed (a split ratio of 10:1). The ion-source temperature was 250°C, the oven temperature was programmed from 100°C (isothermal for 5 min), with an increase of 10°C/min to 300°C. Mass spectra were taken at 70 eV; a scan interval of 0.5 s and fragments from 30 to 450 Da. The solvent delay was 1 to 2 min, and the total GC/MS running time was 38 min. The mass-detector used in this analysis was Turbo-Mass Gold-Perkin-Elmer, and the MS software Turbo-Mass ver-5.1.
Coumarins were quantified in the methanol root extracts of three biological replicates (cultivated in independent flasks) of all AI-RILs individuals. Methanol extracts subjected to enzymatic hydrolysis were used for scopoletin quantification, while scopolin contents were determined in methanol extracts without hydrolysis.
Quantitative genetic analyses
The scopolin and scopoletin mean values for each AI-RILs were used in QTL mapping and trait correlation analysis. The regression equation and R2 were calculated by plotting scopolin and scopoletin mean values against one another in Scatterplot (Microsoft Excel). The broad sense heritability (H 2) was estimated according to the formula H 2 = V G /(V G + V E ), where V G is the among-genotype variance component and V E is the residual (error) variance.
QTL analyses in the AI-RIL population
Statistical analysis of phenotypic data was performed by Shapiro-Wilk normality test. Phenotypic data is normally distributed at the significance level α = 0.05. QTL mapping was performed using R software (A Core Team, 2012, www.R-project.org) with R/qtl package ,; http://www.rqtl.org/). QTL mapping was performed with Simple Interval Mapping (SIM) (data not shown) followed by the Multiple QTL mapping (MQM) procedure. The QTLs with the highest logarithm of odds (LOD) scores detected by SIM were subsequently used to make the QTL model by the MQM. The final QTL model was done with the backward elimination of cofactors with the window size 10°CM and maximum number of cofactors 5. Significance threshold (LOD) values (P <0.05) for the QTL presence was estimated from 10 000 permutations and is 3.4. "Addint" function has been used to add pairwise interaction, one at a time, to a multiple-QTL model. No interaction has been detected.
Candidate genes selection
The physical positions of genes annotated to be involved in coumarin biosynthetic process (Plant Metabolic Network, http://plantcyc.org/) were checked according to TAIR (http://www.arabidopsis.org/). To reveal other candidate genes possibly underlying detected QTLs, a list of candidates was constructed using the following criteria: (1) genes encoding enzymes belonging to families involved in coumarins biosynthesis and genes encoding transcription factors that might be induced by environmental stresses (http://www.arabidopsis.org/); (2) genes that are expressed in roots (http://bar.utoronto.ca/). The list of potential candidates was compiled by searching TAIR (http://www.arabidopsis.org/) and Arabisopsis eFP Browser (http://bar.utoronto.ca/) (Table 4).
All treatments included at least three (or two in case of parental lines used in the genetic mapping) biological replicates. Data processing and statistical analyses (one way ANOVA, post-hoc test: least significant difference test [LSD]) were carried out using Microsoft Excel. Error bars representing standard deviation (SD) are shown in the figures; the data presented are means.
DNA samples preparation and sequencing
The RNeasy® Plant Mini Kit (Qiagen) was used following the instructions of the manufacturer and including on-column DNA digestion step with the RNase-Free DNase Set (Qiagen) to eliminate genomic DNA contamination. 0.5 μg of RNA was used for reverse transcription by Maxima First Strand cDNA Synthesis Kit (Thermo Scientific). The amplification of genes coding sequences was carried out in a 20 μl reaction mixture containing cDNA synthetized from RNA isolated from roots, 0.4 U of Platinum® Taq DNA Polymerase (Invitrogen), 200 μm dNTP, 1 μm primers, and 1 × PCR Buffer and 1.5 mm Mg2+. The reaction mixture was denatured at 94°C for 2 min, and then the PCR amplification was performed using 34°Cycles of 94°C for 30 sec, 52°C for 30 sec, and 72°C for 90 sec in the Thermal Cycler C1000 Touch (Bio-Rad). Gene-specific primers used for AT5G53990 UDP-glycosyltransferase amplification were 5'- ATGGGCCAAAATTTTCACGCT -3' and 5'- TCATTCAAGATTTGTATCGTTGACT-3' and for AT3G28740 CYP81D11 5'- ATGTCATCAACAAAGACAATAATGG-3' and 5'- TTATGGACAAGAAGCATCTAAAACC-3'. PCR products were cloned into pCR8 vector (Invitrogen). For plasmid amplification and maintenance, the Escherichia coli strain One Shot® (Invitrogen) was used. Positive clones were sequenced using vector specific primers M13fwd and M13rev and BigDye® Terminator v3.1 (Life Technologies). Sequencing reaction products were separated and analyzed by 3730xl DNA Analyzer. All sequences were aligned using CLUSTALW .
Availability of supporting data
The data sets supporting the results of this article are included within the article and its additional files.
JS cultivated the plant material, conducted secondary metabolites isolation, performed the QTL mapping and contributed to the in silico analyses, statistical analyses and the results interpretation. LK, RB and BB conducted the coumarins quantification by HPLC. BB and RB contributed to the statistical analyses. AO was involved in the in silico analyses. AGW contributed to the statistical analyses. EL contributed to the results interpretation. AI received grant support for the project, wrote the paper, design the experiments, interpreted the results, performed the in silico and statistical analyses, and participate in optimization of plant growth and secondary metabolites isolation. All authors read and approved the final manuscript.
4-coumarate:CoA ligase 1
4-coumarate:CoA ligase 2
4-coumarate:CoA ligase 3
4-coumarate:CoA ligase 5
Acyl-CoA synthetase 5
Advanced intercross recombinant inbred lines
Caffeoyl coenzyme A dependent O-methyltransferase 1
Caffeoyl coenzyme A dependent O-methyltransferase 7
Cytochrome P450 superfamily of monooxygenases
Feruloyl-CoA 6'-hydroxylase 1
Feruloyl-CoA 6'-hydroxylase 2
Gas Chromatography/Mass Spectrometry
High-performance liquid chromatography
Logarithm of odds
Murashige and Skoog medium
Multiple QTL mapping
Superfamily of transcription factors
Nottingham Arabidopsis stock centre
Caffeate O-methyltransferase 1
Tapetum-specific O- methyltransferase
Phenotypic variance explained
Simple interval mapping
Quantitative trait loci
Superfamily of transcription factors
Fernie AR, Trethewey RN, Krotzky AJ, Willmitzer L: Metabolite profiling: from diagnostics to systems biology. Nat Rev Mol Cell Biol. 2004, 5 (9): 763-769. 10.1038/nrm1451.
Kliebenstein DJ, Osbourn A: Making new molecules - evolution of pathways for novel metabolites in plants. Curr Opin Plant Biol. 2012, 15 (4): 415-423. 10.1016/j.pbi.2012.05.005.
D’Auria JC, Gershenzon J: The secondary metabolism of Arabidopsis thaliana: growing like a weed. Curr Opin Plant Biol. 2005, 8 (3): 308-316. 10.1016/j.pbi.2005.03.012.
Brotman Y, Riewe D, Lisec J, Meyer RC, Willmitzer L, Altmann T: Identification of enzymatic and regulatory genes of plant metabolism through QTL analysis in Arabidopsis. J Plant Physiol. 2011, 168 (12): 1387-1394. 10.1016/j.jplph.2011.03.008.
Alonso-Blanco C, Aarts MG, Bentsink L, Keurentjes JJ, Reymond M, Vreugdenhil D, Koornneef M: What has natural variation taught us about plant development, physiology, and adaptation?. Plant Cell. 2009, 21 (7): 1877-1896. 10.1105/tpc.109.068114.
Koornneef M, onso-Blanco C, Vreugdenhil D: Naturally occurring genetic variation in Arabidopsis thaliana . Annu Rev Plant Biol. 2004, 55: 141-172. 10.1146/annurev.arplant.55.031903.141605.
Weigel D: Natural variation in Arabidopsis: from molecular genetics to ecological genomics. Plant Physiol. 2012, 158 (1): 2-22. 10.1104/pp.111.189845.
Keurentjes JJ, Fu J, de Vos CH, Lommen A, Hall RD, Bino RJ, van der Plas LH, Jansen RC, Vreugdenhil D, Koornneef M: The genetics of plant metabolism. Nat Genet. 2006, 38 (7): 842-849. 10.1038/ng1815.
Lisec J, Meyer RC, Steinfath M, Redestig H, Becher M, Witucka-Wall H, Fiehn O, Torjek O, Selbig J, Altmann T, Willmitzer L: Identification of metabolic and biomass QTL in Arabidopsis thaliana in a parallel analysis of RIL and IL populations. Plant J. 2008, 53 (6): 960-972. 10.1111/j.1365-313X.2007.03383.x.
Rowe HC, Hansen BG, Halkier BA, Kliebenstein DJ: Biochemical networks and epistasis shape the Arabidopsis thaliana metabolome. Plant Cell. 2008, 20 (5): 1199-1216. 10.1105/tpc.108.058131.
Kliebenstein DJ, Gershenzon J, Mitchell-Olds T: Comparative quantitative trait loci mapping of aliphatic, indolic and benzylic glucosinolate production in Arabidopsis thaliana leaves and seeds. Genetics. 2001, 159 (1): 359-370.
Tholl D, Chen F, Petri J, Gershenzon J, Pichersky E: Two sesquiterpene synthases are responsible for the complex mixture of sesquiterpenes emitted from Arabidopsis flowers. Plant J. 2005, 42 (5): 757-771. 10.1111/j.1365-313X.2005.02417.x.
Bednarek P, Schneider B, Svatos A, Oldham NJ, Hahlbrock K: Structural complexity, differential response to infection, and tissue specificity of indolic and phenylpropanoid secondary metabolism in Arabidopsis roots. Plant Physiol. 2005, 138 (2): 1058-1070. 10.1104/pp.104.057794.
Kai K, Shimizu B, Mizutani M, Watanabe K, Sakata K: Accumulation of coumarins in Arabidopsis thaliana . Phytochemistry. 2006, 67 (4): 379-386. 10.1016/j.phytochem.2005.11.006.
Kai K, Mizutani M, Kawamura N, Yamamoto R, Tamai M, Yamaguchi H, Sakata K, Shimizu B: Scopoletin is biosynthesized via ortho-hydroxylation of feruloyl CoA by a 2-oxoglutarate-dependent dioxygenase in Arabidopsis thaliana . Plant J. 2008, 55 (6): 989-999. 10.1111/j.1365-313X.2008.03568.x.
Rohde A, Morreel K, Ralph J, Goeminne G, Hostyn V, De RR, Kushnir S, Van DJ, Joseleau JP, Vuylsteke M, Van DG, Van BJ, Messens E, Boerjan W: Molecular phenotyping of the pal1 and pal2 mutants of Arabidopsis thaliana reveals far-reaching consequences on phenylpropanoid, amino acid, and carbohydrate metabolism. Plant Cell. 2004, 16 (10): 2749-2771. 10.1105/tpc.104.023705.
Baillieul F, de Ruffray P, Kauffmann S: Molecular cloning and biological activity of alpha-, beta-, and gamma-megaspermin, three elicitins secreted by Phytophthora megasperma H20. Plant Physiol. 2003, 131 (1): 155-166. 10.1104/pp.012658.
Stern RS: Psoralen and ultraviolet a light therapy for psoriasis. N Engl J Med. 2007, 357 (7): 682-690. 10.1056/NEJMct072317.
Wulff H, Rauer H, During T, Hanselmann C, Ruff K, Wrisch A, Grissmer S, Hansel W: Alkoxypsoralens, novel nonpeptide blockers of Shaker-type K + channels: synthesis and photoreactivity. J Med Chem. 1998, 41 (23): 4542-4549. 10.1021/jm981032o.
Karamat F, Olry A, Doerper S, Vialart G, Ullmann P, Werck-Reichhart D, Bourgaud F, Hehn A: CYP98A22, a phenolic ester 3'-hydroxylase specialized in the synthesis of chlorogenic acid, as a new tool for enhancing the furanocoumarin concentration in Ruta graveolens . BMC Plant Biol. 2012, 12: 152-10.1186/1471-2229-12-152.
Bertolucci SK, Pereira AB, Pinto JE, Oliveira AB, Braga FC: Seasonal variation on the contents of coumarin and kaurane-type diterpenes in Mikania laevigata and M. glomerata leaves under different shade levels. Chem Biodivers. 2013, 10 (2): 288-295. 10.1002/cbdv.201200166.
Costet L, Fritig B, Kauffmann S: Scopoletin expression in elicitor-treated and tobacco mosaic virus-infected tobacco plants. Physiol Plant. 2002, 115 (2): 228-235. 10.1034/j.1399-3054.2002.1150208.x.
Gnonlonfin BG, Gbaguidi F, Gbenou JD, Sanni A, Brimer L: Changes in scopoletin concentration in cassava chips from four varieties during storage. J Sci Food Agric. 2011, 91 (13): 2344-2347. 10.1002/jsfa.4465.
Matsumoto S, Mizutani M, Sakata K, Shimizu B: Molecular cloning and functional analysis of the ortho-hydroxylases of p-coumaroyl coenzyme A/feruloyl coenzyme A involved in formation of umbelliferone and scopoletin in sweet potato, Ipomoea batatas (L.) Lam. Phytochemistry. 2012, 74: 49-57. 10.1016/j.phytochem.2011.11.009.
Prats E, Galindo JC, Bazzalo ME, Leon A, Macias FA, Rubiales D, Jorrin JV: Antifungal activity of a new phenolic compound from capitulum of a head rot-resistant sunflower genotype. J Chem Ecol. 2007, 33 (12): 2245-2253. 10.1007/s10886-007-9388-9.
Sargent JA, Skoog F: Effects of indoleacetic acid and kinetin on scopoletin-scopolin levels in relation to growth of tobacco tissues in vitro . Plant Physiol. 1960, 35 (6): 934-941. 10.1104/pp.35.6.934.
Schmeda-Hirschmann G, Jordan M, Gerth A, Wilken D, Hormazabal E, Tapia AA: Secondary metabolite content in Fabiana imbricata plants and in vitro cultures. Z Naturforsch C. 2004, 59 (1-2): 48-54.
Taguchi G, Fujikawa S, Yazawa T, Kodaira R, Hayashida N, Shimosaka M, Okazaki M: Scopoletin uptake from culture medium and accumulation in the vacuoles after conversion to scopolin in 2,4-D-treated tobacco cells. Plant Sci. 2000, 151 (2): 153-161. 10.1016/S0168-9452(99)00212-5.
Tal B, Robeson DJ: The metabolism of sunflower phytoalexins ayapin and scopoletin: plant-fungus interactions. Plant Physiol. 1986, 82 (1): 167-172. 10.1104/pp.82.1.167.
Gnonlonfin GJB, Sanni A, Brimer L: Review Scopoletin - a coumarin phytoalexin with medicinal properties. Crit Rev Plant Sci. 2012, 31: 47-56. 10.1080/07352689.2011.616039.
Vogt T: Phenylpropanoid biosynthesis. Mol Plant. 2010, 3 (1): 2-20. 10.1093/mp/ssp106.
Fraser CM, Chapple C: The phenylpropanoid pathway in Arabidopsis. Arabidopsis Book. 2011, 9: e0152-10.1199/tab.0152.
Schoch G, Goepfert S, Morant M, Hehn A, Meyer D, Ullmann P, Werck-Reichhart D: CYP98A3 from Arabidopsis thaliana is a 3'-hydroxylase of phenolic esters, a missing link in the phenylpropanoid pathway. J Biol Chem. 2001, 276 (39): 36566-36574. 10.1074/jbc.M104047200.
Ehlting J, Buttner D, Wang Q, Douglas CJ, Somssich IE, Kombrink E: Three 4-coumarate:coenzyme A ligases in Arabidopsis thaliana represent two evolutionarily divergent classes in angiosperms. Plant J. 1999, 19 (1): 9-20. 10.1046/j.1365-313X.1999.00491.x.
Hamberger B, Hahlbrock K: The 4-coumarate:CoA ligase gene family in Arabidopsis thaliana comprises one rare, sinapate-activating and three commonly occurring isoenzymes. Proc Natl Acad Sci U S A. 2004, 101 (7): 2209-2214. 10.1073/pnas.0307307101.
Hoffmann L, Maury S, Martz F, Geoffroy P, Legrand M: Purification, cloning, and properties of an acyltransferase controlling shikimate and quinate ester intermediates in phenylpropanoid metabolism. J Biol Chem. 2003, 278 (1): 95-103. 10.1074/jbc.M209362200.
Hoffmann L, Besseau S, Geoffroy P, Ritzenthaler C, Meyer D, Lapierre C, Pollet B, Legrand M: Silencing of hydroxycinnamoyl-coenzyme A shikimate/quinate hydroxycinnamoyltransferase affects phenylpropanoid biosynthesis. Plant Cell. 2004, 16 (6): 1446-1465. 10.1105/tpc.020297.
Kuhnl T, Koch U, Heller W, Wellmann E: Chlorogenic acid biosynthesis: characterization of a light-induced microsomal 5-O-(4-coumaroyl)-D-quinate/shikimate 3'-hydroxylase from carrot (Daucus carota L.) cell suspension cultures. Arch Biochem Biophys. 1987, 258 (1): 226-232. 10.1016/0003-9861(87)90339-0.
Goujon T, Sibout R, Pollet B, Maba B, Nussaume L, Bechtold N, Lu F, Ralph J, Mila I, Barriere Y, Lapierre C, Jouanin L: A new Arabidopsis thaliana mutant deficient in the expression of O-methyltransferase impacts lignins and sinapoyl esters. Plant Mol Biol. 2003, 51 (6): 973-989. 10.1023/A:1023022825098.
Wils CR, Brandt W, Manke K, Vogt T: A single amino acid determines position specificity of an Arabidopsis thaliana CCoAOMT-like O-methyltransferase. FEBS Lett. 2013, 587 (6): 683-689. 10.1016/j.febslet.2013.01.040.
Grienenberger E, Besseau S, Geoffroy P, Debayle D, Heintz D, Lapierre C, Pollet B, Heitz T, Legrand M: A BAHD acyltransferase is expressed in the tapetum of Arabidopsis anthers and is involved in the synthesis of hydroxycinnamoyl spermidines. Plant J. 2009, 58 (2): 246-259. 10.1111/j.1365-313X.2008.03773.x.
Hino F, Okazaki M, Miura Y: Effect of 2,4-dichlorophenoxyacetic Acid on glucosylation of scopoletin to scopolin in tobacco tissue culture. Plant Physiol. 1982, 69 (4): 810-813. 10.1104/pp.69.4.810.
Bourgaud F, Hehn A, Larbat R, Doerper S, Gontier E, Kellner S, Matern U: Biosynthesis of coumarins in plants: a major pathway still to be unravelled for cytochrome P450 enzymes. Phytochem Rev. 2006, 5: 293-308. 10.1007/s11101-006-9040-2.
Winter D, Vinegar B, Nahal H, Ammar R, Wilson GV, Provart NJ: An "Electronic Fluorescent PictograpH browser for exploring and analyzing large-scale biological data sets. PLoS One. 2007, 2: e718-10.1371/journal.pone.0000718.
Fernie AR, Klee HJ: The use of natural genetic diversity in the understanding of metabolic organization and regulation. Front Plant Sci. 2011, 2: 59-
Lisec J, Steinfath M, Meyer RC, Selbig J, Melchinger AE, Willmitzer L, Altmann T: Identification of heterotic metabolite QTL in Arabidopsis thaliana RIL and IL populations. Plant J. 2009, 59 (5): 777-788. 10.1111/j.1365-313X.2009.03910.x.
Grillo MA, Li C, Hammond M, Wang L, Schemske DW: Genetic architecture of flowering time differentiation between locally adapted populations of Arabidopsis thaliana . New Phytol. 2013, 197 (4): 1321-1331. 10.1111/nph.12109.
Balasubramanian S, Schwartz C, Singh A, Warthmann N, Kim MC, Maloof JN, Loudet O, Trainer GT, Dabi T, Borevitz JO, Chory J, Weigel D: QTL mapping in new Arabidopsis thaliana advanced intercross-recombinant inbred lines. PLoS One. 2009, 4 (2): e4318-10.1371/journal.pone.0004318.
Price AH: Believe it or not, QTLs are accurate!. Trends Plant Sci. 2006, 11 (5): 213-216. 10.1016/j.tplants.2006.03.006.
Wayne ML, McIntyre LM: Combining mapping and arraying: An approach to candidate gene identification. Proc Natl Acad Sci U S A. 2002, 99 (23): 14903-14906. 10.1073/pnas.222549199.
Werner JD, Borevitz JO, Warthmann N, Trainer GT, Ecker JR, Chory J, Weigel D: Quantitative trait locus mapping and DNA array hybridization identify an FLM deletion as a cause for natural flowering-time variation. Proc Natl Acad Sci U S A. 2005, 102 (7): 2460-2465. 10.1073/pnas.0409474102.
Vigani G, Morandini P, Murgia I: Searching iron sensors in plants by exploring the link among 2'-OG-dependent dioxygenases, the iron deficiency response and metabolic adjustments occurring under iron deficiency. Front Plant Sci. 2013, 4: 169-
Fourcroy P, Siso-Terraza P, Sudre D, Saviron M, Reyt G, Gaymard F, Abadia A, Abadia J, varez-Fernandez A, Briat JF: Involvement of the ABCG37 transporter in secretion of scopoletin and derivatives by Arabidopsis roots in response to iron deficiency. New Phytol. 2014, 201 (1): 155-167. 10.1111/nph.12471.
Schmid NB, Giehl RF, Doll S, Mock HP, Strehmel N, Scheel D, Kong X, Hider RC, von Wiren N: Feruloyl-CoA 6'-hydroxylase1-dependent coumarins mediate iron acquisition from alkaline substrates in Arabidopsis. Plant Physiol. 2014, 164 (1): 160-172. 10.1104/pp.113.228544.
Micallef SA, Shiaris MP, Colon-Carmona A: Influence of Arabidopsis thaliana accessions on rhizobacterial communities and natural variation in root exudates. J Exp Bot. 2009, 60 (6): 1729-1742. 10.1093/jxb/erp053.
Nguyen C, Bouque V, Bourgaud F, Guckert A: Quantification of Daidzein and Furanocoumarin Conjugates of Psoralea cinerea L. (Leguminosae). Phytochem Anal. 1997, 8: 27-31. 10.1002/(SICI)1099-1565(199701)8:1<27::AID-PCA331>3.0.CO;2-A.
Arends D, Prins P, Jansen RC, Broman KW: R/qtl: high-throughput multiple QTL mapping. Bioinformatics. 2010, 26 (23): 2990-2992. 10.1093/bioinformatics/btq565.
Broman KW, Wu H, Sen S, Churchill GA: R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003, 19 (7): 889-890. 10.1093/bioinformatics/btg112.
Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680. 10.1093/nar/22.22.4673.
This research was supported by the National Science Centre (6815/B/P01/2011/40), the Foundation for Polish Science (HOMING Programme) and the LiSMIDoS PhD fellowship (UDA-POKL.04.01.01-00-017/1000). Open access publication cost supported from the project MOBI4Health that has received funding from the European Union s Seventh Framework Programme for research, technological development and demonstration under grant agreement no 316094. We thank Maarten Koornneef from the Max Planck Institute for Plant Breeding Research in Cologne for providing all Arabidopsis seeds used in this study and for critical reading of the manuscript.
The authors declare that they have no competing interests.
Electronic supplementary material
Additional file 1: Figure S1.: The position of known loci involved in scopolin and scopoletin biosynthesis. (PDF 161 KB)
Additional file 2: Figure S2.: Multiple Sequence Alignment of coding sequences of AtCYP81D11 gene produced by CLUSTALW. (PDF 73 KB)
Additional file 3: Figure S3.: Multiple Sequence Alignment of coding sequences of AtUDP-glycosyltransferase gene produced by CLUSTALW. (PDF 67 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Siwinska, J., Kadzinski, L., Banasiuk, R. et al. Identification of QTLs affecting scopolin and scopoletin biosynthesis in Arabidopsis thaliana. BMC Plant Biol 14, 280 (2014). https://doi.org/10.1186/s12870-014-0280-9