Genetic variability and evolutionary diversification of membrane ABC transporters in plants
© Andolfo et al.; licensee BioMed Central. 2015
Received: 6 August 2014
Accepted: 6 November 2014
Published: 13 February 2015
ATP-binding cassette proteins have been recognized as playing a crucial role in the regulation of growth and resistance processes in all kingdoms of life. They have been deeply studied in vertebrates because of their role in drug resistance, but much less is known about ABC superfamily functions in plants.
Recently released plant genome sequences allowed us to identify 803 ABC transporters in four vascular plants (Oryza. sativa, Solanum lycopersicum, Solanum tuberosum and Vitis vinifera) and 76 transporters in the green alga Volvox carteri, by comparing them with those reannotated in Arabidopsis thaliana and the yeast Saccharomyces cerevisiae. Retrieved proteins have been phylogenetically analysed to infer orthologous relationships. Most orthologous relationships in the A, D, E and F subfamilies were found, and interesting expansions within the ABCG subfamily were observed and discussed. A high level of purifying selection is acting in the five ABC subfamilies A, B, C, D and E. However, evolutionary rates of recent duplicate genes could influence vascular plant genome diversification. The transcription profiles of ABC genes within tomato organs revealed a broad functional role for some transporters and a more specific activity for others, suggesting the presence of key ABC regulators in tomato.
The findings achieved in this work could contribute to address several biological questions concerning the evolution of the relationship between genomes of different species. Plant ABC protein inventories obtained could be a valuable tool both for basic and applied studies. Indeed, interpolation of the putative role of gene functions can accelerate the discovering of new ABC superfamily members.
KeywordsATP-binding cassette transporters Multidrug resistance Arabidopsis thaliana Oryza sativa Solanum lycopersicum Solanum tuberosum Vitis vinifera Volvox carteri Saccharomyces cerevisiae Gene duplication Evolutionary dynamics
Life is not possible without the exchange of substances and information between cells, therefore macro- and micro-organisms have developed efficient transport systems to control the molecular interaction processes within the colonized environment. Transportation of many molecules through the cell semipermeable membrane against a concentration gradient requires the use of energy that can be provided, for instance, by ATP hydrolysis. Plants, due to their sessile status, have evolved a very complex movement system for molecules, in which proteins belonging to the ABC superfamily play a major role .
ATP-binding cassette (ABC) superfamily represents one of the largest protein group within the kingdoms of archaea, eubacteria and eukarya. Proteins belonging to this superfamily are ATP powered transporters able to translocate substrates across cellular membranes. The transported molecules, even if secreted from the same ABC protein, may be extremely different, both in terms of chemistry and structure .
The canonical architecture of ABC transporters comprises two transmembrane domains (TMDs) and two cytosolic nucleotide-binding domains (NBDs), also known as ATP-binding cassettes. The structural organisation of the four domains is a dimer of dimers, which can deploy as single polypeptides, or as multisubunit oligomers, reflecting ancient gene duplication events and fusions of the cytosolic catalytic with the membrane-spanning domains . Usually intracytosolic loops are present as extensions of TMDs and function as the interface between the NBDs and TMDs . The NBD contains several highly conserved motifs, including the Walker A and B sequences, the ABC signature motif, the H loop and the Q loop .
The ABC signature (alias C motif or LSGGQ motif, ((LIVMFY)S(SG)GX 3(RKA)(LIVMYA)X(LIVFM)(AG)) is situated between the two Walker boxes , and is the hallmark that distinguish ABC transporters from other ATP binding proteins.
Due to the complexity and dimension of ABC protein superfamily, a precise classification of all the subfamilies is necessary. The proposed categorisations are various. The transporter classification (TC) system, based on incorporation of both functional and phylogenetic information, includes 53 subfamilies of ABC exporters and 34 of ABC importers (http://www.tcdb.org). Based on sequence comparison, [6,7] three classes of ABC systems, that were probably present in the last common ancestor of archaea, bacteria and eukarya, have been proposed: class 1 comprises transporters with fused TMDs and NBDs (exporters); class 2 includes non-transporter ABCs lacking TMDs; class 3 (which is absent in eukaryotes) includes mainly transporters with NBDs and TMDs formed by separate polypeptide chains (canonical importers), and some bacterial exporters.
Plant genomes encode for a high number of ABC proteins with more than 120 found in both Arabidopsis thaliana and Oryza sativa [8,9]. The currently used plant ABC protein classification systems are mainly based on phylogenetic information, domain arrangement or similarities/structure comparison with human and microbial prototypes (eg: Pleiotropic Drug Resistance PDR). Sanchez-Fernandez et al.  combined information from different classification systems and identified 13 plant subfamilies, including also membrane-bound ABCs that consist only of those containing soluble NBD domains. In order to unify plant and animal ABC naming systems, the Human Genome Organization (HUGO) proposed a new subfamily designation for vertebrate and invertebrate ABC communities, which is now widely used [11,12]. This system originally comprised seven ABC subfamilies (A–G) based on sequence homology, phylogenetic relationships and domain organization. Subsequently, following a more recent inventory of Drosophila and fish ABC proteins, an additional subfamily (H) (not containing members from plants) has been defined. For plants, a further subfamily (I) has been created to incorporate ‘prokaryotic’- type ABCs that are not present in many animal genomes . Subfamilies ABCA, ABCB, ABCC and ABCD contain “forward orientation” TMD-NBD transporters. Subfamilies ABCG and ABCH, instead, are characterized by a “reverse organization” domain NBD-TMD. Subfamilies E and F show only two domains NBD and thus they are labelled as “soluble”. These proteins are not transporters but their NBDs clearly cluster with those of other ABC proteins.
Plant genome sequences availability is growing fast, resulting in an almost completely unexplored repository. The aim of the present work was to list, compare and phylogenetically classify the ABC proteins of selected vascular plant genomes (A. thaliana, O. sativa, Solanum lycopersicum, Solanum tuberosum, Vitis vinifera), the green alga Volvox carteri and the yeast Saccharomyces cerevisiae, in order to facilitate future studies on ABC genes and proteins. Identification and classification were based on the work by Verrier and collaborators . Further, a selection pressure study and a customized gene duplication analysis, to identify recent duplication events in vascular plant genomes, have been accomplished. Finally, an expression profile overview of ABC superfamily to detect tissue-specific transporter activation in tomato has been performed as a proof of concept and reported.
Results and Discussion
Identification and characterization of putative ABC proteins
A BLASTp search in O. sativa, S. lycopersicum, S. tuberosum, V. Vinifera and the green alga V. carteri proteomes with Arabidopsis ABC protein dataset allowed us to discover a number of potential ABC sequences. The domain composition of proteins was assessed through a domain detection analysis. A total of 995 proteins (Additional file 1: Table S1) containing one or more ATP-binding cassette domains were identified. Our analysis enlarged number of ABC proteins identified in rice , confirming data on ABCG family refined by Matsuda et al. , added 32 novel ABC transporters in V. vinifera,  and provide manual curated ABC protein catalogues for S. lycopersicum, S. tuberosum. Most of these proteins belong to known plant subfamilies (A-I, except H) (Additional file 2: Table S2 and Additional file 3). Proteins containing a single domain or novel associations were also recorded. The subfamily A is well conserved among plant species (varying from 6 to 13 members), but it is absent in S. cerevisiae. Probably this protein group associated with perturbed cellular lipid transport [8,16], originated after the division of ascomycota and chlorophyta. Subfamily G showed the highest number of members in all the species tested. Subfamilies B and C presented a number of proteins rather stable (ranging from 19 to 38) in the analysed genomes except for V. carteri and S. cerevisiae. Interestingly, ABC proteins of D, E and F subfamilies represent about 6% of the ABC transporters in O. sativa, S. lycopersicum, S. tuberosum, V. Vinifera and A. thaliana but about 25% in V. carteri and S. cerevisiae. The ABCG proteins were found to represent an average of about 40% (ranging from 28% in green alga to 55% in potato) of all annotated transporters in each analysed species. Subfamily G was particularly represented in rice and potato, in which 137 and 93 proteins were annotated, respectively.
Selection pressure acting on plant ABC families
Estimation of non synonymous and synonymous substitutions mean dissimilarity for each sub-family (δ = d N -d S and ω = d N /d S )
n. ABC sequences
Phylogenetic reconstruction of ABC transporters evolutionary dynamics
The ABCA subfamily
ABCA phylogenetic tree (52 protein sequences) shows the presence of three ancestral V. carteri sequences, separated from the rest of proteins, and two clades with four clusters with a good bootstrap value (>90) (Figure 2). Clade 1 comprises members belonging to all the species analysed. It seems that the proteins belonging to cluster 1 are well conserved in all species, even if a swift diversification between Arabidopsis and Solanum spp. was found. Clade1-cluster 2 shows that a gene expansion occurred only in eudicot genomes. The transmembrane region of four proteins belonging to this cluster (Solyc04g015970.2.1-AT2G41700.1), reveals a string of about 26 amino acids with an alignment identity >70%. Members belonging to this subfamily are involved in cellular lipid transport , and a role of full-length ABCA transporter AT2G41700.1, named AtAOH by Sanchez-Fernandez , in sterol metabolism has been demonstrated. A similar function for other proteins included in this sub-cluster could be hypothesized. Clade 2 includes 2 ABC transporters annotated in V. carteri and 27 in vascular plant proteins, which are subjected to a high degree of differentiation in all angiosperm species with the exception of V. vinifera. It is interesting to note that clade 2-cluster 4 contains only 8 Arabidopsis proteins with an alignment identity of about 70%, of which six are on chromosome 3, while clade2-cluster 3 contains 6 Solanum ABCA proteins with an alignment identity of 90%. A potential translocation of ABC transporters between chromosomes 3 and 11 of potato (Sotub11g021420.1.1-Sotub11g021450.1.1 and Sotub03g024890.1.1-Sotub03g024920.1.1) may have occurred since the two chromosome segments show similar gene arrangements (data not shown).
The ABCB subfamily
The ABCB evolutionary history (Additional file 4: Figure S1) was inferred by analysing 169 proteins. Several sub-groups were identified in this subfamily, suggesting a large diversification among the analysed species. The phylogenetic tree displayed 13 main clades, supported from high internal branch bootstrap indexes. Subsequently, modifications of original sequence arrangement produced few sequences in yeast and algae, and a huge number in vascular plants. Phylogenetic analysis suggests that sub-groups evolved differently in each species. However, orthologous of six Arabidopsis proteins present in clade 1 were identified both in monocot and dicot species. Proteins belonging to this subfamily seem to be involved in auxin influx transport in roots, and contribute to the basipetal transport in hypocotyls and root tips by establishing an auxin uptake sink in the root cap. Moreover, they confer sensitivity to 1-N-naphthylphthalamic acid (NPA), regulate root elongation, initiation of lateral roots and development of root hairs, transport IAA, indole-3-propionic acid, NPA syringic acid, vanillic acid and some auxin metabolites, but not 2,4-D and 1-naphthaleneacetic acid [27,28]. In particular, AT2G36910.1 and AT3G28860.1 are involved in auxin transport in stems and root, respectively [29,30]. It is possible to predict similar functions for orthologous proteins and gain insight in species not yet characterized by looking at specific clade arrangements. For instance, clade 8 embraces ten transporters afferent to all the species analysed. S. cerevisiae (YMR301C) and Arabidopsis (AT4G28620.1, AT4G28630.1 and AT5G58270.1) were found to be involved in iron homeostasis  suggesting that this function is well conserved among species. Proteins belonging to clade 11 present a well conserved string of 42 amino acids (alignment identity of 97%) following the Pfam domains (PF00005) (Additional file 6: Figure S2). Interestingly, a member of this group, At5g39040.1 has been reported as involved in aluminium resistance . Finally, in clade 13 (bootstrap index 70%), which groups 40 ABC proteins, three large expansions were observed in tomato (9 ABCBs), potato (9ABCBs) and rice (10 ABCBs). A perfect conservation of orthologous pairs between tomato and potato on chromosomes 11, 6, 12, 3 and 2 [29,33] has been detected.
The ABCC subfamily
ABCC (Additional file 7: Figure S3) is a large subfamily of “full-size”, “forward-orientation” proteins. The phylogenetic tree obtained by comparing 109 proteins displays that three S. cerevisiae and six V. carteri proteins, and one protein from O. sativa (Os04g33700.1) cluster separately. Two distinctive angiosperm clades can be evidenced (bootstrap index >75). Proteins belonging to this subfamily have been found to be involved in cellular processes such as vacuolar transport, detoxification and regulation of guard cell plasma membrane ion channels. Clade 1 encodes 12 proteins, of which four annotated in Arabidopsis (AT1G30400.1, AT1G30410.1, AT1G30420.1 and AT2G34660.1) are involved in detoxification, vacuolar transport of abscisic acid and glucosyl ester, organic anion transport, chlorophyll degradation and modulation of seed phytate content [29,34,35]. A unique orthologous in potato and rice, two members in tomato and four members in grape that are putatively involved in fruit maturation process have been found [36,37]. In clade 2 we underlined four remarkable clusters. In particular, cluster 1 groups orthologous genes of AT2G07680.1 involved in vacuole traffic  and cluster 2 contains nine proteins with an identity of 60% to AT3G62700.1, involved in vacuolar transport of abscisic acid glucosylester [39,33]. In cluster 4 there is an Arabidopsis ABCC protein (AT1G04120.1), involved in the regulation of anion and calcium channel activities , which presents a high sequence similarity (alignment identity of 77%) with other four eudicot proteins. Cluster 4 also contains angiosperm proteins with an average identity of about 75%.
The ABCD subfamily
The ABCD phylogeny tree obtained with proteins belonging to all considered phyla (Figure 3), revealed the evolutionary history of this subfamily. S. cerevisiae YKL188w peptide is separated from the two main clades and could be designated as the ancestral protein. ABC transporters belonging to this subfamily have been found to play a role in the peroxisome transport [40,41]. Clade 1 includes full-size” proteins (average identity 76%) with “forward orientation”. In this clade is present the Arabidopsis protein AT4G39850.1 involved in a wide range of substrates for peroxisome uptake [42-44]. A similar function could be hypothesized for the homologous peptides (Os01g73530.1, Os05g01700.1, Solyc04g055120.2.1, Sotub04g020700.1.1) detected in tomato, potato and rice. Clade 2 includes “half-size” proteins. The transmembrane domain (400 amino acids) is very well conserved (84% average identity) among proteins belonging to this clade as well as the NBD 1 domain. Interestingly, the two Solanum (Solyc12g017420.1.1 and Sotub12g013980.1.1) proteins and the three AT1G54350.1, GSVIVT01036685001 and Os01g11946.1 transporters show a higher identity with NBD motif of green alga Vocar20007372m (about 60% of identity) and Vocar20009192m (about 70%of identity), respectively (Figure 3B).
The ABCE subfamily
ABCE subfamily (Figure 4), with only ten proteins detected, was found to be the smallest among the subfamilies analysed in this work. The structure of the phylogenetic tree was extremely useful in tracking the evolution of these “trasporters”, also known as RNase L inhibitors (RLI)  (Figure 4A). The two ancestral S. cerevisiae (YDR091C) and V. carteri, (Vocar20004039m) proteins were more similar to A. thaliana (AT3G13640.1) and O. sativa, (Os02g18180.1) proteins. Only for Solanum spp proteins, a small expansion was observed (clade 1 of Figure 4). In this group there was a V. vinifera protein (GSVIVT01036876001) that clustered with the Arabidopsis protein AT4G19210.1 which contains N-terminal “ferrodoxin” (4Fe4S-type) motifs and interacts with nucleic acids [11,46,47].
The ABCF subfamily
The phylogenetic tree of ABCF subfamily, obtained by comparing 46 proteins, reveals five clades (Figure 5). Proteins belonging to this subfamily have been found to be involved in stress-associated control  and seem to have an ancestral origin since they are highly represented both in V. carteri and S. cerevisiae (Additional file 1: Table S1). S. cerevisiae proteins are included in all clusters except for clades 1 and 5. These two clades show an alignment identity of 61%and 66% respectively and include two V. carteri (Vocar20008959m and Vocar20013543m) proteins. Interestingly, Arabidopsis proteins present in clades 1 and 5 (AT3G54540.1 and AT5G64840.1) have been found to be involved in root growth and development  and a similar role could be predicted for proteins belonging to such clade. Clade 2, with an alignment identity greater than 75%, includes highly conserved proteins in all species analysed. Clade 4 comprises six proteins belonging to S. cerevisiae and V. carteri, with a low alignment identity (42%). Interestingly, three of these proteins (YPL226W, Vocar20002122, Vocar20002123), one in yeast and two in algae, have an additional chromo-domain (IPR023780).
The ABCG subfamily
ABCG, the largest plant ABC transporter subfamily, includes two groups according to Sanchez-Fernandez nomenclature: WBCs and PDRs. Members of the ABCGWBC consist of approximately 600–750 amino acid residues  and can be involved in the cuticular lipids extrusion [49,50]. ABCG full-size proteins (ABCGPDR) have a NBD domain characterized by four “plant PDR signatures” . Many proteins belonging to this subfamily have been found to be involved in resistance to pathogens, antimicrobial terpenoids and auxinic herbicides, and contribute to the transport of signalling molecules or secretion of volatile compounds [51-53].
The ABCGWBC group
The ABCGWBC evolutionary analysis, obtained by comparing 219 proteins, shows a high diversification (Additional file 5: Figure S4). Eleven clades that encompass a number of proteins varying from 31 (clade 7) to 4 (clades 5, 7 and 10) have been produced. Clades 1, 2 and 9 encompass 33% of the sequences analysed. A putative progenitor of clades 1 and 2 could be the S. cerevisiae protein YCR011C. In the clade 1 is present AT3G55130.1, which appears to be involved in kanamycin resistance when overexpressed in transgenic plants . In clade 2 we found sequences that are highly conserved in eudicot genomes. Clade 3 and 4 encode transporters that could be involved in lipid/sterol homeostasis regulation required for proper vascular development, likewise the AT1G31770.1 and AT4G27420.1 proteins [50,51,55]. Clade 5 groups three sequences similar to AT3G13220.1 (76% identity), which has been found involved in abscisic acid transport . Clade 7 includes a S. cerevisiae sequence (YOL75C) that can be ancestral to the diversification that occurs from clade 7 to 11. Clade 8 includes only 8 rice transporters. Clade 9 includes 21 transporters similar to AT1G17840.1 and AT1G51500.1, which are required for export of wax components such as alkanes .
The ABCGPDR group
Within the cluster 1, 16 Solanum proteins are grouped, with a pairwise identity of 82%, located on chromosomes 5, 9 and 12 in tomato and potato genomes. ABCGPDR sequences of this cluster show a high similarity with an ATP-binding cassette transporter (AT1G15520.1) of A. thaliana PDR12 (AtPDR12)/ABCG40 known to be involved in pleiotropic drug resistance and abscisic acid (ABA) uptake transport [58,59]. Clusters 2 and 3 are specific for V. vinifera while clusters 5 and 6 include only O. sativa sequences, suggesting a recent expansion of ABCGPDR in grape and rice. In cluster 7 is present the A. thalianaAT2G26910.1 gene, that is involved in cutin formation . The five dicot and monocot orthologous belong to this cluster, could be predicted to be putatively involved in cutin formation as well.
Cluster 8 includes eight V. vinifera proteins encoded from genes located on chromosomes 13, 8 and 6 and eleven Solanum proteins encoded from genes located on chromosomes 5 and 11, which show a high identity with the A. thaliana AT1G66950.1 and AT2G36380.1 genes, known to be highly expressed in the root cells for the secretion of several secondary metabolites . Cluster 9 encompasses an Arabidopsis PDR protein (AtABCG36/AT1G59870.1), which seems to be involved into susceptibility/resistance to the barley powdery mildew pathogen . Eight proteins annotated in tomato, potato, grape and rice with a pairwise identity of 67% also belong to this cluster. Thirteen and six transporters were grouped in clusters 10 and 11, respectively. In cluster 10, the six A. thaliana proteins, including AT4G15230 (AtABCG30), a protein involved in root exudation of phytochemicals  show an average identity > 60% with the other proteins of this cluster. In cluster 11, a very strong homology (about 75%) among five Solanum and V. vinifera proteins with A. thaliana AT2G29940.1 protein involved in the modulation of stomata activity  was detected.
Genomic distribution and recent gene duplication events
Identification of recent ABC gene duplication events in all genomes examined
a Blocks of duplication
Total number of duplicate ABCs
Classification of recent ABC transporter gene duplication events
n. ABC genes
More than 90% of gene duplications found in this study concern B, C and G sub-families. Instead, transporters belonging to subfamilies D and E showed to be highly conserved (Table 3). In particular 110 genes, out of 465 annotated in subfamily G, are involved in duplication events (Table 3), indicating a considerable implication of this ABC subfamily expansion in all the vascular plants analysed here.
Tomato expression profile of ABC transporter families
List of ABC transporters expressed in five tissues (bud, flower, leaf, root and fruit) of S. lycopersicum Heinz 1706 subdivided for subfamilies
n. ABCs expressed in bud
n. ABCs expressed in flower
n. ABCs expressed in leaf
n. ABCs expressed in root
n. ABCs expressed in fruit
ABC proteins are firmly established as key players of cellular processes involved in auxin transport, lipid catabolism, xenobiotic detoxification, disease resistance and stomatal function. In this study, 803 ABC transporters were identified by in silico analysis of four plant species (O. sativa, S. lycopersicum, S. tuberosum, V. vinifera) and 76 transporters in the green alga V. carteri, by comparing them with those reannotated in Arabidopsis (A. thaliana) and the yeast S. cerevisiae. The characterization of ABC proteins based on domain annotation allowed the discovering of new subfamily members. Moreover, we ascertained that ABCG represents the largest group of ABC proteins in all plant species analysed. Phylogenetic analysis allowed us to trace the evolutionary history of plant ABCs, evidencing eukarya diversification. It is well known that a large genome datasets accelerate gene discovery in plants. By analysing the expression data of all tomato ABCs identified in this study, we were able to provide an indication of the putative role of these genes. The results from this work offer useful inputs that may help, for instance, to discover ABC genes with broader or more specific roles, and help to address several biological questions concerning the evolution of the relationships between genomes of different species.
Genomes search for ABC transporters identification
Oryza sativa, Vitis vinifera and Volvox carteri genome data were downloaded from website Phytozome portal . Arabidopsis thaliana data were obtained from TAIR database  resource. Tomato and potato sequences were provided by the Tomato Genome Sequencing Consortium . Saccharomyces cerevisiae strain S288C data were taken from Saccharomyces Genome Database . A BLASTp analysis (e-value < 1e-6) to identify potential ABC transporters in different species were performed  (using the entire proteome of each analysed species, starting from 132 ABC protein sequences annotated in A. thaliana previously described [8,10].
Functional prediction of ABC transporters
The set of proteins identified via BLASTp search was further scrutinized using InterProScan software to verify the presence of conserved domains and motifs characteristic of ABC proteins (NBD-TMD). The presence of conserved domains and motifs characteristic of ABC subfamilies (NBD-TMD) allowed us to sort ABC proteins into eight major plant subfamilies (A–I, except subfamily H, which hasn’t members in plants). In this analysis, recovered sequences were compared with the following databases: HMMPanther (Hidden Markov model Panther) to find the characteristic domains for ABC subfamilies, HMMTigr (Hidden Markov model Tigr), patternScan, FPrintScan, HMMPIR, ProfileScan, HAMAP (High-quality Automated and Manual Annotation of Microbial Proteomes), SignalPHMM PROSITE to identify ABC transporters conserved sequences, SuperFamily PRINTS (Fingerprint database), HMMPfam (Protein family) to find “ABC domains”, BlastProDom (Blast protein domain database), and HMMSMART protein motif analyses (Simple Modular Architecture Research Tool,  to find ATPase domains. The TMHMM database was also accessed to verify the presence of transmembrane-regions.
Evolutionary analyses of all subfamilies, except for I (Dataset B), were conducted using MEGA5 . The protein sequences were aligned using ClustalW default parameters (v. 1.74) . The phylogenetic relationships were inferred separately for each ABC subfamily using the Maximum Likelihood method. The best phylogenetic method and evolutionary model was determined among candidate models of protein evolution. Models with the lowest BIC scores (Bayesian Information Criterion) are considered to describe the better substitution pattern. For each model, AICc value (Akaike Information Criterion, corrected), Maximum Likelihood value (lnL), and the number of parameters (including branch lengths) are also presented . The bootstrap consensus tree inferred from 100 replicates was taken to represent the evolutionary history of the sequences analysed . The trees were drawn to scale, with branch lengths measured in terms of number of substitutions per site. We have considered significant clades those that have a bootstrap value not less than ≥ 70, containing at least 4 ABC transporter sequences. ABC protein subgroups described in more detail were labelled as “clusters”.
Recent duplication events of ABC transporter genes
To identify duplicated ABC transporter pairs, we run a phylogenetic analysis using ABC nucleotide sequences of Dataset B, using Maximum Likelihood method and General Time Reversible model.
We defined a gene duplication according to the following criteria: (1) the clade bootstrap index >80, (2) the alignable nucleotide sequence identity ≥70% (3) putative recent duplications were also filtered for physical chromosome co-localization and (4) only one event of duplication is counted for tightly linked genes.
Evolution rates at codon sites
Selective pressure acting on the ABC-subfamilies were investigated by determining the nonsynonymous to synonymous nucleotide substitution (dN-dS) indicated as δ. Tests were conducted to estimate the evolution of each codon: positive (dN > dS); neutral (dN = dS); and negative (dN < dS). The variance of the difference was computed using the bootstrap method (1000 replicates). Analyses were conducted using the Nei-Gojobori method . All positions with less than 80% site coverage were eliminated. All the ABC coding DNA sequences were aligned using ClustalW 1.74 . Evolutionary analyses were conducted in MEGA5 . To clearly depict the proportion of sites under selection, an evolutionary fingerprint analysis was carried out using the SLAC algorithm implemented in the Datamonkey server .
Expression data visualization
The expression data of tomato ABC transporters extracted from dataset of the Tomato Genome Consortium  were processed as reads per kilobase of the exon model per million mapped reads (RPKM), and subsequently normalized with TPM and visualized with R software .
Availability of supporting data
The data sets supporting the results of this article can be found as Additional files.
We sincerely acknowledge Dr. Roberta Marotta for annotation support.
This work was supported by the Ministry of University and Research (GenHORTH project).
- George AM, Jones PM. Perspectives on the structure-function of ABC transporters: the switch and constant contact models. Prog Biophys Mol Bio. 2012;109(3):95–107.View ArticleGoogle Scholar
- Gottesman MM, Paterson JK, Chen KG, Annereau JP, Szakacs G. New ABC transporters associated with multidrug resistance in cancer. Febs J. 2005;272:206.Google Scholar
- Dawson RJP, Hollenstein K, Locher KP. Uptake or extrusion: crystal structures of full ABC transporters suggest a common mechanism. Mol Microbiol. 2007;65(2):250–7.View ArticlePubMedGoogle Scholar
- Higgins CF, Linton KJ. The ATP switch model for ABC transporters. Nat Struct Mol Biol. 2004;11(10):918–26.View ArticlePubMedGoogle Scholar
- Loo TW, Bartlett MC, Clarke DM. The “LSGGQ” motif in each nucleotide-binding domain of human P-glycoprotein is adjacent to the opposing Walker A sequence. J Biol Chem. 2002;277(44):41303–6.View ArticlePubMedGoogle Scholar
- Davidson AL, Dassa E, Orelle C, Chen J. Structure, function, and evolution of bacterial ATP-binding cassette systems. Microbiol Mol Biol R. 2008;72(2):317–64.View ArticleGoogle Scholar
- Licht A, Schneider E. ATP binding cassette systems: structures, mechanisms, and functions. Cent Eur J Biol. 2011;6(5):785–801.View ArticleGoogle Scholar
- Rea PA. Plant ATP-binding cassette transporters. Annu Rev Plant Biol. 2007;58:347–75.View ArticlePubMedGoogle Scholar
- Verrier PJ, Bird D, Buria B, Dassa E, Forestier C, Geisler M, et al. Plant ABC proteins - a unified nomenclature and updated inventory. Trends Plant Sci. 2008;13(4):151–9.View ArticlePubMedGoogle Scholar
- Sanchez-Fernandez R, Davies TGE, Coleman JOD, Rea PA. The Arabidopsis thaliana ABC protein superfamily, a complete inventory. J Biol Chem. 2001;276(32):30231–44.View ArticlePubMedGoogle Scholar
- Dean M, Allikmets R. Complete characterization of the human ABC gene family. J Bioenerg Biomembr. 2001;33(6):475–9.View ArticlePubMedGoogle Scholar
- Dean M. Genetics of ATP-binding cassette transporters. Method Enzymol. 2005;400:409–29.View ArticleGoogle Scholar
- Jasinski M, Ducos E, Martinoia E, Boutry M. The ATP-binding cassette transporters: structure, function, and gene family comparison between. Plant Physiol. 2003;131:1169–77.View ArticlePubMed CentralPubMedGoogle Scholar
- Matsuda S, Funabiki A, Furukawa K, Komori N, Koike M, Tokuji Y, et al. Genome-wide analysis and expression profiling of half-size ABC protein subgroup G in rice in response to abiotic stress and phytohormone treatments. Mol Genet Genomics. 2012;287:819–35.View ArticlePubMedGoogle Scholar
- Çakır B, Kılıçkaya O. Whole-genome survey of the putative ATP-binding cassette transporter family genes in Vitis vinifera. PLoS One. 2013;8:e78860.View ArticlePubMed CentralPubMedGoogle Scholar
- Kaminski WE, Piehler A, Wenzel JJ. ABC A-subfamily transporters: structure, function and disease. Bba-Mol Basis Dis. 2006;1762(5):510–24.View ArticleGoogle Scholar
- Kato T, Tabata S, Sato S. Analyses of expression and phenotypes of knockout lines for Arabidopsis ABCF subfamily members. Plant Biotechnol-Nar. 2009;26(4):409–14.View ArticleGoogle Scholar
- Sato S, Tabata S, Hirakawa H, Asamizu E, Shirasawa K, Isobe S, et al. The tomato genome sequence provides insights into fleshy fruit evolution. Nature. 2012;485(7400):635–41.View ArticleGoogle Scholar
- Andolfo G, Sanseverino W, Rombauts S, Van de Peer Y, Bradeen JM, Carputo D, et al. Overview of tomato (Solanum lycopersicum) candidate pathogen recognition genes reveals important Solanum R locus dynamics. New Phytol. 2013;197(1):223–37.View ArticlePubMedGoogle Scholar
- Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986;3(5):418–26.PubMedGoogle Scholar
- Pond SLK, Frost SDW. A genetic algorithm approach to detecting lineage-specific variation in selection pressure (vol 22, pg 478, 2005). Mol Biol Evol. 2005;22(4):1157.Google Scholar
- Kingsolver JG, Hoekstra HE, Hoekstra JM, Berrigan D, Vignieri SN, Hill CE, et al. The strength of phenotypic selection in natural populations. Am Nat. 2001;157(3):245–61.View ArticlePubMedGoogle Scholar
- Rocha EPC, Smith JM, Hurst LD, Holden MTG, Cooper JE, Smith NH, et al. Comparisons of dN/dS are time dependent for closely related bacterial genomes. J Theor Biol. 2006;239(2):226–35.View ArticlePubMedGoogle Scholar
- He L, Vasiliou K, Nebert DW. Analysis and update of the human solute carrier (SLC) gene superfamily. Hum Genomics. 2009;3(2):195–206.View ArticlePubMed CentralPubMedGoogle Scholar
- Jin W, Wu DD, Zhang X, Irwin DM, Zhang YP. Positive selection on the gene RNASEL: correlation between patterns of evolution and function. Mol Biol Evol. 2012;29(10):3161–8.View ArticlePubMedGoogle Scholar
- Takanashi K, Sugiyama A, Sato S, Tabata S, Yazaki K. LjABCB1, an ATP-binding cassette protein specifically induced in uninfected cells of Lotus japonicus nodules. J Plant Physiol. 2012;169(3):322–6.View ArticlePubMedGoogle Scholar
- Molesini B, Pandolfini T, Pii Y, Korte A, Spena A. Arabidopsis thaliana AUCSIA-1 regulates Auxin biology and physically interacts with a kinesin-related protein. PLoS One. 2012; 7Google Scholar
- Noh B, Murphy AS, Spalding EP. Multidrug resistance-like genes of Arabidopsis required for auxin transport and auxin-mediated development. Plant Cell. 2001;13(11):2441–54.View ArticlePubMed CentralPubMedGoogle Scholar
- Martinoia E, Klein M, Geisler M, Bovet L, Forestier C, Kolukisaoglu U, et al. Multifunctionality of plant ABC transporters - more than just detoxifiers. Planta. 2002;214(3):345–55.View ArticlePubMedGoogle Scholar
- Geisler M, Murphy AS. The ABC of auxin transport: the role of p-glycoproteins in plant development. Febs Lett. 2006;580(4):1094–102.View ArticlePubMedGoogle Scholar
- Chen S, Sánchez-Fernández R, Lyver ER, Dancis A, Rea PA. Functional characterization of AtATM1, AtATM2, and AtATM3, a subfamily of Arabidopsis half-molecule ATP-binding cassette transporters implicated in iron homeostasis. J Biol Chem. 2007;282:21561–71.View ArticlePubMedGoogle Scholar
- Larsen PB, Cancel J, Rounds M, Ochoa V. Arabidopsis ALS1 encodes a root tip and stele localized half type ABC transporter required for root growth in an aluminum toxic environment. Planta. 2007;225(6):1447–58.View ArticlePubMedGoogle Scholar
- Kolukisaoglu HU, Bovet L, Klein M, Eggmann T, Geisler M, Wanke D, et al. Family business: the multidrug-resistance related protein (MRP) ABC transporter genes in Arabidopsis thaliana. Planta. 2002;216(1):107–19.View ArticlePubMedGoogle Scholar
- Shi Z, Peng XX, Kim IW, Shukla S, Si QS, Robey RW, et al. Erlotinib (Tarceva, OSI-774) antagonizes ATP-bInding cassette subfamily B member 1 and ATP-binding cassette subfamily G member 2-mediated drug resistance. Cancer Res. 2007;67(22):11012–20.View ArticlePubMedGoogle Scholar
- van den Brule S, Smart CC. The plant PDR family of ABC transporters. Planta. 2002;216(1):95–106.View ArticlePubMedGoogle Scholar
- Frelet-Barrand A, Kolukisaoglu HU, Plaza S, Ruffer M, Azevedo L, Hortensteiner S, et al. Comparative mutant analysis of arabidopsis ABCC-type ABC transporters: AtMRP2 contributes to detoxification, vacuolar organic anion transport and chlorophyll degradation. Plant Cell Physiol. 2008;49(4):557–69.View ArticlePubMedGoogle Scholar
- Shiratake K, Martinoia E. Transporters in fruit vacuoles. Plant Biotechnol. 2007;127–33Google Scholar
- Jaquinod M, Villiers F, Kieffer-Jaquinod S, Hugouvieu V, Bruley C, Garin J, et al. A proteomics dissection of Arabidopsis thaliana vacuoles isolated from cell culture. Mol Cell Proteomics. 2007;6(3):394–412.View ArticlePubMed CentralPubMedGoogle Scholar
- Suh SJ, Wang YF, Frelet A, Leonhardt N, Klein M, Forestier C, et al. The ATP binding cassette transporter AtMRP5 modulates anion and calcium channel activities in Arabidopsis guard cells. J Biol Chem. 2007;282(3):1916–24.View ArticlePubMedGoogle Scholar
- Shani N, Valle D. Peroxisomal ABC transporters. In: Abc transporters: biochemical, cellular, and molecular aspects. 292nd ed. 1998. p. 753–76.View ArticleGoogle Scholar
- Footitt S, Slocombe SP, Larner V, Kurup S, Wu YS, Larson T, et al. Control of germination and lipid mobilization by COMATOSE, the Arabidopsis homologue of human ALDP. Embo J. 2002;21(12):2912–22.View ArticlePubMed CentralPubMedGoogle Scholar
- Morita M, Shimozawa N, Kashiwayama Y, Suzuki Y, Imanaka T. ABC subfamily D proteins and very long chain fatty acid metabolism as novel targets in adrenoleukodystrophy. Curr Drug Targets. 2011;12(5):694–706.View ArticlePubMedGoogle Scholar
- Theodoulou FL, Holdsworth M, Baker A. Peroxisomal ABC transporters. Febs Lett. 2006;580(4):1139–55.View ArticlePubMedGoogle Scholar
- Hooks MA, Turner JE, Murphy EC, Johnston KA, Burr S, Jaroslawski S. The Arabidopsis ALDP protein homologue COMATOSE is instrumental in peroxisomal acetate metabolism. Biochem J. 2007;406:399–406.View ArticlePubMed CentralPubMedGoogle Scholar
- Braz ASK, Finnegan J, Waterhouse P, Margis R. A plant orthologue of RNase L inhibitor (RLI) is induced in plants showing RNA interference. J Mol Evol. 2004;59(1):20–30.View ArticlePubMedGoogle Scholar
- Bairoch A. Prosite - a dictionary of sites and patterns in proteins. Nucleic Acids Res. 1992;20:2013–8.View ArticlePubMed CentralPubMedGoogle Scholar
- Sarmiento C, Nigul L, Kazantseva J, Buschmann M, Truve E. AtRLI2 is an endogenous suppressor of RNA silencing. Plant Mol Biol. 2006;61(1–2):153–63.View ArticlePubMedGoogle Scholar
- Zeng W, Brutus A, Kremer JM, Withers JC, Gao X, Da Jones AD, et al. A genetic screen reveals Arabidopsis Stomatal and/or apoplastic defenses against pseudomonas syringae pv. tomato DC3000. PLoS Pathog. 2011; 7.Google Scholar
- Bird D, Beisson F, Brigham A, Shin J, Greer S, Jetter R, et al. Characterization of Arabidopsis ABCG11/WBC11, an ATP binding cassette (ABC) transporter that is required for cuticular lipid secretion. Plant J. 2007;52(3):485–98.View ArticlePubMedGoogle Scholar
- Pighin JA, Zheng HQ, Balakshin LJ, Goodman IP, Western TL, Jetter R, et al. Plant cuticular lipid export requires an ABC transporter. Science. 2004;306(5696):702–4.View ArticlePubMedGoogle Scholar
- Kim DY, Bovet L, Maeshima M, Martinoia E, Lee Y. The ABC transporter AtPDR8 is a cadmium extrusion pump conferring heavy metal resistance. Plant J. 2007;50(2):207–18.View ArticlePubMedGoogle Scholar
- Stein M, Dittgen J, Sanchez-Rodriguez C, Hou BH, Molina A, Schulze-Lefert P, et al. Arabidopsis PEN3/PDR8, an ATP binding cassette transporter, contributes to nonhost resistance to inappropriate pathogens that enter by direct penetration. Plant Cell. 2006;18(3):731–46.View ArticlePubMed CentralPubMedGoogle Scholar
- Ruocco M, Ambrosino P, Lanzuise S, Woo SL, Lorito M, Scala F. Four potato (Solanum tuberosum) ABCG transporters and their expression in response to abiotic factors and Phytophthora infestans infection. J Plant Physiol. 2011;168(18):2225–33.View ArticlePubMedGoogle Scholar
- Mentewab A, Stewart CN. Overexpression of an Arabidopsis thaliana ABC transporter confers kanamycin resistance to transgenic plants. Nat Biotechnol. 2005;23:1177–80.View ArticlePubMedGoogle Scholar
- Luo B, Xue XY, Hu WL, Wang LJ, Chen XY. An ABC transporter gene of Arabidopsis thaliana, AtWBC11, is involved in cuticle development and prevention of organ fusion. Plant Cell Physiol. 2007;48(12):1790–802.View ArticlePubMedGoogle Scholar
- Kuromori T, Miyaji T, Yabuuchi H, Shimizu H, Sugimoto E, Kamiya A, et al. ABC transporter AtABCG25 is involved in abscisic acid transport and responses. Proc Natl Acad Sci U S A. 2010;107(5):2361–6.View ArticlePubMed CentralPubMedGoogle Scholar
- Ukitsu H, Kuromori T, Toyooka K, Goto Y, Matsuoka K, Sakuradani E, et al. Cytological and biochemical analysis of COF1, an Arabidopsis mutant of an ABC transporter gene. Plant Cell Physiol. 2007;48(11):1524–33.View ArticlePubMedGoogle Scholar
- Kang J, Hwang JU, Lee M, Kim YY, Assmann SM, Martinoia E, et al. PDR-type ABC transporter mediates cellular uptake of the phytohormone abscisic acid. Proc Natl Acad Sci U S A. 2010;107(5):2355–60.View ArticlePubMed CentralPubMedGoogle Scholar
- Campbell EJ, Schenk PM, Kazan K, Penninckx IAMA, Anderson JP, Maclean DJ, et al. Pathogen-responsive expression of a putative ATP-binding cassette transporter gene conferring resistance to the diterpenoid sclareol is regulated by multiple defense signaling pathways in Arabidopsis. Plant Physiol. 2003;133(3):1272–84.View ArticlePubMed CentralPubMedGoogle Scholar
- Galbiati M, Simoni L, Pavesi G, Cominelli E, Francia P, Vavasseur A, et al. Gene trap lines identify Arabidopsis genes expressed in stomatal guard cells. Plant J. 2008;53(5):750–62.View ArticlePubMedGoogle Scholar
- Zhang JZ. Evolution by gene duplication: an update. Trends Ecol Evol. 2003;18(6):292–8.View ArticleGoogle Scholar
- Taylor JS, Raes J. Duplication and divergence: the evolution of new genes and old ideas. Annu Rev Genet. 2004;38:615–43.View ArticlePubMedGoogle Scholar
- Snider J, Hanif A, Lee ME, Jin K, Yu AR, Graham C, et al. Mapping the functional yeast ABC transporter interactome. Nat Chem Biol. 2013;9(9):565–U564.View ArticlePubMedGoogle Scholar
- Lee M, Lee K, Lee J, Noh EW, Lee Y. AtPDR12 contributes to lead resistance in arabidopsis. Plant Physiol. 2005;138(2):827–36.View ArticlePubMed CentralPubMedGoogle Scholar
- Orsi CH, Tanksley SD. Natural variation in an ABC transporter gene associated with seed size evolution in tomato species. PLoS Genet. 2009; 5Google Scholar
- Takuno S, Nishio T, Satta Y, Innan H. Preservation of a pseudogene by gene conversion and diversifying selection. Genetics. 2008;180(1):517–31.View ArticlePubMed CentralPubMedGoogle Scholar
- Kliebenstein D, Lambrix V, Reichelt M, Gershenzon J, Mitchell-Olds T. Gene duplication and the diversification of secondary metabolism: side chain modification of glucosinolates in Arabidopsis thaliana. Plant Cell. 2001;13:681–93.View ArticlePubMed CentralPubMedGoogle Scholar
- Phytozome v9.1. www.phytozome.net.
- The Arabidopsis Information Resource (TAIR). https://www.arabidopsis.org.
- Tomato Genome Sequencing Consortium (TGC). http://solgenomics.net.
- Saccharomyces Genome Database (SGD). http://www.yeastgenome.org.
- Karlin S, Altschul SF. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci U S A. 1990;87(6):2264–8.View ArticlePubMed CentralPubMedGoogle Scholar
- Schultz J, Milpetz F, Bork P, Ponting CP. SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A. 1998;95:5857–64.View ArticlePubMed CentralPubMedGoogle Scholar
- Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular Evolutionary Genetics Analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.View ArticlePubMed CentralPubMedGoogle Scholar
- Nei M, Kumar S, Takahashi K. The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proc Natl Acad Sci U S A. 1998;95(21):12390–7.View ArticlePubMed CentralPubMedGoogle Scholar
- Felsenstein J. Confidence-limits on phylogenies - an approach using the bootstrap. Evolution. 1985;39(4):783–91.View ArticleGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.View ArticlePubMed CentralPubMedGoogle Scholar
- Delport W, Poon AFY, Frost SDW, Pond SLK. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26(19):2455–7.View ArticlePubMed CentralPubMedGoogle Scholar
- A language and environment for statistical computing. R Foundation for statistical computing. http://www.r-project.org.
This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.