- Research article
- Open Access
Genetic variability and evolutionary diversification of membrane ABC transporters in plants
BMC Plant Biology volume 15, Article number: 51 (2015)
ATP-binding cassette proteins have been recognized as playing a crucial role in the regulation of growth and resistance processes in all kingdoms of life. They have been deeply studied in vertebrates because of their role in drug resistance, but much less is known about ABC superfamily functions in plants.
Recently released plant genome sequences allowed us to identify 803 ABC transporters in four vascular plants (Oryza. sativa, Solanum lycopersicum, Solanum tuberosum and Vitis vinifera) and 76 transporters in the green alga Volvox carteri, by comparing them with those reannotated in Arabidopsis thaliana and the yeast Saccharomyces cerevisiae. Retrieved proteins have been phylogenetically analysed to infer orthologous relationships. Most orthologous relationships in the A, D, E and F subfamilies were found, and interesting expansions within the ABCG subfamily were observed and discussed. A high level of purifying selection is acting in the five ABC subfamilies A, B, C, D and E. However, evolutionary rates of recent duplicate genes could influence vascular plant genome diversification. The transcription profiles of ABC genes within tomato organs revealed a broad functional role for some transporters and a more specific activity for others, suggesting the presence of key ABC regulators in tomato.
The findings achieved in this work could contribute to address several biological questions concerning the evolution of the relationship between genomes of different species. Plant ABC protein inventories obtained could be a valuable tool both for basic and applied studies. Indeed, interpolation of the putative role of gene functions can accelerate the discovering of new ABC superfamily members.
Life is not possible without the exchange of substances and information between cells, therefore macro- and micro-organisms have developed efficient transport systems to control the molecular interaction processes within the colonized environment. Transportation of many molecules through the cell semipermeable membrane against a concentration gradient requires the use of energy that can be provided, for instance, by ATP hydrolysis. Plants, due to their sessile status, have evolved a very complex movement system for molecules, in which proteins belonging to the ABC superfamily play a major role .
ATP-binding cassette (ABC) superfamily represents one of the largest protein group within the kingdoms of archaea, eubacteria and eukarya. Proteins belonging to this superfamily are ATP powered transporters able to translocate substrates across cellular membranes. The transported molecules, even if secreted from the same ABC protein, may be extremely different, both in terms of chemistry and structure .
The canonical architecture of ABC transporters comprises two transmembrane domains (TMDs) and two cytosolic nucleotide-binding domains (NBDs), also known as ATP-binding cassettes. The structural organisation of the four domains is a dimer of dimers, which can deploy as single polypeptides, or as multisubunit oligomers, reflecting ancient gene duplication events and fusions of the cytosolic catalytic with the membrane-spanning domains . Usually intracytosolic loops are present as extensions of TMDs and function as the interface between the NBDs and TMDs . The NBD contains several highly conserved motifs, including the Walker A and B sequences, the ABC signature motif, the H loop and the Q loop .
The ABC signature (alias C motif or LSGGQ motif, ((LIVMFY)S(SG)GX 3(RKA)(LIVMYA)X(LIVFM)(AG)) is situated between the two Walker boxes , and is the hallmark that distinguish ABC transporters from other ATP binding proteins.
Due to the complexity and dimension of ABC protein superfamily, a precise classification of all the subfamilies is necessary. The proposed categorisations are various. The transporter classification (TC) system, based on incorporation of both functional and phylogenetic information, includes 53 subfamilies of ABC exporters and 34 of ABC importers (http://www.tcdb.org). Based on sequence comparison, [6,7] three classes of ABC systems, that were probably present in the last common ancestor of archaea, bacteria and eukarya, have been proposed: class 1 comprises transporters with fused TMDs and NBDs (exporters); class 2 includes non-transporter ABCs lacking TMDs; class 3 (which is absent in eukaryotes) includes mainly transporters with NBDs and TMDs formed by separate polypeptide chains (canonical importers), and some bacterial exporters.
Plant genomes encode for a high number of ABC proteins with more than 120 found in both Arabidopsis thaliana and Oryza sativa [8,9]. The currently used plant ABC protein classification systems are mainly based on phylogenetic information, domain arrangement or similarities/structure comparison with human and microbial prototypes (eg: Pleiotropic Drug Resistance PDR). Sanchez-Fernandez et al.  combined information from different classification systems and identified 13 plant subfamilies, including also membrane-bound ABCs that consist only of those containing soluble NBD domains. In order to unify plant and animal ABC naming systems, the Human Genome Organization (HUGO) proposed a new subfamily designation for vertebrate and invertebrate ABC communities, which is now widely used [11,12]. This system originally comprised seven ABC subfamilies (A–G) based on sequence homology, phylogenetic relationships and domain organization. Subsequently, following a more recent inventory of Drosophila and fish ABC proteins, an additional subfamily (H) (not containing members from plants) has been defined. For plants, a further subfamily (I) has been created to incorporate ‘prokaryotic’- type ABCs that are not present in many animal genomes . Subfamilies ABCA, ABCB, ABCC and ABCD contain “forward orientation” TMD-NBD transporters. Subfamilies ABCG and ABCH, instead, are characterized by a “reverse organization” domain NBD-TMD. Subfamilies E and F show only two domains NBD and thus they are labelled as “soluble”. These proteins are not transporters but their NBDs clearly cluster with those of other ABC proteins.
Plant genome sequences availability is growing fast, resulting in an almost completely unexplored repository. The aim of the present work was to list, compare and phylogenetically classify the ABC proteins of selected vascular plant genomes (A. thaliana, O. sativa, Solanum lycopersicum, Solanum tuberosum, Vitis vinifera), the green alga Volvox carteri and the yeast Saccharomyces cerevisiae, in order to facilitate future studies on ABC genes and proteins. Identification and classification were based on the work by Verrier and collaborators . Further, a selection pressure study and a customized gene duplication analysis, to identify recent duplication events in vascular plant genomes, have been accomplished. Finally, an expression profile overview of ABC superfamily to detect tissue-specific transporter activation in tomato has been performed as a proof of concept and reported.
Results and Discussion
Identification and characterization of putative ABC proteins
A BLASTp search in O. sativa, S. lycopersicum, S. tuberosum, V. Vinifera and the green alga V. carteri proteomes with Arabidopsis ABC protein dataset allowed us to discover a number of potential ABC sequences. The domain composition of proteins was assessed through a domain detection analysis. A total of 995 proteins (Additional file 1: Table S1) containing one or more ATP-binding cassette domains were identified. Our analysis enlarged number of ABC proteins identified in rice , confirming data on ABCG family refined by Matsuda et al. , added 32 novel ABC transporters in V. vinifera,  and provide manual curated ABC protein catalogues for S. lycopersicum, S. tuberosum. Most of these proteins belong to known plant subfamilies (A-I, except H) (Additional file 2: Table S2 and Additional file 3). Proteins containing a single domain or novel associations were also recorded. The subfamily A is well conserved among plant species (varying from 6 to 13 members), but it is absent in S. cerevisiae. Probably this protein group associated with perturbed cellular lipid transport [8,16], originated after the division of ascomycota and chlorophyta. Subfamily G showed the highest number of members in all the species tested. Subfamilies B and C presented a number of proteins rather stable (ranging from 19 to 38) in the analysed genomes except for V. carteri and S. cerevisiae. Interestingly, ABC proteins of D, E and F subfamilies represent about 6% of the ABC transporters in O. sativa, S. lycopersicum, S. tuberosum, V. Vinifera and A. thaliana but about 25% in V. carteri and S. cerevisiae. The ABCG proteins were found to represent an average of about 40% (ranging from 28% in green alga to 55% in potato) of all annotated transporters in each analysed species. Subfamily G was particularly represented in rice and potato, in which 137 and 93 proteins were annotated, respectively.
Normalizing the total number of ABC proteins identified in each species on proteome size, we found a considerable number of ABC proteins in V. vinifera and S. tuberosum and a much lower number in O. sativa. The fraction of single ABC subfamily members in each plant species proteome varies considerably (Figure 1). The A. thaliana profile shows an expansion of proteins belonging to subfamily B. S. tuberosum and V. vitifera has a similar ABC profile with the exception of a slight expansion of subfamily F, whose members have been found involved in growth and development . Also S. lycopersicum shows an analogous profile, but in this case, a clear contraction of the proteins belonging to subfamily G has been observed. A triplication event, retained in potato and lost in tomato, contributes to modify the profile of several gene families . Indeed, it has been already demonstrated that tomato and potato genomes differ significantly for R gene complement . Subfamily F is highly represented in S. cerevisiae and in V. carteri, suggesting an important role in basic processes.
Selection pressure acting on plant ABC families
The dissimilarity level between the non-synonymous substitution (dN) and synonymous substitution (dS) values has been used to infer the direction and magnitude of natural selection acting on protein coding genes. In order to discover the selection pressure that characterize the ABC subfamilies in A. thaliana, O. sativa, S. lycopersicum, S. tuberosum and V. vinifera, we used two different approaches based on Nei-Gojobori and SLAC methods [20,21]. Table 1 shows the results of neutrality tests performed for each ABC subfamilies by using coding DNA sequence (cds) alignments. The average δ (dN − dS) and ω (dN/dS) value of subfamilies A, B, C, D, and E ranges from −35.31 to −4.54 and from 0.237 to 0.885, respectively, indicating that a negative selection is acting against extreme polymorphic variants. The stabilizing selection that characterizes these subfamilies can be ascribed to the plants need to preserve important protein functions [22,23]. In particular, subfamily E, whose members encode solute-carrier organic anion transporters , appeared to be subject to a very strong negative selection pressure (δ = −35.31; ω = 0.237), probably because of its role in RNA degradation . For the ABCF subfamily, the SLAC analysis underlines a soft purification selection (p-Value < 0.05). The subgroup GWBC is the only group that showed a positive average for ω value. Indeed, single codon analysis of the ABCGWBC group underlined 396 positively selected sites (p-Value < 0.05). Finally, the ABCGPDR showed a negative pressure (δ = −13.448 and ω = 0.691), but the single codon analysis underlined 93 codons under positive selection, of which about 15% are located on the first two Pfam NBD ABC_transporter-like domains (PF00005). Probably, the global protein structure of ABCGPDR has been conserved, but positive selection in specific sites of NBD domains has been promoted to generate novel functions .
Phylogenetic reconstruction of ABC transporters evolutionary dynamics
In order to address questions about evolutionary history of ABC proteins in plants, predicted proteins belonging to subfamily A-G were aligned between them. Furthermore, we performed a maximum likelihood analysis for each ABC subfamily using only complete ABC protein sequences belonging to all the analysed species (Figures 2, 3, 4, 5, Additional file 4: Figure S1 and Additional file 5: Figure S2). The number of proteins analysed for each subfamily varied greatly. The sequences are grouped into robust clades supported by bootstrap values ≥ 70%, while to extract more information from evolutionary histories of each ABC subfamily we highlighted selected subgroups indicated as “clusters”.
The ABCA subfamily
ABCA phylogenetic tree (52 protein sequences) shows the presence of three ancestral V. carteri sequences, separated from the rest of proteins, and two clades with four clusters with a good bootstrap value (>90) (Figure 2). Clade 1 comprises members belonging to all the species analysed. It seems that the proteins belonging to cluster 1 are well conserved in all species, even if a swift diversification between Arabidopsis and Solanum spp. was found. Clade1-cluster 2 shows that a gene expansion occurred only in eudicot genomes. The transmembrane region of four proteins belonging to this cluster (Solyc04g015970.2.1-AT2G41700.1), reveals a string of about 26 amino acids with an alignment identity >70%. Members belonging to this subfamily are involved in cellular lipid transport , and a role of full-length ABCA transporter AT2G41700.1, named AtAOH by Sanchez-Fernandez , in sterol metabolism has been demonstrated. A similar function for other proteins included in this sub-cluster could be hypothesized. Clade 2 includes 2 ABC transporters annotated in V. carteri and 27 in vascular plant proteins, which are subjected to a high degree of differentiation in all angiosperm species with the exception of V. vinifera. It is interesting to note that clade 2-cluster 4 contains only 8 Arabidopsis proteins with an alignment identity of about 70%, of which six are on chromosome 3, while clade2-cluster 3 contains 6 Solanum ABCA proteins with an alignment identity of 90%. A potential translocation of ABC transporters between chromosomes 3 and 11 of potato (Sotub11g021420.1.1-Sotub11g021450.1.1 and Sotub03g024890.1.1-Sotub03g024920.1.1) may have occurred since the two chromosome segments show similar gene arrangements (data not shown).
The ABCB subfamily
The ABCB evolutionary history (Additional file 4: Figure S1) was inferred by analysing 169 proteins. Several sub-groups were identified in this subfamily, suggesting a large diversification among the analysed species. The phylogenetic tree displayed 13 main clades, supported from high internal branch bootstrap indexes. Subsequently, modifications of original sequence arrangement produced few sequences in yeast and algae, and a huge number in vascular plants. Phylogenetic analysis suggests that sub-groups evolved differently in each species. However, orthologous of six Arabidopsis proteins present in clade 1 were identified both in monocot and dicot species. Proteins belonging to this subfamily seem to be involved in auxin influx transport in roots, and contribute to the basipetal transport in hypocotyls and root tips by establishing an auxin uptake sink in the root cap. Moreover, they confer sensitivity to 1-N-naphthylphthalamic acid (NPA), regulate root elongation, initiation of lateral roots and development of root hairs, transport IAA, indole-3-propionic acid, NPA syringic acid, vanillic acid and some auxin metabolites, but not 2,4-D and 1-naphthaleneacetic acid [27,28]. In particular, AT2G36910.1 and AT3G28860.1 are involved in auxin transport in stems and root, respectively [29,30]. It is possible to predict similar functions for orthologous proteins and gain insight in species not yet characterized by looking at specific clade arrangements. For instance, clade 8 embraces ten transporters afferent to all the species analysed. S. cerevisiae (YMR301C) and Arabidopsis (AT4G28620.1, AT4G28630.1 and AT5G58270.1) were found to be involved in iron homeostasis  suggesting that this function is well conserved among species. Proteins belonging to clade 11 present a well conserved string of 42 amino acids (alignment identity of 97%) following the Pfam domains (PF00005) (Additional file 6: Figure S2). Interestingly, a member of this group, At5g39040.1 has been reported as involved in aluminium resistance . Finally, in clade 13 (bootstrap index 70%), which groups 40 ABC proteins, three large expansions were observed in tomato (9 ABCBs), potato (9ABCBs) and rice (10 ABCBs). A perfect conservation of orthologous pairs between tomato and potato on chromosomes 11, 6, 12, 3 and 2 [29,33] has been detected.
The ABCC subfamily
ABCC (Additional file 7: Figure S3) is a large subfamily of “full-size”, “forward-orientation” proteins. The phylogenetic tree obtained by comparing 109 proteins displays that three S. cerevisiae and six V. carteri proteins, and one protein from O. sativa (Os04g33700.1) cluster separately. Two distinctive angiosperm clades can be evidenced (bootstrap index >75). Proteins belonging to this subfamily have been found to be involved in cellular processes such as vacuolar transport, detoxification and regulation of guard cell plasma membrane ion channels. Clade 1 encodes 12 proteins, of which four annotated in Arabidopsis (AT1G30400.1, AT1G30410.1, AT1G30420.1 and AT2G34660.1) are involved in detoxification, vacuolar transport of abscisic acid and glucosyl ester, organic anion transport, chlorophyll degradation and modulation of seed phytate content [29,34,35]. A unique orthologous in potato and rice, two members in tomato and four members in grape that are putatively involved in fruit maturation process have been found [36,37]. In clade 2 we underlined four remarkable clusters. In particular, cluster 1 groups orthologous genes of AT2G07680.1 involved in vacuole traffic  and cluster 2 contains nine proteins with an identity of 60% to AT3G62700.1, involved in vacuolar transport of abscisic acid glucosylester [39,33]. In cluster 4 there is an Arabidopsis ABCC protein (AT1G04120.1), involved in the regulation of anion and calcium channel activities , which presents a high sequence similarity (alignment identity of 77%) with other four eudicot proteins. Cluster 4 also contains angiosperm proteins with an average identity of about 75%.
The ABCD subfamily
The ABCD phylogeny tree obtained with proteins belonging to all considered phyla (Figure 3), revealed the evolutionary history of this subfamily. S. cerevisiae YKL188w peptide is separated from the two main clades and could be designated as the ancestral protein. ABC transporters belonging to this subfamily have been found to play a role in the peroxisome transport [40,41]. Clade 1 includes full-size” proteins (average identity 76%) with “forward orientation”. In this clade is present the Arabidopsis protein AT4G39850.1 involved in a wide range of substrates for peroxisome uptake [42-44]. A similar function could be hypothesized for the homologous peptides (Os01g73530.1, Os05g01700.1, Solyc04g055120.2.1, Sotub04g020700.1.1) detected in tomato, potato and rice. Clade 2 includes “half-size” proteins. The transmembrane domain (400 amino acids) is very well conserved (84% average identity) among proteins belonging to this clade as well as the NBD 1 domain. Interestingly, the two Solanum (Solyc12g017420.1.1 and Sotub12g013980.1.1) proteins and the three AT1G54350.1, GSVIVT01036685001 and Os01g11946.1 transporters show a higher identity with NBD motif of green alga Vocar20007372m (about 60% of identity) and Vocar20009192m (about 70%of identity), respectively (Figure 3B).
The ABCE subfamily
ABCE subfamily (Figure 4), with only ten proteins detected, was found to be the smallest among the subfamilies analysed in this work. The structure of the phylogenetic tree was extremely useful in tracking the evolution of these “trasporters”, also known as RNase L inhibitors (RLI)  (Figure 4A). The two ancestral S. cerevisiae (YDR091C) and V. carteri, (Vocar20004039m) proteins were more similar to A. thaliana (AT3G13640.1) and O. sativa, (Os02g18180.1) proteins. Only for Solanum spp proteins, a small expansion was observed (clade 1 of Figure 4). In this group there was a V. vinifera protein (GSVIVT01036876001) that clustered with the Arabidopsis protein AT4G19210.1 which contains N-terminal “ferrodoxin” (4Fe4S-type) motifs and interacts with nucleic acids [11,46,47].
The ABCF subfamily
The phylogenetic tree of ABCF subfamily, obtained by comparing 46 proteins, reveals five clades (Figure 5). Proteins belonging to this subfamily have been found to be involved in stress-associated control  and seem to have an ancestral origin since they are highly represented both in V. carteri and S. cerevisiae (Additional file 1: Table S1). S. cerevisiae proteins are included in all clusters except for clades 1 and 5. These two clades show an alignment identity of 61%and 66% respectively and include two V. carteri (Vocar20008959m and Vocar20013543m) proteins. Interestingly, Arabidopsis proteins present in clades 1 and 5 (AT3G54540.1 and AT5G64840.1) have been found to be involved in root growth and development  and a similar role could be predicted for proteins belonging to such clade. Clade 2, with an alignment identity greater than 75%, includes highly conserved proteins in all species analysed. Clade 4 comprises six proteins belonging to S. cerevisiae and V. carteri, with a low alignment identity (42%). Interestingly, three of these proteins (YPL226W, Vocar20002122, Vocar20002123), one in yeast and two in algae, have an additional chromo-domain (IPR023780).
The ABCG subfamily
ABCG, the largest plant ABC transporter subfamily, includes two groups according to Sanchez-Fernandez nomenclature: WBCs and PDRs. Members of the ABCGWBC consist of approximately 600–750 amino acid residues  and can be involved in the cuticular lipids extrusion [49,50]. ABCG full-size proteins (ABCGPDR) have a NBD domain characterized by four “plant PDR signatures” . Many proteins belonging to this subfamily have been found to be involved in resistance to pathogens, antimicrobial terpenoids and auxinic herbicides, and contribute to the transport of signalling molecules or secretion of volatile compounds [51-53].
The ABCGWBC group
The ABCGWBC evolutionary analysis, obtained by comparing 219 proteins, shows a high diversification (Additional file 5: Figure S4). Eleven clades that encompass a number of proteins varying from 31 (clade 7) to 4 (clades 5, 7 and 10) have been produced. Clades 1, 2 and 9 encompass 33% of the sequences analysed. A putative progenitor of clades 1 and 2 could be the S. cerevisiae protein YCR011C. In the clade 1 is present AT3G55130.1, which appears to be involved in kanamycin resistance when overexpressed in transgenic plants . In clade 2 we found sequences that are highly conserved in eudicot genomes. Clade 3 and 4 encode transporters that could be involved in lipid/sterol homeostasis regulation required for proper vascular development, likewise the AT1G31770.1 and AT4G27420.1 proteins [50,51,55]. Clade 5 groups three sequences similar to AT3G13220.1 (76% identity), which has been found involved in abscisic acid transport . Clade 7 includes a S. cerevisiae sequence (YOL75C) that can be ancestral to the diversification that occurs from clade 7 to 11. Clade 8 includes only 8 rice transporters. Clade 9 includes 21 transporters similar to AT1G17840.1 and AT1G51500.1, which are required for export of wax components such as alkanes .
The ABCGPDR group
ABC PDR transporters show a highly conserved PDR domain defined by PFAM database (IPR013581). The phylogenetic analysis, performed by comparing 125 proteins (Figure 6), separated the ABCGPDR proteins in two well definite groups (bootstrap indexes 70 and 100 respectively). Clade 1 (underlined in yellow in Figure 6) includes exclusively 8 S. cerevisiae proteins characterized by the PDR domain (PF06422). In yeast, PDR proteins confer resistance to several anti-fungal compounds by actively transporting their substrates out of cell. Next, two V. carteri proteins (Vocar20011822 and Vocar20005809) separate clade 1 from clade 2. The sequences of clade 2 are collapsed in eleven clusters marketed with different colours in Figure 6.
Within the cluster 1, 16 Solanum proteins are grouped, with a pairwise identity of 82%, located on chromosomes 5, 9 and 12 in tomato and potato genomes. ABCGPDR sequences of this cluster show a high similarity with an ATP-binding cassette transporter (AT1G15520.1) of A. thaliana PDR12 (AtPDR12)/ABCG40 known to be involved in pleiotropic drug resistance and abscisic acid (ABA) uptake transport [58,59]. Clusters 2 and 3 are specific for V. vinifera while clusters 5 and 6 include only O. sativa sequences, suggesting a recent expansion of ABCGPDR in grape and rice. In cluster 7 is present the A. thalianaAT2G26910.1 gene, that is involved in cutin formation . The five dicot and monocot orthologous belong to this cluster, could be predicted to be putatively involved in cutin formation as well.
Cluster 8 includes eight V. vinifera proteins encoded from genes located on chromosomes 13, 8 and 6 and eleven Solanum proteins encoded from genes located on chromosomes 5 and 11, which show a high identity with the A. thaliana AT1G66950.1 and AT2G36380.1 genes, known to be highly expressed in the root cells for the secretion of several secondary metabolites . Cluster 9 encompasses an Arabidopsis PDR protein (AtABCG36/AT1G59870.1), which seems to be involved into susceptibility/resistance to the barley powdery mildew pathogen . Eight proteins annotated in tomato, potato, grape and rice with a pairwise identity of 67% also belong to this cluster. Thirteen and six transporters were grouped in clusters 10 and 11, respectively. In cluster 10, the six A. thaliana proteins, including AT4G15230 (AtABCG30), a protein involved in root exudation of phytochemicals  show an average identity > 60% with the other proteins of this cluster. In cluster 11, a very strong homology (about 75%) among five Solanum and V. vinifera proteins with A. thaliana AT2G29940.1 protein involved in the modulation of stomata activity  was detected.
Genomic distribution and recent gene duplication events
The genome-wide distribution of Arabidopsis, tomato, potato, rice and grape ABC transporter genes based on the chromosome size was significantly non-random (Arabidopsis p = 0.02; tomato p = 0.005; potato p = 1e-8; rice p = 0.03 and grape =3e-12) (Figure 7). The greatest numbers of ABCs in Arabidopsis were found on chromosomes 3 (about 30% of the annotated genes). In Solanaceae genome about 15% of transporters were located on chromosome 12, while the smallest number has been found on chromosome 10. In rice and grape genomes, about 20 and 26% of ABC transporters were positioned on chromosomes 1 and 9 respectively.
Available genomic data provide substantial evidences for abundance of duplicated genes in all surveyed organisms. The gene duplications occupy a leading role in the evolution of genomes. The importance of these events is linked to the necessity of organisms to generate novel functions [61,62]. Detailed computational analysis of individual gene families in different genomic sequences can be used to uncover the mechanisms behind the evolution by gene duplication. In order to discover ABC gene duplications that took place in each analysed genome, we developed a robust system (see Method) for detecting recent duplication events (Additional file 8: Figure S5, Additional file 9: Figure S6, Additional file 10: Figure S7, Additional file 11: Figure S8, Additional file 12: Figure S9, Additional file 13: Figure S10, Additional file 14: Figure S11). A total of 205 ABC genes were involved in recent duplications, with 25% (ranging from 17% to 31%) of annotated transporters in the seven species (Table 2).
Overall, the data showed in Table 3 suggest that gene duplication events in vascular plants generated an expansion of some ABC protein families. Probably, this phenomenon is due to the need of a efficient molecular cell interconnection among and within tissues of vascular plants .
More than 90% of gene duplications found in this study concern B, C and G sub-families. Instead, transporters belonging to subfamilies D and E showed to be highly conserved (Table 3). In particular 110 genes, out of 465 annotated in subfamily G, are involved in duplication events (Table 3), indicating a considerable implication of this ABC subfamily expansion in all the vascular plants analysed here.
The following example illustrated a case study of a Solanum spp. ABC transporter locus (Duplication Block 1 in Additional file 11: Figure S8; Solyc05g053570.2.1-Solyc05g053600.2.1) under high evolutionary pressure. This region contains five potato and three tomato transporter genes in respective genome, showing a high homology to AT1G15520.1, (average identity about 65%) which is known to be involved in pleiotropic drug resistance, abscisic acid (ABA) uptake transport and lead resistance during Pseudomonas infection [58,59,64,65]. The tomato locus on chromosomes 5 showed an average identity 75% with the orthologous potato locus. The genes found in this Solanum locus are grouped in the ABCGPDR phylogenetic tree cluster 1. Figure 8A proposed the phylogenetic reconstruction of recent duplication events and orthologous relationships between tomato and potato ABCGPDR transporters. Moreover, a genomic alignment is showed in panel B where the collinear genome blocks confirmed the high conservation of this genomic region in Solanum species.
Tomato expression profile of ABC transporter families
A tomato genome-wide overview of ABC expression profiles was performed to gain insights into the biological role of ABC proteins in tomato. It is already demonstrated that ABC genes can exert their control via transcription expression and that synteny approach is a powerful tool to identify candidates in this species . We analysed the expression profiles of our ABC tomato annotated genes in five different tissues (bud, flower, leaf, root and fruit), by grouping ABC genes according to their subfamily. Of 180 ABC transporters annotated in S. lycopersicum Heinz 1706, more than 85% are expressed in at least one of the tissues examined, considering a value normalized as transcripts per million (TPM) > 2 (Table 4). All members of A, D, E and F subfamilies and about 85% of transporters of subfamilies C and I are expressed, while 72% and 67% of ABCB and ABCG genes show a TPM value higher than 2 respectively, suggesting that 15-30% of members belonging to these subfamilies could be pseudogenes. Some of these subfamilies (B, C and G) show also a high rate of gene duplication, suggesting that during the diversification, pseudogenization events could be occurred .
In Figure 9 a diagram of Venn shows the expression profile intersections of the five tissues analysed, evidencing ABCs expressed in specific tissues of tomato. ABCGWBC (Solyc07g053300.1.1) is expressed only in flower and is located in clade 2 of ABCGWBC phylogenetic tree close to AT1G53270.1 (Additional file 5: Figure S2). Two ABCC (Solyc00g283010.1.1and Solyc11g065710.1.1) and an ABCGPDR (Solyc12g019620.1.1), located on chromosome 0, 11 and 12 respectively, are expressed only in leaf. Nine transporters (Solyc03g113690.1.1, Solyc06g036490.1.1, Solyc06g072090.1.1, Solyc07g065770.2.1, Solyc07g065780.1.1, Solyc09g042280.1.1, Solyc09g042300.1.1, Solyc11g065360.1.1, Solyc12g013630.1.1) are expressed specifically in bud tissues: 8 are ABCGWBC members and only Solyc06g036490.1.1 is a member of the C subfamily. Further, nine genes (Solyc03g093650.2.1, Solyc03g113070.2.1, Solyc03g113080.2.1, Solyc05g051540.1.1, Solyc07g018130.1.1, Solyc08g067610.2.1, Solyc11g067300.1.1, Solyc12g019640.1.1) are specifically expressed in root. About 40% (63) of ABC transporters are expressed in all the five tissues. The complexity of ABC subfamilies expression profiles is showed in nine heat-maps (Additional file 15: Figure S12). Genes with expression profiles characterized by high levels of transcription are surrounded in green. Blue boxes indicate groups of ABC transporters with low levels of expression and include subfamilies C, B, GWBC and I. ABCGPDR subfamily members Solyc05g0553302.1 and Solyc11g0670001.1 are found to be highly expressed in root, confirming the high level of activation reported for their homologues AT1G66950.1 and AT2G36380.1 (see paragraph The ABC G PDR group ). Solyc05g0185102.1 is high expressed in fruit and flower and Solyc06g0656702.1 in bud and flowers similarly to the Arabidopsis homologue (AT2G26910.1) found to be involved in cutin formation (see paragraph The ABC G PDR group ). The group that shows homology with AT2G26910.1 (see paragraph The ABC G PDR group ) has striking differences in terms of expression in specific tissues. Solyc03g1209802.1, homologue to AT1G59870.1 (see paragraph The ABC G PDR group ), is highly expressed in fruit, bud and flowers. Solyc01g101070 showed an elevated expression in all analysed tissues. Solyc06g0769301.1, homologue to AT2G29940.1 protein, modulator of stomata activity , is highly expressed in all tissues, especially in leaf. Analyzing the expression profile of 32 tomato recent duplicated ABC transporters (Additional file 16: Table S3), high expression level was evidenced for 9 genes in one or more plant tissue. In some case a moderate or comparable expression of other copies belonging to the same duplication block has been observed. Probably, the gene duplication increases promptly the expression level of this gene subfamily in specific tissues . Five duplication blocks showed a very low or level of expression and the duplication block located on chromosome 12 (Solyc12g013630.1.1 -Solyc12g013640.1.1) clearly show the presence of one active copy.
ABC proteins are firmly established as key players of cellular processes involved in auxin transport, lipid catabolism, xenobiotic detoxification, disease resistance and stomatal function. In this study, 803 ABC transporters were identified by in silico analysis of four plant species (O. sativa, S. lycopersicum, S. tuberosum, V. vinifera) and 76 transporters in the green alga V. carteri, by comparing them with those reannotated in Arabidopsis (A. thaliana) and the yeast S. cerevisiae. The characterization of ABC proteins based on domain annotation allowed the discovering of new subfamily members. Moreover, we ascertained that ABCG represents the largest group of ABC proteins in all plant species analysed. Phylogenetic analysis allowed us to trace the evolutionary history of plant ABCs, evidencing eukarya diversification. It is well known that a large genome datasets accelerate gene discovery in plants. By analysing the expression data of all tomato ABCs identified in this study, we were able to provide an indication of the putative role of these genes. The results from this work offer useful inputs that may help, for instance, to discover ABC genes with broader or more specific roles, and help to address several biological questions concerning the evolution of the relationships between genomes of different species.
Genomes search for ABC transporters identification
Oryza sativa, Vitis vinifera and Volvox carteri genome data were downloaded from website Phytozome portal . Arabidopsis thaliana data were obtained from TAIR database  resource. Tomato and potato sequences were provided by the Tomato Genome Sequencing Consortium . Saccharomyces cerevisiae strain S288C data were taken from Saccharomyces Genome Database . A BLASTp analysis (e-value < 1e-6) to identify potential ABC transporters in different species were performed  (using the entire proteome of each analysed species, starting from 132 ABC protein sequences annotated in A. thaliana previously described [8,10].
Functional prediction of ABC transporters
The set of proteins identified via BLASTp search was further scrutinized using InterProScan software to verify the presence of conserved domains and motifs characteristic of ABC proteins (NBD-TMD). The presence of conserved domains and motifs characteristic of ABC subfamilies (NBD-TMD) allowed us to sort ABC proteins into eight major plant subfamilies (A–I, except subfamily H, which hasn’t members in plants). In this analysis, recovered sequences were compared with the following databases: HMMPanther (Hidden Markov model Panther) to find the characteristic domains for ABC subfamilies, HMMTigr (Hidden Markov model Tigr), patternScan, FPrintScan, HMMPIR, ProfileScan, HAMAP (High-quality Automated and Manual Annotation of Microbial Proteomes), SignalPHMM PROSITE to identify ABC transporters conserved sequences, SuperFamily PRINTS (Fingerprint database), HMMPfam (Protein family) to find “ABC domains”, BlastProDom (Blast protein domain database), and HMMSMART protein motif analyses (Simple Modular Architecture Research Tool,  to find ATPase domains. The TMHMM database was also accessed to verify the presence of transmembrane-regions.
Evolutionary analyses of all subfamilies, except for I (Dataset B), were conducted using MEGA5 . The protein sequences were aligned using ClustalW default parameters (v. 1.74) . The phylogenetic relationships were inferred separately for each ABC subfamily using the Maximum Likelihood method. The best phylogenetic method and evolutionary model was determined among candidate models of protein evolution. Models with the lowest BIC scores (Bayesian Information Criterion) are considered to describe the better substitution pattern. For each model, AICc value (Akaike Information Criterion, corrected), Maximum Likelihood value (lnL), and the number of parameters (including branch lengths) are also presented . The bootstrap consensus tree inferred from 100 replicates was taken to represent the evolutionary history of the sequences analysed . The trees were drawn to scale, with branch lengths measured in terms of number of substitutions per site. We have considered significant clades those that have a bootstrap value not less than ≥ 70, containing at least 4 ABC transporter sequences. ABC protein subgroups described in more detail were labelled as “clusters”.
Recent duplication events of ABC transporter genes
To identify duplicated ABC transporter pairs, we run a phylogenetic analysis using ABC nucleotide sequences of Dataset B, using Maximum Likelihood method and General Time Reversible model.
We defined a gene duplication according to the following criteria: (1) the clade bootstrap index >80, (2) the alignable nucleotide sequence identity ≥70% (3) putative recent duplications were also filtered for physical chromosome co-localization and (4) only one event of duplication is counted for tightly linked genes.
Evolution rates at codon sites
Selective pressure acting on the ABC-subfamilies were investigated by determining the nonsynonymous to synonymous nucleotide substitution (dN-dS) indicated as δ. Tests were conducted to estimate the evolution of each codon: positive (dN > dS); neutral (dN = dS); and negative (dN < dS). The variance of the difference was computed using the bootstrap method (1000 replicates). Analyses were conducted using the Nei-Gojobori method . All positions with less than 80% site coverage were eliminated. All the ABC coding DNA sequences were aligned using ClustalW 1.74 . Evolutionary analyses were conducted in MEGA5 . To clearly depict the proportion of sites under selection, an evolutionary fingerprint analysis was carried out using the SLAC algorithm implemented in the Datamonkey server .
Expression data visualization
The expression data of tomato ABC transporters extracted from dataset of the Tomato Genome Consortium  were processed as reads per kilobase of the exon model per million mapped reads (RPKM), and subsequently normalized with TPM and visualized with R software .
Availability of supporting data
The data sets supporting the results of this article can be found as Additional files.
George AM, Jones PM. Perspectives on the structure-function of ABC transporters: the switch and constant contact models. Prog Biophys Mol Bio. 2012;109(3):95–107.
Gottesman MM, Paterson JK, Chen KG, Annereau JP, Szakacs G. New ABC transporters associated with multidrug resistance in cancer. Febs J. 2005;272:206.
Dawson RJP, Hollenstein K, Locher KP. Uptake or extrusion: crystal structures of full ABC transporters suggest a common mechanism. Mol Microbiol. 2007;65(2):250–7.
Higgins CF, Linton KJ. The ATP switch model for ABC transporters. Nat Struct Mol Biol. 2004;11(10):918–26.
Loo TW, Bartlett MC, Clarke DM. The “LSGGQ” motif in each nucleotide-binding domain of human P-glycoprotein is adjacent to the opposing Walker A sequence. J Biol Chem. 2002;277(44):41303–6.
Davidson AL, Dassa E, Orelle C, Chen J. Structure, function, and evolution of bacterial ATP-binding cassette systems. Microbiol Mol Biol R. 2008;72(2):317–64.
Licht A, Schneider E. ATP binding cassette systems: structures, mechanisms, and functions. Cent Eur J Biol. 2011;6(5):785–801.
Rea PA. Plant ATP-binding cassette transporters. Annu Rev Plant Biol. 2007;58:347–75.
Verrier PJ, Bird D, Buria B, Dassa E, Forestier C, Geisler M, et al. Plant ABC proteins - a unified nomenclature and updated inventory. Trends Plant Sci. 2008;13(4):151–9.
Sanchez-Fernandez R, Davies TGE, Coleman JOD, Rea PA. The Arabidopsis thaliana ABC protein superfamily, a complete inventory. J Biol Chem. 2001;276(32):30231–44.
Dean M, Allikmets R. Complete characterization of the human ABC gene family. J Bioenerg Biomembr. 2001;33(6):475–9.
Dean M. Genetics of ATP-binding cassette transporters. Method Enzymol. 2005;400:409–29.
Jasinski M, Ducos E, Martinoia E, Boutry M. The ATP-binding cassette transporters: structure, function, and gene family comparison between. Plant Physiol. 2003;131:1169–77.
Matsuda S, Funabiki A, Furukawa K, Komori N, Koike M, Tokuji Y, et al. Genome-wide analysis and expression profiling of half-size ABC protein subgroup G in rice in response to abiotic stress and phytohormone treatments. Mol Genet Genomics. 2012;287:819–35.
Çakır B, Kılıçkaya O. Whole-genome survey of the putative ATP-binding cassette transporter family genes in Vitis vinifera. PLoS One. 2013;8:e78860.
Kaminski WE, Piehler A, Wenzel JJ. ABC A-subfamily transporters: structure, function and disease. Bba-Mol Basis Dis. 2006;1762(5):510–24.
Kato T, Tabata S, Sato S. Analyses of expression and phenotypes of knockout lines for Arabidopsis ABCF subfamily members. Plant Biotechnol-Nar. 2009;26(4):409–14.
Sato S, Tabata S, Hirakawa H, Asamizu E, Shirasawa K, Isobe S, et al. The tomato genome sequence provides insights into fleshy fruit evolution. Nature. 2012;485(7400):635–41.
Andolfo G, Sanseverino W, Rombauts S, Van de Peer Y, Bradeen JM, Carputo D, et al. Overview of tomato (Solanum lycopersicum) candidate pathogen recognition genes reveals important Solanum R locus dynamics. New Phytol. 2013;197(1):223–37.
Nei M, Gojobori T. Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol Biol Evol. 1986;3(5):418–26.
Pond SLK, Frost SDW. A genetic algorithm approach to detecting lineage-specific variation in selection pressure (vol 22, pg 478, 2005). Mol Biol Evol. 2005;22(4):1157.
Kingsolver JG, Hoekstra HE, Hoekstra JM, Berrigan D, Vignieri SN, Hill CE, et al. The strength of phenotypic selection in natural populations. Am Nat. 2001;157(3):245–61.
Rocha EPC, Smith JM, Hurst LD, Holden MTG, Cooper JE, Smith NH, et al. Comparisons of dN/dS are time dependent for closely related bacterial genomes. J Theor Biol. 2006;239(2):226–35.
He L, Vasiliou K, Nebert DW. Analysis and update of the human solute carrier (SLC) gene superfamily. Hum Genomics. 2009;3(2):195–206.
Jin W, Wu DD, Zhang X, Irwin DM, Zhang YP. Positive selection on the gene RNASEL: correlation between patterns of evolution and function. Mol Biol Evol. 2012;29(10):3161–8.
Takanashi K, Sugiyama A, Sato S, Tabata S, Yazaki K. LjABCB1, an ATP-binding cassette protein specifically induced in uninfected cells of Lotus japonicus nodules. J Plant Physiol. 2012;169(3):322–6.
Molesini B, Pandolfini T, Pii Y, Korte A, Spena A. Arabidopsis thaliana AUCSIA-1 regulates Auxin biology and physically interacts with a kinesin-related protein. PLoS One. 2012; 7
Noh B, Murphy AS, Spalding EP. Multidrug resistance-like genes of Arabidopsis required for auxin transport and auxin-mediated development. Plant Cell. 2001;13(11):2441–54.
Martinoia E, Klein M, Geisler M, Bovet L, Forestier C, Kolukisaoglu U, et al. Multifunctionality of plant ABC transporters - more than just detoxifiers. Planta. 2002;214(3):345–55.
Geisler M, Murphy AS. The ABC of auxin transport: the role of p-glycoproteins in plant development. Febs Lett. 2006;580(4):1094–102.
Chen S, Sánchez-Fernández R, Lyver ER, Dancis A, Rea PA. Functional characterization of AtATM1, AtATM2, and AtATM3, a subfamily of Arabidopsis half-molecule ATP-binding cassette transporters implicated in iron homeostasis. J Biol Chem. 2007;282:21561–71.
Larsen PB, Cancel J, Rounds M, Ochoa V. Arabidopsis ALS1 encodes a root tip and stele localized half type ABC transporter required for root growth in an aluminum toxic environment. Planta. 2007;225(6):1447–58.
Kolukisaoglu HU, Bovet L, Klein M, Eggmann T, Geisler M, Wanke D, et al. Family business: the multidrug-resistance related protein (MRP) ABC transporter genes in Arabidopsis thaliana. Planta. 2002;216(1):107–19.
Shi Z, Peng XX, Kim IW, Shukla S, Si QS, Robey RW, et al. Erlotinib (Tarceva, OSI-774) antagonizes ATP-bInding cassette subfamily B member 1 and ATP-binding cassette subfamily G member 2-mediated drug resistance. Cancer Res. 2007;67(22):11012–20.
van den Brule S, Smart CC. The plant PDR family of ABC transporters. Planta. 2002;216(1):95–106.
Frelet-Barrand A, Kolukisaoglu HU, Plaza S, Ruffer M, Azevedo L, Hortensteiner S, et al. Comparative mutant analysis of arabidopsis ABCC-type ABC transporters: AtMRP2 contributes to detoxification, vacuolar organic anion transport and chlorophyll degradation. Plant Cell Physiol. 2008;49(4):557–69.
Shiratake K, Martinoia E. Transporters in fruit vacuoles. Plant Biotechnol. 2007;127–33
Jaquinod M, Villiers F, Kieffer-Jaquinod S, Hugouvieu V, Bruley C, Garin J, et al. A proteomics dissection of Arabidopsis thaliana vacuoles isolated from cell culture. Mol Cell Proteomics. 2007;6(3):394–412.
Suh SJ, Wang YF, Frelet A, Leonhardt N, Klein M, Forestier C, et al. The ATP binding cassette transporter AtMRP5 modulates anion and calcium channel activities in Arabidopsis guard cells. J Biol Chem. 2007;282(3):1916–24.
Shani N, Valle D. Peroxisomal ABC transporters. In: Abc transporters: biochemical, cellular, and molecular aspects. 292nd ed. 1998. p. 753–76.
Footitt S, Slocombe SP, Larner V, Kurup S, Wu YS, Larson T, et al. Control of germination and lipid mobilization by COMATOSE, the Arabidopsis homologue of human ALDP. Embo J. 2002;21(12):2912–22.
Morita M, Shimozawa N, Kashiwayama Y, Suzuki Y, Imanaka T. ABC subfamily D proteins and very long chain fatty acid metabolism as novel targets in adrenoleukodystrophy. Curr Drug Targets. 2011;12(5):694–706.
Theodoulou FL, Holdsworth M, Baker A. Peroxisomal ABC transporters. Febs Lett. 2006;580(4):1139–55.
Hooks MA, Turner JE, Murphy EC, Johnston KA, Burr S, Jaroslawski S. The Arabidopsis ALDP protein homologue COMATOSE is instrumental in peroxisomal acetate metabolism. Biochem J. 2007;406:399–406.
Braz ASK, Finnegan J, Waterhouse P, Margis R. A plant orthologue of RNase L inhibitor (RLI) is induced in plants showing RNA interference. J Mol Evol. 2004;59(1):20–30.
Bairoch A. Prosite - a dictionary of sites and patterns in proteins. Nucleic Acids Res. 1992;20:2013–8.
Sarmiento C, Nigul L, Kazantseva J, Buschmann M, Truve E. AtRLI2 is an endogenous suppressor of RNA silencing. Plant Mol Biol. 2006;61(1–2):153–63.
Zeng W, Brutus A, Kremer JM, Withers JC, Gao X, Da Jones AD, et al. A genetic screen reveals Arabidopsis Stomatal and/or apoplastic defenses against pseudomonas syringae pv. tomato DC3000. PLoS Pathog. 2011; 7.
Bird D, Beisson F, Brigham A, Shin J, Greer S, Jetter R, et al. Characterization of Arabidopsis ABCG11/WBC11, an ATP binding cassette (ABC) transporter that is required for cuticular lipid secretion. Plant J. 2007;52(3):485–98.
Pighin JA, Zheng HQ, Balakshin LJ, Goodman IP, Western TL, Jetter R, et al. Plant cuticular lipid export requires an ABC transporter. Science. 2004;306(5696):702–4.
Kim DY, Bovet L, Maeshima M, Martinoia E, Lee Y. The ABC transporter AtPDR8 is a cadmium extrusion pump conferring heavy metal resistance. Plant J. 2007;50(2):207–18.
Stein M, Dittgen J, Sanchez-Rodriguez C, Hou BH, Molina A, Schulze-Lefert P, et al. Arabidopsis PEN3/PDR8, an ATP binding cassette transporter, contributes to nonhost resistance to inappropriate pathogens that enter by direct penetration. Plant Cell. 2006;18(3):731–46.
Ruocco M, Ambrosino P, Lanzuise S, Woo SL, Lorito M, Scala F. Four potato (Solanum tuberosum) ABCG transporters and their expression in response to abiotic factors and Phytophthora infestans infection. J Plant Physiol. 2011;168(18):2225–33.
Mentewab A, Stewart CN. Overexpression of an Arabidopsis thaliana ABC transporter confers kanamycin resistance to transgenic plants. Nat Biotechnol. 2005;23:1177–80.
Luo B, Xue XY, Hu WL, Wang LJ, Chen XY. An ABC transporter gene of Arabidopsis thaliana, AtWBC11, is involved in cuticle development and prevention of organ fusion. Plant Cell Physiol. 2007;48(12):1790–802.
Kuromori T, Miyaji T, Yabuuchi H, Shimizu H, Sugimoto E, Kamiya A, et al. ABC transporter AtABCG25 is involved in abscisic acid transport and responses. Proc Natl Acad Sci U S A. 2010;107(5):2361–6.
Ukitsu H, Kuromori T, Toyooka K, Goto Y, Matsuoka K, Sakuradani E, et al. Cytological and biochemical analysis of COF1, an Arabidopsis mutant of an ABC transporter gene. Plant Cell Physiol. 2007;48(11):1524–33.
Kang J, Hwang JU, Lee M, Kim YY, Assmann SM, Martinoia E, et al. PDR-type ABC transporter mediates cellular uptake of the phytohormone abscisic acid. Proc Natl Acad Sci U S A. 2010;107(5):2355–60.
Campbell EJ, Schenk PM, Kazan K, Penninckx IAMA, Anderson JP, Maclean DJ, et al. Pathogen-responsive expression of a putative ATP-binding cassette transporter gene conferring resistance to the diterpenoid sclareol is regulated by multiple defense signaling pathways in Arabidopsis. Plant Physiol. 2003;133(3):1272–84.
Galbiati M, Simoni L, Pavesi G, Cominelli E, Francia P, Vavasseur A, et al. Gene trap lines identify Arabidopsis genes expressed in stomatal guard cells. Plant J. 2008;53(5):750–62.
Zhang JZ. Evolution by gene duplication: an update. Trends Ecol Evol. 2003;18(6):292–8.
Taylor JS, Raes J. Duplication and divergence: the evolution of new genes and old ideas. Annu Rev Genet. 2004;38:615–43.
Snider J, Hanif A, Lee ME, Jin K, Yu AR, Graham C, et al. Mapping the functional yeast ABC transporter interactome. Nat Chem Biol. 2013;9(9):565–U564.
Lee M, Lee K, Lee J, Noh EW, Lee Y. AtPDR12 contributes to lead resistance in arabidopsis. Plant Physiol. 2005;138(2):827–36.
Orsi CH, Tanksley SD. Natural variation in an ABC transporter gene associated with seed size evolution in tomato species. PLoS Genet. 2009; 5
Takuno S, Nishio T, Satta Y, Innan H. Preservation of a pseudogene by gene conversion and diversifying selection. Genetics. 2008;180(1):517–31.
Kliebenstein D, Lambrix V, Reichelt M, Gershenzon J, Mitchell-Olds T. Gene duplication and the diversification of secondary metabolism: side chain modification of glucosinolates in Arabidopsis thaliana. Plant Cell. 2001;13:681–93.
Phytozome v9.1. www.phytozome.net.
The Arabidopsis Information Resource (TAIR). https://www.arabidopsis.org.
Tomato Genome Sequencing Consortium (TGC). http://solgenomics.net.
Saccharomyces Genome Database (SGD). http://www.yeastgenome.org.
Karlin S, Altschul SF. Methods for assessing the statistical significance of molecular sequence features by using general scoring schemes. Proc Natl Acad Sci U S A. 1990;87(6):2264–8.
Schultz J, Milpetz F, Bork P, Ponting CP. SMART, a simple modular architecture research tool: identification of signaling domains. Proc Natl Acad Sci U S A. 1998;95:5857–64.
Tamura K, Peterson D, Peterson N, Stecher G, Nei M, Kumar S. MEGA5: Molecular Evolutionary Genetics Analysis using maximum likelihood, evolutionary distance, and maximum parsimony methods. Mol Biol Evol. 2011;28(10):2731–9.
Nei M, Kumar S, Takahashi K. The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proc Natl Acad Sci U S A. 1998;95(21):12390–7.
Felsenstein J. Confidence-limits on phylogenies - an approach using the bootstrap. Evolution. 1985;39(4):783–91.
Thompson JD, Higgins DG, Gibson TJ. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994;22:4673–80.
Delport W, Poon AFY, Frost SDW, Pond SLK. Datamonkey 2010: a suite of phylogenetic analysis tools for evolutionary biology. Bioinformatics. 2010;26(19):2455–7.
A language and environment for statistical computing. R Foundation for statistical computing. http://www.r-project.org.
We sincerely acknowledge Dr. Roberta Marotta for annotation support.
This work was supported by the Ministry of University and Research (GenHORTH project).
The authors declare that they have no competing interests.
GA was involved in experiment design, analysis and interpretation of data and in manuscript writing; MR was involved in conception of study, in gene sequence analysis, and in manuscript drafting; AD contribute to gene annotation process and to phylogenetic analysis; LF was involved in data interpretation and in discussion of results; ML was involved in drafting and in critically revision of manuscript; FS in experiments design and in critically revision manuscript; MRE conceived the study and was mainly involved in interpretation of data and in manuscript writing. All authors read and approved the final manuscript.
ABC proteins identified and classified. The nomenclature of proteins within each subfamily is listed under the Human Genome Organization (HUGO) nomenclature.
List of annotated ATP-binding cassette transporter proteins.
Sequences of ABC transporters annotated in FASTA format. All identified of ABC transporter members are included in this file.
Phylogenetic tree of ABCB proteins.
Phylogenetic tree of ABCGWBC proteins.
High conserved amino acidic region among members of ABCB phylogenetic analysis (clade 11).
Phylogenetic tree of ABC-C proteins.
Reconstruction of ABC gene duplication events in Arabidopsis thaliana.
Reconstruction of ABC gene duplication events in Oryza sativa.
Reconstruction of ABC gene duplication events in Saccharomyces cerevisiae.
Reconstruction of ABC gene duplication events in Solanum lycopersicum.
Reconstruction of ABC gene duplication events in Solanum tuberosum.
Reconstruction of ABC gene duplication events in Volvox carteri.
Reconstruction of ABC gene duplication events in Vitis vinifera.
Expression profiles of tomato ABCs. Heat map of RNA-seq expression data from root, leaf, bud, flower and 3cm_fruit. The expression values are measured as reads per kilobase of the exon model per million mapped reads (RPKM), and subsequently normalized with TPM (transcripts per million). The colour key indicates the level of gene expression, from red (few reads) to yellow (many reads).
Expression levels of tomato ABC genes involved in recent duplication events. For each gene ID are reported: phylogenetic clade name of gene duplication event (GDE); expression values (TPM) from root, leaf, bud, flower and 3cm_fruit and ABC subfamily.
About this article
- ATP-binding cassette transporters
- Multidrug resistance
- Arabidopsis thaliana
- Oryza sativa
- Solanum lycopersicum
- Solanum tuberosum
- Vitis vinifera
- Volvox carteri
- Saccharomyces cerevisiae
- Gene duplication
- Evolutionary dynamics