Phylogenetic analysis and classification of the Brassica rapa SET-domain protein family
© Huang et al; licensee BioMed Central Ltd. 2011
Received: 4 November 2011
Accepted: 14 December 2011
Published: 14 December 2011
The SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain is an evolutionarily conserved sequence of approximately 130-150 amino acids, and constitutes the catalytic site of lysine methyltransferases (KMTs). KMTs perform many crucial biological functions via histone methylation of chromatin. Histone methylation marks are interpreted differently depending on the histone type (i.e. H3 or H4), the lysine position (e.g. H3K4, H3K9, H3K27, H3K36 or H4K20) and the number of added methyl groups (i.e. me1, me2 or me3). For example, H3K4me3 and H3K36me3 are associated with transcriptional activation, but H3K9me2 and H3K27me3 are associated with gene silencing. The substrate specificity and activity of KMTs are determined by sequences within the SET domain and other regions of the protein.
Here we identified 49 SET-domain proteins from the recently sequenced Brassica rapa genome. We performed sequence similarity and protein domain organization analysis of these proteins, along with the SET-domain proteins from the dicot Arabidopsis thaliana, the monocots Oryza sativa and Brachypodium distachyon, and the green alga Ostreococcus tauri. We showed that plant SET-domain proteins can be grouped into 6 distinct classes, namely KMT1, KMT2, KMT3, KMT6, KMT7 and S-ET. Apart from the S-ET class, which has an interrupted SET domain and may be involved in methylation of nonhistone proteins, the other classes have characteristics of histone methyltransferases exhibiting different substrate specificities: KMT1 for H3K9, KMT2 for H3K4, KMT3 for H3K36, KMT6 for H3K27 and KMT7 also for H3K4. We also propose a coherent and rational nomenclature for plant SET-domain proteins. Comparisons of sequence similarity and synteny of B. rapa and A. thaliana SET-domain proteins revealed recent gene duplication events for some KMTs.
This study provides the first characterization of the SET-domain KMT proteins of B. rapa. Phylogenetic analysis data allowed the development of a coherent and rational nomenclature of this important family of proteins in plants, as in animals. The results obtained in this study will provide a base for nomenclature of KMTs in other plant species and facilitate the functional characterization of these important epigenetic regulatory genes in Brassica crops.
Epigenetic regulation acts through heritable changes in genome function that occur without a change in DNA sequence. One well-known epigenetic mechanism is through posttranslational covalent modifications of histones; these modifications include acetylation, methylation, ubiquitylation and others, and form the basis of the 'histone code' for gene regulation . Histone lysine methylation plays a pivotal role in a wide range of cellular processes including heterochromatin formation, transcriptional regulation, parental imprinting, and cell fate determination . At least six lysine residues, five on histone H3 (K4, K9, K27, K36, K79) and one on H4 (K20), are subject to methylation. Each lysine can carry one, two or three methyl residue(s), known as mono-, di- and tri-methylation, respectively. In general, di-/tri- methylation of H3K4 and H3K36 correlates with transcriptional activation, whereas di-methylation of H3K9 and trimethylation of H3K27 correlates with gene silencing in plants and animals [2, 3].
All known lysine methylation modifications, with the exception of H3K79 methylation, are carried out by methyltransferases that contain an evolutionarily conserved SET domain, named after three Drosophila genes (Su(var), E(z), and Trithorax) . The SET domain encompasses approximately 130-150 amino acids that form a knot-like structure and constitute the enzyme catalytic site for lysine methylation . In addition to the SET domain, flanking sequences, more distant protein domains, and possibly some cofactors are also important for enzyme activity and specificity. The genes encoding SET-domain proteins are ancient, existing in prokaryotes and eukaryotes, but have proliferated and evolved novel functions connected with the appearance of eukaryotes .
The first plant genes encoding SET-domain proteins to be genetically characterized were CURLY LEAF (CLF) and MEDEA (MEA) in Arabidopsis thaliana [7, 8]. Chromatin-binding properties and histone methylation activity of plant SET-domain proteins were first reported for tobacco NtSET1 and Arabidopsis KRYPTONITE (KYP) [9, 10]. Phylogenetic analysis of plant SET domain proteins has proven helpful as a guide for genetic and molecular studies of this large family of proteins [11, 12]. To date, some of the Arabidopsis SET-domain family members have been characterized and shown to play crucial functions in diverse processes including flowering time control, cell fate determination, leaf morphogenesis, floral organogenesis, parental imprinting and seed development [3, 13–15].
Genome sequences of an increasing number of plant species, in addition to the model plants (Arabidopsis thaliana, Oryza sativa, and Brachypodium distachyon), have also been completed. Other Brassica species are of particular interest because of their agro-economical importance and their close relationship with Arabidopsis, thus providing insights into recent SET-domain gene amplification during evolution of Brassica species. Here, we identified and analyzed 49 SET-domain proteins from the recently completed Brassica rapa whole genome sequence . Our data provide a platform for future functional characterization of these important epigenetic regulatory genes in Brassica species.
Identification of SET-domain proteins from the B. rapagenome
List of green lingeage SET-domain proteins analyzed in this study
B. rapaSET-domain proteins can be grouped into six classes
The KMT1A subclass proteins
The plant KMT1A subclass proteins show high sequence similarity to the animal KMT1 proteins both within the SET domain and in the surrounding regions known as the Pre-SET and post-SET domains. Additionally, most of the plant proteins contain a specific domain named SRA (SET and RING associated). Similar to previously studied Arabidopsis proteins [12, 20], most of the BrKMT1A proteins also contain SRA, Pre-SET, SET and post-SET domains (Figure 2). These domains are missing in some of the Group-4 proteins; for example, BrKMT1A.4e, BrKMT1A;4f and BrKMT1A;4 g lack a Post-SET domain, and BrKMT1A.4 h lacks SRA, Pre-SET and post-SET domains (Figure 2). Several functions have been reported for SRA domains, including binding with the N-terminal tail of histone H3 and with DNA cytosine methylation . The crystal structure of AtKMT1A;3a/SDG9/SUVH5 revealed that SRA recognizes the methylation status of CG and CHH sequences . The Pre-SET domain contains 9 conserved cysteines. The Post-SET domain is a small cysteine-rich region often found at the C-terminal side of SET domains. Both Pre-SET and Post-SET domains have been shown to affect histone methyltransferase activity of the SET domain [23, 24].
Members of the plant KMT1A subclass, like animal KMT1 proteins, are likely to be responsible for H3K9 methylation, an epigenetic mark involved in heterochromatin formation and gene silencing. Consistent with this, analysis of AtKMT1A;1/SDG33/SUVH4/KYP, AtKMT1A;2a/SDG3/SUVH2, AtKMT1A;3a/SDG9/SUVH5 and AtKMT1A;3b/SDG23/SUVH6 has revealed their important roles in H3K9 methylation, in heterochromatic gene silencing and in cross-talk between H3K9 and DNA methylation [9, 21, 22, 25–28]. Work in rice also confirmed that several members of this subclass are involved in H3K9 methylation and in transposon silencing [29–31]. Some of the BrKMT1A genes might also have similar functions.
The KMT1B subclass proteins
The KMT1B subclass differs from the KMT1A subclass in protein domain organization; specifically, these proteins lack the SRA domain (Figure 3). A recent study demonstrated that AtKMT1B;1/SDG31/SUVR4 possesses H3K9-methyltransferase activities and its binding with ubiquitin converts H3K9me1 to H3K9me3 deposition on transposon chromatin . Notably, the WIYLD domain, which binds ubiquitin, is conserved in BrKMT1B;1a, BrKMT1B;1b, BrKMT1B;2a and BrKMT1B;2b (Figure 3). It was reported that AtKMT1B;3/SDG6/SUVR5/AtCZS is involved in regulation of flowering time, possibly through deposition of H3K9 methylation at the flowering time repressor FLC . The functions of other members of the KMT1B subclass remain uncharacterized so far.
The KMT2 class proteins
KMT2 Group-1 members contain one PWWP, one FYR and two PHD domains. Only one member belonging to Group-1 is found in O. sativa or B. distachyon, but two members are found in B. rapa and A. thaliana. Our examination revealed that BrKMT2;1a has synteny with AtKMT2;1a/SDG27/ATX1, and BrKMT2;1b with AtKMT2;1b/SDG30/ATX2, suggesting that they are derived from two different ancestral copies before the B. rapa/A. thaliana separation. Consistent with this, the atx1 mutant plants exhibit strong and pleiotropic defects , but the atx2 mutant plants have a normal phenotype . The atx2 mutation can enhance atx1 in reduction of expression of the flowering repressor gene FLC through reduced levels of H3K4me3 at the FLC locus .
The PWWP and FYR domains are absent from the Group-2 members and the PHD domain is found only in some monocot proteins (Figure 4). Only one Group-2 representative member is found in the dicot species B. rapa or A. thaliana, but the monocot O. sativa has two members and B. distachyon has three members. The B. rapa and A. thaliana proteins, as well as one member each from O. sativa and B. distachyon, contain a GYF domain in the N-terminal part of the protein (Figure 4). The fact that this domain is conserved in KMT2;2 proteins from all four higher plant species suggests that the acquisition of the GYF domain occurred before the monocot/dicot separation and may have a conserved function in higher plants. Genetic analysis demonstrated that AtKMT2;2/SDG25/ATXR7 is necessary in preventing early flowering [43, 44]. The recombinant AtKMT2;2/SDG25/ATXR7 protein was shown to methylate histone H3 in vitro and the depletion of AtKMT2;2/SDG25/ATXR7 in planta slightly reduced H3K4 and H3K36 methylation at FLC chromatin [43, 44]. OsKMT2;2b/SDG732 and BdKMT2;2c contain two PHD domains, but BdKMT2;2b, like the yeast protein ScKMT2/Set1, does not contain a recognizable PHD domain. Future study of these monocot proteins will likely provide a deeper understanding of the domain evolution of KMT2 proteins.
The Group-3 KMT2 proteins have a domain organization more similar to Group-1 except that they lack the FYR domain (Figure 4). B. rapa and A. thaliana both have three Group-3 members, but each of the other two higher plant species has only two Group-3 members. Synteny was observed for BrKMT2;3c with AtKMT2;3c/SDG29/ATX5 but not with AtKMT2;3b/SDG16/ATX4, suggesting that AtKMT2;3b/SDG16/ATX4 was derived from a relatively recent duplication event. This is in agreement with a previous study revealing that AtKMT2;3b/SDG16/ATX4 and AtKMT2;3c/SDG29/ATX5 are collinearly duplicated with AtKMT2;1a/SDG27/ATX1 and AtKMT2;1b/SDG30/ATX2 . To date, none of the Group-3 proteins has been functionally characterized.
The KMT3 class proteins
The KMT3 class plant proteins share high sequence similarity and share the AWS (a subdomain of Pre-SET, ), SET and Post-SET domain organization with the yeast H3K36-methyltransferase ScKMT3/Set2 (Figure 5). The Group-1 proteins have a long sequence and contain an additional CW domain specific to this group. The CW domain of AtKMT3;1/SDG8/ASHH2/EFS/CCR1 was recently shown to bind H3K4me1/me2 , suggesting a novel link between H3K4 and H3K36 methylation in plants. AtKMT3;1/SDG8/ASHH2/EFS/CCR1 is the major H3K36-methyltransferase specifically required for H3K36me2 and H3K36me3 deposition, and activates expression of hundreds of genes including FLC and MAFs . Depletion of AtKMT3;1/SDG8/ASHH2/EFS/CCR1 causes pleiotropic phenotypes, including early flowering, reduced organ size, increased shoot branching, perturbed fertility and carotenoid composition, and impaired plant defenses against pathogens [47–54]. The other group of KMT3 plant proteins have a shorter sequence and do not contain the CW domain; interestingly the depletion of AtKMT3;2/SDG26/ASHH1 resulted in a late-flowering phenotype associated with elevated levels of FLC expression . The Group-3 KMT3 proteins, with the exception of BdKMT3;3b and OsKMT3;3b/SDG707, contain a PHD domain; and AtKMT3;3/SDG4/ASHR3 was reported to be involved in pollen and stamen development possibly through mediating H3K4me2 and H3K36me3 deposition [55, 56]. The functions of the Group-4 proteins remain unexamined so far. Examination of this group in B. rapa could be a challenge because of gene multiplication and more diverged sequences (Figure 5).
The KMT6A subclass proteins
The SANT (SWI3, ADA2, N-CoR, and TFIIIB DNA-binding) domain is found in most of the plant KMT6A subclass proteins. This domain is also found in a number of other chromatin remodeling proteins with multiple activities such as DNA-binding, histone tail binding, and protein-protein interactions . Nevertheless, the precise role of the SANT domain in KMT6A proteins is currently unknown. Notably, BrKMT6A;3b does not contain a SANT domain. As expected from restricted AtKMT6A;3/SDG5/MEA expression in only a small number of cells during reproduction, expression of both BrKMT6A;3a and BrKMT6A;3b is barely detectable in the examined tissues (Additional File 3: Figure S2). It will be interesting to investigate BrKMT6A;3a and BrKMT6A;3b expression during reproduction and to examine whether both genes are functionally important.
The KMT6B subclass proteins
The KMT6B subclass includes two members each in A. thaliana, B. distachyon and O. sativa, and four members in B. rapa, which together can be divided into two distinct groups (Figure 6b). The two members from A. thaliana, AtKMT6B;1/SDG15/ATXR5 and AtKMT6B;2/SDG34/ATXR6, were classified as trithorax-related in the first genome analysis of Arabidopsis SET-domain proteins . However, our study as well as two previous studies that included a more complete set of plant SET-domain proteins clearly show that AtKMT6B;1/SDG15/ATXR5 and AtKMT6B;2/SDG34/ATXR6 belong to the KMT6B subclass (Figure 1) [12, 20]. Consistent with this, functional analysis revealed that AtKMT6B;1/SDG15/ATXR5 and AtKMT6B;2/SDG34/ATXR6 are involved in monomethylation of H3K27 . They appear to act redundantly, because depletion of H3K27 monomethylation is only detectable in the atxr5 atxr6 double mutant . KMT6A-mediated H3K27me3 is mainly present in euchromatic regions and is important for gene silencing , but KMT6B-mediated H3K27me1 is found in heterochromatic chromocenters and is important for heterochromatin condensation and replication in Arabidopsis .
Distinct from KMT6A proteins containing a SANT domain, many plant KMT6B subclass proteins contain a PHD domain (Figure 6). The PHD domain of both AtKMT6B;1/SDG15/ATXR5 and AtKMT6B;2/SDG34/ATXR6 strongly bind unmethylated H3 tail peptides (amino acids 1-21), and this binding is negatively affected by methylation on H3K4 . This binding preference may help to assure that these KMT6B proteins are not targeted to euchromatin and active genes enriched in H3K4 methylation. Remarkably, both Group-1 and Group-2 members are duplicated in B. rapa. Both BrKMT6B;1a and BrKMT6B;1b are syntenic with AtKMT6B;1/SDG15/ATXR5, and both BrKMT6B;2a and BrKMT6B;2b with AtKMT6B;2/SDG34/ATXR6. Expression of BrKMT6B;1a, BrKMT6B;2a and BrKMT6B;2b was detected in different tissues, but we failed to detect BrKMT6B;1b expression (Additional File 3: Figure S2). It is reasonable to speculate that BrKMT6B;1a, BrKMT6B;2a and BrKMT6B;2b might have redundant functions.
The KMT7 class proteins
Over last 10 years, a number of SET-domain genes in Arabidopsis and in rice have been characterized and shown to exert crucial chromatin-based functions via histone methylation during plant growth and development [3, 15]. However, the nomenclature of plant SET-domain proteins remains complex, and multiple synonyms exist for many Arabidopsis proteins (Table 1), which could cause considerable confusion in this important field. Nomenclature based on sequence similarity has several advantages, informing the prediction of KMT enzyme substrate specificity for its histone lysine residue and providing a global view of KMT types in an organism once its whole genome sequence becomes available. However, the SDG nomenclature failed to provide information concerning enzyme substrate specificity and the number following "SDG" could be long and difficult to remember [12, 63], e.g., the first SDG from B. rapa (ID = 197) would have been named SDG19701. While the nomenclature by Baumbusch and colleagues provided information about homology to animal proteins, an incomplete list of Arabidopsis SET-domain proteins and the limitation (at that time) of having only one plant species with a genome-wide analysis restricted the precision and correctness of phylogenetic grouping in this study . In addition, animal KMT nomenclature had also been noncoherent; a rational nomenclature was proposed only recently . Therefore, the nomenclature we propose here is in line with the latest advances in the field.
We identified 49 SET-domain proteins from the recently completed whole genome sequence of B. rapa. Among them, 5 proteins belong to the S-ET class likely involved in nonhistone protein methylation, 20 proteins belong to the KMT1 class potentially involved in H3K9 methylation, 6 proteins belong to the KMT2 class potentially involved in H3K4 methylation, 7 proteins belong to the KMT3 class potentially involved in H3K36 methylation, 8 proteins belong to the KMT6 class potentially involved in H3K27 methylation, and 3 belong to the KMT7 class also potentially involved in H3K4 methylation. This in silico survey is useful for future functional analysis of this important family of epigenetic regulators in Brassica. H4K20 methylation was detected in Arabidopsis using antibodies [28, 65], but the catalyzing enzyme(s) involved is(are) not yet known and the current phylogenetic analysis did not allow prediction of a specific KMT class involved in H4K20 methylation. It is possible that some members of the aforementioned KMT classes catalyze H4K20 methylation. The total number of KMTs in B. rapa (49) is slightly higher than that identified in A. thaliana (37), O. sativa (36) and B. distachyon (41). Nevertheless, we could not exclude the possibility that a few more KMTs may be missing from the currently available genome sequence of B. rapa.
Gene duplication is one of the primary driving forces in the evolution of genomes and genetic systems, and is considered to be a major mechanism for the establishment of new gene functions and the generation of evolutionary novelty [66, 67]. Contrary to what would be expected from the chromosome number duplication in B. rapa compared to A. thaliana, the number of KMT genes in B. rapa (49) is much less than double the number of A. thaliana KMTs (74). Many duplicated genes show synteny with their A. thaliana homologues, suggesting that they are derived from chromosome/genome segment duplications. Three alternative outcomes can occur in the evolution of duplicated genes: (i) one copy may simply become silenced by degenerative mutations (nonfunctionalization); (ii) one copy may acquire a novel, beneficial function and become preserved by natural selection (neofunctionalization); (iii) both copies may become partially compromised by mutation so that their total capacity adds up to the capacity of the single-copy ancestral gene (subfunctionalization) . These different outcomes likely apply to different duplicated KMT genes, judging from their expression patterns (Additional File 3: Figure S2). Expression of BrKMT1A;4c and BrKMT6B;1b was undetectable, suggesting that they might have been nonfunctionalized. The duplicated pairs BrKMT1B;2a and BrKMT1B;2b, BrKMT3;4b and BrKMT3;4d, or BrKMT7;1a and BrKMT7;1b are differentially expressed in plant organs, suggesting that they might have acquired distinct tissue-specific functions. Finally, expression of some duplicated genes, e.g. BrKMT1A;2a and BrKMT1A;2c, BrKMT1B;1a and BrKMT1B;1b, or BrKMT6A;3a and BrKMT6A;3b showed similar patterns, suggesting that they might be subfunctionalized and/or have redundant functions.
Among the groups showing gene duplications in B. rapa, it is worth to note that in Arabidopsis, AtKMT6A;3/SDG5/MEA is critical for parental gene imprinting and seed development [8, 68], AtKMT6B;1/SDG15/ATX5 and AtKMT6B;2/SDG34/ATX6 are important for heterochromatin condensation and replication in Arabidopsis [59, 60], and AtKMT7;1/SDG2/ATXR3 is essential for both sporophyte and gametophyte development [61, 62]. It will be of great interest to investigate these groups of genes for their regulation and function in chromatin organization, plant growth and development in B. rapa.
Our study shows that the plant SET-domain KMT proteins can be phylogenetically grouped into distinct classes and that the classes involved in histone methylation can be named in accordance with the nomenclature proposed for animal and yeast SET-domain KMTs. Such a coherent and rational nomenclature in different organisms will help avoid confusion caused by the existence of multiple names for the same protein or gene. The information provided on the B. rapa KMTs will also be beneficial for future research to unravel the mechanisms of epigenetic regulation in Brassica crops.
SET-domain protein identification
Sequences of SET-domain proteins from A. thaliana, O. sativa, O. tauri and S. cerevisiae were retrieved from the Chromatin Database with the key word SDG in species database respectively (ChromDB, http://www.chromdb.org). These sequences, primarily those from A. thaliana and O. sativa, were used as queries to search the B. distachyon genome (http://www.brachypodium.org) and the B. rapa genome (http://brassicadb.org/brad/index.php) by using the BLASTp and tBLASTn tools (http://blast.ncbi.nlm.nih.gov). The Expect threshold was set at 1.0 and other parameters were set at default values. We did not use a strict E-value threshold; rather we examined each of the resulting hits for the presence of the SET or S-ET domain to collect previously unidentified sequences. The synteny analysis was performed using the online viewer tool (http://brassicadb.org/brad/index.php). ESTs of the B. rapa SET-domain protein genes were retrieved from the Brassica Database (http://brassicadb.org/brad/index.php) and from NCBI (http://blast.ncbi.nlm.nih.gov), using an Expect threshold of 1, and a minimum sequence length of 50 bp.
Protein domain organization analysis
The protein sequences were analyzed for domain organization using NCBI-CD searches (http://ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi). The low-complexity filter was turned off, and the Expect value was set at 1.0 to detect short domains or regions of less conservation in this analysis. Domains were also verified and named according to the SMART database (http://smart.embl-heidelberg.de/).
Multiple sequence alignments of SET-domain sequences were performed using the ClustalW program . The resulting file was subjected to phylogenic analysis using the MEGA4.0 program . The trees were constructed with the following settings: Tree Inference as Neighbor-Joining; Include Sites as pairwise deletion option for total sequences analysis and complete deletion option for each class analysis; Substitution Model: Poisson correction; and Bootstrap test of 1,000 replicates for internal branch reliability.
B. rapa plants were grown at 18-22°C under a 12 h light (10,000 Lx)/12 h dark photoperiod. Leaves were collected from 2-, 4-, 6-, 8- or 10-week-old plants; roots and stems were collected from 6-week-old plants; flower buds were collected from 10-week-old plants. Total RNA was extracted using Trizol reagent (Invitrogen, USA) from about 100 mg of collected plant tissue. The RNA preparation was then treated with DNaseI (Promega, USA) for 30 min at 37°C, followed by enzyme inactivation by incubation at 65°C for 5 min. First strand cDNA was made using an RT-PCR Kit (RevertAid™ First Strand cDNA Synthesis Kit, Fermentas, CA). The RT-solution with first strand cDNA was stored at -80°C. Primers used for the RT-PCR reactions are listed in Additional File 4: Table S2. Conditions for the PCR reactions were as follows: 94°C for 3 min; then 30 cycles of 94°C for 30 s, 50-63°C for 30 s, and 72°C for 1 min; and finally 72°C for 8 min. PCR products were separated in a 1.5% (w/v) agarose Tris-borate/EDTA buffer gel and visualized by ethidium bromide staining.
We thank Dr Xiaowu Wang, Jian Wu and Fen Chen, from the Institute of Vegetables and Flowers of the Chinese Academy of Agricultural Sciences, for providing sequences and helping in synteny analysis. This work was supported in part by National Basic Research Program of China (973 Program, 2012CB910500) and National Natural Science Foundation of China (NSFC31071129 and NSFC31071455).
- Strahl BD, Allis CD: The language of covalent histone modifications. Nature. 2000, 403: 41-45.PubMedView ArticleGoogle Scholar
- Martin C, Zhang Y: The diverse functions of histone lysine methylation. Nat Rev Mol Cell Biol. 2005, 6: 838-849.PubMedView ArticleGoogle Scholar
- Liu C, Lu F, Cui X, Cao X: Histone methylation in higher plants. Annu Rev Plant Biol. 2010, 61: 395-420.PubMedView ArticleGoogle Scholar
- Tschiersch B, Hofmann A, Krauss V, Dorn R, Korge G, Reuter G: The protein encoded by the Drosophila position-effect variegation suppressor gene Su(var)3-9 combines domains of antagonistic regulators of homeotic gene complexes. EMBO J. 1994, 13: 3822-3831.PubMedPubMed CentralGoogle Scholar
- Qian C, Zhou MM: SET domain protein lysine methyltransferases: structure, specificity and catalysis. Cell Mol Life Sci. 2006, 63: 2755-2763.PubMedView ArticleGoogle Scholar
- Alvarez-Venegas R, Sadder M, Tikhonov A, Avramova Z: Origin of the bacterial SET domain genes: vertical or horizontal?. Mol Biol Evol. 2007, 24: 482-497.PubMedView ArticleGoogle Scholar
- Goodrich J, Puangsomlee P, Martin M, Long D, Meyerowitz EM, Coupland G: A Polycomb-group gene regulates homeotic gene expression in Arabidopsis. Nature. 1997, 386: 44-51.PubMedView ArticleGoogle Scholar
- Grossniklaus U, Vielle-Calzada JP, Hoeppner MA, Gagliano WB: Maternal control of embryogenesis by MEDEA, a polycomb group gene in Arabidopsis. Science. 1998, 280: 446-450.PubMedView ArticleGoogle Scholar
- Jackson JP, Lindroth AM, Cao X, Jacobsen SE: Control of CpNpG DNA methylation by the KRYPTONITE histone H3 methyltransferase. Nature. 2002, 416: 556-560.PubMedView ArticleGoogle Scholar
- Shen WH: NtSET1, a member of a newly identified subgroup of plant SET-domain-containing proteins, is chromatin-associated and its ectopic overexpression inhibits tobacco plant growth. Plant J. 2001, 28: 371-383.PubMedView ArticleGoogle Scholar
- Baumbusch LO, Thorstensen T, Krauss V, Fischer A, Naumann K, Assalkhou R, Schulz I, Reuter G, Aalen RB: The Arabidopsis thaliana genome contains at least 29 active genes encoding SET domain proteins that can be assigned to four evolutionarily conserved classes. Nucleic Acids Res. 2001, 29: 4319-4333.PubMedPubMed CentralView ArticleGoogle Scholar
- Springer NM, Napoli CA, Selinger DA, Pandey R, Cone KC, Chandler VL, Kaeppler HF, Kaeppler SM: Comparative analysis of SET domain proteins in maize and Arabidopsis reveals multiple duplications preceding the divergence of monocots and dicots. Plant Physiol. 2003, 132: 907-925.PubMedPubMed CentralView ArticleGoogle Scholar
- Pien S, Grossniklaus U: Polycomb group and trithorax group proteins in Arabidopsis. Biochim Biophys Acta. 2007, 1769: 375-382.PubMedView ArticleGoogle Scholar
- Shen WH, Xu L: Chromatin remodeling in stem cell maintenance in Arabidopsis thaliana. Mol Plant. 2009, 2: 600-609.PubMedView ArticleGoogle Scholar
- Berr A, Shafiq S, Shen WH: Histone modifications in transcriptional activation during plant development. Biochim Biophys Acta. 2011, 1809: 567-576.PubMedView ArticleGoogle Scholar
- Wang X, Wang H, Wang J, Sun R, Wu J, Liu S, Bai Y, Mun JH, Bancroft I, Cheng F, Huang S, Li X, Hua W, Wang J, Wang X, Freeling M, Pires JC, Paterson AH, Chalhoub B, Wang B, Hayward A, Sharpe AG, Park BS, Weisshaar B, Liu B, Li B, Liu B, Tong C, Song C, Duran C, Peng C, Geng C, Koh C, Lin C, Edwards D, Mu D, Shen D, Soumpourou E, Li F, Fraser F, Conant G, Lassalle G, King GJ, Bonnema G, Tang H, Wang H, Belcram H, Zhou H, Hirakawa H, Abe H, Guo H, Wang H, Jin H, Parkin IA, Batley J, Kim JS, Just J, Li J, Xu J, Deng J, Kim JA, Li J, Yu J, Meng J, Wang J, Min J, Poulain J, Wang J, Hatakeyama K, Wu K, Wang L, Fang L, Trick M, Links MG, Zhao M, Jin M, Ramchiary N, Drou N, Berkman PJ, Cai Q, Huang Q, Li R, Tabata S, Cheng S, Zhang S, Zhang S, Huang S, Sato S, Sun S, Kwon SJ, Choi SR, Lee TH, Fan W, Zhao X, Tan X, Xu X, Wang Y, Qiu Y, Yin Y, Li Y, Du Y, Liao Y, Lim Y, Narusaka Y, Wang Y, Wang Z, Li Z, Wang Z, Xiong Z, Zhang Z: The genome of the mesopolyploid crop species Brassica rapa. Nat Genet. 2011, 43: 1035-1039.PubMedView ArticleGoogle Scholar
- Allis CD, Berger SL, Cote J, Dent S, Jenuwien T, Kouzarides T, Pillus L, Reinberg D, Shi Y, Shiekhattar R, Shilatifard A, Workman J, Zhang Y: New nomenclature for chromatin-modifying enzymes. Cell. 2007, 131 (4): 633-636.PubMedView ArticleGoogle Scholar
- Briggs SD, Bryk M, Strahl BD, Cheung WL, Davie JK, Dent SY, Winston F, Allis CD: Histone H3 lysine 4 methylation is mediated by set1 and required for cell growth and rDNA silencing in Saccharomyces cerevisiae. Genes Dev. 2001, 15: 3286-3295.PubMedPubMed CentralView ArticleGoogle Scholar
- Strahl BD, Grant PA, Briggs SD, Sun ZW, Bone JR, Caldwell JA, Mollah S, Cook RG, Shabanowitz J, Hunt DF, Allis CD: Set2 is a nucleosomal histone H3-selective methyltransferase that mediates transcriptional repression. Mol Cell Biol. 2002, 22: 1298-1306.PubMedPubMed CentralView ArticleGoogle Scholar
- Ng DW, Wang T, Chandrasekharan MB, Aramayo R, Kertbundit S, Hall TC: Plant SET domain-containing proteins: structure, function and regulation. Biochim Biophys Acta. 2007, 1769: 316-329.PubMedPubMed CentralView ArticleGoogle Scholar
- Johnson LM, Bostick M, Zhang X, Kraft E, Henderson I, Callis J, Jacobsen SE: The SRA methyl-cytosine-binding domain links DNA and histone methylation. Curr Biol. 2007, 17: 379-384.PubMedPubMed CentralView ArticleGoogle Scholar
- Rajakumara E, Law JA, Simanshu DK, Voigt P, Johnson LM, Reinberg D, Patel DJ, Jacobsen SE: A dual flip-out mechanism for 5mC recognition by the Arabidopsis SUVH5 SRA domain and its impact on DNA methylation and H3K9 dimethylation in vivo. Genes Dev. 2011, 25: 137-152.PubMedPubMed CentralView ArticleGoogle Scholar
- Rea S, Eisenhaber F, O'Carroll D, Strahl BD, Sun ZW, Schmid M, Opravil S, Mechtler K, Ponting CP, Allis CD, Jenuwein T: Regulation of chromatin structure by site-specific histone H3 methyltransferases. Nature. 2000, 406: 593-599.PubMedView ArticleGoogle Scholar
- Tachibana M, Sugimoto K, Fukushima T, Shinkai Y: Set domain-containing protein, G9a, is a novel lysine-preferring mammalian histone methyltransferase with hyperactivity and specific selectivity to lysines 9 and 27 of histone H3. J Biol Chem. 2001, 276: 25309-25317.PubMedView ArticleGoogle Scholar
- Malagnac F, Bartee L, Bender J: An Arabidopsis SET domain protein required for maintenance but not establishment of DNA methylation. EMBO J. 2002, 21: 6842-6852.PubMedPubMed CentralView ArticleGoogle Scholar
- Ebbs ML, Bartee L, Bender J: H3 lysine 9 methylation is maintained on a transcribed inverted repeat by combined action of SUVH6 and SUVH4 methyltransferases. Mol Cell Biol. 2005, 25: 10507-10515.PubMedPubMed CentralView ArticleGoogle Scholar
- Ebbs ML, Bender J: Locus-specific control of DNA methylation by the Arabidopsis SUVH5 histone methyltransferase. Plant Cell. 2006, 18: 1166-1176.PubMedPubMed CentralView ArticleGoogle Scholar
- Naumann K, Fischer A, Hofmann I, Krauss V, Phalke S, Irmler K, Hause G, Aurich AC, Dorn R, Jenuwein T, Reuter G: Pivotal role of AtSUVH2 in heterochromatic histone methylation and gene silencing in Arabidopsis. EMBO J. 2005, 24: 1418-1429.PubMedPubMed CentralView ArticleGoogle Scholar
- Ding Y, Wang X, Su L, Zhai J, Cao S, Zhang D, Liu C, Bi Y, Qian Q, Cheng Z, Chu C, Cao X: SDG714, a histone H3K9 methyltransferase, is involved in Tos17 DNA methylation and transposition in rice. Plant Cell. 2007, 19: 9-22.PubMedPubMed CentralView ArticleGoogle Scholar
- Ding B, Zhu Y, Bu ZY, Shen WH, Yu Y, Dong AW: SDG714 regulates specific gene expression and consequently affects plant growth via H3K9 dimethylation. J Integr Plant Biol. 2010, 52: 420-430.PubMedView ArticleGoogle Scholar
- Qin FJ, Sun QW, Huang LM, Chen XS, Zhou DX: Rice SUVH histone methyltransferase genes display specific functions in chromatin modification and retrotransposon repression. Mol Plant. 2010, 3: 773-782.PubMedView ArticleGoogle Scholar
- Veiseth SV, Rahman MA, Yap KL, Fischer A, Egge-Jacobsen W, Reuter G, Zhou MM, Aalen RB, Thorstensen T: The SUVR4 histone lysine methyltransferase binds ubiquitin and converts H3K9me1 to H3K9me3 on transposon chromatin in Arabidopsis. PLoS Genet. 2011, 7: e1001325-PubMedPubMed CentralView ArticleGoogle Scholar
- Krichevsky A, Gutgarts H, Kozlovsky SV, Tzfira T, Sutton A, Sternglanz R, Mandel G, Citovsky V: C2H2 zinc finger-SET histone methyltransferase is a plant-specific chromatin modifier. Dev Biol. 2007, 303: 259-269.PubMedPubMed CentralView ArticleGoogle Scholar
- Qiu C, Sawada K, Zhang X, Cheng X: The PWWP domain of mammalian DNA methyltransferase Dnmt3b defines a new family of DNA-binding folds. Nat Struct Biol. 2002, 9: 217-224.PubMedPubMed CentralGoogle Scholar
- Vezzoli A, Bonadies N, Allen MD, Freund SM, Santiveri CM, Kvinlaug BT, Huntly BJ, Gottgens B, Bycroft M: Molecular basis of histone H3K36me3 recognition by the PWWP domain of Brpf1. Nat Struct Mol Biol. 2010, 17: 617-619.PubMedView ArticleGoogle Scholar
- Aasland R, Gibson TJ, Stewart AF: The PHD finger: implications for chromatin-mediated transcriptional regulation. Trends Biochem Sci. 1995, 20: 56-59.PubMedView ArticleGoogle Scholar
- Yun M, Wu J, Workman JL, Li B: Readers of histone modifications. Cell Res. 2011, 21: 564-578.PubMedPubMed CentralView ArticleGoogle Scholar
- Schultz J, Copley RR, Doerks T, Ponting CP, Bork P: SMART: a web-based tool for the study of genetically mobile domains. Nucleic Acids Res. 2000, 28: 231-234.PubMedPubMed CentralView ArticleGoogle Scholar
- Kofler MM, Freund C: The GYF domain. FEBS J. 2006, 273: 245-256.PubMedView ArticleGoogle Scholar
- Alvarez-Venegas R, Pien S, Sadder M, Witmer X, Grossniklaus U, Avramova Z: ATX-1, an Arabidopsis homolog of trithorax, activates flower homeotic genes. Curr Biol. 2003, 13: 627-637.PubMedView ArticleGoogle Scholar
- Saleh A, Alvarez-Venegas R, Yilmaz M, Le O, Hou G, Sadder M, Al-Abdallat A, Xia Y, Lu G, Ladunga I, Avramova Z: The highly similar Arabidopsis homologs of trithorax ATX1 and ATX2 encode proteins with divergent biochemical functions. Plant Cell. 2008, 20: 568-579.PubMedPubMed CentralView ArticleGoogle Scholar
- Pien S, Fleury D, Mylne JS, Crevillen P, Inze D, Avramova Z, Dean C, Grossniklaus U: ARABIDOPSIS TRITHORAX1 dynamically regulates FLOWERING LOCUS C activation via histone 3 lysine 4 trimethylation. Plant Cell. 2008, 20: 580-588.PubMedPubMed CentralView ArticleGoogle Scholar
- Berr A, Xu L, Gao J, Cognat V, Steinmetz A, Dong A, Shen WH: SET DOMAIN GROUP25 encodes a histone methyltransferase and is involved in FLOWERING LOCUS C activation and repression of flowering. Plant Physiol. 2009, 151: 1476-1485.PubMedPubMed CentralView ArticleGoogle Scholar
- Tamada Y, Yun JY, Woo SC, Amasino RM: ARABIDOPSIS TRITHORAX-RELATED7 is required for methylation of lysine 4 of histone H3 and for transcriptional activation of FLOWERING LOCUS C. Plant Cell. 2009, 21: 3257-3269.PubMedPubMed CentralView ArticleGoogle Scholar
- Doerks T, Copley RR, Schultz J, Ponting CP, Bork P: Systematic identification of novel protein domain families associated with nuclear functions. Genome Res. 2002, 12: 47-56.PubMedPubMed CentralView ArticleGoogle Scholar
- Hoppmann V, Thorstensen T, Kristiansen PE, Veiseth SV, Rahman MA, Finne K, Aalen RB, Aasland R: The CW domain, a new histone recognition module in chromatin proteins. EMBO J. 2011, 30: 1939-1952.PubMedPubMed CentralView ArticleGoogle Scholar
- Xu L, Zhao Z, Dong A, Soubigou-Taconnat L, Renou JP, Steinmetz A, Shen WH: Di- and tri- but not monomethylation on histone H3 lysine 36 marks active transcription of genes involved in flowering time regulation and other processes in Arabidopsis thaliana. Mol Cell Biol. 2008, 28: 1348-1360.PubMedPubMed CentralView ArticleGoogle Scholar
- Kim SY, He Y, Jacob Y, Noh YS, Michaels S, Amasino R: Establishment of the vernalization-responsive, winter-annual habit in Arabidopsis requires a putative histone H3 methyl transferase. Plant Cell. 2005, 17: 3301-3310.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhao Z, Yu Y, Meyer D, Wu C, Shen WH: Prevention of early flowering by expression of FLOWERING LOCUS C requires methylation of histone H3 K36. Nat Cell Biol. 2005, 7: 1256-1260.PubMedView ArticleGoogle Scholar
- Dong G, Ma DP, Li J: The histone methyltransferase SDG8 regulates shoot branching in Arabidopsis. Biochem Biophys Res Commun. 2008, 373: 659-664.PubMedView ArticleGoogle Scholar
- Cazzonelli CI, Cuttriss AJ, Cossetto SB, Pye W, Crisp P, Whelan J, Finnegan EJ, Turnbull C, Pogson BJ: Regulation of carotenoid composition and shoot branching in Arabidopsis by a chromatin modifying histone methyltransferase, SDG8. Plant Cell. 2009, 21: 39-53.PubMedPubMed CentralView ArticleGoogle Scholar
- Grini PE, Thorstensen T, Alm V, Vizcay-Barrena G, Windju SS, Jorstad TS, Wilson ZA, Aalen RB: The ASH1 HOMOLOG 2 (ASHH2) histone H3 methyltransferase is required for ovule and anther development in Arabidopsis. PLoS One. 2009, 4: e7817-PubMedPubMed CentralView ArticleGoogle Scholar
- Berr A, McCallum EJ, Alioua A, Heintz D, Heitz T, Shen WH: Arabidopsis histone methyltransferase SET DOMAIN GROUP8 mediates induction of the jasmonate/ethylene pathway genes in plant defense response to necrotrophic fungi. Plant Physiol. 2010, 154: 1403-1414.PubMedPubMed CentralView ArticleGoogle Scholar
- Palma K, Thorgrimsen S, Malinovsky FG, Fiil BK, Nielsen HB, Brodersen P, Hofius D, Petersen M, Mundy J: Autoimmunity in Arabidopsis acd11 is mediated by epigenetic regulation of an immune receptor. PLoS Pathog. 2010, 6 (10): e1001137-PubMedPubMed CentralView ArticleGoogle Scholar
- Cartagena JA, Matsunaga S, Seki M, Kurihara D, Yokoyama M, Shinozaki K, Fujimoto S, Azumi Y, Uchiyama S, Fukui K: The Arabidopsis SDG4 contributes to the regulation of pollen tube growth by methylation of histone H3 lysines 4 and 36 in mature pollen. Dev Biol. 2008, 315: 355-368.PubMedView ArticleGoogle Scholar
- Thorstensen T, Grini PE, Mercy IS, Alm V, Erdal S, Aasland R, Aalen RB: The Arabidopsis SET-domain protein ASHR3 is involved in stamen development and interacts with the bHLH transcription factor ABORTED MICROSPORES (AMS). Plant Mol Biol. 2008, 66: 47-59.PubMedView ArticleGoogle Scholar
- Zheng B, Chen X: Dynamics of histone H3 lysine 27 trimethylation in plant development. Curr Opin Plant Biol. 2011, 14: 123-129.PubMedPubMed CentralView ArticleGoogle Scholar
- Zhang D, Martyniuk CJ, Trudeau VL: SANTA domain: a novel conserved protein module in Eukaryota with potential involvement in chromatin regulation. Bioinformatics. 2006, 22: 2459-2462.PubMedView ArticleGoogle Scholar
- Jacob Y, Feng S, LeBlanc CA, Bernatavichute YV, Stroud H, Cokus S, Johnson LM, Pellegrini M, Jacobsen SE, Michaels SD: ATXR5 and ATXR6 are H3K27 monomethyltransferases required for chromatin structure and gene silencing. Nat Struct Mol Biol. 2009, 16: 763-768.PubMedPubMed CentralView ArticleGoogle Scholar
- Jacob Y, Stroud H, Leblanc C, Feng S, Zhuo L, Caro E, Hassel C, Gutierrez C, Michaels SD, Jacobsen SE: Regulation of heterochromatic DNA replication by histone H3 lysine 27 methyltransferases. Nature. 2010, 466: 987-991.PubMedPubMed CentralView ArticleGoogle Scholar
- Berr A, McCallum EJ, Menard R, Meyer D, Fuchs J, Dong A, Shen WH: Arabidopsis SET DOMAIN GROUP2 is required for H3K4 trimethylation and is crucial for both sporophyte and gametophyte development. Plant Cell. 2010, 22: 3232-3248.PubMedPubMed CentralView ArticleGoogle Scholar
- Guo L, Yu Y, Law JA, Zhang X: SET DOMAIN GROUP2 is the major histone H3 lysine 4 trimethyltransferase in Arabidopsis. Proc Natl Acad Sci USA. 2010, 107: 18557-18562.PubMedPubMed CentralView ArticleGoogle Scholar
- Aquea F, Vega A, Timmermann T, Poupin MJ, Arce-Johnson P: Genome-wide analysis of the SET DOMAIN GROUP family in Grapevine. Plant Cell Rep. 2011, 30: 1087-1097.PubMedView ArticleGoogle Scholar
- Price CA: Commission on plant gene nomenclature. Plant Mol Biol Rep. 1993, 11: 273-274.View ArticleGoogle Scholar
- Fischer A, Hofmann I, Naumann K, Reuter G: Heterochromatin proteins and the control of heterochromatic gene silencing in Arabidopsis. J Plant Physiol. 2006, 163: 358-368.PubMedView ArticleGoogle Scholar
- Lynch M, Force A: The probability of duplicate gene preservation by subfunctionalization. Genetics. 2000, 154: 459-473.PubMedPubMed CentralGoogle Scholar
- Cannon SB, Mitra A, Baumgarten A, Young ND, May G: The roles of segmental and tandem gene duplication in the evolution of large gene families in Arabidopsis thaliana. BMC Plant Biol. 2004, 4: 10-PubMedPubMed CentralView ArticleGoogle Scholar
- Luo M, Bilodeau P, Koltunow A, Dennis ES, Peacock WJ, Chaudhury AM: Genes controlling fertilization-independent seed development in Arabidopsis thaliana. Proc Natl Acad Sci USA. 1999, 96: 296-301.PubMedPubMed CentralView ArticleGoogle Scholar
- Thompson JD, Higgins DG, Gibson TJ: CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 1994, 22: 4673-4680.PubMedPubMed CentralView ArticleGoogle Scholar
- Tamura K, Dudley J, Nei M, Kumar S: MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol. 2007, 24: 1596-1599.PubMedView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.