Tag |
Content |
WERAM ID |
WERAM-Art-0096 |
Ensembl Protein ID |
AT4G02020.1 |
Uniprot Accession |
Q9ZSM8; EZA1_ARATH; O04246 |
Genbank Protein ID |
NP_567221.1 |
Protein Name |
Histone-lysine N-methyltransferase EZA1 |
Genbank Nucleotide ID |
NM_116433.2 |
Gene Name |
EZA1;SWN |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
EZ |
SET |
H3K27 |
K |
20703330 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
EZ |
2.30e-64 |
215.4 |
269 |
822 |
HMT |
SET1 |
2.60e-30 |
105.7 |
708 |
822 |
|
Organism |
Arabidopsis thaliana |
NCBI Taxa ID |
3702 |
Functional Description (View)Functional Description
Polycomb group (PcG) protein. Catalytic subunit of some PcG multiprotein complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target genes. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development (By similarity). |
Polycomb group (PcG) protein. Catalytic subunit of some PcG multiprotein complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target genes. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development (By similarity).
|
Domain Profile |
HMT EZ
EZ.txt 3 illgksdvaGwGlflkesvekneylgeytGel 34 l+ ++G l + ek+ y ++y G++ AT4G02020.1 269 CLVFDCRLHGCSQPLISASEKQPYWSDYEGDR 300 5666777889999999999*********9975 PP EZ.txt 1 krillgksdvaGwGlflkesvekneylgeytGelisddeadkrGkiydrakssflfnlndqlvidakrkGnklkfanhsakpncyakvllvaGdhriG 98 +rillgksdvaGwG+flk+sv+kneylgeytGelis++eadkrGkiydra+ssflf+lndq+v+da+rkG+klkfanhsakpncyakv++vaGdhr+G AT4G02020.1 707 QRILLGKSDVAGWGAFLKNSVSKNEYLGEYTGELISHHEADKRGKIYDRANSSFLFDLNDQYVLDAQRKGDKLKFANHSAKPNCYAKVMFVAGDHRVG 804 79************************************************************************************************ PP EZ.txt 99 lfakrrieaseelffdyr 116 +fa++rieaseelf+dyr AT4G02020.1 805 IFANERIEASEELFYDYR 822 *****************7 PP
HMT SET1
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakvvavdgekk 99 ++ + ks+++g+g + k++++k+e + EY+Ge+i++++adkr k y++ + +++lf l+++ +v+da++kg++ +f nhs +pNc+akv+ v g+++ AT4G02020.1 708 RILLGKSDVAGWGAFLKNSVSKNEYLGEYTGELISHHEADKRGKIYDRAN-SSFLFDLNDQ--YVLDAQRKGDKLKFANHSAKPNCYAKVMFVAGDHR 802 577889*****************************************995.56********..*********************************** PP SET1.txt 100 iviyakraIekgeeltydYk 119 ++i+a+ +Ie+ eel+ydY+ AT4G02020.1 803 VGIFANERIEASEELFYDYR 822 *******************6 PP
|
Protein Sequence (Fasta) | MVTDDSNSSG RIKSHVDDDD DGEEEEDRLE GLENRLSELK RKIQGERVRS IKEKFEANRK 60 KVDAHVSPFS SAASSRATAE DNGNSNMLSS RMRMPLCKLN GFSHGVGDRD YVPTKDVISA 120 SVKLPIAERI PPYTTWIFLD RNQRMAEDQS VVGRRQIYYE QHGGETLICS DSEEEPEPEE 180 EKREFSEGED SIIWLIGQEY GMGEEVQDAL CQLLSVDASD ILERYNELKL KDKQNTEEFS 240 NSGFKLGISL EKGLGAALDS FDNLFCRRCL VFDCRLHGCS QPLISASEKQ PYWSDYEGDR 300 KPCSKHCYLQ LKAVREVPET CSNFASKAEE KASEEECSKA VSSDVPHAAA SGVSLQVEKT 360 DIGIKNVDSS SGVEQEHGIR GKREVPILKD SNDLPNLSNK KQKTAASDTK MSFVNSVPSL 420 DQALDSTKGD QGGTTDNKVN RDSEADAKEV GEPIPDNSVH DGGSSICQPH HGSGNGAIII 480 AEMSETSRPS TEWNPIEKDL YLKGVEIFGR NSCLIARNLL SGLKTCLDVS NYMRENEVSV 540 FRRSSTPNLL LDDGRTDPGN DNDEVPPRTR LFRRKGKTRK LKYSTKSAGH PSVWKRIAGG 600 KNQSCKQYTP CGCLSMCGKD CPCLTNETCC EKYCGCSKSC KNRFRGCHCA KSQCRSRQCP 660 CFAAGRECDP DVCRNCWVSC GDGSLGEAPR RGEGQCGNMR LLLRQQQRIL LGKSDVAGWG 720 AFLKNSVSKN EYLGEYTGEL ISHHEADKRG KIYDRANSSF LFDLNDQYVL DAQRKGDKLK 780 FANHSAKPNC YAKVMFVAGD HRVGIFANER IEASEELFYD YRYGPDQAPV WARKPEGSKK 840 DDSAITHRRA RKHQSH 856Protein Fasta Sequence
>AT4G02020.1|EZA1;SWN|Arabidopsis thaliana MVTDDSNSSGRIKSHVDDDDDGEEEEDRLEGLENRLSELKRKIQGERVRSIKEKFEANRKKVDAHVSPFSSAASSRATAEDNGNSNMLSSRMRMPLCKLNGFSHGVGDRDYVPTKDVISASVKLPIAERIPPYTTWIFLDRNQRMAEDQSVVGRRQIYYEQHGGETLICSDSEEEPEPEEEKREFSEGEDSIIWLIGQEYGMGEEVQDALCQLLSVDASDILERYNELKLKDKQNTEEFSNSGFKLGISLEKGLGAALDSFDNLFCRRCLVFDCRLHGCSQPLISASEKQPYWSDYEGDRKPCSKHCYLQLKAVREVPETCSNFASKAEEKASEEECSKAVSSDVPHAAASGVSLQVEKTDIGIKNVDSSSGVEQEHGIRGKREVPILKDSNDLPNLSNKKQKTAASDTKMSFVNSVPSLDQALDSTKGDQGGTTDNKVNRDSEADAKEVGEPIPDNSVHDGGSSICQPHHGSGNGAIIIAEMSETSRPSTEWNPIEKDLYLKGVEIFGRNSCLIARNLLSGLKTCLDVSNYMRENEVSVFRRSSTPNLLLDDGRTDPGNDNDEVPPRTRLFRRKGKTRKLKYSTKSAGHPSVWKRIAGGKNQSCKQYTPCGCLSMCGKDCPCLTNETCCEKYCGCSKSCKNRFRGCHCAKSQCRSRQCPCFAAGRECDPDVCRNCWVSCGDGSLGEAPRRGEGQCGNMRLLLRQQQRILLGKSDVAGWGAFLKNSVSKNEYLGEYTGELISHHEADKRGKIYDRANSSFLFDLNDQYVLDAQRKGDKLKFANHSAKPNCYAKVMFVAGDHRVGIFANERIEASEELFYDYRYGPDQAPVWARKPEGSKKDDSAITHRRARKHQSH
|
Nucleotide Sequence (Fasta) | CCGTTCGTCC TTCTCACAAG TCTGATTGCG GAAAAAGCAG AGAGAGAGAG AAAGTTCGAG 60 CGGAAGAGAA GCGGAAAGCT CGAGGAGTCA TCAATGGTGA CGGACGATAG CAACTCCTCT 120 GGACGAATCA AGTCTCATGT AGATGATGAT GATGATGGTG AAGAAGAAGA AGATAGACTC 180 GAGGGTTTGG AAAACAGATT AAGTGAGCTT AAAAGGAAAA TTCAAGGAGA AAGAGTTAGG 240 TCTATTAAAG AGAAATTTGA GGCTAATAGA AAGAAAGTGG ATGCTCATGT TTCTCCCTTT 300 TCATCTGCTG CATCGAGCCG AGCTACCGCA GAGGATAATG GAAATAGCAA TATGCTTTCT 360 TCGAGAATGA GAATGCCACT CTGCAAGTTA AATGGTTTTT CTCATGGTGT GGGAGATAGA 420 GACTATGTTC CTACTAAGGA TGTTATATCA GCAAGTGTCA AGCTTCCTAT TGCTGAGAGA 480 ATACCGCCAT ACACTACCTG GATATTTTTG GACAGAAATC AAAGAATGGC TGAAGATCAG 540 TCTGTGGTTG GTCGAAGACA AATCTACTAT GAACAACATG GTGGTGAGAC GCTAATATGC 600 AGCGATAGTG AGGAAGAACC AGAACCTGAG GAGGAAAAAC GTGAATTTTC CGAGGGTGAA 660 GATTCCATTA TATGGTTAAT TGGGCAGGAG TATGGCATGG GTGAGGAAGT GCAGGATGCC 720 CTTTGCCAGT TGCTAAGCGT AGATGCTTCT GATATCCTGG AAAGATACAA TGAGCTCAAG 780 TTGAAGGATA AGCAGAATAC CGAGGAATTT TCTAATTCCG GATTCAAGCT GGGAATATCT 840 CTGGAAAAGG GCCTTGGTGC AGCTCTAGAT TCTTTTGATA ATCTTTTCTG CCGCCGTTGC 900 TTGGTATTTG ACTGTCGTCT GCATGGATGT TCTCAGCCTT TGATTAGTGC TAGTGAAAAA 960 CAGCCTTATT GGTCTGATTA TGAAGGTGAT AGGAAACCCT GCAGCAAACA TTGTTACCTC 1020 CAGCTCAAGG CGGTCAGAGA AGTACCAGAA ACATGCAGTA ATTTTGCATC TAAAGCAGAA 1080 GAGAAAGCTT CAGAAGAGGA ATGCAGCAAG GCTGTCTCCT CTGATGTTCC CCATGCTGCT 1140 GCTAGTGGTG TCAGTCTGCA AGTTGAGAAG ACTGATATTG GTATCAAGAA TGTAGATTCA 1200 TCCTCTGGTG TAGAACAAGA GCATGGAATT AGAGGAAAGC GTGAGGTCCC AATTCTAAAA 1260 GACTCCAATG ATCTGCCTAA TTTATCGAAC AAGAAACAGA AGACCGCAGC CTCAGATACA 1320 AAAATGTCAT TTGTTAATTC TGTCCCTAGC TTAGATCAGG CATTGGATAG CACAAAGGGT 1380 GATCAAGGTG GAACAACTGA CAATAAAGTA AACAGAGACT CAGAAGCTGA TGCAAAAGAA 1440 GTAGGTGAGC CTATTCCAGA CAATTCGGTC CATGATGGTG GTTCCTCAAT TTGTCAGCCA 1500 CACCATGGTA GTGGAAACGG AGCAATAATC ATTGCAGAAA TGTCTGAGAC AAGTCGACCA 1560 TCTACAGAGT GGAATCCTAT CGAGAAGGAT CTTTACTTGA AGGGAGTCGA AATCTTTGGA 1620 AGAAACAGCT GTCTTATTGC AAGAAACCTG CTTTCTGGCT TGAAGACATG CCTAGATGTG 1680 TCCAATTACA TGCGTGAAAA CGAAGTTTCA GTTTTTCGAA GATCTAGTAC CCCAAATTTG 1740 CTGTTGGATG ATGGCAGGAC TGACCCAGGG AATGATAATG ATGAGGTGCC TCCAAGGACA 1800 AGATTGTTCC GTAGAAAAGG CAAAACCCGG AAGCTAAAAT ACTCTACAAA GTCTGCTGGT 1860 CATCCGTCTG TCTGGAAAAG AATAGCTGGT GGCAAAAACC AGTCCTGTAA ACAATACACG 1920 CCGTGTGGAT GCCTGTCAAT GTGCGGAAAG GATTGCCCTT GTCTAACTAA TGAAACTTGC 1980 TGCGAGAAAT ATTGCGGGTG CTCAAAAAGC TGTAAAAATC GTTTCCGAGG ATGTCATTGT 2040 GCAAAGAGTC AATGCAGAAG TAGGCAGTGT CCCTGCTTTG CTGCTGGCAG AGAATGTGAT 2100 CCAGATGTTT GCAGAAATTG CTGGGTTAGT TGTGGAGATG GTTCTCTCGG TGAAGCACCA 2160 AGACGCGGAG AAGGGCAATG CGGAAACATG AGACTTCTCC TGAGGCAACA ACAGAGGATC 2220 CTATTGGGAA AGTCTGATGT TGCTGGATGG GGTGCTTTTC TAAAGAACTC GGTCAGCAAA 2280 AATGAATACC TTGGAGAATA CACCGGTGAA TTGATCTCAC ACCATGAGGC GGATAAGCGT 2340 GGGAAAATAT ATGACCGGGC AAATTCGTCC TTCCTCTTTG ACTTGAATGA TCAGTACGTC 2400 CTCGATGCTC AACGCAAAGG TGACAAGCTG AAATTTGCCA ATCACTCAGC TAAACCCAAT 2460 TGCTACGCTA AGGTGATGTT TGTAGCAGGA GATCACAGGG TCGGGATTTT TGCAAACGAA 2520 CGAATAGAAG CTAGCGAAGA GCTTTTCTAT GACTATAGAT ATGGACCAGA CCAAGCACCA 2580 GTGTGGGCTC GCAAACCTGA AGGCTCCAAG AAAGATGATT CAGCCATTAC TCATCGTAGA 2640 GCCAGAAAGC ACCAATCTCA TTGATGATTA CTGGCTAAGA GAAGTAACTT TTATAAAAAT 2700 AACTTATAGA GTTGTGAGAG ATGATATTTG AAGTTTGATA ACTTAAGCTT GTCTTTATTA 2760 ATTAATTATT ATAGAGTTGA GATTTTATTT TATTTTGACA TCGAGTTTGG ACTTTGTATA 2820 GGTGATAAAA CAATTTATGA ATTATTGGGG TCAATAAGTA AAAATGTATC ATTTCG
2877Nucleotide Fasta Sequence
>AT4G02020.1|SET1|Arabidopsis thaliana CCGTTCGTCCTTCTCACAAGTCTGATTGCGGAAAAAGCAGAGAGAGAGAGAAAGTTCGAGCGGAAGAGAAGCGGAAAGCTCGAGGAGTCATCAATGGTGACGGACGATAGCAACTCCTCTGGACGAATCAAGTCTCATGTAGATGATGATGATGATGGTGAAGAAGAAGAAGATAGACTCGAGGGTTTGGAAAACAGATTAAGTGAGCTTAAAAGGAAAATTCAAGGAGAAAGAGTTAGGTCTATTAAAGAGAAATTTGAGGCTAATAGAAAGAAAGTGGATGCTCATGTTTCTCCCTTTTCATCTGCTGCATCGAGCCGAGCTACCGCAGAGGATAATGGAAATAGCAATATGCTTTCTTCGAGAATGAGAATGCCACTCTGCAAGTTAAATGGTTTTTCTCATGGTGTGGGAGATAGAGACTATGTTCCTACTAAGGATGTTATATCAGCAAGTGTCAAGCTTCCTATTGCTGAGAGAATACCGCCATACACTACCTGGATATTTTTGGACAGAAATCAAAGAATGGCTGAAGATCAGTCTGTGGTTGGTCGAAGACAAATCTACTATGAACAACATGGTGGTGAGACGCTAATATGCAGCGATAGTGAGGAAGAACCAGAACCTGAGGAGGAAAAACGTGAATTTTCCGAGGGTGAAGATTCCATTATATGGTTAATTGGGCAGGAGTATGGCATGGGTGAGGAAGTGCAGGATGCCCTTTGCCAGTTGCTAAGCGTAGATGCTTCTGATATCCTGGAAAGATACAATGAGCTCAAGTTGAAGGATAAGCAGAATACCGAGGAATTTTCTAATTCCGGATTCAAGCTGGGAATATCTCTGGAAAAGGGCCTTGGTGCAGCTCTAGATTCTTTTGATAATCTTTTCTGCCGCCGTTGCTTGGTATTTGACTGTCGTCTGCATGGATGTTCTCAGCCTTTGATTAGTGCTAGTGAAAAACAGCCTTATTGGTCTGATTATGAAGGTGATAGGAAACCCTGCAGCAAACATTGTTACCTCCAGCTCAAGGCGGTCAGAGAAGTACCAGAAACATGCAGTAATTTTGCATCTAAAGCAGAAGAGAAAGCTTCAGAAGAGGAATGCAGCAAGGCTGTCTCCTCTGATGTTCCCCATGCTGCTGCTAGTGGTGTCAGTCTGCAAGTTGAGAAGACTGATATTGGTATCAAGAATGTAGATTCATCCTCTGGTGTAGAACAAGAGCATGGAATTAGAGGAAAGCGTGAGGTCCCAATTCTAAAAGACTCCAATGATCTGCCTAATTTATCGAACAAGAAACAGAAGACCGCAGCCTCAGATACAAAAATGTCATTTGTTAATTCTGTCCCTAGCTTAGATCAGGCATTGGATAGCACAAAGGGTGATCAAGGTGGAACAACTGACAATAAAGTAAACAGAGACTCAGAAGCTGATGCAAAAGAAGTAGGTGAGCCTATTCCAGACAATTCGGTCCATGATGGTGGTTCCTCAATTTGTCAGCCACACCATGGTAGTGGAAACGGAGCAATAATCATTGCAGAAATGTCTGAGACAAGTCGACCATCTACAGAGTGGAATCCTATCGAGAAGGATCTTTACTTGAAGGGAGTCGAAATCTTTGGAAGAAACAGCTGTCTTATTGCAAGAAACCTGCTTTCTGGCTTGAAGACATGCCTAGATGTGTCCAATTACATGCGTGAAAACGAAGTTTCAGTTTTTCGAAGATCTAGTACCCCAAATTTGCTGTTGGATGATGGCAGGACTGACCCAGGGAATGATAATGATGAGGTGCCTCCAAGGACAAGATTGTTCCGTAGAAAAGGCAAAACCCGGAAGCTAAAATACTCTACAAAGTCTGCTGGTCATCCGTCTGTCTGGAAAAGAATAGCTGGTGGCAAAAACCAGTCCTGTAAACAATACACGCCGTGTGGATGCCTGTCAATGTGCGGAAAGGATTGCCCTTGTCTAACTAATGAAACTTGCTGCGAGAAATATTGCGGGTGCTCAAAAAGCTGTAAAAATCGTTTCCGAGGATGTCATTGTGCAAAGAGTCAATGCAGAAGTAGGCAGTGTCCCTGCTTTGCTGCTGGCAGAGAATGTGATCCAGATGTTTGCAGAAATTGCTGGGTTAGTTGTGGAGATGGTTCTCTCGGTGAAGCACCAAGACGCGGAGAAGGGCAATGCGGAAACATGAGACTTCTCCTGAGGCAACAACAGAGGATCCTATTGGGAAAGTCTGATGTTGCTGGATGGGGTGCTTTTCTAAAGAACTCGGTCAGCAAAAATGAATACCTTGGAGAATACACCGGTGAATTGATCTCACACCATGAGGCGGATAAGCGTGGGAAAATATATGACCGGGCAAATTCGTCCTTCCTCTTTGACTTGAATGATCAGTACGTCCTCGATGCTCAACGCAAAGGTGACAAGCTGAAATTTGCCAATCACTCAGCTAAACCCAATTGCTACGCTAAGGTGATGTTTGTAGCAGGAGATCACAGGGTCGGGATTTTTGCAAACGAACGAATAGAAGCTAGCGAAGAGCTTTTCTATGACTATAGATATGGACCAGACCAAGCACCAGTGTGGGCTCGCAAACCTGAAGGCTCCAAGAAAGATGATTCAGCCATTACTCATCGTAGAGCCAGAAAGCACCAATCTCATTGATGATTACTGGCTAAGAGAAGTAACTTTTATAAAAATAACTTATAGAGTTGTGAGAGATGATATTTGAAGTTTGATAACTTAAGCTTGTCTTTATTAATTAATTATTATAGAGTTGAGATTTTATTTTATTTTGACATCGAGTTTGGACTTTGTATAGGTGATAAAACAATTTATGAATTATTGGGGTCAATAAGTAAAAATGTATCATTTCG
|
Sequence Source |
Ensembl |
Keyword |
KW-0181--Complete proteome KW-0489--Methyltransferase KW-0539--Nucleus KW-1185--Reference proteome KW-0678--Repressor KW-0949--S-adenosyl-L-methionine KW-0804--Transcription KW-0805--Transcription regulation KW-0808--Transferase --
|
Interpro |
IPR026489--CXC_dom IPR025778--Hist-Lys_N-MeTrfase_EZ IPR001005--SANT/Myb IPR001214--SET_dom IPR033467--Tesmin/TSO1-like_CXC
|
PROSITE |
PS51633--CXC PS51576--SAM_MT43_EZ PS50280--SET
|
Pfam |
PF00856--SET
|
Gene Ontology |
GO:0005677--C:chromatin silencing complex GO:0005634--C:nucleus GO:0031519--C:PcG protein complex GO:0009506--C:plasmodesma GO:0018024--F:histone-lysine N-methyltransferase activity GO:0003727--F:single-stranded RNA binding GO:0003700--F:transcription factor activity, sequence-specific DNA binding GO:0006349--P:regulation of gene expression by genetic imprinting GO:0006351--P:transcription, DNA-templated
|
Orthology |
|
Created Date |
25-Jun-2016 |