Tag |
Content |
WERAM ID |
WERAM-Art-0064 |
Ensembl Protein ID |
AT3G04380.1 |
Uniprot Accession |
Q8W595; SUVR4_ARATH; Q3EBC4; Q9M848 |
Genbank Protein ID |
NP_187088.2 |
Protein Name |
Histone-lysine N-methyltransferase SUVR4 |
Genbank Nucleotide ID |
NM_111309.2 |
Gene Name |
SUVR4 |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
SUV39 |
SET |
H3K9 |
K |
22549957; 20703330 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SUV39 |
1.20e-37 |
129.9 |
89 |
434 |
|
Organism |
Arabidopsis thaliana |
NCBI Taxa ID |
3702 |
Functional Description (View)Functional Description
Histone methyltransferase that converts monomethylated 'Lys-9' of histone H3 (H3K9me1) to dimethylated 'Lys-9' (H3K9me2) in the absence of bound ubiquitin, and to trimethylated 'Lys-9' (H3K9me3) in the presence of bound ubiquitin. Acts in a locus-specific manner and contributes to the transcriptional silencing of pseudogenes and transposons. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. |
Histone methyltransferase that converts monomethylated 'Lys-9' of histone H3 (H3K9me1) to dimethylated 'Lys-9' (H3K9me2) in the absence of bound ubiquitin, and to trimethylated 'Lys-9' (H3K9me3) in the presence of bound ubiquitin. Acts in a locus-specific manner and contributes to the transcriptional silencing of pseudogenes and transposons. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression.
|
Domain Profile |
HMT SUV39
SUV39.txt 46 eegdeyladldskesvenlkegyesdvplssdssn 80 +++ + ++l+ ++s + lk++ye++ + s s + AT3G04380.1 89 SSNGNRGKNLKVIDSPATLKKTYETRSASSGSSIQ 123 34444556666666666666666666665555543 PP SUV39.txt 2 rLqvfktenk.GwGvrclddiakgsFvciyaGeiltddeaekegl...eegdeyladldskesvenlkegyesdvplssdssntrqekdkeeseyiid 95 +Lqv+ t++ GwG+r+l+d++kg+F+c+y+Geilt++e ++++ +e+++y ++ld+ + +++ k+e+ +++d AT3G04380.1 303 QLQVYFTQEGkGWGLRTLQDLPKGTFICEYIGEILTNTELYDRNVrssSERHTYPVTLDADWG---------------------SEKDLKDEEALCLD 379 79***999977****************************999966545666667777777554.....................345668899***** PP SUV39.txt 96 akkegnvgrflnHsc.spNlfvqnvfvdthdlrfprvafFaskrikagtELtwdY 149 a+ gnv+rf+nH+c ++N++ ++ ++t d++++++afF+ +++ka++ELtwdY AT3G04380.1 380 ATICGNVARFINHRCeDANMIDIPIEIETPDRHYYHIAFFTLRDVKAMDELTWDY 434 ***************99************************************** PP
|
Protein Sequence (Fasta) | MISLSGLTSS VESDLDMQQA MLTNKDEKVL KALERTRQLD IPDEKTMPVL MKLLEEAGGN 60 WSYIKLDNYT ALVDAIYSVE DENKQSEGSS NGNRGKNLKV IDSPATLKKT YETRSASSGS 120 SIQVVQKQPQ LSNGDRKRKY KSRIADITKG SESVKIPLVD DVGSEAVPKF TYIPHNIVYQ 180 SAYLHVSLAR ISDEDCCANC KGNCLSADFP CTCARETSGE YAYTKEGLLK EKFLDTCLKM 240 KKEPDSFPKV YCKDCPLERD HDKGTYGKCD GHLIRKFIKE CWRKCGCDMQ CGNRVVQRGI 300 RCQLQVYFTQ EGKGWGLRTL QDLPKGTFIC EYIGEILTNT ELYDRNVRSS SERHTYPVTL 360 DADWGSEKDL KDEEALCLDA TICGNVARFI NHRCEDANMI DIPIEIETPD RHYYHIAFFT 420 LRDVKAMDEL TWDYMIDFND KSHPVKAFRC CCGSESCRDR KIKGSQGKSI ERRKIVSAKK 480 QQGSKEVSKK RK 492Protein Fasta Sequence
>AT3G04380.1|SUVR4|Arabidopsis thaliana MISLSGLTSSVESDLDMQQAMLTNKDEKVLKALERTRQLDIPDEKTMPVLMKLLEEAGGNWSYIKLDNYTALVDAIYSVEDENKQSEGSSNGNRGKNLKVIDSPATLKKTYETRSASSGSSIQVVQKQPQLSNGDRKRKYKSRIADITKGSESVKIPLVDDVGSEAVPKFTYIPHNIVYQSAYLHVSLARISDEDCCANCKGNCLSADFPCTCARETSGEYAYTKEGLLKEKFLDTCLKMKKEPDSFPKVYCKDCPLERDHDKGTYGKCDGHLIRKFIKECWRKCGCDMQCGNRVVQRGIRCQLQVYFTQEGKGWGLRTLQDLPKGTFICEYIGEILTNTELYDRNVRSSSERHTYPVTLDADWGSEKDLKDEEALCLDATICGNVARFINHRCEDANMIDIPIEIETPDRHYYHIAFFTLRDVKAMDELTWDYMIDFNDKSHPVKAFRCCCGSESCRDRKIKGSQGKSIERRKIVSAKKQQGSKEVSKKRK
|
Nucleotide Sequence (Fasta) | CGCACGACGC AGTGAAACAG AGATACTGTG GAAAAGTGTG TTTAAAATGG ATCTGTTTTA 60 GGGTATTTAT CAATTCGAAA ATTGATTAAT TTATTAATTT CGTAAGCTTT CACAGAAAGT 120 TCGAACTCTT TGCATGCAAT CATTCGCCGT CTTCTTCCAC AATTGTGTTT TTTAGCCGGC 180 GGCGGCTGGA TTCGAGAAGC TTCCGATTGT TCTGTATGAT CAGTCTCTCC GGACTAACCA 240 GTTCTGTTGA AAGTGATCTC GATATGCAAC AAGCGATGCT CACCAATAAA GACGAGAAGG 300 TACTCAAAGC TTTAGAGAGA ACAAGGCAAT TGGATATTCC CGATGAAAAG ACAATGCCAG 360 TGCTTATGAA GCTCCTAGAA GAGGCTGGTG GCAATTGGTC GTATATAAAG TTGGATAACT 420 ATACTGCACT GGTCGACGCT ATTTATTCTG TTGAGGATGA GAATAAGCAA AGTGAAGGTT 480 CATCTAATGG TAATAGAGGG AAGAATCTTA AGGTCATTGA CTCTCCTGCT ACTCTGAAAA 540 AAACTTACGA AACCCGTTCT GCATCCTCAG GTTCCTCCAT TCAGGTAGTC CAGAAGCAGC 600 CACAGCTTAG TAATGGTGAT CGCAAGAGGA AATATAAAAG CAGAATTGCT GACATAACTA 660 AAGGTTCAGA GAGCGTTAAA ATCCCTCTTG TCGATGATGT TGGGAGTGAA GCTGTGCCAA 720 AGTTTACTTA CATCCCTCAC AACATTGTTT ACCAAAGTGC TTATCTCCAC GTGTCTCTGG 780 CTCGAATCTC TGATGAAGAT TGCTGCGCAA ACTGCAAAGG GAACTGTCTT TCAGCTGACT 840 TTCCTTGCAC TTGTGCTCGT GAAACCAGTG GAGAATATGC TTATACCAAA GAAGGGCTGC 900 TAAAGGAAAA GTTTCTGGAC ACCTGTCTTA AGATGAAAAA GGAACCAGAT TCGTTCCCTA 960 AAGTTTACTG CAAAGACTGC CCTTTGGAGA GAGATCACGA TAAGGGCACA TATGGAAAAT 1020 GTGATGGACA CTTAATCCGA AAGTTCATCA AGGAATGCTG GAGAAAGTGT GGATGTGATA 1080 TGCAGTGTGG AAATCGAGTA GTACAGAGAG GGATAAGGTG CCAACTGCAG GTTTACTTTA 1140 CTCAAGAAGG GAAAGGATGG GGTCTTAGAA CACTGCAAGA CTTGCCCAAA GGAACCTTTA 1200 TCTGTGAATA CATTGGTGAA ATATTGACCA ACACGGAGTT ATACGATCGG AATGTTAGGT 1260 CTAGTAGTGA ACGACATACA TATCCTGTAA CTCTGGATGC AGACTGGGGT TCTGAAAAGG 1320 ATCTAAAAGA TGAAGAAGCT CTCTGCCTGG ATGCCACAAT CTGTGGAAAT GTCGCAAGGT 1380 TTATCAATCA CAGATGCGAG GATGCAAACA TGATTGATAT TCCGATAGAG ATAGAGACGC 1440 CTGACAGACA TTATTATCAT ATTGCTTTTT TTACCCTACG AGACGTGAAG GCCATGGATG 1500 AGTTGACATG GGATTACATG ATAGACTTCA ATGATAAAAG TCATCCTGTA AAGGCATTTA 1560 GATGTTGCTG CGGAAGCGAA TCATGCAGAG ACAGAAAAAT AAAAGGATCT CAAGGCAAGT 1620 CTATAGAGAG AAGAAAGATT GTTTCTGCTA AAAAACAACA AGGTTCCAAA GAGGTGTCTA 1680 AAAAGCGCAA ATGAGACTGG TCATTATTTA GAGTTAAACA TTAAAGAAAG CATGAGCTCA 1740 CAAAACACTA TGATCCATCG CGAGCTCTGC ATGATTCAAC TAGCTTTTCC TTCTTGATGA 1800 CATATACAAC TTTTATGGTC TGCAGTTTTT TCTCATTTAC AATCTCTTCT TTAGGTAAAA 1860 AAAAGTGTTA TGTTGCCTAA CCAAAGAAAC TCACTGTTTA GTTCTGTTCT TCAAATTTCA 1920 TGGCATATGT CAAAATAAAT AAACCTTTTT TAATAGAGTT CTAGAGCGTA AAGACTGGAT 1980 CGAAAAGGAG CCCATCACTG GTAAATTCAG CCAAGAGTGA TCCATCGCAT CAATGATTTG 2040 TAATGTTAAA CAAAATTTTA ATATTTAGAC TTAGAAGTTG TAAT
2085Nucleotide Fasta Sequence
>AT3G04380.1|SUV39|Arabidopsis thaliana CGCACGACGCAGTGAAACAGAGATACTGTGGAAAAGTGTGTTTAAAATGGATCTGTTTTAGGGTATTTATCAATTCGAAAATTGATTAATTTATTAATTTCGTAAGCTTTCACAGAAAGTTCGAACTCTTTGCATGCAATCATTCGCCGTCTTCTTCCACAATTGTGTTTTTTAGCCGGCGGCGGCTGGATTCGAGAAGCTTCCGATTGTTCTGTATGATCAGTCTCTCCGGACTAACCAGTTCTGTTGAAAGTGATCTCGATATGCAACAAGCGATGCTCACCAATAAAGACGAGAAGGTACTCAAAGCTTTAGAGAGAACAAGGCAATTGGATATTCCCGATGAAAAGACAATGCCAGTGCTTATGAAGCTCCTAGAAGAGGCTGGTGGCAATTGGTCGTATATAAAGTTGGATAACTATACTGCACTGGTCGACGCTATTTATTCTGTTGAGGATGAGAATAAGCAAAGTGAAGGTTCATCTAATGGTAATAGAGGGAAGAATCTTAAGGTCATTGACTCTCCTGCTACTCTGAAAAAAACTTACGAAACCCGTTCTGCATCCTCAGGTTCCTCCATTCAGGTAGTCCAGAAGCAGCCACAGCTTAGTAATGGTGATCGCAAGAGGAAATATAAAAGCAGAATTGCTGACATAACTAAAGGTTCAGAGAGCGTTAAAATCCCTCTTGTCGATGATGTTGGGAGTGAAGCTGTGCCAAAGTTTACTTACATCCCTCACAACATTGTTTACCAAAGTGCTTATCTCCACGTGTCTCTGGCTCGAATCTCTGATGAAGATTGCTGCGCAAACTGCAAAGGGAACTGTCTTTCAGCTGACTTTCCTTGCACTTGTGCTCGTGAAACCAGTGGAGAATATGCTTATACCAAAGAAGGGCTGCTAAAGGAAAAGTTTCTGGACACCTGTCTTAAGATGAAAAAGGAACCAGATTCGTTCCCTAAAGTTTACTGCAAAGACTGCCCTTTGGAGAGAGATCACGATAAGGGCACATATGGAAAATGTGATGGACACTTAATCCGAAAGTTCATCAAGGAATGCTGGAGAAAGTGTGGATGTGATATGCAGTGTGGAAATCGAGTAGTACAGAGAGGGATAAGGTGCCAACTGCAGGTTTACTTTACTCAAGAAGGGAAAGGATGGGGTCTTAGAACACTGCAAGACTTGCCCAAAGGAACCTTTATCTGTGAATACATTGGTGAAATATTGACCAACACGGAGTTATACGATCGGAATGTTAGGTCTAGTAGTGAACGACATACATATCCTGTAACTCTGGATGCAGACTGGGGTTCTGAAAAGGATCTAAAAGATGAAGAAGCTCTCTGCCTGGATGCCACAATCTGTGGAAATGTCGCAAGGTTTATCAATCACAGATGCGAGGATGCAAACATGATTGATATTCCGATAGAGATAGAGACGCCTGACAGACATTATTATCATATTGCTTTTTTTACCCTACGAGACGTGAAGGCCATGGATGAGTTGACATGGGATTACATGATAGACTTCAATGATAAAAGTCATCCTGTAAAGGCATTTAGATGTTGCTGCGGAAGCGAATCATGCAGAGACAGAAAAATAAAAGGATCTCAAGGCAAGTCTATAGAGAGAAGAAAGATTGTTTCTGCTAAAAAACAACAAGGTTCCAAAGAGGTGTCTAAAAAGCGCAAATGAGACTGGTCATTATTTAGAGTTAAACATTAAAGAAAGCATGAGCTCACAAAACACTATGATCCATCGCGAGCTCTGCATGATTCAACTAGCTTTTCCTTCTTGATGACATATACAACTTTTATGGTCTGCAGTTTTTTCTCATTTACAATCTCTTCTTTAGGTAAAAAAAAGTGTTATGTTGCCTAACCAAAGAAACTCACTGTTTAGTTCTGTTCTTCAAATTTCATGGCATATGTCAAAATAAATAAACCTTTTTTAATAGAGTTCTAGAGCGTAAAGACTGGATCGAAAAGGAGCCCATCACTGGTAAATTCAGCCAAGAGTGATCCATCGCATCAATGATTTGTAATGTTAAACAAAATTTTAATATTTAGACTTAGAAGTTGTAAT
|
Sequence Source |
Ensembl |
Keyword |
KW-0002--3D-structure KW-0025--Alternative splicing KW-0156--Chromatin regulator KW-0158--Chromosome KW-0181--Complete proteome KW-0479--Metal-binding KW-0489--Methyltransferase KW-0539--Nucleus KW-1185--Reference proteome KW-0949--S-adenosyl-L-methionine KW-0808--Transferase KW-0862--Zinc --
|
Interpro |
IPR007728--Pre-SET_dom IPR001214--SET_dom IPR025776--SUVR4/1/2 IPR018848--WIYLD_domain
|
PROSITE |
PS50867--PRE_SET PS51580--SAM_MT43_3 PS50280--SET
|
Pfam |
PF05033--Pre-SET PF00856--SET PF10440--WIYLD
|
Gene Ontology |
GO:0005694--C:chromosome GO:0005730--C:nucleolus GO:0009506--C:plasmodesma GO:0018024--F:histone-lysine N-methyltransferase activity GO:0008270--F:zinc ion binding
|
Orthology |
|
Created Date |
25-Jun-2016 |