Tag |
Content |
WERAM ID |
WERAM-Art-0013 |
Ensembl Protein ID |
AT1G17770.1 |
Uniprot Accession |
Q9C5P1; SUVH7_ARATH; Q9LMU9 |
Genbank Protein ID |
NP_564036.1 |
Protein Name |
Histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH7 |
Genbank Nucleotide ID |
NM_101640.1 |
Gene Name |
SUVH7;SET17;SDG17 |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
SUV39 |
SET |
H3K9 |
K |
20703330; 15659850 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SUV39 |
8.10e-43 |
146.8 |
368 |
660 |
|
Organism |
Arabidopsis thaliana |
NCBI Taxa ID |
3702 |
Functional Description (View)Functional Description
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. |
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression.
|
Domain Profile |
HMT SUV39
SUV39.txt 2 rLqvfktenk..GwGv.rclddiakgsFvciyaGeiltddeaeke 43 r++++++ n+ ++++ ++++++++ + G il+d + e AT1G17770.1 368 RFKLVRKPNQppAYAIwKTVENLRNHDLIDSRQGFILEDLSFGAE 412 678888888888899989999999999999999999999776665 PP SUV39.txt 2 rLqvfktenkGwGvrclddiakgsFvciyaGeiltddeaekegleegdeyladldskesvenlkegyesdvplssdssntrqekdkeeseyiidakke 99 +L+vfkt+n GwG+r++d i++g+F+c++aG t++e+e e+d+yl+d++++++ ++ +ye ++ l +ds ++ +e+ + ++++i+ak++ AT1G17770.1 520 HLEVFKTRNCGWGLRSWDPIRAGTFICEFAGLRKTKEEVE-----EDDDYLFDTSKIYQR--FRWNYEPELLL-EDSWEQVSEFINLPTQVLISAKEK 609 79*****************************999998887.....56*********9987..55566666644.667799999999************ PP SUV39.txt 100 gnvgrflnHscspNlfvqnvfvdthdlrfprvafFaskrikagtELtwdYg 150 gnvgrf+nHscspN+f+q++ ++++ + +++Fa+k+i+++tELt+dYg AT1G17770.1 610 GNVGRFMNHSCSPNVFWQPIEYENRGDVYLLIGLFAMKHIPPMTELTYDYG 660 **************************************************6 PP
|
Protein Sequence (Fasta) | MDKSIPIKAI PVACVRPDLV DDVTKNTSTI PTMVSPVLTN MPSATSPLLM VPPLRTIWPS 60 NKEWYDGDAG PSSTGPIKRE ASDNTNDTAH NTFAPPPEMV IPLITIRPSD DSSNYSCDAG 120 AGPSTGPVKR GRGRPKGSKN STPTEPKKPK VYDPNSLKVT SRGNFDSEIT EAETETGNQE 180 IVDSVMMRFD AVRRRLCQIN HPEDILTTAS GNCTKMGVKT NTRRRIGAVP GIHVGDIFYY 240 WGEMCLVGLH KSNYGGIDFF TAAESAVEGH AAMCVVTAGQ YDGETEGLDT LIYSGQGGTD 300 VYGNARDQEM KGGNLALEAS VSKGNDVRVV RGVIHPHENN QKIYIYDGMY LVSKFWTVTG 360 KSGFKEFRFK LVRKPNQPPA YAIWKTVENL RNHDLIDSRQ GFILEDLSFG AELLRVPLVN 420 EVDEDDKTIP EDFDYIPSQC HSGMMTHEFH FDRQSLGCQN CRHQPCMHQN CTCVQRNGDL 480 LPYHNNILVC RKPLIYECGG SCPCPDHCPT RLVQTGLKLH LEVFKTRNCG WGLRSWDPIR 540 AGTFICEFAG LRKTKEEVEE DDDYLFDTSK IYQRFRWNYE PELLLEDSWE QVSEFINLPT 600 QVLISAKEKG NVGRFMNHSC SPNVFWQPIE YENRGDVYLL IGLFAMKHIP PMTELTYDYG 660 VSCVERSEED EVLLYKGKKT CLCGSVKCRG SFT 693Protein Fasta Sequence
>AT1G17770.1|SUVH7;SET17;SDG17|Arabidopsis thaliana MDKSIPIKAIPVACVRPDLVDDVTKNTSTIPTMVSPVLTNMPSATSPLLMVPPLRTIWPSNKEWYDGDAGPSSTGPIKREASDNTNDTAHNTFAPPPEMVIPLITIRPSDDSSNYSCDAGAGPSTGPVKRGRGRPKGSKNSTPTEPKKPKVYDPNSLKVTSRGNFDSEITEAETETGNQEIVDSVMMRFDAVRRRLCQINHPEDILTTASGNCTKMGVKTNTRRRIGAVPGIHVGDIFYYWGEMCLVGLHKSNYGGIDFFTAAESAVEGHAAMCVVTAGQYDGETEGLDTLIYSGQGGTDVYGNARDQEMKGGNLALEASVSKGNDVRVVRGVIHPHENNQKIYIYDGMYLVSKFWTVTGKSGFKEFRFKLVRKPNQPPAYAIWKTVENLRNHDLIDSRQGFILEDLSFGAELLRVPLVNEVDEDDKTIPEDFDYIPSQCHSGMMTHEFHFDRQSLGCQNCRHQPCMHQNCTCVQRNGDLLPYHNNILVCRKPLIYECGGSCPCPDHCPTRLVQTGLKLHLEVFKTRNCGWGLRSWDPIRAGTFICEFAGLRKTKEEVEEDDDYLFDTSKIYQRFRWNYEPELLLEDSWEQVSEFINLPTQVLISAKEKGNVGRFMNHSCSPNVFWQPIEYENRGDVYLLIGLFAMKHIPPMTELTYDYGVSCVERSEEDEVLLYKGKKTCLCGSVKCRGSFT
|
Nucleotide Sequence (Fasta) | ATGGATAAGT CTATTCCAAT CAAGGCAATA CCGGTTGCAT GTGTCAGACC AGATTTGGTA 60 GATGACGTGA CCAAAAACAC ATCAACGATT CCTACAATGG TTTCACCAGT TCTAACCAAT 120 ATGCCATCTG CAACATCTCC TCTTCTAATG GTCCCACCTC TTCGAACAAT CTGGCCATCC 180 AACAAGGAAT GGTACGATGG AGATGCTGGT CCTAGTAGTA CTGGTCCAAT CAAACGAGAA 240 GCGTCCGATA ATACTAATGA TACAGCACAC AACACATTTG CACCTCCTCC AGAAATGGTC 300 ATACCACTGA TCACCATTAG GCCAAGTGAT GACTCTAGCA ACTATTCTTG TGATGCGGGT 360 GCTGGTCCTA GTACTGGTCC AGTAAAACGA GGCCGTGGCC GACCAAAAGG TTCAAAAAAC 420 TCAACGCCGA CGGAGCCGAA GAAGCCAAAA GTATATGATC CCAACAGCTT AAAGGTTACA 480 TCTCGTGGGA ATTTCGATTC AGAGATAACC GAAGCAGAGA CAGAAACTGG AAACCAGGAG 540 ATAGTTGATT CCGTTATGAT GCGGTTTGAT GCGGTTAGAC GACGATTATG CCAAATAAAC 600 CATCCGGAAG ACATCCTTAC AACGGCAAGT GGCAATTGCA CGAAAATGGG TGTCAAGACA 660 AATACAAGAA GGAGAATTGG TGCAGTTCCT GGAATACACG TCGGAGATAT ATTCTATTAC 720 TGGGGTGAAA TGTGCTTAGT GGGGCTTCAC AAATCAAATT ATGGTGGTAT TGATTTTTTT 780 ACGGCTGCAG AGAGTGCAGT GGAAGGCCAT GCTGCTATGT GTGTGGTAAC AGCAGGACAA 840 TACGATGGTG AAACCGAGGG GCTTGACACG TTGATCTACA GCGGACAGGG CGGAACGGAC 900 GTGTACGGTA ACGCTCGTGA TCAAGAGATG AAGGGCGGGA ATCTTGCACT AGAAGCAAGT 960 GTAAGCAAAG GGAATGACGT TAGAGTCGTG AGAGGAGTGA TACATCCTCA TGAGAACAAT 1020 CAGAAGATAT ATATCTACGA TGGGATGTAT CTGGTTTCAA AGTTCTGGAC AGTGACAGGA 1080 AAATCCGGCT TCAAGGAGTT CAGATTCAAA TTGGTGAGGA AACCAAACCA ACCTCCTGCT 1140 TATGCAATCT GGAAAACAGT TGAAAATCTG AGGAACCATG ACTTGATTGA TTCAAGGCAA 1200 GGTTTTATAC TTGAAGATCT TTCTTTTGGA GCTGAGCTTT TACGAGTTCC GCTCGTTAAT 1260 GAAGTTGATG AAGATGACAA AACGATTCCC GAAGATTTTG ATTACATCCC CTCTCAGTGT 1320 CACTCTGGTA TGATGACGCA TGAATTTCAT TTTGATCGTC AATCACTTGG ATGCCAGAAT 1380 TGTCGACATC AGCCATGCAT GCATCAAAAC TGCACCTGCG TGCAGAGAAA CGGTGACCTG 1440 CTACCGTACC ATAACAACAT TTTGGTTTGT CGTAAACCAT TGATTTACGA GTGCGGTGGA 1500 TCTTGTCCTT GCCCCGACCA TTGCCCAACC CGGTTGGTTC AAACCGGTTT GAAACTCCAT 1560 TTGGAAGTGT TCAAGACAAG AAACTGTGGT TGGGGTTTAC GTTCTTGGGA TCCAATCCGA 1620 GCCGGAACTT TTATCTGCGA GTTTGCTGGT TTGAGAAAGA CAAAAGAAGA AGTAGAAGAG 1680 GATGATGATT ACTTGTTCGA CACGTCAAAG ATTTATCAGA GGTTCAGATG GAACTACGAA 1740 CCTGAGCTTT TGCTTGAAGA TAGTTGGGAA CAAGTCTCTG AATTTATCAA TCTTCCAACA 1800 CAAGTCTTGA TAAGTGCTAA GGAAAAAGGG AATGTTGGTC GGTTCATGAA TCACAGTTGT 1860 TCACCGAATG TTTTCTGGCA GCCTATTGAG TATGAAAACA GAGGTGATGT ATATCTTCTT 1920 ATCGGACTTT TTGCTATGAA GCATATTCCT CCGATGACAG AGTTAACATA TGACTATGGA 1980 GTTTCATGTG TGGAGAGGAG CGAAGAAGAT GAAGTACTTC TTTATAAAGG CAAGAAGACC 2040 TGTCTCTGTG GTTCAGTCAA ATGTCGTGGC TCTTTTACCT GA
2083Nucleotide Fasta Sequence
>AT1G17770.1|SUV39|Arabidopsis thaliana ATGGATAAGTCTATTCCAATCAAGGCAATACCGGTTGCATGTGTCAGACCAGATTTGGTAGATGACGTGACCAAAAACACATCAACGATTCCTACAATGGTTTCACCAGTTCTAACCAATATGCCATCTGCAACATCTCCTCTTCTAATGGTCCCACCTCTTCGAACAATCTGGCCATCCAACAAGGAATGGTACGATGGAGATGCTGGTCCTAGTAGTACTGGTCCAATCAAACGAGAAGCGTCCGATAATACTAATGATACAGCACACAACACATTTGCACCTCCTCCAGAAATGGTCATACCACTGATCACCATTAGGCCAAGTGATGACTCTAGCAACTATTCTTGTGATGCGGGTGCTGGTCCTAGTACTGGTCCAGTAAAACGAGGCCGTGGCCGACCAAAAGGTTCAAAAAACTCAACGCCGACGGAGCCGAAGAAGCCAAAAGTATATGATCCCAACAGCTTAAAGGTTACATCTCGTGGGAATTTCGATTCAGAGATAACCGAAGCAGAGACAGAAACTGGAAACCAGGAGATAGTTGATTCCGTTATGATGCGGTTTGATGCGGTTAGACGACGATTATGCCAAATAAACCATCCGGAAGACATCCTTACAACGGCAAGTGGCAATTGCACGAAAATGGGTGTCAAGACAAATACAAGAAGGAGAATTGGTGCAGTTCCTGGAATACACGTCGGAGATATATTCTATTACTGGGGTGAAATGTGCTTAGTGGGGCTTCACAAATCAAATTATGGTGGTATTGATTTTTTTACGGCTGCAGAGAGTGCAGTGGAAGGCCATGCTGCTATGTGTGTGGTAACAGCAGGACAATACGATGGTGAAACCGAGGGGCTTGACACGTTGATCTACAGCGGACAGGGCGGAACGGACGTGTACGGTAACGCTCGTGATCAAGAGATGAAGGGCGGGAATCTTGCACTAGAAGCAAGTGTAAGCAAAGGGAATGACGTTAGAGTCGTGAGAGGAGTGATACATCCTCATGAGAACAATCAGAAGATATATATCTACGATGGGATGTATCTGGTTTCAAAGTTCTGGACAGTGACAGGAAAATCCGGCTTCAAGGAGTTCAGATTCAAATTGGTGAGGAAACCAAACCAACCTCCTGCTTATGCAATCTGGAAAACAGTTGAAAATCTGAGGAACCATGACTTGATTGATTCAAGGCAAGGTTTTATACTTGAAGATCTTTCTTTTGGAGCTGAGCTTTTACGAGTTCCGCTCGTTAATGAAGTTGATGAAGATGACAAAACGATTCCCGAAGATTTTGATTACATCCCCTCTCAGTGTCACTCTGGTATGATGACGCATGAATTTCATTTTGATCGTCAATCACTTGGATGCCAGAATTGTCGACATCAGCCATGCATGCATCAAAACTGCACCTGCGTGCAGAGAAACGGTGACCTGCTACCGTACCATAACAACATTTTGGTTTGTCGTAAACCATTGATTTACGAGTGCGGTGGATCTTGTCCTTGCCCCGACCATTGCCCAACCCGGTTGGTTCAAACCGGTTTGAAACTCCATTTGGAAGTGTTCAAGACAAGAAACTGTGGTTGGGGTTTACGTTCTTGGGATCCAATCCGAGCCGGAACTTTTATCTGCGAGTTTGCTGGTTTGAGAAAGACAAAAGAAGAAGTAGAAGAGGATGATGATTACTTGTTCGACACGTCAAAGATTTATCAGAGGTTCAGATGGAACTACGAACCTGAGCTTTTGCTTGAAGATAGTTGGGAACAAGTCTCTGAATTTATCAATCTTCCAACACAAGTCTTGATAAGTGCTAAGGAAAAAGGGAATGTTGGTCGGTTCATGAATCACAGTTGTTCACCGAATGTTTTCTGGCAGCCTATTGAGTATGAAAACAGAGGTGATGTATATCTTCTTATCGGACTTTTTGCTATGAAGCATATTCCTCCGATGACAGAGTTAACATATGACTATGGAGTTTCATGTGTGGAGAGGAGCGAAGAAGATGAAGTACTTCTTTATAAAGGCAAGAAGACCTGTCTCTGTGGTTCAGTCAAATGTCGTGGCTCTTTTACCTGA
|
Sequence Source |
Ensembl |
Keyword |
KW-0137--Centromere KW-0156--Chromatin regulator KW-0158--Chromosome KW-0181--Complete proteome KW-0238--DNA-binding KW-0479--Metal-binding KW-0489--Methyltransferase KW-0539--Nucleus KW-1185--Reference proteome KW-0949--S-adenosyl-L-methionine KW-0808--Transferase KW-0862--Zinc --
|
Interpro |
IPR025794--Hist-Lys_N-MeTrfase_plant IPR003616--Post-SET_dom IPR007728--Pre-SET_dom IPR015947--PUA-like_domain IPR001214--SET_dom IPR003105--SRA_YDG
|
PROSITE |
PS50868--POST_SET PS50867--PRE_SET PS51575--SAM_MT43_SUVAR39_2 PS50280--SET PS51015--YDG
|
Pfam |
PF05033--Pre-SET PF02182--SAD_SRA PF00856--SET
|
Gene Ontology |
GO:0000775--C:chromosome, centromeric region GO:0005634--C:nucleus GO:0003677--F:DNA binding GO:0018024--F:histone-lysine N-methyltransferase activity GO:0008270--F:zinc ion binding
|
Orthology |
|
Created Date |
25-Jun-2016 |