Tag |
Content |
WERAM ID |
WERAM-Art-0044 |
Ensembl Protein ID |
AT2G24740.1 |
Uniprot Accession |
Q9C5P0; SUVH8_ARATH; Q9TP24 |
Genbank Protein ID |
NP_180049.2 |
Protein Name |
Histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH8 |
Genbank Nucleotide ID |
NM_128034.2 |
Gene Name |
SUVH8;SDG21;SET21 |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
SUV39 |
SET |
H3K9 |
K |
20703330; 15659850 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SUV39 |
3.80e-42 |
144.6 |
455 |
723 |
|
Organism |
Arabidopsis thaliana |
NCBI Taxa ID |
3702 |
Functional Description (View)Functional Description
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. |
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression.
|
Domain Profile |
HMT SUV39
SUV39.txt 12 GwGv.rclddiakgsFvciyaGeiltddeaekegl.........eegdeyladldskesvenlkegyesdvplssds 78 G+++ + ++++++ + + G il d + +egl ee++++ d d+++s ++ +g+++dv++ s+s AT2G24740.1 455 GYAIwKLVENLRNHELIDPRQGFILGDLSFGEEGLrvplvnevdEEDKTIPDDFDYIRS--QCYSGMTNDVNVDSQS 529 77776777888888888888888888888888877777787765555556666777665..5888888888775554 PP SUV39.txt 2 rLqvfktenkGwGvrclddiakgsFvciyaGeiltddeaekegleegdeyladldskesvenlkegyesdvplssdssntrqekdkeeseyiidakke 99 +L+vfkt+n GwG+r++d i++g+F+c+++G t++e+e e+d+yl+d+++++++ ++ +ye ++ + +d +++ +e + ++++i+ak++ AT2G24740.1 582 HLEVFKTSNCGWGLRSWDPIRAGTFICEFTGVSKTKEEVE-----EDDDYLFDTSRIYHS--FRWNYEPELLC-EDACEQVSEDANLPTQVLISAKEK 671 79******************************99999887.....56**********997..55555555544.566699999999************ PP SUV39.txt 100 gnvgrflnHscspNlfvqnvfvdthdlr.fprvafFaskrikagtELtwdYg 150 gnvgrf+nH+c pN+f+q++ +d ++ + + r+++Fa+k+i+++tELt+dYg AT2G24740.1 672 GNVGRFMNHNCWPNVFWQPIEYDDNNGHiYVRIGLFAMKHIPPMTELTYDYG 723 **********************8776551689*******************6 PP
|
Protein Sequence (Fasta) | MVSTPPTLLM LFDDGDAGPS TGLVHREKSD AVNEEAHATS VPPHAPPQTL WLLDNFNIED 60 SYDRDAGPST GPVHRERSDA VNEEAHATSI PPHAPPQTLW LLDNFNIEDS YDRDAGPSTS 120 PIDREASHEV NEDAHATSAP PHVMVSPLQN RRPFDQFNNQ PYDASAGPST GPGKRGRGRP 180 KGSKNGSRKP KKPKAYDNNS TDASAGPSSG LGKRRCGRPK GLKNRSRKPK KPKADDPNSK 240 MVISCPDFDS RITEAERESG NQEIVDSILM RFDAVRRRLC QLNYRKDKIL TASTNCMNLG 300 VRTNMTRRIG PIPGVQVGDI FYYWCEMCLV GLHRNTAGGI DSLLAKESGV DGPAATSVVT 360 SGKYDNETED LETLIYSGHG GKPCDQVLQR GNRALEASVR RRNEVRVIRG ELYNNEKVYI 420 YDGLYLVSDC WQVTGKSGFK EYRFKLLRKP GQPPGYAIWK LVENLRNHEL IDPRQGFILG 480 DLSFGEEGLR VPLVNEVDEE DKTIPDDFDY IRSQCYSGMT NDVNVDSQSL VQSYIHQNCT 540 CILKNCGQLP YHDNILVCRK PLIYECGGSC PTRMVETGLK LHLEVFKTSN CGWGLRSWDP 600 IRAGTFICEF TGVSKTKEEV EEDDDYLFDT SRIYHSFRWN YEPELLCEDA CEQVSEDANL 660 PTQVLISAKE KGNVGRFMNH NCWPNVFWQP IEYDDNNGHI YVRIGLFAMK HIPPMTELTY 720 DYGISCVEKT GEDEVIYKGK KICLCGSVKC RGSFG 755Protein Fasta Sequence
>AT2G24740.1|SUVH8;SDG21;SET21|Arabidopsis thaliana MVSTPPTLLMLFDDGDAGPSTGLVHREKSDAVNEEAHATSVPPHAPPQTLWLLDNFNIEDSYDRDAGPSTGPVHRERSDAVNEEAHATSIPPHAPPQTLWLLDNFNIEDSYDRDAGPSTSPIDREASHEVNEDAHATSAPPHVMVSPLQNRRPFDQFNNQPYDASAGPSTGPGKRGRGRPKGSKNGSRKPKKPKAYDNNSTDASAGPSSGLGKRRCGRPKGLKNRSRKPKKPKADDPNSKMVISCPDFDSRITEAERESGNQEIVDSILMRFDAVRRRLCQLNYRKDKILTASTNCMNLGVRTNMTRRIGPIPGVQVGDIFYYWCEMCLVGLHRNTAGGIDSLLAKESGVDGPAATSVVTSGKYDNETEDLETLIYSGHGGKPCDQVLQRGNRALEASVRRRNEVRVIRGELYNNEKVYIYDGLYLVSDCWQVTGKSGFKEYRFKLLRKPGQPPGYAIWKLVENLRNHELIDPRQGFILGDLSFGEEGLRVPLVNEVDEEDKTIPDDFDYIRSQCYSGMTNDVNVDSQSLVQSYIHQNCTCILKNCGQLPYHDNILVCRKPLIYECGGSCPTRMVETGLKLHLEVFKTSNCGWGLRSWDPIRAGTFICEFTGVSKTKEEVEEDDDYLFDTSRIYHSFRWNYEPELLCEDACEQVSEDANLPTQVLISAKEKGNVGRFMNHNCWPNVFWQPIEYDDNNGHIYVRIGLFAMKHIPPMTELTYDYGISCVEKTGEDEVIYKGKKICLCGSVKCRGSFG
|
Nucleotide Sequence (Fasta) | ATGGTCTCAA CACCTCCAAC CCTATTAATG CTATTTGATG ATGGAGATGC TGGTCCTAGT 60 ACTGGTTTAG TCCACCGAGA AAAATCTGAT GCTGTTAATG AAGAAGCCCA CGCAACATCA 120 GTACCTCCTC ATGCACCGCC ACAAACCCTT TGGCTATTAG ATAACTTCAA CATTGAGGAC 180 TCTTATGACC GAGATGCTGG TCCTAGTACT GGTCCAGTCC ACAGAGAAAG ATCAGATGCT 240 GTTAATGAAG AAGCCCACGC AACATCAATA CCTCCTCATG CACCGCCACA AACCCTTTGG 300 CTATTAGATA ACTTCAACAT TGAGGACTCT TATGACCGAG ATGCCGGTCC TAGTACTAGT 360 CCAATAGACC GAGAAGCATC TCATGAAGTC AATGAAGACG CCCACGCAAC ATCAGCACCT 420 CCTCATGTAA TGGTCTCACC ACTACAAAAC AGAAGACCAT TTGATCAATT CAACAACCAA 480 CCTTATGATG CGAGTGCTGG TCCTAGTACT GGTCCAGGAA AGCGAGGACG TGGCCGACCA 540 AAGGGTTCGA AAAATGGGTC GAGAAAGCCA AAAAAGCCAA AAGCATATGA CAACAACTCT 600 ACTGATGCGA GTGCTGGTCC TAGTTCTGGT CTAGGAAAAC GAAGATGTGG CCGACCAAAG 660 GGTTTAAAAA ACAGGTCGAG AAAGCCAAAG AAGCCAAAAG CAGATGATCC TAATAGCAAA 720 ATGGTTATCT CTTGTCCGGA TTTTGATTCA AGGATAACCG AAGCAGAGAG AGAAAGCGGA 780 AACCAGGAGA TAGTTGATTC GATTTTGATG CGGTTTGACG CGGTTAGACG GCGGTTATGC 840 CAACTAAACT ACCGGAAAGA CAAGATCCTT ACAGCGTCAA CTAATTGCAT GAATTTGGGG 900 GTCCGGACAA ACATGACGAG GAGAATTGGA CCCATTCCCG GAGTACAAGT AGGTGATATA 960 TTCTATTACT GGTGTGAAAT GTGTTTAGTC GGGCTTCACA GAAACACGGC TGGAGGCATT 1020 GATAGTCTAT TGGCTAAAGA GAGTGGAGTG GATGGTCCTG CTGCTACGAG TGTGGTAACA 1080 TCAGGAAAGT ACGACAATGA AACCGAGGAT CTTGAAACGT TGATCTATAG CGGACATGGT 1140 GGTAAACCAT GTGATCAAGT TTTACAAAGA GGAAACCGTG CACTAGAAGC AAGTGTAAGA 1200 AGAAGGAATG AAGTAAGAGT CATAAGAGGG GAGCTCTATA ATAATGAGAA GGTATATATC 1260 TATGATGGAC TTTATCTGGT CTCTGATTGC TGGCAAGTGA CAGGAAAATC CGGTTTCAAG 1320 GAGTATAGGT TCAAACTGCT GAGGAAACCA GGCCAACCTC CTGGTTATGC AATCTGGAAA 1380 TTAGTTGAAA ATTTGAGGAA TCATGAATTG ATTGATCCAA GACAAGGTTT TATACTTGGA 1440 GATCTTTCTT TTGGAGAAGA GGGTTTACGA GTTCCGCTCG TTAATGAAGT CGATGAAGAA 1500 GACAAAACGA TTCCTGACGA TTTTGACTAC ATTAGATCTC AGTGTTACTC AGGTATGACG 1560 AATGATGTTA ATGTTGATAG TCAATCACTT GTTCAGTCAT ACATTCATCA AAACTGCACA 1620 TGCATTCTAA AAAACTGTGG TCAGTTACCA TACCATGACA ACATTTTGGT TTGTCGTAAA 1680 CCATTGATTT ACGAGTGTGG TGGATCTTGT CCAACCCGAA TGGTTGAAAC CGGTTTGAAG 1740 CTCCACTTGG AAGTGTTTAA GACATCAAAC TGTGGTTGGG GTTTACGTTC TTGGGATCCA 1800 ATCCGAGCTG GAACTTTTAT CTGCGAGTTT ACTGGTGTGA GCAAGACGAA AGAAGAAGTA 1860 GAAGAAGATG ACGACTACTT GTTCGATACA TCTCGGATTT ATCATAGCTT CAGATGGAAC 1920 TATGAACCGG AGCTTTTGTG TGAAGATGCT TGCGAACAAG TCTCTGAAGA TGCTAATCTT 1980 CCAACGCAAG TCTTGATAAG CGCCAAGGAG AAAGGAAATG TTGGTCGGTT CATGAACCAC 2040 AACTGTTGGC CTAATGTTTT CTGGCAGCCT ATAGAGTATG ATGATAACAA CGGTCACATA 2100 TATGTTCGCA TTGGACTTTT TGCGATGAAG CACATTCCTC CAATGACAGA GTTAACATAT 2160 GACTATGGAA TTTCTTGTGT GGAGAAGACT GGAGAAGACG AAGTAATTTA TAAAGGAAAG 2220 AAGATTTGTC TCTGTGGTTC AGTCAAATGT CGTGGCTCTT TTGGCTGA
2269Nucleotide Fasta Sequence
>AT2G24740.1|SUV39|Arabidopsis thaliana ATGGTCTCAACACCTCCAACCCTATTAATGCTATTTGATGATGGAGATGCTGGTCCTAGTACTGGTTTAGTCCACCGAGAAAAATCTGATGCTGTTAATGAAGAAGCCCACGCAACATCAGTACCTCCTCATGCACCGCCACAAACCCTTTGGCTATTAGATAACTTCAACATTGAGGACTCTTATGACCGAGATGCTGGTCCTAGTACTGGTCCAGTCCACAGAGAAAGATCAGATGCTGTTAATGAAGAAGCCCACGCAACATCAATACCTCCTCATGCACCGCCACAAACCCTTTGGCTATTAGATAACTTCAACATTGAGGACTCTTATGACCGAGATGCCGGTCCTAGTACTAGTCCAATAGACCGAGAAGCATCTCATGAAGTCAATGAAGACGCCCACGCAACATCAGCACCTCCTCATGTAATGGTCTCACCACTACAAAACAGAAGACCATTTGATCAATTCAACAACCAACCTTATGATGCGAGTGCTGGTCCTAGTACTGGTCCAGGAAAGCGAGGACGTGGCCGACCAAAGGGTTCGAAAAATGGGTCGAGAAAGCCAAAAAAGCCAAAAGCATATGACAACAACTCTACTGATGCGAGTGCTGGTCCTAGTTCTGGTCTAGGAAAACGAAGATGTGGCCGACCAAAGGGTTTAAAAAACAGGTCGAGAAAGCCAAAGAAGCCAAAAGCAGATGATCCTAATAGCAAAATGGTTATCTCTTGTCCGGATTTTGATTCAAGGATAACCGAAGCAGAGAGAGAAAGCGGAAACCAGGAGATAGTTGATTCGATTTTGATGCGGTTTGACGCGGTTAGACGGCGGTTATGCCAACTAAACTACCGGAAAGACAAGATCCTTACAGCGTCAACTAATTGCATGAATTTGGGGGTCCGGACAAACATGACGAGGAGAATTGGACCCATTCCCGGAGTACAAGTAGGTGATATATTCTATTACTGGTGTGAAATGTGTTTAGTCGGGCTTCACAGAAACACGGCTGGAGGCATTGATAGTCTATTGGCTAAAGAGAGTGGAGTGGATGGTCCTGCTGCTACGAGTGTGGTAACATCAGGAAAGTACGACAATGAAACCGAGGATCTTGAAACGTTGATCTATAGCGGACATGGTGGTAAACCATGTGATCAAGTTTTACAAAGAGGAAACCGTGCACTAGAAGCAAGTGTAAGAAGAAGGAATGAAGTAAGAGTCATAAGAGGGGAGCTCTATAATAATGAGAAGGTATATATCTATGATGGACTTTATCTGGTCTCTGATTGCTGGCAAGTGACAGGAAAATCCGGTTTCAAGGAGTATAGGTTCAAACTGCTGAGGAAACCAGGCCAACCTCCTGGTTATGCAATCTGGAAATTAGTTGAAAATTTGAGGAATCATGAATTGATTGATCCAAGACAAGGTTTTATACTTGGAGATCTTTCTTTTGGAGAAGAGGGTTTACGAGTTCCGCTCGTTAATGAAGTCGATGAAGAAGACAAAACGATTCCTGACGATTTTGACTACATTAGATCTCAGTGTTACTCAGGTATGACGAATGATGTTAATGTTGATAGTCAATCACTTGTTCAGTCATACATTCATCAAAACTGCACATGCATTCTAAAAAACTGTGGTCAGTTACCATACCATGACAACATTTTGGTTTGTCGTAAACCATTGATTTACGAGTGTGGTGGATCTTGTCCAACCCGAATGGTTGAAACCGGTTTGAAGCTCCACTTGGAAGTGTTTAAGACATCAAACTGTGGTTGGGGTTTACGTTCTTGGGATCCAATCCGAGCTGGAACTTTTATCTGCGAGTTTACTGGTGTGAGCAAGACGAAAGAAGAAGTAGAAGAAGATGACGACTACTTGTTCGATACATCTCGGATTTATCATAGCTTCAGATGGAACTATGAACCGGAGCTTTTGTGTGAAGATGCTTGCGAACAAGTCTCTGAAGATGCTAATCTTCCAACGCAAGTCTTGATAAGCGCCAAGGAGAAAGGAAATGTTGGTCGGTTCATGAACCACAACTGTTGGCCTAATGTTTTCTGGCAGCCTATAGAGTATGATGATAACAACGGTCACATATATGTTCGCATTGGACTTTTTGCGATGAAGCACATTCCTCCAATGACAGAGTTAACATATGACTATGGAATTTCTTGTGTGGAGAAGACTGGAGAAGACGAAGTAATTTATAAAGGAAAGAAGATTTGTCTCTGTGGTTCAGTCAAATGTCGTGGCTCTTTTGGCTGA
|
Sequence Source |
Ensembl |
Keyword |
KW-0137--Centromere KW-0156--Chromatin regulator KW-0158--Chromosome KW-0181--Complete proteome KW-0238--DNA-binding KW-0479--Metal-binding KW-0489--Methyltransferase KW-0539--Nucleus KW-1185--Reference proteome KW-0949--S-adenosyl-L-methionine KW-0808--Transferase KW-0862--Zinc --
|
Interpro |
IPR025794--Hist-Lys_N-MeTrfase_plant IPR003616--Post-SET_dom IPR007728--Pre-SET_dom IPR015947--PUA-like_domain IPR001214--SET_dom IPR003105--SRA_YDG
|
PROSITE |
PS50868--POST_SET PS51575--SAM_MT43_SUVAR39_2 PS50280--SET PS51015--YDG
|
Pfam |
PF05033--Pre-SET PF02182--SAD_SRA PF00856--SET
|
Gene Ontology |
GO:0000775--C:chromosome, centromeric region GO:0005634--C:nucleus GO:0003677--F:DNA binding GO:0042054--F:histone methyltransferase activity GO:0018024--F:histone-lysine N-methyltransferase activity GO:0008270--F:zinc ion binding GO:0048366--P:leaf development GO:0008361--P:regulation of cell size GO:0040029--P:regulation of gene expression, epigenetic
|
Orthology |
|
Created Date |
25-Jun-2016 |