WERAM Information


Tag Content
WERAM ID WERAM-Art-0044
Ensembl Protein ID AT2G24740.1
Uniprot Accession Q9C5P0; SUVH8_ARATH; Q9TP24
Genbank Protein ID NP_180049.2
Protein Name Histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH8
Genbank Nucleotide ID NM_128034.2
Gene Name SUVH8;SDG21;SET21
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
AT2G24740 AT2G24740.1 AT2G24740.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SUV39 SET H3K9 K 20703330; 15659850
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SUV39 3.80e-42 144.6 455 723
Organism Arabidopsis thaliana
NCBI Taxa ID 3702
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression.
Domain Profile
  HMT SUV39

    SUV39.txt  12 GwGv.rclddiakgsFvciyaGeiltddeaekegl.........eegdeyladldskesvenlkegyesdvplssds 78 
G+++ + ++++++ + + G il d + +egl ee++++ d d+++s ++ +g+++dv++ s+s
AT2G24740.1 455 GYAIwKLVENLRNHELIDPRQGFILGDLSFGEEGLrvplvnevdEEDKTIPDDFDYIRS--QCYSGMTNDVNVDSQS 529
77776777888888888888888888888888877777787765555556666777665..5888888888775554 PP
SUV39.txt 2 rLqvfktenkGwGvrclddiakgsFvciyaGeiltddeaekegleegdeyladldskesvenlkegyesdvplssdssntrqekdkeeseyiidakke 99
+L+vfkt+n GwG+r++d i++g+F+c+++G t++e+e e+d+yl+d+++++++ ++ +ye ++ + +d +++ +e + ++++i+ak++
AT2G24740.1 582 HLEVFKTSNCGWGLRSWDPIRAGTFICEFTGVSKTKEEVE-----EDDDYLFDTSRIYHS--FRWNYEPELLC-EDACEQVSEDANLPTQVLISAKEK 671
79******************************99999887.....56**********997..55555555544.566699999999************ PP
SUV39.txt 100 gnvgrflnHscspNlfvqnvfvdthdlr.fprvafFaskrikagtELtwdYg 150
gnvgrf+nH+c pN+f+q++ +d ++ + + r+++Fa+k+i+++tELt+dYg
AT2G24740.1 672 GNVGRFMNHNCWPNVFWQPIEYDDNNGHiYVRIGLFAMKHIPPMTELTYDYG 723
**********************8776551689*******************6 PP

Protein Sequence
(Fasta)
MVSTPPTLLM LFDDGDAGPS TGLVHREKSD AVNEEAHATS VPPHAPPQTL WLLDNFNIED 60
SYDRDAGPST GPVHRERSDA VNEEAHATSI PPHAPPQTLW LLDNFNIEDS YDRDAGPSTS 120
PIDREASHEV NEDAHATSAP PHVMVSPLQN RRPFDQFNNQ PYDASAGPST GPGKRGRGRP 180
KGSKNGSRKP KKPKAYDNNS TDASAGPSSG LGKRRCGRPK GLKNRSRKPK KPKADDPNSK 240
MVISCPDFDS RITEAERESG NQEIVDSILM RFDAVRRRLC QLNYRKDKIL TASTNCMNLG 300
VRTNMTRRIG PIPGVQVGDI FYYWCEMCLV GLHRNTAGGI DSLLAKESGV DGPAATSVVT 360
SGKYDNETED LETLIYSGHG GKPCDQVLQR GNRALEASVR RRNEVRVIRG ELYNNEKVYI 420
YDGLYLVSDC WQVTGKSGFK EYRFKLLRKP GQPPGYAIWK LVENLRNHEL IDPRQGFILG 480
DLSFGEEGLR VPLVNEVDEE DKTIPDDFDY IRSQCYSGMT NDVNVDSQSL VQSYIHQNCT 540
CILKNCGQLP YHDNILVCRK PLIYECGGSC PTRMVETGLK LHLEVFKTSN CGWGLRSWDP 600
IRAGTFICEF TGVSKTKEEV EEDDDYLFDT SRIYHSFRWN YEPELLCEDA CEQVSEDANL 660
PTQVLISAKE KGNVGRFMNH NCWPNVFWQP IEYDDNNGHI YVRIGLFAMK HIPPMTELTY 720
DYGISCVEKT GEDEVIYKGK KICLCGSVKC RGSFG 755
Nucleotide Sequence
(Fasta)
ATGGTCTCAA CACCTCCAAC CCTATTAATG CTATTTGATG ATGGAGATGC TGGTCCTAGT 60
ACTGGTTTAG TCCACCGAGA AAAATCTGAT GCTGTTAATG AAGAAGCCCA CGCAACATCA 120
GTACCTCCTC ATGCACCGCC ACAAACCCTT TGGCTATTAG ATAACTTCAA CATTGAGGAC 180
TCTTATGACC GAGATGCTGG TCCTAGTACT GGTCCAGTCC ACAGAGAAAG ATCAGATGCT 240
GTTAATGAAG AAGCCCACGC AACATCAATA CCTCCTCATG CACCGCCACA AACCCTTTGG 300
CTATTAGATA ACTTCAACAT TGAGGACTCT TATGACCGAG ATGCCGGTCC TAGTACTAGT 360
CCAATAGACC GAGAAGCATC TCATGAAGTC AATGAAGACG CCCACGCAAC ATCAGCACCT 420
CCTCATGTAA TGGTCTCACC ACTACAAAAC AGAAGACCAT TTGATCAATT CAACAACCAA 480
CCTTATGATG CGAGTGCTGG TCCTAGTACT GGTCCAGGAA AGCGAGGACG TGGCCGACCA 540
AAGGGTTCGA AAAATGGGTC GAGAAAGCCA AAAAAGCCAA AAGCATATGA CAACAACTCT 600
ACTGATGCGA GTGCTGGTCC TAGTTCTGGT CTAGGAAAAC GAAGATGTGG CCGACCAAAG 660
GGTTTAAAAA ACAGGTCGAG AAAGCCAAAG AAGCCAAAAG CAGATGATCC TAATAGCAAA 720
ATGGTTATCT CTTGTCCGGA TTTTGATTCA AGGATAACCG AAGCAGAGAG AGAAAGCGGA 780
AACCAGGAGA TAGTTGATTC GATTTTGATG CGGTTTGACG CGGTTAGACG GCGGTTATGC 840
CAACTAAACT ACCGGAAAGA CAAGATCCTT ACAGCGTCAA CTAATTGCAT GAATTTGGGG 900
GTCCGGACAA ACATGACGAG GAGAATTGGA CCCATTCCCG GAGTACAAGT AGGTGATATA 960
TTCTATTACT GGTGTGAAAT GTGTTTAGTC GGGCTTCACA GAAACACGGC TGGAGGCATT 1020
GATAGTCTAT TGGCTAAAGA GAGTGGAGTG GATGGTCCTG CTGCTACGAG TGTGGTAACA 1080
TCAGGAAAGT ACGACAATGA AACCGAGGAT CTTGAAACGT TGATCTATAG CGGACATGGT 1140
GGTAAACCAT GTGATCAAGT TTTACAAAGA GGAAACCGTG CACTAGAAGC AAGTGTAAGA 1200
AGAAGGAATG AAGTAAGAGT CATAAGAGGG GAGCTCTATA ATAATGAGAA GGTATATATC 1260
TATGATGGAC TTTATCTGGT CTCTGATTGC TGGCAAGTGA CAGGAAAATC CGGTTTCAAG 1320
GAGTATAGGT TCAAACTGCT GAGGAAACCA GGCCAACCTC CTGGTTATGC AATCTGGAAA 1380
TTAGTTGAAA ATTTGAGGAA TCATGAATTG ATTGATCCAA GACAAGGTTT TATACTTGGA 1440
GATCTTTCTT TTGGAGAAGA GGGTTTACGA GTTCCGCTCG TTAATGAAGT CGATGAAGAA 1500
GACAAAACGA TTCCTGACGA TTTTGACTAC ATTAGATCTC AGTGTTACTC AGGTATGACG 1560
AATGATGTTA ATGTTGATAG TCAATCACTT GTTCAGTCAT ACATTCATCA AAACTGCACA 1620
TGCATTCTAA AAAACTGTGG TCAGTTACCA TACCATGACA ACATTTTGGT TTGTCGTAAA 1680
CCATTGATTT ACGAGTGTGG TGGATCTTGT CCAACCCGAA TGGTTGAAAC CGGTTTGAAG 1740
CTCCACTTGG AAGTGTTTAA GACATCAAAC TGTGGTTGGG GTTTACGTTC TTGGGATCCA 1800
ATCCGAGCTG GAACTTTTAT CTGCGAGTTT ACTGGTGTGA GCAAGACGAA AGAAGAAGTA 1860
GAAGAAGATG ACGACTACTT GTTCGATACA TCTCGGATTT ATCATAGCTT CAGATGGAAC 1920
TATGAACCGG AGCTTTTGTG TGAAGATGCT TGCGAACAAG TCTCTGAAGA TGCTAATCTT 1980
CCAACGCAAG TCTTGATAAG CGCCAAGGAG AAAGGAAATG TTGGTCGGTT CATGAACCAC 2040
AACTGTTGGC CTAATGTTTT CTGGCAGCCT ATAGAGTATG ATGATAACAA CGGTCACATA 2100
TATGTTCGCA TTGGACTTTT TGCGATGAAG CACATTCCTC CAATGACAGA GTTAACATAT 2160
GACTATGGAA TTTCTTGTGT GGAGAAGACT GGAGAAGACG AAGTAATTTA TAAAGGAAAG 2220
AAGATTTGTC TCTGTGGTTC AGTCAAATGT CGTGGCTCTT TTGGCTGA 2269
Sequence Source Ensembl
Keyword

KW-0137--Centromere
KW-0156--Chromatin regulator
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0238--DNA-binding
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-1185--Reference proteome
KW-0949--S-adenosyl-L-methionine
KW-0808--Transferase
KW-0862--Zinc
--

Interpro

IPR025794--Hist-Lys_N-MeTrfase_plant
IPR003616--Post-SET_dom
IPR007728--Pre-SET_dom
IPR015947--PUA-like_domain
IPR001214--SET_dom
IPR003105--SRA_YDG

PROSITE

PS50868--POST_SET
PS51575--SAM_MT43_SUVAR39_2
PS50280--SET
PS51015--YDG

Pfam

PF05033--Pre-SET
PF02182--SAD_SRA
PF00856--SET

Gene Ontology

GO:0000775--C:chromosome, centromeric region
GO:0005634--C:nucleus
GO:0003677--F:DNA binding
GO:0042054--F:histone methyltransferase activity
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0008270--F:zinc ion binding
GO:0048366--P:leaf development
GO:0008361--P:regulation of cell size
GO:0040029--P:regulation of gene expression, epigenetic

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Arl-0009 Al_scaffold_0004_519 Arabidopsis lyrata 67 0.0 872
WERAM-Bro-0115 Bo6g105260.1 Brassica oleracea 58 0.0 658
WERAM-Brr-0018 Bra004258.1-P Brassica rapa 57 0.0 654
WERAM-Pot-0040 POPTR_0003s18740.1 Populus trichocarpa 48 3e-134 476
WERAM-Thc-0121 EOY19472 Theobroma cacao 46 9e-131 465
WERAM-Glm-0034 GLYMA04G15120.1 Glycine max 43 1e-130 464
WERAM-Prp-0040 EMJ05444 Prunus persica 47 2e-129 460
WERAM-Met-0060 KEH32911 Medicago truncatula 45 4e-125 446
WERAM-Viv-0081 VIT_13s0047g00120.t01 Vitis vinifera 47 8e-125 445
WERAM-Tra-0138 Traes_4AS_6A2EE5A91.2 Triticum aestivum 44 1e-119 427
WERAM-Brd-0028 BRADI1G53840.1 Brachypodium distachyon 46 2e-118 424
WERAM-Hov-0083 MLOC_63544.1 Hordeum vulgare 44 4e-118 423
WERAM-Sei-0106 Si028938m Setaria italica 43 1e-117 421
WERAM-Orl-0082 KN539783.1_FGP008 Oryza longistaminata 43 2e-117 421
WERAM-Org-0110 ORGLA11G0160500.1 Oryza glaberrima 43 2e-117 421
WERAM-Orb-0118 OBART11G19080.1 Oryza barthii 43 2e-117 421
WERAM-Ors-0105 OS11T0602200-01 Oryza sativa 43 2e-117 421
WERAM-Orr-0118 ORUFI11G20440.1 Oryza rufipogon 43 2e-117 421
WERAM-Zem-0004 AC233961.1_FGP001 Zea mays 46 3e-117 420
WERAM-Sob-0085 Sb06g001340.1 Sorghum bicolor 44 3e-117 420
WERAM-Orni-0117 ONIVA11G18710.1 Oryza nivara 44 9e-117 418
WERAM-Orp-0070 OPUNC07G07250.1 Oryza punctata 43 1e-116 418
WERAM-Aet-0087 EMT20679 Aegilops tauschii 43 9e-116 415
WERAM-Orbr-0036 OB0348G10010.1 Oryza brachyantha 46 4e-115 413
WERAM-Orgl-0113 OGLUM11G18450.2 Oryza glumaepatula 43 1e-114 411
WERAM-Orm-0054 OMERI05G19070.1 Oryza meridionalis 43 2e-108 390
WERAM-Ori-0115 BGIOSGA033726-PA Oryza indica 46 8e-107 385
WERAM-Lep-0057 LPERR05G16930.1 Leersia perrieri 44 5e-106 383
WERAM-Tru-0079 TRIUR3_29078-P1 Triticum urartu 40 6e-102 369
WERAM-Sot-0091 PGSC0003DMT400077329 Solanum tuberosum 42 1e-101 368
WERAM-Sol-0126 Solyc10g077070.1.1 Solanum lycopersicum 40 4e-101 366
WERAM-Amt-0085 ERM98215 Amborella trichopoda 42 5e-92 336
WERAM-Sem-0004 EFJ22703 Selaginella moellendorffii 36 6e-72 269
WERAM-Mua-0084 GSMUA_Achr5P13120_001 Musa acuminata 36 2e-64 244
Created Date 25-Jun-2016