WERAM Information


Tag Content
WERAM ID WERAM-Art-0123
Ensembl Protein ID AT5G13960.1
Uniprot Accession Q8GZB6; SUVH4_ARATH; Q9C5P3; Q9FFX9
Genbank Protein ID NP_196900.1
Protein Name Histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH4
Genbank Nucleotide ID NM_121399.2
Gene Name SUVH4;SET33;KYP;SDG33
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
AT5G13960 AT5G13960.1 AT5G13960.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SUV39 SET H3K9 K 20703330; 15659850; 12067650
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SUV39 1.00e-43 149.6 447 594
Organism Arabidopsis thaliana
NCBI Taxa ID 3702
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression. The silencing mechanism via DNA CpNpG methylation requires the targeting of chromomethylase CMT3 to methylated histones, probably through an interaction with an HP1-like adapter. By its function, KYP is directly required for the maintenance of the DNA CpNpG and asymmetric methylation. Involved in the silencing of transposable elements.
Domain Profile
  HMT SUV39

    SUV39.txt   2 rLqvfktenkGwGvrclddiakgsFvciyaGeiltddeaekegleegdeyladldskesv..enlkegyesdvplssdssntrqekdkeeseyiidak 97 
+L+vf++++kGw+vr+++ i++gs vc+y+G + +++++++ +++ey++++d ++++ +++ +dv++ +++ +++++d++ e++ida
AT5G13960.1 447 NLEVFRSAKKGWAVRSWEYIPAGSPVCEYIGVVRRTADVDTI---SDNEYIFEIDCQQTMqgLGGRQRRLRDVAVPMNNGVSQSSEDENAPEFCIDAG 541
79*************************************998...9***********999888999******************************** PP
SUV39.txt 98 kegnvgrflnHscspNlfvqnvfvdthdlrfprvafFaskrikagtELtwdYg 150
++gn++rf+nHsc+pNlfvq v+ +++d r+ rv++Fa+++i++++ELt+dYg
AT5G13960.1 542 STGNFARFINHSCEPNLFVQCVLSSHQDIRLARVVLFAADNISPMQELTYDYG 594
****************************************************6 PP

Protein Sequence
(Fasta)
MAGKRKRANA PDQTERRSSV RVQKVRQKAL DEKARLVQER VKLLSDRKSE ICVDDTELHE 60
KEEENVDGSP KRRSPPKLTA MQKGKQKLSV SLNGKDVNLE PHLKVTKCLR LFNKQYLLCV 120
QAKLSRPDLK GVTEMIKAKA ILYPRKIIGD LPGIDVGHRF FSRAEMCAVG FHNHWLNGID 180
YMSMEYEKEY SNYKLPLAVS IVMSGQYEDD LDNADTVTYT GQGGHNLTGN KRQIKDQLLE 240
RGNLALKHCC EYNVPVRVTR GHNCKSSYTK RVYTYDGLYK VEKFWAQKGV SGFTVYKYRL 300
KRLEGQPELT TDQVNFVAGR IPTSTSEIEG LVCEDISGGL EFKGIPATNR VDDSPVSPTS 360
GFTYIKSLII EPNVIIPKSS TGCNCRGSCT DSKKCACAKL NGGNFPYVDL NDGRLIESRD 420
VVFECGPHCG CGPKCVNRTS QKRLRFNLEV FRSAKKGWAV RSWEYIPAGS PVCEYIGVVR 480
RTADVDTISD NEYIFEIDCQ QTMQGLGGRQ RRLRDVAVPM NNGVSQSSED ENAPEFCIDA 540
GSTGNFARFI NHSCEPNLFV QCVLSSHQDI RLARVVLFAA DNISPMQELT YDYGYALDSV 600
HGPDGKVKQL ACYCGALNCR KRLY 624
Nucleotide Sequence
(Fasta)
ATACCAAAAA AAAGAGGAGA AAAAAAAAAC TCTCATCTTC GTCGTCTTCA CAGTTTCAAT 60
CTCGCGCTGC TTAGTCTCCT TCCGTCTCTT CTTCTCCGCT AGCTTTTCCA GGGGAAAAAG 120
AGTGATCGAT GGCTGGAAAA AGGAAACGAG CTAATGCTCC TGACCAAACA GAGCGAAGAT 180
CGAGTGTTCG GGTTCAGAAA GTGAGACAGA AAGCGTTAGA TGAGAAGGCG CGTTTAGTAC 240
AGGAGAGGGT TAAGCTCCTC AGTGACAGAA AGAGTGAAAT TTGTGTCGAT GACACTGAGT 300
TACATGAGAA AGAAGAGGAA AATGTCGATG GGAGCCCTAA ACGAAGAAGC CCTCCAAAGC 360
TAACCGCAAT GCAGAAAGGA AAGCAGAAAT TGAGTGTTTC TCTGAATGGT AAGGACGTGA 420
ACTTGGAACC TCATCTCAAA GTGACAAAGT GTCTGAGGTT ATTTAACAAG CAATATCTCC 480
TCTGTGTCCA GGCTAAGTTG AGCAGGCCTG ATTTGAAGGG TGTAACTGAG ATGATAAAAG 540
CTAAGGCGAT ATTGTACCCA AGAAAAATAA TCGGTGACCT TCCAGGTATA GACGTTGGAC 600
ACCGTTTTTT TTCAAGAGCT GAAATGTGTG CTGTAGGATT CCATAACCAT TGGCTAAATG 660
GCATTGATTA TATGTCAATG GAATACGAAA AAGAGTATAG TAACTACAAA TTACCGCTTG 720
CTGTTTCTAT TGTTATGTCG GGCCAGTACG AGGATGATCT AGACAATGCA GATACAGTGA 780
CTTACACTGG ACAGGGAGGG CATAACTTAA CTGGTAATAA ACGTCAGATA AAGGATCAAC 840
TTTTAGAACG AGGGAATTTG GCGCTAAAGC ACTGCTGCGA ATATAATGTG CCTGTCAGAG 900
TAACTCGTGG TCACAATTGC AAAAGTAGCT ATACCAAACG AGTATACACT TATGATGGAC 960
TGTACAAGGT TGAAAAGTTC TGGGCACAAA AGGGCGTTTC AGGATTTACA GTGTATAAGT 1020
ACCGACTGAA ACGATTGGAG GGGCAACCAG AACTAACTAC TGATCAGGTC AACTTTGTTG 1080
CTGGACGCAT ACCAACGAGT ACTTCAGAAA TTGAGGGTTT GGTATGTGAG GACATCTCCG 1140
GAGGGCTAGA ATTTAAGGGT ATCCCCGCCA CTAATCGTGT TGATGATTCA CCAGTTTCAC 1200
CAACATCTGG TTTCACATAC ATCAAATCTT TGATTATTGA GCCTAATGTC ATAATTCCAA 1260
AGAGTTCAAC TGGGTGTAAC TGCCGAGGCA GCTGCACTGA CTCAAAGAAA TGTGCATGTG 1320
CTAAGCTTAA TGGGGGTAAC TTTCCATATG TTGACCTTAA TGATGGCAGA TTAATTGAGT 1380
CTCGAGATGT TGTATTTGAA TGTGGTCCTC ACTGTGGGTG TGGGCCAAAA TGTGTCAACC 1440
GAACTTCTCA GAAGCGTCTA AGATTCAATC TTGAGGTTTT CCGCTCTGCA AAGAAGGGTT 1500
GGGCAGTTAG ATCATGGGAG TACATACCAG CTGGTTCACC AGTATGTGAG TACATAGGAG 1560
TTGTCAGGAG AACTGCTGAT GTGGATACTA TCTCTGACAA TGAATACATA TTTGAGATTG 1620
ACTGCCAACA GACAATGCAA GGTCTTGGTG GAAGACAGAG AAGACTAAGA GATGTTGCTG 1680
TACCAATGAA TAATGGAGTC AGTCAGAGCA GTGAAGATGA GAATGCGCCA GAGTTCTGCA 1740
TTGATGCTGG TTCAACAGGA AACTTTGCTA GGTTTATAAA TCACAGTTGT GAACCAAACC 1800
TATTTGTTCA GTGCGTCCTG AGTTCTCACC AGGATATAAG GCTTGCCCGT GTGGTTCTTT 1860
TCGCAGCTGA CAACATTTCC CCAATGCAGG AGCTCACTTA CGACTATGGA TATGCGCTTG 1920
ATAGCGTTCA TGGACCGGAT GGGAAGGTGA AGCAGCTCGC TTGCTACTGT GGAGCGCTAA 1980
ATTGTAGGAA ACGCCTTTAC TAACAAAGGC TATTGGTGAA CTTATTTTTT GGTCACTCTA 2040
TTTTGGGGAG TACATAGATA GCAACTATCT CCCAAGTGGA AGAACCCTTC ACATTTTATT 2100
TAAGGGACTT TAGCTTCTTC TGCAGCGTAC GTGCTGCCTT TTTCGGCATT GTTGTTTCCT 2160
GAATTCTATT GTTGCCATTG GCATCTTATC TATCGGGTCA ACAATATTAA TC 2213
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0137--Centromere
KW-0156--Chromatin regulator
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-1185--Reference proteome
KW-0949--S-adenosyl-L-methionine
KW-0808--Transferase
KW-0862--Zinc
--

Interpro

IPR025794--Hist-Lys_N-MeTrfase_plant
IPR003616--Post-SET_dom
IPR007728--Pre-SET_dom
IPR015947--PUA-like_domain
IPR001214--SET_dom
IPR003105--SRA_YDG

PROSITE

PS50868--POST_SET
PS50867--PRE_SET
PS51575--SAM_MT43_SUVAR39_2
PS50280--SET
PS51015--YDG

Pfam

PF05033--Pre-SET
PF02182--SAD_SRA
PF00856--SET

Gene Ontology

GO:0000775--C:chromosome, centromeric region
GO:0005634--C:nucleus
GO:0010385--F:double-stranded methylated DNA binding
GO:0046974--F:histone methyltransferase activity (H3-K9 specific)
GO:0008327--F:methyl-CpG binding
GO:0010428--F:methyl-CpNpG binding
GO:0010429--F:methyl-CpNpN binding
GO:0008270--F:zinc ion binding
GO:0051567--P:histone H3-K9 methylation
GO:0016571--P:histone methylation
GO:0010216--P:maintenance of DNA methylation
GO:0018022--P:peptidyl-lysine methylation

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Arl-0091 fgenesh2_kg.6__ 1356__ AT5G13960.1 Arabidopsis lyrata 94 0.0 1187
WERAM-Bro-0044 Bo3g009460.1 Brassica oleracea 76 0.0 946
WERAM-Brr-0031 Bra006226.1-P Brassica rapa 74 0.0 899
WERAM-Thc-0045 EOY02954 Theobroma cacao 57 0.0 724
WERAM-Prp-0046 EMJ18194 Prunus persica 57 0.0 718
WERAM-Met-0152 AES80724 Medicago truncatula 67 0.0 712
WERAM-Viv-0091 VIT_14s0068g01090.t01 Vitis vinifera 67 0.0 712
WERAM-Glm-0162 GLYMA13G23490.1 Glycine max 62 0.0 684
WERAM-Sol-0039 Solyc02g094520.2.1 Solanum lycopersicum 60 0.0 679
WERAM-Zem-0115 GRMZM2G336909_P01 Zea mays 59 1e-179 627
WERAM-Lep-0016 LPERR01G37090.1 Leersia perrieri 57 6e-179 624
WERAM-Orbr-0018 OB01G51530.1 Oryza brachyantha 54 2e-177 619
WERAM-Orp-0015 OPUNC01G41430.1 Oryza punctata 57 6e-176 615
WERAM-Sob-0060 Sb03g044580.1 Sorghum bicolor 61 2e-175 613
WERAM-Orm-0016 OMERI01G39520.1 Oryza meridionalis 57 2e-175 613
WERAM-Org-0049 ORGLA04G0277600.1 Oryza glaberrima 57 2e-175 613
WERAM-Sei-0007 Si000507m Setaria italica 60 2e-175 613
WERAM-Ors-0010 OS01T0927000-01 Oryza sativa 57 2e-175 613
WERAM-Orl-0037 KN538848.1_FGP011 Oryza longistaminata 54 3e-175 612
WERAM-Brd-0060 BRADI2G59430.1 Brachypodium distachyon 55 3e-175 612
WERAM-Tra-0336 TRAES3BF015800070CFD_t1 Triticum aestivum 52 6e-174 608
WERAM-Sem-0055 EFJ24125 Selaginella moellendorffii 55 3e-172 602
WERAM-Hov-0081 MLOC_61815.1 Hordeum vulgare 53 1e-171 600
WERAM-Php-0105 PP1S469_7V6.1 Physcomitrella patens 51 3e-168 589
WERAM-Ori-0015 BGIOSGA005096-PA Oryza indica 60 8e-168 587
WERAM-Orni-0014 ONIVA01G48580.1 Oryza nivara 53 8e-152 534
WERAM-Orr-0014 ORUFI01G45940.1 Oryza rufipogon 53 1e-151 534
WERAM-Orgl-0014 OGLUM01G46790.1 Oryza glumaepatula 53 1e-151 533
WERAM-Orb-0014 OBART01G42520.1 Oryza barthii 53 2e-151 533
WERAM-Amt-0074 ERN19324 Amborella trichopoda 54 1e-109 394
WERAM-Pot-0033 POPTR_0002s23900.1 Populus trichocarpa 69 5e-105 379
WERAM-Sot-0083 PGSC0003DMT400068350 Solanum tuberosum 39 1e-96 351
WERAM-Aet-0032 EMT23957 Aegilops tauschii 40 6e-93 339
WERAM-Tru-0080 TRIUR3_29767-P1 Triticum urartu 40 7e-93 338
WERAM-Mua-0112 GSMUA_Achr8P00500_001 Musa acuminata 58 1e-89 328
Created Date 25-Jun-2016