WERAM Information


Tag Content
WERAM ID WERAM-Art-0116
Ensembl Protein ID AT5G04940.2
Uniprot Accession Q9FF80; SUVH1_ARATH
Genbank Protein ID NP_196113.1; NP_850767.1
Protein Name Histone-lysine N-methyltransferase, H3 lysine-9 specific SUVH1
Genbank Nucleotide ID NM_120576.1; NM_180436.1
Gene Name SUVH1;SET32;SDG32
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
AT5G04940 AT5G04940.2 AT5G04940.2
AT5G04940 AT5G04940.1 AT5G04940.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SUV39 SET H3K9 K 20703330; 15659850
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SUV39 1.30e-47 162.4 495 639
Organism Arabidopsis thaliana
NCBI Taxa ID 3702
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-9' of histone H3. H3 'Lys-9' methylation represents a specific tag for epigenetic transcriptional repression.
Domain Profile
  HMT SUV39

    SUV39.txt   1 vrLqvfktenkGwGvrclddiakgsFvciyaGeiltddeaekegleegdeyladldskesv..enlkegyesdvplssdssntrqekdkeeseyiida 96 
vrL+vfkt+n+GwG+r++d i++gsF+ciy+Ge ++++++++ +++d+y++d++++++ +n+++g ++ ++ ++ +e+++ +ii+a
AT5G04940.2 495 VRLEVFKTANRGWGLRSWDAIRAGSFICIYVGEAKDKSKVQQT--MANDDYTFDTTNVYNPfkWNYEPGLADEDAC-----EEMSEESEIPLPLIISA 585
79**************************************988..99999*******9997778999998888877.....78888999999****** PP
SUV39.txt 97 kkegnvgrflnHscspNlfvqnvfvdthdlrfprvafFaskrikagtELtwdYg 150
k+ gnv+rf+nHscspN+f+q+v ++++++ f +vafFa +i+++tELt+dYg
AT5G04940.2 586 KNVGNVARFMNHSCSPNVFWQPVSYENNSQLFVHVAFFAISHIPPMTELTYDYG 639
*****************************************************6 PP

Protein Sequence
(Fasta)
MERNGGHYTD KTRVLDIKPL RTLRPVFPSG NQAPPFVCAP PFGPFPPGFS SFYPFSSSQA 60
NQHTPDLNQA QYPPQHQQPQ NPPPVYQQQP PQHASEPSLV TPLRSFRSPD VSNGNAELEG 120
STVKRRIPKK RPISRPENMN FESGINVADR ENGNRELVLS VLMRFDALRR RFAQLEDAKE 180
AVSGIIKRPD LKSGSTCMGR GVRTNTKKRP GIVPGVEIGD VFFFRFEMCL VGLHSPSMAG 240
IDYLVVKGET EEEPIATSIV SSGYYDNDEG NPDVLIYTGQ GGNADKDKQS SDQKLERGNL 300
ALEKSLRRDS AVRVIRGLKE ASHNAKIYIY DGLYEIKESW VEKGKSGHNT FKYKLVRAPG 360
QPPAFASWTA IQKWKTGVPS RQGLILPDMT SGVESIPVSL VNEVDTDNGP AYFTYSTTVK 420
YSESFKLMQP SFGCDCANLC KPGNLDCHCI RKNGGDFPYT GNGILVSRKP MIYECSPSCP 480
CSTCKNKVTQ MGVKVRLEVF KTANRGWGLR SWDAIRAGSF ICIYVGEAKD KSKVQQTMAN 540
DDYTFDTTNV YNPFKWNYEP GLADEDACEE MSEESEIPLP LIISAKNVGN VARFMNHSCS 600
PNVFWQPVSY ENNSQLFVHV AFFAISHIPP MTELTYDYGV SRPSGTQNGN PLYGKRKCFC 660
GSAYCRGSFG
Nucleotide Sequence
(Fasta)
GACTTTTCTA CAACAAAATC TCCCTGTTTG TTTCCCTCTC TTTCTCTCTG TCTCTCTCTT 60
TTCGGAAACC CTATTTCGCG AGCAAGCAGC TTGAAAAGGG TGTGTGAATG TGATCTATCC 120
AAAAGCGTAT TTTCAGGTTT CTTCTTTCTG AAAGTTTTAA GCTTTTTGCT TTGTTATTGT 180
AATCTCTGAA TCTGGAATGA TGGTTGTTTC TCTGTGCATG CAATTGGCCT TTTGAGGATC 240
TCTGATTCGT AGTTGTGTTT AGAAAAGGTG GACTTTTTAC TTTACCCAGA TCTTTGATAG 300
CTTAAGGGTT TCATTATCAA TTTGAATTGA GGATTAGATT CAAAGGAATT TGATTTTTAG 360
CTATGGAAAG AAATGGTGGT CACTACACTG ATAAGACGAG AGTGTTGGAT ATTAAACCAT 420
TGCGTACTCT AAGACCTGTG TTTCCCAGTG GAAATCAAGC TCCGCCTTTT GTGTGTGCTC 480
CTCCTTTTGG ACCATTTCCT CCTGGGTTCT CATCGTTTTA TCCGTTTAGT TCGTCTCAAG 540
CGAATCAGCA CACACCAGAT CTTAACCAAG CTCAGTATCC ACCGCAACAT CAGCAGCCTC 600
AGAATCCACC ACCGGTATAT CAGCAGCAGC CTCCTCAGCA TGCATCTGAG CCTTCGTTGG 660
TTACTCCTTT AAGGTCATTT AGATCTCCTG ATGTGTCTAA TGGCAACGCG GAACTTGAGG 720
GGTCAACTGT GAAAAGAAGG ATCCCTAAAA AGCGTCCCAT TTCTCGGCCT GAGAATATGA 780
ATTTCGAGAG TGGGATTAAT GTGGCTGATA GAGAGAATGG CAATAGGGAG TTGGTGTTGA 840
GTGTTCTTAT GCGGTTTGAT GCGTTAAGAA GAAGGTTTGC ACAACTTGAG GATGCTAAGG 900
AAGCAGTTAG TGGGATTATC AAACGCCCTG ATTTGAAATC AGGATCTACT TGTATGGGCA 960
GAGGGGTGCG GACAAACACC AAAAAAAGAC CTGGTATTGT TCCTGGTGTT GAGATTGGGG 1020
ACGTATTCTT CTTCAGGTTT GAGATGTGTT TGGTGGGGTT GCATTCTCCA TCAATGGCTG 1080
GGATTGACTA TCTGGTTGTC AAGGGAGAAA CGGAAGAAGA ACCTATCGCC ACTAGCATTG 1140
TCTCATCTGG ATATTATGAT AATGACGAAG GTAATCCTGA TGTTTTGATT TATACTGGTC 1200
AGGGTGGTAA TGCTGATAAA GATAAGCAAT CTTCTGACCA AAAGCTCGAA AGGGGTAATC 1260
TTGCCTTGGA GAAGAGCTTG CGTAGAGATA GTGCAGTTAG GGTAATAAGG GGCTTGAAAG 1320
AGGCTTCTCA TAATGCTAAG ATCTATATTT ATGATGGACT CTATGAGATT AAAGAGTCAT 1380
GGGTAGAGAA AGGAAAATCT GGACACAACA CCTTCAAGTA TAAACTAGTT AGAGCTCCTG 1440
GTCAACCGCC TGCATTTGCT TCATGGACTG CAATCCAGAA ATGGAAGACG GGTGTGCCTT 1500
CAAGGCAAGG ACTCATTCTT CCCGATATGA CTTCCGGGGT TGAAAGCATA CCTGTTTCAC 1560
TTGTTAACGA AGTTGATACC GACAATGGGC CTGCTTATTT CACCTACTCC ACAACTGTGA 1620
AATACTCAGA GTCGTTTAAG CTGATGCAGC CTTCTTTTGG ATGTGATTGT GCCAACTTAT 1680
GCAAACCAGG GAACTTGGAT TGTCACTGCA TAAGGAAAAA TGGAGGTGAC TTCCCCTACA 1740
CCGGTAATGG AATTCTAGTT AGCCGAAAGC CTATGATATA TGAATGCAGT CCATCTTGCC 1800
CGTGCTCGAC TTGCAAAAAC AAGGTGACTC AAATGGGAGT AAAAGTGAGG CTGGAAGTTT 1860
TCAAGACAGC GAATAGAGGA TGGGGATTGC GGTCATGGGA TGCTATTCGT GCTGGTTCTT 1920
TTATATGTAT CTATGTAGGT GAGGCCAAAG ACAAATCAAA GGTGCAGCAA ACTATGGCTA 1980
ATGATGATTA TACTTTTGAT ACAACCAATG TGTATAACCC TTTCAAGTGG AACTACGAAC 2040
CTGGCTTAGC AGACGAAGAT GCTTGTGAAG AGATGTCTGA AGAATCTGAA ATCCCGCTGC 2100
CACTGATAAT CAGTGCTAAG AATGTTGGGA ACGTTGCCCG ATTCATGAAT CATAGTTGCT 2160
CACCTAATGT TTTCTGGCAG CCGGTTAGTT ATGAAAATAA CAGTCAACTC TTTGTGCATG 2220
TGGCCTTCTT TGCCATTTCT CACATCCCTC CAATGACTGA GTTAACTTAC GACTATGGAG 2280
TATCTAGACC AAGTGGGACT CAAAATGGCA ATCCTTTATA TGGCAAAAGG AAATGCTTCT 2340
GTGGATCAGC GTATTGCCGT GGCTCATTTG GATGATGATG GAGAGAAAGG CGATCTCTGG 2400
TGAACAACTG GAGTCGGATG ATTTTGGTTG CAAAAAGCTG GAGTGTTGAG TCCTGATAGG 2460
TGGAACCGAG TACTTTTGGT AGCAAGTAAA GTAAGTCTCT TTGATCTTCT CTTCAAGAGC 2520
ATAAGACGCA AAGCTGTGAA GGATTTTTAT ACTGCTTCTC CAGCAAAAAC CTTATCTATT 2580
TATCGGATTG TGGAAGATGT CCTCATGCTT TTATCCTGTT GGATATGTAA TCTTATTAGT 2640
CAGTATTCAG ATCGATTTTT AGGACAAATC TCTGGAAACT CTAAGCCTCT TTTGATTAAT 2700
TATTATGTGA ATGTGATAAA GGTTACAAGT CAATTTTAAC 2741
Sequence Source Ensembl
Keyword

KW-0137--Centromere
KW-0156--Chromatin regulator
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-1185--Reference proteome
KW-0949--S-adenosyl-L-methionine
KW-0808--Transferase
KW-0862--Zinc
--

Interpro

IPR025794--Hist-Lys_N-MeTrfase_plant
IPR003616--Post-SET_dom
IPR007728--Pre-SET_dom
IPR015947--PUA-like_domain
IPR001214--SET_dom
IPR003105--SRA_YDG

PROSITE

PS50868--POST_SET
PS50867--PRE_SET
PS51575--SAM_MT43_SUVAR39_2
PS50280--SET
PS51015--YDG

Pfam

PF05033--Pre-SET
PF02182--SAD_SRA
PF00856--SET

Gene Ontology

GO:0000775--C:chromosome, centromeric region
GO:0005634--C:nucleus
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0008270--F:zinc ion binding
GO:0040029--P:regulation of gene expression, epigenetic

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Arl-0099 fgenesh2_kg.6__ 415__ AT5G04940.2 Arabidopsis lyrata 93 0.0 1166
WERAM-Brr-0048 Bra009409.1-P Brassica rapa 82 0.0 1002
WERAM-Bro-0184 Bo9g177540.1 Brassica oleracea 82 0.0 989
WERAM-Pot-0004 POPTR_0001s07390.1 Populus trichocarpa 58 0.0 765
WERAM-Thc-0121 EOY19472 Theobroma cacao 59 0.0 759
WERAM-Viv-0081 VIT_13s0047g00120.t01 Vitis vinifera 64 0.0 696
WERAM-Prp-0038 EMJ21426 Prunus persica 63 0.0 694
WERAM-Met-0060 KEH32911 Medicago truncatula 57 0.0 682
WERAM-Glm-0034 GLYMA04G15120.1 Glycine max 60 0.0 677
WERAM-Sob-0019 Sb02g006620.1 Sorghum bicolor 54 2e-160 563
WERAM-Sei-0106 Si028938m Setaria italica 54 9e-160 561
WERAM-Org-0110 ORGLA11G0160500.1 Oryza glaberrima 54 5e-159 558
WERAM-Orb-0118 OBART11G19080.1 Oryza barthii 54 5e-159 558
WERAM-Ors-0105 OS11T0602200-01 Oryza sativa 54 6e-159 558
WERAM-Orr-0118 ORUFI11G20440.1 Oryza rufipogon 54 6e-159 558
WERAM-Orl-0082 KN539783.1_FGP008 Oryza longistaminata 54 6e-159 558
WERAM-Zem-0004 AC233961.1_FGP001 Zea mays 53 7e-159 558
WERAM-Orni-0117 ONIVA11G18710.1 Oryza nivara 54 1e-158 557
WERAM-Orp-0070 OPUNC07G07250.1 Oryza punctata 54 3e-158 556
WERAM-Brd-0028 BRADI1G53840.1 Brachypodium distachyon 54 5e-158 555
WERAM-Sol-0126 Solyc10g077070.1.1 Solanum lycopersicum 52 5e-158 555
WERAM-Sot-0021 PGSC0003DMT400018511 Solanum tuberosum 53 6e-158 555
WERAM-Orbr-0036 OB0348G10010.1 Oryza brachyantha 54 6e-157 551
WERAM-Orgl-0113 OGLUM11G18450.2 Oryza glumaepatula 54 1e-156 550
WERAM-Tra-0156 Traes_4DL_7DA2A133B.1 Triticum aestivum 53 2e-155 546
WERAM-Hov-0083 MLOC_63544.1 Hordeum vulgare 53 5e-155 545
WERAM-Orm-0054 OMERI05G19070.1 Oryza meridionalis 51 3e-153 539
WERAM-Aet-0087 EMT20679 Aegilops tauschii 54 5e-152 535
WERAM-Ori-0115 BGIOSGA033726-PA Oryza indica 55 5e-152 535
WERAM-Lep-0057 LPERR05G16930.1 Leersia perrieri 53 1e-151 534
WERAM-Tru-0063 TRIUR3_25291-P1 Triticum urartu 51 2e-146 516
WERAM-Amt-0085 ERM98215 Amborella trichopoda 48 5e-127 452
WERAM-Sem-0004 EFJ22703 Selaginella moellendorffii 38 6e-89 326
WERAM-Php-0105 PP1S469_7V6.1 Physcomitrella patens 40 5e-88 323
Created Date 25-Jun-2016