WERAM Information


Tag Content
WERAM ID WERAM-Mum-0199
Ensembl Protein ID ENSMUSP00000117410.1
Uniprot Accession Q9CWR2; SMYD3_MOUSE; Q6P7V6; Q8BG90
Genbank Protein ID NP_081464.1; XP_006497044.1; XP_006497045.1
Protein Name Histone-lysine N-methyltransferase SMYD3
Genbank Nucleotide ID NM_027188.3; XM_006496981.2; XM_006496982.2
Gene Name SMYD3
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMUSG00000055067.15 ENSMUST00000128302.7 ENSMUSP00000117410.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SMYD SET H3K4 K 25669152; 25918436
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SMYD 5.90e-126 420.4 5 240
Organism Mus musculus
NCBI Taxa ID 10090
Functional Description
(View)
Histone methyltransferase. Specifically methylates 'Lys-4' and 'Lys-5' of histone H3, inducing di- and tri-methylation, but not monomethylation. Plays an important role in transcriptional activation as a member of an RNA polymerase complex. Binds DNA containing 5'-CCCTCC-3' or 5'-GAGGGG-3' sequences (By similarity).
Domain Profile
  HMT SMYD

              SMYD.txt   2 kvekftaegkGrGlravkplragdllfasdayayvvtkssrgvvcdrclkrkeklsrcgqckvakycdakcqkeawpdhkrecsalksy 90 
kvekft++++G+Glrav+plr+g+llf+sd++ay+v+k+srgvvcdrcl++kekl+rc+qc++akyc+akcqk+awpdh+recs+lks+
ENSMUSP00000117410.1 5 KVEKFTTANRGNGLRAVAPLRPGELLFRSDPLAYTVCKGSRGVVCDRCLLGKEKLMRCSQCRIAKYCSAKCQKKAWPDHRRECSCLKSC 93
8**************************************************************************************** PP
SMYD.txt 91 kkryPsesvrllarilvkleiegersesekllsvkdleshveklteekkedlrsdvatlqhfyreeiq...qlpdqfdlveifakvncn 176
k+ryP++svrll+r++vkl +++++sesekl+s++dles+++klte+kke+lr++++t+qhf+reeiq qlp++fdl+e+fakv+cn
ENSMUSP00000117410.1 94 KPRYPPDSVRLLGRVIVKL-MDEKPSESEKLYSFYDLESNISKLTEDKKEGLRQLAMTFQHFMREEIQdasQLPPSFDLFEAFAKVICN 181
*******************.********************************************************************* PP
SMYD.txt 177 gftisdeelqevGvgifPdlsllnhscdPnvsvvfnGthlelravreieeGeeltvsyi 235
+fti+++e+qevGvg++P++sllnhscdPn+s+vfnG+hl+lravreie+Geelt++y+
ENSMUSP00000117410.1 182 SFTICNAEMQEVGVGLYPSMSLLNHSCDPNCSIVFNGPHLLLRAVREIEAGEELTICYL 240
**********************************************************7 PP

Protein Sequence
(Fasta)
MEALKVEKFT TANRGNGLRA VAPLRPGELL FRSDPLAYTV CKGSRGVVCD RCLLGKEKLM 60
RCSQCRIAKY CSAKCQKKAW PDHRRECSCL KSCKPRYPPD SVRLLGRVIV KLMDEKPSES 120
EKLYSFYDLE SNISKLTEDK KEGLRQLAMT FQHFMREEIQ DASQLPPSFD LFEAFAKVIC 180
NSFTICNAEM QEVGVGLYPS MSLLNHSCDP NCSIVFNGPH LLLRAVREIE AGEELTICYL 240
DMLMTSEERR KQLRDQYCFE CDCIRCQTQD KDADMLTGDE QIWKEVQESL KKIEELKAHW 300
KWEQVLALCQ AIINSNSNRL PDINIYQLKV LDCAMDACIN LGMLEEALFY AMRTMEPYRI 360
FFPGSHPVRG VQVMKVGKLQ LHQGMFPQAM KNLRLAFDIM KVTHGREHSL IEDLILLLEE 420
CDANIRAS 428
Nucleotide Sequence
(Fasta)
GCAGGACGCA GGCGCGCAGT CAGAAAACAC GAGACTCAAG CTGAGGGACG CAGTTCGGAT 60
CTGAGGTGTG GATCTGAGGA AGGATGGAGG CACTGAAGGT GGAAAAGTTC ACGACTGCCA 120
ACCGGGGAAA CGGGCTGCGT GCTGTGGCTC CACTGCGCCC CGGGGAGCTG CTCTTCCGCT 180
CCGACCCCTT GGCTTACACT GTGTGCAAGG GGAGTCGTGG CGTAGTCTGT GATCGCTGCC 240
TACTGGGGAA GGAGAAGCTG ATGCGTTGTT CTCAATGCCG AATTGCCAAA TACTGCAGTG 300
CCAAGTGTCA GAAAAAAGCC TGGCCGGACC ACAGACGGGA ATGCAGTTGT CTTAAAAGCT 360
GCAAGCCCAG ATATCCGCCC GACTCCGTGA GACTTCTGGG CAGGGTTATC GTCAAGCTGA 420
TGGATGAAAA GCCCTCAGAG TCAGAGAAAC TTTATTCATT TTATGATCTG GAGTCCAATA 480
TTAGCAAACT CACTGAAGAT AAGAAAGAGG GCCTGAGGCA ACTTGCAATG ACTTTTCAAC 540
ATTTCATGAG AGAGGAGATC CAGGATGCGT CTCAGCTGCC GCCTTCTTTT GACCTTTTTG 600
AAGCCTTTGC AAAAGTGATC TGCAATTCGT TCACTATCTG CAACGCGGAG ATGCAGGAGG 660
TCGGCGTTGG CCTGTACCCC AGTATGTCTT TGCTGAATCA CAGCTGTGAC CCCAACTGCT 720
CCATCGTATT CAACGGGCCC CACCTCTTAC TGCGTGCAGT GCGGGAAATT GAAGCAGGAG 780
AGGAGCTCAC CATCTGCTAC CTGGACATGC TGATGACCAG CGAGGAACGC CGGAAGCAGC 840
TGAGGGACCA GTACTGCTTT GAGTGTGACT GCATCCGATG CCAAACCCAG GACAAGGATG 900
CTGACATGCT AACGGGTGAC GAGCAAATAT GGAAGGAGGT TCAAGAGTCG CTGAAGAAAA 960
TCGAAGAGCT GAAGGCGCAC TGGAAGTGGG AGCAGGTTCT GGCGCTGTGC CAGGCGATCA 1020
TAAACAGCAA TTCCAACCGG CTTCCCGACA TCAACATCTA CCAGCTGAAG GTGCTCGACT 1080
GTGCCATGGA TGCCTGCATC AACCTGGGGA TGCTGGAGGA GGCGTTGTTC TACGCCATGC 1140
GCACCATGGA GCCGTACCGG ATTTTTTTCC CTGGAAGCCA TCCCGTCAGA GGTGTGCAAG 1200
TGATGAAAGT TGGCAAGCTG CAGCTCCATC AAGGCATGTT TCCTCAAGCC ATGAAGAACC 1260
TGAGGCTGGC CTTTGACATT ATGAAAGTGA CCCACGGCCG AGAGCACAGC CTGATTGAAG 1320
ATTTGATTCT TCTCCTGGAA GAATGTGATG CCAACATACG AGCCTCCTAA GGACCCCCAG 1380
AAGGACGGGG AAGGTGGCGC GCCCTCCATC GAACGCCTTA CTGAGGTCAC ACGCTCTGTG 1440
GTGTTTGCTG TGTGGACCTT CTGTAGAAAT TGCCAATGTG TTTATGTGGG TAAATGTGAT 1500
TCTGTGGCAT GTCGTGAGAA GTCTTACTCG TGATAGAGCA GAACCATTAC AATAAATTAA 1560
AAAGACGAGT CCAAAATGCC ACTGGCGACT TGGCTTTGCC TATTCTCAGG TGGTTTTGAT 1620
GGTTAAGATG TCAACATTGG AAGCAGGAGC TTTCCAAAAG AAGAATGGAA TCCCAGGCAG 1680
CCCCTGAAAG GCAGAGGCAG GTGGATCTCC ATGAGTTCGA GGCCAGTTTT GTCTACACAG 1740
TGAATTACAA GACAGCCATA GCCTTGTAGA GAGACCCTGT TCCAGAAACT GAAAAAAAAT 1800
AGGGTAGAAT ATGTTCAAGT TTCCTCTGTG TGTGTGTGTA TATAGAAGTA AACGGATAAA 1860
AGTAATAATT TTCACATAGA CTTCTAAACA CAAGTGTATA TCAAGAAGAT TTTTTATTGT 1920
ACTTTTATCT TTATTGAGAT GTCTTAATTG TACTTATTAA CATGGTTCAC TGTGATATTT 1980
CTATATCTGA ATACACACAC ATGGCGCCCA GTTGTCACTG AGGAACTCTG GCCTCCATTT 2040
ATTTTCCTGC CTCCATGGAT CACCGTTTCT CTCTCTCTCT CTCTCTCTCT CTCTCTCTCT 2100
CTCTCTCTCT CTCTCACACA CACACACACA CACACACACA CACACACCCT CCTGACTCTT 2160
ACACATGAGA GGGAAAACGC AGTACTTGTC TCTGTCGTCC CCCCTTATTT CTCTTTGTAT 2220
AATTTCCTCC AGTTCTGTCC ATTCGGCTGC AGCTGACAGA GCTTCACTCT TCTTTATGGC 2280
TGAGTAATAG CCTGTTATTT GTATGTACTT GGTAAATGTT AGCAAGCACC TACAGTAACT 2340
GCAGCCTTGG CGATTATGAA TGGCTTTATA GTAGACATGT CAGGCAAGTA TGTTTGGGGT 2400
TTAATTTCCC TTTGACATAC ACCTAGAAGT GGCATGGCTG GGTCCCATGG TGGTACTCTT 2460
TTAACTTCTT AGATGAACCT CCTCCATACT CTCCTTGGCT TTAGGTTCCC ATCAACAGTG 2520
TATGAGAGTT CCTGTTCTCC ATGTTCACGT CAGCATTTGT TATGGGTTTT TTTTTCCTTG 2580
CTTTGCTGTT GGCTTTAATC GCATTTTAAC TGGCGTACAT GATATTTCAT CATAGTTTCG 2640
ATTTCCCTGA TGACCAATGG TATTGATTGT GCCTCATGTT TTTGTGACTG TTCAGAACAG 2700
TTGTTCATTT TGTAATCAGA TTCTTGCCTT TTCAATTCCT TACCTATTAT AAATACACTA 2760
TGTATTGTGG ATATGACTCC TTCACTGGGA GGATGGTTGG CAGACAGTTC TCTAGTTCTG 2820
TAGATTGTCT TTTCCTCTGC TGTTTTGTGT GCTGTACAGA ACCTCAGTGT AATTGTTTGT 2880
CTTCGGTTCT GATCCCCTCC CTTCTATAAT CACATAAAGC CATTGCTTAT TCTGGCTTCT 2940
ATGATACTCC ATTGCCCTTA CTGACGTGTT CCCATGCATG TTCCCAGCTT GATCTTGCAT 3000
TTGGTCTTTG GTCCATCGTG AGCTGAGATC CGTCCGTCGT GAAAAGGTTC GGTTTCCTTC 3060
TCTTGCATGT GGCTGTCTGC CTTCCCTGCA CACTTTGTTG AAGAGGGAGT CACACAGCCT 3120
TTATGGTTTG GGCACGTTTG TCAAGAAGGA GCGGGTTCTA GGTTTGTGGG CTTACTTTTG 3180
GGCTCTCCGT TCAGTCCCAT GGGTCCCTTG CTCTATGGCC ATGCCGTTTC TGCTAGCACG 3240
GCTCTGTAAT ATGGCTTGCA GTCAGGTGCC ATGTTGCCTC TGGCCTCACT GGTTTCACTC 3300
AAGATGATTT TGTCTGTCCT GAGTCTGTTG TGAAGAAGCA GAATTCTGAA ACGTTGCCTT 3360
TACTCCTCTG CAGTGAATAA CTACGTGTTA TTTCCTCCTG AATACCCCTC CAACACAGGA 3420
ATCATCATCT CAAATCCTAG GTCTAGGCCA TTTCCAACAT TACTAGGAAA ACACTGTAAG 3480
GCTCACACAA AAAGAACAAG ATACTGAAGT CAACTCTTTG TATAAATCTT TGCTGTACTT 3540
TATAATTCTT TGTGTGTGTT AAAACTTTCT TAACCCCCTT TCTTCTGTAA TTGCTCTAGT 3600
CTGTGCTTTG CTGTGCTGGT GGGCAGAGCA CAAAGCTCTA TTGCCCTGTG GTAAACTGTG 3660
TTCTCATCCT CATTCACCTT AGCCTGGTTT CCTCTGCTTT TGTTGTCTTT TCCACTTCCC 3720
CAACCCCTTG GCCTGCTTCT CTATCTTTCT GCTTTAGAAA AATAACAAAA ATATTTCCCT 3780
TGTCCAAAAA AAAAAAAAAA GTTGAACCTT GTATGGCTCA CTGGTCCTGG AAGCAGTGCG 3840
GTCCTTCCTG TTCCAGTGAG ATTCTAGTAG TAAATCAGAC CGGTGGGCTG CTTCAGATGA 3900
AATGGCCCTT AGATAGTAGG CTACAATCCA CTCCTCTGTT ACAGGGATAC AGGGTGTGAA 3960
GGCAGACTCC AAAATAGTGG CCTCCAAGGG AGGGAGGCAG AAGACACTGT CAGAATGACA 4020
CATGGTCGTC AGAGCCTGAA GGGGTATACC TTAAGTTAGA GCAAAGAAAG CTGACAGCAA 4080
GAAGCTGGGG CCCACCGGCC GAACATCTGG TGCCATGGAA ACGATGACAT GTGTAGGGTG 4140
TCTGATGCAC ACCCCTGAGC TCCAGGACCT TTTCCTAAGG ATGCTTCTGC TCATGCCCAG 4200
CTGTCATCCA GCCAGCACAC CCCTAGGCAG TGCTGTCTCT GCTAGCCACA GTAGCTTATA 4260
GTGGGCATCT GAAGGCCGTT TCACAATGAG GAAAGAAGTC TGACGTCAGG TGCACTAGAA 4320
CCCCACCCCA GATTTGCTAA GTAACTCTGA GAGGACCTTG ACCTCAGCCT TTTGAAAGGC 4380
CAGTTCTATC CAATTCAACA GATATCTATT AAATGATTCT GATGTTTTCA GGAAACGTGC 4440
ACACAAAGGC CAAATAAAAC TTGCATGACC TACCAAGTGT ATACTGAGAG AACTTGCATC 4500
CATTAGCTTA TGCTGTATGA CAAACAGCCC CAGAACATAA CTTGGTTTAG CTCATAGTGC 4560
TGTAGGTCAG CACTTTGAGC TGGACATTTC TGGTTTGTTC CGGGACTTAT TGGTTGTGAA 4620
TCGTTGCTGT TATTGGTAGG TCACTTGCAG AATCAGAGTG ACTAAATGAT ACAGCACCTT 4680
AAGTTGGCTA ACCAGTTATG TCCACAGGCC AGGAGAGTTC CAGGATCAGT AAGTCAATCA 4740
ACTCCCAGCC AACCAGTGCT TTTCAAGCCT GCTTCTATCA CATTGGCTTT AGTCCGATAG 4800
GTCAATGCCC ACCCCAAGGC ATGGATATGT GTTCCCATCT CTGTAGAAAC AGCTACAGTC 4860
CCTCAGCAGG GGCATAGCCA CAGGAGTGGG GAATGGTGCC ATCTGGGCCC TTCACCATAG 4920
CAGCCCTGAA ACTATCTATA TGCTATGGAA CACTTAACAA ATATGATGAA ATATACACTC 4980
TGTACTTCCG GTAATAGACT AGAATAATTT GCCATTCTAA TCCAGATTCT TAGAACACTC 5040
CAGGGCCCAT CGGTGAATTC TCTGACATTC TCATCAAATG AAAAACAAAC TTCCTACATT 5100
CTGGAAAGAT GGGGGTGGGG GATGGAACCA AGAGATGAGG ACCAAGAGAA AGCTGTATTG 5160
TTTGTCCCCC TTTAACTCAC TGGGGAAATG AAAGACCCCA AGAGAAAGTC AGGGAGTAAC 5220
TTCCATGGCC CATGGGCAGA GTTCGTAGAT AGGATGCCGT GCTTGCCAGC TTTCTGAATG 5280
AAATTTATCA GTCACTGATG ATCCTGGATT AAGAGATTTG TCTTCACGCC TCCTAGAGAA 5340
AATCTGATGG GCTTTGAAAT CCCTCTCTGC CAACTTACAG TGTGTGAGGA TGAAGGCACC 5400
CTGCGTGGAC CCAGTTTTAA TACAAAGCAA ACATTTAAGC AAAGGTAAAT ATTTACCCAA 5460
TAAAATTACA GCAAAATAAT TTTGGCAAAG AGGGAAACGG TCAGAGAACA CAATTTAAAA 5520
TCCGTGTCGC CAGTGACTGA GGATTGATTG CGGCCCCAGC CTCTACGAAG CCTTCCCATT 5580
AATCCATGGT CATTGTAGCG GCAGGCTTTT CTCACGTCAG TAACTTGACG CCATTAGCCG 5640
CTCCTCTGGG ATGAGAACAC ATTTGAACTG AAAATGAACA CGCCGGGCAT AATTTATAGC 5700
CGTGCCACAT GAATGCCAAG AGTGTGAGGA GGAAGCTGCT TGGGAATCCA CTCATGGTGG 5760
CTACCCAGAC CTGACTTCCT CTAAGCAAGA GCCAGACGGC TGCAAACCAC AGGCCCCCAG 5820
TTAGACTTGG CTGCAGGGAG GGAGGGAAAT TGCATACAGG AGCCAGATGT CTTGAAAATT 5880
CCCTCCTAGT CCAGCCAGTC AGGAGTGGAG GCAAGGCTCC CATAGCAGAG CCCACCGGGG 5940
TTCATCAGAG AGAGGGTTCT GGACCTTGTG AAAACCGTGA AATCTTCATG TCAAGCAGTC 6000
ACGGAAGGAA GCCCCTTTCC TCTGAGGTCT GATGGGAGAA CACCCGAATC CAAGCCAGAA 6060
GCCTCCCTAA GACTGGGATC AGAGTGAGGA TGCTCTCGGC GTGTTCACCA CTAATCCAAA 6120
GCTCTGAAAG AGCAGCCGAA GCATCTCTCC TGCCCCTACA TTTACTATTC AAATTTGCTT 6180
TCAATTGGCA TAGAAAGTGT GCCCCTGCCT ATGGACCTAA CCTGCTATTC CAGAGGGACC 6240
AGCTTCCCAG TCCCAGGCCT CAGCCTTTGT CAGGGCCGTG TGTGTGTGTG TGTGTGTGTG 6300
TGTGTGTGAG TGCATGCATA CAAGGATGTT GCTGTAGGAT ATGAGAGGGG GAAATGGAGG 6360
TTTGAGGGAC TGCAGAGAGA ACCTAATAGC ATACATGTGA CATGAAAGTG AAAAGGAGAG 6420
AGGGGGATTG ACCCACATAT GTATAAAGTA CATACAAATA TACCACAGTG AAACCTGCTA 6480
CACACACACA CACACACACA CACATACACA CACACAAATT AAGGACATAC AAAAATACCA 6540
CAGTGAAGCC TGCTTTCTAT GCCAAGTGAA AACAAATGTT AACAGTCAGT GTAGGGGTGG 6600
GAGAGATGGT TCAGCAGTTA AGAGCACTGG CTGCTCTTTC AGAGGTCCCA ACTTCAATCC 6660
CCAGCACTTA GAACGGCAGC TCACAATCGT CTGCAACTCC AGTTCCAGGA GATACGATGC 6720
CCTCTGATGC CCTCTTCTGA CCATCAAGGG TTCCAGGCAC TCACGTGGTG CACAGACATA 6780
ACATGCAGGC AGATGCTTAC ACAGATAAAA TAATTTTTAA AATTAACTGT TTTCTTTTCA 6840
AGTGAATGTA GAATAAAAAC TGTAAGGTGC A 6872
Sequence Source Ensembl
Keyword

KW-0007--Acetylation
KW-0156--Chromatin regulator
KW-0181--Complete proteome
KW-0963--Cytoplasm
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-1185--Reference proteome
KW-0949--S-adenosyl-L-methionine
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR025805--Hist-Lys_N-MeTrfase_Smyd3
IPR001214--SET_dom
IPR002893--Znf_MYND

PROSITE

PS51574--SAM_MT43_2
PS50280--SET
PS01360--ZF_MYND_1
PS50865--ZF_MYND_2

Pfam

PF00856--SET
PF01753--zf-MYND

Gene Ontology

GO:0005737--C:cytoplasm
GO:0005634--C:nucleus
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0046872--F:metal ion binding
GO:0000993--F:RNA polymerase II core binding
GO:0000979--F:RNA polymerase II core promoter sequence-specific DNA binding
GO:0001162--F:RNA polymerase II intronic transcription regulatory region sequence-specific DNA binding
GO:0071549--P:cellular response to dexamethasone stimulus
GO:0045184--P:establishment of protein localization
GO:0014904--P:myotube cell development
GO:0006469--P:negative regulation of protein kinase activity
GO:0006334--P:nucleosome assembly
GO:0033138--P:positive regulation of peptidyl-serine phosphorylation
GO:0045944--P:positive regulation of transcription from RNA polymerase II promoter

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Ptv-0002 ENSPVAP00000000324.1 Pteropus vampyrus 94 0.0 825
WERAM-Poa-0001 ENSPPYP00000000044.2 Pongo abelii 94 0.0 823
WERAM-Hos-0219 ENSP00000419184.2 Homo sapiens 94 0.0 823
WERAM-Mam-0040 ENSMMUP00000007626.2 Macaca mulatta 93 0.0 818
WERAM-Gog-0225 ENSGGOP00000028735.1 Gorilla gorilla 92 0.0 799
WERAM-Cap-0029 ENSCPOP00000002413.2 Cavia porcellus 91 0.0 799
WERAM-Dio-0005 ENSDORP00000000968.1 Dipodomys ordii 89 0.0 748
WERAM-Bot-0204 ENSBTAP00000044365.2 Bos taurus 83 0.0 714
WERAM-Paa-0169 ENSPANP00000006663.1 Papio anubis 32 9e-59 224
WERAM-Chs-0198 ENSCSAP00000012439.1 Chlorocebus sabaeus 32 1e-58 224
WERAM-Pat-0098 ENSPTRP00000020870.2 Pan troglodytes 33 2e-54 210
WERAM-Caj-0205 ENSCJAP00000035935.2 Callithrix jacchus 30 8e-49 191
Created Date 25-Jun-2016