WERAM Information


Tag Content
WERAM ID WERAM-Hos-0088
Ensembl Protein ID ENSP00000393453.2
Uniprot Accession Q8NB12; SMYD1_HUMAN; A0AV30; A6NE13
Genbank Protein ID NP_938015.1
Protein Name Histone-lysine N-methyltransferase SMYD1
Genbank Nucleotide ID NM_198274.3
Gene Name SMYD1;BOP;KMT3D;ZMYND18;ZMYND22
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000115593.14 ENST00000419482.6 ENSP00000393453.2
ENSG00000115593.14 ENST00000444564.2 ENSP00000407888.2
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SMYD SET H3K4 K 25537518
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SMYD 2.30e-118 396.4 7 253
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Methylates histone H3 at 'Lys-4' (H3K4me), seems able to perform both mono-, di-, and trimethylation. Acts as a transcriptional repressor. Essential for cardiomyocyte differentiation and cardiac morphogenesis.
Domain Profile
  HMT SMYD

           SMYD.txt   1 ekvekftaegkGrGlravkplragdllfasdayayvvtkssrgvvcdrclkrkeklsrcgqckvakycdakcqkeawpdhkrecsalksykk 92 
e+ve+ftaegkGrGl+a+k+++a+d++fa++ay++vv++s++++vc++c+kr+ekl+rcgqck+a+ycd++cqk+aw++hk+ecsa+k+y+k
ENSP00000393453.2 7 ENVEVFTAEGKGRGLKATKEFWAADIIFAERAYSAVVFDSLVNFVCHTCFKRQEKLHRCGQCKFAHYCDRTCQKDAWLNHKNECSAIKRYGK 98
68****************************************************************************************** PP
SMYD.txt 93 ryPsesvrllarilvkleiegersesekllsvkdleshveklteekkedlrsdvatlqhfyreeiqqlpdqfdlveifakvncngftisdee 184
+ P+e++rl+ari++++e+eg+++++++l+sv+dl++hve+++ee+++dlr+dv+t+++++++++qq+++q+ +++if+++ncngft+sd++
ENSP00000393453.2 99 V-PNENIRLAARIMWRVEREGTGLTEGCLVSVDDLQNHVEHFGEEEQKDLRVDVDTFLQYWPPQSQQFSMQY-ISHIFGVINCNGFTLSDQR 188
*.**********************************************************************.******************* PP
SMYD.txt 185 .lqevGvgifPdlsllnhscdPnvsvvf.nGth............lelravreieeGeeltvsyi 235
lq+vGvgifP+l+l+nh+c+Pn++v+f nG+h +elra+++i+eGeeltvsyi
ENSP00000393453.2 189 gLQAVGVGIFPNLGLVNHDCWPNCTVIFnNGNHeavksmfhtqmrIELRALGKISEGEELTVSYI 253
****************************99**********************************8 PP

Protein Sequence
(Fasta)
MTIGRMENVE VFTAEGKGRG LKATKEFWAA DIIFAERAYS AVVFDSLVNF VCHTCFKRQE 60
KLHRCGQCKF AHYCDRTCQK DAWLNHKNEC SAIKRYGKVP NENIRLAARI MWRVEREGTG 120
LTEGCLVSVD DLQNHVEHFG EEEQKDLRVD VDTFLQYWPP QSQQFSMQYI SHIFGVINCN 180
GFTLSDQRGL QAVGVGIFPN LGLVNHDCWP NCTVIFNNGN HEAVKSMFHT QMRIELRALG 240
KISEGEELTV SYIDFLNVSE ERKRQLKKQY YFDCTCEHCQ KKLKDDLFLG VKDNPKPSQE 300
VVKEMIQFSK DTLEKIDKAR SEGLYHEVVK LCRECLEKQE PVFADTNIYM LRMLSIVSEV 360
LSYLQAFEEA SFYARRMVDG YMKLYHPNNA QLGMAVMRAG LTNWHAGNIE VGHGMICKAY 420
AILLVTHGPS HPITKDLEAM RVQTEMELRM FRQNEFMYYK MREAALNNQP MQVMAEPSNE 480
PSPALFHKKQ
Nucleotide Sequence
(Fasta)
GGGATGCTGA AGGTGCTGAA ATAGCAATGA CAAGAGACTT GGCTCAGTGT TAAATAACTG 60
CCGCGCTGGC CTGACAGTCT CTGAGATGAC AATAGGGAGA ATGGAGAACG TGGAGGTCTT 120
CACCGCTGAG GGCAAAGGAA GGGGTCTGAA GGCCACCAAG GAGTTCTGGG CTGCAGATAT 180
CATCTTTGCT GAGCGGGCTT ATTCCGCAGT GGTTTTTGAC AGCCTTGTTA ATTTTGTGTG 240
CCACACCTGC TTCAAGAGGC AGGAGAAGCT CCATCGCTGT GGGCAGTGCA AGTTTGCCCA 300
TTACTGCGAC CGCACCTGCC AGAAGGATGC TTGGCTGAAC CACAAGAATG AATGTTCGGC 360
CATCAAGAGA TATGGGAAGG TGCCCAATGA GAACATCAGG CTGGCGGCGC GCATCATGTG 420
GCGGGTGGAG AGAGAAGGCA CCGGGCTCAC GGAGGGCTGC CTGGTGTCCG TGGACGACTT 480
GCAGAACCAC GTGGAGCACT TTGGGGAGGA GGAGCAGAAG GACCTGCGGG TGGACGTGGA 540
CACATTCTTG CAGTACTGGC CGCCGCAGAG CCAGCAGTTC AGCATGCAGT ACATCTCGCA 600
CATCTTCGGA GTGATTAACT GCAACGGTTT TACTCTCAGT GATCAGAGAG GCCTGCAGGC 660
CGTGGGCGTA GGCATCTTCC CCAACCTGGG CCTGGTGAAC CATGACTGTT GGCCCAACTG 720
TACTGTCATA TTTAACAATG GCAATCATGA GGCAGTGAAA TCCATGTTTC ATACCCAGAT 780
GAGAATTGAG CTCCGGGCCC TAGGCAAGAT CTCAGAAGGA GAGGAGCTGA CTGTGTCCTA 840
TATTGACTTC CTCAACGTTA GTGAAGAACG CAAGAGGCAG CTGAAGAAGC AGTACTACTT 900
TGACTGCACA TGTGAACACT GCCAGAAAAA ACTGAAGGAT GACCTCTTCC TGGGGGTGAA 960
AGACAACCCC AAGCCCTCTC AGGAAGTGGT GAAGGAGATG ATACAATTCT CCAAGGATAC 1020
ATTGGAAAAG ATAGACAAGG CTCGTTCCGA GGGTTTGTAT CATGAGGTTG TGAAATTATG 1080
CCGGGAGTGC CTGGAGAAGC AGGAGCCAGT GTTTGCTGAC ACCAACATCT ACATGCTGCG 1140
GATGCTGAGC ATTGTTTCGG AGGTCCTTTC CTACCTCCAG GCCTTTGAGG AGGCCTCGTT 1200
CTATGCCAGG AGGATGGTGG ACGGCTATAT GAAGCTCTAC CACCCCAACA ATGCCCAACT 1260
GGGCATGGCC GTGATGCGGG CAGGGCTGAC CAACTGGCAT GCTGGTAACA TTGAGGTGGG 1320
GCACGGGATG ATCTGCAAAG CCTATGCCAT TCTCCTGGTG ACACACGGAC CCTCCCACCC 1380
CATCACTAAG GACTTAGAGG CCATGCGGGT GCAGACGGAG ATGGAGCTAC GCATGTTCCG 1440
CCAGAACGAA TTCATGTACT ACAAGATGCG CGAGGCTGCC CTGAACAACC AGCCCATGCA 1500
GGTCATGGCC GAGCCCAGCA ATGAGCCATC CCCAGCTCTG TTCCACAAGA AGCAATGAGG 1560
ACTGCCCAGT GGAGGAGGGG CGATGTGGCT GGGGAGCTAG GGAGAGACTC TGGAGGTGGT 1620
GGGTCTCTCG GGAGACCCCT AATGAGGAAG TTGAGGTAAT GCTTAACATT GTTGCTGTGA 1680
GAATTTACTG CCCTATGTTT CCCAGAGCCA TTTTGGCTCA ATTCAAGTCT ATTCAATTCA 1740
AGTTAACTCT AGCCCAGCCC AGATCAACTC CTCCTACAAA TATTATTGGA TGATAGGCCC 1800
TAGAACCCAA TAAAGGAGCT CCAAATGTCG TTGGGTGGGG AAGCAAAATG TAGAGAAACA 1860
TTTAAAGCAC ACTGTAATAA TAAATGCAAT TATAAACTAT ATGGAGGAGG GTGCAGAGGA 1920
GGGAATGTGT CTGGTGTGTG ATGTGTGTGT GTGCAGTGGG GGTATCACAG AGAGTATGAC 1980
ATCTGAGTTG AGGGTAGCAG GTGCCTGGAG TCTCAGGTGG CTGCTCACCC ATCTGTGCAG 2040
GTGTCTCTGG GGCTGCTGGT CTCACCTGTG GTCTGCAGTA GACACAATTG GCTGAGCAGG 2100
ATATGTGATA CTGTGTGGTT GGTGTGGAGT TTTGAAGAAG GGGCTGTGTT TGGGCCACGT 2160
AGGCTCTACT CAGAGACCTG AAACCACTTC AGAATGGTGC ATATGTCGAA AGAGCTGGCT 2220
GGGGGCCTTG CCCAAACCAA CTGAGGTCTT AAAGTCCAGG GAAAAAAAGT CTGGGTTCCA 2280
ACTAGAATTC TAGAAATATT TCTAGAACAC ACAGAGAGGG AATAAGTCCC TCTATCACCC 2340
TTATTACCAA GCCTTGTGGT TCCCTGTGAT TTTAGATAAT GTCTGATATT TTTCTGGCTA 2400
TTTGCCTAGT AGGATTTAAA AAATATTTTC AAAGTGAAGC TGAGAGAGAA TCTTGGAAAC 2460
ACACATACCT GTTGATCATG GGCCCTGCAG AATTGGCCCT TGGGGGCTTT ATTTGGTTAC 2520
ATGTGCCTGG GTGGTCTTTA CCAGCTTAGA CTCTATCATG GGCCCCCATG AAGCTCCATT 2580
CTCAATACTG AATAATTATT ACTTCCCTTG TTGAGTTTCT TTTTCTGTCA TGCCCTGGGG 2640
GCTTCTGCTC TTCTCACCAG AAAGAACATT TGAATCTGGA TTCTTGTACA CCTGGGTTAG 2700
ACCCTGTTCA GAGGTGTGGC CAATTTATCC CGATCTCCTG GAAGGCTGTT GTGATTTCCA 2760
TCTAAGAAAT GAGGGTCTTG AGAATCAACC AGTCCCAAGA TTAGCCTGTT ATCCTGTTAT 2820
CTACTGAGAC CCCAAATTTC TCACCAATGT TTTGGGAGAT CCTGGAAAAG ATCCCTTCAG 2880
TTTGGGGTGT CACCAAGACT TCTACACAAC CCAGGACTAC CATTGACCTC AGAGCTGTAC 2940
CCCACATCTT GAAGTAAATT GATCCCACCA GGTCCCACGT TTGTTATCTC TGCCTAAATG 3000
TTAGCTTCTC CATCCTCACC ACATGATGAC CTGCTGTGTC CCTCTGAGCA CTACCCAGTG 3060
GCTGAAAACT CTGCAAATGG GCCACACTTT TGCAAAATAC TTGTATCTGA CACTTAGGTC 3120
TTGTTTGAAG AATTTCCTTT CTGGAAGGTT TTACAAGAAG ACTGATAGTC TTTCAAGCCC 3180
CCACATCACA GGCTTAGGGA CGGCACTAAC TTTCTCCCAG GGATCTAACT GGCTAGTTCA 3240
AATTATCACT CTTTTACCTT CATATAAAAT GTCTCCCCCA AACCTTTTTC CCTTCTTTGT 3300
CATTGTTATC TGCTAAGCCC CTGGTCATTT CCCCATATTC GTAGTCTTTT TTTCCATCCT 3360
ATCTTTCTAA TATTTGTTGT CTTTAACAAA CTGTGTTCTG TGTCTGTGCT CCTCCTTCCC 3420
TCTCAGACCA CTGGAATGCA AGTCCTTCTT CCCTTTGGAA TGTACTCTGG ATCCCTTCCC 3480
CTGCTTTGAC CCCCAGACTT TGCTCCATCT ATTATTGCTT CTCCATCCTG GATCCTTGAC 3540
ATTTGTCACC CCACTGGCCT TCTCAGGTGC AATCAGTAAA AATGCTGAGA ACTCTTGGAT 3600
CTTAATCTTC ATGACTGAGT TTTTTTTAGT TGTATAGTTA TCATCTGCCT TTCTTCACTT 3660
TGCATTTCTT CTTGAATCCA TTGCAGATTG ACTTCCACTC CCACTCCTTC ACTAAAAGGG 3720
CTCTTACCAA GATCAAATCT AATGGGTACA TTTTAGTTCC TATGTGATTT GGCCTTTCGA 3780
TGTCAATCAT CACTCCCAGC CATTGATTTT GGTGACCCAC TTCCCTGTGA TGATCTTCTG 3840
ATCTAGTTTC TCAGGTTCCT TCGCTGGTCC TTTTTCTTTC CCTGCCCCTG ACATATTGAC 3900
ATTTCCTGGA GTTGGTTTTG TCCTTGATTC ATTCTCATGT CATTCTGCAC ACAGTCTCTG 3960
CATGAACTCA GGCAGACCCT TCATTTAATG ACCACCTTAG GGCTGATGAT TCTCAAATCT 4020
GTATTCCCCG ATCTTGCATT TGAGCTCCAG CCCCACTCAT CCTCTCGGAT GTTCTGCAGG 4080
CCCAGCAAAC TCATCATGTC CAAAGTGAAA CTTTTTCTCT TTCCTGTCTC CTCTCCTCTG 4140
ATCTGTTCTT TCTTGGAACA CCACCCAAGA ACGTCACCTC CTCCATCAGA TTGTGAGCTC 4200
CTGGAGGGCA GGAGCTGTGT CCTTCTATTC ATCTTCCTAT CCCCAGAACC TTGCACAGAT 4260
CCTGGAATGT GGTAGGTGCT CAGTAAATGT GTGTTGAATA AATGAATGAA TGAATGAACA 4320
AATGAATGAA TTTGCTTACT TCAAGGCAAA AGAACCATGA AACTGTATTT TGAGTTTCTA 4380
TGTTATAGCA GTCAGCAAAT CCTATTAAAT ACTTTGTGTT TCCAAGCAAA TAA 4434
Sequence Source Ensembl
Keyword

KW-0181--Complete proteome
KW-0963--Cytoplasm
KW-0238--DNA-binding
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0621--Polymorphism
KW-1185--Reference proteome
KW-0678--Repressor
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR001214--SET_dom
IPR002893--Znf_MYND

PROSITE

PS50280--SET
PS01360--ZF_MYND_1
PS50865--ZF_MYND_2

Pfam

PF00856--SET
PF01753--zf-MYND

Gene Ontology

GO:0005737--C:cytoplasm
GO:0005634--C:nucleus
GO:0003677--F:DNA binding
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0046872--F:metal ion binding
GO:0003714--F:transcription corepressor activity
GO:0006338--P:chromatin remodeling
GO:0007507--P:heart development
GO:0045892--P:negative regulation of transcription, DNA-templated
GO:0045663--P:positive regulation of myoblast differentiation
GO:0010831--P:positive regulation of myotube differentiation
GO:0035914--P:skeletal muscle cell differentiation
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Paa-0169 ENSPANP00000006663.1 Papio anubis 99 0.0 1025
WERAM-Chs-0198 ENSCSAP00000012439.1 Chlorocebus sabaeus 99 0.0 1025
WERAM-Poa-0106 ENSPPYP00000013606.1 Pongo abelii 99 0.0 1023
WERAM-Gog-0109 ENSGGOP00000009296.2 Gorilla gorilla 93 0.0 955
WERAM-Mum-0198 ENSMUSP00000109824.2 Mus musculus 92 0.0 952
WERAM-Caj-0205 ENSCJAP00000035935.2 Callithrix jacchus 89 0.0 914
WERAM-Mam-0062 ENSMMUP00000010559.2 Macaca mulatta 83 0.0 853
WERAM-Pat-0098 ENSPTRP00000020870.2 Pan troglodytes 99 0.0 801
WERAM-Ptv-0002 ENSPVAP00000000324.1 Pteropus vampyrus 32 1e-58 224
WERAM-Dio-0005 ENSDORP00000000968.1 Dipodomys ordii 33 3e-58 223
WERAM-Cap-0029 ENSCPOP00000002413.2 Cavia porcellus 32 5e-57 219
WERAM-Bot-0204 ENSBTAP00000044365.2 Bos taurus 27 3e-42 170
Created Date 25-Jun-2016