WERAM Information


Tag Content
WERAM ID WERAM-Hos-0155
Ensembl Protein ID ENSP00000313983.7
Uniprot Accession Q9BZ95; NSD3_HUMAN; B7ZL11; D3DSX1; Q1RMD3; Q3B796; Q6ZSA5; Q9BYU8; Q9BYU9; Q9H2M8; Q9H9W9; Q9NXA6
Genbank Protein ID NP_060248.2; NP_075447.1; XP_005273604.1; XP_005273605.1
Protein Name Histone-lysine N-methyltransferase NSD3
Genbank Nucleotide ID NM_017778.2; NM_023034.1; XM_005273547.1; XM_005273548.1
Gene Name NSD3;WHSC1L1;pp14328;DC28
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000147548.16 ENST00000317025.12 ENSP00000313983.7
ENSG00000147548.16 ENST00000316985.7 ENSP00000313410.3
ENSG00000147548.16 ENST00000433384.6 ENSP00000393284.2
ENSG00000147548.16 ENST00000527502.5 ENSP00000434730.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET2 SET H3K36; H3K4; H3K27 K 26807165; 20951770; 25537518
Me_Reader PWWP PWWP1 H3K36me2 K 26626481
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.40e-51 175.8 1147 1262
Me_Reader PWWP 7.10e-45 153.3 270 1021
HMT SET1 1.80e-27 98.1 1150 1262
Me_Reader PHD 4.00e-09 38.8 702 1366
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase. Preferentially methylates 'Lys-4' and 'Lys-27' of histone H3. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation, while 'Lys-27' is a mark for transcriptional repression.
Domain Profile
  HMT SET2

           SET2.txt    3 veliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncetqkwt 92  
e+ikte++G+Glr+k++ikk+ef++eYvGe+ide+e++ R+k+++e++v++fY+l+++kd++iDa kGn++Rf+nhsC+Pncetqkwt
ENSP00000313983.7 1147 AEIIKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKRAHENSVTNFYMLTVTKDRIIDAGPKGNYSRFMNHSCNPNCETQKWT 1236
689*************************************************************************************** PP
SET2.txt 93 vegelrvglfakkkikkgeeltfdYn 118
v+g++rvglfa ++i++g eltf+Yn
ENSP00000313983.7 1237 VNGDVRVGLFALCDIPAGMELTFNYN 1262
*************************8 PP

  Me_Reader PWWP

           PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 
+gdLVw+K+++YpwWP++v+s+p+ ++++k +++ +++y+V+FF n++erawv++k++ +y++
ENSP00000313983.7 270 VGDLVWSKVGTYPWWPCMVSSDPQLEVHTKINTRGAREYHVQFFSNQPERAWVHEKRVREYKG 332
69***********************************************************86 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
+++Vw+Kl++Y+wWPa++++p++ +++++ +++ ++++V+FFg +h+++wv++ +++py e
ENSP00000313983.7 961 KQIVWVKLGNYRWWPAEICNPRSVPLNIQGLKHDLGDFPVFFFG-SHDYYWVHQGRVFPYVE 1021
689*****************************************.***************76 PP

  HMT SET1

           SET1.txt    6 akskikglglvakkeiekeelviEYvGevirsevadkrek.eyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakvvav 94  
k + +g+gl++k++i+k+e+v EYvGe+i +e+ r k +e++ + y+ + +d ++da kgn++rf+nhsc+pNce++ +v
ENSP00000313983.7 1150 IKTERRGWGLRTKRSIKKGEFVNEYVGELIDEEECRLRIKrAHENSVTNFYMLTVTKD--RIIDAGPKGNYSRFMNHSCNPNCETQKWTV 1237
566778*************************999988888355565555*********..****************************** PP
SET1.txt 95 dgekkiviyakraIekgeeltydYk 119
+g+ +++++a +I++g elt++Y+
ENSP00000313983.7 1238 NGDVRVGLFALCDIPAGMELTFNYN 1262
************************7 PP

  Me_Reader PHD

            PHD.txt   2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52 
t+C++C +++ ++ ++ C++ C + fHl+C++l+ slp++ +++C +Ck+
ENSP00000313983.7 702 TVCQIC-ESSGDS--LIPCEGeCCKHFHLECLGLA--SLPDS-KFICMECKT 747
79****.444443..9*******************..*****.*******96 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
C+ C k++ ++ + C+ C + +H Cv + + e+k ++Cp++
ENSP00000313983.7 751 PCFSC-KVSGKD--VKRCSVgaCGKFYHEACVRKFPTAIFESKGFRCPQH 797
6999*.666654..66899999**********988899999978****98 PP
PHD.txt 2 tiClvCg.kddegeke...mvqCdeCddwfHl..kCvklplsslpegkswy..CpsC 50
+C +C+ ++d + m+ C C+ ++H C+ s+ + s++ C ++
ENSP00000313983.7 797 HCCSACSmEKDIHKASkgrMMRCLRCPVAYHSgdACIAAG--SMLVS-SYIliCSNH 850
69****5344444433688************733566666..44433.333336666 PP
PHD.txt 13 gekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
g + +++C++C+ fH +C++++ +peg w C+ Ck+
ENSP00000313983.7 920 GGR-LLCCESCPASFHPECLSIE---MPEG-CWNCNDCKA 954
222.9******************...****.*******85 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
++C+ C d ge +v Cd+ C++++Hl C++l+ + p+g +w Cp ++
ENSP00000313983.7 1322 DYCFQC--GDGGE--LVMCDKkdCPKAYHLLCLNLT--QPPYG-KWECPWHQ 1366
699999..33332..9********************..999**.****9775 PP

Protein Sequence
(Fasta)
MDFSFSFMQG IMGNTIQQPP QLIDSANIRQ EDAFDNNSDI AEDGGQTPYE ATLQQGFQYP 60
ATTEDLPPLT NGYPSSISVY ETQTKYQSYN QYPNGSANGF GAVRNFSPTD YYHSEIPNTR 120
PHEILEKPSP PQPPPPPSVP QTVIPKKTGS PEIKLKITKT IQNGRELFES SLCGDLLNEV 180
QASEHTKSKH ESRKEKRKKS NKHDSSRSEE RKSHKIPKLE PEEQNRPNER VDTVSEKPRE 240
EPVLKEEAPV QPILSSVPTT EVSTGVKFQV GDLVWSKVGT YPWWPCMVSS DPQLEVHTKI 300
NTRGAREYHV QFFSNQPERA WVHEKRVREY KGHKQYEELL AEATKQASNH SEKQKIRKPR 360
PQRERAQWDI GIAHAEKALK MTREERIEQY TFIYIDKQPE EALSQAKKSV ASKTEVKKTR 420
RPRSVLNTQP EQTNAGEVAS SLSSTEIRRH SQRRHTSAEE EEPPPVKIAW KTAAARKSLP 480
ASITMHKGSL DLQKCNMSPV VKIEQVFALQ NATGDGKFID QFVYSTKGIG NKTEISVRGQ 540
DRLIISTPNQ RNEKPTQSVS SPEATSGSTG SVEKKQQRRS IRTRSESEKS TEVVPKKKIK 600
KEQVETVPQA TVKTGLQKGA SEISDSCKPL KKRSRASTDV EMTSSAYRDT SDSDSRGLSD 660
LQVGFGKQVD SPSATADADV SDVQSMDSSL SRRGTGMSKK DTVCQICESS GDSLIPCEGE 720
CCKHFHLECL GLASLPDSKF ICMECKTGQH PCFSCKVSGK DVKRCSVGAC GKFYHEACVR 780
KFPTAIFESK GFRCPQHCCS ACSMEKDIHK ASKGRMMRCL RCPVAYHSGD ACIAAGSMLV 840
SSYILICSNH SKRSSNSSAV NVGFCFVCAR GLIVQDHSDP MFSSYAYKSH YLLNESNRAE 900
LMKLPMIPSS SASKKKCEKG GRLLCCESCP ASFHPECLSI EMPEGCWNCN DCKAGKKLHY 960
KQIVWVKLGN YRWWPAEICN PRSVPLNIQG LKHDLGDFPV FFFGSHDYYW VHQGRVFPYV 1020
EGDKSFAEGQ TSINKTFKKA LEEAAKRFQE LKAQRESKEA LEIEKNSRKP PPYKHIKANK 1080
VIGKVQIQVA DLSEIPRCNC KPADENPCGL ESECLNRMLQ YECHPQVCPA GDRCQNQCFT 1140
KRLYPDAEII KTERRGWGLR TKRSIKKGEF VNEYVGELID EEECRLRIKR AHENSVTNFY 1200
MLTVTKDRII DAGPKGNYSR FMNHSCNPNC ETQKWTVNGD VRVGLFALCD IPAGMELTFN 1260
YNLDCLGNGR TECHCGADNC SGFLGVRPKS ACASTNEEKA KNAKLKQKRR KIKTEPKQMH 1320
EDYCFQCGDG GELVMCDKKD CPKAYHLLCL NLTQPPYGKW ECPWHQCDEC SSAAVSFCEF 1380
CPHSFCKDHE KGALVPSALE GRLCCSEHDP MAPVSPEYWS KIKCKWESQD HGEEVKE 1437
Nucleotide Sequence
(Fasta)
GGGGGCTTTG TGCGCGGCGG CGGCGGGAGA GGCGGCGGCG GCGGCCAGCA CGGAGGCGGA 60
GGCCGAGGGG GCTGTGCACA GGTCGCCGCG GAGAGGCGTG CGAATTCCGA GCCGAGCGCC 120
GAGGACCGTG CTACCCAGGC CGGGCTGCCA GCCGCAGGCT CCTCTCTGGC AGCAGCGGCG 180
GCGCGGCGAC CCCCGTCCCT CGGCCTCCCC TTCCCATCCC ACCTCCCGAG CCTTCCTCTT 240
CCCGCAGCAC GCCCGGCCCG GCCCGGCCGT GGCCCTCCTC AGTGCCGGCC GCCATGGCAG 300
AGGCGTCCGG CGCGGGGAAA ATCTAGCCCG GGGATTTCAT GCGGCCTAGC TCGGTTCCGC 360
CTCCTCCTCG CGCGGCCCCA GCGGCTGCCC GCACCCCAGC CCCACTCCGG GCCTCCGTGT 420
CTCTCCTGTG ATCGCACTGA CACGGCCGGG GGGTTAGAAT GGAACAAACT GAAGGCCCGA 480
TGAGAGAAAG GGAAAGTTAA GGATGCTGGA GCAGAACAAT GGATTTCTCT TTCTCTTTCA 540
TGCAAGGGAT CATGGGAAAC ACAATTCAGC AACCACCTCA ACTCATTGAC TCCGCCAACA 600
TCCGTCAGGA GGATGCCTTT GATAACAACA GTGACATTGC TGAAGATGGT GGCCAGACAC 660
CATATGAAGC TACTTTGCAG CAAGGCTTTC AGTACCCAGC TACAACAGAA GATCTTCCTC 720
CACTCACAAA TGGGTATCCA TCATCAATCA GTGTGTATGA AACTCAAACC AAATACCAGT 780
CATATAATCA GTATCCTAAT GGGTCAGCCA ATGGCTTTGG TGCAGTTAGA AACTTTAGCC 840
CCACTGACTA TTATCATTCA GAAATTCCAA ACACAAGACC ACATGAAATT CTGGAAAAAC 900
CTTCCCCTCC ACAGCCACCA CCTCCTCCTT CGGTACCACA AACTGTGATT CCAAAGAAGA 960
CTGGCTCACC TGAAATTAAA CTAAAAATAA CCAAAACTAT CCAGAATGGC AGGGAATTGT 1020
TTGAGTCTTC CCTTTGTGGA GACCTTTTAA ATGAAGTACA GGCAAGTGAG CACACGAAAT 1080
CAAAGCATGA AAGCAGAAAA GAAAAGAGGA AAAAAAGCAA CAAGCATGAC TCATCAAGAT 1140
CTGAAGAGCG CAAGTCACAC AAAATCCCCA AATTAGAACC AGAGGAACAA AATAGACCAA 1200
ATGAGAGGGT TGACACTGTA TCAGAAAAAC CAAGGGAAGA ACCAGTACTA AAAGAGGAAG 1260
CCCCAGTTCA GCCAATACTA TCTTCTGTTC CAACAACGGA AGTGTCCACT GGTGTTAAGT 1320
TTCAGGTTGG CGATCTTGTG TGGTCCAAGG TGGGAACCTA TCCTTGGTGG CCTTGTATGG 1380
TTTCAAGTGA TCCCCAGCTT GAGGTTCATA CTAAAATTAA CACAAGAGGT GCCCGAGAAT 1440
ATCATGTCCA GTTTTTTAGC AACCAGCCAG AGAGGGCGTG GGTTCATGAA AAACGGGTAC 1500
GAGAGTATAA AGGTCATAAA CAGTATGAAG AATTACTGGC TGAGGCAACC AAACAAGCCA 1560
GCAATCACTC TGAGAAACAA AAGATTCGGA AACCCCGACC TCAGAGAGAA CGTGCTCAGT 1620
GGGATATTGG CATTGCCCAT GCAGAGAAAG CATTGAAAAT GACTCGAGAA GAAAGAATAG 1680
AACAGTATAC TTTTATTTAC ATTGATAAAC AGCCTGAAGA GGCTTTATCC CAAGCAAAAA 1740
AGAGTGTTGC CTCCAAAACC GAAGTTAAAA AAACCCGACG ACCAAGATCT GTGCTGAATA 1800
CTCAGCCAGA ACAGACCAAT GCAGGGGAGG TGGCCTCCTC ACTCTCAAGT ACTGAAATTC 1860
GGAGACATAG CCAGAGGCGG CACACAAGTG CGGAAGAGGA AGAGCCACCG CCTGTTAAAA 1920
TAGCCTGGAA AACTGCGGCA GCAAGGAAAT CCTTACCAGC TTCCATTACG ATGCACAAAG 1980
GGAGCCTGGA TTTGCAGAAG TGTAACATGT CTCCAGTTGT GAAAATTGAA CAAGTGTTTG 2040
CTCTTCAGAA TGCTACAGGG GATGGGAAAT TTATCGATCA ATTTGTTTAT TCAACAAAGG 2100
GAATTGGTAA CAAAACAGAA ATAAGTGTCA GGGGGCAAGA CAGGCTTATA ATTTCTACAC 2160
CAAACCAGAG AAATGAAAAG CCAACGCAGA GTGTATCATC TCCTGAAGCA ACATCTGGTT 2220
CTACAGGCTC AGTAGAAAAG AAGCAACAGA GAAGATCAAT TAGAACTCGT TCTGAATCAG 2280
AGAAATCCAC TGAGGTTGTG CCAAAGAAGA AGATCAAAAA GGAGCAGGTT GAAACAGTTC 2340
CTCAGGCTAC AGTGAAGACT GGATTACAGA AAGGTGCCAG CGAGATTTCA GATTCCTGTA 2400
AACCTCTAAA GAAAAGGAGT CGCGCCTCAA CTGATGTAGA AATGACTAGT TCAGCATACA 2460
GAGACACATC TGACTCCGAT TCTAGAGGAC TGAGTGACCT GCAGGTAGGC TTTGGAAAGC 2520
AAGTAGATAG CCCTTCAGCT ACTGCAGATG CAGACGTTTC TGATGTGCAG TCCATGGATT 2580
CAAGTTTGTC GAGAAGAGGC ACTGGAATGA GTAAGAAGGA CACTGTATGT CAGATTTGTG 2640
AAAGCTCTGG TGACTCTCTG ATTCCTTGTG AGGGAGAGTG CTGCAAACAC TTTCACCTGG 2700
AGTGCCTGGG ATTGGCATCA CTTCCTGATA GCAAGTTCAT CTGCATGGAA TGTAAAACTG 2760
GGCAGCACCC ATGTTTTTCG TGTAAAGTGT CTGGTAAAGA TGTGAAGCGT TGTTCTGTTG 2820
GTGCTTGTGG GAAATTTTAT CATGAAGCCT GTGTCCGCAA ATTCCCCACT GCCATCTTTG 2880
AATCAAAAGG ATTCCGCTGT CCTCAGCACT GCTGCTCTGC CTGCTCTATG GAGAAAGATA 2940
TCCACAAAGC AAGTAAAGGC CGCATGATGA GATGTTTAAG ATGTCCAGTT GCCTATCACT 3000
CTGGAGATGC TTGCATTGCG GCCGGAAGCA TGTTAGTATC CTCCTACATT CTCATCTGTA 3060
GTAATCATTC CAAACGGAGC AGTAATTCTT CTGCTGTAAA TGTAGGCTTT TGTTTCGTTT 3120
GTGCCAGAGG GCTGATAGTT CAGGACCATT CAGACCCCAT GTTCAGTTCA TATGCCTATA 3180
AGTCCCACTA CCTACTGAAT GAATCAAATC GTGCTGAGTT GATGAAATTA CCTATGATTC 3240
CTTCTTCGTC AGCTTCCAAA AAGAAATGTG AGAAAGGTGG AAGATTGCTC TGCTGTGAAT 3300
CGTGCCCAGC TTCCTTCCAC CCGGAATGCC TAAGCATAGA AATGCCAGAA GGCTGCTGGA 3360
ATTGTAATGA CTGTAAAGCT GGCAAGAAAC TACATTACAA GCAGATTGTT TGGGTCAAAT 3420
TGGGAAATTA CAGATGGTGG CCAGCAGAGA TCTGCAACCC CAGGTCTGTG CCACTGAACA 3480
TCCAGGGCCT TAAACATGAC TTGGGGGACT TCCCTGTATT CTTCTTTGGT TCTCATGACT 3540
ACTACTGGGT ACACCAGGGC AGAGTGTTCC CTTATGTTGA AGGAGACAAA AGCTTTGCTG 3600
AAGGGCAGAC TAGTATTAAC AAGACCTTCA AAAAGGCACT GGAAGAAGCT GCAAAACGTT 3660
TCCAGGAATT GAAAGCACAA AGAGAAAGTA AAGAAGCCCT AGAGATTGAA AAAAACTCAA 3720
GAAAACCCCC TCCCTACAAA CACATCAAAG CTAACAAAGT AATAGGAAAG GTGCAGATCC 3780
AGGTTGCTGA CCTGTCAGAG ATTCCCCGCT GTAACTGCAA GCCAGCTGAT GAAAACCCTT 3840
GTGGCTTGGA ATCGGAGTGC CTGAACAGAA TGTTGCAGTA TGAATGCCAC CCGCAGGTGT 3900
GCCCAGCTGG AGATCGTTGT CAGAACCAGT GCTTTACAAA GAGACTATAC CCTGATGCAG 3960
AGATCATCAA AACGGAGCGG AGAGGCTGGG GCCTCAGGAC CAAAAGGAGC ATTAAGAAGG 4020
GTGAATTTGT AAATGAATAC GTCGGTGAAT TAATTGATGA AGAAGAATGC AGATTGCGAA 4080
TCAAGCGAGC CCACGAGAAC AGTGTAACTA ATTTTTATAT GTTAACTGTT ACCAAGGACC 4140
GTATAATTGA TGCCGGCCCA AAAGGAAATT ATTCTCGCTT CATGAACCAC AGTTGTAATC 4200
CCAACTGTGA AACACAAAAG TGGACAGTGA ATGGAGATGT TCGAGTGGGA CTATTTGCTC 4260
TCTGTGATAT TCCTGCAGGG ATGGAGTTAA CATTTAATTA TAACCTAGAT TGTCTGGGCA 4320
ACGGCAGAAC GGAGTGCCAC TGTGGAGCAG ATAACTGCAG TGGTTTTCTA GGAGTGCGGC 4380
CAAAGTCGGC ATGTGCGTCA ACAAATGAAG AGAAGGCAAA AAATGCTAAG TTAAAACAGA 4440
AGAGACGAAA GATCAAAACA GAACCAAAGC AGATGCATGA AGATTACTGT TTTCAATGTG 4500
GAGATGGTGG AGAGCTGGTC ATGTGTGACA AAAAAGACTG TCCCAAAGCA TACCACCTCC 4560
TATGCCTTAA CCTGACTCAG CCACCATATG GAAAGTGGGA GTGTCCGTGG CATCAGTGCG 4620
ATGAGTGCAG CAGTGCAGCT GTTTCCTTCT GTGAATTCTG TCCACATTCA TTTTGTAAAG 4680
ATCATGAAAA GGGGGCCCTG GTTCCCTCTG CACTGGAAGG CCGCCTCTGC TGCTCGGAAC 4740
ATGACCCCAT GGCTCCTGTG TCACCAGAAT ACTGGAGCAA GATAAAATGT AAATGGGAAT 4800
CACAAGATCA TGGAGAAGAA GTAAAAGAAT AAATGTGTGG TGTCCCCTCC TTTCTATTTA 4860
AGTGAAAAAA GCAAATAGAT CATGCATTTA AAAAGAAGAG ACTGCTACAG TGCATACAGC 4920
CTTTGCCATC GGAACTGCCT TATTAAAGCA AAAATGGGAA ACCAGTTCAT GCAGGCAGAA 4980
GCAGTTGGTG GTGTCTGGTT TTTGTTTGAT TTGGTTGGTT TGGGATTCTT TTGTGGAGGG 5040
TTAAATTCCC TTGGTCTTTT CTTGCCTTTT ATTGTGCTTC AGTGCCATTG CAGCTTGAAA 5100
AAGAAATGTT TTTGCTGTTA AAATAAGAAC AAAGAGAAAA GTAAGTTTTG TTAATGAGAT 5160
AAATTTAAAG TCTAAGATGT GTTCCTTGGT TGTATAAAGC AAAAGTAGCC ATCATTCCTT 5220
TATTTATTTT CATTTTTAGG AATTTCAAGA AGTGTAGTTC AATAGTCTAA TCAAGTGTGT 5280
GTGTGTTTTA AGTAGGAATC TGAGAAAGCC CTCTAGGAAA GGGTATGATA AGCTTTATAT 5340
ACCTCTTTAC TGAGCAGTAG GTAGGCTCAC TTCTCTTTCC CTTCAAAATG CTTTTCATAG 5400
GCTTAGAGAA GGGCTCTATG GAAGTATTAA ATCTGGCCTC TTGAAAACAA TGCCTCGGTG 5460
GATTTTACCT CTGTGGTCTT AAACAAGGTG GGCTTTGTAC TATGGTAACT TTACCTAGAA 5520
GGGTATATGG TGTGTTTCCT TGGTGGAAGG AAAAAAAGGA ATGGATGTGC TCACTGATTT 5580
TTAAAACGTT TTCCATTGAC ATGGATGGTT TGGGTTTTGG ATGAAGTGCG TTTCTGTGGA 5640
TATGAAGCAT AGTTGGAATG GTCACTCCTA TTTTCCTGAC ACTGATTAAG GCAGCTAAAG 5700
GGGGATGAAG GCATAAACCA AAATTTGGAT CATATAAGCA CCATTTTCTG TCCAATTTTA 5760
TTGTAGACCC TGGTCTTAAT TGGAGAGAAT TACAGGCATT TTATTTTTTG TCTCTGTTTT 5820
CCTCTCCTAC TCAAACATTT CTTTTGTCAT CTGACCATTC AGTCATGTTT CCTGGTAAAA 5880
CCACACAAGT GAGTTTGAAT TGTATTTGAA TAGATCTTTT TGGTTGAGTT CCTAATTTGG 5940
GGGGTGAAAG TAGTAGGGGC ATCCCCAGGT TACATGTAAA GAATTTCTAT TATTATCCAG 6000
TACTTAAGAT TGCTCCCATT GAGTAAGGGG AAGAACTTCT ACTTTTACAG GGTGCATAAG 6060
AGCCACACAA ACACACTTCA TCTTAGGAAG CACTTGGAGA AAACTCCCAA GAAAGTCAGT 6120
GTTAGCAAAA ATGGTTGATG GTGATGCTTG ACTCTTATAG ATGCGAGTAT ACCAAGATGT 6180
CTGCTCACCA AACTCACTTG CTTTTAATGC TTAGGAATAT GTATATTCCA GTATTCCTTG 6240
TATGAAAATA ATTAGTAATG TCAGACATTG TTGTAACACT GTACTAAGTA GAAGTCTAGG 6300
GTTAGCTTGA ATGAATGTTA ACTTTCTCTG TGAATTTTGA CTGCTTACAA TTGCTGGTAC 6360
CTGGTAGCAT TTATTTTCCC TCAAGTACCT AGTAACCTTA TGAAGGTAGG GAGGGTAGAT 6420
GCTCTACCAC ATTCCTTCAG CTGTATTGAG TGTTCCATTT GCTTATCAAT TTAGAGCTCT 6480
TCTATGGAAA GAGGGTGAAA ATCTGTTTTG TGCCTCTTTT TTTTGTATGT AAATGTGCAA 6540
TTATACTGGA TTTTCTCTTG TAAAAAACAC TACAATTCTT TTACTGAAGC TCCTAAACTG 6600
CCATTTGCCT GACTCCAGCA ATTAATTCTG CGGTGACTCA TTGGGCTTCC AGTACTTCTG 6660
TTGATTAGAT ATGGTCCCAA AAGAAAGAGC AAAATGGAAA ATGCGATCCA TGTCACCAAA 6720
TATAATTGCT ACAAGTAACA TGAAATACAG CTCACCAAAA CACTAAACTT TGCTCTTGAG 6780
CAATTATATA GTTTGATGTC CTTTTAAAAA ATAAGAAAAG CTAGGTTTTA TCATACTAAA 6840
ATTTTATTTT GTATATGATG TTTGGTTTTT TAAACTTACT AACATAAGGA TTTCCTTTAA 6900
TATTCCAAAT ACAGTTCTTG AAACAGTATC CAACATGAAA TCTTATCTCC TTGCTTTTAG 6960
ACTTAGTGCT ATTTACTATC TAAAGACTTT TCTATGGTAA GCCAGGGTAC ATCCTATATA 7020
CTACAGATAC CTTAGGTTTC AGTATTTTTA TGCAGCTTCT TCATTGTGTC ACATTTTGTT 7080
TTCATATCTG TAACAGAGCT CTGCTGAAAT TAAAAAAAAA TTATGATTAA TAGGATTGTT 7140
TCACCATTTC AGAAATCAGA GTAGAAAACA AAATAGCTGC CAGGTGCAGT GGCTCACACC 7200
TGTAATCCCA GCACTTTGGG AGGCCGAGGT GGGCAGATCA TGAGGTCAGG AGTTTGAGAC 7260
CAGCCTGACC AACATGGTGA AACCCTGTCT CTGCTAAAAA TACAAAAAGG AGCCGGGCAT 7320
GGTGGCACAT ACCTGTAGTC CCAGTTTCTC AGGAGGCAGA GGTTGCAGTG AGCCAAGATC 7380
ACATCACTGC ACTCCAGCCT GGGTGACAGA GCGAGACTCC ATCTCAAAAA AAAAAGGTAA 7440
CCATAGTTCT GAAGTCTTTG GCCATGAATG TTCTTTAATG GTCTAAAAAG CGGTTTTAGT 7500
CATGCTGGTG CCAAGGAGTT CTCCTCACAC CTCCATAGCG TCAATGTTGA CCTATTTGAT 7560
ATATTTTTAA GTTGCAGATT ATAGATGCAG AAAGAGAAAA TTGTCCTTTA TGTGTGCATG 7620
AGTATACATG CACTGTGCCT GTGATACACC AATGAAATAT TCCAAAAGAC ATTGCTCACT 7680
CATAGTCTGA AGTTGGTATT TGTTAACTTT TAAATCTATA TGTGGTCAGA GTGTAAGTGG 7740
CTGGTTTTTA TTCTTGTCCA ACAATGAAAT AACTTAAAGC AGCATGGGCT AATGGTGGTA 7800
TTTTTTTTAA CCACATGTTA ATTTGTTGAA GGAAGAAGTG AACATAAGAT TTTTGATAAT 7860
CAGCCTAAAG CTTCAGAAAC TGGATCAGTA AGAGCTAAAG ATGAAATCCT GGGAAACTTT 7920
CTCAGGCACT GTGCAGGGGA GGGAGGGGCT GGACCGCCAC TGCTGCGAGG GAAGAATCAG 7980
AAGAGCTGCC TGTGGCTGGC TGAGAAGGGC AAGGTGGATG GCTCTTCATT CCTGGAAGGC 8040
TGAAGCAGCC AGAGGCAGTG ATTCACCTCT TTAATGTCTA ATCAGGGAGA GATACAGGAG 8100
TGAGGTGGGT TTTATTTTCA TTTTTGTTTT AACAAGTTCC CACTCTTATC TTAATTCTCA 8160
AAAATGATCT GGCCTCATCT ATAGAACTGG CTGAGTTGGG GAATCATTTA AAATTTGGGA 8220
TTTTCTGTAC AAGTCAAGAA CATATGTGAA AGCTAAAAAA GTAATGGAGG CAGGGGGACT 8280
GCTCCCAGCT CCCTTCTGCC ATTTACACAA AATATTTGAA AACATGTCTG TAAATAACGT 8340
TGTGGAACTG AGATATTGAG TATCTAATCA TGTAAAGGAC CCTCAGATGG GACAGGCTAT 8400
TGGACAACTT TCCTGGTTCA TGAAGTGGAA TTTCCTTTTT GAGTAGTTTC TTGCATGGTG 8460
TATTCCAATA GGTCAAACCC AGACCTACTG TTTGCAAACA GGCCATCTGT AGAGAATACA 8520
ATCTCTGGAC TGCCCAGGAC TTGTTGCTTG TCTTTCATTT CATTTTAAAT ACAAAGCTCT 8580
TCCTTTAGCA TTCACCTTGA AGTCAAGGTG TTGGCTGCCA CATGCCATGT CCTCTTAACA 8640
AGATAAGAAA ACAGTGGCTA TGGAGCCCTC ACAGCTGGGA AGCTGACATT CTATACTTCC 8700
ATATGAGAAA AAGGACATTG GAAAGTTGCT TTATTTCTGG GCAGCGTAGA CTAAAGTTAC 8760
TTTTCACAAG GCTGATGCAT CCAGCCTCCC CCAGTTGTCC AAGTTGGTGC AGACATGAAT 8820
TGATGAATGG AAACACTTTC CTTTGGCCTA GGGCCACCTA GAATTGGTCT CTGGTTATTT 8880
GTAAAACAAA AAGTAGAACT TGCTTCCATT TTCAGATGGA CTAGGGAATA CTATAATTAG 8940
ACAGCTTTGC AATGAATTAA CAAATACTGA AAAGTTGGAA ATGTGTATAT ACTGAAATCT 9000
CACCAACTTT TACCTGGGGT GGGGTACCAG GAGAAAGGGA ACCCCCTTTC TCCTGGTAAA 9060
GGGTAAGGGG GGGGATAATG TTTACCACAG GTACGAAATA GTCACTTTAA CATTGAGACC 9120
TCTGCCTCAT TGAATTCAGG TTTTTTAAGT ACTTGAAACT CTTCAGATTC TCCTTATTTT 9180
AGTTTCTTTT TACATTTATG AAGTAGAAAG CATTGTTTTG TAAACTGTTT TGAAAATAAA 9240
TAGCCTAGTC TCTTATCCTC TTTAGCGTGG ATTAAAGGTG AAGTTCTGCA AATGGGAGAG 9300
TGTTCACAGT AGATAGCTCA GATTGATTGA ACACATTTGA GGAAGAGACT CCTGCATGAG 9360
ATACCAGCAT TTTTACAAAT ACTTTTTATG TACATTCTTT ATTTTGTCAT TTTGTCAACC 9420
CTCTCCCCAA GCACATCTTC TTTCCTTTTA CTATGTCTAT GTAGGGAAAA AAACAAAACA 9480
AAAAATTGCA CTTACGTTAC ACTCCCAAAA TGTGGGTAAT CCGTGTCTTT CAAAAAACAT 9540
TTCTGTTTTT TGTTTTGTTT TGGTCAGTCC ATTGCATAAG TGACAAGTTT GGGTGCTTGT 9600
GGCACGTATG TATGAAGCGG GAGGGGGATG AGAATTGCCT GTCCTTCAGT AGGCTGTAAA 9660
AGTAATTTAC ATGTAAGTAA AAAGGGAAAA TAGAATAGAT GCCAAAGTCA TTTATTCAGT 9720
CCTTAGTTTT CTTATGTGGC ATTACTGCAT CTGCTAGTTA GTGAGAAAGC ACCCTCAGCT 9780
TTTACTGCTC CCCTCCCTGC CTGCCAACAC ACTTGATGTG TGCAAACAGC CCTCAAGTAT 9840
CTGTCAGATG ACCTATATAA GGTATTGAAT AAGGTATTCT TGTCAGTTTA GAAATGGACT 9900
GGATAAAACT TACTTGGTTG TCATTATTTT ATCTCATTTG TCCTGTTACA TGCCCTATGT 9960
TAAGATAATT ATATTGCCAC TAATAATCAA GATGCTAAAT GAGTATTACA ACTGGCTAAT 10020
ATCATTTTTT ATATACAAGG GTATGTGTAT ATTTGGAATT GATATGAGAA ACTCATTTGT 10080
ACCCATTTGA GTGATATTGC ACAACAAACA CAGATACCTA CAGACTCCGT TTTCATTTTC 10140
TCGTGTTCTT TATGATAATG ATCTTTGTAG ATTGGTTATT TCTGTACTTT ATCTGTAATA 10200
AACTTTGTAG ATCCTGTGAA CCATTACTTT GCCTAAATCA CTTGAGACTT GAGTCTTTAA 10260
TAACAAAGCA TCAATATTCA CTAAAGTCAA TCTCTTTTGA GTTTCTGTGA CTTGGCTAGA 10320
AGCTCTTGAC ACTAAGGGAT TAGTGTTAAT TTTCCCTGGG GGTGTTCCAC TAGGGCATTA 10380
CTGTATAATG ACTTGATGTT GCCACATAGA CTTCAAGATA TATAATATTT TGAGGATTTT 10440
GTTGATTGGC CTATGTTTTA TTGCATAGTG TGAAACGTGT AAAGCTTGGT TAACCTGTAT 10500
ATAGATAGCT TATTGTTGAC TAGTTATAGT GTATTTAGGG TTGCCTGTAA TATTTAAGCT 10560
TCTTTACTGA TGTGTGTGCT GGTAGGAACA TATAATTTTT GTACATTATA TTTACTGAGA 10620
TGTTGCCTTT TTTATTTTAC AAATACTTTG GAATTCCAAT GTGTTTTTTG CTTCCGTGAG 10680
GATTAATTTG GAAAGGTTTT TAATGACATT CCACTGATTT CAGATTTTGC TTGAGATTGA 10740
CTTCAATAAA TTGTCCTGTA TGTTCCAAAT TAAATA 10777
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0007--Acetylation
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0160--Chromosomal rearrangement
KW-0158--Chromosome
KW-0175--Coiled coil
KW-0181--Complete proteome
KW-1017--Isopeptide bond
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-0656--Proto-oncogene
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0832--Ubl conjugation
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR006560--AWS_dom
IPR003616--Post-SET_dom
IPR000313--PWWP_dom
IPR001214--SET_dom
IPR019786--Zinc_finger_PHD-type_CS
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR019787--Znf_PHD-finger
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS51215--AWS
PS50868--POST_SET
PS50812--PWWP
PS50280--SET
PS01359--ZF_PHD_1
PS50016--ZF_PHD_2

Pfam

PF00855--PWWP
PF00856--SET

Gene Ontology

GO:0005694--C:chromosome
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0008270--F:zinc ion binding
GO:0034968--P:histone lysine methylation
GO:0016571--P:histone methylation
GO:0006355--P:regulation of transcription, DNA-templated
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0171 ENSPTRP00000045907.3 Pan troglodytes 100 0.0 2863
WERAM-Chs-0185 ENSCSAP00000011515.1 Chlorocebus sabaeus 98 0.0 2848
WERAM-Paa-0071 ENSPANP00000005316.1 Papio anubis 97 0.0 2838
WERAM-Poa-0173 ENSPPYP00000020764.2 Pongo abelii 99 0.0 2835
WERAM-Tas-0004 ENSTSYP00000000493.1 Tarsius syrichta 96 0.0 2782
WERAM-Ict-0015 ENSSTOP00000001032.2 Ictidomys tridecemlineatus 96 0.0 2776
WERAM-Mam-0007 ENSMMUP00000001453.2 Macaca mulatta 96 0.0 2775
WERAM-Mup-0106 ENSMPUP00000009598.1 Mustela putorius furo 96 0.0 2773
WERAM-Otg-0155 ENSOGAP00000013154.2 Otolemur garnettii 95 0.0 2766
WERAM-Sus-0116 ENSSSCP00000016766.2 Sus scrofa 96 0.0 2764
WERAM-Fec-0136 ENSFCAP00000012620.3 Felis catus 96 0.0 2759
WERAM-Caf-0073 ENSCAFP00000009124.3 Canis familiaris 96 0.0 2747
WERAM-Cap-0159 ENSCPOP00000013130.2 Cavia porcellus 94 0.0 2741
WERAM-Orc-0161 ENSOCUP00000014227.2 Oryctolagus cuniculus 95 0.0 2740
WERAM-Loa-0134 ENSLAFP00000012330.4 Loxodonta africana 95 0.0 2734
WERAM-Vip-0117 ENSVPAP00000010795.1 Vicugna pacos 95 0.0 2715
WERAM-Eqc-0202 ENSECAP00000021664.1 Equus caballus 94 0.0 2698
WERAM-Ova-0008 ENSOARP00000001509.1 Ovis aries 93 0.0 2688
WERAM-Tut-0052 ENSTTRP00000004261.1 Tursiops truncatus 94 0.0 2678
WERAM-Dan-0070 ENSDNOP00000007225.3 Dasypus novemcinctus 94 0.0 2673
WERAM-Gog-0135 ENSGGOP00000011612.2 Gorilla gorilla 94 0.0 2670
WERAM-Nol-0134 ENSNLEP00000014943.2 Nomascus leucogenys 98 0.0 2654
WERAM-Ora-0032 ENSOANP00000005155.1 Ornithorhynchus anatinus 91 0.0 2618
WERAM-Sah-0189 ENSSHAP00000020535.1 Sarcophilus harrisii 92 0.0 2610
WERAM-Mum-0196 ENSMUSP00000081040.5 Mus musculus 91 0.0 2609
WERAM-Mod-0087 ENSMODP00000013395.2 Monodelphis domestica 91 0.0 2593
WERAM-Soa-0071 ENSSARP00000007016.1 Sorex araneus 88 0.0 2565
WERAM-Pes-0094 ENSPSIP00000011454.1 Pelodiscus sinensis 87 0.0 2548
WERAM-Gaga-0035 ENSGALP00000005219.3 Gallus gallus 86 0.0 2466
WERAM-Meg-0020 ENSMGAP00000001588.2 Meleagris gallopavo 86 0.0 2453
WERAM-Tag-0070 ENSTGUP00000004993.1 Taeniopygia guttata 85 0.0 2448
WERAM-Ptv-0053 ENSPVAP00000005699.1 Pteropus vampyrus 94 0.0 2431
WERAM-Ran-0103 ENSRNOP00000060879.1 Rattus norvegicus 86 0.0 2425
WERAM-Anc-0184 ENSACAP00000017722.2 Anolis carolinensis 84 0.0 2359
WERAM-Fia-0085 ENSFALP00000006989.1 Ficedula albicollis 83 0.0 2343
WERAM-Prc-0161 ENSPCAP00000015649.1 Procavia capensis 87 0.0 2332
WERAM-Mae-0114 ENSMEUP00000010800.1 Macropus eugenii 87 0.0 2320
WERAM-Mim-0088 ENSMICP00000008330.1 Microcebus murinus 89 0.0 2150
WERAM-Anp-0119 ENSAPLP00000013578.1 Anas platyrhynchos 86 0.0 2137
WERAM-Xet-0100 ENSXETP00000054753.2 Xenopus tropicalis 68 0.0 1835
WERAM-Aim-0130 ENSAMEP00000011847.1 Ailuropoda melanoleuca 96 0.0 1809
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 92 0.0 1709
WERAM-Leo-0155 ENSLOCP00000018570.1 Lepisosteus oculatus 66 0.0 1684
WERAM-Ten-0192 ENSTNIP00000019016.1 Tetraodon nigroviridis 60 0.0 1678
WERAM-Tar-0201 ENSTRUP00000042261.1 Takifugu rubripes 59 0.0 1673
WERAM-Xim-0184 ENSXMAP00000014849.1 Xiphophorus maculatus 60 0.0 1655
WERAM-Orla-0025 ENSORLP00000003502.1 Oryzias latipes 59 0.0 1648
WERAM-Dio-0067 ENSDORP00000006773.1 Dipodomys ordii 87 0.0 1636
WERAM-Gam-0146 ENSGMOP00000014567.1 Gadus morhua 59 0.0 1608
WERAM-Pof-0129 ENSPFOP00000011504.2 Poecilia formosa 59 0.0 1606
WERAM-Gaa-0058 ENSGACP00000007451.1 Gasterosteus aculeatus 59 0.0 1603
WERAM-Dar-0147 ENSDARP00000085548.5 Danio rerio 62 0.0 1548
WERAM-Asm-0218 ENSAMXP00000021158.1 Astyanax mexicanus 61 0.0 1528
WERAM-Lac-0115 ENSLACP00000014009.1 Latimeria chalumnae 77 0.0 1437
WERAM-Chh-0074 ENSCHOP00000008071.1 Choloepus hoffmanni 88 0.0 1203
WERAM-Orn-0179 ENSONIP00000018125.1 Oreochromis niloticus 67 0.0 1191
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 49 0.0 1133
WERAM-Myl-0217 ENSMLUP00000019505.1 Myotis lucifugus 94 0.0 1123
WERAM-Bot-0014 ENSBTAP00000002006.4 Bos taurus 93 0.0 1120
WERAM-Ocp-0003 ENSOPRP00000000331.2 Ochotona princeps 82 0.0 1103
WERAM-Caj-0014 ENSCJAP00000003436.2 Callithrix jacchus 47 0.0 1098
WERAM-Tub-0091 ENSTBEP00000010577.1 Tupaia belangeri 94 0.0 965
WERAM-Ect-0036 ENSETEP00000003241.1 Echinops telfairi 43 0.0 953
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 52 1e-175 615
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 33 1e-109 395
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 42 2e-49 196
WERAM-Php-0036 PP1S183_22V6.1 Physcomitrella patens 46 6e-49 194
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 2e-48 192
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 2e-48 192
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 2e-48 192
WERAM-Sem-0083 EFJ16223 Selaginella moellendorffii 42 4e-48 191
WERAM-Miv-0037 MVLG_05378T0 Microbotryum violaceum 45 1e-47 190
WERAM-Chr-0033 EDP02327 Chlamydomonas reinhardtii 41 1e-47 189
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 41 6e-47 187
WERAM-Pyt-0041 EFQ85104 Pyrenophora teres 41 1e-46 187
Created Date 25-Jun-2016