WERAM Information


Tag Content
WERAM ID WERAM-Hos-0077
Ensembl Protein ID ENSP00000372347.5
Uniprot Accession O96028; NSD2_HUMAN; A2A2T2; A2A2T3; A2A2T4; A7MCZ1; D3DVQ2; O96031; Q4VBY8; Q672J1; Q6IS00; Q86V01; Q9BZB4; Q9UI92; Q9UPR2
Genbank Protein ID NP_001035889.1; NP_015627.1; NP_579877.1; NP_579878.1; NP_579889.1; NP_579890.1; XP_005248058.1; XP_005248062.1; XP_006713977.1; XP_006713978.1; XP_011511859.1; XP_011511860.1; XP_011511861.1; XP_011511862.1
Protein Name Histone-lysine N-methyltransferase NSD2
Genbank Nucleotide ID NM_001042424.2; NM_007331.1; NM_133330.2; NM_133331.2; NM_133334.2; NM_133335.3; XM_005248001.3; XM_005248005.1; XM_006713914.2; XM_006713915.2; XM_011513557.1; XM_011513558.1; XM_011513559.1; XM_011513560.1
Gene Name NSD2;WHSC1;TRX5;MMSET;REIIBP;WHSC1KIAA1090;TRX5
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000109685.17 ENST00000382891.9 ENSP00000372347.5
ENSG00000109685.17 ENST00000312087.10 ENSP00000308780.6
ENSG00000109685.17 ENST00000353275.9 ENSP00000329167.5
ENSG00000109685.17 ENST00000382888.3 ENSP00000372344.3
ENSG00000109685.17 ENST00000382892.6 ENSP00000372348.2
ENSG00000109685.17 ENST00000382895.7 ENSP00000372351.3
ENSG00000109685.17 ENST00000398261.5 ENSP00000381311.1
ENSG00000109685.17 ENST00000420906.6 ENSP00000399251.2
ENSG00000109685.17 ENST00000514045.5 ENSP00000421681.1
ENSG00000109685.17 ENST00000509115.5 ENSP00000422878.1
ENSG00000109685.17 ENST00000508803.5 ENSP00000423972.1
ENSG00000109685.17 ENST00000514329.5 ENSP00000425094.1
ENSG00000109685.17 ENST00000503128.5 ENSP00000425761.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET2 SET H3K27; H3K36; H4K20 K 26807165; 20951770; 25537518
Me_Reader PWWP PWWP1 H3K36me2 K 26912663
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET2 5.40e-52 177.2 1064 1180
Me_Reader PWWP 4.30e-45 154 222 941
HMT SET1 4.50e-29 103.2 1067 1180
Me_Reader PHD 7.30e-10 41.2 668 1283
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase with histone H3 'Lys-27' (H3K27me) methyltransferase activity. Isoform 2 may act as a transcription regulator that binds DNA and suppresses IL5 transcription through HDAC recruitment.
Domain Profile
  HMT SET2

           SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncetqkw 91  
++++ikt+ kG+Gl+ak++i+k+ef++eYvGe+ide+e+ +R+k+++e+++++fY+l++dkd++iDa kGn++Rf+nhsC+Pncet kw
ENSP00000372347.5 1064 ETKIIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKHAHENDITHFYMLTIDKDRIIDAGPKGNYSRFMNHSCQPNCETLKW 1153
689*************************************************************************************** PP
SET2.txt 92 tvegelrvglfakkkikkgeeltfdYn 118
tv+g++rvglfa+++i++g+eltf+Yn
ENSP00000372347.5 1154 TVNGDTRVGLFAVCDIPAGTELTFNYN 1180
**************************8 PP

  Me_Reader PWWP

           PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl..ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdLVw+K++gYpwWP++v+ +pl ++++ k q++++++y+V+FFg+ +eraw+ +k+lv+++
ENSP00000372347.5 222 VGDLVWSKVSGYPWWPCMVSADPLLHSYTklKGQKKSARQYHVQFFGDAPERAWIFEKSLVAFE 285
69*************************9999******************************987 PP
PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++d++w+Kl++Y+wWPa+v++p++ + +++++++e ++++V+FFg +++++w ++ +++py+e
ENSP00000372347.5 880 FQDIIWVKLGNYRWWPAEVCHPKNVPPNIQKMKHEIGEFPVFFFG-SKDYYWTHQARVFPYME 941
58*******************************************.***************87 PP

  HMT SET1

           SET1.txt    5 vakskikglglvakkeiekeelviEYvGevirsevadkrek.eyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakvva 93  
+ k + kg+glvak++i+k+e+v EYvGe+i +e+ r k +e++ + y+ +d+d ++da kgn++rf+nhsc+pNce+ +
ENSP00000372347.5 1067 IIKTDGKGWGLVAKRDIRKGEFVNEYVGELIDEEECMARIKhAHENDITHFYMLTIDKD--RIIDAGPKGNYSRFMNHSCQPNCETLKWT 1154
5567789************************9998887777366666666*********..***************************** PP
SET1.txt 94 vdgekkiviyakraIekgeeltydYk 119
v+g+++++++a+ +I++g+elt++Y+
ENSP00000372347.5 1155 VNGDTRVGLFAVCDIPAGTELTFNYN 1180
*************************7 PP

  Me_Reader PHD

            PHD.txt   2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51 
+C++C ++ g+ ++ C++ C +fHl C++l+ + peg ++C +C
ENSP00000372347.5 668 YVCQLC--EKPGS--LLLCEGpCCGAFHLACLGLS--RRPEG-RFTCSECA 711
58****..44443..8*******************..*****.*******7 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++++ + C C + +H Cvk ++ e++ ++Cp +
ENSP00000372347.5 716 SCFVCKESKTD---VKRCVVtqCGKFYHEACVKKYPLTVFESRGFRCPLH 762
7****655555...33565445**********998788888878****98 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHl..kCvkl 34
+C+ C+ ++ ++ m+ C C+ ++H C+
ENSP00000372347.5 763 SCVSCHASNPSNPRpskgkMMRCVRCPVAYHSgdACLAA 801
7****87777755566778************73346655 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ +++C++C+ +fH +C++++ +p+g sw+C+ C++
ENSP00000372347.5 833 WCFVC-SKGGS---LLCCESCPAAFHPDCLNIE---MPDG-SWFCNDCRA 874
7****.33333...9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
C+ C d+ + +v Cd C +++Hl+C++l + p g +w Cp +
ENSP00000372347.5 1242 CFRC-GDGGQ---LVLCDRkfCTKAYHLSCLGLG--KRPFG-KWECPWH 1283
8888.33333...9********************..***99.****977 PP

Protein Sequence
(Fasta)
MEFSIKQSPL SVQSVVKCIK MKQAPEILGS ANGKTPSCEV NRECSVFLSK AQLSSSLQEG 60
VMQKFNGHDA LPFIPADKLK DLTSRVFNGE PGAHDAKLRF ESQEMKGIGT PPNTTPIKNG 120
SPEIKLKITK TYMNGKPLFE SSICGDSAAD VSQSEENGQK PENKARRNRK RSIKYDSLLE 180
QGLVEAALVS KISSPSDKKI PAKKESCPNT GRDKDHLLKY NVGDLVWSKV SGYPWWPCMV 240
SADPLLHSYT KLKGQKKSAR QYHVQFFGDA PERAWIFEKS LVAFEGEGQF EKLCQESAKQ 300
APTKAEKIKL LKPISGKLRA QWEMGIVQAE EAASMSVEER KAKFTFLYVG DQLHLNPQVA 360
KEAGIAAESL GEMAESSGVS EEAAENPKSV REECIPMKRR RRAKLCSSAE TLESHPDIGK 420
STPQKTAEAD PRRGVGSPPG RKKTTVSMPR SRKGDAASQF LVFCQKHRDE VVAEHPDASG 480
EEIEELLRSQ WSLLSEKQRA RYNTKFALVA PVQAEEDSGN VNGKKRNHTK RIQDPTEDAE 540
AEDTPRKRLR TDKHSLRKRD TITDKTARTS SYKAMEAASS LKSQAATKNL SDACKPLKKR 600
NRASTAASSA LGFSKSSSPS ASLTENEVSD SPGDEPSESP YESADETQTE VSVSSKKSER 660
GVTAKKEYVC QLCEKPGSLL LCEGPCCGAF HLACLGLSRR PEGRFTCSEC ASGIHSCFVC 720
KESKTDVKRC VVTQCGKFYH EACVKKYPLT VFESRGFRCP LHSCVSCHAS NPSNPRPSKG 780
KMMRCVRCPV AYHSGDACLA AGCSVIASNS IICTAHFTAR KGKRHHAHVN VSWCFVCSKG 840
GSLLCCESCP AAFHPDCLNI EMPDGSWFCN DCRAGKKLHF QDIIWVKLGN YRWWPAEVCH 900
PKNVPPNIQK MKHEIGEFPV FFFGSKDYYW THQARVFPYM EGDRGSRYQG VRGIGRVFKN 960
ALQEAEARFR EIKLQREARE TQESERKPPP YKHIKVNKPY GKVQIYTADI SEIPKCNCKP 1020
TDENPCGFDS ECLNRMLMFE CHPQVCPAGE FCQNQCFTKR QYPETKIIKT DGKGWGLVAK 1080
RDIRKGEFVN EYVGELIDEE ECMARIKHAH ENDITHFYML TIDKDRIIDA GPKGNYSRFM 1140
NHSCQPNCET LKWTVNGDTR VGLFAVCDIP AGTELTFNYN LDCLGNEKTV CRCGASNCSG 1200
FLGDRPKTST TLSSEEKGKK TKKKTRRRRA KGEGKRQSED ECFRCGDGGQ LVLCDRKFCT 1260
KAYHLSCLGL GKRPFGKWEC PWHHCDVCGK PSTSFCHLCP NSFCKEHQDG TAFSCTPDGR 1320
SYCCEHDLGA ASVRSTKTEK PPPEPGKPKG KRRRRRGWRR VTEGK 1365
Nucleotide Sequence
(Fasta)
TGGTCTTGAA CTCCTGACCT TGTGATCCGC TCGCCTCAGC CTCCCAAAGT GCTGGGATTA 60
CAGGCATGAG CCACCGTGCC TGTCCTAGAA CCACTGGTAA CTCTTAACTG TGTTCTAAGA 120
ACGGAAGCAT CTGGGCTGGA TGGAATTTAG CATCAAGCAG AGTCCCCTTT CTGTTCAGAG 180
TGTTGTAAAG TGCATAAAGA TGAAGCAGGC ACCAGAAATC CTCGGCAGTG CCAACGGGAA 240
GACTCCGAGC TGCGAGGTGA ACCGCGAGTG TTCTGTGTTC CTCAGCAAAG CCCAGCTCTC 300
CAGTAGCCTG CAGGAGGGGG TCATGCAGAA GTTTAACGGC CACGACGCCC TGCCCTTTAT 360
TCCAGCCGAC AAGCTGAAAG ATCTTACTTC CCGGGTGTTT AATGGAGAAC CCGGCGCACA 420
CGATGCCAAA CTGCGTTTTG AGTCCCAGGA AATGAAAGGG ATTGGGACAC CCCCTAACAC 480
TACCCCTATC AAAAATGGCT CTCCAGAAAT TAAGCTGAAA ATCACCAAAA CATACATGAA 540
TGGGAAGCCT CTCTTTGAAT CTTCCATTTG TGGTGACAGT GCTGCTGATG TGTCTCAGTC 600
AGAAGAAAAT GGACAAAAAC CAGAAAACAA GGCGAGAAGG AACAGGAAGA GGAGCATAAA 660
ATATGACTCC TTGCTGGAGC AGGGCCTTGT CGAAGCAGCT CTTGTGTCTA AGATCTCAAG 720
TCCTTCAGAT AAAAAGATTC CAGCTAAGAA AGAGTCTTGT CCAAACACTG GAAGAGACAA 780
AGACCACCTG TTGAAATACA ACGTTGGTGA TTTGGTGTGG TCCAAAGTGT CGGGTTACCC 840
TTGGTGGCCT TGCATGGTTT CTGCAGATCC ACTCCTTCAC AGCTATACCA AACTTAAAGG 900
TCAGAAAAAG AGTGCACGCC AGTATCACGT ACAGTTCTTT GGTGACGCCC CAGAAAGAGC 960
TTGGATATTT GAGAAGAGCC TCGTAGCTTT TGAAGGAGAA GGACAGTTTG AAAAATTATG 1020
CCAGGAAAGT GCCAAGCAGG CACCCACGAA AGCTGAGAAA ATTAAGCTAT TGAAACCAAT 1080
TTCAGGGAAA TTGAGGGCCC AGTGGGAAAT GGGCATTGTT CAAGCAGAAG AAGCTGCAAG 1140
CATGTCAGTG GAGGAGCGGA AAGCCAAGTT CACCTTTCTC TATGTGGGGG ACCAGCTTCA 1200
TCTCAACCCT CAAGTAGCCA AGGAGGCTGG CATTGCTGCA GAGTCTTTGG GAGAAATGGC 1260
AGAATCCTCA GGAGTCAGTG AAGAAGCTGC TGAAAACCCC AAGTCTGTGA GAGAAGAGTG 1320
CATTCCCATG AAGAGAAGGC GGAGGGCCAA ACTGTGTAGC TCTGCAGAGA CCCTGGAGAG 1380
TCACCCCGAC ATAGGGAAGA GTACTCCTCA AAAGACGGCA GAGGCTGACC CCAGAAGAGG 1440
AGTAGGGTCT CCTCCTGGGA GGAAGAAGAC CACAGTCTCC ATGCCACGAA GCAGGAAGGG 1500
AGATGCAGCA TCCCAGTTTT TGGTCTTCTG TCAAAAACAC AGGGATGAGG TGGTAGCTGA 1560
GCACCCAGAT GCTTCAGGTG AGGAGATTGA AGAGCTGCTC AGGTCACAGT GGAGTCTGCT 1620
GAGTGAGAAG CAGAGAGCAC GCTACAACAC CAAGTTTGCC CTGGTGGCCC CTGTCCAGGC 1680
TGAAGAAGAC TCTGGTAATG TAAATGGGAA AAAAAGAAAC CACACAAAGA GGATACAGGA 1740
CCCTACAGAA GATGCTGAAG CTGAGGACAC ACCCAGGAAA AGACTCAGGA CGGACAAGCA 1800
CAGTCTTCGG AAGAGAGACA CAATCACTGA CAAAACGGCC AGAACAAGCT CTTACAAGGC 1860
CATGGAGGCA GCCTCCTCGC TCAAGAGCCA GGCAGCAACG AAAAATCTGT CTGATGCATG 1920
TAAACCACTG AAGAAGCGAA ATCGGGCTTC CACGGCAGCA TCTTCAGCTC TTGGGTTTAG 1980
CAAAAGTTCA TCTCCTTCTG CATCCTTAAC TGAGAATGAG GTCTCGGACA GCCCGGGAGA 2040
CGAGCCCTCG GAGTCCCCAT ACGAAAGTGC AGACGAAACA CAAACTGAAG TATCTGTCTC 2100
ATCCAAAAAG TCTGAGCGAG GAGTGACTGC CAAAAAGGAG TATGTGTGCC AGCTGTGTGA 2160
GAAGCCGGGC AGCCTCCTGC TCTGTGAAGG ACCCTGCTGC GGAGCTTTCC ACCTCGCCTG 2220
CCTTGGGCTT TCCCGGAGGC CAGAAGGGAG GTTCACCTGC AGCGAGTGTG CCTCAGGGAT 2280
TCACTCATGT TTCGTGTGTA AAGAGAGCAA GACAGATGTT AAGCGCTGTG TGGTAACTCA 2340
GTGTGGAAAA TTTTACCATG AGGCTTGTGT GAAAAAATAC CCTCTGACTG TATTTGAGAG 2400
CCGAGGTTTC CGCTGCCCCC TCCACAGCTG TGTGAGCTGC CATGCTTCCA ACCCTTCAAA 2460
CCCAAGGCCG TCAAAAGGTA AAATGATGCG GTGTGTCCGC TGCCCCGTTG CCTATCACAG 2520
CGGGGATGCT TGTCTGGCAG CAGGATGCTC AGTGATCGCC TCCAACAGCA TCATCTGCAC 2580
TGCCCACTTC ACTGCTCGGA AGGGGAAGCG ACACCACGCC CACGTCAACG TGAGCTGGTG 2640
CTTCGTGTGC TCCAAAGGGG GGAGCCTTCT GTGCTGTGAG TCCTGCCCAG CGGCCTTCCA 2700
CCCTGACTGC CTGAACATCG AGATGCCTGA CGGCAGCTGG TTCTGCAATG ACTGCAGGGC 2760
TGGGAAGAAG CTGCACTTCC AGGATATCAT TTGGGTGAAA CTTGGGAACT ACAGATGGTG 2820
GCCGGCAGAA GTTTGCCATC CCAAAAATGT TCCCCCAAAT ATTCAGAAAA TGAAGCACGA 2880
GATTGGAGAA TTCCCTGTGT TTTTCTTTGG GTCTAAAGAT TATTACTGGA CGCATCAGGC 2940
GCGAGTGTTC CCGTACATGG AGGGGGACCG GGGCAGCCGC TACCAGGGGG TCAGAGGGAT 3000
CGGAAGAGTC TTCAAAAACG CACTGCAAGA AGCTGAAGCT CGTTTTCGTG AAATTAAGCT 3060
TCAGAGGGAA GCCCGAGAAA CACAGGAGAG CGAGCGCAAG CCCCCACCAT ACAAGCACAT 3120
CAAGGTGAAT AAGCCTTACG GGAAAGTCCA GATCTACACA GCGGATATTT CAGAAATCCC 3180
TAAGTGCAAC TGCAAGCCCA CAGATGAGAA TCCTTGTGGC TTTGATTCGG AGTGTCTGAA 3240
CAGGATGCTG ATGTTTGAGT GCCACCCGCA GGTGTGTCCC GCGGGCGAGT TCTGCCAGAA 3300
CCAGTGCTTC ACCAAGCGCC AGTACCCAGA GACCAAGATC ATCAAGACAG ATGGCAAAGG 3360
GTGGGGCCTG GTCGCCAAGA GGGACATCAG AAAGGGAGAA TTTGTTAACG AGTACGTTGG 3420
GGAGCTGATC GACGAGGAGG AGTGCATGGC GAGAATCAAG CACGCACACG AGAACGACAT 3480
CACCCACTTC TACATGCTCA CTATAGACAA GGACCGTATA ATAGACGCTG GCCCCAAAGG 3540
AAACTACTCT CGATTTATGA ATCACAGCTG CCAGCCCAAC TGTGAGACCC TCAAGTGGAC 3600
AGTGAATGGG GACACTCGTG TGGGCCTGTT TGCCGTCTGT GACATTCCTG CAGGGACGGA 3660
GCTGACTTTT AACTACAACC TCGATTGTCT GGGCAATGAA AAAACGGTCT GCCGGTGTGG 3720
AGCCTCCAAT TGCAGTGGAT TCCTCGGGGA TAGACCAAAG ACCTCGACGA CCCTTTCATC 3780
AGAGGAAAAG GGCAAAAAGA CCAAGAAGAA AACGAGGCGG CGCAGAGCAA AAGGGGAAGG 3840
GAAGAGGCAG TCAGAGGACG AGTGCTTCCG CTGCGGTGAT GGCGGGCAGC TGGTGCTGTG 3900
TGACCGCAAG TTCTGCACCA AGGCCTACCA CCTGTCCTGC CTGGGCCTTG GCAAGCGGCC 3960
CTTCGGGAAG TGGGAATGTC CTTGGCATCA TTGTGACGTG TGTGGCAAAC CTTCGACTTC 4020
ATTTTGCCAC CTCTGCCCCA ATTCGTTCTG TAAGGAGCAC CAGGACGGGA CAGCCTTCAG 4080
CTGCACCCCG GACGGGCGGT CCTACTGCTG TGAGCATGAC TTAGGGGCGG CATCGGTCAG 4140
AAGCACCAAG ACTGAGAAGC CCCCCCCAGA GCCAGGGAAG CCGAAGGGGA AGAGGCGGCG 4200
GCGGAGGGGC TGGCGGAGAG TCACAGAGGG CAAATAGCGC CAGGCGGCCG CTTGGCCGGA 4260
TCCAGGGGCG GTGCAGGGCG GCCGGCCCTG CCTGCGGGAG AGGGCGAGCA TGAACTGGCC 4320
CGGAGGACCC AGCTCGAGCC GCCAGGACAC AGACGTACAG GCCTCCTCGG GAGGGAGCGC 4380
CTCCCCACCA CTGAGCCATC CTCAGCAGCG TCCGCTGCGT CTGCACTGAT GACCGTCTGA 4440
GCCCAGCTCA GCGTTCCTGG ACAAACAGCC TCACTCCTCA GCGTTACCGC CACACTTGAA 4500
TTTCTCCGAA TGTCAAGGTT CCCTCCCACT CTATTTTTTT AGGTTAAAGT TAATTGGCAT 4560
ATGGAATGTT TTAATCTCCT CTGAAATGTG TAGCGTAGGC TTTTCCCAAG GGTCGCTAGA 4620
AACTCGTCTT CGCGTTGCCC CCTTTCTGGC TCTCAGCGCC GTCGCCACTC GGGAGAGGCT 4680
GGGTGAGGCC CGTGTGAGGA CTGACCCTGG ATTCCTCGAA ACTGCCATTG TGATCATTAC 4740
TCTGCTCTTT GGAAATGGCT GTATCATTTT TTTGTACTAA TGTGAATTGT TCCTCAGAAA 4800
CGCTTCTTTT CCATCCTAGT GAGAAGCTGG CCCTGCAGGT GGTGGCAGCA ATGGTGTTGT 4860
AAGATTTCCT CCCGTAGTTT TTTCTCCTCA TGGATTTGAA TGAAATGCCA ATAACACGTC 4920
CACTTTCAAC GTGTAGTTTA CGCGGAGCAC TTTCGAGGCC TGGCCGGGTT GGGCCTACTT 4980
CTCACCTGGG CCTATCTTCT GAACTCGCTA GGTTCTTATC AACATTTGGG GGATAACTTT 5040
GTATATTTTT TTCATTTGGC TTTTCTTTAC CAGTTTCTGA TTTTTATTCT CAATATATTT 5100
TTGCTAAACC TATTTCACAA ATCACCACCG ACTGAAGTGT GTGTTTACTG ATGCGGCCCT 5160
GAGCTCCATG GCGAAAGGAG TGACTTTGCA GGGCGTGAGA CCGCAGTCTG CTTAGAGCAC 5220
AGGAAGTGAC AACTTAGGGA GCCCCGTAGG GCGCTGCAGG CCCCGGGGAC CCCAGCACGT 5280
GGGTCTAAAG AGAGACGGAG TCTAGCTCTC CTGCCACCCA GAGTGGCTTC CATCTCAGCA 5340
CTCTGTGGGT CTGGTGATGG AAGATGCAGT CTCTGCTGAT CACATGTGCC CTCTGCCAGG 5400
GCACCTACTG AGAGGTGCGG TCCTGGGGGT GGAGGCCTGC CTGGCAGGTG TGCGTGCCTC 5460
GTACGTGTGT TATGGGCACT GGTCTAGGCC AGGTATGACA CCCACTCTCC TGTGAGATTT 5520
CACTTTAGTT TTTAAAAGGT CCAGTTCTAC AGAGTGAGAC CTATCTATCT GAGTACTACA 5580
TATGTTTTAA GACTTGGTTC TTTTTTTGAG GGATCCTTGA CCCTGGGAAG TCTGGAGCAC 5640
CCTGAGAAGG GGGCACCATG TGTGCCTTTG CCCACGTGTC CTGAGGGGCT GCTTGTCTGG 5700
GAGGGAGGGA GAGAACATTC AGCAGCAGGT GCTTTTTTAT GGCCTTTTCT TAAAATAACC 5760
TAAGGGGGAC ACATCCATCT TGCAGAGAAG TTTACAGAAC TCCCCTTGAA AACTGCTGCT 5820
GAGGCTCCTG TTAAATTTTC TGTGGCATCT TTTATGCCTT GGTAAAAACT GCAGTGTCTT 5880
TGGACCTGAG AGTGGCTACT CCGTGGTTTT GTGACCTGTA AGCGTGGGGT TCAGGGGTGT 5940
GTGGCCCTGC AGGGTCCCAC GCCTCCCTGA GCACTGACTG GAAGTTTCAC TGGCTGGTGG 6000
CTGTCCCTTC TCCCATCAGG GTCCCCAGCA AAGTTAACTA CACAGAGGAC CCAGGGGAAA 6060
CGAGCTGTGT AGCCACTGAC TTGCTCGCGC GGCCGTGGCC TCTGAGGGGC ACTCGCCGGT 6120
TAAGACAGGG TGGGAGTAGT GCTTTCCAGT TCAGACTCTA ACTTCTCCCA AAGTGTCCTA 6180
AGAAAATACT GGATCGGCTC ATAGATTTAT GCTCCTTATG ATGCCCTAAC TTGGAAGGTT 6240
GTTCTAGGGA CAGGCCGGGC AGTGTCCCCA CACACACCTT AGAGTCGAAG GCCCCAGGGC 6300
CCCGCTGTCA CTTGCCCAAA AGATCCCTTC CGGCAGGTAA GGGACTACCA ATGCTTACGT 6360
CAAAACAGCA GAATCGGCTT TGCAGTGCAC TTTGGGGAGC AGATATTAAC TTATTTTTGT 6420
GTTGGACAGT AGTGAAATCT TGTGATTTTT AATCGCTTTG ATAATACTTC CAAATTTTAT 6480
GATTTTTCTG AAGGAAATAA TGCAAACATT TTAAATATGT TTCTCCCCCT TTCCAAAAAC 6540
TGTTAAACTA ATGAGCAAGT AACACTAACT TTGAATGTCT CTACAATACC CGTTGATAAC 6600
TCAGTGGAGC CAGGCTTTGG GGTAGCGGCC CTGAGCTTGC AGGGTTTCTC GCCACTGGGG 6660
CTGACCACGC CCCCAGCTGT GACCGTGGGT GTGGCTGGCT CTCGGCCCTG CCCAGCTTTG 6720
TTCTGAGGAC GTGGTGACTT CCTGAACATC AGCTTCAATC CTCCATCATT AATGTGAAGC 6780
AAAACACAAA AACCGCCCCA ATCCCTCAGG ATTCCTTGGC ATCCGAAACC AGCATCTGCA 6840
CCTAAACCCA TACCCACCCG TGTGCGCCCA CAGGGGGATG TGTCCGAATG GGCAGCTTAA 6900
AATGTGGTCA CCTGTGGGGG AAACTCTTCA GGCACCTGAA GTGAGAACCC AGCTGTCCGT 6960
CCTCAGGCCG GCCTTTCTTC CGGCGACACC CGTCCATGGC TGGCTGGGTC CCCTTCGCAG 7020
TGTTTGTCTG TCTTGACATC TAAACCCCGG CGTGTGCAGT GCCCATCTTC CAGGACTACC 7080
TTATTTTCCA GAATTAAACC TGTTTTATAA TTCAAGTTAA TGCAAATGAC TGTCAGTTGC 7140
CAAATATCTT GATCCTATGA GTGTAGTTGA TGACTGTTTG TTAGTCAGTA GAGTAAAATG 7200
CTGTGTCCAC GGGGTGTCAC AGCCTCACCA TACCCTGTTG AGGTGTGAAA TGCCCCGTCA 7260
GAAATTAAAT ACAAACTTAA ATGTGCCTAT TGGTGTCTAA ACTTCATACA ATGTAAGGTC 7320
AGATTCCTTT TAGGAATACT GGGTGCTGTC ACCAGGTTTG ATAGTTAGAC TTAAAAACTT 7380
GAAATTCACT TTTTGGGGGG AGGGATATAC TGAAATAGAG AGTTGAGACT TGCCAGTTGG 7440
GGGAAAATAG CATTTAAAAT GGAAAGCTGT GTTTGGAAAA TTGTGTATGA GTATTTTTGT 7500
ATTAAAAACA TTTTAAAGGC TTTTTTCTTA ACTT 7535
Sequence Source Ensembl
Keyword

KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0160--Chromosomal rearrangement
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0963--Cytoplasm
KW-0238--DNA-binding
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0656--Proto-oncogene
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR006560--AWS_dom
IPR009071--HMG_box_dom
IPR003616--Post-SET_dom
IPR000313--PWWP_dom
IPR001214--SET_dom
IPR019786--Zinc_finger_PHD-type_CS
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR019787--Znf_PHD-finger
IPR001841--Znf_RING
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS51215--AWS
PS50118--HMG_BOX_2
PS50868--POST_SET
PS50812--PWWP
PS50280--SET
PS01359--ZF_PHD_1
PS50016--ZF_PHD_2

Pfam

PF00505--HMG_box
PF00855--PWWP
PF00856--SET

Gene Ontology

GO:0005694--C:chromosome
GO:0005737--C:cytoplasm
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0003682--F:chromatin binding
GO:0042799--F:histone methyltransferase activity (H4-K20 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0043565--F:sequence-specific DNA binding
GO:0008270--F:zinc ion binding
GO:0009653--P:anatomical structure morphogenesis
GO:0003289--P:atrial septum primum morphogenesis
GO:0003290--P:atrial septum secundum morphogenesis
GO:0060348--P:bone development
GO:0006303--P:double-strand break repair via nonhomologous end joining
GO:0010452--P:histone H3-K36 methylation
GO:0003149--P:membranous septum morphogenesis
GO:0000122--P:negative regulation of transcription from RNA polymerase II promoter
GO:0048298--P:positive regulation of isotype switching to IgA isotypes
GO:2001032--P:regulation of double-strand break repair via nonhomologous end joining
GO:0070201--P:regulation of establishment of protein localization
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0130 ENSPTRP00000027281.4 Pan troglodytes 100 0.0 2569
WERAM-Gog-0155 ENSGGOP00000013809.2 Gorilla gorilla 100 0.0 2568
WERAM-Poa-0128 ENSPPYP00000016243.2 Pongo abelii 100 0.0 2563
WERAM-Chs-0189 ENSCSAP00000011746.1 Chlorocebus sabaeus 99 0.0 2536
WERAM-Caj-0014 ENSCJAP00000003436.2 Callithrix jacchus 97 0.0 2514
WERAM-Nol-0001 ENSNLEP00000000133.2 Nomascus leucogenys 100 0.0 2432
WERAM-Eqc-0077 ENSECAP00000009841.1 Equus caballus 93 0.0 2412
WERAM-Aim-0163 ENSAMEP00000014998.1 Ailuropoda melanoleuca 92 0.0 2398
WERAM-Mup-0124 ENSMPUP00000010947.1 Mustela putorius furo 92 0.0 2375
WERAM-Caf-0157 ENSCAFP00000022033.3 Canis familiaris 91 0.0 2374
WERAM-Mum-0202 ENSMUSP00000058940.7 Mus musculus 91 0.0 2347
WERAM-Cap-0089 ENSCPOP00000006781.2 Cavia porcellus 90 0.0 2329
WERAM-Loa-0087 ENSLAFP00000006847.4 Loxodonta africana 90 0.0 2329
WERAM-Prc-0065 ENSPCAP00000006170.1 Procavia capensis 90 0.0 2322
WERAM-Ran-0186 ENSRNOP00000021952.6 Rattus norvegicus 91 0.0 2294
WERAM-Orc-0045 ENSOCUP00000004548.2 Oryctolagus cuniculus 90 0.0 2267
WERAM-Bot-0082 ENSBTAP00000010497.4 Bos taurus 86 0.0 2248
WERAM-Sus-0061 ENSSSCP00000009255.2 Sus scrofa 86 0.0 2244
WERAM-Mim-0108 ENSMICP00000010336.1 Microcebus murinus 88 0.0 2232
WERAM-Mod-0031 ENSMODP00000005127.4 Monodelphis domestica 83 0.0 2180
WERAM-Sah-0066 ENSSHAP00000007775.1 Sarcophilus harrisii 84 0.0 2167
WERAM-Otg-0158 ENSOGAP00000013339.2 Otolemur garnettii 93 0.0 2150
WERAM-Dio-0131 ENSDORP00000012311.1 Dipodomys ordii 85 0.0 2116
WERAM-Anp-0029 ENSAPLP00000003282.1 Anas platyrhynchos 81 0.0 2108
WERAM-Gaga-0142 ENSGALP00000025281.4 Gallus gallus 79 0.0 2090
WERAM-Pes-0051 ENSPSIP00000007311.1 Pelodiscus sinensis 80 0.0 2086
WERAM-Tag-0131 ENSTGUP00000010895.1 Taeniopygia guttata 81 0.0 2085
WERAM-Fia-0121 ENSFALP00000010353.1 Ficedula albicollis 79 0.0 2082
WERAM-Meg-0126 ENSMGAP00000013772.1 Meleagris gallopavo 80 0.0 2076
WERAM-Ect-0036 ENSETEP00000003241.1 Echinops telfairi 82 0.0 2048
WERAM-Dan-0079 ENSDNOP00000007821.3 Dasypus novemcinctus 87 0.0 1998
WERAM-Ova-0167 ENSOARP00000016840.1 Ovis aries 81 0.0 1987
WERAM-Fec-0112 ENSFCAP00000009858.3 Felis catus 87 0.0 1935
WERAM-Ptv-0006 ENSPVAP00000000800.1 Pteropus vampyrus 79 0.0 1922
WERAM-Tut-0152 ENSTTRP00000013007.1 Tursiops truncatus 84 0.0 1828
WERAM-Soa-0010 ENSSARP00000001031.1 Sorex araneus 92 0.0 1807
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 67 0.0 1757
WERAM-Lac-0095 ENSLACP00000012148.1 Latimeria chalumnae 67 0.0 1744
WERAM-Tas-0123 ENSTSYP00000013256.1 Tarsius syrichta 97 0.0 1546
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 89 0.0 1511
WERAM-Paa-0173 ENSPANP00000016589.1 Papio anubis 100 0.0 1459
WERAM-Anc-0083 ENSACAP00000007827.3 Anolis carolinensis 75 0.0 1353
WERAM-Ora-0115 ENSOANP00000022567.2 Ornithorhynchus anatinus 79 0.0 1330
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 53 0.0 1321
WERAM-Pof-0045 ENSPFOP00000003632.2 Poecilia formosa 50 0.0 1219
WERAM-Tar-0051 ENSTRUP00000011596.1 Takifugu rubripes 50 0.0 1217
WERAM-Xim-0090 ENSXMAP00000008149.1 Xiphophorus maculatus 50 0.0 1208
WERAM-Gaa-0041 ENSGACP00000006025.1 Gasterosteus aculeatus 50 0.0 1204
WERAM-Ict-0015 ENSSTOP00000001032.2 Ictidomys tridecemlineatus 47 0.0 1097
WERAM-Myl-0135 ENSMLUP00000010807.2 Myotis lucifugus 80 0.0 1084
WERAM-Chh-0042 ENSCHOP00000004990.1 Choloepus hoffmanni 73 0.0 1075
WERAM-Vip-0117 ENSVPAP00000010795.1 Vicugna pacos 47 0.0 1073
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 72 0.0 1055
WERAM-Dar-0064 ENSDARP00000002944.8 Danio rerio 71 0.0 1049
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 71 0.0 1040
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 71 0.0 1026
WERAM-Mam-0007 ENSMMUP00000001453.2 Macaca mulatta 45 0.0 1024
WERAM-Gam-0039 ENSGMOP00000004113.1 Gadus morhua 71 0.0 1014
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 54 0.0 926
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 51 0.0 869
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 63 0.0 835
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 70 0.0 821
WERAM-Mae-0091 ENSMEUP00000008973.1 Macropus eugenii 75 0.0 757
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 58 0.0 652
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 42 5e-117 420
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 5e-56 218
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 43 4e-48 191
WERAM-Orbr-0054 OB04G19910.1 Oryza brachyantha 44 7e-48 190
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 45 8e-48 190
WERAM-Sob-0087 Sb06g016720.1 Sorghum bicolor 45 8e-48 190
WERAM-Hov-0032 MLOC_34561.1 Hordeum vulgare 43 9e-48 190
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 45 1e-47 190
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 43 4e-47 188
Created Date 25-Jun-2016