WERAM Information


Tag Content
WERAM ID WERAM-Hos-0096
Ensembl Protein ID ENSP00000374157.5
Uniprot Accession Q03164; KMT2A_HUMAN; E9PQG7; Q13743; Q13744; Q14845; Q16364; Q59FF2; Q6UBD1; Q9HBJ3; Q9UD94; Q9UMA3
Genbank Protein ID NP_001184033.1; NP_005924.2
Protein Name Histone-lysine N-methyltransferase 2A
Genbank Nucleotide ID NM_001197104.1; NM_005933.3
Gene Name KMT2A;HRX;TRX1;CXXC7;ALL-1;MLL;MLL1;CXXC7;HTRX1;MLL1A;WDSTS;MLL/GAS7;TET1-MLL;ALL-1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000118058.20 ENST00000389506.9 ENSP00000374157.5
ENSG00000118058.20 ENST00000392873.3 ENSP00000376612.3
ENSG00000118058.20 ENST00000534358.5 ENSP00000436786.1
Details
Type Family Domain Substrates AA References (PMIDs)
Ac_Reader Bromodomain Bromodomain H3K9 K 22464331
HMT SET1 SET H3K4 K 25537518; 26807165; 26886794
Me_Reader PHD PHD1 H3K4me3 K 25537518; 26807165; 26886794
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET1 2.90e-49 168.4 3830 3945
Me_Reader PHD 2.50e-12 49.1 1433 1978
Ac_Reader Bromodomain 9.40e-05 24.7 1703 1738
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis. Plays a critical role in the control of circadian gene expression and is essential for the transcriptional activation mediated by the CLOCK-ARNTL/BMAL1 heterodimer. Establishes a permissive chromatin state for circadian transcription by mediating a rhythmic methylation of 'Lys-4' of histone H3 (H3K4me) and this histone modification directs the circadian acetylation at H3K9 and H3K14 allowing the recruitment of CLOCK-ARNTL/BMAL1 to chromatin (By similarity).
Domain Profile
  HMT SET1

           SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakv 91  
++ v++s+i+g+gl++k++i+++e+viEY+G+virs ++dkrek+y++k+ig+y+fr+d++ vvdat++gn+arfinhscepNc+++v
ENSP00000374157.5 3830 AVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDS--EVVDATMHGNAARFINHSCEPNCYSRV 3917
58899********************************************************..*************************** PP
SET1.txt 92 vavdgekkiviyakraIekgeeltydYk 119
+++dg+k+ivi+a+r+I +geeltydYk
ENSP00000374157.5 3918 INIDGQKHIVIFAMRKIYRGEELTYDYK 3945
***************************7 PP

  Me_Reader PHD

            PHD.txt    3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCk 51  
+C++C +++ e+v+C+ C + fH C++ ++++l+++ ++w+C++Ck
ENSP00000374157.5 1433 VCFLCASSGHV--EFVYCQVCCEPFHKFCLEENERPLEDQlENWCCRRCK 1480
8****554444..59******************6666655778******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslp.egks.wyCpsCke 52
++C vCg++++ +k++++C++C++ +H +C++++ + p ++k+ w+C +C++
ENSP00000374157.5 1480 KFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPtKKKKvWICTKCVR 1532
59****988888888*******************88888444336******86 PP
PHD.txt 2 tiClvCgkddegeke...mvqCdeCddwfHlkCvklp......lsslpegkswyCpsCke 52
++C++C+k+++++++ m+qC +Cd+w+H kC +l+ ls+lpe+ +++C +C+e
ENSP00000374157.5 1567 NFCPLCDKCYDDDDYeskMMQCGKCDRWVHSKCENLSdemyeiLSNLPESVAYTCVNCTE 1626
789999877666555566*******************9**99999***9999******97 PP
PHD.txt 3 iClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+k+++ + +C + C +H C + ++k+ yC++++
ENSP00000374157.5 1932 RCEFCQKPGAT---VGCCLTsCTSNYHFMCSRAKNCVFLDDKKVYCQRHR 1978
599**766665...4566558*************5555577899**9997 PP

  Ac_Reader Bromodomain

          BROMO.txt   27 kePmdLstikerleegnYsspeefvkDvrlifnNak 62  
++P+dL+ +k+++++gnY+s++ef +D+ i++ a
ENSP00000374157.5 1703 QQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAI 1738
69***************************9998775 PP

Protein Sequence
(Fasta)
MAHSCRWRFP ARPGTTGGGG GGGRRGLGGA PRQRVPALLL PPGPPVGGGG PGAPPSPPAV 60
AAAAAAAGSS GAGVPGGAAA ASAASSSSAS SSSSSSSSAS SGPALLRVGP GFDAALQVSA 120
AIGTNLRRFR AVFGESGGGG GSGEDEQFLG FGSDEEVRVR SPTRSPSVKT SPRKPRGRPR 180
SGSDRNSAIL SDPSVFSPLN KSETKSGDKI KKKDSKSIEK KRGRPPTFPG VKIKITHGKD 240
ISELPKGNKE DSLKKIKRTP SATFQQATKI KKLRAGKLSP LKSKFKTGKL QIGRKGVQIV 300
RRRGRPPSTE RIKTPSGLLI NSELEKPQKV RKDKEGTPPL TKEDKTVVRQ SPRRIKPVRI 360
IPSSKRTDAT IAKQLLQRAK KGAQKKIEKE AAQLQGRKVK TQVKNIRQFI MPVVSAISSR 420
IIKTPRRFIE DEDYDPPIKI ARLESTPNSR FSAPSCGSSE KSSAASQHSS QMSSDSSRSS 480
SPSVDTSTDS QASEEIQVLP EERSDTPEVH PPLPISQSPE NESNDRRSRR YSVSERSFGS 540
RTTKKLSTLQ SAPQQQTSSS PPPPLLTPPP PLQPASSISD HTPWLMPPTI PLASPFLPAS 600
TAPMQGKRKS ILREPTFRWT SLKHSRSEPQ YFSSAKYAKE GLIRKPIFDN FRPPPLTPED 660
VGFASGFSAS GTAASARLFS PLHSGTRFDM HKRSPLLRAP RFTPSEAHSR IFESVTLPSN 720
RTSAGTSSSG VSNRKRKRKV FSPIRSEPRS PSHSMRTRSG RLSSSELSPL TPPSSVSSSL 780
SISVSPLATS ALNPTFTFPS HSLTQSGESA EKNQRPRKQT SAPAEPFSSS SPTPLFPWFT 840
PGSQTERGRN KDKAPEELSK DRDADKSVEK DKSRERDRER EKENKRESRK EKRKKGSEIQ 900
SSSALYPVGR VSKEKVVGED VATSSSAKKA TGRKKSSSHD SGTDITSVTL GDTTAVKTKI 960
LIKKGRGNLE KTNLDLGPTA PSLEKEKTLC LSTPSSSTVK HSTSSIGSML AQADKLPMTD 1020
KRVASLLKKA KAQLCKIEKS KSLKQTDQPK AQGQESDSSE TSVRGPRIKH VCRRAAVALG 1080
RKRAVFPDDM PTLSALPWEE REKILSSMGN DDKSSIAGSE DAEPLAPPIK PIKPVTRNKA 1140
PQEPPVKKGR RSRRCGQCPG CQVPEDCGVC TNCLDKPKFG GRNIKKQCCK MRKCQNLQWM 1200
PSKAYLQKQA KAVKKKEKKS KTSEKKDSKE SSVVKNVVDS SQKPTPSARE DPAPKKSSSE 1260
PPPRKPVEEK SEEGNVSAPG PESKQATTPA SRKSSKQVSQ PALVIPPQPP TTGPPRKEVP 1320
KTTPSEPKKK QPPPPESGPE QSKQKKVAPR PSIPVKQKPK EKEKPPPVNK QENAGTLNIL 1380
STLSNGNSSK QKIPADGVHR IRVDFKEDCE AENVWEMGGL GILTSVPITP RVVCFLCASS 1440
GHVEFVYCQV CCEPFHKFCL EENERPLEDQ LENWCCRRCK FCHVCGRQHQ ATKQLLECNK 1500
CRNSYHPECL GPNYPTKPTK KKKVWICTKC VRCKSCGSTT PGKGWDAQWS HDFSLCHDCA 1560
KLFAKGNFCP LCDKCYDDDD YESKMMQCGK CDRWVHSKCE NLSDEMYEIL SNLPESVAYT 1620
CVNCTERHPA EWRLALEKEL QISLKQVLTA LLNSRTTSHL LRYRQAAKPP DLNPETEESI 1680
PSRSSPEGPD PPVLTEVSKQ DDQQPLDLEG VKRKMDQGNY TSVLEFSDDI VKIIQAAINS 1740
DGGQPEIKKA NSMVKSFFIR QMERVFPWFS VKKSRFWEPN KVSSNSGMLP NAVLPPSLDH 1800
NYAQWQEREE NSHTEQPPLM KKIIPAPKPK GPGEPDSPTP LHPPTPPILS TDRSREDSPE 1860
LNPPPGIEDN RQCALCLTYG DDSANDAGRL LYIGQNEWTH VNCALWSAEV FEDDDGSLKN 1920
VHMAVIRGKQ LRCEFCQKPG ATVGCCLTSC TSNYHFMCSR AKNCVFLDDK KVYCQRHRDL 1980
IKGEVVPENG FEVFRRVFVD FEGISLRRKF LNGLEPENIH MMIGSMTIDC LGILNDLSDC 2040
EDKLFPIGYQ CSRVYWSTTD ARKRCVYTCK IVECRPPVVE PDINSTVEHD ENRTIAHSPT 2100
SFTESSSKES QNTAEIISPP SPDRPPHSQT SGSCYYHVIS KVPRIRTPSY SPTQRSPGCR 2160
PLPSAGSPTP TTHEIVTVGD PLLSSGLRSI GSRRHSTSSL SPQRSKLRIM SPMRTGNTYS 2220
RNNVSSVSTT GTATDLESSA KVVDHVLGPL NSSTSLGQNT STSSNLQRTV VTVGNKNSHL 2280
DGSSSSEMKQ SSASDLVSKS SSLKGEKTKV LSSKSSEGSA HNVAYPGIPK LAPQVHNTTS 2340
RELNVSKIGS FAEPSSVSFS SKEALSFPHL HLRGQRNDRD QHTDSTQSAN SSPDEDTEVK 2400
TLKLSGMSNR SSIINEHMGS SSRDRRQKGK KSCKETFKEK HSSKSFLEPG QVTTGEEGNL 2460
KPEFMDEVLT PEYMGQRPCN NVSSDKIGDK GLSMPGVPKA PPMQVEGSAK ELQAPRKRTV 2520
KVTLTPLKME NESQSKNALK ESSPASPLQI ESTSPTEPIS ASENPGDGPV AQPSPNNTSC 2580
QDSQSNNYQN LPVQDRNLML PDGPKPQEDG SFKRRYPRRS ARARSNMFFG LTPLYGVRSY 2640
GEEDIPFYSS STGKKRGKRS AEGQVDGADD LSTSDEDDLY YYNFTRTVIS SGGEERLASH 2700
NLFREEEQCD LPKISQLDGV DDGTESDTSV TATTRKSSQI PKRNGKENGT ENLKIDRPED 2760
AGEKEHVTKS SVGHKNEPKM DNCHSVSRVK TQGQDSLEAQ LSSLESSRRV HTSTPSDKNL 2820
LDTYNTELLK SDSDNNNSDD CGNILPSDIM DFVLKNTPSM QALGESPESS SSELLNLGEG 2880
LGLDSNREKD MGLFEVFSQQ LPTTEPVDSS VSSSISAEEQ FELPLELPSD LSVLTTRSPT 2940
VPSQNPSRLA VISDSGEKRV TITEKSVASS ESDPALLSPG VDPTPEGHMT PDHFIQGHMD 3000
ADHISSPPCG SVEQGHGNNQ DLTRNSSTPG LQVPVSPTVP IQNQKYVPNS TDSPGPSQIS 3060
NAAVQTTPPH LKPATEKLIV VNQNMQPLYV LQTLPNGVTQ KIQLTSSVSS TPSVMETNTS 3120
VLGPMGGGLT LTTGLNPSLP TSQSLFPSAS KGLLPMSHHQ HLHSFPAATQ SSFPPNISNP 3180
PSGLLIGVQP PPDPQLLVSE SSQRTDLSTT VATPSSGLKK RPISRLQTRK NKKLAPSSTP 3240
SNIAPSDVVS NMTLINFTPS QLPNHPSLLD LGSLNTSSHR TVPNIIKRSK SSIMYFEPAP 3300
LLPQSVGGTA ATAAGTSTIS QDTSHLTSGS VSGLASSSSV LNVVSMQTTT TPTSSASVPG 3360
HVTLTNPRLL GTPDIGSISN LLIKASQQSL GIQDQPVALP PSSGMFPQLG TSQTPSTAAI 3420
TAASSICVLP STQTTGITAA SPSGEADEHY QLQHVNQLLA SKTGIHSSQR DLDSASGPQV 3480
SNFTQTVDAP NSMGLEQNKA LSSAVQASPT SPGGSPSSPS SGQRSASPSV PGPTKPKPKT 3540
KRFQLPLDKG NGKKHKVSHL RTSSSEAHIP DQETTSLTSG TGTPGAEAEQ QDTASVEQSS 3600
QKECGQPAGQ VAVLPEVQVT QNPANEQESA EPKTVEEEES NFSSPLMLWL QQEQKRKESI 3660
TEKKPKKGLV FEISSDDGFQ ICAESIEDAW KSLTDKVQEA RSNARLKQLS FAGVNGLRML 3720
GILHDAVVFL IEQLSGAKHC RNYKFRFHKP EEANEPPLNP HGSARAEVHL RKSAFDMFNF 3780
LASKHRQPPE YNPNDEEEEE VQLKSARRAT SMDLPMPMRF RHLKKTSKEA VGVYRSPIHG 3840
RGLFCKRNID AGEMVIEYAG NVIRSIQTDK REKYYDSKGI GCYMFRIDDS EVVDATMHGN 3900
AARFINHSCE PNCYSRVINI DGQKHIVIFA MRKIYRGEEL TYDYKFPIED ASNKLPCNCG 3960
AKKCRKFLN 3969
Nucleotide Sequence
(Fasta)
ATGGCGCACA GCTGTCGGTG GCGCTTCCCC GCCCGACCCG GGACCACCGG GGGCGGCGGC 60
GGCGGGGGGC GCCGGGGCCT AGGGGGCGCC CCGCGGCAAC GCGTCCCGGC CCTGCTGCTT 120
CCCCCCGGGC CCCCGGTCGG CGGTGGCGGC CCCGGGGCGC CCCCCTCCCC CCCGGCTGTG 180
GCGGCCGCGG CGGCGGCGGC GGGAAGCAGC GGGGCTGGGG TTCCAGGGGG AGCGGCCGCC 240
GCCTCAGCAG CCTCCTCGTC GTCCGCCTCG TCTTCGTCTT CGTCATCGTC CTCAGCCTCT 300
TCAGGGCCGG CCCTGCTCCG GGTGGGCCCG GGCTTCGACG CGGCGCTGCA GGTCTCGGCC 360
GCCATCGGCA CCAACCTGCG CCGGTTCCGG GCCGTGTTTG GGGAGAGCGG CGGGGGAGGC 420
GGCAGCGGAG AGGATGAGCA ATTCTTAGGT TTTGGCTCAG ATGAAGAAGT CAGAGTGCGA 480
AGTCCCACAA GGTCTCCTTC AGTTAAAACT AGTCCTCGAA AACCTCGTGG GAGACCTAGA 540
AGTGGCTCTG ACCGAAATTC AGCTATCCTC TCAGATCCAT CTGTGTTTTC CCCTCTAAAT 600
AAATCAGAGA CCAAATCTGG AGATAAGATC AAGAAGAAAG ATTCTAAAAG TATAGAAAAG 660
AAGAGAGGAA GACCTCCCAC CTTCCCTGGA GTAAAAATCA AAATAACACA TGGAAAGGAC 720
ATTTCAGAGT TACCAAAGGG AAACAAAGAA GATAGCCTGA AAAAAATTAA AAGGACACCT 780
TCTGCTACGT TTCAGCAAGC CACAAAGATT AAAAAATTAA GAGCAGGTAA ACTCTCTCCT 840
CTCAAGTCTA AGTTTAAGAC AGGGAAGCTT CAAATAGGAA GGAAGGGGGT ACAAATTGTA 900
CGACGGAGAG GAAGGCCTCC ATCAACAGAA AGGATAAAGA CCCCTTCGGG TCTCCTCATT 960
AATTCTGAAC TGGAAAAGCC CCAGAAAGTC CGGAAAGACA AGGAAGGAAC ACCTCCACTT 1020
ACAAAAGAAG ATAAGACAGT TGTCAGACAA AGCCCTCGAA GGATTAAGCC AGTTAGGATT 1080
ATTCCTTCTT CAAAAAGGAC AGATGCAACC ATTGCTAAGC AACTCTTACA GAGGGCAAAA 1140
AAGGGGGCTC AAAAGAAAAT TGAAAAAGAA GCAGCTCAGC TGCAGGGAAG AAAGGTGAAG 1200
ACACAGGTCA AAAATATTCG ACAGTTCATC ATGCCTGTTG TCAGTGCTAT CTCCTCGCGG 1260
ATCATTAAGA CCCCTCGGCG GTTTATAGAG GATGAGGATT ATGACCCTCC AATTAAAATT 1320
GCCCGATTAG AGTCTACACC GAATAGTAGA TTCAGTGCCC CGTCCTGTGG ATCTTCTGAA 1380
AAATCAAGTG CAGCTTCTCA GCACTCCTCT CAAATGTCTT CAGACTCCTC TCGATCTAGT 1440
AGCCCCAGTG TTGATACCTC CACAGACTCT CAGGCTTCTG AGGAGATTCA GGTACTTCCT 1500
GAGGAGCGGA GCGATACCCC TGAAGTTCAT CCTCCACTGC CCATTTCCCA GTCCCCAGAA 1560
AATGAGAGTA ATGATAGGAG AAGCAGAAGG TATTCAGTGT CGGAGAGAAG TTTTGGATCT 1620
AGAACGACGA AAAAATTATC AACTCTACAA AGTGCCCCCC AGCAGCAGAC CTCCTCGTCT 1680
CCACCTCCAC CTCTGCTGAC TCCACCGCCA CCACTGCAGC CAGCCTCCAG TATCTCTGAC 1740
CACACACCTT GGCTTATGCC TCCAACAATC CCCTTAGCAT CACCATTTTT GCCTGCTTCC 1800
ACTGCTCCTA TGCAAGGGAA GCGAAAATCT ATTTTGCGAG AACCGACATT TAGGTGGACT 1860
TCTTTAAAGC ATTCTAGGTC AGAGCCACAA TACTTTTCCT CAGCAAAGTA TGCCAAAGAA 1920
GGTCTTATTC GCAAACCAAT ATTTGATAAT TTCCGACCCC CTCCACTAAC TCCCGAGGAC 1980
GTTGGCTTTG CATCTGGTTT TTCTGCATCT GGTACCGCTG CTTCAGCCCG ATTGTTTTCG 2040
CCACTCCATT CTGGAACAAG GTTTGATATG CACAAAAGGA GCCCTCTTCT GAGAGCTCCA 2100
AGATTTACTC CAAGTGAGGC TCACTCTAGA ATATTTGAGT CTGTAACCTT GCCTAGTAAT 2160
CGAACTTCTG CTGGAACATC TTCTTCAGGA GTATCCAATA GAAAAAGGAA AAGAAAAGTG 2220
TTTAGTCCTA TTCGATCTGA ACCAAGATCT CCTTCTCACT CCATGAGGAC AAGAAGTGGA 2280
AGGCTTAGTA GTTCTGAGCT CTCACCTCTC ACCCCCCCGT CTTCTGTCTC TTCCTCGTTA 2340
AGCATTTCTG TTAGTCCTCT TGCCACTAGT GCCTTAAACC CAACTTTTAC TTTTCCTTCT 2400
CATTCCCTGA CTCAGTCTGG GGAATCTGCA GAGAAAAATC AGAGACCAAG GAAGCAGACT 2460
AGTGCTCCGG CAGAGCCATT TTCATCAAGT AGTCCTACTC CTCTCTTCCC TTGGTTTACC 2520
CCAGGCTCTC AGACTGAAAG AGGGAGAAAT AAAGACAAGG CCCCCGAGGA GCTGTCCAAA 2580
GATCGAGATG CTGACAAGAG CGTGGAGAAG GACAAGAGTA GAGAGAGAGA CCGGGAGAGA 2640
GAAAAGGAGA ATAAGCGGGA GTCAAGGAAA GAGAAAAGGA AAAAGGGATC AGAAATTCAG 2700
AGTAGTTCTG CTTTGTATCC TGTGGGTAGG GTTTCCAAAG AGAAGGTTGT TGGTGAAGAT 2760
GTTGCCACTT CATCTTCTGC CAAAAAAGCA ACAGGGCGGA AGAAGTCTTC ATCACATGAT 2820
TCTGGGACTG ATATTACTTC TGTGACTCTT GGGGATACAA CAGCTGTCAA AACCAAAATA 2880
CTTATAAAGA AAGGGAGAGG AAATCTGGAA AAAACCAACT TGGACCTCGG CCCAACTGCC 2940
CCATCCCTGG AGAAGGAGAA AACCCTCTGC CTTTCCACTC CTTCATCTAG CACTGTTAAA 3000
CATTCCACTT CCTCCATAGG CTCCATGTTG GCTCAGGCAG ACAAGCTTCC AATGACTGAC 3060
AAGAGGGTTG CCAGCCTCCT AAAAAAGGCC AAAGCTCAGC TCTGCAAGAT TGAGAAGAGT 3120
AAGAGTCTTA AACAAACCGA CCAGCCCAAA GCACAGGGTC AAGAAAGTGA CTCATCAGAG 3180
ACCTCTGTGC GAGGACCCCG GATTAAACAT GTCTGCAGAA GAGCAGCTGT TGCCCTTGGC 3240
CGAAAACGAG CTGTGTTTCC TGATGACATG CCCACCCTGA GTGCCTTACC ATGGGAAGAA 3300
CGAGAAAAGA TTTTGTCTTC CATGGGGAAT GATGACAAGT CATCAATTGC TGGCTCAGAA 3360
GATGCTGAAC CTCTTGCTCC ACCCATCAAA CCAATTAAAC CTGTCACTAG AAACAAGGCA 3420
CCCCAGGAAC CTCCAGTAAA GAAAGGACGT CGATCGAGGC GGTGTGGGCA GTGTCCCGGC 3480
TGCCAGGTGC CTGAGGACTG TGGTGTTTGT ACTAATTGCT TAGATAAGCC CAAGTTTGGT 3540
GGTCGCAATA TAAAGAAGCA GTGCTGCAAG ATGAGAAAAT GTCAGAATCT ACAATGGATG 3600
CCTTCCAAAG CCTACCTGCA GAAGCAAGCT AAAGCTGTGA AAAAGAAAGA GAAAAAGTCT 3660
AAGACCAGTG AAAAGAAAGA CAGCAAAGAG AGCAGTGTTG TGAAGAACGT GGTGGACTCT 3720
AGTCAGAAAC CTACCCCATC AGCAAGAGAG GATCCTGCCC CAAAGAAAAG CAGTAGTGAG 3780
CCTCCTCCAC GAAAGCCCGT CGAGGAAAAG AGTGAAGAAG GGAATGTCTC GGCCCCTGGG 3840
CCTGAATCCA AACAGGCCAC CACTCCAGCT TCCAGGAAGT CAAGCAAGCA GGTCTCCCAG 3900
CCAGCACTGG TCATCCCGCC TCAGCCACCT ACTACAGGAC CGCCAAGAAA AGAAGTTCCC 3960
AAAACCACTC CTAGTGAGCC CAAGAAAAAG CAGCCTCCAC CACCAGAATC AGGTCCAGAG 4020
CAGAGCAAAC AGAAAAAAGT GGCTCCCCGC CCAAGTATCC CTGTAAAACA AAAACCAAAA 4080
GAAAAGGAAA AACCACCTCC GGTCAATAAG CAGGAGAATG CAGGCACTTT GAACATCCTC 4140
AGCACTCTCT CCAATGGCAA TAGTTCTAAG CAAAAAATTC CAGCAGATGG AGTCCACAGG 4200
ATCAGAGTGG ACTTTAAGGA GGATTGTGAA GCAGAAAATG TGTGGGAGAT GGGAGGCTTA 4260
GGAATCTTGA CTTCTGTTCC TATAACACCC AGGGTGGTTT GCTTTCTCTG TGCCAGTAGT 4320
GGGCATGTAG AGTTTGTGTA TTGCCAAGTC TGTTGTGAGC CCTTCCACAA GTTTTGTTTA 4380
GAGGAGAACG AGCGCCCTCT GGAGGACCAG CTGGAAAATT GGTGTTGTCG TCGTTGCAAA 4440
TTCTGTCACG TTTGTGGAAG GCAACATCAG GCTACAAAGC AGCTGCTGGA GTGTAATAAG 4500
TGCCGAAACA GCTATCACCC TGAGTGCCTG GGACCAAACT ACCCCACCAA ACCCACAAAG 4560
AAGAAGAAAG TCTGGATCTG TACCAAGTGT GTTCGCTGTA AGAGCTGTGG ATCCACAACT 4620
CCAGGCAAAG GGTGGGATGC ACAGTGGTCT CATGATTTCT CACTGTGTCA TGATTGCGCC 4680
AAGCTCTTTG CTAAAGGAAA CTTCTGCCCT CTCTGTGACA AATGTTATGA TGATGATGAC 4740
TATGAGAGTA AGATGATGCA ATGTGGAAAG TGTGATCGCT GGGTCCATTC CAAATGTGAG 4800
AATCTTTCAG ATGAGATGTA TGAGATTCTA TCTAATCTGC CAGAAAGTGT GGCCTACACT 4860
TGTGTGAACT GTACTGAGCG GCACCCTGCA GAGTGGCGAC TGGCCCTTGA AAAAGAGCTG 4920
CAGATTTCTC TGAAGCAAGT TCTGACAGCT TTGTTGAATT CTCGGACTAC CAGCCATTTG 4980
CTACGCTACC GGCAGGCTGC CAAGCCTCCA GACTTAAATC CCGAGACAGA GGAGAGTATA 5040
CCTTCCCGCA GCTCCCCCGA AGGACCTGAT CCACCAGTTC TTACTGAGGT CAGCAAACAG 5100
GATGATCAGC AGCCTTTAGA TCTAGAAGGA GTCAAGAGGA AGATGGACCA AGGGAATTAC 5160
ACATCTGTGT TGGAGTTCAG TGATGATATT GTGAAGATCA TTCAAGCAGC CATTAATTCA 5220
GATGGAGGAC AGCCAGAAAT TAAAAAAGCC AACAGCATGG TCAAGTCCTT CTTCATTCGG 5280
CAAATGGAAC GTGTTTTTCC ATGGTTCAGT GTCAAAAAGT CCAGGTTTTG GGAGCCAAAT 5340
AAAGTATCAA GCAACAGTGG GATGTTACCA AACGCAGTGC TTCCACCTTC ACTTGACCAT 5400
AATTATGCTC AGTGGCAGGA GCGAGAGGAA AACAGCCACA CTGAGCAGCC TCCTTTAATG 5460
AAGAAAATCA TTCCAGCTCC CAAACCCAAA GGTCCTGGAG AACCAGACTC ACCAACTCCT 5520
CTGCATCCTC CTACACCACC AATTTTGAGT ACTGATAGGA GTCGAGAAGA CAGTCCAGAG 5580
CTGAACCCAC CCCCAGGCAT AGAAGACAAT AGACAGTGTG CGTTATGTTT GACTTATGGT 5640
GATGACAGTG CTAATGATGC TGGTCGTTTA CTATATATTG GCCAAAATGA GTGGACACAT 5700
GTAAATTGTG CTTTGTGGTC AGCGGAAGTG TTTGAAGATG ATGACGGATC ACTAAAGAAT 5760
GTGCATATGG CTGTGATCAG GGGCAAGCAG CTGAGATGTG AATTCTGCCA AAAGCCAGGA 5820
GCCACCGTGG GTTGCTGTCT CACATCCTGC ACCAGCAACT ATCACTTCAT GTGTTCCCGA 5880
GCCAAGAACT GTGTCTTTCT GGATGATAAA AAAGTATATT GCCAACGACA TCGGGATTTG 5940
ATCAAAGGCG AAGTGGTTCC TGAGAATGGA TTTGAAGTTT TCAGAAGAGT GTTTGTGGAC 6000
TTTGAAGGAA TCAGCTTGAG AAGGAAGTTT CTCAATGGCT TGGAACCAGA AAATATCCAC 6060
ATGATGATTG GGTCTATGAC AATCGACTGC TTAGGAATTC TAAATGATCT CTCCGACTGT 6120
GAAGATAAGC TCTTTCCTAT TGGATATCAG TGTTCCAGGG TATACTGGAG CACCACAGAT 6180
GCTCGCAAGC GCTGTGTATA TACATGCAAG ATAGTGGAGT GCCGTCCTCC AGTCGTAGAG 6240
CCGGATATCA ACAGCACTGT TGAACATGAT GAAAACAGGA CCATTGCCCA TAGTCCAACA 6300
TCTTTTACAG AAAGTTCATC AAAAGAGAGT CAAAACACAG CTGAAATTAT AAGTCCTCCA 6360
TCACCAGACC GACCTCCTCA TTCACAAACC TCTGGCTCCT GTTATTATCA TGTCATCTCA 6420
AAGGTCCCCA GGATTCGAAC ACCCAGTTAT TCTCCAACAC AGAGATCCCC TGGCTGTCGA 6480
CCGTTGCCTT CTGCAGGAAG TCCTACCCCA ACCACTCATG AAATAGTCAC AGTAGGTGAT 6540
CCTTTACTCT CCTCTGGACT TCGAAGCATT GGCTCCAGGC GTCACAGTAC CTCTTCCTTA 6600
TCACCCCAGC GGTCCAAACT CCGGATAATG TCTCCAATGA GAACTGGGAA TACTTACTCT 6660
AGGAATAATG TTTCCTCAGT CTCCACCACC GGGACCGCTA CTGATCTTGA ATCAAGTGCC 6720
AAAGTAGTTG ATCATGTCTT AGGGCCACTG AATTCAAGTA CTAGTTTAGG GCAAAACACT 6780
TCCACCTCTT CAAATTTGCA AAGGACAGTG GTTACTGTAG GCAATAAAAA CAGTCACTTG 6840
GATGGATCTT CATCTTCAGA AATGAAGCAG TCCAGTGCTT CAGACTTGGT GTCCAAGAGC 6900
TCCTCTTTAA AGGGAGAGAA GACCAAAGTG CTGAGTTCCA AGAGCTCAGA GGGATCTGCA 6960
CATAATGTGG CTTACCCTGG AATTCCTAAA CTGGCCCCAC AGGTTCATAA CACAACATCT 7020
AGAGAACTGA ATGTTAGTAA AATCGGCTCC TTTGCTGAAC CCTCTTCAGT GTCGTTTTCT 7080
TCTAAAGAGG CCCTCTCCTT CCCACACCTC CATTTGAGAG GGCAAAGGAA TGATCGAGAC 7140
CAACACACAG ATTCTACCCA ATCAGCAAAC TCCTCTCCAG ATGAAGATAC TGAAGTCAAA 7200
ACCTTGAAGC TATCTGGAAT GAGCAACAGA TCATCCATTA TCAACGAACA TATGGGATCT 7260
AGTTCCAGAG ATAGGAGACA GAAAGGGAAA AAATCCTGTA AAGAAACTTT CAAAGAAAAG 7320
CATTCCAGTA AATCTTTTTT GGAACCTGGT CAGGTGACAA CTGGTGAGGA AGGAAACTTG 7380
AAGCCAGAGT TTATGGATGA GGTTTTGACT CCTGAGTATA TGGGCCAACG ACCATGTAAC 7440
AATGTTTCTT CTGATAAGAT TGGTGATAAA GGCCTTTCTA TGCCAGGAGT CCCCAAAGCT 7500
CCACCCATGC AAGTAGAAGG ATCTGCCAAG GAATTACAGG CACCACGGAA ACGCACAGTC 7560
AAAGTGACAC TGACACCTCT AAAAATGGAA AATGAGAGTC AATCCAAAAA TGCCCTGAAA 7620
GAAAGTAGTC CTGCTTCCCC TTTGCAAATA GAGTCAACAT CTCCCACAGA ACCAATTTCA 7680
GCCTCTGAAA ATCCAGGAGA TGGTCCAGTG GCCCAACCAA GCCCCAATAA TACCTCATGC 7740
CAGGATTCTC AAAGTAACAA CTATCAGAAT CTTCCAGTAC AGGACAGAAA CCTAATGCTT 7800
CCAGATGGCC CCAAACCTCA GGAGGATGGC TCTTTTAAAA GGAGGTATCC CCGTCGCAGT 7860
GCCCGTGCAC GTTCTAACAT GTTTTTTGGG CTTACCCCAC TCTATGGAGT AAGATCCTAT 7920
GGTGAAGAAG ACATTCCATT CTACAGCAGC TCAACTGGGA AGAAGCGAGG CAAGAGATCA 7980
GCTGAAGGAC AGGTGGATGG GGCCGATGAC TTAAGCACTT CAGATGAAGA CGACTTATAC 8040
TATTACAACT TCACTAGAAC AGTGATTTCT TCAGGTGGAG AGGAACGACT GGCATCCCAT 8100
AATTTATTTC GGGAGGAGGA ACAGTGTGAT CTTCCAAAAA TCTCACAGTT GGATGGTGTT 8160
GATGATGGGA CAGAGAGTGA TACTAGTGTC ACAGCCACAA CAAGGAAAAG CAGCCAGATT 8220
CCAAAAAGAA ATGGTAAAGA AAATGGAACA GAGAACTTAA AGATTGATAG ACCTGAAGAT 8280
GCTGGGGAGA AAGAACATGT CACTAAGAGT TCTGTTGGCC ACAAAAATGA GCCAAAGATG 8340
GATAACTGCC ATTCTGTAAG CAGAGTTAAA ACACAGGGAC AAGATTCCTT GGAAGCTCAG 8400
CTCAGCTCAT TGGAGTCAAG CCGCAGAGTC CACACAAGTA CCCCCTCCGA CAAAAATTTA 8460
CTGGACACCT ATAATACTGA GCTCCTGAAA TCAGATTCAG ACAATAACAA CAGTGATGAC 8520
TGTGGGAATA TCCTGCCTTC AGACATTATG GACTTTGTAC TAAAGAATAC TCCATCCATG 8580
CAGGCTTTGG GTGAGAGCCC AGAGTCATCT TCATCAGAAC TCCTGAATCT TGGTGAAGGA 8640
TTGGGTCTTG ACAGTAATCG TGAAAAAGAC ATGGGTCTTT TTGAAGTATT TTCTCAGCAG 8700
CTGCCTACAA CAGAACCTGT GGATAGTAGT GTCTCTTCCT CTATCTCAGC AGAGGAACAG 8760
TTTGAGTTGC CTCTAGAGCT ACCATCTGAT CTGTCTGTCT TGACCACCCG GAGTCCCACT 8820
GTCCCCAGCC AGAATCCCAG TAGACTAGCT GTTATCTCAG ACTCAGGGGA GAAGAGAGTA 8880
ACCATCACAG AAAAATCTGT AGCCTCCTCT GAAAGTGACC CAGCACTGCT GAGCCCAGGA 8940
GTAGATCCAA CTCCTGAAGG CCACATGACT CCTGATCATT TTATCCAAGG ACACATGGAT 9000
GCAGACCACA TCTCTAGCCC TCCTTGTGGT TCAGTAGAGC AAGGTCATGG CAACAATCAG 9060
GATTTAACTA GGAACAGTAG CACCCCTGGC CTTCAGGTAC CTGTTTCCCC AACTGTTCCC 9120
ATCCAGAACC AGAAGTATGT GCCCAATTCT ACTGATAGTC CTGGCCCGTC TCAGATTTCC 9180
AATGCAGCTG TCCAGACCAC TCCACCCCAC CTGAAGCCAG CCACTGAGAA ACTCATAGTT 9240
GTTAACCAGA ACATGCAGCC ACTTTATGTT CTCCAAACTC TTCCAAATGG AGTGACCCAA 9300
AAAATCCAAT TGACCTCTTC TGTTAGTTCT ACACCCAGTG TGATGGAGAC AAATACTTCA 9360
GTATTGGGAC CCATGGGAGG TGGTCTCACC CTTACCACAG GACTAAATCC AAGCTTGCCA 9420
ACTTCTCAAT CTTTGTTCCC TTCTGCTAGC AAAGGATTGC TACCCATGTC TCATCACCAG 9480
CACTTACATT CCTTCCCTGC AGCTACTCAA AGTAGTTTCC CACCAAACAT CAGCAATCCT 9540
CCTTCAGGCC TGCTTATTGG GGTTCAGCCT CCTCCGGATC CCCAACTTTT GGTTTCAGAA 9600
TCCAGCCAGA GGACAGACCT CAGTACCACA GTAGCCACTC CATCCTCTGG ACTCAAGAAA 9660
AGACCCATAT CTCGTCTACA GACCCGAAAG AATAAAAAAC TTGCTCCCTC TAGTACCCCT 9720
TCAAACATTG CCCCTTCTGA TGTGGTTTCT AATATGACAT TGATTAACTT CACACCCTCC 9780
CAGCTTCCTA ATCATCCAAG TCTGTTAGAT TTGGGGTCAC TTAATACTTC ATCTCACCGA 9840
ACTGTCCCCA ACATCATAAA AAGATCTAAA TCTAGCATCA TGTATTTTGA ACCGGCACCC 9900
CTGTTACCAC AGAGTGTGGG AGGAACTGCT GCCACAGCGG CAGGCACATC AACAATAAGC 9960
CAGGATACTA GCCACCTCAC ATCAGGGTCT GTGTCTGGCT TGGCATCCAG TTCCTCTGTC 10020
TTGAATGTTG TATCCATGCA AACTACCACA ACCCCTACAA GTAGTGCGTC AGTTCCAGGA 10080
CACGTCACCT TAACCAACCC AAGGTTGCTT GGTACCCCAG ATATTGGCTC AATAAGCAAT 10140
CTTTTAATCA AAGCTAGCCA GCAGAGCCTG GGGATTCAGG ACCAGCCTGT GGCTTTACCG 10200
CCAAGTTCAG GAATGTTTCC ACAACTGGGG ACATCACAGA CCCCCTCTAC TGCTGCAATA 10260
ACAGCGGCAT CTAGCATCTG TGTGCTCCCC TCCACTCAGA CTACGGGCAT AACAGCCGCT 10320
TCACCTTCTG GGGAAGCAGA CGAACACTAT CAGCTTCAGC ATGTGAACCA GCTCCTTGCC 10380
AGCAAAACTG GGATTCATTC TTCCCAGCGT GATCTTGATT CTGCTTCAGG GCCCCAGGTA 10440
TCCAACTTTA CCCAGACGGT AGACGCTCCT AATAGCATGG GACTGGAGCA GAACAAGGCT 10500
TTATCCTCAG CTGTGCAAGC CAGCCCCACC TCTCCTGGGG GTTCTCCATC CTCTCCATCT 10560
TCTGGACAGC GGTCAGCAAG CCCTTCAGTG CCGGGTCCCA CTAAACCCAA ACCAAAAACC 10620
AAACGGTTTC AGCTGCCTCT AGACAAAGGG AATGGCAAGA AGCACAAAGT TTCCCATTTG 10680
CGGACCAGTT CTTCTGAAGC ACACATTCCA GACCAAGAAA CGACATCCCT GACCTCAGGC 10740
ACAGGGACTC CAGGAGCAGA GGCTGAGCAG CAGGATACAG CTAGCGTGGA GCAGTCCTCC 10800
CAGAAGGAGT GTGGGCAACC TGCAGGGCAA GTCGCTGTTC TTCCGGAAGT TCAGGTGACC 10860
CAAAATCCAG CAAATGAACA AGAAAGTGCA GAACCTAAAA CAGTGGAAGA AGAGGAAAGT 10920
AATTTCAGCT CCCCACTGAT GCTTTGGCTT CAGCAAGAAC AAAAGCGGAA GGAAAGCATT 10980
ACTGAGAAAA AACCCAAGAA AGGACTTGTT TTTGAAATTT CCAGTGATGA TGGCTTTCAG 11040
ATCTGTGCAG AAAGTATTGA AGATGCCTGG AAGTCATTGA CAGATAAAGT CCAGGAAGCT 11100
CGATCAAATG CCCGCCTAAA GCAGCTCTCA TTTGCAGGTG TTAACGGTTT GAGGATGCTG 11160
GGGATTCTCC ATGATGCAGT TGTGTTCCTC ATTGAGCAGC TGTCTGGTGC CAAGCACTGT 11220
CGAAATTACA AATTCCGTTT CCACAAGCCA GAGGAGGCCA ATGAACCCCC CTTGAACCCT 11280
CACGGCTCAG CCAGGGCTGA AGTCCACCTC AGGAAGTCAG CATTTGACAT GTTTAACTTC 11340
CTGGCTTCTA AACATCGTCA GCCTCCTGAA TACAACCCCA ATGATGAAGA AGAGGAGGAG 11400
GTACAGCTGA AGTCAGCTCG GAGGGCAACT AGCATGGATC TGCCAATGCC CATGCGCTTC 11460
CGGCACTTAA AAAAGACTTC TAAGGAGGCA GTTGGTGTCT ACAGGTCTCC CATCCATGGC 11520
CGGGGTCTTT TCTGTAAGAG AAACATTGAT GCAGGTGAGA TGGTGATTGA GTATGCCGGC 11580
AACGTCATCC GCTCCATCCA GACTGACAAG CGGGAAAAGT ATTACGACAG CAAGGGCATT 11640
GGTTGCTATA TGTTCCGAAT TGATGACTCA GAGGTAGTGG ATGCCACCAT GCATGGAAAT 11700
GCTGCACGCT TCATCAATCA CTCGTGTGAG CCTAACTGCT ATTCTCGGGT CATCAATATT 11760
GATGGGCAGA AGCACATTGT CATCTTTGCC ATGCGTAAGA TCTACCGAGG AGAGGAACTC 11820
ACTTACGACT ATAAGTTCCC CATTGAGGAT GCCAGCAACA AGCTGCCCTG CAACTGTGGC 11880
GCCAAGAAAT GCCGGAAGTT CCTAAACTAA AGCTGCTCTT CTCCCCCAGT GTTGGAGTGC 11940
AAGGAGGCGG GGCCATCCAA AGCAACGCTG AAGGCCTTTT CCAGCAGCTG GGAGCTCCCG 12000
GATTGCGTGG CACAGCTGAG GGGCCTCTGT GATGGCTGAG CTCTCTTATG TCCTATACTC 12060
ACATCAGACA TGTGATCATA GTCCCAGAGA CAGAGTTGAG GTCTCGAAGA AAAGATCCAT 12120
GATCGGCTTT CTCCTGGGGC CCCTCCAATT GTTTACTGTT AGAAAGTGGG AATGGGGTCC 12180
CTAGCAGACT TGCCTGGAAG GAGCCTATTA TAGAGGGTTG GTTATGTTGG GAGATTGGGC 12240
CTGAATTTCT CCACAGAAAT AAGTTGCCAT CCTCAGGTTG GCCCTTTCCC AAGCACTGTA 12300
AGTGAGTGGG TCAGGCAAAG CCCCAAATGG AGGGTTGGTT AGATTCCTGA CAGTTTGCCA 12360
GCCAGGCCCC ACCTACAGCG TCTGTCGAAC AAACAGAGGT CTGGTGGTTT TCCCTACTAT 12420
CCTCCCACTC GAGAGTTCAC TTCTGGTTGG GAGACAGGAT TCCTAGCACC TCCGGTGTCA 12480
AAAGGCTGTC ATGGGGTTGT GCCAATTAAT TACCAAACAT TGAGCCTGCA GGCTTTGAGT 12540
GGGAGTGTTG CCCCCAGGAG CCTTATCTCA GCCAATTACC TTTCTTGACA GTAGGAGCGG 12600
CTTCCCTCTC CCATTCCCTC TTCACTCCCT TTTCTTCCTT TCCCCTGTCT TCATGCCACT 12660
GCTTTCCCAT GCTTCTTTCG GGTTGTAGGG GAGACTGACT GCCTGCTCAA GGACACTCCC 12720
TGCTGGGCAT AGGATGTGCC TGCAAAAAGT TCCCTGAGCC TGTAAGCACT CCAGGTGGGG 12780
AAGTGGACAG GAGCCATTGG TCATAACCAG ACAGAATTTG GAAACATTTT CATAAAGCTC 12840
CATGGAGAGT TTTAAAGAAA CATATGTAGC ATGATTTTGT AGGAGAGGAA AAAGATTATT 12900
TAAATAGGAT TTAAATCATG CAACAACGAG AGTATCACAG CCAGGATGAC CCTTGGGTCC 12960
CATTCCTAAG ACATGGTTAC TTTATTTTCC CCTTGTTAAG ACATAGGAAG ACTTAATTTT 13020
TAAACGGTCA GTGTCCAGTT GAAGGCAGAA CACTAATCAG ATTTCAAGGC CCACAACTTG 13080
GGGACTAGAC CACCTTATGT TGAGGGAACT CTGCCACCTG CGTGCAACCC ACAGCTAAAG 13140
TAAATTCAAT GACACTACTG CCCTGATTAC TCCTTAGGAT GTGGTCAAAA CAGCATCAAA 13200
TGTTTCTTCT CTTCCTTTCC CCAAGACAGA GTCCTGAACC TGTTAAATTA AGTCATTGGA 13260
TTTTACTCTG TTCTGTTTAC AGTTTACTAT TTAAGGTTTT ATAAATGTAA ATATATTTTG 13320
TATATTTTTC TATGAGAAGC ACTTCATAGG GAGAAGCACT TATGACAAGG CTATTTTTTA 13380
AACCGCGGTA TTATCCTAAT TTAAAAGAAG ATCGGTTTTT AATAATTTTT TATTTTCATA 13440
GGATGAAGTT AGAGAAAATA TTCAGCTGTA CACACAAAGT CTGGTTTTTC CTGCCCAACT 13500
TCCCCCTGGA AGGTGTACTT TTTGTTGTTT AATGTGTAGC TTGTTTGTGC CCTGTTGACA 13560
TAAATGTTTC CTGGGTTTGC TCTTTGACAA TAAATGGAGA AGGAAGGTCA CCCAACTCCA 13620
TTGGGCCACT CCCCTCCTTC CCCTATTGAA GCTCC 13656
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0007--Acetylation
KW-0025--Alternative splicing
KW-0053--Apoptosis
KW-0090--Biological rhythms
KW-0103--Bromodomain
KW-0156--Chromatin regulator
KW-0160--Chromosomal rearrangement
KW-0181--Complete proteome
KW-0903--Direct protein sequencing
KW-0238--DNA-binding
KW-1017--Isopeptide bond
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-0656--Proto-oncogene
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0832--Ubl conjugation
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR001487--Bromodomain
IPR003889--FYrich_C
IPR003888--FYrich_N
IPR016569--MeTrfase_trithorax
IPR003616--Post-SET_dom
IPR001214--SET_dom
IPR002857--Znf_CXXC
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR019787--Znf_PHD-finger
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS50014--BROMODOMAIN_2
PS51543--FYRC
PS51542--FYRN
PS50868--POST_SET
PS50280--SET
PS51058--ZF_CXXC
PS01359--ZF_PHD_1
PS50016--ZF_PHD_2

Pfam

PF05965--FYRC
PF05964--FYRN
PF00628--PHD
PF00856--SET
PF02008--zf-CXXC

Gene Ontology

GO:0005737--C:cytoplasm
GO:0035097--C:histone methyltransferase complex
GO:0071339--C:MLL1 complex
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0003680--F:AT DNA binding
GO:0003682--F:chromatin binding
GO:0001046--F:core promoter sequence-specific DNA binding
GO:0042800--F:histone methyltransferase activity (H3-K4 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0042802--F:identical protein binding
GO:0070577--F:lysine-acetylated histone binding
GO:0042803--F:protein homodimerization activity
GO:0003700--F:transcription factor activity, sequence-specific DNA binding
GO:0044212--F:transcription regulatory region DNA binding
GO:0045322--F:unmethylated CpG binding
GO:0008270--F:zinc ion binding
GO:0009952--P:anterior/posterior pattern specification
GO:0006915--P:apoptotic process
GO:0032922--P:circadian regulation of gene expression
GO:0060216--P:definitive hemopoiesis
GO:0006306--P:DNA methylation
GO:0035162--P:embryonic hemopoiesis
GO:0035640--P:exploration behavior
GO:0044648--P:histone H3-K4 dimethylation
GO:0051568--P:histone H3-K4 methylation
GO:0080182--P:histone H3-K4 trimethylation
GO:0043984--P:histone H4-K16 acetylation
GO:0048873--P:homeostasis of number of cells within a tissue
GO:0051899--P:membrane depolarization
GO:0008285--P:negative regulation of cell proliferation
GO:0018026--P:peptidyl-lysine monomethylation
GO:2001040--P:positive regulation of cellular response to drug
GO:0051571--P:positive regulation of histone H3-K4 methylation
GO:0045944--P:positive regulation of transcription from RNA polymerase II promoter
GO:0045893--P:positive regulation of transcription, DNA-templated
GO:0032411--P:positive regulation of transporter activity
GO:0009791--P:post-embryonic development
GO:0006461--P:protein complex assembly
GO:0071440--P:regulation of histone H3-K14 acetylation
GO:1901674--P:regulation of histone H3-K27 acetylation
GO:2000615--P:regulation of histone H3-K9 acetylation
GO:0048172--P:regulation of short-term neuronal synaptic plasticity
GO:0035864--P:response to potassium ion
GO:0048536--P:spleen development
GO:0006366--P:transcription from RNA polymerase II promoter
GO:0008542--P:visual learning

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0034 ENSPTRP00000040970.5 Pan troglodytes 100 0.0 6299
WERAM-Poa-0036 ENSPPYP00000004505.2 Pongo abelii 99 0.0 6254
WERAM-Nol-0074 ENSNLEP00000008984.1 Nomascus leucogenys 99 0.0 6215
WERAM-Mam-0038 ENSMMUP00000007251.2 Macaca mulatta 99 0.0 6208
WERAM-Eqc-0038 ENSECAP00000006426.1 Equus caballus 96 0.0 6073
WERAM-Otg-0095 ENSOGAP00000008374.2 Otolemur garnettii 96 0.0 6056
WERAM-Aim-0147 ENSAMEP00000013764.1 Ailuropoda melanoleuca 95 0.0 6049
WERAM-Loa-0136 ENSLAFP00000012406.4 Loxodonta africana 94 0.0 5975
WERAM-Bot-0163 ENSBTAP00000024084.5 Bos taurus 94 0.0 5940
WERAM-Ova-0104 ENSOARP00000010552.1 Ovis aries 94 0.0 5940
WERAM-Caf-0134 ENSCAFP00000018720.5 Canis familiaris 95 0.0 5919
WERAM-Tut-0049 ENSTTRP00000004041.1 Tursiops truncatus 93 0.0 5855
WERAM-Orc-0100 ENSOCUP00000008738.2 Oryctolagus cuniculus 93 0.0 5855
WERAM-Chs-0032 ENSCSAP00000015460.1 Chlorocebus sabaeus 99 0.0 5762
WERAM-Prc-0053 ENSPCAP00000005189.1 Procavia capensis 90 0.0 5581
WERAM-Myl-0155 ENSMLUP00000012825.2 Myotis lucifugus 86 0.0 5442
WERAM-Sah-0138 ENSSHAP00000014807.1 Sarcophilus harrisii 86 0.0 5410
WERAM-Dio-0068 ENSDORP00000006787.1 Dipodomys ordii 87 0.0 5050
WERAM-Paa-0065 ENSPANP00000009106.1 Papio anubis 99 0.0 5010
WERAM-Pes-0058 ENSPSIP00000007945.1 Pelodiscus sinensis 80 0.0 4882
WERAM-Ptv-0056 ENSPVAP00000005910.1 Pteropus vampyrus 89 0.0 4778
WERAM-Meg-0028 ENSMGAP00000002448.2 Meleagris gallopavo 77 0.0 4424
WERAM-Cap-0070 ENSCPOP00000005242.2 Cavia porcellus 93 0.0 4255
WERAM-Ocp-0107 ENSOPRP00000010334.2 Ochotona princeps 88 0.0 4249
WERAM-Mum-0008 ENSMUSP00000110337.1 Mus musculus 90 0.0 4136
WERAM-Gaga-0075 ENSGALP00000011008.4 Gallus gallus 79 0.0 4113
WERAM-Ran-0100 ENSRNOP00000020573.6 Rattus norvegicus 89 0.0 4106
WERAM-Ict-0127 ENSSTOP00000012923.2 Ictidomys tridecemlineatus 92 0.0 4063
WERAM-Mup-0053 ENSMPUP00000004604.1 Mustela putorius furo 96 0.0 4048
WERAM-Gog-0013 ENSGGOP00000000936.2 Gorilla gorilla 100 0.0 3994
WERAM-Mod-0116 ENSMODP00000016853.4 Monodelphis domestica 86 0.0 3991
WERAM-Anp-0040 ENSAPLP00000004456.1 Anas platyrhynchos 80 0.0 3903
WERAM-Chh-0055 ENSCHOP00000006343.1 Choloepus hoffmanni 91 0.0 3902
WERAM-Tag-0002 ENSTGUP00000000072.1 Taeniopygia guttata 79 0.0 3611
WERAM-Fia-0123 ENSFALP00000010386.1 Ficedula albicollis 78 0.0 3586
WERAM-Dan-0088 ENSDNOP00000008968.3 Dasypus novemcinctus 90 0.0 3266
WERAM-Soa-0133 ENSSARP00000013142.1 Sorex araneus 83 0.0 3197
WERAM-Tas-0109 ENSTSYP00000011196.1 Tarsius syrichta 90 0.0 3156
WERAM-Mae-0003 ENSMEUP00000000157.1 Macropus eugenii 82 0.0 2885
WERAM-Lac-0176 ENSLACP00000020625.1 Latimeria chalumnae 66 0.0 2871
WERAM-Anc-0073 ENSACAP00000006937.3 Anolis carolinensis 65 0.0 2857
WERAM-Tub-0079 ENSTBEP00000009516.1 Tupaia belangeri 88 0.0 2857
WERAM-Ect-0072 ENSETEP00000007880.1 Echinops telfairi 85 0.0 2803
WERAM-Vip-0054 ENSVPAP00000005197.1 Vicugna pacos 91 0.0 1984
WERAM-Ora-0035 ENSOANP00000005967.3 Ornithorhynchus anatinus 85 0.0 1384
WERAM-Leo-0025 ENSLOCP00000004893.1 Lepisosteus oculatus 77 0.0 1275
WERAM-Ere-0135 ENSEEUP00000014133.1 Erinaceus europaeus 83 0.0 1223
WERAM-Mim-0138 ENSMICP00000013960.1 Microcebus murinus 94 0.0 1211
WERAM-Xet-0067 ENSXETP00000022279.3 Xenopus tropicalis 75 0.0 1207
WERAM-Dar-0010 ENSDARP00000095298.3 Danio rerio 68 0.0 1149
WERAM-Orn-0070 ENSONIP00000007849.1 Oreochromis niloticus 70 0.0 1145
WERAM-Gaa-0091 ENSGACP00000011913.1 Gasterosteus aculeatus 72 0.0 1140
WERAM-Pof-0188 ENSPFOP00000015902.2 Poecilia formosa 68 0.0 1107
WERAM-Xim-0097 ENSXMAP00000008788.1 Xiphophorus maculatus 65 0.0 1080
WERAM-Ten-0223 ENSTNIP00000002397.1 Tetraodon nigroviridis 66 0.0 1080
WERAM-Gam-0119 ENSGMOP00000012588.1 Gadus morhua 67 0.0 1062
WERAM-Fec-0070 ENSFCAP00000005933.3 Felis catus 48 0.0 785
WERAM-Sus-0021 ENSSSCP00000003118.2 Sus scrofa 48 0.0 785
WERAM-Caj-0107 ENSCJAP00000018740.2 Callithrix jacchus 49 0.0 777
WERAM-Orla-0062 ENSORLP00000008124.1 Oryzias latipes 47 0.0 750
WERAM-Tar-0189 ENSTRUP00000038926.1 Takifugu rubripes 50 0.0 732
WERAM-Asm-0037 ENSAMXP00000004791.1 Astyanax mexicanus 46 0.0 728
WERAM-Pem-0014 ENSPMAP00000002218.1 Petromyzon marinus 59 0.0 689
WERAM-Cis-0045 ENSCSAVP00000009955.1 Ciona savignyi 37 5e-135 481
WERAM-Cii-0034 ENSCINP00000025384.2 Ciona intestinalis 54 1e-100 367
WERAM-Drm-0010 FBpp0082406 Drosophila melanogaster 47 3e-89 329
WERAM-Cae-0021 C26E6.9a Caenorhabditis elegans 54 3e-42 174
WERAM-Tum-0027 CAZ85029 Tuber melanosporum 53 3e-42 173
WERAM-Php-0006 PP1S101_4V6.1 Physcomitrella patens 54 4e-42 173
WERAM-Org-0116 ORGLA12G0159200.1 Oryza glaberrima 54 5e-42 172
WERAM-Ors-0112 OS12T0613200-02 Oryza sativa 54 5e-42 172
WERAM-Sei-0078 Si021071m Setaria italica 54 1e-41 172
WERAM-Orbr-0127 OB12G25360.1 Oryza brachyantha 53 1e-41 171
WERAM-Brd-0092 BRADI4G01790.1 Brachypodium distachyon 54 2e-41 171
WERAM-Asn-0015 CADANIAP00003254 Aspergillus nidulans 53 2e-41 170
WERAM-Ast-0003 CADATEAP00001100 Aspergillus terreus 52 3e-41 170
WERAM-Thc-0094 EOY15831 Theobroma cacao 55 3e-41 170
WERAM-Asni-0037 CADANGAP00014055 Aspergillus niger 52 3e-41 170
WERAM-Asc-0034 CADACLAP00008186 Aspergillus clavatus 52 3e-41 170
WERAM-Coi-0035 EAS31778 Coccidioides immitis 52 3e-41 170
WERAM-Aso-0006 CADAORAP00000676 Aspergillus oryzae 52 4e-41 169
WERAM-Prp-0018 EMJ21490 Prunus persica 55 4e-41 169
WERAM-Sol-0003 Solyc01g006880.2.1 Solanum lycopersicum 53 4e-41 169
WERAM-Asfu-0007 CADAFUAP00002016 Aspergillus fumigatus 52 8e-41 169
WERAM-Pug-0037 EHS63201 Puccinia graminis 52 9e-41 168
Created Date 25-Jun-2016