Tag |
Content |
WERAM ID |
WERAM-Hos-0096 |
Ensembl Protein ID |
ENSP00000374157.5 |
Uniprot Accession |
Q03164; KMT2A_HUMAN; E9PQG7; Q13743; Q13744; Q14845; Q16364; Q59FF2; Q6UBD1; Q9HBJ3; Q9UD94; Q9UMA3 |
Genbank Protein ID |
NP_001184033.1; NP_005924.2 |
Protein Name |
Histone-lysine N-methyltransferase 2A |
Genbank Nucleotide ID |
NM_001197104.1; NM_005933.3 |
Gene Name |
KMT2A;HRX;TRX1;CXXC7;ALL-1;MLL;MLL1;CXXC7;HTRX1;MLL1A;WDSTS;MLL/GAS7;TET1-MLL;ALL-1 |
Ensembl Information |
|
Details |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SET1 |
2.90e-49 |
168.4 |
3830 |
3945 |
Me_Reader |
PHD |
2.50e-12 |
49.1 |
1433 |
1978 |
Ac_Reader |
Bromodomain |
9.40e-05 |
24.7 |
1703 |
1738 |
|
Organism |
Homo sapiens |
NCBI Taxa ID |
9606 |
Functional Description (View)Functional Description
Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis. Plays a critical role in the control of circadian gene expression and is essential for the transcriptional activation mediated by the CLOCK-ARNTL/BMAL1 heterodimer. Establishes a permissive chromatin state for circadian transcription by mediating a rhythmic methylation of 'Lys-4' of histone H3 (H3K4me) and this histone modification directs the circadian acetylation at H3K9 and H3K14 allowing the recruitment of CLOCK-ARNTL/BMAL1 to chromatin (By similarity). |
Histone methyltransferase that plays an essential role in early development and hematopoiesis. Catalytic subunit of the MLL1/MLL complex, a multiprotein complex that mediates both methylation of 'Lys-4' of histone H3 (H3K4me) complex and acetylation of 'Lys-16' of histone H4 (H4K16ac). In the MLL1/MLL complex, it specifically mediates H3K4me, a specific tag for epigenetic transcriptional activation. Has weak methyltransferase activity by itself, and requires other component of the MLL1/MLL complex to obtain full methyltransferase activity. Has no activity toward histone H3 phosphorylated on 'Thr-3', less activity toward H3 dimethylated on 'Arg-8' or 'Lys-9', while it has higher activity toward H3 acetylated on 'Lys-9'. Required for transcriptional activation of HOXA9. Promotes PPP1R15A-induced apoptosis. Plays a critical role in the control of circadian gene expression and is essential for the transcriptional activation mediated by the CLOCK-ARNTL/BMAL1 heterodimer. Establishes a permissive chromatin state for circadian transcription by mediating a rhythmic methylation of 'Lys-4' of histone H3 (H3K4me) and this histone modification directs the circadian acetylation at H3K9 and H3K14 allowing the recruitment of CLOCK-ARNTL/BMAL1 to chromatin (By similarity).
|
Domain Profile |
HMT SET1
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakv 91 ++ v++s+i+g+gl++k++i+++e+viEY+G+virs ++dkrek+y++k+ig+y+fr+d++ vvdat++gn+arfinhscepNc+++v ENSP00000374157.5 3830 AVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDS--EVVDATMHGNAARFINHSCEPNCYSRV 3917 58899********************************************************..*************************** PP SET1.txt 92 vavdgekkiviyakraIekgeeltydYk 119 +++dg+k+ivi+a+r+I +geeltydYk ENSP00000374157.5 3918 INIDGQKHIVIFAMRKIYRGEELTYDYK 3945 ***************************7 PP
Me_Reader PHD
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCk 51 +C++C +++ e+v+C+ C + fH C++ ++++l+++ ++w+C++Ck ENSP00000374157.5 1433 VCFLCASSGHV--EFVYCQVCCEPFHKFCLEENERPLEDQlENWCCRRCK 1480 8****554444..59******************6666655778******7 PP PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslp.egks.wyCpsCke 52 ++C vCg++++ +k++++C++C++ +H +C++++ + p ++k+ w+C +C++ ENSP00000374157.5 1480 KFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPtKKKKvWICTKCVR 1532 59****988888888*******************88888444336******86 PP PHD.txt 2 tiClvCgkddegeke...mvqCdeCddwfHlkCvklp......lsslpegkswyCpsCke 52 ++C++C+k+++++++ m+qC +Cd+w+H kC +l+ ls+lpe+ +++C +C+e ENSP00000374157.5 1567 NFCPLCDKCYDDDDYeskMMQCGKCDRWVHSKCENLSdemyeiLSNLPESVAYTCVNCTE 1626 789999877666555566*******************9**99999***9999******97 PP PHD.txt 3 iClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51 C +C+k+++ + +C + C +H C + ++k+ yC++++ ENSP00000374157.5 1932 RCEFCQKPGAT---VGCCLTsCTSNYHFMCSRAKNCVFLDDKKVYCQRHR 1978 599**766665...4566558*************5555577899**9997 PP
Ac_Reader Bromodomain
BROMO.txt 27 kePmdLstikerleegnYsspeefvkDvrlifnNak 62 ++P+dL+ +k+++++gnY+s++ef +D+ i++ a ENSP00000374157.5 1703 QQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAI 1738 69***************************9998775 PP
|
Protein Sequence (Fasta) | MAHSCRWRFP ARPGTTGGGG GGGRRGLGGA PRQRVPALLL PPGPPVGGGG PGAPPSPPAV 60 AAAAAAAGSS GAGVPGGAAA ASAASSSSAS SSSSSSSSAS SGPALLRVGP GFDAALQVSA 120 AIGTNLRRFR AVFGESGGGG GSGEDEQFLG FGSDEEVRVR SPTRSPSVKT SPRKPRGRPR 180 SGSDRNSAIL SDPSVFSPLN KSETKSGDKI KKKDSKSIEK KRGRPPTFPG VKIKITHGKD 240 ISELPKGNKE DSLKKIKRTP SATFQQATKI KKLRAGKLSP LKSKFKTGKL QIGRKGVQIV 300 RRRGRPPSTE RIKTPSGLLI NSELEKPQKV RKDKEGTPPL TKEDKTVVRQ SPRRIKPVRI 360 IPSSKRTDAT IAKQLLQRAK KGAQKKIEKE AAQLQGRKVK TQVKNIRQFI MPVVSAISSR 420 IIKTPRRFIE DEDYDPPIKI ARLESTPNSR FSAPSCGSSE KSSAASQHSS QMSSDSSRSS 480 SPSVDTSTDS QASEEIQVLP EERSDTPEVH PPLPISQSPE NESNDRRSRR YSVSERSFGS 540 RTTKKLSTLQ SAPQQQTSSS PPPPLLTPPP PLQPASSISD HTPWLMPPTI PLASPFLPAS 600 TAPMQGKRKS ILREPTFRWT SLKHSRSEPQ YFSSAKYAKE GLIRKPIFDN FRPPPLTPED 660 VGFASGFSAS GTAASARLFS PLHSGTRFDM HKRSPLLRAP RFTPSEAHSR IFESVTLPSN 720 RTSAGTSSSG VSNRKRKRKV FSPIRSEPRS PSHSMRTRSG RLSSSELSPL TPPSSVSSSL 780 SISVSPLATS ALNPTFTFPS HSLTQSGESA EKNQRPRKQT SAPAEPFSSS SPTPLFPWFT 840 PGSQTERGRN KDKAPEELSK DRDADKSVEK DKSRERDRER EKENKRESRK EKRKKGSEIQ 900 SSSALYPVGR VSKEKVVGED VATSSSAKKA TGRKKSSSHD SGTDITSVTL GDTTAVKTKI 960 LIKKGRGNLE KTNLDLGPTA PSLEKEKTLC LSTPSSSTVK HSTSSIGSML AQADKLPMTD 1020 KRVASLLKKA KAQLCKIEKS KSLKQTDQPK AQGQESDSSE TSVRGPRIKH VCRRAAVALG 1080 RKRAVFPDDM PTLSALPWEE REKILSSMGN DDKSSIAGSE DAEPLAPPIK PIKPVTRNKA 1140 PQEPPVKKGR RSRRCGQCPG CQVPEDCGVC TNCLDKPKFG GRNIKKQCCK MRKCQNLQWM 1200 PSKAYLQKQA KAVKKKEKKS KTSEKKDSKE SSVVKNVVDS SQKPTPSARE DPAPKKSSSE 1260 PPPRKPVEEK SEEGNVSAPG PESKQATTPA SRKSSKQVSQ PALVIPPQPP TTGPPRKEVP 1320 KTTPSEPKKK QPPPPESGPE QSKQKKVAPR PSIPVKQKPK EKEKPPPVNK QENAGTLNIL 1380 STLSNGNSSK QKIPADGVHR IRVDFKEDCE AENVWEMGGL GILTSVPITP RVVCFLCASS 1440 GHVEFVYCQV CCEPFHKFCL EENERPLEDQ LENWCCRRCK FCHVCGRQHQ ATKQLLECNK 1500 CRNSYHPECL GPNYPTKPTK KKKVWICTKC VRCKSCGSTT PGKGWDAQWS HDFSLCHDCA 1560 KLFAKGNFCP LCDKCYDDDD YESKMMQCGK CDRWVHSKCE NLSDEMYEIL SNLPESVAYT 1620 CVNCTERHPA EWRLALEKEL QISLKQVLTA LLNSRTTSHL LRYRQAAKPP DLNPETEESI 1680 PSRSSPEGPD PPVLTEVSKQ DDQQPLDLEG VKRKMDQGNY TSVLEFSDDI VKIIQAAINS 1740 DGGQPEIKKA NSMVKSFFIR QMERVFPWFS VKKSRFWEPN KVSSNSGMLP NAVLPPSLDH 1800 NYAQWQEREE NSHTEQPPLM KKIIPAPKPK GPGEPDSPTP LHPPTPPILS TDRSREDSPE 1860 LNPPPGIEDN RQCALCLTYG DDSANDAGRL LYIGQNEWTH VNCALWSAEV FEDDDGSLKN 1920 VHMAVIRGKQ LRCEFCQKPG ATVGCCLTSC TSNYHFMCSR AKNCVFLDDK KVYCQRHRDL 1980 IKGEVVPENG FEVFRRVFVD FEGISLRRKF LNGLEPENIH MMIGSMTIDC LGILNDLSDC 2040 EDKLFPIGYQ CSRVYWSTTD ARKRCVYTCK IVECRPPVVE PDINSTVEHD ENRTIAHSPT 2100 SFTESSSKES QNTAEIISPP SPDRPPHSQT SGSCYYHVIS KVPRIRTPSY SPTQRSPGCR 2160 PLPSAGSPTP TTHEIVTVGD PLLSSGLRSI GSRRHSTSSL SPQRSKLRIM SPMRTGNTYS 2220 RNNVSSVSTT GTATDLESSA KVVDHVLGPL NSSTSLGQNT STSSNLQRTV VTVGNKNSHL 2280 DGSSSSEMKQ SSASDLVSKS SSLKGEKTKV LSSKSSEGSA HNVAYPGIPK LAPQVHNTTS 2340 RELNVSKIGS FAEPSSVSFS SKEALSFPHL HLRGQRNDRD QHTDSTQSAN SSPDEDTEVK 2400 TLKLSGMSNR SSIINEHMGS SSRDRRQKGK KSCKETFKEK HSSKSFLEPG QVTTGEEGNL 2460 KPEFMDEVLT PEYMGQRPCN NVSSDKIGDK GLSMPGVPKA PPMQVEGSAK ELQAPRKRTV 2520 KVTLTPLKME NESQSKNALK ESSPASPLQI ESTSPTEPIS ASENPGDGPV AQPSPNNTSC 2580 QDSQSNNYQN LPVQDRNLML PDGPKPQEDG SFKRRYPRRS ARARSNMFFG LTPLYGVRSY 2640 GEEDIPFYSS STGKKRGKRS AEGQVDGADD LSTSDEDDLY YYNFTRTVIS SGGEERLASH 2700 NLFREEEQCD LPKISQLDGV DDGTESDTSV TATTRKSSQI PKRNGKENGT ENLKIDRPED 2760 AGEKEHVTKS SVGHKNEPKM DNCHSVSRVK TQGQDSLEAQ LSSLESSRRV HTSTPSDKNL 2820 LDTYNTELLK SDSDNNNSDD CGNILPSDIM DFVLKNTPSM QALGESPESS SSELLNLGEG 2880 LGLDSNREKD MGLFEVFSQQ LPTTEPVDSS VSSSISAEEQ FELPLELPSD LSVLTTRSPT 2940 VPSQNPSRLA VISDSGEKRV TITEKSVASS ESDPALLSPG VDPTPEGHMT PDHFIQGHMD 3000 ADHISSPPCG SVEQGHGNNQ DLTRNSSTPG LQVPVSPTVP IQNQKYVPNS TDSPGPSQIS 3060 NAAVQTTPPH LKPATEKLIV VNQNMQPLYV LQTLPNGVTQ KIQLTSSVSS TPSVMETNTS 3120 VLGPMGGGLT LTTGLNPSLP TSQSLFPSAS KGLLPMSHHQ HLHSFPAATQ SSFPPNISNP 3180 PSGLLIGVQP PPDPQLLVSE SSQRTDLSTT VATPSSGLKK RPISRLQTRK NKKLAPSSTP 3240 SNIAPSDVVS NMTLINFTPS QLPNHPSLLD LGSLNTSSHR TVPNIIKRSK SSIMYFEPAP 3300 LLPQSVGGTA ATAAGTSTIS QDTSHLTSGS VSGLASSSSV LNVVSMQTTT TPTSSASVPG 3360 HVTLTNPRLL GTPDIGSISN LLIKASQQSL GIQDQPVALP PSSGMFPQLG TSQTPSTAAI 3420 TAASSICVLP STQTTGITAA SPSGEADEHY QLQHVNQLLA SKTGIHSSQR DLDSASGPQV 3480 SNFTQTVDAP NSMGLEQNKA LSSAVQASPT SPGGSPSSPS SGQRSASPSV PGPTKPKPKT 3540 KRFQLPLDKG NGKKHKVSHL RTSSSEAHIP DQETTSLTSG TGTPGAEAEQ QDTASVEQSS 3600 QKECGQPAGQ VAVLPEVQVT QNPANEQESA EPKTVEEEES NFSSPLMLWL QQEQKRKESI 3660 TEKKPKKGLV FEISSDDGFQ ICAESIEDAW KSLTDKVQEA RSNARLKQLS FAGVNGLRML 3720 GILHDAVVFL IEQLSGAKHC RNYKFRFHKP EEANEPPLNP HGSARAEVHL RKSAFDMFNF 3780 LASKHRQPPE YNPNDEEEEE VQLKSARRAT SMDLPMPMRF RHLKKTSKEA VGVYRSPIHG 3840 RGLFCKRNID AGEMVIEYAG NVIRSIQTDK REKYYDSKGI GCYMFRIDDS EVVDATMHGN 3900 AARFINHSCE PNCYSRVINI DGQKHIVIFA MRKIYRGEEL TYDYKFPIED ASNKLPCNCG 3960 AKKCRKFLN 3969Protein Fasta Sequence
>ENSP00000374157.5|KMT2A;HRX;TRX1;CXXC7;ALL-1;MLL;MLL1;CXXC7;HTRX1;MLL1A;WDSTS;MLL/GAS7;TET1-MLL;ALL-1|Homo sapiens MAHSCRWRFPARPGTTGGGGGGGRRGLGGAPRQRVPALLLPPGPPVGGGGPGAPPSPPAVAAAAAAAGSSGAGVPGGAAAASAASSSSASSSSSSSSSASSGPALLRVGPGFDAALQVSAAIGTNLRRFRAVFGESGGGGGSGEDEQFLGFGSDEEVRVRSPTRSPSVKTSPRKPRGRPRSGSDRNSAILSDPSVFSPLNKSETKSGDKIKKKDSKSIEKKRGRPPTFPGVKIKITHGKDISELPKGNKEDSLKKIKRTPSATFQQATKIKKLRAGKLSPLKSKFKTGKLQIGRKGVQIVRRRGRPPSTERIKTPSGLLINSELEKPQKVRKDKEGTPPLTKEDKTVVRQSPRRIKPVRIIPSSKRTDATIAKQLLQRAKKGAQKKIEKEAAQLQGRKVKTQVKNIRQFIMPVVSAISSRIIKTPRRFIEDEDYDPPIKIARLESTPNSRFSAPSCGSSEKSSAASQHSSQMSSDSSRSSSPSVDTSTDSQASEEIQVLPEERSDTPEVHPPLPISQSPENESNDRRSRRYSVSERSFGSRTTKKLSTLQSAPQQQTSSSPPPPLLTPPPPLQPASSISDHTPWLMPPTIPLASPFLPASTAPMQGKRKSILREPTFRWTSLKHSRSEPQYFSSAKYAKEGLIRKPIFDNFRPPPLTPEDVGFASGFSASGTAASARLFSPLHSGTRFDMHKRSPLLRAPRFTPSEAHSRIFESVTLPSNRTSAGTSSSGVSNRKRKRKVFSPIRSEPRSPSHSMRTRSGRLSSSELSPLTPPSSVSSSLSISVSPLATSALNPTFTFPSHSLTQSGESAEKNQRPRKQTSAPAEPFSSSSPTPLFPWFTPGSQTERGRNKDKAPEELSKDRDADKSVEKDKSRERDREREKENKRESRKEKRKKGSEIQSSSALYPVGRVSKEKVVGEDVATSSSAKKATGRKKSSSHDSGTDITSVTLGDTTAVKTKILIKKGRGNLEKTNLDLGPTAPSLEKEKTLCLSTPSSSTVKHSTSSIGSMLAQADKLPMTDKRVASLLKKAKAQLCKIEKSKSLKQTDQPKAQGQESDSSETSVRGPRIKHVCRRAAVALGRKRAVFPDDMPTLSALPWEEREKILSSMGNDDKSSIAGSEDAEPLAPPIKPIKPVTRNKAPQEPPVKKGRRSRRCGQCPGCQVPEDCGVCTNCLDKPKFGGRNIKKQCCKMRKCQNLQWMPSKAYLQKQAKAVKKKEKKSKTSEKKDSKESSVVKNVVDSSQKPTPSAREDPAPKKSSSEPPPRKPVEEKSEEGNVSAPGPESKQATTPASRKSSKQVSQPALVIPPQPPTTGPPRKEVPKTTPSEPKKKQPPPPESGPEQSKQKKVAPRPSIPVKQKPKEKEKPPPVNKQENAGTLNILSTLSNGNSSKQKIPADGVHRIRVDFKEDCEAENVWEMGGLGILTSVPITPRVVCFLCASSGHVEFVYCQVCCEPFHKFCLEENERPLEDQLENWCCRRCKFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPTKKKKVWICTKCVRCKSCGSTTPGKGWDAQWSHDFSLCHDCAKLFAKGNFCPLCDKCYDDDDYESKMMQCGKCDRWVHSKCENLSDEMYEILSNLPESVAYTCVNCTERHPAEWRLALEKELQISLKQVLTALLNSRTTSHLLRYRQAAKPPDLNPETEESIPSRSSPEGPDPPVLTEVSKQDDQQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAINSDGGQPEIKKANSMVKSFFIRQMERVFPWFSVKKSRFWEPNKVSSNSGMLPNAVLPPSLDHNYAQWQEREENSHTEQPPLMKKIIPAPKPKGPGEPDSPTPLHPPTPPILSTDRSREDSPELNPPPGIEDNRQCALCLTYGDDSANDAGRLLYIGQNEWTHVNCALWSAEVFEDDDGSLKNVHMAVIRGKQLRCEFCQKPGATVGCCLTSCTSNYHFMCSRAKNCVFLDDKKVYCQRHRDLIKGEVVPENGFEVFRRVFVDFEGISLRRKFLNGLEPENIHMMIGSMTIDCLGILNDLSDCEDKLFPIGYQCSRVYWSTTDARKRCVYTCKIVECRPPVVEPDINSTVEHDENRTIAHSPTSFTESSSKESQNTAEIISPPSPDRPPHSQTSGSCYYHVISKVPRIRTPSYSPTQRSPGCRPLPSAGSPTPTTHEIVTVGDPLLSSGLRSIGSRRHSTSSLSPQRSKLRIMSPMRTGNTYSRNNVSSVSTTGTATDLESSAKVVDHVLGPLNSSTSLGQNTSTSSNLQRTVVTVGNKNSHLDGSSSSEMKQSSASDLVSKSSSLKGEKTKVLSSKSSEGSAHNVAYPGIPKLAPQVHNTTSRELNVSKIGSFAEPSSVSFSSKEALSFPHLHLRGQRNDRDQHTDSTQSANSSPDEDTEVKTLKLSGMSNRSSIINEHMGSSSRDRRQKGKKSCKETFKEKHSSKSFLEPGQVTTGEEGNLKPEFMDEVLTPEYMGQRPCNNVSSDKIGDKGLSMPGVPKAPPMQVEGSAKELQAPRKRTVKVTLTPLKMENESQSKNALKESSPASPLQIESTSPTEPISASENPGDGPVAQPSPNNTSCQDSQSNNYQNLPVQDRNLMLPDGPKPQEDGSFKRRYPRRSARARSNMFFGLTPLYGVRSYGEEDIPFYSSSTGKKRGKRSAEGQVDGADDLSTSDEDDLYYYNFTRTVISSGGEERLASHNLFREEEQCDLPKISQLDGVDDGTESDTSVTATTRKSSQIPKRNGKENGTENLKIDRPEDAGEKEHVTKSSVGHKNEPKMDNCHSVSRVKTQGQDSLEAQLSSLESSRRVHTSTPSDKNLLDTYNTELLKSDSDNNNSDDCGNILPSDIMDFVLKNTPSMQALGESPESSSSELLNLGEGLGLDSNREKDMGLFEVFSQQLPTTEPVDSSVSSSISAEEQFELPLELPSDLSVLTTRSPTVPSQNPSRLAVISDSGEKRVTITEKSVASSESDPALLSPGVDPTPEGHMTPDHFIQGHMDADHISSPPCGSVEQGHGNNQDLTRNSSTPGLQVPVSPTVPIQNQKYVPNSTDSPGPSQISNAAVQTTPPHLKPATEKLIVVNQNMQPLYVLQTLPNGVTQKIQLTSSVSSTPSVMETNTSVLGPMGGGLTLTTGLNPSLPTSQSLFPSASKGLLPMSHHQHLHSFPAATQSSFPPNISNPPSGLLIGVQPPPDPQLLVSESSQRTDLSTTVATPSSGLKKRPISRLQTRKNKKLAPSSTPSNIAPSDVVSNMTLINFTPSQLPNHPSLLDLGSLNTSSHRTVPNIIKRSKSSIMYFEPAPLLPQSVGGTAATAAGTSTISQDTSHLTSGSVSGLASSSSVLNVVSMQTTTTPTSSASVPGHVTLTNPRLLGTPDIGSISNLLIKASQQSLGIQDQPVALPPSSGMFPQLGTSQTPSTAAITAASSICVLPSTQTTGITAASPSGEADEHYQLQHVNQLLASKTGIHSSQRDLDSASGPQVSNFTQTVDAPNSMGLEQNKALSSAVQASPTSPGGSPSSPSSGQRSASPSVPGPTKPKPKTKRFQLPLDKGNGKKHKVSHLRTSSSEAHIPDQETTSLTSGTGTPGAEAEQQDTASVEQSSQKECGQPAGQVAVLPEVQVTQNPANEQESAEPKTVEEEESNFSSPLMLWLQQEQKRKESITEKKPKKGLVFEISSDDGFQICAESIEDAWKSLTDKVQEARSNARLKQLSFAGVNGLRMLGILHDAVVFLIEQLSGAKHCRNYKFRFHKPEEANEPPLNPHGSARAEVHLRKSAFDMFNFLASKHRQPPEYNPNDEEEEEVQLKSARRATSMDLPMPMRFRHLKKTSKEAVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDSEVVDATMHGNAARFINHSCEPNCYSRVINIDGQKHIVIFAMRKIYRGEELTYDYKFPIEDASNKLPCNCGAKKCRKFLN
|
Nucleotide Sequence (Fasta) | ATGGCGCACA GCTGTCGGTG GCGCTTCCCC GCCCGACCCG GGACCACCGG GGGCGGCGGC 60 GGCGGGGGGC GCCGGGGCCT AGGGGGCGCC CCGCGGCAAC GCGTCCCGGC CCTGCTGCTT 120 CCCCCCGGGC CCCCGGTCGG CGGTGGCGGC CCCGGGGCGC CCCCCTCCCC CCCGGCTGTG 180 GCGGCCGCGG CGGCGGCGGC GGGAAGCAGC GGGGCTGGGG TTCCAGGGGG AGCGGCCGCC 240 GCCTCAGCAG CCTCCTCGTC GTCCGCCTCG TCTTCGTCTT CGTCATCGTC CTCAGCCTCT 300 TCAGGGCCGG CCCTGCTCCG GGTGGGCCCG GGCTTCGACG CGGCGCTGCA GGTCTCGGCC 360 GCCATCGGCA CCAACCTGCG CCGGTTCCGG GCCGTGTTTG GGGAGAGCGG CGGGGGAGGC 420 GGCAGCGGAG AGGATGAGCA ATTCTTAGGT TTTGGCTCAG ATGAAGAAGT CAGAGTGCGA 480 AGTCCCACAA GGTCTCCTTC AGTTAAAACT AGTCCTCGAA AACCTCGTGG GAGACCTAGA 540 AGTGGCTCTG ACCGAAATTC AGCTATCCTC TCAGATCCAT CTGTGTTTTC CCCTCTAAAT 600 AAATCAGAGA CCAAATCTGG AGATAAGATC AAGAAGAAAG ATTCTAAAAG TATAGAAAAG 660 AAGAGAGGAA GACCTCCCAC CTTCCCTGGA GTAAAAATCA AAATAACACA TGGAAAGGAC 720 ATTTCAGAGT TACCAAAGGG AAACAAAGAA GATAGCCTGA AAAAAATTAA AAGGACACCT 780 TCTGCTACGT TTCAGCAAGC CACAAAGATT AAAAAATTAA GAGCAGGTAA ACTCTCTCCT 840 CTCAAGTCTA AGTTTAAGAC AGGGAAGCTT CAAATAGGAA GGAAGGGGGT ACAAATTGTA 900 CGACGGAGAG GAAGGCCTCC ATCAACAGAA AGGATAAAGA CCCCTTCGGG TCTCCTCATT 960 AATTCTGAAC TGGAAAAGCC CCAGAAAGTC CGGAAAGACA AGGAAGGAAC ACCTCCACTT 1020 ACAAAAGAAG ATAAGACAGT TGTCAGACAA AGCCCTCGAA GGATTAAGCC AGTTAGGATT 1080 ATTCCTTCTT CAAAAAGGAC AGATGCAACC ATTGCTAAGC AACTCTTACA GAGGGCAAAA 1140 AAGGGGGCTC AAAAGAAAAT TGAAAAAGAA GCAGCTCAGC TGCAGGGAAG AAAGGTGAAG 1200 ACACAGGTCA AAAATATTCG ACAGTTCATC ATGCCTGTTG TCAGTGCTAT CTCCTCGCGG 1260 ATCATTAAGA CCCCTCGGCG GTTTATAGAG GATGAGGATT ATGACCCTCC AATTAAAATT 1320 GCCCGATTAG AGTCTACACC GAATAGTAGA TTCAGTGCCC CGTCCTGTGG ATCTTCTGAA 1380 AAATCAAGTG CAGCTTCTCA GCACTCCTCT CAAATGTCTT CAGACTCCTC TCGATCTAGT 1440 AGCCCCAGTG TTGATACCTC CACAGACTCT CAGGCTTCTG AGGAGATTCA GGTACTTCCT 1500 GAGGAGCGGA GCGATACCCC TGAAGTTCAT CCTCCACTGC CCATTTCCCA GTCCCCAGAA 1560 AATGAGAGTA ATGATAGGAG AAGCAGAAGG TATTCAGTGT CGGAGAGAAG TTTTGGATCT 1620 AGAACGACGA AAAAATTATC AACTCTACAA AGTGCCCCCC AGCAGCAGAC CTCCTCGTCT 1680 CCACCTCCAC CTCTGCTGAC TCCACCGCCA CCACTGCAGC CAGCCTCCAG TATCTCTGAC 1740 CACACACCTT GGCTTATGCC TCCAACAATC CCCTTAGCAT CACCATTTTT GCCTGCTTCC 1800 ACTGCTCCTA TGCAAGGGAA GCGAAAATCT ATTTTGCGAG AACCGACATT TAGGTGGACT 1860 TCTTTAAAGC ATTCTAGGTC AGAGCCACAA TACTTTTCCT CAGCAAAGTA TGCCAAAGAA 1920 GGTCTTATTC GCAAACCAAT ATTTGATAAT TTCCGACCCC CTCCACTAAC TCCCGAGGAC 1980 GTTGGCTTTG CATCTGGTTT TTCTGCATCT GGTACCGCTG CTTCAGCCCG ATTGTTTTCG 2040 CCACTCCATT CTGGAACAAG GTTTGATATG CACAAAAGGA GCCCTCTTCT GAGAGCTCCA 2100 AGATTTACTC CAAGTGAGGC TCACTCTAGA ATATTTGAGT CTGTAACCTT GCCTAGTAAT 2160 CGAACTTCTG CTGGAACATC TTCTTCAGGA GTATCCAATA GAAAAAGGAA AAGAAAAGTG 2220 TTTAGTCCTA TTCGATCTGA ACCAAGATCT CCTTCTCACT CCATGAGGAC AAGAAGTGGA 2280 AGGCTTAGTA GTTCTGAGCT CTCACCTCTC ACCCCCCCGT CTTCTGTCTC TTCCTCGTTA 2340 AGCATTTCTG TTAGTCCTCT TGCCACTAGT GCCTTAAACC CAACTTTTAC TTTTCCTTCT 2400 CATTCCCTGA CTCAGTCTGG GGAATCTGCA GAGAAAAATC AGAGACCAAG GAAGCAGACT 2460 AGTGCTCCGG CAGAGCCATT TTCATCAAGT AGTCCTACTC CTCTCTTCCC TTGGTTTACC 2520 CCAGGCTCTC AGACTGAAAG AGGGAGAAAT AAAGACAAGG CCCCCGAGGA GCTGTCCAAA 2580 GATCGAGATG CTGACAAGAG CGTGGAGAAG GACAAGAGTA GAGAGAGAGA CCGGGAGAGA 2640 GAAAAGGAGA ATAAGCGGGA GTCAAGGAAA GAGAAAAGGA AAAAGGGATC AGAAATTCAG 2700 AGTAGTTCTG CTTTGTATCC TGTGGGTAGG GTTTCCAAAG AGAAGGTTGT TGGTGAAGAT 2760 GTTGCCACTT CATCTTCTGC CAAAAAAGCA ACAGGGCGGA AGAAGTCTTC ATCACATGAT 2820 TCTGGGACTG ATATTACTTC TGTGACTCTT GGGGATACAA CAGCTGTCAA AACCAAAATA 2880 CTTATAAAGA AAGGGAGAGG AAATCTGGAA AAAACCAACT TGGACCTCGG CCCAACTGCC 2940 CCATCCCTGG AGAAGGAGAA AACCCTCTGC CTTTCCACTC CTTCATCTAG CACTGTTAAA 3000 CATTCCACTT CCTCCATAGG CTCCATGTTG GCTCAGGCAG ACAAGCTTCC AATGACTGAC 3060 AAGAGGGTTG CCAGCCTCCT AAAAAAGGCC AAAGCTCAGC TCTGCAAGAT TGAGAAGAGT 3120 AAGAGTCTTA AACAAACCGA CCAGCCCAAA GCACAGGGTC AAGAAAGTGA CTCATCAGAG 3180 ACCTCTGTGC GAGGACCCCG GATTAAACAT GTCTGCAGAA GAGCAGCTGT TGCCCTTGGC 3240 CGAAAACGAG CTGTGTTTCC TGATGACATG CCCACCCTGA GTGCCTTACC ATGGGAAGAA 3300 CGAGAAAAGA TTTTGTCTTC CATGGGGAAT GATGACAAGT CATCAATTGC TGGCTCAGAA 3360 GATGCTGAAC CTCTTGCTCC ACCCATCAAA CCAATTAAAC CTGTCACTAG AAACAAGGCA 3420 CCCCAGGAAC CTCCAGTAAA GAAAGGACGT CGATCGAGGC GGTGTGGGCA GTGTCCCGGC 3480 TGCCAGGTGC CTGAGGACTG TGGTGTTTGT ACTAATTGCT TAGATAAGCC CAAGTTTGGT 3540 GGTCGCAATA TAAAGAAGCA GTGCTGCAAG ATGAGAAAAT GTCAGAATCT ACAATGGATG 3600 CCTTCCAAAG CCTACCTGCA GAAGCAAGCT AAAGCTGTGA AAAAGAAAGA GAAAAAGTCT 3660 AAGACCAGTG AAAAGAAAGA CAGCAAAGAG AGCAGTGTTG TGAAGAACGT GGTGGACTCT 3720 AGTCAGAAAC CTACCCCATC AGCAAGAGAG GATCCTGCCC CAAAGAAAAG CAGTAGTGAG 3780 CCTCCTCCAC GAAAGCCCGT CGAGGAAAAG AGTGAAGAAG GGAATGTCTC GGCCCCTGGG 3840 CCTGAATCCA AACAGGCCAC CACTCCAGCT TCCAGGAAGT CAAGCAAGCA GGTCTCCCAG 3900 CCAGCACTGG TCATCCCGCC TCAGCCACCT ACTACAGGAC CGCCAAGAAA AGAAGTTCCC 3960 AAAACCACTC CTAGTGAGCC CAAGAAAAAG CAGCCTCCAC CACCAGAATC AGGTCCAGAG 4020 CAGAGCAAAC AGAAAAAAGT GGCTCCCCGC CCAAGTATCC CTGTAAAACA AAAACCAAAA 4080 GAAAAGGAAA AACCACCTCC GGTCAATAAG CAGGAGAATG CAGGCACTTT GAACATCCTC 4140 AGCACTCTCT CCAATGGCAA TAGTTCTAAG CAAAAAATTC CAGCAGATGG AGTCCACAGG 4200 ATCAGAGTGG ACTTTAAGGA GGATTGTGAA GCAGAAAATG TGTGGGAGAT GGGAGGCTTA 4260 GGAATCTTGA CTTCTGTTCC TATAACACCC AGGGTGGTTT GCTTTCTCTG TGCCAGTAGT 4320 GGGCATGTAG AGTTTGTGTA TTGCCAAGTC TGTTGTGAGC CCTTCCACAA GTTTTGTTTA 4380 GAGGAGAACG AGCGCCCTCT GGAGGACCAG CTGGAAAATT GGTGTTGTCG TCGTTGCAAA 4440 TTCTGTCACG TTTGTGGAAG GCAACATCAG GCTACAAAGC AGCTGCTGGA GTGTAATAAG 4500 TGCCGAAACA GCTATCACCC TGAGTGCCTG GGACCAAACT ACCCCACCAA ACCCACAAAG 4560 AAGAAGAAAG TCTGGATCTG TACCAAGTGT GTTCGCTGTA AGAGCTGTGG ATCCACAACT 4620 CCAGGCAAAG GGTGGGATGC ACAGTGGTCT CATGATTTCT CACTGTGTCA TGATTGCGCC 4680 AAGCTCTTTG CTAAAGGAAA CTTCTGCCCT CTCTGTGACA AATGTTATGA TGATGATGAC 4740 TATGAGAGTA AGATGATGCA ATGTGGAAAG TGTGATCGCT GGGTCCATTC CAAATGTGAG 4800 AATCTTTCAG ATGAGATGTA TGAGATTCTA TCTAATCTGC CAGAAAGTGT GGCCTACACT 4860 TGTGTGAACT GTACTGAGCG GCACCCTGCA GAGTGGCGAC TGGCCCTTGA AAAAGAGCTG 4920 CAGATTTCTC TGAAGCAAGT TCTGACAGCT TTGTTGAATT CTCGGACTAC CAGCCATTTG 4980 CTACGCTACC GGCAGGCTGC CAAGCCTCCA GACTTAAATC CCGAGACAGA GGAGAGTATA 5040 CCTTCCCGCA GCTCCCCCGA AGGACCTGAT CCACCAGTTC TTACTGAGGT CAGCAAACAG 5100 GATGATCAGC AGCCTTTAGA TCTAGAAGGA GTCAAGAGGA AGATGGACCA AGGGAATTAC 5160 ACATCTGTGT TGGAGTTCAG TGATGATATT GTGAAGATCA TTCAAGCAGC CATTAATTCA 5220 GATGGAGGAC AGCCAGAAAT TAAAAAAGCC AACAGCATGG TCAAGTCCTT CTTCATTCGG 5280 CAAATGGAAC GTGTTTTTCC ATGGTTCAGT GTCAAAAAGT CCAGGTTTTG GGAGCCAAAT 5340 AAAGTATCAA GCAACAGTGG GATGTTACCA AACGCAGTGC TTCCACCTTC ACTTGACCAT 5400 AATTATGCTC AGTGGCAGGA GCGAGAGGAA AACAGCCACA CTGAGCAGCC TCCTTTAATG 5460 AAGAAAATCA TTCCAGCTCC CAAACCCAAA GGTCCTGGAG AACCAGACTC ACCAACTCCT 5520 CTGCATCCTC CTACACCACC AATTTTGAGT ACTGATAGGA GTCGAGAAGA CAGTCCAGAG 5580 CTGAACCCAC CCCCAGGCAT AGAAGACAAT AGACAGTGTG CGTTATGTTT GACTTATGGT 5640 GATGACAGTG CTAATGATGC TGGTCGTTTA CTATATATTG GCCAAAATGA GTGGACACAT 5700 GTAAATTGTG CTTTGTGGTC AGCGGAAGTG TTTGAAGATG ATGACGGATC ACTAAAGAAT 5760 GTGCATATGG CTGTGATCAG GGGCAAGCAG CTGAGATGTG AATTCTGCCA AAAGCCAGGA 5820 GCCACCGTGG GTTGCTGTCT CACATCCTGC ACCAGCAACT ATCACTTCAT GTGTTCCCGA 5880 GCCAAGAACT GTGTCTTTCT GGATGATAAA AAAGTATATT GCCAACGACA TCGGGATTTG 5940 ATCAAAGGCG AAGTGGTTCC TGAGAATGGA TTTGAAGTTT TCAGAAGAGT GTTTGTGGAC 6000 TTTGAAGGAA TCAGCTTGAG AAGGAAGTTT CTCAATGGCT TGGAACCAGA AAATATCCAC 6060 ATGATGATTG GGTCTATGAC AATCGACTGC TTAGGAATTC TAAATGATCT CTCCGACTGT 6120 GAAGATAAGC TCTTTCCTAT TGGATATCAG TGTTCCAGGG TATACTGGAG CACCACAGAT 6180 GCTCGCAAGC GCTGTGTATA TACATGCAAG ATAGTGGAGT GCCGTCCTCC AGTCGTAGAG 6240 CCGGATATCA ACAGCACTGT TGAACATGAT GAAAACAGGA CCATTGCCCA TAGTCCAACA 6300 TCTTTTACAG AAAGTTCATC AAAAGAGAGT CAAAACACAG CTGAAATTAT AAGTCCTCCA 6360 TCACCAGACC GACCTCCTCA TTCACAAACC TCTGGCTCCT GTTATTATCA TGTCATCTCA 6420 AAGGTCCCCA GGATTCGAAC ACCCAGTTAT TCTCCAACAC AGAGATCCCC TGGCTGTCGA 6480 CCGTTGCCTT CTGCAGGAAG TCCTACCCCA ACCACTCATG AAATAGTCAC AGTAGGTGAT 6540 CCTTTACTCT CCTCTGGACT TCGAAGCATT GGCTCCAGGC GTCACAGTAC CTCTTCCTTA 6600 TCACCCCAGC GGTCCAAACT CCGGATAATG TCTCCAATGA GAACTGGGAA TACTTACTCT 6660 AGGAATAATG TTTCCTCAGT CTCCACCACC GGGACCGCTA CTGATCTTGA ATCAAGTGCC 6720 AAAGTAGTTG ATCATGTCTT AGGGCCACTG AATTCAAGTA CTAGTTTAGG GCAAAACACT 6780 TCCACCTCTT CAAATTTGCA AAGGACAGTG GTTACTGTAG GCAATAAAAA CAGTCACTTG 6840 GATGGATCTT CATCTTCAGA AATGAAGCAG TCCAGTGCTT CAGACTTGGT GTCCAAGAGC 6900 TCCTCTTTAA AGGGAGAGAA GACCAAAGTG CTGAGTTCCA AGAGCTCAGA GGGATCTGCA 6960 CATAATGTGG CTTACCCTGG AATTCCTAAA CTGGCCCCAC AGGTTCATAA CACAACATCT 7020 AGAGAACTGA ATGTTAGTAA AATCGGCTCC TTTGCTGAAC CCTCTTCAGT GTCGTTTTCT 7080 TCTAAAGAGG CCCTCTCCTT CCCACACCTC CATTTGAGAG GGCAAAGGAA TGATCGAGAC 7140 CAACACACAG ATTCTACCCA ATCAGCAAAC TCCTCTCCAG ATGAAGATAC TGAAGTCAAA 7200 ACCTTGAAGC TATCTGGAAT GAGCAACAGA TCATCCATTA TCAACGAACA TATGGGATCT 7260 AGTTCCAGAG ATAGGAGACA GAAAGGGAAA AAATCCTGTA AAGAAACTTT CAAAGAAAAG 7320 CATTCCAGTA AATCTTTTTT GGAACCTGGT CAGGTGACAA CTGGTGAGGA AGGAAACTTG 7380 AAGCCAGAGT TTATGGATGA GGTTTTGACT CCTGAGTATA TGGGCCAACG ACCATGTAAC 7440 AATGTTTCTT CTGATAAGAT TGGTGATAAA GGCCTTTCTA TGCCAGGAGT CCCCAAAGCT 7500 CCACCCATGC AAGTAGAAGG ATCTGCCAAG GAATTACAGG CACCACGGAA ACGCACAGTC 7560 AAAGTGACAC TGACACCTCT AAAAATGGAA AATGAGAGTC AATCCAAAAA TGCCCTGAAA 7620 GAAAGTAGTC CTGCTTCCCC TTTGCAAATA GAGTCAACAT CTCCCACAGA ACCAATTTCA 7680 GCCTCTGAAA ATCCAGGAGA TGGTCCAGTG GCCCAACCAA GCCCCAATAA TACCTCATGC 7740 CAGGATTCTC AAAGTAACAA CTATCAGAAT CTTCCAGTAC AGGACAGAAA CCTAATGCTT 7800 CCAGATGGCC CCAAACCTCA GGAGGATGGC TCTTTTAAAA GGAGGTATCC CCGTCGCAGT 7860 GCCCGTGCAC GTTCTAACAT GTTTTTTGGG CTTACCCCAC TCTATGGAGT AAGATCCTAT 7920 GGTGAAGAAG ACATTCCATT CTACAGCAGC TCAACTGGGA AGAAGCGAGG CAAGAGATCA 7980 GCTGAAGGAC AGGTGGATGG GGCCGATGAC TTAAGCACTT CAGATGAAGA CGACTTATAC 8040 TATTACAACT TCACTAGAAC AGTGATTTCT TCAGGTGGAG AGGAACGACT GGCATCCCAT 8100 AATTTATTTC GGGAGGAGGA ACAGTGTGAT CTTCCAAAAA TCTCACAGTT GGATGGTGTT 8160 GATGATGGGA CAGAGAGTGA TACTAGTGTC ACAGCCACAA CAAGGAAAAG CAGCCAGATT 8220 CCAAAAAGAA ATGGTAAAGA AAATGGAACA GAGAACTTAA AGATTGATAG ACCTGAAGAT 8280 GCTGGGGAGA AAGAACATGT CACTAAGAGT TCTGTTGGCC ACAAAAATGA GCCAAAGATG 8340 GATAACTGCC ATTCTGTAAG CAGAGTTAAA ACACAGGGAC AAGATTCCTT GGAAGCTCAG 8400 CTCAGCTCAT TGGAGTCAAG CCGCAGAGTC CACACAAGTA CCCCCTCCGA CAAAAATTTA 8460 CTGGACACCT ATAATACTGA GCTCCTGAAA TCAGATTCAG ACAATAACAA CAGTGATGAC 8520 TGTGGGAATA TCCTGCCTTC AGACATTATG GACTTTGTAC TAAAGAATAC TCCATCCATG 8580 CAGGCTTTGG GTGAGAGCCC AGAGTCATCT TCATCAGAAC TCCTGAATCT TGGTGAAGGA 8640 TTGGGTCTTG ACAGTAATCG TGAAAAAGAC ATGGGTCTTT TTGAAGTATT TTCTCAGCAG 8700 CTGCCTACAA CAGAACCTGT GGATAGTAGT GTCTCTTCCT CTATCTCAGC AGAGGAACAG 8760 TTTGAGTTGC CTCTAGAGCT ACCATCTGAT CTGTCTGTCT TGACCACCCG GAGTCCCACT 8820 GTCCCCAGCC AGAATCCCAG TAGACTAGCT GTTATCTCAG ACTCAGGGGA GAAGAGAGTA 8880 ACCATCACAG AAAAATCTGT AGCCTCCTCT GAAAGTGACC CAGCACTGCT GAGCCCAGGA 8940 GTAGATCCAA CTCCTGAAGG CCACATGACT CCTGATCATT TTATCCAAGG ACACATGGAT 9000 GCAGACCACA TCTCTAGCCC TCCTTGTGGT TCAGTAGAGC AAGGTCATGG CAACAATCAG 9060 GATTTAACTA GGAACAGTAG CACCCCTGGC CTTCAGGTAC CTGTTTCCCC AACTGTTCCC 9120 ATCCAGAACC AGAAGTATGT GCCCAATTCT ACTGATAGTC CTGGCCCGTC TCAGATTTCC 9180 AATGCAGCTG TCCAGACCAC TCCACCCCAC CTGAAGCCAG CCACTGAGAA ACTCATAGTT 9240 GTTAACCAGA ACATGCAGCC ACTTTATGTT CTCCAAACTC TTCCAAATGG AGTGACCCAA 9300 AAAATCCAAT TGACCTCTTC TGTTAGTTCT ACACCCAGTG TGATGGAGAC AAATACTTCA 9360 GTATTGGGAC CCATGGGAGG TGGTCTCACC CTTACCACAG GACTAAATCC AAGCTTGCCA 9420 ACTTCTCAAT CTTTGTTCCC TTCTGCTAGC AAAGGATTGC TACCCATGTC TCATCACCAG 9480 CACTTACATT CCTTCCCTGC AGCTACTCAA AGTAGTTTCC CACCAAACAT CAGCAATCCT 9540 CCTTCAGGCC TGCTTATTGG GGTTCAGCCT CCTCCGGATC CCCAACTTTT GGTTTCAGAA 9600 TCCAGCCAGA GGACAGACCT CAGTACCACA GTAGCCACTC CATCCTCTGG ACTCAAGAAA 9660 AGACCCATAT CTCGTCTACA GACCCGAAAG AATAAAAAAC TTGCTCCCTC TAGTACCCCT 9720 TCAAACATTG CCCCTTCTGA TGTGGTTTCT AATATGACAT TGATTAACTT CACACCCTCC 9780 CAGCTTCCTA ATCATCCAAG TCTGTTAGAT TTGGGGTCAC TTAATACTTC ATCTCACCGA 9840 ACTGTCCCCA ACATCATAAA AAGATCTAAA TCTAGCATCA TGTATTTTGA ACCGGCACCC 9900 CTGTTACCAC AGAGTGTGGG AGGAACTGCT GCCACAGCGG CAGGCACATC AACAATAAGC 9960 CAGGATACTA GCCACCTCAC ATCAGGGTCT GTGTCTGGCT TGGCATCCAG TTCCTCTGTC 10020 TTGAATGTTG TATCCATGCA AACTACCACA ACCCCTACAA GTAGTGCGTC AGTTCCAGGA 10080 CACGTCACCT TAACCAACCC AAGGTTGCTT GGTACCCCAG ATATTGGCTC AATAAGCAAT 10140 CTTTTAATCA AAGCTAGCCA GCAGAGCCTG GGGATTCAGG ACCAGCCTGT GGCTTTACCG 10200 CCAAGTTCAG GAATGTTTCC ACAACTGGGG ACATCACAGA CCCCCTCTAC TGCTGCAATA 10260 ACAGCGGCAT CTAGCATCTG TGTGCTCCCC TCCACTCAGA CTACGGGCAT AACAGCCGCT 10320 TCACCTTCTG GGGAAGCAGA CGAACACTAT CAGCTTCAGC ATGTGAACCA GCTCCTTGCC 10380 AGCAAAACTG GGATTCATTC TTCCCAGCGT GATCTTGATT CTGCTTCAGG GCCCCAGGTA 10440 TCCAACTTTA CCCAGACGGT AGACGCTCCT AATAGCATGG GACTGGAGCA GAACAAGGCT 10500 TTATCCTCAG CTGTGCAAGC CAGCCCCACC TCTCCTGGGG GTTCTCCATC CTCTCCATCT 10560 TCTGGACAGC GGTCAGCAAG CCCTTCAGTG CCGGGTCCCA CTAAACCCAA ACCAAAAACC 10620 AAACGGTTTC AGCTGCCTCT AGACAAAGGG AATGGCAAGA AGCACAAAGT TTCCCATTTG 10680 CGGACCAGTT CTTCTGAAGC ACACATTCCA GACCAAGAAA CGACATCCCT GACCTCAGGC 10740 ACAGGGACTC CAGGAGCAGA GGCTGAGCAG CAGGATACAG CTAGCGTGGA GCAGTCCTCC 10800 CAGAAGGAGT GTGGGCAACC TGCAGGGCAA GTCGCTGTTC TTCCGGAAGT TCAGGTGACC 10860 CAAAATCCAG CAAATGAACA AGAAAGTGCA GAACCTAAAA CAGTGGAAGA AGAGGAAAGT 10920 AATTTCAGCT CCCCACTGAT GCTTTGGCTT CAGCAAGAAC AAAAGCGGAA GGAAAGCATT 10980 ACTGAGAAAA AACCCAAGAA AGGACTTGTT TTTGAAATTT CCAGTGATGA TGGCTTTCAG 11040 ATCTGTGCAG AAAGTATTGA AGATGCCTGG AAGTCATTGA CAGATAAAGT CCAGGAAGCT 11100 CGATCAAATG CCCGCCTAAA GCAGCTCTCA TTTGCAGGTG TTAACGGTTT GAGGATGCTG 11160 GGGATTCTCC ATGATGCAGT TGTGTTCCTC ATTGAGCAGC TGTCTGGTGC CAAGCACTGT 11220 CGAAATTACA AATTCCGTTT CCACAAGCCA GAGGAGGCCA ATGAACCCCC CTTGAACCCT 11280 CACGGCTCAG CCAGGGCTGA AGTCCACCTC AGGAAGTCAG CATTTGACAT GTTTAACTTC 11340 CTGGCTTCTA AACATCGTCA GCCTCCTGAA TACAACCCCA ATGATGAAGA AGAGGAGGAG 11400 GTACAGCTGA AGTCAGCTCG GAGGGCAACT AGCATGGATC TGCCAATGCC CATGCGCTTC 11460 CGGCACTTAA AAAAGACTTC TAAGGAGGCA GTTGGTGTCT ACAGGTCTCC CATCCATGGC 11520 CGGGGTCTTT TCTGTAAGAG AAACATTGAT GCAGGTGAGA TGGTGATTGA GTATGCCGGC 11580 AACGTCATCC GCTCCATCCA GACTGACAAG CGGGAAAAGT ATTACGACAG CAAGGGCATT 11640 GGTTGCTATA TGTTCCGAAT TGATGACTCA GAGGTAGTGG ATGCCACCAT GCATGGAAAT 11700 GCTGCACGCT TCATCAATCA CTCGTGTGAG CCTAACTGCT ATTCTCGGGT CATCAATATT 11760 GATGGGCAGA AGCACATTGT CATCTTTGCC ATGCGTAAGA TCTACCGAGG AGAGGAACTC 11820 ACTTACGACT ATAAGTTCCC CATTGAGGAT GCCAGCAACA AGCTGCCCTG CAACTGTGGC 11880 GCCAAGAAAT GCCGGAAGTT CCTAAACTAA AGCTGCTCTT CTCCCCCAGT GTTGGAGTGC 11940 AAGGAGGCGG GGCCATCCAA AGCAACGCTG AAGGCCTTTT CCAGCAGCTG GGAGCTCCCG 12000 GATTGCGTGG CACAGCTGAG GGGCCTCTGT GATGGCTGAG CTCTCTTATG TCCTATACTC 12060 ACATCAGACA TGTGATCATA GTCCCAGAGA CAGAGTTGAG GTCTCGAAGA AAAGATCCAT 12120 GATCGGCTTT CTCCTGGGGC CCCTCCAATT GTTTACTGTT AGAAAGTGGG AATGGGGTCC 12180 CTAGCAGACT TGCCTGGAAG GAGCCTATTA TAGAGGGTTG GTTATGTTGG GAGATTGGGC 12240 CTGAATTTCT CCACAGAAAT AAGTTGCCAT CCTCAGGTTG GCCCTTTCCC AAGCACTGTA 12300 AGTGAGTGGG TCAGGCAAAG CCCCAAATGG AGGGTTGGTT AGATTCCTGA CAGTTTGCCA 12360 GCCAGGCCCC ACCTACAGCG TCTGTCGAAC AAACAGAGGT CTGGTGGTTT TCCCTACTAT 12420 CCTCCCACTC GAGAGTTCAC TTCTGGTTGG GAGACAGGAT TCCTAGCACC TCCGGTGTCA 12480 AAAGGCTGTC ATGGGGTTGT GCCAATTAAT TACCAAACAT TGAGCCTGCA GGCTTTGAGT 12540 GGGAGTGTTG CCCCCAGGAG CCTTATCTCA GCCAATTACC TTTCTTGACA GTAGGAGCGG 12600 CTTCCCTCTC CCATTCCCTC TTCACTCCCT TTTCTTCCTT TCCCCTGTCT TCATGCCACT 12660 GCTTTCCCAT GCTTCTTTCG GGTTGTAGGG GAGACTGACT GCCTGCTCAA GGACACTCCC 12720 TGCTGGGCAT AGGATGTGCC TGCAAAAAGT TCCCTGAGCC TGTAAGCACT CCAGGTGGGG 12780 AAGTGGACAG GAGCCATTGG TCATAACCAG ACAGAATTTG GAAACATTTT CATAAAGCTC 12840 CATGGAGAGT TTTAAAGAAA CATATGTAGC ATGATTTTGT AGGAGAGGAA AAAGATTATT 12900 TAAATAGGAT TTAAATCATG CAACAACGAG AGTATCACAG CCAGGATGAC CCTTGGGTCC 12960 CATTCCTAAG ACATGGTTAC TTTATTTTCC CCTTGTTAAG ACATAGGAAG ACTTAATTTT 13020 TAAACGGTCA GTGTCCAGTT GAAGGCAGAA CACTAATCAG ATTTCAAGGC CCACAACTTG 13080 GGGACTAGAC CACCTTATGT TGAGGGAACT CTGCCACCTG CGTGCAACCC ACAGCTAAAG 13140 TAAATTCAAT GACACTACTG CCCTGATTAC TCCTTAGGAT GTGGTCAAAA CAGCATCAAA 13200 TGTTTCTTCT CTTCCTTTCC CCAAGACAGA GTCCTGAACC TGTTAAATTA AGTCATTGGA 13260 TTTTACTCTG TTCTGTTTAC AGTTTACTAT TTAAGGTTTT ATAAATGTAA ATATATTTTG 13320 TATATTTTTC TATGAGAAGC ACTTCATAGG GAGAAGCACT TATGACAAGG CTATTTTTTA 13380 AACCGCGGTA TTATCCTAAT TTAAAAGAAG ATCGGTTTTT AATAATTTTT TATTTTCATA 13440 GGATGAAGTT AGAGAAAATA TTCAGCTGTA CACACAAAGT CTGGTTTTTC CTGCCCAACT 13500 TCCCCCTGGA AGGTGTACTT TTTGTTGTTT AATGTGTAGC TTGTTTGTGC CCTGTTGACA 13560 TAAATGTTTC CTGGGTTTGC TCTTTGACAA TAAATGGAGA AGGAAGGTCA CCCAACTCCA 13620 TTGGGCCACT CCCCTCCTTC CCCTATTGAA GCTCC
13656Nucleotide Fasta Sequence
>ENSP00000374157.5|Bromodomain|Homo sapiens ATGGCGCACAGCTGTCGGTGGCGCTTCCCCGCCCGACCCGGGACCACCGGGGGCGGCGGCGGCGGGGGGCGCCGGGGCCTAGGGGGCGCCCCGCGGCAACGCGTCCCGGCCCTGCTGCTTCCCCCCGGGCCCCCGGTCGGCGGTGGCGGCCCCGGGGCGCCCCCCTCCCCCCCGGCTGTGGCGGCCGCGGCGGCGGCGGCGGGAAGCAGCGGGGCTGGGGTTCCAGGGGGAGCGGCCGCCGCCTCAGCAGCCTCCTCGTCGTCCGCCTCGTCTTCGTCTTCGTCATCGTCCTCAGCCTCTTCAGGGCCGGCCCTGCTCCGGGTGGGCCCGGGCTTCGACGCGGCGCTGCAGGTCTCGGCCGCCATCGGCACCAACCTGCGCCGGTTCCGGGCCGTGTTTGGGGAGAGCGGCGGGGGAGGCGGCAGCGGAGAGGATGAGCAATTCTTAGGTTTTGGCTCAGATGAAGAAGTCAGAGTGCGAAGTCCCACAAGGTCTCCTTCAGTTAAAACTAGTCCTCGAAAACCTCGTGGGAGACCTAGAAGTGGCTCTGACCGAAATTCAGCTATCCTCTCAGATCCATCTGTGTTTTCCCCTCTAAATAAATCAGAGACCAAATCTGGAGATAAGATCAAGAAGAAAGATTCTAAAAGTATAGAAAAGAAGAGAGGAAGACCTCCCACCTTCCCTGGAGTAAAAATCAAAATAACACATGGAAAGGACATTTCAGAGTTACCAAAGGGAAACAAAGAAGATAGCCTGAAAAAAATTAAAAGGACACCTTCTGCTACGTTTCAGCAAGCCACAAAGATTAAAAAATTAAGAGCAGGTAAACTCTCTCCTCTCAAGTCTAAGTTTAAGACAGGGAAGCTTCAAATAGGAAGGAAGGGGGTACAAATTGTACGACGGAGAGGAAGGCCTCCATCAACAGAAAGGATAAAGACCCCTTCGGGTCTCCTCATTAATTCTGAACTGGAAAAGCCCCAGAAAGTCCGGAAAGACAAGGAAGGAACACCTCCACTTACAAAAGAAGATAAGACAGTTGTCAGACAAAGCCCTCGAAGGATTAAGCCAGTTAGGATTATTCCTTCTTCAAAAAGGACAGATGCAACCATTGCTAAGCAACTCTTACAGAGGGCAAAAAAGGGGGCTCAAAAGAAAATTGAAAAAGAAGCAGCTCAGCTGCAGGGAAGAAAGGTGAAGACACAGGTCAAAAATATTCGACAGTTCATCATGCCTGTTGTCAGTGCTATCTCCTCGCGGATCATTAAGACCCCTCGGCGGTTTATAGAGGATGAGGATTATGACCCTCCAATTAAAATTGCCCGATTAGAGTCTACACCGAATAGTAGATTCAGTGCCCCGTCCTGTGGATCTTCTGAAAAATCAAGTGCAGCTTCTCAGCACTCCTCTCAAATGTCTTCAGACTCCTCTCGATCTAGTAGCCCCAGTGTTGATACCTCCACAGACTCTCAGGCTTCTGAGGAGATTCAGGTACTTCCTGAGGAGCGGAGCGATACCCCTGAAGTTCATCCTCCACTGCCCATTTCCCAGTCCCCAGAAAATGAGAGTAATGATAGGAGAAGCAGAAGGTATTCAGTGTCGGAGAGAAGTTTTGGATCTAGAACGACGAAAAAATTATCAACTCTACAAAGTGCCCCCCAGCAGCAGACCTCCTCGTCTCCACCTCCACCTCTGCTGACTCCACCGCCACCACTGCAGCCAGCCTCCAGTATCTCTGACCACACACCTTGGCTTATGCCTCCAACAATCCCCTTAGCATCACCATTTTTGCCTGCTTCCACTGCTCCTATGCAAGGGAAGCGAAAATCTATTTTGCGAGAACCGACATTTAGGTGGACTTCTTTAAAGCATTCTAGGTCAGAGCCACAATACTTTTCCTCAGCAAAGTATGCCAAAGAAGGTCTTATTCGCAAACCAATATTTGATAATTTCCGACCCCCTCCACTAACTCCCGAGGACGTTGGCTTTGCATCTGGTTTTTCTGCATCTGGTACCGCTGCTTCAGCCCGATTGTTTTCGCCACTCCATTCTGGAACAAGGTTTGATATGCACAAAAGGAGCCCTCTTCTGAGAGCTCCAAGATTTACTCCAAGTGAGGCTCACTCTAGAATATTTGAGTCTGTAACCTTGCCTAGTAATCGAACTTCTGCTGGAACATCTTCTTCAGGAGTATCCAATAGAAAAAGGAAAAGAAAAGTGTTTAGTCCTATTCGATCTGAACCAAGATCTCCTTCTCACTCCATGAGGACAAGAAGTGGAAGGCTTAGTAGTTCTGAGCTCTCACCTCTCACCCCCCCGTCTTCTGTCTCTTCCTCGTTAAGCATTTCTGTTAGTCCTCTTGCCACTAGTGCCTTAAACCCAACTTTTACTTTTCCTTCTCATTCCCTGACTCAGTCTGGGGAATCTGCAGAGAAAAATCAGAGACCAAGGAAGCAGACTAGTGCTCCGGCAGAGCCATTTTCATCAAGTAGTCCTACTCCTCTCTTCCCTTGGTTTACCCCAGGCTCTCAGACTGAAAGAGGGAGAAATAAAGACAAGGCCCCCGAGGAGCTGTCCAAAGATCGAGATGCTGACAAGAGCGTGGAGAAGGACAAGAGTAGAGAGAGAGACCGGGAGAGAGAAAAGGAGAATAAGCGGGAGTCAAGGAAAGAGAAAAGGAAAAAGGGATCAGAAATTCAGAGTAGTTCTGCTTTGTATCCTGTGGGTAGGGTTTCCAAAGAGAAGGTTGTTGGTGAAGATGTTGCCACTTCATCTTCTGCCAAAAAAGCAACAGGGCGGAAGAAGTCTTCATCACATGATTCTGGGACTGATATTACTTCTGTGACTCTTGGGGATACAACAGCTGTCAAAACCAAAATACTTATAAAGAAAGGGAGAGGAAATCTGGAAAAAACCAACTTGGACCTCGGCCCAACTGCCCCATCCCTGGAGAAGGAGAAAACCCTCTGCCTTTCCACTCCTTCATCTAGCACTGTTAAACATTCCACTTCCTCCATAGGCTCCATGTTGGCTCAGGCAGACAAGCTTCCAATGACTGACAAGAGGGTTGCCAGCCTCCTAAAAAAGGCCAAAGCTCAGCTCTGCAAGATTGAGAAGAGTAAGAGTCTTAAACAAACCGACCAGCCCAAAGCACAGGGTCAAGAAAGTGACTCATCAGAGACCTCTGTGCGAGGACCCCGGATTAAACATGTCTGCAGAAGAGCAGCTGTTGCCCTTGGCCGAAAACGAGCTGTGTTTCCTGATGACATGCCCACCCTGAGTGCCTTACCATGGGAAGAACGAGAAAAGATTTTGTCTTCCATGGGGAATGATGACAAGTCATCAATTGCTGGCTCAGAAGATGCTGAACCTCTTGCTCCACCCATCAAACCAATTAAACCTGTCACTAGAAACAAGGCACCCCAGGAACCTCCAGTAAAGAAAGGACGTCGATCGAGGCGGTGTGGGCAGTGTCCCGGCTGCCAGGTGCCTGAGGACTGTGGTGTTTGTACTAATTGCTTAGATAAGCCCAAGTTTGGTGGTCGCAATATAAAGAAGCAGTGCTGCAAGATGAGAAAATGTCAGAATCTACAATGGATGCCTTCCAAAGCCTACCTGCAGAAGCAAGCTAAAGCTGTGAAAAAGAAAGAGAAAAAGTCTAAGACCAGTGAAAAGAAAGACAGCAAAGAGAGCAGTGTTGTGAAGAACGTGGTGGACTCTAGTCAGAAACCTACCCCATCAGCAAGAGAGGATCCTGCCCCAAAGAAAAGCAGTAGTGAGCCTCCTCCACGAAAGCCCGTCGAGGAAAAGAGTGAAGAAGGGAATGTCTCGGCCCCTGGGCCTGAATCCAAACAGGCCACCACTCCAGCTTCCAGGAAGTCAAGCAAGCAGGTCTCCCAGCCAGCACTGGTCATCCCGCCTCAGCCACCTACTACAGGACCGCCAAGAAAAGAAGTTCCCAAAACCACTCCTAGTGAGCCCAAGAAAAAGCAGCCTCCACCACCAGAATCAGGTCCAGAGCAGAGCAAACAGAAAAAAGTGGCTCCCCGCCCAAGTATCCCTGTAAAACAAAAACCAAAAGAAAAGGAAAAACCACCTCCGGTCAATAAGCAGGAGAATGCAGGCACTTTGAACATCCTCAGCACTCTCTCCAATGGCAATAGTTCTAAGCAAAAAATTCCAGCAGATGGAGTCCACAGGATCAGAGTGGACTTTAAGGAGGATTGTGAAGCAGAAAATGTGTGGGAGATGGGAGGCTTAGGAATCTTGACTTCTGTTCCTATAACACCCAGGGTGGTTTGCTTTCTCTGTGCCAGTAGTGGGCATGTAGAGTTTGTGTATTGCCAAGTCTGTTGTGAGCCCTTCCACAAGTTTTGTTTAGAGGAGAACGAGCGCCCTCTGGAGGACCAGCTGGAAAATTGGTGTTGTCGTCGTTGCAAATTCTGTCACGTTTGTGGAAGGCAACATCAGGCTACAAAGCAGCTGCTGGAGTGTAATAAGTGCCGAAACAGCTATCACCCTGAGTGCCTGGGACCAAACTACCCCACCAAACCCACAAAGAAGAAGAAAGTCTGGATCTGTACCAAGTGTGTTCGCTGTAAGAGCTGTGGATCCACAACTCCAGGCAAAGGGTGGGATGCACAGTGGTCTCATGATTTCTCACTGTGTCATGATTGCGCCAAGCTCTTTGCTAAAGGAAACTTCTGCCCTCTCTGTGACAAATGTTATGATGATGATGACTATGAGAGTAAGATGATGCAATGTGGAAAGTGTGATCGCTGGGTCCATTCCAAATGTGAGAATCTTTCAGATGAGATGTATGAGATTCTATCTAATCTGCCAGAAAGTGTGGCCTACACTTGTGTGAACTGTACTGAGCGGCACCCTGCAGAGTGGCGACTGGCCCTTGAAAAAGAGCTGCAGATTTCTCTGAAGCAAGTTCTGACAGCTTTGTTGAATTCTCGGACTACCAGCCATTTGCTACGCTACCGGCAGGCTGCCAAGCCTCCAGACTTAAATCCCGAGACAGAGGAGAGTATACCTTCCCGCAGCTCCCCCGAAGGACCTGATCCACCAGTTCTTACTGAGGTCAGCAAACAGGATGATCAGCAGCCTTTAGATCTAGAAGGAGTCAAGAGGAAGATGGACCAAGGGAATTACACATCTGTGTTGGAGTTCAGTGATGATATTGTGAAGATCATTCAAGCAGCCATTAATTCAGATGGAGGACAGCCAGAAATTAAAAAAGCCAACAGCATGGTCAAGTCCTTCTTCATTCGGCAAATGGAACGTGTTTTTCCATGGTTCAGTGTCAAAAAGTCCAGGTTTTGGGAGCCAAATAAAGTATCAAGCAACAGTGGGATGTTACCAAACGCAGTGCTTCCACCTTCACTTGACCATAATTATGCTCAGTGGCAGGAGCGAGAGGAAAACAGCCACACTGAGCAGCCTCCTTTAATGAAGAAAATCATTCCAGCTCCCAAACCCAAAGGTCCTGGAGAACCAGACTCACCAACTCCTCTGCATCCTCCTACACCACCAATTTTGAGTACTGATAGGAGTCGAGAAGACAGTCCAGAGCTGAACCCACCCCCAGGCATAGAAGACAATAGACAGTGTGCGTTATGTTTGACTTATGGTGATGACAGTGCTAATGATGCTGGTCGTTTACTATATATTGGCCAAAATGAGTGGACACATGTAAATTGTGCTTTGTGGTCAGCGGAAGTGTTTGAAGATGATGACGGATCACTAAAGAATGTGCATATGGCTGTGATCAGGGGCAAGCAGCTGAGATGTGAATTCTGCCAAAAGCCAGGAGCCACCGTGGGTTGCTGTCTCACATCCTGCACCAGCAACTATCACTTCATGTGTTCCCGAGCCAAGAACTGTGTCTTTCTGGATGATAAAAAAGTATATTGCCAACGACATCGGGATTTGATCAAAGGCGAAGTGGTTCCTGAGAATGGATTTGAAGTTTTCAGAAGAGTGTTTGTGGACTTTGAAGGAATCAGCTTGAGAAGGAAGTTTCTCAATGGCTTGGAACCAGAAAATATCCACATGATGATTGGGTCTATGACAATCGACTGCTTAGGAATTCTAAATGATCTCTCCGACTGTGAAGATAAGCTCTTTCCTATTGGATATCAGTGTTCCAGGGTATACTGGAGCACCACAGATGCTCGCAAGCGCTGTGTATATACATGCAAGATAGTGGAGTGCCGTCCTCCAGTCGTAGAGCCGGATATCAACAGCACTGTTGAACATGATGAAAACAGGACCATTGCCCATAGTCCAACATCTTTTACAGAAAGTTCATCAAAAGAGAGTCAAAACACAGCTGAAATTATAAGTCCTCCATCACCAGACCGACCTCCTCATTCACAAACCTCTGGCTCCTGTTATTATCATGTCATCTCAAAGGTCCCCAGGATTCGAACACCCAGTTATTCTCCAACACAGAGATCCCCTGGCTGTCGACCGTTGCCTTCTGCAGGAAGTCCTACCCCAACCACTCATGAAATAGTCACAGTAGGTGATCCTTTACTCTCCTCTGGACTTCGAAGCATTGGCTCCAGGCGTCACAGTACCTCTTCCTTATCACCCCAGCGGTCCAAACTCCGGATAATGTCTCCAATGAGAACTGGGAATACTTACTCTAGGAATAATGTTTCCTCAGTCTCCACCACCGGGACCGCTACTGATCTTGAATCAAGTGCCAAAGTAGTTGATCATGTCTTAGGGCCACTGAATTCAAGTACTAGTTTAGGGCAAAACACTTCCACCTCTTCAAATTTGCAAAGGACAGTGGTTACTGTAGGCAATAAAAACAGTCACTTGGATGGATCTTCATCTTCAGAAATGAAGCAGTCCAGTGCTTCAGACTTGGTGTCCAAGAGCTCCTCTTTAAAGGGAGAGAAGACCAAAGTGCTGAGTTCCAAGAGCTCAGAGGGATCTGCACATAATGTGGCTTACCCTGGAATTCCTAAACTGGCCCCACAGGTTCATAACACAACATCTAGAGAACTGAATGTTAGTAAAATCGGCTCCTTTGCTGAACCCTCTTCAGTGTCGTTTTCTTCTAAAGAGGCCCTCTCCTTCCCACACCTCCATTTGAGAGGGCAAAGGAATGATCGAGACCAACACACAGATTCTACCCAATCAGCAAACTCCTCTCCAGATGAAGATACTGAAGTCAAAACCTTGAAGCTATCTGGAATGAGCAACAGATCATCCATTATCAACGAACATATGGGATCTAGTTCCAGAGATAGGAGACAGAAAGGGAAAAAATCCTGTAAAGAAACTTTCAAAGAAAAGCATTCCAGTAAATCTTTTTTGGAACCTGGTCAGGTGACAACTGGTGAGGAAGGAAACTTGAAGCCAGAGTTTATGGATGAGGTTTTGACTCCTGAGTATATGGGCCAACGACCATGTAACAATGTTTCTTCTGATAAGATTGGTGATAAAGGCCTTTCTATGCCAGGAGTCCCCAAAGCTCCACCCATGCAAGTAGAAGGATCTGCCAAGGAATTACAGGCACCACGGAAACGCACAGTCAAAGTGACACTGACACCTCTAAAAATGGAAAATGAGAGTCAATCCAAAAATGCCCTGAAAGAAAGTAGTCCTGCTTCCCCTTTGCAAATAGAGTCAACATCTCCCACAGAACCAATTTCAGCCTCTGAAAATCCAGGAGATGGTCCAGTGGCCCAACCAAGCCCCAATAATACCTCATGCCAGGATTCTCAAAGTAACAACTATCAGAATCTTCCAGTACAGGACAGAAACCTAATGCTTCCAGATGGCCCCAAACCTCAGGAGGATGGCTCTTTTAAAAGGAGGTATCCCCGTCGCAGTGCCCGTGCACGTTCTAACATGTTTTTTGGGCTTACCCCACTCTATGGAGTAAGATCCTATGGTGAAGAAGACATTCCATTCTACAGCAGCTCAACTGGGAAGAAGCGAGGCAAGAGATCAGCTGAAGGACAGGTGGATGGGGCCGATGACTTAAGCACTTCAGATGAAGACGACTTATACTATTACAACTTCACTAGAACAGTGATTTCTTCAGGTGGAGAGGAACGACTGGCATCCCATAATTTATTTCGGGAGGAGGAACAGTGTGATCTTCCAAAAATCTCACAGTTGGATGGTGTTGATGATGGGACAGAGAGTGATACTAGTGTCACAGCCACAACAAGGAAAAGCAGCCAGATTCCAAAAAGAAATGGTAAAGAAAATGGAACAGAGAACTTAAAGATTGATAGACCTGAAGATGCTGGGGAGAAAGAACATGTCACTAAGAGTTCTGTTGGCCACAAAAATGAGCCAAAGATGGATAACTGCCATTCTGTAAGCAGAGTTAAAACACAGGGACAAGATTCCTTGGAAGCTCAGCTCAGCTCATTGGAGTCAAGCCGCAGAGTCCACACAAGTACCCCCTCCGACAAAAATTTACTGGACACCTATAATACTGAGCTCCTGAAATCAGATTCAGACAATAACAACAGTGATGACTGTGGGAATATCCTGCCTTCAGACATTATGGACTTTGTACTAAAGAATACTCCATCCATGCAGGCTTTGGGTGAGAGCCCAGAGTCATCTTCATCAGAACTCCTGAATCTTGGTGAAGGATTGGGTCTTGACAGTAATCGTGAAAAAGACATGGGTCTTTTTGAAGTATTTTCTCAGCAGCTGCCTACAACAGAACCTGTGGATAGTAGTGTCTCTTCCTCTATCTCAGCAGAGGAACAGTTTGAGTTGCCTCTAGAGCTACCATCTGATCTGTCTGTCTTGACCACCCGGAGTCCCACTGTCCCCAGCCAGAATCCCAGTAGACTAGCTGTTATCTCAGACTCAGGGGAGAAGAGAGTAACCATCACAGAAAAATCTGTAGCCTCCTCTGAAAGTGACCCAGCACTGCTGAGCCCAGGAGTAGATCCAACTCCTGAAGGCCACATGACTCCTGATCATTTTATCCAAGGACACATGGATGCAGACCACATCTCTAGCCCTCCTTGTGGTTCAGTAGAGCAAGGTCATGGCAACAATCAGGATTTAACTAGGAACAGTAGCACCCCTGGCCTTCAGGTACCTGTTTCCCCAACTGTTCCCATCCAGAACCAGAAGTATGTGCCCAATTCTACTGATAGTCCTGGCCCGTCTCAGATTTCCAATGCAGCTGTCCAGACCACTCCACCCCACCTGAAGCCAGCCACTGAGAAACTCATAGTTGTTAACCAGAACATGCAGCCACTTTATGTTCTCCAAACTCTTCCAAATGGAGTGACCCAAAAAATCCAATTGACCTCTTCTGTTAGTTCTACACCCAGTGTGATGGAGACAAATACTTCAGTATTGGGACCCATGGGAGGTGGTCTCACCCTTACCACAGGACTAAATCCAAGCTTGCCAACTTCTCAATCTTTGTTCCCTTCTGCTAGCAAAGGATTGCTACCCATGTCTCATCACCAGCACTTACATTCCTTCCCTGCAGCTACTCAAAGTAGTTTCCCACCAAACATCAGCAATCCTCCTTCAGGCCTGCTTATTGGGGTTCAGCCTCCTCCGGATCCCCAACTTTTGGTTTCAGAATCCAGCCAGAGGACAGACCTCAGTACCACAGTAGCCACTCCATCCTCTGGACTCAAGAAAAGACCCATATCTCGTCTACAGACCCGAAAGAATAAAAAACTTGCTCCCTCTAGTACCCCTTCAAACATTGCCCCTTCTGATGTGGTTTCTAATATGACATTGATTAACTTCACACCCTCCCAGCTTCCTAATCATCCAAGTCTGTTAGATTTGGGGTCACTTAATACTTCATCTCACCGAACTGTCCCCAACATCATAAAAAGATCTAAATCTAGCATCATGTATTTTGAACCGGCACCCCTGTTACCACAGAGTGTGGGAGGAACTGCTGCCACAGCGGCAGGCACATCAACAATAAGCCAGGATACTAGCCACCTCACATCAGGGTCTGTGTCTGGCTTGGCATCCAGTTCCTCTGTCTTGAATGTTGTATCCATGCAAACTACCACAACCCCTACAAGTAGTGCGTCAGTTCCAGGACACGTCACCTTAACCAACCCAAGGTTGCTTGGTACCCCAGATATTGGCTCAATAAGCAATCTTTTAATCAAAGCTAGCCAGCAGAGCCTGGGGATTCAGGACCAGCCTGTGGCTTTACCGCCAAGTTCAGGAATGTTTCCACAACTGGGGACATCACAGACCCCCTCTACTGCTGCAATAACAGCGGCATCTAGCATCTGTGTGCTCCCCTCCACTCAGACTACGGGCATAACAGCCGCTTCACCTTCTGGGGAAGCAGACGAACACTATCAGCTTCAGCATGTGAACCAGCTCCTTGCCAGCAAAACTGGGATTCATTCTTCCCAGCGTGATCTTGATTCTGCTTCAGGGCCCCAGGTATCCAACTTTACCCAGACGGTAGACGCTCCTAATAGCATGGGACTGGAGCAGAACAAGGCTTTATCCTCAGCTGTGCAAGCCAGCCCCACCTCTCCTGGGGGTTCTCCATCCTCTCCATCTTCTGGACAGCGGTCAGCAAGCCCTTCAGTGCCGGGTCCCACTAAACCCAAACCAAAAACCAAACGGTTTCAGCTGCCTCTAGACAAAGGGAATGGCAAGAAGCACAAAGTTTCCCATTTGCGGACCAGTTCTTCTGAAGCACACATTCCAGACCAAGAAACGACATCCCTGACCTCAGGCACAGGGACTCCAGGAGCAGAGGCTGAGCAGCAGGATACAGCTAGCGTGGAGCAGTCCTCCCAGAAGGAGTGTGGGCAACCTGCAGGGCAAGTCGCTGTTCTTCCGGAAGTTCAGGTGACCCAAAATCCAGCAAATGAACAAGAAAGTGCAGAACCTAAAACAGTGGAAGAAGAGGAAAGTAATTTCAGCTCCCCACTGATGCTTTGGCTTCAGCAAGAACAAAAGCGGAAGGAAAGCATTACTGAGAAAAAACCCAAGAAAGGACTTGTTTTTGAAATTTCCAGTGATGATGGCTTTCAGATCTGTGCAGAAAGTATTGAAGATGCCTGGAAGTCATTGACAGATAAAGTCCAGGAAGCTCGATCAAATGCCCGCCTAAAGCAGCTCTCATTTGCAGGTGTTAACGGTTTGAGGATGCTGGGGATTCTCCATGATGCAGTTGTGTTCCTCATTGAGCAGCTGTCTGGTGCCAAGCACTGTCGAAATTACAAATTCCGTTTCCACAAGCCAGAGGAGGCCAATGAACCCCCCTTGAACCCTCACGGCTCAGCCAGGGCTGAAGTCCACCTCAGGAAGTCAGCATTTGACATGTTTAACTTCCTGGCTTCTAAACATCGTCAGCCTCCTGAATACAACCCCAATGATGAAGAAGAGGAGGAGGTACAGCTGAAGTCAGCTCGGAGGGCAACTAGCATGGATCTGCCAATGCCCATGCGCTTCCGGCACTTAAAAAAGACTTCTAAGGAGGCAGTTGGTGTCTACAGGTCTCCCATCCATGGCCGGGGTCTTTTCTGTAAGAGAAACATTGATGCAGGTGAGATGGTGATTGAGTATGCCGGCAACGTCATCCGCTCCATCCAGACTGACAAGCGGGAAAAGTATTACGACAGCAAGGGCATTGGTTGCTATATGTTCCGAATTGATGACTCAGAGGTAGTGGATGCCACCATGCATGGAAATGCTGCACGCTTCATCAATCACTCGTGTGAGCCTAACTGCTATTCTCGGGTCATCAATATTGATGGGCAGAAGCACATTGTCATCTTTGCCATGCGTAAGATCTACCGAGGAGAGGAACTCACTTACGACTATAAGTTCCCCATTGAGGATGCCAGCAACAAGCTGCCCTGCAACTGTGGCGCCAAGAAATGCCGGAAGTTCCTAAACTAAAGCTGCTCTTCTCCCCCAGTGTTGGAGTGCAAGGAGGCGGGGCCATCCAAAGCAACGCTGAAGGCCTTTTCCAGCAGCTGGGAGCTCCCGGATTGCGTGGCACAGCTGAGGGGCCTCTGTGATGGCTGAGCTCTCTTATGTCCTATACTCACATCAGACATGTGATCATAGTCCCAGAGACAGAGTTGAGGTCTCGAAGAAAAGATCCATGATCGGCTTTCTCCTGGGGCCCCTCCAATTGTTTACTGTTAGAAAGTGGGAATGGGGTCCCTAGCAGACTTGCCTGGAAGGAGCCTATTATAGAGGGTTGGTTATGTTGGGAGATTGGGCCTGAATTTCTCCACAGAAATAAGTTGCCATCCTCAGGTTGGCCCTTTCCCAAGCACTGTAAGTGAGTGGGTCAGGCAAAGCCCCAAATGGAGGGTTGGTTAGATTCCTGACAGTTTGCCAGCCAGGCCCCACCTACAGCGTCTGTCGAACAAACAGAGGTCTGGTGGTTTTCCCTACTATCCTCCCACTCGAGAGTTCACTTCTGGTTGGGAGACAGGATTCCTAGCACCTCCGGTGTCAAAAGGCTGTCATGGGGTTGTGCCAATTAATTACCAAACATTGAGCCTGCAGGCTTTGAGTGGGAGTGTTGCCCCCAGGAGCCTTATCTCAGCCAATTACCTTTCTTGACAGTAGGAGCGGCTTCCCTCTCCCATTCCCTCTTCACTCCCTTTTCTTCCTTTCCCCTGTCTTCATGCCACTGCTTTCCCATGCTTCTTTCGGGTTGTAGGGGAGACTGACTGCCTGCTCAAGGACACTCCCTGCTGGGCATAGGATGTGCCTGCAAAAAGTTCCCTGAGCCTGTAAGCACTCCAGGTGGGGAAGTGGACAGGAGCCATTGGTCATAACCAGACAGAATTTGGAAACATTTTCATAAAGCTCCATGGAGAGTTTTAAAGAAACATATGTAGCATGATTTTGTAGGAGAGGAAAAAGATTATTTAAATAGGATTTAAATCATGCAACAACGAGAGTATCACAGCCAGGATGACCCTTGGGTCCCATTCCTAAGACATGGTTACTTTATTTTCCCCTTGTTAAGACATAGGAAGACTTAATTTTTAAACGGTCAGTGTCCAGTTGAAGGCAGAACACTAATCAGATTTCAAGGCCCACAACTTGGGGACTAGACCACCTTATGTTGAGGGAACTCTGCCACCTGCGTGCAACCCACAGCTAAAGTAAATTCAATGACACTACTGCCCTGATTACTCCTTAGGATGTGGTCAAAACAGCATCAAATGTTTCTTCTCTTCCTTTCCCCAAGACAGAGTCCTGAACCTGTTAAATTAAGTCATTGGATTTTACTCTGTTCTGTTTACAGTTTACTATTTAAGGTTTTATAAATGTAAATATATTTTGTATATTTTTCTATGAGAAGCACTTCATAGGGAGAAGCACTTATGACAAGGCTATTTTTTAAACCGCGGTATTATCCTAATTTAAAAGAAGATCGGTTTTTAATAATTTTTTATTTTCATAGGATGAAGTTAGAGAAAATATTCAGCTGTACACACAAAGTCTGGTTTTTCCTGCCCAACTTCCCCCTGGAAGGTGTACTTTTTGTTGTTTAATGTGTAGCTTGTTTGTGCCCTGTTGACATAAATGTTTCCTGGGTTTGCTCTTTGACAATAAATGGAGAAGGAAGGTCACCCAACTCCATTGGGCCACTCCCCTCCTTCCCCTATTGAAGCTCC
|
Sequence Source |
Ensembl |
Keyword |
KW-0002--3D-structure KW-0007--Acetylation KW-0025--Alternative splicing KW-0053--Apoptosis KW-0090--Biological rhythms KW-0103--Bromodomain KW-0156--Chromatin regulator KW-0160--Chromosomal rearrangement KW-0181--Complete proteome KW-0903--Direct protein sequencing KW-0238--DNA-binding KW-1017--Isopeptide bond KW-0479--Metal-binding KW-0489--Methyltransferase KW-0539--Nucleus KW-0597--Phosphoprotein KW-0621--Polymorphism KW-0656--Proto-oncogene KW-1185--Reference proteome KW-0677--Repeat KW-0949--S-adenosyl-L-methionine KW-0804--Transcription KW-0805--Transcription regulation KW-0808--Transferase KW-0832--Ubl conjugation KW-0862--Zinc KW-0863--Zinc-finger --
|
Interpro |
IPR001487--Bromodomain IPR003889--FYrich_C IPR003888--FYrich_N IPR016569--MeTrfase_trithorax IPR003616--Post-SET_dom IPR001214--SET_dom IPR002857--Znf_CXXC IPR011011--Znf_FYVE_PHD IPR001965--Znf_PHD IPR019787--Znf_PHD-finger IPR013083--Znf_RING/FYVE/PHD
|
PROSITE |
PS50014--BROMODOMAIN_2 PS51543--FYRC PS51542--FYRN PS50868--POST_SET PS50280--SET PS51058--ZF_CXXC PS01359--ZF_PHD_1 PS50016--ZF_PHD_2
|
Pfam |
PF05965--FYRC PF05964--FYRN PF00628--PHD PF00856--SET PF02008--zf-CXXC
|
Gene Ontology |
GO:0005737--C:cytoplasm GO:0035097--C:histone methyltransferase complex GO:0071339--C:MLL1 complex GO:0005654--C:nucleoplasm GO:0005634--C:nucleus GO:0003680--F:AT DNA binding GO:0003682--F:chromatin binding GO:0001046--F:core promoter sequence-specific DNA binding GO:0042800--F:histone methyltransferase activity (H3-K4 specific) GO:0018024--F:histone-lysine N-methyltransferase activity GO:0042802--F:identical protein binding GO:0070577--F:lysine-acetylated histone binding GO:0042803--F:protein homodimerization activity GO:0003700--F:transcription factor activity, sequence-specific DNA binding GO:0044212--F:transcription regulatory region DNA binding GO:0045322--F:unmethylated CpG binding GO:0008270--F:zinc ion binding GO:0009952--P:anterior/posterior pattern specification GO:0006915--P:apoptotic process GO:0032922--P:circadian regulation of gene expression GO:0060216--P:definitive hemopoiesis GO:0006306--P:DNA methylation GO:0035162--P:embryonic hemopoiesis GO:0035640--P:exploration behavior GO:0044648--P:histone H3-K4 dimethylation GO:0051568--P:histone H3-K4 methylation GO:0080182--P:histone H3-K4 trimethylation GO:0043984--P:histone H4-K16 acetylation GO:0048873--P:homeostasis of number of cells within a tissue GO:0051899--P:membrane depolarization GO:0008285--P:negative regulation of cell proliferation GO:0018026--P:peptidyl-lysine monomethylation GO:2001040--P:positive regulation of cellular response to drug GO:0051571--P:positive regulation of histone H3-K4 methylation GO:0045944--P:positive regulation of transcription from RNA polymerase II promoter GO:0045893--P:positive regulation of transcription, DNA-templated GO:0032411--P:positive regulation of transporter activity GO:0009791--P:post-embryonic development GO:0006461--P:protein complex assembly GO:0071440--P:regulation of histone H3-K14 acetylation GO:1901674--P:regulation of histone H3-K27 acetylation GO:2000615--P:regulation of histone H3-K9 acetylation GO:0048172--P:regulation of short-term neuronal synaptic plasticity GO:0035864--P:response to potassium ion GO:0048536--P:spleen development GO:0006366--P:transcription from RNA polymerase II promoter GO:0008542--P:visual learning
|
Orthology |
|
Created Date |
25-Jun-2016 |