WERAM Information


Tag Content
WERAM ID WERAM-Mum-0152
Ensembl Protein ID ENSMUSP00000043874.7
Uniprot Accession Q8BRH4; KMT2C_MOUSE; Q5YLV9; Q8BK12; Q8C6M3; Q923H5; Q923H6
Genbank Protein ID
Protein Name Histone-lysine N-methyltransferase 2C
Genbank Nucleotide ID
Gene Name KMT2C;MLL3
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMUSG00000038056.15 ENSMUST00000045291.13 ENSMUSP00000043874.7
ENSMUSG00000038056.15 ENSMUST00000173174.1 ENSMUSP00000133304.1
ENSMUSG00000038056.15 ENSMUST00000174734.7 ENSMUSP00000133482.1
ENSMUSG00000038056.15 ENSMUST00000172556.7 ENSMUSP00000133941.1
ENSMUSG00000038056.15 ENSMUST00000173073.7 ENSMUSP00000134442.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET1 SET H3K4 K 25925669
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET1 9.40e-45 153 4765 4880
Me_Reader PHD 3.00e-24 86.4 283 4500
Organism Mus musculus
NCBI Taxa ID 10090
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-4' of histone H3. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. Central component of the MLL2/3 complex, a coactivator complex of nuclear receptors, involved in transcriptional coactivation. KMT2C/MLL3 may be a catalytic subunit of this complex (By similarity).
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+++iek+++viEY+G++ir+eva+++ek ye++++gvy+fr+d+d +v+dat +g+ ar+inhsc+pNc+
ENSMUSP00000043874.7 4765 NVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDND--HVIDATLTGGPARYINHSCAPNCV 4849
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++++ +ki+i ++r+I+kgeel+ydYk
ENSMUSP00000043874.7 4850 AEVVTFERGHKIIISSNRRIQKGEELCYDYK 4880
******************************7 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde.CddwfHlkCvklp 35 
C +C++ ++ + +C+e C + +H+ C +
ENSMUSP00000043874.7 283 RCAFCKHLGAT---IKCCEEkCTQMYHYPCAAGA 313
6****433333...6688889*********8766 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC+ +++ + C +C + +H+ C++++ ++l+ w Cp+Ck
ENSMUSP00000043874.7 342 NCAVCDSPGDL-LDQFFCTTCGQHYHGMCLDIAVTPLKRA-GWQCPECK 388
5****544444.45999****************8888865.7******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C+ C++++e++k m+ Cd+Cd+ +H+ C+++ ++s+p + w C++C+
ENSMUSP00000043874.7 389 VCQNCKQSGEDSK-MLVCDTCDKGYHTFCLQPVMKSVPTN-GWKCKNCR 435
8****88888876.************************77.7******8 PP
PHD.txt 2 tiClvCgkddegeke..mvqCdeCddwfHlkCvklplsslpeg..kswyCpsCk 51
++C++Cgk+++ e + m+ C+ C++w+Hl+C k++ ++l ++ ++++C Ck
ENSMUSP00000043874.7 464 NLCPFCGKCYHPELQkdMLHCNMCKRWVHLECDKPTDQELDSQlkEDYICMYCK 517
68999998888865566*******************77777776567******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
++C+vCg ++g++ ++ C++C + +H +Cv+++ +++ +k w+C +C+
ENSMUSP00000043874.7 952 DMCVVCGSFGQGAEGrLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECT 1002
68****7544443334*******************999996658******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
t+C +Cgk+ + + ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSMUSP00000043874.7 1002 TVCEACGKATDPGR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 1049
68999976666655.9*************************.9**99996 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.ks..wyCpsCk 51
+C+vC +++ +e+ ++qC +Cd+w+H+ C +l+ ++ e+ ++ + C C+
ENSMUSP00000043874.7 1080 SCPVCCRNYREEDLILQCRQCDRWMHAVCQNLNTEEEVENvADigFDCSMCR 1131
7***9677777777*******************5444455422458999997 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C++C+k+++ + C C + +H +C + ++ ++k +Cp +k
ENSMUSP00000043874.7 4454 KCVFCHKTGATS----GCHRfrCTNIYHFTCATKAQCMFFKDKTMLCPMHK 4500
6****8888876....6**999**********9996666688789999987 PP

Protein Sequence
(Fasta)
MSSEEDRSAE QQQPPPAPPE EPGAPAPSPA AADKRPRGRP RKDGASPFQR ARKKPRSRGK 60
STVEDEDSMD GLETTETENI VETEIKEQSV EEDAETEVDS SKQPVSALQR SVSEESANSL 120
VSVGVEAKIS EQLCAFCYCG EKSSLGQGDL KQFRVTPGLT LPWKDQPSNK DIDDNSSGTC 180
EKIQNYAPRK QRGQRKERPP QQSAVSCVSV STQTACEDQA GKLWDELSLV GLPDAIDVQA 240
LFDSTGTCWA HHRCVEWSLG ICQMEEPLLV NVDKAVVSGS TERCAFCKHL GATIKCCEEK 300
CTQMYHYPCA AGAGTFQDFS HFFLLCPEHI DQAPERSKED ANCAVCDSPG DLLDQFFCTT 360
CGQHYHGMCL DIAVTPLKRA GWQCPECKVC QNCKQSGEDS KMLVCDTCDK GYHTFCLQPV 420
MKSVPTNGWK CKNCRICIEC GTRSSTQWHH NCLICDTCYQ QQDNLCPFCG KCYHPELQKD 480
MLHCNMCKRW VHLECDKPTD QELDSQLKED YICMYCKHLG AEIDPLHPGN EVEMPELPTD 540
YASGMEIEGT EDEVVFLEQT VNKDVSDHQC RPGIVPDAVQ VYTEEPQKSN PLESPDTVGL 600
ITSESSDNKM NPDLANEIAH EVDTEKTEML SKGRHVCEED QNEDRMEVTE NIEVLPHQTI 660
VPQEDLLLSE DSEVASKELS PPKSAPETAA PEALLSPHSE RSLSCKEPLL TERVQEEMEQ 720
KENSEFSTGC VDFEMTLAVD SCDKDSSCQG DKYVELPAEE ESTFSSATDL NKADVSSSST 780
LCSDLPSCDM LHGYPPAFNS AAGSIMPTTY ISVTPKIGMG KPAITKRKFS PGRPRSKQGA 840
WSNHNTVSPP SWAPDTSEGR EIFKPRQLSG SAIWSIKVGR GSGFPGKRRP RGAGLSGRGG 900
RGRSKLKSGI GAVVLPGVSA ADISSNKDEE ENSMHNTVVL FSSSDKFTLQ QDMCVVCGSF 960
GQGAEGRLLA CSQCGQCYHP YCVSIKITKV VLSKGWRCLE CTVCEACGKA TDPGRLLLCD 1020
DCDISYHTYC LDPPLQTVPK GGWKCKWCVW CRHCGATSAG LRCEWQNNYT QCAPCASLSS 1080
CPVCCRNYRE EDLILQCRQC DRWMHAVCQN LNTEEEVENV ADIGFDCSMC RPYMPVSNVP 1140
SSDCCDSSLV AQIVTKVKEL DPPKTYTQDG VCLTESGMSQ LQSLTVTAPR RKRTKPKLKL 1200
KIINQNSVAV LQTPPDIQSE HSRDGEMDDS REGELMDCDG KSESSPEREA GDDETKGIEG 1260
TDAIKKRKRK PYRPGIGGFM VRQRSRTGQG KAKRSVVRKD SSGSISEQLP SRDDGWREQL 1320
PDTLVDEPVS VAENTDKIKK RYRKRKNKLE ETFPAYLQEA FFGKDLLDTS RQNKLSVDNL 1380
SEDAAQLSFK TGFLDPSSDP LLSSSSTSAK PGTQGTADDP LADISEVLNT DDDILGIISD 1440
DLAKSVDHSD IGPTTADASS LPQPGVSQSS RPLTEEQLDG ILSPELDKMV TDGAILGKLY 1500
KIPELGGKDV EDLFTAVLSP ATTQPAPLPQ PPPPPQLLPM HNQDVFSRMP LMNGLIGPSP 1560
HLPHNSLPPG SGLGTFPAIA QSPYTDVRDK SPAFNAIASD PNSSWAPTTP SMEGENDTLS 1620
NAQRSTLKWE KEEALGEMAT VAPVLYTNIN FPNLKEEFPD WTTRVKQIAK LWRKASSQER 1680
APYVQKARDN RAALRINKVQ MSNDSMKRQQ QQDSIDPSSR IDSDLFKDPL KQRESEHEQE 1740
WKFRQQMRQK SKQQAKIEAT QKLEQVKNEQ QQQQQQQQQQ QQQQLASQHL LVAPGSDTPS 1800
SGAQSPLTPQ AGNGNVSPAQ TFHKDLFSKH LPGTPASTPS DGVFVKPQPP PPPSTPSRIP 1860
VQESLSQSQN SQPPSPQMFS PGSSHSRPPS PVDPYAKMVG TPRPPPGGHS FPRRNSVTPV 1920
ENCVPLSSVP RPIHMNETSA TRPSPARDLC ASSMTNSDPY AKPPDTPRPM MTDQFSKPFS 1980
LPRSPVISEQ STKGPLTTGT SDHFTKPSPR TDAFQRQRLP DPYAGPSLTP APLGNGPFKT 2040
PLHPPPSQDP YGSVSQTSRR LSVDPYERPA LTPRPVDNFS HSQSNDPYSH PPLTPHPAMT 2100
ESFTHASRAF PQPGTISRSA SQDPYSQPPG TPRPLIDSYS QTSGTARSNP DPYSQPPGTP 2160
RPNTIDPYSQ QPPTPRPSPQ TDMFVSSVAN QRHTDPYTHH LGPPRPGISV PYSQPPAVPR 2220
PRTSEGFTRP SSARPALMPN QDPFLQAAQN RVPGLPGPLI RPPDTCSQTP RPPGPGRIDT 2280
FTHASSSAVR DPYDQPPVTP RPHSESFGTS QVVHDLVDRP VPGSEGNFST SSNLPVSSQG 2340
QQFSSVSQLP GPVPTSGGTD TQNTVNMSQA DTEKLRQRQK LREIILQQQQ QKKIASRQEK 2400
GPQDTAVVPH PVPLPHWQPE SINQAFTRPP PPYPGSTRSP VIPPLGPRYA VFPKDQRGPY 2460
PPEVAGMGMR PHGFRFGFPG AGHGPMPSQD RFHVPQQIQG SGIPPHIRRP MSMEMPRPSN 2520
NPPLNNPVGL PQHFPPQGLP VQQHNILGQA FIELRHRAPD GRSRLPFAAS PSSVIESPSH 2580
PRHGNFLPRP DFPGPRHTDP IRQPSQCLSN QLPVHPNLEQ VPPSQQEQGH PAHQSSIVMR 2640
PLNHPLSGEF SEAPLSTSTP AETSPDNLEI AGQSSAGLEE KLDSDDPSVK ELDVKDLEGV 2700
EVKDLDDEDL ENLNLDTEDG KGDDLDTLDN LETNDPNLDD LLRSGEFDII AYTDPELDLG 2760
DKKSMFNEEL DLNVPIDDKL DNQCASVEPK TRDQGDKTMV LEDKDLPQRK SSVSSEIKTE 2820
ALSPYSKEEI QSEIKNHDDS RGDADTACSQ AASAQTNHSD RGKTALLTTD QDMLEKRCNQ 2880
ENAGPVVSAI QGSTPLPARD VMNSCDITGS TPVLSSLLSN EKCDDSDIRP SGSSPPSLPI 2940
SPSTHGSSLP PTLIVPPSPL LDNTVNSNVT VVPRINHAFS QGVPVNPGFI QGQSSVNHNL 3000
GTGKPTNQTV PLTNQSSTMS GPQQLMIPQT LAQQNRERPL LLEEQPLLLQ DLLDQERQEQ 3060
QQQRQMQAMI RQRSEPFFPN IDFDAITDPI MKAKMVALKG INKVMAQNSL GMPPMVMSRF 3120
PFMGPSVAGT QNNDGQTLVP QAVAQDGSIT HQISRPNPPN FGPGFVNDSQ RKQYEEWLQE 3180
TQQLLQMQQK YLEEQIGAHR KSKKALSAKQ RTAKKAGREF PEEDAEQLKH VTEQQSMVQK 3240
QLEQIRKQQK EHAELIEDYR IKQQQQQQQC ALAPPILMPG VQPQPPLVPG ATSLTMSQPN 3300
FPMVPQQLQH QQHTAVISGH TSPARMPSLP GWQSNSASAH LPLNPPRIQP PIAQLSLKTC 3360
TPAPGTVSSA NPQNGPPPRV EFDDNNPFSE SFQERERKER LREQQERQRV QLMQEVDRQR 3420
ALQQRMEMEQ HCLMGAELAN RTPVSQMPFY GSDRPCDFLQ PPRPLQQSPQ HQQQIGPVLQ 3480
QQNVQQGSVN SPPNQTFMQT NEQRQVGPPS FVPDSPSASG GSPNFHSVKP GHGNLPGSSF 3540
QQSPLRPPFT PILPGTSPVA NSNVPCGQDP AVTQGQNYSG SSQSLIQLYS DIIPEEKGKK 3600
KRTRKKKKDD DAESGKAPST PHSDCAAPLT PGLSETTSTP AVSSPSELPQ QRQQEPVEPV 3660
PVPTPNVSAG QPCIESENKL PNSEFIKETS NQQTHVNAEA DKPSVETPNK TEEIKLEKAE 3720
TQPSQEDTKV EEKTGNKIKD IVAGPVSSIQ CPSHPVGTPT TKGDTGNELL KHLLKNKKAS 3780
SLLTQKPEGT LSSDESSTKD GKLIEKQSPA EGLQTLGAQM QGGFGGGNSQ LPKTDGASEN 3840
KKQRSKRTQR TGEKAAPRSK KRKKDEEEKQ AMYSSSDSFT HLKQQNNLSN PPTPPASLPP 3900
TPPPMACQKM ANGFATTEEL AGKAGVLVSH EVARALGPKP FQLPFRPQDD LLARAIAQGP 3960
KTVDVPASLP TPPHNNHEEL RIQDHYGDRD TPDSFVPSSS PESVVGVEVN KYPDLSLVKE 4020
EPPEPVPSPI IPILPSISGK NSESRRNDIK TEPGTLFFTS PFGSSPNGPR SGLISVAITL 4080
HPTAAENISS VVAAFSDLLH VRIPNSYEVS NAPDVPPMGL VSSHRVNPSL EYRQHLLLRG 4140
PPPGSANPPR LATSYRLKQP NVPFPPTSNG LSGYKDSSHG PAEGASLRPQ WCCHCKVVIL 4200
GSGVRKSCKD LTFVNKGSRE NTKRMEKDIV FCSNNCFILY SSAAQAKNSD NKESLPSLPQ 4260
SPMKEPSKAF HQYSNNISTL DVHCLPQFQE KVSPPASPPI SFPPAFEAAK VESKPDELKV 4320
TVKLKPRLRT VPVGLEDCRP LNKKWRGMKW KKWSIHIVIP KGTFKPPCED EIDEFLKKLG 4380
TCLKPDPVPK DCRKCCFCHE EGDGLTDGPA RLLNLDLDLW VHLNCALWST EVYETQAGAL 4440
INVELALRRG LQMKCVFCHK TGATSGCHRF RCTNIYHFTC ATKAQCMFFK DKTMLCPMHK 4500
PKGIHEQQLS YFAVFRRVYV QRDEVRQIAS IVQRGERDHT FRVGSLIFHT IGQLLPQQMQ 4560
AFHSPKALFP VGYEASRLYW STRYANRRCR YLCSIEEKDG RPVFVIRIVE QGHEDLVLSD 4620
SSPKDVWDKI LEPVACVRKK SEMLQLFPAY LKGEDLFGLT VSAVARIAES LPGVEACENY 4680
TFRYGRNPLM ELPLAVNPTG CARSEPKMSA HVKRFVLRPH TLNSTSTSKS FQSTVTGELN 4740
APYSKQFVHS KSSQYRRMKT EWKSNVYLAR SRIQGLGLYA ARDIEKHTMV IEYIGTIIRN 4800
EVANRKEKLY ESQNRGVYMF RMDNDHVIDA TLTGGPARYI NHSCAPNCVA EVVTFERGHK 4860
IIISSNRRIQ KGEELCYDYK FDFEDDQHKI PCHCGAVNCR KWMN 4904
Nucleotide Sequence
(Fasta)
CCTGCTAGCT CCATGTTGCC GCCTCTCCCG GTACCTGCTG CTGCTGCTGC TGCTCCCGGG 60
GCTGCGGGAA ATGCGAGAGG CTGAGCCGGG GAGGAGCAAC TCGAGCAGCA GCAGCGGCGG 120
CGGCTGCGGC CGCTGCGGCG GGAGGAGCCC CCCAGGAGGA GGACGGGGAT CCATGTGTCT 180
TTCCTGGTGA CTAGGATGTC GTCGGAGGAG GACCGGAGCG CGGAGCAGCA GCAGCCGCCG 240
CCAGCACCCC CCGAGGAGCC CGGAGCCCCG GCCCCGAGCC CCGCAGCCGC AGACAAAAGA 300
CCTCGGGGCC GGCCTCGCAA AGATGGCGCT TCCCCTTTCC AGAGAGCCAG AAAGAAGCCT 360
CGAAGTAGAG GAAAATCTAC AGTGGAAGAT GAGGACAGCA TGGATGGACT GGAGACGACG 420
GAAACAGAAA ATATTGTGGA GACAGAAATC AAAGAACAAT CTGTGGAAGA GGATGCTGAA 480
ACAGAAGTGG ATAGCAGCAA ACAGCCAGTC TCAGCTCTTC AGCGGTCTGT GTCTGAGGAA 540
TCTGCAAACT CCCTGGTCTC TGTTGGCGTA GAAGCCAAAA TCAGTGAACA GCTCTGCGCT 600
TTTTGTTATT GTGGAGAGAA AAGTTCCTTA GGACAAGGAG ACTTGAAACA ATTCAGAGTA 660
ACACCTGGAC TTACCTTACC TTGGAAAGAT CAACCTTCTA ACAAGGACAT TGATGACAAC 720
AGTAGTGGGA CCTGTGAGAA AATACAGAAC TACGCTCCAC GAAAACAAAG AGGACAGAGA 780
AAAGAACGAC CTCCCCAGCA GAGTGCCGTT TCTTGTGTAA GTGTAAGCAC CCAGACAGCC 840
TGTGAGGACC AGGCGGGCAA GTTATGGGAT GAACTCAGTC TGGTTGGCCT TCCAGATGCC 900
ATTGATGTCC AAGCCTTATT TGATTCCACA GGCACTTGCT GGGCTCATCA TCGATGTGTA 960
GAGTGGTCAC TGGGAATATG CCAAATGGAA GAACCATTAT TAGTAAACGT GGACAAAGCT 1020
GTTGTCTCAG GGAGCACAGA ACGATGTGCA TTTTGCAAAC ACCTTGGAGC CACTATCAAA 1080
TGCTGTGAAG AGAAGTGTAC CCAGATGTAC CATTACCCAT GTGCTGCAGG GGCTGGCACC 1140
TTTCAGGACT TTAGTCACTT CTTCCTTCTC TGTCCAGAAC ACATCGACCA AGCTCCTGAA 1200
AGATCAAAGG AAGATGCAAA CTGTGCAGTG TGTGACAGCC CGGGAGATCT TTTAGATCAG 1260
TTCTTTTGTA CCACTTGTGG TCAGCACTAC CATGGGATGT GCCTGGATAT AGCTGTTACT 1320
CCACTAAAGC GTGCTGGTTG GCAGTGTCCT GAGTGCAAAG TGTGCCAGAA CTGCAAACAA 1380
TCGGGAGAAG ATAGCAAGAT GCTGGTATGT GATACATGTG ACAAAGGGTA CCATACTTTT 1440
TGTCTTCAAC CTGTTATGAA ATCAGTGCCA ACCAATGGCT GGAAATGCAA AAATTGCAGA 1500
ATCTGTATAG AATGTGGCAC ACGATCTAGT ACTCAGTGGC ACCACAACTG TTTGATATGT 1560
GACACTTGTT ATCAACAGCA GGATAATTTA TGTCCTTTTT GTGGAAAGTG TTACCATCCA 1620
GAGTTACAGA AAGATATGCT GCACTGTAAT ATGTGTAAGA GATGGGTTCA CTTAGAATGT 1680
GACAAACCAA CTGATCAGGA ATTGGATTCT CAGCTCAAAG AAGACTATAT CTGCATGTAT 1740
TGTAAACACT TAGGAGCTGA GATAGATCCC TTACATCCAG GGAATGAAGT GGAGATGCCT 1800
GAACTACCTA CAGATTATGC CAGTGGAATG GAAATTGAAG GTACTGAAGA TGAAGTGGTA 1860
TTTCTGGAGC AGACAGTAAA TAAAGATGTC AGTGATCACC AGTGCAGACC TGGAATTGTT 1920
CCAGATGCAG TTCAAGTGTA CACTGAAGAG CCCCAGAAAA GTAATCCACT AGAAAGCCCT 1980
GACACAGTTG GTCTCATTAC TTCTGAATCA TCTGACAATA AAATGAATCC TGATTTGGCA 2040
AATGAGATTG CTCATGAGGT TGATACTGAG AAAACGGAAA TGCTTTCTAA AGGGCGACAT 2100
GTTTGTGAGG AAGATCAAAA TGAAGACAGA ATGGAAGTGA CAGAAAACAT CGAAGTCCTT 2160
CCACACCAAA CCATTGTGCC ACAAGAGGAC CTGCTGCTGT CAGAGGACTC TGAGGTGGCA 2220
TCTAAAGAAC TAAGCCCTCC AAAATCAGCT CCTGAGACTG CTGCACCAGA AGCCTTACTG 2280
TCCCCACACA GCGAAAGGTC CTTATCTTGT AAGGAACCAT TGTTGACAGA AAGAGTACAA 2340
GAAGAAATGG AACAGAAGGA AAATTCTGAA TTCTCCACAG GATGTGTGGA TTTTGAAATG 2400
ACTCTTGCTG TTGACAGTTG TGACAAAGAT AGTTCATGCC AAGGAGACAA ATATGTAGAG 2460
TTACCAGCTG AGGAAGAATC AACATTCTCT TCAGCAACTG ACCTGAACAA GGCAGATGTG 2520
TCTTCTTCCT CCACACTTTG TTCAGACTTG CCTTCATGTG ACATGCTGCA CGGTTACCCT 2580
CCAGCTTTCA ACTCTGCTGC TGGAAGCATC ATGCCAACAA CATACATCTC AGTCACTCCA 2640
AAAATTGGCA TGGGTAAACC AGCTATTACC AAAAGAAAAT TTTCTCCTGG TAGACCCCGG 2700
TCCAAACAGG GGGCTTGGAG TAACCATAAC ACAGTGAGCC CGCCTTCCTG GGCCCCAGAC 2760
ACTTCAGAAG GCCGGGAAAT TTTTAAACCC AGGCAGCTTT CTGGCAGTGC CATTTGGAGC 2820
ATCAAAGTGG GCCGAGGGTC TGGATTTCCA GGAAAGCGGA GGCCTCGTGG TGCTGGGCTG 2880
TCAGGACGAG GTGGCAGAGG CAGGTCTAAA CTAAAGAGTG GAATTGGAGC TGTTGTATTA 2940
CCTGGGGTGT CTGCTGCAGA TATTTCATCA AATAAAGATG AAGAAGAAAA CTCTATGCAC 3000
AATACAGTTG TTTTGTTTTC TAGTAGTGAC AAGTTCACTT TGCAGCAGGA TATGTGTGTA 3060
GTTTGTGGCA GTTTCGGCCA AGGAGCAGAA GGAAGATTAC TTGCCTGTTC ACAGTGTGGT 3120
CAATGTTACC ATCCATACTG TGTCAGTATT AAGATCACTA AAGTGGTTCT TAGCAAAGGT 3180
TGGAGATGTC TTGAGTGCAC TGTGTGTGAG GCCTGTGGGA AGGCAACTGA CCCTGGAAGA 3240
CTCTTGCTAT GTGATGACTG TGACATAAGC TATCATACCT ACTGCCTAGA CCCTCCATTG 3300
CAGACAGTTC CCAAAGGAGG CTGGAAATGC AAATGGTGTG TTTGGTGCAG ACACTGTGGA 3360
GCAACATCTG CAGGTTTGAG ATGTGAATGG CAGAACAATT ACACACAGTG TGCTCCTTGT 3420
GCCAGCCTGT CTTCCTGCCC TGTCTGCTGT CGGAACTACA GAGAAGAGGA CCTCATTCTG 3480
CAGTGTAGAC AGTGTGATAG GTGGATGCAT GCAGTTTGTC AAAACTTAAA TACTGAGGAG 3540
GAAGTGGAAA ATGTAGCAGA CATTGGTTTT GATTGTAGCA TGTGCAGACC CTATATGCCT 3600
GTGTCTAATG TGCCTTCCTC AGACTGCTGC GACTCTTCTC TTGTGGCACA GATCGTCACA 3660
AAAGTGAAAG AGCTAGACCC CCCTAAGACC TACACCCAGG ATGGTGTGTG TCTGACCGAG 3720
TCAGGGATGA GTCAGCTACA GAGCCTCACA GTCACAGCTC CCAGGAGAAA ACGCACAAAA 3780
CCAAAACTGA AATTGAAGAT TATAAATCAG AATAGTGTGG CTGTCCTTCA GACCCCTCCA 3840
GACATCCAGT CAGAACATTC AAGAGATGGT GAAATGGATG ATAGTCGAGA AGGAGAACTT 3900
ATGGATTGTG ATGGAAAATC TGAATCTAGT CCTGAGCGGG AAGCTGGAGA TGATGAAACT 3960
AAGGGAATAG AAGGAACAGA TGCCATAAAA AAGAGGAAAA GAAAACCATA CAGACCAGGT 4020
ATTGGTGGAT TTATGGTACG ACAAAGAAGT CGAACAGGAC AAGGAAAAGC CAAAAGATCT 4080
GTGGTTAGAA AAGATTCTTC CGGCTCTATT TCTGAGCAGT TACCTAGCAG AGATGATGGC 4140
TGGAGGGAGC AGTTACCAGA TACCCTAGTT GATGAGCCTG TTTCTGTTGC TGAGAACACT 4200
GACAAAATAA AGAAGAGATA CCGAAAAAGG AAAAATAAGC TTGAAGAAAC CTTTCCTGCT 4260
TATTTACAAG AAGCATTTTT TGGAAAAGAT CTTCTAGATA CCAGCAGACA AAACAAGCTG 4320
AGTGTAGATA ATCTGTCAGA AGATGCAGCT CAGCTTTCAT TTAAAACAGG TTTCTTGGAT 4380
CCTTCCTCAG ATCCGCTACT GAGTTCATCT TCTACTTCAG CAAAACCTGG AACTCAGGGT 4440
ACTGCTGATG ACCCATTAGC TGATATTTCT GAAGTCTTAA ACACAGATGA TGACATTCTT 4500
GGAATAATCT CAGATGATCT TGCAAAATCA GTTGATCATT CAGATATCGG CCCCACTACT 4560
GCTGATGCCT CCTCATTGCC TCAGCCAGGT GTCAGTCAGA GTTCACGGCC ATTAACTGAA 4620
GAGCAGCTAG ATGGGATCCT CAGTCCTGAG CTAGACAAAA TGGTCACAGA TGGAGCAATT 4680
CTTGGAAAGT TATATAAAAT TCCAGAGCTT GGAGGAAAGG ATGTTGAAGA CTTGTTTACT 4740
GCTGTACTTA GCCCTGCTAC CACTCAGCCA GCTCCATTAC CACAGCCTCC CCCTCCACCA 4800
CAGCTATTGC CAATGCACAA TCAGGATGTT TTTTCCCGGA TGCCACTCAT GAATGGCCTT 4860
ATTGGACCCA GTCCTCACCT TCCACATAAT TCTTTGCCTC CTGGAAGTGG ATTGGGGACT 4920
TTCCCTGCTA TAGCACAATC CCCTTATACT GATGTCAGGG ATAAAAGTCC AGCATTTAAT 4980
GCAATTGCAA GTGATCCTAA CAGCTCCTGG GCACCAACAA CTCCAAGTAT GGAAGGAGAA 5040
AATGACACCC TGTCAAATGC ACAGAGGAGC ACATTGAAGT GGGAGAAAGA GGAGGCTCTT 5100
GGTGAGATGG CAACAGTAGC ACCAGTTCTC TACACAAATA TTAATTTCCC CAATTTAAAG 5160
GAAGAATTCC CAGATTGGAC TACTCGAGTG AAACAAATTG CCAAGTTGTG GAGAAAAGCC 5220
AGCTCTCAGG AAAGAGCACC ATATGTGCAA AAAGCCAGAG ATAACAGGGC TGCTTTACGC 5280
ATAAATAAAG TTCAGATGTC AAACGATTCT ATGAAGAGGC AACAACAGCA AGACAGCATC 5340
GATCCCAGCT CACGCATCGA TTCGGATCTT TTTAAAGATC CTTTAAAGCA GAGAGAATCA 5400
GAGCATGAAC AGGAATGGAA GTTTAGACAG CAAATGCGTC AGAAAAGTAA GCAACAAGCT 5460
AAAATTGAAG CCACACAGAA GCTGGAACAA GTGAAGAATG AGCAGCAGCA GCAGCAGCAG 5520
CAGCAGCAGC AACAGCAGCA GCAGCAGCTT GCTTCTCAGC ACCTTCTGGT AGCACCTGGT 5580
TCAGATACTC CAAGTAGTGG AGCACAGAGT CCCTTGACAC CTCAGGCTGG CAATGGGAAT 5640
GTGTCTCCTG CACAGACATT CCATAAAGAC CTATTCTCCA AACACCTGCC TGGTACCCCT 5700
GCTTCCACAC CTTCAGATGG TGTGTTTGTC AAACCACAAC CTCCACCCCC TCCTTCAACC 5760
CCATCCCGTA TACCTGTTCA GGAAAGTCTT TCTCAGTCCC AGAATTCTCA GCCACCTTCT 5820
CCACAGATGT TCTCACCTGG ATCATCCCAT TCCAGACCAC CATCTCCAGT GGATCCTTAT 5880
GCAAAAATGG TTGGTACTCC TAGACCACCT CCTGGTGGAC ATAGTTTTCC AAGAAGAAAT 5940
TCTGTTACAC CAGTGGAAAA CTGTGTGCCT TTATCTTCAG TACCTAGGCC CATTCATATG 6000
AATGAAACAT CAGCCACAAG GCCATCCCCA GCCAGAGACT TATGTGCTTC CTCCATGACA 6060
AACAGTGACC CCTATGCAAA GCCTCCAGAT ACACCTAGGC CCATGATGAC AGATCAGTTT 6120
TCAAAACCTT TTAGCCTGCC TAGGTCCCCT GTGATTTCAG AACAAAGTAC AAAAGGTCCT 6180
CTAACAACTG GAACCAGTGA TCACTTTACT AAACCATCTC CTAGAACAGA TGCCTTTCAA 6240
AGGCAACGGC TACCTGATCC CTATGCAGGA CCGTCATTGA CACCTGCCCC ATTGGGTAAT 6300
GGGCCTTTTA AGACCCCACT GCACCCTCCT CCATCTCAGG ATCCATATGG ATCTGTGTCA 6360
CAGACATCAA GACGACTCTC TGTTGACCCT TATGAAAGGC CTGCCTTGAC ACCAAGGCCA 6420
GTAGATAATT TTTCTCATAG TCAGTCAAAT GATCCATATA GCCACCCTCC CCTAACTCCA 6480
CACCCAGCAA TGACTGAATC TTTTACTCAT GCTTCAAGGG CTTTTCCTCA GCCTGGAACC 6540
ATATCAAGGT CAGCATCTCA GGACCCATAT TCACAACCCC CAGGAACTCC ACGCCCTCTT 6600
ATAGATTCTT ATTCCCAAAC CTCAGGAACA GCTCGATCCA ATCCAGATCC TTATTCCCAA 6660
CCTCCTGGTA CTCCCCGGCC TAACACTATT GATCCATATA GTCAGCAGCC ACCTACCCCC 6720
AGGCCTTCTC CACAGACAGA CATGTTTGTT TCATCTGTGG CAAATCAGAG ACACACTGAT 6780
CCATATACTC ATCATCTTGG GCCTCCAAGA CCTGGAATTT CTGTTCCCTA TTCTCAGCCA 6840
CCAGCAGTAC CAAGGCCAAG GACTTCAGAG GGTTTTACTA GGCCCTCCAG TGCAAGACCA 6900
GCCCTCATGC CAAACCAGGA TCCTTTTTTG CAAGCAGCAC AAAACCGAGT ACCAGGTTTA 6960
CCTGGCCCTT TGATAAGGCC ACCTGATACA TGCTCCCAGA CTCCCAGGCC ACCTGGGCCT 7020
GGCCGTATAG ACACATTCAC TCATGCTTCC TCATCTGCTG TTCGTGATCC ATATGATCAG 7080
CCTCCAGTGA CTCCCAGGCC TCATTCTGAG TCTTTCGGAA CTAGTCAAGT TGTTCACGAT 7140
CTTGTTGACC GTCCAGTTCC TGGGTCAGAG GGAAACTTTA GCACGTCTTC AAACCTTCCT 7200
GTAAGCTCCC AAGGGCAGCA GTTCTCCAGT GTCTCCCAGC TTCCTGGACC CGTGCCAACC 7260
TCAGGAGGAA CTGATACACA GAACACTGTA AACATGTCTC AAGCTGACAC AGAGAAACTG 7320
AGACAGCGGC AGAAACTGCG TGAAATCATT CTCCAGCAAC AACAGCAGAA GAAGATTGCT 7380
AGTCGCCAGG AGAAGGGGCC TCAGGATACA GCAGTAGTAC CTCACCCAGT GCCCCTTCCA 7440
CACTGGCAGC CAGAGAGCAT CAACCAGGCT TTCACTCGAC CTCCACCTCC CTATCCTGGG 7500
AGCACTCGAT CACCTGTTAT CCCTCCACTA GGACCTAGAT ATGCAGTTTT CCCCAAAGAT 7560
CAGCGTGGAC CCTATCCTCC AGAGGTTGCT GGTATGGGCA TGAGGCCTCA TGGATTCAGA 7620
TTTGGATTTC CAGGAGCTGG CCATGGTCCC ATGCCAAGCC AAGATCGCTT CCATGTGCCT 7680
CAGCAAATAC AAGGATCTGG AATTCCTCCA CACATAAGAA GACCAATGTC TATGGAAATG 7740
CCTAGACCTT CAAATAACCC ACCATTAAAT AATCCAGTTG GGCTTCCTCA GCATTTCCCA 7800
CCACAGGGTC TGCCAGTTCA GCAACATAAT ATACTAGGCC AGGCATTTAT TGAGTTGAGG 7860
CATAGAGCCC CTGATGGAAG GTCACGGTTG CCATTTGCTG CTTCTCCTAG CAGTGTTATA 7920
GAGTCACCTT CACATCCAAG GCATGGAAAT TTCCTTCCCA GACCTGATTT TCCTGGCCCT 7980
AGACACACAG ACCCCATAAG ACAGCCTTCC CAGTGTCTAT CCAATCAGCT ACCTGTGCAT 8040
CCAAATTTAG AGCAGGTCCC ACCTTCTCAG CAAGAGCAAG GTCATCCTGC TCACCAATCT 8100
TCTATTGTCA TGAGGCCCCT AAATCATCCT TTAAGTGGTG AATTTTCTGA GGCACCTTTG 8160
TCAACATCTA CCCCAGCTGA AACATCACCA GATAACTTAG AGATAGCTGG TCAGTCTTCT 8220
GCTGGCCTGG AAGAAAAACT AGACTCTGAT GATCCTTCTG TGAAAGAACT GGATGTGAAA 8280
GATCTTGAGG GGGTTGAAGT CAAAGATTTG GATGATGAAG ATCTAGAAAA TTTAAATTTA 8340
GACACAGAGG ATGGCAAGGG TGATGACCTG GACACTTTAG ACAATTTGGA AACTAATGAC 8400
CCTAACCTGG ATGACCTCCT AAGGTCAGGT GAATTTGATA TCATTGCATA CACAGATCCA 8460
GAACTTGACT TAGGGGATAA AAAAAGCATG TTCAATGAAG AATTAGACCT TAATGTTCCA 8520
ATTGATGATA AGCTAGATAA TCAGTGTGCA TCTGTTGAAC CAAAAACAAG GGATCAAGGA 8580
GACAAAACTA TGGTTCTTGA AGATAAGGAT TTGCCACAGA GAAAGTCCAG TGTTAGCAGT 8640
GAGATAAAGA CAGAAGCCCT GTCTCCCTAC TCTAAAGAAG AAATACAGAG TGAGATTAAG 8700
AACCATGATG ACAGTAGAGG TGATGCGGAT ACTGCGTGCT CACAGGCTGC TTCTGCTCAG 8760
ACCAATCACA GTGACAGAGG AAAGACTGCT CTATTGACTA CTGATCAAGA TATGCTTGAG 8820
AAAAGATGTA ACCAGGAGAA TGCTGGGCCT GTTGTCAGTG CCATTCAAGG GTCCACTCCT 8880
CTGCCTGCTC GGGATGTGAT GAACTCCTGT GACATAACAG GATCGACTCC AGTTCTCTCG 8940
AGTTTACTTT CTAATGAGAA GTGTGATGAT TCAGACATTA GGCCTTCAGG GTCCTCCCCA 9000
CCAAGTCTGC CCATCTCACC ATCCACTCAT GGGTCAAGTT TGCCTCCTAC TTTAATAGTA 9060
CCACCTAGCC CTCTTTTGGA TAATACCGTG AATTCTAATG TAACAGTGGT CCCTAGGATA 9120
AACCATGCTT TTTCTCAGGG TGTGCCAGTA AATCCAGGAT TCATTCAGGG CCAATCATCA 9180
GTGAACCATA ATTTAGGGAC AGGGAAACCT ACAAATCAAA CTGTGCCTCT CACGAATCAG 9240
TCCAGCACCA TGTCTGGACC GCAGCAGCTC ATGATTCCTC AAACATTAGC CCAGCAGAAC 9300
AGAGAGAGGC CCCTCCTTCT AGAGGAACAG CCTCTGCTTC TACAAGATCT TTTGGATCAA 9360
GAGAGGCAGG AGCAGCAACA GCAAAGACAA ATGCAAGCCA TGATTCGTCA GCGGTCAGAA 9420
CCATTCTTCC CTAACATTGA TTTTGATGCT ATTACAGATC CTATAATGAA AGCGAAAATG 9480
GTAGCCCTTA AAGGCATAAA TAAAGTGATG GCACAGAACA GTCTGGGCAT GCCACCAATG 9540
GTGATGAGCA GATTCCCCTT CATGGGCCCA TCAGTGGCTG GAACACAAAA CAATGACGGC 9600
CAGACCCTAG TGCCACAAGC TGTAGCTCAG GATGGCAGTA TAACACATCA GATTTCTAGG 9660
CCTAATCCTC CAAATTTTGG TCCAGGCTTT GTCAATGACT CTCAGCGTAA GCAGTATGAA 9720
GAATGGCTAC AGGAGACTCA GCAGCTGCTT CAGATGCAGC AGAAGTATCT CGAAGAACAA 9780
ATTGGTGCAC ACAGAAAATC TAAGAAGGCT CTTTCAGCTA AACAGCGCAC AGCCAAGAAG 9840
GCAGGGCGGG AGTTCCCAGA AGAGGACGCA GAGCAACTCA AGCATGTTAC TGAGCAGCAG 9900
AGCATGGTTC AGAAACAGCT TGAGCAGATT CGGAAACAAC AGAAAGAGCA TGCTGAGCTG 9960
ATTGAAGATT ATCGGATCAA ACAGCAGCAG CAGCAGCAAC AGTGTGCCCT AGCCCCTCCC 10020
ATCCTCATGC CAGGGGTTCA GCCCCAGCCA CCTCTAGTTC CAGGTGCCAC TTCACTTACC 10080
ATGAGCCAAC CCAACTTTCC CATGGTGCCA CAGCAGCTTC AGCACCAGCA GCACACAGCA 10140
GTCATCTCAG GGCATACCAG CCCTGCGAGA ATGCCCAGTT TACCTGGATG GCAATCTAAC 10200
AGTGCTTCTG CTCACCTCCC CCTTAATCCT CCTAGAATTC AGCCCCCAAT TGCCCAACTA 10260
TCTTTAAAAA CTTGTACACC AGCCCCAGGG ACAGTGTCAA GTGCAAATCC ACAGAATGGA 10320
CCACCACCTC GAGTGGAATT TGATGACAAC AATCCTTTCA GTGAAAGTTT TCAAGAGCGA 10380
GAGAGGAAGG AACGCTTACG AGAACAGCAG GAAAGACAGC GAGTTCAACT GATGCAAGAA 10440
GTAGACAGAC AGAGAGCTCT GCAGCAGCGG ATGGAGATGG AGCAGCATTG TCTGATGGGT 10500
GCTGAGCTAG CCAACAGGAC ACCTGTTTCC CAGATGCCAT TCTATGGTTC TGACAGACCT 10560
TGTGACTTTT TGCAACCTCC ACGACCCCTT CAGCAGTCTC CACAACACCA GCAACAAATA 10620
GGGCCAGTTT TACAGCAGCA GAATGTCCAA CAAGGGTCTG TTAACTCACC CCCAAACCAA 10680
ACTTTCATGC AAACCAATGA GCAAAGGCAG GTAGGGCCTC CCTCCTTTGT TCCAGATTCA 10740
CCATCTGCTT CTGGTGGGAG CCCAAACTTT CATTCTGTTA AGCCGGGACA TGGAAATCTT 10800
CCTGGGTCCA GCTTTCAGCA GTCTCCACTG AGGCCTCCAT TCACACCAAT TTTACCAGGA 10860
ACGTCTCCAG TAGCTAATAG CAATGTCCCT TGTGGCCAAG ACCCTGCTGT AACACAGGGA 10920
CAGAATTATT CAGGATCCAG CCAGTCTCTC ATTCAGTTAT ATTCTGACAT AATTCCAGAA 10980
GAAAAGGGGA AAAAGAAAAG AACAAGAAAA AAGAAAAAAG ATGATGATGC AGAATCCGGC 11040
AAAGCACCGT CAACTCCCCA CTCTGACTGC GCTGCTCCAC TAACCCCAGG CCTCTCAGAA 11100
ACTACCTCCA CTCCTGCGGT GAGCTCACCC AGTGAGCTCC CTCAGCAAAG ACAACAGGAG 11160
CCGGTGGAGC CAGTGCCTGT ACCCACTCCA AATGTGTCAG CAGGCCAGCC TTGCATAGAG 11220
TCAGAAAACA AACTTCCCAA CAGTGAATTC ATAAAAGAAA CTTCAAATCA ACAAACACAT 11280
GTGAATGCAG AGGCAGACAA GCCTTCCGTG GAAACCCCTA ACAAAACTGA AGAAATAAAG 11340
TTGGAAAAGG CTGAGACACA GCCAAGTCAG GAGGATACCA AAGTGGAAGA GAAAACTGGT 11400
AATAAGATCA AAGACATTGT AGCTGGTCCT GTCTCCTCAA TACAGTGTCC TTCCCATCCT 11460
GTCGGAACCC CTACTACCAA AGGAGATACA GGAAATGAGC TATTGAAGCA CTTGTTAAAA 11520
AATAAGAAGG CCTCTTCCCT TCTAACTCAG AAACCTGAGG GCACTTTATC TTCAGATGAA 11580
AGTTCTACAA AGGATGGTAA ACTGATTGAG AAGCAGAGTC CAGCAGAAGG ATTGCAAACT 11640
TTGGGGGCTC AAATGCAAGG TGGTTTTGGA GGTGGCAACA GCCAGTTGCC AAAAACAGAT 11700
GGAGCAAGTG AAAACAAGAA ACAGCGAAGC AAACGGACTC AAAGGACGGG GGAAAAAGCA 11760
GCACCTCGCT CAAAGAAAAG GAAGAAGGAT GAAGAGGAAA AGCAGGCTAT GTACTCCAGC 11820
TCTGACTCCT TCACCCACTT GAAACAGCAG AATAATTTAA GTAATCCTCC AACACCCCCT 11880
GCCTCTCTTC CTCCTACACC ACCTCCTATG GCTTGCCAGA AGATGGCAAA TGGTTTTGCA 11940
ACGACTGAAG AACTTGCTGG AAAAGCTGGC GTGTTGGTGA GCCATGAAGT TGCCAGAGCT 12000
TTAGGACCGA AGCCATTTCA GCTGCCTTTC AGACCTCAGG ATGACTTGCT GGCTCGAGCT 12060
ATTGCTCAAG GCCCGAAGAC TGTGGATGTT CCTGCCTCAC TTCCAACACC GCCTCATAAT 12120
AATCATGAAG AATTAAGGAT ACAGGACCAC TATGGTGACC GGGACACTCC CGATAGTTTC 12180
GTCCCCTCCT CTTCTCCTGA GAGTGTGGTT GGTGTGGAGG TGAACAAGTA CCCAGATCTG 12240
TCACTGGTGA AAGAGGAGCC TCCAGAACCT GTGCCGTCCC CCATCATCCC CATTCTTCCC 12300
AGCATATCCG GGAAAAATTC AGAATCAAGA AGAAATGACA TCAAAACTGA GCCAGGCACT 12360
TTATTTTTTA CTTCACCTTT TGGTTCATCC CCAAATGGTC CCAGATCGGG TCTTATATCT 12420
GTAGCGATCA CTCTGCATCC TACAGCTGCT GAGAACATTA GCAGCGTTGT TGCTGCGTTT 12480
TCTGACCTTC TTCACGTGAG AATTCCTAAC AGCTATGAGG TTAGTAATGC TCCAGATGTT 12540
CCACCCATGG GTTTGGTCAG TAGCCACAGA GTAAACCCAA GTTTGGAGTA TCGGCAGCAT 12600
TTGCTTCTTC GTGGGCCTCC ACCAGGATCT GCAAATCCTC CCAGATTAGC AACCTCTTAC 12660
CGGTTGAAGC AACCTAATGT ACCATTTCCT CCAACAAGCA ATGGTCTTTC TGGGTATAAA 12720
GACTCTAGTC ATGGTCCAGC AGAAGGTGCG TCGCTCCGAC CACAGTGGTG CTGCCACTGT 12780
AAGGTGGTTA TTCTTGGAAG TGGTGTGCGG AAGTCATGCA AGGATCTGAC CTTTGTGAAC 12840
AAGGGTTCCC GAGAGAACAC CAAAAGGATG GAAAAGGATA TTGTCTTTTG TAGTAATAAC 12900
TGCTTTATTC TTTATTCATC AGCTGCACAA GCAAAAAACT CAGACAACAA GGAATCCCTT 12960
CCGTCACTGC CACAGTCCCC TATGAAGGAG CCTTCCAAAG CATTTCACCA GTATAGCAAC 13020
AACATCTCCA CTTTGGATGT GCACTGTCTC CCTCAGTTCC AGGAAAAGGT TTCCCCTCCT 13080
GCATCACCTC CCATATCCTT CCCTCCAGCC TTTGAAGCAG CCAAAGTCGA GTCCAAGCCT 13140
GATGAGCTTA AGGTAACGGT CAAGTTAAAG CCTCGGCTGA GGACTGTTCC TGTTGGGCTT 13200
GAAGACTGTA GACCACTGAA TAAAAAGTGG AGAGGAATGA AGTGGAAGAA ATGGAGCATT 13260
CATATTGTCA TCCCCAAGGG GACCTTTAAG CCACCTTGTG AGGATGAAAT AGATGAGTTT 13320
CTAAAGAAAT TGGGCACTTG TCTTAAACCT GACCCTGTGC CCAAAGACTG TCGGAAGTGC 13380
TGCTTTTGTC ATGAGGAAGG GGACGGGCTG ACAGATGGGC CAGCACGGCT GCTCAACCTG 13440
GACCTGGACC TCTGGGTCCA CCTGAACTGT GCTCTGTGGT CTACAGAGGT TTATGAAACA 13500
CAGGCTGGTG CCTTAATAAA TGTGGAGCTA GCGCTGAGGA GAGGACTACA AATGAAGTGT 13560
GTCTTCTGTC ATAAGACAGG TGCCACCAGT GGATGTCACA GATTCCGATG TACCAACATT 13620
TATCATTTTA CTTGCGCCAC TAAAGCACAA TGCATGTTTT TTAAGGACAA AACAATGCTT 13680
TGCCCCATGC ACAAACCAAA GGGAATCCAC GAGCAACAGT TAAGTTACTT TGCAGTCTTC 13740
AGGAGGGTCT ATGTGCAACG AGATGAGGTG CGGCAGATTG CTAGTATCGT GCAGCGGGGA 13800
GAGCGGGACC ATACCTTTCG TGTTGGGAGC CTCATCTTCC ACACCATTGG CCAGCTGCTT 13860
CCACAACAGA TGCAAGCATT CCACTCTCCG AAAGCACTCT TCCCAGTGGG CTATGAAGCC 13920
AGCCGGTTAT ACTGGAGCAC CCGCTATGCC AACCGACGCT GCCGGTACCT GTGCTCCATT 13980
GAGGAGAAAG ATGGGCGGCC TGTGTTTGTC ATCAGGATTG TAGAGCAAGG CCATGAGGAC 14040
CTGGTCTTAA GTGATTCATC ACCTAAAGAT GTTTGGGATA AAATTTTGGA GCCTGTGGCT 14100
TGTGTGAGAA AAAAATCTGA AATGCTGCAG CTTTTCCCTG CGTATTTGAA AGGAGAAGAC 14160
CTGTTTGGCC TGACTGTCTC TGCAGTAGCA CGGATAGCTG AATCACTTCC TGGGGTTGAG 14220
GCATGTGAAA ATTATACCTT CCGATATGGC CGTAATCCTC TCATGGAGCT TCCCCTTGCC 14280
GTGAACCCCA CAGGTTGTGC CCGTTCTGAA CCTAAAATGA GCGCCCATGT CAAGAGGTTT 14340
GTGTTAAGGC CTCACACCTT GAATAGCACC AGCACTTCAA AGTCATTTCA GAGCACAGTC 14400
ACTGGAGAGC TGAATGCACC CTACAGTAAG CAGTTTGTCC ACTCCAAGTC ATCACAGTAC 14460
CGGAGAATGA AGACTGAATG GAAATCTAAT GTGTATCTGG CCCGATCTCG GATCCAGGGA 14520
CTGGGCCTGT ATGCTGCTAG AGACATTGAA AAACACACTA TGGTCATCGA GTACATTGGA 14580
ACAATTATTC GAAATGAGGT TGCAAACCGG AAGGAGAAGC TTTATGAGTC TCAGAACCGG 14640
GGTGTGTACA TGTTCCGCAT GGACAATGAC CACGTGATTG ACGCCACACT CACAGGAGGG 14700
CCTGCAAGAT ACATCAACCA CTCCTGTGCC CCTAACTGTG TGGCTGAAGT CGTGACCTTT 14760
GAGAGAGGAC ACAAGATTAT CATCAGCTCC AACCGGAGGA TACAGAAAGG AGAAGAGCTC 14820
TGCTATGATT ATAAGTTTGA CTTTGAAGAT GACCAGCACA AGATTCCGTG TCACTGTGGA 14880
GCGGTGAACT GCCGGAAGTG GATGAACTGA ATGCATTCCT TGCTATCTCA CCGGGTGGCT 14940
CGTCCCTAGG AAGAGGCGAT TCAACAC 14968
Sequence Source Ensembl
Keyword

KW-0007--Acetylation
KW-0010--Activator
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0175--Coiled coil
KW-0181--Complete proteome
KW-0238--DNA-binding
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR003889--FYrich_C
IPR003888--FYrich_N
IPR009071--HMG_box_dom
IPR000637--HMGI/Y_DNA-bd_CS
IPR003616--Post-SET_dom
IPR001214--SET_dom
IPR001594--Znf_DHHC_palmitoyltrfase
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR019787--Znf_PHD-finger
IPR001841--Znf_RING
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS51543--FYRC
PS51542--FYRN
PS00354--HMGI_Y
PS50868--POST_SET
PS50280--SET
PS50216--ZF_DHHC
PS01359--ZF_PHD_1
PS50016--ZF_PHD_2
PS50089--ZF_RING_2

Pfam

PF05965--FYRC
PF05964--FYRN
PF00628--PHD
PF00856--SET

Gene Ontology

GO:0035097--C:histone methyltransferase complex
GO:0044666--C:MLL3/4 complex
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0003677--F:DNA binding
GO:0042800--F:histone methyltransferase activity (H3-K4 specific)
GO:0044822--F:poly(A) RNA binding
GO:0008270--F:zinc ion binding
GO:0061029--P:eyelid development in camera-type eye
GO:0051568--P:histone H3-K4 methylation
GO:0016571--P:histone methylation
GO:0035264--P:multicellular organism growth
GO:0048146--P:positive regulation of fibroblast proliferation
GO:0010468--P:regulation of gene expression
GO:0006355--P:regulation of transcription, DNA-templated
GO:0007338--P:single fertilization
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0018 ENSP00000347325.3 Homo sapiens 85 0.0 7113
WERAM-Ict-0124 ENSSTOP00000012342.2 Ictidomys tridecemlineatus 85 0.0 7105
WERAM-Pat-0168 ENSPTRP00000046674.3 Pan troglodytes 85 0.0 7059
WERAM-Aim-0154 ENSAMEP00000014067.1 Ailuropoda melanoleuca 83 0.0 6871
WERAM-Caf-0059 ENSCAFP00000007370.4 Canis familiaris 83 0.0 6859
WERAM-Paa-0005 ENSPANP00000006150.1 Papio anubis 83 0.0 6846
WERAM-Gog-0210 ENSGGOP00000027941.1 Gorilla gorilla 83 0.0 6802
WERAM-Cap-0018 ENSCPOP00000001682.2 Cavia porcellus 81 0.0 6765
WERAM-Fec-0096 ENSFCAP00000008002.3 Felis catus 81 0.0 6667
WERAM-Myl-0122 ENSMLUP00000010086.2 Myotis lucifugus 80 0.0 6621
WERAM-Tut-0198 ENSTTRP00000016174.1 Tursiops truncatus 80 0.0 6593
WERAM-Bot-0193 ENSBTAP00000028347.5 Bos taurus 80 0.0 6580
WERAM-Ova-0045 ENSOARP00000005594.1 Ovis aries 79 0.0 6433
WERAM-Mup-0098 ENSMPUP00000009152.1 Mustela putorius furo 81 0.0 6416
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 71 0.0 5719
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 71 0.0 5654
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 71 0.0 5449
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 70 0.0 5331
WERAM-Chs-0076 ENSCSAP00000002864.1 Chlorocebus sabaeus 83 0.0 5242
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 82 0.0 5135
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 84 0.0 5100
WERAM-Anc-0148 ENSACAP00000014142.2 Anolis carolinensis 65 0.0 5082
WERAM-Orc-0115 ENSOCUP00000009766.3 Oryctolagus cuniculus 80 0.0 5058
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 82 0.0 4960
WERAM-Mam-0056 ENSMMUP00000009467.2 Macaca mulatta 78 0.0 4553
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 60 0.0 4491
WERAM-Dio-0019 ENSDORP00000002189.1 Dipodomys ordii 80 0.0 4461
WERAM-Poa-0169 ENSPPYP00000020408.2 Pongo abelii 84 0.0 4426
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 70 0.0 4274
WERAM-Sah-0130 ENSSHAP00000013860.1 Sarcophilus harrisii 71 0.0 4233
WERAM-Otg-0078 ENSOGAP00000005885.2 Otolemur garnettii 79 0.0 4160
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 83 0.0 3830
WERAM-Loa-0084 ENSLAFP00000006640.4 Loxodonta africana 80 0.0 2851
WERAM-Mim-0010 ENSMICP00000000977.1 Microcebus murinus 78 0.0 2495
WERAM-Ptv-0086 ENSPVAP00000007862.1 Pteropus vampyrus 74 0.0 2446
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 75 0.0 2365
WERAM-Asm-0011 ENSAMXP00000001840.1 Astyanax mexicanus 46 0.0 2206
WERAM-Sus-0158 ENSSSCP00000023447.1 Sus scrofa 82 0.0 2160
WERAM-Tar-0071 ENSTRUP00000014027.1 Takifugu rubripes 45 0.0 2090
WERAM-Leo-0127 ENSLOCP00000015481.1 Lepisosteus oculatus 52 0.0 2062
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 77 0.0 2061
WERAM-Mod-0040 ENSMODP00000005845.3 Monodelphis domestica 74 0.0 2047
WERAM-Fia-0086 ENSFALP00000007141.1 Ficedula albicollis 74 0.0 1996
WERAM-Vip-0013 ENSVPAP00000001498.1 Vicugna pacos 76 0.0 1738
WERAM-Orla-0181 ENSORLP00000020984.1 Oryzias latipes 47 0.0 1696
WERAM-Dar-0184 ENSDARP00000115827.2 Danio rerio 46 0.0 1688
WERAM-Xet-0065 ENSXETP00000021458.2 Xenopus tropicalis 63 0.0 1599
WERAM-Prc-0090 ENSPCAP00000008256.1 Procavia capensis 77 0.0 1542
WERAM-Ran-0259 ENSRNOP00000072878.1 Rattus norvegicus 91 0.0 1353
WERAM-Gaa-0138 ENSGACP00000017696.1 Gasterosteus aculeatus 48 0.0 1349
WERAM-Ocp-0124 ENSOPRP00000012666.1 Ochotona princeps 75 0.0 1303
WERAM-Pof-0064 ENSPFOP00000005925.2 Poecilia formosa 47 0.0 1295
WERAM-Xim-0205 ENSXMAP00000016470.1 Xiphophorus maculatus 46 0.0 1280
WERAM-Ten-0186 ENSTNIP00000018287.1 Tetraodon nigroviridis 44 0.0 1265
WERAM-Mae-0021 ENSMEUP00000001693.1 Macropus eugenii 65 0.0 1162
WERAM-Orn-0120 ENSONIP00000012272.1 Oreochromis niloticus 57 0.0 1007
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 73 0.0 910
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 72 0.0 882
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 73 0.0 825
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 75 0.0 757
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 61 0.0 696
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 664
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 53 8e-179 627
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 51 8e-164 577
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 43 1e-151 537
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 35 4e-98 359
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 3e-49 197
Created Date 25-Jun-2016