WERAM Information


Tag Content
WERAM ID WERAM-Poa-0036
Ensembl Protein ID ENSPPYP00000004505.2
Gene Name KMT2A
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPPYG00000003935.2 ENSPPYT00000004683.2 ENSPPYP00000004505.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 5.90e-50 168.4 3828 3943
Me_Reader PHD 5.20e-13 49.1 1431 1976
Ac_Reader Bromodomain 2.00e-05 24.7 1701 1736
Organism Pongo abelii
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++ v++s+i+g+gl++k++i+++e+viEY+G+virs ++dkrek+y++k+ig+y+fr+d++ vvdat++gn+arfinhscepNc+
ENSPPYP00000004505.2 3828 AVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYDSKGIGCYMFRIDDS--EVVDATMHGNAARFINHSCEPNCY 3912
58899********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
++v+++dg+k+ivi+a+r+I +geeltydYk
ENSPPYP00000004505.2 3913 SRVINIDGQKHIVIFAMRKIYRGEELTYDYK 3943
******************************7 PP

  Me_Reader PHD

               PHD.txt    3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCk 51  
+C++C +++ e+v+C+ C + fH C++ ++++l+++ ++w+C++Ck
ENSPPYP00000004505.2 1431 VCFLCASSGHV--EFVYCQVCCEPFHKFCLEENERPLEDQlENWCCRRCK 1478
8****554444..59******************6666655778******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslp.egks.wyCpsCke 52
++C vCg++++ +k++++C++C++ +H +C++++ + p ++k+ w+C +C++
ENSPPYP00000004505.2 1478 KFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPtKKKKvWICTKCVR 1530
59****988888888*******************88888444336******86 PP
PHD.txt 2 tiClvCgkddegeke...mvqCdeCddwfHlkCvklp......lsslpegkswyCpsCke 52
++C++C+k+++++++ m+qC +Cd+w+H kC +l+ ls+lpe+ +++C +C+e
ENSPPYP00000004505.2 1565 NFCPLCDKCYDDDDYeskMMQCGKCDRWVHSKCENLSdemyeiLSNLPESVAYTCVNCTE 1624
789999877666555566*******************9**99999***9999******97 PP
PHD.txt 3 iClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+k+++ + +C + C +H C + ++k+ yC++++
ENSPPYP00000004505.2 1930 RCEFCQKPGAT---VGCCLTsCTSNYHFMCSRAKNCVFLDDKKVYCQRHR 1976
599**766665...4566558*************5555577899**9997 PP

  Ac_Reader Bromodomain

             BROMO.txt   27 kePmdLstikerleegnYsspeefvkDvrlifnNak 62  
++P+dL+ +k+++++gnY+s++ef +D+ i++ a
ENSPPYP00000004505.2 1701 QQPLDLEGVKRKMDQGNYTSVLEFSDDIVKIIQAAI 1736
69***************************9998775 PP

Protein Sequence
(Fasta)
MAHSCRWRFP ARPGTTGGGG GGGRRGLGGA PRQRVPALLL PPGPPVGGGG PGAPPSPPAV 60
AAAAAAAGSS GAGVPGGAAA ASAASSSSAS SSSSSSSSAS SGPALLRVGP GFDAALQVSA 120
AIGTNLRRFR AVFGESGGGG GSGEDEQFLG FGSDEEVRVR SPTRSPSVKT SPRKPRGRPR 180
SGSDRNSAIL SDPSVFSPLN KSETKSGDKI KKKDSKSIEK KRGRPPTFPG VKIKITHGKD 240
ISELPKGNKE DSLKKIKRTP SATFQQATKI KKLRAGKLSP LKSKFKTGKL QIGRKGVQIV 300
RRRGRPPSTE RIKTPSGLLI NSELEKPQKV RKDKEGTPPL TKEDKTVVRQ SPRRIKPVRI 360
IPSSKRTDAT IAKQLLQRAK KGAQKKIEKE AAQLQGRKVK TQVKNIRQFI MPVVSAISSR 420
IIKTPRRFIE DEDYDPPIKI ARLESTPNSR FSAPSCGSSE KSSAASQHSS QMSSDSSRSS 480
SPSVDTSTDS QASEEIQVLP EERSDTPEVH TPLPISQSPE NESNDRRSRR YSVSERSFGS 540
RTTKKLSTLQ SAPQQQTSSS PPPPLLTPPP PLQPASSISD HTPWLMPPTI PLASPFLPAS 600
TAPMQGKRKS ILREPTFRWT SLKHSRSEPQ YFSSAKYAKE GLIRKPIFDN FRPPPLTPED 660
VGFASGFSAS GTAASARLFS PLHSGTRFDM HKRSPLLRAP RFTPSEAHSR IFESVTLPSN 720
RTSAGTSGVS NRKRKRKVFS PIRSEPRSPS HSMRTRSGRL STSELSPLTP PSSVSSSLSI 780
SVSPLATSAL NPTFTFPSHS LTQSGESAEK NQRPRKQTSA PAEPFSSSSP TPLFPWFTPG 840
SQTERGRNKD KAPEELSKDR DADKSVEKDK SRERDREREK ENKRESRKEK RKKGSEIQSS 900
SALYPVGRVS KEKVVGEDVA TSSSAKKATG RKKSSSHDSG TDITSVTLGD TTAVKTKILI 960
KKGRGNLEKT NLDLGPTAPS LEKEKTLCLS TPSSSTVKHS TSSIGSMLAQ ADKLPMTDKR 1020
VASLLKKAKA QLCKIEKSKS LKQTDQPKAQ GQESDSSETS VRGPRIKHVC RRAAVALGRK 1080
RAVFPDDMPT LSALPWEERE KILSSMGNDD KSSIAGSEDA EPLAPPIKPI KPVTRNKAPQ 1140
EPPVKKGRRS RRCGQCPGCQ VPEDCGVCTN CLDKPKFGGR NIKKQCCKMR KCQNLQWMPS 1200
KAYLQKQAKA VKKKEKKSKT SEKKDSKESS VVKNVMDSSQ KPTPSAREDP APKKSSNEPP 1260
PRKPVEEKSE EGNVSAPGPE SKQATTPASR KSSKQVSQPA PVIPPQPPTT GPPRKEVPKT 1320
TPSEPKKKQP PPPESGPEQS KQKKVAPRPS IPVKQKPKEK EKPPPVNKQE NAGTLNILST 1380
LSNGNSSKQK IPADGVHRIR VDFKEDCEAE NVWEMGGLGI LTSVPITPRV VCFLCASSGH 1440
VEFVYCQVCC EPFHKFCLEE NERPLEDQLE NWCCRRCKFC HVCGRQHQAT KQLLECNKCR 1500
NSYHPECLGP NYPTKPTKKK KVWICTKCVR CKSCGSTTPG KGWDAQWSHD FSLCHDCAKL 1560
FAKGNFCPLC DKCYDDDDYE SKMMQCGKCD RWVHSKCENL SDEMYEILSN LPESVAYTCV 1620
NCTERHPAEW RLALEKELQI SLKQVLTALL NSRTTSHLLR YRQAAKPPDL NPETEESIPS 1680
RSSPEGPDPP VLTEVSKQDD QQPLDLEGVK RKMDQGNYTS VLEFSDDIVK IIQAAINSDG 1740
GQPEIKKANS MVKSFFIRQM ERVFPWFSVK KSRFWEPNKV SSNSGMLPNA VLPPSLDHNY 1800
AQWQEREENS HTEQPPLMKK IIPAPKPKGP GEPDSPTPLH PPTPPILSTD RSREDSPELN 1860
PPPGIEDNRQ CALCLTYGDD SANDAGRLLY IGQNEWTHVN CALWSAEVFE DDDGSLKNVH 1920
MAVIRGKQLR CEFCQKPGAT VGCCLTSCTS NYHFMCSRAK NCVFLDDKKV YCQRHRDLIK 1980
GEVVPENGFE VFRRVFVDFE GISLRRKFLN GLEPENIHMM IGSMTIDCLG ILNDLSDCED 2040
KLFPIGYQCS RVYWSTTDAR KRCVYTCKIV ECRPPVVEPD INSTVEHDEN RTIAHSPTSF 2100
TESSSKESQN TAEIISPPSP DRPPHSQTSG SCYYHVISKV PRIRTPSYSP TQRSPGCRPL 2160
PSAGSPTPTT HEIVTVGDPL LSSGLRSIGS RRHSTSSLSP QRSKLRIMSP MRTGNTYSRN 2220
NVSSVSTIGT ATNLESSAKA VDHVLGPLNS STSLGQNTST SSNLQRTVVT VGNKNSHLDG 2280
SSSSEMKQSS ASDLASKSSS LKGEKTKVLS SKSSEGSAHN VTYPGIPKLA PQVHNTTSRE 2340
LNVSKIGSFA EPSSVSFSSK EALSFPHLHL RGQRNDRDQH TDSTQSANPS PDEDTEVKTL 2400
KLSGMSNRSS IINEHTGSSS RDRRQKGKKS CKETFKEKHS SKSFLEPGQV TTGEEGNLKP 2460
EFMDEVLTPE YMGQRPCNNV SSDKIGDKGL SMPGVPKAPP MQVEGSAKEL QAPRKRTVKV 2520
TLTPLKMENE SQSKNALKES SPASPLQIES TSPTEPISAS ENPGDGPVAQ PSPNNTSCQD 2580
SQSNNYQNLP VQDRNLMLPD GPKPQEDGSF KRRYPRRSAR ARSNMFFGLT PLYGVRSYGE 2640
EDIPFYSSST GKKRGKRSAE GQVDGADDLS TSDEDDLYYY NFTRTVISSG GEERLASHNL 2700
FREEEQCDLP KISQLDGVDD GTESDTSVTA TTRKSSQIPK RNGKENGTEN LKIDRPEDAG 2760
EKEHVIKSSV GHKNEPKMDN CHSVSRVKTQ GQDSLEAQLS SLESSRRVHT STPSDKNLLD 2820
TYNTELLKSD SDNNNSDDCG NILPSDIMDF VLKNTPSMQA LGESPESSSS ELLNLGEGLG 2880
LDSNREKDMG LFEVFSQQLP TTEPVDSSVS SSISAEEQFE LPLELPSDLS VLTTRSPTVP 2940
SQNPSRLAVI SDSGEKRVTI TEKSVASSEG DPALLSPGVD TTPEGHMTPD HFIQGHMDAD 3000
HISSPPCGSV EQGHGNNQDL TRNSSTPGLQ VPVSPTVPIQ NQKYVPNSTD SPGPSQISNA 3060
AVQTTPPHLK PATEKLIVVN QNMQPLYVLQ TLPNGVTQKI QLTSSVSSTP SVMETNTSVL 3120
GPMGSGLTLT TGLNPSLPTS QSLFPSASKG LLPMSHHQHL HSFPAATQSS FPPNISNPPS 3180
GLLIGVQPPP DPQLLVSESS QRTDLSTTVA TPSSGLKKRP ISRLQTRKNK KLAPSSTPSN 3240
IAPADVVSNM TLINFTPSQL PNHPNLLDLG SLNTSSHRTV PNIIKRSKSS IMYFEPAPLL 3300
PQSVGGTAAT AAGTSTISQD TSHLTSGSVS GLASSSSVLN VVSMQTTTTP TSSASVPGHV 3360
TLTNPRLLGT PDIGSISNLL IKASQQSLGI QDQPVALPPS SGMFPQLGTS QTPSTAAMTA 3420
ASSICVLPST QTTGITAASP SGEADEHYQL QHVNQLLASK PGIHSSQHDL DSASGPQVSN 3480
FTQTVDAPNS VGLEQNKALS SAVQASSTSP GGSPSSPSSG QRSASPSVPG PTKPKPKTKR 3540
FQLPLDKGNG KKHKVSHLRT SSSEAHIPDQ ETTSLTSGTG TPGAEAEQQD TASVEQSSQK 3600
ECGQPAGQVA VLPEVQVTQN PANEQESTEP KTVEEEESNF SSPLMLWLQQ EQKRKESITE 3660
KKPKKGLVFE ISSDDGFQIC AESIEDAWKS LTDKVQEARS NARLKQLSFA GVNGLRMLGI 3720
LHDAVVFLIE QLSGAKHCRN YKFRFHKPEE ANEPPLNPHG SARAEVHLRK SAFDMFNFLA 3780
SKHRQPPEYN PNDEEEEEVQ LKSARRATSM DLPMPMRFRH LKKTSKEAVG VYRSPIHGRG 3840
LFCKRNIDAG EMVIEYAGNV IRSIQTDKRE KYYDSKGIGC YMFRIDDSEV VDATMHGNAA 3900
RFINHSCEPN CYSRVINIDG QKHIVIFAMR KIYRGEELTY DYKFPIEDAS NKLPCNCGAK 3960
KCRKFLN 3967
Nucleotide Sequence
(Fasta)
ATGGCGCACA GCTGTCGGTG GCGCTTCCCC GCCCGACCCG GGACCACCGG GGGCGGCGGC 60
GGCGGGGGGC GCCGGGGCCT AGGGGGCGCC CCGCGGCAAC GCGTCCCGGC CCTGCTGCTT 120
CCCCCCGGGC CCCCGGTCGG CGGTGGCGGC CCCGGGGCGC CCCCCTCCCC CCCGGCTGTG 180
GCGGCCGCGG CGGCGGCGGC GGGAAGCAGC GGGGCTGGGG TTCCAGGGGG AGCGGCCGCC 240
GCCTCAGCAG CCTCCTCGTC GTCCGCCTCG TCTTCGTCTT CGTCATCGTC CTCAGCCTCT 300
TCAGGGCCGG CCCTGCTCCG GGTGGGCCCG GGCTTCGACG CGGCGCTGCA GGTCTCGGCC 360
GCCATCGGCA CCAACCTGCG CCGGTTCCGG GCCGTGTTTG GGGAGAGCGG CGGGGGAGGC 420
GGCAGCGGAG AGGATGAGCA ATTCTTAGGT TTTGGCTCAG ATGAAGAAGT CAGAGTGCGA 480
AGTCCCACAA GGTCTCCTTC AGTTAAAACT AGTCCTCGAA AACCTCGTGG GAGACCTAGA 540
AGTGGCTCTG ACCGAAATTC AGCTATCCTC TCAGATCCAT CTGTGTTTTC CCCTCTAAAT 600
AAATCAGAGA CCAAATCTGG AGATAAGATC AAGAAGAAAG ATTCTAAAAG TATAGAAAAG 660
AAGAGAGGAA GACCTCCCAC CTTCCCTGGA GTAAAAATCA AAATAACACA TGGAAAGGAC 720
ATTTCAGAGT TACCAAAGGG AAACAAAGAA GATAGCCTGA AAAAAATTAA AAGGACACCG 780
TCTGCTACGT TTCAGCAAGC CACAAAGATT AAAAAATTAA GAGCAGGTAA ACTCTCTCCT 840
CTCAAGTCTA AGTTTAAGAC AGGGAAGCTT CAAATAGGAA GGAAGGGGGT ACAAATTGTA 900
CGACGGAGAG GAAGGCCTCC ATCAACAGAA AGGATAAAGA CCCCTTCGGG TCTCCTCATT 960
AATTCTGAAC TGGAAAAGCC CCAGAAAGTC CGGAAAGACA AAGAAGGAAC ACCTCCACTT 1020
ACAAAAGAAG ATAAGACAGT TGTCAGACAA AGCCCTCGAA GGATTAAGCC AGTTAGGATT 1080
ATTCCTTCTT CAAAAAGGAC AGATGCAACC ATTGCTAAGC AACTCTTACA GAGGGCAAAA 1140
AAGGGGGCTC AAAAGAAAAT TGAAAAAGAA GCAGCTCAGC TGCAGGGAAG AAAGGTGAAG 1200
ACACAGGTCA AAAATATTCG ACAGTTCATC ATGCCTGTTG TCAGTGCTAT CTCCTCGCGG 1260
ATCATTAAGA CCCCTCGGCG ATTTATAGAG GATGAGGATT ATGACCCTCC AATTAAAATT 1320
GCCCGATTAG AGTCTACACC GAATAGTAGA TTCAGTGCCC CGTCCTGTGG ATCTTCTGAA 1380
AAATCAAGTG CAGCTTCTCA GCACTCCTCT CAAATGTCTT CAGACTCCTC TCGATCTAGT 1440
AGCCCCAGTG TTGATACCTC CACAGACTCT CAGGCTTCTG AGGAGATTCA GGTACTTCCT 1500
GAGGAGCGGA GCGATACCCC TGAAGTTCAT ACTCCACTGC CCATTTCCCA GTCCCCAGAA 1560
AATGAGAGTA ATGATAGGAG AAGCAGAAGG TATTCAGTGT CGGAGAGAAG TTTTGGATCT 1620
AGAACGACGA AAAAATTATC AACTCTACAA AGTGCCCCCC AGCAGCAGAC CTCCTCGTCT 1680
CCACCCCCAC CTCTGCTGAC TCCCCCGCCA CCACTGCAGC CAGCCTCCAG TATCTCTGAC 1740
CACACACCTT GGCTTATGCC TCCAACAATC CCCTTAGCAT CACCATTTTT GCCTGCTTCC 1800
ACTGCTCCTA TGCAAGGGAA GCGAAAATCT ATTTTGCGAG AACCGACATT TAGGTGGACT 1860
TCTTTAAAGC ATTCTAGGTC AGAGCCACAA TACTTTTCCT CAGCAAAGTA TGCCAAAGAA 1920
GGTCTTATTC GCAAACCAAT ATTTGATAAT TTCCGACCCC CTCCGCTAAC TCCCGAGGAC 1980
GTTGGCTTTG CATCTGGTTT TTCTGCATCT GGTACCGCTG CTTCAGCCCG ATTGTTTTCG 2040
CCACTCCATT CTGGAACAAG GTTTGATATG CACAAAAGGA GCCCTCTTCT GAGAGCTCCA 2100
AGATTTACTC CAAGTGAGGC TCACTCTAGA ATATTTGAGT CTGTAACCTT GCCTAGTAAT 2160
CGAACTTCTG CTGGAACATC TGGAGTATCC AATAGAAAAA GGAAAAGAAA AGTGTTTAGT 2220
CCTATTCGAT CTGAACCAAG ATCTCCTTCT CACTCCATGA GGACAAGAAG TGGAAGGCTT 2280
AGTACTTCTG AGCTGTCACC TCTCACCCCC CCGTCTTCTG TCTCTTCCTC GTTAAGCATT 2340
TCTGTTAGTC CTCTTGCCAC TAGTGCCTTA AACCCAACTT TTACTTTTCC TTCTCATTCC 2400
CTGACTCAGT CTGGGGAATC TGCAGAGAAA AATCAGAGAC CAAGGAAGCA GACTAGTGCT 2460
CCAGCAGAGC CATTTTCATC AAGTAGTCCT ACTCCTCTCT TCCCTTGGTT TACCCCAGGC 2520
TCTCAGACTG AAAGAGGGAG AAATAAAGAC AAGGCCCCTG AGGAGCTGTC CAAAGATCGA 2580
GATGCTGACA AGAGCGTGGA GAAGGACAAG AGTAGAGAGA GAGACCGGGA GAGAGAAAAG 2640
GAGAATAAGC GGGAGTCAAG GAAAGAGAAA AGGAAAAAGG GATCAGAAAT TCAGAGTAGT 2700
TCTGCTTTGT ATCCTGTGGG TAGGGTTTCC AAAGAGAAGG TTGTTGGTGA AGATGTTGCC 2760
ACTTCATCTT CTGCCAAAAA AGCAACAGGG CGGAAGAAGT CTTCATCACA TGATTCTGGG 2820
ACTGATATTA CTTCTGTGAC TCTTGGGGAT ACAACAGCTG TCAAAACCAA AATACTTATA 2880
AAGAAAGGGA GAGGAAATCT GGAAAAAACC AACTTGGACC TCGGCCCAAC TGCCCCATCC 2940
CTGGAGAAGG AGAAAACCCT CTGCCTTTCC ACTCCTTCAT CTAGCACTGT TAAACATTCC 3000
ACTTCCTCCA TAGGCTCCAT GTTGGCTCAG GCAGACAAGC TTCCAATGAC TGACAAGAGG 3060
GTTGCCAGCC TCCTAAAAAA GGCCAAAGCT CAGCTCTGCA AGATTGAGAA GAGTAAGAGT 3120
CTTAAACAAA CTGACCAGCC CAAAGCACAG GGTCAAGAAA GTGACTCATC AGAGACCTCT 3180
GTGCGAGGAC CCCGGATTAA ACATGTCTGC AGAAGAGCAG CTGTTGCCCT TGGCCGAAAA 3240
CGAGCTGTGT TTCCTGATGA CATGCCCACC CTGAGTGCCT TACCATGGGA AGAACGAGAA 3300
AAGATTCTGT CTTCCATGGG GAATGATGAC AAGTCATCAA TTGCTGGCTC AGAAGATGCT 3360
GAACCTCTTG CTCCACCCAT CAAACCAATT AAACCTGTCA CTAGAAACAA GGCACCCCAG 3420
GAACCTCCAG TAAAGAAAGG ACGTCGATCG AGGCGGTGTG GGCAGTGTCC CGGCTGCCAG 3480
GTGCCTGAGG ACTGTGGTGT TTGTACTAAT TGCTTAGATA AGCCCAAGTT TGGTGGTCGC 3540
AATATAAAGA AGCAGTGCTG CAAGATGAGA AAATGTCAGA ATCTACAATG GATGCCTTCC 3600
AAAGCCTACC TGCAGAAGCA GGCTAAAGCT GTGAAAAAGA AAGAGAAGAA GTCTAAGACC 3660
AGTGAAAAGA AAGATAGCAA AGAGAGCAGT GTTGTGAAGA ACGTGATGGA CTCTAGTCAG 3720
AAACCTACCC CATCAGCAAG AGAGGATCCT GCCCCAAAGA AAAGCAGTAA TGAGCCTCCT 3780
CCACGAAAGC CCGTCGAGGA AAAGAGTGAA GAAGGGAATG TCTCGGCCCC TGGGCCTGAA 3840
TCCAAACAGG CCACCACTCC AGCTTCCAGG AAGTCAAGCA AGCAGGTCTC CCAGCCAGCA 3900
CCGGTCATCC CCCCTCAGCC ACCTACTACA GGACCGCCAA GAAAAGAAGT TCCCAAAACC 3960
ACTCCTAGTG AGCCCAAGAA AAAGCAGCCT CCACCACCAG AATCAGGTCC AGAGCAGAGC 4020
AAACAGAAAA AAGTGGCTCC CCGCCCAAGT ATCCCTGTAA AACAAAAACC AAAAGAAAAG 4080
GAAAAACCAC CTCCGGTCAA TAAGCAGGAG AATGCAGGCA CTTTGAACAT CCTCAGCACT 4140
CTCTCCAATG GCAATAGTTC TAAGCAAAAA ATTCCAGCAG ATGGAGTCCA CAGGATCAGA 4200
GTGGACTTTA AGGAGGATTG TGAAGCAGAA AATGTGTGGG AGATGGGAGG CTTAGGAATC 4260
TTGACTTCTG TTCCTATAAC ACCCAGGGTG GTTTGCTTTC TCTGTGCCAG TAGTGGGCAT 4320
GTAGAGTTTG TGTATTGCCA AGTCTGTTGT GAGCCCTTCC ACAAGTTTTG TTTAGAGGAG 4380
AACGAGCGCC CTCTGGAGGA CCAGCTGGAA AATTGGTGTT GTCGTCGCTG CAAATTCTGT 4440
CATGTTTGTG GAAGACAACA TCAGGCTACA AAGCAGCTGC TGGAGTGTAA TAAGTGCCGA 4500
AACAGCTATC ACCCTGAGTG CCTGGGACCA AACTACCCCA CCAAACCCAC AAAGAAGAAG 4560
AAAGTCTGGA TCTGTACCAA GTGTGTTCGC TGTAAGAGCT GTGGATCCAC AACTCCAGGC 4620
AAAGGGTGGG ATGCACAGTG GTCTCACGAT TTCTCACTGT GTCATGATTG CGCCAAGCTC 4680
TTTGCTAAAG GAAACTTCTG CCCTCTCTGT GACAAATGTT ATGATGATGA TGACTATGAG 4740
AGTAAGATGA TGCAATGTGG AAAGTGTGAT CGCTGGGTCC ATTCCAAATG TGAGAATCTT 4800
TCAGATGAGA TGTATGAGAT TCTATCTAAT CTGCCAGAAA GTGTGGCCTA CACTTGTGTG 4860
AACTGTACTG AGCGGCACCC TGCAGAGTGG CGACTGGCCC TTGAAAAAGA GCTGCAGATT 4920
TCTCTGAAGC AAGTTCTGAC AGCTTTGTTG AATTCTCGGA CTACCAGCCA TTTGCTACGC 4980
TACCGGCAGG CTGCCAAGCC TCCAGACTTA AATCCCGAGA CAGAGGAGAG TATACCTTCC 5040
CGCAGCTCCC CCGAAGGACC TGATCCACCA GTTCTTACTG AGGTCAGCAA ACAGGATGAT 5100
CAGCAGCCTT TAGATCTAGA AGGAGTCAAG AGGAAGATGG ACCAAGGGAA TTACACATCT 5160
GTGTTGGAGT TCAGTGATGA TATTGTGAAG ATCATTCAAG CAGCCATTAA TTCAGATGGA 5220
GGACAGCCAG AAATTAAAAA AGCCAACAGC ATGGTCAAGT CCTTCTTCAT TCGGCAAATG 5280
GAACGTGTTT TTCCATGGTT CAGTGTCAAA AAGTCCAGGT TTTGGGAGCC AAATAAAGTA 5340
TCAAGCAACA GTGGGATGTT ACCAAACGCA GTGCTTCCAC CTTCACTTGA CCATAATTAT 5400
GCTCAGTGGC AGGAGCGAGA GGAAAACAGC CACACTGAGC AGCCTCCTTT AATGAAGAAA 5460
ATCATTCCAG CTCCCAAACC CAAAGGGCCT GGAGAACCAG ACTCACCAAC TCCTCTGCAT 5520
CCTCCTACAC CACCAATTTT GAGTACTGAT AGGAGTCGAG AAGACAGTCC AGAGCTGAAC 5580
CCACCCCCAG GCATAGAAGA CAATAGACAG TGTGCGTTAT GTTTGACTTA TGGTGATGAC 5640
AGTGCTAATG ATGCTGGTCG TTTACTATAT ATTGGCCAAA ATGAGTGGAC ACATGTAAAT 5700
TGTGCTTTGT GGTCAGCGGA AGTGTTTGAA GATGATGATG GATCACTAAA GAATGTGCAT 5760
ATGGCTGTGA TCAGGGGCAA GCAGCTGAGA TGTGAATTCT GCCAAAAGCC AGGAGCCACC 5820
GTGGGTTGCT GTCTCACATC CTGCACCAGC AACTATCACT TCATGTGTTC CCGAGCCAAG 5880
AACTGTGTCT TTCTGGATGA TAAAAAAGTA TATTGCCAAC GACATCGGGA TTTGATCAAA 5940
GGCGAAGTGG TTCCTGAGAA TGGATTTGAA GTTTTCAGAA GAGTGTTTGT GGACTTTGAA 6000
GGAATCAGCT TGAGAAGGAA GTTTCTCAAT GGCTTGGAAC CAGAAAATAT CCACATGATG 6060
ATTGGGTCTA TGACAATCGA CTGCTTAGGA ATTCTAAATG ATCTCTCTGA CTGTGAAGAT 6120
AAGCTCTTTC CTATTGGATA TCAGTGTTCC AGGGTATACT GGAGCACCAC AGATGCTCGC 6180
AAGCGCTGCG TATATACATG CAAGATAGTG GAGTGCCGTC CTCCAGTCGT AGAGCCGGAT 6240
ATCAACAGCA CTGTTGAACA TGATGAAAAC AGGACCATTG CCCATAGTCC AACATCTTTC 6300
ACAGAAAGTT CATCAAAAGA GAGTCAAAAC ACAGCTGAAA TTATAAGTCC TCCGTCACCA 6360
GACCGACCTC CTCATTCACA AACCTCTGGC TCCTGTTATT ATCATGTCAT CTCAAAGGTC 6420
CCCAGGATTC GAACACCCAG TTATTCTCCA ACACAGAGAT CTCCTGGCTG TCGACCATTG 6480
CCTTCTGCAG GAAGTCCTAC CCCAACCACT CATGAAATAG TCACAGTAGG TGATCCTTTA 6540
CTCTCCTCTG GACTTCGAAG CATTGGCTCC AGGCGTCACA GTACCTCTTC CTTATCACCC 6600
CAGCGGTCCA AACTCCGGAT AATGTCTCCA ATGAGAACTG GGAATACTTA CTCTAGGAAT 6660
AATGTTTCCT CAGTCTCCAC CATTGGGACC GCTACTAATC TTGAATCAAG TGCCAAAGCA 6720
GTTGATCATG TCTTAGGGCC ACTGAATTCA AGTACTAGTT TAGGGCAAAA CACTTCCACC 6780
TCTTCAAATT TGCAAAGGAC AGTGGTTACT GTAGGCAATA AAAACAGTCA CTTGGATGGA 6840
TCTTCATCTT CAGAAATGAA GCAGTCCAGT GCTTCAGACT TGGCATCCAA GAGCTCTTCT 6900
TTAAAGGGAG AGAAGACCAA AGTGCTGAGT TCCAAGAGCT CAGAGGGATC TGCACATAAT 6960
GTGACTTACC CTGGAATTCC TAAACTGGCC CCACAGGTTC ATAACACAAC ATCTAGAGAA 7020
CTGAATGTTA GTAAAATCGG CTCCTTTGCT GAACCCTCTT CGGTGTCGTT TTCTTCTAAA 7080
GAGGCCCTCT CCTTCCCACA CCTCCATCTG AGAGGGCAAA GGAATGATCG AGACCAACAC 7140
ACAGATTCTA CCCAATCAGC AAACCCCTCT CCAGATGAAG ATACTGAAGT CAAAACCTTG 7200
AAGCTATCTG GAATGAGCAA CAGATCATCC ATTATCAATG AACATACGGG ATCTAGTTCC 7260
AGAGATAGGA GACAGAAAGG GAAAAAATCT TGTAAAGAAA CTTTCAAAGA AAAGCATTCC 7320
AGTAAATCTT TTTTGGAACC TGGTCAGGTG ACAACTGGTG AGGAAGGAAA CTTGAAGCCA 7380
GAGTTTATGG ATGAGGTTTT GACTCCTGAG TATATGGGCC AACGACCATG TAACAATGTT 7440
TCTTCTGATA AGATTGGTGA TAAAGGCCTT TCTATGCCAG GAGTCCCCAA AGCTCCACCC 7500
ATGCAAGTAG AAGGATCTGC CAAGGAATTA CAGGCACCAC GGAAACGCAC AGTCAAAGTG 7560
ACACTGACAC CTCTAAAAAT GGAAAATGAG AGTCAATCCA AAAACGCCCT GAAAGAAAGT 7620
AGTCCTGCTT CCCCTTTGCA AATAGAGTCA ACATCTCCCA CAGAACCAAT TTCAGCCTCT 7680
GAAAATCCAG GAGATGGTCC AGTGGCCCAA CCAAGCCCCA ATAATACCTC ATGCCAGGAT 7740
TCTCAAAGTA ACAACTATCA GAATCTTCCA GTACAGGACA GAAACCTAAT GCTTCCAGAT 7800
GGCCCCAAAC CTCAGGAGGA TGGCTCTTTT AAAAGGAGGT ATCCCCGTCG CAGTGCCCGT 7860
GCACGTTCTA ACATGTTCTT TGGGCTTACC CCACTCTATG GAGTAAGATC CTATGGTGAA 7920
GAAGACATTC CATTCTACAG CAGCTCAACT GGGAAGAAGC GAGGCAAGAG ATCAGCTGAA 7980
GGACAGGTGG ATGGGGCCGA TGACTTAAGC ACTTCAGATG AAGATGACTT ATACTATTAC 8040
AACTTCACTA GAACAGTGAT TTCTTCAGGT GGAGAGGAAC GACTGGCATC CCATAATTTA 8100
TTTCGGGAGG AGGAACAGTG TGATCTTCCA AAAATCTCAC AGTTGGATGG TGTTGATGAT 8160
GGGACAGAGA GTGATACTAG TGTCACAGCC ACAACAAGGA AGAGCAGCCA GATTCCAAAA 8220
AGAAATGGTA AAGAAAATGG AACAGAGAAC TTAAAGATTG ATCGACCTGA AGACGCTGGG 8280
GAGAAAGAAC ATGTCATTAA GAGTTCTGTT GGCCACAAAA ATGAGCCAAA GATGGATAAC 8340
TGCCATTCTG TAAGCAGAGT TAAAACACAG GGACAGGATT CCTTGGAAGC TCAGCTCAGC 8400
TCATTGGAGT CAAGCCGCAG AGTCCACACA AGTACCCCCT CTGACAAAAA TTTACTGGAC 8460
ACCTATAATA CTGAGCTCCT GAAATCAGAT TCAGACAATA ACAACAGTGA TGACTGTGGG 8520
AATATCCTGC CTTCAGACAT TATGGACTTT GTACTAAAGA ATACTCCATC CATGCAGGCT 8580
TTGGGTGAGA GCCCAGAGTC ATCTTCATCA GAACTCCTGA ATCTTGGTGA AGGATTGGGT 8640
CTTGACAGTA ATCGTGAAAA AGACATGGGT CTTTTTGAAG TATTTTCTCA GCAGCTGCCT 8700
ACAACAGAAC CTGTGGATAG TAGTGTCTCG TCCTCTATCT CAGCAGAGGA ACAGTTTGAG 8760
TTGCCTCTAG AGCTACCATC TGATCTGTCT GTCTTGACCA CCCGGAGTCC CACTGTCCCC 8820
AGCCAGAATC CCAGTAGACT AGCTGTTATC TCAGATTCAG GGGAGAAGAG AGTAACCATC 8880
ACAGAAAAAT CCGTAGCCTC CTCTGAAGGT GACCCAGCAC TGCTGAGCCC AGGAGTAGAT 8940
ACAACTCCTG AAGGTCACAT GACTCCTGAT CATTTTATCC AAGGACACAT GGATGCAGAC 9000
CACATCTCTA GCCCTCCTTG TGGTTCAGTA GAGCAAGGTC ATGGCAACAA TCAGGATTTA 9060
ACTAGAAACA GTAGCACCCC TGGCCTTCAG GTACCTGTTT CCCCAACTGT TCCTATCCAG 9120
AACCAGAAGT ATGTGCCCAA TTCTACTGAT AGTCCTGGCC CGTCTCAGAT TTCCAATGCA 9180
GCTGTCCAGA CCACTCCACC CCACCTGAAG CCAGCCACTG AGAAACTCAT AGTTGTTAAC 9240
CAGAACATGC AGCCACTTTA TGTTCTCCAA ACTCTTCCAA ATGGAGTGAC CCAAAAAATC 9300
CAATTGACCT CTTCCGTTAG TTCTACACCC AGTGTGATGG AGACAAATAC TTCAGTATTG 9360
GGGCCCATGG GAAGTGGTCT CACCCTCACC ACAGGACTAA ATCCAAGCTT GCCAACTTCT 9420
CAATCTTTGT TCCCTTCTGC TAGCAAAGGA TTGCTACCTA TGTCTCATCA CCAGCACTTA 9480
CATTCCTTCC CTGCAGCTAC TCAAAGTAGT TTCCCACCCA ACATCAGCAA TCCTCCTTCA 9540
GGCCTGCTTA TTGGGGTTCA GCCTCCTCCG GATCCCCAAC TTTTGGTTTC AGAATCCAGC 9600
CAGAGGACAG ACCTCAGTAC CACAGTAGCC ACTCCATCCT CTGGACTCAA GAAAAGACCC 9660
ATATCTCGTC TACAGACCCG AAAGAATAAA AAACTTGCTC CCTCTAGTAC CCCTTCAAAC 9720
ATTGCCCCTG CCGATGTGGT TTCTAATATG ACATTGATTA ACTTCACACC CTCCCAGCTT 9780
CCTAATCATC CCAATCTGTT AGATTTGGGG TCACTTAATA CTTCATCTCA CCGAACTGTC 9840
CCCAACATCA TAAAAAGATC TAAATCTAGC ATCATGTATT TTGAACCGGC ACCCCTGTTA 9900
CCACAGAGTG TGGGAGGAAC TGCTGCCACA GCGGCGGGCA CATCAACAAT AAGCCAGGAT 9960
ACTAGCCACC TCACATCAGG GTCTGTGTCT GGCTTGGCAT CCAGTTCCTC TGTCTTGAAT 10020
GTTGTATCCA TGCAAACTAC CACAACCCCT ACAAGTAGTG CATCAGTTCC AGGACACGTC 10080
ACCTTAACCA ACCCAAGGTT GCTTGGTACC CCAGATATTG GCTCAATAAG CAATCTTTTA 10140
ATCAAAGCTA GCCAGCAGAG CCTGGGGATT CAGGACCAGC CTGTGGCTTT ACCGCCAAGT 10200
TCAGGAATGT TTCCACAACT GGGGACATCA CAGACCCCCT CTACTGCTGC AATGACAGCG 10260
GCATCTAGCA TCTGTGTGCT CCCCTCCACT CAGACTACGG GCATAACAGC CGCTTCACCT 10320
TCTGGGGAAG CAGACGAACA CTATCAGCTT CAGCATGTGA ACCAGCTCCT TGCCAGCAAA 10380
CCTGGGATTC ATTCTTCCCA ACATGATCTT GATTCTGCTT CAGGGCCCCA GGTATCCAAC 10440
TTTACCCAGA CGGTAGACGC TCCTAATAGC GTGGGGCTGG AGCAGAACAA GGCTTTATCC 10500
TCAGCTGTGC AAGCCAGCTC CACCTCTCCT GGGGGTTCTC CATCCTCTCC ATCTTCTGGA 10560
CAGCGGTCAG CAAGCCCTTC AGTGCCGGGT CCCACTAAAC CCAAACCAAA AACCAAACGG 10620
TTTCAGCTGC CTCTAGACAA AGGGAATGGC AAGAAGCACA AAGTTTCCCA TTTGCGGACC 10680
AGTTCTTCTG AAGCACACAT TCCAGACCAA GAAACAACGT CCCTGACCTC AGGCACAGGG 10740
ACTCCAGGAG CAGAGGCCGA GCAGCAGGAT ACAGCTAGTG TGGAGCAGTC CTCCCAGAAG 10800
GAGTGTGGGC AACCTGCAGG GCAAGTGGCT GTTCTTCCGG AAGTTCAGGT GACCCAAAAT 10860
CCAGCAAATG AACAAGAAAG TACAGAACCT AAAACAGTGG AAGAAGAAGA AAGTAATTTC 10920
AGCTCCCCAC TGATGCTTTG GCTTCAGCAA GAACAAAAGC GGAAGGAAAG CATTACTGAG 10980
AAAAAACCCA AGAAAGGACT TGTTTTTGAA ATTTCCAGTG ATGATGGCTT TCAGATCTGT 11040
GCAGAAAGTA TTGAAGATGC CTGGAAGTCA TTGACAGATA AAGTCCAGGA AGCTCGATCA 11100
AATGCCCGCC TAAAGCAGCT CTCATTTGCA GGTGTTAACG GTTTGAGGAT GCTGGGGATT 11160
CTCCATGATG CAGTTGTGTT CCTCATTGAG CAGCTGTCTG GTGCCAAGCA CTGTCGAAAT 11220
TACAAATTCC GTTTCCACAA GCCAGAGGAG GCCAATGAAC CCCCCTTGAA CCCTCACGGC 11280
TCAGCCAGGG CTGAAGTCCA CCTCAGGAAG TCAGCATTTG ACATGTTTAA CTTCCTGGCT 11340
TCTAAACATC GTCAGCCTCC TGAATACAAC CCCAATGATG AAGAAGAGGA GGAGGTACAG 11400
CTGAAATCAG CTCGGAGGGC AACTAGCATG GATCTGCCAA TGCCCATGCG TTTCCGGCAC 11460
TTAAAAAAGA CTTCTAAGGA GGCAGTTGGT GTCTACAGGT CTCCCATCCA TGGCCGGGGT 11520
CTTTTCTGTA AGAGAAACAT TGATGCAGGT GAGATGGTGA TTGAGTATGC CGGCAACGTC 11580
ATCCGCTCCA TCCAGACTGA CAAGCGGGAA AAGTATTATG ACAGCAAGGG CATTGGTTGC 11640
TATATGTTCC GAATTGATGA CTCAGAGGTA GTGGATGCCA CCATGCATGG AAATGCTGCG 11700
CGCTTCATCA ATCACTCGTG TGAGCCTAAC TGCTATTCTC GGGTCATCAA TATTGATGGG 11760
CAGAAGCACA TTGTCATCTT TGCCATGCGT AAGATCTACC GAGGAGAGGA ACTAACTTAC 11820
GACTATAAGT TCCCCATTGA GGATGCCAGC AACAAGCTGC CCTGCAACTG TGGCGCCAAG 11880
AAATGCCGGA AGTTCCTAAA CTAAAGCTGC TCTTCTCCCC CAGTGTTGGA GTGCAAGGAC 11940
GCGGGGCCAT CCAAAGCAAC GCTGAAGGCC TTTTCCAGCA GCTGGGAGCT CCCGGATTGC 12000
GTGGGCACAG CTGAGGGGCC TCTGTGATGG CTGAGCTCTC TTATGTCCTA TACTCACATC 12060
AGACATGTGA TCATAGTCCC GGAGACAGAG TTGAGGTCTC AAAGAAAAGA TCCATGATCG 12120
GCTTTCTCCT GGGGCCCCTC GATTGTTTAC TGTTAGAAAG TGGGAATGGG GTCCCTAGCA 12180
GACTTGCCTG GAAGGAGCCT ATTATAGAGG GTTGGTTATG TTGGGAGATG GGGCCTGAAT 12240
TTCTCCACAG AAATAAGTTG CCATCCTCAA GTTGGCCCTT TCCCAAGCAC TATAAGTGAG 12300
TGGGTCAGGC AAAGCCCCAA ATGGAGGGTT GGTTGGATTC CTGACAGTTT GCCAGCCAGG 12360
CCCCACCTAC AGCGTGTGTC GAACAAACAG AGGTCTGGTG GTTTTCTCTA CTATCCTCCC 12420
ACTCGAAAGT TCACTGGTTG GGAGACAGGA TTCCTAGCAC CTCCGGTGTC AAAAGGCTGT 12480
CATGGGGTTG TGCCAATTAA TTACCAAACA TTGAGCCTGC AGGCTTTGAG TGGGAGCGTT 12540
GCCCCCAGGA GCCTTATCTC AGCCAATTAC CTTTCTTGAC AGTAGGAGCG GCTTCCCTCT 12600
CCCATTCCTT CTTCACTCCC TTTTCTTCCT TTCCCCTGTC TTCATCCCAC TGCTTTCCCA 12660
TGCTTCTTTC TGGGTTGTAG GGGAGACTGA CTGCCTGCTC AAGGACACTC CCTGCTGGGC 12720
ATAGGATGTG CCTGCAAAAA GTTCCCTGAG CCTGTAAGCA CTCCAGGTGG GGAAGTGGAC 12780
AGGAGCCATT GGTCATAACC AGACAGAATT TGGAAACATT TTCATAAAGC TCCACGGAGA 12840
GTTTTAAAGA AACATATGTA GCATGATTTT TTAGGAGAGG AAAAATTATT TAAATAGGAT 12900
TTAAATCATG CAACAACGAG AGTATCACAG CCAGGATGAC CCCTGGGTCC CATTCCTAAG 12960
ACATGGTTAC TTTATTTTCC CCTTGTTAAG ACATAGGAAG ACTTAATTTT TAAACGGTCA 13020
GTGTCCAGTT GAAGGCAGAA CACTAATCAG ATTTCAAGGC CCACAACTTG GGGACTAGAC 13080
CACCTTATGT TGAGGGAACT CTGCCACCTG CGTGCAACCC ACGGCTAAAG TAAATTCAAT 13140
GACACTACTG CCCTGATTAC TCCTTAGGAT GTGGTCAAAA CAGCATCAAA TGTTTCTTCT 13200
CTTCCTTTCC CCAAGACAGT GTCCTGAACC TGTTAAATTA AGTCATTGGA TTTTACTCTG 13260
TTCTGTTTAC AGTTTACTAT TTAAGGTTTT ATAAATGTAA ATATATTTTG TATATTTTTC 13320
TATGAGAAGC ACTTCATAGG GAGAAGCACT TATGACAAGG CTATTTTTTA AACCGCGGTA 13380
TTATCCTAAT TTAAAAGAAG ATCGGTTTTT AATAATTTTT TATTTTCATA GGATGAAGTT 13440
AGAGAAAATA TTCAGCTGTA CACACAAAGT CTGGTTTTTC CTGCTCAACT TCCCCCTGGA 13500
AGGTGTACTT TTTGTTGTTT AATGTGTAGC TTGTTTGTGC CCTGTTGATA TAAATGTTTC 13560
CTGGGTTTGC TCTTCGACAA TAAATGGAGA AGGAAGGTCA CCCAACTCCA TTGGGCCACT 13620
CCCCTCCTTC CCCTATTGAA GCTCCTCAAA AGGCTACAGT AGTATCTTGA TACAACAGAT 13680
TCTCTTCTTT CCCGCCTCTC TCCTTTCCGG CGCAACTTCC AGAGTGGTGG GAGATGGCAA 13740
TCTTTACATT TCCCTCATCT TTCTTACTTC AGAGTTAGCA AACAACAAGT TGAATGGCAA 13800
CTTGACATTT TTCCATCACC ATCTGCCTCA TAGGCCACTC TTTCCTTTCC TCTGCCCACC 13860
AAGTCCTCAT ATCTGCAGAG AACCCATTGA TCACCTTGTG CCCTCTTTTG GGGCAGCCCG 13920
TTGAAACTGA AGCACAGTCT GACCACTCAC GATAAAGCAG ATTTTTCTCT GCCTCTGCCA 13980
CAAGGTTTCA GAGTAGTGTA GTCCAAGTAG AGGGTGGGGC ACCCTTTTCT CGCCGCAAGA 14040
AGCCCATTCC TATGGAAGTC TAGCAAAGCA ATACGACTCA GCCCAGCACT CTCTGCCCCA 14100
GGACTCATGG CTCTGCTGTG CCTTCCATCC TGGGCTCCCT TCTCTCCCGT GACCTTAAGA 14160
ACTTTGTCTG GTGGCTTTGC TGGAACATTG TCACTGTTTT CACTGTCATG CAGGGAGCCC 14220
AGCACTGTGG CCAGGATGGC AGAGACTTCC TTGTCATCAT GGAGAAGTGC CAGCAGGGGA 14280
CTGGGGAAAG CACTCTACCC AGACCTCACC TCCCTTCCTC CTTTTGCCCA TGAACAAGAT 14340
GCAGTGGCCC TAGGGGTTCC ACTAGTGTCT GCTTTCCTTT ATTATTGCAC TGTGTGAGGT 14400
TTTTTTGTAA ATCCTTGTAT TCCTATTTTT TTTAAAGAAA AAAAAAAACT TAAGCTGCAT 14460
TTGTTACTGA AATGATTAAT GCACTGATGG GTCCTGAATT CACCTTGAGA AAGACCCAAA 14520
GGCCAGTCAG GGGGTGGGGG GAACTCAGCT AAATGGACCT AGTTACTGCC CTGCTAGGCC 14580
ATGCTGTACT GTGAGCCCCC TCCTCACTCT CTACCAACCC TAAACCCTGA GGACAGGGGA 14640
GGAACCCACA GCTTCCTTCT CCTGCCAGCT GCAGATGGTT TGCCTTGCCT TTCCACCCCC 14700
TAATTGTCAA CCACAAAAAT GAGAAATTCC TCTTCTAGCT CAGCCTTGAG TCCATTGCCA 14760
AATTTTCAGC ACACCTGCCA GCAACTTGGG GGGATAAGCG AAGGTTTCCC TACAAGAGGG 14820
AAAGAAGGCA AAAACGGCAC AGCTATCTCC AAACACATCT GAGTTCATTT CAAAAGTGAC 14880
CAAGGGAATC TCCGCACAAA AGTGCAGATT GAGGAATTGT GGTGGGTCAT TCCCAAGAAT 14940
CCCCCAAGGG GCATCCCAAA TCCCTAAGGA GTAACAGCTG CAAACCTGGT CAGTTCTCAG 15000
TGAGAGCCAG CTCACTTATA GCTTTGCTGC TAGAACCTGT TGTGGCTGCA TTTCCTGGTG 15060
GCCAGTGACA ACTGTGTAAC CAGAATAGCT GCATGGCGCT GACCCTTTGG CCGGAACTTG 15120
GTCTCTTGGC TCCCTCCTTG GCCACCCACC ACCTCTCGCA CAGCCCCTCT GTTTTTACAC 15180
CAATAACAAG AATTAAGGGG GAAGCCCTGG CAGCTATACG TTTTCAACCA GACTCCTTTG 15240
CCGGGACCCA GCCCGCCACC CTGCTCGCCT CCATCAAACC CCCGGCCAAT GCAGTGAGCA 15300
CCATGTAGCT CCCTTGATTT AAAAAAAAAA AAAAAAGGAA AAAAAAATAC AACACACACA 15360
AAAATAAAAA AAAATCTAAT GAATGTATCT TTCTAAAGGA CTGACGTTCA ATCAAATATC 15420
TGAAAATACT AAAGGTCAAA ACCTTGTCAG ATGTTAACTT TTAAGTTCGG TTTGGGATTT 15480
TTTTTTTTTT TTTAATAGAA ATCAAGTTGT TTTTGTTTTT AAGGAAAAGC GGGTCATTGC 15540
AAAGGGCTGG GTGTAATTTT ATGTTTCATT TCCTTCATTT TAAAGCAATA CAAGGTTATT 15600
GAGCAGATGG TTTTGTGCCG AATCATGAAT ACTAGTCAAG TCACACACTC TGGAAACTTG 15660
CAACTTTTTG TTTGTTTTGG TTTTCAAATA AATATAAATA TGATATATAT AGGAACTAAT 15720
ATAGTAATGC ACCATGTAAC AAAGCCTAGT TCAGTCCATG GCTTTTAATT CTCTTAACAC 15780
TATAGATAAG GATCGTGTTA CAGTTGCTAG TAGCGGCAGG AAGATGTCAG GCTCGCTTTC 15840
CTCTGATTCC CGAAATGGGG GGATCCTCTA ACCACAAAGG GATGGTAGAA CAGTCCATTC 15900
CTCGGATCAG AGAAAAATGC AGACATGGTG TCACCTGGAT TTTTTTCTGC CCATGAATGT 15960
TGCCA 15966
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0034 ENSPTRP00000040970.5 Pan troglodytes 99 0.0 6318
WERAM-Hos-0096 ENSP00000374157.5 Homo sapiens 99 0.0 6315
WERAM-Mam-0038 ENSMMUP00000007251.2 Macaca mulatta 99 0.0 6300
WERAM-Nol-0074 ENSNLEP00000008984.1 Nomascus leucogenys 99 0.0 6281
WERAM-Otg-0095 ENSOGAP00000008374.2 Otolemur garnettii 96 0.0 6134
WERAM-Aim-0147 ENSAMEP00000013764.1 Ailuropoda melanoleuca 95 0.0 6130
WERAM-Eqc-0038 ENSECAP00000006426.1 Equus caballus 96 0.0 6105
WERAM-Loa-0136 ENSLAFP00000012406.4 Loxodonta africana 94 0.0 6035
WERAM-Bot-0163 ENSBTAP00000024084.5 Bos taurus 94 0.0 6012
WERAM-Ova-0104 ENSOARP00000010552.1 Ovis aries 94 0.0 6005
WERAM-Caf-0134 ENSCAFP00000018720.5 Canis familiaris 95 0.0 5997
WERAM-Orc-0100 ENSOCUP00000008738.2 Oryctolagus cuniculus 93 0.0 5922
WERAM-Tut-0049 ENSTTRP00000004041.1 Tursiops truncatus 93 0.0 5915
WERAM-Chs-0032 ENSCSAP00000015460.1 Chlorocebus sabaeus 99 0.0 5844
WERAM-Mum-0008 ENSMUSP00000110337.1 Mus musculus 90 0.0 5695
WERAM-Ran-0100 ENSRNOP00000020573.6 Rattus norvegicus 90 0.0 5665
WERAM-Prc-0053 ENSPCAP00000005189.1 Procavia capensis 90 0.0 5660
WERAM-Myl-0155 ENSMLUP00000012825.2 Myotis lucifugus 86 0.0 5522
WERAM-Sah-0138 ENSSHAP00000014807.1 Sarcophilus harrisii 86 0.0 5471
WERAM-Dio-0068 ENSDORP00000006787.1 Dipodomys ordii 87 0.0 5137
WERAM-Paa-0065 ENSPANP00000009106.1 Papio anubis 99 0.0 5099
WERAM-Mod-0116 ENSMODP00000016853.4 Monodelphis domestica 87 0.0 5097
WERAM-Tag-0002 ENSTGUP00000000072.1 Taeniopygia guttata 79 0.0 4913
WERAM-Pes-0058 ENSPSIP00000007945.1 Pelodiscus sinensis 80 0.0 4913
WERAM-Fia-0123 ENSFALP00000010386.1 Ficedula albicollis 80 0.0 4874
WERAM-Ptv-0056 ENSPVAP00000005910.1 Pteropus vampyrus 89 0.0 4864
WERAM-Meg-0028 ENSMGAP00000002448.2 Meleagris gallopavo 77 0.0 4504
WERAM-Ocp-0107 ENSOPRP00000010334.2 Ochotona princeps 88 0.0 4337
WERAM-Cap-0070 ENSCPOP00000005242.2 Cavia porcellus 93 0.0 4281
WERAM-Gaga-0075 ENSGALP00000011008.4 Gallus gallus 79 0.0 4170
WERAM-Ict-0127 ENSSTOP00000012923.2 Ictidomys tridecemlineatus 92 0.0 4126
WERAM-Mup-0053 ENSMPUP00000004604.1 Mustela putorius furo 96 0.0 4114
WERAM-Gog-0013 ENSGGOP00000000936.2 Gorilla gorilla 99 0.0 3989
WERAM-Anp-0040 ENSAPLP00000004456.1 Anas platyrhynchos 80 0.0 3958
WERAM-Chh-0055 ENSCHOP00000006343.1 Choloepus hoffmanni 91 0.0 3925
WERAM-Anc-0073 ENSACAP00000006937.3 Anolis carolinensis 66 0.0 3888
WERAM-Dan-0088 ENSDNOP00000008968.3 Dasypus novemcinctus 91 0.0 3293
WERAM-Soa-0133 ENSSARP00000013142.1 Sorex araneus 83 0.0 3201
WERAM-Tas-0109 ENSTSYP00000011196.1 Tarsius syrichta 90 0.0 3165
WERAM-Mae-0003 ENSMEUP00000000157.1 Macropus eugenii 82 0.0 2898
WERAM-Lac-0176 ENSLACP00000020625.1 Latimeria chalumnae 66 0.0 2890
WERAM-Tub-0079 ENSTBEP00000009516.1 Tupaia belangeri 88 0.0 2882
WERAM-Ect-0072 ENSETEP00000007880.1 Echinops telfairi 85 0.0 2806
WERAM-Vip-0054 ENSVPAP00000005197.1 Vicugna pacos 91 0.0 2041
WERAM-Ora-0035 ENSOANP00000005967.3 Ornithorhynchus anatinus 85 0.0 1383
WERAM-Leo-0025 ENSLOCP00000004893.1 Lepisosteus oculatus 77 0.0 1275
WERAM-Mim-0138 ENSMICP00000013960.1 Microcebus murinus 93 0.0 1224
WERAM-Ere-0135 ENSEEUP00000014133.1 Erinaceus europaeus 83 0.0 1224
WERAM-Xet-0067 ENSXETP00000022279.3 Xenopus tropicalis 75 0.0 1206
WERAM-Dar-0010 ENSDARP00000095298.3 Danio rerio 68 0.0 1150
WERAM-Orn-0070 ENSONIP00000007849.1 Oreochromis niloticus 70 0.0 1145
WERAM-Gaa-0091 ENSGACP00000011913.1 Gasterosteus aculeatus 72 0.0 1140
WERAM-Pof-0188 ENSPFOP00000015902.2 Poecilia formosa 68 0.0 1107
WERAM-Ten-0223 ENSTNIP00000002397.1 Tetraodon nigroviridis 66 0.0 1079
WERAM-Xim-0097 ENSXMAP00000008788.1 Xiphophorus maculatus 65 0.0 1079
WERAM-Gam-0119 ENSGMOP00000012588.1 Gadus morhua 67 0.0 1062
WERAM-Fec-0070 ENSFCAP00000005933.3 Felis catus 48 0.0 785
WERAM-Sus-0021 ENSSSCP00000003118.2 Sus scrofa 48 0.0 785
WERAM-Caj-0107 ENSCJAP00000018740.2 Callithrix jacchus 49 0.0 776
WERAM-Orla-0062 ENSORLP00000008124.1 Oryzias latipes 47 0.0 751
WERAM-Tar-0189 ENSTRUP00000038926.1 Takifugu rubripes 50 0.0 733
WERAM-Asm-0037 ENSAMXP00000004791.1 Astyanax mexicanus 46 0.0 728
WERAM-Pem-0014 ENSPMAP00000002218.1 Petromyzon marinus 59 0.0 689
WERAM-Cis-0045 ENSCSAVP00000009955.1 Ciona savignyi 37 7e-135 481
WERAM-Cii-0034 ENSCINP00000025384.2 Ciona intestinalis 54 1e-100 367
WERAM-Drm-0010 FBpp0082406 Drosophila melanogaster 47 2e-89 330
WERAM-Cae-0021 C26E6.9a Caenorhabditis elegans 54 2e-42 174
WERAM-Tum-0027 CAZ85029 Tuber melanosporum 53 3e-42 173
WERAM-Php-0006 PP1S101_4V6.1 Physcomitrella patens 54 4e-42 173
WERAM-Ors-0112 OS12T0613200-02 Oryza sativa 54 5e-42 172
WERAM-Org-0116 ORGLA12G0159200.1 Oryza glaberrima 54 5e-42 172
WERAM-Sei-0078 Si021071m Setaria italica 54 9e-42 172
WERAM-Orbr-0127 OB12G25360.1 Oryza brachyantha 53 1e-41 171
WERAM-Brd-0092 BRADI4G01790.1 Brachypodium distachyon 54 2e-41 171
WERAM-Asn-0015 CADANIAP00003254 Aspergillus nidulans 53 2e-41 171
WERAM-Thc-0094 EOY15831 Theobroma cacao 55 3e-41 170
WERAM-Ast-0003 CADATEAP00001100 Aspergillus terreus 52 3e-41 170
WERAM-Asni-0037 CADANGAP00014055 Aspergillus niger 52 3e-41 170
WERAM-Coi-0035 EAS31778 Coccidioides immitis 52 3e-41 170
WERAM-Asc-0034 CADACLAP00008186 Aspergillus clavatus 52 3e-41 170
WERAM-Aso-0006 CADAORAP00000676 Aspergillus oryzae 52 4e-41 169
WERAM-Prp-0018 EMJ21490 Prunus persica 55 5e-41 169
WERAM-Sol-0003 Solyc01g006880.2.1 Solanum lycopersicum 53 5e-41 169
WERAM-Asfu-0007 CADAFUAP00002016 Aspergillus fumigatus 52 8e-41 169
Created Date 25-Jun-2016