WERAM Information


Tag Content
WERAM ID WERAM-Mim-0010
Ensembl Protein ID ENSMICP00000000977.1
Gene Name KMT2C
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMICG00000001060.1 ENSMICT00000001073.1 ENSMICP00000000977.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 5.80e-42 142.3 4709 4823
Me_Reader PHD 6.90e-13 48.3 230 4376
Organism Microcebus murinus
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+++iek+++viEY+G++ir+eva+++ek ye++++gvy+fr+d+d +v+dat +g+ ar+inhsc+ Nc+
ENSMICP00000000977.1 4709 NVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDND--HVIDATLTGGPARYINHSCA-NCV 4792
7999*********************************************************..*******************6.*** PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++++ +ki+i ++r+I+kgeel+ydYk
ENSMICP00000000977.1 4793 AEVVTFERGHKIIISSNRRIQKGEELCYDYK 4823
******************************7 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde.CddwfHlkCv 32 
C +C++ ++ + +C+e C + +H+ C
ENSMICP00000000977.1 230 RCAFCKHLGAT---IKCCEEkCTQMYHYPCA 257
6****433333...6688889*********7 PP
PHD.txt 11 degekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+e++k m+ Cd+Cd+ +H+ C+++ ++s+p + w C++C+
ENSMICP00000000977.1 343 GEDSK-MLVCDTCDKGYHTFCLQPVMKSVPTN-GWKCKNCR 381
44544.************************77.7******8 PP
PHD.txt 2 tiClvCgkddegeke..mvqCdeCddwfHlkCvklplsslpeg..kswyCpsCk 51
++C++Cgk+ + e + m+ C+ C++w+Hl+C k++ ++l ++ ++++C Ck
ENSMICP00000000977.1 410 NLCPFCGKCCHPELQkdMLHCNMCKRWVHLECDKPTDHELDSQlkEEYICMYCK 463
68999987776654455*******************77777666668******8 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
t+C +Cgk+ + + ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSMICP00000000977.1 952 TVCEACGKATDPGR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 999
68999976666655.9*************************.9**99996 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.ks..wyCpsCk 51
+C+vC +++ +++ ++qC +Cd+w+H+ C +l+ ++ e+ ++ + C C+
ENSMICP00000000977.1 1030 SCPVCYRNYREDDLILQCRQCDRWMHAVCQNLNNEEEVENvADigFDCSMCR 1081
7****777776677*******************5554444422458899997 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCvk 33
+C +C+++++g + +++ d d w+Hl+C
ENSMICP00000000977.1 4344 CC-FCHEEGDGLTDgparLLNLDL-DLWVHLNCAL 4376
45.587777774445555666666.5599999975 PP

Protein Sequence
(Fasta)
PRSRGKTAVE DEDSMDGLET TETENIVETE IKEQSAEEDA EAEVDNSKQP VPALQRSVSE 60
ESANSLVSVG VEAKISEQLC AFCYCGEKSS LGQGDLKQFR VTPGFILPWR NQPSSKKDID 120
DNSNGTYEKI QNTAPRKQRG QRKERSPQQN IASCVSVSTQ TAADDQAGKL WDELSLVGLP 180
DAIDVQALFD PTGTCWAHHR CVEWSLGVCQ IEEQLLVNVD KAVVSGSTER CAFCKHLGAT 240
IKCCEEKCTQ MYHYPCAAGA GTFQFSHFXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 300
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX QSGEDSKMLV CDTCDKGYHT 360
FCLQPVMKSV PTNGWKCKNC RICVECGTRS SSQWHHNCLI CDSCYQQQDN LCPFCGKCCH 420
PELQKDMLHC NMCKRWVHLE CDKPTDHELD SQLKEEYICM YCKHLEAEMD PLQPGDEVEM 480
AELITDNNEM EVEGPEDQMV FLEQAVNKDV NGQESTPGIV PDEVEVHTEE PQKSNPPESL 540
DTDGLLTSES SSNKMNSELE NQTSHEVNSE KMEMSSKVMH TCDEDQNEDK MEVTENIEVI 600
THQIIVQQEE LQLEETKTVV SKEELRLPKL TIESVAVVPP ETFVSPHEES TSLCSKKQLL 660
IERIQEEMEK KEHSEFPAGF VDFEMTPAIE SCVKDGSCRG DKSMKLSSES ESSFSSSADI 720
SKANVSSSPT LSSDLPSHEM LHSYPSTLGS AAGNILPTTY ISVTPKIGMR KPAITKRKFS 780
PGRPRSKQGA WSAHNTVSPP SWSPDISEGR EIFKPRQLPG SAIWSIKVGR GSGFPGKRRL 840
RGAGLXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 900
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXITKV ILSKGWRCLE CTVCEACGKA 960
TDPGRLLLCD DCDISYHTYC LDPPLQTVPK GGWKCKWCVW CRHCGATSAG LRCEWQNNYT 1020
QCAPCASLSS CPVCYRNYRE DDLILQCRQC DRWMHAVCQN LNNEEEVENV ADIGFDCSMC 1080
RPYMPASNVP SSDCCESSLV AQIVTKVKEL XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1140
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXGELMDCDG KSESSPEREA 1200
VDDETKGVEG TDGVKKRKRK PYRPGIGGFM GRQRSRTGQG KTKRSVIRKD SSGSISEQLP 1260
SRDDXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXEA FFGKDLLDSS 1320
RRNKLSLDNL SEDTAQLSYK TNRSTGFLDP SLDPLLSASS APAEPGIQXX XXXXXXXXXX 1380
XXXXXXXXXX XXXXXXXXXX XXXXIPVADD PSSLPQQSVN QSLRPLSEEQ LDGILSPELD 1440
KMVTDGAILG KLYKIPELGG KDVEDLFTAV LSPAATQPPP LPQPNPQTQL LPLHSQDVFS 1500
RMPLMNGLIG PSPHLPHNSL PPGSGLGTFS SIAQSPYPDA XXXXXXXXXX XXXXXXXXXX 1560
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXWTTRVKQ 1620
IAKLWRKASS QERAPYVQKA RDNRAALRIN KVQMSNDSMK RQQQQDSIDP SSRIDSDLFK 1680
DPLKQRESEH EQEWKFRQQM RQKSKQQAKI EATQKLEQVK NEQQQQQQQQ QQFGSQHLLM 1740
QSGSDTPSSG IQSPLTPQPG NGNMSPAQSF HKDLFTKQLF ITSSDDVFVK PHAPPPPPTP 1800
SRIPVPESLS QSQTSQPPSP QMFSPGSSNS RPPSPMDPYA KMVGTPRPPP GGHSFPRRNS 1860
APMENCTPLS SVTRPIQMNE TTANRPSPVR DLCSSSTTNS DPYAKPPDTP RPVMTDQFPK 1920
PLGLPRSPIV SEQAAKGPVA AGTNDHFTKP SPRADVFQRQ RVSDTYARPL LTPAPLDSAP 1980
GPFKTPMQPP PSSQDPYGSV SQASRRLSVD PYERPALTPR PVDNFSHHSQ SNDPYSQPPL 2040
TPHPAMTESF AHPSRAFSQP GTISRPTSQD PYSQPPGTPR PVVDSYSQPS GPARSNTDPY 2100
SQPPGTPRPT SIDPYSQQPP TPRPSPQTDL FVTSVANPRH SDPYAHPPGT PRPGISVPYS 2160
QPPATPRPRI SEGFNRSSMT RPVLMPNQDP FLPAAQNRGP ALPGPLVRPP DLCSQTPRPP 2220
GPGLSDTFSR VSPSAARDPC DQPVVTPRSQ AGSFGATQVA HDIANQPGPG SEGSFGTLAG 2280
SPASSQGQQF SSVSQLSGPV PTSGATDTHN TVNMSQADTE KLRQRQKLRE IILQQQQHKK 2340
IAGRQEKGSQ DSAVVPHPGP LPHWQPESIN QAFTRPPPPY PGNIRSPVVP PLGPRYAVFP 2400
KDQHGPYPPD VAGVGMRPHG FRFGFPGGSH GVMSSQERFL GPPQQIQGSG VPPQLRRSIS 2460
VDMPRPLNNS QMNNPIGLPQ HFPPQSLPVQ QHNILGQAFI ELRHRAPDGR PRLPFNAPPG 2520
SVVEAPSHAR HGNFIPRPDF AGPRHTDPMR RPPQGLPNQL LVHPNLEQVP PSQQEQGHPV 2580
HSSSMVMRSL SHSLGGEFSE APLSTTTPAE TTPDNLQITS QSSDGLEEKL DSDDPSVKEL 2640
DVKDLEGVEV KDLDDDDLEN LNLDTEDGKG DELDTLGNLE TNDPNLDDLL RSGEFDIIAY 2700
TDPELDLGDK KNMFNEELDL NVPIDDKLDN QCVSVEPKKK EQEDKTVVLT DKHSPQKKST 2760
VSNEVKTEVL SPNSKVESKC EIEKSDESKD NVDTPCSQAS AHTDLNDGEK TCSQPCDPET 2820
LENRTNRETA GSSASVIQAS TQLPAQDVVN SCGISGSTPV LSSLLANEKA DNSDVRPLGS 2880
PPATLRASPS NQVSSLPPLM APPGHVLDNT MNSNVTGISR VNHAFSQGGQ VNPGFIQGQS 2940
TVNHSLGTGK PTNQTIPLTS QSGTSGMSGP QQLMIPQTLA QQNRERPLLL EEQPLLLQDL 3000
LDQERQEQQQ QRQMQAMIRQ RSEPFFPNMD FDAITDPIMK AKMVALKGIN KVMAQNNLGM 3060
PPMVMSXXXX XXQSLAGQNS EGHNPIPQAP QDGSITHQIS RPNPPNFGPG FVNDSHRKQY 3120
EEWLQETHTL LLMQQRYLER IGAHRKTKKA LSAKQRTAKK DGREFPEEDA EQLKHVTEQQ 3180
SMVQKQLEQI RKQQKEHAEL MEDYRIKQQQ QQQQCAMAPP AVMAGVQPQP PLVPGATPPT 3240
MSQPSFPVVP PQLQHQHQQH TAVLPSHSSP ARMPSLPGWQ PSSAPAHLPL NPPRIQPPVN 3300
QLPLKTCTPA PGAASNANPQ SGPPPRVEFD DNNPFSESFQ ERERKERLRE QQERQRIQLM 3360
QEVDRQRALQ QRMEMEQHGM VGSEISGRTP VSQIPFYNSE LPCDFMQPPR PIQSPQHQQQ 3420
MGQVLQQQGI QPGSVNSPST QTFMQTNERR QVGPPSFVPD SSSIPVGSPN FHPVKQGHGS 3480
LPGTSFQQSP VRPPFTPALP APPAAASSSL PCGPDPAVTH GQSYPGSSQS LIQLYSDIIP 3540
EEKGKKKRTR KKKKDDDAES TKAPSTPHSD ITAPLTPSIA ETTSTPAGST PSELPPQAEQ 3600
ESVEPAGQST PSVAAGQPCA GLESRLPNGD SSQETPNQQS YANAEVDKVS METPAKIEEI 3660
KLEKAETELS PGQEEPKLEE QTGSKVEENA DACPVSSAQS PPHSVGTSAA KGDSGNELLK 3720
HLLKNKKSSS LLSQKPEGSF CAEDDCTKDN KLVEKQNPAQ GLXXXXXXXX XXXXXXXXXX 3780
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXQNNLSNPP 3840
TPPASLPPTP PPMACQKMAN GFATTEELAG KAGVLVNHEV TKTLGPKPFH LPFRPQDDLL 3900
ARAIIAQGPK TVDVPASLPT PPHNNQEELR IQDHCGDRDT PESFVPSSSP ESVVGVEVSR 3960
YPDLSLVKEE PPEPVPSPII PILPSTAGKS SESKRDIKSE PGTLYFASPF GSSPNGPRSG 4020
LISVAITLHP AAAENISSVV AAFSDLLHVR IPNSYEVSNA PDVPSVGLVS SHRVNPGLEY 4080
RQHLLLRGPP PGSANPPKLA SPYRLKQPNV PFPPTSNGLS GYKDSSHGVT ESTALRPQWC 4140
CHCKVVILGS GVRKSFKDLT FGNKDFRENS RRVEKDIVFC SNNCFILYSS TAQAKNSESK 4200
ESTPSLPQSP MRETPSKAFH QYSNNISTLD VHCLPQLQEK ASPPASPPIA FPPAFEAAKV 4260
EAKPDELKVT VKLKPRLRTV HGGFEDCRPL NKKWRGMKWK KWSIHIVIPK GTFKPPCEDE 4320
IDEFLKKLGT SLKPDPVPRD YRKCCFCHEE GDGLTDGPAR LLNLDLDLWV HLNCALWSTE 4380
VYETQAGALI NVELALRRGL QMKCVFCHKT GATSCHRFRC TNIYHCAKAC MFXXXXXXXX 4440
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXL IFHTIGQLLP 4500
QQMQAFHSPK ALFPVGYEAS RLYWSSRYAN RRCRYLCSIE EKDGRPVFVI RIVEQGHEDL 4560
VLSDTSPKGV WDKILEPVAC VRKKSEMLQL FPAYLKGEDL FGLTVSAVAR IAESLPGVEA 4620
CENYTFRYGR NPLMELPLAV NPTGCARSEP KMSAHVKRFV LRPHTLNSTS TSKSFQSTVT 4680
GELNAPYSKQ FVHSKSSQYR KMKTEWKSNV YLARSRIQGL GLYAARDIEK HTMVIEYIGT 4740
IIRNEVANRK EKLYESQNRG VYMFRMDNDH VIDATLTGGP ARYINHSCAN CVAEVVTFER 4800
GHKIIISSNR RIQKGEELCY DYKFDFEDDQ HKIPCHCGAV NCRKWMN 4847
Nucleotide Sequence
(Fasta)
CCTCGAAGTA GAGGAAAAAC GGCGGTGGAA GATGAGGACA GCATGGATGG CCTGGAGACA 60
ACAGAAACAG AAAATATTGT GGAAACAGAA ATCAAAGAAC AGTCTGCAGA AGAGGATGCT 120
GAAGCAGAAG TGGATAACAG CAAACAGCCA GTCCCAGCTC TGCAACGATC TGTGTCTGAG 180
GAATCTGCAA ACTCCTTGGT CTCTGTTGGT GTAGAAGCCA AAATCAGTGA ACAGCTCTGC 240
GCTTTTTGTT ACTGTGGGGA AAAAAGTTCC TTAGGACAAG GAGATTTAAA ACAGTTCAGG 300
GTAACACCTG GATTTATCTT GCCATGGAGA AACCAGCCTT CTAGCAAGAA GGACATTGAT 360
GACAACAGCA ATGGAACCTA TGAGAAAATA CAAAACACAG CACCACGAAA ACAAAGAGGA 420
CAGAGAAAAG AACGATCGCC TCAGCAGAAT ATAGCATCTT GCGTAAGTGT AAGCACCCAA 480
ACAGCTGCAG ATGATCAGGC TGGTAAACTA TGGGATGAAC TCAGTCTGGT TGGCCTTCCA 540
GATGCCATTG ATGTCCAAGC CTTATTTGAT CCTACAGGCA CTTGTTGGGC TCATCACCGT 600
TGCGTGGAGT GGTCACTGGG AGTATGCCAG ATAGAAGAAC AATTGTTAGT AAACGTGGAC 660
AAAGCTGTTG TCTCAGGGAG CACAGAACGA TGTGCATTTT GTAAGCACCT TGGAGCCACT 720
ATCAAATGCT GTGAAGAGAA ATGTACCCAG ATGTACCATT ATCCTTGTGC TGCTGGCGCC 780
GGCACCTTTC AGTTCAGTCA CTTCNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 840
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 900
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 960
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA 1020
CAATCGGGAG AAGATAGCAA GATGCTAGTG TGTGATACGT GTGACAAAGG GTATCATACT 1080
TTTTGTCTTC AACCAGTTAT GAAATCAGTA CCAACCAATG GCTGGAAATG CAAAAATTGC 1140
AGAATATGTG TAGAGTGTGG CACACGGTCT AGTTCACAGT GGCATCACAA TTGCCTGATA 1200
TGTGACAGTT GTTACCAACA ACAGGATAAC TTATGTCCTT TCTGTGGGAA GTGCTGTCAT 1260
CCAGAATTGC AGAAAGACAT GCTTCATTGT AACATGTGCA AAAGATGGGT TCACTTAGAA 1320
TGTGACAAAC CAACAGATCA TGAACTGGAT TCTCAGCTCA AAGAAGAGTA TATCTGCATG 1380
TATTGTAAAC ACTTAGAAGC TGAGATGGAT CCATTACAGC CAGGTGATGA AGTGGAGATG 1440
GCTGAACTCA TTACAGATAA CAATGAGATG GAGGTTGAAG GACCTGAAGA CCAAATGGTA 1500
TTTTTGGAGC AAGCTGTTAA TAAAGATGTC AATGGTCAGG AGTCCACACC TGGAATTGTT 1560
CCAGATGAAG TTGAAGTCCA CACTGAAGAG CCACAGAAGA GTAATCCACC AGAAAGTCTT 1620
GACACAGATG GTCTTCTCAC TTCTGAATCA TCCTCAAATA AAATGAATTC TGAATTGGAA 1680
AATCAGACTT CTCATGAAGT TAATAGTGAA AAAATGGAAA TGTCTTCTAA AGTGATGCAC 1740
ACTTGTGATG AAGATCAAAA TGAAGACAAA ATGGAAGTGA CAGAAAACAT CGAAGTCATT 1800
ACACATCAGA TCATTGTGCA GCAAGAGGAG CTGCAGTTAG AGGAAACTAA AACAGTGGTA 1860
TCTAAAGAAG AATTGCGGCT TCCAAAATTA ACCATTGAGT CCGTTGCTGT TGTTCCACCA 1920
GAAACCTTCG TTTCCCCACA TGAGGAAAGT ACTTCATTAT GTTCTAAGAA ACAGTTGCTT 1980
ATAGAAAGGA TACAAGAGGA AATGGAAAAG AAAGAACATT CTGAATTTCC TGCTGGATTT 2040
GTGGACTTTG AAATGACTCC TGCAATTGAG AGTTGTGTGA AAGATGGTTC ATGTCGAGGA 2100
GACAAATCTA TGAAATTATC ATCTGAGTCA GAGTCATCAT TTTCATCATC AGCAGACATA 2160
AGCAAGGCAA ATGTGTCTTC CTCTCCAACA CTGTCCTCAG ACTTACCTTC ACATGAAATG 2220
CTGCACAGTT ACCCTTCAAC TCTTGGTTCT GCTGCTGGGA ACATCCTGCC AACAACATAT 2280
ATCTCAGTCA CTCCAAAAAT TGGCATGCGT AAACCAGCTA TTACCAAAAG AAAATTTTCT 2340
CCTGGTAGAC CACGGTCCAA ACAGGGGGCA TGGAGTGCCC ATAATACAGT GAGCCCACCT 2400
TCCTGGTCCC CAGACATTTC AGAAGGTCGG GAAATTTTTA AACCCAGGCA GCTTCCTGGC 2460
AGTGCCATTT GGAGCATCAA AGTGGGCCGG GGATCTGGGT TTCCAGGAAA GCGGAGACTC 2520
CGTGGTGCAG GACTGTNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2580
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNAT CACTAAAGTG 2820
ATTCTTAGCA AAGGTTGGAG GTGTCTTGAG TGCACAGTGT GTGAGGCCTG TGGGAAGGCC 2880
ACCGACCCGG GAAGACTCCT GCTGTGTGAT GACTGTGACA TAAGTTACCA CACCTATTGC 2940
CTAGATCCCC CGTTGCAGAC AGTTCCCAAA GGAGGCTGGA AGTGCAAATG GTGTGTTTGG 3000
TGCAGACACT GTGGAGCAAC ATCTGCAGGT CTAAGATGTG AATGGCAAAA CAATTACACG 3060
CAGTGTGCTC CTTGTGCAAG CTTATCTTCC TGTCCGGTCT GCTATCGAAA CTATAGAGAA 3120
GACGACCTTA TTCTGCAATG TAGACAATGT GATAGATGGA TGCATGCAGT TTGTCAAAAC 3180
TTAAACAATG AGGAAGAAGT GGAAAATGTG GCAGACATTG GTTTTGACTG TAGCATGTGC 3240
AGACCCTATA TGCCTGCGTC TAATGTGCCT TCCTCAGACT GTTGTGAATC TTCACTTGTA 3300
GCACAAATTG TCACAAAAGT AAAAGAGCTA GNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3480
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3540
NNNNAAGGAG AACTTATGGA TTGTGATGGA AAATCAGAAT CTAGTCCTGA GCGGGAAGCT 3600
GTGGATGATG AAACTAAGGG AGTAGAAGGA ACAGATGGTG TCAAAAAGAG AAAAAGGAAA 3660
CCATACAGAC CAGGTATTGG GGGATTTATG GGGCGGCAAA GAAGTCGAAC TGGGCAAGGG 3720
AAAACCAAAA GATCTGTGAT CAGGAAAGAT TCCTCAGGCT CTATTTCTGA GCAATTACCT 3780
AGCAGAGATG ATGNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3840
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3900
NNNNNNNNNN NNNNNNNNNN NNNNGAAGCT TTCTTTGGAA AAGATCTTCT AGATTCAAGT 3960
AGACGAAACA AGCTGAGTTT AGATAATCTA TCAGAAGATA CAGCTCAGCT TTCATATAAA 4020
ACAAATAGGA GCACAGGTTT CTTGGATCCT TCCTTAGATC CCCTACTTAG TGCATCTTCA 4080
GCTCCAGCAG AACCTGGAAT TCAAGNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4140
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4200
NNNNNNNNNN ATATTCCTGT TGCTGATGAT CCTTCCTCTT TGCCTCAACA AAGTGTCAAT 4260
CAGAGTTTAC GACCATTAAG TGAAGAGCAA CTAGATGGAA TCCTCAGTCC TGAACTAGAC 4320
AAAATGGTCA CAGATGGAGC AATTCTTGGA AAGTTATATA AAATTCCAGA GCTCGGAGGG 4380
AAGGATGTTG AAGATTTATT TACTGCTGTA CTTAGTCCTG CAGCCACTCA GCCACCTCCA 4440
TTACCACAGC CCAACCCCCA AACACAGCTG TTGCCATTAC ACAGTCAGGA TGTTTTTTCA 4500
CGGATGCCAC TCATGAATGG CCTTATTGGA CCCAGTCCTC ACCTCCCACA TAATTCTTTG 4560
CCTCCTGGCA GTGGACTAGG AACTTTCTCA TCAATAGCAC AATCCCCTTA TCCTGATGCC 4620
AGNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4680
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4740
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4800
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNATT GGACTACTAG AGTGAAACAG 4860
ATCGCTAAGT TGTGGAGAAA AGCAAGCTCT CAGGAAAGAG CACCATATGT GCAAAAAGCC 4920
AGAGATAACA GGGCTGCTTT ACGCATTAAT AAAGTGCAGA TGTCAAATGA CTCAATGAAA 4980
AGGCAGCAAC AGCAAGATAG CATCGATCCC AGCTCTCGTA TTGATTCAGA TCTTTTTAAA 5040
GATCCATTAA AGCAAAGAGA ATCGGAACAT GAACAGGAAT GGAAATTTAG ACAGCAAATG 5100
CGTCAGAAAA GTAAGCAGCA AGCTAAAATT GAAGCCACAC AGAAACTTGA ACAGGTGAAA 5160
AATGAGCAGC AGCAGCAGCA GCAACAGCAG CAACAGTTTG GTTCTCAGCA TCTTCTGATG 5220
CAGTCTGGTT CAGACACCCC AAGTAGTGGG ATACAGAGCC CCTTGACACC TCAGCCTGGG 5280
AATGGAAATA TGTCTCCTGC ACAGTCATTC CATAAAGACC TGTTTACAAA ACAACTATTT 5340
ATCACATCTT CAGATGATGT GTTTGTAAAA CCACACGCAC CACCTCCTCC TCCAACCCCA 5400
TCTCGGATTC CTGTTCCGGA GAGCCTTTCT CAGTCTCAGA CTTCTCAGCC GCCTTCACCA 5460
CAAATGTTTT CACCTGGATC CTCTAACTCC CGACCACCAT CTCCAATGGA TCCTTATGCA 5520
AAAATGGTCG GTACTCCTAG ACCACCTCCT GGGGGCCATA GTTTTCCCAG AAGAAACTCT 5580
GCACCGATGG AAAACTGTAC ACCTTTGTCA TCGGTAACTA GGCCCATTCA GATGAATGAG 5640
ACAACAGCAA ATAGGCCATC CCCAGTCAGA GATTTATGTT CTTCCTCCAC AACAAATAGT 5700
GACCCATATG CAAAGCCTCC AGACACACCT AGGCCTGTGA TGACAGATCA GTTTCCCAAA 5760
CCCTTGGGCC TACCCCGCTC TCCTATAGTT TCAGAACAAG CTGCAAAAGG CCCTGTAGCA 5820
GCTGGAACCA ATGATCACTT TACTAAACCA TCTCCCAGGG CAGATGTGTT TCAAAGACAA 5880
CGGGTATCTG ACACATATGC ACGACCCTTG CTGACACCTG CACCTCTTGA TAGTGCTCCT 5940
GGACCATTTA AGACTCCAAT GCAGCCACCT CCGTCCTCTC AGGACCCTTA TGGATCAGTG 6000
TCACAGGCAT CAAGGCGACT GTCTGTTGAC CCTTATGAAA GGCCTGCTTT GACACCAAGA 6060
CCTGTAGATA ATTTTTCTCA TCATAGTCAG TCAAATGATC CGTATAGTCA GCCTCCCCTA 6120
ACGCCACATC CAGCAATGAC TGAATCATTT GCCCATCCTT CAAGGGCTTT TTCCCAGCCT 6180
GGAACCATAT CAAGGCCAAC ATCTCAGGAT CCATATTCCC AACCCCCAGG AACTCCACGA 6240
CCTGTTGTAG ATTCTTATTC CCAACCCTCA GGACCAGCTC GATCCAATAC AGACCCTTAC 6300
TCCCAACCAC CTGGAACTCC CCGGCCTACC TCTATTGACC CATACAGTCA GCAGCCCCCA 6360
ACTCCACGAC CATCTCCACA AACTGACTTG TTTGTTACAT CTGTAGCTAA TCCTAGGCAC 6420
TCAGATCCAT ATGCTCATCC TCCTGGAACA CCAAGACCTG GAATTTCTGT TCCTTACTCT 6480
CAGCCACCAG CAACACCAAG GCCAAGGATT TCAGAGGGGT TTAATAGGTC CTCAATGACA 6540
AGACCGGTCC TCATGCCAAA TCAGGATCCT TTCCTGCCAG CAGCACAAAA CCGAGGACCA 6600
GCTTTACCTG GCCCGTTGGT AAGGCCACCT GATTTGTGTT CCCAGACACC TCGGCCACCA 6660
GGACCTGGTC TTTCAGACAC ATTTAGCCGT GTTTCCCCAT CTGCTGCTCG TGATCCCTGT 6720
GATCAGCCTG TGGTGACTCC AAGGTCTCAG GCTGGCTCTT TCGGAGCAAC TCAAGTTGCT 6780
CATGACATTG CTAATCAGCC AGGGCCTGGA TCGGAGGGGA GCTTTGGGAC GTTGGCAGGC 6840
TCTCCTGCGA GCTCTCAGGG CCAGCAGTTC TCTAGTGTCT CTCAGCTGTC TGGGCCTGTA 6900
CCCACTTCAG GAGCAACGGA TACACACAAT ACTGTAAATA TGTCTCAAGC AGATACAGAG 6960
AAATTGAGAC AGAGGCAGAA GTTACGTGAA ATCATTCTTC AGCAGCAACA GCATAAGAAG 7020
ATTGCAGGTC GACAGGAGAA GGGCTCGCAG GATTCAGCAG TAGTGCCTCA TCCAGGGCCC 7080
CTTCCACACT GGCAACCAGA GAGTATCAAC CAGGCTTTCA CTAGACCCCC ACCTCCCTAT 7140
CCTGGGAATA TTAGGTCTCC TGTTGTCCCT CCTTTAGGAC CGAGATATGC AGTCTTCCCA 7200
AAAGATCAAC ACGGACCCTA TCCTCCTGAT GTTGCTGGTG TGGGGATGAG ACCTCATGGA 7260
TTTAGATTTG GGTTTCCAGG AGGTAGTCAT GGTGTCATGT CAAGTCAGGA ACGCTTCCTT 7320
GGGCCTCCTC AACAAATACA AGGATCTGGA GTTCCTCCGC AGCTGAGAAG ATCAATATCT 7380
GTAGATATGC CGAGGCCTTT AAATAACTCA CAAATGAATA ACCCAATTGG GCTTCCTCAG 7440
CATTTTCCAC CGCAGAGTCT ACCAGTGCAG CAGCACAACA TACTGGGCCA AGCGTTTATT 7500
GAATTGAGGC ACAGGGCCCC TGATGGAAGG CCACGGCTGC CTTTCAATGC TCCTCCTGGC 7560
AGCGTTGTGG AGGCACCTTC TCATGCAAGA CATGGAAACT TCATTCCCCG GCCAGACTTT 7620
GCAGGCCCAA GACACACAGA CCCCATGAGA CGACCTCCCC AAGGCCTACC TAATCAGCTG 7680
CTTGTACATC CAAATTTGGA ACAAGTGCCA CCATCTCAGC AAGAGCAAGG TCATCCTGTC 7740
CATTCATCTT CTATGGTCAT GAGGTCTCTG AGTCACTCGT TAGGTGGAGA ATTTTCAGAG 7800
GCTCCTTTGT CAACAACTAC ACCAGCTGAA ACAACACCTG ATAATTTACA GATAACCAGC 7860
CAGTCTTCCG ATGGTCTGGA AGAAAAACTT GATTCCGATG ACCCTTCTGT GAAAGAACTG 7920
GATGTTAAAG ACCTTGAGGG GGTTGAAGTC AAAGACTTAG ATGATGATGA TCTTGAAAAT 7980
TTAAATTTAG ATACAGAGGA TGGCAAAGGA GATGAATTGG ATACCTTAGG TAATTTGGAA 8040
ACTAATGATC CCAACCTGGA TGACCTGTTA AGGTCAGGAG AGTTTGATAT CATTGCTTAT 8100
ACAGATCCAG AACTTGACCT GGGAGATAAA AAAAACATGT TTAATGAGGA ACTAGACCTT 8160
AATGTTCCAA TTGATGATAA GTTAGATAAT CAGTGTGTAT CTGTTGAACC AAAAAAAAAG 8220
GAGCAAGAAG ACAAAACAGT GGTTCTCACT GATAAACATT CACCACAGAA AAAGTCCACT 8280
GTTAGCAATG AGGTAAAAAC GGAAGTACTG TCTCCAAATT CTAAAGTGGA ATCCAAATGT 8340
GAAATTGAGA AAAGTGATGA GAGTAAAGAT AATGTTGACA CTCCCTGCTC ACAGGCTTCT 8400
GCTCACACGG ACCTAAATGA TGGAGAAAAG ACTTGTTCTC AGCCTTGTGA TCCAGAAACA 8460
CTCGAGAATA GAACTAACCG AGAAACTGCT GGTTCCAGTG CAAGTGTCAT TCAGGCATCC 8520
ACTCAGCTGC CTGCTCAAGA TGTAGTAAAC TCTTGTGGCA TAAGTGGATC GACTCCAGTT 8580
CTCTCAAGTT TACTTGCTAA TGAAAAAGCT GACAATTCAG ATGTTAGGCC ATTGGGATCA 8640
CCACCAGCAA CTCTGCGGGC CTCACCCTCC AATCAGGTGT CGAGTTTGCC TCCTTTAATG 8700
GCACCACCTG GCCATGTTTT GGATAATACC ATGAATTCTA ATGTAACAGG GATCTCTAGG 8760
GTAAACCATG CTTTTTCTCA GGGTGGGCAG GTAAATCCGG GATTCATTCA GGGTCAGTCA 8820
ACAGTTAACC ACAGTTTGGG GACAGGAAAA CCTACAAATC AAACCATACC TCTAACAAGT 8880
CAGTCCGGTA CCAGTGGCAT GTCTGGACCC CAGCAGCTAA TGATTCCTCA GACTTTGGCC 8940
CAGCAGAATA GAGAGAGGCC CCTCCTCCTA GAGGAACAGC CTCTGCTTCT ACAGGATCTT 9000
TTGGATCAAG AGAGGCAAGA ACAGCAGCAA CAAAGGCAGA TGCAAGCCAT GATTCGTCAG 9060
CGGTCAGAAC CATTCTTCCC TAATATGGAT TTTGATGCAA TTACAGATCC TATAATGAAA 9120
GCCAAAATGG TGGCCCTTAA AGGCATAAAT AAAGTGATGG CACAGAACAA TCTGGGCATG 9180
CCACCAATGG TGATGAGCAG NNNNNNNNNN NNNNNNCAGT CGTTGGCTGG ACAGAACAGT 9240
GAAGGCCATA ACCCTATACC ACAAGCCCCT CAGGATGGCA GTATAACACA TCAGATTTCT 9300
AGGCCTAATC CTCCAAATTT TGGTCCAGGC TTTGTCAACG ATTCACACCG CAAGCAGTAT 9360
GAAGAATGGC TTCAGGAGAC CCACACACTT CTCTTGATGC AGCAGAGATA TCTTGAGCGA 9420
ATTGGTGCGC ACAGAAAAAC TAAGAAGGCC CTGTCAGCTA AGCAACGAAC TGCCAAGAAA 9480
GATGGGCGTG AGTTTCCAGA AGAAGATGCT GAACAACTCA AACATGTTAC TGAACAGCAG 9540
AGCATGGTTC AGAAACAGCT AGAACAGATT CGTAAACAGC AGAAGGAGCA TGCAGAGCTG 9600
ATGGAAGATT ATCGAATCAA GCAGCAGCAG CAGCAGCAAC AGTGTGCCAT GGCCCCGCCT 9660
GCTGTGATGG CTGGTGTCCA GCCCCAGCCA CCCCTAGTTC CGGGTGCCAC TCCACCCACC 9720
ATGAGCCAGC CCAGCTTTCC CGTGGTGCCA CCGCAGCTTC AGCACCAGCA CCAGCAGCAC 9780
ACAGCAGTCC TTCCTAGCCA TAGCAGTCCT GCTAGAATGC CTAGTTTACC TGGATGGCAA 9840
CCCAGCAGTG CTCCTGCTCA TCTCCCCCTC AATCCTCCTA GGATTCAGCC TCCAGTTAAC 9900
CAGTTACCAC TCAAAACTTG CACACCAGCC CCAGGGGCAG CGTCAAATGC AAATCCACAG 9960
AGTGGACCAC CTCCACGGGT AGAGTTTGAT GACAACAACC CCTTTAGTGA AAGTTTTCAA 10020
GAACGGGAAC GTAAAGAACG TTTACGAGAG CAGCAGGAAA GACAACGAAT CCAGCTCATG 10080
CAAGAAGTAG ATAGACAAAG AGCTTTGCAG CAAAGGATGG AAATGGAACA GCATGGCATG 10140
GTGGGCTCAG AGATAAGTGG TAGGACGCCT GTGTCCCAGA TTCCATTCTA TAACTCTGAA 10200
CTACCTTGTG ATTTTATGCA ACCCCCAAGA CCTATTCAGT CTCCACAACA CCAGCAGCAA 10260
ATGGGGCAGG TTTTACAGCA GCAGGGTATA CAGCCAGGAT CTGTTAATTC ACCCTCCACC 10320
CAAACTTTCA TGCAAACCAA TGAGCGAAGG CAGGTAGGAC CTCCCTCATT TGTTCCTGAT 10380
TCATCATCAA TTCCTGTTGG AAGCCCAAAT TTCCATCCTG TTAAGCAGGG ACATGGGAGT 10440
CTTCCTGGAA CCAGCTTCCA GCAGTCTCCA GTGAGGCCTC CTTTTACACC TGCTTTGCCA 10500
GCACCACCTG CTGCAGCTAG TAGCAGTCTC CCGTGTGGCC CAGACCCTGC TGTAACCCAT 10560
GGGCAGAGTT ATCCAGGATC ATCCCAATCT CTCATTCAGT TGTATTCTGA TATAATTCCA 10620
GAAGAAAAAG GGAAAAAGAA AAGAACAAGA AAAAAGAAGA AAGATGATGA TGCAGAGTCC 10680
ACCAAGGCAC CGTCAACTCC CCATTCAGAT ATAACTGCTC CACTGACTCC AAGCATTGCA 10740
GAAACCACCT CCACTCCTGC AGGGAGCACA CCCAGCGAGC TTCCTCCACA AGCAGAGCAG 10800
GAATCCGTGG AGCCAGCTGG CCAGTCCACT CCCAGTGTGG CAGCAGGCCA GCCGTGTGCA 10860
GGATTAGAGA GCAGACTGCC CAACGGTGAT TCCTCACAAG AAACTCCAAA TCAACAGTCT 10920
TATGCAAATG CAGAGGTGGA CAAGGTCTCC ATGGAAACTC CTGCCAAAAT TGAAGAGATA 10980
AAACTGGAAA AGGCTGAGAC AGAGTTAAGC CCAGGCCAAG AGGAGCCTAA ATTGGAGGAA 11040
CAAACTGGTA GTAAGGTAGA AGAAAATGCC GACGCCTGTC CTGTCTCCTC AGCACAGAGT 11100
CCTCCCCATT CTGTCGGGAC CTCTGCTGCC AAAGGAGATT CAGGGAATGA ACTTCTGAAA 11160
CACTTGTTAA AAAATAAAAA ATCATCTTCC CTTTTAAGTC AAAAACCTGA AGGCAGTTTT 11220
TGTGCAGAGG ATGACTGTAC AAAGGATAAT AAACTAGTTG AGAAGCAAAA CCCAGCTCAA 11280
GGATTGNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11340
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11400
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11460
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNCAGA ATAATTTAAG TAATCCTCCA 11520
ACACCCCCTG CCTCTCTTCC TCCTACACCA CCTCCTATGG CTTGTCAGAA GATGGCAAAT 11580
GGTTTTGCAA CAACTGAAGA ACTTGCTGGA AAAGCTGGAG TGTTAGTGAA CCATGAAGTT 11640
ACTAAAACTC TGGGACCTAA ACCATTTCAC CTCCCGTTCA GACCACAAGA TGACTTGTTA 11700
GCCAGAGCTA TTATTGCTCA AGGCCCAAAG ACAGTTGATG TTCCAGCTTC ACTTCCAACA 11760
CCACCTCATA ACAATCAGGA AGAATTGAGG ATACAGGATC ACTGTGGTGA TCGAGATACT 11820
CCCGAAAGCT TTGTTCCCTC GTCCTCTCCT GAGAGTGTGG TTGGGGTGGA AGTGAGCCGG 11880
TATCCGGACC TGTCATTGGT CAAAGAGGAG CCTCCGGAGC CAGTGCCATC CCCCATCATT 11940
CCAATTCTTC CTAGCACTGC TGGGAAAAGT TCAGAATCAA AAAGGGACAT CAAAAGTGAG 12000
CCAGGCACTT TATATTTTGC ATCACCTTTC GGTTCATCCC CAAATGGTCC CAGATCAGGT 12060
CTCATATCTG TAGCAATTAC CCTGCATCCT GCAGCTGCTG AGAACATCAG CAGTGTTGTG 12120
GCTGCGTTTT CCGACCTTCT TCACGTTAGA ATTCCTAACA GCTATGAGGT TAGTAACGCT 12180
CCAGATGTTC CATCTGTGGG TTTGGTCAGT AGCCACAGAG TAAACCCAGG TTTGGAGTAT 12240
CGACAGCATT TACTTCTTCG TGGGCCTCCT CCAGGATCTG CAAACCCTCC CAAATTAGCG 12300
AGCCCGTACC GGCTGAAGCA GCCTAATGTA CCATTTCCTC CAACAAGCAA TGGTCTCTCT 12360
GGGTATAAGG ATTCCAGTCA TGGTGTCACA GAAAGTACAG CTCTCAGGCC GCAGTGGTGC 12420
TGTCACTGTA AAGTAGTTAT TCTTGGAAGC GGTGTGCGGA AATCATTCAA GGATCTGACC 12480
TTTGGAAACA AGGATTTCCG AGAAAATTCT AGGAGAGTGG AAAAGGACAT TGTTTTTTGT 12540
AGTAATAACT GCTTTATTCT TTATTCATCA ACTGCACAAG CAAAAAACTC AGAAAGCAAG 12600
GAGTCCACTC CCTCGTTGCC ACAGTCCCCT ATGAGAGAGA CGCCTTCCAA AGCGTTTCAT 12660
CAGTACAGTA ACAACATCTC CACTTTGGAT GTGCACTGTC TCCCACAGCT CCAGGAGAAG 12720
GCTTCTCCCC CCGCATCGCC GCCCATCGCC TTCCCTCCTG CTTTCGAAGC CGCCAAAGTG 12780
GAGGCCAAGC CAGATGAGCT TAAGGTAACG GTCAAGTTAA AGCCTCGGCT AAGAACTGTC 12840
CATGGTGGGT TTGAAGACTG TAGGCCGTTA AATAAAAAGT GGAGAGGAAT GAAATGGAAG 12900
AAATGGAGCA TTCATATTGT AATCCCCAAG GGGACATTTA AACCACCTTG TGAGGATGAA 12960
ATAGACGAAT TTCTAAAGAA ATTGGGCACT TCCCTTAAAC CTGATCCCGT GCCCAGAGAC 13020
TATCGGAAAT GTTGCTTTTG TCACGAAGAA GGTGATGGAT TGACAGACGG ACCAGCAAGG 13080
CTGCTCAACC TTGATTTGGA TCTGTGGGTC CACTTGAATT GTGCTCTGTG GTCCACGGAA 13140
GTCTATGAGA CTCAGGCTGG TGCCTTAATA AACGTGGAGC TAGCACTGAG GAGAGGCCTG 13200
CAGATGAAAT GTGTCTTCTG TCACAAGACA GGTGCCACCA GTTGTCACAG ATTTCGATGC 13260
ACCAACATTT ATCACTGCGC CAAAGCATGC ATGTTTTNNN NNNNNNNNNN NNNNNNNNNN 13320
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13440
NNNNNNNNNN NNNNNNNNNN NNNNNGCCTC ATCTTTCATA CAATCGGTCA GCTGCTTCCA 13500
CAACAGATGC AAGCATTTCA TTCTCCTAAA GCGCTCTTCC CCGTGGGCTA CGAGGCCAGC 13560
CGGCTGTACT GGAGCTCGCG CTATGCCAAC AGGCGCTGCC GCTACCTGTG CTCCATCGAG 13620
GAGAAGGACG GGCGCCCCGT GTTCGTTATC AGGATTGTGG AGCAAGGCCA TGAAGACCTC 13680
GTCTTAAGTG ACACCTCACC TAAAGGTGTT TGGGATAAAA TTTTGGAACC TGTAGCGTGT 13740
GTGAGAAAAA AATCTGAAAT GCTGCAGCTT TTTCCAGCAT ACTTAAAAGG AGAAGATCTG 13800
TTTGGCTTGA CGGTGTCTGC GGTGGCGCGG ATAGCTGAAT CGCTACCTGG GGTGGAGGCA 13860
TGCGAAAACT ATACCTTCCG ATATGGCCGA AATCCTCTCA TGGAACTGCC GCTTGCCGTT 13920
AACCCCACAG GTTGTGCCCG TTCTGAACCT AAAATGAGTG CCCATGTCAA GAGGTTTGTG 13980
TTAAGGCCTC ACACCTTGAA CAGCACCAGC ACCTCAAAGT CATTTCAGAG CACAGTCACT 14040
GGAGAACTGA ATGCACCTTA TAGTAAGCAG TTTGTCCACT CCAAGTCATC GCAGTATCGG 14100
AAAATGAAGA CTGAATGGAA ATCCAACGTG TATCTGGCCC GGTCTCGGAT TCAGGGGCTG 14160
GGCCTATATG CTGCTAGAGA CATTGAGAAG CACACCATGG TCATTGAGTA TATCGGAACT 14220
ATCATTCGAA ATGAAGTAGC GAACAGGAAG GAGAAACTTT ATGAGTCTCA GAACCGCGGC 14280
GTGTACATGT TCCGCATGGA CAACGACCAT GTGATTGACG CCACGCTCAC AGGAGGGCCT 14340
GCAAGGTACA TCAACCATTC GTGTGCAAAT TGTGTGGCTG AAGTGGTGAC TTTTGAGAGA 14400
GGACACAAAA TTATCATCAG CTCCAATCGG AGAATCCAGA AGGGAGAAGA GCTCTGCTAT 14460
GACTATAAGT TTGACTTTGA AGATGACCAG CACAAGATCC CGTGCCACTG TGGAGCTGTG 14520
AACTGCCGGA AGTGGATGAA CTGA 14545
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Chs-0076 ENSCSAP00000002864.1 Chlorocebus sabaeus 87 0.0 3039
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 87 0.0 3015
WERAM-Pat-0168 ENSPTRP00000046674.3 Pan troglodytes 86 0.0 3009
WERAM-Hos-0018 ENSP00000347325.3 Homo sapiens 86 0.0 3004
WERAM-Gog-0210 ENSGGOP00000027941.1 Gorilla gorilla 85 0.0 2990
WERAM-Poa-0169 ENSPPYP00000020408.2 Pongo abelii 86 0.0 2980
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 86 0.0 2977
WERAM-Otg-0078 ENSOGAP00000005885.2 Otolemur garnettii 86 0.0 2962
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 86 0.0 2962
WERAM-Paa-0005 ENSPANP00000006150.1 Papio anubis 83 0.0 2899
WERAM-Aim-0154 ENSAMEP00000014067.1 Ailuropoda melanoleuca 83 0.0 2864
WERAM-Caf-0059 ENSCAFP00000007370.4 Canis familiaris 83 0.0 2856
WERAM-Orc-0115 ENSOCUP00000009766.3 Oryctolagus cuniculus 82 0.0 2823
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 85 0.0 2746
WERAM-Ict-0124 ENSSTOP00000012342.2 Ictidomys tridecemlineatus 81 0.0 2745
WERAM-Mup-0098 ENSMPUP00000009152.1 Mustela putorius furo 79 0.0 2693
WERAM-Tut-0198 ENSTTRP00000016174.1 Tursiops truncatus 80 0.0 2649
WERAM-Cap-0018 ENSCPOP00000001682.2 Cavia porcellus 77 0.0 2644
WERAM-Mum-0152 ENSMUSP00000043874.7 Mus musculus 78 0.0 2591
WERAM-Bot-0193 ENSBTAP00000028347.5 Bos taurus 78 0.0 2590
WERAM-Fec-0096 ENSFCAP00000008002.3 Felis catus 79 0.0 2588
WERAM-Dio-0019 ENSDORP00000002189.1 Dipodomys ordii 76 0.0 2556
WERAM-Ova-0045 ENSOARP00000005594.1 Ovis aries 76 0.0 2517
WERAM-Myl-0122 ENSMLUP00000010086.2 Myotis lucifugus 77 0.0 2494
WERAM-Mam-0056 ENSMMUP00000009467.2 Macaca mulatta 80 0.0 2481
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 68 0.0 2188
WERAM-Sah-0130 ENSSHAP00000013860.1 Sarcophilus harrisii 69 0.0 2177
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 66 0.0 2067
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 65 0.0 2049
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 66 0.0 2019
WERAM-Mod-0039 ENSMODP00000005827.3 Monodelphis domestica 68 0.0 2015
WERAM-Loa-0084 ENSLAFP00000006640.4 Loxodonta africana 84 0.0 2014
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 65 0.0 1996
WERAM-Vip-0013 ENSVPAP00000001498.1 Vicugna pacos 81 0.0 1971
WERAM-Ptv-0086 ENSPVAP00000007862.1 Pteropus vampyrus 75 0.0 1889
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 83 0.0 1754
WERAM-Anc-0148 ENSACAP00000014142.2 Anolis carolinensis 58 0.0 1732
WERAM-Sus-0158 ENSSSCP00000023447.1 Sus scrofa 86 0.0 1595
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 81 0.0 1543
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 52 0.0 1483
WERAM-Fia-0086 ENSFALP00000007141.1 Ficedula albicollis 80 0.0 1479
WERAM-Prc-0090 ENSPCAP00000008256.1 Procavia capensis 75 0.0 1353
WERAM-Mae-0021 ENSMEUP00000001693.1 Macropus eugenii 66 0.0 1342
WERAM-Xet-0065 ENSXETP00000021458.2 Xenopus tropicalis 71 0.0 1301
WERAM-Leo-0127 ENSLOCP00000015481.1 Lepisosteus oculatus 70 0.0 1300
WERAM-Asm-0011 ENSAMXP00000001840.1 Astyanax mexicanus 54 0.0 972
WERAM-Dar-0173 ENSDARP00000122949.2 Danio rerio 52 0.0 861
WERAM-Ocp-0124 ENSOPRP00000012666.1 Ochotona princeps 74 0.0 856
WERAM-Orla-0036 ENSORLP00000004819.1 Oryzias latipes 50 0.0 845
WERAM-Pof-0024 ENSPFOP00000002462.1 Poecilia formosa 50 0.0 827
WERAM-Xim-0055 ENSXMAP00000005029.1 Xiphophorus maculatus 49 0.0 816
WERAM-Tar-0071 ENSTRUP00000014027.1 Takifugu rubripes 41 0.0 803
WERAM-Ten-0186 ENSTNIP00000018287.1 Tetraodon nigroviridis 40 0.0 792
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 65 0.0 765
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 61 0.0 744
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 63 0.0 734
WERAM-Ran-0049 ENSRNOP00000010349.7 Rattus norvegicus 97 0.0 710
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 65 0.0 698
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 64 1e-173 610
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 76 8e-169 594
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 58 5e-130 465
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 60 3e-126 452
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 94 2e-97 357
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 51 6e-97 355
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 32 4e-79 296
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 68 5e-66 253
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 37 4e-48 193
Created Date 25-Jun-2016