WERAM Information


Tag Content
WERAM ID WERAM-Prc-0037
Ensembl Protein ID ENSPCAP00000003628.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPCAG00000003597.1 ENSPCAT00000003867.1 ENSPCAP00000003628.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 2.70e-45 153 2973 4889
Me_Reader PHD 2.70e-15 56 769 4513
Organism Procavia capensis
Domain Profile
  HMT SET1

              SET1.txt   17 akkeiekeelviEYvGevirsevadkrek 45  
k++ e+++l++EY+ + ++++++++++
ENSPCAP00000003628.1 2973 RKQQKEHTNLMAEYRNKQQQQQQQQQQQQ 3001
45556778899999999844444443333 PP
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSPCAP00000003628.1 4774 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 4858
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSPCAP00000003628.1 4859 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 4889
******************************6 PP

  Me_Reader PHD

               PHD.txt   2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50 
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSPCAP00000003628.1 769 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 818
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSPCAP00000003628.1 820 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 866
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l ++ e+ + + C sC+
ENSPCAP00000003628.1 897 TCPICHSPYVEEDLLIQCRHCERWMHAGCESLFTEEEVEQaadEGFDCVSCQ 948
7*****99999999*****************98444444445445******7 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSPCAP00000003628.1 4408 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4440
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSPCAP00000003628.1 4467 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRTKCMFFKDKTMLCPMHK 4513
599996666665....6*9999*********99986666677789999886 PP

Protein Sequence
(Fasta)
MSPPPEESPM SPPPEASCLF PPFEESPLSP PPEESPLSPP PEASRLFPPP EDSPMSPPPE 60
DSPMSPPPEV SHLLPLPEVS RLAPPPEESP LSPPPEESPT SPPPEASRLS PPPEDSPTSL 120
PPEDLPASPP PENLLTSLLL KESPLSPLPG EPCLCPRPEE SQFCPQPEEL QFCHQPEEQR 180
LSPQPKELCL SPQPEELHLS PIPEGPHLSP RLEELHLSPA PKEPCLSPQP DEPHRSSPQP 240
EEPRLSPRPE EPCLSPRPEE PHLFPKPEEP HLSPKPEESC LPSQPGEPPE EPGLCPVPEE 300
LPLLPPRGEP HLSPLLREPA LLEPGEPPLS PLPEGLPMSA SGEPSLSPQL MPPDPLPPPL 360
SPIITAGAPP ALSPLGDLEY SFGAKGDSDP ESPLAAPILE TPISPPPEAN CTXXXPFPSM 420
FCLQHLLPQS FFQCSPALPL SIPSPLSMEK TVVISDEAES HEMETEKGLE RECPALEPSP 480
TSPLPSPIGE LSCPAPSPAP VLDDFSGLGE DPAPLDGTDT PDLQLEAGQT TSSSVACVLK 540
SSPVLLDPEE LAPVTPVEVY GPECKQVGQG SPCEMQEEPR APVAPTPPTL IKSDIVNEIS 600
NLSQGDASAS FPGSEPLLGS PDPEGGGSLS MELGVSTDVS PARDEGSLRL CTDSLPETDD 660
SLLCEAGTIV SGGKAEGDKG RRRSSPARSR IKQGRSSSFP GRRRPRGGAH GGRGRGRARL 720
KSTASSIETL VVADIDSSPS KEEEEEDDDT MQNTVVLFSN TDKFVLMQDM CVVCGSFGRG 780
AEGHLLACSQ CSQCYHPYCV NSKITKVMLL KGWRCVECIV CEVCGQASDP SRLLLCDDCD 840
ISYHTYCLDP PLLTVPKGGW KCKWCVSCMQ CGAASPGFHC EWQNSYTHCG PCASLVTCPI 900
CHSPYVEEDL LIQCRHCERW MHAGCESLFT EEEVEQAADE GFDCVSCQPY VIKPMVPVAP 960
PELVPIKVKE PEPQYFRFEG VWLTEAGMAV LRNLTMSPLH KRRQRRGRLG LPGEAGLEGS 1020
EPVDALGPDD KKDGDLDTDE LTKGEGGVEH MECEIKLEGL ISPDVESGKE ETEESKKRKR 1080
KPYRPGIGFM VRXXXXXXXX XXXXXXXXXX XXXXXXXXXX MPADLPAEGS VEQSLADVDE 1140
KKKQQRRGRK KSKLEDMFPA YLQEAFFGKE LLDLSRKALF AVGVGRPSFG PGTPKTKADG 1200
GPERKDPPTL QKGDDGPEVA DEESRGLEGK ADTPGPEDRG IKASPVPSDP EKPGTPGEGV 1260
LSSDLDRIPT EELPKMESKD LQQLFKDVLG SEREQHLGCG TPGLDGSRTP LQRPFLQGGL 1320
PLGNLPSNSP MDSYPGLCQS PFLDSRERGG FFSPEPGEPD SPWTGSGGTT PSTPTTPTTE 1380
GEGDGLSYNQ RSLQRWEKDE ELGQLSTISP VLYANINFPN LKQDYPDWSS RCKQIMKLWR 1440
KVPAADKAPY LQKAKDNRAA HRINKVQKQA ESQINKQTKV GDIARKTDRP ALHLRIPPQP 1500
GALGSPPPAA APTIFIGSPP TPAGLSTSAD GFLKPPAGTV PGPDSPGELF LKLPPQVPAQ 1560
VPSQDPFGLA PAYALEPRFP TAPPAYPPYP SPTGAPTQPP TMGTSSRSGT GPPGEFHTTP 1620
PGTPRHQPST PDPFLKPRCP SLDNLAVPES PGVAGGKPSE PLLSPPPFGE SRKALEVKKE 1680
ELGAASPSYG PPNLGFVDSP SSGSHLGGLE LKAPDVFKAP LTPRASQVEP QSPGLGLRPQ 1740
EPPPAQALAP SPPSQPDIFR PGPYPDPYAQ PPLTPRPQPP PPESCCALPP RSLPSDPFSR 1800
VPASPQSQSS SQSPLTPRPL SAEAFCPSPV TPRFQSPDPY SRPPSRPQSR DPFAPLHKPP 1860
RPQPPEIAFK AGPLAHAPLG AGGFPAALPS GPVGELHAKV PSGQPSSFAR SPGTSAFVGT 1920
PSPIRFTFPQ AVGEPSLKPP APQPAPPQPH GINSHFGPSA TLGKPQSTNY SAATGSFHTS 1980
GSPLGPSSGS TGEGYGLSPL RPPSVLPPPT PDGSLPYLSH GASQRAGITS PVEKREDPGA 2040
GMGSSVAAPE LPGTQDQGMP SLSQSELEKQ RQRQRLRELL IRQQIQRTLR QEETAAAAXX 2100
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XDKSSLVGLP QSKLCGAVLG PGAFPSDERL 2160
SRPPPPATPS AMDVNSRQLV GGSQAFYQRT PYSGPLPLQQ QQQQQLWQQQ QQQQATAATS 2220
MRLAMSTRFP STSGPELGRQ ALGSPLAGIP TRLPGPGEPG PGPPGPAQFI ELRHNVQKGL 2280
APGGPPFPGQ GHPQRARFYA VSEESHRLAT EGLRGLAVSG LPPQKPSAPP ATDLNNGLHP 2340
TPHTKGSNLS TGLELVSQPP SSTELGRPPP LALEPGKLPC EDSELDDDFD AHKALEDDEE 2400
LAHLGLGVDV AKGDDELGTL ENLETNDPHL DDLLNGDEFD LLAYTDPELD TGDKKDIFNE 2460
HLRLVESANE KAEREALMQG VEPGPSGPEE RPTPASATAA DASEPRLAPV LPEVKTKVEE 2520
GGPHPSPCQF TITTPKVEPA SAATSLGLGL KPGQSVIGSR DMRVGSGPFS SSGHTGEKGP 2580
FGATGGPPAH LLAPNPLSGP GGSSLLEKFE LESGTLALPS GHTASGDELD KMESSLVASE 2640
LPLLIEDLLE HEKKELQKKQ QLSAQLQPAQ QQQQQHSLLS TPPSAQAMPL PHEGSSTLAG 2700
PQQQLALGLG GARQPGLAQP LMPNQPPAHA FQQRLAPSMT MVSSQGHLLS GQHGGQAGLV 2760
QQSSQPVLTQ KPMGNVPLSM CMKPQQLAMQ QQIANSFFPD TDLDKFAAED IIDPIAKAKM 2820
VALKGIKKVM AQGNIGVAPG MNRQQVSLLA QRLSGGPGND LQNHVAAGSA QERSASDPSQ 2880
PRPNPPTFAQ GVINEADQRQ YEEWLFHTQQ LLQMQLKVLE EQIGVHRKSR KALCAKQRTA 2940
KKAGREFPEA DAEKLKLVTE QQSKIQKQLD QVRKQQKEHT NLMAEYRNKQ QQQQQQQQQQ 3000
QQQHSAVLAL SPSQSPRLLT KLPGQLLPGH GLQPPQGPLG GQPGGLRLPP GGMALPGQPG 3060
GPFLNTALAQ QQQQQHSGGA GALAGPSGGF FPGNLALRGL GPDSRLLQER QLQLQQQRMQ 3120
LAQKLQQQQQ QQQQQQHLLG QVAIQQQQQQ GPGVQANQAL GPKPQGLLPP NSHQGVLVQQ 3180
LSPQQPQGTQ SMLGTTQVAV LQQQQHPGAL GPQAPHRQVL LTQSRVLSSP QLAQQGQGLM 3240
GHRLVTSQQQ QQQQQHQQQG SMAGLSHLQQ GLMPHSGQAK LNAPPMGSLQ QQQQLQQHQF 3300
QLQQQQQQLQ QQQQQQLQQQ QLQQQQQQLQ QQQQFQQQQI GLLNQSRTLM SPQQQQQHQQ 3360
QQQQQQHQQQ QQQQQQQQQM TLGPGIPAKP LQHFSSPGSL GSTLLLTGKE QGIVEAALPP 3420
EVTEGPSTHQ GGPLTVGSTP ESVTAEPGEV KPSVSGDSQL MLVQPQSQPQ PNSLQLQPPL 3480
RLPGQQQQQV NLLHTAGGGS HGQLGSGSSS EAHILSQPSI SLGEQPGPIT QNLLGPQHPL 3540
GLERPMQSNV GSQPSKPGHV PQSGQGLPGA GVMPTVGQLR AQLQGVLAKT PQLRHLSPQQ 3600
QQQLQALLMQ RQLQQSQAVR QTPPFQEPGT QPSPLQGLLG RQPQLGSFPG SQTGPLQELG 3660
AGPRPQGQPR LSTPQGALST GPALGPVHPT PPPSSPQEPK RPSPQLPSPS SQLPSEVQLT 3720
PSQPGTPKPQ GPPLELPSGR VSSAAAQLPD TFFGKGLGTW DSPDNLTEAQ KPDQSSLVPG 3780
HLEQVNGPVV PEPPHLSIKQ EPREEPCALG AQVVKREANG EPLGTPGTSN HLLLAGPRTE 3840
AGHLLLQKLL RAKNVQLTTG RGPEGLRAEI NGHIDSKLAG LEQKLQGTPS NKEDVAARKP 3900
TPKLKRVQKT SDRLVSSRKX XXXXXXXXXX XXXXXXXXXX XXXXXXTGAP VIVNSRKTIS 3960
RFVVFGILTP NTNLYRMIRA HGSGADPTGP VIYSPLPNNL SNPPTPPSSL PPTPPPSVQQ 4020
KMVNGVTPSE ELGEHPKDAA SAQETEGALR NASEVKNLDL LAALPTPPHN QTEDVRMESD 4080
EDSDSPDSIV PASSPESILG EEAPRFPQLG SCRWEQDDRA LSPVIPIIPR ASIPVFPDTK 4140
PYGALDLEVP GKLPVTTWEK GKGSEVSVML TVSAAAAKNL NGVMVAVAEL LSMKIPNSYE 4200
VLFAESPGRT GIEAKKGESE GPGGKEKGLG GKSPEAGPDW LKQFDAVLPG YTLKSQLESP 4260
APELPTQHSY TYNVSNLDVR QLSAPPPEEP SPPPSPLVPS PASPPAEPMV ELPAEPLAEP 4320
PVPSPLPLAS SPESARPKPR ARPPEEGEDS RPPRLKKWKG VRWKRLRLLL TIQKGGGRQE 4380
DEREVAEFME QLGTALRPDK VPRDMRRCCF CHEEGDGATD GPARLLNLDL DLWVHLNCAL 4440
WSTEVYETQG GALMNVEVAL HRGLLTKCSL CQRTGATSSC NRMRCPNVYH FACAIRTKCM 4500
FFKDKTMLCP MHKVKGPCEQ ELSSFAVFRR VYIERDEVKQ IASIIQRGER LHMFRVGGLV 4560
FHAIGQLLPH QMADFHSATA LYPVGYEATR IYWSLRTNNR RCCYRCSIGE NNGRPEFIIK 4620
VTEQGLEDLV FTDASPQAVW NRIIEPVAAM RKEADMLRLF PEYLKGEELF GLTVHAVLRI 4680
AESLPGVESC QNYLFRYGRH PLMELPLMIN PTGCARSEPK ILTHYKRPHT LNSTSMSKAY 4740
QSTFTGETNT PYSKQFVHSK SSQYRRLRTE WKNNVYLARS RIQGLGLYAA KDLEKHTMVI 4800
EYIGTIIRNE VANRREKIYE EQNRGIYMFR INNEHVIDAT LTGGPARYIN HSCAPNCVAE 4860
VVTFDKEDKI IIISSRRIPK GEELTYDYQF DFEDDQHKIP CHCGAWNCRK WMN 4913
Nucleotide Sequence
(Fasta)
ATGTCTCCTC CACCTGAAGA GTCACCTATG TCTCCACCAC CTGAGGCTTC TTGTCTGTTC 60
CCACCGTTTG AAGAGTCACC CTTATCCCCT CCACCTGAGG AGTCTCCTCT CTCCCCACCA 120
CCTGAAGCAT CACGCTTATT CCCACCACCT GAGGACTCTC CCATGTCCCC ACCACCTGAA 180
GACTCACCTA TGTCACCCCC ACCTGAAGTG TCACACCTGT TGCCCCTTCC TGAAGTATCA 240
CGTCTAGCTC CACCACCAGA AGAATCTCCC CTTTCCCCAC CACCTGAGGA GTCTCCCACT 300
TCTCCTCCAC CTGAGGCTTC GCGCCTGTCC CCACCACCTG AGGACTCACC TACATCACTG 360
CCACCTGAAG ACTTACCTGC TTCCCCACCA CCAGAGAACT TGCTCACGTC CCTGCTACTG 420
AAAGAGTCAC CCCTGTCGCC ACTGCCTGGG GAGCCATGTC TCTGCCCCCG ACCTGAGGAG 480
TCACAGTTCT GCCCTCAGCC TGAGGAGCTG CAATTCTGTC ACCAGCCTGA GGAGCAGCGC 540
CTGTCCCCTC AGCCTAAGGA GCTGTGCCTA TCACCTCAGC CTGAGGAACT GCACCTGTCT 600
CCCATACCTG AGGGGCCTCA CCTGTCCCCC AGACTTGAGG AACTACATCT GTCCCCAGCG 660
CCTAAGGAGC CATGTCTGTC ACCCCAGCCT GATGAGCCTC ACCGGTCATC CCCCCAGCCT 720
GAGGAGCCTC GCCTTTCCCC CCGGCCTGAG GAGCCTTGCC TTTCCCCCCG GCCTGAGGAG 780
CCACACTTAT TCCCCAAGCC TGAGGAGCCA CACTTGTCCC CCAAGCCTGA GGAGTCCTGC 840
CTGCCCTCCC AGCCAGGGGA ACCCCCTGAG GAGCCAGGCT TGTGTCCTGT ACCTGAGGAG 900
TTGCCCTTGT TACCACCACG TGGGGAGCCA CACCTCTCCC CTTTGCTTAG AGAGCCTGCC 960
CTGTTGGAGC CTGGAGAGCC ACCTTTGTCT CCTCTGCCTG AAGGTCTGCC CATGTCTGCA 1020
TCTGGGGAGC CATCCTTGTC ACCTCAACTA ATGCCACCAG ATCCTCTTCC TCCTCCACTC 1080
TCACCCATTA TCACAGCTGG GGCCCCACCA GCCCTGTCTC CTTTGGGGGA CTTAGAGTAC 1140
TCCTTTGGTG CCAAAGGGGA CAGTGACCCT GAGTCACCAT TGGCCGCCCC CATTCTAGAA 1200
ACACCCATTA GCCCTCCGCC AGAAGCAAAT TGCACTGANN NNNAGCCCTT CCCCTCAATG 1260
TTCTGTCTCC AGCATCTACT TCCCCAAAGC TTCTTCCAGT GTTCTCCTGC TCTGCCTCTG 1320
TCCATTCCCT CCCCACTGAG TATGGAGAAG ACAGTGGTGA TCTCTGATGA GGCTGAATCA 1380
CATGAGATGG AGACTGAAAA AGGCCTAGAA CGTGAGTGCC CAGCCTTGGA GCCCAGCCCT 1440
ACCAGTCCGC TCCCCTCTCC CATTGGAGAG CTTTCCTGTC CTGCCCCTAG CCCTGCCCCT 1500
GTGCTGGATG ACTTCTCTGG CCTTGGGGAA GACCCAGCCC CTCTTGATGG GACTGACACT 1560
CCTGATTTAC AGCTAGAAGC TGGACAGACC ACCAGCAGTT CAGTGGCTTG TGTACTTAAA 1620
AGTTCCCCTG TGCTCCTGGA CCCTGAGGAG CTGGCACCTG TGACCCCTGT GGAGGTCTAT 1680
GGCCCAGAAT GCAAACAGGT CGGGCAGGGC TCACCATGTG AGATGCAGGA GGAGCCACGT 1740
GCACCAGTGG CCCCCACCCC ACCTACTCTC ATCAAATCCG ACATCGTTAA TGAGATCTCA 1800
AACCTGAGCC AAGGTGATGC CAGTGCTAGT TTTCCTGGCT CAGAGCCCCT GTTGGGCTCT 1860
CCTGACCCTG AGGGGGGTGG CTCCCTGTCC ATGGAGCTGG GGGTATCTAC AGACGTTAGC 1920
CCAGCCCGAG ATGAAGGCTC TTTGCGACTC TGTACCGACT CGCTGCCAGA GACTGATGAC 1980
TCGCTGTTAT GTGAAGCTGG GACAATTGTC AGCGGAGGCA AAGCCGAGGG GGACAAGGGA 2040
AGGAGGCGCA GTTCCCCTGC CCGTTCCCGC ATTAAGCAGG GACGCAGCAG TAGTTTCCCA 2100
GGAAGGCGCC GGCCACGCGG AGGAGCACAT GGAGGACGTG GGAGAGGGCG GGCCCGGCTA 2160
AAATCAACTG CTTCTTCCAT TGAGACTCTG GTAGTTGCTG ATATCGATAG CTCTCCCAGC 2220
AAAGAGGAAG AAGAAGAAGA TGATGACACC ATGCAAAATA CTGTGGTTCT CTTCTCCAAC 2280
ACAGACAAAT TTGTCCTAAT GCAGGACATG TGTGTGGTAT GTGGCAGCTT TGGCCGAGGA 2340
GCAGAAGGCC ATCTCCTTGC CTGTTCCCAG TGTTCTCAGT GCTATCACCC TTACTGTGTT 2400
AACAGCAAGA TCACCAAGGT GATGCTGCTG AAAGGCTGGC GGTGTGTGGA GTGTATTGTG 2460
TGCGAGGTGT GTGGCCAGGC CTCCGACCCC TCACGCCTGC TTCTGTGTGA TGACTGTGAC 2520
ATTAGCTACC ACACATACTG CCTGGACCCC CCACTGCTCA CCGTACCCAA AGGTGGCTGG 2580
AAGTGCAAGT GGTGTGTCTC CTGTATGCAG TGTGGGGCCG CTTCCCCTGG CTTCCACTGT 2640
GAGTGGCAGA ATAGTTACAC ACACTGTGGG CCCTGTGCTA GCCTAGTGAC CTGCCCTATC 2700
TGTCATTCCC CATATGTGGA AGAGGACCTA CTCATCCAGT GCCGCCACTG TGAACGGTGG 2760
ATGCATGCTG GCTGCGAGAG CCTCTTCACA GAGGAAGAGG TGGAGCAGGC TGCGGATGAG 2820
GGCTTTGACT GTGTCTCTTG CCAGCCCTAT GTCATAAAAC CTATGGTGCC TGTTGCACCT 2880
CCGGAGTTGG TGCCTATCAA AGTGAAAGAG CCAGAGCCCC AGTACTTTCG CTTCGAGGGT 2940
GTGTGGCTGA CAGAAGCTGG CATGGCTGTG CTGCGTAACC TGACCATGTC TCCTCTTCAT 3000
AAGCGGCGCC AGCGGCGGGG TCGGCTCGGC CTCCCAGGCG AGGCAGGGCT GGAAGGTTCT 3060
GAGCCTGTAG ATGCCCTTGG CCCTGATGAC AAGAAGGATG GGGACCTGGA CACTGATGAA 3120
CTTACCAAGG GTGAAGGTGG TGTGGAGCAC ATGGAGTGTG AAATTAAACT GGAGGGCCTC 3180
ATCAGCCCTG ACGTGGAGTC TGGCAAGGAA GAGACCGAAG AAAGCAAAAA ACGCAAGCGC 3240
AAACCATATC GGCCTGGCAT TGGCTTCATG GTGCGACANN NNNNNNNNNN NNNNNNNNNN 3300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNTG 3360
ATGCCTGCTG ACCTGCCTGC AGAGGGCTCT GTGGAGCAGA GTTTAGCTGA TGTGGATGAG 3420
AAGAAGAAGC AGCAGCGGCG AGGGCGCAAG AAGAGCAAAC TAGAGGACAT GTTCCCTGCT 3480
TACTTGCAGG AAGCCTTCTT TGGAAAAGAG CTGCTGGACC TGAGCCGTAA AGCCCTTTTT 3540
GCAGTTGGGG TGGGCCGGCC AAGCTTTGGA CCAGGAACCC CAAAAACCAA GGCGGATGGA 3600
GGCCCAGAAA GGAAGGATCC CCCCACCTTG CAGAAAGGAG ATGATGGTCC AGAAGTTGCT 3660
GATGAAGAAT CACGAGGCCT CGAGGGCAAG GCTGATACAC CAGGACCTGA GGATAGGGGA 3720
ATAAAGGCAT CCCCAGTGCC CAGTGACCCT GAGAAGCCAG GTACCCCAGG TGAAGGGGTG 3780
CTTAGCTCTG ACTTAGACAG GATTCCCACA GAAGAATTGC CCAAGATGGA ATCCAAGGAC 3840
CTACAGCAGC TCTTCAAAGA TGTTCTGGGT TCTGAACGAG AACAACATCT GGGTTGTGGA 3900
ACCCCTGGGC TAGATGGCAG CCGTACACCA CTGCAGAGGC CTTTTCTCCA AGGTGGACTC 3960
CCTTTGGGCA ATCTCCCCTC CAACAGCCCA ATGGATTCCT ACCCTGGCCT CTGTCAGTCC 4020
CCATTCTTGG ATAGCAGGGA GCGCGGGGGC TTCTTCAGCC CGGAACCAGG TGAGCCAGAC 4080
AGTCCATGGA CGGGCTCAGG GGGCACCACA CCCTCCACCC CCACCACCCC AACCACAGAG 4140
GGTGAGGGCG ACGGGCTCTC CTACAACCAG CGGAGCCTTC AGCGCTGGGA AAAGGACGAG 4200
GAGTTGGGCC AGCTTTCTAC CATCTCACCT GTGCTCTATG CCAACATTAA CTTTCCCAAT 4260
CTCAAGCAAG ACTATCCAGA CTGGTCTAGC CGATGCAAAC AAATAATGAA GCTCTGGAGA 4320
AAGGTTCCAG CAGCTGACAA AGCCCCCTAC CTGCAAAAGG CCAAAGATAA CCGGGCAGCT 4380
CACCGCATCA ACAAGGTTCA GAAGCAGGCT GAGAGCCAAA TCAACAAGCA GACCAAGGTG 4440
GGCGACATAG CCCGTAAGAC TGACCGACCG GCCCTGCATC TCCGCATTCC CCCCCAGCCA 4500
GGGGCACTGG GCAGCCCACC CCCTGCTGCT GCCCCCACCA TTTTCATTGG CAGCCCCCCT 4560
ACCCCCGCTG GCTTGTCTAC CTCTGCGGAC GGGTTCTTGA AGCCGCCGGC AGGCACAGTA 4620
CCCGGCCCCG ACTCACCTGG TGAGCTCTTC CTCAAGCTCC CACCCCAGGT GCCTGCCCAA 4680
GTGCCTTCGC AGGATCCCTT TGGACTGGCC CCTGCCTATG CCCTGGAGCC CCGCTTTCCC 4740
ACAGCACCGC CTGCCTATCC CCCGTATCCT AGCCCTACTG GGGCCCCCAC ACAGCCTCCG 4800
ACGATGGGCA CCTCATCTCG TTCTGGGACT GGTCCACCAG GAGAATTTCA CACGACCCCA 4860
CCTGGCACTC CCCGACACCA GCCTTCCACA CCTGACCCCT TCCTCAAACC CCGCTGCCCT 4920
TCATTGGACA ACCTGGCTGT GCCTGAGAGC CCAGGGGTTG CAGGAGGCAA GCCTTCTGAG 4980
CCCCTGCTCT CTCCTCCGCC TTTTGGGGAG TCCCGGAAGG CATTAGAGGT AAAAAAGGAA 5040
GAGCTTGGGG CAGCGTCTCC TAGCTATGGG CCCCCAAACT TGGGTTTTGT TGACTCACCC 5100
TCTTCAGGCT CCCACCTAGG TGGCCTGGAA TTAAAGGCAC CTGATGTCTT CAAAGCCCCT 5160
CTGACCCCTC GGGCATCTCA GGTAGAGCCC CAGAGCCCAG GCTTGGGTCT ACGGCCCCAG 5220
GAGCCACCCC CTGCCCAGGC TTTGGCCCCT TCTCCCCCTA GCCAGCCTGA CATCTTTCGC 5280
CCTGGTCCTT ACCCTGACCC TTATGCCCAG CCCCCACTGA CTCCTCGGCC CCAGCCACCA 5340
CCTCCCGAGA GCTGCTGTGC CCTGCCCCCT CGCTCACTGC CCTCCGACCC TTTCTCCCGA 5400
GTTCCTGCTA GTCCTCAGTC TCAGTCCAGC TCCCAGTCCC CATTGACACC CCGTCCTTTG 5460
TCTGCTGAGG CTTTCTGCCC ATCTCCCGTA ACCCCTCGCT TCCAGTCTCC TGACCCTTAT 5520
TCTCGCCCAC CCTCACGCCC TCAGTCTCGT GATCCTTTTG CCCCGTTGCA TAAGCCACCC 5580
CGACCCCAGC CTCCTGAAAT TGCCTTCAAG GCTGGGCCTC TAGCCCATGC TCCACTAGGG 5640
GCTGGGGGCT TCCCAGCAGC CCTGCCCTCA GGGCCAGTAG GTGAACTCCA TGCCAAGGTC 5700
CCAAGTGGGC AGCCCTCCAG TTTTGCCCGG TCCCCTGGAA CCAGTGCATT TGTGGGTACC 5760
CCCTCTCCCA TACGTTTCAC TTTCCCTCAG GCAGTAGGAG AGCCTTCCTT AAAGCCACCT 5820
GCCCCTCAGC CTGCTCCCCC CCAACCCCAT GGGATCAATA GCCATTTTGG GCCCAGCGCT 5880
ACCTTGGGCA AGCCCCAAAG CACAAATTAC TCAGCAGCCA CGGGGAGCTT CCACACATCA 5940
GGCAGCCCCT TGGGGCCCAG CAGCGGATCC ACAGGAGAGG GCTATGGGTT GTCCCCACTA 6000
CGCCCTCCAT CAGTCCTGCC ACCACCTACA CCCGATGGAT CCCTCCCCTA CCTGTCCCAT 6060
GGAGCCTCAC AGCGGGCAGG GATCACCTCT CCAGTTGAGA AGCGAGAAGA TCCAGGGGCT 6120
GGAATGGGCA GCTCCGTAGC GGCACCTGAA CTCCCAGGTA CCCAGGATCA AGGCATGCCC 6180
AGCCTCAGTC AGTCAGAGCT GGAGAAGCAA CGACAGCGCC AGCGACTGCG GGAGTTGCTG 6240
ATTCGGCAGC AGATCCAGCG CACCCTGCGA CAGGAGGAAA CAGCTGCAGC AGCTGNNNNN 6300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNGACAAGA GTAGCCTTGT GGGACTACCC 6420
CAAAGCAAGC TGTGTGGCGC TGTTCTGGGT CCAGGAGCTT TTCCCAGTGA CGAGCGACTC 6480
TCCCGGCCAC CTCCACCAGC CACCCCTTCC GCTATGGATG TGAACAGCCG ACAATTGGTA 6540
GGAGGCTCCC AAGCCTTCTA TCAGCGCACA CCCTATTCGG GGCCCCTGCC CTTACAACAG 6600
CAACAACAAC AGCAATTGTG GCAGCAGCAA CAGCAGCAGC AGGCAACAGC GGCAACCTCC 6660
ATGCGACTTG CAATGTCCAC TCGCTTTCCA TCAACTTCCG GGCCTGAACT CGGCCGCCAA 6720
GCCCTAGGTT CCCCCTTAGC GGGAATTCCC ACCCGCTTGC CTGGCCCTGG TGAGCCAGGG 6780
CCGGGTCCAC CTGGTCCTGC CCAGTTCATT GAGTTGAGGC ACAATGTACA GAAAGGACTA 6840
GCACCTGGGG GGCCTCCGTT CCCTGGTCAA GGGCACCCTC AGAGAGCCCG TTTCTATGCT 6900
GTAAGTGAGG AGTCACACCG ACTGGCCACT GAAGGACTTC GGGGCCTGGC GGTATCAGGG 6960
CTTCCTCCAC AGAAACCCTC AGCCCCACCA GCCACTGATT TGAACAACGG CCTCCATCCA 7020
ACACCCCACA CCAAGGGTTC TAACCTGTCT ACTGGCTTGG AACTGGTCAG CCAGCCCCCT 7080
TCAAGCACTG AACTTGGCCG CCCGCCTCCT CTGGCTTTGG AACCTGGAAA GCTACCCTGT 7140
GAGGATTCTG AGCTAGATGA TGACTTTGAT GCCCACAAGG CCCTAGAGGA TGATGAGGAG 7200
CTTGCTCACC TAGGCTTGGG TGTGGATGTG GCCAAAGGTG ATGATGAGCT GGGCACCCTG 7260
GAAAACCTAG AAACTAATGA CCCCCACCTA GATGACCTAC TCAACGGAGA TGAATTTGAC 7320
CTACTGGCTT ATACTGACCC TGAGCTGGAC ACTGGGGACA AGAAGGACAT TTTCAATGAG 7380
CATCTAAGGC TAGTAGAATC GGCTAATGAG AAAGCTGAAC GAGAGGCCCT TATGCAGGGA 7440
GTGGAACCAG GACCCTCAGG CCCTGAGGAG CGTCCTACCC CTGCCTCTGC CACTGCTGCT 7500
GATGCCTCTG AGCCCCGCCT GGCACCAGTA CTCCCTGAAG TGAAAACCAA GGTGGAAGAA 7560
GGTGGGCCCC ACCCTTCCCC TTGCCAGTTC ACCATTACCA CACCTAAGGT AGAGCCAGCA 7620
TCTGCCGCCA CTTCTCTGGG CCTGGGGCTG AAGCCAGGAC AGAGCGTGAT TGGTAGCCGG 7680
GACATGCGGG TGGGCTCAGG GCCATTTTCT AGTAGTGGGC ACACAGGTGA GAAGGGTCCT 7740
TTTGGGGCCA CAGGAGGACC ACCTGCTCAC CTGCTAGCCC CTAACCCGTT GAGTGGTCCT 7800
GGAGGGTCTT CCCTGCTAGA AAAGTTTGAG CTGGAAAGTG GAACCCTGGC CTTACCCAGT 7860
GGACACACAG CATCTGGGGA TGAACTAGAC AAGATGGAAA GCTCACTGGT GGCCAGTGAG 7920
TTGCCCCTGC TCATTGAGGA TCTTTTGGAG CATGAGAAGA AGGAACTGCA AAAGAAGCAG 7980
CAGCTCTCAG CACAGCTGCA GCCTGCCCAG CAGCAACAGC AGCAGCATTC CCTACTCTCT 8040
ACACCACCCT CTGCCCAGGC CATGCCTTTG CCACATGAGG GCTCCTCCAC TTTGGCTGGG 8100
CCCCAACAGC AGCTTGCCCT GGGTCTTGGA GGTGCCAGAC AGCCAGGCTT GGCCCAGCCG 8160
CTGATGCCCA ACCAGCCACC AGCTCATGCC TTCCAGCAGC GTCTAGCCCC ATCCATGACG 8220
ATGGTGTCCA GCCAAGGGCA TTTGCTAAGT GGACAGCATG GAGGGCAAGC AGGATTGGTA 8280
CAGCAGAGCT CACAACCAGT GTTGACACAG AAGCCCATGG GTAATGTGCC ACTTTCCATG 8340
TGCATGAAGC CCCAGCAGCT GGCGATGCAG CAGCAGATAG CTAACAGCTT CTTTCCGGAT 8400
ACAGACCTAG ACAAATTTGC TGCAGAAGAT ATTATTGATC CCATTGCAAA GGCTAAGATG 8460
GTGGCTTTGA AAGGCATCAA GAAAGTGATG GCTCAAGGCA ACATTGGAGT AGCACCTGGC 8520
ATGAACAGGC AGCAAGTGTC ACTGCTAGCC CAGAGACTCT CAGGTGGGCC TGGCAATGAC 8580
CTGCAGAACC ATGTGGCAGC TGGGAGTGCC CAGGAGCGGA GTGCCAGTGA CCCCTCCCAG 8640
CCTCGTCCCA ACCCACCCAC TTTTGCTCAG GGGGTGATCA ATGAGGCTGA CCAGCGGCAG 8700
TATGAGGAAT GGCTGTTTCA CACCCAGCAG CTCCTACAGA TGCAGCTAAA GGTGCTAGAG 8760
GAGCAGATTG GTGTGCACCG TAAGTCCCGG AAGGCTTTGT GTGCCAAGCA GCGCACTGCC 8820
AAAAAGGCTG GCCGTGAGTT CCCAGAGGCT GATGCTGAGA AGCTCAAGCT GGTCACGGAA 8880
CAGCAAAGCA AGATCCAGAA ACAGCTGGAT CAGGTCCGAA AACAACAGAA GGAGCACACA 8940
AACCTCATGG CAGAATATCG GAATAAACAG CAGCAGCAGC AGCAGCAGCA GCAACAGCAA 9000
CAGCAACAGC ATTCAGCTGT GTTGGCCCTC AGCCCTTCCC AGAGTCCCCG ACTGCTCACG 9060
AAGCTCCCTG GTCAGCTGCT CCCAGGCCAT GGGCTGCAGC CACCTCAGGG ACCTCTAGGT 9120
GGGCAACCTG GAGGTCTTCG CTTGCCTCCT GGTGGTATGG CACTACCTGG ACAACCAGGT 9180
GGCCCCTTCC TCAACACAGC CCTGGCCCAA CAGCAGCAAC AGCAACATTC TGGCGGGGCT 9240
GGGGCCCTGG CTGGTCCCTC TGGAGGCTTC TTCCCTGGCA ACCTTGCTCT TCGAGGCCTG 9300
GGACCTGATT CAAGACTCTT ACAGGAAAGA CAGCTGCAGT TACAGCAGCA ACGCATGCAG 9360
CTGGCCCAGA AACTGCAGCA GCAGCAACAG CAGCAGCAGC AGCAGCAGCA CCTTCTAGGA 9420
CAAGTGGCCA TCCAGCAACA GCAACAACAA GGTCCAGGAG TGCAGGCAAA CCAGGCCTTG 9480
GGTCCTAAAC CCCAGGGACT TCTTCCTCCC AACAGCCATC AGGGTGTCTT GGTCCAACAG 9540
CTGTCCCCTC AACAACCCCA GGGGACCCAG AGTATGCTGG GCACTACACA GGTGGCAGTG 9600
TTGCAGCAGC AGCAACACCC TGGAGCTTTG GGCCCTCAGG CACCTCACAG ACAGGTACTT 9660
CTGACCCAGT CTCGGGTGCT AAGTTCCCCC CAACTGGCAC AGCAGGGTCA GGGCCTGATG 9720
GGACACCGGT TAGTCACATC CCAGCAGCAG CAGCAGCAAC AACAGCACCA ACAGCAGGGA 9780
TCCATGGCAG GGCTCTCCCA TCTTCAACAA GGTCTGATGC CACACAGTGG GCAGGCCAAA 9840
CTGAATGCAC CGCCCATGGG TTCCTTGCAG CAGCAGCAAC AGCTTCAGCA GCACCAGTTT 9900
CAACTACAAC AACAGCAGCA GCAGCTTCAA CAGCAGCAGC AGCAGCAGCT ACAACAACAG 9960
CAATTACAGC AACAGCAACA GCAGCTTCAA CAGCAGCAGC AGTTTCAACA GCAGCAGATA 10020
GGCCTGTTGA ACCAGAGTCG AACTTTAATG TCTCCTCAGC AGCAGCAGCA GCATCAGCAG 10080
CAGCAGCAGC AGCAGCAGCA TCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCAGCAGATG 10140
ACACTTGGCC CTGGCATACC TGCCAAGCCT CTTCAACACT TTTCTAGTCC TGGATCGCTG 10200
GGCTCAACTC TTCTCCTGAC GGGCAAGGAA CAAGGCATTG TGGAGGCAGC TCTCCCTCCA 10260
GAGGTCACTG AGGGACCCTC AACACATCAG GGAGGCCCAC TAACAGTAGG GTCTACACCC 10320
GAATCAGTAA CTGCCGAACC AGGGGAAGTA AAGCCTTCAG TCTCTGGGGA CTCACAACTC 10380
ATGCTTGTCC AACCCCAGTC CCAGCCTCAG CCCAACTCCT TGCAGCTGCA GCCACCTCTA 10440
AGGCTCCCAG GACAACAGCA GCAGCAAGTG AACTTGCTCC ACACAGCAGG TGGGGGAAGC 10500
CATGGGCAGC TAGGCAGTGG ATCATCTTCG GAGGCCCACA TACTGTCACA ACCTTCTATT 10560
TCCTTAGGGG AGCAGCCTGG ACCCATTACC CAGAACCTTC TGGGTCCCCA GCATCCCCTT 10620
GGGCTAGAGC GGCCCATGCA GAGTAATGTA GGGTCACAAC CTTCCAAACC AGGACATGTC 10680
CCCCAGTCTG GGCAAGGCCT ACCTGGGGCT GGAGTCATGC CTACAGTAGG TCAGCTTCGA 10740
GCGCAGCTCC AAGGGGTCCT GGCCAAAACC CCACAGCTGC GACACTTGAG TCCTCAGCAG 10800
CAGCAGCAGC TACAGGCACT TCTCATGCAA CGGCAGCTGC AGCAGAGTCA GGCAGTACGG 10860
CAGACCCCAC CCTTCCAGGA GCCTGGGACC CAACCCTCTC CTCTCCAGGG CCTCCTGGGC 10920
CGCCAGCCCC AACTTGGGAG CTTCCCTGGA TCCCAGACAG GCCCACTCCA GGAGCTAGGG 10980
GCAGGGCCTC GACCTCAGGG CCAACCCCGG CTCTCTACCC CACAAGGAGC CTTATCGACA 11040
GGACCAGCCC TTGGCCCTGT CCATCCCACA CCTCCTCCAT CCAGCCCCCA AGAACCAAAG 11100
AGACCTTCCC CACAGTTACC TTCCCCCAGC TCTCAACTTC CCTCTGAGGT CCAGCTCACT 11160
CCTTCCCAGC CAGGGACTCC AAAGCCCCAG GGGCCACCCT TGGAGCTGCC TTCTGGGAGG 11220
GTTTCATCTG CTGCTGCCCA GCTTCCGGAT ACCTTCTTTG GCAAAGGGCT GGGAACTTGG 11280
GACTCCCCAG ACAACCTCAC AGAAGCCCAG AAGCCAGATC AGAGCAGCCT GGTACCTGGG 11340
CATCTGGAGC AGGTGAATGG CCCAGTGGTG CCTGAGCCAC CCCATCTCAG CATCAAGCAG 11400
GAGCCTCGGG AAGAGCCATG CGCTCTGGGG GCCCAGGTGG TAAAGAGGGA AGCCAATGGG 11460
GAGCCACTAG GGACACCAGG TACCAGCAAC CACCTGCTGC TGGCTGGCCC CCGCACAGAG 11520
GCTGGACATC TGCTCTTGCA GAAGCTTCTA CGGGCAAAGA ATGTGCAGCT CACCACTGGG 11580
CGGGGGCCTG AGGGGCTGCG AGCTGAGATC AACGGACACA TTGACAGCAA GCTGGCTGGG 11640
CTGGAGCAGA AACTACAGGG TACCCCCAGC AACAAGGAGG ATGTAGCAGC AAGGAAGCCA 11700
ACCCCGAAGC TCAAGCGGGT ACAGAAGACA AGCGACAGGT TGGTGAGCTC CCGAAAGANN 11760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11820
NNNNNNNNNN NNNNNNNNAC GGGAGCCCCT GTCATCGTAA ATTCACGTAA GACAATCAGC 11880
CGATTTGTGG TGTTTGGCAT TTTAACTCCT AACACAAATC TGTACCGCAT GATAAGGGCC 11940
CATGGAAGTG GGGCAGATCC CACTGGCCCT GTCATTTATT CCCCACTTCC AAATAACCTG 12000
AGTAACCCGC CGACACCACC CTCGTCGCTG CCCCCCACCC CACCCCCATC GGTGCAGCAG 12060
AAGATGGTGA ATGGCGTCAC CCCATCTGAA GAGTTGGGGG AGCACCCCAA GGATGCCGCC 12120
TCTGCCCAAG AGACTGAAGG GGCACTGAGG AATGCTTCAG AGGTGAAGAA TCTAGACCTG 12180
CTGGCTGCCT TGCCTACACC ACCTCACAAT CAGACTGAGG ATGTCAGGAT GGAGAGTGAC 12240
GAGGACAGTG ATTCTCCTGA CAGTATTGTG CCAGCTTCAT CCCCTGAGAG CATCTTGGGG 12300
GAGGAAGCAC CTCGGTTCCC TCAGCTAGGC TCATGCCGGT GGGAGCAGGA TGACAGGGCC 12360
CTGTCTCCAG TCATCCCCAT CATTCCTCGG GCCAGCATTC CAGTCTTCCC AGATACCAAG 12420
CCTTATGGGG CCTTGGACCT GGAGGTCCCT GGAAAGCTGC CTGTCACAAC ATGGGAAAAG 12480
GGCAAAGGAA GTGAGGTGTC AGTCATGCTG ACAGTTTCTG CTGCTGCAGC CAAGAACCTG 12540
AATGGTGTGA TGGTGGCAGT AGCAGAGCTG CTAAGCATGA AGATTCCCAA CTCTTATGAA 12600
GTACTCTTTG CAGAGAGCCC TGGCCGCACA GGCATTGAGG CCAAGAAGGG GGAATCTGAG 12660
GGTCCTGGTG GAAAAGAAAA GGGCCTGGGA GGCAAGAGCC CAGAAGCTGG CCCTGATTGG 12720
CTGAAGCAGT TTGATGCAGT GTTGCCTGGC TATACCCTCA AGAGCCAGCT AGAGAGCCCT 12780
GCCCCGGAGC TACCCACCCA GCACAGCTAC ACCTATAACG TCTCGAATCT GGATGTGCGA 12840
CAGCTCTCGG CCCCGCCTCC TGAAGAGCCC TCCCCACCTC CCTCTCCCTT GGTACCCTCT 12900
CCCGCCAGTC CCCCTGCTGA ACCCATGGTT GAACTCCCAG CTGAACCCTT GGCTGAGCCA 12960
CCAGTCCCCT CACCTCTACC TCTAGCCTCA TCCCCTGAGT CAGCCAGGCC CAAGCCACGA 13020
GCCCGGCCCC CTGAAGAAGG GGAAGACTCC CGCCCCCCTC GCCTCAAGAA GTGGAAGGGG 13080
GTGCGATGGA AGCGACTGCG GCTGCTACTC ACTATCCAGA AGGGTGGTGG GCGGCAGGAG 13140
GATGAGCGGG AAGTGGCAGA GTTCATGGAG CAGCTTGGCA CAGCCCTGCG ACCTGACAAG 13200
GTGCCCCGAG ACATGCGGCG CTGCTGCTTT TGCCATGAGG AGGGTGATGG AGCCACTGAT 13260
GGGCCCGCCC GCCTGCTCAA CCTGGACCTG GACCTATGGG TGCATCTCAA CTGTGCCCTC 13320
TGGTCCACAG AGGTGTATGA GACCCAGGGT GGGGCGCTGA TGAATGTGGA GGTTGCTCTG 13380
CACCGGGGAC TGCTAACCAA GTGCTCCCTG TGTCAACGCA CTGGTGCCAC CAGCAGCTGC 13440
AATCGAATGC GTTGCCCCAA TGTCTACCAT TTTGCTTGCG CCATCCGCAC CAAGTGCATG 13500
TTCTTCAAGG ACAAGACTAT GCTGTGCCCA ATGCATAAGG TCAAGGGGCC CTGTGAGCAG 13560
GAGCTGAGTT CTTTTGCTGT CTTCCGACGG GTCTACATTG AGCGGGACGA AGTGAAGCAA 13620
ATTGCCAGCA TCATCCAGCG GGGAGAGCGG CTGCACATGT TTCGTGTGGG GGGCCTTGTG 13680
TTCCATGCCA TCGGACAGCT GCTTCCTCAC CAGATGGCTG ACTTCCACAG TGCCACTGCC 13740
CTCTATCCGG TGGGCTACGA GGCCACGCGC ATCTACTGGA GCCTCCGCAC CAACAACCGC 13800
CGCTGCTGCT ACCGGTGCTC CATTGGTGAG AACAACGGGC GGCCGGAGTT CATCATCAAA 13860
GTCACAGAGC AGGGCCTGGA GGACTTGGTC TTCACTGACG CCTCCCCCCA GGCTGTGTGG 13920
AATCGCATCA TTGAGCCTGT GGCTGCCATG AGAAAAGAGG CTGACATGCT GCGGCTCTTC 13980
CCTGAGTACC TGAAAGGTGA AGAGCTCTTT GGGCTGACAG TGCACGCTGT GCTTCGCATA 14040
GCTGAATCGT TGCCTGGGGT GGAGAGCTGT CAAAACTATT TATTTCGCTA TGGGCGTCAC 14100
CCCCTGATGG AGCTGCCACT CATGATCAAC CCCACTGGCT GTGCTCGATC GGAACCTAAA 14160
ATCCTCACAC ACTACAAACG GCCCCACACC CTGAACAGCA CCAGCATGTC CAAGGCATAT 14220
CAAAGCACCT TTACAGGCGA GACTAATACC CCATACAGCA AGCAGTTTGT GCACTCCAAA 14280
TCATCTCAGT ACCGGAGGCT ACGCACTGAG TGGAAGAACA ATGTGTATCT GGCTCGCTCC 14340
CGTATCCAGG GTCTGGGGCT CTATGCAGCC AAGGACCTAG AGAAGCACAC AATGGTCATC 14400
GAGTACATTG GCACCATCAT TCGCAATGAG GTGGCCAACC GGCGGGAGAA AATTTATGAA 14460
GAGCAGAATC GAGGTATCTA CATGTTCCGG ATAAACAATG AACATGTCAT TGATGCCACA 14520
TTGACCGGAG GCCCTGCCAG GTACATTAAC CATTCCTGTG CCCCTAACTG TGTGGCAGAA 14580
GTTGTGACAT TTGACAAGGA GGACAAAATC ATCATCATCT CCAGCCGGCG AATCCCCAAA 14640
GGAGAGGAGC TGACCTATGA CTATCAGTTT GACTTTGAGG ACGATCAGCA CAAGATCCCC 14700
TGCCACTGTG GAGCCTGGAA TTGTCGGAAA TGGATGAACT AA 14743
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 75 0.0 2912
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 90 0.0 2337
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 88 0.0 2312
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 88 0.0 2299
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 88 0.0 2291
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 88 0.0 2290
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 88 0.0 2287
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 89 0.0 2286
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 82 0.0 2281
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 88 0.0 2281
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 88 0.0 2279
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 88 0.0 2278
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 88 0.0 2271
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 88 0.0 2269
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 88 0.0 2268
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 88 0.0 2267
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 88 0.0 2267
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 88 0.0 2259
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 87 0.0 2249
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 74 0.0 2247
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 89 0.0 2183
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 87 0.0 2167
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 88 0.0 2075
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 88 0.0 2068
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 82 0.0 2031
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 85 0.0 2006
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 86 0.0 1749
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 81 0.0 1697
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 86 0.0 1533
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 86 0.0 1459
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 78 0.0 1372
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 89 0.0 1364
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 89 0.0 1300
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 64 0.0 1231
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 99 0.0 1173
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 75 0.0 1162
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 98 0.0 1156
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 93 0.0 1122
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 92 0.0 1114
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 89 0.0 1103
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 84 0.0 1031
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 86 0.0 1030
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 82 0.0 1022
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 81 0.0 1005
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 80 0.0 998
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 993
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 80 0.0 989
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 988
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 987
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 80 0.0 986
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 984
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 76 0.0 951
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 924
WERAM-Ora-0001 ENSOANP00000000271.1 Ornithorhynchus anatinus 86 0.0 909
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 73 0.0 906
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 75 0.0 895
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 75 0.0 890
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 850
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 73 0.0 731
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 56 0.0 647
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 88 3e-176 619
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 5e-173 608
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 4e-159 562
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 67 3e-99 363
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 8e-98 358
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 9e-93 341
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 2e-45 184
Created Date 25-Jun-2016