WERAM Information


Tag Content
WERAM ID WERAM-Dan-0166
Ensembl Protein ID ENSDNOP00000021359.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSDNOG00000041118.1 ENSDNOT00000051525.1 ENSDNOP00000021359.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 4.90e-45 152.9 3354 5293
Me_Reader PHD 4.20e-26 91.3 171 4917
Organism Dasypus novemcinctus
Domain Profile
  HMT SET1

              SET1.txt   16 vakkeiekeelviEYvGevirsevadkrekeyekkeig 53  
v k++ e+++l++EY+ + +++++++++++ ++++ +
ENSDNOP00000021359.1 3354 VRKQQKEHTNLMAEYRNKQQQQQQQQQQQQQQQQQQHS 3391
44556778899******996666666665554444444 PP
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSDNOP00000021359.1 5178 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5262
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSDNOP00000021359.1 5263 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5293
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSDNOP00000021359.1 171 RCSHCTRLGA----SIPCRSpgCPRLYHFPCATASGSFLSmKTLQLLCPQHSE 219
6999933333....599******************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ ge + ++ C +C + +H+ C++ +l+ + w Cp+Ck
ENSDNOP00000021359.1 228 HCAVC--EGPGELCdLFFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****..444545559******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSDNOP00000021359.1 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 2 tiClvCgkddeg.ekemvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g e ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSDNOP00000021359.1 1330 DMCVVCGSFGRGvEGHLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1379
68****74444423349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSDNOP00000021359.1 1381 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1427
7999*99888887.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l ++ e+ + + C sC+
ENSDNOP00000021359.1 1458 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEEEVEQaadEGFDCISCQ 1509
7*****99999999*****************9844444444544599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSDNOP00000021359.1 4812 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4844
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSDNOP00000021359.1 4871 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 4917
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDSLKPPGED KDSEPAADGP AASEESGAAE PDLPDLHVGE VSVPGSGSAR LQEPPQDGSE 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPQCPVVPPG GDSGPKEAVL PSEDLSQIGF 120
PEGLTPAHLG EPGGPCWAHH WCAAWSAGVW GQEGPELCGV DKAIFSGISQ RCSHCTRLGA 180
SIPCRSPGCP RLYHFPCATA SGSFLSMKTL QLLCPQHSEG AAHLEEAHCA VCEGPGELCD 240
LFFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRACGAGS AELHPNSEWF ENYSLCHPCH QAQRGQPVSS 360
VAEQHPPVCS RFSPPEPGAT PTDEPGALYF ACRGQPEGGD VTAMQSKEPG PLHCEAKPLG 420
RAEAQPEPQP EAPLSEEMPL LPPPEESPLS PPPEESPTSP PPEASRLSPP PPEESPLSPP 480
PESSPFSPLE ESPFSPPEES PPSPSPETPL SPPPKASSLS PPLEESPLSP PPEELPTSPP 540
PEASRLSPPP EESPMSPPPE ESPTSPPPEA SCLFPPFEES PLSPPPEESP LSPPPEALRL 600
SPPPEDSPMS PPPEDSPMSP PPEVSRLSPP PEESPLSPPP EESPTSPPPE ASRLSPPPED 660
SPTSPPRPEK PHLSPRPEEP CLFPAPEEPR LSPAPEEPHL SPAPAEPRLS PAAEEPRLSP 720
APEEPRLSPA LEEPHLSPAP EEPRLSPEPE EPHLSTAPEE PHLSPAPEEP LLSAAPEEPC 780
LSAAPEEPRL SPAPEEPRLS PVSEEPRLST APEEPCLSPA PKEPRMSPQP QESFEEPGLC 840
PTPEELPLVP PSGESPLSPL LGEPALSEPG EPPLSPLTEE LPLSPSGEPS LSPQLMPPDP 900
LPPPLSPIIT AVAPPALSPL GQLEYPFGAK GDSDPESPLA APILETPISP PPEANCTDPE 960
PVPPMILPPS PGSPMGLASP MLISLPPQSP LPSQCFPPAL RLSIPPLSPM EKAVEVSDEA 1020
ELHEMETEKV LEPECPALEP GPSSPLPSPM GELSCPAPSP APALDDFSGL GEDTALLDGT 1080
DIPGSQAEAG QTSGSLTSEL KGSPVLLDPE ELTPVTPMEV YGPECKQVGQ GSPCEEQEEP 1140
RAPVAPTPPT LIKSDIVNEI SNLSQGDASA SFPGSEPLLG SPDPEGGGSL SMELGVSTDV 1200
SPARDEGSLR LCTDSLPETD DSLLCEAGTV VSGGKADGDK GRRRSSPARS RVKQGRSSSF 1260
PGRRRPRGGA HGGRGRGRAR LKSTTSSIET LVVADIDSSP SKEEEEDDDD TMQNTVVLFS 1320
NTDKFVLMQD MCVVCGSFGR GVEGHLLACS QCSQCYHPYC VNSKITKVML LKGWRCVECI 1380
VCEVCGQASD PSRLLLCDDC DISYHTYCLD PPLLTVPKGG WKCKWCVSCM QCGAASPGFH 1440
CEWQNSYTHC GPCASLVTCP ICHAPYVEED LLIQCRHCER WMHAGCESLF TEEEVEQAAD 1500
EGFDCISCQP YVIKPAVVPV APPELVPMKV KEPEPQYFRF EGVWLTETGM AVLRNLTMSP 1560
LHKRRQRRGR LGLPGEAGLE GSEPSDALGP DDKKDGDLDT DELLKGEGGV EHMECEIKLE 1620
GPVSPDVEPG KEETEESKKR KRKPYRPGIG GFMVRQRKSH TRVKKGPAAQ AEVLSGDGQP 1680
DEVLPADLPA EGPVEQSLAD GDEKKKQQRR GRKKSKLEDM FPAYLQEAFF GKELLDLSRK 1740
ALFAVGVGRP SFASDDLVRP RAPSGLEAGS GSLPLWFRGA DGSLLAIAGP EDGGVKASPV 1800
PSDPEKPGTP GEGMLSSDLD RIPTEELPKM ESKDLQQLFK DVLGSEREQH LGCGTPGLDG 1860
SRTPLQRPFL QGGLPLGNLP SSSPMDSYPS LCQSPFLDSR ERGGFFSPEP GEPDSPWTGS 1920
GGTTPSTPTT PTTEGEGDGL SYNQRSLQRW EKDEELGQLS TISPVLYANI NFPNLKQDYP 1980
DWSSRCKQIM KLWRKVPAAD KAPYLQKAKD NRAAHRINKV QKQAESQINK QTKVGDLARK 2040
TDRPALHLRI PPQPGALGSP PPAAAPTIFI GSPTTPAGLS TSADGFLKPP AGTVPGPDSP 2100
GELFLKLPPQ VPAQVPSQDP SPDPFLKPRC PSLDNLAVPE SPGVGGGKTS EPLLSPPPFG 2160
EPRKALEVKK EELGAASPSY GPPNLGFVDS PSSGPHVGGL ELKAPDVFKA PLTPRASQVE 2220
PQSPGLGLRP QEPPAAQALA PSPPSHPDIF RPGPYPDPYT QPPVTPRPQP PAPEGCCALP 2280
PRSLPSDPFS RVPASPQSQS SSQSPLTPRP LSAEAFCPSP VTPRFQSPDP YSRPPSRPQS 2340
RDPFAPLHKP PRPQPPEVAF KAGPLAHTPL GAGGFPAALP SGPTGELHAK VPAGQPPNFA 2400
RSPGTGAFVG SPSSMRFTFP QAVGEPSLKP PQPGLPPPHG INSHFGPGPT LAKPQSTNYT 2460
VATGNFHPSG SPLGPSSGST GEGYGLSPLR PPSVLPPPVP DGSLPYLSHG ASQRAGITSP 2520
VDKREDPGAG MGSSLAAPEL PGTQDPGMSS LSQTELEKQR QRQRLRELLI RQQIQRNTLR 2580
QEKETAAAAA GAVGPPGSWA GEPSGPAFEQ LNRGQTPFPG SQDKSSLVGL PPNKLSGPGL 2640
GPGPFPGDDR LSRPPPSATP SSLDVNSRQL VGGSQAFYQR PPYPGPLPLQ PQPQQQLWQQ 2700
QQQQQAAAAT SMRLAMSTRF PSTPGPELGR QALGSPLAGI PTRMPGPGEP VPGPVGPAQF 2760
IELRHNVQKG LGPGGAPFPG QGPPQRPRFY PVSEDPHRLA PEGLRSLAVS GLPPQKPSVP 2820
LAPELNSSLH PTSHTKGPAL PTKDIFNEHL RLVESANEKA EREALLRGVE PGSLGPEERP 2880
PPAPEASEPR LAPVLPEVKP KVEESGRHPS PCQFAITTPK VEPAPATPSL GLGLKPGQSV 2940
IGNRDPRMGS GPFSGSGHTT EKGPFGATGG PPAHLLTPNP LGGPGGSSLL EKFDLEGGAL 3000
TLPSGHAPSG DELDKMESSL VASELPLLIE DLLEHEKKEL QKKQQLSAQL QPAQQQQQHS 3060
LLPSSGPAQT MPLPPEAASP GLAGPQQQLA LGLGGARQPG LAQPPAHALQ QRLAPSMAMM 3120
SNQGHMLSGQ HGGQAGLVPQ QGPQPVLAQK PMGSMPPSMC MKPPQLAMQQ QLANSFFPDT 3180
DLDKFAAEDI IDPIAKAKMV ALKGIKKVMA QGSIGVAPGM NRNSRQQVSL LAQRLSGGSG 3240
NDLQNHVTAG SGQERSAGDP SQSRPNPPTF AQGVINEADQ RQYEEWLFHT QQLLQMQLKV 3300
LEEQIGVHRK SRKALCAKQR TAKKAGREFP EADAEKLKLV TEQQSKIQKQ LDQVRKQQKE 3360
HTNLMAEYRN KQQQQQQQQQ QQQQQQQQQH SAVLALSPSQ SPRLLTKLPS QLLPGHGLQP 3420
PQGPPGGQAG GLRLPPGSMA LSGQPAGPFL NTALAQQQQQ QQHSGGAGAL AGPSGGFFPG 3480
NLALRGLGPD SRLLQERQLQ LQQQRMQLAQ KLQQQQQQHL LGQVAIQQQQ QQGSGVQANQ 3540
ALGPKPPGLL PPSSHQGLLV QQLSPQPPQG PQGMLGPAQV AVLQQQHQQH PGALGPQGPN 3600
RQVLLTQSRV LSSPQLAQQG QGLMGHRLVT AQQQQQQQQQ QQQGSMAGLS HLQQGLLPHS 3660
GQPKLSAQPM GTLQQQQFQQ QQQQQQLQQQ QQQFQQQQQQ LQQQQLQQQQ QLQQQQLQQQ 3720
QQQLQQQQQQ LQQQQQQQQQ FQQQQQQQQM GLLNQSRTLL SPQQQQQPQA TLGPGVPAKP 3780
LQHFSSPGAL GPTLLLTGKE QGIGETALPA EVTEGSSTHQ GGPLAIGTTP ESMAAEPGEG 3840
KPPLSGDSQL LLVQPQAQPQ AQPQPSSLQL QPPLRLPEQQ QQANVLHTAG GGSHGLLGSG 3900
SSSEASSVPH LLAPPSVSLG EHPGPMSQNL LGSQHPLALE RPMQSTAGPQ LPKAGPVPQS 3960
GQGLPGAGVV PTVGQLRAQL QGVLAKNPQL RHLSPQQQQQ LQALLMQRHL QQSQAVRHTP 4020
PYQEPGTQPS PLQGLLGRQP QLGAFPAPQP GPLQELGAGP RPQGPPRLSA PQGALSTGPV 4080
LGPVHPTPPP SSPQEPKRPS PQVPSPSSQL PSEVQLPPNQ PGTPKPQGLP SELPPGRVSP 4140
AAAQLVDTFF GKGLGPWGPP DNLAEAQKLE QSSLVAGHLE QVNGQPVPEP PHLSIKQEPR 4200
EEPCALGAPA VKREANGEPV GAPGTSNHLL LAGPRSEAGH LLLQKLLRAK SVQLSTGRGP 4260
EGLRTEINGH IDSKLAGLEQ KLQGTPSSKE DTAARKPLTP KPKRVQKASD RLVSSRKKLR 4320
KEDGVRAGEA LLKQLKQELS LLPLMEPTIT ANFSLFAPFG SGCPVNGQCQ LRGAFGNGAL 4380
PTGPDYYSQL LTKNNLSNPP TPPSSLPPTP PPSVQQKMVN GVTPSEELGE HPKDAASARE 4440
TEGALRDASE VKSLDLLAAL PTPPHNQTED VRMESDEDSD SPDSIVPASS PESILGEEAP 4500
RFPQLGSGRW EQDDRALSPV IPIIPRTSIP VFPDTKPYGA LDLEAPGKLP ASTWEKGKGS 4560
EVSVMLTVSA AAAKNLNGVM VAVAELLSMK IPNSYEVLFP ESPARAGIEP KKGEAEGPGG 4620
KEKGLGGKSP EAGPDWLKQF DAILPGYTLK SQLDILSLLK QESPAPEPPT QHSYTYNVSN 4680
LDVRQLSAPP PEEPSPPPSP MAPSPASPPT EPLGELPAEP SAEPPVPSPL PLASSPESAR 4740
PKPRARPPEE GEDSRPPRLK KWKGVRWKRL RLLLTIQKGS GRQEDEREVA EFMEQLGTAL 4800
RPDKVPRDMR RCCFCHEEGD GATDGPARLL NLDLDLWVHL NCALWSTEVY ETQGGALMNV 4860
EVALHRGLLT KCSLCQRTGA TSSCNRMRCP NVYHFACAIR AKCMFFKDKT MLCPMHKIKG 4920
PCEQELSSFA VFRRVYIERD EVKQIASIIQ RGERLHMFRV GGLVFHAIGQ LLPHQMADFH 4980
SATALYPVGY EATRIYWSLR TNNRRCCYRC SIGENSGRPE FVIKVMEQGL EDLVFTDASP 5040
QAVWNRIIEP VAAMRKEADM LRLFPEYLKG EELFGLTVHA VLRIAESLPG VESCQNYLFR 5100
YGRHPLMELP LMINPTGCAR SEPKILTHYK RPHTLNSTSM SKAYQSTFTG ETNTPYSKQF 5160
VHSKSSQYRR LRTEWKNNVY LARSRIQGLG LYAAKDLEKH TMVIEYIGTI IRNEVANRRE 5220
KIYEEQNRGI YMFRINNEHV IDATLTGGPA RYINHSCAPN CVAEVVTFDK EDKIIIISSR 5280
RIPKGEELTY DYQFDFEDDQ HKIPCHCGAW NCRKWMN 5317
Nucleotide Sequence
(Fasta)
ATGGATAGCC TGAAGCCGCC TGGTGAGGAT AAAGATTCAG AACCAGCAGC TGATGGACCC 60
GCAGCCTCCG AAGAGTCAGG TGCCGCTGAG CCAGACCTTC CCGACCTACA CGTGGGGGAG 120
GTCTCTGTCC CTGGTTCTGG CAGTGCCAGG CTTCAGGAGC CTCCCCAGGA CGGCAGTGAG 180
GGTCCAGTGC GACGCTGTGC TCTCTGTAAC TGCGGGGAGC CCAGTCTGCA TGGGCAGCGG 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAC TGGCCCCAGT GTCCAGTGGT ACCCCCCGGA 300
GGGGATTCAG GACCTAAGGA GGCAGTACTG CCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCT TGACACCTGC CCACCTAGGA GAACCTGGAG GGCCCTGCTG GGCTCACCAT 420
TGGTGTGCTG CGTGGTCGGC AGGCGTGTGG GGGCAGGAGG GCCCAGAACT ATGCGGTGTG 480
GACAAGGCCA TCTTCTCAGG GATCTCACAG CGCTGCTCTC ACTGCACCAG GCTCGGTGCC 540
TCCATCCCTT GCCGCTCGCC TGGATGTCCA CGGCTTTACC ACTTCCCCTG CGCGACTGCC 600
AGCGGTTCCT TCTTGTCCAT GAAAACACTG CAGCTGCTCT GCCCACAGCA CAGTGAGGGG 660
GCTGCACATC TGGAAGAGGC CCATTGTGCA GTATGTGAGG GGCCAGGGGA ATTGTGCGAC 720
CTGTTCTTCT GTACCAGCTG TGGGCATCAC TATCACGGGG CCTGTCTGGA CACTGCTCTG 780
ACGGCCCGCA AGCGTGCTGG CTGGCAGTGC CCTGAGTGCA AAGTGTGCCA AGCCTGCAGG 840
AAACCCGGGA ATGACTCTAA GATGTTGGTC TGTGAGACGT GTGACAAAGG ATACCATACT 900
TTCTGCCTAA AACCACCCAT GGAGGAACTG CCTGCTCATT CGTGGAAGTG CAAGGCCTGC 960
CGGGTATGCC GGGCCTGCGG GGCAGGCTCA GCAGAGCTAC ACCCCAACTC TGAGTGGTTT 1020
GAGAACTACT CACTCTGTCA CCCCTGTCAC CAGGCCCAGA GAGGCCAGCC TGTCAGTTCC 1080
GTTGCTGAGC AGCATCCCCC AGTCTGTAGC AGATTCTCAC CCCCAGAGCC CGGCGCTACC 1140
CCCACCGATG AGCCTGGCGC TCTGTACTTT GCATGCCGAG GGCAGCCAGA GGGTGGGGAC 1200
GTGACCGCTA TGCAATCCAA GGAACCGGGG CCCCTGCACT GTGAAGCCAA ACCACTAGGG 1260
AGAGCAGAGG CCCAACCTGA GCCCCAGCCG GAGGCCCCCC TGAGTGAGGA AATGCCACTG 1320
CTGCCCCCTC CTGAGGAGTC GCCCCTGTCC CCACCGCCTG AGGAATCACC CACATCCCCG 1380
CCTCCGGAGG CGTCGCGCCT GTCCCCACCA CCGCCGGAGG AGTCTCCCCT GTCTCCTCCA 1440
CCTGAGTCAT CACCGTTTTC TCCACTTGAG GAGTCACCCT TCTCTCCACC AGAGGAGTCG 1500
CCCCCATCCC CCTCACCCGA GACACCCCTC TCCCCGCCAC CTAAAGCGTC ATCCCTGTCC 1560
CCACCATTGG AGGAGTCTCC CCTGTCCCCG CCACCCGAGG AGCTGCCCAC GTCCCCACCG 1620
CCTGAAGCAT CTCGACTGTC CCCCCCGCCT GAAGAGTCTC CCATGTCCCC TCCTCCCGAA 1680
GAGTCACCCA CGTCTCCGCC ACCCGAGGCA TCGTGCCTCT TCCCACCATT TGAAGAGTCT 1740
CCCCTGTCCC CTCCACCTGA AGAGTCTCCC CTCTCCCCAC CGCCTGAGGC GTTGCGCCTG 1800
TCCCCACCAC CTGAGGACTC ACCCATGTCC CCGCCGCCTG AAGATTCGCC TATGTCCCCT 1860
CCACCTGAGG TGTCCCGCCT GTCTCCACCA CCTGAGGAAT CTCCTCTGTC CCCACCGCCT 1920
GAGGAGTCTC CCACATCCCC TCCACCTGAG GCATCGCGCC TGTCCCCGCC ACCTGAGGAC 1980
TCTCCCACGT CCCCACCCCG GCCTGAGAAG CCGCATCTGT CCCCCCGGCC TGAGGAGCCG 2040
TGCCTGTTCC CTGCGCCCGA GGAGCCGCGT CTGTCCCCTG CGCCCGAGGA GCCGCATCTG 2100
TCCCCTGCGC CTGCCGAGCC ACGCCTGTCC CCTGCAGCGG AGGAGCCACG CCTGTCCCCT 2160
GCGCCCGAGG AGCCGCGCCT GTCCCCTGCG CTCGAGGAGC CGCACCTGTC CCCTGCGCCC 2220
GAGGAGCCGC GCTTGTCCCC TGAGCCAGAG GAGCCGCATC TGTCCACTGC ACCCGAGGAG 2280
CCGCACCTGT CCCCTGCACC CGAGGAGCCG CTCCTGTCCG CTGCGCCTGA GGAGCCGTGC 2340
CTGTCCGCTG CGCCCGAGGA GCCGCGCCTG TCCCCTGCGC CCGAGGAGCC GCGCCTGTCC 2400
CCTGTGTCCG AGGAGCCGCG CCTGTCCACT GCTCCCGAGG AGCCGTGCCT GTCCCCTGCG 2460
CCCAAGGAGC CGCGCATGTC TCCCCAGCCT CAAGAGTCCT TTGAGGAACC AGGCTTGTGC 2520
CCCACACCTG AGGAGTTGCC CTTGGTTCCG CCATCTGGGG AGTCACCTCT GTCACCCCTG 2580
CTTGGAGAGC CGGCCCTGTC TGAGCCTGGG GAGCCACCTC TGTCTCCTCT GACTGAGGAG 2640
CTGCCCTTGT CCCCATCTGG CGAGCCATCC TTGTCGCCTC AGCTGATGCC ACCAGATCCT 2700
CTTCCTCCTC CGCTGTCACC TATTATCACA GCTGTGGCGC CGCCTGCCCT GTCTCCTTTG 2760
GGGCAGTTAG AGTACCCCTT TGGTGCCAAA GGGGACAGTG ACCCTGAGTC ACCGCTGGCT 2820
GCCCCCATCC TAGAAACACC CATCAGCCCT CCCCCTGAAG CTAACTGCAC TGACCCTGAG 2880
CCTGTGCCCC CTATGATCCT CCCCCCATCT CCAGGTTCCC CAATGGGACT GGCCTCTCCC 2940
ATGCTGATTT CTCTTCCCCC TCAATCTCCC CTTCCTTCCC AGTGCTTCCC TCCTGCTCTG 3000
CGCCTGTCCA TTCCCCCCTT GAGTCCCATG GAGAAGGCAG TGGAGGTCTC AGATGAAGCT 3060
GAGCTGCATG AGATGGAGAC TGAGAAGGTC CTGGAACCGG AGTGCCCAGC CTTGGAGCCC 3120
GGCCCCAGCA GTCCTCTTCC CTCCCCCATG GGGGAGCTTT CCTGCCCTGC CCCCAGCCCT 3180
GCCCCAGCCC TGGATGACTT CTCAGGCCTG GGGGAAGACA CAGCCCTTCT GGATGGGACT 3240
GACATTCCTG GTTCACAGGC AGAGGCTGGA CAGACCTCTG GCAGTTTGAC TAGTGAACTT 3300
AAGGGTTCCC CTGTGCTCCT GGACCCCGAG GAGCTGACCC CTGTGACCCC TATGGAGGTC 3360
TATGGCCCAG AATGCAAACA GGTCGGGCAG GGCTCCCCCT GTGAAGAGCA GGAGGAGCCA 3420
CGTGCCCCCG TGGCCCCCAC TCCACCCACT CTCATTAAAT CCGACATTGT TAATGAGATC 3480
TCCAATCTGA GCCAGGGCGA TGCCAGCGCC AGCTTTCCTG GCTCAGAGCC TCTGCTGGGC 3540
TCTCCCGACC CTGAGGGGGG CGGCTCCCTG TCCATGGAGC TGGGGGTATC GACAGACGTG 3600
AGCCCAGCCC GAGATGAGGG CTCCCTGCGG CTCTGTACTG ATTCGCTGCC CGAGACCGAT 3660
GACTCGCTGT TATGTGAAGC TGGGACAGTT GTCAGTGGAG GCAAAGCCGA CGGGGACAAG 3720
GGGAGGCGGC GGAGTTCCCC CGCTCGTTCC CGCGTCAAAC AGGGACGCAG TAGTAGTTTC 3780
CCAGGAAGAC GCAGGCCACG TGGAGGAGCA CACGGAGGAC GCGGGAGAGG ACGGGCCCGG 3840
CTAAAATCAA CTACGTCTTC CATTGAGACT CTGGTAGTTG CTGATATCGA TAGCTCTCCC 3900
AGCAAGGAGG AGGAAGAAGA TGATGATGAC ACCATGCAAA ACACTGTGGT TCTCTTCTCC 3960
AACACAGACA AATTTGTCCT AATGCAGGAC ATGTGCGTGG TATGTGGCAG CTTTGGCCGG 4020
GGGGTGGAGG GCCACCTCCT GGCCTGTTCC CAGTGCTCTC AGTGCTATCA TCCTTACTGT 4080
GTCAACAGCA AGATCACCAA GGTGATGCTG TTGAAGGGCT GGCGTTGCGT GGAGTGTATT 4140
GTGTGCGAGG TGTGTGGCCA GGCCTCTGAC CCCTCACGCC TGCTGCTCTG TGATGACTGT 4200
GACATTAGCT ACCACACGTA CTGCCTGGAC CCCCCACTGC TCACTGTGCC CAAGGGCGGC 4260
TGGAAGTGCA AGTGGTGTGT GTCCTGTATG CAGTGTGGGG CTGCCTCCCC TGGCTTCCAC 4320
TGCGAGTGGC AGAATAGTTA CACTCACTGC GGGCCCTGCG CCAGTCTGGT GACCTGCCCT 4380
ATCTGCCATG CCCCATACGT GGAGGAGGAC TTACTCATCC AGTGCCGCCA CTGTGAACGG 4440
TGGATGCATG CTGGCTGCGA GAGCCTCTTC ACGGAGGAGG AGGTGGAGCA GGCAGCCGAC 4500
GAGGGCTTTG ACTGCATCTC CTGCCAGCCC TACGTGATAA AACCTGCGGT AGTGCCTGTC 4560
GCGCCTCCAG AGTTGGTGCC TATGAAGGTG AAAGAGCCAG AGCCTCAGTA CTTCCGCTTC 4620
GAGGGTGTGT GGCTGACAGA AACTGGCATG GCCGTGCTGC GGAACCTGAC CATGTCACCC 4680
CTGCACAAGC GGCGCCAGCG GCGGGGACGG CTCGGCCTCC CAGGCGAGGC AGGGCTGGAA 4740
GGTTCAGAGC CCTCGGATGC CCTTGGCCCT GATGACAAGA AGGATGGGGA CCTGGACACT 4800
GATGAGCTGC TCAAGGGTGA AGGTGGTGTG GAGCACATGG AATGTGAAAT TAAACTGGAG 4860
GGCCCCGTCA GCCCTGACGT GGAGCCTGGC AAGGAAGAGA CCGAGGAAAG CAAAAAGCGC 4920
AAGCGCAAAC CCTACCGGCC TGGCATTGGT GGTTTCATGG TGCGACAGCG GAAATCCCAC 4980
ACACGTGTGA AAAAGGGGCC TGCTGCACAG GCGGAGGTGT TGAGTGGGGA TGGGCAGCCC 5040
GACGAGGTGC TGCCTGCTGA CCTGCCCGCA GAGGGCCCTG TGGAGCAGAG CTTGGCCGAT 5100
GGGGACGAGA AGAAGAAGCA GCAGCGGCGC GGGCGCAAGA AGAGCAAACT CGAGGATATG 5160
TTTCCTGCTT ACCTGCAGGA GGCCTTCTTT GGGAAGGAGC TGCTGGACCT GAGCCGTAAG 5220
GCCCTTTTTG CGGTTGGGGT GGGCCGACCG AGCTTTGCCT CTGACGATCT CGTCCGCCCC 5280
CGGGCACCGT CAGGGCTGGA GGCTGGAAGC GGTTCCCTGC CCCTCTGGTT CAGGGGGGCT 5340
GATGGGTCTC TCCTGGCAAT TGCAGGACCT GAGGATGGGG GCGTAAAGGC GTCCCCAGTG 5400
CCCAGTGACC CTGAGAAGCC AGGCACCCCG GGTGAAGGGA TGCTTAGCTC TGACTTAGAC 5460
AGGATCCCCA CAGAAGAACT GCCCAAGATG GAATCCAAGG ACCTGCAGCA GCTCTTCAAG 5520
GATGTTTTGG GTTCTGAGCG CGAGCAGCAC CTGGGATGTG GAACCCCTGG CCTGGATGGC 5580
AGCCGTACAC CCCTGCAGAG GCCCTTTCTC CAAGGTGGAC TCCCTTTGGG CAATCTCCCC 5640
TCCAGTAGCC CCATGGACTC CTACCCGAGC CTCTGCCAGT CCCCGTTCTT GGACAGCAGG 5700
GAGCGCGGGG GCTTCTTCAG CCCGGAACCC GGTGAGCCAG ACAGCCCCTG GACAGGCTCA 5760
GGGGGCACCA CGCCCTCCAC CCCCACCACG CCAACCACAG AGGGTGAGGG CGACGGGCTC 5820
TCCTATAACC AGCGGAGTCT TCAGCGCTGG GAGAAGGATG AGGAGTTGGG TCAGCTCTCT 5880
ACCATCTCGC CTGTGCTCTA TGCCAACATT AACTTCCCCA ATCTCAAGCA AGATTACCCA 5940
GACTGGTCTA GCCGCTGCAA ACAAATCATG AAGCTCTGGA GAAAGGTTCC AGCTGCTGAC 6000
AAAGCCCCCT ATCTGCAAAA GGCCAAAGAT AACCGGGCGG CTCACCGCAT CAACAAGGTG 6060
CAGAAGCAGG CTGAGAGCCA GATCAACAAG CAGACCAAGG TGGGCGACTT AGCCCGTAAG 6120
ACTGACCGAC CGGCCCTACA TCTCCGCATT CCCCCCCAGC CAGGGGCACT GGGCAGTCCG 6180
CCCCCCGCCG CTGCCCCCAC CATTTTCATT GGCAGCCCCA CCACCCCCGC CGGCTTGTCT 6240
ACCTCTGCGG ACGGGTTCCT GAAGCCACCA GCAGGCACGG TGCCCGGCCC CGACTCGCCC 6300
GGTGAGCTCT TCCTCAAGCT CCCACCCCAG GTGCCCGCCC AAGTGCCTTC GCAGGACCCC 6360
TCCCCTGACC CCTTTCTCAA ACCCCGCTGT CCCTCCCTGG ACAACCTGGC TGTGCCTGAG 6420
AGCCCAGGGG TAGGGGGAGG CAAGACTTCC GAGCCCCTGC TCTCACCCCC ACCTTTTGGG 6480
GAGCCCCGGA AAGCCCTAGA GGTGAAGAAG GAAGAGCTTG GGGCTGCCTC GCCTAGCTAC 6540
GGGCCCCCTA ACCTGGGCTT TGTTGACTCA CCCTCCTCAG GCCCCCACGT GGGTGGCCTG 6600
GAGTTAAAGG CACCTGATGT CTTCAAAGCC CCCCTGACCC CTCGGGCATC TCAGGTAGAG 6660
CCCCAAAGCC CGGGCTTGGG CCTGCGGCCC CAGGAGCCAC CCGCCGCCCA GGCTTTGGCC 6720
CCTTCTCCCC CCAGCCACCC TGACATCTTT CGCCCGGGCC CCTACCCTGA CCCCTATACG 6780
CAGCCCCCAG TCACGCCTCG GCCTCAGCCC CCAGCCCCTG AGGGTTGCTG TGCCCTGCCC 6840
CCTCGCTCCC TGCCCTCCGA CCCTTTTTCC CGAGTGCCCG CCAGCCCCCA GTCCCAGTCC 6900
AGCTCACAGT CCCCGCTGAC ACCCCGTCCT CTGTCTGCCG AGGCTTTCTG CCCGTCCCCA 6960
GTCACCCCTC GCTTCCAGTC CCCTGACCCG TATTCCCGCC CACCCTCACG CCCGCAGTCC 7020
CGGGACCCGT TTGCCCCATT GCATAAGCCA CCGCGACCCC AGCCCCCTGA AGTTGCCTTC 7080
AAGGCTGGGC CTCTAGCCCA CACTCCGCTG GGGGCTGGGG GCTTCCCAGC AGCTCTGCCC 7140
TCAGGGCCGA CAGGCGAGCT CCATGCCAAG GTCCCAGCTG GGCAGCCCCC CAATTTCGCC 7200
CGCTCTCCTG GAACAGGCGC ATTTGTGGGG AGCCCCTCTT CCATGCGTTT CACTTTCCCT 7260
CAGGCGGTCG GGGAGCCGTC CCTAAAGCCC CCTCAGCCTG GTCTCCCCCC ACCCCATGGG 7320
ATCAACAGCC ATTTTGGGCC TGGCCCTACC TTGGCCAAGC CCCAAAGCAC AAACTACACA 7380
GTAGCCACAG GGAACTTCCA CCCGTCGGGC AGCCCCCTGG GGCCCAGCAG TGGGTCCACA 7440
GGAGAGGGCT ACGGGCTGTC CCCGCTACGC CCCCCGTCTG TCCTGCCACC ACCTGTACCC 7500
GACGGGTCCC TTCCCTACCT GTCCCATGGA GCCTCGCAGC GGGCAGGCAT CACCTCTCCA 7560
GTCGATAAGC GAGAAGACCC CGGGGCTGGA ATGGGCAGTT CCCTGGCAGC ACCTGAACTC 7620
CCAGGTACCC AGGATCCAGG CATGTCCAGC CTCAGCCAGA CAGAGCTGGA GAAGCAGCGG 7680
CAGCGCCAGC GACTGCGGGA GCTGCTGATT CGACAGCAGA TCCAGCGTAA TACCCTGCGG 7740
CAGGAGAAGG AGACGGCCGC GGCAGCCGCG GGAGCCGTGG GGCCGCCGGG CAGCTGGGCT 7800
GGTGAGCCCA GCGGTCCTGC CTTTGAGCAG CTGAATCGAG GCCAGACCCC CTTCCCTGGG 7860
AGCCAGGACA AGAGCAGCCT CGTGGGACTG CCCCCAAACA AGCTAAGTGG CCCCGGCCTG 7920
GGGCCAGGGC CTTTCCCTGG CGACGATCGA CTCTCCCGGC CACCTCCATC AGCTACCCCT 7980
TCCTCTCTGG ACGTGAACAG CCGGCAACTG GTAGGCGGCT CTCAAGCCTT CTATCAGCGA 8040
CCACCCTATC CTGGGCCCCT GCCCTTACAG CCACAACCGC AGCAACAACT GTGGCAGCAG 8100
CAGCAGCAGC AGCAGGCAGC AGCAGCAACC TCCATGCGAC TGGCCATGTC CACGCGCTTT 8160
CCGTCAACTC CTGGGCCTGA ACTTGGCCGC CAAGCCCTCG GATCCCCCTT GGCAGGAATT 8220
CCCACCCGCA TGCCTGGCCC CGGTGAGCCG GTGCCTGGTC CGGTTGGTCC TGCGCAGTTC 8280
ATTGAGTTGC GGCACAATGT GCAGAAAGGA CTTGGACCTG GGGGGGCTCC GTTCCCTGGT 8340
CAGGGGCCCC CTCAGAGACC CCGTTTTTAC CCTGTAAGTG AGGACCCTCA CCGACTGGCC 8400
CCTGAAGGGC TTCGGAGCCT GGCGGTCTCA GGTCTTCCCC CACAGAAACC CTCAGTCCCT 8460
CTGGCCCCTG AACTGAACAG CAGCCTCCAT CCAACATCCC ACACCAAGGG CCCTGCTCTG 8520
CCCACTAAGG ACATCTTCAA CGAACATCTG AGGCTGGTGG AGTCGGCGAA TGAGAAGGCC 8580
GAGCGGGAGG CCCTGCTGCG GGGGGTGGAG CCGGGATCCT TGGGCCCCGA GGAGCGCCCT 8640
CCCCCTGCCC CTGAGGCCTC TGAGCCGCGC CTGGCACCAG TGCTCCCTGA GGTGAAGCCC 8700
AAGGTGGAGG AGAGTGGGCG CCACCCTTCC CCTTGCCAGT TCGCCATCAC CACCCCCAAG 8760
GTAGAGCCAG CCCCTGCCAC TCCTTCCCTG GGCCTGGGGC TGAAGCCAGG ACAGAGCGTA 8820
ATCGGCAATC GGGACCCCCG GATGGGCTCA GGACCCTTTT CTGGCAGTGG GCACACAACT 8880
GAGAAGGGTC CCTTTGGGGC CACGGGAGGA CCACCGGCTC ACCTGCTGAC TCCCAACCCC 8940
CTGGGTGGCC CAGGAGGGTC CTCCCTATTG GAAAAGTTTG ATCTAGAGGG AGGAGCCCTC 9000
ACCTTGCCCA GTGGACATGC ACCATCTGGG GATGAACTGG ACAAGATGGA GAGCTCACTG 9060
GTGGCCAGCG AATTGCCCCT ACTCATCGAG GACCTACTGG AACACGAGAA GAAGGAGCTG 9120
CAGAAGAAGC AACAGCTTTC AGCACAGCTG CAACCTGCCC AGCAGCAGCA GCAACACTCC 9180
CTACTGCCCT CCTCAGGCCC TGCACAGACC ATGCCTCTGC CACCTGAGGC TGCTTCTCCT 9240
GGCCTGGCCG GGCCGCAGCA GCAGCTCGCC CTGGGCCTCG GGGGAGCCCG GCAGCCAGGC 9300
TTGGCCCAGC CACCCGCTCA CGCCCTCCAG CAGCGCCTGG CACCATCGAT GGCCATGATG 9360
TCCAACCAAG GGCACATGCT AAGTGGGCAG CATGGGGGCC AGGCAGGCTT GGTGCCCCAG 9420
CAGGGCCCAC AGCCAGTGCT GGCACAGAAG CCCATGGGGA GCATGCCGCC CTCCATGTGC 9480
ATGAAGCCCC CGCAGCTGGC GATGCAGCAG CAGCTGGCCA ACAGCTTCTT CCCAGATACA 9540
GACCTGGATA AATTTGCTGC CGAAGATATT ATTGATCCCA TTGCTAAGGC CAAGATGGTA 9600
GCTTTGAAAG GCATCAAGAA AGTGATGGCT CAGGGCAGCA TTGGGGTGGC ACCCGGTATG 9660
AACAGGAACT CCAGGCAGCA AGTGTCCCTC CTAGCTCAGC GACTCTCTGG AGGGTCTGGC 9720
AATGACCTGC AAAACCATGT GACAGCTGGG AGTGGTCAGG AGCGAAGTGC CGGTGATCCC 9780
TCCCAGTCTC GTCCAAACCC ACCCACTTTT GCCCAGGGAG TGATCAATGA GGCGGACCAG 9840
CGGCAGTATG AGGAGTGGCT GTTCCATACC CAGCAGCTCC TACAGATGCA GCTGAAGGTG 9900
CTAGAGGAGC AGATTGGTGT GCATCGCAAG TCCCGGAAAG CTCTGTGTGC CAAGCAACGC 9960
ACCGCCAAAA AGGCCGGCCG GGAGTTCCCA GAGGCCGATG CTGAGAAGCT CAAGCTGGTT 10020
ACGGAGCAGC AGAGCAAGAT CCAGAAACAG CTGGATCAGG TCCGGAAACA GCAGAAGGAG 10080
CACACTAATC TCATGGCAGA GTATCGGAAT AAGCAGCAGC AACAGCAGCA GCAGCAACAG 10140
CAGCAGCAGC AGCAGCAGCA GCAGCAGCAC TCAGCCGTGC TGGCCCTCAG CCCTTCCCAG 10200
AGTCCCCGGC TGCTTACAAA GCTCCCTAGT CAGCTGCTCC CTGGCCATGG GCTGCAGCCT 10260
CCGCAGGGGC CCCCAGGTGG GCAAGCTGGA GGTCTTCGCC TGCCCCCTGG GAGCATGGCA 10320
CTCTCTGGAC AGCCTGCTGG TCCCTTCCTC AACACAGCCC TGGCCCAGCA GCAGCAGCAG 10380
CAGCAGCATT CTGGTGGGGC TGGGGCCCTG GCTGGCCCCT CAGGGGGCTT CTTCCCTGGC 10440
AACTTGGCCC TTCGGGGCCT GGGACCTGAT TCGAGGCTCT TACAGGAAAG ACAGCTGCAG 10500
CTCCAGCAGC AACGCATGCA GCTGGCTCAG AAACTGCAGC AGCAGCAGCA GCAGCACCTC 10560
CTAGGGCAGG TGGCAATCCA GCAGCAACAG CAGCAGGGCT CAGGAGTGCA GGCCAACCAG 10620
GCCCTGGGTC CCAAACCCCC GGGGCTTCTG CCTCCCAGCA GCCATCAGGG TCTCCTGGTC 10680
CAGCAGCTGT CCCCTCAACC ACCCCAGGGA CCCCAGGGCA TGTTGGGCCC TGCCCAGGTG 10740
GCAGTGTTGC AGCAGCAGCA CCAGCAGCAT CCTGGAGCTT TGGGCCCCCA GGGCCCTAAC 10800
AGACAAGTGC TCCTGACCCA GTCCCGGGTC CTGAGTTCCC CTCAGTTGGC ACAGCAGGGT 10860
CAGGGCCTTA TGGGACACCG GTTGGTCACA GCCCAGCAGC AGCAACAGCA ACAGCAGCAG 10920
CAACAACAGG GATCCATGGC AGGGCTCTCC CATCTTCAGC AGGGTCTGCT GCCACACAGC 10980
GGGCAGCCCA AATTGAGTGC TCAACCCATG GGGACCTTAC AACAGCAGCA GTTTCAGCAG 11040
CAGCAGCAAC AGCAGCAACT TCAGCAGCAG CAGCAGCAGT TTCAGCAACA ACAGCAGCAG 11100
CTTCAGCAGC AGCAACTTCA GCAGCAGCAG CAGCTTCAGC AGCAGCAGCT TCAACAGCAG 11160
CAGCAGCAGC TTCAACAACA GCAGCAGCAG CTGCAGCAAC AGCAGCAGCA GCAGCAGCAG 11220
TTTCAGCAGC AGCAGCAGCA GCAGCAGATG GGTCTCTTGA ATCAGAGTCG AACTTTACTG 11280
TCTCCTCAGC AGCAACAGCA GCCGCAGGCG ACGCTTGGCC CTGGCGTGCC AGCCAAGCCT 11340
CTTCAACACT TTTCTAGCCC TGGAGCTCTG GGCCCAACTC TTCTTTTGAC GGGCAAGGAA 11400
CAGGGCATTG GAGAGACAGC TCTTCCTGCA GAGGTCACTG AGGGGTCCTC AACACATCAG 11460
GGAGGGCCGT TAGCAATAGG GACTACGCCT GAATCGATGG CTGCTGAACC AGGGGAGGGA 11520
AAACCCCCAC TCTCTGGAGA CTCCCAGCTC CTGCTCGTGC AGCCCCAGGC CCAGCCCCAG 11580
GCCCAGCCTC AGCCCAGCTC CCTGCAGCTG CAGCCCCCTC TCCGGCTCCC AGAGCAGCAG 11640
CAGCAAGCTA ACGTGCTCCA CACAGCAGGC GGGGGCAGTC ACGGGCTGCT CGGCAGCGGA 11700
TCGTCTTCTG AGGCCTCGTC TGTGCCCCAC CTGCTGGCCC CACCCTCTGT TTCCTTAGGG 11760
GAGCATCCTG GACCCATGAG CCAGAACCTC TTGGGTTCCC AACATCCCCT TGCTCTAGAG 11820
CGGCCTATGC AAAGTACTGC AGGGCCACAG CTTCCCAAAG CAGGACCTGT CCCCCAGTCT 11880
GGGCAGGGCC TGCCTGGGGC TGGAGTCGTG CCTACAGTGG GTCAGCTTCG AGCACAGCTC 11940
CAGGGAGTCC TGGCCAAGAA CCCTCAGCTG CGGCACTTGA GCCCTCAGCA GCAGCAGCAG 12000
CTGCAGGCGC TCCTCATGCA GCGGCACCTG CAGCAGAGTC AGGCTGTCCG CCACACCCCA 12060
CCCTACCAGG AGCCCGGGAC CCAGCCCTCT CCCCTCCAGG GCCTCCTAGG CCGCCAACCC 12120
CAACTCGGGG CCTTCCCTGC ACCCCAGCCG GGCCCTCTCC AGGAGCTAGG GGCAGGACCT 12180
CGACCTCAGG GCCCACCCCG GCTCTCCGCC CCACAAGGAG CCTTATCGAC AGGACCAGTC 12240
CTTGGCCCTG TCCATCCTAC CCCTCCACCA TCCAGCCCCC AAGAGCCAAA GAGACCTTCC 12300
CCACAAGTAC CTTCCCCCAG CTCCCAGCTC CCCTCCGAGG TCCAACTCCC CCCCAACCAG 12360
CCAGGGACCC CAAAGCCCCA GGGGCTACCC TCTGAGCTGC CTCCTGGGAG GGTCTCACCT 12420
GCTGCTGCCC AGCTTGTGGA CACGTTCTTT GGCAAGGGGC TGGGACCTTG GGGCCCCCCA 12480
GACAACTTGG CAGAAGCCCA GAAGCTGGAA CAGAGCAGCT TGGTAGCTGG GCATCTGGAG 12540
CAGGTGAATG GGCAGCCAGT GCCTGAGCCG CCCCATCTCA GCATCAAGCA GGAGCCTCGG 12600
GAAGAGCCGT GTGCCCTGGG AGCCCCGGCG GTGAAGAGGG AGGCCAATGG GGAGCCAGTA 12660
GGGGCACCTG GTACCAGCAA CCACCTCCTG CTGGCAGGCC CCCGCTCAGA GGCTGGGCAC 12720
CTGCTCTTGC AGAAGCTTCT ACGGGCAAAG AGTGTGCAAC TTAGCACTGG GCGGGGGCCC 12780
GAGGGACTGC GAACTGAGAT CAACGGGCAC ATTGACAGCA AGCTGGCTGG CCTGGAGCAG 12840
AAACTACAGG GTACTCCCAG CAGCAAGGAG GATACAGCAG CAAGGAAGCC ATTGACCCCG 12900
AAGCCCAAGA GGGTACAGAA AGCAAGCGAC AGGTTGGTGA GCTCCCGAAA GAAGCTGCGG 12960
AAGGAGGACG GGGTCAGGGC CGGCGAGGCC TTGCTGAAAC AGCTGAAACA GGAGCTCTCC 13020
TTGCTGCCCC TGATGGAGCC TACCATCACT GCCAATTTCA GCCTCTTTGC CCCTTTCGGC 13080
AGTGGCTGCC CAGTCAACGG GCAGTGTCAG CTGAGGGGGG CCTTTGGAAA CGGGGCACTG 13140
CCTACTGGCC CTGACTACTA TTCCCAGCTG CTCACCAAGA ATAACCTGAG TAACCCGCCG 13200
ACACCACCCT CGTCGCTGCC CCCCACCCCA CCCCCATCGG TGCAGCAGAA GATGGTGAAT 13260
GGCGTCACCC CATCAGAAGA GCTGGGGGAG CACCCCAAGG ATGCCGCCTC TGCCCGGGAG 13320
ACGGAAGGTG CGCTGAGAGA TGCTTCAGAG GTGAAGAGCC TAGACCTGCT GGCTGCCTTG 13380
CCTACACCCC CTCACAATCA GACTGAGGAC GTCAGGATGG AGAGTGACGA GGACAGTGAT 13440
TCTCCTGACA GCATTGTGCC AGCTTCGTCC CCTGAGAGCA TCCTGGGGGA GGAGGCCCCT 13500
CGATTCCCTC AGCTGGGCTC GGGTCGGTGG GAGCAGGACG ACCGGGCCCT CTCCCCTGTC 13560
ATCCCCATCA TTCCTCGGAC CAGCATCCCA GTCTTCCCAG ATACCAAACC TTATGGGGCC 13620
CTGGACCTGG AGGCGCCCGG AAAGCTTCCT GCCAGCACTT GGGAAAAGGG CAAAGGAAGT 13680
GAGGTGTCGG TCATGCTGAC AGTCTCTGCT GCTGCAGCCA AGAACCTGAA CGGTGTGATG 13740
GTAGCAGTGG CAGAGCTGCT GAGCATGAAG ATCCCCAACT CCTATGAAGT GCTCTTCCCA 13800
GAGAGCCCCG CCCGGGCCGG CATTGAGCCC AAGAAGGGGG AAGCCGAAGG CCCTGGTGGG 13860
AAAGAGAAAG GGCTGGGAGG CAAGAGCCCG GAAGCTGGCC CCGACTGGCT GAAGCAGTTT 13920
GATGCCATAT TGCCCGGCTA TACCCTCAAG AGCCAGCTGG ACATCTTGAG TCTCCTGAAA 13980
CAGGAGAGCC CTGCCCCAGA GCCACCCACT CAGCACAGCT ACACCTACAA TGTCTCCAAC 14040
CTGGACGTGC GACAGCTCTC AGCCCCGCCT CCTGAGGAGC CCTCCCCGCC CCCCTCCCCC 14100
ATGGCACCCT CTCCTGCCAG TCCCCCTACT GAGCCCCTGG GTGAACTCCC AGCCGAACCC 14160
TCGGCGGAGC CACCAGTGCC CTCGCCTCTG CCACTGGCCT CATCCCCTGA GTCAGCCCGG 14220
CCTAAGCCCC GAGCCCGGCC CCCCGAAGAA GGTGAGGATT CTCGCCCCCC GCGCCTCAAG 14280
AAGTGGAAGG GGGTGCGCTG GAAGCGGCTC CGGTTGCTGC TGACCATCCA GAAGGGTAGT 14340
GGGCGGCAGG AGGATGAGCG GGAAGTGGCA GAGTTCATGG AGCAGCTTGG CACAGCCCTG 14400
CGGCCTGACA AGGTGCCTCG GGACATGCGG CGCTGCTGCT TCTGTCACGA GGAGGGCGAC 14460
GGGGCCACCG ACGGGCCCGC CCGCCTCCTG AACCTAGACC TGGACCTGTG GGTGCACCTC 14520
AACTGTGCCC TGTGGTCCAC CGAGGTGTAT GAGACCCAGG GCGGGGCCCT GATGAACGTG 14580
GAGGTGGCCC TGCACCGAGG ACTGCTGACC AAGTGCTCCC TGTGCCAGCG CACTGGTGCC 14640
ACCAGCAGCT GCAATCGCAT GCGTTGCCCC AATGTCTACC ACTTTGCCTG TGCCATCCGT 14700
GCTAAGTGCA TGTTCTTCAA GGACAAGACC ATGCTCTGTC CAATGCATAA GATCAAGGGG 14760
CCCTGTGAAC AGGAGCTGAG CTCTTTTGCT GTCTTCCGGC GGGTCTACAT TGAGCGGGAT 14820
GAGGTGAAGC AAATTGCCAG CATCATCCAG CGGGGCGAGC GGCTGCACAT GTTCCGGGTG 14880
GGGGGCCTCG TGTTCCACGC CATCGGACAG CTCCTCCCTC ATCAGATGGC CGACTTCCAC 14940
AGTGCCACCG CCCTCTATCC CGTGGGCTAC GAGGCCACGC GCATCTACTG GAGCCTCCGT 15000
ACCAACAACC GCCGCTGCTG CTACCGCTGC TCCATCGGTG AGAACAGCGG GCGGCCCGAG 15060
TTCGTAATCA AGGTCATGGA GCAGGGCCTG GAGGACTTGG TCTTCACTGA TGCCTCTCCG 15120
CAGGCCGTGT GGAACCGCAT CATCGAGCCT GTGGCTGCCA TGAGAAAGGA GGCTGACATG 15180
CTGCGGCTCT TCCCTGAGTA CCTGAAAGGC GAGGAACTCT TCGGCCTGAC GGTGCACGCC 15240
GTGCTCCGCA TAGCTGAATC ACTGCCTGGA GTGGAGAGCT GTCAAAACTA TTTATTCCGC 15300
TATGGGCGCC ACCCCCTGAT GGAGCTGCCA CTCATGATCA ACCCCACTGG CTGTGCCCGC 15360
TCAGAGCCCA AAATCCTCAC ACACTACAAA CGGCCCCACA CCCTGAACAG CACCAGCATG 15420
TCCAAGGCCT ATCAGAGCAC CTTCACAGGC GAGACCAACA CGCCGTACAG CAAGCAGTTT 15480
GTGCACTCCA AGTCGTCTCA GTACCGGCGG CTGCGCACTG AGTGGAAGAA CAACGTCTAC 15540
CTGGCTCGCT CCCGCATCCA GGGCCTGGGG CTCTACGCGG CCAAGGACCT CGAGAAGCAC 15600
ACCATGGTCA TCGAGTACAT CGGCACTATC ATTCGCAACG AGGTGGCCAA CCGGCGGGAG 15660
AAGATCTATG AGGAGCAGAA TCGCGGCATC TACATGTTCC GAATAAACAA TGAACATGTG 15720
ATTGATGCGA CGTTGACTGG AGGCCCTGCC AGGTACATTA ACCATTCCTG TGCCCCTAAC 15780
TGTGTGGCGG AAGTCGTGAC ATTCGATAAG GAGGATAAAA TCATCATCAT CTCCAGCCGG 15840
CGAATCCCTA AAGGAGAAGA GCTGACCTAT GACTATCAGT TTGACTTTGA GGACGATCAG 15900
CACAAGATCC CCTGCCACTG TGGAGCCTGG AATTGTCGGA AATGGATGAA CTGAGAAGCT 15960
TTGAGGCTAC CAGGCAGGGG AGTCCCCCCA CCCCCGACCT CTTCCCTGAA AGGGACGAGG 16020
GGGAAGAGAG GTAGCAGCCA GAGCCGGGAC CCAGGGCTGG GGCTGCCGGC CGACCGGAGC 16080
CCCTGGAACA GGAGGCTGGG CCAGTGGGCC TAGGCCAGGC CCACCCTGGG CACCAGGGAC 16140
AGCCCTCTTC CCCGCCACCG GCCCCCAGGC TGGCATCTCT GCCCCCAGCT CCAGGAGGGG 16200
CCAGACAGAA GCAGCCATGG GGCATCTCAG GTTTGAGGGG GCTATGGGCC GGGAACTACC 16260
CAGAAGCATC TGGGAGGCAG CAGGGAGAGG GGAGGAGGAT GTGTGGCCGG GCCTCACAGC 16320
CCTGCTGCTC CCACCCACCT CTCCGGCCCA ACGCGAGGCT GCACAGAGAC TTGACTAAGC 16380
TTGACAATCC CAAAGGCCGG GTCCTACACC TGGCCCTGCC TGCCGGGTCC TGCCCCCACC 16440
CCCACCCCCA CCCTCTTCCC TCGCAATCTG TCTCTGTCTC CCTCCTCTCC TCTGTGTTTC 16500
TGTCTCTCTA TGGGTTGTGT TTCCTTGTTT TCCACTCTGA CAAATGCAAC ATGAACGGGA 16560
AAGAGGCGCC TAGCTGCCCC AGAGGGCGAG CCGGGCGAGC CGGGCAAGGA GACCCCGCAC 16620
CCACACCTAC CTCATTTAAG TGTTGGATTT TTTGCTGTTT TGAAATGTGA GACCCTCTCT 16680
AAGCCCCCCA CTGCCCCAAC CCTCTCCCCC ACCTCACTGC CCTCTTCGAG TGGGTGGAAG 16740
GGGGGGTAGG AGGAGGAAGA AAACACAAAC AACAAAAAAA AATCCATCTT TGTTTTTAAT 16800
TATGGGCATG GGATGGTGGT TGAGGCTGAT GATGATGAAG ATTGGGGATG ACTGGCCCCT 16860
AGTTGCTCCA GGACTTCCTT CTCCATCTGG ACATGGGGGT GGGAGGGGTG TGCTAACCTA 16920
GGACCAGGAT ATCTCCCTCC TGTTTCCCAA CCCCATCATG AACTCATTTG CCCTCCAGCC 16980
CCTGGATGGG GTGAGTCGGG GGGTCGGGTG AGGGCTATCC CTGAGTGGCA TGCCCATACC 17040
CAGTGAGGCA GGGTGTGGCC CGGAGCTCCC ACTTTCCCTC AGTCACCAAA TTGCTGCTGG 17100
TCTGGTGGGA AGGGGTGGTG ACATGGGGGG TGGGGGAGCT TAGTGTCAGC GTGGGGAGGG 17160
TGGGGGGTAT TTATCTATTT ATACATGGGA TTGTACATAG TCTTGTGGGG CATGGGGGAG 17220
CCGGCTGGAG GTGAGAACCC TCCCCTCTCC CCCCACCCCC GGGGAGAGCA AATGTAAAAC 17280
TACTAATTTT TGTGCTTTAT ATATTCTATA TAAATATATC TATTTTCTTT TTACAAAACC 17340
AGTTTATAAA TGGTAGGGGG GTGTGGGGCG GACACATAGA GCTCCCCTTG TGGGGGGGCC 17400
CCCTCCATTA CCCATCCTAC CGCCCTTTTC CTCCCCCTCC CCCCCACTCC CACCCCCTGG 17460
CTGTGACTGC TGTAAGGTGG GGGTATAGAG GCTGGGCGAT TCCCACTCCC TGTTGTATAG 17520
TTGGACTATG GTATAACGCA CAAAAGTGAG CTGGTCCCAG GGGGAGCCAG AGAGTGATGG 17580
GTCCCCTGCC TCCCTCCTCC CTCCCCTTCC CACCCAACCT TGTGCTGCAG TTGAACCTCT 17640
TCCTGGGGGT GGGTGAAGGG AGGGGGTGGG TGAGGCCCCA GACCCCTCTC TGGTAGGGAG 17700
CCATGGGGAT GAAGATGAAG CTTATATGCA GTTTTCTCCT AGGGCCTGTG GGCAAAGCGC 17760
ATTTTGTAAT TACTATTTTC AAGAATCAAA TGTCTGGAGT GTAGGGGTGG GCTTGGTGGC 17820
GGTGGATGGG CGGCAGGCCT GCTGGAGGGG GAGCACGGTG GCTGTTGTGA TTTTAGGTTT 17880
GGTTTGGTTT TGTTTTTTGA ATTTGGAGGG GTGTGGATTG ATGGGGGTAG GGAGATTTTT 17940
TTTTTTTAAG CTGCTTCCTC AACTATTTCA AGCTGCAAAT GTTTAATAGA ATAACACCCC 18000
CCCACACACA CACACACACA CTCACACACA CACAGGAACC GCTGTAATTA AATCAGACAG 18060
TGGAGAGGAC TGGGCAGCTG CCCCCAAAGC CACAGCTGTT GGATGTTCCT TTTCCAAGGG 18120
CAAAAGGTCT AGGCACCTGA GAAGGGGAGA GATTGGCTTC TGTGAGTCAA GGCTCTGGTG 18180
GGCCTTGAGC CCTGGGATTG GGAAAAGGGG ATGGCGCAGA CTTTGTAAGC ATATGCTAGG 18240
TATCCGATAG TCCTGTAGAA TTTAGTGAAT AAACCTTATA CAGTTTTTAA TTTTTATATA 18300
AACTATAACT CAGACCCAAG CTACAGGTTG GAATTTTGGT TGTTGGTTTT TTCTTTTTTT 18360
TCTTTTTTTT TTTCTCTCTC TTTTTTTTTT TAAGTACCCC GCCTGTATAA TTGCATCAGA 18420
TTTCCCTCCT CCTCCCCCTC CATGTTTGTA TTTTGGGTTG GTTTACACTT GCACATATTC 18480
GATTTTCAGT TTTCCCCTTT ACGGTCTTCT CTCACCTCCA GGACCCTCCC CCTTTTTAAA 18540
AAATAAATCG CTGACAAAGT GTGAACCCCG TGAAGACTTT ATTTTGTGTT GTGTGTATCC 18600
TGTACAGCAA GGTTTGTCCT TCGTAACAAC GGATGAAATG ATTCCCTTTT TTAAAGCGCC 18660
CTCTCCCCCT CCACCCCAGC TCCCCTGTCC TTGGCATGTT TTATATCAGC GATCATTCTG 18720
AACTGTACAT AATTTATGTT GCGAGAGGCA AAGGGCAAGT TTTGGATTTT GCTTCTTCCA 18780
AGTTTGTTTT TAAACGACAA ATAAAAAAAG AACATTTTAA ATACACGCGG CTCCGTCCCG 18840
TCACCCTCGC CGTGAGGACC CGGGGGAGGG CGGGACCCGA GGCCAGGCCG GCG 18894
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 85 0.0 2397
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 88 0.0 2347
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 87 0.0 2343
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 84 0.0 2338
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 87 0.0 2328
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 85 0.0 2310
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 87 0.0 2310
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 85 0.0 2308
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 83 0.0 2299
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 86 0.0 2279
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 86 0.0 2276
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 86 0.0 2258
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 86 0.0 2257
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 86 0.0 2254
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 85 0.0 2237
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 83 0.0 2189
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 81 0.0 2075
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 74 0.0 1923
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 86 0.0 1742
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 92 0.0 1683
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 92 0.0 1681
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 91 0.0 1676
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 91 0.0 1659
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 90 0.0 1657
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 86 0.0 1644
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 89 0.0 1597
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 89 0.0 1595
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 88 0.0 1550
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 87 0.0 1475
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 80 0.0 1337
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 89 0.0 1313
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 79 0.0 1274
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 66 0.0 1178
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 99 0.0 1171
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 99 0.0 1165
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 75 0.0 1152
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 93 0.0 1117
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 92 0.0 1110
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 89 0.0 1101
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 84 0.0 1029
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 86 0.0 1029
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 82 0.0 1021
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 82 0.0 1007
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 80 0.0 996
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 76 0.0 993
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 991
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 989
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 80 0.0 988
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 80 0.0 988
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 987
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 983
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 99 0.0 962
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 923
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 74 0.0 907
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 74 0.0 907
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 897
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 892
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 848
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 733
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 646
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 1e-172 607
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 100 4e-165 582
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 3e-158 559
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 6e-97 355
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 67 6e-95 349
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 2e-92 340
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 2e-45 184
Created Date 25-Jun-2016