WERAM Information


Tag Content
WERAM ID WERAM-Cap-0033
Ensembl Protein ID ENSCPOP00000002700.2
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSCPOG00000002973.3 ENSCPOT00000003012.2 ENSCPOP00000002700.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 1.70e-45 153.9 5125 5240
Me_Reader PHD 1.10e-26 92.7 171 4864
Organism Cavia porcellus
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSCPOP00000002700.2 5125 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5209
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSCPOP00000002700.2 5210 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5240
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSCPOP00000002700.2 171 RCSHCTRLGA----SIPCRSsgCPRLYHFPCATASGSFLSmKTLQLLCPEHSE 219
6999933333....599*9999*************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC+ + ge + m+ C +C + +H+ C++ +l+ + w Cp+Ck
ENSCPOP00000002700.2 228 RCAVCE--GPGELCnMFFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****4..4444555*******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSCPOP00000002700.2 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 3 iClvCgkddeg...ekemvq....CdeCddw 26
+C+vCg + + ++e+++ C C+++
ENSCPOP00000002700.2 322 VCRVCGAGSSElnpNSEWFEnyslCHRCHKA 352
6999963333222223366677778888876 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSCPOP00000002700.2 1103 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1152
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSCPOP00000002700.2 1154 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1200
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSCPOP00000002700.2 1231 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1282
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSCPOP00000002700.2 4759 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4791
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C+ +H C ++ ++k +Cp +k
ENSCPOP00000002700.2 4818 KCSLCQRTGATS----SCNRmrCPSVYHFACAIRAKCMFFKDKTMLCPMHK 4864
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDGQKQPGED KDSEPAADGP AASEDPGATE PDLPNSHIGE VSDLCHGSPR LQEPPRDGSG 60
GLVRRCALCN CGEPSLHGQR ELRRFELPFD WPRSPVVAPG GNAGPSEAVL PSEDPSQIGF 120
PEGLTPAHLG EPGGSCWAHH WCAAWSAGVW GQEGPELCGV DKAVFSGISQ RCSHCTRLGA 180
SIPCRSSGCP RLYHFPCATA SGSFLSMKTL QLLCPEHSEG AAHLEEARCA VCEGPGELCN 240
MFFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRVCGAGS SELNPNSEWF ENYSLCHRCH KAQEGQPVSS 360
VAEQHPSVCS KFSPPESGDI PTDGPDALYV ACQGQPKSGH VTSMQPKELG PLQCEVKPLG 420
RAGAQLEPQS EAPLNEEMPL LPLPEESPLS PPPEESPTSP PPEASRLSPP PEESPASPPP 480
EASCLSLLPE ESPLSPPPEE SPLSPLPDSS PFSPLEESPL SPPEESPISP ALETPLSPPP 540
KTSPLSPSFE ESPLSPPPEE LPTSPPPEAS RISPPPEESP MSPPPEESPM SPPPEASRLF 600
PPFEESPLSP PPEESPLSPP PEASRLSPPP EDSPMSPPPE DSPMSPPPEI SCLSPLPEVS 660
HLSPPPEESP LSPPPLSPLG ELEYPFGGKG DSDPESPLAA PILETPISPP PEANCTDPEP 720
VPPMILPPSP SSPMGPASPI LMGPLPPQCS PLLQHSSPPP NSSPSHCSPP ALPLSVPSPL 780
SPMGKAVEVS EEAESHEMET EKGPEPECPA LEPSATSPLP SPVGDLSCPA PSPAPALDDF 840
SGLGEDIAPL DGTDAPGSQP EAGQTPGSSA SEFKGSPVLL DPEELAPVTP MEVYGPECKQ 900
AEQGSPCEEQ EEPSAPVVPI PPTLIKSDIV NEISNLSQGD ASASFPGSEP LLGSPDPEGG 960
GSLSMELGVS TDVSPVRDEG SLRLCTDSLP ETDDSLLCDA GTAISGGKAE GDKGRRRSSP 1020
ARSRIKQGRS SSFPGRRRPR GGAHGGRGRG RARLKSTTSS IETLVVADID SSPSKEEEEE 1080
DDDTMQNTVV LFSNTDKFVL MQDMCVVCGS FGRGAEGHLL ACSQCSQCYH PYCVNSKITK 1140
VMLLKGWRCV ECIVCEVCGQ ASDPSRLLLC DDCDISYHTY CLDPPLLTVP KGGWKCKWCV 1200
SCMQCGAASP GFHCEWQNSY THCGPCASLV TCPICHAPYV EEDLLIQCRH CERWMHAGCE 1260
SLFTEDDVEQ AADEGFDCVS CQPYVVKPVA PVAPPELVPM KVKEPEPQYF RFEGVWLTET 1320
GMAVLRNLTM SPLHKRRQRR GRLGLPGEVA LEGSEPSDAL GLDEKKDGEL DTDELLKGEG 1380
GVEHMECEIK LEGPASPDVE PGKEETEESK KRKRKPYRPG IGGFMVRQRK SHTRVKKGPA 1440
AQAEVLSGDG QPDEGETVMP TDLPAEGSGE QGLIDGDEKK KQQRRGRKKS KLEDMFPAYL 1500
QEAFFGKELL DLSRKALFAV GVGRPSFGLG TPKARGDGGS ERKELPVLQK GDDGPDVADE 1560
ESRGLEGMAD TPGPEDGGVK ASPVPSDSEK PGTPGEGMLS SDLDRIPTEE LPKMESKDLQ 1620
QLFKDVLGSE REQHLGCGTP GLDGSRTPLQ RPFVQGGLPL GSLPSNSPMD SYPGLCQSPF 1680
LDSRERGGFF SPEPGEPDSP WTGSGGTTPS TPTTPTTEGE GDGLSYNQRS LQRWEKDEEL 1740
GQLSTISPVL YANINFPNLK QDYPDWSSRC KQIMKLWRKV PATDKAPYLQ KAKDNRAAHR 1800
INKVQKQAES QINKQTKVGD VARKTDRPAL HLRIPPQPGA LGSPLPASAP TIFIGSPTTP 1860
AGLSTSADGF LKPPAGTVPG PDSPGELFLK LPPQVPAQVP SQDPFGLAPT YTLEPRFPAA 1920
PPTYPPYPSP TGAPAQPPML GASSRPGTGQ PGEFHSTPPG TPRHQPSTPD PFLKPRCPSL 1980
DNLAVPESPG VAGSKASEPL LSPSAFGETR KALEVKKEEL GASSPSYGPS NLGFVDSPSS 2040
GPHLGGLELK APDVFKAPLT PRASQVEPQS PGLGLRNQEP SPAQGLAASP PNHPDIFRPG 2100
PYPDPYAQPP LTPRPQPPPP LPESCCALPP RSLPSDPFSR VPASPQSQSS SQSPLTPRPL 2160
STEAFCPSPV TPRFQSPDPY NRPPSRPQSR DPFAPLHKPP RPQPPEVAFK AGPLAHTPLG 2220
AGGFPAALPS GPVGELHAKV PSGQPPNFAR SPGTGTFVGT SSPMRFTFPQ GVGEPSLKPP 2280
VPQPGLPPPH GINSHFGPAP TLGKPQSTNY AVTTGNFHPA GSPLGPSSGS TGEGYGLSPL 2340
RPTSVLPPPA PDGSLPFLSH GASQRTGITS PVEKREDPGA TMSSSLGAPE LPGTQDPGMS 2400
SLSQTELEKQ WQRQRLRELL IRQQIQRNTL RQEKETAAAA AGAVGHPGSW GTEPSSSTFE 2460
QLSRGQTPFP AAQDKSSLVG LPPSKLGGPV LGPGPGLFPT DDRLSRPPPP ATPSSMDVNS 2520
RQLVGGSQAF YQRVPYPGSL PLQQQQQQQQ QQQQAASATS MRLTMSTRFP STAGSELGRQ 2580
ALGSPLAGIP TRLPCPAEPV PGPSGPAQFI ELRHNVQKGL GPGGPSFPGQ GPPQRPRFFP 2640
VNEDTHRLAP EGLRGLAAPG LPPQKPPAPP APELNNSLHP TSHTKASALT AGLDLVTRPP 2700
STTELARPPP LALEAGKLHC EDPELDDDFD AHKALEDDEE LAHLGLGVDV AKGDDELGTL 2760
ENLETNDPHL DDLLNGDEFD LLAYTDPELD TGDKKDIFNE HLRLVESANE KAEREALLRG 2820
VEPGPLGPEE RPPPPTDVSE PRLASVLPEV KPKVEEGGRH PSPCQFTITT PKVEPGPASA 2880
SLGLGLKPGQ NVLGSRDTRM GTGPFSGSGH TAEKGSFGAT GGPPAHLLTP SPLSGTGGSS 2940
LLEKFELESG ALTLPSGHGA SGDELDKMES SLVASELPLL IEDLLEHEKK ELQKKQQLSA 3000
QLQPVQQQQQ QQPQPQQQHS LLSTPSSGQA MPLPHEGSSP NLAGPQQQLA LGLGGTRQPG 3060
LGQPLMPTQP PAHALQQRLA PTMAMVSNQG HMLSGQHGGQ AGLVPQQSPQ PVLAQKPMST 3120
MPPSMCMKPQ QLAMQQQLAN SFFPDTDLDK FAAEDIIDPI AKAKMVALKG IKKVMAQGSI 3180
GVAPGMNRQQ VSLLAQRLSG GSGGDLQNHV APGSSQERNA SDPSQPRPNP PTFAQGVINE 3240
ADQRQYEEWL FHTQQLLQMQ LKVLEEQIGV HRKSRKALCA KQRTAKKAGR EFPEADAEKL 3300
KLVTEQQSKI QKQLDQVRKQ QKEHTNLMAE YRNKQQQQQQ QQQQQQHSAV LALSPSQSPR 3360
LLTKLPGQLL PGHGLQPPQG PPGGQTGSLR LPPGGMALPG QPGGPFLNTA LAQQQQQQHS 3420
GGAGSLAGPS GGFFPGNLAL RSLGPDSRLL QERQLQLQQQ RMQLAQKLQQ QQQQQQQQQH 3480
HLGQVAVQQQ QQQGPSLQAN QALGPKPQGL VPPNSHQGLL VQQLSPQPPQ GPQGMLGPAQ 3540
VAVLQQHSGA LGPQGPHRQV LMTQSRVLSS PQLAQQGHGL MGHRLVTGQQ QQQSQQHQQQ 3600
GPMAGLSHLQ QGLLSHSGQA KLNAQPLGSL QQQQLQQQLQ QQQQQQQQLQ QQQQQQQQLQ 3660
QQQQLQQQQQ QQLHQQQLQQ LQQQQQQQLQ QQLLQQQQQQ QQQQQQQMCL LNQSRTLLSP 3720
QQQQQVTLGP GMPAKPLQHF SSPGTLGPTL LLTGKEQNTV ETALPSEVNE GPSTHQGGPL 3780
IIGPASESVA TESGEVKPSL SGDSQLLLVQ PQAQPQPNSL QLQPPVRLPG QPQQQVNLLH 3840
TAGVGSHGQL GSGSSSEGSS MPHLLTQPSV SLGEQPGPVT QNILGPQQPL GLERPVQNNA 3900
GPQPPKSGPV PQSGQGLSGV GITPTVGQLR VQLQGVLAKN PQLRHLSPQQ QQQLQALILQ 3960
RQLQQSQAVR QTPPYQEPGT QPSPLQGLLG CQPQPGGFPG TQTGPLQELG AGPRPQGPPR 4020
LSVPQGALST GPVVGPVHPT PPPSSPQEPK RPSSQLPSPS TQLTPTHPGT PKPQGPTLEL 4080
PPGRVSPAAA QLADTFFGKG LGPWDPSDNL VEAQKPEQCS LVPGHLEQVN GQVAPEPPQL 4140
SIKQEPREEP CALGAQAVKR EANGEPVGAS GTSNHLLLAG PRSEAGHLLL QKLLRAKNVQ 4200
LSAGRGPEGL RAEINGHIDS KLTGLEQKLQ GTPANKEDAA ARKPLTPKPK RVQKASDRLV 4260
SSRKKLRKED GVRANEALLK QLKQELSLLP LTEPTITANY SLFAPFGSSC PISGQSQLRG 4320
AFGSGALPSG PDYYSQLLTK NNLSNPPTPP SSLPPTPPPS VQQKMVNGVT PSEELGEHPK 4380
DPASAGDTEG TLRDASEVKS LDLLAALPTP PHNQTEDVRM ESDEDSDSPD SIVPASSPES 4440
ILGEEAPRFP QLGSGRGEQD DRALSPVIPI IPRASIPVFP DTKPYGVLDL EVPGKLPATA 4500
WEKGKGSEVS VMLTVSAAAA KNLNGVMVAV AELLRMKIPN SYEVLFPESP ARVGIEPKKG 4560
EAEGPGGKEK NISSKSSDSS PDWLKQFDAV LPGYTLKSQL DILSLLKQES PAPELPTQHS 4620
YTYNVSNLDV RQLSAPPPEE PSPPPSPLAP SPASPPADPL VELPVEPLAE PPVPSPLPLA 4680
SSPESTRPKP RARPPEEGED SRPPHLKKWK GVRWKRLRLL LTIQKGSGRQ EDEREVAEFM 4740
EQLGTALRPD KVPRDMRRCC FCHEEGDGAT DGPARLLNLD LDLWVHLNCA LWSTEVYETQ 4800
GGALMNVEVA LHRGLLTKCS LCQRTGATSS CNRMRCPSVY HFACAIRAKC MFFKDKTMLC 4860
PMHKIKGPCE QELSSFAVFR RVYIERDEVK QIASIIQRGE RLHMFRVGGL VFHAIGQLLP 4920
HQMADFHSAT ALYPVGYEAT RIYWSLRTNN RRCCYRCSIG ENNGRPEFVI KVMEQGLEDL 4980
VFTDASPQAV WNRIIEPVAA MRKEADMLRL FPEYLKGEEL FGLTVHAVLR IAESLPGVES 5040
CQNYLFRYGR HPLMELPLMI NPTGCARSEP KILTHYKRPH TLNSTSMSKA YQSTFTGETN 5100
TPYSKQFVHS KSSQYRRLRT EWKNNVYLAR SRIQGLGLYA AKDLEKHTMV IEYIGTIIRN 5160
EVANRREKIY EEQNRGIYMF RINNEHVIDA TLTGGPARYI NHSCAPNCVA EVVTFDKEDK 5220
IIIISSRRIP KGEELTYDYQ FDFEDDQHKI PCHCGAWNCR KWMN 5264
Nucleotide Sequence
(Fasta)
ATGGACGGCC AGAAGCAGCC TGGTGAGGAT AAAGATTCAG AACCGGCAGC TGATGGACCT 60
GCAGCCTCTG AGGATCCAGG CGCTACTGAG CCAGACCTTC CCAATTCACA TATTGGGGAG 120
GTATCTGACC TCTGTCATGG GAGTCCTAGG CTTCAGGAGC CTCCCCGTGA CGGCAGTGGG 180
GGTCTGGTGC GGCGTTGTGC TCTCTGTAAC TGCGGAGAGC CCAGTCTGCA TGGGCAACGG 240
GAACTACGGC GCTTCGAGTT GCCATTTGAT TGGCCCCGGT CTCCAGTGGT GGCCCCTGGT 300
GGGAATGCAG GCCCCAGTGA AGCAGTGCTG CCCAGTGAGG ACCCATCACA GATCGGTTTC 360
CCTGAGGGCC TTACACCTGC CCATTTAGGA GAACCTGGAG GGTCCTGCTG GGCACATCAT 420
TGGTGTGCTG CGTGGTCAGC AGGTGTGTGG GGACAGGAGG GTCCAGAACT ATGTGGTGTG 480
GACAAGGCCG TCTTCTCAGG GATCTCACAG CGCTGCTCCC ACTGCACCAG GCTTGGTGCC 540
TCCATTCCTT GCCGCTCATC TGGATGTCCA CGGCTTTACC ATTTCCCCTG CGCAACGGCC 600
AGTGGTTCCT TCTTATCCAT GAAAACACTG CAGCTGCTAT GCCCAGAGCA CAGTGAGGGA 660
GCTGCGCATC TGGAGGAGGC TCGCTGTGCA GTATGTGAGG GGCCTGGGGA GTTGTGTAAC 720
ATGTTCTTCT GTACCAGCTG TGGGCATCAC TATCATGGGG CCTGCTTAGA CACTGCTCTG 780
ACTGCCCGCA AACGTGCTGG TTGGCAGTGC CCTGAATGCA AAGTGTGCCA AGCCTGCAGA 840
AAACCTGGGA ATGACTCTAA GATGCTGGTG TGTGAGACGT GTGACAAAGG GTACCATACC 900
TTCTGCTTAA AACCACCCAT GGAGGAACTG CCTGCTCATT CTTGGAAGTG TAAAGCTTGC 960
CGGGTGTGCC GGGTCTGTGG GGCAGGCTCA TCAGAGCTGA ATCCCAACTC TGAGTGGTTT 1020
GAGAACTACT CGCTCTGTCA CCGCTGCCAC AAAGCCCAGG AAGGTCAGCC TGTCAGTTCT 1080
GTTGCTGAGC AGCATCCCTC TGTCTGTAGC AAATTTTCAC CCCCAGAATC TGGTGATATC 1140
CCTACTGATG GGCCTGATGC TCTGTACGTT GCATGCCAAG GGCAGCCAAA GAGTGGGCAC 1200
GTGACCTCTA TGCAACCCAA GGAACTGGGG CCCCTGCAAT GTGAAGTCAA ACCACTAGGG 1260
AGAGCAGGGG CCCAACTTGA GCCTCAGTCG GAGGCCCCAC TAAATGAGGA GATGCCACTG 1320
CTACCCCTAC CTGAGGAGTC ACCCTTGTCC CCGCCACCTG AGGAATCACC CACATCCCCG 1380
CCACCTGAGG CCTCACGCCT ATCCCCACCA CCTGAGGAGT CACCTGCATC CCCACCGCCT 1440
GAGGCGTCAT GCTTGTCCCT GCTGCCTGAA GAGTCACCCC TCTCTCCACC ACCTGAGGAA 1500
TCTCCTCTTT CTCCCCTGCC TGACTCATCA CCTTTTTCTC CACTGGAGGA GTCGCCCCTC 1560
TCTCCCCCAG AGGAGTCACC TATATCTCCT GCACTTGAGA CACCCCTGTC CCCACCACCA 1620
AAAACATCAC CCCTATCTCC ATCATTTGAA GAGTCGCCCC TGTCTCCTCC ACCTGAGGAA 1680
TTGCCCACTT CTCCACCACC TGAAGCATCT CGCATATCTC CACCACCCGA GGAGTCACCC 1740
ATGTCCCCTC CACCTGAAGA ATCGCCTATG TCTCCACCAC CTGAGGCATC TCGTCTGTTC 1800
CCACCATTTG AAGAGTCTCC TCTGTCCCCT CCACCTGAGG AGTCTCCCCT CTCCCCACCT 1860
CCTGAGGCAT CACGCCTGTC CCCACCTCCT GAGGACTCGC CCATGTCCCC ACCGCCTGAA 1920
GACTCTCCTA TGTCTCCCCC ACCTGAGATA TCATGCTTGT CTCCCCTGCC TGAAGTGTCA 1980
CACCTGTCTC CACCCCCTGA GGAATCTCCC CTATCTCCAC CCCCTCTGTC TCCTTTGGGA 2040
GAGTTAGAGT ACCCGTTTGG TGGCAAAGGG GACAGTGACC CTGAGTCACC ATTGGCTGCC 2100
CCCATCCTAG AGACACCCAT CAGCCCTCCT CCAGAAGCTA ACTGCACTGA CCCTGAGCCT 2160
GTGCCTCCAA TGATCCTTCC CCCATCTCCA AGTTCCCCAA TGGGGCCTGC TTCTCCCATC 2220
CTGATGGGAC CCCTTCCTCC CCAATGTTCT CCACTCCTTC AGCATTCCTC ACCGCCCCCA 2280
AACTCCTCTC CTTCCCACTG TTCCCCTCCT GCTTTGCCAC TGTCAGTCCC CTCTCCATTG 2340
AGTCCCATGG GGAAGGCAGT GGAGGTCTCA GAAGAGGCTG AGTCACATGA GATGGAGACT 2400
GAGAAAGGCC CAGAACCTGA ATGCCCAGCC TTGGAACCTA GTGCCACCAG CCCTCTCCCC 2460
TCCCCTGTGG GGGATCTGTC CTGTCCTGCC CCTAGCCCTG CCCCAGCCCT GGATGACTTC 2520
TCTGGCTTGG GGGAGGACAT AGCCCCTCTT GATGGGACTG ATGCTCCTGG TTCACAGCCA 2580
GAGGCTGGAC AAACTCCTGG CAGTTCAGCT AGTGAATTTA AAGGTTCCCC TGTGCTCCTT 2640
GACCCAGAGG AGCTGGCCCC TGTGACTCCT ATGGAGGTCT ATGGCCCGGA ATGCAAGCAG 2700
GCAGAGCAGG GCTCACCTTG TGAAGAACAG GAGGAGCCCA GTGCACCAGT GGTCCCCATA 2760
CCACCCACAC TCATCAAATC CGACATTGTA AATGAGATCT CTAATCTGAG CCAGGGCGAT 2820
GCCAGTGCCA GTTTTCCTGG TTCAGAGCCC CTACTGGGCT CTCCTGACCC TGAGGGAGGC 2880
GGCTCTCTGT CCATGGAGTT GGGAGTATCT ACAGATGTGA GTCCAGTTCG AGATGAGGGC 2940
TCCCTGAGGC TCTGTACCGA CTCACTGCCA GAGACTGATG ACTCACTACT GTGTGATGCT 3000
GGGACAGCTA TCAGCGGAGG CAAAGCTGAG GGGGACAAGG GGCGAAGGCG CAGCTCCCCA 3060
GCCCGTTCCC GAATCAAACA GGGTCGCAGC AGTAGTTTCC CAGGAAGACG TCGGCCTCGT 3120
GGAGGAGCAC ATGGAGGACG TGGGAGGGGA CGGGCCCGGC TAAAATCAAC TACTTCTTCC 3180
ATTGAGACTC TGGTAGTGGC TGATATTGAT AGTTCTCCCA GTAAGGAAGA GGAAGAAGAA 3240
GATGATGACA CCATGCAAAA TACTGTGGTT CTCTTCTCCA ACACAGACAA ATTCGTTCTA 3300
ATGCAGGACA TGTGTGTGGT ATGTGGCAGC TTTGGTCGGG GAGCAGAGGG CCATCTCCTG 3360
GCTTGTTCAC AGTGCTCTCA GTGCTATCAT CCTTACTGTG TCAACAGCAA GATCACCAAA 3420
GTGATGCTGC TGAAGGGCTG GCGTTGTGTG GAGTGTATCG TGTGTGAGGT ATGCGGTCAG 3480
GCCTCCGACC CTTCACGCCT GCTTCTCTGT GATGACTGTG ATATTAGCTA TCACACATAC 3540
TGCCTGGATC CCCCACTACT CACCGTGCCC AAGGGTGGCT GGAAGTGCAA GTGGTGTGTG 3600
TCTTGTATGC AGTGTGGGGC TGCCTCCCCT GGCTTCCATT GTGAGTGGCA GAATAGTTAC 3660
ACCCACTGTG GGCCCTGTGC TAGCCTGGTG ACCTGTCCTA TCTGTCACGC TCCATATGTG 3720
GAAGAGGACC TGCTGATCCA GTGCCGCCAC TGTGAAAGAT GGATGCATGC TGGCTGTGAG 3780
AGCCTCTTTA CCGAGGATGA TGTGGAGCAG GCAGCGGATG AAGGCTTTGA CTGTGTTTCC 3840
TGCCAGCCTT ATGTGGTAAA GCCTGTGGCA CCAGTTGCCC CTCCAGAATT GGTGCCAATG 3900
AAGGTGAAAG AACCAGAGCC CCAGTATTTT CGCTTTGAGG GTGTATGGCT GACAGAAACT 3960
GGCATGGCTG TGCTGCGTAA CCTGACCATG TCACCACTGC ACAAACGGCG CCAGCGGCGA 4020
GGACGGTTGG GCCTCCCAGG CGAGGTGGCA CTGGAGGGTT CTGAGCCCTC AGATGCCCTT 4080
GGCCTTGATG AAAAGAAGGA TGGGGAGCTG GACACTGATG AGCTGCTCAA AGGAGAAGGT 4140
GGAGTGGAGC ACATGGAGTG TGAAATTAAA TTGGAGGGCC CTGCCAGCCC TGATGTGGAA 4200
CCTGGCAAAG AAGAGACTGA GGAAAGCAAA AAACGCAAGC GCAAACCTTA CCGGCCTGGT 4260
ATTGGTGGTT TCATGGTGCG ACAGCGGAAA TCCCATACAC GTGTGAAAAA GGGGCCTGCT 4320
GCACAGGCGG AGGTGTTGAG TGGGGATGGG CAGCCCGACG AGGGTGAGAC GGTGATGCCT 4380
ACTGACCTGC CTGCAGAGGG CTCCGGGGAG CAGGGCCTCA TAGATGGGGA TGAGAAGAAG 4440
AAGCAACAGC GGCGAGGGCG CAAGAAGAGC AAACTAGAGG ACATGTTCCC TGCTTACCTG 4500
CAGGAAGCCT TCTTTGGGAA GGAGCTGCTG GACCTAAGCC GGAAGGCCCT TTTTGCAGTT 4560
GGGGTGGGCC GACCAAGCTT TGGATTAGGA ACCCCAAAAG CCCGAGGAGA TGGAGGCTCA 4620
GAAAGGAAGG AGCTCCCTGT CTTGCAGAAA GGAGATGATG GTCCAGATGT TGCAGATGAA 4680
GAATCCCGTG GCCTGGAGGG CATGGCTGAT ACACCAGGAC CTGAGGATGG GGGTGTAAAG 4740
GCATCCCCAG TGCCCAGTGA CTCCGAGAAG CCAGGCACCC CAGGTGAAGG GATGCTTAGC 4800
TCTGACTTAG ACAGGATTCC CACAGAAGAA CTGCCCAAAA TGGAATCCAA GGATCTACAG 4860
CAGCTCTTCA AGGATGTTCT GGGTTCTGAA CGAGAGCAGC ATCTGGGCTG TGGAACCCCT 4920
GGCCTAGATG GCAGCCGCAC ACCGCTGCAG AGGCCCTTTG TCCAAGGTGG ACTCCCTTTG 4980
GGTAGTCTGC CTTCCAACAG CCCAATGGAC TCCTACCCGG GCCTCTGCCA GTCTCCGTTC 5040
TTGGATTCTA GGGAGCGCGG GGGCTTCTTT AGCCCAGAAC CCGGTGAGCC AGACAGCCCC 5100
TGGACAGGAT CAGGGGGCAC CACGCCCTCC ACACCCACTA CCCCCACCAC GGAGGGTGAG 5160
GGCGACGGGC TGTCCTATAA CCAGCGGAGC CTGCAGCGCT GGGAGAAGGA TGAGGAGTTG 5220
GGCCAGCTCT CTACCATCTC ACCTGTACTC TATGCCAACA TCAACTTTCC TAATCTCAAG 5280
CAAGATTACC CAGACTGGTC TAGTCGTTGT AAACAAATCA TGAAGCTCTG GAGGAAGGTT 5340
CCAGCTACTG ACAAAGCCCC CTACCTGCAA AAGGCCAAAG ATAACCGGGC AGCTCACCGC 5400
ATCAACAAGG TGCAGAAGCA GGCTGAGAGC CAGATCAACA AGCAGACCAA GGTGGGCGAT 5460
GTAGCCCGTA AGACTGATCG ACCGGCCCTC CATCTCCGCA TTCCCCCACA GCCAGGGGCA 5520
CTGGGCAGTC CGCTCCCTGC TTCGGCCCCC ACCATTTTCA TTGGCAGCCC CACCACCCCC 5580
GCCGGCTTGT CTACCTCTGC GGATGGGTTC CTGAAGCCAC CAGCGGGCAC AGTGCCCGGC 5640
CCTGACTCCC CTGGTGAGCT CTTCCTCAAG CTCCCACCCC AGGTGCCCGC CCAAGTGCCT 5700
TCACAGGACC CCTTTGGACT GGCCCCCACC TATACCCTGG AGCCCCGCTT CCCTGCAGCA 5760
CCACCCACCT ACCCTCCCTA TCCCAGTCCA ACTGGGGCTC CTGCACAGCC CCCAATGCTG 5820
GGTGCCTCAT CTCGTCCTGG GACTGGCCAG CCGGGCGAAT TCCACTCCAC TCCTCCTGGC 5880
ACCCCCCGAC ACCAGCCCTC CACGCCTGAC CCCTTCCTCA AGCCCCGCTG CCCTTCCCTG 5940
GACAACCTGG CTGTGCCTGA GAGCCCAGGA GTGGCAGGGA GCAAGGCTTC TGAGCCCCTG 6000
CTCTCACCAT CTGCTTTCGG GGAGACCCGG AAGGCCTTGG AGGTGAAGAA GGAGGAGCTT 6060
GGGGCATCTT CTCCTAGCTA TGGGCCCTCA AACCTAGGTT TTGTGGACTC ACCCTCCTCA 6120
GGCCCCCACC TGGGTGGCCT GGAGTTAAAG GCACCTGATG TCTTCAAAGC CCCTTTGACC 6180
CCTCGGGCAT CTCAGGTAGA GCCCCAAAGC CCAGGCCTGG GTCTTCGGAA CCAGGAACCA 6240
TCCCCTGCCC AGGGTTTGGC GGCTTCTCCT CCCAACCACC CAGATATCTT TCGCCCAGGC 6300
CCTTACCCTG ACCCCTATGC CCAACCCCCA CTGACCCCTC GGCCCCAACC CCCACCCCCG 6360
CTCCCTGAGA GCTGCTGTGC TCTGCCCCCT CGTTCACTGC CTTCCGACCC TTTCTCCCGA 6420
GTTCCAGCCA GTCCTCAGTC CCAGTCAAGC TCCCAGTCCC CTCTGACCCC CCGTCCTTTG 6480
TCAACTGAGG CTTTCTGCCC ATCCCCTGTT ACTCCTCGCT TCCAGTCCCC TGACCCCTAT 6540
AATCGCCCAC CCTCACGCCC TCAGTCCCGA GACCCATTTG CTCCATTGCA TAAGCCACCC 6600
CGACCCCAGC CCCCTGAAGT TGCCTTTAAG GCTGGGCCTC TAGCCCACAC TCCACTGGGA 6660
GCTGGGGGCT TCCCAGCAGC CCTGCCTTCA GGGCCAGTGG GTGAGCTCCA TGCCAAGGTC 6720
CCAAGTGGAC AGCCCCCCAA TTTTGCCCGC TCCCCTGGGA CTGGTACATT TGTGGGCACC 6780
TCCTCTCCCA TGCGTTTTAC TTTTCCCCAG GGTGTAGGAG AACCTTCTCT AAAGCCCCCT 6840
GTCCCTCAGC CTGGGCTTCC TCCACCCCAT GGGATCAACA GCCATTTTGG GCCCGCGCCC 6900
ACTTTGGGCA AGCCTCAAAG CACAAACTAC GCAGTAACTA CCGGGAACTT CCACCCAGCG 6960
GGCAGCCCCT TGGGGCCCAG CAGTGGGTCC ACAGGGGAGG GCTATGGGCT GTCCCCACTA 7020
CGCCCCACAT CTGTTCTGCC ACCGCCTGCA CCCGATGGAT CCCTCCCCTT CCTGTCCCAT 7080
GGAGCCTCAC AGAGGACAGG CATCACCTCT CCAGTTGAAA AGCGAGAAGA TCCAGGGGCT 7140
ACAATGAGTA GCTCCTTGGG GGCACCTGAA CTCCCAGGTA CCCAAGACCC AGGCATGTCC 7200
AGTCTCAGTC AGACAGAGCT GGAAAAGCAA TGGCAGCGCC AGCGACTCCG GGAGCTACTG 7260
ATTCGGCAGC AGATCCAGCG CAACACCTTG CGGCAGGAGA AGGAGACAGC TGCAGCAGCT 7320
GCAGGTGCAG TAGGGCACCC AGGAAGCTGG GGTACAGAAC CCAGCAGCTC TACCTTTGAA 7380
CAGCTGAGTA GAGGCCAGAC TCCATTTCCT GCGGCACAGG ACAAGAGCAG CCTGGTGGGA 7440
TTACCCCCAA GCAAGCTAGG TGGCCCTGTC TTGGGACCGG GACCGGGGCT CTTCCCTACT 7500
GATGACCGAC TCTCCCGGCC ACCTCCACCT GCCACACCTT CCTCTATGGA TGTGAACAGC 7560
CGGCAACTGG TAGGAGGCTC CCAGGCCTTC TATCAACGAG TGCCCTATCC TGGATCCCTG 7620
CCCTTACAGC AGCAGCAACA GCAGCAACAG CAGCAGCAGC AAGCAGCGTC AGCAACTTCC 7680
ATGCGACTCA CCATGTCTAC TCGTTTTCCT TCAACTGCTG GATCTGAACT TGGCCGCCAA 7740
GCACTAGGTT CCCCTTTGGC AGGAATTCCC ACCCGCCTGC CATGCCCTGC TGAGCCAGTG 7800
CCTGGTCCAT CGGGGCCTGC CCAGTTCATT GAGTTACGGC ACAACGTTCA GAAAGGACTG 7860
GGACCTGGGG GGCCTTCATT TCCTGGTCAG GGCCCCCCTC AGAGGCCCCG TTTTTTCCCT 7920
GTAAATGAAG ATACCCACCG ACTGGCTCCT GAAGGGCTTC GAGGCCTGGC AGCGCCAGGC 7980
CTTCCTCCAC AGAAACCCCC AGCCCCACCT GCTCCTGAAT TGAACAATAG TCTTCATCCT 8040
ACATCCCACA CCAAGGCTTC TGCACTGACA GCTGGCTTAG ATCTGGTCAC CCGGCCGCCC 8100
TCTACCACTG AGCTTGCTCG CCCTCCTCCC CTGGCCCTGG AAGCTGGAAA GTTACACTGT 8160
GAGGATCCTG AGCTAGATGA TGATTTTGAT GCCCACAAGG CCCTAGAAGA TGATGAAGAG 8220
CTTGCTCATC TAGGTCTGGG TGTGGATGTT GCCAAGGGTG ATGATGAGCT GGGCACCCTG 8280
GAAAACCTGG AGACCAATGA CCCTCACTTA GATGACTTGC TCAATGGAGA TGAGTTTGAC 8340
CTACTGGCGT ATACTGACCC TGAGCTGGAC ACTGGGGACA AGAAAGACAT CTTCAATGAG 8400
CACCTAAGGC TGGTGGAGTC AGCAAATGAG AAGGCTGAGC GGGAGGCCCT GCTTCGGGGA 8460
GTAGAGCCAG GACCCTTGGG CCCTGAGGAG CGCCCTCCTC CGCCTACTGA TGTCTCTGAG 8520
CCCCGCCTGG CATCTGTGCT CCCTGAGGTG AAGCCTAAGG TGGAGGAGGG TGGGCGCCAC 8580
CCTTCCCCTT GCCAATTCAC CATTACCACC CCCAAGGTAG AGCCAGGACC TGCCTCTGCT 8640
TCCCTTGGCC TGGGGCTAAA ACCAGGACAG AATGTGCTGG GCAGCCGGGA CACCCGGATG 8700
GGCACAGGGC CATTTTCTGG CAGTGGACAC ACAGCTGAGA AGGGCTCTTT TGGGGCTACA 8760
GGGGGACCAC CAGCTCACCT GCTTACCCCC AGCCCGCTGA GTGGCACAGG AGGATCCTCC 8820
CTGCTAGAGA AGTTTGAGCT AGAGAGTGGA GCCTTGACCT TGCCTAGTGG ACATGGAGCA 8880
TCTGGGGATG AACTGGACAA GATGGAAAGC TCTCTAGTAG CCAGTGAGTT GCCCCTGCTT 8940
ATTGAGGACC TGTTGGAACA TGAAAAGAAG GAGCTACAGA AGAAACAGCA GCTTTCAGCC 9000
CAGCTACAGC CTGTCCAACA GCAACAGCAG CAGCAGCCAC AGCCGCAACA GCAACATTCC 9060
TTACTATCCA CACCAAGCTC TGGCCAGGCA ATGCCTTTAC CACATGAGGG CTCTTCTCCC 9120
AATTTGGCTG GGCCTCAGCA GCAACTTGCC CTGGGTCTTG GAGGAACCCG ACAGCCAGGT 9180
TTGGGCCAAC CATTGATGCC CACCCAGCCA CCAGCTCATG CCCTCCAGCA GCGCCTGGCC 9240
CCAACCATGG CCATGGTGTC CAATCAAGGG CATATGCTAA GTGGACAGCA TGGGGGGCAG 9300
GCAGGCTTGG TGCCGCAGCA GAGTCCACAG CCAGTACTAG CACAGAAACC CATGAGCACC 9360
ATGCCACCTT CCATGTGTAT GAAGCCCCAG CAACTGGCAA TGCAGCAACA GCTGGCCAAC 9420
AGCTTCTTTC CAGATACAGA CCTGGACAAA TTTGCTGCAG AAGATATCAT TGATCCCATT 9480
GCAAAGGCCA AGATGGTAGC TTTGAAAGGT ATCAAGAAAG TGATGGCTCA GGGCAGCATT 9540
GGGGTTGCAC CTGGTATGAA CAGGCAGCAA GTGTCACTGC TAGCCCAGAG GCTCTCTGGG 9600
GGATCTGGGG GTGATCTACA GAACCATGTA GCTCCTGGGA GCAGCCAGGA GCGGAATGCC 9660
AGTGACCCCT CTCAGCCTCG CCCCAACCCA CCCACTTTTG CCCAGGGAGT TATCAATGAA 9720
GCAGACCAGC GGCAATATGA GGAGTGGCTC TTCCACACTC AGCAGCTCCT GCAGATGCAG 9780
TTGAAGGTGC TAGAGGAGCA GATTGGTGTG CATCGCAAGT CCCGGAAGGC TCTGTGTGCC 9840
AAGCAGCGCA CTGCCAAAAA GGCAGGCCGT GAGTTCCCAG AAGCTGATGC TGAGAAGCTC 9900
AAGCTGGTTA CAGAGCAGCA GAGCAAGATC CAGAAACAGT TGGATCAGGT TCGGAAACAG 9960
CAGAAGGAGC ATACTAATCT CATGGCAGAG TATCGAAACA AGCAGCAGCA GCAGCAGCAA 10020
CAGCAGCAGC AACAGCAACA CTCGGCTGTA TTGGCCCTCA GCCCTTCTCA GAGTCCCCGA 10080
CTACTCACCA AGCTCCCTGG GCAGCTGCTC CCTGGCCATG GGCTGCAGCC ACCACAGGGC 10140
CCCCCGGGTG GGCAGACTGG CAGTCTTCGC CTGCCTCCTG GGGGTATGGC ACTACCTGGA 10200
CAGCCTGGTG GCCCTTTTCT TAACACAGCC CTGGCCCAGC AACAGCAACA GCAGCATTCT 10260
GGTGGGGCTG GATCCTTGGC TGGCCCCTCA GGCGGCTTCT TCCCTGGCAA CCTTGCTCTC 10320
CGAAGCCTTG GACCTGATTC AAGGCTTTTA CAGGAAAGGC AGCTGCAGCT GCAGCAGCAA 10380
CGCATGCAGT TGGCCCAGAA ATTACAGCAG CAGCAGCAGC AGCAGCAGCA GCAACAGCAT 10440
CACCTAGGAC AGGTGGCAGT CCAGCAGCAA CAGCAACAAG GTCCCAGTTT ACAGGCAAAT 10500
CAGGCTCTGG GCCCCAAACC CCAGGGGCTT GTGCCTCCCA ACAGCCACCA AGGTCTTCTG 10560
GTCCAGCAAC TGTCCCCTCA GCCACCCCAG GGACCCCAGG GCATGCTGGG TCCTGCCCAG 10620
GTGGCAGTGT TGCAGCAGCA CTCTGGAGCT TTGGGGCCCC AGGGCCCTCA CAGACAGGTG 10680
CTTATGACCC AGTCCCGGGT GCTGAGTTCC CCCCAGCTGG CACAGCAGGG TCATGGCCTT 10740
ATGGGACATC GACTGGTCAC AGGCCAGCAA CAACAGCAGT CACAACAGCA CCAACAACAG 10800
GGACCCATGG CAGGGCTTTC CCATCTTCAG CAGGGCCTGT TATCACACAG TGGGCAGGCC 10860
AAACTGAATG CTCAGCCTCT GGGCTCATTA CAGCAGCAAC AGCTCCAGCA GCAGCTCCAG 10920
CAGCAGCAGC AGCAACAGCA GCAGCTCCAG CAGCAACAGC AGCAGCAGCA GCAGCTGCAA 10980
CAGCAGCAGC AGCTGCAACA GCAACAGCAG CAGCAGCTTC ATCAACAGCA GCTGCAGCAA 11040
TTACAGCAGC AGCAGCAACA GCAGCTACAA CAGCAGCTTC TACAGCAGCA GCAACAACAG 11100
CAACAGCAAC AGCAGCAGCA GATGTGCCTC TTGAACCAGA GTCGAACTTT GTTATCTCCT 11160
CAGCAACAAC AGCAGGTGAC ACTTGGCCCT GGCATGCCAG CCAAGCCTCT TCAACACTTT 11220
TCTAGCCCCG GAACCCTGGG CCCAACCCTC CTTCTGACGG GCAAGGAACA AAACACTGTA 11280
GAGACCGCTC TTCCTTCAGA GGTCAATGAG GGTCCCTCAA CACATCAGGG AGGGCCGTTA 11340
ATAATAGGGC CTGCATCTGA GTCAGTGGCT ACGGAATCAG GGGAGGTAAA ACCCTCCCTC 11400
TCTGGGGACT CACAACTTCT GCTTGTCCAA CCCCAGGCCC AGCCTCAACC CAACTCTTTA 11460
CAGCTGCAGC CACCTGTAAG ACTCCCAGGA CAACCACAGC AGCAAGTTAA TTTGCTCCAT 11520
ACAGCAGGTG TAGGAAGCCA TGGGCAACTA GGCAGCGGAT CATCTTCTGA GGGCTCATCT 11580
ATGCCCCACC TGCTGACCCA ACCCTCTGTT TCTTTAGGGG AGCAGCCTGG GCCCGTGACC 11640
CAGAACATTC TGGGCCCCCA ACAGCCCCTT GGGCTAGAGC GACCCGTGCA GAATAATGCA 11700
GGGCCACAAC CTCCCAAATC AGGACCTGTC CCCCAGTCTG GGCAAGGTCT GTCTGGGGTT 11760
GGAATCACGC CTACAGTGGG TCAGCTTCGA GTGCAGCTCC AAGGAGTTCT GGCCAAAAAC 11820
CCACAGCTGC GGCACTTGAG TCCTCAGCAG CAGCAGCAGC TACAGGCACT CATTTTGCAG 11880
CGACAGCTGC AGCAGAGTCA GGCAGTACGC CAGACTCCAC CCTACCAGGA GCCAGGGACT 11940
CAGCCCTCTC CCTTGCAGGG CCTCCTGGGC TGCCAGCCTC AACCTGGGGG CTTTCCTGGA 12000
ACCCAGACAG GCCCTCTCCA GGAGCTAGGG GCAGGGCCTC GACCTCAGGG CCCACCCCGG 12060
CTCTCTGTCC CACAAGGAGC CTTATCCACA GGACCAGTTG TTGGCCCTGT CCATCCCACA 12120
CCTCCGCCAT CCAGCCCCCA AGAGCCAAAG AGACCTTCTT CACAATTACC TTCCCCCAGT 12180
ACCCAGCTCA CCCCCACCCA TCCAGGCACC CCAAAGCCCC AGGGGCCAAC CTTGGAGCTG 12240
CCTCCTGGGA GGGTCTCACC TGCTGCTGCC CAGCTTGCGG ATACCTTCTT TGGCAAGGGG 12300
CTGGGACCTT GGGACCCCTC AGACAACCTA GTAGAAGCCC AGAAGCCAGA GCAGTGCAGC 12360
CTGGTGCCTG GGCATCTGGA ACAGGTGAAT GGACAGGTGG CACCTGAACC ACCCCAACTC 12420
AGCATCAAGC AGGAGCCTCG GGAAGAGCCA TGTGCCCTGG GAGCCCAGGC GGTGAAGAGG 12480
GAGGCCAATG GGGAACCAGT AGGGGCATCA GGTACCAGCA ATCACCTCCT GCTGGCAGGC 12540
CCCCGCTCAG AGGCTGGGCA TCTGCTCTTG CAGAAGCTTC TACGTGCAAA GAATGTGCAA 12600
CTCAGCGCTG GGCGGGGGCC TGAGGGGCTG CGAGCTGAGA TCAACGGGCA CATTGACAGC 12660
AAGCTGACTG GATTGGAGCA GAAACTACAG GGTACCCCCG CCAACAAAGA AGATGCAGCA 12720
GCAAGGAAGC CTTTGACACC AAAGCCCAAG CGGGTACAGA AGGCAAGCGA CAGGTTGGTG 12780
AGCTCCCGAA AGAAGCTGCG GAAGGAGGAC GGGGTCAGGG CCAACGAGGC CTTGCTGAAA 12840
CAGCTGAAAC AGGAGCTGTC CCTGCTGCCC TTGACGGAGC CTACCATCAC CGCCAACTAT 12900
AGCCTCTTTG CTCCTTTTGG TAGCAGCTGC CCAATCAGTG GGCAGAGCCA GCTGAGAGGG 12960
GCCTTTGGAA GTGGGGCACT GCCCAGTGGG CCTGACTACT ATTCCCAGCT GCTTACCAAG 13020
AATAACCTGA GTAACCCGCC GACACCACCC TCGTCGCTGC CCCCCACCCC ACCCCCATCG 13080
GTGCAGCAGA AGATGGTAAA TGGCGTCACT CCTTCCGAAG AGCTGGGGGA GCACCCCAAG 13140
GATCCTGCCT CTGCTGGGGA CACTGAAGGG ACACTGCGGG ATGCTTCAGA AGTGAAGAGT 13200
CTAGACCTAC TGGCTGCGTT GCCCACACCT CCTCACAATC AAACTGAGGA TGTCAGGATG 13260
GAGAGTGATG AGGACAGTGA TTCTCCTGAC AGTATTGTAC CAGCTTCATC CCCCGAGAGC 13320
ATCCTGGGAG AGGAGGCCCC CCGTTTCCCT CAGCTGGGTT CAGGCCGAGG GGAGCAGGAC 13380
GACCGGGCCC TCTCCCCTGT CATCCCTATC ATTCCTCGGG CCAGCATCCC AGTCTTCCCA 13440
GATACCAAAC CATATGGGGT CCTGGACCTG GAGGTCCCTG GAAAGCTGCC TGCTACAGCT 13500
TGGGAAAAGG GCAAAGGGAG TGAGGTGTCA GTAATGCTGA CAGTCTCTGC TGCTGCAGCC 13560
AAGAACCTGA ATGGTGTGAT GGTGGCAGTG GCAGAACTAT TACGCATGAA GATCCCCAAC 13620
TCCTATGAGG TGCTGTTCCC AGAAAGCCCT GCCCGTGTAG GCATTGAGCC TAAGAAGGGG 13680
GAAGCTGAGG GCCCTGGTGG GAAAGAAAAG AATATAAGCA GCAAGAGCTC AGACTCTAGC 13740
CCTGATTGGC TGAAGCAGTT TGATGCAGTG TTGCCTGGTT ATACTCTCAA GAGCCAGCTA 13800
GACATCTTGA GCCTTCTGAA ACAGGAGAGC CCCGCCCCAG AGCTGCCCAC CCAGCACAGC 13860
TATACCTACA ACGTCTCCAA TTTGGATGTG CGACAGCTCT CAGCCCCACC TCCGGAAGAA 13920
CCCTCCCCAC CCCCATCCCC CTTGGCACCC TCTCCTGCCA GCCCTCCTGC CGATCCCCTA 13980
GTTGAACTTC CTGTTGAACC CTTAGCTGAG CCACCAGTGC CCTCACCCCT GCCATTGGCC 14040
TCATCTCCTG AATCCACTCG TCCCAAGCCC CGTGCCCGGC CCCCTGAAGA AGGTGAAGAT 14100
TCCCGACCTC CTCACCTCAA AAAGTGGAAG GGGGTACGCT GGAAACGGCT ACGGCTGCTG 14160
CTGACCATCC AGAAGGGTAG TGGGCGGCAG GAAGATGAGC GGGAAGTAGC AGAATTTATG 14220
GAGCAGCTTG GCACAGCCTT GCGACCTGAC AAGGTGCCTC GAGATATGCG ACGCTGCTGC 14280
TTCTGTCACG AGGAGGGGGA TGGGGCCACT GATGGGCCTG CCCGCCTGTT GAACCTGGAC 14340
CTGGACCTGT GGGTACACCT CAACTGTGCC CTGTGGTCCA CAGAGGTGTA TGAGACCCAG 14400
GGCGGGGCAC TGATGAATGT GGAGGTTGCC CTGCACCGAG GACTGCTAAC CAAGTGCTCC 14460
TTGTGCCAGC GAACCGGTGC CACCAGCAGC TGCAATCGTA TGCGTTGCCC CAGTGTCTAC 14520
CATTTTGCCT GTGCCATCCG CGCTAAGTGC ATGTTCTTCA AGGATAAGAC CATGCTATGT 14580
CCAATGCATA AGATCAAGGG GCCCTGTGAG CAGGAGCTGA GTTCGTTTGC TGTCTTCCGA 14640
CGGGTCTACA TTGAGAGAGA TGAAGTAAAG CAAATCGCCA GCATCATCCA GCGGGGAGAA 14700
CGGCTGCACA TGTTTCGTGT AGGGGGCCTT GTGTTCCATG CCATTGGACA GCTGCTTCCT 14760
CACCAGATGG CTGACTTCCA CAGTGCCACT GCCCTCTATC CTGTGGGCTA TGAGGCCACA 14820
CGCATCTACT GGAGTCTCCG TACCAACAAC CGCCGCTGCT GCTACCGCTG CTCCATTGGA 14880
GAGAACAATG GGCGGCCGGA GTTTGTCATC AAAGTCATGG AGCAGGGCTT GGAGGACCTG 14940
GTTTTCACTG ATGCCTCTCC ACAGGCTGTG TGGAATCGCA TCATTGAGCC TGTGGCTGCC 15000
ATGAGAAAAG AGGCCGATAT GCTGCGACTC TTCCCTGAGT ACCTGAAGGG TGAGGAGCTC 15060
TTCGGCCTGA CGGTGCACGC TGTGCTTCGC ATAGCTGAAT CACTGCCTGG GGTGGAGAGC 15120
TGTCAAAACT ATTTATTCCG CTATGGGCGT CACCCCCTGA TGGAACTGCC ACTCATGATC 15180
AACCCCACTG GCTGTGCCCG ATCGGAGCCC AAAATTCTCA CACACTACAA ACGGCCCCAT 15240
ACCCTGAACA GCACCAGCAT GTCTAAGGCA TATCAGAGCA CCTTCACAGG CGAAACCAAC 15300
ACCCCATACA GCAAGCAGTT TGTGCACTCC AAGTCATCTC AGTACCGACG CCTGCGTACG 15360
GAGTGGAAGA ACAACGTGTA TCTGGCTCGC TCCCGTATCC AGGGTCTGGG GCTCTATGCA 15420
GCCAAGGACC TAGAAAAGCA CACAATGGTC ATCGAGTACA TTGGCACCAT CATTCGTAAT 15480
GAAGTGGCGA ACAGGCGGGA GAAAATCTAC GAGGAGCAGA ACCGAGGCAT TTACATGTTT 15540
CGAATAAACA ATGAACATGT TATTGATGCC ACATTGACTG GAGGTCCTGC CAGGTACATT 15600
AACCATTCCT GTGCCCCTAA CTGTGTGGCG GAAGTTGTGA CATTTGACAA GGAGGACAAA 15660
ATCATTATCA TCTCCAGCCG GCGAATCCCC AAAGGAGAGG AGCTGACCTA TGACTATCAG 15720
TTTGATTTTG AGGACGATCA GCACAAGATC CCCTGTCACT GTGGAGCCTG GAATTGTCGG 15780
AAATGGATGA AC 15793
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 91 0.0 3843
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 90 0.0 3797
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 91 0.0 3785
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 91 0.0 3784
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 91 0.0 3768
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 86 0.0 3766
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 91 0.0 3755
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 90 0.0 3753
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 90 0.0 3752
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 91 0.0 3746
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 89 0.0 3739
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 91 0.0 3737
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 90 0.0 3732
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 89 0.0 3729
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 90 0.0 3716
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 89 0.0 3712
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 90 0.0 3706
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 90 0.0 3684
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 86 0.0 3672
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 89 0.0 3661
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 86 0.0 3631
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 90 0.0 3630
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 86 0.0 3548
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 79 0.0 3189
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 83 0.0 2365
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 89 0.0 2329
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 88 0.0 2227
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 92 0.0 2040
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 85 0.0 1818
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 83 0.0 1669
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 89 0.0 1643
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 90 0.0 1626
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 77 0.0 1567
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 93 0.0 1464
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 87 0.0 1439
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 53 0.0 1310
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 63 0.0 1301
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 66 0.0 1298
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 63 0.0 1293
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 63 0.0 1284
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 83 0.0 1251
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 98 0.0 1218
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 99 0.0 1154
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 88 0.0 1109
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 83 0.0 1033
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 81 0.0 1020
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 81 0.0 1006
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 989
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 988
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 79 0.0 987
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 79 0.0 986
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 979
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 922
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 73 0.0 910
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 73 0.0 909
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 895
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 891
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 847
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 733
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 85 0.0 689
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 645
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 6e-173 608
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 49 6e-158 558
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 1e-97 358
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 67 8e-95 348
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 2e-92 341
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 3e-45 184
Created Date 25-Jun-2016