WERAM Information


Tag Content
WERAM ID WERAM-Gog-0092
Ensembl Protein ID ENSGGOP00000007801.2
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSGGOG00000007949.2 ENSGGOT00000008011.2 ENSGGOP00000007801.2
ENSGGOG00000007949.2 ENSGGOT00000022054.1 ENSGGOP00000024047.1
ENSGGOG00000007949.2 ENSGGOT00000031147.1 ENSGGOP00000027250.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 5.20e-45 152.8 3573 5510
Me_Reader PHD 2.90e-26 91.8 171 5134
Organism Gorilla gorilla
Domain Profile
  HMT SET1

              SET1.txt   17 akkeiekeelviEYvGevirsevadkreke 46  
k++ e+++l++EY+ + +++++++++++
ENSGGOP00000007801.2 3573 RKQQKEHTNLMAEYRNKQQQQQQQQQQQQQ 3602
455667788999**9999433333333332 PP
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSGGOP00000007801.2 5395 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5479
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSGGOP00000007801.2 5480 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5510
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSGGOP00000007801.2 171 RCSHCTRLGA----SIPCRSpgCPRLYHFPCATASGSFLSmKTLQLLCPEHSE 219
6999933333....599******************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ ge + ++ C +C + +H+ C++ +l+ + w Cp+Ck
ENSGGOP00000007801.2 228 RCAVC--EGPGELCdLFFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****..444545559******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSGGOP00000007801.2 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSGGOP00000007801.2 1369 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1418
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSGGOP00000007801.2 1420 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1466
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSGGOP00000007801.2 1497 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1548
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSGGOP00000007801.2 5029 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 5061
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSGGOP00000007801.2 5088 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 5134
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDSQKLAGED KDSEPAADGP AASEDPSATE SDLPNPHVGE VSVLSSGSPR LQETPQDCSG 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG GSPGPNEAVL PSEDLSQIGF 120
PEGLTPAHLG EPGGSCWAHH WCAAWSAGVW GQEGPELCGV DKAIFSGISQ RCSHCTRLGA 180
SIPCRSPGCP RLYHFPCATA SGSFLSMKTL QLLCPEHSEG AAHLEEARCA VCEGPGELCD 240
LFFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRACGVGS AELNPNSEWF ENYSLCHRCH KAQGGQPIRS 360
VAEQHTPVCS RFSPPEPGDT PTDEPDALYV ACQGQPKGGH VTSMQPKEPG PLQCEAKPLG 420
RAGVQLEPQL EAPLNEEMPL LPPPEESPLS PPPEESPTSP PPEASRLSPP PEESPASPLP 480
EALHLSRPPE ESPLSPPPEE SPLSPPPESS PFSPLEESPF SPPEESPPSP ALETPLSPPP 540
EASPLSPPFE ESPLSPPPEE LPTSPPPEAS RLSPPPEESP MSPPPEESPM SPPPEASRLF 600
PPFEESPLSP PPEESPLSPP PEASRLSPPP EDSPMSPPPE ESSMSPPPEV SRLSPLPVVS 660
RLSPPPEESP LSPPPEESPT SPPPEASRLS PPPEDSPTSP PPEDSPASPP PEDSLMSLPL 720
EESPLSPLPE EPQLCPMSEG PHLSPRPEEP HLSPRPEEPH LSPQAEEPCL CAVPEEPHLS 780
PQAEGPHLSP QPEELHLSPQ TEELHLSPVP EEPCLSPQPE ESHLSPQSEE PCLSPRPEES 840
HLSPQLEQPP LSPRPEEPPE EPGQCPAPEE LPLFPPPGEP SLSPLLGEPA LSEPGEPPLS 900
PLPEELPLSP SGEPSLSPQL MPPDPLPPPL SPIITAAAPP ALSPLGELEY PFGAKGDSDP 960
ESPLAAPILE TPISPPPEAN CTDPEPVPPM ILPPSPGSPV GPASPILMEP LPPQCSPLLQ 1020
HSLVPQNSPP SQCSPPALPL SVPSPLSPIG KVVGVSDEAE LHEMETEKVS EPECPALEPS 1080
ATSPLPSPMG DLSCPAPSPA PALDDFSGLG EDTAPLDGID AQGSQPEPGQ TPGSLASELK 1140
GSPVLLDPEE LAPVTPMEVY PECKQTAGQG SPCEEQEEPR APVAPTPPTL IKSDIVNEIS 1200
NLSQGDASAS FPGSEPLLGS PDPEGGGSLS MELGVSTDVS PARDEGSLRL CTDSLPETDD 1260
SLLCDAGTAI SGGKAEGEKG RRRSSPARSR IKQGRSSSFP GRRRPRGGAH GGRGRGRARL 1320
KSTASSIETL VVADIDSSPS KEEEEEDDDT MQNTVVLFSN TDKFVLMQDM CVVCGSFGRG 1380
AEGHLLACSQ CSQCYHPYCV NSKITKVMLL KGWRCVECIV CEVCGQASDP SRLLLCDDCD 1440
ISYHTYCLDP PLLTVPKGGW KCKWCVSCMQ CGAASPGFHC EWQNSYTHCG PCASLVTCPI 1500
CHAPYVEEDL LIQCRHCERW MHAGCESLFT EDDVEQAADE GFDCVSCQPY VVKPVAPVAP 1560
PELVPMKVKE PEPQYFRFEG VWLTETGMAL LRNLTMSPLH KRRQRRGRLG LPGEAGLEGS 1620
EPSDALGPDD KKDGDLDTDE LLKGEGGVEH MECEIKLEGP VSPDVEPGKE ETEESKKRKR 1680
KPYRPGIGGF MVRQRKSHTR TKKGPAAQAE VLSGDGQPDE VIPADLPAEG AVEQSLAEGD 1740
EKKKQQRRGR KKSKLEDMFP AYLQEAFFGK ELLDLSRKAL FAVGVGRPSF GLGTPKAKGD 1800
GGSERKELPT SQKGDDGPDI ADEESRGLEG KADTPGPEDG GVKASPVPSD PEKPGTPGEG 1860
MLSSDLDRIS TEELPKMESK DLQQLFKDVL GSEREQHLGC GTPGLEGSRT PLQRPFLQGG 1920
LPLGNLPSSS PMDSYPGLCQ SPFLDSRERG GFFSPEPGEP DSPWTGSGGT TPSTPTTPTT 1980
EGEGDGLSYN QRSLQRWEKD EELGQLSTIS PVLYANINFP NLKQDYPDWS SRCKQIMKLW 2040
RKVPAADKAP YLQKAKDNRA AHRINKVQKQ AESQINKQTK VGDIARKTDR PALHLRIPPQ 2100
PGALGSPPPA AAPTIFIGSP TTPAGLSTSA DGFLKPPAGS VPGPDSPGEL FLKLPPQVPA 2160
QVPSQDPFGL APAYPLEPRF PTAPPTYPPY PSPTGAPAQP PMLGASSRSG AGQPGEFHTT 2220
PPGTPRHQPS TPDPFLKPRC PSLDNLAVPE SPGVGGGKAS EPLLSPPPFG ESRKALEVKK 2280
EELGASSPSY GPPNLGFVDS PSSGPHLGGL ELKTPDVFKA PLTPRASQVE PQSPGLGLRP 2340
QEPPPAQALA PSPPSHPDIF RPGSYPDPYA QPPLTPRPQP PPPESCCALP PRSLPSDPFS 2400
RVPASPQSQS SSQSPLTPRP LSAEAFCPSP VTPRFQSPDP YSRPPSRPQS RDPFAPLHKP 2460
PRPQPPEVAF KAGSLAHTSL GAGGFPAALP SGPAGELHAK VPSGQPPNFV RSPGTGAFVG 2520
TPSPMRFTFP QAVGEPSLKP PVPQPGLPPP HGINSHFGPG PTLGKPQSTN YTVATGNFHP 2580
SGSPLGPSSG STGESYGLSP LRPPSVLPPP APDGSLPYLS HGASQRSGIT SPVEKREDPG 2640
TGMGSSLATA ELPGTQDPGM SGLSQTELEK QRQRQRLREL LIRQQIQRNT LRQEKETAAA 2700
AAGAVGPPGS WGAEPSSPAF EQLSRGQTPF AGTQDKSSLV GLPPSKLSGP ILGPGSFPSD 2760
DRLSRPPPPA TPSSMDVNSR QLVGGSQAFY QRAPYPGSLP LQQQQQQLWQ QQQATAATSM 2820
RFAMSARFPS TPGPELGRQA LGSPLAGIST RLPGPGEPVP GPAGPAQFIE LRHNVQKGLG 2880
PGGTPFPGQG PPQRPRFYPV SEDPHRLAPE GLRGLAVSGL PPQKPSAPPA PELNNSLHPT 2940
PHTKGPTLPT GLELVNRPPS STELGRPTPL ALEAGKLPCE DPELDDDFDA HKALEDDEEL 3000
AHLGLGVDVA KGDDELGTLE NLETNDPHLD DLLNGDEFDL LAYTDPELDT GDKKDIFNEH 3060
LRLVESANEK AEREALLRGV EPGPLGPEER PPAAADASEP RLASVLPEVK PKVEEGGRHP 3120
SPCQFTIATP KVEPAPAANS LGLGLKPGQS MMGSRDTRMG TGPFSSSGHT AEKASFGATG 3180
GPPAHLLTPS PLSGPGGSSL LEKFELESGA LTLPGGPAAS GDELDKMESS LVASELPLLI 3240
EDLLEHEKKE LQKKQQLSAQ LQPAQQQQQQ QQQHSLLSAP GPAQAMSLPH EGSSPSLAGS 3300
QQQLSLGLAG ARQPGLPQPL MPTQPPAHAL QQRLAPSMAM VSNQGHMLSG QHGGQVGLVP 3360
QQSSQPVLSQ KPMGTMPPSM CMKPQQLAMQ QQLANSFFPD TDLDRFAAED IIDPIAKAKM 3420
VALKGIKKVM AQGSIGVAPG MNRQQVSLLA QRLSGGPGSD LQNHVAAGSS QERSAGDPSQ 3480
PRPNPPTFAQ GVINEADQRQ YEEWLFHTQQ LLQMQLKVLE EQIGVHRKSR KALCAKQRTA 3540
KKAGREFPEA DAEKLKLVTE QQSKIQKQLD QVRKQQKEHT NLMAEYRNKQ QQQQQQQQQQ 3600
QQQQHSAVLA LSPSQSPRLL TKLPGQLLPG HGLQPPQGPP GGQAGGLRLP PGGMALPGQP 3660
GGPFLNTALA QQQQQQHSGG AGSLAGPSGG FFPGNLALRS LGPDSRLLQE RQLQLQQQRM 3720
QLAQKLQQQQ QQQQQQQQQH LLGQVAVQQQ QQQGPGVQTN QALGPKPQGL LPPSSHQGLL 3780
VQQLSPQPPQ GPQGMLGPAQ VAVLQQQHPG ALGPQGPHRQ VLMTQSRVLS SPQLAQQGQG 3840
LMGHRLVTAQ QQQQQQQHQQ QGSMAGLSHL QQSLMSHSGQ PKLSAQPMGS LQQLQQQQQL 3900
QQQQQLQQQQ QQQLQQQQQL QQQQLQQQQQ QQQLQQQQQQ QLQQQQQQLQ QQQQQQQQQL 3960
QQQQQQQQQQ MGLLNQSRTL LSPQQQQQQQ VALGPGMPAK PLQHFSSPGA LGPTLLLTGK 4020
EQNTVDPAVS SEATEGPSTH QGGPLAIGTT PESMATEPGE VKPSLSGDSQ LLLVQPQPQP 4080
QPSSLQLQPP LRLPGQQQQQ VSLLHTAGGG SHGQLGSGSS SEASSVPHLL AQPSVSLGDQ 4140
PGPMTQNLLG PQQPMLERPM QNNTGPPQPP KPGPVLQSGQ GLPGVGIMPT VGQLRAQLQG 4200
VLAKNPQLRH LSPQQQQQLQ ALLMQRQLQQ SQAVRQTPPY QEPGTQTSPI QGPLGCQPQL 4260
GGFPGPQTGP LQELGAGPRP QGPPRLPAPP GALSTGPVLG PVHPTPPPSS PQEPKRPSQL 4320
PSPSSQLPTE AQLPPTHPGT PKPQGPTLEL PPGRVSPAAA QLADTLFSKG LGPWDPPDNL 4380
AETQKPEQSS LVPGHLDQVN GQVVPEASQL SIKQEPREEP CALGVQSVKR EANGEPIGAP 4440
GTSNHLLLAG PRSEAGHLLL QKLLRAKNVQ LSTGRGSEGL RAEINGHIDS KLAGLEQKLQ 4500
GTPSNKEDAA ARKPLTPKPK RVQKASDRLV SSRKKLRKED GVRASEALLK QLKQELSLLP 4560
LTEPAITANF SLFAPFGSGC PVNGQSQLRG AFGSGALPTG PDYYSQLLTK NNLSNPPTPP 4620
SSLPPTPPPS VQQKMVNGVT PSEELGEHPK DAASARDSER ALRDTSEVKS LDLLAALPTP 4680
PHNQTEDVRM ESDEDSDSPD SIVPASSPES ILGEEAPRFP HLGSGRWEQE DRALSPVIPL 4740
IPRASIPVFP DTKPYGALDL EVPGKLPATT WEKGKGSEVS VMLTVSAAAA KNLNGVMVAV 4800
AELLSMKIPN SYEVLFPESP ARAGTEPKKG EAEGPGGKEK GLGGKSPDTG PDWLKQFDAV 4860
LPGYTLKSQL DILSLLKQES PAPEPPTQHS YTYNVSNLDV RQLSAPPPEE PSPPPSPLAP 4920
SPASPPTEPL VELPAEPLAE PPVPSPLPLA SSPESARPKP RARPPEEGED SRPPRLKKWK 4980
GVRWKRLRLL LTIQKGSGRQ EDEREVAEFM EQLGTALRPD KVPRDMRRCC FCHEEGDGAT 5040
DGPARLLNLD LDLWVHLNCA LWSTEVYETQ GGALMNVEVA LHRGLLTKCS LCQRTGATSS 5100
CNRMRCPNVY HFACAIRAKC MFFKDKTMLC PMHKIKGPCE QELSSFAVFR RVYIERDEVK 5160
QIASIIQRGE RLHMFRVGGL VFHAIGQLLP HQMADFHSAT ALYPVGYEAT RIYWSLRTNN 5220
RRCCYRCSIG ENNGRPEFVI KVIEQGLEDL VFTDASPQAV WNRIIEPVAA MRKEADMLRL 5280
FPEYLKGEEL FGLTVHAVLR IAESLPGVES CQNYLFRYGR HPLMELPLMI NPTGCARSEP 5340
KILTHYKRPH TLNSTSMSKA YQSTFTGETN TPYSKQFVHS KSSQYRRLRT EWKNNVYLAR 5400
SRIQGLGLYA AKDLEKHTMV IEYIGTIIRN EVANRREKIY EEQNRGIYMF RINNEHVIDA 5460
TLTGGPARYI NHSCAPNCVA EVVTFDKEDK IIIISSRRIP KGEELTYDYQ FDFEDDQHKI 5520
PCHCGAWNCR KWMN 5534
Nucleotide Sequence
(Fasta)
ATGGACAGCC AGAAGCTGGC TGGTGAGGAT AAAGATTCAG AACCGGCAGC TGATGGACCT 60
GCAGCTTCTG AGGACCCAAG TGCCACTGAG TCAGACCTGC CCAACCCGCA TGTGGGAGAG 120
GTCTCTGTCC TTAGTTCTGG GAGTCCCAGG CTTCAGGAGA CTCCTCAGGA CTGCAGTGGG 180
GGTCCGGTGC GGCGTTGTGC TCTCTGTAAC TGCGGGGAGC CCAGTCTACA CGGGCAGCGG 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAT TGGCCCCGGT GTCCAGTGGT GTCCCCTGGG 300
GGGAGCCCAG GGCCCAATGA GGCAGTGCTG CCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCC TTACACCTGC CCACCTAGGA GAACCTGGAG GGTCCTGCTG GGCTCACCAT 420
TGGTGTGCTG CATGGTCGGC AGGCGTATGG GGGCAGGAGG GCCCAGAACT ATGTGGTGTG 480
GACAAGGCCA TCTTCTCAGG GATCTCACAG CGCTGCTCCC ACTGCACCAG GCTCGGTGCC 540
TCCATCCCTT GCCGCTCACC TGGATGTCCA CGGCTTTACC ACTTCCCCTG CGCGACTGCC 600
AGCGGTTCCT TCCTATCCAT GAAAACACTG CAGCTGCTAT GCCCAGAGCA CAGTGAGGGG 660
GCTGCACATC TGGAGGAGGC TCGCTGTGCA GTGTGTGAGG GGCCGGGGGA GTTGTGTGAC 720
CTGTTCTTCT GTACCAGCTG TGGGCATCAC TATCATGGGG CCTGCCTGGA CACTGCTCTG 780
ACTGCCCGCA AACGTGCTGG CTGGCAGTGC CCTGAATGCA AAGTGTGCCA AGCCTGCAGG 840
AAACCTGGGA ATGACTCTAA GATGTTGGTT TGTGAGACGT GTGACAAAGG ATACCATACT 900
TTCTGCCTAA AACCACCCAT GGAGGAACTG CCTGCTCACT CTTGGAAGTG CAAGGCGTGC 960
CGGGTGTGCC GGGCCTGTGG GGTGGGCTCA GCAGAACTGA ATCCCAACTC GGAGTGGTTT 1020
GAGAACTACT CTCTCTGTCA CCGCTGTCAC AAAGCCCAGG GAGGTCAGCC TATCCGCTCT 1080
GTTGCTGAGC AGCATACCCC GGTGTGTAGC AGATTTTCAC CCCCAGAGCC TGGCGATACC 1140
CCCACTGACG AGCCCGATGC TCTGTACGTT GCATGCCAAG GGCAGCCAAA GGGTGGGCAC 1200
GTGACCTCTA TGCAACCCAA GGAACCAGGG CCCCTGCAAT GTGAAGCCAA ACCACTAGGG 1260
AGAGCAGGGG TCCAACTTGA GCCCCAGTTG GAGGCCCCCC TAAATGAGGA GATGCCACTG 1320
CTGCCCCCAC CTGAGGAGTC ACCCCTGTCC CCACCACCTG AGGAATCACC CACGTCCCCA 1380
CCACCTGAGG CATCACGCCT GTCCCCACCA CCTGAGGAAT CGCCCGCATC CCCACTTCCT 1440
GAGGCATTGC ACCTGTCCCG GCCGCCGGAG GAATCGCCCC TCTCTCCGCC GCCTGAGGAG 1500
TCTCCTCTGT CTCCCCCACC TGAATCATCA CCTTTTTCTC CACTGGAGGA GTCGCCCTTC 1560
TCTCCACCGG AAGAGTCACC CCCATCTCCT GCACTTGAGA CGCCTCTATC CCCACCACCT 1620
GAAGCATCGC CCCTGTCCCC ACCATTTGAA GAATCTCCTT TGTCCCCGCC ACCTGAGGAA 1680
TTGCCCACTT CCCCGCCACC TGAAGCATCT CGCCTGTCTC CACCACCTGA GGAGTCACCC 1740
ATGTCCCCTC CACCTGAAGA GTCACCCATG TCTCCACCAC CAGAGGCATC TCGTCTGTTC 1800
CCACCATTTG AAGAGTCTCC TCTGTCCCCT CCACCTGAGG AGTCTCCCCT TTCCCCACCA 1860
CCTGAGGCAT CACGCCTGTC CCCACCACCT GAGGACTCGC CTATGTCCCC ACCACCTGAA 1920
GAATCATCTA TGTCCCCCCC ACCTGAGGTA TCGCGCCTAT CCCCCCTGCC TGTGGTGTCA 1980
CGCCTGTCTC CACCGCCTGA GGAATCTCCC TTGTCCCCAC CACCTGAGGA GTCTCCCACG 2040
TCCCCTCCAC CTGAGGCTTC ACGCCTCTCC CCACCACCTG AGGACTCCCC CACATCCCCG 2100
CCACCTGAGG ACTCACCTGC TTCCCCACCA CCAGAGGACT CGCTCATGTC CCTGCCACTG 2160
GAGGAGTCAC CCCTGTCGCC ACTACCTGAG GAGCCGCAAC TCTGCCCCAT GTCCGAGGGG 2220
CCGCACCTGT CACCCCGGCC TGAGGAGCCG CACCTGTCCC CCCGGCCTGA GGAGCCACAC 2280
CTATCTCCGC AGGCTGAGGA GCCATGCCTG TGCGCTGTGC CTGAGGAGCC ACACTTGTCC 2340
CCCCAGGCTG AGGGACCACA TCTGTCCCCT CAGCCTGAGG AATTGCACCT GTCCCCCCAG 2400
ACTGAGGAGC TGCACCTGTC TCCTGTGCCT GAGGAGCCAT GCTTATCCCC CCAACCTGAG 2460
GAATCACACC TGTCCCCCCA GTCTGAGGAA CCATGCCTGT CCCCCCGGCC TGAGGAATCG 2520
CATCTGTCCC CTCAGCTTGA GCAGCCACCC CTGTCCCCTC GGCCTGAAGA GCCCCCTGAG 2580
GAGCCAGGCC AATGCCCTGC ACCTGAGGAG CTGCCCTTGT TCCCTCCCCC TGGGGAACCA 2640
TCCTTATCTC CCTTGCTTGG AGAGCCAGCC CTGTCTGAGC CTGGGGAACC ACCTCTGTCC 2700
CCTCTGCCCG AGGAGCTGCC GTTGTCCCCA TCTGGGGAGC CATCCTTGTC GCCTCAGCTG 2760
ATGCCACCAG ATCCCCTTCC TCCTCCACTC TCACCCATTA TCACAGCTGC GGCCCCACCG 2820
GCCCTGTCTC CTTTGGGGGA GTTAGAGTAC CCCTTTGGTG CCAAAGGGGA CAGTGACCCT 2880
GAGTCACCGT TGGCTGCCCC CATCCTGGAG ACACCCATCA GCCCTCCACC AGAAGCTAAC 2940
TGCACTGACC CTGAGCCTGT CCCCCCTATG ATCCTTCCCC CATCTCCAGG CTCCCCAGTG 3000
GGGCCGGCTT CTCCCATCCT GATGGAGCCC CTTCCTCCTC AGTGTTCTCC ACTCCTTCAG 3060
CATTCCCTGG TTCCCCAAAA CTCCCCTCCT TCCCAGTGCT CTCCTCCTGC CCTACCACTG 3120
TCCGTTCCCT CCCCGTTGAG TCCCATAGGG AAGGTAGTGG GGGTCTCAGA TGAGGCTGAG 3180
CTGCACGAGA TGGAGACTGA GAAAGTTTCA GAACCTGAAT GCCCAGCCTT GGAACCCAGT 3240
GCCACCAGTC CTCTCCCTTC CCCAATGGGG GACCTTTCCT GCCCCGCCCC CAGCCCTGCC 3300
CCAGCCCTGG ATGACTTCTC TGGCCTAGGG GAAGACACAG CCCCTCTGGA TGGGATTGAT 3360
GCTCAGGGTT CACAGCCAGA GCCTGGACAG ACCCCTGGCA GTTTGGCTAG TGAACTTAAA 3420
GGCTCCCCTG TGCTCCTGGA CCCCGAGGAG CTGGCCCCTG TGACCCCTAT GGAGGTCTAC 3480
CCCGAATGCA AGCAGACAGC AGGGCAGGGC TCACCATGTG AAGAACAGGA AGAGCCACGT 3540
GCACCGGTGG CCCCCACACC ACCCACTCTC ATCAAATCCG ACATCGTTAA CGAGATCTCT 3600
AATCTGAGCC AGGGTGATGC CAGTGCCAGT TTTCCTGGCT CAGAGCCCCT CCTGGGCTCT 3660
CCAGACCCAG AGGGGGGTGG CTCCCTGTCC ATGGAGCTGG GGGTCTCTAC GGATGTTAGT 3720
CCAGCCCGAG ATGAGGGCTC CCTACGGCTC TGTACCGACT CACTGCCAGA GACTGATGAC 3780
TCACTATTGT GTGATGCTGG GACAGCTATC AGCGGAGGCA AAGCTGAGGG GGAGAAGGGG 3840
CGGCGGCGCA GCTCCCCAGC TCGTTCCCGC ATCAAACAGG GTCGCAGCAG CAGTTTCCCA 3900
GGAAGACGCC GGCCTCGTGG AGGAGCCCAT GGAGGACGTG GTAGAGGACG GGCCCGGCTA 3960
AAGTCAACTG CTTCTTCCAT TGAGACTCTG GTAGTTGCTG ACATTGATAG CTCTCCCAGT 4020
AAGGAGGAGG AGGAAGAAGA TGATGACACC ATGCAGAATA CCGTGGTTCT CTTCTCCAAC 4080
ACAGACAAAT TTGTCCTAAT GCAGGACATG TGTGTGGTAT GTGGCAGCTT TGGCCGGGGG 4140
GCAGAGGGCC ACCTCCTTGC CTGTTCGCAG TGCTCTCAGT GCTATCACCC TTACTGTGTC 4200
AACAGCAAGA TCACCAAGGT GATGCTGCTC AAGGGCTGGC GTTGTGTGGA GTGTATTGTG 4260
TGTGAGGTGT GTGGCCAGGC CTCCGACCCC TCACGCCTGC TGCTCTGTGA TGACTGTGAT 4320
ATTAGCTACC ACACATACTG CCTGGACCCC CCACTGCTCA CCGTCCCCAA GGGCGGCTGG 4380
AAGTGCAAGT GGTGTGTGTC CTGTATGCAG TGTGGGGCTG CTTCCCCTGG CTTCCACTGT 4440
GAATGGCAGA ATAGTTACAC ACACTGTGGG CCCTGTGCCA GCCTGGTGAC CTGCCCTATC 4500
TGTCATGCTC CTTATGTAGA AGAGGACCTA CTAATCCAGT GCCGCCACTG TGAACGGTGG 4560
ATGCATGCAG GCTGTGAGAG CCTCTTCACA GAGGACGATG TGGAGCAGGC AGCCGATGAA 4620
GGCTTTGACT GTGTCTCCTG CCAGCCCTAC GTGGTAAAGC CTGTGGCGCC TGTTGCACCT 4680
CCAGAGCTGG TGCCCATGAA GGTGAAAGAG CCAGAGCCCC AGTACTTTCG CTTCGAAGGT 4740
GTGTGGCTGA CAGAAACTGG CATGGCCTTG CTGCGTAACC TGACCATGTC ACCACTGCAC 4800
AAGCGGCGCC AACGGCGAGG ACGGCTTGGC CTCCCGGGCG AGGCAGGATT GGAGGGTTCT 4860
GAGCCCTCAG ATGCCCTTGG CCCTGATGAC AAGAAGGATG GGGACCTGGA CACCGATGAG 4920
CTGCTCAAGG GTGAAGGTGG TGTGGAGCAC ATGGAGTGCG AAATTAAACT GGAGGGCCCC 4980
GTCAGCCCTG ATGTGGAGCC TGGCAAAGAG GAGACCGAGG AAAGCAAAAA ACGCAAGCGT 5040
AAACCGTATC GGCCTGGCAT TGGTGGTTTC ATGGTGCGAC AGCGGAAATC CCACACACGC 5100
ACGAAAAAGG GGCCTGCTGC ACAGGCGGAG GTGTTGAGTG GGGATGGGCA GCCCGACGAG 5160
GTGATACCTG CTGACCTGCC TGCAGAGGGC GCCGTGGAGC AGAGCTTAGC TGAAGGGGAT 5220
GAGAAGAAGA AGCAACAGCG GCGAGGGCGC AAGAAGAGCA AACTGGAGGA CATGTTCCCT 5280
GCTTACCTGC AGGAAGCCTT CTTTGGGAAG GAGCTGCTGG ACCTGAGCCG TAAGGCCCTT 5340
TTTGCAGTTG GGGTGGGCCG GCCAAGCTTT GGACTAGGGA CCCCAAAAGC CAAGGGAGAT 5400
GGAGGCTCAG AAAGGAAGGA ACTCCCCACA TCGCAGAAAG GAGATGATGG TCCAGATATT 5460
GCAGATGAAG AATCCCGTGG CCTCGAGGGC AAAGCTGATA CACCAGGACC TGAGGATGGG 5520
GGCGTGAAGG CATCCCCAGT GCCCAGTGAC CCTGAGAAGC CAGGCACCCC AGGTGAAGGG 5580
ATGCTTAGCT CTGACTTAGA CAGGATTTCC ACAGAAGAAC TGCCCAAGAT GGAATCCAAG 5640
GACCTGCAGC AGCTCTTCAA GGATGTTCTG GGCTCTGAAC GAGAACAGCA TCTGGGTTGC 5700
GGAACCCCTG GCCTAGAAGG CAGCCGTACA CCACTGCAGA GGCCCTTTCT TCAAGGTGGA 5760
CTCCCTTTGG GCAATCTGCC CTCCAGCAGC CCAATGGACT CCTACCCAGG CCTCTGCCAG 5820
TCCCCGTTCC TGGATTCTAG GGAGCGCGGG GGCTTCTTTA GCCCGGAACC CGGTGAGCCC 5880
GACAGCCCCT GGACGGGCTC AGGTGGCACC ACGCCCTCCA CCCCCACAAC CCCCACCACG 5940
GAGGGTGAGG GCGACGGACT CTCCTATAAC CAGCGGAGTC TTCAGCGCTG GGAGAAGGAT 6000
GAGGAGTTGG GTCAGCTGTC CACCATCTCA CCCGTGCTCT ATGCCAACAT TAATTTTCCT 6060
AATCTCAAGC AAGATTACCC AGACTGGTCA AGCCGTTGCA AACAAATCAT GAAGCTCTGG 6120
AGAAAGGTTC CAGCAGCTGA CAAAGCCCCC TACCTGCAAA AGGCCAAAGA TAACCGGGCA 6180
GCTCACCGCA TCAACAAGGT GCAGAAGCAG GCTGAGAGCC AGATCAACAA GCAGACCAAG 6240
GTGGGCGACA TAGCCCGTAA GACTGACCGA CCGGCCCTAC ATCTCCGCAT TCCCCCGCAG 6300
CCAGGGGCAC TGGGCAGCCC GCCCCCCGCT GCTGCCCCCA CCATTTTCAT TGGCAGCCCC 6360
ACTACCCCCG CCGGCTTGTC TACCTCTGCG GACGGGTTCC TGAAGCCGCC GGCGGGCTCG 6420
GTGCCTGGCC CTGACTCGCC TGGTGAGCTC TTCCTCAAGC TCCCACCCCA GGTGCCCGCC 6480
CAAGTGCCTT CGCAGGACCC CTTTGGACTG GCCCCTGCCT ATCCCCTAGA GCCCCGCTTC 6540
CCCACGGCAC CACCTACCTA TCCCCCCTAT CCTAGTCCTA CGGGGGCCCC TGCGCAGCCC 6600
CCGATGCTGG GCGCCTCATC TCGTTCTGGG GCTGGCCAGC CAGGGGAATT CCACACTACC 6660
CCACCTGGCA CCCCCAGACA CCAGCCCTCC ACACCTGACC CATTCCTCAA ACCCCGCTGC 6720
CCCTCGCTGG ATAACTTGGC TGTGCCTGAG AGCCCTGGGG TAGGGGGAGG CAAAGCTTCC 6780
GAGCCCCTGC TCTCGCCCCC ACCTTTTGGG GAGTCCCGGA AGGCCCTAGA GGTGAAGAAG 6840
GAAGAGCTTG GGGCATCCTC TCCTAGCTAT GGGCCCCCAA ACCTGGGCTT TGTTGACTCA 6900
CCCTCCTCAG GCCCCCACCT GGGTGGCCTG GAGTTAAAGA CACCTGATGT CTTCAAAGCC 6960
CCCCTGACCC CTCGGGCATC TCAGGTAGAG CCCCAGAGCC CGGGCTTGGG CCTAAGGCCC 7020
CAGGAGCCAC CCCCTGCCCA GGCTTTGGCA CCTTCTCCTC CAAGTCACCC GGACATCTTT 7080
CGCCCTGGCT CCTACCCTGA CCCATATGCT CAGCCCCCAT TGACTCCTCG GCCCCAACCT 7140
CCGCCCCCTG AGAGCTGCTG TGCTCTGCCC CCTCGCTCAC TGCCCTCCGA CCCTTTCTCC 7200
CGAGTGCCTG CCAGTCCTCA GTCCCAGTCC AGCTCCCAGT CTCCACTGAC ACCCCGTCCT 7260
CTGTCTGCTG AAGCTTTTTG CCCATCCCCC GTTACCCCTC GCTTCCAGTC CCCTGACCCT 7320
TATTCTCGCC CACCCTCACG CCCTCAGTCC CGTGACCCAT TTGCCCCATT GCATAAGCCA 7380
CCCCGACCCC AGCCCCCTGA AGTTGCCTTT AAGGCTGGGT CTCTAGCCCA CACTTCGCTG 7440
GGGGCTGGGG GTTTCCCAGC AGCCCTGCCC TCGGGGCCAG CAGGTGAGCT CCATGCCAAG 7500
GTCCCAAGTG GGCAGCCCCC CAATTTTGTC CGGTCCCCTG GGACGGGTGC ATTTGTGGGC 7560
ACCCCCTCTC CCATGCGTTT CACTTTCCCT CAGGCAGTAG GGGAGCCTTC CCTAAAGCCC 7620
CCTGTCCCTC AGCCTGGTCT CCCACCACCC CATGGGATCA ACAGCCATTT TGGGCCCGGC 7680
CCCACCTTGG GCAAGCCTCA AAGCACAAAC TACACAGTAG CCACAGGGAA CTTCCACCCA 7740
TCGGGAAGCC CCCTGGGGCC CAGCAGCGGG TCCACAGGGG AGAGCTATGG GCTGTCCCCG 7800
CTACGCCCTC CGTCGGTTCT GCCACCACCT GCACCCGACG GATCCCTCCC CTACCTGTCC 7860
CATGGAGCCT CACAGCGATC AGGCATCACC TCTCCTGTCG AAAAGCGAGA AGACCCAGGG 7920
ACTGGAATGG GTAGCTCTTT GGCGACAGCT GAACTCCCAG GTACCCAGGA CCCAGGCATG 7980
TCTGGCCTTA GCCAAACAGA GCTGGAGAAG CAACGGCAGC GCCAGCGACT ACGAGAGCTG 8040
CTGATTCGGC AGCAGATCCA GCGCAACACC CTGCGGCAGG AGAAGGAAAC AGCTGCAGCA 8100
GCTGCAGGAG CAGTGGGGCC TCCAGGCAGC TGGGGTGCTG AGCCCAGCAG CCCTGCCTTT 8160
GAGCAGCTGA GTCGAGGCCA GACCCCCTTT GCTGGGACAC AGGACAAGAG CAGCCTTGTG 8220
GGGTTGCCCC CAAGCAAGCT GAGTGGCCCC ATCCTGGGGC CAGGGTCCTT CCCTAGCGAT 8280
GACCGACTCT CCCGGCCACC TCCACCAGCC ACGCCTTCCT CTATGGATGT GAACAGCCGG 8340
CAACTGGTAG GAGGCTCCCA AGCTTTCTAT CAGCGAGCAC CCTATCCTGG GTCCCTGCCC 8400
TTACAGCAGC AACAGCAACA ACTGTGGCAG CAACAACAGG CGACAGCAGC AACCTCCATG 8460
CGATTTGCCA TGTCAGCTCG CTTTCCATCA ACTCCTGGAC CTGAACTTGG CCGCCAAGCC 8520
CTAGGTTCCC CATTGGCGGG AATTTCCACC CGTCTGCCAG GCCCTGGTGA GCCAGTGCCT 8580
GGTCCGGCTG GTCCTGCCCA GTTCATTGAG CTGCGGCACA ATGTACAGAA AGGACTGGGA 8640
CCTGGGGGCA CTCCGTTTCC TGGTCAGGGC CCACCTCAGA GACCCCGTTT TTACCCTGTA 8700
AGTGAGGACC CCCACCGACT GGCTCCTGAA GGGCTTCGGG GCCTGGCGGT ATCAGGTCTT 8760
CCCCCACAGA AACCCTCAGC CCCACCGGCC CCTGAATTGA ACAACAGTCT TCATCCGACA 8820
CCCCACACCA AGGGTCCTAC CCTGCCAACT GGTTTGGAGC TGGTCAACCG GCCCCCATCG 8880
AGCACTGAGC TTGGCCGCCC CACTCCTCTG GCCCTGGAAG CTGGGAAGTT GCCCTGTGAG 8940
GATCCCGAGC TGGATGACGA TTTTGATGCC CACAAGGCCC TAGAGGATGA TGAAGAGCTT 9000
GCTCACCTGG GTCTGGGTGT GGATGTGGCC AAGGGTGATG ATGAACTTGG CACCTTAGAA 9060
AACCTGGAGA CCAATGACCC CCACTTGGAT GACCTGCTCA ATGGAGACGA GTTTGACCTG 9120
CTGGCATATA CTGATCCTGA GCTGGACACT GGGGACAAGA AGGATATCTT CAATGAGCAC 9180
CTGAGGCTGG TAGAATCGGC TAATGAGAAG GCTGAACGGG AGGCTCTGCT GCGGGGGGTG 9240
GAGCCAGGAC CCTTGGGCCC TGAGGAGCGC CCTCCCGCTG CTGCTGATGC CTCTGAACCC 9300
CGCCTGGCAT CTGTGCTCCC TGAGGTGAAG CCCAAGGTGG AGGAGGGTGG ACGCCACCCT 9360
TCTCCTTGCC AATTCACCAT TGCTACCCCC AAGGTAGAGC CCGCACCTGC TGCCAATTCA 9420
CTTGGCCTGG GGCTAAAGCC AGGACAGAGC ATGATGGGCA GCCGGGATAC CCGGATGGGC 9480
ACAGGGCCAT TTTCTAGCAG TGGGCACACA GCTGAGAAGG CCTCCTTTGG GGCCACAGGA 9540
GGACCACCAG CTCACCTGCT GACCCCCAGC CCACTGAGTG GCCCAGGAGG ATCCTCCCTG 9600
CTGGAAAAGT TTGAGCTCGA GAGTGGGGCT TTGACCTTGC CTGGTGGACC TGCAGCATCT 9660
GGGGATGAGC TAGACAAGAT GGAGAGCTCA CTGGTAGCCA GCGAGTTACC CCTGCTCATT 9720
GAGGACCTGT TGGAGCATGA GAAGAAGGAG CTGCAGAAGA AGCAGCAGCT TTCAGCACAG 9780
TTGCAGCCTG CCCAGCAGCA GCAGCAACAG CAGCAGCAGC ATTCCCTACT GTCTGCACCA 9840
GGCCCTGCCC AGGCCATGTC TTTGCCACAT GAGGGCTCTT CTCCCAGTTT GGCTGGGTCC 9900
CAACAGCAGC TTTCCCTGGG TCTTGCAGGT GCCCGACAGC CAGGCTTGCC CCAACCACTG 9960
ATGCCCACCC AGCCACCAGC TCATGCCCTC CAGCAACGCC TGGCTCCATC CATGGCTATG 10020
GTGTCCAATC AAGGGCATAT GCTAAGTGGG CAGCATGGAG GGCAGGTAGG CTTGGTACCC 10080
CAGCAGAGCT CACAGCCAGT GCTATCACAG AAGCCCATGG GCACCATGCC ACCTTCCATG 10140
TGCATGAAGC CGCAGCAACT GGCAATGCAG CAGCAGCTGG CAAACAGCTT CTTCCCAGAT 10200
ACAGACCTGG ACAGATTTGC TGCAGAAGAT ATCATTGATC CCATTGCAAA GGCCAAGATG 10260
GTGGCTTTGA AAGGCATCAA GAAAGTGATG GCTCAGGGCA GCATTGGGGT GGCACCTGGT 10320
ATGAACAGAC AGCAAGTGTC TCTGCTAGCC CAGAGGCTCT CGGGGGGACC TGGCAGTGAT 10380
CTGCAGAACC ATGTGGCAGC TGGGAGTAGC CAGGAGCGGA GTGCTGGTGA TCCCTCCCAG 10440
CCTCGTCCCA ACCCGCCCAC TTTTGCTCAG GGAGTGATCA ATGAAGCTGA CCAGCGGCAG 10500
TATGAGGAGT GGCTGTTCCA TACCCAGCAG CTCCTACAGA TGCAGCTGAA GGTGCTAGAG 10560
GAGCAGATTG GTGTACACCG CAAGTCCCGG AAGGCTCTGT GTGCCAAGCA GCGCACTGCC 10620
AAAAAAGCTG GCCGTGAGTT CCCAGAAGCT GATGCTGAGA AGCTCAAGCT GGTTACAGAG 10680
CAGCAGAGCA AGATCCAGAA ACAACTGGAT CAGGTCCGGA AACAGCAGAA GGAGCACACT 10740
AATCTCATGG CAGAATATCG GAACAAGCAG CAGCAACAAC AACAGCAGCA GCAGCAGCAG 10800
CAACAGCAAC AACACTCAGC TGTGCTGGCT CTCAGCCCTT CCCAGAGTCC CCGGCTGCTC 10860
ACCAAGCTCC CTGGTCAGCT GCTCCCTGGC CATGGGCTGC AGCCACCACA GGGGCCTCCG 10920
GGTGGGCAAG CCGGAGGTCT TCGCCTGCCC CCTGGGGGTA TGGCACTACC TGGACAGCCT 10980
GGTGGCCCCT TCCTTAATAC AGCTCTGGCC CAGCAGCAGC AACAGCAACA TTCTGGTGGG 11040
GCTGGATCCC TGGCTGGCCC TTCAGGGGGC TTCTTCCCTG GCAACCTTGC TCTTCGAAGC 11100
CTCGGACCTG ATTCAAGGCT TTTACAGGAA AGGCAGCTGC AGCTGCAGCA GCAGCGTATG 11160
CAGCTGGCCC AGAAACTGCA GCAGCAGCAG CAGCAGCAGC AGCAACAGCA ACAGCAGCAC 11220
CTTCTAGGAC AGGTGGCAGT CCAGCAGCAA CAGCAGCAGG GTCCTGGAGT ACAGACAAAC 11280
CAAGCTCTGG GTCCCAAGCC CCAGGGGCTT CTGCCTCCCA GCAGCCACCA AGGCCTCCTG 11340
GTCCAGCAGC TGTCCCCTCA ACCACCCCAG GGGCCCCAGG GCATGCTGGG CCCTGCCCAG 11400
GTGGCTGTGT TGCAGCAGCA GCACCCTGGA GCTTTGGGCC CCCAGGGCCC TCACAGACAG 11460
GTGCTTATGA CCCAGTCCCG GGTGCTCAGT TCCCCCCAGC TGGCACAGCA GGGTCAGGGC 11520
CTTATGGGAC ACAGGCTGGT CACAGCCCAG CAGCAGCAGC AGCAACAACA GCACCAACAG 11580
CAAGGGTCCA TGGCAGGGCT GTCCCATCTT CAGCAGAGTC TGATGTCACA CAGTGGGCAG 11640
CCCAAACTGA GCGCTCAGCC CATGGGCTCT TTACAGCAGC TTCAGCAGCA GCAGCAGCTG 11700
CAACAGCAAC AGCAACTTCA GCAGCAGCAG CAGCAGCAGC TACAACAGCA ACAGCAACTT 11760
CAGCAGCAAC AGCTTCAACA GCAGCAACAG CAGCAGCAGC TTCAACAACA GCAGCAGCAA 11820
CAGCTTCAAC AGCAGCAACA GCAGCTACAA CAGCAACAGC AACAGCAGCA GCAGCAGCTA 11880
CAACAGCAAC AGCAACAACA ACAGCAGCAG ATGGGCCTTT TAAACCAGAG TCGAACTTTA 11940
CTGTCTCCTC AGCAACAACA GCAGCAGCAA GTGGCACTTG GCCCTGGCAT GCCAGCAAAG 12000
CCTCTTCAAC ACTTTTCTAG CCCTGGAGCC CTGGGCCCAA CCCTCCTCCT GACGGGCAAG 12060
GAACAAAACA CCGTAGACCC AGCCGTTTCT TCAGAGGCCA CTGAGGGGCC CTCTACACAT 12120
CAGGGAGGGC CGTTAGCAAT AGGAACTACC CCTGAGTCAA TGGCCACTGA ACCAGGAGAG 12180
GTAAAGCCCT CACTCTCTGG GGACTCACAA CTCCTGCTTG TCCAGCCCCA GCCCCAGCCT 12240
CAGCCCAGCT CTCTGCAGCT GCAGCCACCT CTGAGGCTTC CAGGACAACA GCAGCAGCAA 12300
GTTAGCCTGC TCCACACAGC AGGTGGAGGA AGCCATGGGC AGCTAGGCAG TGGATCATCT 12360
TCTGAGGCCT CATCTGTGCC CCACCTGCTG GCTCAGCCCT CTGTTTCCTT AGGGGATCAG 12420
CCTGGGCCCA TGACCCAGAA CCTTCTGGGC CCCCAACAGC CCATGCTAGA GCGGCCCATG 12480
CAAAATAATA CAGGGCCACC ACAACCTCCC AAACCAGGAC CTGTCCTCCA GTCTGGGCAG 12540
GGTCTGCCTG GGGTTGGAAT CATGCCTACG GTGGGTCAGC TTCGAGCACA GCTCCAAGGA 12600
GTCCTGGCCA AAAACCCACA GCTGCGGCAC TTAAGTCCTC AGCAGCAGCA GCAGCTACAG 12660
GCACTCCTCA TGCAGCGGCA GCTGCAGCAG AGTCAGGCAG TACGCCAGAC CCCACCCTAC 12720
CAGGAGCCTG GGACCCAGAC CTCTCCCATC CAGGGCCCCC TGGGCTGCCA ACCTCAACTT 12780
GGGGGCTTCC CTGGACCACA GACAGGCCCC CTCCAGGAGC TAGGGGCAGG GCCTCGACCT 12840
CAGGGCCCAC CCCGGCTCCC TGCCCCACCA GGAGCCTTAT CTACAGGACC AGTCCTTGGC 12900
CCTGTCCATC CCACACCTCC ACCATCCAGC CCTCAAGAGC CAAAGAGACC TTCACAATTA 12960
CCTTCCCCCA GCTCCCAGCT TCCCACTGAG GCCCAGCTCC CTCCCACCCA TCCAGGGACC 13020
CCCAAACCCC AGGGGCCAAC CTTGGAGCTG CCTCCTGGGA GGGTCTCACC TGCTGCTGCC 13080
CAGCTTGCAG ATACCTTGTT TAGCAAGGGT CTGGGACCTT GGGATCCCCC AGACAACCTA 13140
GCAGAAACCC AGAAGCCAGA GCAGAGCAGC CTGGTACCTG GGCATCTGGA CCAGGTGAAT 13200
GGACAGGTGG TGCCTGAGGC ATCCCAACTC AGCATCAAGC AGGAACCTCG GGAAGAGCCA 13260
TGTGCCCTGG GAGTCCAGTC AGTGAAGAGG GAGGCCAATG GGGAGCCAAT AGGGGCACCA 13320
GGAACCAGCA ACCACCTCCT GCTGGCAGGC CCTCGCTCAG AAGCTGGGCA TCTGCTCTTG 13380
CAGAAGCTAC TCCGGGCAAA GAATGTGCAA CTCAGCACTG GGCGGGGGTC CGAGGGGCTG 13440
CGAGCTGAGA TCAACGGGCA CATTGACAGC AAGCTGGCTG GGCTGGAGCA GAAACTACAG 13500
GGTACCCCCA GCAACAAGGA GGATGCAGCA GCAAGGAAGC CTTTGACACC GAAGCCCAAG 13560
CGGGTACAGA AGGCAAGCGA CAGGTTGGTG AGCTCCCGAA AGAAGCTGCG GAAGGAGGAC 13620
GGGGTCAGGG CCAGCGAGGC CTTGCTGAAA CAGCTGAAAC AGGAGCTGTC CCTGCTGCCC 13680
CTAACGGAGC CTGCTATCAC CGCCAATTTT AGCCTCTTTG CTCCCTTTGG CAGTGGCTGC 13740
CCAGTCAATG GGCAGAGCCA GCTGAGGGGG GCCTTTGGAA GTGGGGCGCT GCCCACTGGC 13800
CCTGACTACT ATTCCCAGCT GCTTACCAAG AATAACCTGA GTAACCCGCC GACACCACCC 13860
TCGTCGCTGC CTCCCACCCC ACCCCCATCG GTGCAGCAGA AGATGGTGAA TGGCGTCACC 13920
CCATCTGAAG AGCTGGGGGA GCACCCCAAG GATGCTGCCT CTGCCCGGGA TAGTGAAAGG 13980
GCACTGAGGG ATACTTCAGA GGTGAAGAGT CTAGACCTGC TGGCTGCCTT GCCTACACCC 14040
CCTCACAATC AGACTGAGGA TGTCAGGATG GAGAGTGATG AGGATAGCGA TTCTCCTGAC 14100
AGCATTGTGC CAGCTTCATC CCCTGAGAGC ATCTTGGGGG AGGAGGCCCC TCGTTTCCCT 14160
CATCTGGGCT CAGGCCGGTG GGAGCAAGAG GACCGGGCCC TCTCCCCTGT CATCCCCCTC 14220
ATTCCTCGGG CCAGCATCCC AGTCTTCCCA GATACCAAAC CTTATGGGGC CCTTGACCTG 14280
GAGGTCCCTG GAAAGCTGCC TGCCACAACT TGGGAAAAGG GCAAAGGAAG TGAGGTGTCA 14340
GTCATGCTCA CAGTCTCTGC TGCTGCAGCC AAGAACCTGA ATGGCGTGAT GGTGGCAGTG 14400
GCAGAGCTGC TGAGCATGAA GATCCCCAAC TCCTATGAGG TGCTGTTCCC AGAGAGCCCT 14460
GCCCGGGCAG GCACTGAGCC TAAGAAGGGG GAAGCTGAGG GTCCTGGTGG GAAGGAAAAG 14520
GGTCTGGGAG GCAAGAGCCC AGACACTGGC CCTGATTGGC TGAAGCAGTT TGATGCAGTG 14580
TTGCCTGGCT ATACCCTGAA GAGCCAACTA GACATCTTGA GCCTCCTGAA ACAGGAGAGC 14640
CCCGCCCCAG AGCCACCCAC TCAGCACAGC TATACCTACA ATGTCTCCAA TCTGGATGTG 14700
CGACAGCTCT CAGCCCCACC TCCTGAAGAA CCCTCCCCGC CCCCTTCCCC CTTGGCACCT 14760
TCTCCTGCCA GTCCCCCTAC TGAGCCCTTG GTTGAACTTC CCGCCGAACC CTTGGCTGAG 14820
CCACCCGTCC CCTCACCTCT GCCACTGGCC TCATCCCCTG AATCAGCCCG ACCCAAGCCC 14880
CGTGCCCGGC CCCCTGAAGA AGGTGAAGAT TCCCGTCCTC CTCGCCTCAA GAAATGGAAA 14940
GGAGTGCGCT GGAAGCGGCT TCGGCTGCTG CTGACCATCC AGAAGGGCAG TGGGCGGCAG 15000
GAGGATGAGC GGGAAGTGGC AGAGTTTATG GAGCAGCTTG GCACAGCCTT GCGACCTGAC 15060
AAGGTACCGC GAGACATGCG TCGCTGCTGT TTCTGTCATG AGGAGGGTGA CGGGGCCACT 15120
GATGGGCCTG CCCGTCTGCT GAACCTGGAC CTGGACCTGT GGGTGCACCT CAACTGTGCC 15180
CTTTGGTCCA CGGAGGTGTA TGAGACCCAG GGCGGGGCAC TGATGAATGT GGAGGTTGCC 15240
CTGCACCGAG GACTGCTAAC CAAGTGCTCC CTGTGCCAGC GAACTGGTGC CACCAGCAGC 15300
TGCAATCGCA TGCGTTGCCC CAATGTCTAC CATTTTGCTT GTGCCATCCG TGCCAAGTGC 15360
ATGTTCTTCA AGGACAAGAC CATGCTGTGT CCAATGCATA AGATCAAGGG GCCCTGTGAG 15420
CAAGAGCTGA GCTCTTTTGC TGTCTTCCGG CGGGTCTACA TTGAGCGGGA CGAGGTGAAG 15480
CAAATCGCTA GCATCATTCA GCGGGGAGAA CGGCTGCACA TGTTCCGTGT GGGGGGCCTT 15540
GTGTTCCACG CCATCGGACA GCTGCTGCCT CACCAGATGG CTGACTTTCA TAGTGCCACT 15600
GCCCTCTATC CCGTGGGCTA CGAGGCCACG CGCATCTATT GGAGCCTCCG CACCAACAAT 15660
CGTCGCTGTT GCTATCGCTG TTCTATTGGT GAGAACAACG GGCGGCCGGA GTTTGTAATC 15720
AAAGTCATCG AGCAGGGCCT GGAGGACCTG GTCTTCACTG ACGCCTCTCC CCAGGCCGTG 15780
TGGAATCGCA TCATTGAGCC TGTGGCTGCC ATGAGAAAAG AGGCTGACAT GCTGCGACTC 15840
TTCCCTGAGT ATCTGAAGGG CGAGGAGCTC TTTGGGCTGA CGGTGCATGC CGTGCTTCGC 15900
ATAGCTGAAT CACTGCCCGG GGTGGAGAGC TGTCAAAACT ATTTATTCCG CTATGGGCGC 15960
CACCCCCTTA TGGAGCTGCC ACTCATGATC AACCCCACTG GCTGTGCCCG ATCAGAGCCT 16020
AAAATCCTCA CACACTACAA ACGGCCCCAT ACCCTGAACA GCACCAGCAT GTCTAAGGCA 16080
TATCAGAGCA CCTTCACAGG CGAGACCAAC ACCCCCTACA GCAAGCAGTT TGTGCACTCC 16140
AAGTCATCTC AGTACCGGCG GCTGCGCACC GAATGGAAGA ACAACGTGTA CCTGGCTCGC 16200
TCCCGTATCC AGGGCCTGGG GCTCTATGCA GCCAAGGACC TAGAAAAGCA CACAATGGTT 16260
ATCGAGTACA TTGGCACCAT CATTCGGAAC GAGGTGGCCA ACCGGCGGGA GAAAATCTAC 16320
GAAGAGCAGA ATCGAGGCAT CTACATGTTC CGAATAAACA ATGAACATGT GATTGATGCT 16380
ACGTTGACCG GCGGCCCTGC CAGGTACATT AACCATTCCT GTGCCCCTAA CTGTGTGGCT 16440
GAAGTCGTGA CATTTGACAA AGAGGACAAA ATCATCATCA TCTCCAGCCG GCGAATCCCC 16500
AAAGGAGAGG AGCTAACCTA TGACTATCAG TTTGATTTTG AGGACGATCA GCACAAGATC 16560
CCCTGCCACT GTGGAGCCTG GAATTGTCGG AAATGGATGA ACTAA 16606
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 100 0.0 3910
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 99 0.0 3910
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 99 0.0 3902
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 98 0.0 3895
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 98 0.0 3850
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 95 0.0 3830
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 98 0.0 3787
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 94 0.0 3707
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 94 0.0 3704
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 94 0.0 3699
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 95 0.0 3697
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 93 0.0 3667
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 93 0.0 3665
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 94 0.0 3649
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 93 0.0 3646
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 93 0.0 3642
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 92 0.0 3621
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 93 0.0 3620
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 92 0.0 3600
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 90 0.0 3563
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 89 0.0 3459
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 89 0.0 3448
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 89 0.0 3404
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 86 0.0 3377
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 80 0.0 3058
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 85 0.0 2309
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 92 0.0 2154
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 88 0.0 1986
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 90 0.0 1751
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 87 0.0 1701
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 90 0.0 1654
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 90 0.0 1547
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 82 0.0 1404
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 93 0.0 1402
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 66 0.0 1245
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 77 0.0 1221
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 100 0.0 1176
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 99 0.0 1157
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 93 0.0 1121
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 93 0.0 1113
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 89 0.0 1102
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 87 0.0 1031
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 84 0.0 1030
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 82 0.0 1021
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 82 0.0 1007
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 80 0.0 997
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 992
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 989
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 80 0.0 989
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 80 0.0 988
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 988
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 983
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 926
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 74 0.0 908
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 74 0.0 908
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 898
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 893
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 850
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 735
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 93 0.0 686
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 645
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 3e-172 605
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 2e-158 560
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 2e-97 357
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 67 9e-95 348
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 2e-92 340
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 4e-45 183
Created Date 25-Jun-2016