WERAM Information


Tag Content
WERAM ID WERAM-Poa-0043
Ensembl Protein ID ENSPPYP00000005112.2
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPPYG00000004478.2 ENSPPYT00000005313.2 ENSPPYP00000005112.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 3.90e-45 152.9 3309 5259
Me_Reader PHD 2.40e-26 91.8 171 4883
Organism Pongo abelii
Domain Profile
  HMT SET1

              SET1.txt   16 vakkeiekeelviEYvGevirsevadkrekeyekke 51  
v k++ e+++l++EY+ + +++++++++++ ++++
ENSPPYP00000005112.2 3309 VRKQQKEHTNLMAEYRNKQQQQQQQQQQQQQ-QQQH 3343
44556778899*****999444444444333.3333 PP
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSPPYP00000005112.2 5144 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5228
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSPPYP00000005112.2 5229 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5259
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSPPYP00000005112.2 171 RCSHCTRLGA----SIPCRSpgCPRLYHFPCATASGSFLSmKTLQLLCPEHSE 219
6999933333....599******************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ ge + ++ C +C + +H+ C++ +l+ + w Cp+Ck
ENSPPYP00000005112.2 228 RCAVC--EGPGELCdLFFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****..444545559******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSPPYP00000005112.2 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSPPYP00000005112.2 1103 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1152
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSPPYP00000005112.2 1154 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1200
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSPPYP00000005112.2 1231 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1282
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSPPYP00000005112.2 4778 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4810
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSPPYP00000005112.2 4837 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 4883
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDSQKLAGED KDSEPAADGP AASEDPSVTE SDLPNPHVGE VSVLSSGSPR LQESPQDCSG 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG GSPGPNEAVL PSEDLSQIGF 120
PEGLTPAHLG EPGGSCWAHH WCAAWSAGVW GQEGPELCGV DKAIFSGISQ RCSHCTRLGA 180
SIPCRSPGCP RLYHFPCATA SGSFLSMKTL QLLCPEHSEG AAHLEEARCA VCEGPGELCD 240
LFFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRACGVGS AELNPNSEWF ENYSLCHRCH KAQGGQPISS 360
VAEQHTPVCS RFSPPEPGDT PTDEPDALYV ACQGQPKGGH VTSMQPKEPG PLQCEAKPLG 420
RAGVQLEPQL EAPLNEEMPL LPPPEESPLS PPPEESPTSP PPEASRLSPP PEESPASPLP 480
EALHLSRPPE ESPLSPPPEE SPLSPQPESS PFSPLEESPF SPPEESPPSP ALETPLSPPP 540
EASPLSPPFE ESPLSPPPEE LPTSPPPEAS RLSPPPEESP MSPPPEESPM SPPPEASRLF 600
PPFEESPLSP PPEESPLSPP PEASRLSPPP EDSPMSPPPE ESPMSPPPEV SRLSPLPVVS 660
RLSPPPEESP LSPPALSPLG ELEYPFGAKG DSDPESPLAA PILETPISPP PEANCTDPEP 720
VPPMILPPSP GSPMGPASPI LMEPLPPQCS PLLQHSLASQ NSPPSQCSPP ALPLSVPSPL 780
SPIGKVVGVS DEAELHEMET EKVSEPECPA LEPSATSPLP SPMGDLSCPA PSPAPALDDF 840
SGLGEDTAPL DGIDAPGSQP EAGQTPGSLA SELKGSPVLL DPEELAPVTP MEVYPECKQT 900
AGQGSPCEEQ EEPRAPVAPT PPTLIKSDIV NEISNLSQGD ASASFPGSEP LLGSPDPEGG 960
GSLSMELGVS TDVSPARDEG SLRLCTDSLP ETDDSLLCDA GTAISGGKAE GEKGRRRSSP 1020
ARSRIKQGRS SSFPGRRRPR GGAHGGRGRG RARLKSTASS IETLVVADID SSPSKEEEEE 1080
DDDTMQNTVV LFSNTDKFVL MQDMCVVCGS FGRGAEGHLL ACSQCSQCYH PYCVNSKITK 1140
VMLLKGWRCV ECIVCEVCGQ ASDPSRLLLC DDCDISYHTY CLDPPLLTVP KGGWKCKWCV 1200
SCMQCGAASP GFHCEWQNSY THCGPCASLV TCPICHAPYV EEDLLIQCRH CERWMHAGCE 1260
SLFTEDDVEQ AADEGFDCVS CQPYVVKPVA PIAPPELVPM KVKEPEPQYF RFEGVWLTET 1320
GMALLRNLTM SPLHKRRQRR GRLGLPGEAG LEGSEPSDAL GPDDKKDGDL DTDELLKGEG 1380
GVEHMECEIK LEGPVSPDVE PGKEETEESK KRKRKPYRPG IGGFMVRQRK SHTRTKKGPA 1440
AQAEVLSGDG QPDEGETVIP ADLPAEGAVE QSLAEGDEKK KQQRRGRKKS KLEDMFPAYL 1500
QEAFFGKELL DLSRKALFAV GVGRPSFGLG TPKAKGDGSS ERKELPTSQK GDDGPDIADE 1560
ESRGLEGKAD TPGPEDGGVK ASPVPSDPEK PGTPGEGMLS SDLDRISTEE LPKMESKDLQ 1620
QLFKDVLGSE REQHLGCGTP GLEGSRTPLQ RPFLQGGLPL GNLPSSSPMD SYPGLCQSPF 1680
LDSRERGGFF SPEPGEPDSP WTGSGGTTPS TPTTPTTEGE GDGLSYNQRS LQRWEKDEEL 1740
GQLSTISPVL YANINFPNLK QDYPDWSSRC KQIMKLWRKV PAADKAPYLQ KAKDNRAAHR 1800
INKVQKQAES QINKQTKVGD IARKTDRPAL HLRIPPQPGA LGSPPPAAAP TIFIGSPTTP 1860
AGLSTSADGF LKPPAGSVPG PDSPGELFLK LPPQVPAQVP SQDPFGLAPA YPLEPRFPTA 1920
PPTYPPYPSP TGAPVQPPML GTSSRPGTGQ PGEFHTTPPG TPRHQPSTPD PFLKPRCPSL 1980
DNLAVPESPG VGGGKASEPL LSPPPFGESR KALEVKKEEL GASSPSYGPP NLGFVDSPSS 2040
GPHLGGLELK TPDVFKAPLT PRASQVEPQS PGLGLRPQEP PPAQALAPSP PSHPDIFRPG 2100
SYPDPYAQPP LTPRPQPPPP ESCCALPPRS LPSDPFSRVP ASPQSQSSSQ SPLTPRPLSA 2160
EAFCPSPVTP RFQSPDPYSR PPSRPQSRDP FAPLHKPPRP QPPEVAFKAG SLAHTSLGAG 2220
GFPAALPSGP AGELHAKVPS GQPPNFVRSP GTGAFVGTPS PMRFTFPQAV GEPSLKPPVP 2280
QPGLPPPHGI NSHFGPGPTL GKPQSTNYTV ATGNFHPSGS PLGPSSGSTG ESYGLSPLRP 2340
PSVLPPPAPD GSLPYLSHGA SQRSGITSPV EKREDPGAGM GSSLATAELP GTQDPGMSGL 2400
SQTELEKQRQ RQRLRELLIR QQIQRNTLRQ EKETAAAAAG AVGPPGSWGA EPSSPAFEQL 2460
SRGQTPFAGT QDKSSLVGLP PSKLSGPILG PGSFPSDDRL SRPPPPATPS SMDVNSRQLV 2520
GGSQAFYQRA PYPGSLPLQQ QQQQLWQQQQ ATAATSMRFA MSARFPSTPG PELGRQALGS 2580
PLAGIPTRLP GPGEPVPGPA GPAQFIELRH NVQKGLGPGG TPFPGQGPPQ RPRFYPVSED 2640
PHRLAPEGLR GLAVSGLPPQ KPSAPPAPEL NNSLHPTPHT KGPTLPTGLE LVNRPPSSTE 2700
LGRPTPLALE AGKLPCEDPE LDDDFDAHKA LEDDEELAHL GLGVDVAKGD DELGTLENLE 2760
TNDPHLDDLL NGDEFDLLAY TDPELDTGDK KDIFNEHLRL VESANEKAER EALLRGVEPG 2820
PLGPEERPPP AADASEPRLA SVLPEVKPKV EEGGRHPSPC QFTIATPKVE PTPAANSLGL 2880
GLKPGQSMMG SRDTRMGTGP FSSSGHTAEK ASFGATGGPP AHLLTPSPLS GPGGSSLLEK 2940
FELESGALTL PGGPAASGDE LDKMESSLVA SELPLLIEDL LEHEKKELQK KQQLSAQLQP 3000
AQQQPQQQQQ HSLLSAPGPA QAMSLPHEGS SPSLAGSQQQ LSLGLAGARQ PGLPQPLMPT 3060
QPPAHALQQR LAPSMAMVSN QGHMLSGQHG GQAGLVPQQS SQPVLSQKPM GTMPPSMCMK 3120
PQQLAMQQQL ANSFFPDTDL DKFAAEDIID PIAKAKMVAL KGIKKVMAQG SIGVAPGMNR 3180
QQVSLLAQRL SGGPGSDLQN HVAAGSGQER SAGDPSQPRP NPPTFAQGVI NEADQRQYEE 3240
WLFHTQQLLQ MQLKVLEEQI GVHRKSRKAL CAKQRTAKKA GREFPEADAE KLKLVTEQQS 3300
KIQKQLDQVR KQQKEHTNLM AEYRNKQQQQ QQQQQQQQQQ QQHSAVLALS PSQSPRLLTK 3360
LPGQLLPGHG LQPPQGPPGG QAGGLRLPPG GMALPGQPGG PFLNTALAQQ QQQQHSGGAG 3420
SLAGPSGGFF PGNLALRSLG PDSRLLQERQ LQLQQQRMQL AQKLQQQQQQ QQQHLLGQVA 3480
VQQQQQQGPG VQTNQALGPK PQGLLPPGSH QGLLVQQLSP QPPQGPQGML GPTQVAVLQQ 3540
QHPGALGPQG PHRQVLMTQS RVLSSPQLAQ QGQGLMGHRL VTAQQQQQQQ QQHQQQGSMA 3600
GLSHLQQSLM SHSGQPKLSA QPMGSLQQLQ QQQQLQQQQQ LQQQQQQQLQ QQQQQQQLQQ 3660
QQQQLQQQQQ QQQQLQQQQQ QLQQQQQQQL QQQQQQLQQQ QQQLQQQQQQ QQFQQQQQQQ 3720
QMGLLNQSRT LLSPQQQQQQ QVALGPGMPA KPLQHFSSPG ALGPTLLLTG KEQNTVDPAV 3780
SSEATEGPST HQGGPLAIGT TPESMATEPG EVKPPLSGDS QLLLVQPQPQ PQPSSLQLQP 3840
PLRLPGQQQQ QVSLLHTAGG GSHGQLGTGS SSEASSMPHL LAQPSVSLGD QPGPMTQNLL 3900
GPQQPMLERP MQNNTGPQPP KPGPVLQSGQ GLPGVGIMPT VGQLRAQLQG VLAKNPQLRH 3960
LSPQQQQQLQ ALLMQRQLQQ SQAVRQTPPY QEPGTQTSPL QGLLGCQPQL GGFPGPQTGP 4020
LQELGAGPRP QGPPRLPAPP GALSTGPVLG PVHPTPPPSS PQEPKRPSQL PSPSSQLPTE 4080
AQLPPTHPGT PKPQGPTLEL PPGRVSPAAA QLADTLFSKG LGPWDPPDNL AETQKPEQSS 4140
LVPGHLDQVN GQVVPEASQL SIKQEPREEP CALGAQSVKR EANGEPIGVP GTSNHLLLAG 4200
PRSEAGHLLL QKLLRAKNVQ LSTGRGSEGL RAEINGHIDS KLAGLEQKLQ GTPSNKEDAA 4260
ARKPLTPKPK RVQKASDRLV SSRKKLRKED GVRASEALLK QLKQELSLLP LTEPAITANF 4320
SLFAPFGSGC PVNGQSQLRG AFGSGALPTG PDYYSQLLTK NNLSNPPTPP SSLPPTPPPS 4380
VQQKMVNGVT PSEELGEHPK DAASARDTER ALRDTSEVKS LDLLAALPTP PHNQTEDVRM 4440
ESDDSDSPDS IVPASSPESI LGEEAPRFPH LGSGQWEQED RALSPVIPLI PRASIPVFPD 4500
TKPYGALDLE VPGKLPATTW EKGKGSEVSV MLTVSAAAAK NLNGVMVAVA ELLSMKIPNS 4560
YEVLFPESPA RAGTEPKKGE AEGPGGKEKG LGGKSPDTGP DWLKQFDAVL PGYTLKSQLD 4620
ILSLLKQESP APEPPTQHSY TYNVSNLDVR QLSAPPPEEP SPPPSPLAPS PASPPTEPLV 4680
ELPAEPLAEP PVPSPLPLAS SPESARPKPR ARPPEEGEDS RPPRLKKWKG VRWKRLRLLL 4740
TIQKGSGRQE DEREVAEFME QLGTALRPDK VPRDMRRCCF CHEEGDGATD GPARLLNLDL 4800
DLWVHLNCAL WSTEVYETQG GALMNVEVAL HRGLLTKCSL CQRTGATSSC NRMRCPNVYH 4860
FACAIRAKCM FFKDKTMLCP MHKIKGPCEQ ELSSFAVFRR VYIERDEVKQ IASIIQRGER 4920
LHMFRVGGLV FHAIGQLLPH QMADFHSATA LYPVGYEATR IYWSLRTNNR RCCYRCSIGE 4980
NNGRPEFVIK VIEQGLEDLV FTDASPQAVW NRIIEPVAAM RKEADMLRLF PEYLKGEELF 5040
GLTVHAVLRI AESLPGVESC QNYLFRYGRH PLMELPLMIN PTGCARSEPK ILTHYKRPHT 5100
LNSTSMSKAY QSTFTGETNT PYSKQFVHSK SSQYRRLRTE WKNNVYLARS RIQGLGLYAA 5160
KDLEKHTMVI EYIGTIIRNE VANRREKIYE EQNRGIYMFR INNEHVIDAT LTGGPARYIN 5220
HSCAPNCVAE VVTFDKEDKI IIISSRRIPK GEELTYDYQF DFEDDQHKIP CHCGAWNCRK 5280
WMN 5283
Nucleotide Sequence
(Fasta)
ATGGACAGCC AGAAGCTGGC TGGTGAGGAT AAAGATTCAG AACCGGCAGC TGATGGACCT 60
GCAGCTTCTG AGGACCCAAG TGTCACTGAG TCAGATCTGC CCAACCCACA TGTGGGAGAG 120
GTCTCTGTCC TTAGTTCTGG GAGTCCCAGG CTTCAGGAGT CTCCTCAGGA CTGCAGTGGG 180
GGTCCAGTGC GGCGTTGTGC TCTCTGTAAC TGCGGGGAGC CCAGTCTACA TGGGCAGCGG 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAT TGGCCCCGGT GTCCAGTGGT GTCCCCTGGG 300
GGGAGCCCAG GGCCCAATGA GGCAGTGCTG CCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCC TTACACCTGC CCACCTAGGA GAACCTGGAG GGTCCTGCTG GGCTCACCAT 420
TGGTGTGCTG CATGGTCGGC AGGCGTATGG GGGCAGGAGG GCCCAGAACT ATGTGGTGTG 480
GACAAGGCCA TCTTCTCAGG GATCTCACAG CGCTGCTCCC ACTGCACCAG GCTCGGTGCC 540
TCCATCCCTT GCCGCTCACC TGGATGTCCA CGGCTTTACC ACTTCCCCTG CGCGACTGCC 600
AGCGGTTCCT TCCTATCCAT GAAAACACTG CAGCTGCTAT GCCCAGAGCA CAGTGAGGGG 660
GCTGCACATC TGGAGGAGGC TCGCTGTGCA GTGTGTGAGG GGCCAGGGGA GTTGTGTGAC 720
CTGTTCTTCT GTACCAGCTG TGGGCATCAC TATCACGGGG CCTGCCTGGA CACTGCTCTG 780
ACTGCCCGCA AACGTGCTGG CTGGCAGTGC CCTGAATGCA AAGTGTGCCA AGCCTGCAGG 840
AAACCTGGGA ATGACTCTAA GATGTTGGTT TGTGAGACGT GTGACAAAGG ATACCATACT 900
TTCTGCCTAA AACCACCCAT GGAGGAACTG CCTGCTCACT CTTGGAAGTG CAAGGCGTGC 960
CGGGTGTGCC GGGCCTGTGG GGTGGGCTCA GCAGAACTGA ATCCCAACTC GGAGTGGTTT 1020
GAGAACTACT CTCTCTGTCA CCGCTGTCAC AAAGCCCAGG GAGGTCAGCC TATCAGCTCC 1080
GTTGCTGAGC AGCATACCCC TGTGTGTAGC AGATTTTCAC CCCCAGAGCC TGGCGATACC 1140
CCCACTGACG AGCCCGATGC TCTGTACGTT GCATGCCAAG GGCAGCCAAA GGGTGGGCAC 1200
GTGACCTCTA TGCAACCCAA GGAACCAGGG CCCCTGCAAT GTGAAGCCAA ACCACTAGGG 1260
AGAGCAGGGG TCCAACTTGA GCCCCAGTTG GAGGCCCCCC TAAACGAGGA GATGCCACTG 1320
CTGCCCCCAC CTGAGGAGTC ACCCCTGTCC CCACCACCTG AGGAGTCACC CACATCCCCA 1380
CCACCTGAGG CATCACGCCT GTCCCCACCA CCTGAGGAAT CGCCCGCATC CCCACTTCCT 1440
GAGGCATTGC ACCTGTCCCG GCCGCCGGAG GAATCGCCCC TCTCTCCGCC GCCTGAGGAG 1500
TCTCCTCTGT CTCCCCAACC TGAATCATCA CCTTTTTCTC CACTGGAGGA GTCGCCCTTC 1560
TCTCCACCGG AAGAGTCACC CCCATCTCCT GCACTTGAGA CGCCTCTATC CCCACCACCT 1620
GAAGCATCGC CCCTGTCCCC CCCATTTGAG GAATCTCCTT TGTCCCCGCC ACCTGAGGAA 1680
TTGCCCACTT CCCCGCCACC TGAAGCATCT CGCCTGTCTC CACCGCCTGA GGAGTCACCC 1740
ATGTCCCCTC CACCTGAAGA GTCACCCATG TCTCCACCAC CGGAGGCATC TCGTCTGTTC 1800
CCACCATTTG AAGAGTCTCC TCTGTCCCCT CCACCTGAGG AGTCTCCCCT CTCCCCACCA 1860
CCTGAGGCAT CACGCCTGTC CCCACCACCT GAGGACTCGC CTATGTCCCC ACCACCTGAA 1920
GAATCACCTA TGTCCCCCCC ACCTGAGGTA TCGCGCCTAT CCCCCCTGCC TGTGGTGTCA 1980
CGCCTGTCTC CACCGCCTGA GGAATCTCCC TTGTCCCCAC CGGCCCTGTC TCCTTTGGGG 2040
GAGTTAGAGT ACCCCTTTGG TGCCAAAGGG GACAGTGACC CTGAGTCACC CTTGGCTGCC 2100
CCCATCCTGG AGACACCCAT CAGCCCTCCA CCAGAAGCTA ACTGCACTGA CCCTGAGCCT 2160
GTCCCGCCTA TGATCCTTCC CCCATCTCCA GGCTCCCCAA TGGGGCCGGC TTCTCCCATC 2220
CTGATGGAGC CCCTTCCTCC TCAGTGTTCG CCACTCCTTC AGCATTCCCT GGCTTCCCAA 2280
AACTCCCCTC CTTCCCAGTG CTCCCCTCCT GCCCTACCGC TGTCCGTTCC CTCCCCGTTA 2340
AGTCCCATAG GGAAGGTAGT GGGGGTCTCA GACGAGGCTG AGCTGCACGA GATGGAGACT 2400
GAGAAAGTTT CAGAACCCGA ATGCCCAGCC TTGGAACCCA GTGCCACCAG TCCTCTCCCT 2460
TCCCCAATGG GGGACCTTTC CTGCCCCGCC CCCAGCCCTG CCCCGGCCCT GGATGACTTC 2520
TCTGGCCTAG GGGAAGACAC AGCCCCTCTG GATGGGATTG ATGCTCCGGG TTCACAGCCA 2580
GAGGCTGGAC AGACCCCTGG CAGTTTAGCT AGTGAACTTA AAGGCTCCCC TGTGCTCCTG 2640
GACCCCGAGG AGCTGGCCCC CGTGACCCCT ATGGAGGTCT ACCCTGAATG CAAGCAGACA 2700
GCAGGGCAGG GCTCACCATG TGAAGAACAG GAAGAGCCAC GTGCACCGGT GGCCCCCACA 2760
CCACCCACTC TCATCAAATC CGACATCGTT AACGAGATCT CTAATCTGAG CCAGGGCGAT 2820
GCCAGTGCCA GTTTTCCTGG CTCAGAGCCC CTCCTGGGCT CTCCAGACCC GGAGGGGGGT 2880
GGCTCCCTGT CCATGGAGCT GGGGGTCTCT ACGGATGTTA GTCCAGCCCG AGATGAGGGC 2940
TCCCTACGGC TCTGTACCGA CTCACTGCCA GAGACTGATG ACTCACTATT GTGCGATGCT 3000
GGGACAGCTA TCAGCGGAGG CAAAGCTGAG GGGGAGAAGG GGCGGCGGCG CAGCTCCCCA 3060
GCCCGTTCCC GCATCAAACA GGGTCGCAGC AGCAGTTTCC CAGGAAGACG CCGGCCTCGT 3120
GGAGGAGCCC ATGGAGGACG TGGTAGAGGA CGGGCCCGGC TAAAGTCAAC TGCTTCTTCC 3180
ATTGAGACTC TGGTAGTTGC TGACATTGAT AGCTCTCCCA GTAAGGAGGA GGAGGAAGAA 3240
GATGACGACA CCATGCAGAA TACCGTGGTT CTCTTCTCCA ACACAGACAA ATTTGTCCTA 3300
ATGCAGGACA TGTGTGTGGT ATGTGGCAGC TTTGGCCGGG GGGCAGAGGG CCACCTCCTT 3360
GCCTGTTCGC AGTGCTCTCA GTGCTATCAC CCTTACTGTG TCAACAGCAA GATCACCAAG 3420
GTGATGCTGC TCAAGGGCTG GCGTTGTGTG GAGTGTATTG TGTGTGAGGT GTGTGGCCAG 3480
GCCTCCGACC CCTCACGCCT GCTGCTCTGT GATGACTGTG ATATTAGCTA CCACACATAC 3540
TGCCTGGACC CCCCACTGCT CACCGTCCCC AAGGGCGGCT GGAAGTGCAA GTGGTGTGTG 3600
TCGTGTATGC AGTGTGGGGC TGCTTCCCCT GGCTTCCACT GTGAATGGCA GAATAGTTAC 3660
ACACACTGTG GGCCCTGTGC CAGCCTGGTG ACCTGCCCTA TCTGTCATGC TCCTTATGTA 3720
GAAGAGGACC TACTAATCCA GTGCCGCCAC TGTGAACGGT GGATGCATGC TGGCTGTGAG 3780
AGCCTCTTCA CAGAGGACGA TGTGGAGCAG GCAGCCGATG AAGGCTTTGA CTGTGTCTCC 3840
TGCCAGCCCT ACGTGGTAAA GCCTGTGGCG CCTATTGCAC CTCCAGAGCT GGTGCCCATG 3900
AAGGTGAAAG AGCCAGAGCC CCAGTACTTT CGCTTCGAAG GTGTGTGGCT GACAGAAACT 3960
GGCATGGCCT TGCTGCGTAA CCTGACCATG TCACCACTGC ACAAGCGGCG CCAACGGCGA 4020
GGACGGCTTG GCCTCCCAGG CGAGGCAGGA TTGGAGGGTT CTGAGCCCTC AGATGCTCTT 4080
GGCCCTGATG ACAAGAAGGA TGGGGACCTG GACACCGATG AGCTGCTCAA GGGTGAAGGT 4140
GGTGTGGAGC ACATGGAGTG CGAAATTAAA CTGGAGGGCC CCGTCAGCCC TGATGTGGAG 4200
CCTGGCAAAG AGGAGACCGA GGAAAGCAAA AAACGCAAGC GTAAACCGTA TCGGCCTGGC 4260
ATTGGTGGTT TCATGGTGCG ACAGCGGAAA TCCCACACAC GCACGAAAAA GGGGCCTGCT 4320
GCACAGGCGG AGGTGTTGAG TGGGGATGGG CAGCCCGACG AGGGTGAGAC GGTGATACCT 4380
GCTGACCTGC CTGCAGAGGG CGCCGTGGAG CAGAGCTTAG CTGAAGGGGA TGAGAAGAAG 4440
AAGCAACAGC GGCGAGGGCG CAAGAAGAGC AAACTGGAGG ACATGTTCCC TGCTTACCTG 4500
CAGGAAGCCT TCTTTGGGAA GGAGCTGCTG GACCTGAGTC GTAAGGCCCT TTTTGCAGTT 4560
GGGGTGGGCC GGCCAAGCTT TGGACTAGGG ACCCCAAAAG CCAAGGGAGA TGGAAGCTCA 4620
GAAAGGAAGG AACTCCCCAC ATCGCAGAAA GGAGATGATG GTCCAGATAT TGCAGATGAA 4680
GAATCCCGTG GCCTCGAGGG CAAAGCTGAT ACACCAGGAC CTGAGGATGG GGGCGTGAAG 4740
GCATCCCCAG TGCCCAGTGA CCCTGAGAAG CCAGGCACCC CAGGTGAAGG GATGCTTAGC 4800
TCTGACTTAG ACAGGATTTC CACAGAAGAA CTGCCCAAGA TGGAATCCAA GGACCTGCAG 4860
CAGCTCTTCA AGGATGTTCT GGGCTCTGAA CGAGAACAGC ATCTGGGTTG TGGAACCCCT 4920
GGCCTAGAAG GCAGCCGTAC ACCACTGCAG AGGCCCTTTC TTCAAGGTGG ACTCCCTTTG 4980
GGCAATCTGC CCTCCAGCAG CCCAATGGAC TCCTACCCAG GCCTCTGCCA GTCCCCGTTC 5040
CTGGATTCTA GGGAGCGCGG GGGCTTCTTT AGCCCGGAAC CCGGTGAGCC CGACAGCCCC 5100
TGGACAGGCT CAGGTGGCAC CACGCCCTCC ACCCCCACAA CCCCCACCAC GGAGGGTGAG 5160
GGCGACGGAC TCTCCTATAA CCAGCGGAGT CTTCAGCGCT GGGAGAAGGA TGAGGAGTTG 5220
GGCCAGCTGT CCACCATCTC ACCTGTGCTC TATGCCAACA TTAATTTTCC TAATCTCAAG 5280
CAAGATTACC CAGACTGGTC AAGCCGTTGC AAACAAATCA TGAAGCTCTG GAGAAAGGTT 5340
CCAGCAGCTG ACAAAGCCCC CTACCTGCAA AAGGCCAAAG ATAACCGGGC AGCTCACCGC 5400
ATCAACAAGG TGCAGAAGCA GGCTGAGAGC CAGATCAACA AGCAGACCAA GGTGGGCGAC 5460
ATAGCCCGTA AGACTGACCG ACCGGCCCTA CATCTCCGCA TTCCCCCGCA GCCAGGGGCA 5520
CTGGGCAGCC CGCCCCCCGC TGCTGCCCCC ACCATTTTCA TTGGCAGCCC CACTACCCCC 5580
GCCGGCTTGT CTACCTCTGC GGACGGGTTC CTGAAGCCGC CGGCGGGCTC GGTGCCTGGC 5640
CCTGACTCGC CTGGTGAGCT CTTCCTCAAG CTCCCACCCC AGGTGCCCGC CCAAGTGCCT 5700
TCGCAGGACC CCTTTGGACT GGCCCCTGCC TATCCCCTGG AGCCCCGCTT CCCCACGGCA 5760
CCACCCACCT ATCCCCCCTA TCCTAGTCCT ACGGGGGCCC CTGTGCAGCC CCCAATGCTG 5820
GGCACCTCAT CTCGTCCTGG GACTGGCCAG CCAGGGGAAT TCCACACTAC CCCACCTGGC 5880
ACCCCCAGAC ACCAGCCCTC CACACCTGAC CCATTCCTCA AACCCCGCTG CCCCTCGCTG 5940
GATAACTTGG CTGTGCCTGA GAGCCCTGGG GTAGGGGGAG GCAAAGCTTC CGAGCCCCTG 6000
CTCTCGCCCC CACCTTTTGG GGAGTCCCGG AAGGCCCTAG AGGTGAAGAA GGAAGAGCTT 6060
GGGGCATCCT CTCCTAGCTA TGGGCCCCCA AACCTGGGCT TTGTTGACTC ACCCTCCTCA 6120
GGCCCCCACC TGGGTGGCCT GGAGTTAAAG ACACCTGATG TCTTCAAAGC CCCCCTGACC 6180
CCTCGGGCAT CTCAGGTAGA GCCCCAGAGC CCGGGCTTGG GCCTAAGGCC CCAGGAGCCA 6240
CCCCCTGCCC AGGCTTTGGC ACCTTCTCCT CCAAGTCACC CAGACATCTT TCGCCCTGGT 6300
TCCTACCCTG ACCCATATGC TCAGCCCCCA TTGACTCCTC GGCCCCAACC TCCGCCCCCT 6360
GAGAGCTGCT GTGCTCTGCC CCCTCGCTCA CTGCCCTCCG ACCCTTTCTC CCGAGTGCCT 6420
GCCAGTCCTC AGTCCCAGTC CAGCTCCCAG TCTCCACTGA CACCCCGTCC TCTGTCTGCT 6480
GAAGCTTTTT GCCCATCCCC AGTTACCCCT CGCTTCCAGT CCCCTGACCC TTATTCTCGC 6540
CCACCCTCAC GCCCTCAGTC CCGTGACCCA TTTGCCCCAT TGCATAAGCC ACCCCGACCC 6600
CAGCCCCCTG AAGTTGCCTT TAAGGCTGGG TCTCTAGCCC ACACTTCGCT GGGGGCTGGG 6660
GGTTTCCCAG CAGCCCTGCC CTCGGGGCCA GCAGGTGAGC TCCATGCCAA GGTCCCAAGT 6720
GGGCAGCCCC CCAATTTTGT CCGGTCCCCT GGGACGGGTG CATTTGTGGG CACCCCCTCT 6780
CCCATGCGTT TCACTTTCCC TCAGGCAGTA GGGGAGCCTT CCCTAAAGCC CCCGGTCCCT 6840
CAGCCTGGTC TCCCGCCACC CCATGGGATC AACAGCCATT TTGGGCCCGG CCCCACCTTG 6900
GGCAAGCCTC AAAGCACAAA CTACACAGTA GCCACAGGGA ACTTCCACCC ATCGGGCAGC 6960
CCCCTGGGGC CCAGCAGCGG GTCCACAGGG GAGAGCTATG GGCTGTCCCC GCTACGCCCT 7020
CCGTCGGTTC TGCCACCACC TGCACCCGAT GGATCCCTCC CCTACCTGTC CCATGGAGCC 7080
TCACAGCGAT CAGGCATCAC CTCTCCTGTC GAAAAGCGAG AAGACCCAGG GGCTGGAATG 7140
GGTAGCTCTT TGGCGACAGC TGAACTCCCA GGTACCCAGG ACCCAGGCAT GTCCGGCCTT 7200
AGCCAAACAG AGCTGGAGAA GCAACGGCAG CGCCAGCGAC TACGAGAGCT GCTGATTCGG 7260
CAGCAGATCC AGCGCAACAC CCTGCGGCAG GAGAAGGAAA CAGCTGCAGC AGCTGCAGGA 7320
GCAGTGGGGC CTCCAGGCAG CTGGGGTGCT GAGCCCAGCA GCCCTGCCTT TGAGCAGCTG 7380
AGTCGAGGCC AGACCCCCTT TGCTGGGACA CAGGACAAGA GCAGCCTTGT GGGGTTGCCC 7440
CCAAGCAAGC TGAGTGGCCC CATCCTGGGG CCAGGGTCCT TCCCTAGCGA TGACCGACTC 7500
TCCCGGCCAC CTCCACCAGC CACGCCTTCC TCTATGGATG TGAACAGCCG GCAACTGGTA 7560
GGAGGCTCCC AAGCTTTCTA TCAGCGAGCA CCCTATCCTG GGTCCCTGCC CTTACAGCAG 7620
CAACAGCAAC AACTGTGGCA GCAACAACAG GCAACAGCAG CAACCTCCAT GCGATTTGCC 7680
ATGTCAGCTC GCTTTCCATC AACTCCTGGA CCTGAACTTG GCCGCCAAGC CCTAGGTTCC 7740
CCGTTGGCGG GAATTCCCAC CCGTCTGCCA GGCCCTGGTG AGCCAGTGCC TGGTCCAGCT 7800
GGTCCTGCCC AGTTTATTGA GCTGCGGCAC AATGTACAGA AAGGACTGGG ACCTGGGGGC 7860
ACTCCGTTTC CTGGTCAGGG CCCACCTCAG AGACCCCGTT TTTACCCTGT AAGTGAGGAC 7920
CCCCACCGAC TGGCTCCTGA AGGGCTTCGG GGCCTGGCAG TATCAGGTCT TCCCCCACAG 7980
AAACCCTCAG CTCCACCGGC CCCTGAATTG AACAACAGTC TTCATCCGAC ACCCCACACC 8040
AAGGGTCCTA CCCTGCCAAC TGGTTTGGAG CTGGTCAACC GGCCCCCCTC GAGCACTGAG 8100
CTTGGCCGCC CCACTCCTCT GGCCCTGGAA GCTGGGAAGT TGCCCTGTGA GGATCCCGAG 8160
CTGGATGACG ATTTTGATGC CCACAAGGCC CTAGAGGACG ATGAAGAGCT TGCTCACCTG 8220
GGTCTGGGTG TGGATGTGGC CAAGGGTGAT GATGAACTTG GCACCTTAGA AAACCTGGAG 8280
ACCAATGACC CCCACTTGGA TGACCTGCTC AATGGAGACG AGTTTGACCT GCTGGCATAT 8340
ACTGATCCTG AGCTGGACAC TGGGGACAAG AAGGATATCT TCAATGAGCA CCTGAGGCTG 8400
GTAGAATCGG CTAATGAGAA GGCTGAACGG GAGGCCCTGC TGCGGGGGGT GGAGCCAGGA 8460
CCCTTGGGCC CTGAGGAGCG CCCTCCCCCT GCTGCTGATG CCTCTGAGCC CCGCCTGGCA 8520
TCTGTGCTCC CTGAGGTGAA GCCCAAGGTG GAGGAGGGTG GACGCCACCC TTCCCCTTGC 8580
CAGTTCACCA TTGCTACCCC CAAGGTAGAG CCCACACCTG CTGCCAATTC CCTTGGCCTG 8640
GGGCTAAAGC CGGGACAGAG CATGATGGGC AGCCGGGATA CCCGGATGGG CACAGGGCCA 8700
TTTTCTAGCA GTGGGCACAC AGCTGAGAAG GCCTCCTTTG GGGCCACAGG AGGACCACCA 8760
GCTCATCTGC TGACCCCCAG CCCACTGAGT GGCCCAGGAG GATCCTCCCT GCTGGAAAAG 8820
TTTGAGCTCG AGAGTGGAGC TTTGACCTTG CCTGGTGGAC CTGCAGCATC TGGGGATGAG 8880
CTAGACAAGA TGGAGAGCTC ACTGGTAGCC AGCGAGTTAC CCCTGCTCAT TGAGGACCTG 8940
TTGGAGCATG AGAAGAAGGA GCTGCAGAAG AAGCAGCAGC TTTCAGCACA GTTGCAGCCT 9000
GCCCAGCAGC AGCCACAACA GCAGCAGCAG CATTCCCTAC TGTCTGCACC AGGCCCTGCC 9060
CAGGCCATGT CTTTGCCACA TGAGGGCTCT TCTCCCAGTT TGGCTGGGTC CCAACAGCAG 9120
CTTTCCCTGG GTCTTGCAGG TGCCCGACAG CCAGGCTTGC CCCAACCACT GATGCCCACC 9180
CAGCCACCAG CTCATGCCCT CCAGCAACGC CTGGCTCCAT CCATGGCTAT GGTGTCCAAT 9240
CAAGGGCATA TGCTAAGTGG GCAGCATGGA GGGCAGGCAG GGTTGGTACC CCAGCAGAGC 9300
TCACAGCCAG TGCTATCACA GAAGCCCATG GGCACCATGC CACCGTCCAT GTGCATGAAG 9360
CCGCAGCAAC TGGCAATGCA GCAGCAGCTG GCAAACAGCT TCTTCCCAGA TACAGACCTG 9420
GACAAATTTG CTGCAGAAGA TATCATTGAT CCCATTGCAA AGGCCAAGAT GGTGGCTTTG 9480
AAAGGCATCA AGAAAGTGAT GGCTCAGGGC AGCATTGGGG TGGCACCTGG TATGAACAGA 9540
CAGCAAGTGT CTCTGCTAGC CCAGAGGCTC TCGGGGGGAC CTGGCAGTGA TCTGCAGAAC 9600
CATGTGGCAG CTGGGAGTGG CCAGGAGCGG AGTGCTGGTG ATCCCTCCCA GCCTCGTCCC 9660
AACCCGCCCA CTTTTGCTCA GGGAGTGATC AATGAAGCTG ACCAGCGGCA GTATGAGGAG 9720
TGGCTGTTCC ATACCCAGCA GCTCCTACAG ATGCAGCTGA AGGTGTTAGA GGAGCAGATT 9780
GGTGTACACC GCAAGTCCCG GAAGGCTCTG TGTGCCAAGC AGCGCACTGC CAAAAAAGCT 9840
GGCCGTGAGT TCCCAGAAGC TGATGCTGAG AAGCTCAAGC TGGTTACAGA GCAGCAGAGC 9900
AAGATCCAGA AACAACTAGA TCAGGTCCGG AAACAGCAGA AGGAGCACAC TAATCTCATG 9960
GCAGAATATC GGAACAAGCA GCAGCAACAG CAGCAGCAGC AGCAGCAGCA GCAGCAACAG 10020
CAACAACACT CAGCTGTGCT GGCTCTCAGC CCTTCCCAGA GTCCCCGGCT GCTCACCAAG 10080
CTCCCTGGTC AGCTGCTCCC TGGCCATGGG CTGCAGCCAC CACAGGGGCC TCCGGGTGGG 10140
CAAGCCGGAG GTCTTCGCCT GCCCCCTGGG GGTATGGCAC TACCTGGACA GCCTGGTGGC 10200
CCCTTCCTTA ATACAGCTCT GGCCCAACAG CAGCAACAGC AACATTCTGG TGGGGCTGGA 10260
TCCCTGGCTG GCCCCTCAGG GGGCTTCTTC CCTGGCAACC TTGCTCTTCG AAGCCTCGGA 10320
CCTGATTCAA GGCTTTTACA GGAAAGGCAG CTGCAGCTGC AGCAGCAACG TATGCAGCTG 10380
GCCCAGAAAC TGCAGCAGCA GCAGCAACAG CAGCAGCAGC ACCTTCTAGG ACAGGTGGCA 10440
GTCCAGCAGC AACAGCAGCA GGGTCCTGGA GTACAGACAA ACCAAGCTCT GGGTCCCAAG 10500
CCCCAGGGGC TTCTGCCTCC CGGCAGCCAC CAAGGCCTCC TGGTCCAGCA GCTGTCCCCT 10560
CAACCACCCC AGGGGCCCCA GGGCATGCTG GGCCCTACCC AGGTGGCTGT GTTGCAGCAG 10620
CAGCACCCTG GAGCTTTGGG CCCCCAGGGC CCTCACAGAC AGGTGCTTAT GACCCAGTCC 10680
CGGGTGCTCA GTTCCCCCCA GCTGGCACAG CAGGGTCAGG GCCTTATGGG ACACAGGCTG 10740
GTCACAGCCC AGCAGCAGCA GCAGCAGCAA CAACAGCACC AACAGCAAGG GTCCATGGCA 10800
GGGCTGTCCC ATCTTCAGCA GAGTCTGATG TCACACAGTG GGCAGCCCAA ACTGAGTGCT 10860
CAGCCCATGG GCTCTTTACA GCAGCTTCAG CAGCAGCAGC AGCTGCAACA GCAACAGCAA 10920
CTTCAGCAGC AGCAGCAGCA ACAGCTTCAA CAGCAGCAAC AGCAGCAGCA GCTTCAACAG 10980
CAGCAGCAGC AGCTTCAACA GCAGCAGCAA CAGCAGCAGC AACTTCAACA GCAACAACAG 11040
CAGCTACAAC AGCAACAGCA GCAGCAGCTT CAACAGCAGC AACAGCAGCT ACAACAGCAA 11100
CAGCAACAGC TACAACAGCA ACAACAACAG CAGCAGTTTC AACAGCAGCA GCAACAGCAG 11160
CAGATGGGCC TTTTAAACCA GAGTCGAACT TTACTGTCTC CTCAGCAACA ACAGCAGCAG 11220
CAAGTGGCAC TTGGCCCTGG CATGCCAGCA AAGCCTCTTC AACACTTTTC TAGCCCTGGA 11280
GCCCTGGGCC CAACCCTCCT CCTGACGGGC AAGGAACAAA ACACCGTAGA CCCAGCCGTT 11340
TCTTCAGAGG CCACTGAGGG GCCCTCTACA CATCAGGGAG GGCCGTTAGC AATAGGAACT 11400
ACCCCTGAGT CAATGGCCAC TGAACCAGGA GAGGTAAAGC CCCCACTCTC TGGGGACTCA 11460
CAACTCCTGC TTGTCCAACC CCAGCCCCAG CCTCAGCCCA GCTCTCTGCA GCTGCAGCCA 11520
CCTCTGAGGC TTCCAGGACA ACAGCAGCAG CAAGTTAGCC TGCTCCACAC AGCAGGTGGA 11580
GGAAGCCATG GGCAGCTAGG CACTGGATCA TCTTCTGAGG CCTCATCTAT GCCCCACCTG 11640
CTGGCTCAGC CCTCTGTTTC CTTAGGGGAT CAGCCTGGGC CCATGACCCA GAACCTTCTG 11700
GGCCCCCAGC AGCCCATGCT AGAGCGGCCC ATGCAAAATA ATACAGGGCC ACAACCTCCC 11760
AAACCAGGAC CTGTCCTCCA GTCTGGGCAG GGTCTGCCTG GGGTTGGAAT CATGCCTACG 11820
GTGGGTCAGC TTCGAGCACA GCTCCAAGGA GTCCTGGCCA AAAACCCACA GCTGCGGCAC 11880
TTAAGTCCTC AGCAGCAGCA GCAGCTACAG GCACTCCTCA TGCAGCGGCA GCTGCAGCAG 11940
AGTCAGGCAG TACGCCAGAC CCCACCCTAC CAGGAGCCTG GGACCCAGAC CTCTCCCCTC 12000
CAGGGCCTCC TGGGCTGCCA ACCTCAACTT GGGGGCTTCC CTGGACCACA GACAGGCCCC 12060
CTCCAGGAGC TAGGGGCAGG GCCTCGACCT CAGGGCCCAC CCCGGCTCCC TGCCCCACCA 12120
GGAGCCTTAT CTACAGGACC AGTCCTTGGC CCTGTCCATC CCACACCTCC ACCATCCAGC 12180
CCTCAAGAGC CAAAGAGACC TTCACAATTA CCTTCCCCCA GCTCCCAGCT TCCCACTGAG 12240
GCCCAGCTCC CTCCCACCCA TCCAGGGACC CCCAAACCCC AGGGGCCAAC CTTGGAGCTG 12300
CCTCCTGGGA GGGTCTCACC TGCTGCTGCC CAGCTTGCAG ATACCTTGTT TAGCAAGGGT 12360
CTGGGACCTT GGGATCCCCC AGACAACCTA GCAGAAACCC AGAAGCCAGA GCAGAGCAGC 12420
CTGGTACCTG GGCATCTGGA CCAGGTGAAT GGACAGGTGG TGCCTGAGGC ATCCCAACTC 12480
AGCATCAAGC AGGAACCTCG GGAAGAGCCA TGTGCCCTGG GAGCCCAGTC AGTGAAGAGG 12540
GAGGCCAATG GGGAGCCAAT AGGGGTACCA GGAACCAGCA ACCACCTCCT GCTGGCAGGC 12600
CCTCGCTCAG AAGCTGGGCA TCTGCTCTTG CAGAAGCTTC TCCGGGCAAA GAATGTGCAA 12660
CTCAGCACTG GGCGGGGGTC CGAGGGGCTG CGAGCTGAGA TCAACGGGCA CATTGACAGC 12720
AAGCTGGCTG GGCTGGAGCA GAAACTACAG GGTACCCCCA GCAACAAGGA GGATGCAGCA 12780
GCAAGGAAGC CTTTGACACC GAAGCCCAAG CGGGTACAGA AGGCAAGCGA CAGGTTGGTG 12840
AGCTCCCGAA AGAAGCTGCG GAAGGAGGAC GGGGTCAGGG CCAGCGAGGC CTTGCTGAAA 12900
CAGCTGAAAC AGGAGCTGTC CCTGCTGCCC CTAACGGAGC CTGCTATCAC CGCCAATTTT 12960
AGCCTCTTTG CCCCCTTTGG CAGTGGCTGC CCAGTTAATG GGCAGAGCCA GCTGAGGGGT 13020
GCCTTTGGAA GTGGGGCACT GCCCACTGGC CCTGACTACT ATTCCCAGCT GCTTACCAAG 13080
AATAACCTGA GTAACCCGCC GACACCACCC TCGTCGCTGC CCCCCACCCC ACCCCCATCG 13140
GTGCAGCAGA AGATGGTGAA TGGCGTCACC CCATCTGAAG AGCTGGGGGA GCACCCCAAG 13200
GATGCTGCCT CTGCCCGGGA TACTGAAAGG GCACTGAGGG ATACTTCAGA GGTGAAGAGT 13260
CTAGACCTGC TGGCTGCCTT GCCTACACCC CCTCACAATC AGACTGAGGA TGTCAGGATG 13320
GAGAGTGATG ATAGCGATTC TCCTGACAGC ATTGTGCCAG CTTCATCCCC TGAAAGCATC 13380
TTGGGGGAGG AGGCCCCTCG TTTCCCTCAT CTGGGTTCAG GCCAGTGGGA GCAAGAGGAC 13440
CGGGCCCTCT CCCCTGTCAT CCCCCTCATT CCTCGGGCCA GCATCCCAGT CTTCCCAGAT 13500
ACCAAACCTT ATGGGGCCCT TGACCTGGAG GTCCCTGGAA AGCTGCCTGC CACAACTTGG 13560
GAAAAGGGCA AAGGAAGTGA GGTGTCAGTC ATGCTCACAG TCTCTGCTGC TGCAGCCAAG 13620
AACCTGAATG GCGTGATGGT GGCAGTGGCG GAGCTGCTAA GCATGAAGAT CCCCAACTCC 13680
TATGAGGTGC TGTTCCCAGA GAGCCCCGCC CGGGCAGGCA CTGAGCCTAA GAAGGGGGAA 13740
GCTGAGGGTC CTGGTGGGAA GGAAAAGGGT CTGGGAGGCA AGAGCCCAGA CACTGGCCCT 13800
GATTGGCTGA AGCAGTTTGA TGCAGTATTG CCTGGCTATA CCCTGAAGAG CCAACTAGAC 13860
ATCTTGAGCC TCCTGAAACA GGAGAGCCCC GCCCCAGAGC CACCCACTCA GCACAGCTAT 13920
ACCTACAATG TCTCCAATCT GGATGTGCGA CAGCTCTCGG CCCCACCTCC TGAAGAACCC 13980
TCCCCGCCCC CTTCCCCCTT GGCACCTTCT CCTGCCAGTC CCCCTACTGA ACCCTTGGTT 14040
GAACTTCCCG CCGAACCCTT GGCTGAGCCA CCCGTCCCCT CACCTCTGCC ACTGGCCTCA 14100
TCCCCTGAAT CAGCCCGACC CAAGCCCCGT GCCCGGCCCC CTGAAGAAGG TGAAGATTCC 14160
CGTCCTCCTC GCCTCAAGAA ATGGAAAGGA GTGCGCTGGA AGCGGCTTCG GCTGCTGCTG 14220
ACCATCCAGA AGGGCAGTGG GCGGCAGGAG GATGAGCGGG AAGTGGCAGA GTTTATGGAG 14280
CAGCTTGGCA CAGCCTTGCG ACCTGACAAG GTACCGCGAG ACATGCGTCG CTGCTGTTTC 14340
TGTCATGAGG AGGGTGACGG GGCCACTGAT GGGCCTGCCC GCCTGCTGAA CCTGGACCTG 14400
GACCTGTGGG TGCACCTCAA CTGTGCTCTT TGGTCCACGG AGGTGTATGA GACCCAGGGC 14460
GGGGCACTGA TGAATGTGGA GGTTGCCCTG CACCGAGGAC TGCTAACCAA GTGCTCCCTG 14520
TGCCAGCGAA CTGGTGCCAC CAGCAGCTGC AATCGCATGC GTTGCCCCAA TGTCTACCAT 14580
TTTGCTTGTG CCATCCGTGC CAAGTGCATG TTCTTCAAGG ACAAGACCAT GCTGTGTCCA 14640
ATGCATAAGA TCAAGGGGCC CTGTGAGCAA GAGCTGAGCT CTTTTGCTGT CTTCCGGCGG 14700
GTCTACATTG AGCGGGACGA GGTGAAGCAA ATCGCCAGCA TCATTCAGCG GGGAGAACGG 14760
CTGCACATGT TCCGTGTAGG AGGCCTTGTA TTCCACGCCA TCGGACAGCT GCTGCCTCAC 14820
CAGATGGCTG ACTTTCATAG TGCCACTGCC CTCTATCCCG TGGGCTACGA GGCCACGCGC 14880
ATCTATTGGA GCCTCCGCAC CAACAATCGT CGCTGTTGCT ATCGCTGTTC TATTGGTGAG 14940
AACAACGGGC GGCCGGAGTT TGTAATCAAA GTCATCGAGC AGGGCCTGGA GGACCTGGTC 15000
TTCACTGACG CCTCTCCCCA GGCCGTGTGG AATCGCATCA TTGAGCCTGT GGCTGCCATG 15060
AGAAAAGAGG CTGACATGCT GCGACTCTTC CCTGAGTATC TGAAGGGCGA GGAGCTCTTT 15120
GGGCTGACGG TGCATGCCGT GCTTCGCATA GCTGAATCAC TGCCCGGGGT GGAGAGCTGT 15180
CAAAACTATT TATTCCGCTA TGGGCGCCAC CCCCTTATGG AGCTGCCACT CATGATCAAC 15240
CCCACTGGCT GTGCCCGATC AGAGCCTAAA ATCCTCACAC ACTACAAACG GCCCCATACC 15300
CTGAACAGCA CCAGCATGTC TAAGGCATAT CAGAGCACCT TCACAGGCGA GACCAACACC 15360
CCGTACAGCA AGCAGTTTGT GCACTCCAAG TCATCTCAGT ACCGGCGGCT GCGCACTGAA 15420
TGGAAGAACA ACGTGTACCT GGCTCGCTCC CGTATCCAGG GCCTGGGGCT CTATGCAGCC 15480
AAGGACCTAG AAAAGCACAC AATGGTTATC GAGTACATTG GCACCATCAT TCGGAACGAG 15540
GTGGCCAACC GGCGGGAGAA AATCTACGAA GAGCAGAATC GAGGCATCTA CATGTTCCGA 15600
ATAAACAATG AACATGTGAT TGATGCTACG TTGACCGGCG GCCCTGCCAG GTACATTAAC 15660
CATTCCTGTG CCCCTAACTG TGTGGCTGAA GTCGTGACAT TTGACAAAGA GGACAAAATC 15720
ATCATCATCT CCAGCCGGCG TATCCCCAAA GGAGAGGAGC TAACCTATGA CTATCAGTTT 15780
GATTTTGAGG ACGATCAGCA CAAGATCCCC TGCCACTGTG GAGCCTGGAA TTGTCGGAAA 15840
TGGATGAACT AAGAAGCTTT GAGGCTACCA GGCAGGGGAG TCCCCCTACC CACAACCTCT 15900
TCCCTGAAAG GGATGAGGGG GAAGAGAGGT AGCAGCCAGA GCCAGGACCC AGGGCTGGGG 15960
CTGCCGGCTG ACCGGAGCCC CTGGAGCAGG AGGCTGGGGC AGAGGGCCCT AGGCCAAGCC 16020
CACCCTGGGC ACCAGGGACA ACCCTCTTCC CCACCACCGG CCCTCAGGCT GGCATCTCTG 16080
CCCCCAGCTC CAGGAGGGGC CAGACAGAAG CAGCCATTGG GCATCTCAGG TTTGAGGGGG 16140
ATATGGGCCG GGAACTACCC AGAAGCATCT GGGAGGCAGC AGGGTGGGGG AAGAGGATGT 16200
GTGGCCGGGC CTCACAGCCC TGCTGCTCCC ACTGACCTCT CCGGCCCAAC TCACGGCTGC 16260
AAAGAGACTT GACTAAGCTT GACAATCCCA AAGGCCGGGT CCCACACCTG GCCCTGCCTG 16320
CCGGGTCCTG CCCCCACCCT CACCCCCATC CCCCTCCCTC TTGATCTGTC TCTGTTTCCC 16380
TCTTTCCTCT GTGTTTCTGT TCTCTCTATG GGTTGTGTTT CCTTGTTTTC CACTCTGACA 16440
ATGCAACATG AACGGGAAGA GGCGCCCAGC TGCCTAGGAG GTCAAGCTGG GCAAGCCGGG 16500
CAAGGAGACC CTGCACCCAT ACCTACCTCA TTTAAGTGTT GGATTTTTTG TTGTTTGGAA 16560
TTGTGAGACC CTCTCNNNNN NNNNNNNNNN NNNNNNNNNN N 16602
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 99 0.0 3958
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 99 0.0 3957
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 98 0.0 3954
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 98 0.0 3944
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 98 0.0 3929
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 95 0.0 3887
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 98 0.0 3841
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 94 0.0 3757
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 94 0.0 3757
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 93 0.0 3754
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 94 0.0 3746
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 93 0.0 3727
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 93 0.0 3727
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 94 0.0 3699
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 93 0.0 3692
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 93 0.0 3690
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 92 0.0 3685
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 92 0.0 3670
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 93 0.0 3640
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 90 0.0 3621
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 90 0.0 3467
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 89 0.0 3466
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 89 0.0 3464
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 86 0.0 3432
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 80 0.0 3100
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 85 0.0 2352
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 91 0.0 2195
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 88 0.0 2002
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 89 0.0 1784
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 87 0.0 1711
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 90 0.0 1663
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 91 0.0 1609
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 93 0.0 1449
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 81 0.0 1395
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 66 0.0 1269
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 76 0.0 1215
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 100 0.0 1177
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 99 0.0 1157
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 55 0.0 1140
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 93 0.0 1114
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 89 0.0 1102
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 87 0.0 1031
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 84 0.0 1030
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 82 0.0 1021
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 82 0.0 1008
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 80 0.0 997
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 992
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 990
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 80 0.0 988
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 80 0.0 988
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 986
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 983
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 924
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 74 0.0 909
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 74 0.0 908
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 898
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 894
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 850
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 736
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 93 0.0 687
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 646
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 3e-172 605
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 2e-158 560
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 1e-97 357
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 66 4e-97 356
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 2e-92 341
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 4e-45 183
Created Date 25-Jun-2016