WERAM Information


Tag Content
WERAM ID WERAM-Anp-0025
Ensembl Protein ID ENSAPLP00000003226.1
Gene Name KMT2C
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSAPLG00000003614.1 ENSAPLT00000003825.1 ENSAPLP00000003226.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 4.00e-45 152.5 4719 4834
Me_Reader PHD 4.70e-23 80.9 233 4454
Organism Anas platyrhynchos
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+++iek+++viEY+G++ir+eva+++ek ye++++gvy+fr+d+d +v+dat +g+ ar+inhsc+pNc+
ENSAPLP00000003226.1 4719 NVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRIDND--HVIDATLTGGPARYINHSCAPNCV 4803
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++++ +ki+i ++r+I+kgeel+ydYk
ENSAPLP00000003226.1 4804 AEVVTFERGHKIIISSSRRIQKGEELCYDYK 4834
******************************7 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde.CddwfHlkCvklplsslp.egkswyCpsC 50 
C C++ ++ + +C+e C + +H+ C + + ++ s +Cp +
ENSAPLP00000003226.1 233 RCAYCKHLGAT---IKCCEEkCTQMYHYPCAAGAGTFQDfSNLSLLCPDH 279
69999333333...6688889*********98773333323345666665 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC+ +++ + C +C + +H+ C++++ ++l+ w Cp Ck
ENSAPLP00000003226.1 292 NCAVCDSPGDL-LDQLFCTTCGQHYHGMCLDIQVTPLKRA-GWQCPDCK 338
5****544444.459******************8888865.7******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C+ C++++e++k m+ Cd+Cd+ +H+ C+++ + s+p + w C++C+
ENSAPLP00000003226.1 339 VCQNCKHSGEDNK-MLVCDTCDKGYHTFCLQPVMDSVPTN-GWKCKNCR 385
8****77777766.************************77.7******8 PP
PHD.txt 3 iClvCg...kddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCke 52
+C++C+ +d++++ m+ C C++w+H++C +p s+l+++ k+++C C++
ENSAPLP00000003226.1 416 SCPFCEklcLQDFQKD-MLHCHMCKRWIHIECDRFPGSELESQlKDYICTLCRQ 468
6888853444455544.*******************9999988788******97 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
++C+vCg ++g++ ++ C++C + +H +Cv+++ +++ +k w+C +C+
ENSAPLP00000003226.1 912 DMCVVCGSFGQGAEGrLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECT 962
68****7544443334*******************999996658******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
t+C +Cgk+ + + ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSAPLP00000003226.1 962 TVCEACGKATDPGR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 1009
68999976666655.9*************************.9**99996 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C +++ +e+ ++qC +Cd+w+H+ C +l+ ++ e+ + C C+
ENSAPLP00000003226.1 1040 TCPICYRTYRDEELIIQCRQCDRWMHAICQNLNTEEEVENiadMGFDCTICR 1091
7****888888888*******************5444455554559999997 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCv 32
+C +C+++++g + +++ d d w+Hl+C
ENSAPLP00000003226.1 4349 CC-FCHEEGDGLTDgparLLNLDL-DLWVHLNCA 4380
45.587777774445555666666.559999997 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C++C+k ++ + C C + +H +C ++ ++k +Cp +k
ENSAPLP00000003226.1 4408 KCMFCHKMGATS----GCHRlrCTNIYHFTCAIKAQCMFFKDKTMLCPMHK 4454
699**8777775....699999*********98885666677778898887 PP

Protein Sequence
(Fasta)
RPRSRGKTAV EDEDSMDGLE AAETENTVET EVKEQSAEED AQAEVDSTKQ LTPVLQRSVS 60
EESASSTASV GVEAKTSEQL CAFCYCGERS SLGQGDLKQF SPTPGYVIPW KNQPLNKSEA 120
SDSTDGACEK TPRQNSVPRK QRGQKKQQQS RSSIASCTSV STQTASDDQV VKFWDELSLV 180
GLPDDIDVQA LFEPTGHCWA HHRCAEWSMG VCQTEEQLSV NVDKAVVSGS TERCAYCKHL 240
GATIKCCEEK CTQMYHYPCA AGAGTFQDFS NLSLLCPDHI DQAPERSKEE ANCAVCDSPG 300
DLLDQLFCTT CGQHYHGMCL DIQVTPLKRA GWQCPDCKVC QNCKHSGEDN KMLVCDTCDK 360
GYHTFCLQPV MDSVPTNGWK CKNCRVCAEC GTRTSCQWHH NCLVCDSCYQ QQDNLSCPFC 420
EKLCLQDFQK DMLHCHMCKR WIHIECDRFP GSELESQLKD YICTLCRQEG DQTQSLLSNG 480
KEIQRELASL DLSLANEWKS SVSEHVHQVR VATSDFEDNS CSERLVEAVR LAVEQPITGN 540
GSSQESAVGC VTDGVLPEEK TTELETEVPD EVNSAEMEMS PQKTPATGDS QTEKMMEVME 600
GGQAVIQQEL DKQVEETQLL QGSVEASEVS TVDSRSLSSV PETVTVTPEI QGNESKGDVC 660
IHVEHQLEII KVQKEVEKGE TPEKSETPAL LEVETSSANE GVANSSPVAD KPIKSPAETY 720
PSLPPAVNLT KTNVSSSPDV SLDLHSARDV PHSQPPALPS TAGSALPTTY ISVTPKIGMG 780
KPAITKRKFS PGRPRSRQGA WSTNNTVSPP SWSPDISEGR DIFKSRQLPG SAIWSIKVGR 840
GSGFPGKRRP RGAGLSGRGG RGRSKLKNGV GTVVIPGVTT MDISSNKDEE ENSMHNTVVL 900
FSSSDKFTLH QDMCVVCGSF GQGAEGRLLA CSQCGQCYHP YCVSIKITKV VLSKGWRCLE 960
CTVCEACGKA TDPGRLLLCD DCDISYHTYC LDPPLQTVPK GGWKCKWCVW CRHCGATSPG 1020
LRCEWQNNYT QCAPCASLST CPICYRTYRD EELIIQCRQC DRWMHAICQN LNTEEEVENI 1080
ADMGFDCTIC RPYIPPTNVP SLDCCDSSVG AQIISKSKEL DPPKTYTQDG VCLTESGMSQ 1140
LQSLTVTVPR RKRTKPKLKL KIINQNSVAV LQTPPDAQSE HSRDGEMDDS REGDLMDCDG 1200
KSDSSPEREA ADDDTKVAEG ADGIKKRKRK PYRPGIGGFM VRQRSRTGQG KTKRSLSRKD 1260
SSGSVSEQLP GRDEGWTEQL PDTPVEESTP ASESTEKIKK RYRKKKNKLE ETFPVYLQEA 1320
FFGKDLLDTS RQNKMSLDNL PEESLPLSCK TNLTTNFLDP SSDPLRSSTS TPAQAKLGPQ 1380
GSTDDPLADL SEVLNTDDDI LGILSDDLVK SGDHSAGLDI GPISDDPSSL PQPNVNQSSR 1440
PLSEEQLDGI LSPELDKMVT DGAILGKLYK IPELGGKDVE DLFTAVLSPA TTQPAPLPQP 1500
PPPPQLMSLH SQGESAFPRV PLMNGLIGPN PHLPHTALTP GGGLSTFSPI TQQPYTDTRD 1560
KNPAFNPMVS DPNSSWAPSA PPLEGEGDTM SNAQRSTLKW EKEEALGEMA TVAPVLYTNI 1620
NFPNLKEEFP DWATRVKQIA KLWRKASSQE RAPYVQKARD NRAALRINKV QMSNDSMKRQ 1680
QQQDSIDPSS RIDSDLFKDP LKQRESEHEQ EWKFRQQMRQ KSKQQAKIEA TQKLEQVKNG 1740
SQQHLSQSGS DTPSSGIQSP LTPQTGNGSM SPAQQTFHKD LFAKQQLPAT PTSASSDDVF 1800
LKPQAPPPTP TRIQHDSLSQ SQAPQTSSQQ MFSPGSASTR PPSPMDPYAK MVGTPRPAPM 1860
NQNIVRRGSI TPLDTCAPQS AMSRPIQVTE PSGSRPSPVR DSCSSSPSSN DPYAKPPDTP 1920
RPVMPPDQFS KPLGVPRSPI VMEQSGKVPL TAGSGDPFTK PAPRTDTFQR QRVTSADAYA 1980
RPPLTPTPTP VDGSPGPFKT PMRPPQSPQD PYTSMPTTPR RVPVDPYERP ALTPRPVDNF 2040
SHNQPNDPYS QPPLTPHPAI KESFAHPPRI VRQQSDSFPQ SGPISRPASQ DPYSQPPGTP 2100
RPVADPYAQP PGTPRPTTVD PYMQQPPTPR PAQPADLFAQ SPASQRHSDP YAQPPGTPRP 2160
VLNDPYSQQP GTPRPGIPES FNRPAMTRPG LIPNRDPFLQ TSQNRLQGTF VRPPDPCSQT 2220
PRPAGPAITD SFSHGPAAPA RDPYDQPPMT PRPQPESFGT GQLAHDAAEQ SRPGSEGNFS 2280
ASANSPMSSQ GQQFPKVSQV PGPAPTTGVT DTQNTMNMSP QADTEKLRQR QKLREIILQQ 2340
QQQKKNAVRQ EKALQEQAAT PHATPLQHWQ QDNLNPIFNR PPPPYPGNIR SSVVPPGGPR 2400
FPVYPKDQRG PFPGDMNAMG MRPHGLRFGF PGGGHGPPSG QERFLSPQQP MQRPGVPPQL 2460
RRSLSIDMSR PVNNPQINNT SGISQHFPPQ GIQVQQHNIL GQAFIELRHR APDGRPRLPF 2520
TPSATNVMDP SAQHQRHAGF IPRQEFPGPR PAEPLRHNSQ GIPNQLQMST GLEHMSQPQQ 2580
DQINPENPPA LVIRSLSHPP VADAFPGPSL PTPAPSDESA NLQIPTQPSD GLEEKLDPDD 2640
PAVKDLDVKD LDGVEVKDLD DEDLENLNLD TEDGKGDELD TLDNLETNDP HLDDLLRSGE 2700
FDIIAYTDPD LDVGDKKGMF NEELDLSVPI DDKLDIQGKA EEPKQKEQGD KTEAHPETLS 2760
PQKKSTTENE IKTEVLSPKT HEEKKNENEK SDGNAESTST QQAAETELKD GEKAPAQPAN 2820
PDFPDKATAV PSQETTVPSP DAQGSAPLPV QGAVSSCSIS GSTPVLSSLL ANESSDNAEA 2880
RTLGSPSQSL PATQLNQASG IPQTLMTPGG QILENTLNSN LAMVPRINHN FSQGSPNPGF 2940
IQGQSANHNF GTGQTSSQTV AVANRPGPSG ISGPQQIMLP QPLAQQQNRE RPLLLEEQPL 3000
LLQDLLDQER QEQQQQRQMQ AMIRQRSEPF FPNIDFDAIT DPIMKAKMVA LKGINKVMAQ 3060
SNMGMPPMVM NRFPFMGQPV PGAQTSEGQN HMQQAITQDG SLTPQISRPN PPNFGPGFVN 3120
DSQRKQYEEW LQETQQLLQM QQKYLEEQIG AHRKSKKALS AKQRTAKKAG REFPEEDAEQ 3180
LKHVTEQQSM VQKQLEQIRK QQKEHAELIE EYRICGMAPH AIMPGVQPQP PMVPGGTSPA 3240
MNQQNFPMVA QQLQHQQHAV VIPGQPNPTR MPNLSGWQAA NAPASHLAMN PARMQPPMTP 3300
LPITPGTPAP VPGPNATAQS GPPPRVEFDD NNPFSESFQE RERKERLREQ QERQRIQLMQ 3360
EVDRQRALQQ RMELEQHGMI GSELNNRASL SQIPFFNSDL PCDFIQTPRP LQQSPQHQQQ 3420
QMGQMLQQGP VNSPPATNFM QSSERRPVGP TTFGPDGSAV AGGAPNFHNV KQTHGNLPGA 3480
TFTQNQVRPP FAPSVPSSTV PSSTALSSGA PCGSDSSVPQ ATNFPGSSQS LIQLYSDIIP 3540
EEKGKKKRTR KKKKDDDAES IKAPSTPHSD ITAPSTPTIS DSTSTPTVNT PSELTRHQDE 3600
QESVELTGPS TSNAAESQTS PELESKLPGS SLPQKQPSVS METEKDKAET STSIQEVKLE 3660
KAETDQCSGQ AEPKTENQSS VKVEEDKVTS QPASSAQSPA QPASVPAAKG ESGNELLKHL 3720
LKNKKSSSLL NQKSENSCRT EDETAGDKKL TEKQNPAEGA QTPGNQMQGV FGCSNSQLQR 3780
TDVGHETKKQ RNKRTQRTGE KAAPRSKKRK KEEEEKQAIY PNADTFIQLK QQNNLSNPPT 3840
PPASLPPTPP PVACQKLVNG FATTEELAGK AGMLGGHDVT KALGPKQFQL PFRPQDDLLV 3900
RAMAQGPKTV DVPASLPTPP HNNQEELRVQ DHCEDRDTPD SFVPSSSPES VVGMEISRYP 3960
DLSVVKEENP EPVPSPIIPI LPSSTGKGSE AKRNYIKSEP GSGALPGSGF FASQLGPSQN 4020
GPKSGLISVA ITLHPTAAEN ISNVVAAFSN LLHVRIPNSY EVSNAPDVPS SMAAANSHRV 4080
NPSLEYRQHL LLQGPQAGSL GPARISGSYG LKQPNVPFPA SNNGIAGYKD HSQNIAESSA 4140
LRPRWCSHCK VVVLGSGVRK SFKDLPFHKQ DSQEGPDKMK DVVFCSNNCF VLYSAAVQAK 4200
NSENKESVPS LPQSPMKERP PKAFHQYSNN ISTLDVHCLP QLQEKASPPS SPPIMFPPAF 4260
EAAKVEAKPD ELKVTVKLKP RLKAIHSSLD DCRPPSKKWK GMKWKKWSIQ IVIPKGSFKP 4320
PCEEEIDEFL KKLGTTLKPD PVLKDYRKCC FCHEEGDGLT DGPARLLNLD LDLWVHLNCA 4380
LWSTEVYETQ AGALINVELA LRRGLQMKCM FCHKMGATSG CHRLRCTNIY HFTCAIKAQC 4440
MFFKDKTMLC PMHKPKGTHE QELSYFAVFR RVYVQRDEVR QIASIVQRGE RDHTFRVGSL 4500
IFHAVGQLLP QQMQAFHSSK ALFPVGYEAS RLYWSMRYAN RRCRYLCSIE EKDGLPLFVI 4560
KVVEQGHEDL VLTDTTPKGV WDKILEPVAS VRKESEMLQL FPGYLKGEDL FGLTVSAVAR 4620
IAESLPGVEA CENYTFRYGR NPLMELPLAI NPTGCARAEP KMSTHVKRFV LRPHTLNSTS 4680
TSKSFQSTVT GELNAPYSKQ FVHSKSSQYR KMKTEWKSNV YLARSRIQGL GLYAARDIEK 4740
HTMVIEYIGT IIRNEVANRK EKLYESQNRG VYMFRIDNDH VIDATLTGGP ARYINHSCAP 4800
NCVAEVVTFE RGHKIIISSS RRIQKGEELC YDYKFDFEDD QHKIPCHCGA VNCRKWMN 4858
Nucleotide Sequence
(Fasta)
AGGCCTCGGA GTAGAGGAAA AACTGCAGTG GAGGATGAAG ACAGTATGGA TGGTTTGGAG 60
GCAGCAGAAA CAGAAAACAC TGTGGAAACA GAAGTCAAAG AGCAGTCTGC AGAAGAAGAT 120
GCCCAAGCAG AAGTGGACAG CACCAAGCAG CTAACACCAG TCCTTCAGCG CTCAGTGTCT 180
GAGGAGTCTG CAAGCTCCAC CGCCTCTGTT GGTGTCGAAG CGAAAACCAG CGAACAACTC 240
TGCGCCTTTT GTTACTGTGG TGAACGAAGC TCATTAGGAC AAGGAGACTT AAAACAATTC 300
AGTCCAACTC CTGGGTATGT TATCCCATGG AAGAACCAGC CTTTAAACAA GAGCGAAGCC 360
AGCGACAGCA CCGATGGAGC TTGCGAGAAA ACTCCAAGGC AAAACTCAGT GCCGCGCAAA 420
CAGAGAGGAC AGAAGAAGCA GCAGCAATCT CGATCAAGTA TAGCATCGTG TACAAGTGTA 480
AGCACCCAAA CTGCTTCTGA CGATCAGGTT GTCAAATTCT GGGATGAACT CAGTTTAGTT 540
GGCTTACCAG ATGACATTGA TGTTCAAGCC TTATTTGAGC CAACAGGTCA TTGCTGGGCT 600
CATCACCGCT GTGCGGAGTG GTCAATGGGA GTCTGCCAGA CTGAAGAGCA GCTATCGGTG 660
AACGTGGACA AAGCTGTTGT CTCAGGGAGC ACAGAAAGAT GTGCATATTG TAAGCACCTT 720
GGAGCCACTA TCAAATGCTG TGAAGAGAAA TGTACCCAGA TGTACCATTA CCCCTGTGCT 780
GCTGGAGCAG GCACGTTTCA GGATTTCAGT AACTTGTCCC TTCTTTGTCC AGACCACATT 840
GATCAGGCTC CTGAAAGATC AAAGGAAGAA GCAAATTGTG CGGTGTGCGA CAGCCCTGGA 900
GACCTCTTAG ATCAACTTTT TTGTACTACC TGTGGCCAGC ACTACCATGG GATGTGCCTA 960
GACATACAAG TTACACCTTT AAAACGAGCA GGTTGGCAGT GTCCTGATTG CAAAGTGTGC 1020
CAGAACTGCA AACATTCTGG GGAAGACAAC AAGATGCTGG TGTGTGATAC ATGCGACAAA 1080
GGCTATCATA CTTTCTGTCT ACAACCAGTT ATGGACTCTG TACCAACAAA TGGTTGGAAA 1140
TGTAAAAACT GTAGAGTATG TGCTGAGTGT GGCACACGAA CCAGCTGTCA GTGGCACCAT 1200
AATTGTTTGG TGTGTGACAG TTGTTACCAA CAGCAAGACA ACTTATCCTG TCCTTTCTGT 1260
GAAAAACTGT GCCTCCAAGA CTTCCAGAAA GATATGTTGC ATTGTCACAT GTGCAAAAGG 1320
TGGATTCACA TAGAATGTGA CAGATTTCCA GGTAGCGAAT TGGAGTCTCA GCTGAAGGAC 1380
TACATCTGCA CTCTCTGTAG ACAAGAAGGG GATCAGACTC AGTCATTACT CAGCAATGGT 1440
AAGGAAATAC AGAGAGAACT TGCATCTCTT GATCTGAGTC TGGCAAATGA GTGGAAGAGC 1500
TCTGTCAGTG AACATGTGCA CCAGGTTAGA GTAGCAACTT CAGACTTTGA AGATAACTCC 1560
TGTTCTGAAA GATTAGTAGA AGCTGTTAGA CTCGCAGTAG AACAACCCAT TACTGGTAAT 1620
GGCTCCAGTC AAGAATCAGC TGTTGGATGC GTCACAGACG GCGTATTGCC AGAAGAAAAG 1680
ACCACAGAAC TGGAAACAGA AGTCCCTGAT GAAGTTAACT CTGCTGAAAT GGAAATGTCT 1740
CCCCAAAAGA CACCAGCAAC TGGTGACAGT CAGACCGAGA AAATGATGGA AGTGATGGAA 1800
GGTGGCCAAG CTGTAATCCA ACAGGAGTTA GACAAACAAG TGGAAGAGAC ACAGCTGCTT 1860
CAAGGTAGCG TTGAGGCATC AGAAGTATCC ACAGTTGACT CTCGGTCTCT GTCTTCAGTA 1920
CCTGAGACCG TAACTGTGAC GCCAGAAATA CAAGGGAATG AAAGCAAAGG AGATGTTTGT 1980
ATCCATGTAG AGCATCAACT GGAAATAATA AAAGTGCAAA AAGAGGTAGA AAAAGGAGAA 2040
ACCCCTGAAA AGTCAGAAAC ACCAGCTCTT CTGGAAGTAG AAACTTCTTC TGCAAATGAA 2100
GGTGTTGCAA ACAGTTCACC GGTTGCAGAC AAACCCATAA AATCACCAGC TGAGACTTAC 2160
CCCTCACTTC CCCCAGCAGT TAACCTAACC AAGACAAATG TGTCTTCCTC ACCGGATGTT 2220
TCCTTAGACT TGCATTCAGC ACGGGATGTA CCGCACAGCC AGCCTCCAGC GTTGCCTTCC 2280
ACTGCTGGAA GTGCTCTCCC AACGACTTAC ATATCAGTCA CTCCAAAAAT TGGCATGGGG 2340
AAACCAGCCA TCACCAAACG GAAGTTTTCT CCCGGAAGAC CCAGATCAAG ACAGGGGGCT 2400
TGGAGTACCA ATAATACAGT CAGCCCACCT TCCTGGTCCC CAGACATTTC AGAAGGTCGG 2460
GACATTTTTA AATCCAGACA GCTTCCTGGC AGTGCCATTT GGAGCATCAA AGTGGGCCGT 2520
GGATCAGGAT TCCCGGGGAA GCGGAGGCCT CGTGGTGCAG GTCTTTCGGG ACGAGGTGGC 2580
AGAGGTAGAT CGAAACTGAA AAATGGAGTT GGGACTGTAG TCATTCCAGG GGTCACAACT 2640
ATGGATATCT CATCTAACAA AGATGAGGAA GAAAACTCAA TGCATAATAC AGTTGTCCTC 2700
TTCTCTAGCA GCGACAAGTT CACTCTCCAT CAGGATATGT GCGTAGTTTG TGGGAGTTTT 2760
GGCCAAGGTG CTGAGGGCAG GTTGCTCGCC TGCTCTCAAT GCGGTCAGTG TTACCATCCA 2820
TACTGTGTCA GTATTAAGAT TACTAAAGTG GTGCTTAGTA AAGGCTGGAG GTGTTTGGAG 2880
TGTACGGTGT GTGAAGCCTG TGGGAAAGCT ACAGATCCAG GAAGGCTGCT GCTCTGTGAT 2940
GACTGCGACA TCAGCTACCA CACTTACTGC TTAGATCCGC CCCTGCAGAC CGTTCCAAAA 3000
GGAGGCTGGA AGTGCAAATG GTGTGTTTGG TGCAGACACT GTGGAGCAAC TTCGCCGGGT 3060
TTAAGATGCG AATGGCAAAA TAATTACACA CAGTGTGCTC CCTGTGCAAG TTTGTCTACC 3120
TGCCCCATCT GCTATCGCAC TTACAGGGAT GAAGAGCTTA TAATTCAGTG TAGACAATGT 3180
GATCGATGGA TGCATGCAAT CTGTCAGAAC TTAAACACAG AGGAGGAAGT GGAAAACATA 3240
GCTGATATGG GTTTTGACTG TACCATTTGT AGGCCATACA TACCACCAAC AAACGTGCCT 3300
TCTTTGGACT GTTGTGATTC ATCAGTTGGA GCTCAGATTA TTTCAAAATC AAAAGAACTA 3360
GACCCACCGA AGACATACAC GCAGGATGGA GTCTGCTTGA CAGAATCCGG CATGTCTCAG 3420
TTACAGAGCC TCACTGTCAC AGTACCTAGA AGAAAACGGA CGAAACCAAA GCTTAAACTG 3480
AAGATTATAA ATCAGAACAG TGTGGCTGTG CTTCAGACAC CCCCGGATGC CCAGTCAGAA 3540
CACTCAAGAG ATGGCGAGAT GGATGACAGT AGAGAGGGTG ATCTTATGGA CTGTGATGGA 3600
AAATCGGATT CCAGCCCAGA ACGGGAGGCT GCAGATGATG ATACCAAAGT GGCAGAGGGA 3660
GCTGATGGGA TTAAGAAAAG AAAGCGAAAA CCATACAGAC CTGGTATTGG TGGATTTATG 3720
GTACGTCAAA GAAGTCGTAC AGGGCAGGGG AAGACTAAAA GATCTCTCTC CAGGAAAGAT 3780
TCTTCTGGAT CTGTTTCTGA GCAGTTGCCT GGCAGAGATG AAGGTTGGAC AGAACAGTTG 3840
CCAGACACTC CAGTTGAGGA GTCTACCCCA GCTTCTGAAA GTACTGAGAA AATAAAAAAG 3900
CGTTACAGGA AAAAGAAGAA TAAACTTGAA GAAACCTTTC CTGTCTACTT GCAGGAAGCT 3960
TTCTTTGGGA AGGATCTACT AGATACCAGT AGACAAAACA AGATGAGCCT CGATAATTTG 4020
CCTGAAGAAT CACTTCCACT CTCATGTAAA ACAAATCTGA CCACCAATTT CCTGGATCCT 4080
TCATCAGACC CACTTCGTAG CTCAACCTCC ACTCCAGCTC AGGCAAAACT GGGACCTCAA 4140
GGTAGCACTG ACGATCCCTT GGCTGATCTT TCTGAGGTCT TAAACACCGA TGATGATATT 4200
CTTGGAATAC TTTCTGATGA TTTGGTGAAG TCTGGAGATC ATTCAGCTGG ATTGGATATT 4260
GGCCCCATCT CTGATGATCC TTCTTCTCTG CCTCAGCCAA ATGTCAACCA GAGTTCACGG 4320
CCATTGAGTG AAGAACAATT GGATGGAATC CTCAGTCCAG AACTAGACAA AATGGTCACA 4380
GATGGTGCTA TTCTTGGCAA ATTGTACAAA ATCCCAGAAC TAGGAGGAAA GGATGTTGAA 4440
GATCTGTTCA CGGCGGTATT AAGTCCAGCA ACCACACAGC CAGCACCTTT GCCGCAGCCT 4500
CCACCTCCAC CACAACTTAT GTCTTTGCAC AGCCAAGGAG AGAGTGCATT TCCAAGAGTA 4560
CCACTTATGA ATGGTCTGAT TGGCCCTAAC CCTCATCTCC CTCATACAGC TTTGACTCCT 4620
GGAGGTGGAT TGAGCACCTT CTCTCCCATA ACACAGCAAC CATATACTGA TACCAGGGAT 4680
AAGAATCCAG CATTCAATCC AATGGTAAGC GATCCAAACA GCTCGTGGGC ACCATCTGCT 4740
CCACCTTTGG AAGGTGAAGG TGATACGATG TCCAATGCTC AAAGAAGCAC TCTTAAGTGG 4800
GAAAAAGAGG AAGCATTGGG TGAAATGGCA ACAGTAGCAC CTGTTCTCTA TACAAATATC 4860
AACTTCCCTA ACCTAAAGGA AGAATTCCCA GACTGGGCTA CAAGAGTGAA GCAGATTGCT 4920
AAACTGTGGA GGAAGGCAAG CTCGCAAGAG AGGGCTCCAT ATGTGCAAAA AGCCAGAGAT 4980
AATAGAGCTG CTTTGCGCAT CAATAAAGTA CAGATGTCAA ATGACTCCAT GAAAAGGCAA 5040
CAGCAACAGG ATAGCATTGA TCCTAGCTCA CGTATCGACT CCGACCTCTT CAAAGATCCA 5100
TTAAAACAGA GGGAATCGGA GCATGAACAA GAATGGAAAT TCAGACAGCA AATGCGTCAA 5160
AAAAGTAAGC AGCAAGCCAA AATAGAAGCC ACGCAGAAGC TTGAACAAGT GAAAAATGGT 5220
TCACAACAGC ATCTGAGCCA GTCTGGCTCA GACACGCCTA GCAGTGGGAT CCAGAGCCCC 5280
TTGACACCGC AGACTGGCAA TGGCAGTATG TCCCCTGCGC AGCAAACATT CCACAAGGAT 5340
CTATTTGCAA AGCAGCAGCT ACCTGCTACA CCTACTTCAG CATCCTCCGA TGACGTGTTC 5400
CTAAAACCAC AGGCCCCACC CCCTACTCCA ACCCGAATCC AACACGATTC CCTGTCTCAG 5460
TCTCAGGCTC CCCAGACATC CTCCCAGCAG ATGTTCTCTC CAGGTTCCGC AAGCACAAGG 5520
CCTCCTTCTC CAATGGATCC ATACGCTAAG ATGGTGGGAA CACCTAGACC AGCGCCTATG 5580
AATCAAAACA TTGTTAGAAG GGGTAGCATC ACACCGTTAG ACACCTGTGC ACCACAGTCA 5640
GCCATGTCCA GGCCCATCCA GGTCACGGAA CCATCAGGAA GCAGGCCTTC GCCAGTCAGG 5700
GATTCATGTT CTTCATCTCC AAGCAGCAAT GATCCCTATG CAAAGCCACC AGACACACCC 5760
AGGCCTGTCA TGCCACCAGA TCAGTTCTCC AAACCCCTGG GGGTCCCGAG GTCACCCATA 5820
GTTATGGAGC AGTCAGGGAA AGTTCCTCTG ACAGCTGGAA GCGGTGATCC CTTCACTAAG 5880
CCAGCTCCCA GAACTGATAC CTTTCAGAGA CAGAGAGTAA CTTCTGCTGA TGCATATGCA 5940
CGGCCCCCGT TGACTCCTAC TCCTACTCCT GTTGATGGTA GCCCTGGACC CTTTAAAACT 6000
CCCATGCGCC CACCTCAGTC TCCACAGGAT CCTTACACCT CGATGCCAAC CACACCGAGG 6060
CGCGTTCCTG TTGATCCGTA TGAGCGGCCC GCTTTGACAC CGAGGCCAGT GGATAACTTC 6120
TCCCATAATC AGCCTAACGA TCCGTACAGC CAGCCCCCCC TTACTCCCCA TCCTGCAATA 6180
AAGGAGTCTT TCGCCCATCC GCCCCGGATA GTGCGCCAGC AGAGTGATTC TTTCCCTCAA 6240
TCCGGACCCA TTTCAAGGCC AGCTTCTCAG GACCCTTACT CCCAACCTCC AGGTACTCCT 6300
CGGCCAGTTG CCGATCCTTA TGCCCAACCT CCAGGAACTC CTCGGCCCAC CACAGTCGAT 6360
CCGTATATGC AGCAACCACC AACACCAAGA CCTGCACAGC CAGCAGATTT ATTTGCTCAG 6420
TCCCCAGCAA GTCAGAGACA TTCTGATCCA TACGCTCAAC CTCCTGGAAC GCCAAGGCCA 6480
GTTCTGAATG ATCCTTACTC TCAGCAACCA GGAACTCCAA GGCCAGGAAT ACCAGAGAGT 6540
TTTAACAGAC CTGCAATGAC AAGACCAGGA TTAATACCAA ACAGAGATCC TTTCCTGCAG 6600
ACATCGCAGA ACAGGTTGCA GGGCACTTTT GTCAGGCCGC CAGACCCATG TTCTCAAACT 6660
CCCAGACCAG CAGGACCTGC AATAACGGAT TCATTTAGCC ATGGTCCTGC TGCTCCAGCA 6720
CGTGACCCCT ATGATCAACC ACCCATGACT CCAAGACCTC AGCCAGAATC TTTTGGAACT 6780
GGTCAGCTAG CCCACGATGC TGCCGAACAG TCACGGCCTG GATCTGAGGG CAACTTCAGT 6840
GCATCTGCAA ATTCTCCAAT GAGTTCTCAA GGGCAGCAGT TTCCCAAAGT TTCACAAGTT 6900
CCTGGCCCAG CACCCACTAC AGGAGTAACA GATACACAGA ATACCATGAA TATGTCTCCT 6960
CAAGCAGATA CTGAAAAATT AAGACAGCGC CAGAAATTAC GTGAAATTAT TCTACAACAA 7020
CAGCAGCAGA AGAAGAATGC AGTTCGTCAG GAAAAAGCCT TACAGGAGCA AGCAGCTACT 7080
CCCCATGCAA CTCCTCTTCA GCATTGGCAG CAAGACAACT TAAATCCAAT TTTTAACCGT 7140
CCTCCTCCTC CGTATCCTGG GAATATTAGG TCCTCTGTTG TCCCTCCAGG CGGGCCAAGG 7200
TTCCCAGTAT ACCCAAAAGA TCAACGCGGA CCATTTCCTG GAGATATGAA TGCCATGGGG 7260
ATGAGACCAC ATGGTCTTAG ATTTGGGTTT CCAGGAGGTG GCCATGGTCC ACCATCAGGT 7320
CAAGAACGTT TCCTCAGCCC TCAGCAGCCG ATGCAACGCC CTGGAGTCCC ACCACAGCTG 7380
AGAAGATCTC TGTCTATAGA TATGTCTCGG CCAGTGAACA ATCCGCAAAT AAATAATACA 7440
TCTGGGATCT CGCAGCATTT TCCTCCGCAG GGAATTCAAG TTCAGCAGCA CAATATATTG 7500
GGTCAGGCAT TTATTGAACT ACGTCACAGA GCTCCTGATG GGAGGCCAAG GCTGCCTTTT 7560
ACTCCTTCTG CAACAAATGT TATGGATCCA TCAGCGCAAC ATCAGCGACA TGCAGGGTTT 7620
ATACCCAGGC AGGAATTTCC GGGCCCAAGA CCGGCAGAAC CACTGAGACA TAATTCTCAA 7680
GGTATACCTA ACCAGTTGCA GATGTCTACG GGTTTGGAGC ATATGTCACA GCCCCAGCAG 7740
GATCAAATCA ATCCTGAAAA CCCACCTGCA CTTGTAATAC GTTCTCTAAG CCATCCACCA 7800
GTGGCCGATG CTTTCCCAGG ACCATCTTTG CCTACACCTG CACCAAGTGA TGAATCGGCA 7860
AATTTACAGA TTCCTACCCA GCCAAGTGAT GGTCTAGAAG AAAAGCTTGA TCCTGATGAT 7920
CCTGCTGTAA AAGATTTGGA TGTGAAAGAC CTTGATGGGG TTGAAGTCAA AGATTTGGAT 7980
GATGAGGATC TGGAAAATTT GAATCTAGAC ACAGAGGATG GAAAAGGGGA TGAACTGGAC 8040
ACTTTAGATA ACTTGGAAAC CAATGATCCT CACCTTGATG ACCTTCTAAG ATCTGGGGAA 8100
TTTGATATAA TTGCATACAC AGACCCCGAC CTTGACGTGG GGGATAAGAA AGGAATGTTT 8160
AATGAAGAAC TAGATCTTAG TGTCCCCATT GATGACAAAC TAGATATCCA GGGCAAGGCA 8220
GAAGAACCAA AACAGAAGGA GCAAGGGGAT AAAACTGAAG CCCATCCTGA GACCCTGTCG 8280
CCACAGAAGA AATCTACTAC AGAAAATGAA ATTAAAACCG AAGTGCTTTC CCCAAAGACT 8340
CATGAGGAAA AGAAGAATGA AAATGAAAAA AGCGATGGAA ATGCTGAATC AACAAGTACT 8400
CAACAGGCTG CCGAAACGGA GTTGAAAGAT GGAGAAAAGG CTCCTGCACA GCCTGCTAAC 8460
CCAGACTTCC CTGACAAAGC TACTGCTGTT CCCAGTCAAG AAACAACTGT GCCTAGTCCA 8520
GATGCTCAGG GATCTGCTCC ATTGCCTGTT CAAGGAGCAG TAAGTTCCTG CAGTATTTCT 8580
GGGTCCACAC CAGTCCTCTC GAGTTTGCTA GCTAATGAAA GCTCAGATAA TGCTGAAGCA 8640
AGGACACTAG GGTCTCCCTC TCAATCTTTG CCAGCAACAC AACTGAACCA AGCATCAGGT 8700
ATTCCACAAA CACTAATGAC ACCTGGTGGA CAGATCCTGG AAAACACTTT AAATTCCAAT 8760
TTGGCCATGG TGCCACGAAT AAACCATAAT TTTTCTCAAG GGTCACCAAA CCCTGGATTT 8820
ATTCAGGGTC AGTCAGCAAA TCATAACTTT GGGACAGGAC AGACGTCTAG TCAAACTGTG 8880
GCTGTAGCAA ACCGTCCTGG TCCTAGTGGC ATATCTGGTC CTCAGCAGAT AATGCTCCCT 8940
CAACCATTAG CTCAACAACA AAACCGAGAG AGACCACTGC TGCTGGAGGA GCAGCCACTT 9000
CTTCTGCAGG ATCTTTTGGA TCAGGAAAGG CAGGAGCAGC AGCAGCAGCG GCAGATGCAA 9060
GCCATGATTC GCCAACGCTC GGAGCCATTT TTCCCCAATA TTGACTTCGA TGCAATTACA 9120
GATCCTATAA TGAAAGCAAA AATGGTAGCT CTGAAGGGGA TCAATAAAGT CATGGCACAG 9180
AGTAACATGG GGATGCCACC AATGGTTATG AACAGGTTTC CCTTTATGGG TCAGCCGGTG 9240
CCGGGGGCTC AGACCAGCGA AGGCCAGAAT CACATGCAGC AGGCCATTAC ACAGGATGGA 9300
AGTTTGACAC CTCAGATTTC TAGACCGAAT CCTCCCAATT TTGGTCCTGG TTTTGTCAAT 9360
GATTCACAAA GAAAGCAATA TGAGGAATGG CTGCAAGAAA CACAACAGCT TCTCCAAATG 9420
CAACAGAAGT ACCTTGAGGA GCAAATCGGA GCACACAGAA AATCAAAAAA AGCACTCTCA 9480
GCAAAACAGC GTACAGCCAA GAAAGCTGGC CGTGAGTTCC CAGAAGAAGA TGCAGAACAG 9540
CTTAAACATG TTACAGAGCA GCAAAGTATG GTCCAAAAAC AGCTAGAACA GATTCGAAAG 9600
CAGCAGAAGG AGCATGCAGA ACTAATTGAG GAATATCGAA TCTGTGGAAT GGCACCCCAC 9660
GCCATCATGC CAGGAGTCCA GCCTCAACCA CCGATGGTTC CCGGAGGAAC GTCACCAGCA 9720
ATGAATCAGC AAAACTTCCC CATGGTGGCA CAACAGCTCC AGCACCAGCA GCATGCAGTG 9780
GTGATCCCGG GGCAGCCCAA CCCAACCAGA ATGCCAAATT TATCGGGATG GCAAGCTGCA 9840
AATGCCCCTG CGAGTCATCT TGCCATGAAT CCAGCAAGGA TGCAGCCTCC AATGACGCCG 9900
TTGCCAATTA CTCCTGGTAC ACCAGCTCCC GTGCCTGGTC CAAATGCAAC TGCACAGTCA 9960
GGGCCACCAC CAAGGGTGGA GTTTGATGAC AACAACCCTT TCAGTGAAAG TTTTCAAGAG 10020
CGGGAGAGGA AGGAGCGTTT ACGAGAGCAG CAGGAACGGC AACGCATCCA GCTTATGCAG 10080
GAGGTAGATC GACAGAGGGC TTTGCAGCAG AGAATGGAAC TAGAACAGCA TGGTATGATA 10140
GGGTCTGAAC TAAATAACAG AGCATCCTTA TCGCAGATAC CTTTCTTTAA TTCTGATCTG 10200
CCTTGTGATT TCATCCAAAC ACCGCGCCCT CTTCAGCAGT CTCCGCAGCA CCAGCAGCAA 10260
CAAATGGGAC AAATGCTGCA GCAAGGTCCT GTGAACTCAC CTCCCGCCAC AAATTTCATG 10320
CAAAGCAGTG AGCGAAGGCC AGTGGGACCT ACAACTTTTG GACCCGATGG ATCTGCTGTT 10380
GCGGGTGGAG CCCCCAATTT CCATAATGTA AAACAAACTC ACGGGAATCT TCCTGGGGCC 10440
ACCTTCACGC AGAACCAAGT CAGGCCCCCA TTTGCTCCTT CTGTACCTTC ATCTACAGTA 10500
CCTTCATCTA CAGCACTCAG CAGTGGTGCC CCGTGTGGTT CGGACAGTAG TGTGCCCCAG 10560
GCAACAAACT TCCCTGGATC AAGCCAGTCT CTCATACAGC TGTATTCTGA CATAATTCCA 10620
GAAGAGAAAG GGAAAAAGAA AAGGACACGA AAAAAGAAAA AGGACGATGA CGCTGAGTCA 10680
ATAAAAGCCC CCTCGACTCC CCATTCAGAC ATTACTGCAC CGTCAACTCC AACCATTTCT 10740
GATTCTACCT CCACCCCCAC AGTTAATACC CCCAGCGAAC TTACACGTCA TCAGGATGAG 10800
CAAGAGTCGG TGGAGTTAAC AGGTCCATCA ACATCAAATG CAGCAGAGAG CCAGACCTCT 10860
CCAGAGCTGG AAAGTAAGCT CCCCGGCAGC AGCTTGCCAC AGAAACAGCC AAGCGTAAGT 10920
ATGGAGACTG AAAAGGATAA AGCAGAGACA TCTACCAGCA TTCAAGAAGT TAAACTAGAA 10980
AAGGCAGAAA CTGATCAATG CTCAGGTCAA GCTGAGCCTA AAACAGAAAA TCAAAGCAGT 11040
GTTAAGGTGG AAGAAGATAA GGTCACGTCA CAGCCTGCCT CTTCAGCTCA GAGTCCAGCC 11100
CAGCCAGCTA GCGTTCCAGC AGCAAAAGGG GAGTCAGGGA ATGAATTACT GAAACACCTG 11160
CTTAAAAACA AAAAATCCTC ATCTCTTCTA AATCAGAAAT CAGAGAACAG CTGTCGAACA 11220
GAAGACGAGA CTGCTGGGGA TAAGAAGTTA ACAGAGAAGC AGAATCCAGC AGAAGGAGCG 11280
CAAACTCCAG GAAACCAGAT GCAAGGTGTG TTTGGGTGTA GTAACAGCCA GCTTCAGAGA 11340
ACAGATGTGG GACATGAAAC CAAGAAGCAG AGAAATAAGC GAACTCAGAG GACGGGGGAG 11400
AAGGCAGCTC CTCGGTCCAA GAAAAGAAAA AAAGAGGAAG AAGAAAAACA AGCGATTTAC 11460
CCTAACGCTG ATACATTCAT CCAGCTCAAG CAACAGAACA ATCTGAGTAA TCCTCCAACA 11520
CCCCCAGCCT CTCTTCCTCC CACACCACCT CCTGTGGCAT GCCAGAAGTT GGTAAATGGC 11580
TTTGCAACCA CTGAAGAACT GGCTGGCAAG GCTGGTATGT TAGGAGGTCA TGATGTTACC 11640
AAAGCTCTGG GACCAAAGCA GTTCCAGTTA CCTTTCAGAC CACAAGACGA TTTACTAGTA 11700
AGAGCAATGG CTCAGGGCCC TAAAACTGTG GATGTTCCCG CTTCGCTTCC AACACCACCT 11760
CACAACAATC AGGAGGAGTT AAGGGTTCAA GATCACTGTG AGGACAGGGA CACTCCTGAC 11820
AGCTTCGTTC CTTCTTCCTC TCCTGAAAGT GTGGTGGGAA TGGAAATAAG TAGGTATCCC 11880
GACTTGTCAG TCGTCAAGGA GGAGAACCCA GAGCCTGTAC CATCTCCCAT CATTCCCATC 11940
CTACCTAGCA GTACTGGAAA AGGTTCAGAA GCCAAAAGGA ATTACATAAA GTCAGAACCT 12000
GGTTCTGGAG CATTACCTGG TTCTGGCTTC TTTGCCTCTC AGCTTGGTCC ATCCCAGAAT 12060
GGTCCTAAAT CTGGCCTTAT ATCTGTAGCA ATTACATTAC ATCCCACAGC TGCTGAGAAT 12120
ATTAGTAACG TTGTGGCGGC CTTCTCCAAC CTGCTGCACG TCCGAATTCC CAACAGTTAT 12180
GAGGTTAGTA ATGCTCCAGA CGTCCCATCC TCCATGGCAG CGGCCAACAG TCACAGGGTG 12240
AACCCATCTC TGGAGTACAG GCAGCACTTA CTGCTTCAGG GTCCTCAAGC AGGGTCCTTG 12300
GGACCTGCCA GGATCTCAGG GTCTTACGGA CTGAAGCAAC CCAACGTGCC ATTCCCTGCA 12360
AGCAACAATG GTATAGCTGG CTATAAGGAT CACAGTCAAA ACATTGCAGA AAGCTCAGCA 12420
CTGAGACCTC GATGGTGCTC TCATTGCAAA GTGGTGGTTC TTGGTAGTGG TGTGCGGAAG 12480
TCCTTCAAAG ACCTGCCTTT CCATAAACAG GATTCCCAAG AAGGGCCTGA TAAAATGAAG 12540
GACGTTGTAT TCTGTAGCAA CAACTGCTTT GTTCTCTATT CAGCAGCTGT GCAGGCAAAA 12600
AACTCCGAGA ACAAAGAGTC GGTTCCATCT TTGCCGCAGT CACCGATGAA GGAGAGGCCA 12660
CCCAAAGCAT TCCATCAGTA CAGCAACAAC ATCTCCACCT TGGATGTCCA TTGTCTGCCT 12720
CAGTTGCAGG AAAAAGCGTC TCCACCATCC TCGCCCCCGA TCATGTTCCC CCCTGCATTC 12780
GAAGCAGCCA AGGTAGAGGC GAAACCGGAC GAGCTTAAGG TAACAGTGAA ACTAAAACCT 12840
AGGTTAAAAG CAATACACAG CAGTCTTGAT GACTGTCGGC CTCCTAGTAA GAAATGGAAA 12900
GGAATGAAGT GGAAGAAGTG GAGCATTCAG ATTGTGATTC CTAAAGGATC ATTCAAACCT 12960
CCTTGTGAAG AAGAAATAGA TGAATTTCTC AAAAAATTGG GCACAACCCT TAAACCGGAT 13020
CCCGTGCTTA AAGACTACAG AAAATGTTGC TTCTGTCATG AGGAAGGTGA TGGATTAACT 13080
GATGGACCAG CAAGGCTTCT GAACCTGGAT TTAGACCTTT GGGTCCATTT GAACTGTGCT 13140
CTTTGGTCTA CAGAAGTCTA CGAGACACAA GCTGGTGCCT TAATAAACGT GGAACTAGCA 13200
CTGCGGAGAG GCTTGCAGAT GAAGTGCATG TTCTGTCACA AAATGGGTGC CACCAGCGGT 13260
TGTCACAGGT TAAGGTGCAC CAATATTTAT CACTTTACCT GTGCCATTAA AGCACAATGC 13320
ATGTTTTTTA AAGACAAGAC CATGCTTTGC CCCATGCACA AACCAAAGGG AACTCACGAG 13380
CAAGAACTCA GTTACTTTGC AGTCTTCAGG AGGGTCTACG TGCAGCGCGA CGAGGTGCGG 13440
CAGATCGCTA GCATCGTGCA GCGAGGAGAA CGCGACCACA CCTTCCGTGT GGGGAGCCTG 13500
ATCTTCCACG CCGTCGGTCA GCTGCTGCCG CAGCAGATGC AGGCATTTCA CTCCTCGAAA 13560
GCGCTCTTCC CCGTGGGCTA CGAGGCCAGC AGGCTGTACT GGAGCATGAG ATACGCAAAC 13620
AGGCGCTGTC GCTATCTCTG TTCGATTGAG GAGAAGGACG GGCTTCCTCT GTTTGTCATC 13680
AAGGTTGTGG AGCAAGGCCA CGAAGATCTG GTCCTCACGG ACACAACGCC AAAAGGTGTG 13740
TGGGATAAAA TCTTGGAGCC TGTTGCTTCT GTTAGAAAAG AGTCTGAAAT GCTCCAGCTG 13800
TTTCCTGGCT ACTTGAAGGG TGAAGACCTC TTTGGTTTGA CAGTCTCTGC GGTGGCAAGG 13860
ATTGCCGAAT CGCTTCCAGG GGTTGAGGCC TGTGAGAATT ACACTTTCCG CTATGGCCGA 13920
AACCCCTTAA TGGAACTTCC TCTTGCCATC AACCCCACGG GCTGTGCCCG TGCCGAGCCT 13980
AAAATGAGTA CCCATGTCAA GAGGTTTGTG TTAAGGCCTC ACACCTTGAA TAGCACCAGC 14040
ACATCGAAGT CATTTCAGAG CACAGTTACA GGAGAACTGA ATGCGCCTTA CAGTAAGCAG 14100
TTCGTCCATT CCAAGTCCTC TCAGTACCGA AAAATGAAAA CCGAATGGAA GTCCAACGTG 14160
TATCTAGCTC GCTCTCGCAT TCAGGGCTTG GGCTTGTATG CTGCTAGAGA CATTGAAAAG 14220
CACACTATGG TCATTGAATA TATCGGAACT ATTATCCGCA ATGAGGTGGC AAACAGGAAA 14280
GAGAAGCTTT ACGAATCTCA GAATCGCGGG GTGTACATGT TCCGCATTGA CAATGACCAC 14340
GTCATCGACG CTACCTTGAC GGGAGGTCCT GCAAGGTATA TCAACCATTC GTGTGCACCT 14400
AACTGCGTGG CTGAGGTGGT GACTTTTGAG AGAGGACACA AAATCATCAT TAGCTCCAGC 14460
AGGAGAATCC AGAAAGGGGA GGAGCTTTGC TATGACTATA AGTTTGATTT TGAAGATGAC 14520
CAGCACAAGA TTCCATGTCA CTGTGGAGCT GTAAACTGCC GAAAATGGAT GAACTAGAAT 14580
GCATTCCTTG CTGTCTTTTG AGGGTTGCTT GTCCCTAGGA AGAAGTGATT CACATTTTGA 14640
TTTTGTCGAC ATAAAATTGT TGCCTTTTTG AAGGGAAGAA AAAAATCACT TCTGGAAGTA 14700
CACATTTCTA CTCAAGTATT TAAAAAAGAA AAAAAAAAAA AAAAAAAAAA AGAAAAAAAA 14760
AAAAAGGAAA AACAAGCACC AGAAGCAAGC CGGCAAAATC TGAAGCTAAC TTTCCAGACA 14820
TGTGGAGGTT AAACTGATTT AACAGAATGG TCCAGCACTT TTGTTTTTGA GCTCACTAAG 14880
GAGAAACTGT TGAACTTGGG CAAAGAACTG ATGGCTGACC TGCGGAAGTG CAGGTGGGCA 14940
TATATGAATG TAAGACTGAA ATCACCAGCG AAAGGGAGAA GTCCAAAGTG CTGGCCACAG 15000
CCTTA 15006
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 92 0.0 7778
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 92 0.0 7569
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 90 0.0 7198
WERAM-Hos-0018 ENSP00000347325.3 Homo sapiens 73 0.0 5843
WERAM-Pat-0168 ENSPTRP00000046674.3 Pan troglodytes 73 0.0 5839
WERAM-Aim-0154 ENSAMEP00000014067.1 Ailuropoda melanoleuca 73 0.0 5740
WERAM-Anc-0148 ENSACAP00000014142.2 Anolis carolinensis 72 0.0 5720
WERAM-Ict-0124 ENSSTOP00000012342.2 Ictidomys tridecemlineatus 72 0.0 5718
WERAM-Caf-0059 ENSCAFP00000007370.4 Canis familiaris 71 0.0 5710
WERAM-Gog-0210 ENSGGOP00000027941.1 Gorilla gorilla 72 0.0 5640
WERAM-Paa-0005 ENSPANP00000006150.1 Papio anubis 71 0.0 5628
WERAM-Mum-0152 ENSMUSP00000043874.7 Mus musculus 71 0.0 5611
WERAM-Myl-0122 ENSMLUP00000010086.2 Myotis lucifugus 71 0.0 5603
WERAM-Fec-0096 ENSFCAP00000008002.3 Felis catus 71 0.0 5595
WERAM-Bot-0193 ENSBTAP00000028347.5 Bos taurus 71 0.0 5568
WERAM-Cap-0018 ENSCPOP00000001682.2 Cavia porcellus 70 0.0 5558
WERAM-Ova-0045 ENSOARP00000005594.1 Ovis aries 70 0.0 5524
WERAM-Mup-0098 ENSMPUP00000009152.1 Mustela putorius furo 71 0.0 5445
WERAM-Tut-0198 ENSTTRP00000016174.1 Tursiops truncatus 70 0.0 5439
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 63 0.0 4945
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 73 0.0 4427
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 73 0.0 4304
WERAM-Sah-0130 ENSSHAP00000013860.1 Sarcophilus harrisii 70 0.0 4210
WERAM-Chs-0076 ENSCSAP00000002864.1 Chlorocebus sabaeus 69 0.0 4120
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 69 0.0 4064
WERAM-Orc-0115 ENSOCUP00000009766.3 Oryctolagus cuniculus 68 0.0 4029
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 70 0.0 3994
WERAM-Mam-0056 ENSMMUP00000009467.2 Macaca mulatta 67 0.0 3774
WERAM-Poa-0169 ENSPPYP00000020408.2 Pongo abelii 72 0.0 3628
WERAM-Dio-0019 ENSDORP00000002189.1 Dipodomys ordii 69 0.0 3601
WERAM-Otg-0078 ENSOGAP00000005885.2 Otolemur garnettii 69 0.0 3387
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 71 0.0 3180
WERAM-Leo-0127 ENSLOCP00000015481.1 Lepisosteus oculatus 52 0.0 2731
WERAM-Pof-0064 ENSPFOP00000005925.2 Poecilia formosa 48 0.0 2689
WERAM-Loa-0084 ENSLAFP00000006640.4 Loxodonta africana 71 0.0 2546
WERAM-Fia-0086 ENSFALP00000007141.1 Ficedula albicollis 89 0.0 2383
WERAM-Ptv-0086 ENSPVAP00000007862.1 Pteropus vampyrus 66 0.0 2126
WERAM-Mod-0040 ENSMODP00000005845.3 Monodelphis domestica 72 0.0 1961
WERAM-Sus-0158 ENSSSCP00000023447.1 Sus scrofa 74 0.0 1903
WERAM-Mim-0010 ENSMICP00000000977.1 Microcebus murinus 65 0.0 1890
WERAM-Dar-0184 ENSDARP00000115827.2 Danio rerio 48 0.0 1821
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 92 0.0 1820
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 66 0.0 1778
WERAM-Tar-0071 ENSTRUP00000014027.1 Takifugu rubripes 47 0.0 1678
WERAM-Xim-0205 ENSXMAP00000016470.1 Xiphophorus maculatus 46 0.0 1667
WERAM-Xet-0065 ENSXETP00000021458.2 Xenopus tropicalis 66 0.0 1623
WERAM-Gaa-0138 ENSGACP00000017696.1 Gasterosteus aculeatus 50 0.0 1459
WERAM-Orla-0181 ENSORLP00000020984.1 Oryzias latipes 56 0.0 1399
WERAM-Asm-0011 ENSAMXP00000001840.1 Astyanax mexicanus 53 0.0 1393
WERAM-Vip-0013 ENSVPAP00000001498.1 Vicugna pacos 65 0.0 1384
WERAM-Ten-0186 ENSTNIP00000018287.1 Tetraodon nigroviridis 54 0.0 1382
WERAM-Prc-0090 ENSPCAP00000008256.1 Procavia capensis 90 0.0 1330
WERAM-Gam-0038 ENSGMOP00000003815.1 Gadus morhua 54 0.0 1265
WERAM-Mae-0021 ENSMEUP00000001693.1 Macropus eugenii 64 0.0 1157
WERAM-Orn-0120 ENSONIP00000012272.1 Oreochromis niloticus 57 0.0 996
WERAM-Ocp-0124 ENSOPRP00000012666.1 Ochotona princeps 68 0.0 974
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 76 0.0 894
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 72 0.0 842
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 72 0.0 823
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 76 0.0 767
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 54 0.0 647
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 4e-179 628
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 54 5e-175 615
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 51 7e-153 541
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 42 1e-152 540
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 35 1e-95 350
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 37 2e-46 187
Created Date 25-Jun-2016