WERAM Information


Tag Content
WERAM ID WERAM-Pat-0168
Ensembl Protein ID ENSPTRP00000046674.3
Gene Name KMT2C
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPTRG00000019892.5 ENSPTRT00000043948.3 ENSPTRP00000046674.3
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 3.10e-45 153.1 4720 4835
Me_Reader PHD 2.80e-26 91.5 231 4455
Organism Pan troglodytes
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+++iek+++viEY+G++ir+eva+++ek ye++++gvy+fr+d+d +v+dat +g+ ar+inhsc+pNc+
ENSPTRP00000046674.3 4720 NVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDND--HVIDATLTGGPARYINHSCAPNCV 4804
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++++ +ki+i ++r+I+kgeel+ydYk
ENSPTRP00000046674.3 4805 AEVVTFERGHKIIISSNRRIQKGEELCYDYK 4835
******************************7 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde.CddwfHlkCvklp 35 
C +C++ ++ + +C+e C + +H+ C +
ENSPTRP00000046674.3 231 RCAFCKHLGAT---IKCCEEkCTQMYHYPCAAGA 261
6****433333...6688889*********8655 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC+ +++ + C +C + +H+ C++++ ++l+ w Cp+Ck
ENSPTRP00000046674.3 290 NCAVCDSPGDL-LDQFFCTTCGQHYHGMCLDIAVTPLKRA-GWQCPECK 336
5****544444.45999****************8888865.7******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C+ C++++e++k m+ Cd+Cd+ +H+ C+++ ++s+p + w C++C+
ENSPTRP00000046674.3 337 VCQNCKQSGEDSK-MLVCDTCDKGYHTFCLQPVMKSVPTN-GWKCKNCR 383
8****88888876.************************77.7******8 PP
PHD.txt 2 tiClvCgkddegeke..mvqCdeCddwfHlkCvklplsslpeg..kswyCpsCk 51
++C++Cgk ++ e + m+ C+ C++w+Hl+C k++ ++l + ++++C Ck
ENSPTRP00000046674.3 412 NLCPFCGKYYHPELQkdMLHCNMCKRWVHLECDKPTDHELDTQlkEEYICMYCK 465
78****87777755556*******************77777666668******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
++C+vCg ++g++ ++ C++C + +H +Cv+++ +++ +k w+C +C+
ENSPTRP00000046674.3 905 DMCVVCGSFGQGAEGrLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECT 955
68****7544443334*******************999996658******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
t+C +Cgk+ + + ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSPTRP00000046674.3 955 TVCEACGKATDPGR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 1002
68999976666655.9*************************.9**99996 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.ks..wyCpsCk 51
+C+vC +++ +e+ ++qC +Cd+w+H+ C +l+ ++ e+ ++ + C C+
ENSPTRP00000046674.3 1033 SCPVCYRNYREEDLILQCRQCDRWMHAVCQNLNTEEEVENvADigFDCSMCR 1084
7****777777777*******************5444455422458999997 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCv 32
+C +C+++++g + +++ d d w+Hl+C
ENSPTRP00000046674.3 4350 CC-FCHEEGDGLTDgparLLNLDL-DLWVHLNCA 4381
45.587777774445555666666.559999997 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C++C+k+++ + C C + +H +C ++ ++k +Cp +k
ENSPTRP00000046674.3 4409 KCVFCHKTGATS----GCHRfrCTNIYHFTCAIKAQCMFFKDKTMLCPMHK 4455
6****8888876....6**999*********98885666677778898887 PP

Protein Sequence
(Fasta)
RPRSRGKTAV EDEDSMDGLE TTETETIVET EIKEQSAEED AEAEVDNSKQ LIPTLQRSVS 60
EESANSLVSV GVEAKISEQL CAFCYCGEKS SLGQGDLKQF RITPGFILPW RNQPSNKKDI 120
DDNSNGTYEK MQNSAPRKQR GQRKERSPQQ NIVSCVSVST QTASDDQAGK LWDELSLVGL 180
PDAIDIQALF DSTGTCWAHH RCVEWSLGVC QMEEPLLVNV DKAVVSGSTE RCAFCKHLGA 240
TIKCCEEKCT QMYHYPCAAG AGTFQDFSHI FLLCPEHIDQ APERSKEDAN CAVCDSPGDL 300
LDQFFCTTCG QHYHGMCLDI AVTPLKRAGW QCPECKVCQN CKQSGEDSKM LVCDTCDKGY 360
HTFCLQPVMK SVPTNGWKCK NCRICIECGT RSSSQWHHNC LICDNCYQQQ DNLCPFCGKY 420
YHPELQKDML HCNMCKRWVH LECDKPTDHE LDTQLKEEYI CMYCKHLGAE MDPLQPGEEV 480
EVAELTTDYN NEMEVEGPED QMVFSEQAVN KDVNGQESTP GIVPDAVQVH TEEQQKSHPS 540
ESLDTDTLLI AVSSQHTVNT ELEKQISNEV DSEDLKMSSE VKHICGEDQI EDKMEVTENI 600
EVVTHQITVQ QEQLQLLEEP KTVVSREESR PPKLVMESVT LPLETLVSPH EESTSLCPEE 660
QLVIERLQGE KEQKENSELS TGLMDSEMTP TIEGCVKDVS YQGSKSIKLS SETESSFSSS 720
ADISKADASS SPTPSSDLPS HDMLHNYPSA LSSSAGNIMP TTYISVTPKI GMGKPAITKR 780
KFSPGRPRSK QGAWSTHNTV SPPSWSPDIS EGREIFKPRQ LPGSAIWSIK VGRGSGFPGK 840
RRPRGAGLSG RGGRGRSKLK SGIGAVVLPG VSTADISSNK DDEENSMHNT VVLFSSSDKF 900
TLNQDMCVVC GSFGQGAEGR LLACSQCGQC YHPYCVSIKI TKVVLSKGWR CLECTVCEAC 960
GKATDPGRLL LCDDCDISYH TYCLDPPLQT VPKGGWKCKW CVWCRHCGAT SAGLRCEWQN 1020
NYTQCAPCAS LSSCPVCYRN YREEDLILQC RQCDRWMHAV CQNLNTEEEV ENVADIGFDC 1080
SMCRPYMPAS NVPSSDCCES SLVAQIVTKV KELDPPKTYT QDGVCLTESG MTQLQSLTVT 1140
VPRRKRSKPK LKLKIINQNS VAVLQTPPDI QSEHSRDGEM DDSREGELMD CDGKSESSPE 1200
REAVDDETKG VEGTDGVKKR KRKPYRPGIG GFMVRQRSRT GQGKTKRSVI RKESSGSISE 1260
QLPCRDDGWS EQLPDTLVDE SVSVTESTEK IKKRYRKRKN KLEETFPAYL QEAFFGKDLL 1320
DTSRQSKISL DNLSEDGAQL LYKTNMNTGF LDPSLDPLLS SSSAPTKSGT HGPADDPLAD 1380
ISEVLNTDDD ILGIISDDLA KSVDHSDIGP VTDDPSSLPQ PNVNQSSRPL SEEQLDGILS 1440
PELDKMVTDG AILGKLYKIP ELGGKDVEDL FTAVLSPANT QPTPLPQPPP PTQLLPIHNQ 1500
DVFSRMPLMN GLIGPSPHLP HNSLPPGSGL GTFSAIAQSS YPDARDKNSA FNPMASDPNN 1560
SWTSSAPTVE GENDTMSNAQ RSTLKWEKEE ALGEMATVAP VLYTNINFPN LKEEFPDWTT 1620
RVKQIAKLWR KASSQERAPY VQKARDNRAA LRINKVQMSN DSMKRQQQQD SIDPSSRIDS 1680
ELFKDPLKQR ESEHEQEWKF RQQMRQKSKQ QAKIEATQKL EQVKNEQQQQ QQQQQFGSQH 1740
LLVQSGSDTP SSGIQSPLTP QPGNGNMSPA QSFHKELFTK QPTSTPTSTS SDDVFVKPQA 1800
PPPPPAPSRI PIQDSLSQAQ TSQPPSPQVF SPGSSNSRPP SPMDPYAKMV GTPRPPPVGH 1860
SFSRRNSAAP VENCTPLSSV SRPLQMNETT ANRPSPVRDL CSSSTTNNDP YAKPPDTPRP 1920
VMTDQFPKSL GLSRSPVVSE QTAKGPVAAG TSDHFTKPSP RADVFQRQRI PDSYARPLLT 1980
PAPLDSGPGP FKTPMQPPPS SQDPYGSVSQ ASRRLSVDPY ERPALTPRPI DNFSHNQSND 2040
PYSQPPLTPH PAMNESFAHP SRAFSQPGTI SRPTSQDPYS QPPGTPRPVV DSYSQSSGTA 2100
RSNTDPYSQP PGTPRPTTVD PYSQQPQTPR PSTQTDLFVT PVTNQRHSDP YAHPPGTPRP 2160
GISVPYSQPP ATPRPRISEG FTRSSMTRPV LMPNQDPFLQ AAQNRGPALP GPLVRPPDTC 2220
SQTPRPPGPG LSDTFSRVSP SAARDPYDQS PMTPRSQSDS FGTSQTAHDV ADQPRPGSEG 2280
SFCASSNSPM HSQGQQFSGV SQLPGPVPTS GVTDTQNTVN MAQADTEKLR QRQKLREIIL 2340
QQQQQKKIAG RQEKGSQDSP AVPHPGPLQH WQPENVNQAF TRPPPPYPGN IRSPVAPPLG 2400
PRYAVFPKDQ RGPYPPDVAS MGMRPHGFRF GFPGGSHGTM PSQERFLVPP QQIQGSGVSP 2460
QLRRSVAVDM PRPLNNSQMN NPVGLPQHFS PQSLPVQQHN ILGQAYIELR HRAPDGRQRL 2520
PFSAPPGSVV EASSNLRHGN FIPRPDFPGP RHTDPMRRPP QGLPNQLPVH PDLEQVPPSQ 2580
QEQGHSVHSS SMVMRTLNHP LGGEFSEAPL STSVPSETTS DNLQITTQPS DGLEEKLDSD 2640
VPSVKELDVK DLEGVEVKDL DDEDLENLNL DTEDGKVVEL DTLDNLETND PNLDDLLRSG 2700
EFDIIAYTDP ELDMGDKKSM FNEELDLPID DKLDNQCVSV EPKKKEQENK TLVLSDKHSP 2760
QKKSTVTNEV KTEVLSPNSK VESKCETEKN DENKDNVDTP CSQASAHSDL NDGEKTSLHP 2820
CDPDLFEKRT NRETAGPGAN VIQASTQLPA QDVINSCGIT GSTPVLSSLL ANEKSDNSDI 2880
RPLGSPPPPT LPASPSNHVS SLPPLIAPPG RVLDNTMNSN VTVVSRVNHV FSQGVQVNPG 2940
FIPGQSTVNH NLGTGKPATQ TGPLTSQSGT SGMSGPQQLM IPQTLAQQNR ERPLLLEEQP 3000
LLLQDLLDQE RQEQQQQRQM QAMIRQRSEP FFPNIDFDAI TDPIMKAKMV ALKGINKVMA 3060
QNNLGMPPMV MSRFPFMGQA VTGTQNSEGQ NLGPQAIPQD GSITHQISRP NPPNFGPGFV 3120
NDSQRKQYEE WLQETQQLLQ MQQKYLEEQI GAHRKSKKAL SAKQRTAKKA GREFPEEDAE 3180
QLKHVTEQQS MVQKQLEQIR KQQKEHAELI EDYRIKQQQQ CAMAPPTMMP SVQPQPPLIP 3240
GATPPTMSQP AFPMVPQQLQ HQQHTTVISG HTSPVRMPSL PGWQPNSAPA HLPLNPPRIQ 3300
PPIAQLPIKT CTPAPGTVSN ANPQSGPPPR VEFDDNNPFS ESFQERERKE RLREQQERQR 3360
IQLMQEVDRQ RALQQRMEME QHGMVGSEIS SSRTSVSQIP FYSSDLPCDF MQPLGPLQQS 3420
PQHQQQMGQV LQQQNIQQGS INSPSTQTFM QTNERRQVGP PSFVPDSPSI PVGSPNFSSV 3480
KQGHGNLSGT SFQQSPVRPS FTPALPAAPP VANSSLPCGQ DSTMTHGHSY PGSTQSLIQL 3540
YSDIIPEEKG KKKRTRKKKR DDDAESTKAP STPHSDITAP PTPGISETTS TPAVSTPSEL 3600
PQQADPESVE PVGPSTPNMA AGQLCTELEN KLPNSDFSQA TPNQQTYANS EVDKLSMETP 3660
AKTEEIKLEK AETESCPGQE EPKLEEQNGS KVEGNAVACP VSSAQSPPHS AGAPAAKGDS 3720
GNELLKHLLK NKKSSSFLNQ KPEGSICSED DCTKDNKLVE KQNPAEGLQT LGAQMQGGFG 3780
CGNQLPKTDG GSETKKQRSK RTQRTGEKAA PRSKKRKKDE EEKQAMYSST DTFTHLKQQN 3840
NLSNPPTPPA SLPPTPPPMA CQKMANGFAT TEELAGKAGV LVSHEVTKTL GPKPFQLPFR 3900
PQDDLLARAL VQGPKTVDVP ASLPTPPHNN QEELRIQDHC GDRDTPDSFV PSSSPESVVG 3960
VEVSRYPDLS LVKEEPPEPV PSPIIPILPS TAGKSSESRR NDIKTEPGTL YFASPFGPSP 4020
NGPRSGLISV AITLHPTAAE NISSVVAAFS DLLHVRIPNS YEVSNAPDVP SMGLVSSHRI 4080
NPGLEYRQHL LLRGPPPGSA NPPRLVSSYR LKQPNVPFPP TSNGLSGYKD SSHGIAESAA 4140
LRPQWCCHCK VVILGSGVRK SFKDLTLLNK DSRESTKRVE KDIVFCSNNC FILYSSTAQV 4200
KNSENKGSIP SLPQSPMRET PSKAFHQYSN NISTLDVHCL PQLPEKASPP ASPPIAFPPA 4260
FEAAQVEAKP DELKVTVKLK PRLRAVHGGF EDCRPLNKKW RGMKWKKWSI HIVIPKGTFK 4320
PPCEDEIDEF LKKLGTSLKP DPVPKDYRKC CFCHEEGDGL TDGPARLLNL DLDLWVHLNC 4380
ALWSTEVYET QAGALINVEL ALRRGLQMKC VFCHKTGATS GCHRFRCTNI YHFTCAIKAQ 4440
CMFFKDKTML CPMHKPKGIH EQELSYFAVF RRVYVQRDEV RQIASIVQRG ERDHTFRVGS 4500
LIFHTIGQLL PQQMQAFHSP KALFPVGYEA SRLYWSTRYA NRRCRYLCSI EEKDGRPVFV 4560
IRIVEQGHED LVLSDISPKG VWDKILEPVA CVRKKSEMLQ LFPAYLKGED LFGLTVSAVA 4620
RIAESLPGVE ACESYTFRYG RNPLMELPLA VNPTGCARSE PKMSAHVKRF VLRPHTLNST 4680
STSKSFQSTV TGELNAPYSK QFVHSKSSQY RKMKTEWKSN VYLARSRIQG LGLYAARDIE 4740
KHTMVIEYIG TIIRNEVANR KEKLYESQNR GVYMFRMDND HVIDATLTGG PARYINHSCA 4800
PNCVAEVVTF ERGHKIIISS NRRIQKGEEL CYDYKFDFED DQHKIPCHCG AVNCRKWMN 4859
Nucleotide Sequence
(Fasta)
ATCGCCCTTC CCCTTTCCAC GAGGCCAGGT AGACCTCGAA GTAGGGGGAA AACTGCAGTG 60
GAAGATGAGG ACAGCATGGA TGGGCTGGAG ACAACAGAAA CAGAAACGAT TGTGGAAACA 120
GAAATCAAAG AACAATCTGC AGAAGAGGAT GCTGAAGCAG AAGTGGATAA CAGCAAACAG 180
CTAATTCCAA CTCTTCAGCG ATCTGTTTCT GAGGAATCGG CAAACTCCCT GGTCTCTGTT 240
GGTGTAGAAG CCAAAATCAG TGAACAGCTC TGCGCTTTTT GTTACTGTGG GGAAAAAAGT 300
TCCTTAGGAC AAGGAGACTT AAAACAATTC AGAATAACGC CTGGATTTAT CTTGCCATGG 360
AGAAACCAAC CTTCTAACAA GAAGGACATT GATGACAACA GCAATGGAAC CTATGAGAAA 420
ATGCAAAACT CAGCACCACG AAAACAAAGA GGACAGAGAA AAGAACGATC TCCTCAGCAG 480
AATATAGTAT CTTGTGTAAG TGTAAGCACC CAGACAGCTT CAGATGATCA AGCTGGTAAA 540
CTGTGGGATG AACTCAGTCT GGTTGGGCTT CCAGATGCCA TTGATATCCA AGCCTTATTT 600
GATTCTACAG GCACTTGTTG GGCTCATCAC CGTTGTGTGG AGTGGTCACT AGGAGTATGC 660
CAGATGGAAG AACCATTGTT AGTGAACGTG GACAAAGCTG TTGTCTCAGG GAGCACAGAA 720
CGATGTGCAT TTTGTAAGCA CCTTGGAGCC ACTATCAAAT GCTGTGAAGA GAAATGTACC 780
CAGATGTATC ATTATCCTTG TGCTGCAGGA GCCGGCACCT TTCAGGATTT CAGTCACATC 840
TTCCTGCTTT GTCCAGAACA CATTGACCAA GCTCCTGAAA GATCGAAGGA AGATGCAAAC 900
TGTGCAGTGT GCGACAGCCC GGGAGACCTC TTAGATCAGT TCTTTTGTAC TACTTGTGGT 960
CAGCACTATC ATGGAATGTG CCTGGATATA GCGGTTACTC CATTAAAACG TGCAGGTTGG 1020
CAATGTCCTG AGTGCAAAGT GTGCCAGAAC TGCAAACAAT CGGGAGAAGA TAGCAAGATG 1080
CTAGTGTGTG ATACGTGTGA CAAAGGGTAT CATACTTTTT GTCTTCAACC AGTTATGAAA 1140
TCAGTACCAA CCAATGGCTG GAAATGCAAA AATTGCAGAA TATGTATAGA GTGTGGCACA 1200
CGGTCTAGTT CTCAGTGGCA CCACAATTGC CTGATATGTG ACAATTGTTA CCAACAGCAG 1260
GATAACTTAT GTCCCTTCTG TGGGAAGTAT TATCATCCAG AATTGCAGAA AGACATGCTT 1320
CATTGTAATA TGTGCAAAAG GTGGGTTCAC CTAGAGTGTG ACAAACCAAC AGATCATGAA 1380
CTGGATACTC AGCTCAAAGA AGAGTATATC TGCATGTATT GTAAACACCT GGGAGCTGAG 1440
ATGGATCCTT TACAGCCAGG TGAGGAAGTG GAGGTAGCTG AGCTCACTAC AGATTATAAC 1500
AATGAAATGG AAGTTGAAGG CCCTGAAGAT CAAATGGTAT TCTCAGAGCA GGCAGTTAAT 1560
AAAGATGTCA ACGGTCAGGA GTCCACTCCT GGAATTGTTC CAGATGCGGT TCAAGTCCAC 1620
ACTGAAGAGC AACAGAAGAG TCATCCCTCA GAAAGTCTTG ACACAGATAC TCTTCTTATT 1680
GCTGTATCAT CCCAACATAC AGTGAATACT GAATTGGAAA AACAGATTTC TAATGAAGTT 1740
GATAGTGAAG ACCTGAAAAT GTCTTCTGAA GTGAAGCATA TTTGTGGCGA AGATCAAATT 1800
GAAGATAAAA TGGAAGTGAC AGAAAACATT GAAGTCGTTA CACACCAGAT CACTGTGCAG 1860
CAAGAACAAC TGCAGTTGTT AGAGGAACCT AAAACAGTGG TATCCAGAGA AGAATCAAGG 1920
CCTCCAAAAT TAGTCATGGA ATCTGTCACT CTTCCACTAG AAACCTTAGT GTCCCCACAT 1980
GAGGAAAGTA CTTCATTATG TCCTGAGGAA CAGTTGGTTA TAGAAAGGCT ACAAGGAGAA 2040
AAGGAACAGA AAGAAAATTC TGAACTTTCT ACTGGATTGA TGGACTCTGA AATGACTCCT 2100
ACAATTGAGG GTTGTGTGAA AGATGTTTCA TACCAAGGAA GCAAATCTAT AAAGTTATCA 2160
TCTGAGACAG AGTCATCATT TTCATCATCA GCAGACATAA GCAAGGCAGA TGCGTCTTCC 2220
TCCCCAACAC CTTCTTCAGA CTTGCCTTCG CATGACATGC TGCATAATTA CCCTTCAGCT 2280
CTTAGTTCCT CTGCTGGAAA CATCATGCCA ACAACTTACA TCTCAGTCAC TCCAAAAATT 2340
GGCATGGGTA AACCAGCTAT TACTAAGAGA AAATTTTCTC CTGGTAGACC TCGGTCCAAA 2400
CAGGGGGCTT GGAGTACCCA TAATACAGTG AGCCCACCTT CCTGGTCCCC AGACATTTCA 2460
GAAGGTCGGG AAATTTTTAA ACCCAGGCAG CTTCCTGGCA GTGCCATTTG GAGCATCAAA 2520
GTGGGCCGTG GGTCTGGATT TCCAGGAAAG CGGAGACCTC GAGGTGCAGG ACTGTCGGGG 2580
CGAGGTGGCC GAGGCAGGTC AAAGCTGAAA AGTGGAATCG GAGCTGTTGT ATTGCCTGGG 2640
GTGTCTACTG CAGATATTTC ATCAAATAAG GATGATGAAG AAAACTCTAT GCACAATACA 2700
GTTGTGTTGT TTTCTAGCAG TGACAAGTTC ACTTTGAATC AGGATATGTG TGTAGTTTGT 2760
GGCAGTTTTG GCCAAGGAGC AGAAGGAAGA TTACTTGCCT GTTCTCAGTG TGGTCAGTGT 2820
TACCATCCAT ACTGTGTCAG TATTAAGATC ACTAAAGTGG TTCTTAGCAA AGGTTGGAGG 2880
TGTCTTGAGT GCACTGTGTG TGAGGCCTGT GGGAAGGCAA CTGACCCAGG AAGACTCCTG 2940
CTGTGTGATG ATTGTGACAT AAGTTATCAC ACCTACTGCC TAGACCCTCC ATTGCAGACA 3000
GTTCCCAAAG GAGGCTGGAA GTGCAAATGG TGTGTTTGGT GCAGACACTG TGGAGCAACA 3060
TCTGCAGGTC TAAGATGTGA ATGGCAGAAC AATTACACAC AGTGCGCTCC TTGTGCAAGC 3120
TTATCTTCCT GTCCAGTCTG CTATCGAAAC TATAGAGAAG AAGATCTTAT TCTGCAATGT 3180
AGACAATGTG ATAGATGGAT GCATGCAGTT TGTCAGAACT TAAATACTGA GGAAGAAGTG 3240
GAAAATGTAG CAGACATTGG TTTTGATTGT AGCATGTGCA GACCCTATAT GCCTGCGTCT 3300
AATGTGCCTT CCTCAGACTG CTGTGAATCT TCACTTGTAG CACAAATTGT CACAAAAGTA 3360
AAAGAGCTAG ACCCACCCAA GACTTATACC CAGGATGGTG TGTGTTTGAC TGAATCAGGG 3420
ATGACTCAGT TACAGAGCCT CACAGTTACA GTTCCAAGAA GAAAACGGTC AAAACCAAAA 3480
TTGAAATTGA AGATTATAAA TCAGAATAGC GTGGCCGTCC TTCAGACCCC TCCAGACATC 3540
CAATCAGAGC ATTCAAGGGA TGGTGAAATG GATGATAGTC GAGAAGGAGA ACTTATGGAT 3600
TGTGATGGAA AATCAGAATC TAGTCCTGAG CGGGAAGCTG TGGATGATGA AACTAAGGGA 3660
GTGGAAGGAA CAGATGGTGT CAAAAAGAGA AAAAGGAAAC CATACAGACC AGGTATTGGT 3720
GGATTTATGG TGCGGCAAAG AAGTCGAACT GGGCAAGGGA AAACTAAAAG ATCTGTGATC 3780
AGAAAAGAGT CCTCAGGCTC CATTTCCGAG CAGTTACCTT GCAGAGATGA TGGCTGGAGT 3840
GAGCAGTTAC CAGATACTTT AGTTGATGAA TCTGTTTCTG TTACTGAAAG CACTGAAAAA 3900
ATAAAGAAGA GATACCGAAA AAGGAAAAAT AAGCTTGAAG AAACTTTCCC TGCCTATTTA 3960
CAAGAAGCTT TCTTTGGAAA AGATCTTCTA GATACAAGTA GACAAAGCAA GATAAGTTTA 4020
GATAATCTGT CAGAAGATGG AGCTCAGCTT TTATATAAAA CAAACATGAA CACAGGTTTC 4080
TTGGATCCTT CCTTAGATCC ACTACTTAGT TCATCCTCGG CTCCAACAAA ATCTGGAACT 4140
CACGGTCCTG CTGATGACCC ATTAGCTGAT ATTTCTGAAG TTTTAAACAC AGATGATGAC 4200
ATTCTTGGAA TAATTTCAGA TGATCTAGCA AAATCAGTTG ATCATTCAGA TATTGGTCCT 4260
GTCACTGATG ATCCTTCCTC TTTGCCTCAG CCAAATGTCA ATCAGAGTTC ACGACCATTA 4320
AGTGAAGAAC AGCTAGATGG GATCCTCAGT CCTGAACTAG ACAAAATGGT CACAGATGGA 4380
GCAATTCTTG GAAAATTATA TAAAATTCCA GAGCTCGGCG GAAAAGATGT TGAAGACTTA 4440
TTTACAGCTG TACTTAGTCC TGCGAACACT CAGCCAACTC CATTGCCACA GCCTCCCCCA 4500
CCAACACAGC TGTTGCCAAT ACACAATCAG GATGTTTTTT CACGGATGCC TCTCATGAAT 4560
GGCCTTATTG GACCCAGTCC TCATCTCCCA CATAATTCTT TGCCACCTGG AAGCGGACTG 4620
GGAACTTTCT CTGCAATTGC ACAATCCTCT TATCCTGATG CCAGGGATAA AAATTCAGCC 4680
TTTAATCCAA TGGCAAGTGA TCCTAACAAC TCTTGGACAT CATCAGCTCC CACTGTGGAA 4740
GGAGAAAATG ACACCATGTC GAATGCCCAG AGAAGCACGC TTAAGTGGGA GAAAGAGGAG 4800
GCTCTGGGTG AAATGGCAAC AGTTGCCCCA GTTCTCTACA CCAATATTAA TTTCCCCAAC 4860
TTAAAGGAAG AATTCCCTGA TTGGACTACT AGAGTGAAGC AAATTGCCAA ATTGTGGAGA 4920
AAAGCAAGCT CACAAGAAAG AGCACCATAT GTGCAAAAAG CCAGAGATAA CAGAGCTGCT 4980
TTACGCATTA ATAAAGTACA GATGTCAAAT GATTCCATGA AAAGGCAGCA ACAGCAAGAT 5040
AGCATTGATC CCAGCTCTCG TATTGATTCG GAGCTTTTTA AAGATCCTTT AAAGCAAAGA 5100
GAATCAGAAC ATGAACAGGA ATGGAAATTT AGACAGCAAA TGCGTCAGAA AAGTAAGCAG 5160
CAAGCTAAAA TTGAAGCCAC ACAGAAACTT GAACAGGTGA AAAATGAGCA GCAGCAGCAG 5220
CAACAACAAC AGCAATTTGG TTCTCAGCAT CTTCTGGTGC AGTCTGGTTC AGATACACCA 5280
AGTAGTGGGA TACAGAGTCC CTTGACACCT CAGCCTGGCA ATGGAAATAT GTCTCCTGCA 5340
CAGTCATTCC ATAAAGAACT GTTTACAAAA CAGCCAACCA GTACCCCTAC GTCTACATCT 5400
TCAGATGATG TGTTTGTAAA GCCACAAGCT CCACCTCCTC CTCCAGCCCC ATCCCGGATT 5460
CCCATCCAGG ATAGTCTTTC TCAGGCTCAG ACTTCTCAGC CACCCTCACC ACAAGTGTTT 5520
TCACCTGGGT CCTCTAACTC ACGACCACCA TCTCCAATGG ATCCATATGC AAAAATGGTT 5580
GGTACCCCTC GACCACCTCC TGTGGGCCAT AGTTTTTCCA GAAGAAATTC TGCTGCACCA 5640
GTGGAAAACT GTACACCTTT ATCATCGGTA TCTAGGCCCC TTCAAATGAA TGAGACAACA 5700
GCAAATAGGC CATCCCCTGT CAGAGATTTA TGTTCTTCTT CCACGACAAA TAATGACCCC 5760
TATGCAAAAC CTCCAGACAC ACCTAGGCCT GTGATGACAG ATCAATTTCC CAAATCCTTG 5820
GGCCTATCCC GGTCTCCTGT AGTTTCAGAA CAAACTGCAA AAGGCCCTGT AGCAGCTGGA 5880
ACCAGTGATC ACTTTACTAA ACCATCTCCT AGGGCAGATG TGTTTCAAAG ACAAAGGATA 5940
CCTGACTCAT ATGCACGACC CTTGTTGACA CCTGCACCTC TTGATAGTGG TCCTGGACCT 6000
TTTAAGACTC CAATGCAACC TCCTCCATCC TCTCAGGATC CTTATGGATC AGTGTCACAG 6060
GCATCAAGGC GATTATCTGT TGACCCTTAT GAAAGGCCTG CTTTGACACC AAGACCTATA 6120
GATAATTTTT CTCATAATCA GTCAAATGAT CCATATAGTC AGCCTCCCCT TACCCCACAT 6180
CCAGCAATGA ATGAATCTTT TGCCCATCCT TCAAGGGCTT TTTCCCAGCC TGGAACCATA 6240
TCAAGGCCAA CATCTCAGGA CCCATACTCC CAACCCCCAG GAACTCCACG ACCTGTTGTA 6300
GATTCTTATT CCCAATCTTC AGGAACAGCT AGGTCCAATA CAGACCCTTA CTCTCAACCT 6360
CCTGGAACTC CCCGGCCTAC TACTGTTGAC CCATATAGTC AGCAGCCCCA AACCCCAAGA 6420
CCATCTACAC AAACTGACTT GTTTGTTACA CCTGTAACAA ATCAGAGGCA TTCTGATCCA 6480
TATGCTCATC CTCCTGGAAC ACCAAGACCT GGAATTTCTG TCCCTTACTC TCAGCCACCA 6540
GCAACACCAA GGCCAAGGAT TTCAGAGGGT TTTACTAGGT CCTCAATGAC AAGACCAGTC 6600
CTCATGCCAA ATCAGGATCC TTTCCTGCAA GCAGCACAAA ACCGAGGACC AGCTTTACCT 6660
GGCCCGTTGG TAAGGCCACC TGATACATGT TCCCAGACAC CTAGGCCCCC TGGACCTGGT 6720
CTTTCAGACA CATTTAGCCG TGTTTCCCCA TCTGCTGCCC GTGATCCCTA TGATCAGTCT 6780
CCAATGACTC CAAGATCTCA GTCTGACTCT TTTGGAACAA GTCAAACTGC CCATGATGTT 6840
GCTGATCAGC CAAGGCCTGG ATCAGAGGGG AGCTTCTGTG CATCTTCAAA CTCTCCAATG 6900
CACTCCCAAG GCCAGCAGTT CTCTGGTGTC TCCCAACTTC CTGGACCTGT GCCAACTTCA 6960
GGAGTAACTG ATACACAGAA TACTGTAAAT ATGGCCCAAG CAGATACAGA GAAATTGAGA 7020
CAGCGGCAGA AGTTACGTGA AATCATTCTC CAGCAGCAAC AGCAGAAGAA GATTGCAGGT 7080
CGACAGGAGA AGGGGTCACA GGACTCACCC GCAGTGCCTC ATCCAGGGCC TCTTCAACAC 7140
TGGCAACCAG AGAATGTTAA CCAGGCTTTC ACCAGACCCC CACCTCCCTA TCCTGGGAAC 7200
ATTAGGTCTC CTGTTGCCCC TCCTTTAGGA CCTAGATATG CTGTTTTCCC AAAAGATCAG 7260
CGTGGACCCT ATCCTCCTGA TGTTGCTAGT ATGGGGATGA GACCTCATGG ATTTAGATTT 7320
GGATTTCCAG GAGGTAGTCA TGGTACCATG CCGAGTCAAG AGCGCTTCCT TGTGCCTCCT 7380
CAACAAATAC AGGGATCTGG AGTTTCTCCA CAGCTAAGAA GATCAGTAGC TGTAGATATG 7440
CCTAGGCCTT TAAATAACTC ACAAATGAAT AATCCAGTTG GACTTCCTCA GCATTTTTCA 7500
CCACAGAGCT TGCCAGTTCA GCAGCACAAC ATACTGGGCC AAGCATATAT TGAACTGAGA 7560
CATAGGGCTC CTGACGGAAG GCAACGGCTG CCTTTCAGTG CTCCACCTGG CAGCGTTGTA 7620
GAGGCATCTT CAAATCTGAG ACATGGAAAC TTCATTCCCC GGCCAGACTT TCCGGGCCCT 7680
AGACATACAG ACCCCATGCG ACGACCTCCC CAGGGTCTAC CTAATCAGCT ACCTGTGCAC 7740
CCAGATTTGG AACAAGTGCC ACCATCTCAA CAAGAGCAAG GTCATTCTGT CCATTCATCT 7800
TCTATGGTCA TGAGGACTCT GAACCATCCA CTAGGTGGTG AATTTTCAGA AGCTCCTTTG 7860
TCAACATCTG TACCATCTGA AACAACGTCT GATAATTTAC AGATAACCAC CCAACCTTCT 7920
GATGGTCTAG AGGAAAAACT TGATTCTGAT GTCCCTTCTG TGAAGGAACT GGATGTTAAA 7980
GACCTTGAGG GGGTTGAAGT CAAAGACTTA GATGATGAAG ATCTTGAAAA CTTAAATTTA 8040
GATACAGAGG ATGGCAAGGT AGTTGAATTG GATACTTTAG ATAATTTGGA AACTAATGAT 8100
CCCAACCTGG ATGACCTCTT AAGGTCAGGA GAGTTTGATA TCATTGCATA TACAGATCCA 8160
GAACTTGACA TGGGAGATAA GAAAAGCATG TTTAATGAGG AACTAGACCT TCCAATTGAT 8220
GATAAGTTAG ATAATCAGTG TGTATCTGTT GAACCAAAAA AAAAGGAACA AGAAAACAAA 8280
ACTCTGGTTC TCTCTGATAA ACATTCACCA CAGAAAAAAT CCACTGTTAC CAATGAGGTA 8340
AAAACGGAAG TACTGTCTCC AAATTCTAAG GTGGAATCCA AATGTGAAAC TGAAAAAAAT 8400
GATGAGAATA AAGATAATGT TGACACTCCT TGCTCACAGG CTTCTGCTCA CTCAGACCTA 8460
AATGATGGAG AAAAGACTTC TTTGCATCCT TGTGATCCAG ATCTATTTGA GAAAAGAACC 8520
AATCGAGAAA CTGCTGGCCC CGGTGCAAAT GTCATTCAAG CATCCACTCA ACTACCTGCT 8580
CAAGATGTAA TAAACTCTTG TGGCATAACT GGATCAACTC CAGTTCTCTC AAGTTTACTT 8640
GCCAATGAGA AATCTGATAA TTCAGACATT AGGCCATTGG GGTCTCCACC ACCACCAACT 8700
CTGCCGGCCT CACCATCCAA TCATGTGTCA AGTTTGCCTC CTTTAATAGC ACCGCCTGGC 8760
CGTGTTTTGG ATAATACCAT GAATTCTAAT GTAACAGTAG TCTCTAGGGT AAACCATGTT 8820
TTTTCTCAGG GTGTGCAGGT AAATCCAGGG TTCATTCCAG GTCAATCAAC AGTTAACCAC 8880
AATCTGGGGA CAGGAAAACC TGCAACTCAA ACTGGGCCTC TAACAAGTCA GTCTGGTACC 8940
AGTGGCATGT CTGGACCCCA ACAGCTAATG ATTCCTCAAA CATTAGCACA GCAGAATAGA 9000
GAGAGGCCCC TTCTTCTAGA AGAACAGCCT CTACTTCTAC AGGATCTTTT GGATCAAGAA 9060
AGGCAGGAAC AGCAGCAGCA AAGACAGATG CAAGCCATGA TTCGTCAGCG ATCAGAACCG 9120
TTCTTCCCTA ATATTGATTT TGATGCAATT ACAGATCCTA TAATGAAAGC CAAAATGGTG 9180
GCCCTTAAAG GCATAAATAA AGTGATGGCA CAAAACAATC TGGGCATGCC ACCAATGGTG 9240
ATGAGCAGGT TCCCTTTTAT GGGCCAGGCG GTAACTGGAA CACAGAACAG TGAAGGACAG 9300
AACCTTGGAC CACAGGCCAT TCCTCAGGAT GGCAGTATAA CACATCAGAT TTCTAGGCCT 9360
AATCCTCCAA ATTTTGGTCC AGGCTTTGTC AATGATTCAC AGCGTAAGCA GTATGAAGAG 9420
TGGCTCCAGG AGACCCAACA GCTGCTTCAA ATGCAGCAGA AGTATCTTGA AGAACAAATT 9480
GGTGCTCACA GAAAATCTAA GAAGGCCCTT TCAGCTAAAC AACGTACTGC CAAGAAAGCT 9540
GGGCGTGAAT TTCCAGAGGA AGATGCAGAA CAACTCAAGC ATGTTACTGA ACAGCAAAGC 9600
ATGGTTCAGA AACAGCTAGA ACAGATTCGT AAACAACAGA AAGAACATGC TGAATTGATT 9660
GAAGATTATC GGATCAAACA GCAGCAGCAA TGTGCAATGG CCCCACCTAC CATGATGCCC 9720
AGTGTCCAGC CCCAGCCACC CCTAATTCCA GGTGCCACTC CACCCACCAT GAGCCAACCC 9780
GCCTTTCCCA TGGTGCCACA GCAGCTTCAG CACCAGCAGC ACACAACAGT TATTTCTGGC 9840
CATACTAGCC CTGTTAGAAT GCCCAGTTTA CCTGGATGGC AACCCAACAG TGCTCCTGCC 9900
CACCTGCCCC TCAATCCTCC TAGAATTCAG CCCCCAATTG CCCAATTACC AATAAAAACT 9960
TGTACACCAG CCCCAGGGAC AGTCTCAAAT GCAAATCCAC AGAGTGGACC ACCACCTCGG 10020
GTAGAATTTG ATGACAACAA TCCCTTTAGT GAAAGTTTTC AAGAACGGGA ACGCAAGGAA 10080
CGTTTACGAG AACAGCAAGA GAGACAACGG ATCCAACTCA TGCAGGAGGT AGATAGACAA 10140
AGAGCTTTGC AGCAGAGGAT GGAAATGGAG CAGCATGGTA TGGTGGGCTC TGAGATAAGT 10200
AGTAGTAGGA CATCTGTGTC CCAGATTCCC TTCTACAGTT CCGACTTACC TTGTGATTTT 10260
ATGCAACCTC TAGGACCCCT TCAGCAGTCT CCACAACACC AACAGCAAAT GGGGCAGGTT 10320
TTACAGCAGC AGAATATACA ACAAGGATCA ATTAATTCAC CCTCCACCCA AACTTTCATG 10380
CAGACTAATG AGCGAAGGCA GGTAGGCCCT CCTTCATTTG TTCCTGATTC ACCATCAATC 10440
CCTGTTGGAA GCCCAAATTT TTCTTCTGTG AAGCAGGGAC ATGGAAATCT TTCTGGGACC 10500
AGCTTCCAGC AGTCCCCAGT GAGGCCTTCT TTTACACCTG CTTTACCAGC AGCACCTCCA 10560
GTAGCTAATA GCAGTCTCCC ATGTGGCCAA GATTCTACTA TGACCCATGG ACACAGTTAT 10620
CCGGGATCAA CCCAATCGCT CATTCAGTTG TATTCTGATA TAATCCCAGA GGAAAAAGGG 10680
AAAAAGAAAA GAACAAGAAA GAAGAAAAGA GATGATGATG CAGAATCCAC CAAGGCTCCA 10740
TCAACTCCCC ATTCAGATAT AACTGCCCCG CCGACTCCAG GCATCTCAGA AACTACCTCT 10800
ACTCCTGCAG TGAGCACACC CAGTGAGCTT CCTCAACAAG CCGACCCAGA GTCGGTGGAA 10860
CCAGTCGGCC CATCCACTCC CAATATGGCA GCAGGCCAGC TATGTACAGA ATTAGAGAAC 10920
AAACTGCCCA ATAGTGATTT CTCACAAGCA ACTCCAAATC AACAGACGTA TGCAAATTCA 10980
GAAGTAGATA AGCTCTCCAT GGAAACCCCT GCCAAAACAG AAGAAATAAA ACTGGAAAAG 11040
GCTGAGACAG AGTCCTGCCC AGGCCAAGAG GAGCCTAAAT TGGAGGAACA GAATGGTAGT 11100
AAGGTAGAAG GAAACGCTGT AGCCTGTCCT GTCTCCTCAG CACAGAGTCC TCCCCATTCT 11160
GCTGGGGCCC CTGCTGCCAA AGGAGACTCA GGGAATGAAC TTCTGAAACA CTTGTTGAAA 11220
AATAAAAAGT CATCTTCTTT TTTGAATCAA AAACCTGAGG GCAGTATTTG TTCAGAAGAT 11280
GACTGTACAA AGGATAATAA ACTAGTTGAG AAGCAGAACC CAGCTGAAGG ACTGCAAACT 11340
TTGGGGGCTC AAATGCAAGG TGGTTTTGGA TGTGGCAACC AGTTGCCAAA AACAGATGGA 11400
GGAAGTGAAA CCAAGAAACA GCGAAGCAAA CGGACTCAGA GGACGGGTGA GAAAGCAGCA 11460
CCTCGCTCAA AGAAAAGGAA AAAGGACGAA GAGGAGAAAC AAGCTATGTA CTCTAGCACT 11520
GACACGTTTA CCCACTTGAA ACAGCAGAAT AATTTAAGTA ATCCTCCAAC ACCCCCTGCC 11580
TCTCTTCCTC CTACACCACC TCCTATGGCT TGTCAGAAGA TGGCCAATGG TTTTGCAACA 11640
ACTGAAGAAC TTGCTGGAAA AGCCGGAGTG TTAGTGAGCC ATGAAGTTAC CAAAACTCTA 11700
GGACCTAAAC CATTTCAGCT GCCCTTCAGA CCCCAGGACG ACTTGTTGGC CCGAGCTCTT 11760
GTTCAGGGCC CCAAGACAGT TGATGTGCCA GCCTCCCTCC CAACACCACC TCATAACAAT 11820
CAGGAAGAAT TAAGGATACA GGATCACTGT GGTGATCGAG ATACTCCTGA CAGTTTTGTT 11880
CCCTCATCCT CTCCTGAGAG TGTGGTTGGG GTAGAAGTGA GCAGGTATCC AGATCTGTCA 11940
TTGGTCAAGG AGGAGCCTCC AGAACCGGTG CCGTCCCCCA TCATTCCAAT TCTTCCTAGC 12000
ACTGCTGGGA AAAGTTCAGA ATCAAGAAGG AATGACATCA AAACTGAGCC AGGCACTTTA 12060
TATTTTGCGT CACCTTTTGG TCCTTCCCCA AATGGTCCCA GATCAGGTCT TATATCTGTA 12120
GCAATTACTC TGCATCCTAC AGCTGCTGAG AACATTAGCA GTGTTGTGGC TGCATTTTCC 12180
GACCTTCTTC ACGTCCGAAT CCCTAACAGC TATGAGGTTA GCAATGCTCC AGATGTCCCA 12240
TCCATGGGTT TGGTCAGTAG CCACAGAATC AACCCGGGTT TGGAGTATCG ACAGCATTTA 12300
CTTCTCCGTG GGCCTCCGCC AGGATCTGCA AACCCTCCCA GATTAGTGAG CTCTTACCGG 12360
CTGAAGCAGC CTAATGTACC ATTTCCTCCA ACAAGCAATG GTCTTTCTGG ATATAAGGAT 12420
TCTAGTCATG GTATTGCAGA AAGCGCAGCA CTCAGACCAC AGTGGTGTTG TCATTGTAAA 12480
GTGGTTATTC TTGGAAGTGG TGTGCGGAAA TCTTTCAAAG ATCTGACCCT TTTGAACAAG 12540
GATTCCCGAG AAAGCACCAA GAGGGTAGAG AAGGACATTG TCTTCTGTAG TAATAACTGC 12600
TTTATTCTTT ATTCATCAAC TGCACAAGTG AAAAACTCAG AAAACAAGGG ATCCATTCCT 12660
TCATTGCCAC AATCACCTAT GAGAGAAACG CCTTCCAAAG CATTTCATCA GTACAGCAAC 12720
AACATCTCCA CTTTGGATGT GCACTGTCTC CCCCAGCTCC CAGAGAAAGC TTCTCCCCCT 12780
GCGTCACCGC CCATCGCCTT CCCTCCTGCT TTTGAAGCAG CCCAAGTCGA GGCCAAGCCA 12840
GATGAGCTGA AGGTGACAGT CAAGCTGAAG CCTCGGCTAA GAGCTGTCCA TGGTGGGTTT 12900
GAAGATTGCA GGCCACTCAA TAAAAAATGG AGAGGAATGA AATGGAAGAA GTGGAGCATT 12960
CATATTGTAA TCCCTAAGGG GACATTTAAA CCACCTTGTG AGGATGAAAT AGATGAATTT 13020
CTAAAGAAAT TGGGCACTTC CCTTAAACCT GATCCTGTGC CCAAAGACTA TCGGAAATGT 13080
TGCTTTTGTC ATGAAGAAGG TGATGGATTG ACAGATGGAC CAGCAAGGCT ACTCAACCTT 13140
GACTTGGATC TGTGGGTCCA CTTGAACTGC GCTCTGTGGT CCACGGAGGT CTATGAGACT 13200
CAGGCTGGTG CCTTAATAAA TGTGGAGCTA GCTCTGAGGA GAGGCCTACA AATGAAATGT 13260
GTCTTCTGTC ACAAGACAGG TGCCACTAGT GGATGCCACA GATTTCGATG CACCAACATT 13320
TATCACTTTA CTTGCGCCAT TAAAGCACAA TGCATGTTTT TTAAGGACAA AACTATGCTT 13380
TGCCCCATGC ACAAACCAAA GGGAATTCAT GAGCAAGAAT TAAGTTACTT TGCAGTCTTC 13440
AGGAGGGTCT ATGTTCAGCG CGATGAGGTG CGACAGATTG CTAGCATCGT GCAACGAGGA 13500
GAACGGGACC ATACCTTTCG CGTGGGTAGC CTCATCTTCC ACACAATTGG TCAGCTGCTT 13560
CCACAGCAGA TGCAAGCATT CCATTCTCCT AAAGCACTCT TCCCTGTGGG CTATGAAGCC 13620
AGCCGGCTGT ACTGGAGCAC GCGCTATGCC AATAGGCGCT GCCGCTACCT GTGCTCCATT 13680
GAGGAGAAGG ATGGGCGCCC AGTGTTTGTC ATCAGGATTG TGGAACAAGG CCATGAAGAC 13740
CTGGTTCTAA GTGACATCTC ACCTAAAGGT GTCTGGGATA AGATTTTGGA GCCTGTGGCA 13800
TGTGTGAGAA AAAAGTCTGA AATGCTCCAG CTTTTCCCAG CGTATTTAAA AGGAGAGGAT 13860
CTGTTTGGCC TGACCGTCTC TGCAGTGGCA CGCATAGCGG AATCACTACC TGGGGTTGAG 13920
GCATGTGAAA GTTATACCTT CCGATACGGC CGAAATCCTC TCATGGAACT TCCTCTTGCC 13980
GTTAACCCCA CAGGTTGTGC CCGTTCTGAA CCTAAAATGA GTGCCCATGT CAAGAGGTTT 14040
GTGTTAAGGC CTCACACCTT AAACAGCACC AGCACCTCAA AGTCATTTCA GAGCACAGTC 14100
ACTGGAGAAC TGAACGCACC TTATAGTAAA CAGTTCGTTC ACTCCAAGTC ATCGCAGTAC 14160
CGGAAGATGA AAACTGAATG GAAATCCAAT GTGTATCTGG CACGGTCTCG GATTCAGGGG 14220
CTGGGCCTGT ATGCTGCTAG AGACATTGAG AAGCACACCA TGGTCATCGA GTACATCGGG 14280
ACTATCATTC GAAACGAAGT AGCAAACAGG AAAGAGAAGC TTTATGAGTC TCAGAACCGT 14340
GGTGTGTACA TGTTCCGCAT GGATAACGAC CACGTGATTG ACGCGACACT CACAGGAGGG 14400
CCTGCAAGGT ATATCAACCA TTCGTGTGCA CCTAATTGTG TGGCTGAAGT GGTGACTTTT 14460
GAGAGAGGAC ACAAAATTAT CATCAGCTCC AATCGGAGAA TCCAGAAAGG AGAAGAGCTC 14520
TGCTATGACT ATAAGTTTGA CTTTGAAGAT GACCAGCACA AGATTCCCTG TCACTGTGGA 14580
GCTGTGAACT GCCGGAAGTG GATGAACTGA AATGCATTCC TTGCTAGCTC AGCGGGCGGC 14640
TTGTCCCTAG GAAGAGGCGA TTCAACACAC CATTGGAATT TTGCAGACAG AAAGAGATGT 14700
TTGTTTTCTG TTTTATGACT TTTTGAAAAA GCTTCTGGGA GTTCTGATTT CCTCAGTCCT 14760
TTAGGTTAAA GCAGCGCCAG GAGGAAGCTG ACAGAAGCAG CGTTCCTGAA GTGGCCGAGG 14820
TTAAACGGAA TCACAGAATG GTCCAGCACT TTTGGTTTTT TTTCTTTTCC TTTTCTTTCT 14880
TTTTTTGTTT GTTTTTTGTT TTTTGTTTTG TTTTTCCCTT GTGGGTGGGT TTCATTGTTT 14940
TGGTTTTCTA GTCTCACTAA GGAGAAACTT TTACTGGGGC AAAGAGCCGA TGGCTGCCCT 15000
GCCCCGGGCA GGGGCCTTCC TATGAATGTA AGACTGAAAT CACCAGCGAA GGGGACAGAG 15060
AGTGCTGGCC ACGGCCTTAT TAAAAAGGGG CAGGCCCTCT AACTTCAAAA TGTTTTTAAA 15120
TAAAGTAGAC ACCACTGAAC AAGGAATGTA CTGAAATGAC TTCCTTAGGG ATAGAGCTAA 15180
GGGATAATAA CTTGCACTAA ATACATTTAA ATACTTGATT CCATGAGTCA GTTTATTGTA 15240
GTTTTTGATT TCTGTAAAAT AAGAGAAACT TTTGTATTTA TTATTGAATA AGTGAATGAA 15300
GCTATTTTTA AATAAAGTTA GAAGAAAGCC AAGCTGCTGC TGTTACCTGC AGAACTAACA 15360
AACCCTGTTA CTTTGTACAG ATATGTAAAT ATTTTGAGAA AAAATACAGT ATAAAAATAG 15420
TTATTGACCA AATGCTACCA GGCTCTGCAG CAGCTCGGGG GCTTATAAAA TGTTCATAGG 15480
GATGTTACAA TATAATTTTG TGTTATAAAA TATGCCATTA TAATTATGTA ATAACCAAAA 15540
TTTCAACCTA GAGTGTTGGG GGTTTTTTGG AAACTGCAGT CTATTAGTAC TCAATGGTTT 15600
TATACACCTT ACTTCTGACA GAGCGGGGCG TATGCTACGA CTACAACTTT TATAGCTGTT 15660
TTGGTAATTT AAACTAATTT TTTCATATTA TATTGTTGCA TCCCTACTTC TTCAGTCAGG 15720
TTTTTTTGTG CTTACAATTT GTGATAACTG TGAATAACTG CTTAAAAATA CACCCAAATG 15780
GAGGCTGAAT TTTTTCTTCA GCAAAAGTAG TTTTGATTAG AACTTTGTTT CAGCCACAGA 15840
GAATCATGTA AACGTAATAG GATCATGTAG CAGAAACTTA AATCTAACCC TTTAGCCTTC 15900
CATTTAACAC AAAAATTTGA AAAAGTTAAA AAAAAAAAAA AAAGGAGATG TGATTATGCT 15960
TACAGCTGCA GGACTCTGGC AATAGGGTTT TTCGAAGATG TAATTTTAAA ATGTGTTTGT 16020
ATGAACTGTT TGTTTACATT TCTTTAATAA AAAAAACACT GTTTTGTGTT TGCTTGTAGA 16080
AACTTAATCA GCATTTTGAA CCAGGTTAGC TTTTTATTTT GTACTTAAAA TTCTGGTACT 16140
GACACTTCAC AGGCTAAGTA TAAAATGAAG TTTTGTGTGC ACAATTCAAG TGGACTGTAA 16200
ACTGTTGGTA TATTCAGTGA TGCAGTTCTG AACTTGTATA TGGCATGATG TATTTTTATC 16260
TTACAGAATA AATCAATTGT ATATATTTTT CTCTTGATAA ATAGCTGTAT GAAATTTGTT 16320
TCCTGAATAT TTTTCTTCTC TTGTACAATA TCCTGACATC CTACCAGTAT TTGTCCTACC 16380
GGGTTTTTGT TGTTTTCTGT TCTGTATAAT AGTATCTAAT GTTGGCAAAA ATTGAATTTT 16440
TTGAAGTATA CAGAGTGTTA TGGGTTTTGG AATTTGTGGA CACAGATTTA GAAGATCACC 16500
ATTTACAAAT AAAATATTTT ACATCTAT 16529
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0018 ENSP00000347325.3 Homo sapiens 99 0.0 8569
WERAM-Gog-0210 ENSGGOP00000027941.1 Gorilla gorilla 96 0.0 8241
WERAM-Paa-0005 ENSPANP00000006150.1 Papio anubis 95 0.0 8101
WERAM-Ict-0124 ENSSTOP00000012342.2 Ictidomys tridecemlineatus 87 0.0 7510
WERAM-Caf-0059 ENSCAFP00000007370.4 Canis familiaris 87 0.0 7509
WERAM-Aim-0154 ENSAMEP00000014067.1 Ailuropoda melanoleuca 88 0.0 7499
WERAM-Mum-0152 ENSMUSP00000043874.7 Mus musculus 85 0.0 7227
WERAM-Fec-0096 ENSFCAP00000008002.3 Felis catus 86 0.0 7219
WERAM-Cap-0018 ENSCPOP00000001682.2 Cavia porcellus 84 0.0 7174
WERAM-Tut-0198 ENSTTRP00000016174.1 Tursiops truncatus 85 0.0 7165
WERAM-Bot-0193 ENSBTAP00000028347.5 Bos taurus 84 0.0 7122
WERAM-Myl-0122 ENSMLUP00000010086.2 Myotis lucifugus 84 0.0 7092
WERAM-Ova-0045 ENSOARP00000005594.1 Ovis aries 82 0.0 6974
WERAM-Mup-0098 ENSMPUP00000009152.1 Mustela putorius furo 85 0.0 6919
WERAM-Chs-0076 ENSCSAP00000002864.1 Chlorocebus sabaeus 97 0.0 6434
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 97 0.0 6417
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 72 0.0 6056
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 73 0.0 6039
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 72 0.0 5870
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 91 0.0 5779
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 72 0.0 5691
WERAM-Orc-0115 ENSOCUP00000009766.3 Oryctolagus cuniculus 85 0.0 5618
WERAM-Mam-0056 ENSMMUP00000009467.2 Macaca mulatta 91 0.0 5601
WERAM-Poa-0169 ENSPPYP00000020408.2 Pongo abelii 98 0.0 5461
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 88 0.0 5447
WERAM-Anc-0148 ENSACAP00000014142.2 Anolis carolinensis 67 0.0 5405
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 62 0.0 4839
WERAM-Otg-0078 ENSOGAP00000005885.2 Otolemur garnettii 86 0.0 4775
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 95 0.0 4759
WERAM-Dio-0019 ENSDORP00000002189.1 Dipodomys ordii 81 0.0 4714
WERAM-Sah-0130 ENSSHAP00000013860.1 Sarcophilus harrisii 74 0.0 4666
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 73 0.0 4621
WERAM-Loa-0084 ENSLAFP00000006640.4 Loxodonta africana 85 0.0 3204
WERAM-Mim-0010 ENSMICP00000000977.1 Microcebus murinus 86 0.0 3043
WERAM-Tar-0071 ENSTRUP00000014027.1 Takifugu rubripes 49 0.0 2761
WERAM-Ptv-0086 ENSPVAP00000007862.1 Pteropus vampyrus 78 0.0 2727
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 84 0.0 2404
WERAM-Sus-0158 ENSSSCP00000023447.1 Sus scrofa 85 0.0 2290
WERAM-Mod-0039 ENSMODP00000005827.3 Monodelphis domestica 70 0.0 2207
WERAM-Leo-0127 ENSLOCP00000015481.1 Lepisosteus oculatus 53 0.0 2181
WERAM-Fia-0086 ENSFALP00000007141.1 Ficedula albicollis 76 0.0 2066
WERAM-Vip-0013 ENSVPAP00000001498.1 Vicugna pacos 83 0.0 1989
WERAM-Dar-0184 ENSDARP00000115827.2 Danio rerio 47 0.0 1774
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 87 0.0 1747
WERAM-Ten-0186 ENSTNIP00000018287.1 Tetraodon nigroviridis 46 0.0 1721
WERAM-Xet-0065 ENSXETP00000021458.2 Xenopus tropicalis 64 0.0 1660
WERAM-Xim-0205 ENSXMAP00000016470.1 Xiphophorus maculatus 46 0.0 1654
WERAM-Prc-0090 ENSPCAP00000008256.1 Procavia capensis 78 0.0 1644
WERAM-Gaa-0138 ENSGACP00000017696.1 Gasterosteus aculeatus 55 0.0 1467
WERAM-Asm-0011 ENSAMXP00000001840.1 Astyanax mexicanus 53 0.0 1459
WERAM-Orla-0181 ENSORLP00000020984.1 Oryzias latipes 55 0.0 1441
WERAM-Ocp-0124 ENSOPRP00000012666.1 Ochotona princeps 83 0.0 1411
WERAM-Mae-0021 ENSMEUP00000001693.1 Macropus eugenii 68 0.0 1401
WERAM-Pof-0064 ENSPFOP00000005925.2 Poecilia formosa 48 0.0 1359
WERAM-Ran-0259 ENSRNOP00000072878.1 Rattus norvegicus 77 0.0 1094
WERAM-Orn-0120 ENSONIP00000012272.1 Oreochromis niloticus 57 0.0 1026
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 73 0.0 912
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 72 0.0 886
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 73 0.0 827
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 75 0.0 758
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 63 0.0 743
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 56 0.0 662
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 54 9e-180 630
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 51 2e-162 573
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 42 8e-151 534
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 35 3e-99 363
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 37 7e-50 199
Created Date 25-Jun-2016