WERAM Information


Tag Content
WERAM ID WERAM-Mim-0155
Ensembl Protein ID ENSMICP00000015981.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMICG00000017547.1 ENSMICT00000017547.1 ENSMICP00000015981.1
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader PHD 2.40e-25 88.2 165 4837
Organism Microcebus murinus
Domain Profile
  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSMICP00000015981.1 165 RCSHCTRLGA----SIPCRSpgCPRLYHFPCATASGSFLSmKTLQLLCPEHSE 213
6999933333....599******************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ ge + ++ C +C + +H+ C++ +l+ + w Cp Ck
ENSMICP00000015981.1 222 RCTVC--EGPGELCdLFFCTSCGHHYHGACLDTALTARKRA-GWQCPDCK 268
6****..444545559******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSMICP00000015981.1 269 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 315
8****99999987.*************************9.*******8 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
+C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSMICP00000015981.1 1092 MCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1140
8****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSMICP00000015981.1 1142 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1188
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSMICP00000015981.1 1219 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1270
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSMICP00000015981.1 4732 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4764
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSMICP00000015981.1 4791 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 4837
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDSQKSPGED KDSEPAADGP TASEDPGATE PDLPNPHVGE VSVPSSGTPK LQEPPQDCSG 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG GNPGPNEAVL PSEDLSQIGF 120
PEGLTPAHLG EPGGSAHHWC AVSAVWEDPE LCGVDKAIFS GISQRCSHCT RLGASIPCRS 180
PGCPRLYHFP CATASGSFLS MKTLQLLCPE HSEGAAHLEE ARCTVCEGPG ELCDLFFCTS 240
CGHHYHGACL DTALTARKRA GWQCPDCKVC QACRKPGNDS KMLVCETCDK GYHTFCLKPP 300
MEELPAHSWK CKACRVCRAC GASSAEMNPN CERENYSLCH RCHKAQGAIS SAEQHPXXXX 360
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 420
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 480
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 540
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 600
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 660
XXPAAPPALS PLGELEYPFG AKGDSDPESP LAAPILETPI SPPPEANCTD PEPVPPMILP 720
PSPGSPMELA SPILMEPLPP RCSPLLQHSL APQNSPPALP LSLPSPLSPI GKAVEVSDEA 780
ETQEMETEKV PEPECPALEP SATSPLLSPM GDLSCPAPSP APALDDFSGL GEDTPPLDGI 840
DAPGSQPETG QTPGSLTSEP KGSPVLLDPE ELAPVTPMEV YGPECKQAAG QSSPCEEQEE 900
PCAPVAPIPP ILIKSDIVNE ISNLSQGDAS ASFPGSEPLL GSPDPEGGGS LSMELGVSTD 960
VSPVRDEGSL RLCTDSLPET DDSLLCDAGT AISGGKVEGD KGRRRSSPAR SRIKQXXXXX 1020
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1080
XXXXXXXXXX DMCVVCGSFG RGAEGHLLAC SQCSQCYHPY CVNSKITKVM LLKGWRCVEC 1140
IVCEVCGQAS DPSRLLLCDD CDISYHTYCL DPPLLTVPKG GWKCKWCVSC MQCGAASPGF 1200
HCEWQNSYTH CGPCASLVTC PICHAPYVEE DLLIQCRHCE RWMHAGCESL FTEDDVEQAA 1260
DEGFDCVSCQ PYVVKPVVPI APPELVPMKV KEPEPQYFRF EGVWLTETGM AVLRNLTMSP 1320
LHKRRQRRGR LGLPGEAGLE GSEPSDALGP DDKKDGDLDT DELLKAEGGV EHMECEIKLE 1380
GPVSPDVEPG KEETEESKKR KRKPYRPGIG GFMVRQRKSH TRVKKGPAAQ AEVLSGDGQP 1440
DEGETVMPAD LPAXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXE AFFGKELLDL 1500
SRKALIAVGV GRPSFGLGTP KPKGDGGSER KELPTLQKGD DGPDVADEES RGPESKADTP 1560
GPEDGGVRAS PVPSDSEKPG TXXXXXXXXX XXXXXXXXLP KMESKDLQQL FKDVLGSERE 1620
QHLGCGTPGL EGSRTPLQRP FLQGGLPLGN LPSSSPMDSY PGLCQSPFLD SRERGGFFSP 1680
EPGEPDSPWT GSGGTTPSTP TTPTTEGEGD GLSYNQRSLQ RWEKDEELGQ LSTISPVLYA 1740
NINFPNLKQD YPDWSSRCKQ IMKLWRKVPA ADKAPYLQKA KDNRAAHRIN KVQKQAESQI 1800
NKQTKVGDMA RKPDRPALHL RIPPQPGALG SPPPAAAPTI FIGSPTTPAG LSTSADGFLK 1860
PPAGTVPGPD SPGELFLKLP PQVPAQVPSQ DPFGLAPAYA LEPRFPTAPP TYAPYPTPTG 1920
GPTQPPMLGA SSRPGTGQPG EFHTTPPDTP RHQPSTPDPF LKPRCPSLDN LAVPESPGVG 1980
GSKASEPLLS PPPFGESRKA LEVKKEELGA SSPSYGPPNL GFVDSPSSGP HLGGLELKAP 2040
DVFKAPLTPR ASQVEPQSPG LGLRPQESPQ ALAPSPSHRN LRPGPYPDPX XXXXXXXXXX 2100
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2160
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2220
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2280
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXYL 2340
SHGAPQRSGI TSPVEKREDP GAGMGSSLAT PELSGTQDPG MSNLSQTELE KQRQRQRLRE 2400
LLIRQQIQRN TLRQEKETAA AAAGAVGPPG NWGAEPSSPA FEQLSRAQTP FVGTQDKSSL 2460
VGLPPSKMSG PLMGPGAFSS DDRLSRPPPP ATPSSMDVNS RQLVGGSQAF YQRAPYPGSL 2520
PLRQQQQLWQ QQQATAATSM RLAMSARFPS TPGPELSRQA LSSPLAGIPT RLPGPGEPVP 2580
GPAGPAQFIE LRHNVQKGLG PGGAPFPGQG LPPRPRFYPV SEDPHRLAPE GLRSLAVSGL 2640
PPQKPSAPPA PELNNSLHPT PHTKGPTLPT GLELVSRPPS STELGRPPPL ALEAGKLPCE 2700
DPELDDDFDA HKALEDDEEL AHLGLGVDVA KGDDELGTLE NLETNDPHLD DLLNGDEFDL 2760
LAYTDPELDT GDKKDIFNEH LRLVESANEK AEREALLRGV EPGPLGPEER PPPATDASES 2820
RLASVLPEVK PKVEEGGRHP SPCQFTITTP KVEPAPAATS LGLGLKPGQS MTGSRDSRMG 2880
TGPFSSSGHT AEKGPYGATG GPPAHLLTPS PLSGPGGSSL LEKFELESGA LTLPGGHTAS 2940
GDELDKMESS LVASELPLLI EDLLEHEKKE LQKKQQLSAQ LQPAQQQQQQ QQHSLLSTSG 3000
AAQAMPLPHE GSSPSLAGPQ QQLALGLGGA RQSGLAQPLM PTQPPGHALQ QRLTPSMAMV 3060
SNQGHMLSGQ HGGQAGLVPQ QNPQPVLSQK PMGTMPPSMC MKPQQLAMQQ QQLANSFFPD 3120
TDLDKFAAED IIDPIAKAKM VALKGIKKVM AQGSIGVAPG INRQQVSLLA QRLSGGAGSD 3180
LQNHVAAGSG QERSAGDPSQ PRPNPPTFAQ GVINEADQRQ YEEWLFHTQQ LLQMQLKVLE 3240
EQIGVHRKSR KALCAKQRTA KKAGREFPEA DAEKLKLVTE QQSKIQKQLD QVRKQQKEHT 3300
NLMAEYRNKQ QQQQQQQQQQ QQQQQQHSAV LALSPSQSPR LLTKLPGQLL PGHGLQPPQG 3360
PPGGQAGGLR LPPGSMALPG QPGGPFLNTA LAQQQQQQHS GGGGSLAGPS GGFFPGNLAL 3420
RSLGPDSRLL QERQLQLQQQ RMQLAQKLQQ QQQQQHLLGQ VAIQQQQQQG PGVQANQALG 3480
PKPQGLLPPG SHQGLLVQQL SPQPPQGPQG MLGPAQVAVL QQQHPGGLGP QGSHRQVLMT 3540
QSRVLSSPQL AQQGQGLMGH RLVTAQQQQQ QQQHQQQGSM AGLSHLQQGL IPHSGQPKLS 3600
AQPMGSLQQQ QLQQQQLQQQ QLQQQQQQQQ QLQQQQQLQQ QQQLQQQQQH QQQQQLQQRQ 3660
QQQQQQQLQQ QQQLQQQQFQ QQQHKQQMVL LDQSRILMSL QQQKQIKXXX XXXXXXXXXX 3720
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXKPSL 3780
SGDSQLLLVQ PQAQPQPNSL QLQPPLRLPG QQQQQVNLLH TAGGGSHGQL GSGSSSEASS 3840
MPHLLAQPSV SLGEQPGPMT QNLLGPQQPP GLERPMQNNI GPQPPKQGPV PQSGQGLPGV 3900
GVMPTVGQLR AQLQGVLAKN PQLRHLSPQQ QQQLQALLMQ RQLQQSQAVR QTPPYQEPGT 3960
QPSPLQGLLG CQPQIGGFPG SQTGPLQELG AGPRPQGPPR LPAPQGALST GPVLGPVHPT 4020
PPPSSPQEPK RPSQLPSPSS QLPTEAQLTP THPGTPKSQG PPLELPPGRV SPAAAQLADT 4080
LFGKGLGTWD PPDNLAEVQK PVQSSLVPGH LEQVVNGQAV PEPPQLSIKQ EPREEPCALG 4140
AQAVKREANG EPIGAPGTSN HLLLAGPRSE AGHLLLQKLL RAKNVQLSSG RGSEGLRAEI 4200
NGHIDSKLAG LEQKLQGTPS NKEDAAARKP LTPKPKRVQK ASDRLVSSRK KLRKEDGVRA 4260
SEALLKQLKQ ELSLLPLTEP AITANFSLFA PFGSGCPVSG QNQLRGAFGS GVLSTGPDYY 4320
SQLLTKNNLS NPPTPPSSLP PTPPPSVQQK MVNGVTPSEE LGEHPKDAAS ARDTDGALRD 4380
ASEVKSLDLL AALPTPPHNQ TEDVRMESDE DSDSPDSIVP ASSPESILGE EAPRFPHLGS 4440
GRWEQDDRAL SPVIPIIPRA SIPXXXXXXX XXXXXXXVPG KLPATSWERA NGSEVSVMLT 4500
VSAAAAKNLN GVMVAVAELL SMKIPNSYEV LFPESPARAG IEPKKGEAEG PDGKEKGLGG 4560
KSPDAGPDWL KQFDAVLPGY ESPAPEPPTQ HSYTYNVSNL DVRQLSAPPP EEPSPPPSPL 4620
APSPASPPAE PLVELPPAEP SAEPPIPSPL PLASSPESAR PKPRARPPEE GEDSRPPRLK 4680
KWKGVRWKRL RLLLTIQKGS GRQEDEREVA EFMEQLGTAL RPDKVPRDMR RCCFCHEEGD 4740
GATDGPARLL NLDLDLWVHL NCALWSTEVY ETQGGALMNV EVALHRGLLT KCSLCQRTGA 4800
TSSCNRMRCP NVYHFACAIR AKCMFFKDKT MLCPMHKIKG PCEQELSSFA VFRRVYIERD 4860
EVKQIASIIQ RGERLHMFRV GGLVFHAIGQ LLPHQMADFH SATALYPVGY EATRIYWSLR 4920
TNNRRCCYRC SIGENNGRPE FVIKVIEQGL EDLVFTDASP QAVWNRIIEP VAAMRKEADM 4980
LRLFPEYLKG EELFGLTVHA VLRIAESLPG VESCQNYLFR YGRHPLMELP LMINPTGCAR 5040
SEPKILTHYK RXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 5100
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXNRGI YMFRINNEHV 5160
IDATLTGGPA RYINHSCAPN CVAEVVTFDK EDKIIIISSR RIPKGEELTY DYQFDFEDDQ 5220
HKIPCHCGAW NCRKWMN 5237
Nucleotide Sequence
(Fasta)
ATGGACAGCC AGAAGTCGCC TGGTGAGGAT AAAGATTCAG AACCAGCAGC TGATGGACCC 60
ACAGCCTCTG AGGACCCAGG TGCCACTGAG CCAGACCTTC CCAACCCACA TGTGGGAGAG 120
GTCTCTGTCC CCAGTTCTGG GACTCCCAAG CTTCAGGAGC CTCCCCAGGA CTGCAGTGGG 180
GGTCCAGTGC GGCGTTGTGC TCTCTGTAAC TGCGGGGAGC CCAGTCTGCA TGGGCAGCGG 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAT TGGCCCCGGT GTCCAGTGGT GTCCCCTGGG 300
GGGAACCCAG GGCCCAATGA GGCAGTGCTG CCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCC TTACACCTGC CCACCTAGGA GAACCTGGAG GGTCCGCTCA CCATTGGTGT 420
GCTGTGTCAG CAGTATGGGA GGACCCAGAA CTATGTGGTG TGGACAAGGC CATCTTCTCA 480
GGGATCTCAC AGCGCTGTTC CCACTGCACC AGGCTCGGTG CCTCCATCCC TTGCCGCTCA 540
CCTGGATGTC CACGGCTTTA CCACTTCCCC TGCGCGACTG CCAGCGGTTC CTTCTTATCC 600
ATGAAAACAC TGCAGCTACT ATGCCCGGAG CACAGTGAGG GGGCTGCACA TCTGGAGGAG 660
GCTCGCTGTA CAGTATGTGA GGGGCCGGGG GAATTGTGCG ACCTGTTCTT CTGTACCAGC 720
TGTGGGCATC ACTATCACGG GGCCTGCCTG GACACTGCTC TGACTGCCCG AAAGCGTGCT 780
GGCTGGCAGT GCCCTGATTG CAAAGTGTGC CAAGCCTGCA GGAAACCTGG GAACGATTCT 840
AAGATGTTGG TCTGTGAAAC GTGTGACAAA GGATATCATA CCTTCTGCCT AAAACCACCC 900
ATGGAGGAAC TGCCTGCTCA CTCTTGGAAG TGCAAGGCAT GCCGGGTATG CCGGGCCTGT 960
GGGGCAAGCT CAGCAGAGAT GAATCCCAAC TGTGAGAGGG AGAACTACTC GCTCTGTCAC 1020
CGCTGTCACA AAGCCCAGGG AGCTATCAGT TCTGCGGAAC AGCATCCGNN NNNNNNNNNN 1080
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1140
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1200
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1260
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1320
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1560
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1620
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1680
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1740
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1800
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1860
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1920
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1980
NNNNNTCCTG CGGCCCCGCC AGCCCTGTCT CCTTTGGGGG AGTTAGAGTA CCCCTTTGGT 2040
GCCAAAGGGG ACAGTGACCC TGAGTCACCA TTGGCTGCCC CCATTCTGGA GACACCCATC 2100
AGTCCTCCAC CAGAAGCTAA CTGCACTGAC CCCGAGCCTG TCCCCCCTAT GATCCTTCCC 2160
CCATCTCCAG GCTCCCCAAT GGAACTGGCT TCTCCCATCC TGATGGAGCC TCTTCCCCCA 2220
CGGTGTTCTC CACTCCTTCA GCATTCCCTG GCTCCTCAGA ACTCCCCTCC TGCTCTGCCC 2280
CTGTCCCTGC CCTCCCCCTT GAGTCCCATA GGGAAAGCAG TAGAAGTCTC AGATGAGGCT 2340
GAGACACAGG AGATGGAGAC TGAGAAAGTC CCAGAACCCG AGTGTCCAGC CTTAGAACCC 2400
AGTGCCACCA GTCCTCTCCT CTCCCCTATG GGGGACCTGT CCTGCCCTGC CCCCAGCCCT 2460
GCCCCAGCCC TGGATGACTT TTCTGGCTTG GGGGAAGACA CACCCCCCCT GGATGGGATT 2520
GATGCTCCTG GTTCACAGCC AGAGACTGGA CAGACTCCTG GCAGTTTGAC TAGTGAACCT 2580
AAAGGTTCCC CTGTGCTCCT GGACCCTGAG GAGCTGGCCC CTGTGACCCC TATGGAGGTC 2640
TATGGCCCCG AATGCAAGCA GGCAGCAGGG CAGAGCTCAC CCTGTGAAGA ACAGGAGGAG 2700
CCATGTGCAC CAGTGGCCCC CATACCACCC ATTCTCATCA AATCCGACAT TGTAAATGAG 2760
ATCTCTAATC TGAGCCAGGG CGATGCCAGT GCCAGTTTTC CTGGCTCAGA GCCCCTGCTG 2820
GGCTCTCCTG ACCCTGAGGG GGGCGGCTCC CTGTCCATGG AGCTAGGGGT CTCTACAGAT 2880
GTTAGTCCAG TCCGAGATGA GGGTTCCCTA CGGCTCTGTA CTGACTCGCT GCCAGAGACT 2940
GACGACTCAC TGTTGTGTGA TGCTGGGACA GCTATCAGCG GAGGCAAAGT CGAGGGGGAT 3000
AAGGGGAGGC GGCGCAGTTC CCCAGCCCGT TCCCGCATCA AACAGNNNNN NNNNNNNNNN 3060
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3180
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN GACATGTGTG TAGTATGTGG CAGCTTTGGC 3300
CGTGGGGCAG AGGGCCACCT TCTTGCCTGT TCGCAGTGCT CTCAGTGCTA CCACCCTTAC 3360
TGTGTCAACA GCAAGATCAC CAAGGTGATG CTGCTGAAAG GTTGGCGTTG TGTGGAGTGT 3420
ATCGTGTGTG AGGTGTGCGG TCAGGCCTCT GACCCCTCAC GCCTGCTGCT CTGTGATGAC 3480
TGCGACATTA GCTACCACAC ATACTGCCTG GACCCACCAC TGCTCACTGT GCCTAAGGGT 3540
GGCTGGAAGT GCAAGTGGTG TGTGTCTTGT ATGCAGTGTG GGGCCGCCTC CCCTGGCTTC 3600
CACTGTGAAT GGCAGAATAG TTACACACAC TGTGGACCCT GCGCCAGCCT GGTGACCTGT 3660
CCTATCTGTC ATGCTCCTTA TGTGGAAGAG GACCTATTAA TCCAGTGCCG CCACTGTGAA 3720
CGGTGGATGC ATGCAGGTTG TGAGAGCCTC TTCACAGAGG ATGATGTGGA GCAGGCAGCC 3780
GATGAGGGCT TTGACTGCGT CTCCTGCCAG CCCTACGTAG TAAAACCTGT GGTGCCCATT 3840
GCACCTCCAG AGTTGGTGCC TATGAAGGTG AAAGAGCCAG AGCCCCAGTA CTTTCGCTTT 3900
GAGGGTGTGT GGCTGACAGA AACTGGCATG GCTGTGCTGC GTAACTTGAC CATGTCACCA 3960
CTGCACAAGC GGCGCCAGCG GCGAGGACGG CTTGGCCTCC CAGGGGAGGC AGGGCTGGAG 4020
GGTTCTGAGC CCTCAGATGC CCTTGGCCCG GATGACAAGA AGGATGGGGA CCTGGACACC 4080
GATGAGCTGC TCAAGGCTGA AGGTGGTGTG GAGCACATGG AGTGTGAAAT TAAACTGGAG 4140
GGCCCTGTCA GCCCCGATGT GGAGCCTGGC AAAGAGGAGA CCGAGGAAAG CAAAAAACGC 4200
AAGCGCAAAC CCTATCGGCC CGGCATTGGT GGTTTCATGG TGCGACAGCG GAAATCCCAC 4260
ACGCGTGTGA AAAAGGGGCC TGCTGCACAG GCGGAGGTGT TGAGTGGGGA TGGGCAGCCC 4320
GACGAGGGTG AGACGGTGAT GCCTGCTGAT CTGCCTGCNN NNNNNNNNNN NNNNNNNNNN 4380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4440
NNNNNNNNNN NNNNNNNNNN NNNNNNNGAA GCCTTTTTTG GGAAGGAGCT GCTGGACCTG 4500
AGCCGTAAGG CCCTTATTGC AGTTGGGGTG GGCCGGCCAA GCTTTGGACT AGGAACCCCA 4560
AAACCCAAGG GGGATGGAGG CTCAGAAAGG AAGGAGCTTC CTACCTTGCA GAAAGGAGAT 4620
GATGGTCCAG ATGTTGCAGA TGAAGAATCC CGTGGCCCTG AGAGCAAGGC TGACACACCA 4680
GGACCTGAGG ATGGGGGTGT AAGGGCGTCC CCAGTGCCCA GTGACTCTGA GAAGCCTGGT 4740
ACNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNAACTGCCC 4800
AAAATGGAAT CCAAGGACCT GCAGCAGCTA TTCAAGGATG TTCTGGGTTC CGAACGAGAG 4860
CAGCACCTGG GTTGTGGAAC CCCTGGCCTA GAAGGCAGCC GTACACCACT GCAGAGGCCC 4920
TTTCTTCAAG GTGGACTCCC TTTGGGTAAT CTCCCCTCCA GCAGCCCAAT GGACTCCTAC 4980
CCGGGCCTCT GCCAGTCCCC GTTCCTGGAT TCTAGGGAGC GCGGGGGCTT CTTTAGCCCG 5040
GAACCCGGTG AACCAGACAG CCCCTGGACA GGCTCAGGGG GCACCACGCC CTCCACCCCC 5100
ACCACCCCCA CCACAGAGGG TGAGGGCGAC GGGCTCTCCT ATAACCAGCG AAGTCTTCAG 5160
CGCTGGGAGA AGGATGAGGA GTTGGGCCAG CTCTCCACCA TCTCACCCGT ACTCTATGCC 5220
AACATTAACT TTCCTAATCT CAAGCAAGAT TACCCAGACT GGTCTAGCCG TTGCAAACAA 5280
ATCATGAAGC TGTGGAGAAA AGTTCCAGCT GCTGACAAAG CCCCCTACCT GCAAAAGGCC 5340
AAAGATAACC GGGCAGCTCA CCGCATCAAC AAGGTACAGA AGCAGGCTGA GAGCCAGATC 5400
AACAAGCAGA CCAAGGTGGG CGACATGGCC CGTAAGCCTG ACCGACCAGC CCTACATCTC 5460
CGCATTCCCC CCCAGCCAGG GGCACTGGGC AGTCCGCCCC CTGCTGCTGC CCCCACCATT 5520
TTCATTGGCA GCCCCACTAC CCCCGCCGGC TTGTCTACCT CTGCGGACGG GTTCCTGAAG 5580
CCGCCGGCGG GCACGGTGCC TGGCCCGGAC TCGCCTGGTG AGCTCTTCCT CAAGCTCCCA 5640
CCCCAGGTGC CCGCCCAAGT GCCTTCGCAG GACCCCTTTG GACTGGCCCC CGCCTACGCC 5700
CTGGAGCCCC GCTTCCCCAC GGCACCACCC ACTTACGCTC CCTATCCTAC TCCGACTGGG 5760
GGCCCCACGC AGCCCCCGAT GCTGGGCGCC TCATCTCGTC CTGGGACTGG CCAGCCAGGA 5820
GAATTCCACA CTACCCCACC TGACACGCCC CGACACCAAC CCTCCACACC TGACCCCTTC 5880
CTCAAACCCC GCTGCCCCTC ACTGGACAAC CTGGCTGTGC CTGAGAGCCC AGGCGTAGGG 5940
GGAAGCAAGG CTTCTGAGCC CCTGCTCTCA CCCCCACCTT TTGGTGAGTC CCGGAAGGCC 6000
CTAGAGGTGA AGAAGGAAGA GCTCGGGGCA TCCTCTCCTA GCTATGGGCC CCCAAACCTG 6060
GGTTTTGTTG ACTCACCCTC CTCAGGCCCC CACCTGGGTG GCCTGGAGTT GAAGGCACCT 6120
GATGTCTTCA AAGCTCCCTT GACCCCTCGG GCATCTCAGG TAGAGCCCCA GAGCCCGGGC 6180
TTGGGCCTTC GGCCTCAGGA GTCCCCTCAG GCTTTGGCCC CTTCTCCTAG CCACCGGAAT 6240
CTTCGCCCTG GCCCCTACCC TGACCCNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6480
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6720
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6780
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6840
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6900
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6960
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNCCTACCTG 7020
TCCCATGGAG CCCCACAGCG ATCAGGCATC ACCTCTCCTG TCGAAAAGCG AGAAGACCCG 7080
GGGGCTGGAA TGGGTAGCTC TTTGGCGACA CCTGAACTCT CAGGTACCCA GGACCCAGGC 7140
ATGTCCAACC TCAGCCAGAC AGAGCTGGAG AAGCAACGGC AGCGCCAGCG ACTACGGGAG 7200
CTGCTGATTC GGCAGCAGAT CCAGCGCAAC ACCCTGCGGC AGGAGAAGGA AACAGCTGCA 7260
GCAGCTGCAG GAGCAGTGGG GCCTCCAGGC AACTGGGGTG CTGAGCCCAG CAGCCCTGCC 7320
TTTGAGCAGC TGAGTCGAGC CCAGACCCCC TTCGTTGGGA CCCAGGACAA GAGCAGCCTT 7380
GTGGGGCTGC CCCCAAGCAA GATGAGTGGC CCCCTCATGG GGCCAGGGGC CTTCTCTAGT 7440
GATGACCGAC TCTCCCGGCC ACCTCCACCG GCCACACCTT CCTCCATGGA TGTGAACAGT 7500
CGGCAACTGG TAGGAGGCTC CCAAGCTTTC TATCAGCGAG CACCCTATCC TGGGTCCCTG 7560
CCCTTACGGC AACAGCAACA ACTGTGGCAG CAGCAACAGG CAACAGCAGC AACCTCCATG 7620
CGACTTGCCA TGTCTGCTCG CTTTCCATCA ACTCCTGGAC CTGAACTTAG CCGCCAAGCT 7680
CTAAGTTCCC CCTTGGCGGG AATTCCCACC CGCCTGCCAG GCCCTGGTGA GCCAGTGCCT 7740
GGTCCAGCTG GTCCTGCTCA GTTCATTGAG TTGCGGCACA ATGTACAGAA AGGACTAGGA 7800
CCTGGGGGGG CTCCATTTCC AGGTCAGGGC CTGCCTCCAA GACCCCGTTT TTATCCTGTA 7860
AGTGAAGACC CCCACCGACT GGCTCCTGAA GGACTTCGCA GCCTGGCAGT ATCAGGCCTT 7920
CCACCACAGA AACCCTCAGC CCCACCAGCC CCTGAATTGA ACAACAGCCT CCATCCGACT 7980
CCCCACACCA AGGGTCCTAC CCTGCCAACT GGCCTGGAGC TGGTCAGCCG GCCCCCCTCG 8040
AGCACTGAGC TTGGCCGCCC CCCTCCTCTG GCCTTGGAAG CTGGGAAGTT ACCCTGTGAG 8100
GATCCTGAGC TGGATGATGA CTTTGATGCC CACAAGGCCC TAGAGGATGA TGAAGAGCTT 8160
GCTCACCTGG GTCTGGGTGT AGATGTGGCC AAGGGTGATG ATGAGCTGGG CACTCTGGAA 8220
AACCTGGAGA CCAATGATCC CCACCTGGAT GACCTGCTCA ATGGAGATGA GTTTGACCTC 8280
CTGGCATATA CTGACCCTGA GCTGGACACT GGGGACAAGA AGGACATCTT CAATGAGCAC 8340
CTGAGGCTGG TGGAATCTGC TAATGAGAAG GCTGAGCGGG AGGCCCTGCT GAGGGGAGTG 8400
GAGCCAGGCC CCTTAGGCCC TGAGGAGCGC CCTCCCCCTG CCACTGATGC CTCTGAGTCC 8460
CGCCTGGCAT CTGTGCTCCC TGAGGTGAAG CCCAAGGTAG AGGAGGGTGG ACGCCATCCT 8520
TCCCCTTGCC AGTTCACCAT TACCACCCCC AAGGTAGAGC CAGCACCTGC TGCCACTTCC 8580
CTTGGCCTGG GGTTAAAACC AGGACAAAGC ATGACGGGCA GCCGGGACTC CCGGATGGGC 8640
ACAGGGCCAT TTTCCAGCAG TGGGCACACA GCTGAGAAGG GCCCCTATGG AGCTACAGGA 8700
GGACCACCAG CTCACCTGCT GACCCCCAGC CCGCTTAGTG GCCCAGGAGG ATCATCTTTG 8760
CTGGAAAAGT TTGAGCTGGA GAGTGGGGCA CTGACTTTGC CTGGTGGACA TACAGCATCT 8820
GGGGATGAGC TGGACAAGAT GGAGAGCTCA CTGGTGGCCA GTGAGTTGCC CCTGCTCATT 8880
GAGGACCTGT TGGAGCATGA GAAGAAGGAG CTGCAGAAGA AGCAGCAGCT TTCAGCACAG 8940
CTGCAGCCTG CCCAGCAGCA GCAGCAGCAG CAGCAGCATT CCCTATTGTC CACATCAGGT 9000
GCTGCCCAGG CCATGCCTTT GCCACATGAG GGCTCTTCTC CCAGTTTGGC TGGACCTCAA 9060
CAGCAGCTTG CCCTGGGTCT TGGAGGTGCC CGACAGTCAG GCTTGGCCCA ACCACTGATG 9120
CCTACCCAGC CACCAGGTCA TGCCCTCCAG CAGCGCCTGA CCCCATCCAT GGCCATGGTG 9180
TCCAATCAAG GGCATATGCT AAGTGGACAG CACGGGGGAC AGGCAGGCTT GGTACCCCAG 9240
CAGAACCCAC AGCCAGTGCT ATCACAGAAG CCCATGGGCA CCATGCCACC TTCCATGTGT 9300
ATGAAGCCCC AGCAACTGGC AATGCAGCAG CAGCAGCTGG CTAACAGCTT CTTCCCAGAT 9360
ACAGACCTGG ACAAATTTGC TGCAGAAGAT ATCATTGATC CCATTGCAAA GGCCAAGATG 9420
GTGGCTTTGA AAGGCATCAA GAAAGTGATG GCTCAGGGCA GCATTGGGGT GGCACCTGGT 9480
ATAAACAGGC AGCAAGTATC TCTGTTAGCC CAGAGGCTCT CTGGGGGAGC TGGCAGTGAT 9540
CTGCAGAACC ATGTGGCAGC TGGGAGTGGC CAGGAGCGGA GTGCTGGTGA CCCCTCCCAG 9600
CCTCGTCCCA ACCCGCCCAC TTTTGCCCAG GGAGTGATCA ATGAGGCTGA CCAGCGGCAA 9660
TATGAAGAGT GGCTGTTTCA TACCCAGCAG CTCCTACAGA TGCAGCTGAA GGTGCTAGAG 9720
GAGCAGATTG GTGTGCACCG CAAGTCCCGG AAGGCTCTGT GTGCCAAGCA GCGCACTGCC 9780
AAAAAGGCTG GTCGTGAGTT CCCAGAAGCT GATGCTGAGA AGCTTAAACT GGTTACAGAA 9840
CAGCAGAGCA AGATCCAGAA ACAGCTGGAT CAGGTCCGGA AACAGCAGAA GGAGCACACT 9900
AATCTCATGG CAGAATATCG GAATAAGCAG CAGCAACAGC AGCAGCAGCA GCAGCAGCAG 9960
CAACAACAGC AGCAACAGCA TTCAGCGGTG TTGGCTCTCA GCCCTTCTCA GAGTCCCCGG 10020
CTACTTACCA AGCTCCCTGG CCAGCTACTA CCTGGCCATG GGCTGCAACC ACCACAGGGA 10080
CCTCCGGGTG GACAAGCTGG AGGTCTTCGC CTGCCCCCTG GGAGTATGGC ACTACCTGGA 10140
CAGCCTGGTG GCCCCTTCCT CAACACAGCT CTGGCCCAGC AGCAACAACA GCAACATTCT 10200
GGTGGTGGTG GATCCCTAGC TGGCCCCTCA GGAGGGTTCT TCCCTGGCAA CCTTGCTCTT 10260
CGAAGCCTTG GACCTGATTC AAGGCTTTTA CAGGAAAGGC AGCTACAGCT GCAGCAGCAA 10320
CGTATGCAGC TGGCCCAGAA ACTGCAGCAA CAGCAGCAGC AGCAGCACCT CCTAGGACAG 10380
GTGGCAATCC AGCAGCAACA GCAGCAGGGT CCTGGGGTAC AGGCAAACCA GGCTCTGGGT 10440
CCCAAGCCCC AGGGGCTTCT GCCTCCTGGC AGTCATCAGG GCCTCCTAGT CCAGCAGTTG 10500
TCCCCCCAGC CGCCCCAGGG GCCCCAGGGC ATGCTGGGCC CTGCCCAGGT GGCTGTGTTG 10560
CAGCAGCAGC ACCCTGGAGG TTTGGGCCCC CAGGGCTCTC ACAGGCAGGT GCTTATGACC 10620
CAGTCCCGGG TACTGAGTTC CCCCCAGCTG GCACAGCAGG GTCAGGGCCT TATGGGACAT 10680
CGGCTGGTCA CAGCTCAGCA GCAGCAGCAG CAGCAACAAC ACCAACAGCA GGGGTCCATG 10740
GCAGGGCTGT CTCATCTGCA GCAAGGTCTG ATACCACACA GTGGGCAGCC AAAACTCAGT 10800
GCTCAGCCCA TGGGCTCTTT ACAGCAGCAA CAGCTTCAGC AGCAGCAGCT ACAGCAACAG 10860
CAGCTACAGC AGCAGCAGCA GCAACAGCAA CAGCTCCAAC AGCAGCAGCA GCTCCAACAG 10920
CAGCAGCAGC TCCAACAGCA GCAGCAGCAC CAACAGCAGC AACAGCTGCA ACAGAGGCAA 10980
CAGCAACAGC AGCAGCAGCA GCTTCAGCAG CAACAGCAGC TACAGCAGCA GCAATTTCAG 11040
CAGCAGCAGC ACAAGCAGCA GATGGTCCTC TTGGACCAGA GTCGAATTTT AATGTCTCTT 11100
CAACAACAAA AGCAGATAAA ANNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11160
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11220
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11280
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNAA ACCCTCACTC 11340
TCTGGGGACT CACAACTCCT GCTTGTCCAA CCCCAGGCCC AGCCTCAGCC CAATTCTCTG 11400
CAGCTTCAGC CACCTCTGAG GCTTCCAGGA CAGCAGCAAC AGCAAGTTAA CTTGCTCCAC 11460
ACAGCAGGTG GAGGAAGCCA TGGACAACTA GGCAGTGGAT CATCTTCTGA GGCTTCATCT 11520
ATGCCCCACC TGCTGGCCCA ACCCTCTGTT TCCCTAGGAG AGCAGCCTGG GCCCATGACG 11580
CAGAACCTTC TGGGCCCCCA GCAACCCCCT GGGCTAGAGC GGCCCATGCA AAATAATATA 11640
GGGCCACAAC CTCCTAAACA AGGACCTGTC CCCCAGTCTG GGCAGGGTCT GCCTGGGGTT 11700
GGAGTTATGC CTACAGTGGG TCAGCTTCGA GCACAGCTCC AAGGAGTTCT GGCCAAAAAC 11760
CCACAGCTGC GGCACTTGAG TCCTCAACAG CAACAGCAGC TACAGGCACT TCTCATGCAG 11820
CGGCAGCTGC AGCAAAGTCA GGCAGTACGC CAGACCCCAC CCTACCAGGA GCCTGGGACC 11880
CAGCCCTCTC CTCTCCAGGG ACTCCTGGGC TGCCAACCCC AAATTGGGGG CTTCCCTGGA 11940
TCCCAGACGG GCCCCCTCCA GGAGCTAGGG GCAGGACCTC GACCTCAGGG CCCACCCCGG 12000
CTCCCTGCCC CACAAGGAGC CTTATCCACA GGACCAGTCC TTGGCCCTGT CCATCCCACA 12060
CCTCCACCAT CCAGCCCCCA AGAGCCAAAG AGACCTTCAC AGTTACCTTC CCCCAGCTCT 12120
CAGCTCCCCA CTGAGGCCCA ACTCACCCCC ACCCATCCAG GCACCCCAAA GTCCCAGGGG 12180
CCACCCTTGG AGCTGCCTCC TGGGAGGGTC TCACCTGCTG CTGCCCAGCT TGCAGATACC 12240
TTGTTTGGCA AGGGGCTGGG AACTTGGGAC CCCCCAGACA ACCTAGCAGA AGTCCAGAAG 12300
CCAGTGCAGA GCAGCCTGGT ACCTGGGCAT CTGGAACAGG TGGTGAATGG ACAGGCGGTG 12360
CCTGAGCCAC CCCAACTAAG CATCAAGCAG GAACCTCGGG AAGAGCCATG TGCCCTGGGA 12420
GCCCAGGCAG TGAAGAGGGA GGCCAATGGG GAACCAATAG GGGCACCAGG TACCAGCAAC 12480
CACCTCCTGC TGGCAGGCCC CCGCTCAGAG GCTGGACATC TGCTCTTGCA GAAGCTTCTA 12540
CGGGCAAAGA ATGTGCAACT CAGCAGTGGG CGGGGGTCTG AGGGGCTTCG AGCTGAGATC 12600
AACGGACACA TTGACAGCAA ATTGGCTGGG CTGGAGCAGA AACTACAGGG TACCCCCAGC 12660
AACAAGGAGG ATGCAGCAGC AAGGAAACCT TTGACACCGA AGCCCAAGCG GGTACAGAAG 12720
GCAAGCGACA GGTTGGTGAG CTCCCGAAAG AAGCTGCGGA AGGAGGACGG GGTCAGGGCC 12780
AGCGAGGCCT TGCTGAAACA GCTGAAACAG GAGCTGTCCC TGCTGCCCCT AACGGAGCCT 12840
GCTATCACCG CCAATTTTAG CCTCTTTGCT CCCTTTGGCA GTGGTTGCCC AGTCAGTGGG 12900
CAGAACCAGC TGAGGGGGGC CTTTGGAAGT GGGGTGCTGT CCACTGGCCC TGACTACTAT 12960
TCCCAGCTGC TTACCAAGAA TAACCTGAGT AACCCGCCGA CACCACCCTC GTCGCTGCCT 13020
CCCACCCCAC CCCCATCGGT GCAGCAGAAG ATGGTAAATG GTGTCACTCC ATCTGAAGAG 13080
CTGGGGGAGC ACCCCAAGGA TGCCGCCTCT GCCCGGGATA CTGATGGGGC ACTAAGGGAT 13140
GCTTCAGAGG TGAAGAGTCT AGACCTGCTG GCTGCCTTGC CTACACCCCC TCACAATCAG 13200
ACTGAGGATG TCAGGATGGA GAGTGATGAG GATAGTGATT CTCCTGATAG CATTGTGCCA 13260
GCTTCATCCC CTGAGAGCAT CCTGGGGGAA GAGGCCCCTC GTTTCCCTCA TCTGGGCTCA 13320
GGCCGGTGGG AGCAGGATGA CCGAGCCCTC TCCCCCGTCA TCCCCATCAT TCCTCGGGCC 13380
AGCATCCCAG NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA GGTCCCTGGA 13440
AAGCTGCCTG CCACATCTTG GGAAAGGGCA AACGGAAGTG AGGTATCAGT CATGCTGACA 13500
GTCTCAGCTG CTGCAGCCAA GAACCTGAAT GGTGTGATGG TGGCAGTGGC AGAGCTGCTG 13560
AGCATGAAGA TCCCCAACTC TTATGAGGTG CTGTTCCCAG AGAGCCCCGC CCGGGCAGGC 13620
ATTGAGCCTA AGAAGGGGGA AGCTGAGGGC CCTGATGGGA AAGAAAAGGG TCTGGGAGGT 13680
AAGAGCCCAG ACGCTGGCCC TGATTGGCTG AAGCAGTTTG ATGCAGTGTT GCCTGGCTAT 13740
GAGAGCCCTG CCCCAGAGCC GCCCACCCAG CACAGCTACA CCTACAATGT CTCCAATCTG 13800
GACGTGCGAC AGCTCTCAGC CCCACCTCCC GAAGAACCCT CCCCACCCCC ATCCCCCTTG 13860
GCACCTTCTC CTGCCAGTCC CCCTGCTGAG CCCTTGGTTG AACTTCCTCC GGCTGAACCG 13920
TCAGCTGAGC CACCCATCCC CTCGCCTCTG CCTCTGGCCT CATCCCCTGA ATCAGCCCGG 13980
CCCAAACCCC GAGCCCGGCC TCCTGAAGAA GGTGAAGATT CCCGTCCCCC TCGCCTCAAA 14040
AAATGGAAGG GAGTACGCTG GAAGCGGCTG CGGCTGCTGC TGACCATCCA GAAGGGTAGT 14100
GGACGGCAGG AGGATGAGCG GGAAGTGGCA GAGTTCATGG AGCAGCTTGG CACAGCTTTG 14160
CGACCCGACA AGGTGCCTCG AGACATGCGG CGCTGCTGCT TCTGTCATGA GGAGGGTGAT 14220
GGGGCCACTG ACGGGCCTGC CCGCCTGCTG AACCTGGACC TGGATCTGTG GGTGCACCTC 14280
AACTGTGCCC TGTGGTCCAC AGAGGTGTAT GAGACCCAGG GTGGAGCACT GATGAATGTG 14340
GAGGTTGCCC TGCACCGAGG ACTGCTAACC AAGTGCTCCC TGTGCCAGCG AACTGGTGCC 14400
ACCAGCAGCT GCAATCGCAT GCGTTGCCCC AACGTCTACC ATTTTGCTTG TGCCATCCGC 14460
GCCAAGTGCA TGTTCTTCAA GGACAAGACC ATGCTTTGTC CAATGCATAA GATCAAGGGG 14520
CCCTGTGAGC AGGAGCTGAG CTCTTTCGCT GTTTTCCGGC GGGTTTACAT TGAGCGGGAC 14580
GAGGTGAAGC AAATCGCCAG CATCATTCAG CGGGGAGAAC GGCTACACAT GTTCCGAGTG 14640
GGGGGCCTTG TGTTTCATGC CATCGGACAG CTGCTGCCTC ACCAGATGGC TGACTTCCAC 14700
AGTGCCACTG CCCTCTATCC TGTGGGCTAC GAGGCCACAC GCATATACTG GAGCCTCCGT 14760
ACCAACAATC GTCGCTGCTG CTACCGCTGC TCTATTGGCG AGAACAACGG GCGGCCGGAG 14820
TTTGTAATCA AAGTCATCGA GCAGGGCCTG GAGGACCTGG TCTTCACCGA TGCCTCTCCC 14880
CAGGCTGTAT GGAATCGCAT CATTGAGCCT GTGGCTGCCA TGAGAAAAGA GGCTGACATG 14940
CTGCGGCTCT TCCCTGAGTA CCTGAAAGGC GAGGAGCTCT TCGGGTTGAC GGTGCATGCA 15000
GTGCTTCGCA TAGCTGAATC ACTGCCTGGG GTGGAGAGCT GTCAAAACTA TTTATTTCGC 15060
TATGGGCGCC ACCCTCTGAT GGAGCTGCCA CTCATGATCA ACCCCACTGG CTGTGCCCGA 15120
TCGGAGCCTA AAATCCTTAC ACACTACAAA CGNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15180
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15420
NNNNNNNNNN NNNNNNNNAA TCGAGGCATC TACATGTTCC GAATAAACAA TGAACATGTG 15480
ATTGATGCTA CGTTGACCGG AGGCCCTGCC AGGTACATTA ACCATTCCTG TGCCCCTAAC 15540
TGTGTGGCGG AAGTCGTGAC ATTTGACAAA GAGGACAAAA TCATCATCAT CTCCAGCCGG 15600
CGAATCCCCA AAGGAGAGGA GCTGACCTAT GACTATCAGT TTGATTTTGA GGACGATCAG 15660
CACAAGATCC CCTGCCACTG TGGAGCCTGG AATTGTCGGA AATGGATGAA CTAA 15715
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 91 0.0 1631
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 91 0.0 1629
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 91 0.0 1624
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 90 0.0 1623
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 91 0.0 1621
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 91 0.0 1620
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 91 0.0 1619
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 90 0.0 1617
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 91 0.0 1615
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 90 0.0 1615
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 90 0.0 1615
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 90 0.0 1614
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 90 0.0 1614
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 90 0.0 1613
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 90 0.0 1605
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 90 0.0 1604
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 90 0.0 1603
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 90 0.0 1597
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 88 0.0 1580
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 89 0.0 1575
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 89 0.0 1552
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 88 0.0 1518
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 87 0.0 1514
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 89 0.0 1496
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 86 0.0 1493
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 85 0.0 1457
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 82 0.0 1444
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 90 0.0 1378
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 88 0.0 1335
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 77 0.0 1293
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 87 0.0 1244
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 86 0.0 1221
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 86 0.0 1106
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 63 0.0 1058
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 82 0.0 968
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 52 0.0 920
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 76 0.0 808
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 100 0.0 795
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 100 0.0 782
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 46 0.0 753
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 90 0.0 742
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 55 0.0 724
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 84 0.0 715
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 55 0.0 714
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 50 0.0 697
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 55 0.0 694
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 48 0.0 679
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 53 0.0 676
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 43 0.0 660
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 82 0.0 658
WERAM-Ora-0001 ENSOANP00000000271.1 Ornithorhynchus anatinus 84 0.0 639
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 74 0.0 637
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 72 5e-175 615
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 71 6e-174 611
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 69 6e-164 578
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 72 8e-161 568
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 72 1e-160 567
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 72 3e-159 562
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 92 4e-158 559
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 65 2e-152 540
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 51 5e-105 382
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 47 3e-94 347
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 45 6e-94 345
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 73 9e-89 328
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 68 2e-86 321
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 6e-61 236
Created Date 25-Jun-2016