WERAM Information


Tag Content
WERAM ID WERAM-Dio-0019
Ensembl Protein ID ENSDORP00000002189.1
Gene Name Kmt2c
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSDORG00000002330.1 ENSDORT00000002331.1 ENSDORP00000002189.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 8.00e-41 138.6 4710 4819
Me_Reader PHD 8.40e-22 76.8 288 4446
Organism Dipodomys ordii
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+++iek+++viEY+G++ir+eva+++ek ye++++gvy+fr+d+d +v+dat +g+ ar+inhsc+pNc+
ENSDORP00000002189.1 4710 NVYLARSRIQGLGLYAARDIEKHTMVIEYIGTIIRNEVANRKEKLYESQNRGVYMFRMDND--HVIDATLTGGPARYINHSCAPNCV 4794
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgee 113
a+vv++++ +ki+i ++r+I+kgee
ENSDORP00000002189.1 4795 AEVVTFERGHKIIISSNRRIQKGEE 4819
************************8 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51 
C vC+ +++ + C +C + +H+ C+++ ++l+ w Cp+Ck
ENSDORP00000002189.1 288 NCAVCDSPGDL-LDQFFCTTCGQHYHGMCLDIVVTPLKRA-GWQCPECK 334
5****544444.45999****************8888865.7******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C+ C++++e++k m+ Cd+Cd+ +H+ C+++ ++ +p + w C++C+
ENSDORP00000002189.1 335 VCQNCKQSGEDSK-MLVCDTCDKGYHTFCLQPVMKAVPTN-GWKCKNCR 381
8****88888876.************************77.7******8 PP
PHD.txt 2 tiClvCgkddegeke..mvqCdeCddwfHlkCvklplsslpeg..kswyCpsCk 51
++C++Cgk+ + e + ++ C+ C++w+Hl+C k + ++l ++ ++++C Ck
ENSDORP00000002189.1 410 NLCPLCGKCCHPELQkdLLHCNMCRRWVHLECDKQTDHELDSQlkEEYICLYCK 463
67999987776654455*******************77777666668******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
++C+vCg ++g++ ++ C++C + +H +Cv+++ +++ +k w+C +C+
ENSDORP00000002189.1 903 DMCVVCGSFGQGAEGrLLACSQCGQCYHPYCVSIKITKVVLSKGWRCLECT 953
68****7544443334*******************999996658******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
t+C +Cgk+ + + ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSDORP00000002189.1 953 TVCEACGKATDPGR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 1000
68999976666655.9*************************.9**99996 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.ks..wyCpsCk 51
+C+vC +++ +e+ ++qC +Cd+w+H+ C +l+ + e+ ++ + C C+
ENSDORP00000002189.1 1031 SCPVCCRNYREEDLILQCRQCDRWMHAVCQNLNTEDEVENvADigFDCSMCR 1082
7***9677777777*******************3333344322448899997 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCv 32
+C +C+++++g + +++ d d w+Hl+C
ENSDORP00000002189.1 4341 CC-FCHEEGDGLTDgparLLNLDL-DLWVHLNCA 4372
45.587777774445555666666.559999997 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C++C+k+++ + C C + +H +C ++ ++k +Cp +k
ENSDORP00000002189.1 4400 KCVFCHKTGATS----GCHRfrCTNIYHFTCAIKAQCMFFKDKTMLCPMHK 4446
6****8888876....6**999*********98885666677778898887 PP

Protein Sequence
(Fasta)
PRSRGKTTVE DEDSMDGLEA TETETIVETE VKEQSAEEDA EAEVENSKQP APALQRSVSE 60
ESANSLVSVG VEAKISEQLC AFCYCGEKSS LGQGDLKQFR VTPGLVLPWK NHPPTKDIDD 120
NSSGTCEKTQ NSAPRKQRGQ RKERSPQQNV VSYVSVSTQT ASDDQAGKLW DELSLVGLPD 180
AIDVQALFDP TGTCWAHHRC VEWSLGVCQM EEPSLVNVDK AVVSGSTEXX XXXXXXXXXX 240
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXKEDANCA VCDSPGDLLD 300
QFFCTTCGQH YHGMCLDIVV TPLKRAGWQC PECKVCQNCK QSGEDSKMLV CDTCDKGYHT 360
FCLQPVMKAV PTNGWKCKNC RICVECGTRS SSQWHHNCLV CDTCYQQQDN LCPLCGKCCH 420
PELQKDLLHC NMCRRWVHLE CDKQTDHELD SQLKEEYICL YCKHLGAEVD ALLPGDEVEM 480
AELNADSNTE MEVDRPEDQM AFLEPTIDKD INAKESIPGI VPDAAEVHTE QQQQRPCRPS 540
ESLGTDGLLI TESSLSKMNP DLENDVSPEI GGENTEMPSK VMTVCDEDQN GDKMEVTENM 600
EELTQEITVR QDDQHLLKEP TVVTAKEAAS PPTSTMESVL VPPEALASPS QESISLCSSD 660
QLLIDRVQAE MEQKENSKFP TGCMDCEMTP SIESCMKDGL CQGDGSINLP SEIESFSSVE 720
MSKTNTASSP TRSSDLPSHD RLQGYPSTLS SPAGNIMPTT YISVAPKIGM GKPAITKRKF 780
SPGRPRSXXG AWSTHNTVSP PSWSPDISEG REIFKPRQLP GSAIWSIKVG RGSGFPGKRR 840
PRGAGLSGRG GRGRSKLKSG LGAVVLPGVS AADISLNKDE EENSMHNTVV LFSSSDKFTL 900
HQDMCVVCGS FGQGAEGRLL ACSQCGQCYH PYCVSIKITK VVLSKGWRCL ECTVCEACGK 960
ATDPGRLLLC DDCDISYHTY CLDPPLQTVP KGGWKCKWCV WCRHCGATSA GLRCEWQNNY 1020
TQCAPCASLS SCPVCCRNYR EEDLILQCRQ CDRWMHAVCQ NLNTEDEVEN VADIGFDCSM 1080
CRPYMPASNV PSSDCCESSL VAQIVTKVKE LDPPKTYTQD GVCLTESGMT QLQSLTVTVP 1140
RRKRSKPKLK LKIINQNSVA VLQTPPDIQS EHSRDGEMDD SREGELMDCD GKSESSPERE 1200
AVDDETKGVE GADGVKKRKR KPYRPGIGGF MVRQRSRTGQ GKTKRSMIRK DSSGSISEQL 1260
PNRDDXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1320
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX TADDPLADIS 1380
EVLNTDDDIL GIISDDLAKS VDHSDIGPIA GDPSSLPQPS VTQSSRPLSE EQLDGILSPE 1440
LDKMVTDGAI LGKLYKIPEL GGKDVEDLFT AVLSPAASQP TPLPQPPPPP QLLSMHNQDV 1500
FSRMPLMNGL IGPGPHLPHN SLPPGSGLGT FPAVAQSPYA DARDKNPAYS AIGSDPNGSW 1560
APSVPAVECE NDTMSNAQRS TLKWEKEEAL AEMATVAPVL YTNINFPNLK EEFPDWTTRV 1620
KQIAKLWRKA SSQERAPYVQ KARDNRAALR INKVQMSNDS LKRQQQQDAI DPSVRMDADL 1680
FKDPLKQRES EHEQEWKFRQ QMRQKSKQQA KIEATQKLEQ VKNEQQQQQQ QQLGSQHLLV 1740
PSGSDTPSSG AQSPLTPQPG SGNMSPAQTF HKDLFSKQLH NTPAPTPSDD VFVKPQAPRA 1800
PVTPSRVPIQ ETLSQSPSQP PSPQMFSPGS SNPRPLSPMD PYAKMVGTPR PPPGGHSFPR 1860
KNSVTAVENC VPLSSVPRPI QMSETTTSRP SPARDLGSSS TPSSDPYAKP PDTPRPVMTD 1920
QFPKPLGLPR SPLISEQTAK GPLTAGTNDH FTKPPPRTDV FQRQRIPDPY VRPLLTPAPL 1980
DGGPGPFKAP LHPPPSSQDP YGSLSQASRR LSVDPYERPV LTPRPVDNFS HGQSNDPYSQ 2040
PPLTPHPAMS DSFTHSSRAF PQPGTVSRST SQDPYSQPPG TPRPVIDSYS QPSGTARSNP 2100
DPYSQPPGTP RPTTVDPYSQ QPPTPRPSPQ TDMFVAPTTN QRHSDPYAHP PGTPRPGISV 2160
SYSQPPATPR PRTSEGFTRS AGARPALVSN QEFLQAAXXX XXXXXXXXXX XXXXXXXXXX 2220
XXXXXXXXTF SRVSPSARDP YDQPPGTPRP QADSFGTSQV VHDVADQGGP GLEGSFSTSA 2280
NPAMGSQGQQ FPSAPQHPGP VPTSGGTDTQ NTVNMSQADT EKMRQRQKLR EIILQQQQQK 2340
KIASRQEKGP QDTTGMPHPV PLPHWQPESI NQAFTRPPPP YPGNVRSPVV PPLGPRYGQR 2400
GPYPPDVAGM GMRPHGFRFG FPGGSHGTMS SQDRFLVPPQ QMQGSGIPPH LRRSMSVEMS 2460
RPLSNSAMSN PAGLPQHFPP QGLPVQQHNI LGQAFIELRH RAPDGRPRLP FTASPGSMIE 2520
APHPRHGNFV PRPDFPGPRH PDAMRRPPQG LPPQLQAHPD LEPVPPSRQE QSHPVHPTSM 2580
VMRPLNHPLG SEFSEASMST SAPVETTPDN LHIASQSSDS LEEKLDSDDP SMKELDVKDL 2640
DSVEVKDLDD EDLENLNLDP EDGKGDELDT LGNLETNDPN LDDLLRSGEF DIIAYTDPEL 2700
DLGDKKSMFN EELDLNVPMD DKLDNPCVSL ETKKKEQEDK AVVLSDNHSP QKNSTATSEK 2760
IKRETGSPHC KEEARCDIEK GDESKDCVNT PCSQASAQPD LRDGEKTSLL PSGPDLLEKR 2820
TSGESAGSNT STVQGSSPLP ARDGTNTCDI MGSTPVLSRL LANEKTDNSD IRPLGSPPAL 2880
PVSPSSRVAS LPPALMPPPG PLLDNSMNSN VTMVSRANHA FSQGVQVNPG FIQAQSTVNH 2940
SVGTGKPTTQ NVPLTNPSST TGMSGPQQLM IPQTLTQQQN RERPLLLEEQ PLLLQDLLDQ 3000
ERQEQQQQRQ MQAMIRQRSE PFFPNIDFDA ITDPIMKAKM VALKGINKVM AQSNLGMPPM 3060
VMNRFPFMGP PVAGPQNTDG QSLVPQAVAQ DGSITHQISR PNPPNFGPGF VNDSQRKQYE 3120
EWLQETQQLL QMQQKYLEEQ IGAHRKSKKA LSAKQRTAKK AGREFPEEDA EQLKHVTEQQ 3180
SMVQKQLEQI RKQQKEHAEL IEDYRIKQQQ QQQQCALAPP ILMPGAPPPL VSGATPPTAS 3240
QPSFPMVPQQ LQRQQHTPVI SGHTSPARMP GLPGWQPAST PAHLPLNPPR IPPPITQLPI 3300
KTCTPAPGTV SNTNPQGGPP PRVEFDDNNP FSESFQERER KERLREQQER QRIQLMQEVD 3360
RQRALQQRME MEQHGLIGSE LGNRSSVSQM PFYPSDRPCD FMQPPRPLQQ SPQHQQQMGP 3420
ALQQSVQQGS VSSPPTQTFM QTSERRQVGP PSFVPDSPSI PGGSPNFHSA KQQGHGSVPG 3480
TSFQQSPLRS PFTPALPGTP PVANSSLPCG QDPATAHGQC YPGSTPSLIQ LYSDIIPEEK 3540
GKKKRARKKK KDDDAESTKA PSTPHSDITA PPTPGISERT STPSMSTPSE LPPQGEQEAP 3600
EPVGPSTPGT ATGQPCSQLE NKLPGSDFSQ GAPGHHTNEN SEVDKLSTET PATSEEIKLE 3660
KSETEPCLSQ EETKLEEQGD SKVEEDIAAD PGSSVHSPSH SAAAPAAKGD SGNELLKHLL 3720
KNKKSASLLS QRPEGTFCPE DSCPKENKLA EKQSPVEGLQ TLGAQMQSGF GCGNSQLPKS 3780
DGGNETKKQR SKRTQRTGER AAPRSKKRKK DEEEKHVMFS NSDSFTQLKQ QNNLSNPPTP 3840
PASLPPTPPL MACQKMANGF ATEELARKAG VLVSXXXXXX XXXXXXXXXX XXXXXXXXXX 3900
XXXXXXXXXX XXXXXXXXXX XXXXXXIQDH CGDRDTPDSF VPSSSPESVV GVEVSRYPDL 3960
SQVKEEPPEP VPSPIIPILP SITGKSSESR RNDIKTEPGT LFFTSPFGSS PNGPRSGLIS 4020
VAITLHPTAA ENISSVVAAF SDLLHVRIPN SYEVSNAPDV PSMGLVNSHR VNPGLEYRQQ 4080
LLLRGPPPGS ANPPRLASSY RLKPPNVPFP PTSNGLSGYK DSSHGVTESG VLRPQWCCHC 4140
KVVILGSGVR KSFKDLAFAN KDSRESIRRM EKDIVFCSNN CFILYSSTTQ AKNPESKEPI 4200
PSLPQSPLRE MPSKAFHQYS NNISTLDVHC LPKFQEKASP PPSPPIAFPP AFEAAQVEAK 4260
PDELKVTVKL KPRLRTVHGG LEDCRPLNKK WRGMKWKKWS IHIVIPKGTF KPPCEDEIDE 4320
FLRKLGTSLK PDPVPKDYRK CCFCHEEGDG LTDGPARLLN LDLDLWVHLN CALWSTEVYE 4380
TQAGALINVE LALRRGLQMK CVFCHKTGAT SGCHRFRCTN IYHFTCAIKA QCMFFKDKTM 4440
LCPMHKPKGI HEQELSYFAV FRRVYVQRDE VRQIASIVQR GERDHTFRVG SLIFHTIGQL 4500
LPQQMQAFHS PKALFPVGYE ASRLYWSTRY ANRRCRYLCS IEEKDGRPVV IRIVXXXXXX 4560
XXXXXXXXXX VWDKILEPVA CVRKKSEMLQ LFPAYLKGED LFGLTVSAVA RIAESLPGVE 4620
ACENYSFRYG RNPLMELPLA VNPTGCARSE PKMSAHVKRF VLRPHTLNST STSKSFQSTV 4680
TGELNAPYSK QFVHSKSSQY RRMKTEWKSN VYLARSRIQG LGLYAARDIE KHTMVIEYIG 4740
TIIRNEVANR KEKLYESQNR GVYMFRMDND HVIDATLTGG PARYINHSCA PNCVAEVVTF 4800
ERGHKIIISS NRRIQKGEE 4819
Nucleotide Sequence
(Fasta)
CCTCGAAGTA GAGGAAAAAC TACAGTGGAA GATGAGGACA GCATGGATGG GCTGGAGGCG 60
ACAGAAACAG AAACTATCGT GGAAACAGAA GTCAAAGAAC AGTCTGCAGA AGAGGATGCT 120
GAAGCAGAGG TGGAAAACAG CAAGCAGCCA GCCCCAGCTC TCCAGCGATC TGTGTCTGAG 180
GAATCTGCTA ACTCCCTGGT CTCTGTTGGT GTGGAAGCTA AAATCAGTGA ACAGCTCTGC 240
GCTTTTTGTT ACTGTGGGGA GAAAAGTTCC TTAGGACAAG GGGACTTAAA ACAATTTAGA 300
GTCACGCCAG GACTTGTTTT ACCTTGGAAA AACCACCCTC CTACCAAGGA CATTGATGAC 360
AACAGCAGTG GGACCTGTGA GAAAACGCAA AATTCTGCTC CACGAAAGCA AAGAGGACAG 420
AGAAAAGAAC GATCTCCTCA GCAGAATGTA GTGTCTTACG TCAGTGTGAG TACCCAGACA 480
GCTTCAGATG ACCAGGCTGG TAAATTATGG GATGAGCTCA GTCTGGTTGG CCTTCCAGAT 540
GCCATTGATG TCCAAGCCTT ATTTGATCCT ACAGGCACTT GTTGGGCTCA TCACCGTTGT 600
GTGGAGTGGT CACTAGGAGT GTGCCAAATG GAAGAACCAT CATTAGTGAA TGTGGACAAA 660
GCTGTTGTCT CAGGGAGCAC AGAANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 720
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 780
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 840
NNNNNNNCAA AGGAAGATGC AAACTGTGCA GTGTGCGACA GCCCTGGAGA CCTCTTAGAT 900
CAGTTCTTTT GTACTACTTG TGGTCAGCAC TATCATGGAA TGTGCCTGGA CATAGTGGTC 960
ACTCCATTAA AACGTGCAGG TTGGCAGTGT CCTGAGTGCA AAGTGTGCCA GAACTGCAAA 1020
CAATCGGGAG AAGATAGCAA GATGCTAGTG TGTGACACAT GTGACAAAGG GTATCATACT 1080
TTTTGTCTTC AACCAGTTAT GAAAGCAGTA CCAACCAATG GCTGGAAATG CAAAAATTGC 1140
AGAATATGTG TAGAGTGTGG CACACGATCT AGTTCTCAGT GGCACCACAA TTGCTTGGTC 1200
TGTGACACTT GTTACCAACA GCAGGATAAC TTATGTCCTT TGTGTGGAAA GTGCTGCCAT 1260
CCAGAATTGC AGAAAGACTT GCTCCATTGT AATATGTGCA GGAGATGGGT TCACTTAGAA 1320
TGTGACAAAC AAACAGATCA TGAGCTGGAT TCTCAACTCA AAGAAGAGTA TATCTGCTTG 1380
TATTGTAAAC ACTTAGGAGC TGAGGTGGAT GCCTTACTGC CTGGTGATGA AGTAGAGATG 1440
GCCGAACTCA ATGCAGATTC TAACACTGAA ATGGAAGTTG ACAGACCTGA AGACCAAATG 1500
GCATTTTTGG AGCCAACAAT TGATAAAGAC ATCAATGCTA AGGAGTCCAT TCCTGGAATT 1560
GTTCCAGATG CAGCTGAAGT CCACACTGAG CAGCAGCAGC AGCGACCATG TCGTCCATCA 1620
GAAAGTCTTG GCACAGATGG TCTTCTCATT ACTGAATCAT CTCTCAGCAA AATGAATCCT 1680
GATTTGGAAA ATGATGTTTC CCCTGAGATT GGTGGTGAAA ACACGGAAAT GCCTTCTAAA 1740
GTGATGACAG TTTGTGATGA AGATCAGAAT GGAGACAAAA TGGAAGTGAC AGAAAATATG 1800
GAGGAGCTTA CACAGGAGAT CACTGTGCGG CAAGATGATC AGCATTTGTT AAAGGAACCT 1860
ACAGTGGTGA CAGCCAAAGA AGCAGCGAGC CCTCCCACAT CAACCATGGA ATCTGTCCTT 1920
GTTCCACCAG AAGCCTTAGC GTCCCCGAGT CAGGAGAGTA TTTCTTTATG TTCTAGTGAT 1980
CAGTTGCTTA TTGACAGAGT TCAAGCAGAA ATGGAACAGA AAGAAAATTC CAAATTTCCC 2040
ACCGGGTGTA TGGACTGTGA AATGACTCCT TCAATCGAGA GTTGCATGAA AGATGGCTTG 2100
TGCCAAGGGG ACGGATCTAT AAACTTACCT TCTGAGATTG AGTCATTTTC ATCAGTGGAG 2160
ATGAGCAAGA CAAACACTGC TTCCTCCCCA ACACGTTCTT CAGACTTGCC TTCACATGAT 2220
AGGCTGCAGG GTTACCCTTC GACGCTCAGC TCTCCTGCTG GAAATATCAT GCCAACAACT 2280
TACATCTCAG TGGCTCCAAA AATCGGCATG GGTAAACCAG CTATTACAAA AAGGAAATTC 2340
TCTCCTGGTA GACCCCGATC CAANNNNGGG GCCTGGAGTA CCCATAATAC AGTGAGCCCA 2400
CCTTCCTGGT CCCCAGACAT TTCAGAAGGT CGGGAAATTT TTAAACCCAG GCAGCTTCCT 2460
GGCAGTGCCA TTTGGAGCAT CAAAGTGGGT CGAGGGTCTG GATTCCCAGG GAAGCGGAGA 2520
CCTCGCGGAG CTGGGCTGTC GGGAAGAGGC GGCAGAGGGA GGTCCAAGCT AAAAAGTGGA 2580
CTTGGCGCTG TTGTATTACC TGGGGTGTCT GCTGCAGATA TTTCCTTGAA TAAGGATGAA 2640
GAAGAAAACT CTATGCACAA TACAGTGGTG TTGTTTTCTA GCAGTGACAA GTTCACTCTG 2700
CATCAGGATA TGTGTGTGGT TTGTGGCAGT TTTGGCCAAG GAGCAGAAGG GAGATTACTT 2760
GCTTGTTCTC AGTGTGGTCA GTGTTACCAT CCATACTGTG TCAGCATCAA GATCACTAAA 2820
GTGGTTCTTA GCAAAGGTTG GAGGTGTCTG GAGTGCACAG TGTGTGAGGC CTGTGGGAAG 2880
GCCACCGACC CCGGAAGACT CCTACTGTGC GATGACTGTG ACATAAGCTA CCACACCTAC 2940
TGCCTCGACC CTCCTCTGCA GACAGTTCCC AAAGGAGGCT GGAAGTGCAA ATGGTGTGTT 3000
TGGTGCAGGC ACTGTGGAGC GACGTCCGCG GGCCTGAGGT GCGAATGGCA GAACAACTAC 3060
ACCCAGTGTG CGCCGTGCGC CAGCCTGTCC TCCTGCCCGG TCTGCTGCCG AAACTACAGA 3120
GAAGAAGATC TCATTCTGCA GTGTAGGCAG TGTGATAGAT GGATGCATGC AGTTTGTCAA 3180
AACTTAAATA CTGAAGATGA AGTGGAAAAT GTAGCAGACA TTGGTTTTGA TTGTAGCATG 3240
TGCAGACCCT ATATGCCTGC GTCAAATGTG CCTTCCTCGG ACTGCTGTGA ATCTTCACTT 3300
GTAGCACAAA TTGTCACAAA AGTAAAAGAG CTAGATCCAC CTAAGACATA CACCCAGGAT 3360
GGTGTGTGCC TGACCGAGTC AGGGATGACT CAGTTGCAGA GCCTCACAGT AACGGTTCCG 3420
AGAAGAAAAC GATCAAAGCC AAAACTGAAA TTGAAGATTA TAAATCAGAA TAGTGTGGCT 3480
GTCCTTCAGA CCCCTCCAGA CATTCAGTCA GAGCACTCAA GAGATGGTGA AATGGATGAT 3540
AGTCGAGAAG GAGAACTTAT GGATTGTGAT GGGAAATCAG AATCTAGCCC TGAACGGGAA 3600
GCTGTGGATG ACGAAACTAA GGGAGTGGAA GGAGCTGATG GCGTGAAAAA GAGAAAGAGG 3660
AAACCATACA GACCAGGTAT TGGTGGGTTT ATGGTACGGC AAAGAAGTCG AACTGGGCAA 3720
GGAAAAACCA AAAGATCTAT GATCAGAAAA GATTCTTCAG GCTCTATATC TGAGCAGTTA 3780
CCTAACAGAG ATGATGNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3840
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3900
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3960
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4020
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4080
NNNNNNNNNN NNNNNNNNNN NNNNNNNNGT ACTGCTGATG ACCCATTAGC TGATATTTCT 4140
GAAGTCTTAA ACACAGATGA TGACATTCTT GGAATAATCT CAGATGACCT TGCAAAATCA 4200
GTTGATCATT CAGATATTGG TCCTATTGCT GGTGACCCTT CCTCCTTACC TCAGCCAAGT 4260
GTCACTCAGA GTTCACGACC TTTAAGTGAA GAGCAGCTAG ATGGGATCCT CAGTCCTGAG 4320
CTAGACAAAA TGGTCACAGA TGGAGCAATT CTTGGAAAGT TATATAAAAT TCCAGAGCTT 4380
GGAGGAAAGG ACGTTGAAGA CTTGTTTACT GCAGTGCTTA GTCCTGCGGC CAGTCAGCCG 4440
ACTCCACTGC CACAGCCCCC GCCTCCACCA CAGCTGTTGT CAATGCACAA TCAGGATGTT 4500
TTTTCACGGA TGCCACTCAT GAATGGCCTT ATTGGACCCG GTCCTCATCT CCCACATAAC 4560
TCTCTGCCTC CGGGCAGCGG ATTGGGGACT TTCCCAGCTG TGGCCCAGTC CCCTTACGCT 4620
GATGCCAGGG ATAAAAACCC AGCCTACAGT GCAATCGGAA GTGATCCTAA TGGCTCATGG 4680
GCTCCATCAG TCCCTGCTGT GGAATGCGAA AATGATACCA TGTCAAATGC CCAGAGGAGC 4740
ACGCTAAAGT GGGAGAAGGA GGAGGCCCTG GCTGAAATGG CAACCGTAGC TCCAGTTCTC 4800
TACACAAATA TTAATTTCCC CAATTTAAAG GAAGAATTCC CTGACTGGAC AACTCGAGTA 4860
AAGCAAATTG CTAAGTTGTG GAGAAAAGCA AGCTCTCAGG AAAGAGCACC ATATGTGCAA 4920
AAAGCCAGAG ATAACAGAGC TGCCTTACGT ATTAATAAAG TGCAGATGTC CAATGACTCC 4980
TTGAAAAGGC AGCAGCAGCA GGATGCCATC GACCCCAGTG TGCGCATGGA TGCAGACCTT 5040
TTTAAAGATC CATTAAAGCA AAGAGAATCA GAACATGAAC AGGAATGGAA ATTCAGACAG 5100
CAAATGCGTC AGAAAAGTAA GCAGCAAGCT AAGATCGAAG CCACACAAAA ACTGGAGCAG 5160
GTAAAGAATG AGCAGCAGCA GCAGCAACAG CAGCAGCTTG GTTCTCAGCA CCTTCTGGTT 5220
CCATCTGGTT CAGATACTCC AAGTAGTGGA GCACAGAGTC CTCTGACACC TCAGCCAGGC 5280
AGTGGAAATA TGTCTCCCGC ACAGACATTC CATAAAGACC TGTTTTCAAA GCAGCTACAC 5340
AACACCCCTG CTCCCACACC TTCAGATGAT GTGTTTGTTA AGCCACAAGC TCCACGGGCT 5400
CCTGTTACCC CCTCCCGGGT GCCCATTCAG GAGACTCTTT CTCAGTCTCC TTCTCAGCCA 5460
CCTTCTCCAC AAATGTTTTC ACCTGGATCT TCTAACCCCC GACCATTGTC TCCAATGGAT 5520
CCTTATGCAA AAATGGTTGG TACCCCTAGA CCACCTCCTG GGGGTCACAG TTTTCCCAGA 5580
AAAAACTCTG TCACAGCAGT GGAAAACTGT GTGCCTTTAT CATCTGTACC CAGGCCCATT 5640
CAGATGAGTG AGACAACAAC CAGTAGGCCA TCCCCGGCCA GGGACTTGGG TTCTTCCTCC 5700
ACACCAAGTA GCGACCCCTA TGCAAAGCCT CCGGATACGC CCAGACCCGT GATGACAGAT 5760
CAGTTTCCCA AACCTTTGGG CCTACCTAGG TCTCCTTTAA TTTCAGAGCA AACTGCAAAG 5820
GGTCCTCTAA CGGCCGGAAC CAATGATCAC TTTACTAAGC CACCTCCTCG GACAGATGTG 5880
TTTCAGAGAC AACGGATACC TGACCCATAT GTAAGGCCTT TATTGACTCC TGCACCACTC 5940
GATGGTGGTC CTGGGCCTTT TAAAGCTCCC CTACACCCTC CCCCCTCTTC TCAGGATCCC 6000
TACGGATCGC TGTCACAGGC ATCAAGACGA CTGTCAGTTG ACCCTTACGA AAGGCCCGTC 6060
TTGACACCAA GACCCGTGGA TAATTTTTCT CATGGTCAGT CAAATGATCC ATATAGTCAG 6120
CCTCCCCTCA CCCCACATCC AGCAATGAGT GACTCCTTTA CCCATTCTTC AAGGGCTTTT 6180
CCCCAGCCTG GAACTGTATC AAGATCAACA TCTCAGGACC CATACTCCCA GCCCCCAGGA 6240
ACTCCAAGAC CTGTCATAGA TTCTTATTCC CAGCCCTCAG GAACGGCTCG GTCCAATCCA 6300
GACCCCTATT CTCAACCTCC TGGAACTCCC CGGCCTACCA CTGTTGATCC ATACAGTCAG 6360
CAGCCCCCAA CGCCCAGACC ATCTCCACAG ACGGACATGT TTGTTGCACC CACCACGAAC 6420
CAGAGGCACT CTGATCCGTA CGCTCATCCT CCTGGGACAC CAAGACCTGG AATTTCCGTT 6480
TCCTACTCTC AGCCACCAGC AACACCACGG CCACGGACTT CAGAGGGGTT TACAAGGTCT 6540
GCAGGTGCAA GGCCAGCCCT TGTGTCAAAT CAGGAGTTCC TGCAAGCAGC ACANNNNNNN 6600
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660
NNNNNNNNNN NNNNNNNNNN NNNNACATTC AGCCGGGTGT CCCCGTCTGC TCGTGATCCG 6720
TATGACCAGC CTCCCGGGAC TCCTCGGCCT CAGGCTGACT CTTTCGGAAC CAGTCAAGTT 6780
GTTCACGACG TTGCCGACCA GGGAGGACCT GGCTTGGAGG GGAGCTTCAG CACGTCTGCA 6840
AACCCTGCCA TGGGTTCCCA AGGGCAGCAG TTCCCCAGCG CCCCCCAGCA TCCTGGACCT 6900
GTGCCAACCT CAGGAGGAAC TGACACACAG AACACTGTAA ATATGTCTCA AGCAGACACG 6960
GAGAAAATGA GACAGCGGCA GAAGCTACGT GAAATCATCC TCCAGCAGCA GCAGCAGAAG 7020
AAGATTGCTA GTCGCCAGGA AAAAGGGCCT CAAGATACAA CAGGAATGCC CCATCCAGTG 7080
CCCCTTCCAC ATTGGCAGCC CGAGAGTATC AACCAGGCTT TCACTCGACC TCCACCTCCC 7140
TACCCTGGGA ATGTGCGTTC TCCAGTTGTT CCCCCGCTAG GACCTAGATA TGGTCAGCGT 7200
GGACCCTATC CTCCTGATGT TGCTGGTATG GGGATGAGAC CTCATGGGTT CAGATTTGGA 7260
TTTCCTGGAG GTAGTCATGG AACCATGTCC AGTCAGGATC GTTTTCTTGT GCCTCCTCAG 7320
CAAATGCAAG GATCTGGAAT TCCCCCACAC CTAAGAAGAT CAATGTCCGT GGAAATGTCG 7380
AGACCTTTAA GTAACTCAGC AATGAGTAAT CCAGCTGGGC TTCCCCAGCA TTTCCCTCCC 7440
CAGGGCCTGC CGGTTCAGCA GCACAACATA CTAGGTCAAG CGTTCATTGA GCTGAGGCAT 7500
AGGGCCCCTG ATGGGAGGCC TCGGCTGCCC TTTACTGCTT CTCCCGGAAG CATGATAGAG 7560
GCACCTCATC CACGACATGG AAACTTCGTT CCACGGCCAG ACTTCCCAGG TCCCAGACAT 7620
CCAGATGCCA TGAGAAGACC TCCCCAAGGC TTACCACCTC AGCTGCAGGC ACATCCTGAT 7680
TTGGAACCAG TGCCGCCATC TCGACAAGAA CAAAGTCATC CTGTTCATCC CACTTCTATG 7740
GTTATGAGGC CTCTGAATCA TCCCTTGGGC AGTGAGTTTT CTGAAGCTTC CATGTCCACA 7800
TCTGCCCCAG TTGAAACAAC ACCTGATAAT TTGCACATAG CCAGCCAGTC TTCTGATAGT 7860
CTAGAAGAAA AACTAGACTC TGATGACCCT TCTATGAAAG AATTAGATGT TAAAGACCTT 7920
GACAGTGTTG AAGTCAAAGA CTTAGATGAT GAAGATCTTG AAAATTTAAA TTTGGACCCA 7980
GAAGACGGAA AAGGAGATGA ATTGGATACT TTAGGTAATT TGGAAACAAA TGATCCCAAC 8040
CTTGATGACC TCTTAAGGTC AGGAGAGTTT GATATCATTG CATACACAGA CCCAGAACTT 8100
GACTTGGGGG ATAAGAAGAG CATGTTTAAT GAGGAGCTAG ACCTTAATGT TCCGATGGAT 8160
GATAAGCTAG ATAATCCATG TGTATCTCTT GAAACAAAAA AAAAGGAGCA AGAAGACAAA 8220
GCTGTAGTTC TCTCTGATAA CCATTCACCA CAGAAAAATT CCACTGCCAC CAGTGAGAAG 8280
ATAAAGAGAG AAACTGGGTC TCCACATTGT AAAGAAGAAG CCAGATGTGA CATTGAGAAA 8340
GGGGATGAGA GTAAAGATTG TGTGAACACT CCATGCTCAC AGGCTTCTGC TCAGCCAGAC 8400
CTGCGTGATG GAGAAAAGAC TTCTTTGCTG CCTTCTGGTC CAGATTTGCT TGAGAAGAGA 8460
ACCAGTGGAG AAAGTGCTGG CTCCAACACC AGTACTGTGC AAGGCTCCTC ACCACTGCCC 8520
GCTCGGGATG GGACTAACAC CTGTGATATC ATGGGATCCA CTCCTGTCCT TTCACGTTTA 8580
CTTGCTAATG AAAAGACTGA CAATTCAGAC ATTAGGCCTC TGGGGTCTCC ACCAGCTTTG 8640
CCGGTTTCAC CATCTAGCCG TGTGGCAAGT TTACCTCCTG CCTTAATGCC ACCACCTGGC 8700
CCTCTCTTGG ATAATTCCAT GAATTCTAAT GTAACAATGG TCTCTAGGGC AAACCATGCT 8760
TTTTCTCAGG GTGTGCAAGT AAATCCAGGA TTCATTCAGG CTCAGTCAAC TGTTAACCAC 8820
AGTGTGGGGA CAGGAAAGCC TACAACTCAA AATGTGCCTC TCACAAATCC GTCCAGTACC 8880
ACTGGCATGT CTGGACCTCA ACAGCTAATG ATTCCTCAGA CATTAACCCA ACAGCAGAAT 8940
AGAGAGAGAC CTCTCCTTCT AGAGGAACAG CCTCTGCTTC TGCAGGATCT TTTGGATCAA 9000
GAGAGGCAAG AACAACAGCA ACAAAGACAA ATGCAAGCCA TGATTCGTCA GCGGTCAGAA 9060
CCATTCTTCC CTAACATTGA TTTTGATGCA ATTACAGATC CTATCATGAA AGCCAAAATG 9120
GTGGCCCTGA AAGGCATAAA TAAAGTGATG GCACAGAGCA ATCTGGGCAT GCCGCCCATG 9180
GTGATGAACA GATTCCCTTT CATGGGCCCT CCGGTGGCTG GCCCACAGAA CACTGACGGA 9240
CAGAGTCTGG TACCCCAGGC TGTTGCTCAG GATGGCAGTA TAACACATCA GATTTCTAGG 9300
CCTAATCCTC CAAATTTTGG TCCAGGCTTT GTCAATGACT CTCAGCGTAA GCAGTATGAA 9360
GAATGGCTTC AGGAGACCCA ACAGCTCCTT CAAATGCAGC AGAAGTACCT TGAAGAACAA 9420
ATTGGTGCAC ACAGAAAGTC TAAGAAAGCC CTTTCAGCAA AACAGCGCAC TGCCAAGAAG 9480
GCAGGGAGAG AATTTCCAGA AGAAGATGCA GAACAGCTCA AGCATGTCAC AGAGCAGCAG 9540
AGTATGGTTC AGAAACAGCT GGAGCAGATT CGTAAACAAC AGAAAGAACA CGCTGAGTTG 9600
ATTGAAGATT ATCGCATCAA GCAGCAGCAG CAGCAGCAGC AGTGTGCACT GGCCCCACCC 9660
ATCCTCATGC CTGGGGCCCC GCCCCCCCTC GTCTCTGGTG CCACTCCGCC CACCGCCAGC 9720
CAGCCCAGCT TTCCCATGGT GCCACAGCAG CTTCAGCGCC AGCAACACAC ACCGGTCATC 9780
TCTGGCCACA CCAGTCCTGC TAGAATGCCT GGTTTACCTG GTTGGCAGCC TGCCAGCACT 9840
CCTGCTCACC TCCCTCTCAA TCCTCCTAGG ATTCCACCCC CTATCACACA ATTACCAATA 9900
AAAACTTGTA CCCCAGCCCC AGGTACAGTG TCAAACACAA ATCCCCAGGG CGGACCACCG 9960
CCACGAGTAG AGTTTGATGA CAACAACCCC TTCAGCGAAA GTTTCCAGGA ACGGGAGAGG 10020
AAGGAACGCT TACGAGAACA GCAAGAAAGA CAGAGAATCC AGCTAATGCA AGAAGTAGAC 10080
CGGCAGAGAG CTCTGCAGCA GAGGATGGAA ATGGAGCAGC ACGGCCTGAT AGGCTCTGAG 10140
CTAGGAAACA GGTCCTCTGT GTCCCAGATG CCATTCTACC CTTCTGACCG ACCTTGTGAT 10200
TTTATGCAAC CCCCAAGACC CCTTCAGCAG TCTCCGCAAC ACCAGCAGCA GATGGGACCA 10260
GCTCTACAGC AGAGTGTCCA GCAGGGCTCT GTTAGTTCAC CCCCCACCCA AACTTTCATG 10320
CAGACCAGTG AGCGGAGGCA GGTAGGACCT CCATCATTTG TGCCTGACTC ACCATCCATT 10380
CCTGGTGGAA GCCCAAACTT CCATTCTGCT AAACAGCAGG GGCATGGAAG TGTTCCTGGG 10440
ACCAGCTTCC AGCAGTCGCC TTTGAGGTCT CCATTTACAC CTGCTTTGCC AGGAACACCT 10500
CCAGTAGCCA ATAGCAGCCT CCCATGTGGC CAGGACCCTG CTACAGCCCA TGGACAGTGT 10560
TACCCAGGAT CAACCCCGTC TCTCATTCAG CTGTACTCTG ATATAATCCC AGAAGAAAAA 10620
GGGAAAAAGA AAAGAGCAAG AAAAAAGAAA AAAGATGATG ATGCAGAATC CACCAAAGCC 10680
CCGTCCACTC CCCACTCAGA TATAACTGCT CCGCCGACCC CAGGCATCTC AGAGCGTACC 10740
TCCACTCCCT CCATGAGCAC ACCCAGTGAG CTCCCTCCAC AGGGAGAACA AGAGGCACCC 10800
GAGCCGGTCG GCCCATCGAC TCCTGGTACA GCAACAGGCC AGCCATGTTC CCAATTAGAA 10860
AACAAACTTC CTGGGAGTGA CTTCTCCCAG GGAGCTCCAG GCCACCACAC AAATGAAAAT 10920
TCAGAGGTGG ATAAACTCTC CACAGAAACT CCTGCCACAA GTGAAGAAAT AAAACTAGAG 10980
AAATCTGAGA CAGAGCCATG CCTGAGTCAA GAGGAGACCA AACTGGAGGA ACAAGGTGAC 11040
AGTAAGGTGG AGGAGGACAT TGCCGCTGAT CCTGGCTCCT CGGTCCACAG TCCTTCCCAT 11100
TCTGCTGCAG CCCCTGCAGC CAAAGGAGAC TCGGGGAATG AACTGCTGAA GCATTTGTTG 11160
AAAAACAAAA AGTCCGCTTC CCTTTTAAGT CAAAGACCTG AAGGCACTTT CTGCCCAGAA 11220
GACAGCTGCC CAAAGGAGAA TAAGCTGGCT GAGAAGCAGA GCCCAGTAGA AGGACTGCAA 11280
ACTTTGGGGG CTCAAATGCA AAGTGGTTTT GGATGTGGCA ACAGCCAGTT GCCAAAATCA 11340
GATGGAGGAA ATGAAACCAA GAAACAGCGA AGCAAACGGA CTCAGAGGAC TGGGGAGAGA 11400
GCAGCACCTC GCTCAAAGAA AAGGAAAAAG GACGAAGAGG AGAAACACGT GATGTTCTCC 11460
AACTCTGACT CCTTCACCCA ACTGAAACAG CAGAATAACT TAAGTAATCC TCCAACACCC 11520
CCTGCCTCTC TTCCTCCTAC ACCACCTCTT ATGGCTTGTC AGAAGATGGC AAATGGTTTT 11580
GCGACTGAAG AACTTGCAAG AAAAGCTGGA GTTCTGGTGA GCNNNNNNNN NNNNNNNNNN 11640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11760
NNNNNNNNNN NNNNNNNGAT ACAGGATCAC TGTGGTGATC GTGACACTCC AGACAGCTTT 11820
GTGCCCTCAT CCTCTCCGGA GAGTGTGGTT GGGGTGGAAG TGAGCAGGTA TCCAGATCTG 11880
TCTCAGGTCA AAGAGGAGCC TCCAGAGCCA GTGCCATCCC CCATCATCCC CATCCTCCCT 11940
AGCATCACTG GGAAAAGTTC AGAATCCAGA AGGAATGACA TCAAAACTGA GCCAGGCACT 12000
TTATTTTTTA CATCACCTTT TGGTTCATCC CCAAATGGTC CCAGATCAGG TCTTATATCT 12060
GTAGCAATTA CCCTGCATCC CACAGCTGCT GAGAACATTA GCAGTGTTGT GGCTGCGTTT 12120
TCTGACCTTC TCCACGTCCG AATTCCTAAC AGCTATGAGG TTAGTAATGC TCCAGATGTT 12180
CCCTCCATGG GTTTGGTCAA TAGCCACAGA GTAAACCCAG GTTTGGAGTA TCGACAGCAG 12240
TTACTTCTTC GTGGGCCTCC ACCAGGATCT GCAAATCCTC CCAGATTAGC CAGCTCTTAC 12300
CGGCTGAAGC CGCCTAATGT ACCATTTCCT CCAACAAGCA ATGGTCTCTC TGGATATAAG 12360
GATTCTAGCC ACGGCGTCAC TGAAAGCGGA GTGCTCCGGC CTCAGTGGTG CTGTCACTGC 12420
AAGGTGGTTA TTCTTGGAAG TGGGGTACGG AAGTCTTTCA AGGATCTAGC CTTTGCAAAC 12480
AAGGATTCCC GGGAGAGCAT TAGGAGAATG GAGAAGGACA TTGTCTTTTG TAGTAATAAC 12540
TGCTTTATTC TTTATTCATC AACCACACAA GCGAAAAACC CAGAGAGTAA GGAGCCCATC 12600
CCTTCGCTGC CACAGTCGCC TCTGAGAGAG ATGCCCTCCA AGGCATTCCA CCAGTACAGC 12660
AACAACATCT CCACTTTGGA CGTGCACTGT CTCCCCAAGT TCCAGGAGAA AGCATCCCCC 12720
CCTCCCTCGC CTCCCATTGC ATTCCCTCCT GCCTTTGAGG CAGCCCAAGT GGAGGCGAAG 12780
CCTGATGAGC TCAAGGTAAC GGTCAAGCTG AAGCCTCGGC TGAGGACTGT CCACGGTGGG 12840
CTTGAAGACT GTCGGCCACT AAATAAGAAG TGGAGAGGAA TGAAGTGGAA GAAATGGAGC 12900
ATTCATATTG TAATCCCTAA GGGGACATTC AAACCGCCGT GTGAGGATGA AATAGATGAG 12960
TTTCTAAGGA AACTGGGCAC TTCTCTTAAA CCTGACCCTG TGCCTAAAGA CTACCGAAAG 13020
TGTTGCTTTT GTCACGAAGA AGGTGATGGG TTGACAGATG GACCAGCTAG GCTGCTCAAC 13080
CTCGACCTGG ACCTGTGGGT CCACTTGAAT TGTGCTCTGT GGTCCACCGA GGTCTATGAG 13140
ACTCAGGCTG GTGCCTTAAT AAATGTAGAG CTAGCCCTGA GGAGAGGCCT GCAGATGAAA 13200
TGTGTCTTCT GTCATAAGAC GGGTGCCACC AGTGGATGTC ACAGATTCCG GTGCACCAAC 13260
ATTTATCACT TTACTTGCGC CATTAAAGCA CAATGCATGT TTTTTAAGGA CAAAACTATG 13320
CTTTGCCCCA TGCACAAACC AAAGGGAATC CATGAGCAAG AGCTAAGTTA CTTTGCCGTC 13380
TTCAGAAGGG TCTATGTGCA ACGGGATGAG GTACGGCAGA TCGCCAGCAT TGTGCAGCGA 13440
GGAGAACGGG ACCATACCTT CCGCGTGGGA AGCCTTATCT TCCACACGAT TGGCCAGCTG 13500
CTTCCACAGC AAATGCAGGC ATTCCACTCT CCTAAAGCAC TCTTTCCTGT GGGCTATGAA 13560
GCCAGTCGGT TGTACTGGAG CACTCGCTAT GCCAACAGAC GCTGCCGTTA CCTGTGTTCC 13620
ATTGAGGAGA AAGATGGACG CCCAGTGGTC ATCAGGATCG TGNNNNNNNN NNNNNNNNNN 13680
NNNNNNNNNN NNNNNNNNNN NNNNNNNNGT GTTTGGGATA AGATTTTAGA GCCTGTGGCA 13740
TGTGTGAGGA AGAAATCTGA AATGCTCCAG CTCTTCCCCG CGTATTTAAA AGGAGAAGAC 13800
CTGTTCGGCC TGACTGTCTC GGCGGTGGCG CGCATAGCTG AATCACTGCC CGGGGTTGAG 13860
GCGTGTGAGA ACTACAGCTT CCGGTACGGC CGGAATCCTC TCATGGAGCT TCCCCTCGCC 13920
GTGAACCCCA CAGGCTGTGC CCGCTCGGAA CCTAAAATGA GTGCCCATGT CAAGAGGTTT 13980
GTGTTAAGGC CTCACACCCT CAACAGCACC AGCACCTCCA AGTCCTTCCA GAGCACAGTG 14040
ACGGGCGAGC TGAACGCGCC CTACAGCAAG CAGTTCGTCC ACTCCAAGTC ATCTCAGTAC 14100
CGAAGAATGA AGACCGAGTG GAAATCCAAT GTGTATCTCG CCCGGTCTCG GATCCAGGGG 14160
CTGGGCCTGT ACGCCGCTAG AGACATCGAG AAGCACACCA TGGTGATCGA GTACATCGGA 14220
ACGATCATTC GCAATGAAGT GGCCAACAGG AAGGAGAAAC TCTATGAGTC CCAGAACCGC 14280
GGAGTGTACA TGTTCCGCAT GGACAACGAC CACGTGATCG ACGCCACACT CACAGGAGGG 14340
CCTGCAAGGT ATATCAACCA TTCCTGTGCC CCGAACTGTG TGGCTGAAGT GGTCACTTTT 14400
GAGAGAGGAC ACAAGATCAT CATCAGCTCC AACCGGAGAA TCCAGAAAGG AGAAGAG 14458
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 81 0.0 4656
WERAM-Ict-0124 ENSSTOP00000012342.2 Ictidomys tridecemlineatus 81 0.0 4611
WERAM-Hos-0018 ENSP00000347325.3 Homo sapiens 81 0.0 4593
WERAM-Pat-0168 ENSPTRP00000046674.3 Pan troglodytes 81 0.0 4590
WERAM-Gog-0210 ENSGGOP00000027941.1 Gorilla gorilla 81 0.0 4589
WERAM-Paa-0005 ENSPANP00000006150.1 Papio anubis 80 0.0 4491
WERAM-Mum-0152 ENSMUSP00000043874.7 Mus musculus 80 0.0 4487
WERAM-Aim-0154 ENSAMEP00000014067.1 Ailuropoda melanoleuca 80 0.0 4479
WERAM-Caf-0059 ENSCAFP00000007370.4 Canis familiaris 79 0.0 4474
WERAM-Cap-0018 ENSCPOP00000001682.2 Cavia porcellus 79 0.0 4408
WERAM-Mup-0098 ENSMPUP00000009152.1 Mustela putorius furo 79 0.0 4364
WERAM-Fec-0096 ENSFCAP00000008002.3 Felis catus 78 0.0 4307
WERAM-Bot-0193 ENSBTAP00000028347.5 Bos taurus 76 0.0 4244
WERAM-Myl-0122 ENSMLUP00000010086.2 Myotis lucifugus 76 0.0 4168
WERAM-Poa-0169 ENSPPYP00000020408.2 Pongo abelii 81 0.0 4160
WERAM-Tut-0198 ENSTTRP00000016174.1 Tursiops truncatus 76 0.0 4158
WERAM-Ova-0045 ENSOARP00000005594.1 Ovis aries 74 0.0 4138
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 68 0.0 3686
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 69 0.0 3670
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 69 0.0 3615
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 68 0.0 3544
WERAM-Anc-0148 ENSACAP00000014142.2 Anolis carolinensis 63 0.0 3208
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 79 0.0 2966
WERAM-Chs-0076 ENSCSAP00000002864.1 Chlorocebus sabaeus 79 0.0 2949
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 78 0.0 2925
WERAM-Orc-0115 ENSOCUP00000009766.3 Oryctolagus cuniculus 77 0.0 2917
WERAM-Otg-0078 ENSOGAP00000005885.2 Otolemur garnettii 76 0.0 2869
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 80 0.0 2686
WERAM-Mim-0010 ENSMICP00000000977.1 Microcebus murinus 76 0.0 2428
WERAM-Mam-0056 ENSMMUP00000009467.2 Macaca mulatta 74 0.0 2359
WERAM-Sah-0130 ENSSHAP00000013860.1 Sarcophilus harrisii 67 0.0 2354
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 67 0.0 2290
WERAM-Loa-0207 ENSLAFP00000021432.1 Loxodonta africana 79 0.0 2100
WERAM-Sus-0158 ENSSSCP00000023447.1 Sus scrofa 77 0.0 1959
WERAM-Ptv-0086 ENSPVAP00000007862.1 Pteropus vampyrus 71 0.0 1944
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 60 0.0 1872
WERAM-Mod-0039 ENSMODP00000005827.3 Monodelphis domestica 66 0.0 1830
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 74 0.0 1828
WERAM-Fia-0086 ENSFALP00000007141.1 Ficedula albicollis 70 0.0 1788
WERAM-Vip-0013 ENSVPAP00000001498.1 Vicugna pacos 74 0.0 1578
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 80 0.0 1500
WERAM-Prc-0090 ENSPCAP00000008256.1 Procavia capensis 70 0.0 1404
WERAM-Xet-0065 ENSXETP00000021458.2 Xenopus tropicalis 76 0.0 1293
WERAM-Tar-0071 ENSTRUP00000014027.1 Takifugu rubripes 48 0.0 1293
WERAM-Leo-0127 ENSLOCP00000015481.1 Lepisosteus oculatus 74 0.0 1266
WERAM-Asm-0011 ENSAMXP00000001840.1 Astyanax mexicanus 50 0.0 1263
WERAM-Mae-0021 ENSMEUP00000001693.1 Macropus eugenii 63 0.0 1113
WERAM-Ocp-0124 ENSOPRP00000012666.1 Ochotona princeps 74 0.0 1111
WERAM-Gaa-0138 ENSGACP00000017696.1 Gasterosteus aculeatus 50 0.0 1060
WERAM-Dar-0184 ENSDARP00000115827.2 Danio rerio 59 0.0 1055
WERAM-Ten-0186 ENSTNIP00000018287.1 Tetraodon nigroviridis 51 0.0 1045
WERAM-Orla-0181 ENSORLP00000020984.1 Oryzias latipes 47 0.0 1041
WERAM-Xim-0205 ENSXMAP00000016470.1 Xiphophorus maculatus 59 0.0 972
WERAM-Pof-0064 ENSPFOP00000005925.2 Poecilia formosa 58 0.0 967
WERAM-Ran-0259 ENSRNOP00000072878.1 Rattus norvegicus 66 0.0 966
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 70 0.0 813
WERAM-Orn-0120 ENSONIP00000012272.1 Oreochromis niloticus 64 0.0 801
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 788
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 70 0.0 732
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 72 0.0 660
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 92 2e-173 610
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 54 7e-170 598
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 51 2e-157 556
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 49 5e-144 511
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 55 8e-110 398
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 33 9e-83 308
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 7e-37 155
Created Date 25-Jun-2016