WERAM Information


Tag Content
WERAM ID WERAM-Gam-0044
Ensembl Protein ID ENSGMOP00000004922.1
Gene Name kmt2d
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSGMOG00000004488.1 ENSGMOT00000005065.1 ENSGMOP00000004922.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 2.10e-44 150.5 4491 4606
Me_Reader PHD 5.30e-32 109.9 161 4230
Organism Gadus morhua
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+k++ek+++viEY+G+vir+eva++rek ye++++g+y+fr++++ v+dat +g+ ar+ nhsc+pNc+
ENSGMOP00000004922.1 4491 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTVIRNEVANRREKIYEEQNRGIYMFRINNE--QVIDATLTGGPARYANHSCAPNCV 4575
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSGMOP00000004922.1 4576 AEVVTFDREDKIIIISSRRIPKGEELTYDYQ 4606
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C C k+ + C C++ +H C + s l+ ++ + +C +++
ENSGMOP00000004922.1 161 HCEYC-KRLGA---TIRCHAegCSRFYHFPCSAASGSFLSmKKLALLCSEHMD 209
68888.44443...69**9999**********999777775556789999885 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ + ++ C +C + +H+ C+++ ++++ w Cp+Ck
ENSGMOP00000004922.1 220 WCAVC-DSAGDLLDLLFCTGCGQHYHAACLEIGATPIQRT-GWQCPECK 266
7****.66666666*******************9999976.7******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C++++e++k m+ Cd Cd+ +H+ C+++++++lp++ w C++C+
ENSGMOP00000004922.1 267 VCQTCRNPGEDTK-MLVCDACDKGYHTFCLQPAMETLPSD-PWKCRRCR 313
8****99999977.************************99.9******8 PP
PHD.txt 4 ClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C vC+k+ + ++ C+ C++ +H +C + + g++++C Cke
ENSGMOP00000004922.1 348 CAVCSKAASPSVTLQSCSLCHRHVHSECALAA--AESTGDKYTCLLCKE 394
7777666666554555*************999..4444459******97 PP
PHD.txt 2 tiClvCg...kddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg k eg +++ C +C + +H +Cv+ + +++ k w+C +C
ENSGMOP00000004922.1 682 DMCVVCGsfgKGAEG--QLLACAQCAQCYHPYCVNSKITKMMLRKGWRCLEC 731
68****734333333..49******************888884447*****9 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCgk+++ ++ ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSGMOP00000004922.1 733 VCEVCGKASDPSR-LLLCDDCDVSYHTYCLDPPLHTVPKG-GWKCKWCV 779
7****99999988.**************************.9**99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklp.lsslpeg...kswyCpsCk 51
+C+vC+++ +e+ ++qC++Cd+w+H+ C +l + e+ + + C sC+
ENSGMOP00000004922.1 810 TCPVCRENFMEEELLLQCQHCDRWVHAVCESLYtE-DEVEQasdEGFACTSCT 861
7****776666667******************942.3224434345******7 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCvk 33
+C +C+++++g++ +++ d d w+Hl+C
ENSGMOP00000004922.1 4125 CC-FCHEEGDGATDgparLLNIDV-DLWVHLNCAL 4157
55.598888887667776666555.6699999975 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
t+C C+k+++ + C+ C++ +H C ++ ++k +C ++k
ENSGMOP00000004922.1 4183 TLCAYCQKTGATS----SCNRlrCPNVYHFACAARARCMFFKDKTMLCTQHK 4230
68****7777776....6**99**********99986666677678888876 PP

Protein Sequence
(Fasta)
MDEQKSNCEE NDSEPTADDG APAKQSLKSG EESETIPLED DKSGATPAMS STLVESTRAC 60
ALCNCVERSL HGQRELRHFR PSSDPPTLEP TASSLPGPGN DDLSSIGFSD SSCLAALFDD 120
SGTGGCWVHH WCAVWSEGVK RLENDQLEDV DTVVISGTQR HCEYCKRLGA TIRCHAEGCS 180
RFYHFPCSAA SGSFLSMKKL ALLCSEHMDK AEELVGEEAW CAVCDSAGDL LDLLFCTGCG 240
QHYHAACLEI GATPIQRTGW QCPECKVCQT CRNPGEDTKM LVCDACDKGY HTFCLQPAME 300
TLPSDPWKCR RCRVCSECGV RGPALPGTQW FDNYAVCEDC QQQRSSKCAV CSKAASPSVT 360
LQSCSLCHRH VHSECALAAA ESTGDKYTCL LCKEAPPPLD PAETRAGEAS EEDNAVIKMS 420
TEPSDEPVVA KERTTTSQRG MVGAEAVEDE TPMELGMGSR PCLGSSPPPS APADTQGCPS 480
ADEADCRALP QSEEEEDEEE DEEEEEDDIK QEPVERQVKV ELLDEASNMS PGDGSSSGFL 540
GSPAEPDPQD FCLLPARRSR ADSLLTETDD SLPFEPHKCD GEKLRRRGSP GRSRIKQGRG 600
SGFPGKRRPR GGGGGGAGAG RGRGGRSRLK AMASCIDAFL LSMTEETGHN KEEGGEVEEE 660
DEAMQNTVVL FSNTDKFVLL QDMCVVCGSF GKGAEGQLLA CAQCAQCYHP YCVNSKITKM 720
MLRKGWRCLE CIVCEVCGKA SDPSRLLLCD DCDVSYHTYC LDPPLHTVPK GGWKCKWCVS 780
CVQCASHSPG FHCEWQNNYT HCGPCASLVT CPVCRENFME EELLLQCQHC DRWVHAVCES 840
LYTEDEVEQA SDEGFACTSC TPYVPKPVGK WLNTADVYTW GRGEPQFYRL EGVWLTESGM 900
SLLRSISMSP LHKRRQRRSR LGGLCGDGGD GLELKEGDGG DGDEGKGEPM DCETKMEPPG 960
TPERELGAEG SGEGLGYCDG VKGGAEETED GKKRKRKPYR PAGIGGFMVR QRKCHTRIKK 1020
GFFAQLAGET TLDGQPLERT IEEDNIMDPK PAEGEEQKKR RGRKKSKLED MFPAYLQEAF 1080
FGKTLIDMSK KAVVIPPVQR PGMCPSRPPE PALPGLIPPV QEGDQPVRVV ADVGGLFIWF 1140
PPLSSPARDD RGTLPLKREV VDTPLSQGEG ASLPQGMESQ DSEQFFRRVL GVSDGSTLEG 1200
MKPILEGSKG ELNRNALQQR PLLSGSLPSA GVMDTFPGLA QSPFFDMRDR GGLFSPDGGE 1260
ESPWATPSTP ATPSTPPTPT EVEGDGLSYN QRSLQRWEKD EELGELSTIS PVLYANTNFP 1320
TLKQDYPDWA SRCKQIMKIW RKVSAADKVP FLQKAKDNRA SQRINKAQKQ AESQVCRPIK 1380
TEPGRVKGER PSLHLQIPPP SGSASTPSQP SSAESPFPFP PIHGSSSAFF PDGPPKTPGS 1440
AEIRTDPFAK LPPQSPRAHS QPSTPFSQAG TSPSQANSSG YPPPGPHGPP QGRPAPGPFD 1500
MQPGTPGTPR RAQQVDPYFR SQLQQQQMQQ GSLESLGPPQ SPHSRAGGPG EPLFSPPHTP 1560
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1620
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1680
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1740
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1800
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1860
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXRQK LRDFLIKQQV 1920
KSNPAAGWHG GDLAQSHKAP PPYPQDRVAA AISGPQAAMA RKMPMAIGGM EEKLLCPQAL 1980
ETPGVLDPNA MRQPGPGNPQ AMYGRPPFPP QWQGQASGPR RFPPPGMDSM PPRHHLNPAV 2040
NMQAMQGMVN PRGMMAGPGE AMQPMGQGPP PQFIELRHNA QRLPLGPHFM LRGPQARPRL 2100
CPPQQDLAAP YVQQHTMQVD GGDSQMGLQQ GGLSMLMPTC PATQQGQHPQ QTIAHSSNHP 2160
GSQQQQQQPS NNTQQQPSTA HRASGPEDLP EPDLEGLSDA PGDGGVEDDD LALDLDPDKG 2220
DDDLGNLDNL ETNDPHLDDL LNSDEFDLLA YTDPELDQGD PKDVFSDQLR LVEAESEAPT 2280
ASGSADIKVE QKPKLEPGQV SDTSASSALH APPSETASTS KIKLEDRGLM PQHQDGQMVI 2340
KDEMGEAVSM LLGGTGASGK QTQPQAPSAS LSSVRLGGIS YPLPGQGDPL SFPPSTPHPD 2400
LGADPLGLPD VGGHTSPSVD MAKVESSLDG ELPLLIQDLL EHEKKEQQKQ QQLSSMHQAG 2460
MPSHMQGMPG QQPNPQAPPG ALMLQQQHHH RPPPQVMMGQ PGMGPRPMHA MQPQHQQQRF 2520
MGPGMAPPPH MAQQQAMMRL GQPGGMHPGM NHQPQRWAKP PMANNFFPNK DLDTFASDDN 2580
MDPIAKAKMV ALKGIKRVLA QDPLGVPPGI NRQQVSLLAQ RLASAPGADQ LGQAVPGSSK 2640
EGETSTPLQT RPNPPQFTQG IINDAEQQQY EEWLIHTQQL LQMQLKFLEE QIGAHRKSRK 2700
ALCAKQRTAK KAGREFAETD AEKLKLVTEE QSKIQKQLDQ VRKQQKEHTN LIAEYKSKQQ 2760
QHQQGSGLLK PGPSALAPPH MLSKMPGQMM MGQPPGMMPQ GQPFMAGAPP QNPGVLVPPP 2820
GPPGAPAGYF PQGPGMQGAD PRLLQERQLQ HRMQLAKVMP HPGQQPGMMP QAQPGIMGNQ 2880
LMAQQQPNIQ QGMPVDQANQ QGMVPVPQGM VGGQPVPQLP PNMVPMNQPP GMMSAQPGIM 2940
VTQPDGPSQQ QQRPQLMMGP QGMVVAPGHP GIRGPQAQLT LQQQNILAQR MISQQQMQQQ 3000
QQMAHRQQSQ GLINQPNQDQ RTSQPSTPQM CSSPSAGSIT PQQQGGTDNQ NPGLKERAML 3060
TPAPRTPLQQ SGPPTASPMV QQGSTGEQHI QNQRRHGLMV HQSALVNIKQ ERQQMDVSTS 3120
QQQQQHAVQN VPQQSQDPGT LQHVMGQNPG PMQPQPALMG HPSPQQQALM AQQQKQQAMM 3180
GMMRAQQPGM MVQRPGAPPG QIRMPNINIQ AIIAQNPQLR NLPPNQQIQH IHAMIAQRQQ 3240
QQQGQMMRMS MAQGQPGQMR PQMAPGQLHQ GDQRMPGALG QQPGMPSQIL QGMMVPGQPP 3300
QQVGQMMQQM SRGQMPMVRL PMDPSRMVRP MSPHQSLPSS PGDPQRHAMA QALGMCPPTP 3360
NHQQQQAHMV AAAGRMPGSP SQAGSPRGPS FTRMDSSPAT PGTPHSTHVP SPSQAEGGAG 3420
RGSPYNQVRA SPLRSPGAKS PHHYPGLKAE PHSSANDASQ TPSVPLNGPQ QEERLQQHLP 3480
QKASAGHGPQ PGSREGALCR MTLQNIKQEP GETQCDSGSP AGAHPGAIKR EELMNCGHPS 3540
GFINAENMGG DPGTLGRSET GQQLLQKLLR TKNLGAQRPS EGIHNEINGH INSKLAMLEQ 3600
KLQGTPRNME HSYLDLQSIT KKTPLAKAKR TNKPGGERGP NPRKKNKKED VGKSAEALMK 3660
QLKQGLSLLP LMEPSITASL DLFAPFGSSS ANGKAPLKGS FGNAVLDNIP DYYSQLLTKS 3720
NLSNPPTPPS SLPPTPPPSV QHKLPNGVTA GEELADTRKQ AETTEDTMDP VSQEVKSVDI 3780
LAALPTPPHN QNEDIRMESD DEDASESIVQ ASSPESALGD AMARFPCLRE PKEEETERAI 3840
SPIIPLIPRS AIPVFPEIKP FEATDSKAVS TSNNWDSSKN NEVSVTFMLS SAAAKNLNHM 3900
MVAMAQLLHI RMPGSYEVTF PPTPGTPGAA GPGNAPEQPG DGGTHDGPSV SQDDWLRQFD 3960
VTLPGCTLKK QVDVLALIKQ EFSQQQDTPA QHCYTTNVND LDVRHLPVIP VEESPPPSPS 4020
PPPPPSEPAP VGDGGKPPSL AAVHIKTEPE PEAVPAADSA ERPVGSEAAX XXXXXXXXXX 4080
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXRPDRLPR DKRKCCFCHE EGDGATDGPA 4140
RLLNIDVDLW VHLNCALWST EVYETQGGAL INVEVALRRG LRTLCAYCQK TGATSSCNRL 4200
RCPNVYHFAC AARARCMFFK DKTMLCTQHK LKGPSEEELG SFSVYRRVYI ERDEVKQIAS 4260
ILQRGDRFHL FRVGGLIFHS VGQLLPSQMA SFHSPTAIFP VGYEATRIYW STRVPNKRCR 4320
YRCRVNEQDS RPFFEVRVLE HGMEDLHYSD TTPEGIWDRV VQQVAKLRDE SAMLKLFADR 4380
VKGEEMYGLT IHAVMRITES LPGVENCQNY QFRYGRHPLM ELPLMINPSG CARSEAKVPT 4440
HCKRPHTLNS TSMSKAYQST FTGETNTPYS KQFVHSKSSQ YRRLKTEWKN NVYLARSRIQ 4500
GLGLYAAKDL EKHTMVIEYI GTVIRNEVAN RREKIYEEQN RGIYMFRINN EQVIDATLTG 4560
GPARYANHSC APNCVAEVVT FDREDKIIII SSRRIPKGEE LTYDYQFDFE DDQHKIPCHC 4620
GAWNCRKWMN
Nucleotide Sequence
(Fasta)
ATGGATGAGC AGAAATCAAA CTGCGAAGAG AATGACTCGG AACCAACAGC TGATGACGGT 60
GCCCCTGCAA AGCAGTCTCT CAAATCTGGG GAGGAGTCGG AGACGATTCC TCTTGAGGAT 120
GACAAGTCTG GCGCTACGCC CGCCATGTCC AGCACATTGG TGGAGAGTAC GAGGGCCTGT 180
GCACTCTGTA ACTGTGTAGA GCGCAGCCTT CATGGACAAC GGGAACTCCG ACACTTTAGG 240
CCATCTTCCG ATCCGCCGAC CCTGGAACCC ACCGCCTCTT CCCTCCCAGG GCCTGGGAAC 300
GATGACCTGT CTTCCATCGG ATTCTCCGAC TCCTCTTGCC TCGCAGCACT CTTTGACGAC 360
TCGGGAACAG GGGGATGTTG GGTCCATCAC TGGTGTGCAG TGTGGTCGGA GGGAGTGAAG 420
CGGTTGGAGA ATGACCAACT GGAGGATGTA GACACGGTTG TTATCTCAGG GACACAACGG 480
CATTGTGAAT ACTGTAAGCG GCTGGGCGCG ACGATACGAT GCCATGCTGA GGGCTGCTCC 540
CGGTTCTACC ACTTCCCCTG CTCGGCAGCC AGTGGCTCCT TTCTGTCCAT GAAGAAGCTT 600
GCGCTCCTGT GTTCAGAGCA CATGGACAAG GCAGAAGAGT TGGTGGGTGA GGAGGCTTGG 660
TGTGCAGTGT GCGACTCTGC AGGGGACCTT CTGGACCTGC TGTTCTGTAC CGGTTGTGGG 720
CAGCATTACC ACGCCGCCTG TCTTGAGATC GGAGCCACGC CCATCCAGCG GACGGGCTGG 780
CAGTGCCCGG AGTGTAAAGT GTGCCAGACT TGCAGAAATC CAGGGGAAGA CACAAAGATG 840
TTGGTCTGCG ATGCCTGTGA TAAGGGCTAC CACACCTTCT GCCTCCAGCC AGCCATGGAA 900
ACCCTCCCCT CTGACCCTTG GAAATGCAGG AGGTGCCGAG TGTGCTCAGA GTGTGGCGTC 960
CGTGGGCCAG CGCTCCCAGG CACCCAGTGG TTTGATAACT ACGCTGTGTG TGAGGACTGC 1020
CAGCAGCAGC GGAGCTCCAA ATGTGCTGTG TGCAGTAAAG CGGCCTCCCC ATCTGTCACC 1080
TTGCAGAGCT GCAGCCTGTG CCACAGGCAT GTGCACAGTG AATGTGCATT GGCGGCTGCA 1140
GAGTCTACAG GAGATAAGTA CACCTGTCTG CTCTGTAAGG AGGCTCCGCC TCCGCTGGAC 1200
CCAGCAGAGA CCCGGGCGGG AGAGGCCTCG GAGGAGGACA ACGCGGTAAT AAAGATGTCA 1260
ACCGAGCCGT CGGACGAACC AGTGGTCGCA AAGGAACGCA CAACCACGTC TCAGAGGGGT 1320
ATGGTTGGAG CGGAGGCTGT AGAGGACGAG ACGCCCATGG AGTTGGGTAT GGGAAGCCGT 1380
CCATGTCTGG GATCCAGCCC TCCTCCCTCT GCTCCAGCCG ACACCCAGGG CTGCCCCTCT 1440
GCCGACGAGG CAGACTGCAG AGCACTGCCT CAGTCAGAGG AGGAGGAGGA CGAGGAGGAG 1500
GACGAGGAGG AAGAGGAGGA TGACATCAAG CAGGAGCCGG TGGAGCGGCA GGTGAAGGTG 1560
GAGCTGCTGG ACGAGGCCTC CAACATGAGC CCTGGCGACG GGAGCAGCAG TGGCTTCCTG 1620
GGGTCCCCGG CGGAGCCCGA CCCCCAGGAC TTCTGCCTGC TGCCGGCGCG CCGCTCCCGC 1680
GCAGACTCCC TGCTCACCGA GACGGACGAC TCGCTGCCCT TTGAGCCGCA TAAATGTGAC 1740
GGCGAGAAGC TGCGGCGGAG AGGGTCCCCC GGGCGCTCGC GGATCAAACA GGGGCGGGGT 1800
AGCGGTTTCC CAGGGAAGCG GCGACCTCGT GGTGGTGGTG GTGGAGGAGC TGGAGCAGGG 1860
CGGGGGCGCG GTGGGAGGTC ACGCCTCAAA GCCATGGCCT CCTGCATTGA CGCCTTCCTG 1920
CTGAGCATGA CCGAAGAGAC TGGACACAAT AAGGAAGAGG GAGGCGAGGT AGAAGAGGAA 1980
GACGAGGCCA TGCAGAACAC CGTGGTGCTC TTCTCAAACA CTGATAAATT TGTCTTGCTC 2040
CAGGACATGT GTGTCGTGTG TGGCAGTTTT GGGAAGGGAG CCGAGGGACA GCTGTTGGCC 2100
TGTGCTCAGT GTGCACAGTG TTACCATCCA TACTGCGTAA ACAGCAAGAT CACCAAGATG 2160
ATGCTCCGTA AGGGCTGGCG CTGCTTGGAG TGCATCGTGT GCGAGGTGTG CGGGAAGGCC 2220
TCGGACCCGT CGCGCCTGCT GCTATGCGAC GACTGCGACG TCAGCTACCA CACCTACTGC 2280
CTGGACCCGC CCCTGCACAC GGTCCCCAAG GGCGGCTGGA AGTGTAAATG GTGTGTGAGC 2340
TGCGTGCAGT GCGCCTCCCA CTCCCCCGGC TTCCACTGCG AGTGGCAGAA CAACTACACC 2400
CACTGCGGCC CCTGTGCCAG CCTGGTCACC TGCCCCGTCT GCCGGGAGAA CTTCATGGAG 2460
GAGGAGCTCC TGCTGCAGTG TCAGCACTGC GACCGATGGG TCCATGCCGT GTGTGAGAGT 2520
CTGTACACGG AGGACGAGGT AGAGCAGGCC TCCGATGAGG GCTTTGCGTG CACTTCCTGC 2580
ACTCCCTACG TCCCCAAGCC CGTAGGTAAG TGGCTGAACA CCGCGGATGT GTATACATGG 2640
GGGAGGGGGG AGCCCCAGTT CTACCGGCTG GAGGGCGTGT GGCTGACGGA GTCGGGCATG 2700
TCCCTGCTGC GCAGCATCTC CATGTCCCCG TTGCACAAGA GGAGGCAGCG GCGCTCCCGC 2760
CTCGGAGGCC TCTGTGGGGA CGGCGGAGAC GGCCTGGAGC TCAAAGAGGG CGACGGCGGC 2820
GACGGCGACG AGGGCAAAGG GGAGCCGATG GACTGTGAGA CTAAGATGGA GCCACCGGGG 2880
ACCCCCGAGA GAGAGCTGGG AGCGGAGGGC AGCGGTGAGG GCCTGGGCTA CTGTGACGGA 2940
GTCAAGGGAG GTGCAGAGGA GACGGAGGAC GGCAAGAAAA GGAAGAGGAA ACCCTACCGA 3000
CCTGCAGGGA TCGGGGGCTT CATGGTGCGG CAGAGAAAAT GTCACACGAG GATCAAGAAA 3060
GGATTCTTCG CCCAGCTGGC TGGAGAGACC ACTTTAGACG GACAGCCATT GGAAAGAACT 3120
ATCGAAGAGG ACAACATCAT GGACCCCAAG CCAGCCGAGG GCGAGGAGCA GAAGAAACGC 3180
CGGGGGAGAA AGAAGAGCAA ACTGGAGGAC ATGTTCCCTG CATATCTCCA GGAGGCGTTC 3240
TTTGGCAAGA CACTCATTGA CATGAGTAAG AAGGCGGTGG TGATACCTCC GGTCCAGAGG 3300
CCAGGAATGT GCCCGTCTCG ACCCCCGGAG CCGGCCTTAC CCGGCCTCAT CCCTCCAGTC 3360
CAAGAGGGTG ATCAACCTGT CAGAGTTGTC GCTGACGTCG GTGGTCTTTT TATTTGGTTC 3420
CCCCCCCTCA GCAGCCCAGC ACGGGATGAC CGAGGCACTC TGCCTCTCAA GCGCGAGGTG 3480
GTGGACACAC CCCTGTCTCA GGGAGAGGGA GCAAGTCTTC CTCAAGGCAT GGAGAGCCAA 3540
GATTCGGAGC AGTTCTTCCG CAGGGTGCTG GGGGTGTCCG ATGGCTCCAC TCTGGAGGGC 3600
ATGAAGCCCA TCTTAGAGGG CAGTAAAGGA GAGCTCAACC GCAACGCGTT GCAACAGAGA 3660
CCTCTCCTCT CAGGTTCCCT GCCCTCAGCT GGAGTGATGG ACACCTTCCC TGGCCTTGCC 3720
CAGTCTCCAT TCTTTGACAT GCGGGACCGT GGGGGGCTCT TCAGTCCAGA CGGTGGAGAG 3780
GAAAGCCCCT GGGCCACGCC CTCGACTCCG GCCACCCCTT CCACGCCACC CACCCCCACC 3840
GAGGTAGAGG GCGACGGGCT CTCCTACAAC CAGCGCAGCC TCCAGCGCTG GGAGAAGGAC 3900
GAGGAGCTGG GGGAGCTGTC CACCATCTCC CCCGTGCTCT ACGCCAACAC CAACTTCCCC 3960
ACGCTCAAGC AGGACTACCC AGACTGGGCG AGTCGCTGCA AACAGATCAT GAAGATCTGG 4020
AGGAAGGTTT CTGCGGCTGA CAAGGTTCCA TTTCTGCAAA AGGCCAAAGA TAATCGAGCA 4080
TCTCAGCGCA TCAACAAGGC ACAGAAGCAG GCGGAGAGCC AGGTGTGCCG GCCAATCAAG 4140
ACAGAACCAG GACGAGTGAA GGGCGAGCGG CCAAGCCTCC ACCTCCAGAT CCCTCCCCCC 4200
TCGGGCTCCG CCTCCACCCC GTCCCAGCCC AGCTCCGCGG AGAGCCCCTT CCCTTTCCCT 4260
CCGATCCACG GCTCCTCTTC CGCCTTTTTC CCGGACGGGC CCCCGAAGAC GCCGGGCTCT 4320
GCCGAGATCC GGACGGATCC GTTTGCCAAG CTGCCTCCTC AGTCCCCGCG CGCCCACTCC 4380
CAGCCCTCCA CCCCCTTCTC CCAGGCCGGG ACCAGCCCCT CGCAGGCCAA CTCCTCAGGT 4440
TACCCTCCTC CTGGCCCGCA CGGTCCACCT CAGGGACGGC CGGCCCCCGG CCCCTTCGAC 4500
ATGCAACCAG GCACCCCTGG CACTCCCCGG CGGGCCCAGC AGGTAGATCC CTATTTCAGA 4560
TCGCAATTGC AGCAGCAGCA GATGCAACAA GGCTCTCTGG AGTCCTTAGG ACCTCCACAA 4620
AGTCCTCATT CAAGGGCTGG TGGACCCGGG GAGCCGCTCT TCTCTCCTCC GCACACCCCC 4680
CNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4740
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4800
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4860
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4920
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4980
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5040
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5100
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5160
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5220
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5280
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5340
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5400
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5460
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5520
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5580
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5700
NNNNNNNNNN NNNNNNNNNN NCGGCAGAAA CTGAGAGATT TCTTAATCAA GCAACAAGTC 5760
AAGAGCAACC CCGCCGCTGG CTGGCATGGA GGAGACCTGG CACAGTCCCA CAAGGCCCCA 5820
CCGCCTTATC CACAGGACCG TGTGGCTGCT GCTATATCCG GACCCCAGGC TGCTATGGCC 5880
AGGAAGATGC CAATGGCTAT TGGGGGCATG GAGGAAAAGC TCCTCTGCCC ACAAGCTCTG 5940
GAAACCCCTG GCGTTTTGGA TCCCAATGCA ATGAGACAAC CCGGACCCGG CAACCCTCAA 6000
GCCATGTATG GCCGTCCACC GTTCCCTCCT CAGTGGCAGG GTCAGGCGTC CGGCCCCCGG 6060
AGGTTCCCAC CGCCTGGCAT GGACTCCATG CCCCCCCGGC ATCACCTCAA CCCTGCCGTC 6120
AATATGCAGG CCATGCAGGG GATGGTGAAC CCCCGTGGCA TGATGGCCGG GCCCGGAGAG 6180
GCCATGCAGC CAATGGGGCA GGGACCCCCA CCGCAGTTCA TCGAGTTAAG GCACAACGCT 6240
CAGAGGTTGC CCCTCGGACC CCATTTCATG CTCAGAGGGC CACAGGCCAG ACCCCGTCTC 6300
TGCCCTCCCC AACAGGACCT CGCAGCGCCT TATGTGCAGC AACATACCAT GCAGGTCGAC 6360
GGCGGCGACT CACAGATGGG GCTGCAGCAG GGAGGACTAT CGATGTTGAT GCCCACTTGC 6420
CCTGCAACGC AACAAGGCCA GCATCCGCAA CAAACAATCG CTCACTCGTC CAACCATCCT 6480
GGTTCACAGC AACAGCAGCA ACAACCGAGC AACAACACAC AGCAGCAGCC CTCCACAGCC 6540
CATAGGGCCT CAGGTCCCGA GGACCTGCCT GAACCTGACC TGGAGGGCCT TTCGGACGCC 6600
CCTGGGGATG GCGGTGTGGA GGACGACGAC CTGGCCCTCG ACCTGGACCC TGACAAAGGA 6660
GATGACGACC TGGGTAACCT TGACAATCTA GAAACCAATG ACCCACACCT TGACGATCTG 6720
CTGAACAGTG ACGAGTTTGA TCTGCTGGCC TACACTGACC CCGAGCTAGA CCAAGGGGAC 6780
CCAAAAGACG TGTTCTCGGA CCAGCTCCGC TTGGTGGAGG CGGAGAGCGA GGCGCCTACT 6840
GCCTCTGGCT CCGCTGACAT CAAGGTGGAG CAGAAACCCA AGCTGGAGCC CGGACAGGTT 6900
TCAGATACGT CAGCCTCATC AGCTCTTCAC GCTCCCCCCT CAGAGACGGC CAGCACCTCC 6960
AAAATCAAAC TGGAGGACCG AGGCCTAATG CCACAGCATC AGGACGGACA GATGGTCATC 7020
AAAGACGAGA TGGGAGAGGC CGTATCCATG TTACTGGGTG GAACGGGTGC ATCAGGCAAG 7080
CAAACACAAC CGCAGGCTCC GTCTGCCTCT CTCAGCTCTG TTCGCTTAGG GGGAATATCC 7140
TACCCTCTCC CGGGTCAAGG CGATCCCCTC TCATTCCCTC CGTCCACCCC ACACCCAGAC 7200
CTGGGGGCGG ACCCACTGGG GCTGCCCGAT GTCGGGGGTC ACACCTCTCC TTCTGTAGAC 7260
ATGGCCAAGG TAGAGAGCTC TTTAGACGGG GAGTTGCCTC TTCTCATACA GGATCTACTG 7320
GAGCATGAGA AGAAGGAGCA GCAGAAGCAG CAACAACTCA GCTCCATGCA CCAGGCAGGC 7380
ATGCCCAGCC ACATGCAGGG CATGCCCGGT CAACAGCCCA ACCCGCAGGC CCCTCCTGGC 7440
GCCCTGATGC TGCAGCAGCA GCACCACCAC CGTCCGCCTC CCCAAGTCAT GATGGGCCAG 7500
CCGGGAATGG GTCCGCGGCC CATGCACGCC ATGCAGCCCC AGCACCAGCA GCAGAGGTTC 7560
ATGGGACCGG GAATGGCTCC TCCACCTCAC ATGGCTCAGC AGCAGGCCAT GATGAGGCTG 7620
GGTCAGCCTG GGGGCATGCA CCCCGGGATG AATCACCAAC CACAGAGATG GGCAAAGCCC 7680
CCCATGGCCA ATAACTTCTT CCCAAACAAA GATTTGGATA CATTTGCTTC AGATGACAAC 7740
ATGGATCCCA TTGCAAAAGC CAAGATGGTG GCGCTAAAGG GCATCAAGAG AGTGTTGGCA 7800
CAAGATCCAC TCGGTGTCCC ACCTGGGATC AACAGACAAC AAGTGTCTCT GCTGGCCCAG 7860
AGGTTGGCCT CTGCCCCAGG AGCCGATCAA CTTGGGCAGG CTGTCCCGGG ATCATCCAAG 7920
GAGGGAGAAA CCAGTACGCC TCTCCAGACA AGACCCAATC CCCCGCAGTT TACTCAAGGA 7980
ATAATCAATG ACGCCGAGCA GCAGCAGTAC GAGGAGTGGC TCATCCACAC CCAGCAGCTG 8040
CTCCAGATGC AGTTGAAGTT CTTGGAAGAA CAGATCGGCG CTCACCGGAA GTCCCGCAAG 8100
GCCCTCTGCG CCAAGCAGCG CACGGCCAAG AAGGCCGGGA GGGAGTTTGC CGAGACCGAC 8160
GCAGAGAAAC TCAAACTGGT GACCGAAGAA CAGAGCAAGA TCCAGAAGCA GCTGGACCAG 8220
GTCCGCAAGC AGCAGAAGGA GCACACTAAC CTCATCGCCG AGTACAAAAG CAAACAGCAG 8280
CAGCACCAGC AGGGCTCTGG CTTATTGAAG CCCGGCCCCT CTGCACTGGC CCCTCCTCAC 8340
ATGCTCTCCA AGATGCCTGG CCAGATGATG ATGGGCCAGC CGCCTGGCAT GATGCCCCAG 8400
GGGCAGCCCT TCATGGCCGG GGCCCCCCCC CAGAACCCCG GGGTCCTTGT CCCCCCCCCG 8460
GGTCCTCCGG GCGCCCCCGC GGGCTATTTC CCCCAGGGGC CCGGGATGCA GGGCGCCGAC 8520
CCTCGGCTCC TCCAAGAAAG ACAGCTGCAA CATCGCATGC AGCTGGCCAA AGTCATGCCC 8580
CATCCAGGGC AGCAGCCCGG CATGATGCCA CAGGCACAAC CTGGGATCAT GGGGAACCAG 8640
CTCATGGCTC AACAGCAGCC AAACATTCAA CAAGGGATGC CAGTGGACCA GGCCAACCAG 8700
CAGGGCATGG TGCCAGTTCC CCAGGGAATG GTTGGGGGCC AGCCAGTCCC CCAGCTGCCG 8760
CCGAACATGG TGCCCATGAA TCAGCCTCCG GGCATGATGT CAGCTCAGCC TGGGATCATG 8820
GTGACTCAGC CCGACGGCCC ATCTCAGCAG CAGCAGAGAC CTCAGTTAAT GATGGGTCCG 8880
CAAGGAATGG TTGTGGCTCC CGGTCACCCT GGCATCAGGG GTCCACAGGC CCAGCTTACT 8940
CTGCAACAGC AAAACATCTT AGCCCAGCGA ATGATATCTC AGCAGCAGAT GCAGCAGCAG 9000
CAACAGATGG CACACCGACA GCAGTCCCAA GGCCTCATCA ATCAACCCAA CCAAGACCAG 9060
AGGACCTCCC AACCTTCAAC GCCGCAAATG TGCTCCTCCC CTTCTGCAGG GAGCATCACT 9120
CCCCAGCAAC AGGGCGGTAC AGACAACCAG AACCCAGGCC TTAAAGAAAG AGCAATGCTC 9180
ACCCCAGCTC CCAGGACCCC ATTACAGCAG AGTGGGCCTC CGACTGCCAG TCCCATGGTT 9240
CAACAAGGAT CTACAGGGGA GCAACATATT CAGAACCAAC GCCGGCATGG CCTCATGGTC 9300
CACCAGTCTG CTCTGGTCAA CATCAAGCAG GAGCGGCAGC AGATGGATGT CTCAACTTCA 9360
CAGCAGCAAC AACAACATGC GGTTCAGAAT GTCCCGCAGC AGTCCCAAGA TCCCGGCACC 9420
CTGCAGCATG TCATGGGCCA GAACCCGGGT CCGATGCAGC CGCAGCCGGC CCTGATGGGT 9480
CATCCCAGCC CCCAGCAGCA GGCCCTCATG GCGCAGCAGC AGAAGCAACA GGCCATGATG 9540
GGCATGATGA GGGCACAGCA GCCGGGCATG ATGGTCCAGA GGCCGGGCGC GCCCCCGGGA 9600
CAGATCCGCA TGCCCAACAT CAACATCCAG GCCATCATCG CTCAGAATCC CCAGCTCCGC 9660
AACCTCCCCC CGAACCAGCA GATCCAACAC ATCCACGCCA TGATCGCCCA GCGGCAGCAG 9720
CAGCAGCAGG GCCAGATGAT GAGGATGTCC ATGGCCCAGG GCCAGCCGGG TCAGATGAGG 9780
CCTCAGATGG CGCCGGGCCA GCTCCACCAG GGGGACCAGC GGATGCCGGG TGCGTTGGGA 9840
CAGCAGCCCG GGATGCCCTC TCAGATCCTG CAGGGGATGA TGGTCCCGGG GCAGCCGCCG 9900
CAGCAAGTGG GGCAGATGAT GCAGCAGATG AGCCGGGGTC AGATGCCGAT GGTGCGGTTG 9960
CCGATGGACC CCAGTCGTAT GGTGAGGCCC ATGTCTCCGC ACCAATCCCT CCCCAGCTCC 10020
CCTGGGGACC CCCAGCGCCA CGCCATGGCT CAGGCGCTGG GCATGTGTCC GCCCACTCCC 10080
AACCATCAAC AGCAGCAAGC GCACATGGTC GCGGCGGCGG GCCGAATGCC GGGTTCCCCG 10140
TCCCAAGCGG GGTCTCCAAG AGGACCCTCC TTCACGAGGA TGGACAGCAG TCCCGCCACC 10200
CCCGGGACGC CGCACTCCAC ACACGTTCCT TCCCCGTCGC AAGCCGAGGG TGGCGCCGGC 10260
AGAGGGAGTC CGTACAACCA GGTCAGGGCG TCCCCGCTCC GGTCCCCCGG CGCCAAGAGC 10320
CCCCATCACT ACCCAGGGCT GAAGGCAGAG CCGCACTCCT CCGCTAATGA CGCGTCCCAG 10380
ACCCCCTCGG TGCCCCTCAA CGGGCCGCAG CAGGAGGAGC GTCTTCAGCA GCATCTCCCG 10440
CAAAAGGCCT CGGCGGGCCA CGGCCCTCAG CCCGGCTCCA GGGAGGGGGC CCTGTGCCGG 10500
ATGACTCTCC AGAACATCAA GCAGGAGCCC GGGGAGACGC AGTGTGACAG CGGCTCACCG 10560
GCGGGCGCCC ACCCCGGGGC GATAAAGAGG GAGGAGCTCA TGAACTGCGG CCACCCTTCT 10620
GGCTTCATCA ACGCAGAGAA CATGGGCGGG GACCCGGGCA CGCTGGGCCG ATCGGAAACG 10680
GGACAGCAGC TACTCCAGAA GCTCCTGAGG ACCAAGAACC TTGGCGCCCA GAGGCCCTCC 10740
GAGGGGATCC ACAACGAGAT CAACGGCCAC ATCAACAGCA AGCTGGCCAT GCTGGAGCAG 10800
AAGCTACAGG GAACCCCGCG CAACATGGAG CACTCGTATC TTGACCTGCA GTCCATCACT 10860
AAAAAGACTC CTCTGGCCAA GGCCAAACGT ACCAATAAGC CGGGCGGGGA GCGAGGGCCC 10920
AACCCTCGGA AGAAGAACAA GAAAGAGGAT GTTGGAAAGA GTGCTGAGGC CCTGATGAAA 10980
CAACTGAAAC AGGGCCTCTC TCTATTGCCT CTCATGGAGC CCTCCATTAC CGCCAGCCTT 11040
GACCTGTTCG CGCCCTTCGG CAGCAGCTCG GCCAATGGCA AAGCCCCCCT CAAAGGCTCT 11100
TTTGGAAACG CAGTGTTGGA CAACATCCCC GACTACTACT CACAGCTGCT CACCAAGAGT 11160
AACCTCAGCA ACCCCCCCAC GCCCCCGTCC TCCCTCCCCC CGACGCCCCC ACCATCGGTG 11220
CAGCACAAGC TGCCCAACGG GGTCACTGCG GGGGAGGAGC TGGCCGACAC TCGGAAGCAG 11280
GCCGAGACCA CCGAAGACAC TATGGATCCC GTCAGTCAAG AGGTGAAGAG TGTGGACATC 11340
CTGGCTGCTC TCCCAACTCC ACCGCACAAC CAGAACGAGG ACATCAGGAT GGAGAGTGAC 11400
GACGAGGACG CCTCGGAGAG CATCGTCCAG GCCTCGTCCC CGGAGAGCGC CCTGGGCGAC 11460
GCCATGGCCC GCTTCCCCTG TCTCCGGGAG CCCAAAGAGG AGGAGACGGA GCGGGCCATA 11520
TCCCCCATCA TCCCCCTCAT CCCGCGCAGC GCTATCCCAG TTTTCCCTGA GATCAAGCCA 11580
TTCGAGGCCA CCGACAGCAA GGCCGTCTCC ACGTCAAACA ACTGGGACAG CTCCAAAAAC 11640
AACGAGGTCT CCGTCACTTT CATGCTGTCC TCCGCCGCAG CCAAGAACCT GAACCACATG 11700
ATGGTGGCCA TGGCCCAGCT GCTTCACATC AGGATGCCCG GGTCCTACGA GGTGACCTTC 11760
CCTCCGACCC CGGGAACACC TGGAGCCGCC GGGCCAGGTA ATGCCCCTGA ACAGCCAGGC 11820
GACGGCGGTA CACACGACGG GCCCTCGGTG AGCCAGGACG ACTGGCTGAG GCAGTTTGAC 11880
GTGACGCTAC CAGGGTGCAC GCTCAAGAAG CAGGTGGACG TCCTCGCCCT CATCAAGCAG 11940
GAGTTCTCTC AGCAGCAGGA CACACCAGCG CAGCACTGCT ACACCACCAA CGTCAACGAC 12000
CTGGACGTGC GCCACCTGCC CGTCATCCCG GTGGAGGAGT CCCCGCCCCC TTCCCCCTCG 12060
CCTCCCCCGC CCCCGTCGGA GCCCGCCCCC GTAGGCGACG GTGGAAAGCC GCCGTCCCTC 12120
GCCGCGGTTC ACATCAAGAC GGAGCCGGAG CCGGAGGCCG TGCCCGCCGC CGACTCAGCT 12180
GAGAGGCCTG TCGGGTCTGA GGCCGCCNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 12240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 12300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNTGC GGCCCGACCG GCTGCCGCGG 12360
GACAAGCGCA AGTGCTGCTT CTGCCACGAG GAGGGCGACG GTGCCACCGA CGGCCCGGCG 12420
CGCCTCCTCA ACATCGACGT GGACCTCTGG GTGCACCTCA ACTGCGCGCT GTGGTCCACG 12480
GAGGTGTACG AGACGCAGGG CGGCGCCCTC ATCAACGTGG AGGTGGCGCT GCGCCGGGGC 12540
CTGCGCACGC TCTGCGCCTA CTGCCAGAAG ACGGGCGCCA CCAGCAGCTG CAACCGGCTG 12600
CGCTGCCCCA ACGTCTACCA CTTCGCCTGC GCCGCGCGCG CCCGCTGCAT GTTCTTCAAG 12660
GACAAGACCA TGCTGTGCAC GCAGCACAAG CTGAAGGGCC CCAGCGAGGA GGAGCTGGGC 12720
AGCTTCTCGG TGTACCGCCG CGTCTACATC GAGCGCGACG AGGTCAAGCA GATCGCCAGC 12780
ATCCTGCAGC GCGGCGACCG CTTCCACCTG TTCCGCGTGG GCGGCCTCAT CTTCCACTCG 12840
GTGGGCCAGC TGCTGCCCTC CCAGATGGCC AGCTTCCACT CGCCCACCGC CATCTTCCCC 12900
GTGGGCTACG AGGCCACGCG CATCTACTGG AGCACGCGCG TGCCCAACAA GCGCTGCCGC 12960
TACCGCTGCC GCGTCAACGA GCAGGACAGC CGGCCCTTCT TCGAGGTGCG CGTGCTGGAG 13020
CACGGCATGG AGGACCTGCA CTACAGCGAC ACCACGCCCG AGGGCATCTG GGACCGAGTG 13080
GTTCAGCAGG TGGCCAAGCT GCGGGACGAG TCGGCCATGC TGAAGCTGTT CGCCGACCGG 13140
GTGAAGGGAG AGGAGATGTA CGGCCTCACC ATCCACGCTG TCATGCGCAT CACTGAGTCG 13200
CTGCCCGGAG TGGAGAACTG CCAGAACTAC CAGTTCCGCT ACGGCCGCCA CCCTCTCATG 13260
GAGCTCCCTC TCATGATCAA CCCGAGTGGC TGCGCCCGCT CAGAAGCCAA GGTCCCCACC 13320
CACTGCAAGA GGCCGCACAC ACTCAACAGC ACCAGCATGT CCAAGGCCTA CCAGAGCACC 13380
TTCACAGGCG AGACCAACAC GCCCTACAGC AAGCAGTTCG TCCACTCCAA GTCGTCCCAG 13440
TACCGCCGGC TGAAGACAGA GTGGAAGAAC AACGTGTACC TGGCCCGCTC GCGCATCCAG 13500
GGCCTGGGGC TGTACGCCGC CAAGGACCTT GAGAAGCACA CCATGGTCAT CGAGTACATC 13560
GGCACGGTGA TCCGCAACGA GGTGGCCAAC CGGCGGGAGA AGATCTACGA GGAGCAGAAC 13620
CGTGGCATCT ACATGTTCCG CATCAACAAC GAGCAAGTGA TCGACGCCAC GCTGACCGGC 13680
GGACCGGCAC GCTACGCAAA TCACTCGTGT GCACCCAACT GCGTGGCAGA GGTTGTGACC 13740
TTTGACCGAG AGGACAAGAT CATCATCATC TCCAGTCGCC GGATCCCCAA AGGAGAAGAG 13800
TTGACCTACG ACTATCAGTT CGACTTCGAA GACGATCAGC ACAAGATCCC CTGCCATTGC 13860
GGAGCCTGGA ATTGCCGAAA GTGGATGAAT TGA 13894
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 74 0.0 2029
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 69 0.0 1872
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 69 0.0 1835
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 75 0.0 1312
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 69 0.0 1244
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 68 0.0 1195
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 67 0.0 1147
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 63 0.0 1118
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 91 0.0 1003
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 91 0.0 984
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 82 0.0 915
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 82 0.0 914
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 83 0.0 914
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 82 0.0 912
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 82 0.0 912
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 82 0.0 912
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 82 0.0 912
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 82 0.0 912
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 82 0.0 911
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 82 0.0 911
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 82 0.0 911
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 82 0.0 911
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 83 0.0 911
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 82 0.0 911
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 82 0.0 911
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 82 0.0 910
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 82 0.0 910
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 82 0.0 910
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 82 0.0 910
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 82 0.0 909
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 82 0.0 909
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 82 0.0 908
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 82 0.0 907
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 82 0.0 903
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 82 0.0 900
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 82 0.0 900
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 82 0.0 898
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 82 0.0 898
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 82 0.0 897
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 82 0.0 892
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 81 0.0 887
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 81 0.0 868
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 80 0.0 833
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 74 0.0 820
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 74 0.0 820
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 73 0.0 819
WERAM-Mod-0039 ENSMODP00000005827.3 Monodelphis domestica 73 0.0 819
WERAM-Nol-0046 ENSNLEP00000005663.2 Nomascus leucogenys 73 0.0 818
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 73 0.0 817
WERAM-Pat-0168 ENSPTRP00000046674.3 Pan troglodytes 73 0.0 815
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 72 0.0 808
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 80 0.0 794
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 54 0.0 784
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 54 0.0 767
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 763
WERAM-Dio-0019 ENSDORP00000002189.1 Dipodomys ordii 70 0.0 721
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 70 0.0 716
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 53 0.0 695
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 71 0.0 662
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 56 1e-168 593
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 50 2e-163 576
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 88 9e-149 527
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 8e-141 501
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 71 1e-96 354
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 68 1e-79 298
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 36 3e-60 233
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 34 3e-44 180
Created Date 25-Jun-2016