WERAM Information


Tag Content
WERAM ID WERAM-Ere-0080
Ensembl Protein ID ENSEEUP00000007438.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSEEUG00000007930.1 ENSEEUT00000008162.1 ENSEEUP00000007438.1
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader PHD 4.90e-19 67.8 283 1265
Organism Erinaceus europaeus
Domain Profile
  Me_Reader PHD

               PHD.txt  12 egekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51 
++ ++m+ C++Cd+ +H+ C+++p+++lp+ sw C+ C+
ENSEEUP00000007438.1 283 ND-SKMLVCETCDKGYHTFCLQPPMEELPAH-SWKCKACR 320
44.45*************************9.*******8 PP
PHD.txt 3 iClvCgkddegeke......mvq....CdeCddw 26
+C++C + g+ e +++ C C+++
ENSEEUP00000007438.1 321 VCRAC---GGGSAElnpdseWFEnyslCHRCHKA 351
45555...44433222334466667779989876 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ ++C++ +H + v+ + +++ k w+C +C
ENSEEUP00000007438.1 1086 DMCVVCGSFGRGAEGhLLPGSKCSQCYHPYRVNSKITKVMLLKGWRCVEC 1135
68****8555554443777888*************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSEEUP00000007438.1 1137 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1183
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSEEUP00000007438.1 1214 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1265
7*****99999999*****************9933333444434599*9997 PP

Protein Sequence
(Fasta)
MDSQKPPGDK DPEQGXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 60
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 120
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 180
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 240
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXK PGNDSKMLVC ETCDKGYHTF 300
CLQPPMEELP AHSWKCKACR VCRACGGGSA ELNPDSEWFE NYSLCHRCHK AQGSQPISSI 360
AEQPPPVCHR FSPPEPGDTP TDEPDALYVA CQGQPKGGHV TSMQPKEPGP LQCEAKPLGR 420
AGAQLEPQLD APIDEEMPLL PPPEESPLSP PPEESPTSPP PEASRLSPLP EESPLSPPPE 480
ESPLSPPPES SPFSPLEASP FSPPXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 540
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 600
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 660
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 720
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 780
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 840
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 900
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX GVSTDVSPAR 960
DEGSLRLCTD SLPETDDSLL CDAGAALGGG KAEGDKGRRR SSPARSRIKQ GRSSSFPGRR 1020
RPRGGAHGGR GRGRARLKST ASSVETLVVA DIDSSPSKEE EDDDDDTMQN TVVLFSNTDK 1080
FVLMQDMCVV CGSFGRGAEG HLLPGSKCSQ CYHPYRVNSK ITKVMLLKGW RCVECIVCEV 1140
CGQASDPSRL LLCDDCDISY HTYCLDPPLL TVPKGGWKCK WCVSCMQCGA ASPGFHCEWQ 1200
NSYTHCGPCA SLVTCPICHA PYVEEDLLIQ CRHCERWMHA GCESLFTEDD VEQAADEGFD 1260
CVSCQPYVVK PAAPVAPPES VPVKVKEPEP QYFRFEGVWL TETGMAVLRN LTMSPLHKRR 1320
QRRGRPGLPG EAGLEGSEPP DALGSDDKKD GDLDADDLLK GEGGVEHMEC EIKLEGPASP 1380
DVEAGKEETE ESKKRKRKPY RPGIGGFMVR QRKPHTRVKK RLAAQAEVLS GDGQPDEGET 1440
VIATDLPAES SMDPGLADGD EKKKQQRRGR KKSKLEDMFP AYLQEAFFGK ELLDLSRKAL 1500
FAVGVGRPSF GLGAPKAKGD GGSDRKELPS LHKGDDGPDV ADEESRGPEG KADTPGPEDG 1560
GVKASPVPSD PEKPGTPGEG MLSSDLDRIP TEELPKMESK DLQQLFKDVL GSEREQHLGC 1620
GTPGLDGSRT PLQRPFLQGG PLGSLPSSSP MGIPYPGLCQ YTLLDSRERG GFFSPEPGEP 1680
DSPWTGSGGT TPSTPTTPTT EGEGDGLSYN QRSLQRWEKD EELGQLSTIS PVLYANINFP 1740
NLKQDYPDWS SRCKQIMKLW RKVPAADKAP YLQKAKDNRA AHRISKVQKQ AESQINKQTK 1800
VGDMARKTDR PALHLRIPSQ PGALGSPPPA AAPTIFIGSP TAPAAGLSTS ADGFLKPPAG 1860
TVPGPDSPGE LFLKLPQVPX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1920
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1980
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2040
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2100
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2160
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2220
XXXGQPPNFA RSPGTGAFVG TPPPMRFTFP QAVGEPSLKP PVPQPSLPPP HGINSHFGPG 2280
PTLGKPQSTD YTVATGNFHP SGSPLGPSSG STGEGYGLSP LRPPSVLPPA PDGALPYLSH 2340
GASQRAGITS PVDKREDPGA GMGSSLAAPE LPGSQDPSMS NLSQTELEKQ RQRQRLRELL 2400
IRQQIQRNNL RQEKETAAAA AGAVGPPGGW GAEASGPAFE QLSRGQTPFS GTQDKSSLVG 2460
LPPNKLGGPI LGPGTFPTDD RLSRPPPPAT PSSMDVSSRX XXXXXXXXXX XXXXXXXXXX 2520
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXLGRQTL SSPLAGIPPR LPGPGEPVPG 2580
PAGPAQFIEL RHNVQKGLGP GGAPFPGQGP PQRPRFYPLS EDSHRLAAEG LRGLAVPGLP 2640
PQKPSAPPPA PELNNSLHPA PQAKGPTVPA GLELVSRPPS GTELGRPPPL ALEAGKLPCE 2700
DPELDDDFDA HKALEDDEEL AHLGLGVDVA KGDDELGTLE NLETNDPHLD DLLNGDEFDL 2760
LAYTDPELDT GDKKDIFNEH LRLVESANEK AEREALLRGV EPGPLGPEER PPPAADAAEP 2820
RLTSVLPEVK PKVEEGGRHP SPCQFSITAP KAEPAAATPS LGLGLKPAQS GVGNRDSRMG 2880
PGPFPSSGQT AEKGPFGTTA GPPAHLLTSS PLSGPGGSTL LEKFELESGP LTLPGGHAAS 2940
GDELDKMESS LVASELPLLI EDLLEHEKKE LQKKQQLSAQ LQPAQQPPQP QPQPPPLLSA 3000
PGPAQAMPLP HENSAPGLAG PQQQLALGLG GPRQSSLTQP LMPTQPPAHA HQQRLAPSMA 3060
MVSNQGHMMS GQHGGQAGLV PQQGPQPVLA QKPTGTMPHS MCMKPQPLAM PQQLANSFFP 3120
DTXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXQQVSLL AQRLSGGPGS 3180
DLQNHVAARS GQDRSTGDPS QPRPNPPTFA QGVINEADQQ QYEEWLFHTQ QLLQMQLKVL 3240
EEQIGVHRKS RKALCAKQRT AKKAGREFPE ADAEKLKLVT EQQSKIQKQL DQVRKQQKEH 3300
TNLMAEYRNK QQQQQQQQQQ QQQQHSAVLA LSPSQSPRLL TKLPGQLLPG HGLQPPQGPP 3360
GGQPGGLRLP PGSMALPGQP GGPFLNTALA QQQQQQHSGG PGPLAGPSGG FFPGNLALRG 3420
LGPESRLLQE RQLQLQQQRM QLAQKLQQQQ QQQQQHHLLG QVAIQQQQQQ GPGVQVNQAL 3480
GSKPQGLLPP SSHQGLLVQQ LSPQPPQGPQ GMLGPAQVAV LQQQQQQHPG TLGPQGPHRQ 3540
VLLTPSRVLS SPQLAQQGQG LMGHRLVASQ QHQQQGSMAG LSHLQQGLIT HSGQPKLNTQ 3600
PMGSLQQQQL QQQQQQQQQQ QLQQQQQQQQ QQQQQQQLQQ QQLQQQQQQQ QLQQQQQQQL 3660
QQQQQQLQQQ QQLQQQQFQQ QQQQQQMGLL NQGRTLLSPQ QQQQQQVTLG PGMPAKPLQH 3720
FSNPVALGST LLLMGKDQSI VETPLPPEVT EGPATLQGGP LAVGPIPESV ATEPGEVKPS 3780
LSGDSQLVLV QPQAQAQPNS VQLQPPLRLP GQQQQQVLLH TAAVGNHGQT GSGSXXXXXX 3840
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 3900
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 3960
XXXXXGLLGR QPQLGGFPGP QTGPLQELGA GLRSQGPPRL PAPQGALSTG PVLGPVHPTL 4020
PPSSPQEPKR PSSQLPSPNS QLPSDTQLTP NQPGTPKPQG PPLEMPPGRV SPAAAQLADT 4080
FFGKGLGPWD PPDNLVEAQK PDQSSMVPGH LEQVNGQVVP EAPSLSIKQE PREEPCALGA 4140
QAVKREANGE PVGAPGTSNH LLLAGPRSEA GHLLLQKLLR AKNVQLNTGR GPEGLRAEIN 4200
GHIDSKLAGL EQKLQGPHSN KEDTAARKPL TPKPKRVQKA SDRLVSSRKK LRKEDGVRAS 4260
EALLKQLKQE LSLLPLTEPT VTANFSLFAP FGSGSPISGQ CQLRGAFGSG ALSTGPDYYS 4320
QLLTKNNLSN PPTPPSSLPP TPPPSVQQKM VNGVTASEEL GEHPKDATSA RETEGTLRDA 4380
SEVKSLDLLA ALPTPPHNQT EDVRMESDED SDSPDSIVPA SSXXXXXXXX XXXXXXXXXX 4440
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4500
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4560
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4620
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4680
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4740
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4800
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4860
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4920
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 4980
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 5040
XXXXXXXXXX XXXXXXXXXX XXXPHTLNST SMKYQRTFTG ETSTPYSKQ 5089
Nucleotide Sequence
(Fasta)
ATGGACAGCC AGAAGCCGCC TGGGGATAAG GATCCGGAAC AGGGGGNNNN NNNNNNNNNN 60
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 180
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 480
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 540
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 600
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 660
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 720
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 780
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNGAAG 840
CCTGGGAATG ACTCTAAAAT GTTGGTCTGT GAGACATGTG ACAAAGGATA CCACACCTTC 900
TGCCTGCAAC CACCTATGGA GGAACTTCCT GCTCATTCGT GGAAGTGCAA GGCATGCCGT 960
GTATGCCGAG CCTGTGGGGG GGGCTCAGCA GAGCTGAATC CTGACTCAGA GTGGTTTGAA 1020
AACTACTCGC TCTGTCATCG TTGTCACAAA GCCCAGGGAA GCCAGCCCAT CAGCTCTATT 1080
GCTGAGCAAC CTCCCCCAGT TTGCCACAGA TTCTCACCCC CAGAGCCTGG CGATACCCCC 1140
ACTGATGAGC CCGATGCTCT GTACGTTGCA TGCCAAGGGC AGCCAAAGGG TGGGCACGTG 1200
ACCTCTATGC AACCCAAGGA ACCGGGGCCC CTGCAATGTG AAGCCAAACC ACTAGGGAGA 1260
GCAGGGGCCC AACTTGAGCC CCAGTTGGAT GCCCCCATAG ATGAGGAGAT GCCACTGCTG 1320
CCCCCACCTG AGGAGTCACC CCTGTCGCCA CCACCTGAGG AATCACCTAC ATCCCCACCG 1380
CCTGAGGCTT CTCGTCTCTC CCCGCTGCCT GAGGAATCAC CCCTCTCTCC ACCTCCTGAG 1440
GAGTCTCCTC TGTCTCCCCC ACCCGAGTCG TCACCTTTTT CCCCGCTGGA GGCGTCACCC 1500
TTCTCTCCTC CAGANNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1560
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1620
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1680
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1740
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1800
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1860
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1920
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1980
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2040
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2100
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2160
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2220
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2280
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2340
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2400
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2460
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2520
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2580
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2820
NNNNNNNNNN NNNNNNNNNN NNNNNNNNTG GGGGTCTCTA CAGACGTTAG TCCAGCCCGA 2880
GATGAGGGCT CCCTGCGACT CTGTACCGAC TCTCTGCCAG AGACTGATGA CTCCCTATTG 2940
TGTGATGCCG GGGCAGCTCT TGGCGGAGGC AAAGCCGAAG GGGACAAAGG GAGGCGGCGC 3000
AGTTCCCCAG CTCGATCCCG CATCAAGCAG GGTCGCAGCA GTAGTTTTCC AGGAAGACGC 3060
AGACCACGTG GAGGGGCACA CGGAGGACGC GGGAGAGGAC GGGCCCGGCT AAAATCAACT 3120
GCATCTTCTG TTGAGACTCT GGTAGTTGCT GATATCGATA GCTCTCCCAG CAAGGAGGAG 3180
GAAGATGATG ATGATGACAC CATGCAAAAT ACCGTGGTCC TCTTCTCTAA TACAGACAAG 3240
TTTGTCCTAA TGCAGGACAT GTGTGTGGTA TGTGGCAGCT TTGGCCGGGG GGCAGAGGGC 3300
CACCTCCTGC CTGGTTCCAA GTGCTCTCAG TGCTACCACC CTTACCGTGT CAACAGCAAG 3360
ATCACCAAGG TGATGCTGCT GAAGGGCTGG CGCTGTGTAG AGTGCATTGT GTGCGAGGTG 3420
TGCGGCCAGG CCTCGGACCC CTCGCGCCTG CTGCTCTGCG ATGACTGTGA CATCAGCTAC 3480
CACACATACT GCCTGGACCC TCCACTGCTC ACTGTGCCCA AGGGTGGCTG GAAGTGCAAG 3540
TGGTGTGTGT CTTGTATGCA GTGTGGGGCT GCCTCCCCCG GTTTCCACTG TGAATGGCAG 3600
AACAGTTACA CTCATTGTGG GCCCTGTGCC AGCCTGGTGA CCTGCCCCAT CTGCCATGCC 3660
CCATACGTGG AGGAGGACCT GCTCATCCAG TGCCGCCACT GTGAACGGTG GATGCACGCC 3720
GGCTGTGAGA GCCTCTTCAC AGAGGATGAT GTGGAACAGG CGGCCGATGA GGGCTTCGAC 3780
TGTGTCTCCT GCCAGCCTTA CGTGGTAAAG CCTGCAGCAC CTGTTGCACC TCCAGAGTCG 3840
GTGCCCGTGA AGGTCAAAGA GCCCGAGCCT CAGTACTTTC GCTTCGAGGG TGTGTGGCTG 3900
ACAGAGACTG GCATGGCAGT GCTTCGGAAC CTGACCATGT CACCCCTGCA CAAGCGGCGC 3960
CAGCGGCGAG GACGGCCGGG TCTCCCTGGA GAGGCAGGGC TGGAAGGTTC TGAGCCCCCA 4020
GATGCCCTTG GCTCTGATGA CAAGAAGGAT GGGGACCTGG ATGCTGACGA TCTGCTCAAG 4080
GGTGAAGGAG GTGTAGAGCA CATGGAGTGT GAAATAAAAC TGGAGGGCCC CGCCAGCCCC 4140
GATGTGGAAG CTGGCAAGGA GGAGACCGAG GAAAGCAAGA AACGCAAACG CAAACCTTAC 4200
AGGCCTGGCA TTGGTGGTTT CATGGTACGA CAGCGGAAAC CTCACACACG TGTAAAAAAG 4260
AGGCTTGCTG CACAGGCGGA GGTGTTGAGT GGGGATGGGC AGCCCGACGA GGGTGAGACG 4320
GTGATAGCTA CTGACCTGCC CGCAGAGAGC TCCATGGACC CAGGCTTAGC GGATGGGGAT 4380
GAGAAGAAGA AACAGCAGCG GCGAGGCCGG AAGAAGAGCA AGCTTGAGGA CATGTTCCCT 4440
GCCTACCTGC AGGAGGCCTT TTTCGGGAAG GAGCTGCTGG ACCTGAGCCG GAAAGCCCTT 4500
TTTGCAGTCG GGGTGGGCAG ACCGAGCTTT GGACTAGGAG CCCCCAAAGC CAAGGGGGAT 4560
GGAGGCTCAG ATAGGAAGGA GCTGCCCTCC TTACACAAAG GAGATGATGG TCCGGATGTT 4620
GCCGATGAAG AGTCGCGTGG CCCTGAGGGC AAGGCAGACA CACCAGGACC TGAGGATGGG 4680
GGCGTGAAGG CATCCCCAGT GCCCAGTGAC CCTGAGAAGC CAGGCACCCC AGGTGAAGGG 4740
ATGCTTAGCT CTGACTTAGA CAGGATTCCC ACAGAAGAAC TGCCCAAGAT GGAATCCAAG 4800
GACCTACAGC AGCTCTTCAA GGATGTTCTG GGTTCTGAAC GAGAGCAGCA TCTGGGTTGT 4860
GGTACCCCCG GTCTAGATGG CAGCCGCACA CCGCTGCAGA GGCCCTTTCT TCAAGGCGGG 4920
CCTTTGGGCA GTCTCCCATC CAGCAGCCCA ATGGGCATCC CATACCCGGG CCTTTGCCAG 4980
TACACGCTTC TGGATTCAAG GGAGCGCGGG GGCTTCTTTA GCCCGGAACC CGGTGAGCCA 5040
GACAGCCCCT GGACAGGCTC AGGGGGCACC ACCCCCTCCA CCCCCACCAC CCCCACCACC 5100
GAGGGTGAGG GCGACGGGCT CTCTTACAAC CAGCGGAGTC TGCAGCGCTG GGAGAAGGAC 5160
GAAGAGCTGG GCCAGCTCTC CACCATCTCA CCTGTGCTGT ATGCCAACAT TAACTTTCCC 5220
AATCTCAAGC AAGATTACCC AGACTGGTCT AGCCGTTGCA AACAAATCAT GAAACTCTGG 5280
AGAAAGGTTC CAGCTGCTGA CAAAGCTCCC TACCTGCAAA AGGCCAAAGA TAACCGGGCG 5340
GCTCACCGAA TCAGCAAGGT GCAAAAGCAG GCTGAGAGCC AGATCAACAA GCAGACCAAG 5400
GTGGGGGACA TGGCCCGGAA GACTGACCGA CCGGCCCTTC ATCTTCGCAT TCCCTCCCAG 5460
CCAGGGGCAC TGGGCAGTCC GCCCCCTGCT GCCGCCCCCA CCATTTTCAT TGGCAGCCCC 5520
ACTGCCCCTG CCGCCGGCTT GTCTACCTCT GCGGATGGGT TCCTGAAGCC GCCGGCGGGC 5580
ACAGTGCCTG GCCCCGACTC GCCCGGTGAG CTCTTCCTCA AGCTGCCACA GGTGCCCNNN 5640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5820
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5880
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5940
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6000
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6060
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6180
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6480
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6540
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6600
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6660
NNNNNNNNTG GGCAGCCCCC CAATTTTGCC CGGTCCCCTG GTACTGGTGC ATTTGTGGGC 6720
ACCCCCCCTC CTATGCGCTT TACCTTCCCT CAGGCTGTAG GGGAACCTTC CCTAAAGCCC 6780
CCCGTCCCTC AGCCCAGTCT CCCTCCACCC CATGGGATCA ACAGCCATTT TGGGCCTGGC 6840
CCTACCTTGG GCAAGCCTCA AAGCACAGAT TACACAGTAG CCACAGGGAA CTTCCACCCA 6900
TCGGGCAGCC CCCTGGGACC CAGCAGTGGA TCCACAGGAG AGGGCTATGG GCTGTCCCCA 6960
CTACGTCCCC CGTCGGTCCT CCCACCTGCA CCCGATGGAG CCCTTCCTTA CCTGTCCCAT 7020
GGAGCCTCAC AGCGGGCAGG CATCACCTCC CCAGTCGATA AGCGAGAAGA CCCAGGGGCT 7080
GGAATGGGCA GCTCTTTGGC AGCACCTGAA CTGCCAGGTA GCCAAGACCC CAGTATGTCC 7140
AACCTGAGCC AGACAGAGCT GGAGAAGCAG CGCCAGCGCC AGCGGCTCCG GGAGCTGCTG 7200
ATTCGGCAGC AGATCCAGCG CAACAACCTG CGGCAAGAGA AGGAAACAGC GGCGGCAGCT 7260
GCAGGAGCAG TGGGGCCTCC AGGCGGCTGG GGTGCTGAGG CCAGCGGCCC TGCCTTTGAG 7320
CAACTGAGTC GAGGCCAGAC CCCCTTTTCT GGCACCCAGG ACAAGAGTAG TCTTGTGGGG 7380
CTGCCCCCAA ACAAGCTGGG TGGCCCTATC CTGGGGCCTG GGACTTTCCC CACGGATGAC 7440
AGGCTCTCTC GGCCACCTCC ACCAGCCACC CCTTCCTCTA TGGATGTGAG CAGTCGNNNN 7500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7560
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7620
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNCTTGGTCG CCAGACCCTA 7680
AGTTCCCCCT TGGCTGGAAT TCCTCCACGC CTGCCTGGCC CTGGAGAGCC AGTGCCCGGC 7740
CCGGCCGGTC CTGCTCAGTT CATCGAGCTA CGGCACAATG TACAGAAAGG ACTAGGGCCT 7800
GGAGGGGCTC CCTTTCCCGG TCAGGGCCCC CCTCAAAGAC CCCGCTTTTA CCCTCTAAGT 7860
GAGGATTCTC ACCGACTGGC TGCAGAAGGA CTTCGAGGCC TAGCGGTACC AGGCCTTCCC 7920
CCACAGAAAC CATCAGCCCC ACCACCAGCT CCTGAACTGA ACAACAGCCT CCACCCAGCG 7980
CCCCAGGCCA AGGGCCCCAC AGTGCCCGCT GGCTTGGAAC TGGTCAGCCG GCCTCCCTCG 8040
GGCACTGAGC TTGGTCGCCC TCCTCCACTG GCCCTGGAAG CTGGGAAGTT ACCCTGTGAG 8100
GATCCCGAGC TGGACGATGA CTTTGATGCT CACAAAGCCC TAGAGGATGA TGAGGAACTC 8160
GCTCACCTGG GCCTGGGTGT GGACGTGGCC AAGGGGGACG ACGAGCTGGG CACCCTGGAA 8220
AACCTGGAGA CCAACGACCC CCACCTGGAT GACCTGCTCA ATGGGGATGA GTTTGACTTG 8280
CTGGCCTACA CTGACCCTGA GCTGGACACC GGGGACAAGA AGGACATCTT CAATGAGCAC 8340
CTGCGGCTGG TGGAGTCAGC CAATGAAAAG GCTGAGCGTG AGGCCCTGCT GCGGGGAGTG 8400
GAGCCAGGAC CCTTGGGTCC CGAGGAGCGC CCTCCCCCTG CTGCTGATGC CGCTGAGCCC 8460
CGCCTGACAT CAGTGCTCCC TGAAGTGAAG CCCAAGGTGG AAGAGGGCGG GCGCCACCCT 8520
TCCCCCTGCC AGTTTTCCAT CACGGCCCCC AAGGCAGAGC CGGCAGCTGC TACGCCTTCC 8580
CTGGGCCTGG GACTGAAGCC CGCACAGAGC GGAGTGGGCA ACCGGGACAG CCGAATGGGC 8640
CCGGGACCCT TTCCCAGCAG TGGGCAAACA GCTGAGAAGG GCCCCTTTGG GACCACAGCA 8700
GGCCCCCCAG CTCACCTGCT CACCTCCAGC CCACTAAGTG GTCCAGGTGG GTCTACCCTG 8760
CTGGAAAAGT TTGAGCTAGA GAGTGGGCCC CTGACCTTAC CCGGTGGACA TGCAGCCTCT 8820
GGGGACGAAC TGGATAAGAT GGAGAGCTCA CTGGTAGCCA GTGAGTTACC CCTGCTCATC 8880
GAGGACCTGC TGGAGCACGA GAAAAAGGAG CTGCAGAAGA AGCAACAGCT TTCAGCACAG 8940
CTGCAGCCTG CCCAACAGCC GCCGCAGCCG CAGCCGCAGC CACCGCCCCT GCTGTCTGCA 9000
CCAGGTCCTG CCCAGGCCAT GCCTTTGCCA CACGAGAACT CCGCTCCTGG CCTGGCTGGG 9060
CCCCAGCAGC AGCTTGCTCT GGGACTTGGA GGTCCTCGAC AATCTAGCTT GACCCAGCCA 9120
CTGATGCCCA CCCAGCCACC CGCCCATGCC CACCAGCAGC GCTTGGCTCC ATCCATGGCC 9180
ATGGTGTCCA ACCAAGGGCA CATGATGAGT GGGCAGCATG GGGGACAGGC AGGCTTGGTG 9240
CCCCAGCAGG GCCCACAGCC AGTGCTGGCA CAGAAGCCCA CAGGTACCAT GCCACATTCC 9300
ATGTGCATGA AGCCCCAGCC ATTGGCGATG CCGCAGCAGC TGGCCAATAG TTTCTTCCCG 9360
GATACAGNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 9480
NNNNNNNNNN NGCAGCAAGT GTCCCTCCTA GCCCAGAGGC TCTCGGGGGG ACCTGGAAGT 9540
GATCTGCAAA ACCATGTGGC AGCTCGGAGT GGGCAGGATC GGAGCACTGG AGACCCGTCT 9600
CAACCTCGTC CCAATCCACC CACCTTTGCC CAGGGAGTGA TCAATGAGGC TGACCAGCAG 9660
CAGTATGAGG AGTGGTTGTT CCACACCCAG CAGCTCCTGC AGATGCAACT GAAGGTGCTA 9720
GAGGAGCAGA TTGGTGTGCA CCGCAAGTCC CGGAAGGCAC TATGTGCCAA GCAGCGCACT 9780
GCCAAAAAGG CTGGCCGAGA GTTCCCAGAG GCTGATGCTG AGAAACTCAA GCTTGTTACC 9840
GAACAACAGA GCAAGATCCA GAAGCAGCTG GATCAGGTCC GGAAGCAGCA GAAGGAACAC 9900
ACTAATCTTA TGGCAGAATA CCGGAATAAG CAGCAGCAGC AGCAGCAACA GCAACAACAG 9960
CAGCAACAGC AGCACTCTGC AGTACTGGCC CTTAGCCCTT CCCAGAGCCC CCGGCTGCTC 10020
ACTAAGCTCC CTGGCCAGCT TCTCCCTGGC CATGGCTTGC AGCCACCACA AGGGCCCCCC 10080
GGTGGCCAAC CTGGAGGTCT TCGCCTGCCT CCAGGCAGCA TGGCACTCCC TGGACAGCCC 10140
GGTGGCCCAT TCCTTAACAC AGCCCTGGCC CAACAGCAAC AACAGCAACA TTCTGGTGGG 10200
CCTGGACCCC TAGCAGGCCC TTCAGGGGGC TTTTTCCCTG GCAACCTTGC TCTTCGAGGC 10260
CTGGGACCTG AATCGAGGCT TTTACAGGAA AGGCAGCTGC AGCTGCAACA GCAACGTATG 10320
CAACTGGCCC AGAAACTACA GCAGCAGCAG CAGCAGCAGC AGCAGCACCA CCTCCTTGGA 10380
CAGGTGGCAA TCCAGCAGCA ACAGCAGCAG GGCCCAGGGG TACAGGTGAA CCAGGCTCTG 10440
GGTTCTAAGC CCCAGGGGCT TCTGCCTCCC AGCAGCCATC AAGGTCTCTT AGTCCAGCAG 10500
CTGTCCCCCC AACCACCTCA GGGGCCCCAG GGGATGTTGG GCCCTGCTCA GGTGGCAGTG 10560
TTGCAGCAGC AGCAGCAGCA GCACCCTGGA ACTTTGGGCC CACAGGGCCC TCACAGACAG 10620
GTGCTTCTGA CCCCATCCCG GGTGCTGAGT TCTCCCCAGC TGGCACAGCA AGGTCAGGGC 10680
CTGATGGGAC ACCGGCTGGT TGCATCCCAG CAGCACCAAC AACAGGGATC TATGGCAGGA 10740
CTTTCACATC TTCAACAGGG TCTAATTACA CACAGTGGGC AGCCCAAACT GAACACCCAA 10800
CCCATGGGCT CCTTACAGCA GCAGCAGCTT CAGCAGCAGC AGCAGCAGCA GCAACAACAG 10860
CAGCTTCAGC AACAGCAGCA GCAACAACAA CAACAGCAAC AACAGCAGCA ACTTCAGCAG 10920
CAGCAACTTC AACAGCAGCA ACAGCAGCAG CAGCTTCAAC AGCAGCAGCA GCAGCAGCTT 10980
CAGCAGCAGC AGCAGCAACT TCAACAGCAA CAACAACTGC AACAGCAGCA GTTTCAGCAG 11040
CAGCAGCAGC AACAGCAGAT GGGCCTCTTG AACCAGGGTC GAACTTTATT ATCTCCACAG 11100
CAGCAGCAGC AGCAGCAGGT GACACTTGGC CCTGGCATGC CAGCTAAGCC TCTTCAACAC 11160
TTCTCTAACC CTGTAGCCCT GGGCTCAACC CTTCTGCTAA TGGGCAAGGA TCAAAGCATT 11220
GTAGAAACAC CCCTTCCACC AGAGGTCACT GAGGGACCCG CTACTCTTCA GGGAGGGCCA 11280
TTAGCAGTAG GGCCTATACC TGAGTCAGTG GCCACTGAAC CAGGGGAGGT AAAACCGTCA 11340
CTTTCTGGGG ATTCACAGCT CGTGCTTGTC CAACCCCAGG CCCAGGCTCA GCCCAACTCT 11400
GTGCAACTGC AGCCACCATT GAGGCTCCCA GGACAGCAGC AGCAGCAAGT TTTGCTTCAC 11460
ACAGCAGCTG TAGGAAACCA TGGGCAAACT GGCAGCGGAT CNNNNNNNNN NNNNNNNNNN 11520
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11580
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11820
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 11880
NNNNNNNNNN NNNNGGGCCT CCTAGGCCGC CAACCCCAAC TTGGAGGCTT TCCTGGACCC 11940
CAGACAGGCC CCCTCCAAGA GCTAGGGGCA GGGCTTCGAT CTCAGGGCCC ACCCCGGCTC 12000
CCTGCCCCAC AAGGAGCCTT ATCCACAGGA CCAGTCCTTG GCCCTGTCCA TCCCACACTT 12060
CCACCATCCA GCCCCCAAGA GCCAAAGAGA CCTTCCTCAC AATTACCTTC CCCAAACTCA 12120
CAGCTCCCCT CTGACACCCA GCTCACCCCC AACCAGCCAG GGACCCCCAA ACCCCAGGGG 12180
CCGCCCTTGG AGATGCCTCC TGGGCGGGTC TCCCCTGCTG CTGCCCAGCT TGCAGATACC 12240
TTTTTTGGCA AAGGGCTGGG ACCTTGGGAC CCCCCCGATA ACCTAGTAGA AGCCCAGAAG 12300
CCAGACCAGA GCAGCATGGT ACCTGGACAT CTGGAGCAGG TGAATGGCCA GGTGGTGCCT 12360
GAAGCACCCA GCCTCAGCAT CAAGCAGGAG CCTCGGGAAG AACCGTGTGC CCTTGGAGCC 12420
CAGGCAGTGA AGAGGGAAGC CAATGGAGAG CCAGTAGGGG CACCGGGTAC CAGCAACCAC 12480
CTCTTGCTGG CAGGCCCCCG CTCAGAGGCT GGGCATCTGC TTTTGCAGAA GCTACTACGG 12540
GCAAAGAATG TACAGCTTAA CACTGGGCGG GGGCCTGAGG GCCTGCGCGC TGAGATCAAT 12600
GGGCATATTG ACAGCAAGTT GGCTGGGCTG GAGCAGAAAC TGCAGGGTCC TCACAGTAAC 12660
AAGGAGGACA CAGCAGCAAG GAAGCCTTTG ACACCGAAGC CCAAGCGGGT GCAAAAGGCA 12720
AGCGACAGGT TGGTGAGCTC GAGAAAGAAG TTGCGGAAGG AGGATGGGGT CAGGGCCAGC 12780
GAGGCACTGC TGAAACAGCT GAAACAGGAG CTGTCCCTGC TGCCCCTGAC GGAGCCTACC 12840
GTCACCGCCA ACTTCAGCCT CTTTGCTCCC TTTGGCAGTG GCAGCCCAAT CAGTGGGCAG 12900
TGCCAGCTGA GGGGGGCCTT TGGAAGTGGT GCCTTGTCCA CTGGCCCTGA CTACTATTCC 12960
CAGCTGCTTA CCAAGAATAA CCTGAGTAAC CCGCCGACAC CACCCTCGTC GCTGCCCCCC 13020
ACCCCACCCC CATCGGTGCA GCAGAAGATG GTCAATGGTG TCACTGCATC TGAGGAACTG 13080
GGGGAGCACC CCAAGGATGC CACCTCTGCC CGGGAGACTG AAGGGACACT GAGGGATGCT 13140
TCCGAGGTGA AGAGTCTAGA CCTGCTGGCC GCATTGCCTA CACCCCCCCA CAATCAAACT 13200
GAAGATGTCA GGATGGAGAG TGATGAGGAC AGTGATTCTC CTGACAGCAT TGTGCCAGCT 13260
TCGTCCCNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13320
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13560
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13620
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13680
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13740
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13800
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13860
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13920
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 13980
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14040
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14100
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14160
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14220
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14280
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14340
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14400
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14460
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14520
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14580
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14640
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14700
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14760
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14820
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14880
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 14940
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15000
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15060
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15180
NNNNNNNNGC CGCACACCCT GAACAGCACC AGCATGAAGT ACCAGAGGAC CTTCACAGGC 15240
GAGACCAGCA CCCCGTACAG CAAGCAG 15268
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 93 0.0 1358
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 92 0.0 1347
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 93 0.0 1345
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 92 0.0 1344
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 93 0.0 1343
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 93 0.0 1340
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 93 0.0 1339
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 92 0.0 1339
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 93 0.0 1338
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 93 0.0 1338
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 93 0.0 1337
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 93 0.0 1335
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 93 0.0 1335
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 93 0.0 1335
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 92 0.0 1335
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 93 0.0 1334
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 93 0.0 1331
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 92 0.0 1331
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 92 0.0 1317
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 92 0.0 1316
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 92 0.0 1312
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 91 0.0 1288
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 90 0.0 1285
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 92 0.0 1258
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 88 0.0 1255
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 88 0.0 1248
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 87 0.0 1226
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 85 0.0 1211
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 88 0.0 1190
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 88 0.0 1185
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 87 0.0 1172
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 84 0.0 1080
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 68 0.0 960
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 81 0.0 895
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 91 0.0 846
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 57 0.0 830
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 56 0.0 741
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 55 0.0 739
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 56 0.0 739
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 77 0.0 736
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 54 0.0 720
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 54 0.0 714
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 55 0.0 712
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 52 0.0 709
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 54 0.0 706
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 55 0.0 705
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 63 8e-153 541
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 41 3e-144 513
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 40 9e-144 511
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 40 4e-143 509
WERAM-Caj-0209 ENSCJAP00000036628.3 Callithrix jacchus 40 3e-141 503
WERAM-Tag-0008 ENSTGUP00000000641.1 Taeniopygia guttata 40 3e-141 502
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 39 7e-141 501
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 77 9e-126 451
WERAM-Eqc-0189 ENSECAP00000020200.1 Equus caballus 49 8e-110 398
WERAM-Ten-0167 ENSTNIP00000016534.1 Tetraodon nigroviridis 48 3e-105 383
WERAM-Pem-0018 ENSPMAP00000002590.1 Petromyzon marinus 46 2e-99 363
WERAM-Lac-0079 ENSLACP00000010253.1 Latimeria chalumnae 62 3e-99 363
WERAM-Dar-0184 ENSDARP00000115827.2 Danio rerio 59 9e-95 348
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 72 2e-93 343
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 56 1e-66 255
WERAM-Drm-0094 FBpp0072043 Drosophila melanogaster 43 1e-49 198
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 42 2e-48 194
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 62 6e-40 166
WERAM-Cis-0045 ENSCSAVP00000009955.1 Ciona savignyi 36 6e-23 110
WERAM-Mua-0133 GSMUA_Achr8P31480_001 Musa acuminata 30 1e-21 105
WERAM-Php-0038 PP1S184_45V6.1 Physcomitrella patens 33 1e-21 105
WERAM-Sei-0081 Si021380m Setaria italica 30 7e-21 102
WERAM-Brr-0201 Bra040045.1-P Brassica rapa 31 7e-21 102
WERAM-Tra-0182 Traes_5AS_00A90AD82.1 Triticum aestivum 30 9e-21 102
WERAM-Met-0094 KEH29501 Medicago truncatula 31 1e-20 102
WERAM-Bro-0020 Bo1g145070.1 Brassica oleracea 31 1e-20 102
WERAM-Brd-0094 BRADI4G06137.1 Brachypodium distachyon 30 2e-20 102
WERAM-Amt-0090 ERN00866 Amborella trichopoda 30 2e-20 101
WERAM-Zem-0055 GRMZM2G081350_P01 Zea mays 31 4e-20 100
WERAM-Glm-0107 GLYMA09G36290.1 Glycine max 32 4e-20 100
WERAM-Art-0067 AT3G08020.1 Arabidopsis thaliana 30 8e-20 99.8
WERAM-Viv-0082 VIT_13s0064g00630.t01 Vitis vinifera 30 8e-20 99.4
WERAM-Arl-0072 fgenesh2_kg.3__ 844__ AT3G08020.1 Arabidopsis lyrata 31 1e-19 99.0
WERAM-Pot-0104 POPTR_0009s06440.1 Populus trichocarpa 28 1e-19 99.0
WERAM-Prp-0032 EMJ21805 Prunus persica 30 2e-19 98.2
WERAM-Thc-0017 EOX96661 Theobroma cacao 29 2e-19 98.2
WERAM-Orbr-0125 OB12G21490.1 Oryza brachyantha 29 4e-19 97.4
WERAM-Orl-0077 KN539619.1_FGP013 Oryza longistaminata 29 5e-19 97.1
WERAM-Orni-0122 ONIVA12G14190.1 Oryza nivara 29 5e-19 96.7
WERAM-Orgl-0120 OGLUM12G15650.1 Oryza glumaepatula 29 5e-19 96.7
WERAM-Orr-0123 ORUFI12G15660.1 Oryza rufipogon 29 6e-19 96.7
WERAM-Tub-0079 ENSTBEP00000009516.1 Tupaia belangeri 29 6e-19 96.7
WERAM-Sol-0011 Solyc01g087410.2.1 Solanum lycopersicum 35 6e-19 96.7
WERAM-Org-0115 ORGLA12G0122900.1 Oryza glaberrima 29 7e-19 96.3
WERAM-Ors-0110 OS12T0527800-01 Oryza sativa 29 7e-19 96.3
WERAM-Orp-0113 OPUNC12G12830.1 Oryza punctata 29 8e-19 96.3
WERAM-Hov-0071 MLOC_58575.1 Hordeum vulgare 30 2e-18 94.7
WERAM-Crn-0024 AAW44449 Cryptococcus neoformans 37 2e-18 94.7
Created Date 25-Jun-2016