WERAM Information


Tag Content
WERAM ID WERAM-Ocp-0133
Ensembl Protein ID ENSOPRP00000014011.2
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSOPRG00000015259.2 ENSOPRT00000015338.2 ENSOPRP00000014011.2
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader PHD 1.60e-27 95.1 171 4725
Organism Ochotona princeps
Domain Profile
  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + s l+ + + +Cp++ e
ENSOPRP00000014011.2 171 RCSHCTRLGA----SIPCRSpgCPRLYHFPCAATSGSFLSmKTLQLLCPEHSE 219
6999933333....599******************888875557899*99975 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC+ ++e ++ ++ C +C + +H+ C++ +l+ + w Cp+Ck
ENSOPRP00000014011.2 228 RCAVCEGPGELRD-LLFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****66666654.*******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSOPRP00000014011.2 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSOPRP00000014011.2 1016 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1065
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSOPRP00000014011.2 1067 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1113
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSOPRP00000014011.2 1144 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1195
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSOPRP00000014011.2 4620 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4652
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSOPRP00000014011.2 4679 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 4725
599996666665....6*9999*********98886666677778888886 PP

Protein Sequence
(Fasta)
MDSQKPPGED KDSEPAXXXX XXXXXXXXXX XXXXXXXXXX XXVPSPGSAR LQEPRRDCSG 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG GSPGPREAVL SSEDLSQIGF 120
PEGLTPAHLG EPGGFCWAHH WCAAWSAGVW GQEGPELCGV DKAIFSGISQ RCSHCTRLGA 180
SIPCRSPGCP RLYHFPCAAT SGSFLSMKTL QLLCPEHSEG AAHLEEARCA VCEGPGELRD 240
LLFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRACGAGS AELSPHCEWF ENYSLCHRCH EAQGGQPVSS 360
AAGQHPPVCS RFSPPEPGGD TPTDEPDALY VACQGQPKGG HVTSMQPKEP GPLQCEAKPL 420
GRAGAQLEPR LEPPEEEMPL LPLPEESPLS PPPEESPTSP PEASRLSPPP EESPPEELPT 480
SPPPEASRLS PPPEESPMSP PPEESPMSPP PEASRLFPPF EESPLSPPPE ESPLSPPPEA 540
SRLSPPPEDS PMSPPPEDSP MSPPPEDSPM SPPPEVSRLC PPPEESPLSP PALSPLGELT 600
YPFGAKGDSD PESLAAPILE TPISPPPEAH CTDPEPVPPM ILPPSPGSPL GPASPILMEP 660
LPPPCSPLLQ HSLPPPSSPP SQCSPLALPL SLPSPLSPVG KAEPLSDEPE LHQMETEKVP 720
EPECPALEPS VTSPLPSPME ELSCPAPSPA PALENFPGLG EDMAPLDGAA AAHAQPAAGE 780
APGSELKGCP ELLDPEELAP VTPMEVYGPE CKQPGQGSPC EEQEEPRATV APIPPTLIKS 840
DIVNEISNLS QGDASASFPG SEPLLGSPDP EGGGSLSMEL GVSTDVSPAR DEGSLRLCTD 900
SLPETDDSLL CDTGTAVSGG KAEGDKGRRR SSPARSRIKQ GRSSSFPGRR RPRGGAHGGR 960
GRGRARLKST TSSVETLVVA DIDGSPSKEE EEDDDDTMQN TVVLFSNTDK FVLMQDMCVV 1020
CGSFGRGAEG HLLACSQCSQ CYHPYCVNSK ITKVMLLKGW RCVECIVCEV CGQASDPSRL 1080
LLCDDCDISY HTYCLDPPLL TVPKGGWKCK WCVSCMQCGA ASPGFHCEWQ NSYTHCGPCA 1140
SLVTCPICHA PYVEEDLLIQ CRHCERWMHA GCESLFTEDD VEQAADEGFD CVSCQPYVVK 1200
PVAPVAPPEL VPVKAKEPEP QYFRFEGVWL TETGMAVLRN LTMSPLHKRR QRRGRLGLPG 1260
EVGLEGSEPS DALGPDDKKD GDLDAEELLK GEGGVEHMEC EIKLEGPTSP DAEPGKEETE 1320
ESKKRKRKPY RPGIGGFMVR QRKSHTRVKK GPAAQSEVLS GDGQPDEGET VMPVDLPAEG 1380
SGXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXDLSRKG LFAVGGRPGF 1440
GLGPPKGKGD GGSDRKEPSA LHKGDDGPDV ADDESHGPEG KADTPGPEDG GVKASPVPSD 1500
PEKPSTPGEG MLSSDLDRIP TEELPKMESK DLQQLFKDVL GSEREQHLGC GTPGLEGNRT 1560
PLQRPIVHGG LPLGSLPSSS PLDSYPGLCQ SPFLDSRERG GFFSPEPGEP DSPWTGSGGT 1620
TPSTPTTPTT EGEGDGLSYN QRSLQRWEKD EELGQLSTIS PVLYANINFP NLKQDYPDWS 1680
SRCKQIMKLW RKVPAADKAP YLQKAKDNRA AHRINKVQKQ AESQINRQPK VGDTARKTER 1740
PALHLRIPPQ QGALGSPPPA AAPTVFLGSP TPPAGMSTSA DGFLKPPAGT VPGPDSPGEL 1800
FPKLPPQVPA QVPSQDPFGL APAYALEPRF PVAPPTYPTY PNVAGAPAQS PVPGASTRPG 1860
PGLPGEFHVT PPGTPRHQPS TPDPFLKPRC PSLDNLAVPE SPGVGGGKAS EPLLSPPPLG 1920
EARKALEVKK EELGASSPSY GPPNLGFVDP SSSGPHLGGL ELKAPDVFKA PLTPRASQVE 1980
PQSPGLGLRP QEPPPAQALA PSPPNHPDIF RPSPYPDPYA QPPLTPRPQP PAPESCCALP 2040
PRSLPSDPFS RVPASPQSQS SSQSPLTPRP LSAEAFCPSP VTPRFQSPDP YSRPPSRPQS 2100
RDPFAPLHKP PRPQPSEVAF KAGSLAHTPL GAGGFPAALP SGPAAELHAK VPSGQPPNFA 2160
RSPGTGAFVS TPSPMRFTFP QAAGEPPLKP PVPQPGLPPP HGINSHFGPG PTLGKPQSTN 2220
YAVATGNFHP SGSPLGPGSG STVEGYGLSP LRPTSVLPPP APDGSLPYLS HGASQRASIT 2280
SPVEKREEPG AGMSSSLVAP ELPGTQDPGM SSLSQTELEK QRQRQRLREL LIRQQIQRNT 2340
LRQEKETAAA AVGAVGPPGS WGAEPSSPAF EQLSRGQAPF AGTQDKGSLV GLPPGKLGGS 2400
VLGPGPFPSD DRLSRPPPPA TPSSVDVNGR QLVGGSQAFY QRAPYPGSLP LQQQQLWQQQ 2460
QQQQQQQQAT SAASMRLAMS TRFPSTPGPE LSRQALGSPL PGIPTRLPGP GEPVPGPAGP 2520
AQFIELRHNV QKGLGPGGAP FPGQGPPQRP RFYPVTEDPH RLAPEGLRGL VLSGLPSQKP 2580
SAPPAPELSN NLHAPPLTKA STLPAGLELV SRPPSSTELS RPPPLALETG KLPCEDPELD 2640
DDFDAHKALE DDEELAHLGL GVDVAKGDDE LGTLENLETN DPHLDDLLNG DEFDLLAYTD 2700
PELDTGDKKD IFNEHLRLVE SANEKAEREA LLRGVEPGPS VSEERPPPVA DASEPRLAEV 2760
KPKVEEGGRH PSPCQFTINT PKVEPAPATT SLGLGLKPGQ NMMGSRETRM GTGPFSSGGH 2820
TAEKGPFGTT GGPPAHLLAP SPLSGSAGSS LLEKFELESG PLNLPGGPAA SGDELDKMES 2880
SLVASDLPLL IEDLLEHEKK ELQQRQQLSA QLQPAQQQQQ QQQQQLILSA TGPAQAMALP 2940
HEGSSPSLSG PQQQLALGIG GARQPGLGQP LMPTQPPAHA LQQRLAPSMA MMSNQGHMLS 3000
GQHGGQAGLV PPQNPQPVLS QKPMGTMPPS MCMKPQQLAV QQQLANSFFP DTDLDKFAAE 3060
DIIDPIAKAK MVALKGIKKV MAQGSIGVAP GMNRQQVSLL AQRLSGGPGS DLQNHVVPGS 3120
GQERNAGDPS QPRPNPPTFA QGVINEADQR QYEEWLFHTQ QLLQMQLKVL EEQIGVHRKS 3180
RKALCAKQRT AKKAGREFPE ADAEKLKLVT EQQSKIQKQL DQVRKQQKEH TNLMAEYRNK 3240
QQQQQQQQQQ HSAVLTLSPS QSPRLLTKLP GQLLPGHGLQ PLQGPPGGQA GGLRMPPGAM 3300
ALPGQPGGPF LNSSMAQQQH SGGAGSLTGP SGGFFPGSLT LRGLAPDSRL VQERQLQLQQ 3360
QRMQLAQKLQ QQQQQQQHLL GQVAIQQQQQ QGSGVQANQA LGPKPQGLLP PSNHQGLLVQ 3420
QLSPQPPQGP QGMLGPAQVA VLQQQQQHPG ALGPQGPHRQ VLMTQSRVLT SPQLAQQGQG 3480
LMRQRLVTAQ QQQQQQQQHS QQQQQGSMPG LSHIQQGLMS HSGQPTLNGQ SMSSLQPQQQ 3540
LQQQQLQQQL QQQQQQQLQQ QQFQQQQQQM GLLNQSRTLL SPQQQQPQQQ QQQQVTLGPS 3600
MPAKPLQHFS SPGALGPTLL LPGKEQNIVE TALPSEVAEG ASAHQGGGPL GVGTTPEPMA 3660
AEPGEVKPSL SGDSQLLLVQ PQAQPQPQPQ PGSLQLQPPL RLPGQQQQQQ VNLLHSAGMG 3720
SHGQLGSGSS EASAVPQLLV QPSVSVGDQP GPVTQNLLGP QPSLLEQPLQ NNTGPQLPKP 3780
GPAPQAGQGL PGVGVMPAVG QLRAQLQGVL AKNPQLRHLS PQQQQQLQAL LVQRQLQQSQ 3840
AVRQAPPFQE PGTQPSPLQG LLGCQPQPGG FPGPQTGPPQ ELGAGPRPQG PPRPPVPQGA 3900
SPAGPALGPV HPTPPPSSPQ EPKRPSSQLP SPNAQLPPTH PGTPKPLGRV SPASAQHTDT 3960
FFGKGLGPWD PPDNLAETQK PEQSNLVAGH LEQVNGQVVP EPPQLSIKQE PREESCALGA 4020
PAVKREANGE PVGTAGTSNH LLLAGPRSEA GHLLLQKLLR AKNVQLNAGR GPEGLRTEIN 4080
GHIDSKLAGL EQKLQSTPIN KEDVAARKPL TSKPKRVQKA GDRLVSSRKK LRKEDGLRAS 4140
EALLKQLKQE LSLLPLTEPT ITANFSLLAP FGSGCPVSGQ NQLRGAFGSG TLTTGPDYYS 4200
QLLTKNNLSN PPTPPSSLPP TPPPSVQQKM VNGVTASEEL GENPKDATSA GDTEGTLRDA 4260
SEVKSLDLLA ALPTPPHNQT EDVRMESDED SDSPDSIVPA SSPESILGEE APRYPQLGSG 4320
RWEQDDRALS PVIPIIPRAS IPVFPDAKPY VALDLDVSGK LPAVAWEKGQ GSEVSVMLTV 4380
SAAAAKNLNG VMVAVAELLS MKIPNSYEVL FPESPARIGM VPKKGDAEGA VGKEKGVGDK 4440
NPDAGPEWLK QFDAVLPGYT LKSQLDILSL LKQXXXXXXX XXXXXXXXXX XXXXXXXLSA 4500
PPEPSPPPSL APSPASPPAE PLVELPSESA EPPVPSPLPL ASSPESARPK PRARPPEEGE 4560
DSHPPRLKKW KGVRWKRLRL LLTIQKTSGH QEDEREVAEF MEQLGTALRP DKVPRDMRRC 4620
CFCHEEGDGA TDGPARLLNL DLDLWVHLNC ALWSTEVYET QGGALMNVEV ALHRGLLTKC 4680
SLCQRTGATS SCNRMRCPNV YHFACAIRAK CMFFKDKTML CPMHKVKGPC EQELSSFAVF 4740
RRVYIERDEV KQIASIIQRG ERLHMFRVGG LVFHAIGQLL PHQMADFHSA TALYPVGYEA 4800
TRIYWSLRTN NRRCCYRCSI SENNGRPEFI IKVMEQGLED LIFTDASPQA VWNRIIEPVA 4860
AMRKEADMLL FPEYLKELFG LTVHVLRIAE SLPGVRCQYL SRWPHPLMLP MIPTGCARSE 4920
PKILSHYKRP HTLNSTSMSK AYQSTFTGET HTPYSKQFVH SKSSQYRRLR TEWKNNVYLA 4980
RSRIQGLGLY AAKDLEKHTM VIEYIGTIIR NEVANRXXXX XXXXXXXXXX XXXXXXXXXX 5040
XXXXXXXXXX XXXXXXXXXX XEVVTFDKED KIIIISSRRI KGEELTYDYQ FDFEDDQHKI 5100
PCHCGAWNCR KWMN 5114
Nucleotide Sequence
(Fasta)
ATGGACAGCC AGAAGCCGCC TGGTGAGGAT AAAGATTCAG AACCAGCAGN NNNNNNNNNN 60
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 120
NNNNCTGTCC CCAGTCCTGG GAGTGCCCGG CTTCAGGAGC CTCGCAGGGA CTGCAGTGGG 180
GGTCCAGTGC GGCGCTGTGC TCTCTGTAAC TGCGGGGAGC CTAGTCTGCA TGGGCAGCGC 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAC TGGCCCCGGT GTCCAGTGGT GTCCCCTGGG 300
GGGAGCCCAG GGCCCAGAGA GGCAGTGCTG TCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCC TTACACCTGC CCACCTGGGA GAACCTGGAG GGTTCTGCTG GGCTCACCAT 420
TGGTGTGCTG CGTGGTCGGC AGGCGTCTGG GGGCAGGAGG GCCCAGAACT ATGTGGTGTG 480
GACAAGGCCA TCTTCTCAGG GATCTCACAG CGCTGCTCCC ACTGCACCAG GCTCGGTGCC 540
TCCATCCCTT GCCGCTCGCC TGGATGTCCA CGGCTTTACC ACTTCCCCTG TGCCGCTACC 600
AGTGGTTCCT TCTTGTCCAT GAAGACACTG CAGCTGCTCT GCCCAGAGCA CAGTGAGGGG 660
GCTGCACATC TGGAGGAGGC TCGCTGTGCT GTCTGTGAGG GGCCAGGGGA GTTGCGTGAC 720
CTGTTGTTCT GTACCAGCTG CGGGCATCAC TACCATGGGG CCTGCCTCGA CACTGCTCTG 780
ACGGCCCGCA AGCGTGCAGG CTGGCAGTGC CCTGAATGCA AAGTGTGCCA AGCCTGCAGG 840
AAACCTGGGA ATGACTCTAA GATGTTGGTC TGTGAGACGT GTGACAAGGG ATACCATACT 900
TTCTGCCTGA AACCACCCAT GGAGGAGCTG CCTGCTCATT CTTGGAAATG CAAGGCATGC 960
CGAGTATGCC GCGCTTGTGG GGCAGGCTCA GCAGAGCTGA GTCCCCACTG TGAGTGGTTT 1020
GAGAACTACT CACTCTGTCA CCGCTGTCAC GAAGCCCAGG GCGGCCAGCC AGTCAGTTCT 1080
GCGGCTGGGC AGCACCCCCC TGTCTGTAGC AGATTTTCAC CCCCAGAGCC TGGTGGTGAT 1140
ACCCCTACTG ATGAGCCTGA TGCTCTGTAC GTTGCATGCC AAGGGCAGCC AAAGGGTGGG 1200
CACGTGACCT CTATGCAACC CAAGGAACCA GGGCCCCTGC AATGTGAAGC CAAACCACTA 1260
GGGAGAGCAG GGGCCCAGCT TGAGCCCCGA TTGGAGCCCC CCGAAGAGGA GATGCCACTG 1320
CTGCCCCTCC CTGAAGAGTC ACCCCTCTCC CCACCGCCCG AGGAATCGCC CACATCGCCA 1380
CCTGAAGCGT CACGTCTGTC CCCGCCACCT GAAGAGTCGC CNCCTGAGGA GTTGCCCACT 1440
TCCCCACCCC CTGAAGCATC GCGCCTGTCT CCACCACCTG AGGAGTCGCC CATGTCACCT 1500
CCACCAGAGG AGTCGCCCAT GTCTCCACCC CCTGAGGCAT CTCGTCTGTT CCCGCCATTT 1560
GAAGAGTCTC CCCTGTCCCC TCCCCCTGAG GAGTCTCCCC TGTCCCCACC ACCTGAGGCC 1620
TCACGCCTGT CCCCACCACC TGAGGACTCA CCCATGTCCC CACCACCTGA GGACTCACCC 1680
ATGTCCCCAC CGCCTGAAGA CTCACCTATG TCTCCTCCGC CTGAGGTGTC GCGCCTGTGC 1740
CCACCACCTG AGGAATCCCC CTTATCCCCA CCAGCCCTGT CTCCTTTGGG GGAGTTAACA 1800
TACCCCTTTG GTGCCAAAGG GGACAGTGAC CCTGAGTCGT TGGCTGCTCC CATTCTTGAG 1860
ACCCCCATTA GCCCTCCTCC CGAAGCTCAC TGCACTGACC CTGAGCCCGT GCCCCCAATG 1920
ATCCTGCCCC CGTCCCCAGG CTCCCCGCTG GGCCCGGCAT CTCCTATCCT GATGGAACCC 1980
CTGCCCCCTC CATGTTCTCC GCTCCTCCAG CATTCCCTGC CTCCCCCGAG TTCTCCTCCT 2040
TCCCAGTGCT CGCCTCTGGC TCTGCCGCTG TCTCTGCCTT CCCCGTTGAG TCCTGTAGGA 2100
AAGGCAGAGC CCCTTTCAGA TGAGCCTGAG CTGCACCAGA TGGAGACTGA GAAGGTCCCA 2160
GAGCCCGAGT GCCCGGCCTT GGAACCCAGT GTCACCAGTC CTCTTCCCTC CCCGATGGAG 2220
GAGCTGTCCT GCCCTGCCCC CAGCCCTGCA CCAGCCCTGG AGAACTTCCC TGGCCTGGGG 2280
GAGGACATGG CCCCTCTGGA TGGGGCTGCT GCTGCTCATG CACAGCCAGC GGCGGGCGAG 2340
GCCCCTGGCA GTGAATTGAA AGGTTGCCCT GAGCTCCTGG ACCCTGAGGA GCTGGCCCCT 2400
GTGACCCCTA TGGAGGTCTA TGGCCCAGAA TGCAAGCAGC CAGGGCAGGG CTCGCCCTGT 2460
GAAGAGCAGG AGGAGCCACG TGCAACAGTG GCCCCCATAC CACCTACTCT CATCAAATCT 2520
GATATCGTTA ATGAAATTTC TAATCTGAGC CAGGGTGATG CCAGTGCCAG TTTTCCTGGC 2580
TCAGAGCCCC TGTTGGGCTC TCCTGACCCT GAAGGGGGTG GCTCCCTGTC CATGGAGCTA 2640
GGGGTATCTA CAGATGTGAG TCCAGCCCGA GATGAAGGCT CCCTGAGGCT CTGTACCGAC 2700
TCACTGCCAG AGACCGATGA CTCACTCTTG TGTGATACTG GGACAGCCGT CAGCGGAGGC 2760
AAAGCTGAGG GGGACAAGGG GAGGCGGCGT AGTTCTCCAG CCCGTTCTCG CATCAAACAG 2820
GGTCGCAGCA GCAGTTTCCC AGGACGACGG CGGCCTCGAG GAGGAGCCCA CGGAGGACGA 2880
GGGAGAGGAC GGGCCCGGCT AAAATCTACT ACGTCTTCCG TTGAGACTCT GGTAGTTGCT 2940
GATATCGATG GCTCCCCCAG CAAGGAGGAG GAGGAGGACG ACGACGACAC CATGCAGAAC 3000
ACTGTGGTCC TCTTCTCCAA CACGGACAAA TTTGTCCTAA TGCAGGACAT GTGTGTGGTG 3060
TGTGGCAGCT TCGGCCGAGG TGCAGAGGGC CACCTCCTGG CCTGTTCGCA GTGTTCTCAG 3120
TGCTACCATC CCTACTGTGT CAACAGCAAG ATTACCAAGG TGATGCTCTT GAAGGGCTGG 3180
CGCTGCGTGG AGTGCATCGT GTGTGAGGTG TGCGGCCAGG CCTCTGACCC CTCTCGCCTG 3240
CTGCTGTGTG ACGACTGTGA CATCAGCTAT CACACCTACT GCTTGGACCC TCCACTGCTC 3300
ACAGTGCCCA AGGGCGGCTG GAAGTGCAAG TGGTGCGTGT CCTGTATGCA ATGTGGGGCC 3360
GCTTCCCCTG GCTTCCACTG TGAGTGGCAA AATAGTTACA CACACTGTGG GCCCTGTGCC 3420
AGCTTGGTGA CCTGCCCCAT CTGCCATGCT CCCTATGTGG AAGAGGACCT GCTGATCCAG 3480
TGCCGACACT GTGAACGGTG GATGCATGCT GGCTGTGAGA GCCTCTTCAC AGAGGATGAT 3540
GTGGAGCAGG CAGCCGATGA AGGCTTTGAC TGTGTCTCCT GCCAGCCCTA TGTGGTGAAG 3600
CCTGTGGCAC CTGTTGCGCC TCCGGAGTTG GTGCCTGTGA AAGCAAAGGA GCCAGAGCCC 3660
CAGTACTTTC GTTTTGAAGG CGTGTGGCTG ACAGAAACTG GCATGGCTGT GCTGCGTAAC 3720
CTGACCATGT CACCTCTGCA CAAGCGGCGC CAGCGGCGAG GACGGCTTGG GCTCCCAGGC 3780
GAGGTGGGGC TGGAGGGTTC TGAACCTTCA GATGCCCTTG GCCCCGATGA CAAGAAGGAT 3840
GGGGACCTGG ATGCCGAAGA GCTGCTCAAG GGTGAAGGTG GTGTGGAGCA CATGGAATGT 3900
GAAATTAAAT TGGAGGGTCC CACCAGTCCA GATGCGGAAC CTGGCAAAGA GGAGACTGAG 3960
GAAAGCAAAA AACGCAAGCG CAAGCCTTAC CGACCTGGCA TTGGTGGCTT CATGGTGCGG 4020
CAGAGGAAAT CCCACACTCG TGTGAAGAAG GGGCCTGCTG CCCAGTCGGA GGTGTTGAGT 4080
GGGGATGGGC AGCCTGACGA GGGTGAGACA GTGATGCCTG TGGACCTGCC TGCAGAGGGC 4140
TCCGGGANNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4200
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4260
NNNNNNNNNN NGGACCTGAG CCGAAAAGGC CTTTTTGCAG TTGGGGGCCG TCCAGGCTTT 4320
GGATTAGGAC CCCCCAAAGG CAAGGGTGAT GGAGGTTCAG ATAGGAAGGA ACCTTCCGCC 4380
TTACACAAAG GGGATGATGG TCCAGATGTT GCAGATGATG AGTCCCATGG TCCTGAGGGC 4440
AAGGCTGATA CACCAGGCCC TGAGGATGGG GGTGTAAAGG CATCCCCAGT GCCCAGTGAC 4500
CCCGAGAAGC CAAGCACTCC AGGCGAAGGG ATGCTTAGCT CCGACTTAGA CCGGATTCCC 4560
ACGGAAGAAC TGCCCAAGAT GGAATCCAAG GACCTGCAGC AGCTCTTCAA GGATGTCCTG 4620
GGTTCAGAAC GAGAACAACA TCTGGGCTGT GGAACCCCCG GCCTGGAAGG CAACCGCACG 4680
CCGCTGCAGA GGCCCATTGT CCACGGCGGA CTCCCTTTGG GCAGTCTCCC TTCCAGCAGC 4740
CCATTGGACT CCTACCCAGG CCTCTGCCAG TCCCCATTCC TGGATTCCAG GGAGCGCGGG 4800
GGCTTTTTCA GCCCAGAACC CGGTGAGCCA GACAGCCCCT GGACAGGCTC AGGGGGCACC 4860
ACGCCCTCCA CCCCCACAAC CCCCACCACG GAGGGTGAGG GCGACGGGCT GTCCTATAAC 4920
CAGCGGAGTC TGCAGCGCTG GGAGAAGGAT GAGGAGTTGG GCCAGCTCTC CACCATCTCC 4980
CCTGTGCTGT ATGCCAACAT TAACTTTCCC AACCTCAAGC AAGATTACCC AGACTGGTCC 5040
AGCCGTTGCA AACAAATCAT GAAGCTCTGG AGAAAGGTCC CAGCCGCTGA CAAAGCCCCC 5100
TACCTGCAAA AGGCCAAAGA TAACCGGGCA GCTCACCGCA TCAACAAGGT GCAGAAGCAG 5160
GCTGAGAGCC AGATCAACAG GCAGCCCAAG GTGGGCGACA CAGCCCGGAA GACTGAGCGA 5220
CCGGCCCTGC ATCTTCGCAT TCCCCCTCAG CAGGGGGCAC TGGGCAGCCC ACCCCCTGCT 5280
GCCGCCCCCA CCGTTTTCCT TGGCAGCCCC ACTCCCCCCG CCGGCATGTC TACCTCTGCG 5340
GACGGGTTCC TGAAGCCGCC AGCAGGCACC GTGCCTGGCC CCGACTCGCC CGGTGAGCTC 5400
TTCCCCAAGC TCCCGCCCCA GGTGCCCGCC CAAGTGCCTT CGCAGGACCC CTTTGGACTG 5460
GCCCCTGCCT ACGCCCTGGA GCCCCGCTTC CCCGTGGCAC CACCCACCTA CCCCACTTAC 5520
CCTAATGTAG CTGGGGCCCC TGCGCAGTCC CCAGTGCCAG GTGCCTCCAC TCGTCCTGGG 5580
CCTGGCCTGC CAGGGGAATT TCACGTTACC CCACCTGGCA CCCCCCGGCA CCAGCCCTCT 5640
ACACCCGACC CCTTCCTCAA ACCCCGCTGC CCCTCGTTGG ACAACCTGGC TGTGCCTGAG 5700
AGCCCTGGGG TAGGGGGAGG CAAGGCTTCC GAGCCCCTGC TCTCACCCCC ACCACTTGGG 5760
GAGGCCCGTA AGGCCCTGGA GGTGAAGAAG GAAGAGCTTG GGGCATCCTC CCCCAGCTAT 5820
GGGCCCCCAA ATCTGGGTTT TGTTGACCCA TCCTCCTCAG GCCCCCACCT TGGTGGGCTG 5880
GAGTTAAAGG CACCTGATGT CTTCAAAGCC CCGCTGACCC CTCGGGCATC TCAGGTAGAA 5940
CCCCAGAGCC CGGGCTTGGG TCTACGGCCC CAGGAGCCGC CCCCTGCCCA GGCTTTGGCT 6000
CCTTCTCCTC CAAACCACCC AGACATCTTT CGTCCCAGCC CTTACCCTGA CCCCTATGCC 6060
CAGCCCCCAC TGACCCCTCG GCCCCAGCCC CCTGCCCCTG AGAGCTGTTG CGCCCTGCCC 6120
CCTCGCTCAC TGCCCTCTGA CCCTTTCTCC CGAGTACCTG CCAGTCCACA GTCCCAGTCC 6180
AGCTCCCAGT CACCATTGAC GCCCCGTCCT CTGTCTGCTG AGGCTTTTTG CCCGTCTCCT 6240
GTCACCCCTC GCTTCCAGTC TCCTGACCCT TACTCCCGCC CACCCTCACG CCCTCAGTCC 6300
CGTGACCCAT TTGCCCCATT GCATAAGCCA CCCCGTCCCC AGCCCTCTGA AGTTGCCTTT 6360
AAGGCTGGGT CTCTAGCCCA CACTCCCCTG GGGGCTGGGG GCTTCCCAGC AGCCCTGCCC 6420
TCGGGGCCAG CCGCTGAGCT CCATGCCAAG GTCCCAAGTG GGCAGCCCCC CAATTTTGCC 6480
CGTTCCCCAG GGACAGGCGC ATTTGTGAGC ACCCCATCTC CCATGCGTTT CACTTTCCCT 6540
CAGGCAGCCG GGGAGCCTCC CCTAAAGCCC CCTGTCCCTC AGCCTGGTCT CCCTCCACCC 6600
CATGGGATCA ACAGCCATTT TGGGCCTGGC CCCACTTTGG GCAAGCCTCA AAGCACAAAC 6660
TACGCAGTAG CTACGGGGAA CTTTCACCCA TCGGGCAGCC CCCTGGGGCC TGGCAGCGGG 6720
TCCACAGTGG AGGGCTATGG GCTGTCCCCG CTACGCCCCA CGTCAGTTCT GCCACCACCT 6780
GCACCCGACG GATCCCTCCC CTACCTGTCC CACGGAGCCT CACAGCGGGC AAGCATCACC 6840
TCTCCAGTCG AAAAGCGAGA AGAGCCAGGG GCTGGAATGA GTAGCTCTTT GGTGGCACCT 6900
GAACTCCCAG GTACCCAAGA CCCTGGCATG TCCAGCCTTA GCCAGACAGA GCTGGAGAAG 6960
CAGCGACAGC GCCAGCGACT CCGTGAGCTG CTGATCCGCC AGCAGATCCA GCGTAACACC 7020
CTGCGGCAGG AGAAGGAAAC AGCTGCGGCA GCTGTAGGAG CAGTGGGGCC CCCAGGCAGC 7080
TGGGGTGCAG AACCCAGCAG CCCTGCCTTT GAGCAGCTGA GCCGAGGCCA GGCGCCCTTT 7140
GCTGGGACTC AGGATAAGGG CAGTCTCGTG GGATTGCCCC CAGGCAAGCT GGGTGGCTCT 7200
GTCCTGGGAC CAGGGCCCTT CCCCAGTGAT GATCGACTAT CTCGGCCACC TCCACCAGCC 7260
ACGCCTTCCT CTGTGGACGT GAACGGCCGG CAATTGGTAG GAGGCTCCCA GGCCTTCTAT 7320
CAGCGAGCGC CCTATCCTGG CTCCCTGCCC TTGCAGCAGC AACAGCTGTG GCAGCAGCAG 7380
CAGCAGCAGC AGCAGCAACA GCAGGCGACG TCAGCTGCTT CTATGCGACT TGCCATGTCG 7440
ACTCGCTTTC CATCAACTCC TGGCCCTGAG CTTAGCCGCC AAGCCCTGGG GTCCCCGTTG 7500
CCGGGAATTC CCACTCGCTT GCCAGGCCCT GGTGAGCCAG TGCCTGGCCC AGCTGGTCCT 7560
GCTCAGTTCA TTGAGTTGAG GCACAATGTA CAGAAAGGAC TAGGACCTGG AGGGGCTCCA 7620
TTTCCTGGTC AAGGGCCCCC TCAGAGACCT CGGTTTTACC CTGTGACTGA AGACCCTCAT 7680
CGACTGGCTC CCGAAGGGCT GCGGGGCCTG GTGCTGTCAG GCCTTCCTTC ACAGAAACCA 7740
TCGGCTCCAC CGGCCCCTGA GCTGAGCAAC AACCTCCATG CCCCGCCCCT CACCAAGGCT 7800
TCCACCCTGC CTGCTGGCCT GGAGCTGGTC AGCCGGCCAC CCTCCAGCAC TGAGCTCAGC 7860
CGGCCACCGC CTCTGGCTCT GGAGACAGGG AAGCTACCCT GTGAGGACCC TGAGCTGGAT 7920
GATGACTTTG ATGCCCACAA GGCCCTGGAG GACGATGAAG AACTTGCTCA CCTGGGGCTG 7980
GGTGTGGATG TGGCCAAGGG CGATGATGAG CTGGGAACCC TGGAAAACCT GGAAACCAAC 8040
GACCCCCACC TGGATGACCT GCTCAATGGA GATGAGTTTG ATCTTCTGGC CTACACTGAC 8100
CCCGAGCTGG ACACTGGGGA CAAGAAGGAC ATCTTCAACG AGCACCTGAG GCTGGTGGAG 8160
TCGGCTAATG AGAAGGCCGA GCGGGAGGCC CTGCTGCGGG GAGTGGAGCC AGGACCCTCG 8220
GTCTCTGAGG AGCGCCCACC CCCTGTTGCT GATGCCTCTG AGCCCCGCCT GGCAGAGGTG 8280
AAGCCCAAGG TGGAGGAGGG TGGACGTCAC CCTTCCCCTT GCCAGTTCAC CATCAACACC 8340
CCAAAGGTAG AGCCAGCACC TGCCACCACT TCCCTTGGCC TGGGCCTGAA GCCAGGACAA 8400
AACATGATGG GCAGCCGAGA GACCCGGATG GGCACAGGCC CATTTTCCAG CGGTGGGCAC 8460
ACAGCAGAGA AGGGCCCCTT TGGGACCACG GGGGGACCAC CAGCTCATCT GCTAGCCCCC 8520
AGCCCACTAA GCGGCTCAGC GGGATCTTCC CTGCTGGAAA AGTTTGAACT TGAGAGTGGG 8580
CCCCTGAACT TGCCTGGTGG CCCTGCAGCA TCTGGGGACG AGCTGGACAA GATGGAGAGC 8640
TCCCTGGTAG CCAGTGATCT ACCCCTGCTC ATCGAGGACC TGTTGGAACA TGAGAAGAAG 8700
GAGCTGCAGC AGCGGCAGCA GCTCTCAGCA CAGCTGCAGC CAGCTCAGCA GCAGCAGCAA 8760
CAGCAGCAGC AGCAGCTGAT ACTGTCTGCA ACAGGCCCTG CCCAGGCCAT GGCTCTGCCA 8820
CATGAAGGCT CTTCTCCAAG TTTGTCTGGA CCTCAACAGC AGCTTGCCTT GGGTATTGGA 8880
GGGGCCCGGC AGCCAGGCTT GGGCCAACCA CTGATGCCCA CCCAGCCACC GGCTCATGCC 8940
CTTCAGCAGC GCTTGGCCCC ATCCATGGCC ATGATGTCCA ACCAAGGGCA TATGCTAAGT 9000
GGGCAGCATG GGGGACAGGC AGGCTTGGTG CCCCCGCAGA ACCCACAGCC AGTGTTATCA 9060
CAGAAGCCCA TGGGTACCAT GCCACCTTCC ATGTGCATGA AGCCTCAGCA GTTGGCAGTG 9120
CAACAGCAGC TGGCTAATAG CTTCTTCCCA GACACAGACC TGGATAAATT TGCTGCTGAA 9180
GATATCATTG ATCCTATTGC AAAGGCCAAG ATGGTGGCTT TGAAAGGCAT TAAGAAAGTG 9240
ATGGCGCAGG GCAGCATTGG GGTGGCACCT GGTATGAACA GGCAGCAAGT GTCCCTACTA 9300
GCCCAGAGGC TCTCAGGGGG ACCTGGCAGT GATCTGCAAA ACCACGTGGT ACCCGGGAGT 9360
GGACAGGAGC GGAATGCCGG GGATCCCTCC CAGCCTCGTC CCAACCCACC CACTTTTGCA 9420
CAGGGAGTGA TCAATGAGGC CGACCAGCGG CAGTATGAAG AGTGGCTGTT CCACACCCAG 9480
CAACTCCTGC AGATGCAGCT GAAGGTGCTT GAGGAGCAGA TTGGCGTACA CCGCAAGTCC 9540
CGGAAGGCCC TGTGCGCCAA GCAGCGAACT GCCAAAAAGG CTGGCCGTGA GTTCCCGGAA 9600
GCTGATGCTG AAAAGCTCAA GTTGGTTACA GAGCAGCAGA GCAAGATCCA GAAACAGCTG 9660
GATCAGGTCC GTAAACAGCA GAAAGAACAC ACTAATCTCA TGGCAGAGTA TCGGAATAAA 9720
CAGCAGCAGC AGCAGCAGCA ACAGCAACAA CATTCGGCCG TACTGACTCT TAGTCCCTCC 9780
CAGAGTCCCC GGCTACTCAC GAAGCTCCCG GGTCAGCTAC TCCCTGGCCA TGGGCTGCAG 9840
CCACTACAGG GACCCCCTGG GGGGCAAGCT GGAGGTCTTC GCATGCCCCC TGGGGCTATG 9900
GCGCTACCTG GACAACCTGG TGGTCCCTTC CTCAACTCAT CCATGGCGCA GCAGCAGCAT 9960
TCTGGTGGGG CCGGATCCCT GACTGGCCCG TCAGGGGGCT TTTTCCCCGG CAGCCTTACT 10020
CTTCGAGGCC TTGCACCTGA CTCAAGACTT GTACAGGAGA GGCAACTGCA GCTGCAACAG 10080
CAGCGCATGC AGCTTGCCCA GAAACTGCAA CAGCAGCAGC AGCAGCAGCA GCACCTCCTG 10140
GGACAGGTGG CAATCCAGCA ACAACAGCAG CAGGGTTCTG GTGTGCAAGC AAACCAAGCT 10200
TTGGGTCCCA AGCCCCAGGG ACTTCTGCCC CCCAGCAACC ACCAAGGTCT CCTGGTCCAG 10260
CAGCTGTCCC CTCAGCCACC GCAGGGGCCT CAGGGCATGC TGGGCCCTGC CCAGGTGGCA 10320
GTGCTGCAGC AGCAGCAGCA GCATCCTGGA GCTTTGGGTC CACAGGGCCC CCACAGACAG 10380
GTGCTTATGA CCCAGAGTCG GGTGCTGACG TCCCCCCAGC TGGCACAGCA GGGTCAGGGC 10440
CTTATGAGAC AACGCTTGGT CACAGCCCAG CAGCAGCAGC AGCAGCAGCA GCAGCACTCC 10500
CAGCAGCAGC AGCAGGGCTC CATGCCAGGG CTTTCCCATA TTCAGCAGGG GCTGATGTCT 10560
CACAGTGGGC AACCTACACT GAATGGCCAG TCCATGAGTT CCTTGCAGCC ACAGCAGCAA 10620
CTTCAGCAGC AGCAGCTACA GCAACAGCTT CAGCAACAGC AGCAACAGCA GCTACAGCAA 10680
CAGCAATTTC AGCAGCAGCA GCAGCAGATG GGCCTTTTGA ACCAGAGTCG AACTTTACTG 10740
TCTCCTCAGC AGCAGCAGCC ACAGCAGCAG CAGCAGCAGC AGGTGACCCT TGGCCCCAGC 10800
ATGCCAGCCA AGCCTCTTCA ACACTTCTCT AGCCCCGGAG CCCTGGGCCC AACCCTCCTT 10860
TTGCCGGGCA AGGAACAAAA CATTGTAGAG ACGGCTCTTC CTTCAGAGGT CGCTGAGGGG 10920
GCCTCAGCCC ATCAGGGAGG AGGGCCTCTA GGAGTAGGGA CTACACCTGA GCCCATGGCT 10980
GCTGAACCAG GGGAGGTAAA ACCTTCGCTC TCTGGGGACT CACAACTCCT GCTTGTCCAA 11040
CCCCAAGCCC AGCCTCAGCC TCAGCCCCAG CCCGGCTCTC TGCAGCTACA GCCACCTCTG 11100
AGGCTCCCAG GACAACAGCA GCAACAGCAA GTTAACTTGC TTCACTCAGC AGGCATGGGA 11160
AGCCATGGGC AGCTAGGCAG CGGATCGTCT GAGGCCTCAG CTGTGCCCCA GCTCCTGGTC 11220
CAACCCTCTG TTTCCGTAGG GGACCAGCCC GGACCCGTGA CCCAGAACCT TCTTGGCCCC 11280
CAGCCTTCCC TGCTAGAACA GCCCCTGCAG AATAATACAG GGCCACAGCT TCCTAAACCA 11340
GGACCTGCTC CCCAGGCTGG GCAGGGTCTG CCTGGGGTTG GAGTCATGCC TGCAGTGGGT 11400
CAGCTTCGAG CACAACTTCA AGGAGTCCTG GCCAAAAACC CGCAGCTGCG GCACTTGAGT 11460
CCTCAGCAGC AACAGCAGCT ACAGGCCCTT CTTGTGCAGC GGCAGCTGCA GCAGAGTCAG 11520
GCCGTACGCC AGGCTCCACC TTTCCAGGAG CCTGGGACAC AACCCTCTCC TCTGCAGGGC 11580
CTGCTGGGCT GCCAGCCACA ACCTGGGGGT TTCCCTGGGC CCCAGACAGG CCCTCCCCAG 11640
GAGCTGGGGG CAGGGCCTCG ACCTCAGGGC CCACCCCGGC CCCCTGTCCC ACAAGGAGCC 11700
TCACCCGCAG GACCAGCACT TGGGCCTGTC CACCCCACGC CTCCGCCATC CAGCCCCCAA 11760
GAGCCAAAGA GACCTTCATC ACAGTTACCT TCCCCCAATG CTCAGCTTCC CCCCACCCAT 11820
CCAGGGACTC CAAAGCCTCT TGGGAGGGTC TCACCTGCTT CTGCCCAGCA CACAGATACC 11880
TTCTTTGGCA AGGGGCTGGG ACCTTGGGAC CCCCCAGACA ACCTAGCAGA AACCCAGAAG 11940
CCAGAGCAGA GCAACCTGGT AGCTGGGCAC CTGGAGCAAG TGAATGGACA GGTGGTGCCT 12000
GAACCACCCC AACTCAGCAT CAAGCAGGAG CCTCGGGAAG AGTCATGTGC CCTGGGAGCC 12060
CCAGCGGTGA AGAGGGAGGC CAATGGGGAG CCAGTAGGGA CAGCAGGTAC CAGCAACCAC 12120
CTCCTGCTGG CAGGCCCCCG CTCAGAGGCT GGGCATCTGC TCTTGCAGAA GCTTCTACGA 12180
GCAAAGAATG TGCAACTCAA CGCTGGGCGG GGGCCTGAGG GGCTGCGAAC TGAGATCAAC 12240
GGGCACATTG ACAGCAAGCT GGCTGGGCTG GAGCAGAAAC TGCAGAGTAC CCCCATCAAC 12300
AAGGAGGATG TGGCAGCAAG GAAACCTTTG ACGTCAAAGC CCAAGCGGGT GCAGAAGGCA 12360
GGCGACAGGT TGGTGAGCTC CAGAAAAAAG CTCCGGAAGG AGGACGGGCT GAGGGCCAGC 12420
GAAGCCTTGC TGAAGCAACT GAAGCAGGAG CTGTCTCTGC TGCCCCTGAC GGAGCCTACC 12480
ATCACCGCCA ACTTTAGCCT CCTGGCTCCC TTTGGCAGTG GCTGCCCAGT CAGTGGGCAA 12540
AACCAGCTAA GAGGGGCCTT TGGAAGTGGG ACGCTAACCA CTGGCCCTGA CTACTATTCT 12600
CAGCTGCTGA CCAAGAATAA CCTGAGTAAC CCGCCGACAC CACCCTCGTC GCTGCCCCCC 12660
ACCCCACCCC CATCGGTGCA GCAGAAGATG GTGAATGGTG TCACTGCGTC TGAGGAACTG 12720
GGGGAGAACC CCAAGGATGC CACGTCTGCC GGGGACACAG AAGGCACGCT GCGGGATGCT 12780
TCCGAGGTGA AGAGTCTTGA CCTGCTGGCT GCCTTGCCTA CGCCCCCTCA CAACCAGACC 12840
GAGGATGTCA GGATGGAGAG CGATGAGGAC AGTGACTCTC CCGACAGCAT CGTGCCAGCT 12900
TCATCCCCCG AGAGCATCCT GGGAGAGGAG GCCCCTCGTT ACCCTCAGCT GGGGTCTGGC 12960
CGGTGGGAAC AGGATGATCG GGCCCTGTCC CCTGTCATAC CCATCATCCC TCGGGCCAGT 13020
ATCCCAGTCT TCCCAGATGC CAAGCCGTAT GTGGCCCTTG ACCTGGATGT CTCAGGAAAG 13080
CTGCCTGCTG TAGCTTGGGA GAAAGGCCAA GGAAGTGAGG TGTCTGTCAT GCTGACAGTG 13140
TCTGCTGCCG CAGCCAAGAA CCTGAATGGG GTGATGGTGG CAGTGGCAGA GCTGCTAAGC 13200
ATGAAGATTC CCAACTCCTA TGAGGTGCTG TTCCCAGAGA GCCCCGCTCG GATTGGCATG 13260
GTGCCCAAAA AGGGAGATGC TGAAGGTGCT GTAGGGAAAG AGAAGGGTGT GGGAGACAAG 13320
AACCCAGATG CTGGCCCCGA GTGGCTCAAG CAGTTTGATG CTGTGTTGCC CGGCTACACG 13380
CTTAAAAGCC AGCTGGACAT CTTGAGCCTC CTGAAGCAGN NNNNNNNNNN NNNNNNNNNN 13440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA GCTCTCGGCC 13500
CCGCCCGAGC CCTCCCCACC CCCCTCCTTG GCACCCTCTC CTGCCAGCCC CCCTGCTGAA 13560
CCCCTGGTTG AACTTCCATC TGAATCAGCT GAGCCACCTG TCCCCTCCCC TCTGCCGCTG 13620
GCCTCATCCC CTGAATCCGC CCGGCCCAAA CCCCGAGCCC GGCCCCCGGA GGAAGGTGAA 13680
GATTCCCATC CCCCTCGCCT CAAGAAGTGG AAGGGAGTGC GCTGGAAGCG ACTGCGGCTG 13740
CTGCTGACCA TCCAGAAGAC CAGCGGGCAT CAGGAAGACG AGCGGGAAGT TGCAGAGTTC 13800
ATGGAGCAGC TGGGCACAGC CTTGCGACCT GACAAGGTGC CTCGGGATAT GCGGCGCTGC 13860
TGCTTCTGTC ACGAGGAGGG TGATGGGGCC ACGGATGGGC CAGCCCGCCT GTTGAACTTG 13920
GACCTGGACC TCTGGGTGCA CCTGAACTGT GCCCTGTGGT CCACGGAGGT ATATGAGACC 13980
CAGGGTGGGG CGCTGATGAA TGTGGAAGTT GCCCTCCACC GAGGCCTGCT CACCAAGTGC 14040
TCCCTGTGCC AGCGAACCGG TGCCACCAGC AGCTGCAACC GCATGCGTTG CCCCAATGTC 14100
TACCATTTTG CCTGTGCCAT TCGCGCCAAG TGCATGTTCT TCAAGGACAA GACCATGCTG 14160
TGTCCAATGC ATAAGGTCAA GGGCCCCTGT GAGCAGGAGC TGAGCTCGTT TGCGGTCTTC 14220
CGCCGGGTCT ACATCGAGCG GGACGAGGTG AAGCAGATAG CCAGCATCAT CCAGCGGGGG 14280
GAGCGGCTGC ACATGTTCCG CGTCGGGGGC CTCGTGTTCC ATGCCATTGG GCAGTTGCTC 14340
CCACACCAGA TGGCCGATTT CCACAGTGCC ACTGCCCTCT ACCCAGTGGG CTATGAGGCC 14400
ACGCGCATCT ATTGGAGCCT TCGCACCAAC AACCGGCGCT GCTGCTACCG CTGTTCCATC 14460
AGTGAGAACA ATGGGCGGCC TGAGTTCATA ATCAAAGTCA TGGAGCAAGG CCTGGAGGAT 14520
CTCATCTTCA CTGACGCCTC TCCCCAGGCC GTTTGGAACC GCATCATCGA GCCTGTGGCT 14580
GCCATGAGGA AGGAGGCCGA CATGCTGCTC TTCCCTGAGT ACCTGAAGGA GCTCTTCGGT 14640
TTGACAGTGC ACGTGCTGCG CATAGCTGAA TCACTGCCTG GGGTGAGATG TCAATATTTA 14700
TCTCGTTGGC CCCACCCCTT GATGCTCCCT ATGATCCCCA CTGGCTGTGC CCGATCCGAG 14760
CCCAAGATCC TTTCACACTA CAAACGGCCC CACACCCTGA ACAGCACCAG CATGTCTAAG 14820
GCGTATCAGA GCACCTTCAC AGGCGAGACC CACACCCCGT ACAGCAAGCA GTTTGTGCAC 14880
TCCAAGTCAT CCCAGTACCG ACGGCTGCGC ACCGAGTGGA AGAACAACGT GTATCTGGCT 14940
CGCTCCCGGA TCCAGGGCCT GGGGCTCTAT GCAGCCAAGG ACCTGGAGAA GCACACGATG 15000
GTCATCGAGT ACATCGGCAC CATCATTCGC AACGAGGTGG CCAACCGGCN NNNNNNNNNN 15060
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 15180
NNNGAGGTCG TGACGTTTGA CAAGGAGGAC AAGATCATCA TCATCTCCAG CCGGCGAATC 15240
AAGGGAGAGG AGCTGACCTA TGACTATCAG TTTGATTTTG AGGACGATCA GCACAAGATC 15300
CCCTGCCACT GTGGAGCCTG GAATTGTCGG AAATGGATGA ACTGA 15346
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 86 0.0 4618
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 86 0.0 4385
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 81 0.0 3645
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 85 0.0 3608
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 87 0.0 3517
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 87 0.0 3505
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 87 0.0 3487
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 89 0.0 3486
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 85 0.0 3482
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 88 0.0 3474
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 88 0.0 3472
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 88 0.0 3467
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 89 0.0 3464
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 88 0.0 3459
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 88 0.0 3454
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 88 0.0 3453
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 88 0.0 3421
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 86 0.0 3400
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 88 0.0 3398
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 88 0.0 3359
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 88 0.0 3357
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 84 0.0 3321
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 84 0.0 3307
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 83 0.0 3180
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 77 0.0 2936
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 81 0.0 2113
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 83 0.0 1869
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 81 0.0 1825
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 83 0.0 1816
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 86 0.0 1697
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 80 0.0 1496
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 83 0.0 1452
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 87 0.0 1245
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 80 0.0 1224
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 74 0.0 1018
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 87 0.0 969
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 87 0.0 944
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 81 0.0 918
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 81 0.0 914
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 81 0.0 909
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 77 0.0 892
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 73 0.0 830
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 76 0.0 830
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 71 0.0 816
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 72 0.0 804
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 70 0.0 798
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 70 0.0 797
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 70 0.0 792
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 69 0.0 790
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 69 0.0 787
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 69 0.0 787
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 69 0.0 785
WERAM-Ora-0001 ENSOANP00000000271.1 Ornithorhynchus anatinus 81 0.0 734
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 70 0.0 731
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 63 0.0 719
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 90 0.0 716
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 65 0.0 708
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 65 0.0 703
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 61 0.0 673
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 71 0.0 649
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 49 2e-135 483
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 43 6e-126 452
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 41 5e-109 395
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 73 1e-94 348
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 68 3e-85 317
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 43 2e-50 201
Created Date 25-Jun-2016