WERAM Information


Tag Content
WERAM ID WERAM-Bot-0131
Ensembl Protein ID ENSBTAP00000019193.5
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSBTAG00000014429.5 ENSBTAT00000019193.5 ENSBTAP00000019193.5
ENSBTAG00000014429.5 ENSBTAT00000063707.1 ENSBTAP00000054949.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 4.10e-45 152.8 5309 5424
Me_Reader PHD 4.20e-26 91 171 5048
Organism Bos taurus
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSBTAP00000019193.5 5309 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5393
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSBTAP00000019193.5 5394 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5424
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslp.egkswyCpsCke 52 
C +C + ++ + C + C++ +H C + + s l+ + + +Cp++ e
ENSBTAP00000019193.5 171 RCSHCTRLGA----SIPCRSpgCSRLYHFPCATASGSFLSmKTLQLLCPEHSE 219
6999933333....599******************888885557899**9975 PP
PHD.txt 3 iClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C vC ++ ge + ++ C +C + +H+ C++ +l+ + w Cp+Ck
ENSBTAP00000019193.5 228 RCAVC--EGPGELCdLFFCTSCGHHYHGACLDTALTARKRA-GWQCPECK 274
6****..444545559******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C+k+++++k m+ C++Cd+ +H+ C+k+p+++lp+ sw C+ C+
ENSBTAP00000019193.5 275 VCQACRKPGNDSK-MLVCETCDKGYHTFCLKPPMEELPAH-SWKCKACR 321
8****99999987.*************************9.*******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSBTAP00000019193.5 1343 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1392
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSBTAP00000019193.5 1394 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1440
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l + e+ + + C sC+
ENSBTAP00000019193.5 1471 TCPICHAPYVEEDLLIQCRHCERWMHAGCESLFTEDDVEQaadEGFDCVSCQ 1522
7*****99999999*****************9933333444434599*9997 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSBTAP00000019193.5 4943 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4975
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C++ +H C ++ ++k +Cp +k
ENSBTAP00000019193.5 5002 KCSLCQRTGATS----SCNRmrCPNVYHFACAIRAKCMFFKDKTMLCPMHK 5048
599996666665....6*9999*********98886666677678888776 PP

Protein Sequence
(Fasta)
MDSPKPPGED KDSEPAADGP AASEESGATE PDLPKPHVGE VSVPSSGGPR LQEPPQDCSG 60
GPVRRCALCN CGEPSLHGQR ELRRFELPFD WPRCPVVSPG GNPGPSEGAL PSEDLSQIGF 120
PEGLTPAHLG EPGGSCWAHH WCAAWSAGVW GQEGPELCGV DKAIFSGISQ RCSHCTRLGA 180
SIPCRSPGCS RLYHFPCATA SGSFLSMKTL QLLCPEHSEG ATHLEEARCA VCEGPGELCD 240
LFFCTSCGHH YHGACLDTAL TARKRAGWQC PECKVCQACR KPGNDSKMLV CETCDKGYHT 300
FCLKPPMEEL PAHSWKCKAC RVCRACGAGS AELNPNSEWF ENYSLCHRCH KAPGGQPVSS 360
LAEQHPPVCS RFSPLEPGAT PTDEPNSLYV ACQGQPKGGH VTSMQPKEPG PLQCEAKPLG 420
REGAQLEPQL EAPLNEEMPL LPPPEESPLS PPPEDSPTSP PPEASRLSPP PEDSPLSPPP 480
EESPLSPPPE SPPFSSPEDS PPHPPLDTPL PPPPEASPLS PPLEESPLSP PPEELPTSPP 540
PEASRLSPPP EESPMSPPPE ESPMSPPPEA SCLFPPFEES PLSPPPEESP LSPPPEASRL 600
SPPPEDSPMS PPPEDLPMSP PPEVSRLSPP PEESPLSPPP EESPTSPPPE ASRLSPPPED 660
SPTSPPPEDP PASPPPEDLL VSLPLEESPL LPLPEELRLC PRPEEPHLSP QPEKPRLSPA 720
PQEPRLSPAS QEPRLSPAPQ EPCLSPAPEE PRLSPAPQQP CLSPAPEEPR LSPAPQQPHL 780
SPVPPEEPCL SPRPEEPRLS PRPEEPRLSP RPEEPRLTPR PEEPHLSPRP EEPIEEPSLC 840
LASEELPLLL PPREPPLSPV LGEPALSEPG EPPLSPLPEE LPLSPSGEPS LSPQLMPPDP 900
LPPPLSPIIT TVAPPALSPL GELEYPFDAK GDSDPESPLA APILETPISP PPEANCTDPE 960
PVPPMILPPS PGSPMGPASP ILMEPLPPRC SPLLQHSLPP SNSPPSQCSP ALPLLVPSPL 1020
SPMDKAVEVS EEAEPQKMET EKAPEPECPA LEPSPTSPLP SPLGNLSCPA PSPAPALDDF 1080
CGLGEDTAPL DGTDTPGSQP EAGQTPGSLA SELKGSPVLL DSEELAPVTP MEVYGPECKQ 1140
AGQGSPCEEQ EEPRAPVAPT PPILIKSDIV NEISNLSQGD ASASFPGSEP LLGSPDPEGG 1200
GSLSMELGVS TDVSPARDEG SLRLCTDSLP ETDDSLLCDA GTAIGGGKAE GDKGRRRSSP 1260
ARSRIKQGRS SSFPGRRRPR GGAHGGRGRG RARLKSTTSS IETLVVADID SSPSKEEEDD 1320
DDDTMQNTVV LFSNTDKFVL MQDMCVVCGS FGRGAEGHLL ACSQCSQCYH PYCVNSKITK 1380
VMLLKGWRCV ECIVCEVCGQ ASDPSRLLLC DDCDISYHTY CLDPPLLTVP KGGWKCKWCV 1440
SCMQCGAASP GFHCEWQNSY THCGPCASLV TCPICHAPYV EEDLLIQCRH CERWMHAGCE 1500
SLFTEDDVEQ AADEGFDCVS CQPYVVKPAA PVAPPELVPM KVKEPEPQYF RFEGVWLTET 1560
GMAVLRNLTM SPLHKRRQRR GRPGLPGEAG LEGAEPSEVL GPDDKKDGDL DTDELLKAEG 1620
SVEHMECEIK LEGPVSPDGE PGKEETEESK KRKRKPYRPG IGGFMVRQRK SHTRVKKGPA 1680
AQAEVLSGDG QPDEVLPADL PAEGSVDQGL ADGDEKKKQQ RRGRKKNKLE DMFPAYLQEA 1740
FFGKELLDLS RKALFAVGVG RPSFGLGTPK AKGDGGSERK ELPTSQKGDD GPDVADEESR 1800
GPEGKADTPA IAGPEDGGIK ASPVPSDPEK PGTPGEGMLS SDLDRIPTEE LPKMESKDLQ 1860
QLFKDVLGSE REQHLGCGTP GLDGSRTPLQ RPFLQGGLPL GNLPSNSPMD SYPGLCQSPF 1920
LDSRERGGFF SPEPGEPDSP WTGSGGTTPS TPTTPTTEGE GDGLSYNQRS LQRWEKDEEL 1980
GQLSTISPVL YANINFPSLK QDYPDWSSRC KQIMKLWRKV PAADKAPYLQ KAKDNRAAHR 2040
INKVQKQAES QINKQTKVGD MARKTDRPAL HLRIPPQPGA LGSPPPAAAP TIFIGSPTPP 2100
AGLSTSADGF LKPPAGTVPG PDSPGELFLK LPPQVPAQVP SQDPFGLASA YALEPRFPTA 2160
PPTYPPYPSP TGAPAPPPTL GASSRPGTGQ PGEFHTTPPG TPRHQPSTPD PFLKPRCPSL 2220
DNLAVPESPG GGGSKAAEPL LSPLPFGESR KALEVKKEEL GGSSPSYGPP NLGFVDSPSS 2280
GPHLGGLELK APDVFKAPLT PRASQVEPQS PGLGLRPQEP PPAQALAPSP PSHSDIFRPG 2340
PYPDPYAQPP LTPRPQPPPP ESCCALPPRS LPSDPFSRVP ASPQSQSSSQ SPLTPRPLSA 2400
EAFCPSPVTP RFQSPDPYSR PPSRPQSRDP FAPLHKPPRP QPSEVAFKAG PLAHTPLGAG 2460
GFPAALPSGP AGELHAKVPS VQPPNFARSP GTSAFVGAPS PMRFTFPQAV GEPPLKPPVP 2520
QPGLPPPHGI NSHFGPGPTM GKPQSTNYAV AAGNFRPSGS PLGPSSGPAG EGYGPSPLRP 2580
PSVLPQPTPD GPLPCLPHGA TQRAGITSPV EKREDPGAGT GSSLAAPELS GTQDPGMSGL 2640
SQTELEKQRQ RQRLRELLIR QQMQRNTLRQ EKETAAAAGA VGPPGNWGAE PSSPAFEQLG 2700
RGQTPFAGTQ DKSSLVGLPP SKLGGPILGP GAFPSDDRLS RPPPPATPSS MDVNSRQLVG 2760
GSQAFYQRAP YPGSLPLQQQ QQQQLWQQQQ QATAAASMRL TMSARFPSTP GPELGRQTLG 2820
SPLAGISNRL PGPGEPVPGP AGPAQFIELR HNVQKGLGPG GPPFPGQGPP QRPRFYPVTE 2880
DSHRLAPEGL RGLAVSGLPP QKPSAPPAPE LNNSLHPTPH TKGPNLPTGL ELVSRPPSST 2940
ELGRPPPLTL EAGKLPCEDP ELDDDFDAHK ALEDDEELAH LGLGVDVAKG DDELGTLENL 3000
ETNDPHLDDL LNGDEFDLLA YTDPELDTGD KKDIFNEHLR LVESANEKAE REALLRGVEP 3060
GPFGPEERPP PAADASEPRL ASGLPEVKPK VEEGGRHPSP CQFTITTPKA ELAPVTTSLG 3120
LGVKPGQSVT GSRDTRMGTG PFSSSGHTAE KVPFGTTGGP PAHLLTPSPL SGPGGSSLLE 3180
KFELESGALT LPGGHASGDE LDKMESSLVA SELPLLIEDL LEHEKKELQK KQQLSAQLQP 3240
VQQQQPQQHS LLSTPGPGQA VSLPHEGSSP SLAGPQQQLA LGLGGSRQPG LAQPLMPNQP 3300
PAHALQQRLA PSMAMVSNQG HMLSGQHGGQ AGLVPQQNPQ PVLAQKPMGT VPPSMCMKPQ 3360
QLAMQQQLAN SFFPDTDLDK FAAEDIIDPI AKAKMVALKG IKKVMAQGSI GVAPGMNRQQ 3420
VSLLAQRLSG GPGSDLQNHV AAGSGQERGA GDSSQPRPNP PTFAQGVINE ADQRQYEEWL 3480
FHTQQLLQMQ LKVLEEQIGV HRKSRKALCA KQRTAKKAGR EFPEADAEKL KLVTEQQSKI 3540
QKQLDQVRKQ QKEHTNLMAE YRNKQQQQQQ QQQQQQQQHS AVLALSPSQS PRLLTKLPGQ 3600
LLPGHGLQPP QGPPGGQAAG LRLTPGGMAL PGQPGGPFLN TTLAQQQQQQ HSGGAGALAG 3660
PSGGFFPGNL ALRGLGPDSR LLQERQLQLQ QQRMQLAQKL QQQQQQHLLG QVAIQQQQQQ 3720
GPGVQANQAL GPKPQGLLPP SSHQGLLVQQ LSPQPPPGPQ GMLGPAQVAV LQQQQPHPGA 3780
LGPQGPHRQV LLTQPRVLSS PQMAQQGQGL MGHRLVTSQQ QQQQQHQQQG SMAGLSHLQQ 3840
GLMPHSGQPK VGAQPMAALQ QQQLQQQQQQ QQQQQQQLQQ QQQIGLLSQS RALLSPQQQQ 3900
QQQQMTLGPG MPAKPLQHFS SPGALGPTLI LTGKEQSIVE TALPSEASEG SSTHQGGPLP 3960
MGTVPESVAP EPGEVKPSLS GDSQLLLVQP QAQPQPNSLQ LQPPLRLPGQ QQPPVNLLHT 4020
AGAGSHGQPG SGSSEASSVP HLLAQSSVSL GEQPGSVTQN LLSSQQPLGL ERPMQNNIGP 4080
QPAKPGSVPQ SGQSLPGAGV MPTVGQLRAQ LQGVLAKNPQ LRHLSPQQQQ QLHALLMQRQ 4140
LQQSQAARQA PPYQEPGTQP SPLQGLLGRQ PQLGGFPGSQ TGPLQELGAG LRPQGPPRLP 4200
APQGALSTGP VLGPVHPTPP PSSPQEPKRP SSQLPSPSSQ LPSEAQLPTT QPGTPKPQGP 4260
PLELPPGRVS PAAAQLADTF FGKGLGPWDP PDHLAEAQKL EQSSLVPGHL DQVNGQVVPE 4320
PPHLNIKQEP REEPCALGSQ AVKREANGEP VGAPGTSNHL LLAGPRSEAG HLLLQKLLRA 4380
KNVQLSTGRG PEGLRAEING HIDSKLAGLE QKPQGTPSTK EDTAARKPLT PKPKRVQKAS 4440
DRLVSSRKKL RKEDGVRASE ALLKQLKQEL SLLPLTEPTI TANFSLFAPF GSGCPISGQC 4500
QLRGAFGSGV LPTGPDYYSQ LLTKNNLSNP PTPPSSLPPT PPPSVQQKMV NGVTPSEELG 4560
EHPKDAACAR DTEGVLRDAS EVKSLDLLAA LPTPPHNQTE DVRMESDEDS DSPDSIVPAS 4620
SPESILGEEA PRFPQLGSGR WEQDDRALSP VIPIIPRASI PVFPESKPYG VLDLETTRKL 4680
PAPTWEKGKG SEVSVMLTVS AAAAKNLNGM MVAVAELLSM KIPNSYEVLF PESPARAGIE 4740
PKKGEAEGPG GKEKSLGGKS PEAGPDWLKQ FDAVLPGYTL KSQLDILSLL KQESPAPEPP 4800
AQHSYTYNVS NLDVRQLSAP PPEEPSPPPS PLAPSPASPP AEPLVELPAE PSAEPPIPSP 4860
LPLASSPEST RPKPRARPPE EGEDSRPPRL KKWKGVRWKR LRLLLTIQKG GVRQEDEREV 4920
AEFMEQLGTA LRPDKVPRDM RRCCFCHEEG DGATDGPARL LNLDLDLWVH LNCALWSTEV 4980
YETQGGALMN VEVALHRGLL TKCSLCQRTG ATSSCNRMRC PNVYHFACAI RAKCMFFKDK 5040
TMLCPMHKIK GPCEQELSSF AVFRRVYIER DEVKQIASII QRGERLHMFR VGGLVFHAIG 5100
QLLPHQMADF HSATALYPVG YEATRIYWSL RTNNRRCCYR CSIGENNGRP EFVIKVMEQG 5160
LEDMVFTDAS PQAVWNRIIE PVAAMRKEAD MLRLFPEYLK GEELFGLTVH AVLRIAESLP 5220
GVESCQNYLF RYGRHPLMEL PLMINPTGCA RSEPKILTHY KRPHTLNSTS MSKAYQSTFT 5280
GETNTPYSKQ FVHSKSSQYR RLRTEWKNNV YLARSRIQGL GLYAAKDLEK HTMVIEYIGT 5340
IIRNEVANRR EKIYEEQNRG IYMFRINNEH VIDATLTGGP ARYINHSCAP NCVAEVVTFD 5400
KEDKIIIISS RRIPKGEELT YDYQFDFEDD QHKIPCHCGA WNCRKWMN 5448
Nucleotide Sequence
(Fasta)
ATGGACAGCC CGAAGCCGCC TGGTGAGGAT AAAGATTCGG AACCAGCAGC TGATGGACCT 60
GCAGCCTCCG AGGAGTCAGG TGCCACTGAG CCAGACCTTC CCAAACCACA TGTTGGGGAG 120
GTCTCTGTGC CCAGTTCTGG GGGTCCCCGG CTTCAGGAGC CTCCCCAGGA CTGCAGCGGG 180
GGTCCGGTGC GGCGTTGTGC TCTCTGTAAC TGCGGAGAGC CCAGTCTACA TGGGCAGCGG 240
GAGCTACGGC GCTTTGAGTT GCCATTTGAC TGGCCCCGGT GTCCCGTGGT GTCCCCTGGG 300
GGGAACCCAG GGCCCAGTGA GGGAGCGCTG CCCAGTGAGG ACCTATCACA GATTGGTTTC 360
CCTGAGGGCC TGACACCTGC CCACCTGGGA GAACCTGGAG GGTCCTGCTG GGCTCACCAC 420
TGGTGCGCTG CATGGTCGGC AGGCGTATGG GGGCAGGAGG GCCCAGAACT ATGTGGTGTG 480
GACAAGGCCA TCTTCTCAGG GATCTCACAG CGCTGCTCCC ACTGCACCAG ACTTGGTGCC 540
TCCATCCCTT GCCGCTCGCC TGGATGTTCA CGTCTTTACC ACTTCCCCTG TGCAACTGCC 600
AGCGGTTCCT TCTTATCCAT GAAAACACTG CAGCTGCTAT GCCCAGAGCA CAGTGAGGGG 660
GCCACACATC TGGAGGAGGC TCGCTGTGCA GTATGTGAGG GGCCAGGGGA ATTGTGTGAC 720
TTGTTCTTCT GTACCAGCTG TGGGCATCAC TATCACGGGG CCTGTCTGGA CACTGCTCTG 780
ACTGCCCGCA AGCGTGCTGG CTGGCAGTGC CCTGAATGCA AAGTGTGCCA AGCTTGCAGG 840
AAACCTGGGA ATGACTCTAA GATGTTGGTC TGTGAGACGT GTGACAAAGG ATACCATACC 900
TTCTGCCTGA AACCACCAAT GGAAGAACTG CCTGCTCACT CTTGGAAGTG CAAGGCATGC 960
CGGGTGTGCC GGGCCTGTGG GGCAGGCTCA GCTGAGCTGA ATCCCAACTC TGAGTGGTTT 1020
GAGAACTATT CGCTCTGTCA CCGCTGTCAC AAAGCCCCGG GAGGACAGCC TGTCAGTTCT 1080
CTTGCTGAGC AGCACCCCCC GGTCTGTAGC AGATTCTCAC CCCTAGAGCC TGGCGCTACC 1140
CCCACTGATG AGCCCAATAG TCTGTATGTT GCGTGCCAAG GGCAGCCAAA GGGTGGACAC 1200
GTGACCTCTA TGCAACCCAA GGAACCGGGG CCCCTGCAAT GTGAAGCCAA ACCACTAGGG 1260
AGAGAAGGGG CCCAACTTGA GCCCCAGTTG GAGGCCCCCC TAAATGAGGA GATGCCACTG 1320
CTGCCCCCAC CTGAGGAGTC GCCCCTGTCC CCACCACCTG AGGACTCACC CACGTCCCCG 1380
CCGCCTGAGG CATCGCGCCT GTCCCCACCG CCTGAGGACT CACCCCTCTC TCCACCGCCT 1440
GAGGAGTCTC CTCTGTCTCC CCCACCTGAG TCACCACCCT TCTCTTCCCC AGAGGACTCC 1500
CCCCCACATC CCCCACTCGA TACACCCTTA CCCCCACCAC CTGAAGCGTC ACCCCTGTCC 1560
CCACCACTGG AGGAGTCTCC TCTGTCCCCT CCCCCTGAAG AATTGCCTAC TTCCCCACCA 1620
CCGGAAGCAT CTCGCCTGTC TCCACCACCG GAAGAGTCAC CCATGTCTCC GCCACCTGAA 1680
GAGTCACCTA TGTCTCCGCC ACCTGAGGCC TCTTGTCTGT TCCCACCATT TGAGGAGTCG 1740
CCCCTGTCCC CTCCACCCGA GGAGTCTCCT CTCTCCCCGC CACCTGAGGC GTCACGCCTA 1800
TCCCCACCGC CTGAGGACTC ACCCATGTCC CCACCACCTG AAGACTTGCC TATGTCTCCC 1860
CCGCCTGAGG TGTCACGCCT GTCCCCCCCG CCTGAGGAGT CTCCCCTGTC ACCGCCACCT 1920
GAGGAGTCTC CCACATCCCC TCCACCTGAG GCTTCACGCC TGTCCCCTCC ACCTGAGGAC 1980
TCCCCTACAT CTCCGCCACC TGAAGACCCG CCTGCTTCCC CACCTCCGGA AGACTTGCTC 2040
GTGTCCCTGC CGCTGGAGGA GTCACCGCTG TTGCCACTGC CTGAGGAACT ACGACTTTGC 2100
CCCCGGCCTG AGGAGCCACA CCTGTCCCCT CAACCTGAGA AACCACGCCT GTCTCCTGCA 2160
CCCCAGGAGC CGCGTCTGTC CCCCGCATCC CAGGAGCCGC GTCTGTCCCC TGCACCCCAG 2220
GAGCCGTGCC TGTCCCCTGC ACCTGAGGAG CCACGCCTGT CCCCCGCACC CCAGCAGCCG 2280
TGCCTGTCCC CTGCACCCGA GGAGCCGCGT CTGTCCCCCG CACCCCAGCA GCCGCACCTG 2340
TCCCCGGTGC CCCCTGAGGA ACCATGCCTG TCCCCCCGGC CTGAGGAACC GCGCCTGTCC 2400
CCTCGGCCTG AGGAACCGCG CCTGTCCCCT CGGCCTGAGG AACCGCGCCT GACCCCCCGG 2460
CCTGAGGAAC CACACCTGTC CCCTCGACCT GAAGAGCCCA TTGAGGAGCC AAGCCTGTGT 2520
CTTGCATCTG AGGAATTGCC CTTACTCCTC CCACCCAGGG AGCCACCCTT ATCCCCTGTG 2580
CTTGGAGAGC CGGCCCTGTC TGAGCCTGGG GAACCACCTC TGTCCCCTCT GCCTGAGGAG 2640
CTGCCATTGT CCCCATCTGG GGAGCCATCT TTGTCACCTC AGCTGATGCC ACCAGATCCT 2700
CTTCCTCCTC CACTTTCACC CATCATCACA ACTGTGGCCC CACCAGCCCT GTCTCCGTTG 2760
GGAGAGTTGG AGTACCCCTT TGATGCTAAA GGGGACAGTG ACCCTGAGTC ACCACTGGCT 2820
GCCCCCATCC TGGAGACACC TATCAGCCCT CCACCAGAAG CTAACTGCAC TGACCCTGAG 2880
CCTGTACCCC CTATGATCCT TCCCCCATCC CCAGGCTCCC CCATGGGACC GGCCTCTCCC 2940
ATCCTGATGG AGCCCCTCCC CCCTCGTTGT TCCCCTCTCC TCCAGCATTC CCTGCCTCCC 3000
TCAAACTCAC CTCCTTCCCA GTGCTCTCCT GCCCTGCCGC TGTTGGTTCC TTCCCCGCTG 3060
AGTCCCATGG ACAAGGCAGT GGAGGTCTCA GAAGAGGCTG AGCCACAGAA GATGGAGACT 3120
GAGAAAGCCC CAGAACCTGA GTGCCCAGCC TTGGAACCCA GCCCTACTAG TCCTCTTCCA 3180
TCTCCTCTGG GGAACCTTTC CTGCCCTGCA CCCAGCCCTG CCCCAGCCCT GGATGACTTC 3240
TGTGGCCTGG GGGAAGATAC AGCCCCTCTG GATGGGACTG ACACTCCTGG TTCACAGCCA 3300
GAGGCTGGAC AGACCCCTGG CAGTTTGGCT AGTGAACTTA AGGGTTCTCC TGTGCTCCTG 3360
GACTCTGAGG AGCTGGCCCC TGTGACCCCT ATGGAGGTCT ATGGCCCAGA ATGCAAGCAG 3420
GCAGGGCAGG GCTCACCCTG TGAAGAGCAA GAGGAGCCAC GTGCACCAGT GGCCCCAACC 3480
CCACCTATTC TCATCAAATC CGACATCGTT AATGAGATCT CCAATCTGAG CCAGGGCGAT 3540
GCCAGTGCCA GTTTTCCTGG CTCAGAGCCC CTGCTAGGCT CTCCAGATCC CGAGGGGGGT 3600
GGCTCCCTGT CCATGGAGTT GGGGGTATCT ACAGACGTTA GTCCAGCCCG AGATGAGGGC 3660
TCCCTGCGGC TCTGTACCGA CTCGCTTCCA GAGACTGATG ACTCGCTATT GTGTGATGCT 3720
GGGACAGCTA TTGGAGGAGG CAAAGCCGAG GGGGACAAGG GGAGGCGGCG CAGCTCCCCA 3780
GCTCGTTCCC GCATCAAACA GGGTCGCAGC AGTAGTTTCC CAGGGAGACG CCGGCCACGT 3840
GGAGGAGCAC ATGGAGGACG TGGGAGAGGA CGGGCCCGGC TAAAATCAAC TACTTCTTCC 3900
ATTGAGACTC TGGTAGTTGC TGATATCGAT AGCTCTCCCA GCAAGGAGGA AGAAGACGAT 3960
GATGATGATA CCATGCAAAA TACTGTGGTT CTCTTCTCCA ACACAGACAA ATTTGTGCTA 4020
ATGCAGGACA TGTGTGTGGT GTGTGGCAGC TTTGGCCGGG GGGCAGAGGG CCACCTCCTT 4080
GCCTGTTCCC AGTGCTCTCA GTGCTATCAC CCTTACTGTG TCAACAGCAA GATCACCAAG 4140
GTGATGCTGC TGAAGGGCTG GCGTTGTGTG GAGTGCATCG TGTGCGAGGT GTGTGGCCAG 4200
GCCTCAGACC CCTCCCGCCT CCTGCTCTGT GATGACTGTG ACATTAGCTA CCACACGTAC 4260
TGCCTGGACC CCCCACTGCT CACCGTGCCC AAGGGTGGCT GGAAGTGCAA GTGGTGTGTG 4320
TCTTGTATGC AGTGTGGGGC CGCCTCCCCT GGCTTCCACT GTGAATGGCA GAATAGTTAC 4380
ACACACTGCG GGCCCTGTGC CAGCCTGGTG ACCTGCCCTA TCTGTCACGC CCCCTACGTG 4440
GAGGAGGACC TGCTGATCCA GTGTCGCCAC TGTGAACGGT GGATGCATGC TGGCTGTGAG 4500
AGCCTCTTCA CAGAGGATGA TGTGGAGCAG GCAGCTGATG AGGGCTTCGA CTGTGTCTCC 4560
TGCCAGCCTT ACGTGGTCAA GCCTGCTGCA CCTGTCGCAC CTCCAGAGTT GGTACCTATG 4620
AAGGTGAAAG AGCCTGAGCC CCAGTACTTT CGCTTTGAGG GCGTGTGGCT GACAGAAACG 4680
GGCATGGCGG TGCTGCGTAA CCTGACCATG TCGCCTCTGC ACAAGCGGCG TCAGAGGCGT 4740
GGACGGCCCG GCCTCCCAGG CGAGGCGGGG CTGGAGGGGG CTGAGCCTTC AGAGGTCCTG 4800
GGCCCTGATG ACAAGAAGGA TGGTGACCTG GACACGGATG AGCTGCTCAA GGCTGAAGGT 4860
AGTGTGGAGC ACATGGAGTG CGAAATTAAA CTGGAGGGTC CTGTCAGCCC TGATGGAGAG 4920
CCTGGCAAAG AGGAGACCGA GGAAAGCAAA AAACGCAAAC GCAAACCCTA TCGGCCTGGC 4980
ATCGGTGGTT TCATGGTGCG ACAGCGGAAA TCCCACACAC GTGTGAAAAA AGGGCCTGCT 5040
GCACAGGCGG AGGTGTTGAG TGGGGATGGG CAGCCCGACG AGGTGTTGCC TGCCGACCTG 5100
CCTGCCGAGG GCTCTGTGGA CCAGGGCTTA GCCGATGGGG ATGAAAAGAA GAAGCAGCAG 5160
CGGCGAGGGC GCAAGAAGAA CAAACTGGAG GACATGTTCC CTGCTTACCT GCAGGAAGCC 5220
TTCTTTGGGA AGGAGCTGCT GGACCTGAGC CGGAAGGCCC TGTTTGCAGT TGGCGTGGGC 5280
CGGCCAAGCT TTGGATTGGG AACCCCCAAA GCCAAGGGCG ATGGAGGCTC AGAGAGGAAG 5340
GAGCTCCCGA CCTCACAGAA AGGAGATGAT GGTCCGGATG TTGCAGATGA AGAATCCCGT 5400
GGCCCCGAGG GCAAGGCTGA TACACCAGCA ATTGCAGGAC CTGAGGATGG TGGCATAAAG 5460
GCATCCCCGG TGCCCAGTGA CCCTGAGAAG CCAGGCACCC CAGGTGAAGG GATGCTTAGC 5520
TCTGACTTAG ACAGGATTCC CACAGAAGAA CTGCCTAAGA TGGAATCCAA GGACCTACAG 5580
CAGCTTTTCA AGGACGTTCT GGGTTCCGAA CGAGAGCAGC ATCTGGGTTG CGGAACCCCT 5640
GGCCTGGACG GCAGCCGGAC ACCGCTGCAG AGGCCCTTTC TTCAAGGTGG ACTCCCTTTG 5700
GGCAATCTCC CCTCCAACAG CCCAATGGAC TCTTACCCGG GCCTCTGCCA GTCCCCATTC 5760
CTGGATTCTA GGGAGCGCGG GGGCTTCTTT AGCCCGGAAC CCGGTGAGCC AGACAGCCCC 5820
TGGACAGGCT CGGGGGGCAC CACGCCCTCC ACCCCCACCA CCCCCACCAC GGAGGGTGAG 5880
GGCGACGGGC TCTCCTATAA TCAGCGGAGT CTTCAGCGCT GGGAGAAGGA CGAGGAGTTG 5940
GGCCAGCTCT CCACCATCTC ACCTGTGCTG TACGCCAATA TTAACTTCCC CAGTCTCAAG 6000
CAGGATTACC CAGATTGGTC AAGCCGCTGC AAACAAATCA TGAAGCTGTG GAGAAAAGTT 6060
CCAGCTGCTG ATAAAGCCCC CTACCTGCAA AAGGCCAAAG ATAACCGGGC AGCTCACCGC 6120
ATCAACAAGG TGCAGAAGCA GGCTGAGAGC CAGATCAACA AGCAGACCAA GGTGGGCGAC 6180
ATGGCCCGCA AGACTGACCG ACCGGCCCTA CATCTCCGCA TTCCCCCCCA GCCAGGGGCC 6240
CTGGGCAGTC CACCTCCTGC TGCTGCCCCC ACCATTTTCA TTGGCAGCCC CACTCCCCCC 6300
GCCGGCTTGT CTACCTCTGC GGACGGGTTC CTGAAGCCGC CGGCGGGCAC GGTGCCCGGC 6360
CCCGACTCGC CTGGTGAGCT CTTCCTCAAG CTCCCGCCCC AGGTGCCCGC CCAAGTGCCT 6420
TCGCAGGACC CCTTTGGACT GGCCTCTGCC TATGCTCTGG AGCCCCGCTT CCCCACAGCA 6480
CCACCCACCT ACCCTCCCTA TCCTAGTCCG ACTGGGGCCC CTGCACCGCC CCCGACGCTG 6540
GGCGCCTCAT CTCGTCCTGG GACTGGCCAG CCAGGGGAGT TCCATACTAC CCCACCTGGC 6600
ACCCCCCGAC ACCAGCCCTC CACGCCTGAC CCCTTCCTCA AACCCCGCTG CCCCTCCCTG 6660
GACAACCTGG CTGTGCCTGA GAGCCCAGGA GGAGGGGGAA GCAAGGCTGC TGAGCCTCTG 6720
CTGTCGCCCC TGCCTTTCGG GGAGTCTCGG AAGGCCCTGG AGGTGAAGAA GGAAGAGCTT 6780
GGGGGATCCT CTCCGAGCTA TGGGCCCCCA AACCTGGGCT TTGTTGACTC TCCCTCCTCA 6840
GGCCCCCACC TGGGTGGCCT GGAGTTAAAG GCACCTGATG TCTTCAAAGC CCCCCTGACC 6900
CCTCGGGCAT CTCAGGTAGA GCCCCAGAGC CCGGGCTTGG GCCTAAGGCC CCAGGAGCCA 6960
CCCCCTGCCC AGGCTTTGGC CCCTTCTCCT CCCAGCCACT CAGACATCTT TCGCCCTGGT 7020
CCCTACCCTG ACCCCTACGC CCAGCCCCCG CTGACGCCTC GGCCCCAACC CCCACCACCT 7080
GAGAGCTGCT GTGCCCTGCC TCCCCGCTCA CTGCCCTCTG ACCCTTTCTC CCGAGTGCCC 7140
GCCAGTCCCC AGTCCCAGTC CAGCTCCCAG TCCCCATTGA CACCCCGTCC TCTGTCTGCT 7200
GAGGCTTTCT GCCCATCCCC TGTTACCCCT CGCTTCCAAT CCCCTGACCC TTATTCCCGT 7260
CCACCCTCGC GCCCTCAGTC CCGGGATCCA TTTGCCCCAT TGCATAAGCC CCCCCGCCCC 7320
CAGCCCTCTG AAGTTGCCTT CAAGGCTGGG CCTCTAGCCC ACACTCCGCT GGGGGCTGGG 7380
GGTTTCCCAG CAGCCCTGCC CTCAGGGCCA GCAGGTGAGC TCCATGCCAA GGTCCCAAGT 7440
GTGCAACCCC CGAATTTTGC CCGGTCCCCT GGGACCAGTG CATTTGTGGG CGCCCCTTCT 7500
CCCATGCGTT TCACTTTCCC TCAGGCGGTC GGGGAGCCTC CCCTAAAGCC GCCTGTCCCT 7560
CAGCCTGGTC TCCCTCCACC CCATGGGATC AACAGCCATT TTGGGCCTGG CCCTACCATG 7620
GGCAAGCCTC AAAGCACAAA CTACGCAGTA GCCGCAGGGA ACTTCCGCCC ATCGGGCAGC 7680
CCCCTGGGGC CCAGCAGCGG GCCCGCAGGA GAGGGCTACG GGCCGTCCCC ACTGCGCCCC 7740
CCGTCAGTCC TGCCCCAACC CACACCCGAT GGGCCCCTCC CCTGCCTGCC CCATGGGGCC 7800
ACACAGCGGG CGGGCATCAC CTCTCCCGTT GAGAAGCGAG AAGATCCAGG GGCTGGCACG 7860
GGCAGCTCTT TGGCGGCACC TGAGCTTTCA GGTACCCAGG ACCCAGGCAT GTCTGGCCTC 7920
AGTCAGACAG AACTAGAGAA GCAGCGACAG CGCCAGCGAC TACGGGAGCT ATTAATTCGG 7980
CAGCAGATGC AGCGCAACAC CCTTCGGCAG GAGAAGGAGA CTGCGGCAGC TGCCGGAGCG 8040
GTGGGACCCC CAGGCAACTG GGGTGCTGAG CCTAGTAGCC CCGCATTTGA GCAGCTGGGT 8100
CGAGGCCAGA CCCCCTTTGC TGGGACCCAG GACAAGAGCA GCCTTGTGGG ACTGCCCCCA 8160
AGCAAGCTGG GTGGCCCCAT CCTGGGGCCA GGGGCTTTCC CCAGTGATGA CCGACTCTCC 8220
CGGCCGCCTC CACCAGCCAC CCCTTCCTCT ATGGATGTTA ACAGCCGGCA ATTGGTGGGG 8280
GGCTCCCAAG CCTTCTATCA GCGAGCACCG TATCCTGGGT CCCTGCCCTT ACAGCAGCAG 8340
CAGCAGCAGC AACTGTGGCA GCAGCAGCAA CAGGCAACAG CAGCAGCTTC CATGCGACTT 8400
ACCATGTCTG CGCGCTTTCC ATCAACTCCT GGGCCTGAAC TTGGCCGCCA AACCCTAGGT 8460
TCCCCTTTGG CTGGAATTTC CAACCGCCTG CCTGGCCCTG GTGAACCAGT GCCTGGCCCA 8520
GCTGGTCCTG CCCAGTTCAT TGAGTTGCGG CACAATGTAC AGAAAGGACT CGGACCGGGG 8580
GGGCCTCCAT TTCCCGGTCA GGGACCTCCA CAGAGACCCC GTTTTTACCC TGTAACTGAG 8640
GATTCCCACC GACTGGCCCC TGAAGGGCTT CGTGGCCTAG CAGTCTCAGG CCTTCCCCCA 8700
CAGAAACCTT CAGCCCCACC AGCTCCTGAA CTGAACAACA GCCTCCATCC AACGCCCCAC 8760
ACCAAGGGTC CCAACCTGCC CACTGGCTTG GAGCTAGTCA GCCGGCCCCC CTCCAGTACT 8820
GAGCTTGGCC GCCCCCCTCC TCTGACCCTG GAAGCTGGAA AACTACCTTG TGAGGACCCT 8880
GAGCTGGATG ATGACTTTGA CGCCCACAAG GCCTTGGAGG ACGACGAGGA GCTGGCTCAC 8940
CTGGGCCTGG GCGTGGATGT GGCCAAGGGA GATGACGAGC TGGGCACTCT GGAGAACCTG 9000
GAGACCAATG ATCCCCACCT CGATGACCTG CTCAATGGGG ATGAGTTTGA CTTGCTGGCC 9060
TATACTGACC CTGAGCTGGA CACGGGGGAC AAGAAGGACA TCTTCAATGA GCATCTGAGG 9120
CTGGTGGAGT CGGCCAATGA AAAGGCTGAA CGAGAGGCTC TGTTGCGAGG GGTGGAGCCA 9180
GGACCCTTCG GCCCCGAGGA GCGCCCTCCC CCGGCCGCTG ATGCCTCTGA GCCCCGTCTG 9240
GCGTCAGGGC TTCCCGAGGT GAAGCCCAAG GTGGAGGAGG GTGGGCGCCA CCCTTCCCCT 9300
TGCCAGTTTA CCATTACCAC CCCCAAGGCA GAGCTGGCAC CTGTCACCAC TTCCCTAGGC 9360
CTGGGGGTGA AGCCAGGTCA GAGTGTGACG GGCAGCCGGG ACACTCGAAT GGGCACAGGA 9420
CCTTTTTCTA GCAGTGGGCA CACAGCTGAG AAGGTCCCCT TTGGGACCAC AGGAGGACCA 9480
CCAGCTCACC TGCTGACGCC CAGCCCACTA AGTGGCCCGG GAGGGTCCTC CCTACTGGAA 9540
AAGTTTGAGC TAGAGAGTGG GGCCCTGACT TTGCCTGGTG GACATGCATC TGGGGATGAG 9600
CTGGACAAGA TGGAGAGCTC ACTGGTAGCC AGTGAGTTAC CCCTGCTCAT TGAGGACCTG 9660
TTGGAACATG AGAAGAAAGA GCTGCAAAAG AAGCAGCAGC TTTCAGCGCA GCTGCAGCCT 9720
GTGCAGCAGC AGCAGCCCCA GCAGCATTCC CTGCTGTCCA CCCCAGGCCC TGGCCAGGCT 9780
GTGTCTTTGC CCCATGAGGG CTCTTCTCCC AGTTTGGCTG GCCCTCAACA GCAGCTTGCC 9840
CTGGGACTTG GAGGCTCCCG ACAGCCAGGC TTGGCCCAAC CATTGATGCC CAACCAGCCA 9900
CCAGCTCATG CCCTCCAGCA GCGCCTGGCC CCATCCATGG CCATGGTATC CAACCAAGGG 9960
CATATGCTAA GTGGGCAGCA TGGGGGACAG GCAGGCTTGG TGCCCCAGCA GAACCCACAG 10020
CCGGTGCTGG CACAGAAGCC AATGGGTACC GTGCCACCGT CCATGTGCAT GAAACCACAG 10080
CAGCTGGCAA TGCAGCAGCA GTTGGCTAAC AGCTTTTTCC CTGATACAGA CCTGGACAAA 10140
TTTGCTGCAG AAGATATCAT TGATCCAATT GCAAAGGCCA AGATGGTGGC TTTGAAAGGC 10200
ATCAAGAAGG TGATGGCTCA GGGCAGCATT GGAGTGGCAC CTGGTATGAA CAGGCAGCAA 10260
GTGTCCCTGC TAGCTCAGAG GCTCTCAGGG GGGCCCGGCA GTGATCTGCA GAACCATGTG 10320
GCAGCTGGGA GTGGCCAGGA GCGGGGTGCC GGTGACTCCT CCCAGCCTCG TCCCAATCCA 10380
CCCACTTTTG CCCAGGGAGT AATCAATGAG GCTGACCAGC GGCAGTATGA GGAGTGGCTG 10440
TTCCATACCC AGCAGCTCCT ACAAATGCAA CTGAAGGTGC TAGAGGAGCA GATAGGGGTG 10500
CACCGCAAGT CCCGGAAAGC CCTGTGTGCC AAGCAGCGCA CTGCCAAGAA GGCTGGCCGG 10560
GAGTTCCCCG AGGCTGATGC TGAGAAGCTG AAGCTGGTTA CAGAACAACA GAGCAAGATC 10620
CAGAAACAGC TGGATCAGGT CCGGAAGCAG CAGAAGGAGC ACACTAACCT AATGGCAGAA 10680
TATCGGAATA AGCAGCAGCA GCAGCAACAG CAGCAGCAGC AGCAACAGCA GCAGCACTCA 10740
GCCGTACTTG CCCTTAGCCC TTCCCAGAGT CCCCGGCTAC TCACCAAGCT TCCTGGTCAG 10800
CTGCTCCCAG GCCATGGGCT GCAGCCACCT CAAGGACCCC CTGGGGGCCA AGCTGCAGGC 10860
CTTCGCCTGA CCCCTGGGGG CATGGCCCTA CCTGGACAGC CTGGTGGCCC CTTTCTCAAC 10920
ACCACCCTGG CCCAACAGCA GCAACAGCAA CATTCTGGAG GAGCTGGGGC CTTGGCTGGC 10980
CCCTCAGGGG GCTTTTTCCC TGGCAACCTT GCTCTTCGAG GCCTGGGACC TGACTCGAGG 11040
CTTTTACAGG AAAGGCAGCT GCAGCTCCAA CAGCAGCGCA TGCAGCTGGC CCAGAAACTG 11100
CAACAGCAGC AGCAGCAGCA TCTCCTAGGA CAGGTGGCAA TCCAGCAGCA ACAGCAGCAG 11160
GGCCCGGGAG TACAGGCAAA CCAGGCTCTG GGTCCCAAGC CCCAGGGCCT TCTGCCTCCC 11220
AGCAGCCACC AGGGCCTCTT GGTCCAGCAG CTGTCCCCGC AACCACCCCC GGGACCCCAG 11280
GGCATGCTGG GCCCTGCCCA GGTTGCAGTG TTGCAGCAGC AGCAACCACA CCCTGGAGCT 11340
TTGGGCCCCC AGGGCCCTCA CAGACAGGTG CTCCTGACCC AGCCACGGGT GCTAAGTTCC 11400
CCCCAGATGG CACAGCAGGG TCAGGGCCTT ATGGGACACC GGCTGGTCAC ATCCCAGCAG 11460
CAGCAGCAGC AACAGCACCA ACAGCAAGGA TCCATGGCTG GGCTTTCCCA TCTTCAACAG 11520
GGTCTGATGC CACACAGTGG GCAGCCCAAA GTGGGCGCTC AGCCCATGGC TGCCTTGCAG 11580
CAGCAACAGT TGCAACAGCA GCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GCTTCAACAG 11640
CAGCAGCAGA TAGGCCTCTT GAGCCAGAGT CGAGCTTTAC TGTCTCCTCA ACAGCAACAG 11700
CAGCAGCAAC AGATGACACT TGGCCCTGGC ATGCCAGCCA AGCCTCTGCA ACACTTTTCT 11760
AGCCCCGGAG CCCTGGGCCC AACCCTTATC CTAACGGGCA AGGAACAAAG CATTGTAGAG 11820
ACAGCTCTTC CTTCAGAGGC CAGTGAGGGG TCCTCCACAC ATCAGGGAGG GCCCTTACCA 11880
ATGGGGACTG TACCAGAGTC CGTGGCCCCT GAACCAGGAG AGGTGAAGCC CTCACTGTCT 11940
GGGGACTCAC AACTCCTTCT TGTCCAGCCC CAGGCCCAGC CTCAGCCCAA CTCTCTGCAG 12000
CTGCAGCCAC CTCTGAGGCT CCCAGGACAA CAGCAGCCGC CGGTTAACTT GCTCCACACA 12060
GCAGGCGCAG GAAGCCATGG GCAACCAGGC AGTGGATCAT CTGAGGCCTC GTCTGTGCCC 12120
CACCTACTGG CCCAATCCTC TGTTTCCTTA GGGGAACAGC CTGGATCCGT GACCCAGAAC 12180
CTTCTGAGCT CCCAACAGCC CCTTGGACTA GAGCGGCCCA TGCAAAATAA CATAGGGCCA 12240
CAGCCTGCCA AGCCGGGATC TGTCCCGCAG TCTGGGCAGA GCCTGCCAGG GGCTGGGGTC 12300
ATGCCTACAG TGGGTCAGCT TCGGGCACAG CTCCAAGGAG TCCTGGCCAA AAACCCACAG 12360
CTGCGGCATT TGAGTCCTCA GCAGCAGCAG CAGCTCCATG CACTTCTCAT GCAGCGGCAG 12420
CTACAGCAGA GTCAGGCAGC ACGCCAGGCC CCACCATACC AGGAGCCTGG GACCCAGCCC 12480
TCTCCCCTCC AAGGCCTCCT GGGCCGCCAG CCCCAACTTG GGGGCTTCCC TGGATCCCAG 12540
ACAGGCCCTC TTCAGGAGCT AGGGGCCGGG CTTCGACCTC AGGGCCCACC CCGACTCCCC 12600
GCCCCACAAG GAGCCTTATC CACAGGACCA GTTCTTGGCC CTGTCCATCC CACACCTCCA 12660
CCATCCAGCC CCCAAGAGCC AAAGAGACCT TCCTCCCAAT TACCTTCCCC TAGCTCCCAG 12720
CTCCCCTCAG AGGCCCAGCT CCCTACCACC CAGCCAGGAA CCCCCAAGCC CCAGGGGCCA 12780
CCCTTGGAGC TTCCTCCTGG GAGGGTCTCA CCTGCTGCTG CCCAGCTTGC GGATACCTTC 12840
TTTGGCAAGG GACTGGGACC TTGGGACCCC CCAGACCACC TAGCGGAAGC CCAGAAGCTG 12900
GAGCAAAGCA GCCTGGTACC TGGGCATCTG GACCAGGTGA ATGGGCAGGT GGTACCTGAG 12960
CCACCGCATC TCAACATCAA GCAGGAGCCT CGGGAAGAGC CGTGTGCCCT GGGGTCCCAG 13020
GCGGTGAAGA GGGAGGCCAA CGGGGAGCCT GTGGGGGCAC CAGGTACCAG CAACCACCTC 13080
CTGCTGGCAG GGCCCCGCTC AGAAGCTGGG CATCTGCTCT TGCAGAAGCT TCTACGAGCA 13140
AAGAATGTGC AACTCAGCAC TGGGCGGGGG CCTGAGGGGC TGCGAGCTGA GATCAACGGG 13200
CACATTGACA GCAAGCTTGC TGGGCTGGAG CAGAAACCAC AGGGTACCCC CAGCACCAAG 13260
GAGGATACAG CAGCAAGGAA GCCTTTGACA CCGAAGCCCA AGCGGGTACA GAAGGCAAGC 13320
GACAGGTTGG TGAGCTCCCG AAAGAAGCTG CGGAAGGAGG ACGGGGTCAG GGCCAGCGAG 13380
GCCTTGCTGA AACAGCTGAA ACAGGAGCTG TCCCTGTTGC CCCTCACGGA GCCTACCATC 13440
ACCGCCAATT TTAGCCTCTT TGCTCCCTTT GGCAGCGGCT GCCCGATCAG TGGGCAGTGC 13500
CAGCTGAGGG GGGCCTTTGG AAGTGGGGTG CTGCCCACGG GCCCTGACTA CTATTCCCAG 13560
CTGCTTACCA AGAATAACCT GAGTAACCCG CCGACACCAC CCTCGTCGCT GCCCCCCACC 13620
CCACCCCCAT CGGTGCAGCA GAAGATGGTT AATGGCGTCA CTCCATCCGA AGAGCTGGGG 13680
GAGCACCCCA AGGATGCCGC CTGTGCCCGG GATACTGAAG GGGTGCTGAG GGATGCTTCA 13740
GAAGTGAAAA GTCTAGACCT GCTGGCCGCT TTGCCTACCC CCCCTCACAA TCAGACTGAG 13800
GATGTCAGGA TGGAGAGTGA TGAGGACAGC GATTCTCCTG ACAGCATCGT GCCAGCTTCA 13860
TCCCCTGAGA GCATCCTGGG GGAGGAGGCT CCCCGTTTCC CTCAGCTGGG CTCAGGCCGG 13920
TGGGAGCAGG ATGACCGGGC TCTCTCCCCC GTCATCCCTA TCATTCCTCG GGCCAGCATT 13980
CCAGTCTTCC CAGAGAGCAA ACCTTATGGA GTCTTGGATC TGGAGACCAC CAGGAAGCTG 14040
CCTGCCCCAA CTTGGGAAAA GGGCAAAGGA AGCGAGGTGT CAGTCATGCT GACGGTTTCT 14100
GCTGCTGCCG CCAAGAACCT GAATGGCATG ATGGTGGCAG TGGCAGAGCT GCTGAGCATG 14160
AAGATCCCCA ATTCCTATGA GGTGCTGTTC CCAGAGAGCC CTGCCCGGGC TGGCATCGAG 14220
CCTAAGAAGG GGGAGGCTGA GGGCCCTGGT GGGAAAGAAA AGAGTCTGGG AGGCAAGAGC 14280
CCAGAGGCTG GCCCTGATTG GCTGAAGCAG TTTGATGCTG TGTTGCCTGG CTATACACTC 14340
AAGAGTCAGT TAGACATCTT GAGTCTCCTC AAACAGGAGA GCCCTGCCCC AGAGCCCCCC 14400
GCCCAGCACA GCTACACCTA CAACGTCTCC AACCTGGATG TGCGACAGCT CTCGGCCCCA 14460
CCTCCTGAAG AACCCTCCCC ACCTCCTTCC CCTCTGGCAC CCTCTCCTGC CAGCCCCCCT 14520
GCTGAACCCT TGGTTGAACT TCCAGCTGAA CCCTCAGCTG AGCCACCCAT CCCCTCGCCT 14580
CTGCCACTGG CCTCATCCCC TGAATCTACC CGGCCCAAGC CCCGAGCCCG GCCTCCTGAA 14640
GAAGGTGAAG ATTCCCGGCC CCCTCGCCTC AAGAAGTGGA AGGGGGTGCG CTGGAAGCGG 14700
CTCCGGCTGC TGCTGACTAT CCAGAAGGGT GGTGTGCGGC AGGAGGATGA GCGGGAAGTG 14760
GCTGAGTTCA TGGAACAGCT CGGCACAGCC TTGCGACCTG ACAAGGTGCC TCGAGACATG 14820
CGGCGCTGCT GCTTCTGTCA TGAGGAGGGG GATGGGGCCA CGGATGGGCC TGCCCGCCTG 14880
CTGAACCTGG ACTTGGACTT GTGGGTGCAT CTCAACTGTG CCCTGTGGTC CACAGAGGTA 14940
TATGAGACCC AGGGCGGGGC GCTGATGAAC GTGGAGGTTG CCCTGCACCG AGGCCTGCTC 15000
ACCAAGTGCT CCCTGTGCCA GCGCACTGGT GCCACCAGCA GCTGCAATCG AATGCGTTGC 15060
CCCAACGTCT ACCATTTTGC CTGCGCCATC CGCGCCAAGT GCATGTTCTT CAAGGACAAG 15120
ACCATGCTAT GTCCAATGCA TAAGATCAAG GGGCCCTGTG AGCAGGAGCT GAGCTCTTTT 15180
GCTGTCTTCC GGAGGGTCTA CATTGAGCGG GATGAGGTGA AGCAGATTGC CAGCATCATC 15240
CAGCGGGGAG AGCGGCTGCA CATGTTCCGG GTGGGGGGCC TTGTGTTCCA CGCCATCGGA 15300
CAGCTGCTGC CCCACCAGAT GGCCGACTTC CACAGTGCCA CTGCCCTCTA TCCAGTGGGC 15360
TATGAGGCCA CGCGCATCTA CTGGAGCCTC CGCACTAACA ACCGCCGCTG CTGCTACCGC 15420
TGCTCCATCG GCGAGAACAA TGGGCGGCCG GAGTTCGTGA TCAAAGTCAT GGAGCAGGGC 15480
CTGGAGGACA TGGTCTTCAC GGACGCCTCT CCGCAGGCCG TTTGGAATCG CATCATTGAG 15540
CCTGTGGCAG CCATGAGGAA AGAGGCCGAC ATGCTGCGTC TCTTCCCTGA GTACCTGAAA 15600
GGCGAGGAGC TCTTTGGGCT GACAGTGCAT GCCGTGCTGC GCATAGCTGA ATCACTGCCT 15660
GGAGTGGAAA GCTGTCAAAA TTATTTATTC CGCTATGGAC GCCACCCCCT GATGGAGCTA 15720
CCGCTCATGA TCAACCCCAC TGGCTGTGCT CGGTCAGAGC CTAAAATCCT CACACACTAC 15780
AAACGGCCCC ACACTCTGAA CAGCACCAGC ATGTCCAAGG CATATCAGAG CACCTTCACA 15840
GGAGAGACCA ACACCCCGTA CAGCAAGCAG TTTGTGCACT CCAAGTCATC TCAGTACCGG 15900
CGGCTGCGCA CTGAGTGGAA GAACAATGTC TATCTGGCTC GCTCCCGTAT CCAGGGCCTG 15960
GGTCTCTATG CAGCCAAGGA CCTAGAAAAG CACACAATGG TCATTGAGTA CATTGGCACC 16020
ATCATTCGCA ATGAGGTGGC CAACCGGCGG GAGAAAATCT ATGAGGAGCA GAATCGAGGC 16080
ATCTACATGT TTCGAATAAA CAATGAACAT GTCATTGATG CTACGTTGAC CGGAGGCCCC 16140
GCCAGGTACA TTAACCATTC TTGTGCCCCT AACTGTGTGG CGGAAGTTGT GACATTTGAC 16200
AAGGAGGACA AAATCATCAT CATCTCCAGC CGGCGAATCC CCAAAGGAGA GGAGCTGACA 16260
TATGACTATC AGTTTGATTT TGAGGACGAT CAGCACAAGA TCCCCTGCCA CTGTGGAGCC 16320
TGGAATTGTC GGAAATGGAT GAACTAA 16348
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 93 0.0 6098
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 88 0.0 5973
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 85 0.0 3879
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 93 0.0 3827
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 92 0.0 3784
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 92 0.0 3784
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 89 0.0 3715
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 93 0.0 3710
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 93 0.0 3697
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 93 0.0 3658
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 93 0.0 3649
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 93 0.0 3649
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 90 0.0 3648
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 93 0.0 3637
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 92 0.0 3637
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 85 0.0 3633
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 92 0.0 3623
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 91 0.0 3602
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 93 0.0 3593
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 92 0.0 3563
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 90 0.0 3539
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 86 0.0 3355
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 86 0.0 3347
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 87 0.0 3311
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 81 0.0 2588
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 92 0.0 2286
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 92 0.0 2144
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 99 0.0 2133
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 86 0.0 1926
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 85 0.0 1808
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 86 0.0 1611
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 77 0.0 1574
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 90 0.0 1531
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 90 0.0 1513
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 94 0.0 1380
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 85 0.0 1376
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 64 0.0 1319
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 52 0.0 1315
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 63 0.0 1298
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 65 0.0 1248
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 82 0.0 1189
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 98 0.0 1164
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 88 0.0 1115
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 83 0.0 1039
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 81 0.0 1025
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 81 0.0 1006
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 993
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 79 0.0 992
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 79 0.0 990
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 79 0.0 988
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 988
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 984
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 925
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 73 0.0 912
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 73 0.0 910
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 897
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 75 0.0 891
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 848
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 73 0.0 733
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 89 0.0 716
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 0.0 646
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 8e-173 607
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 2e-158 559
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 2e-97 357
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 73 8e-95 348
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 1e-92 341
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 3e-45 184
Created Date 25-Jun-2016