WERAM Information


Tag Content
WERAM ID WERAM-Fia-0003
Ensembl Protein ID ENSFALP00000000206.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSFALG00000000182.1 ENSFALT00000000208.1 ENSFALP00000000206.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 1.20e-45 154.1 5280 5395
Me_Reader PHD 9.60e-31 105.4 233 5019
Organism Ficedula albicollis
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++++a+s+i+glgl+a+k+iek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSFALP00000000206.1 5280 NVYLARSRIQGLGLYAAKDIEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5364
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSFALP00000000206.1 5365 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5395
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51 
C +C+ ++e ++ +v C +C + fH+ C++++l++ + + w Cp+Ck
ENSFALP00000000206.1 233 RCSACDGPGELRD-LVLCTSCGQHFHGACLGISLTPRKRS-GWQCPECK 279
699**55555544.*******************9999977.8******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
+C++C+++++++ m+ C+ Cd+ +H++C++++ + lp g sw C++C
ENSFALP00000000206.1 280 VCQTCRQRGQDSA-MLVCEACDKGYHTSCMEPATQGLPTG-SWKCKNC 325
8****88888876.**************************.******* PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSFALP00000000206.1 1250 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1299
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCgk+++ ++ ++ Cd Cd +H++C+++pl+++p+g w C+ C+
ENSFALP00000000206.1 1301 VCEVCGKASDPSR-LLLCDDCDISYHTYCLDPPLQTVPKG-GWKCKWCV 1347
7****99999988.**************************.*****997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C++++ +++ ++qC +Cd+w H+ C +l ++ e+ + + C C+
ENSFALP00000000206.1 1378 VCPFCREKYVEDDLLIQCRHCDRWLHAACDSLFTEEEVEQaadEGFDCSACQ 1429
8****999999889***************99984443344443459999997 PP
PHD.txt 3 iClvCgkddegeke....mvqCdeCddwfHlkCvk 33
+C +C+++++g++ +++ d d w+Hl+C
ENSFALP00000000206.1 4914 CC-FCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4946
55.598888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+k+++ + C+ C+ +H C ++ ++k +Cp +k
ENSFALP00000000206.1 4973 KCSLCQKTGATN----SCNRirCPSVYHFACAIRAKCMFFKDKTMLCPLHK 5019
5999*7777776....5**999*********98886666677778999886 PP

Protein Sequence
(Fasta)
MDEEKVPSEE KDAAAPADGA MASEEMGSAE GDPLKPPEGA SAGPEPESRG MEGPPEAGSP 60
LGRRCALCNC GDWSLHGQRE LQRCEPVPDW PERLVGHEPP DGPGQPPPGP SQLPPEPELV 120
GDGLAQIGFS EGVTPAQLFE PTGHCWAHHW CAAWAAGAEP EVAGVARAVF SGISQPPPPP 180
PPPPPPPPPP PPPRSRLYHF PCAAASGCFQ SMKTLRLLCP EHLAEAVDME DARCSACDGP 240
GELRDLVLCT SCGQHFHGAC LGISLTPRKR SGWQCPECKV CQTCRQRGQD SAMLVCEACD 300
KGYHTSCMEP ATQGLPTGSW KCKNCWVCSD CGRHPSGLDS SCHWSPWSAV CGDCQQRRAT 360
ADVPTEPQHS PQPDPPAQIE PTDAPVPPPE LEPVGGDPEE ASPDPKEAPP GSAEPPPGEL 420
ASVEVLPSEG PPEDLPSAEL PPDQPPPDTQ PSDIQPPDVQ PPEEPLALML PPEEPPSEEL 480
PLETPSLAEL PPDRPSPEEL PPSEPPHNEL PLDVPPLAEL PPEKPAFEKS PSHELPPEVL 540
PIDKSPPDEL PPSESPPPTS RSSLSSTLRS CPPTSRPLRN YPPRSPPSRS YPPRSCPSRN 600
YPPRSPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP 660
PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPL 720
PLEELPPSEP PPIELLPEEP PPKALPPEKL SPEEPPVVEL PSEELVPSEL HPEEDLKPPG 780
VGVLHQGLLS ALALPDPRIP IFMCWCPVLG YVGTWGRVLP LNPRLEGTGL LPEVTVAEEA 840
PALPPPPTVA EEAPALAPEV LPEEEMELEP EPSLPPEMTP PPSAPPALRA ISPARTPAPS 900
PDAKEDEVLE PPTPALPEPP GSPGTAMDTD PPGSPPPPLG SPAVTLAPAE PPEDEEMEMA 960
GGEGESPASP PGDAPHGSPA PELGPFPSLP EEEEEEEEEA PRLEREGTSG SSPPLSEEDA 1020
LEEARAGGDP EVKGSPSGSP LLLEPEELGP ETPMEVCTTN EKLLGHGDEG CPEPPALRPR 1080
PDILNEISNL SQGDTSSSFP GSEPLLGSPD PEGGGSLSPE LGPASADASL QKEDAGSLPL 1140
GAETDDSLLF EPAAKGDGDK SRRRSSPGRS RVKQGRSSSF PGRRRPRGGS HGGRGRGRAR 1200
LKSTTSSVDT LALADVESSP SKEEDEDDDD TMQNTVVLFS NTDKFVLMQD MCVVCGSFGR 1260
GAEGHLLACS QCSQCYHPYC VNSKITKVML LKGWRCVECI VCEVCGKASD PSRLLLCDDC 1320
DISYHTYCLD PPLQTVPKGG WKCKWCVCCV QCGAANPGFH CEWQNNYTHC APCASLVVCP 1380
FCREKYVEDD LLIQCRHCDR WLHAACDSLF TEEEVEQAAD EGFDCSACQP YVVKPAVPPP 1440
SAEMIKAKDP EPQYFRFEGV WLTETGMAVL RNLTLSPLHK RRQRKGRPGT LNGDGGLEGG 1500
DPLGPEDKKD GDLDTDELLK AEVGVEHMEC EIKLEAPASP DRDVGADGDS GKGLEDPEEC 1560
KKRKRKPYRP GIGGFMVRQR KSHTRLKKVS AAPPDVGREG LSAEGHPEEG APGDVPAEAG 1620
LDPGSGEGDE KKKRRGRKKS KLEDMFPPYL QEAFFGKALM DLSRKALLAA GGVGDGAARP 1680
SLGQGAPRPK GDLSLTGALQ GGTLDRRETP THPRGDDSTD GSAAAPDDDG KDDAKAEDLG 1740
ADEPKDSPDR GDTEKPATPG EGALSSDLDK IPTEELPKME SKDVQQLFKD VLGSAERGEQ 1800
PLNCVAAGME AGQDPSRAQR PFLQGSVSLG SLSGSVSLDS YSGVCQSPFL DNRERSGFFS 1860
PDHCEPESPW ASSSAATTPS TPTTPTTEGE GDGLSYNQRS LQRWEKDEEL GELSTISPVL 1920
YANMNFPNLK QDYPDWSSRC KQIMKLWRKV PATDKAPYLQ KAKDNRAAHR INKVQKQAES 1980
QINKQTKGEG LRKPERPSLH LRIPVPSGAQ PVYISSPPAA GEGFLKPPAG AGGGPESPSE 2040
LFLKLPPQSP AQVPSHDPYG AASAFSLEPR FPSPLGQSPT PFLPTPPGTP RHQPGTPDPF 2100
LKPRCPSLDN LSVPGSPGAR PPEALLSPLP FGEQKKGLEV KKEDGGSLGV CSPGYTSAMG 2160
YGDSPGGPHL SAAELKAADV FKAPMTPRVS QVEPQSPGLG HRPPPPPPPP PPPPPPPPPP 2220
PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP 2280
PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP PPPPPPPPPP 2340
PPPPRGGFGG APAGDPHAKA AAGPQPPFAR SPGASVFAGS QPPMRFTFPP AVSEPLKGSP 2400
SHQPHGINSH YGSGKPQSSA YASSPSFHQA SSPLGPGTAA HDSYSLSPLR PPSVLPPQPP 2460
APPQQQDPSG AFLPRAALGL TADKREEVAA GLSAPPNRDL AELPGGQDGA LGTMSQSELE 2520
KQRQVRPLLR SSCIRLGLGE TESASGCGSC SSAAASSSPA AWAPEAGGQS FELSRGMAPY 2580
QPAQDKALLG TLATAAGAGK LPGSVMVQAA FAQDERLSRP PPAATPAMLD ISGRAAVGPP 2640
QAFYPRGVFP SPSPAAPCHL PLRPPGHWDP HGQPGPPPPP PPPPATGTPM GSLGTRLPTP 2700
AEAVPPSPAA LGAPFIELRH NVQKGPGGVV GSPFAPQAPR PRFFLPGEVP APLPPASPQG 2760
AEMGNSHQQP HAKAAPPLPA PTLELQQHIP PRPLSSGATL RPEGPKDGPA APDPVPAPGK 2820
SHLSLEGGRL PCEVPVEPDL DDDFGSHKDL EDDDDLANLS LDPDVAKGDD DLDNLDSLET 2880
ADPHLDDLLN GDEFDLLAYT DPELDTGDKK DIFNEHLRLV ESANEKAERE AQLKTEPPAP 2940
TLKLPDPGAQ GVDAKAGSLP ADVKPKLEDG CLKTSPCQFT TSGPERLPVP VNASLGLSVK 3000
PGQTLLSSST GPTRLSLTQF SNSGHAGVPV LEKDPYTGPG RGLATAQLGG QSNPLLEKFE 3060
LDSGALGLGS ARHSPADDLD KMESSLVASE LPLLIEDLLE HEKKELQKKQ QMSAHLPRGT 3120
PHLPAPGHPM LHTGASGQPP EGQPPRLGTP QTPLQLGLVA RPPLLPPQPP RLNAPQQGPA 3180
LAPHMGMGSG QPHVLSSPHG VQVALGQPQQ SQQQVLVQKP MAGVQPPTLG LKPPQLVMQQ 3240
QLANSFFPDT DLDKFAAEDI MDPIAKAKMV ALKGIKKVMA QGSIGVAPGM NRQQVSLLAQ 3300
RLSGGPAVSE MQNHLLAGSG QERSGTDPTQ ARPNPPTFAQ GVINEADQRQ YEEWLFHTQQ 3360
LLQMQLKVLE EQIGAHRKSR KALCAKQRTA KKAGREFPEA DAEKLKLVTE QQSKIQKQLD 3420
QVRKQQKEHT NLMAEYRNKQ QQHQHQASAV MALSPSQSPR LMSKLPGQLL SAHGVQQPGG 3480
ALVGAQGLGQ QPGQPGGLRL PQGGVSMAVQ QGLSFMGQQP VGNAPGPGSS GAFFSGNPAL 3540
RGLAADSRLM QERQLQRMQL AQKLQQQNML GQVSLQQQPG VMGQTSMQQP GVMGQVSLQQ 3600
PGVMGQAAMQ QQGVMGQTSM QQPGVMGQAA MQQSGVMGQA SMQQSGVMGQ TSLQQSGVMA 3660
QASMQQQGVM NQTSMQQPGV MGQASLQQPG VMAQASMQQP GVMGQTSMQQ QGVMGQASMQ 3720
QPGVIGQSSM QPQSIMAQTS MQQPGVMGQA SMQQAGVLAQ TSLQQPGMMG QASMQQPGVL 3780
GQAPVQSTAV GQPGLLGQQQ QQPPAPQPGT GAQPMVPKQP GLLGSQQSLL VQQLSPQQQP 3840
GHGAGGQAPS VLHLSQGMQP APAVLAAKDQ PTGMDGSALH AEGGESGGQL HGEQQGALNP 3900
PGLGGPEPPA PKHQAELGQG QQLLLSSPQP AVMRAAPGQA GALLPPPLQS PGAVHPELSQ 3960
QHGDTAGVAE QPLGCGTGPA QAMVPAPRPP EQPGKGVAGP LPGQPQPQPL RLAPPPPPPP 4020
PPPPPPPPPP PPPPPPPPTK SGRRVPGPVM GQIRAQLQGV LAKNPQLRHL TPQQQQQLQA 4080
LLVQRHQQSL LQQNQALRTP GPFPGQAPDY SLPATRQPPR PMGSLFQPRP GAPAEAQASS 4140
ATEPGRVLPG QPPQQPLAEL VQAALAARGP QPGLIRLPTP PAPSPLGCAP QHLASPEQSP 4200
QEPKKTSPGV PALAPSPAAP TEPGLTPTPS AALPDPAHGP WDPDAPGADP ADPAETHING 4260
AAPEGAPAGG EAPPVLVKQE PKEEAAAAAQ CEVDGGETAK PEANGDPGDS QANSLLAPGG 4320
RSEAGHLLLQ KLLRAKNVQL SAQSPGELNG HVENRSTGPE PRPQSLLLGR EDPSIARKPA 4380
PTKPKRVQKG NERIPASRKK LRKDEGLRPG EALMKQLKQE LSLLPLTEPT IMANFSLFAP 4440
FGSSPINGKS QLRGAFGSAV LDSVPDYYSQ LLTKNNLSNP PTPPSSLPPT PPPSVQQKMV 4500
NGVTAPEELG EQPKDADPAR EPHGQKDAPA VEVKSLDLLA ALPTPPHNQT EDVRMESDDE 4560
SDSPDSIVPA SSPESVLGEE LLRFPLLSEA KPELEERVLS PIIPIIPRAS IPVFPDTKPY 4620
EVAEPFGAPP GKVGAAGPGA PWEKGKSSEV SVMLTVSAAA AKNLNGVMVA MAELLSMKIP 4680
SSYEVLFPDG PVRAAVVEAK RVETDMAGML GGKEKALAGK VPDSSSEWLK QFDAVLPGYS 4740
LKGELDLLTL LRQESPAPEK TLHHCYVNNV SNLDVRQLSV LPQEPSPPLS PSVPSPSSPA 4800
EPTRVPDPEA SREAAPAPLS PLPPAPSPAP QEEGATAALS PPRFKPRSRP PEDGDEARPR 4860
LKKWKGVRWK RLRFLVTIQK GGAKRDGDKE IAEFIDKLGT TLRPEKVPQD LRKCCFCHEE 4920
GDGATDGPAR LLNLDLDLWV HLNCALWSTE VYETQGGALI NVEVALHRGL LTKCSLCQKT 4980
GATNSCNRIR CPSVYHFACA IRAKCMFFKD KTMLCPLHKL KGPCEQELSS FTVFRRVYIE 5040
RDEVKQIASI IQRGERLHMF RVGGLVFHAI GQLLPHQMAD FHSVTALYPV GYEATRIYWS 5100
LRTNNRRCCY RCTICENNGR PEFVVQVIEQ GLEDLVFSDS SPQAVWNRII EPVAMMRKEA 5160
DMLRLFPEYL KGEELFGLTV HAVLRIAESL PGVESCQNYL FRYGRHPLME LPLMINPTGC 5220
ARSEPKILTH YKRPHTLNST SMSKAYQSTF TGETNTPYSK QFVHSKSSQY RRLKTEWKNN 5280
VYLARSRIQG LGLYAAKDIE KHTMVIEYIG TIIRNEVANR REKIYEEQNR GIYMFRINNE 5340
HVIDATLTGG PARYINHSCA PNCVAEVVTF DKEDKIIIIS SRRIPKGEEL TYDYQFDFED 5400
DQHKIPCHCG AWNCRKWMN 5419
Nucleotide Sequence
(Fasta)
ATGGACGAAG AGAAGGTGCC CAGCGAGGAG AAGGACGCAG CCGCGCCAGC CGATGGGGCG 60
ATGGCCTCGG AGGAGATGGG CAGCGCCGAG GGGGACCCCC TGAAACCCCC CGAAGGAGCC 120
TCTGCGGGGC CAGAGCCGGA GAGCAGGGGC ATGGAGGGAC CTCCCGAGGC TGGCAGCCCG 180
CTGGGCCGCC GCTGTGCCCT CTGCAACTGT GGGGACTGGA GCCTACACGG GCAGCGGGAG 240
CTGCAGCGCT GCGAGCCGGT GCCCGACTGG CCCGAGCGGC TGGTGGGCCA CGAGCCCCCT 300
GATGGGCCTG GCCAGCCTCC CCCAGGGCCC AGCCAGCTCC CCCCAGAGCC AGAGCTGGTG 360
GGGGACGGCC TGGCACAGAT TGGCTTCTCC GAGGGGGTGA CCCCAGCGCA GCTCTTCGAG 420
CCGACAGGGC ACTGCTGGGC GCACCACTGG TGTGCGGCCT GGGCGGCTGG TGCAGAGCCG 480
GAGGTGGCCG GCGTGGCCAG AGCTGTTTTC TCAGGGATCT CACAGCCCCC CCCCCCCCCC 540
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC GCTCCCGCCT CTACCACTTC 600
CCCTGCGCCG CCGCCAGCGG CTGCTTCCAG TCCATGAAGA CCCTGCGGCT GCTCTGCCCC 660
GAGCACCTGG CTGAGGCTGT GGATATGGAG GACGCGCGGT GCTCGGCGTG CGACGGGCCG 720
GGCGAGCTGC GGGACCTGGT GTTGTGCACC AGCTGCGGGC AGCACTTCCA CGGGGCCTGC 780
CTGGGCATCT CCCTGACGCC TCGCAAGCGC TCGGGCTGGC AGTGCCCCGA GTGCAAAGTG 840
TGCCAAACCT GCAGGCAGCG CGGCCAGGAC TCTGCCATGC TGGTGTGCGA GGCGTGTGAC 900
AAGGGCTACC ACACCTCCTG CATGGAGCCA GCCACCCAGG GCCTCCCCAC CGGCTCCTGG 960
AAGTGCAAGA ACTGCTGGGT CTGCTCAGAC TGTGGCCGGC ACCCCTCGGG GCTCGACTCC 1020
AGCTGCCACT GGTCCCCGTG GTCAGCGGTG TGTGGGGACT GCCAGCAGCG CCGTGCCACG 1080
GCCGATGTCC CCACGGAGCC ACAGCACTCA CCACAGCCTG ACCCGCCCGC CCAGATCGAG 1140
CCGACGGATG CGCCCGTGCC CCCGCCTGAG CTGGAGCCCG TGGGGGGTGA CCCCGAAGAG 1200
GCATCCCCAG ACCCCAAGGA GGCTCCCCCA GGCAGTGCTG AGCCTCCTCC TGGCGAGCTG 1260
GCCTCTGTGG AGGTGCTCCC CAGTGAGGGA CCCCCTGAAG ACCTGCCCTC TGCTGAGCTG 1320
CCCCCCGATC AGCCGCCCCC CGACACGCAG CCCTCTGACA TTCAGCCTCC TGATGTGCAG 1380
CCCCCCGAGG AGCCCCTTGC TCTCATGCTG CCCCCCGAGG AGCCTCCCTC AGAGGAGCTG 1440
CCCCTCGAGA CGCCATCCCT TGCTGAGCTG CCCCCTGACA GACCCTCCCC TGAGGAGTTG 1500
CCCCCCAGTG AACCACCCCA CAACGAGCTG CCCCTCGACG TCCCTCCCCT TGCCGAGCTG 1560
CCCCCTGAGA AACCAGCGTT TGAAAAGTCA CCCTCCCATG AGCTGCCCCC TGAAGTATTG 1620
CCCATTGACA AATCACCCCC TGACGAGCTG CCCCCCAGTG AGTCACCTCC CCCCACGAGC 1680
CGCAGCTCGC TGAGCTCCAC CTTGAGGAGC TGCCCCCCAA CGAGCCGCCC CTTGAGGAAC 1740
TACCCCCCAA GGAGCCCCCC CTCAAGGAGC TACCCCCCAA GGAGTTGCCC CTCGAGGAAC 1800
TACCCCCCGA GGAGCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 1860
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 1920
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 1980
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 2040
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 2100
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCGCTG 2160
CCCCTTGAAG AGCTGCCCCC CAGTGAACCA CCCCCCATTG AGCTGCTCCC CGAGGAACCT 2220
CCTCCCAAAG CGCTGCCCCC CGAGAAGCTG AGCCCTGAAG AGCCCCCCGT TGTCGAGTTA 2280
CCCTCTGAAG AGCTGGTCCC CAGTGAGCTG CACCCCGAGG AGGACCTGAA ACCACCAGGT 2340
GTGGGGGTCC TGCACCAGGG GCTTCTCTCA GCACTGGCTT TGCCTGATCC CCGTATTCCT 2400
ATTTTCATGT GCTGGTGCCC TGTTTTGGGG TATGTGGGGA CCTGGGGGAG GGTCTTGCCT 2460
CTCAATCCCA GGCTGGAAGG CACAGGTCTC CTCCCTGAGG TGACAGTGGC AGAGGAAGCT 2520
CCAGCCCTGC CCCCCCCCCC GACAGTGGCA GAGGAAGCTC CAGCCCTGGC TCCAGAGGTG 2580
CTCCCCGAGG AGGAGATGGA GCTGGAGCCA GAGCCCTCGC TGCCCCCTGA GATGACACCC 2640
CCACCCTCAG CTCCCCCAGC TCTCCGTGCC ATCTCACCTG CACGGACTCC AGCCCCCTCC 2700
CCTGATGCTA AGGAGGACGA GGTGCTCGAG CCCCCCACCC CAGCCCTGCC CGAGCCCCCT 2760
GGCAGCCCTG GCACCGCCAT GGACACCGAC CCCCCTGGCT CCCCACCGCC ACCCCTGGGC 2820
TCCCCTGCTG TCACCCTGGC CCCCGCTGAG CCGCCTGAGG ATGAGGAGAT GGAGATGGCC 2880
GGTGGCGAGG GGGAGTCCCC AGCCAGCCCC CCTGGAGATG CCCCCCATGG CAGCCCAGCC 2940
CCTGAGCTGG GGCCCTTCCC GAGCCTGCCT GAGGAGGAGG AAGAGGAGGA AGAAGAGGCG 3000
CCGAGGCTGG AGAGGGAGGG CACCAGTGGA AGCTCCCCGC CGCTCAGTGA GGAGGACGCG 3060
CTGGAAGAAG CCCGGGCTGG GGGGGACCCT GAAGTCAAGG GGTCCCCTTC GGGGTCACCC 3120
CTGCTCCTGG AACCGGAGGA GCTGGGCCCT GAGACCCCCA TGGAGGTGTG CACTACCAAC 3180
GAGAAGCTGC TGGGGCACGG GGACGAGGGG TGCCCCGAGC CCCCCGCGCT GCGCCCGCGG 3240
CCCGACATCC TCAACGAGAT CTCCAACCTG AGCCAGGGGG ACACGAGCAG CAGCTTCCCC 3300
GGCTCGGAGC CGCTGCTGGG GTCCCCGGAT CCTGAGGGTG GGGGGTCGCT GTCCCCTGAG 3360
CTGGGCCCGG CCTCGGCTGA TGCCAGCCTG CAGAAGGAGG ACGCTGGGTC GCTGCCCCTG 3420
GGTGCTGAGA CCGATGACTC GCTGCTCTTC GAGCCAGCAG CCAAAGGTGA CGGGGACAAG 3480
AGCCGGCGCC GCAGCTCTCC CGGGCGCTCC CGTGTCAAGC AGGGTCGCAG CAGCAGCTTC 3540
CCTGGGCGGA GGCGCCCCCG AGGTGGGTCC CACGGGGGCC GAGGCCGGGG CCGAGCGCGC 3600
CTGAAGTCGA CCACCTCGTC TGTTGACACC TTAGCACTCG CTGACGTGGA GAGCTCGCCC 3660
AGCAAGGAGG AGGACGAGGA CGATGATGAC ACAATGCAGA ACACGGTTGT CCTCTTCTCC 3720
AACACTGACA AGTTTGTGCT CATGCAGGAC ATGTGCGTGG TGTGCGGCAG CTTCGGGCGT 3780
GGGGCCGAGG GGCACCTCCT CGCCTGCTCC CAGTGCTCCC AGTGCTACCA CCCTTACTGT 3840
GTCAACAGCA AGATCACCAA GGTGATGCTG CTGAAGGGCT GGCGCTGCGT GGAGTGCATC 3900
GTGTGCGAGG TGTGCGGCAA AGCCTCCGAC CCCTCGCGCC TGCTGCTCTG CGACGACTGC 3960
GACATCAGCT ACCACACCTA CTGCCTGGAC CCGCCGCTGC AGACCGTGCC CAAGGGCGGC 4020
TGGAAGTGCA AGTGGTGCGT GTGCTGTGTG CAGTGCGGGG CTGCAAACCC CGGCTTCCAC 4080
TGCGAGTGGC AGAACAACTA CACGCACTGC GCGCCCTGCG CCAGCCTCGT CGTCTGCCCC 4140
TTCTGCCGCG AGAAGTACGT GGAGGACGAC CTGCTCATCC AGTGCCGGCA CTGCGATCGG 4200
TGGCTGCACG CGGCGTGTGA CAGCCTCTTC ACTGAGGAGG AGGTGGAGCA GGCTGCGGAT 4260
GAAGGCTTCG ACTGCAGCGC CTGCCAGCCC TACGTGGTCA AACCTGCAGT GCCCCCGCCT 4320
TCTGCAGAGA TGATCAAAGC CAAGGATCCA GAGCCCCAGT ATTTCCGCTT CGAGGGCGTG 4380
TGGCTGACGG AGACGGGGAT GGCCGTGCTG CGCAACCTGA CCCTGTCCCC CCTGCACAAG 4440
CGGCGGCAGC GCAAGGGCCG CCCGGGCACC CTCAACGGGG ATGGGGGGCT GGAGGGGGGC 4500
GACCCCCTGG GCCCTGAGGA CAAGAAGGAC GGTGACCTGG ACACCGATGA GCTGCTCAAA 4560
GCTGAGGTGG GTGTGGAGCA CATGGAGTGT GAGATCAAGC TGGAGGCTCC AGCCAGCCCT 4620
GACCGTGATG TTGGAGCTGA TGGAGACTCG GGGAAGGGGC TGGAGGACCC TGAGGAGTGC 4680
AAGAAGAGGA AGCGCAAACC TTACCGGCCT GGCATTGGCG GGTTCATGGT GCGGCAGCGC 4740
AAATCCCACA CGCGGCTCAA GAAGGTGTCG GCGGCGCCGC CGGACGTGGG GCGCGAGGGG 4800
CTGAGCGCCG AGGGACACCC TGAGGAGGGG GCTCCAGGTG ATGTCCCAGC CGAGGCTGGC 4860
CTGGATCCAG GCTCAGGAGA GGGGGACGAG AAGAAAAAGC GCCGGGGACG GAAGAAGAGC 4920
AAGTTGGAGG ACATGTTCCC CCCGTACCTG CAGGAAGCGT TTTTTGGGAA GGCACTGATG 4980
GACCTGAGCC GGAAGGCGCT GCTGGCAGCA GGCGGGGTGG GGGACGGGGC TGCCCGCCCC 5040
TCCTTGGGCC AGGGTGCCCC AAGGCCCAAG GGGGACCTCA GCCTCACTGG GGCACTGCAG 5100
GGGGGCACCT TGGACAGGAG GGAGACCCCC ACCCACCCCC GAGGGGACGA CAGCACTGAC 5160
GGCAGCGCCG CCGCTCCCGA TGATGATGGC AAGGACGACG CGAAGGCCGA GGATCTGGGG 5220
GCTGACGAGC CGAAGGATTC CCCGGACCGT GGGGACACCG AAAAACCAGC AACCCCGGGC 5280
GAGGGAGCGC TAAGCTCTGA CCTCGACAAG ATCCCCACGG AAGAGCTGCC CAAGATGGAG 5340
AGCAAAGACG TGCAGCAGCT CTTCAAGGAC GTGCTGGGCT CGGCGGAGCG CGGGGAGCAG 5400
CCCCTCAACT GCGTGGCCGC GGGGATGGAG GCCGGGCAGG ACCCCAGCCG AGCACAGCGC 5460
CCGTTCCTGC AAGGGAGCGT TTCCCTGGGC TCACTCTCCG GATCCGTCTC CTTGGACTCG 5520
TACTCGGGGG TCTGCCAGTC CCCATTCCTC GATAACAGGG AGCGGAGTGG CTTCTTCAGC 5580
CCAGACCACT GCGAGCCCGA GAGCCCCTGG GCCAGCAGCT CAGCTGCCAC CACCCCCTCG 5640
ACCCCCACGA CGCCCACGAC GGAGGGCGAG GGTGACGGGC TGTCCTACAA CCAGCGCAGT 5700
TTGCAGCGCT GGGAGAAGGA CGAGGAGCTG GGTGAGCTCT CCACCATCTC CCCCGTCCTC 5760
TACGCCAACA TGAACTTCCC CAATCTCAAG CAGGATTATC CAGACTGGTC GAGCCGCTGC 5820
AAACAGATCA TGAAGCTCTG GAGGAAAGTC CCTGCCACGG ACAAAGCCCC GTATTTGCAA 5880
AAGGCCAAAG ATAACCGGGC GGCTCATCGC ATCAACAAGG TGCAGAAGCA GGCCGAGAGC 5940
CAGATCAACA AGCAGACCAA AGGGGAGGGA CTGCGCAAGC CTGAGAGGCC CTCGCTGCAC 6000
CTGCGGATCC CGGTGCCGTC AGGGGCACAG CCCGTCTACA TCAGCAGCCC CCCCGCCGCC 6060
GGAGAGGGCT TCCTGAAGCC CCCAGCTGGT GCTGGAGGGG GACCAGAGTC TCCCAGCGAG 6120
CTCTTCCTCA AGCTCCCGCC CCAGTCCCCC GCCCAAGTGC CTTCGCACGA TCCCTACGGC 6180
GCGGCCTCCG CCTTCTCGCT GGAGCCTCGC TTCCCCTCGC CCCTGGGCCA GAGCCCCACC 6240
CCTTTCCTGC CCACCCCACC TGGCACCCCC CGGCACCAAC CTGGCACCCC TGACCCCTTC 6300
CTGAAGCCCC GGTGCCCTTC CTTGGACAAC CTCTCGGTGC CAGGGAGTCC TGGGGCGCGG 6360
CCCCCTGAGG CCCTGCTCTC TCCCCTGCCT TTTGGGGAGC AGAAGAAGGG TCTGGAGGTG 6420
AAGAAGGAGG ATGGGGGGAG CCTGGGGGTC TGCTCGCCCG GCTACACTTC TGCCATGGGC 6480
TACGGGGACT CTCCCGGTGG CCCCCACCTC TCGGCTGCGG AGCTGAAGGC GGCCGACGTC 6540
TTCAAAGCCC CCATGACCCC ACGGGTCTCT CAGGTAGAGC CGCAGAGCCC GGGGCTGGGG 6600
CACCGGCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6660
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6720
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6780
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6840
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6900
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 6960
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC 7020
CCCCCCCCCC CCCGCGGAGG CTTTGGGGGC GCCCCGGCCG GCGACCCCCA CGCCAAGGCA 7080
GCGGCCGGGC CACAGCCGCC CTTTGCCCGC TCGCCCGGTG CCAGCGTGTT CGCAGGGAGC 7140
CAGCCCCCCA TGCGCTTCAC CTTCCCCCCG GCCGTCTCGG AGCCGCTCAA GGGCTCCCCG 7200
TCCCACCAGC CCCACGGGAT CAACAGCCAT TACGGCTCTG GGAAGCCCCA GAGCTCGGCC 7260
TACGCCTCGT CCCCCAGCTT CCACCAGGCC AGCAGCCCGC TGGGCCCTGG CACTGCCGCC 7320
CACGACTCCT ACAGCCTGTC GCCCCTGCGG CCCCCGTCGG TGCTGCCGCC GCAGCCCCCG 7380
GCCCCTCCGC AGCAGCAGGA CCCTTCTGGA GCCTTCCTGC CCCGCGCTGC GCTGGGACTG 7440
ACAGCTGACA AGAGGGAGGA GGTGGCTGCA GGGCTCTCGG CACCCCCAAA CCGGGACCTG 7500
GCAGAGCTGC CCGGTGGCCA GGACGGGGCT CTGGGCACCA TGAGCCAGTC AGAGCTGGAG 7560
AAGCAGCGGC AGGTGAGGCC TCTCCTGAGA TCCAGCTGTA TCCGTTTGGG TTTGGGGGAG 7620
ACTGAAAGCG CCAGCGGCTG CGGGAGCTGC TCATCTGCCG CCGCCTCCTC CTCGCCGGCT 7680
GCCTGGGCCC CTGAGGCCGG CGGGCAGAGC TTTGAGCTGA GCCGGGGCAT GGCCCCCTAC 7740
CAGCCAGCAC AGGACAAGGC TCTGCTCGGG ACTCTGGCCA CAGCAGCCGG TGCTGGGAAG 7800
CTGCCCGGCT CCGTCATGGT GCAGGCAGCG TTTGCACAGG ACGAGCGGCT GTCCCGGCCC 7860
CCGCCGGCGG CCACGCCAGC CATGCTGGAC ATCAGCGGGC GGGCGGCAGT GGGGCCCCCC 7920
CAGGCCTTCT ACCCCCGTGG GGTCTTCCCA TCGCCATCCC CTGCGGCTCC CTGTCACCTC 7980
CCGCTTCGCC CCCCCGGCCA CTGGGACCCC CATGGGCAGC CCGGGCCCCC CCCCCCCCCC 8040
CCCCCCCCCC CGGCCACTGG GACCCCCATG GGCAGCCTGG GCACTCGCCT GCCCACCCCT 8100
GCCGAGGCCG TGCCCCCCTC ACCCGCTGCC CTGGGCGCTC CCTTCATCGA GCTGCGACAC 8160
AACGTGCAGA AGGGACCCGG GGGTGTCGTG GGGTCTCCCT TTGCCCCCCA GGCACCCCGG 8220
CCTCGCTTCT TCCTGCCCGG GGAGGTTCCA GCACCGCTGC CACCTGCATC CCCGCAGGGA 8280
GCCGAGATGG GCAACAGCCA CCAGCAGCCC CACGCCAAGG CGGCCCCGCC GCTGCCAGCC 8340
CCCACCCTGG AGCTGCAGCA GCACATCCCC CCGCGCCCGC TGTCCTCCGG TGCCACCCTC 8400
CGCCCCGAGG GTCCCAAGGA TGGCCCTGCT GCTCCAGACC CTGTGCCTGC CCCAGGCAAG 8460
AGCCACCTGA GCCTGGAGGG GGGGCGGCTG CCCTGCGAGG TGCCCGTGGA GCCTGATTTG 8520
GATGATGACT TCGGCTCCCA CAAGGACTTG GAAGACGATG ATGATTTGGC CAACCTTAGC 8580
CTGGATCCAG ATGTGGCCAA AGGGGACGAT GACTTGGATA ATTTGGACAG CCTGGAGACA 8640
GCAGACCCCC ACCTTGATGA CCTGCTGAAT GGTGACGAGT TCGACCTGCT GGCCTACACT 8700
GACCCCGAGC TGGACACGGG TGACAAGAAG GACATCTTCA ACGAGCACCT GCGCCTGGTC 8760
GAGTCGGCCA ACGAGAAGGC AGAGCGTGAG GCCCAGCTCA AGACAGAACC GCCAGCGCCC 8820
ACCTTGAAGC TCCCCGACCC TGGGGCACAG GGTGTGGACG CAAAGGCTGG CTCGCTGCCC 8880
GCTGATGTCA AGCCCAAGCT GGAGGACGGC TGCCTGAAGA CCTCGCCCTG CCAGTTCACC 8940
ACCTCTGGTC CCGAGCGTCT CCCGGTGCCA GTCAATGCTT CCCTGGGCCT GAGCGTCAAA 9000
CCTGGCCAGA CCCTCCTGAG CTCCAGCACC GGCCCCACTC GCCTCAGCCT GACTCAGTTC 9060
TCCAACAGTG GCCACGCTGG CGTCCCCGTG CTGGAGAAAG ACCCCTACAC CGGCCCTGGC 9120
CGTGGGCTGG CAACCGCCCA GCTGGGTGGC CAGAGCAACC CCTTGCTGGA GAAGTTTGAG 9180
CTGGACAGTG GAGCCTTGGG GCTGGGCAGT GCCCGGCACT CGCCAGCAGA CGACTTGGAC 9240
AAGATGGAGA GCTCGTTGGT GGCCAGCGAG CTGCCGCTGC TCATCGAGGA CCTCCTGGAG 9300
CACGAGAAGA AGGAGCTGCA GAAGAAGCAG CAGATGTCAG CCCACTTGCC CCGTGGCACC 9360
CCCCACCTCC CAGCCCCGGG GCACCCCATG CTGCACACAG GGGCATCTGG GCAGCCTCCT 9420
GAGGGGCAGC CACCCCGTCT GGGCACCCCA CAGACACCCC TCCAGCTGGG GCTGGTGGCC 9480
CGGCCACCGC TGCTGCCGCC ACAGCCCCCC CGGCTCAACG CACCCCAGCA GGGCCCAGCT 9540
CTGGCCCCCC ACATGGGCAT GGGCTCTGGG CAGCCCCATG TGCTGTCATC CCCCCACGGG 9600
GTGCAGGTGG CCCTGGGGCA GCCGCAGCAG AGCCAGCAGC AGGTGCTGGT GCAGAAGCCA 9660
ATGGCCGGGG TACAGCCCCC CACCCTGGGC CTGAAGCCCC CTCAGCTCGT CATGCAGCAG 9720
CAGCTGGCCA ACAGCTTCTT CCCAGACACA GACCTGGACA AGTTTGCGGC CGAGGACATC 9780
ATGGACCCCA TTGCTAAGGC CAAGATGGTG GCACTGAAGG GCATCAAGAA GGTGATGGCT 9840
CAGGGCAGCA TTGGGGTGGC CCCAGGCATG AACAGGCAAC AGGTTTCCCT TCTGGCCCAG 9900
CGGCTCTCGG GGGGCCCGGC TGTCTCTGAG ATGCAGAACC ACTTACTGGC AGGGAGTGGG 9960
CAGGAGCGGA GCGGCACAGA CCCGACCCAG GCTCGCCCCA ACCCACCCAC CTTTGCACAG 10020
GGCGTCATCA ACGAGGCTGA CCAGCGGCAG TACGAGGAGT GGCTGTTCCA CACGCAGCAG 10080
CTGCTGCAGA TGCAGCTGAA GGTGCTGGAG GAGCAGATCG GGGCGCACCG CAAGTCGCGC 10140
AAGGCGCTGT GCGCCAAGCA GCGCACGGCC AAGAAAGCCG GGCGCGAGTT CCCCGAGGCC 10200
GACGCCGAGA AGCTGAAGCT GGTGACGGAG CAGCAGAGCA AGATCCAGAA GCAGCTGGAC 10260
CAGGTGCGGA AGCAGCAGAA GGAGCACACC AACCTCATGG CTGAGTACCG CAACAAGCAG 10320
CAGCAGCACC AGCACCAGGC CTCAGCCGTG ATGGCCCTGA GCCCCTCGCA GAGCCCCCGC 10380
CTCATGTCCA AGCTCCCAGG ACAGCTCCTG TCAGCACACG GCGTGCAGCA GCCCGGGGGA 10440
GCCCTGGTGG GAGCACAGGG CCTGGGGCAG CAGCCAGGAC AGCCTGGGGG GCTGCGCCTG 10500
CCCCAGGGCG GTGTGTCCAT GGCTGTGCAG CAGGGCCTGT CCTTCATGGG GCAGCAGCCC 10560
GTGGGGAATG CCCCAGGACC TGGCTCCTCT GGCGCTTTCT TCAGCGGGAA CCCCGCGCTG 10620
CGTGGGCTGG CCGCTGACAG CCGCCTGATG CAGGAGCGGC AGCTCCAGAG GATGCAGCTG 10680
GCACAGAAAC TGCAGCAGCA GAACATGCTG GGACAGGTGT CACTGCAGCA GCAGCCAGGT 10740
GTGATGGGAC AGACGTCAAT GCAGCAGCCA GGTGTGATGG GACAGGTGTC GCTGCAGCAG 10800
CCAGGTGTGA TGGGACAGGC GGCCATGCAG CAACAGGGCG TGATGGGACA AACATCAATG 10860
CAGCAGCCGG GTGTGATGGG ACAGGCAGCC ATGCAGCAAT CAGGTGTGAT GGGACAGGCA 10920
TCCATGCAGC AATCAGGTGT GATGGGACAG ACGTCACTGC AGCAATCAGG TGTCATGGCA 10980
CAAGCATCGA TGCAGCAACA GGGCGTGATG AATCAAACGT CCATGCAGCA GCCAGGTGTG 11040
ATGGGACAGG CATCGCTGCA GCAGCCAGGT GTGATGGCAC AGGCATCCAT GCAGCAGCCA 11100
GGTGTGATGG GACAGACGTC CATGCAGCAA CAGGGTGTGA TGGGACAGGC GTCCATGCAG 11160
CAGCCGGGTG TGATAGGACA GAGCTCGATG CAGCCACAGA GCATCATGGC ACAGACGTCC 11220
ATGCAGCAGC CAGGTGTGAT GGGACAAGCT TCAATGCAAC AGGCAGGTGT GCTGGCCCAG 11280
ACATCACTCC AGCAGCCGGG CATGATGGGC CAGGCATCCA TGCAGCAGCC AGGTGTGCTG 11340
GGCCAGGCCC CCGTGCAGTC CACAGCCGTG GGGCAGCCCG GCCTGCTGGG GCAGCAGCAG 11400
CAGCAGCCCC CAGCCCCACA GCCCGGCACA GGGGCTCAGC CCATGGTGCC GAAGCAGCCG 11460
GGGCTGCTGG GCAGCCAGCA GAGCCTCCTG GTGCAGCAGC TCTCACCCCA GCAGCAGCCA 11520
GGCCACGGGG CGGGGGGCCA GGCCCCCTCT GTGCTGCACC TGAGTCAGGG AATGCAGCCG 11580
GCCCCGGCGG TCCTGGCTGC CAAGGACCAG CCCACGGGGA TGGACGGCAG CGCCCTGCAC 11640
GCTGAGGGCG GTGAGAGTGG GGGGCAGCTG CACGGGGAGC AGCAGGGAGC CCTCAACCCC 11700
CCGGGCCTGG GGGGCCCGGA GCCCCCGGCC CCCAAGCACC AGGCTGAGCT GGGGCAGGGC 11760
CAGCAGCTCC TCTTGAGCTC CCCGCAGCCG GCTGTCATGC GGGCAGCCCC CGGGCAGGCA 11820
GGTGCCCTAC TCCCCCCGCC GCTGCAGAGC CCTGGGGCTG TTCACCCCGA GCTGTCCCAG 11880
CAGCATGGGG ACACGGCTGG GGTGGCAGAG CAGCCCCTGG GCTGTGGGAC GGGCCCAGCA 11940
CAGGCCATGG TGCCAGCACC GCGGCCCCCA GAGCAGCCGG GCAAGGGGGT GGCAGGGCCC 12000
CTGCCAGGGC AGCCACAGCC ACAGCCCCTG CGGCTGGCCC CCCCCCCCCC CCCCCCCCCC 12060
CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCCCCCCC CCCCACAAAG 12120
TCGGGCCGGC GTGTGCCAGG GCCAGTGATG GGCCAGATCC GGGCGCAGCT CCAGGGTGTC 12180
CTGGCCAAGA ACCCGCAGCT GCGGCATCTG ACGCCGCAGC AGCAGCAGCA GCTCCAGGCC 12240
CTCCTGGTGC AGCGGCACCA GCAGAGCCTG CTGCAGCAGA ACCAGGCACT ACGGACCCCT 12300
GGCCCCTTCC CCGGGCAGGC CCCGGACTAC AGCTTGCCTG CGACCCGCCA GCCCCCTCGT 12360
CCCATGGGGT CCCTTTTCCA GCCCCGGCCA GGCGCCCCAG CAGAGGCACA GGCCAGCTCT 12420
GCCACCGAGC CGGGGCGGGT GCTGCCAGGG CAGCCCCCCC AGCAGCCCCT GGCTGAGCTG 12480
GTTCAGGCAG CTCTGGCTGC CCGTGGCCCC CAGCCCGGCC TCATCCGCCT CCCCACGCCA 12540
CCAGCCCCCT CACCCCTGGG CTGTGCCCCC CAGCACCTGG CCAGCCCTGA GCAGAGCCCC 12600
CAAGAGCCAA AGAAGACGTC ACCGGGAGTC CCGGCCTTGG CCCCCAGCCC TGCGGCCCCC 12660
ACAGAACCAG GGCTGACTCC CACGCCAAGC GCTGCCCTTC CCGACCCAGC ACACGGCCCC 12720
TGGGACCCTG ATGCGCCTGG GGCGGACCCA GCTGACCCTG CTGAGACCCA CATCAATGGG 12780
GCTGCGCCAG AGGGGGCCCC TGCAGGAGGC GAAGCCCCCC CGGTGTTGGT GAAGCAGGAG 12840
CCGAAGGAAG AGGCGGCGGC AGCGGCACAG TGTGAGGTGG ATGGAGGCGA GACAGCGAAA 12900
CCAGAGGCCA ATGGGGACCC TGGGGACAGC CAAGCCAACA GCCTGCTGGC CCCAGGTGGG 12960
CGCTCTGAGG CTGGGCACCT GCTGCTGCAG AAGCTACTGC GGGCCAAGAA CGTGCAGCTG 13020
TCAGCGCAGA GCCCGGGCGA GCTCAATGGG CACGTGGAGA ACCGGAGCAC TGGGCCGGAG 13080
CCGCGGCCGC AGTCGCTGCT GCTGGGCCGG GAGGACCCCT CCATCGCCAG GAAGCCGGCA 13140
CCCACCAAGC CTAAGCGGGT GCAGAAAGGC AACGAGCGGA TCCCAGCGTC CCGCAAGAAG 13200
CTGCGGAAGG ATGAGGGGCT GCGTCCCGGC GAGGCCCTCA TGAAGCAGCT GAAGCAGGAG 13260
CTGTCGCTGC TGCCGCTGAC GGAGCCGACC ATCATGGCCA ACTTCAGCCT CTTCGCGCCG 13320
TTCGGCAGCA GCCCCATCAA CGGGAAGAGC CAGCTGCGGG GCGCCTTCGG CAGCGCCGTG 13380
CTCGACAGCG TCCCCGACTA CTACTCCCAG CTGCTCACCA AGAACAACCT CAGCAACCCA 13440
CCCACGCCAC CCTCCTCGCT GCCCCCCACA CCCCCTCCCT CCGTGCAGCA GAAGATGGTG 13500
AATGGTGTCA CGGCCCCTGA AGAGCTGGGG GAGCAGCCCA AGGATGCTGA CCCAGCCCGC 13560
GAGCCCCACG GCCAGAAAGA TGCACCAGCT GTGGAGGTGA AGAGCCTGGA CCTGCTGGCA 13620
GCGCTGCCCA CCCCTCCGCA CAACCAGACC GAGGACGTCA GGATGGAGAG TGACGACGAG 13680
AGCGACTCCC CCGACAGCAT CGTCCCTGCT TCATCCCCCG AGAGTGTCCT GGGCGAGGAG 13740
CTGCTCCGCT TCCCCCTCCT CAGCGAGGCC AAGCCAGAGC TGGAGGAGCG TGTGCTGTCA 13800
CCCATCATCC CCATCATCCC CCGGGCCAGT ATCCCAGTGT TCCCCGACAC AAAACCCTAC 13860
GAGGTCGCAG AGCCCTTTGG GGCTCCACCA GGGAAGGTGG GGGCAGCAGG CCCTGGAGCC 13920
CCATGGGAGA AGGGCAAGAG CAGCGAGGTC TCCGTCATGC TGACGGTGTC TGCAGCAGCT 13980
GCCAAGAACC TCAACGGGGT GATGGTGGCC ATGGCCGAGC TGCTGAGCAT GAAGATCCCC 14040
AGCTCCTACG AGGTGTTGTT CCCCGACGGC CCTGTGCGAG CAGCTGTGGT TGAGGCCAAG 14100
AGGGTGGAGA CTGATATGGC TGGAATGCTC GGAGGGAAGG AGAAGGCGCT GGCTGGGAAG 14160
GTGCCGGACA GCAGCTCCGA GTGGCTGAAG CAGTTCGATG CCGTGCTGCC AGGGTACAGC 14220
CTCAAGGGCG AGCTGGACCT CCTGACGCTG CTCAGACAGG AGAGCCCTGC TCCTGAGAAG 14280
ACCCTTCACC ACTGCTACGT CAACAACGTC TCCAACCTGG ACGTGCGGCA GCTCTCTGTC 14340
CTGCCCCAGG AGCCCTCGCC CCCGCTGTCC CCCTCCGTTC CCTCCCCGTC CAGCCCCGCT 14400
GAGCCCACCA GGGTCCCTGA CCCCGAGGCG TCCCGCGAGG CAGCCCCAGC CCCATTGTCA 14460
CCGCTGCCAC CCGCCCCCTC GCCAGCCCCC CAGGAGGAAG GGGCCACGGC CGCCCTCTCG 14520
CCGCCCCGGT TCAAGCCGCG CTCGCGGCCC CCGGAGGACG GGGACGAGGC GAGGCCGCGG 14580
CTGAAGAAGT GGAAAGGGGT GCGCTGGAAA CGGCTGCGCT TCCTCGTCAC CATCCAGAAA 14640
GGGGGGGCCA AGCGCGACGG CGACAAGGAG ATCGCCGAGT TCATCGACAA GCTGGGCACC 14700
ACGCTGCGCC CCGAGAAGGT GCCGCAGGAC CTGCGCAAGT GCTGCTTCTG CCACGAGGAG 14760
GGCGACGGGG CCACGGACGG GCCGGCCCGC CTGCTCAACC TCGACCTTGA CCTCTGGGTC 14820
CACCTGAACT GCGCCCTGTG GTCCACGGAG GTGTACGAGA CTCAGGGAGG GGCTCTGATC 14880
AACGTGGAGG TGGCCCTGCA CCGCGGGCTG CTCACCAAGT GCTCCCTGTG CCAGAAAACC 14940
GGCGCCACCA ACAGCTGCAA CCGCATCCGC TGCCCCAGCG TGTACCACTT CGCCTGCGCC 15000
ATCCGCGCCA AGTGCATGTT CTTCAAGGAC AAGACCATGC TGTGCCCCCT GCACAAGCTG 15060
AAGGGGCCCT GCGAGCAGGA GCTCAGCAGC TTCACCGTGT TCCGCCGCGT CTACATCGAG 15120
CGGGACGAGG TGAAGCAGAT CGCCAGCATC ATCCAGCGCG GGGAGCGGCT CCACATGTTC 15180
CGCGTGGGCG GCTTGGTCTT CCACGCCATC GGGCAGCTGC TGCCGCACCA GATGGCCGAT 15240
TTCCACAGCG TCACCGCCCT CTACCCCGTG GGCTACGAGG CCACGCGCAT CTACTGGAGC 15300
CTGCGGACCA ACAACCGCCG CTGCTGCTAC CGCTGCACCA TCTGCGAGAA CAACGGGCGC 15360
CCCGAGTTCG TCGTGCAGGT CATCGAGCAG GGCCTGGAGG ATCTTGTGTT CTCCGACTCC 15420
TCGCCACAAG CTGTGTGGAA CCGGATCATC GAGCCCGTGG CGATGATGAG GAAGGAGGCC 15480
GACATGCTGC GGCTCTTCCC CGAGTACCTG AAGGGCGAGG AGCTCTTCGG CCTCACGGTG 15540
CACGCGGTGC TGCGCATCGC TGAGTCGCTG CCGGGTGTTG AGAGCTGCCA GAACTACCTG 15600
TTCCGCTACG GGCGCCACCC TCTGATGGAG CTGCCGTTGA TGATCAATCC CACCGGCTGC 15660
GCCCGCTCCG AGCCCAAGAT CCTCACCCAC TACAAGCGGC CCCACACCCT GAACAGCACA 15720
AGCATGTCCA AGGCCTATCA AAGCACATTC ACGGGCGAGA CCAACACGCC CTACAGCAAG 15780
CAGTTTGTGC ACTCCAAGTC CTCCCAGTAC CGGCGCCTGA AGACGGAGTG GAAGAACAAC 15840
GTGTACCTGG CACGGTCCCG CATCCAGGGG CTGGGGCTTT ATGCAGCCAA GGACATCGAG 15900
AAGCACACGA TGGTCATCGA GTACATCGGC ACCATCATCC GCAACGAGGT GGCCAACAGG 15960
CGGGAGAAGA TCTATGAGGA GCAGAACCGT GGCATTTACA TGTTCCGCAT CAACAACGAG 16020
CACGTCATTG ACGCCACGCT GACGGGGGGC CCGGCCAGGT ACATCAACCA CTCGTGTGCC 16080
CCGAACTGTG TGGCCGAAGT CGTGACCTTC GACAAGGAGG ACAAGATCAT CATCATCTCC 16140
AGCCGGCGCA TCCCCAAGGG GGAGGAGCTC ACCTACGACT ACCAGTTTGA CTTCGAGGAC 16200
GATCAGCACA AGATCCCCTG CCACTGTGGA GCTTGGAATT GCAGGAAGTG GATGAACTAA 16260
CTGGGAAAAA GGGGCTGGGG GCTGCCATCC CCGCCTCAGA GACCTGGCCG AGCCCCCCAG 16320
GGCTGCAGGA GGTGCCAGGG GCGGCCGGGC ACTGAGGGAG AGCAGCGGCT GCAGCGCTGC 16380
CGGCCCTGCC CGAGCGGGAC GGACGGACAG ACAGATGAAC TGGCAACT 16429
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 97 0.0 1893
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 67 0.0 1198
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 67 0.0 1196
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 67 0.0 1187
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 67 0.0 1185
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 66 0.0 1182
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 68 0.0 1181
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 67 0.0 1179
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 67 0.0 1179
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 67 0.0 1177
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 67 0.0 1176
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 66 0.0 1176
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 66 0.0 1172
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 67 0.0 1172
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 66 0.0 1171
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 66 0.0 1170
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 66 0.0 1169
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 67 0.0 1168
WERAM-Sah-0035 ENSSHAP00000004216.1 Sarcophilus harrisii 66 0.0 1164
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 66 0.0 1162
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 66 0.0 1159
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 66 0.0 1156
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 65 0.0 1156
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 66 0.0 1143
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 67 0.0 1134
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 96 0.0 1125
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 65 0.0 1119
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 92 0.0 1101
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 67 0.0 1099
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 92 0.0 1093
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 91 0.0 1090
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 92 0.0 1082
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 93 0.0 1078
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 93 0.0 1059
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 88 0.0 1051
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 84 0.0 1008
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 82 0.0 1000
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 64 0.0 981
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 81 0.0 979
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 80 0.0 974
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 80 0.0 971
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 968
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 80 0.0 965
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 80 0.0 965
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 80 0.0 962
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 961
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 65 0.0 933
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 68 0.0 928
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 95 0.0 906
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 82 0.0 901
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 74 0.0 890
WERAM-Ora-0065 ENSOANP00000009850.3 Ornithorhynchus anatinus 74 0.0 889
WERAM-Tas-0126 ENSTSYP00000013377.1 Tarsius syrichta 74 0.0 887
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 80 0.0 884
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 872
WERAM-Mod-0039 ENSMODP00000005827.3 Monodelphis domestica 75 0.0 867
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 866
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 824
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 708
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 56 4e-180 632
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 53 3e-166 586
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 99 7e-165 581
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 49 3e-154 546
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 2e-93 344
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 36 8e-90 332
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 75 2e-88 327
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 36 1e-45 185
Created Date 25-Jun-2016