WERAM Information


Tag Content
WERAM ID WERAM-Sah-0035
Ensembl Protein ID ENSSHAP00000004216.1
Gene Name KMT2D
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSSHAG00000003715.1 ENSSHAT00000004258.1 ENSSHAP00000004216.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 3.90e-45 152.9 3210 5040
Me_Reader PHD 1.20e-25 89.6 170 4664
Organism Sarcophilus harrisii
Domain Profile
  HMT SET1

              SET1.txt   16 vakkeiekeelviEYvGevirsevadkrekeyekke 51  
v k++ e+++l++EY+ + +++++++++++ ++++
ENSSHAP00000004216.1 3210 VRKQQKEHTNLMAEYRNKQQQQQQQQQQQQQQQQQQ 3245
44556778899*****99966666555555444433 PP
SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88
++++a+s+i+glgl+a+k++ek+++viEY+G++ir+eva++rek ye++++g+y+fr++++ +v+dat +g+ ar+inhsc+pNc+
ENSSHAP00000004216.1 4925 NVYLARSRIQGLGLYAAKDLEKHTMVIEYIGTIIRNEVANRREKIYEEQNRGIYMFRINNE--HVIDATLTGGPARYINHSCAPNCV 5009
7999*********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
a+vv++d+e ki+i+++r+I+kgeeltydY+
ENSSHAP00000004216.1 5010 AEVVTFDKEDKIIIISSRRIPKGEELTYDYQ 5040
******************************6 PP

  Me_Reader PHD

               PHD.txt   3 iClvCgkddegekemvqCde..CddwfHlkCvklpl 36 
C +C + ++ + C + C++ +H C +
ENSSHAP00000004216.1 170 RCSHCTRLGA----SIPCRSpgCPRLYHFPCAAASG 201
6999933333....599**9***********98873 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
C+vC+ +e + ++ C +C + +H+ C++ +l+ + w Cp Ck
ENSSHAP00000004216.1 227 HCVVCDGLGE-LRDLLFCTSCGQHYHGACLDTALTARKRA-GWQCPDCK 273
6****43333.344*******************8888855.6******7 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C++C++++e++ m+ C+ Cd+ +H+ C+k++ +slp + sw C++C+
ENSSHAP00000004216.1 274 VCQTCRQPGEDSM-MLVCEACDKGYHTFCLKPAIQSLPPD-SWKCKTCR 320
8****99999975.************************99.9******8 PP
PHD.txt 2 tiClvCgkddegeke.mvqCdeCddwfHlkCvklplsslpegkswyCpsC 50
++C+vCg + g++ ++ C++C++ +H +Cv+ + +++ k w+C +C
ENSSHAP00000004216.1 1000 DMCVVCGSFGRGAEGhLLACSQCSQCYHPYCVNSKITKVMLLKGWRCVEC 1049
68****75444433349******************888884446****** PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51
+C vCg++++ ++ ++ Cd Cd +H++C+++pl ++p+g w C+ C+
ENSSHAP00000004216.1 1051 VCEVCGQASDPSR-LLLCDDCDISYHTYCLDPPLLTVPKG-GWKCKWCV 1097
7****99999987.**************************.***99997 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg...kswyCpsCk 51
+C++C+ ++ +e+ ++qC +C++w+H+ C +l ++ e+ + + C sC+
ENSSHAP00000004216.1 1128 TCPACRAPYVEEDLLIQCRHCERWMHAGCESLFTEEEVEQaadEGFDCASCQ 1179
7*****99999999*****************98444444445445******7 PP
PHD.txt 4 ClvCgkddegeke....mvqCdeCddwfHlkCvk 33
C++C+++++g++ +++ d d w+Hl+C
ENSSHAP00000004216.1 4559 CCFCHEEGDGATDgparLLNLDL-DLWVHLNCAL 4591
66698888887667777777777.5599999975 PP
PHD.txt 3 iClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C +C+++++ + C+ C+ +H C ++ ++k +Cp +k
ENSSHAP00000004216.1 4618 KCSLCQRTGATN----SCNRlrCPSVYHFACAIRAKCMFFKDKTMLCPLHK 4664
599996666665....6**999*********98886666677778999886 PP

Protein Sequence
(Fasta)
MDDPKPSGED KDSEPAADGP EASEESGPTE PDPPNPPLGD ASVHSPGSSK SQDPPQDCSK 60
GPLRRCAFCN CGEWSLHGQR ELRCFKLPPD WPQGSARPPE GPGLGEAVPL NDDLSQIGFP 120
EGSTPAHLGK PGELCWAHHW CAAWSVGGQG QDGLELHGVD KAIFSGISQR CSHCTRLGAS 180
IPCRSPGCPR LYHFPCAAAS GSFQSMKTLQ LLCLEHGEEA EHLEDAHCVV CDGLGELRDL 240
LFCTSCGQHY HGACLDTALT ARKRAGWQCP DCKVCQTCRQ PGEDSMMLVC EACDKGYHTF 300
CLKPAIQSLP PDSWKCKTCR VCRACGACPA ELDPNCQWYE NYSLCERCQR QQNPQGGRVG 360
SSETEQSLHV CSKYSQSELG MASVGAPGDF DTACQEQLEG EKEACRKSEE PGPLHCEAKP 420
LGKGVLQINF ASVNSCQLHR TLGTLVRLHS LRQGVGPGCP CLMLSVAYRI WLLMSAGQAG 480
TQDESLPEVL SPPEVAPLDE EMPLEPLLLP EEPLLSPQLP PTGKELGKHP SSGIQSVPQR 540
DLLATVQACT NLKAQCEVQN TVPLKSSFFL PFPDPPPPPL SPATAVAPPS LSPLGEQEEL 600
PDAKGDSEPG TPPEVPILDT PISPPPEASC FEPEPIAPID LTPSPASPEE LGELNSPIPL 660
EPPPPQCSPL PLAPSPLGFM EKAPKVLDET ESQEMETEPE CPALEPSGIS PPACPTGDLS 720
CPAPSPAPSP APALDVFSNL VEDTICLDET EATGPVPEAG QTSGSSGSDP KGSPLLLDPE 780
ELAPVTPMEV YGSDGKQAGQ SSPCEDPEEP VAPPVHTPPT LIKSDIVNEI SNLSQGDASA 840
SFPGSEPLLG SPDPEGGGSL SMELGVSTDV SPARDEGSFR LCTDSLPETD DSLLCDSGAT 900
GAGGKAEGDK GRRRSSPARS RIKQGRSSSF PGRRRPRGGA HGGRGRGRAR LKSTTSSVET 960
LVVADIDSSP SKEEEEEDDD TMQNTVVLFS NTDKFVLMQD MCVVCGSFGR GAEGHLLACS 1020
QCSQCYHPYC VNSKITKVML LKGWRCVECI VCEVCGQASD PSRLLLCDDC DISYHTYCLD 1080
PPLLTVPKGG WKCKWCVSCM QCGAVSPGFH CEWQNSYTHC GPCASLVTCP ACRAPYVEED 1140
LLIQCRHCER WMHAGCESLF TEEEVEQAAD EGFDCASCQP YVVKPVVPIA PPELVPVKVK 1200
EPEPQYFRFE GVWLTETGMA VLRNLSMSPL HKRRQRRGRL GLLGEGGLEG PEPSDPGGPD 1260
DKKDGDLEAE ELLKSEGVGV EHMECEIKLE APASPDGEPG KEETDEGKKR KRKPYRPGIG 1320
GFMVRQRKSH ARLKKGPAVL VEVLSGEGQP DEVVTVDPPT EGAVEQSTAE GDEKKKRRGR 1380
KKSKLEDMFP AYLQEAFFGK ALLDLSRKAL LVAGDGRPGF VPGTLRGKGD PGPDRKESSA 1440
SQKGDDGPDI ADEESRGPEG NSETPGPEEG GIKTSPVPSD SEKPGTPGEG MLSSDLDKIP 1500
TEELPKMESK DLQQLFKDVL GSEREQQIGC GTPNLDGSHT PMQQRPFLQG GLPLGNLPTS 1560
SPMDSYPGLC QSPFLDNRER GGFFSPEPGE PDSPWTGSGG TTPSTPTTPT TEGEGDGLSY 1620
NQRSLQRWEK DEELGQLSTI SPVLYANINF PNLKQDYPDW ASRCKQIMKL WRKVPATDKA 1680
PYLQKAKDNR AAHRINKVQK QAESQISKQT KVGDIARKTD RPTLHLRIPP QPGMLGSPPP 1740
ASAPTVFIGS PPTPAGLSTS SDGFLKPPAG TVPGPDSPGD LFLKLPPQVP AQVPLQDPFG 1800
LAPSYTLESR FPTAPPAYPP YPGSAVPPAK PQMLGAPPRP GAGQPGEFQS TPPGTPRHQP 1860
STPDPFLKPR CPSLDNLAIP ESPGVGGAKA SEQLISPPPF GEPPRKALEV KKEELGAASP 1920
GYGPPSLGFG DSPSSGPHLG SLELKAPDVF KAPLTPRASQ VEPQSPGIGL RPQEPPSAHA 1980
VAPSPPSHAD LYRPTPSGSY PETYSQPPLT PRPQPPPPES CCPLPPRSLP SDPFTRVPAS 2040
PQSQSSSQSP LTPRPLSTEA FCQSPVTPCF QSPDPYSRPP SRPQSRDPFA PLHKPPRSQP 2100
PEVAFKAGPL AHTPLGVGGF PVALPSGHPN EHHTKSSGGQ PPSFVRSPGA NVFVGNPSAM 2160
RFTFPQAVGE PPLKPPASQP SHPPPHGINS HYGPGPTVGK PQSTNYTTAT GSFHPGGSPL 2220
GPGTGPPGEG YGLSPMRPPS VLPQPAPDGP LAYLPHGASQ RGSITSPIDK REDPVSGMAT 2280
SLVGPELQGP QDPSMANLSQ TELEKQRQRQ RLRELLIRQQ IQRNTLRQEK ETAAATSAVG 2340
PGNWGTESSN PAFEQLNRGP TSYPGTQDKS GLTGLPPNKL SGSMLGPSPF PAEERLSRPP 2400
PPPTPSSIDM NGRQLVGGTQ AFFPRGPFPG PLPQQQQQQL WQQQQQQQAT VASMRLSMPA 2460
RFPQPPGPEL GRTILGPPLS GLSNRLPGPG EPLPGPAGPA QFIELRHNVQ KGPGPGSAPY 2520
HLQGPPQRPR FFPVAEDSHR LAPESLRPLA VPALPPQKPS APPPPELSNS LHSLPHAKGP 2580
SLPAGLELVG RSPSSTETAR PPLVLESAKL PCEDPELDDD FDAHKALEDD EELAHLGIGV 2640
DVAKGDDELG TLGNLETNDP HLDDLLNGDE FDLLAYTDPE LDTGDKKDIF NEHLRLVESA 2700
NEKAEREALL RGTEPGLSGL EERPAPTTDG PDSHLMPGAS EVKPKLEEGG HQASPCQFTV 2760
ATPKVDPGPP GTLGLGLKPG QSLLGPRDNR LSMGPFPNNV HTVEKSPFGG TAGPPAQLLT 2820
PNPLGGPGGA SLLEKFDLES GGLTLPGGPP PSGDELDKME SSLVASELPL LIEDLLEHEK 2880
KELQKKQQLS AQLQQPPPPP PPPPPPPQQQ QQHPLLATSA TAQSMPLPQE GPSPGLAVTP 2940
QQLALGLGAR QSSLVSTQAM MTTQQPSHAL QQRLVPPMAM VPNQGHMLSG GQASLVPQQN 3000
PQPVLAQKPM GAVPPSMCMK PQQLAMQQQL ANSFFPDTDL DKFAAEDIID PIAKAKMVAL 3060
KGIKKVMAQG SIGVPPGMNR QQVSLLAQRL SGAPGNTDLQ NHVAAGSGQE RSNSDPSQPR 3120
PNPPTFAHGV INEADQRQYE EWLFHTQQLL QMQLKVLEEQ IGVHRKSRKA LCAKQRTAKK 3180
AGREFPEADA EKLKLVTEQQ SKIQKQLDQV RKQQKEHTNL MAEYRNKQQQ QQQQQQQQQQ 3240
QQQQQQHSAV LALSPSQSPR LLAKLPGQLL PGHGLQPPQG PLGGQAGSLR LPPGGVTLTG 3300
QPSGPFLNPA LGQQQQQQQH QAGAGTLAGP SAGFFPGNLA LRGLGADARL LQERQLQLQQ 3360
QRMQLSQKLQ QQQQQQHLLG PKPQGLLPPG SHQGLLVQQL SPQPTPGPQG MLGPAQVAVL 3420
QQQQQQVHPG GLGPQGPHRQ VLLAQPRMLG SPQLAQQAQG LMAHRLVMAQ QQQQQQQQQQ 3480
QQQQQQQQQQ QQQQALAGLS HLQQGLMPPS GQPKLGTQSM GTQQQQQLGL LNQGRTLLPK 3540
PLQHFPSHGA LGPTLLLSLP GKEQAGAETV LVPEGTEAPS VHLGGPLALG APTETLPPEP 3600
VEVKPSIPGD SHLLLSQPQP HTQMNSLQLQ APLRLPGQQQ QQQQQQQVSL LHPAGSTSHG 3660
QMTGGQPPEV TSMSHLLTQP LSSVGERPSG LDRPLKGSPR PPPPKPSPLP HPGQGLPGHS 3720
GMPTVGQLRA QLQGVLAKNP QLRHLSPQQQ QQLQALLVQR QLQQGQVLRQ TVPYLEPGTQ 3780
SSSLQGLLGR QPQPGGFLGA PSGPLQELGA VPRPQGPSRL STSQGAISSG STLGFAQPPP 3840
LPSSPQEPKR PSPRLLPPSP QLSAEVQLTP TQIENPKPQE SSLELPSGDI PPPSSAASQL 3900
PDTFFAKGPG PWETPDNLAE AHKTEQNNMG HGEVQQVNGQ EMPEPPCLMI KQEPREEPCA 3960
MGAQLIKRET NGEPMGTPGT SNHLLLASSR SEAGHLLLQK LLRAKNVQLG TGHGPEGLRA 4020
EINGHIDSKL AGLEQKLQGT PVSKEDVAAK KPLTPKPKRV QKAGDRIASS RKKLRKEDGV 4080
KANEALLKQL KQELSLLPLS EPNITNNFSL FAPFGSGCLI NGRNQLRGAF GSGALPTGPD 4140
YYSQLLTKNN LSNPPTPPSS LPPTPPPSVQ QKLVNGVTPS EDLGEQQKDA APARESEGGA 4200
RGWGEVGSLD LLAALPTPPH NQTEDVRMES DEESDSPDSI VPASSPESIL GEEAPRFPQL 4260
VSGQQEQEDR ALSPVIPIIP RASIPVFPDA KPYGTLEVEP TGKLSATTWE KGKGSEVSVM 4320
LTVSAAAAKN LNGVMVAVAE LLSMKIPSSY EVLFPESPAR SAGAEPKKGE VEGPGGKEKS 4380
LGGKPTESGS DWLKQFDAVL PGYTLKSQLD ILSLLKQESP VPELPTQHCY IHNVSNLDVR 4440
QLSAPPPEEP SPPPSPSAPS PTSPPAEPSA PSPPPLVPSP PMPEAARPKP RARPPEEGED 4500
SRPPRLKKWK GLRWKRLRLL LTIQKGSGRR EGEREVAEFM EQLGTALRPD KVPRDLRRCC 4560
FCHEEGDGAT DGPARLLNLD LDLWVHLNCA LWSTEVYETQ GGALMNVEGA LHRGLLTKCS 4620
LCQRTGATNS CNRLRCPSVY HFACAIRAKC MFFKDKTMLC PLHKLKGPCE QELSSFAVFR 4680
RVYIERDEVK QIASIIQRGE RLHMFRVGGL VFHAIGQLLP HQMADFHSAT ALYPVGYEAT 4740
RIYWSLRTNN RRCCYRCSII ENNGRPEFII KVMEQGLEDL VFTDASPQAV WNRIIEPVAA 4800
MRKEADMLRL FPEYLKGEEL FGLTVHAVLR IAESLPGVES CHNYLFRYGR HPLMELPLMI 4860
NPTGCARSEP KILTHYKRPH TLNSTSMSKA YQSTFTGETN TPYSKQFVHS KSSQYRRLRT 4920
EWKNNVYLAR SRIQGLGLYA AKDLEKHTMV IEYIGTIIRN EVANRREKIY EEQNRGIYMF 4980
RINNEHVIDA TLTGGPARYI NHSCAPNCVA EVVTFDKEDK IIIISSRRIP KGEELTYDYQ 5040
FDFEDDQHKI PCHCGAWNCR KWMN 5064
Nucleotide Sequence
(Fasta)
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 60
NNNNNNNTTT TGGGACCGAG GGCAGAGGAC CAACTCCCAG CCTTGGTGTT TTCATCCACT 120
TTAAATTTTT TTCGAAGTGG GGGGGGGCCG CTCCGACCTG GATTTACCGT TCTTGGCCTC 180
TCTTTGGCCC CCCCCGTGTG GGGGGCGCTT GTGATCGTCC TGGCGGGTGG AGGTCGGGGA 240
GCGGCCGGAG TTGTGGCCAT GTTCTCGGGT GAAGATTTCT GGATCGCCGT GTGAAGAGGT 300
CTCCCCGGGA GGGCACTGCC CAGTCGGAGA GAGGAATGGA CGACCCGAAG CCCTCTGGTG 360
AAGATAAAGA TTCAGAGCCG GCAGCTGATG GGCCTGAAGC CTCCGAAGAA TCAGGCCCCA 420
CTGAGCCAGA CCCCCCCAAC CCACCTTTGG GGGATGCCTC TGTCCACAGC CCAGGGAGTT 480
CCAAATCCCA GGACCCTCCC CAGGATTGCA GCAAGGGCCC ACTTCGGCGC TGTGCCTTCT 540
GTAACTGTGG GGAATGGAGT CTCCATGGGC AGCGGGAGCT TCGGTGCTTT AAACTTCCAC 600
CAGACTGGCC CCAGGGTTCA GCAAGGCCAC CTGAGGGGCC TGGACTTGGA GAGGCAGTGC 660
CACTCAATGA TGATTTGTCC CAGATTGGCT TCCCCGAGGG ATCGACACCT GCCCATTTGG 720
GAAAGCCTGG TGAACTCTGC TGGGCTCATC ACTGGTGTGC AGCATGGTCA GTGGGAGGAC 780
AGGGCCAAGA TGGGCTGGAG TTGCACGGAG TGGACAAGGC CATCTTCTCA GGGATCTCTC 840
AGCGCTGCTC TCATTGCACC AGGCTGGGAG CCTCCATTCC CTGCCGATCG CCTGGATGCC 900
CACGGCTCTA CCACTTCCCC TGTGCTGCTG CCAGTGGTTC CTTCCAGTCT ATGAAGACCT 960
TACAGCTGTT GTGTTTGGAG CATGGCGAGG AGGCTGAACA TCTGGAGGAT GCCCATTGTG 1020
TGGTATGTGA TGGGCTGGGT GAACTTCGAG ACCTTCTCTT CTGTACCAGC TGTGGGCAGC 1080
ACTATCATGG GGCCTGTCTG GACACTGCTC TGACTGCACG AAAACGTGCC GGCTGGCAAT 1140
GCCCGGATTG CAAAGTGTGC CAAACCTGCA GGCAGCCTGG GGAGGATTCT ATGATGCTAG 1200
TCTGTGAGGC GTGTGATAAG GGCTACCACA CCTTCTGCCT GAAGCCAGCT ATTCAGAGCC 1260
TGCCCCCTGA TTCATGGAAG TGCAAGACGT GCCGGGTGTG CCGAGCCTGT GGGGCCTGCC 1320
CAGCAGAGCT GGATCCCAAC TGCCAGTGGT ATGAGAATTA CTCATTGTGT GAGCGGTGCC 1380
AGCGGCAGCA GAACCCCCAA GGTGGAAGGG TGGGAAGCTC TGAGACCGAG CAGAGTCTCC 1440
ATGTCTGCAG CAAATATTCT CAGTCAGAGC TTGGCATGGC CTCCGTTGGT GCACCTGGCG 1500
ATTTTGATAC TGCATGCCAG GAACAGTTGG AGGGGGAGAA GGAAGCCTGC AGGAAGTCTG 1560
AGGAACCAGG GCCACTGCAT TGTGAAGCCA AACCACTAGG TAAAGGGGTC CTCCAGATAA 1620
ACTTTGCTAG TGTTAATAGC TGCCAGCTCC ACAGGACTTT GGGGACACTG GTTCGACTTC 1680
ACTCCCTTAG ACAGGGAGTA GGTCCTGGTT GCCCCTGTCT TATGCTTTCT GTAGCATATA 1740
GAATTTGGCT TCTTATGTCT GCAGGGCAAG CAGGGACCCA AGATGAGTCA CTACCTGAGG 1800
TCCTTTCCCC ACCTGAGGTT GCCCCCCTTG ATGAAGAGAT GCCACTGGAG CCCCTGTTAC 1860
TACCTGAGGA GCCACTCCTG TCACCTCAGC TACCACCAAC AGGTAAGGAA CTGGGAAAGC 1920
ATCCTTCTTC CGGGATACAA TCTGTCCCCC AGAGAGATTT GTTAGCTACT GTACAGGCTT 1980
GTACAAACCT CAAAGCACAG TGTGAAGTAC AGAATACTGT ACCTCTAAAA TCTTCCTTTT 2040
TCCTCCCCTT TCCAGATCCT CCTCCTCCTC CACTCTCTCC TGCCACTGCT GTAGCCCCAC 2100
CATCCTTGTC TCCCTTGGGG GAGCAAGAGG AGCTCCCTGA TGCCAAAGGG GACAGTGAGC 2160
CAGGAACTCC TCCAGAGGTC CCTATCCTAG ACACACCCAT CAGCCCCCCT CCAGAAGCCA 2220
GCTGCTTTGA ACCAGAACCC ATAGCTCCTA TAGACTTGAC CCCTTCTCCA GCCTCCCCCG 2280
AAGAACTGGG AGAACTGAAT TCTCCCATCC CTCTAGAGCC CCCTCCTCCA CAGTGTTCCC 2340
CTTTACCATT AGCCCCTTCT CCCCTGGGCT TCATGGAGAA GGCACCAAAA GTCCTAGATG 2400
AGACTGAGTC TCAGGAGATG GAGACAGAGC CAGAGTGCCC AGCTTTGGAG CCTAGTGGGA 2460
TCAGCCCCCC AGCCTGTCCC ACAGGAGACC TCTCCTGTCC TGCCCCTAGC CCTGCCCCCA 2520
GCCCTGCCCC GGCCCTGGAT GTCTTTTCTA ATCTGGTTGA GGACACAATC TGTCTTGATG 2580
AGACTGAGGC AACTGGTCCG GTGCCAGAGG CTGGACAGAC CTCTGGCAGT TCAGGCAGTG 2640
ATCCTAAAGG ATCCCCCTTG CTCCTTGATC CTGAGGAGCT GGCCCCTGTG ACCCCTATGG 2700
AGGTCTATGG TTCCGATGGT AAGCAGGCTG GGCAGAGCTC ACCTTGTGAA GATCCTGAGG 2760
AGCCAGTTGC CCCACCAGTG CATACCCCTC CCACTCTCAT CAAGTCTGAC ATTGTCAATG 2820
AGATCTCCAA CCTGAGCCAA GGGGACGCCA GCGCCAGCTT CCCGGGCTCC GAGCCTCTGT 2880
TGGGCTCCCC AGACCCCGAG GGGGGTGGAT CACTCTCCAT GGAGCTCGGT GTGTCCACTG 2940
ACGTCAGCCC TGCGCGGGAT GAGGGCTCCT TTCGGCTCTG CACCGATTCA CTGCCTGAGA 3000
CTGATGACTC CTTACTTTGT GATTCTGGAG CAACTGGAGC TGGTGGCAAG GCTGAGGGCG 3060
ATAAGGGGAG ACGGCGAAGC TCCCCCGCCC GTTCCCGCAT CAAACAGGGC CGTAGCAGTA 3120
GCTTCCCAGG AAGACGTCGA CCTCGGGGGG GTGCCCATGG AGGAAGGGGG AGAGGACGTG 3180
CCCGGCTGAA GTCGACCACT TCCTCTGTGG AGACTCTGGT AGTCGCGGAC ATTGATAGTT 3240
CTCCCAGTAA GGAGGAAGAA GAGGAAGATG ATGATACCAT GCAGAATACT GTGGTTCTCT 3300
TTTCCAACAC CGACAAATTT GTCTTGATGC AGGATATGTG CGTAGTGTGT GGTAGCTTTG 3360
GTCGAGGGGC AGAGGGACAC CTCTTGGCCT GCTCTCAGTG TTCACAGTGC TACCATCCTT 3420
ACTGTGTCAA CAGCAAGATT ACTAAAGTGA TGCTCCTGAA AGGCTGGAGA TGTGTGGAAT 3480
GCATCGTGTG TGAGGTGTGT GGCCAGGCCT CTGACCCATC TCGCCTTTTG CTCTGCGATG 3540
ACTGTGACAT CAGCTACCAC ACTTACTGTC TGGATCCCCC TCTGCTTACA GTGCCCAAGG 3600
GTGGCTGGAA GTGCAAATGG TGTGTGAGTT GCATGCAGTG TGGGGCTGTT TCACCGGGCT 3660
TCCACTGTGA GTGGCAGAAC AGTTACACCC ACTGTGGGCC CTGTGCCAGC CTGGTGACCT 3720
GCCCAGCATG TCGTGCTCCT TATGTAGAAG AAGATTTGTT AATCCAGTGC CGTCATTGTG 3780
AGCGGTGGAT GCATGCTGGT TGTGAGAGCT TATTCACAGA GGAAGAGGTA GAACAAGCAG 3840
CAGATGAAGG CTTTGATTGT GCCTCTTGTC AGCCCTATGT GGTGAAGCCT GTGGTGCCCA 3900
TTGCGCCACC AGAGCTCGTG CCTGTGAAGG TGAAGGAGCC AGAGCCCCAG TACTTCCGCT 3960
TTGAGGGAGT GTGGCTGACG GAGACCGGCA TGGCTGTGCT TCGAAATCTG TCCATGTCCC 4020
CCCTCCACAA GCGGCGTCAG AGGCGTGGGC GGCTTGGCCT CCTAGGTGAG GGTGGGCTAG 4080
AAGGGCCGGA GCCCTCAGAC CCCGGTGGCC CCGACGACAA GAAAGATGGG GACCTGGAAG 4140
CTGAGGAGTT GCTCAAGAGC GAAGGAGTTG GTGTGGAGCA CATGGAGTGT GAGATCAAGC 4200
TGGAGGCCCC TGCCAGTCCT GATGGGGAGC CTGGCAAGGA AGAAACTGAT GAGGGCAAAA 4260
AGCGCAAGCG AAAACCTTAC AGGCCTGGCA TTGGGGGCTT CATGGTTCGA CAGCGCAAAT 4320
CTCATGCCCG TCTGAAGAAA GGGCCCGCTG TGTTGGTGGA AGTGTTGAGT GGGGAGGGCC 4380
AGCCTGATGA GGTTGTGACA GTTGACCCCC CTACAGAGGG TGCAGTGGAG CAGAGTACAG 4440
CGGAAGGTGA TGAGAAGAAG AAGCGTCGGG GTCGGAAAAA GAGCAAACTG GAGGACATGT 4500
TTCCTGCTTA CCTGCAGGAA GCCTTCTTTG GGAAGGCACT GTTGGACCTT AGCCGAAAGG 4560
CATTGCTGGT GGCTGGGGAT GGACGACCAG GCTTTGTGCC AGGCACTCTC AGGGGTAAGG 4620
GGGACCCAGG CCCTGATAGA AAAGAGTCTT CTGCCTCACA GAAAGGGGAT GATGGACCAG 4680
ACATTGCAGA TGAAGAGTCC AGAGGCCCTG AGGGCAATTC AGAGACTCCA GGCCCTGAAG 4740
AGGGAGGCAT CAAGACATCT CCTGTACCCA GTGACTCAGA GAAACCAGGG ACCCCAGGCG 4800
AGGGAATGCT CAGCTCTGAC TTGGACAAGA TCCCCACAGA AGAGCTGCCC AAGATGGAAT 4860
CAAAGGACCT TCAACAGTTA TTCAAGGATG TCCTAGGCTC AGAGCGGGAG CAGCAGATTG 4920
GCTGTGGGAC TCCTAACTTG GATGGGAGTC ACACCCCAAT GCAGCAGAGG CCCTTCCTAC 4980
AAGGTGGGCT TCCTTTGGGC AATCTCCCTA CCAGCAGCCC AATGGACTCC TACCCAGGCC 5040
TTTGCCAGTC TCCTTTCCTT GATAATAGGG AGCGCGGGGG CTTCTTCAGC CCAGAGCCTG 5100
GTGAGCCTGA CAGCCCCTGG ACGGGCTCAG GGGGCACCAC ACCCTCCACA CCCACCACCC 5160
CAACCACGGA GGGTGAAGGC GATGGGCTCT CCTACAACCA GCGGAGCCTC CAGCGCTGGG 5220
AAAAGGATGA GGAGCTGGGC CAGCTGTCTA CAATCTCTCC TGTGCTCTAC GCCAATATTA 5280
ATTTTCCCAA CCTCAAGCAG GATTATCCAG ACTGGGCCAG TCGCTGCAAA CAGATCATGA 5340
AGCTGTGGAG AAAAGTCCCG GCCACTGACA AAGCCCCCTA CCTGCAAAAG GCCAAAGATA 5400
ACCGGGCGGC TCACCGCATC AACAAGGTGC AGAAGCAGGC GGAAAGTCAG ATCAGTAAGC 5460
AGACCAAGGT GGGCGACATA GCCCGCAAAA CCGATCGGCC CACCCTGCAT CTACGCATTC 5520
CCCCTCAGCC AGGGATGCTG GGCAGTCCCC CTCCAGCTTC TGCTCCGACT GTCTTCATTG 5580
GCAGTCCCCC TACCCCAGCT GGCTTGTCTA CCTCCTCGGA TGGGTTCCTG AAGCCGCCAG 5640
CAGGCACAGT GCCTGGCCCT GACTCACCTG GTGATCTCTT CCTCAAGCTC CCACCCCAGG 5700
TGCCCGCCCA AGTGCCTTTG CAGGACCCTT TTGGACTGGC CCCCAGCTAT ACTCTGGAGT 5760
CTCGCTTCCC CACAGCACCA CCAGCCTACC CCCCCTATCC TGGGTCAGCA GTGCCCCCTG 5820
CAAAGCCTCA GATGCTGGGT GCTCCCCCTC GGCCTGGGGC TGGCCAGCCT GGAGAGTTCC 5880
AATCCACCCC GCCTGGGACC CCAAGGCACC AGCCTTCCAC ACCTGATCCT TTTCTCAAGC 5940
CCCGCTGCCC TTCACTAGAC AACTTGGCAA TTCCAGAGAG TCCAGGGGTG GGTGGAGCCA 6000
AGGCATCTGA GCAACTGATA TCCCCTCCAC CCTTTGGAGA GCCGCCTCGA AAGGCCCTAG 6060
AAGTAAAGAA GGAGGAGCTA GGAGCAGCCT CTCCTGGTTA TGGCCCCCCC AGCCTTGGTT 6120
TTGGAGACTC GCCTTCCTCT GGGCCCCACT TAGGCAGCCT GGAACTAAAG GCACCAGATG 6180
TCTTCAAAGC CCCCCTGACC CCTCGGGCAT CTCAGGTAGA GCCTCAGAGC CCAGGCATAG 6240
GCTTGAGACC CCAGGAGCCT CCCTCTGCCC ATGCTGTGGC CCCCTCACCC CCTAGCCATG 6300
CAGACCTATA TCGTCCAACT CCTTCTGGTT CCTATCCTGA GACCTATTCG CAGCCCCCAC 6360
TGACGCCCCG GCCTCAGCCT CCACCTCCTG AGAGTTGCTG TCCCCTGCCG CCTCGCTCCT 6420
TGCCCTCCGA TCCCTTCACT CGAGTACCTG CGAGCCCCCA GTCTCAGTCT AGCTCTCAGT 6480
CGCCCCTGAC ACCCCGTCCG TTGTCCACTG AAGCCTTCTG CCAGTCGCCG GTCACCCCTT 6540
GCTTCCAGTC ACCTGACCCC TACTCTCGCC CACCCTCACG GCCCCAGTCC CGGGACCCTT 6600
TTGCCCCACT ACATAAGCCT CCTCGGTCCC AGCCCCCTGA AGTTGCCTTC AAGGCTGGGC 6660
CCCTGGCCCA TACTCCACTA GGAGTAGGGG GCTTTCCGGT AGCCCTGCCC TCTGGGCATC 6720
CTAATGAGCA TCACACTAAG AGTTCCGGCG GACAGCCTCC ATCTTTTGTC CGCTCTCCTG 6780
GGGCAAATGT GTTTGTGGGC AATCCTTCTG CCATGCGCTT CACTTTCCCC CAGGCAGTGG 6840
GTGAGCCACC CTTGAAGCCC CCTGCCTCTC AACCCAGTCA CCCTCCACCC CATGGGATCA 6900
ACAGCCATTA TGGGCCAGGA CCCACTGTGG GCAAGCCCCA AAGCACAAAC TACACAACAG 6960
CTACAGGCAG TTTCCACCCA GGAGGCAGCC CCCTGGGGCC AGGCACCGGG CCACCAGGTG 7020
AGGGCTATGG TCTGTCTCCG ATGCGTCCAC CATCGGTTTT ACCACAACCA GCACCAGATG 7080
GTCCTCTCGC TTACCTGCCC CATGGTGCTT CACAGCGGGG GAGCATCACT TCCCCAATCG 7140
ATAAGCGAGA AGATCCAGTG TCTGGGATGG CCACCTCCCT GGTGGGTCCT GAGCTACAAG 7200
GTCCCCAGGA CCCCAGCATG GCCAATCTGA GCCAGACAGA GTTGGAGAAG CAGCGGCAGC 7260
GTCAGCGTTT GAGAGAGCTA TTGATCCGAC AGCAGATTCA ACGTAACACC CTTCGGCAAG 7320
AGAAGGAAAC AGCAGCAGCC ACGAGTGCTG TAGGGCCAGG AAACTGGGGC ACCGAGTCCA 7380
GCAACCCTGC TTTTGAGCAA CTGAATCGAG GCCCCACCTC CTACCCTGGG ACACAGGACA 7440
AAAGTGGCCT TACAGGATTG CCCCCTAACA AACTGAGTGG CTCGATGCTG GGACCTAGCC 7500
CCTTCCCTGC AGAGGAGAGA CTCTCTCGGC CCCCACCACC TCCCACCCCT TCTTCTATTG 7560
ACATGAATGG CCGGCAACTG GTTGGGGGCA CCCAAGCCTT CTTTCCTCGT GGGCCTTTCC 7620
CTGGACCTCT GCCTCAGCAG CAGCAGCAGC AGCTGTGGCA GCAGCAGCAG CAGCAGCAGG 7680
CTACTGTGGC TTCCATGCGG CTTTCCATGC CTGCAAGGTT CCCTCAACCA CCTGGACCTG 7740
AACTTGGCCG GACAATCTTA GGCCCTCCTT TGAGTGGGCT TTCCAACCGG CTGCCCGGAC 7800
CAGGGGAGCC ACTTCCTGGT CCAGCAGGTC CTGCTCAATT CATTGAGCTT CGTCACAATG 7860
TGCAGAAGGG TCCAGGGCCT GGCAGTGCCC CTTACCATCT TCAGGGCCCT CCTCAGAGAC 7920
CCAGATTCTT CCCAGTAGCT GAGGACTCCC ACCGCCTGGC TCCAGAGAGC CTTCGGCCCC 7980
TGGCGGTACC CGCCCTGCCT CCACAGAAGC CTTCGGCTCC TCCACCGCCT GAGCTGAGTA 8040
ACAGTCTCCA CTCCCTTCCC CATGCCAAGG GTCCCAGCCT TCCTGCTGGC TTGGAGCTGG 8100
TTGGTCGATC TCCTTCTAGC ACAGAGACAG CACGTCCACC CCTGGTTCTA GAATCTGCAA 8160
AACTGCCCTG TGAGGACCCC GAGCTGGACG ATGACTTTGA TGCCCATAAA GCACTTGAAG 8220
ATGATGAGGA GCTGGCCCAC CTGGGGATCG GAGTGGATGT GGCCAAGGGG GATGATGAGC 8280
TCGGCACCCT GGGGAACTTG GAGACCAATG ACCCCCATCT GGATGACTTG CTAAATGGGG 8340
ATGAATTTGA TCTCTTGGCC TACACAGACC CTGAGCTGGA CACTGGTGAT AAGAAGGACA 8400
TATTTAATGA ACACTTGCGA CTAGTAGAAT CTGCTAATGA GAAGGCTGAA CGTGAGGCAC 8460
TTCTGCGGGG CACAGAACCT GGCTTATCTG GCCTTGAGGA GCGCCCAGCC CCTACCACTG 8520
ATGGCCCTGA CTCCCATCTC ATGCCTGGGG CCAGTGAAGT AAAGCCTAAA TTAGAGGAGG 8580
GTGGACATCA GGCTTCTCCT TGCCAGTTTA CTGTGGCCAC CCCCAAGGTG GACCCGGGGC 8640
CTCCTGGCAC CCTGGGACTG GGGCTAAAGC CTGGACAGAG CTTGTTGGGA CCACGGGATA 8700
ACAGGCTGAG CATGGGTCCC TTCCCTAACA ATGTGCATAC AGTGGAAAAG AGTCCCTTTG 8760
GAGGCACAGC AGGTCCCCCA GCTCAGCTAC TGACCCCGAA CCCATTGGGT GGCCCAGGAG 8820
GGGCATCCTT GCTGGAGAAG TTTGACCTAG AGAGTGGGGG CCTAACCCTG CCTGGTGGGC 8880
CTCCACCATC TGGAGATGAG CTAGACAAGA TGGAGAGCTC CTTGGTGGCC AGTGAACTGC 8940
CTTTGCTCAT TGAGGACCTT CTGGAACATG AGAAGAAAGA ACTCCAAAAG AAACAACAGC 9000
TCTCTGCTCA GCTCCAGCAG CCACCTCCGC CACCGCCACC ACCGCCGCCA CCACCGCAGC 9060
AGCAGCAGCA GCACCCCTTG CTGGCCACCT CTGCCACTGC TCAGTCCATG CCTCTGCCCC 9120
AAGAAGGTCC ATCCCCTGGC CTGGCTGTTA CCCCACAACA GCTTGCACTA GGGCTTGGGG 9180
CCCGGCAGTC CAGCTTAGTG TCCACCCAGG CAATGATGAC CACTCAGCAA CCATCTCATG 9240
CCCTGCAACA GCGCCTGGTG CCTCCCATGG CCATGGTGCC CAACCAAGGC CATATGCTCA 9300
GTGGAGGGCA GGCGAGCTTG GTGCCTCAGC AGAATCCACA GCCAGTGCTG GCTCAGAAAC 9360
CTATGGGGGC AGTACCCCCA TCGATGTGCA TGAAGCCACA GCAGCTGGCA ATGCAGCAGC 9420
AGCTGGCCAA CAGCTTCTTC CCTGACACAG ACTTGGACAA GTTTGCTGCG GAAGATATCA 9480
TCGATCCTAT TGCCAAGGCT AAGATGGTGG CCCTAAAGGG TATCAAGAAA GTGATGGCAC 9540
AAGGCAGCAT TGGGGTGCCA CCAGGCATGA ACAGGCAACA GGTGTCCCTC TTGGCCCAGC 9600
GGCTCTCAGG AGCTCCTGGC AACACTGACC TACAGAATCA TGTGGCAGCT GGGAGTGGGC 9660
AGGAAAGGAG CAACAGTGAC CCCTCCCAGC CTCGACCTAA TCCACCCACT TTTGCCCATG 9720
GTGTGATCAA TGAAGCTGAC CAGCGGCAGT ATGAGGAATG GCTATTTCAC ACCCAGCAAC 9780
TGCTACAGAT GCAGCTCAAA GTGCTAGAGG AGCAGATTGG GGTACACCGG AAGTCCCGAA 9840
AGGCCCTGTG CGCCAAACAG CGCACCGCTA AAAAGGCTGG CCGAGAATTC CCAGAGGCCG 9900
ATGCGGAGAA GCTCAAGTTG GTCACTGAGC AGCAGAGCAA AATCCAGAAG CAGCTGGACC 9960
AGGTTCGGAA ACAGCAGAAG GAGCACACAA ACCTCATGGC AGAGTATCGG AATAAGCAGC 10020
AGCAGCAGCA GCAGCAGCAA CAACAGCAGC AGCAGCAGCA GCAGCAGCAG CAGCATTCTG 10080
CAGTGCTGGC CTTGAGCCCC TCTCAGAGCC CTAGGCTACT GGCCAAGCTC CCCGGCCAGC 10140
TACTCCCAGG CCATGGGTTA CAGCCTCCAC AGGGGCCCCT GGGTGGCCAG GCAGGCAGCC 10200
TTCGCCTGCC CCCTGGAGGG GTTACACTTA CTGGCCAGCC CAGTGGCCCG TTCCTCAATC 10260
CAGCCCTGGG CCAGCAGCAG CAGCAGCAGC AACATCAAGC TGGGGCAGGC ACTCTAGCAG 10320
GCCCCTCAGC TGGCTTCTTC CCTGGGAATC TTGCTCTCCG AGGCCTGGGT GCTGATGCAA 10380
GGCTCTTACA GGAGCGGCAA CTCCAGCTGC AGCAGCAGAG GATGCAGCTG TCTCAGAAGC 10440
TACAGCAGCA GCAGCAGCAA CAGCACCTTT TGGGACCAAA GCCCCAGGGA CTCCTGCCTC 10500
CTGGCAGTCA TCAGGGCCTC TTGGTCCAGC AGCTGTCCCC CCAGCCAACC CCAGGGCCCC 10560
AAGGCATGCT GGGCCCTGCC CAGGTAGCAG TGCTGCAGCA ACAGCAGCAG CAGGTACACC 10620
CTGGGGGGCT GGGTCCCCAA GGTCCCCACA GACAAGTACT CCTTGCCCAA CCTCGAATGC 10680
TGGGGTCTCC TCAACTGGCC CAGCAAGCAC AAGGCCTAAT GGCCCACCGA CTTGTTATGG 10740
CCCAGCAACA ACAGCAACAA CAGCAGCAGC AGCAGCAGCA GCAACAGCAG CAGCAGCAGC 10800
AGCAACAGCA GCAGCAAGCC CTGGCTGGTC TCTCCCATCT CCAGCAAGGA TTGATGCCTC 10860
CTAGTGGGCA GCCCAAGTTA GGCACCCAGT CTATGGGGAC ACAACAGCAG CAGCAGCTGG 10920
GGCTCCTGAA CCAGGGTCGA ACTTTATTGC CTAAACCTCT CCAGCACTTT CCCAGCCATG 10980
GAGCCTTAGG CCCAACCCTT TTGCTGAGCT TACCAGGCAA GGAGCAAGCT GGGGCAGAGA 11040
CAGTGCTAGT ACCTGAGGGC ACTGAAGCCC CCTCAGTGCA CCTAGGAGGA CCGTTGGCAT 11100
TGGGTGCCCC AACAGAAACC CTGCCCCCAG AGCCAGTGGA GGTGAAGCCC TCTATCCCTG 11160
GGGACTCTCA CCTCCTCCTC TCCCAACCCC AACCTCATAC TCAGATGAAC TCCCTGCAGT 11220
TACAGGCACC TCTGAGACTC CCAGGGCAGC AGCAGCAGCA GCAGCAGCAA CAGCAAGTCA 11280
GCTTACTCCA TCCAGCAGGC AGTACAAGTC ATGGGCAAAT GACTGGTGGG CAACCCCCAG 11340
AAGTTACTTC CATGTCCCAC CTGCTGACCC AGCCCCTTAG TTCTGTAGGG GAGCGGCCAT 11400
CTGGGCTGGA TCGACCCTTG AAAGGGAGCC CAAGGCCACC GCCTCCCAAG CCAAGCCCTC 11460
TACCCCACCC TGGGCAGGGC CTACCTGGGC ATTCAGGAAT GCCCACAGTG GGGCAGCTTC 11520
GGGCCCAGCT TCAAGGTGTC CTGGCTAAGA ACCCACAGCT TCGGCACCTG AGCCCTCAAC 11580
AGCAACAGCA GCTGCAGGCC CTTCTTGTTC AGAGGCAACT GCAGCAAGGC CAGGTGCTAA 11640
GACAAACAGT GCCCTATCTG GAGCCTGGGA CCCAGTCTTC TTCCTTGCAA GGCCTCCTAG 11700
GTCGCCAGCC CCAACCTGGG GGCTTCCTGG GAGCTCCATC AGGGCCTCTC CAAGAGCTAG 11760
GGGCAGTGCC CCGACCTCAG GGCCCTTCCC GACTCTCCAC CTCACAAGGA GCCATATCTT 11820
CAGGGTCAAC CCTTGGCTTT GCACAGCCCC CTCCTTTGCC ATCCAGCCCT CAAGAGCCAA 11880
AGAGACCTTC CCCCCGATTG CTGCCCCCTA GCCCTCAGCT TTCTGCTGAG GTTCAGCTCA 11940
CTCCTACCCA GATTGAGAAC CCAAAGCCCC AAGAATCTTC CTTGGAGTTA CCTTCAGGGG 12000
ACATTCCACC CCCCTCATCT GCTGCCTCTC AGCTTCCAGA CACTTTCTTT GCCAAGGGAC 12060
CGGGACCTTG GGAAACCCCG GACAACCTGG CTGAGGCCCA TAAGACTGAG CAGAACAACA 12120
TGGGACATGG AGAGGTGCAG CAGGTGAATG GACAGGAAAT GCCGGAGCCA CCCTGCCTCA 12180
TGATCAAGCA GGAGCCTCGA GAGGAGCCTT GTGCCATGGG GGCCCAGTTG ATAAAACGGG 12240
AGACTAATGG AGAGCCTATG GGCACACCAG GCACCAGCAA CCACCTCTTA CTGGCAAGCA 12300
GCCGCTCAGA GGCTGGGCAC CTGCTCTTGC AGAAGCTGCT TCGGGCAAAG AATGTACAGC 12360
TTGGCACAGG GCATGGACCC GAGGGGCTCC GAGCAGAGAT CAATGGGCAC ATTGACAGCA 12420
AGCTGGCTGG TTTGGAGCAG AAGCTTCAGG GCACTCCAGT CAGCAAGGAG GATGTTGCAG 12480
CCAAGAAACC ACTCACCCCC AAACCCAAGA GGGTACAGAA GGCAGGGGAC AGGATAGCAA 12540
GCTCCCGAAA GAAGCTCCGG AAGGAAGATG GGGTGAAAGC CAATGAGGCC CTGTTGAAAC 12600
AACTAAAGCA GGAACTGTCC TTGCTGCCAC TTTCGGAACC CAACATCACC AACAACTTCA 12660
GCCTTTTTGC CCCCTTTGGC AGTGGCTGCC TCATCAATGG GCGGAACCAG CTGAGGGGGG 12720
CATTTGGGAG CGGGGCACTA CCCACTGGCC CTGACTACTA TTCCCAGCTT CTTACAAAGA 12780
ATAACCTGAG TAACCCGCCT ACGCCACCTT CCTCACTGCC ACCTACCCCA CCCCCATCAG 12840
TACAGCAGAA ATTGGTGAAT GGGGTCACTC CTTCTGAAGA CCTTGGGGAA CAACAAAAAG 12900
ATGCAGCCCC TGCCCGAGAA TCTGAAGGGG GAGCAAGGGG GTGGGGGGAG GTGGGGAGTT 12960
TGGACCTGCT GGCAGCACTT CCCACACCTC CTCACAACCA GACTGAGGAT GTCAGGATGG 13020
AGAGTGATGA GGAGAGTGAC TCCCCTGACA GCATCGTTCC AGCTTCTTCA CCTGAGAGTA 13080
TTCTAGGGGA GGAGGCACCT AGATTTCCCC AACTGGTATC AGGCCAGCAG GAGCAGGAGG 13140
ATAGGGCCCT CTCCCCAGTC ATTCCCATCA TCCCCCGGGC CAGCATCCCA GTCTTCCCAG 13200
ATGCCAAGCC CTATGGGACT CTGGAAGTAG AACCAACAGG AAAGCTTTCT GCCACCACCT 13260
GGGAAAAGGG CAAAGGGAGT GAAGTGTCCG TCATGCTGAC TGTCTCTGCT GCAGCAGCCA 13320
AGAATCTGAA TGGGGTCATG GTGGCTGTGG CAGAGCTGTT GAGCATGAAG ATTCCCAGCT 13380
CCTATGAGGT GCTGTTCCCA GAGAGCCCTG CCCGCTCAGC GGGTGCTGAG CCCAAGAAGG 13440
GGGAAGTGGA AGGGCCTGGT GGTAAAGAGA AAAGCCTTGG GGGAAAGCCC ACGGAGAGTG 13500
GCAGTGACTG GCTGAAACAG TTTGATGCAG TGCTGCCAGG CTACACCCTC AAGAGCCAGC 13560
TGGACATCTT GAGCCTTCTC AAGCAGGAAA GTCCTGTCCC AGAGCTGCCC ACCCAACACT 13620
GCTACATCCA CAACGTCTCC AACCTGGATG TCCGTCAGCT CTCTGCCCCG CCCCCTGAAG 13680
AGCCTTCCCC ACCCCCTTCA CCCTCAGCTC CTTCCCCCAC CAGCCCTCCA GCCGAACCTT 13740
CGGCTCCGTC ACCTCCACCC CTGGTTCCCT CACCTCCGAT GCCTGAGGCG GCTCGCCCCA 13800
AGCCCAGAGC ACGGCCCCCT GAAGAGGGTG AGGATTCTCG ACCTCCCCGG CTAAAGAAAT 13860
GGAAAGGGCT TCGCTGGAAA CGGTTACGCC TGCTGCTGAC GATCCAGAAG GGCAGTGGGC 13920
GACGGGAGGG TGAACGGGAG GTAGCTGAAT TCATGGAGCA ATTGGGCACA GCCCTCCGAC 13980
CTGACAAAGT GCCCCGTGAC CTTCGGCGCT GCTGCTTCTG CCATGAAGAG GGTGATGGCG 14040
CTACCGACGG GCCTGCCCGG CTCCTCAACC TGGATCTAGA CTTGTGGGTT CATCTCAACT 14100
GTGCCCTGTG GTCCACGGAG GTCTATGAAA CTCAGGGGGG GGCCCTGATG AATGTTGAGG 14160
GGGCCCTGCA CCGGGGGCTT CTGACCAAGT GCTCACTGTG CCAGCGTACG GGAGCCACCA 14220
ACAGCTGTAA CAGACTGCGC TGTCCCAGTG TCTACCACTT TGCCTGTGCC ATCCGGGCCA 14280
AGTGCATGTT CTTCAAGGAT AAGACTATGC TGTGCCCCCT GCACAAGCTG AAAGGGCCCT 14340
GTGAGCAGGA GCTGAGCTCT TTTGCCGTCT TTCGCCGTGT CTACATCGAG CGGGATGAGG 14400
TAAAGCAGAT AGCCAGCATC ATCCAACGTG GTGAGCGGCT CCACATGTTC CGGGTAGGGG 14460
GCCTGGTGTT CCACGCCATC GGGCAGCTTC TGCCCCACCA GATGGCCGAC TTCCACAGTG 14520
CTACAGCCCT CTACCCTGTT GGCTACGAGG CCACTCGCAT CTACTGGAGC TTACGCACCA 14580
ACAACCGCCG ATGCTGTTAC CGCTGCTCCA TCATTGAGAA CAATGGGCGG CCCGAGTTCA 14640
TCATCAAGGT CATGGAACAG GGCTTGGAGG ACCTGGTCTT CACTGATGCA TCCCCTCAGG 14700
CTGTGTGGAA TCGTATCATT GAGCCAGTGG CTGCCATGCG CAAGGAAGCC GACATGCTTC 14760
GGTTATTCCC GGAGTATCTT AAGGGAGAGG AGCTCTTTGG GTTGACAGTA CATGCTGTCC 14820
TACGGATTGC AGAATCGCTG CCTGGGGTGG AAAGCTGTCA CAACTACTTG TTTCGATATG 14880
GACGGCACCC ACTGATGGAA TTGCCACTTA TGATCAACCC TACTGGCTGT GCCCGCTCTG 14940
AGCCCAAAAT TCTCACTCAC TACAAACGGC CTCACACCCT CAACAGCACC AGCATGTCCA 15000
AGGCTTACCA GAGCACCTTC ACAGGTGAGA CCAACACCCC GTACAGCAAA CAGTTTGTGC 15060
ACTCCAAGTC ATCCCAGTAT CGGCGGCTGC GCACTGAGTG GAAGAACAAC GTCTACCTGG 15120
CTCGATCTCG AATTCAGGGC CTGGGGCTGT ATGCTGCCAA GGACCTGGAG AAGCACACAA 15180
TGGTCATCGA GTACATTGGC ACCATCATCC GCAATGAGGT GGCCAACCGT CGTGAGAAGA 15240
TCTACGAGGA GCAGAATCGA GGTATCTATA TGTTTCGGAT AAACAATGAG CATGTGATTG 15300
ATGCCACACT GACTGGAGGG CCTGCCAGGT ACATAAACCA TTCATGTGCC CCCAACTGTG 15360
TGGCAGAGGT TGTGACCTTT GATAAAGAGG ATAAGATCAT TATCATCTCT AGCCGGCGCA 15420
TCCCCAAAGG AGAGGAGCTG ACCTATGACT ATCAGTTTGA CTTTGAGGAT GATCAGCACA 15480
AAATCCCCTG CCATTGTGGA GCCTGGAACT GCCGGAAGTG GATGAACTAA 15531
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Orc-0058 ENSOCUP00000005384.2 Oryctolagus cuniculus 75 0.0 3982
WERAM-Paa-0006 ENSPANP00000012073.1 Papio anubis 75 0.0 3948
WERAM-Caf-0097 ENSCAFP00000012833.4 Canis familiaris 80 0.0 3173
WERAM-Aim-0081 ENSAMEP00000007326.1 Ailuropoda melanoleuca 80 0.0 3156
WERAM-Otg-0138 ENSOGAP00000012066.2 Otolemur garnettii 80 0.0 3154
WERAM-Mup-0167 ENSMPUP00000014520.1 Mustela putorius furo 80 0.0 3143
WERAM-Fec-0002 ENSFCAP00000000087.3 Felis catus 79 0.0 3141
WERAM-Pat-0041 ENSPTRP00000041051.3 Pan troglodytes 82 0.0 3121
WERAM-Gog-0092 ENSGGOP00000007801.2 Gorilla gorilla 82 0.0 3121
WERAM-Poa-0043 ENSPPYP00000005112.2 Pongo abelii 82 0.0 3119
WERAM-Mam-0144 ENSMMUP00000020643.2 Macaca mulatta 82 0.0 3118
WERAM-Nol-0193 ENSNLEP00000021682.1 Nomascus leucogenys 82 0.0 3118
WERAM-Hos-0186 ENSP00000301067.7 Homo sapiens 82 0.0 3114
WERAM-Sus-0005 ENSSSCP00000000202.2 Sus scrofa 79 0.0 3105
WERAM-Chs-0070 ENSCSAP00000002475.1 Chlorocebus sabaeus 82 0.0 3093
WERAM-Myl-0127 ENSMLUP00000010258.2 Myotis lucifugus 81 0.0 3093
WERAM-Bot-0131 ENSBTAP00000019193.5 Bos taurus 81 0.0 3090
WERAM-Ict-0043 ENSSTOP00000003541.2 Ictidomys tridecemlineatus 81 0.0 3081
WERAM-Ptv-0040 ENSPVAP00000004387.1 Pteropus vampyrus 81 0.0 3080
WERAM-Cap-0033 ENSCPOP00000002700.2 Cavia porcellus 80 0.0 3078
WERAM-Loa-0099 ENSLAFP00000008377.4 Loxodonta africana 79 0.0 3073
WERAM-Ran-0263 ENSRNOP00000069442.1 Rattus norvegicus 79 0.0 2963
WERAM-Ocp-0133 ENSOPRP00000014011.2 Ochotona princeps 78 0.0 2953
WERAM-Prc-0037 ENSPCAP00000003628.1 Procavia capensis 76 0.0 2904
WERAM-Mum-0185 ENSMUSP00000135941.2 Mus musculus 82 0.0 1994
WERAM-Tut-0057 ENSTTRP00000004493.1 Tursiops truncatus 82 0.0 1740
WERAM-Mae-0064 ENSMEUP00000005938.1 Macropus eugenii 96 0.0 1645
WERAM-Ect-0018 ENSETEP00000001138.1 Echinops telfairi 76 0.0 1626
WERAM-Dio-0062 ENSDORP00000006099.1 Dipodomys ordii 75 0.0 1536
WERAM-Dan-0166 ENSDNOP00000021359.1 Dasypus novemcinctus 81 0.0 1502
WERAM-Ova-0206 ENSOARP00000020382.1 Ovis aries 82 0.0 1458
WERAM-Mod-0206 ENSMODP00000040832.1 Monodelphis domestica 93 0.0 1447
WERAM-Tas-0007 ENSTSYP00000001044.1 Tarsius syrichta 74 0.0 1402
WERAM-Mim-0155 ENSMICP00000015981.1 Microcebus murinus 83 0.0 1348
WERAM-Ere-0080 ENSEEUP00000007438.1 Erinaceus europaeus 86 0.0 1276
WERAM-Fia-0003 ENSFALP00000000206.1 Ficedula albicollis 66 0.0 1201
WERAM-Caj-0221 ENSCJAP00000038788.2 Callithrix jacchus 97 0.0 1108
WERAM-Eqc-0140 ENSECAP00000015455.1 Equus caballus 97 0.0 1098
WERAM-Anc-0164 ENSACAP00000015233.2 Anolis carolinensis 94 0.0 1083
WERAM-Tag-0186 ENSTGUP00000016261.1 Taeniopygia guttata 94 0.0 1077
WERAM-Lac-0190 ENSLACP00000021616.1 Latimeria chalumnae 89 0.0 1057
WERAM-Leo-0073 ENSLOCP00000009304.1 Lepisosteus oculatus 85 0.0 994
WERAM-Xet-0077 ENSXETP00000024426.3 Xenopus tropicalis 88 0.0 990
WERAM-Dar-0080 ENSDARP00000053862.6 Danio rerio 83 0.0 984
WERAM-Asm-0140 ENSAMXP00000013406.1 Astyanax mexicanus 83 0.0 966
WERAM-Orn-0146 ENSONIP00000015272.1 Oreochromis niloticus 81 0.0 960
WERAM-Orla-0074 ENSORLP00000009504.1 Oryzias latipes 81 0.0 955
WERAM-Tar-0144 ENSTRUP00000031120.1 Takifugu rubripes 81 0.0 953
WERAM-Ten-0184 ENSTNIP00000018122.1 Tetraodon nigroviridis 81 0.0 952
WERAM-Xim-0135 ENSXMAP00000011152.1 Xiphophorus maculatus 81 0.0 948
WERAM-Pof-0076 ENSPFOP00000007738.1 Poecilia formosa 80 0.0 947
WERAM-Gaa-0092 ENSGACP00000011950.1 Gasterosteus aculeatus 80 0.0 944
WERAM-Gam-0044 ENSGMOP00000004922.1 Gadus morhua 81 0.0 916
WERAM-Ora-0001 ENSOANP00000000271.1 Ornithorhynchus anatinus 93 0.0 906
WERAM-Gaga-0065 ENSGALP00000010110.4 Gallus gallus 76 0.0 889
WERAM-Anp-0025 ENSAPLP00000003226.1 Anas platyrhynchos 76 0.0 885
WERAM-Pes-0170 ENSPSIP00000020005.1 Pelodiscus sinensis 75 0.0 882
WERAM-Pem-0016 ENSPMAP00000002311.1 Petromyzon marinus 70 0.0 815
WERAM-Meg-0048 ENSMGAP00000004625.2 Meleagris gallopavo 74 0.0 727
WERAM-Cii-0007 ENSCINP00000003156.3 Ciona intestinalis 57 3e-176 619
WERAM-Drm-0022 FBpp0070347 Drosophila melanogaster 52 4e-172 605
WERAM-Vip-0002 ENSVPAP00000000152.1 Vicugna pacos 100 1e-164 580
WERAM-Cis-0001 ENSCSAVP00000000096.1 Ciona savignyi 50 1e-147 523
WERAM-Soa-0040 ENSSARP00000003919.1 Sorex araneus 70 4e-99 362
WERAM-Cae-0045 T12D8.1 Caenorhabditis elegans 37 6e-97 355
WERAM-Chh-0008 ENSCHOP00000000593.1 Choloepus hoffmanni 61 3e-92 340
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 35 1e-44 181
Created Date 25-Jun-2016