WERAM Information


Tag Content
WERAM ID WERAM-Mod-0116
Ensembl Protein ID ENSMODP00000016853.4
Gene Name KMT2A
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMODG00000013474.4 ENSMODT00000017166.4 ENSMODP00000016853.4
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET1 8.70e-50 167.9 3432 3547
Me_Reader PHD 6.20e-13 48.9 1032 1577
Organism Monodelphis domestica
Domain Profile
  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88  
++ v++s+i+g+gl++k++i+++e+viEY+G+virs ++dkrek+ye+k+ig+y+fr+d++ vvdat++gn+arfinhscepNc+
ENSMODP00000016853.4 3432 AVGVYRSPIHGRGLFCKRNIDAGEMVIEYAGNVIRSIQTDKREKYYESKGIGCYMFRIDDS--EVVDATMHGNAARFINHSCEPNCY 3516
58899********************************************************..************************ PP
SET1.txt 89 akvvavdgekkiviyakraIekgeeltydYk 119
++v+++dg+k+ivi+a+r+I +geeltydYk
ENSMODP00000016853.4 3517 SRVINIDGQKHIVIFAMRKIYRGEELTYDYK 3547
******************************7 PP

  Me_Reader PHD

               PHD.txt    3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCk 51  
+C++C +++ e+v+C+ C + fH C++ ++++l+++ ++w+C++Ck
ENSMODP00000016853.4 1032 VCFLCASSGHV--EFVYCQVCCEPFHKFCLEENERPLEDQlENWCCRRCK 1079
8****554444..59******************6666655778******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslp.egks.wyCpsCke 52
++C vCg++++ +k++++C++C++ +H +C++++ + p ++k+ w+C +C++
ENSMODP00000016853.4 1079 KFCHVCGRQHQATKQLLECNKCRNSYHPECLGPNYPTKPtKKKKvWICTKCVR 1131
59****988888888*******************88888444336******86 PP
PHD.txt 2 tiClvCgkddegeke...mvqCdeCddwfHlkCvklp......lsslpegkswyCpsCk 51
++C++C+k+++++++ m+qC +Cd+w+H kC +l+ ls+lpe+ +++C +C+
ENSMODP00000016853.4 1166 NFCPLCDKCYDDDDYeskMMQCGKCDRWVHSKCENLSdemyeiLSNLPESVAYTCVNCT 1224
789999877666555566*******************9**99999***9999******7 PP
PHD.txt 24 ddwfHlkCv 32
++w H++C
ENSMODP00000016853.4 1495 NEWTHVNCA 1503
479999997 PP
PHD.txt 3 iClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51
C C+k+++ + +C + C +H C + ++k+ yC++++
ENSMODP00000016853.4 1531 RCEYCQKPGAT---VGCCLTsCTSNYHFMCSRAKNCVFLDDKKVYCQRHR 1577
59999666665...4466558*************5555577899**9997 PP

Protein Sequence
(Fasta)
MPVVSAISSR IIKTPRRFIE DEDYDPPIKI ARLESTPNSR FSATSCGSSE KSSAASQHSS 60
QMSSDSSRSS SPSVDTSTDS QASEEMQALS EERSNTPDVH APIPISQSPE NDNSDRRSRR 120
FSISERSFGS RASKKLSALQ SAPQQQTSSS PPPPLLTPPP PLQPASSIPD HTPWLMPPTI 180
PLASPFLPAS AAPIQEKRKS ILREPTFRWT SLKHSRSEPQ YFSSAKYAKE GLIRKPIFDN 240
FRPPPLTPED VGFASGFSAS GSTAPAQLFS ALHSGTRFDM HKRSPLLRAP RFTPSEAHSR 300
IFESVTLPSN RTSAGTSSTG VSNRKRKRRV FSPIRSEPRS PSHSMRTRSG RLSTSELSPL 360
TPPSSVSSSL SISVSPLATS ALNPTFAFPS HSLTQSGESS EKNQRPRKQT STPAEPFSSS 420
SPTPLFPWFT HTSQTERGRN KDRATEELSK ERDVDKSMEK DKNRERDRER EKENKRESRK 480
EKRKKGSEIQ SSSALFPIGR VSKEKVVEDV ATSSSAKKAA GRKKSSSLDP GTDIATVALG 540
DTTAVKTKIL VKKGKGSLDK NNLDLAPAAP SLEKEKALCL SASSSSTVKH STSSIGSMLA 600
QADKLPVADK RVASLLRKAK AQLYKIEKSK TLKQADQPKA QGQESDSSET SVRGPRIKHV 660
CRRAAVALGR KRAVFPDDMP TLSALPWEER EKILSSMGND DKSSIAGSED AEPLAPPIKP 720
IKPVTRNKAP QEPPVKKGRR SRRCGQCPGC QVPEDCGVCT NCLDKPKFGG RNIKKQCCKM 780
RKCQNLQWMP SKAYLQKQAK AVKKKEKKSK TSEKKESNVV KNLVDPSQKT TPAREDHAPK 840
KSSEPPPRKP VEEKNEDGNV SAPGADSKQS STSARKTGKQ ASQPVQVIPP QPPSTASLKK 900
EAPKTTPNEP KKKQQPPPPP PPPPPPPPPP PPPPPESVPE QSKPKKVAPR PSIPVKQKPK 960
EKEKPPPVSK QENGTLNLLN TLSNGNSSKQ KLPADGVHRI RVDFKEDCEV ENVWEMGGLG 1020
ILTSVPITPR VVCFLCASSG HVEFVYCQVC CEPFHKFCLE ENERPLEDQL ENWCCRRCKF 1080
CHVCGRQHQA TKQLLECNKC RNSYHPECLG PNYPTKPTKK KKVWICTKCV RCKSCGSTTP 1140
GKGWDAQWSH DFSLCHDCAK LFAKGNFCPL CDKCYDDDDY ESKMMQCGKC DRWVHSKCEN 1200
LSDEMYEILS NLPESVAYTC VNCTVRHPAE WRLALEKELQ ISLKQVLTAL LNSRTTSHLL 1260
RYRQAAKPPD LNPETEESIP SRSSPEGPDP PILTEVSKQE EQLPLDLEGV KRKMDQGSYT 1320
SVLEFSDDIV KIIQAAINSD GGQPEVKKAN SMVKSFFIRQ MERVFPWFSV KKSRFWEPNK 1380
VTSNSGMLPN AVLPPSLDHN YAQWQEREEN SHTEQPPLMK KIIPAPKPKG PGEPDSPIPL 1440
HPPTPPILNT DRSREDSPEL NPPPGVEDNR QCALCLTYGD DNANDAGRLL YIGQNEWTHV 1500
NCALWSAEVF EDDDGSLKNV HMAVIRGKQL RCEYCQKPGA TVGCCLTSCT SNYHFMCSRA 1560
KNCVFLDDKK VYCQRHRDLI KGEVVPENGF EVLRRVFVDF EGISLRRKFL SGLEPENIHM 1620
MIGSMTIDCL GILNDLSDCE DKLFPIGYQC SRVYWSTTDA RKRCVYTCKI VECRPPVVEP 1680
DINSTVEHDE NRTIAHSPTS LTEIPPRDIL NTAEMINPPS PDRPPHSHTS SSCFYHVISK 1740
APRIRMPSYS PTQRSPGCRP LPSAGSPTPT THEIVTVGDP LLSSGLRSIG SRRHSTSSLS 1800
PQRSKFRIMS PMRAGNSYSH HSVSSISGIG VSTDHDSSIK TIDHFLGSLN PSTPNTLGQN 1860
TSSSSQRTLV TVGTKATNVD GPPPSEMKHI SVADLSSKSS SLKGEKSKML NSKGSEGSTH 1920
ILAYPKPVPQ AHNTMSGEVN VSKMGTFVEP SSVSFSSKEA LSFPPLPLRG QKKERDQHTN 1980
SSQPENPSPG EDTETKALKT PGMNSRSIAN EQITSSSRDR RQKGKKSGKE SFKEKHSIKS 2040
FLDPGQVMAG EEGSLKPEFV NQILTSEHIS QRSCNNISSE KSGDKILPIS GGIKAPSVQL 2100
EGPAKESQTS RKRTVKVTLT PLKMESESPS KNTLKEIIPG SPSQGMESAT LAESSSTSES 2160
PGDGSVAQPS PNDPSSQESQ SNTYSNLPVQ DRNLMLQDGT KPQEDSSYKR RYPRRSARAR 2220
SNMFFGLTPL YGVRSYGEED IPFFSSSSGK KRGKRSAEGQ VDGADDLSTS DEDDLYYYNF 2280
TRTVISSSGE ERLGSHNLFR EEEQCELPKI SQLDGVDDGT ESDTSVTTTA RKVSQLPKRN 2340
GKENGTENLK LDRPEDSGEK EHVIKSSSGH KTNEPKIDNC HSVSRVKTQG QDSLEAQLSS 2400
LESGRRVHTS TPSDKNLLDT YNTELLKSDS DNNNSDDCGN ILPSDIMDFV LKNTPSMQAL 2460
GESPESSSSE LLTLGEGLGL DSNRGKDMGL FEVFSQQLPT AEPVDSSVSS SISAEEQFEL 2520
PLELPSDLSV LTTRSPTVPS QNPNRLAVIS DTGEKRVTIT EKSVASTESD SALLSPGVDP 2580
TPEGHMTPDH FIQGHMDTDH IASPPCGSVE QGHGNNQDLT RNSNTPSLQV PVSPTVPLQN 2640
QKYVPSSTDS PGPSQISNAA VQTTPPHMKP ATEKLLVVNQ NMQPLYVLQT LPNGVTQKIQ 2700
LTPSVSSAPN VMETNTSVLG PMGSGLTLAT GLNPSLPTSQ SLFPSASKGL LPMTHHQHLH 2760
SFPAATQSSF PPNINNPTSS LLIGVQPPPD PQLLVSETNQ RTDLNTTATN PPPGLKKRPI 2820
SRLHSRKNKK LAPSSTSSSI APSDMVSNMT LINFTPSQLS NHPNLLDLGT LGNTTSHRTV 2880
PNIIKRSKSG IMYFEQAPLL PQSVGGAASS AVGASTIGPD TSHLTAGPVS GLASGSSVLN 2940
VVSMQATTAP TTGGSVPGHV LGQGSVTLTS PRLLGAPDIG SISNLLIKAS QQSLGLQEQP 3000
ITLPPGSGMF PQLGTSQTPS TAAMTAASSI CVLPSTQTVG MTVAPSSNEP EGSYQLQHMT 3060
QLLASKSGIL PSQLDITSAS GNQLSSFPQL VDVPNTGLEQ NKTSSSVMHA SSASPGGSPS 3120
SGQQSASSSV LGPTKSRPKV KRIQLPLDKG NGKKHKVSHM RTSSSEAHIP DPEANSTSLT 3180
SVTGTPGSKS DVQDTTNMDQ SSQKDCGQSI RQMTAIPEEP PTQNSTNEQD SSEPKVTEEE 3240
ESNFSSPLMF WLQQEQKRKE SIGEKKPKKG LVFEISSDDG FQICAESIED AWKSLTDKVQ 3300
EARSNARLKQ LSFAGVNGLK MLGILHDAVV FLIEQLSGAK HCRNYKFRFH KPEEANEPPL 3360
NPHGSARAEV HLRKSAFDMF NFLASKHRQP PEYNPNDEEE EEVQLKSARR ATSMDLPMPM 3420
RFRHLKKTSK EAVGVYRSPI HGRGLFCKRN IDAGEMVIEY AGNVIRSIQT DKREKYYESK 3480
GIGCYMFRID DSEVVDATMH GNAARFINHS CEPNCYSRVI NIDGQKHIVI FAMRKIYRGE 3540
ELTYDYKFPI EDASNKLPCN CGAKKCRKFL N 3571
Nucleotide Sequence
(Fasta)
CTAACAACAC AGATCCCATG TAGTTGGAGA ACCAAAGGCC TCATACATGA CAAAAAGACT 60
GAACCGTTCA GGTTACTTGC ATGGAGTTGG TGCTTAAATG ATGAGCAGTT CTTAGGTTTT 120
GGCTCAGATG AAGAAGTCAA AGTGCGAAGT CCCACAAGGT CTCCTTCAGT TAAATCTAGT 180
CCTCGAAAAC CTCGTGGGAG ACCCAGAAGC ATTTCTGACC GAAATTCTAC TATCCTGTCA 240
GATTCTTCAC CTGTGTTTTC CCCTCTAAAC AAACCAGAAA CTAAATCTGG AGAGAAAATA 300
AAGAAGAAAG ATTCTAAAAG TGGAGAAAAG AGAAGAGGAA GACCTCCAAC CCTTAGCAGT 360
GTCAAATTCA AATTATCACA AGGAAAGGAC ATATCGGACT TATCAAAGGG GAACAAAGAA 420
GATACCTTAA AAAAAATTAA AAGGACACCG TCTGCTACAT TTCAGCAAGC AGCAAAAATA 480
AAAAAGTTGA GAGCTGGCAA ACTCTCCCCT CTCAAGTCTA AGTTTAAAGC AGGAAAGCTT 540
CAAATTGGGA GAAAAGGGGT ACAGATTGTC CGACGGAGAG GAAGGCCTCC ATCAGCAGAA 600
AGGATAAAGA CAGATTCAGT ACCGCTCATT AGCTCTCAGC TGGAAAAGCC CCAAAGGGTC 660
CGGAAAGAGA AGGATGGTAC ACCACCACTC ACAAAAGAAG AAAAGACGGC TGTCAGACAG 720
AGCCCTCGAA GGATTAAGCC TGTTAGGATT ATTCCTTCCA CCAAAAGGAC AGATGCAACA 780
ATTGCTAAGC AACTCTTGCA GAGGGCAAAA AAAGGAGCTC AAAAAAAGAT TGAGAAAGAA 840
GCAGCTCAGC TGCAAGGAAG AAAGGTGAAA ACACAGGTCA AAAATATCCG ACAGTTCATC 900
ATGCCCGTTG TCAGCGCTAT CTCCTCACGG ATCATTAAAA CCCCTCGGCG GTTTATTGAG 960
GATGAAGATT ATGACCCTCC AATTAAAATT GCTCGACTAG AGTCTACCCC AAACAGTAGA 1020
TTCAGTGCCA CATCCTGTGG GTCCTCTGAA AAATCAAGTG CAGCTTCTCA ACACTCTTCT 1080
CAGATGTCTT CAGACTCTTC TCGATCTAGC AGTCCTAGTG TTGATACATC CACAGATTCT 1140
CAGGCCTCTG AGGAGATGCA GGCACTTTCT GAAGAGCGGA GCAATACTCC AGATGTGCAT 1200
GCTCCAATAC CTATTTCTCA ATCCCCAGAA AATGATAACA GTGATAGGAG GAGTAGAAGG 1260
TTTTCAATCT CAGAAAGAAG CTTTGGGTCT AGAGCTTCTA AGAAATTATC AGCCTTGCAA 1320
AGTGCCCCCC AGCAACAAAC CTCTTCCTCT CCACCTCCAC CATTACTCAC TCCACCACCA 1380
CCACTACAAC CCGCTTCCAG TATTCCTGAC CACACACCTT GGCTTATGCC TCCAACAATC 1440
CCCTTAGCAT CACCTTTTTT GCCTGCCTCT GCTGCTCCCA TACAAGAGAA ACGAAAGTCT 1500
ATTTTACGAG AACCAACATT CAGGTGGACC TCTCTAAAGC ATTCTCGGTC AGAGCCACAG 1560
TACTTTTCCT CAGCGAAGTA TGCCAAAGAA GGTCTTATCC GAAAACCCAT ATTTGATAAC 1620
TTTCGACCCC CTCCACTGAC TCCTGAGGAT GTTGGTTTTG CTTCTGGTTT TTCAGCATCT 1680
GGTTCCACTG CTCCAGCCCA GCTATTTTCA GCACTCCATT CTGGAACTAG GTTTGATATG 1740
CACAAAAGAA GTCCTCTTCT AAGAGCTCCC AGATTCACTC CAAGTGAGGC CCACTCCAGA 1800
ATCTTTGAGT CAGTAACTTT GCCTAGTAAT CGAACATCTG CTGGAACTTC ATCTACTGGG 1860
GTATCTAATA GAAAAAGGAA AAGAAGGGTG TTCAGCCCTA TCAGATCTGA ACCAAGATCT 1920
CCTTCTCATT CCATGAGGAC AAGAAGTGGA AGGCTTAGTA CTTCTGAGCT ATCACCTCTC 1980
ACCCCACCAT CTTCTGTCTC TTCCTCATTA AGCATTTCTG TTAGCCCTCT TGCCACTAGT 2040
GCCTTAAACC CAACTTTTGC TTTTCCTTCT CATTCCCTGA CACAGTCTGG GGAATCTTCG 2100
GAGAAAAATC AGAGACCAAG GAAGCAGACT AGCACTCCAG CAGAGCCATT CTCATCCAGT 2160
AGTCCTACTC CTCTCTTCCC CTGGTTCACG CATACCTCTC AGACAGAAAG AGGGAGAAAC 2220
AAAGACAGGG CCACTGAGGA ACTGTCCAAA GAGCGAGATG TTGACAAGAG TATGGAGAAG 2280
GACAAAAACA GAGAGAGAGA CCGGGAGAGG GAAAAAGAAA ACAAGCGGGA ATCAAGGAAA 2340
GAAAAAAGGA AAAAGGGGTC AGAAATTCAG AGTAGCTCTG CTTTGTTTCC CATAGGTAGA 2400
GTTTCCAAAG AGAAGGTTGT TGAAGATGTT GCCACTTCAT CTTCCGCCAA AAAAGCTGCA 2460
GGGCGGAAGA AATCCTCATC ACTTGATCCT GGGACAGACA TTGCCACTGT AGCTCTTGGG 2520
GATACAACAG CTGTCAAAAC CAAAATACTT GTTAAGAAAG GGAAAGGGAG CCTTGATAAA 2580
AACAATTTGG ACCTTGCCCC TGCTGCACCA TCCCTGGAGA AGGAGAAAGC ACTCTGTCTT 2640
TCTGCTTCTT CATCAAGCAC TGTTAAACAT TCCACTTCCT CCATTGGCTC CATGTTGGCT 2700
CAGGCAGACA AACTTCCGGT GGCTGACAAA AGAGTAGCCA GCCTTTTAAG AAAGGCCAAA 2760
GCCCAGCTTT ATAAGATTGA GAAAAGCAAG ACTCTCAAGC AAGCTGACCA GCCCAAAGCA 2820
CAGGGTCAAG AAAGTGATTC ATCAGAGACT TCTGTTCGAG GACCCCGGAT AAAGCATGTT 2880
TGTAGAAGGG CTGCTGTTGC CCTTGGCCGA AAGCGAGCAG TATTTCCTGA TGACATGCCC 2940
ACCCTGAGTG CCTTACCATG GGAAGAACGA GAAAAGATTT TGTCTTCCAT GGGAAATGAT 3000
GACAAGTCAT CAATTGCCGG CTCAGAAGAT GCAGAACCTC TTGCTCCACC CATTAAGCCA 3060
ATTAAACCTG TCACCAGGAA CAAGGCACCT CAAGAACCTC CAGTGAAGAA AGGACGACGA 3120
TCCAGGAGAT GTGGGCAGTG CCCGGGCTGT CAGGTGCCTG AAGACTGTGG TGTCTGTACT 3180
AATTGTTTAG ATAAACCCAA ATTTGGTGGC CGCAACATAA AGAAGCAATG CTGCAAGATG 3240
AGGAAATGCC AGAACTTGCA ATGGATGCCT TCTAAAGCCT ACCTGCAGAA GCAAGCTAAA 3300
GCTGTGAAAA AGAAAGAGAA GAAGTCCAAG ACCAGTGAAA AGAAAGAGAG CAATGTTGTA 3360
AAAAATCTTG TGGACCCTAG CCAGAAAACA ACACCAGCAA GAGAGGATCA TGCCCCAAAG 3420
AAAAGCAGTG AACCTCCCCC CCGAAAGCCT GTGGAAGAAA AGAATGAGGA TGGAAATGTG 3480
TCTGCTCCAG GGGCTGATTC TAAACAATCT AGTACTTCTG CTAGGAAGAC TGGCAAACAG 3540
GCCTCCCAGC CAGTGCAGGT CATTCCTCCA CAGCCACCTA GCACAGCATC ACTAAAAAAG 3600
GAAGCTCCCA AGACCACTCC TAATGAGCCC AAGAAAAAGC AGCAGCCACC CCCACCACCC 3660
CCACCACCTC CTCCACCACC TCCACCACCA CCTCCACCAC CTCCAGAATC AGTACCAGAG 3720
CAAAGCAAGC CGAAAAAAGT AGCTCCTCGC CCAAGTATTC CAGTGAAACA GAAACCAAAA 3780
GAAAAGGAAA AACCACCTCC AGTCAGTAAG CAAGAGAATG GCACTCTGAA TCTACTCAAC 3840
ACCCTCTCCA ATGGAAATAG TTCCAAGCAA AAGCTGCCAG CAGATGGAGT CCACAGAATC 3900
AGGGTAGACT TTAAGGAAGA CTGTGAAGTG GAAAATGTGT GGGAAATGGG TGGTTTGGGC 3960
ATCCTGACCT CAGTTCCTAT AACACCCAGA GTTGTTTGCT TTCTCTGTGC CAGCAGTGGA 4020
CATGTGGAGT TTGTATATTG CCAAGTCTGT TGTGAACCCT TCCATAAATT CTGTTTGGAG 4080
GAGAATGAAC GCCCTCTGGA GGACCAGTTG GAAAATTGGT GTTGTCGTCG CTGCAAGTTT 4140
TGTCATGTGT GTGGAAGACA ACACCAGGCC ACAAAGCAGC TGCTGGAGTG TAATAAGTGC 4200
CGAAACAGCT ATCACCCTGA GTGCCTGGGA CCAAACTACC CAACCAAACC CACCAAGAAA 4260
AAGAAAGTTT GGATCTGTAC CAAGTGTGTT CGATGCAAGA GCTGTGGATC TACAACTCCA 4320
GGCAAAGGAT GGGATGCACA GTGGTCTCAC GATTTTTCAC TGTGCCATGA TTGTGCCAAA 4380
CTCTTTGCTA AAGGAAACTT CTGCCCACTT TGTGACAAGT GTTATGATGA TGATGACTAT 4440
GAAAGCAAGA TGATGCAGTG TGGGAAATGT GATCGATGGG TTCACTCCAA GTGCGAAAAT 4500
CTTTCAGATG AAATGTATGA GATCCTCTCC AACCTGCCTG AGAGTGTGGC CTATACCTGT 4560
GTGAACTGTA CAGTGCGGCA TCCTGCTGAG TGGAGATTGG CTCTAGAAAA AGAGCTTCAG 4620
ATTTCTCTGA AACAGGTTTT GACAGCATTG TTGAATTCTC GAACAACTAG TCATTTACTC 4680
CGTTACAGAC AGGCTGCCAA GCCCCCTGAT TTAAATCCAG AGACAGAGGA AAGTATACCT 4740
TCTCGAAGCT CTCCAGAAGG ACCTGATCCC CCCATTCTGA CTGAGGTTAG CAAGCAGGAA 4800
GAACAGCTGC CTTTGGACCT GGAAGGAGTA AAGAGGAAAA TGGACCAAGG AAGTTACACC 4860
TCTGTGTTGG AATTCAGCGA TGATATTGTG AAGATTATTC AAGCAGCCAT CAATTCAGAT 4920
GGAGGGCAAC CAGAGGTTAA AAAAGCTAAT AGTATGGTCA AGTCCTTCTT TATTCGGCAA 4980
ATGGAACGTG TTTTTCCATG GTTCAGTGTC AAAAAGTCCA GGTTTTGGGA ACCAAATAAA 5040
GTTACAAGCA ACAGTGGAAT GTTACCAAAT GCAGTGCTAC CACCTTCACT TGACCATAAC 5100
TATGCTCAGT GGCAAGAGAG AGAAGAGAAC AGCCACACTG AGCAACCTCC TTTGATGAAG 5160
AAAATCATTC CAGCTCCTAA ACCCAAAGGC CCTGGAGAAC CAGATTCACC AATTCCTCTC 5220
CATCCTCCTA CACCACCAAT CTTAAATACT GACAGGAGTC GGGAAGACAG TCCTGAGCTG 5280
AACCCACCCC CAGGTGTAGA AGATAATCGA CAGTGTGCAT TATGCTTGAC GTATGGTGAT 5340
GATAACGCTA ATGATGCTGG CCGCTTGCTC TACATTGGTC AAAATGAGTG GACCCATGTG 5400
AATTGTGCTT TGTGGTCAGC AGAAGTGTTT GAAGATGATG ATGGGTCCCT AAAGAATGTG 5460
CACATGGCTG TGATCAGAGG CAAGCAATTG AGATGTGAGT ACTGCCAAAA GCCAGGGGCC 5520
ACTGTCGGTT GCTGCCTCAC ATCTTGCACT AGCAACTATC ACTTCATGTG TTCCCGAGCC 5580
AAGAATTGTG TCTTTCTGGA TGATAAAAAA GTTTATTGCC AGCGACATCG TGATTTGATA 5640
AAAGGAGAGG TGGTTCCTGA AAATGGATTT GAAGTTCTTA GAAGAGTATT TGTAGACTTT 5700
GAAGGCATCA GCTTGAGAAG GAAATTTCTT AGTGGCCTAG AACCAGAAAA TATCCACATG 5760
ATGATTGGCT CAATGACGAT TGATTGCTTA GGAATTCTAA ATGATCTTTC TGACTGTGAA 5820
GATAAGCTAT TTCCTATTGG ATATCAATGT TCCAGAGTGT ACTGGAGCAC CACAGATGCT 5880
CGAAAGCGGT GTGTGTATAC GTGCAAGATA GTAGAATGCC GTCCTCCAGT TGTAGAGCCA 5940
GATATCAACA GTACTGTTGA ACATGATGAG AATAGGACCA TTGCTCATAG TCCTACCTCT 6000
CTGACTGAAA TTCCACCAAG AGATATTCTC AATACAGCTG AAATGATAAA TCCTCCATCC 6060
CCAGACCGAC CTCCTCATTC ACATACCTCC AGCTCTTGTT TCTATCATGT CATCTCAAAA 6120
GCCCCTAGAA TCCGAATGCC CAGCTATTCT CCAACACAGA GATCCCCTGG TTGTCGTCCA 6180
TTACCTTCTG CAGGGAGCCC TACCCCAACT ACCCATGAGA TAGTCACAGT GGGAGACCCT 6240
TTACTCTCTT CTGGGCTTCG AAGTATTGGT TCCAGACGTC ACAGCACTTC CTCTCTGTCA 6300
CCCCAGCGGT CTAAATTCCG GATAATGTCC CCAATGAGAG CTGGGAATTC TTATTCCCAT 6360
CACAGTGTTT CCTCTATCTC TGGCATTGGA GTCTCCACTG ATCATGATTC AAGTATAAAA 6420
ACAATTGACC ATTTCCTAGG GTCATTGAAT CCAAGCACCC CAAATACTTT AGGGCAAAAC 6480
ACTTCTTCAA GTTCTCAAAG GACATTGGTT ACAGTGGGGA CTAAAGCAAC TAATGTAGAT 6540
GGACCTCCAC CTTCAGAGAT GAAACATATT AGTGTTGCAG ATTTGTCAAG TAAAAGCTCC 6600
TCCTTGAAAG GAGAGAAAAG TAAAATGTTG AATTCCAAGG GCTCAGAGGG ATCAACACAT 6660
ATCTTGGCTT ATCCTAAACC AGTCCCTCAG GCTCATAATA CGATGTCTGG AGAAGTAAAT 6720
GTCAGTAAAA TGGGCACTTT TGTTGAACCT TCTTCCGTGT CATTTTCTTC CAAAGAGGCC 6780
CTTTCCTTCC CTCCACTCCC TTTGAGGGGG CAAAAGAAGG AAAGAGACCA ACATACAAAC 6840
TCTTCCCAAC CAGAAAACCC TTCTCCAGGT GAAGACACTG AAACTAAAGC ATTGAAGACC 6900
CCAGGTATGA ACAGTAGATC TATTGCAAAT GAACAAATCA CATCTAGTTC TAGGGATAGA 6960
AGACAGAAAG GAAAAAAATC TGGGAAAGAA TCTTTCAAAG AGAAACATTC CATCAAATCT 7020
TTTTTAGATC CTGGTCAGGT GATGGCTGGT GAGGAAGGAA GCCTAAAACC AGAGTTTGTC 7080
AATCAGATTT TGACATCTGA GCATATAAGC CAGCGATCTT GTAATAATAT TTCTTCTGAA 7140
AAGAGTGGAG ATAAGATCCT TCCTATTTCA GGGGGTATCA AAGCTCCATC TGTACAACTG 7200
GAGGGACCAG CCAAGGAATC ACAGACATCT CGGAAACGCA CAGTTAAAGT AACCTTGACA 7260
CCATTGAAAA TGGAAAGTGA GAGTCCATCC AAAAATACAT TGAAAGAAAT TATTCCTGGT 7320
TCCCCATCCC AAGGAATGGA ATCAGCAACT TTAGCAGAAT CATCTTCAAC TTCAGAAAGC 7380
CCAGGAGATG GCTCAGTGGC ACAACCCAGT CCGAACGATC CTTCATCCCA AGAATCTCAG 7440
AGTAATACCT ATTCAAATCT TCCTGTCCAG GATAGAAATT TGATGCTCCA AGATGGCACC 7500
AAGCCTCAAG AAGACAGTTC CTACAAGAGG AGATATCCTC GCCGTAGTGC CCGAGCCCGC 7560
TCTAATATGT TTTTTGGGCT CACTCCTCTT TATGGAGTGA GGTCCTATGG GGAGGAAGAT 7620
ATTCCATTCT TTAGCAGTTC TTCAGGCAAG AAACGAGGAA AGAGATCTGC TGAAGGGCAG 7680
GTTGATGGAG CTGATGACTT AAGCACCTCA GATGAAGATG ATTTGTATTA CTACAACTTC 7740
ACCAGAACAG TGATTTCTTC AAGTGGGGAG GAGCGATTAG GATCTCATAA TTTATTTCGG 7800
GAGGAAGAGC AGTGTGAACT CCCTAAAATT TCACAGCTAG ATGGTGTAGA TGATGGGACA 7860
GAGAGTGATA CTAGTGTTAC AACCACAGCA AGGAAAGTCA GTCAGCTTCC CAAAAGAAAT 7920
GGAAAAGAAA ATGGGACAGA GAACTTAAAG CTTGATCGAC CTGAAGATTC TGGGGAAAAG 7980
GAACATGTCA TCAAGAGTTC ATCTGGTCAT AAAACTAATG AGCCAAAGAT AGATAATTGC 8040
CATTCTGTAA GCAGGGTGAA AACACAAGGA CAAGATTCCC TGGAGGCCCA GCTCAGTTCA 8100
TTGGAATCTG GCCGCAGGGT CCACACAAGT ACTCCTTCAG ATAAAAACTT ACTAGATACT 8160
TATAACACTG AACTTCTGAA ATCTGATTCT GACAACAATA ATAGTGATGA CTGTGGGAAC 8220
ATCTTACCCT CAGATATCAT GGACTTTGTG TTAAAGAATA CTCCATCCAT GCAGGCTTTA 8280
GGAGAAAGTC CAGAGTCATC TTCATCTGAA CTCCTGACTC TTGGGGAAGG GTTAGGTCTT 8340
GACAGCAACC GTGGAAAAGA TATGGGTCTC TTTGAAGTAT TTTCTCAGCA GCTACCAACA 8400
GCAGAGCCTG TGGATAGTAG TGTGTCATCC TCTATCTCAG CAGAGGAACA ATTTGAGTTG 8460
CCTTTGGAGC TCCCGTCTGA TCTCTCTGTC CTAACTACCC GAAGTCCCAC TGTCCCCAGC 8520
CAGAACCCAA ACAGACTGGC AGTAATCTCT GACACTGGGG AGAAGAGAGT GACAATCACA 8580
GAAAAATCTG TGGCCTCAAC TGAAAGTGAC TCAGCACTGT TGAGTCCAGG AGTAGATCCA 8640
ACCCCAGAAG GCCACATGAC CCCAGATCAT TTCATTCAAG GGCATATGGA TACAGACCAT 8700
ATTGCTAGCC CTCCTTGTGG TTCAGTAGAA CAAGGACATG GCAATAACCA GGACTTAACT 8760
AGAAATAGCA ACACCCCTAG CCTTCAGGTT CCTGTTTCTC CTACTGTTCC CCTTCAGAAT 8820
CAGAAATATG TTCCCAGTTC TACAGACAGC CCTGGCCCAT CTCAGATTTC AAATGCAGCT 8880
GTCCAGACAA CTCCACCTCA CATGAAACCA GCCACTGAAA AACTCCTTGT TGTCAATCAG 8940
AATATGCAAC CACTTTATGT TCTCCAAACT CTTCCAAATG GAGTGACTCA AAAAATACAA 9000
CTGACTCCCT CAGTTAGTTC TGCCCCCAAT GTGATGGAAA CCAACACTTC AGTGCTTGGG 9060
CCCATGGGTA GTGGTCTCAC TTTAGCCACA GGACTAAATC CAAGTTTGCC AACGTCTCAG 9120
TCTTTGTTCC CCTCTGCTAG CAAAGGATTG CTCCCCATGA CTCATCACCA GCACTTACAT 9180
TCCTTTCCTG CAGCTACTCA AAGTAGTTTC CCACCCAACA TCAATAATCC TACTTCAAGC 9240
CTGTTAATTG GTGTTCAGCC TCCTCCAGAT CCCCAACTTT TGGTTTCAGA AACTAATCAG 9300
AGGACAGACC TCAATACCAC TGCAACCAAT CCTCCCCCTG GGCTAAAGAA AAGGCCTATA 9360
TCTCGCCTAC ATTCACGAAA GAATAAGAAA CTTGCTCCTT CAAGCACTTC ATCATCTATT 9420
GCCCCTTCTG ATATGGTTTC TAACATGACT CTGATTAATT TTACACCCTC CCAGCTTTCA 9480
AATCATCCAA ATCTCTTAGA TTTGGGAACA CTTGGAAATA CCACCTCCCA CCGAACAGTC 9540
CCCAACATCA TCAAAAGGTC CAAGTCTGGT ATTATGTACT TTGAACAGGC ACCCCTGTTA 9600
CCACAAAGTG TGGGAGGAGC TGCTTCTTCA GCAGTTGGGG CATCAACAAT AGGCCCAGAT 9660
ACCAGCCACC TCACAGCAGG ACCTGTATCT GGTTTGGCAT CAGGTTCCTC TGTTCTTAAT 9720
GTTGTATCCA TGCAAGCCAC AACAGCCCCT ACCACTGGTG GGTCAGTTCC AGGGCATGTT 9780
TTGGGGCAAG GCTCAGTCAC ATTAACCAGC CCAAGGTTGC TTGGTGCCCC AGACATTGGC 9840
TCAATAAGCA ACCTCTTAAT CAAAGCCAGC CAACAGAGCC TGGGACTTCA GGAACAGCCC 9900
ATCACTTTGC CACCAGGATC AGGAATGTTT CCACAGCTGG GAACATCACA GACCCCCTCC 9960
ACTGCTGCAA TGACAGCTGC ATCCAGTATT TGTGTGCTTC CCTCGACCCA GACTGTGGGC 10020
ATGACAGTTG CCCCTTCATC TAATGAACCA GAAGGATCCT ATCAACTTCA GCACATGACC 10080
CAACTCCTTG CCAGTAAGTC TGGGATTCTC CCCTCTCAGT TGGATATTAC TTCAGCTTCT 10140
GGGAACCAAT TGTCAAGCTT TCCCCAGCTG GTTGATGTTC CCAACACAGG ACTAGAGCAG 10200
AACAAGACTT CATCCTCAGT TATGCATGCT AGCTCAGCCT CTCCTGGTGG TTCCCCATCT 10260
TCTGGTCAGC AGTCAGCAAG CAGTTCAGTG CTAGGCCCCA CTAAATCTAG GCCAAAAGTC 10320
AAACGGATTC AGCTGCCTTT GGACAAGGGG AATGGAAAGA AGCACAAGGT TTCCCACATG 10380
CGGACCAGTT CTTCTGAAGC ACACATTCCA GATCCAGAAG CCAACTCTAC ATCCCTGACC 10440
TCAGTGACAG GAACTCCAGG ATCAAAGTCA GATGTTCAGG ATACAACTAA CATGGATCAA 10500
TCATCACAGA AGGATTGTGG ACAGTCTATA AGGCAAATGA CTGCAATTCC AGAAGAACCA 10560
CCAACACAGA ATTCAACAAA TGAGCAAGAC AGTTCAGAAC CCAAAGTTAC TGAAGAAGAA 10620
GAAAGCAACT TCAGTTCTCC ACTGATGTTT TGGTTACAGC AAGAACAGAA GAGGAAGGAA 10680
AGCATTGGAG AGAAAAAGCC AAAGAAAGGG CTAGTTTTTG AAATATCAAG TGATGATGGT 10740
TTTCAAATTT GTGCAGAAAG CATTGAAGAT GCATGGAAAT CATTAACAGA TAAAGTTCAG 10800
GAAGCTCGGT CCAATGCTCG TCTAAAGCAG CTTTCATTTG CAGGTGTGAA TGGCTTGAAG 10860
ATGCTAGGGA TTCTTCATGA TGCAGTTGTG TTCCTAATTG AGCAGCTTTC TGGGGCCAAG 10920
CATTGTCGGA ATTATAAGTT TCGGTTCCAC AAACCAGAGG AGGCCAATGA ACCACCCCTT 10980
AATCCTCATG GTTCAGCCAG AGCTGAAGTC CACCTGAGGA AGTCAGCATT TGACATGTTT 11040
AACTTCCTGG CTTCTAAACA TAGACAGCCT CCTGAATATA ACCCCAATGA TGAGGAAGAA 11100
GAAGAAGTAC AGCTGAAATC AGCTCGAAGG GCGACTAGCA TGGATTTGCC CATGCCCATG 11160
CGTTTCCGGC ACTTAAAGAA GACTTCCAAG GAGGCAGTCG GGGTCTATAG GTCTCCCATC 11220
CATGGACGGG GTCTCTTCTG TAAGAGAAAC ATCGATGCAG GTGAGATGGT GATTGAGTAT 11280
GCAGGCAACG TCATCCGTTC CATACAGACT GACAAGCGAG AGAAGTATTA TGAGAGCAAG 11340
GGCATTGGCT GCTACATGTT CCGAATTGAT GACTCAGAAG TAGTGGATGC CACCATGCAT 11400
GGAAACGCTG CACGATTCAT CAACCATTCA TGCGAGCCCA ACTGCTATTC TCGGGTCATC 11460
AACATTGATG GGCAGAAGCA CATTGTCATA TTTGCAATGC GTAAGATCTA CCGTGGAGAA 11520
GAACTCACTT ATGACTATAA GTTCCCCATT GAGGATGCCA GCAATAAGCT GCCCTGCAAT 11580
TGTGGCGCTA AGAAATGCCG AAAGTTCTTA AACTAAAGTT GCTCTTTTTC TCATTCCCCC 11640
ATTACGGGAG TTGCAAGACC CAGGGTCATC CAAAGCAAAA CTGAAGGCTT TTTCTAGCAG 11700
CCAAGGGTTC CAGGATCAGA TGGGCTCAGT TGAGGGACCT CCATTATGTC TGAGCTCCCT 11760
TCCCTTGTGC CCCATCTTCA CATCATGCAG TGTGATCATA ATCCTGGTGA GAGGTGACGT 11820
CATGAAGAAA AGATTGGTGA TGGACTTTCT TCCTGGCACC TCTGGATTTC GCACACCAAT 11880
ATGAGGAGGG GAAGGGGTCC CCTAACAGAT TTGCCTGGAG AGAGCCTGTA AAGGGTTATA 11940
TCAGGAGAAT AGGGATTTTC CTGAGCTTCT CCCTAGGGGA AGTTGGTAAC TTGCTGCCAG 12000
CCTCTGCTTG GCCCATTTCC TAAGCACTGT TAGTGAGGTG GCCAGAGTGG CCAAGCAGGG 12060
TCAAGAGGAT AATATAGGTC AGGCACCTCA GAGTTGGCTG GCCAGGCCCA GCCTACATCC 12120
GTCTCTTGAT TGAAAGAAAA TATTTCATGG TTTCTCCTTA GTGCCCTCCC ACCTAAGACT 12180
TCATTTCTGA TTGGGAGGCA GGATTCCTAG CGCCTTTGTT GACCAAAAGG GCTGTTTGGG 12240
GTTGTGCCAA TGAATTACCA AACACTTGAG CCTGCCTATT GGCTTTGTAG TGGGAGTGTT 12300
ACCCCTGTAA GCCTTATCTC AGCCAGTTAC TTTTCTTGAC AGTAGGAGCA GCTTCCTTCC 12360
CTTTCCTTCT CCCCATTTTA CTTTTCTTTC TTTTCTCTTG GCTTCCAGCC CACCACTTTT 12420
CCATTCTTTC TGGGTGGTAG GTGAAATTGG CTCCCTATTT AAGGATCTGC ACCCCACTGG 12480
GCACAGTTTG TGCCTACAAA ACATCCTTTG AGCTAGTTAG CACTCCAGAA GGGGACATGG 12540
ACACAAGCCA CTAAAAATAG ATAGGTTTAG GAGGAGTCTT TACCAGCCCA AGCTCCAGTG 12600
GAAATGTTTT TTTTTAAAAA ATGCAGCATG GTGGTCTTTG AAGAGGGAAA ATTGTTTAAA 12660
TAGGATTTGA ATCATGCATC TTCCTGAGAA TTTCATATCC AAGTTGATGA TATCCACCCC 12720
CTTCTACCTC CCACCCCTTC TCCATTACTA GGATACAGTT ACTTTGATTT ATTCTTTGCT 12780
AAGACATAGG AAGGCCAAAT TCCTAAATGG TCAATCCCCA CTAAAAGCTA AAGGCAGAAC 12840
ACTAAGGTTA CAAGACCCTG AGGGAGTATG TTACCATGTG TTGAGGGAGC TCTACCAGAT 12900
GTGCAAAATC TATTGTCAGA GCAGATGAAA ATGACACTAC TGCCTAGGTT CCTTCTCTAG 12960
GACATGGTCA AAGCAGCATC AGACTAACTT CTCTCTCCTT CCCTTCTTTC CTCCCTCCAT 13020
TTCCCCTTCC CCAAGACAGT GTCCTGAATC TGTTAAATTA AGTCATTGGA TTTTACTCTG 13080
TTCTGTTTAC AGTTTAATAT TTAAGGTTTT ATAAATGTAA ATATTTTTTG TATATTTTTC 13140
TATGAGAAGC ACTTCATAGG GAGAAGCACT TATGACGAGG CTATTTTTTA AACCGTGGTA 13200
TTATCCTAAT TTAAAAGAAG ATCTGTTTTT AATAATTTTT TATTTTCATA GGATGAAGTT 13260
AGAGAAAATA TTCAGCTGTA CACGCAAAGT CTGATTTTTT TTTCTGTCCA ACTTCCCCCG 13320
CTCCCAACCC CTGCCCTCAG AAGGTATATT TTTGTTGTTG TTTAACGTGT AAGCTTGTTC 13380
ATGCCTTGTT GACTTAGGTA ACTGTTTCCT GGGTTGCTCC TAGATTATAG AGAAGGAAGG 13440
TCACCAAAAT ACAAGGTTCT CCCTTCTCTC CCTCCTCTAC AAAGCTCCCA GCAGGCTACA 13500
TTACTATGTT GGTACAATTC TTCTTTCATC CCCTTTCTAA AACACATTTC TTCTCTGATT 13560
CTAGCCCCAG GGCAATGGGA AGAGGCAGAT TTTTGTACTT CCCAAGTCTC TTCTACTTCA 13620
GAGGTAGAAA AATAAGAGCT TGGATGGCAG GCAACTTGAC ATTTTTCCAT CTCCATCTGC 13680
CTCATTTGTT GCCTCCATTT ATCTCTGCTT CCAGTATCCC CACAAAATCC TTATATCTGC 13740
AAAAAAGATC TTACTGACCC CCTAGACATA CTTCCAGGGT GGCCCAATGA AACTAAAGCA 13800
CAGTCTTAGA TCATTCACAG AGAAACAGGT TTTCTCTCTG TTTTGGTTAC TGGGCTTCAA 13860
AACAGTTCAG TCCAGCAGGA GATCAATAAT GGCTCTCAGT GGGACACTCT TCCTGGCTAC 13920
AAGAAGCCCA CCCTGTGGAA ACTTAACAAA GCAATATCAC TCAGCCCAGC GTACTCCACC 13980
CCAGGACCCT TGGTTCAGCC TGCGCCCTCC GACCTGGGCT CCCTTATCTC CTGTGACCTT 14040
AAAAACTTTG TCTGGTGGAT CTGCTGAAAC ACCATCACTT TCGCCAGCAC TGTGGACAGG 14100
ATGGCAAAGG ATACCTTGCG AGCATGGAGG GGGGAGCTAA CAGTGGGCTG AGCACAGTCC 14160
ACCCAGATCT CACCTCCCCT CTTTCTTCAG TCTTTGAGAA GATATGGTGA CTCAAAGTGT 14220
TCCACTAGTA TCTGATTTCT TTCTTATTGC ACTGTGTGAG GAGGTTTTTT TGTAAATCTT 14280
TTTTGTCCTT ATTTCAAAAC AAAACAACAA AAAAAAACCT TAGGCTGAAT TTATTACCGA 14340
AATGATTGAT GCACTGATGG GTCGGGGATT CACTCTTAAG AAAGAGACCC AAAGGCCAGT 14400
CAAGGATGGG GAGAACTCAG ATGAATAGAC CTGGTCACTG CCCTATTATT AGGCAGTGCT 14460
CTATTATAAA AGATCTCTTT CTCATTCCTT TCCCACCATT CCCTGGGTAT TAGGAGAAAA 14520
GTTACATCCT AACCCCAGCC TGCTTAAGAA AGCACCCCTG TGAGTGACTG CAGCTCATTC 14580
TTGAGCCCAT TGCCAAAATT CAACAGCTTG AGAATGAGAG GAGGGTCACT ACCAGAAGTA 14640
GGAGAGGAAG AGATGAGTAA TAAATGAATG GAAAATGCCT CTGAGTTCAT CTTGAAAAGT 14700
ATCCAGAGGA TTTTAGAACA GAAATTCATT CAGACTGCGC AAATAACATT GGGTCATTCA 14760
TTAGAATCTT TCAAGAAGCT TCCTTCAGAA ATTCCCAAAT CCCTGAGGAG TAATAATTAT 14820
GTCTCTGGTT AGTTCTCCAA GAGAGCCAAT TCACTCAAAT AGCTTTGCTG CTAGAACCTG 14880
TTGTGGCTGC ATGAACTGGT GGCCATTGCA GCCAGTGAAA ACTATTTCAT AACCAGAGCA 14940
GCTGCATGGC ACTGACACTT TGGCCAGGCC TTCTTGGTCC CTTTGCTTCT CCGTAACCAT 15000
CTACCACCAC ATCTCCAGCA CATCCCTTCC TGCTTTCAGT ACATTAATGA GTTTGGAGGA 15060
GAGAAGCCTT GGCAGCTACT ATTAATTTTT CAACCAGACT CGTTGACATT TGCAGGATCC 15120
AGTCCGCCAC CCTGCTTACC TCTGTCCCAT CCTGGCCAAT TCAGTGAGCA CTGTGCAGCT 15180
CCCTTAATTA AAAAAATAGA TTTTAAAAAT GAATGTATCT TTTTAAAGGA CTTCTGTTCA 15240
ATCACAAATA TCTGAAAATA CTAAAGGTCA AAACCTTGTT GGGTGTTTGA ATTTATTTGG 15300
GGGTGGGGGG AGGGAAAGGG AGATTTGTAG CAAACTTTTT TTCTCAAAGG AAAAGCGGGT 15360
CATTATAAAG GGCTGGGTGT AAACTGTTTC ATTTCCTTCA TTTCAAAGCA ATACAAGGTT 15420
ATTAAGCAGA TGGTTTTGTG CTGAATCATG AATGCCAGTC AAGTCTAGAC TACTTACTGG 15480
TTTCACACAC TCTGAAAACT TGCAAGATTT TTTTTGAATT TTCAAATAAC TATAAATATG 15540
ATATATATAG GAACTAATAT AGTAATGCAC CATGTAACAA AGCCTAGTTC AGTATCAGTC 15600
CATGGCTTTT AATTCTCTTA ACACTATAGA TAAAGATTGT GTTACAGTTG CTAGCAGAGG 15660
CAGGAAAATG TCAGTCTGGT GTTCCTCTGG GTCCCCAAAG GAAAAAAAAA ACAAAAACAA 15720
ACTGCAAGAG TATCATCTTA ATAGGGATTG CATCAACCCA GCCCACATCG GGTTGGAAAG 15780
AATTGCACAA ACCATGTTAC CTGGAGCTTC TCTGCTGAGG ATTTTTCTGC CCATGAATGT 15840
TACCAGTCAG TACCTGTACT TCTTGTTTCT CTATTTTTGG TTATGAATGT TGGGATTACC 15900
ACCTGCATTT AGGGGGAAAA ATTGTGTTCT GTGCTTTCCT GGTATCTTGT TCTGAGGTAC 15960
TCTAGTTTTG TCTATCAACC AAGAAAATAT ACTTGTGGTG TTTCTTTTAA TTGAACTTTC 16020
AACAGTCTCT TTAGTAAATA CAGGTAGTTG AATAATTGTT TCAAGAACTG AACAGATGAC 16080
AAACTTCTGT TCCAGAAATA AGACATTTCT TAACTTTATC ATGTATAACA GATCTTTTTG 16140
TTTTTATTTT TTCCTTGTGT TCTTCCAAAC TTCTGGTTTA GGGGGGGGGG GGAAGAAAAG 16200
AAAAAAAAAA GGAATGTGTC TAAAGTCCAT CAGTGTTAAC TCCCTGTGAC AGGGATGAAG 16260
GAAAATACTT TAATAGTTAA AAAAAATAAT AATGCTGAAA GCTCTCTACG AAAGACTGAA 16320
TGTAAAAGTA AAAAGTGTAT ATAGTTGTAA AAAAAAATGG AGTTTTTAAA CATGTTTATT 16380
TTCTATGCAC TTTTTTTTAT TTAAGTGATA GTTTAATTAA TAAACATGTC AAGTTTATTG 16440
CTGCAGACAG TTGGACTTTA TTTCCTTTGG TAAAGGGATG TTGGGGAGGG GGTGGTAAAT 16500
GACCTCAGGT CAAACTGCTT GTCATTTTTC CTTTTTCAGA AATGGAAATA ACTAAAAATA 16560
TTCCCTA 16568
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Sah-0138 ENSSHAP00000014807.1 Sarcophilus harrisii 94 0.0 5588
WERAM-Eqc-0038 ENSECAP00000006426.1 Equus caballus 87 0.0 4096
WERAM-Ova-0104 ENSOARP00000010552.1 Ovis aries 86 0.0 4081
WERAM-Loa-0136 ENSLAFP00000012406.4 Loxodonta africana 87 0.0 4080
WERAM-Aim-0147 ENSAMEP00000013764.1 Ailuropoda melanoleuca 86 0.0 4072
WERAM-Bot-0163 ENSBTAP00000024084.5 Bos taurus 86 0.0 4069
WERAM-Caf-0134 ENSCAFP00000018720.5 Canis familiaris 86 0.0 4062
WERAM-Otg-0095 ENSOGAP00000008374.2 Otolemur garnettii 86 0.0 4060
WERAM-Mam-0038 ENSMMUP00000007251.2 Macaca mulatta 87 0.0 4060
WERAM-Chs-0032 ENSCSAP00000015460.1 Chlorocebus sabaeus 87 0.0 4054
WERAM-Paa-0065 ENSPANP00000009106.1 Papio anubis 87 0.0 4041
WERAM-Nol-0074 ENSNLEP00000008984.1 Nomascus leucogenys 86 0.0 4033
WERAM-Poa-0036 ENSPPYP00000004505.2 Pongo abelii 86 0.0 4030
WERAM-Hos-0096 ENSP00000374157.5 Homo sapiens 86 0.0 4016
WERAM-Pat-0034 ENSPTRP00000040970.5 Pan troglodytes 86 0.0 4008
WERAM-Cap-0070 ENSCPOP00000005242.2 Cavia porcellus 86 0.0 3984
WERAM-Tut-0049 ENSTTRP00000004041.1 Tursiops truncatus 85 0.0 3952
WERAM-Orc-0100 ENSOCUP00000008738.2 Oryctolagus cuniculus 85 0.0 3929
WERAM-Mum-0008 ENSMUSP00000110337.1 Mus musculus 83 0.0 3880
WERAM-Myl-0155 ENSMLUP00000012825.2 Myotis lucifugus 82 0.0 3862
WERAM-Prc-0053 ENSPCAP00000005189.1 Procavia capensis 83 0.0 3856
WERAM-Ran-0100 ENSRNOP00000020573.6 Rattus norvegicus 84 0.0 3855
WERAM-Tag-0002 ENSTGUP00000000072.1 Taeniopygia guttata 82 0.0 3783
WERAM-Gaga-0075 ENSGALP00000011008.4 Gallus gallus 82 0.0 3779
WERAM-Anp-0040 ENSAPLP00000004456.1 Anas platyrhynchos 83 0.0 3764
WERAM-Ptv-0056 ENSPVAP00000005910.1 Pteropus vampyrus 81 0.0 3745
WERAM-Pes-0058 ENSPSIP00000007945.1 Pelodiscus sinensis 82 0.0 3733
WERAM-Fia-0123 ENSFALP00000010386.1 Ficedula albicollis 80 0.0 3719
WERAM-Dio-0068 ENSDORP00000006787.1 Dipodomys ordii 80 0.0 3615
WERAM-Ocp-0107 ENSOPRP00000010334.2 Ochotona princeps 80 0.0 3585
WERAM-Ict-0127 ENSSTOP00000012923.2 Ictidomys tridecemlineatus 84 0.0 3499
WERAM-Meg-0028 ENSMGAP00000002448.2 Meleagris gallopavo 77 0.0 3493
WERAM-Gog-0013 ENSGGOP00000000936.2 Gorilla gorilla 85 0.0 3455
WERAM-Mae-0003 ENSMEUP00000000157.1 Macropus eugenii 91 0.0 3407
WERAM-Dan-0088 ENSDNOP00000008968.3 Dasypus novemcinctus 81 0.0 3051
WERAM-Soa-0133 ENSSARP00000013142.1 Sorex araneus 77 0.0 2982
WERAM-Lac-0176 ENSLACP00000020625.1 Latimeria chalumnae 68 0.0 2977
WERAM-Anc-0073 ENSACAP00000006937.3 Anolis carolinensis 66 0.0 2916
WERAM-Tas-0109 ENSTSYP00000011196.1 Tarsius syrichta 79 0.0 2855
WERAM-Chh-0055 ENSCHOP00000006343.1 Choloepus hoffmanni 87 0.0 2729
WERAM-Ect-0072 ENSETEP00000007880.1 Echinops telfairi 78 0.0 2617
WERAM-Mup-0053 ENSMPUP00000004604.1 Mustela putorius furo 87 0.0 2600
WERAM-Tub-0079 ENSTBEP00000009516.1 Tupaia belangeri 80 0.0 2571
WERAM-Vip-0054 ENSVPAP00000005197.1 Vicugna pacos 83 0.0 1811
WERAM-Ora-0035 ENSOANP00000005967.3 Ornithorhynchus anatinus 84 0.0 1390
WERAM-Leo-0025 ENSLOCP00000004893.1 Lepisosteus oculatus 77 0.0 1286
WERAM-Xet-0067 ENSXETP00000022279.3 Xenopus tropicalis 76 0.0 1222
WERAM-Ere-0135 ENSEEUP00000014133.1 Erinaceus europaeus 79 0.0 1174
WERAM-Orn-0070 ENSONIP00000007849.1 Oreochromis niloticus 71 0.0 1167
WERAM-Dar-0010 ENSDARP00000095298.3 Danio rerio 69 0.0 1160
WERAM-Gaa-0091 ENSGACP00000011913.1 Gasterosteus aculeatus 71 0.0 1159
WERAM-Pof-0188 ENSPFOP00000015902.2 Poecilia formosa 68 0.0 1122
WERAM-Ten-0223 ENSTNIP00000002397.1 Tetraodon nigroviridis 67 0.0 1095
WERAM-Xim-0097 ENSXMAP00000008788.1 Xiphophorus maculatus 65 0.0 1093
WERAM-Gam-0119 ENSGMOP00000012588.1 Gadus morhua 68 0.0 1080
WERAM-Mim-0138 ENSMICP00000013960.1 Microcebus murinus 79 0.0 960
WERAM-Orla-0075 ENSORLP00000009606.2 Oryzias latipes 45 0.0 880
WERAM-Tar-0190 ENSTRUP00000039006.1 Takifugu rubripes 45 0.0 842
WERAM-Fec-0070 ENSFCAP00000005933.3 Felis catus 50 0.0 804
WERAM-Sus-0021 ENSSSCP00000003118.2 Sus scrofa 49 0.0 802
WERAM-Caj-0107 ENSCJAP00000018740.2 Callithrix jacchus 50 0.0 797
WERAM-Asm-0037 ENSAMXP00000004791.1 Astyanax mexicanus 46 0.0 766
WERAM-Pem-0014 ENSPMAP00000002218.1 Petromyzon marinus 60 0.0 721
WERAM-Cis-0045 ENSCSAVP00000009955.1 Ciona savignyi 37 2e-140 499
WERAM-Cii-0034 ENSCINP00000025384.2 Ciona intestinalis 54 6e-101 368
WERAM-Drm-0010 FBpp0082406 Drosophila melanogaster 47 1e-88 327
WERAM-Cae-0021 C26E6.9a Caenorhabditis elegans 55 1e-42 175
WERAM-Ors-0112 OS12T0613200-02 Oryza sativa 55 1e-42 174
WERAM-Org-0116 ORGLA12G0159200.1 Oryza glaberrima 55 1e-42 174
WERAM-Php-0006 PP1S101_4V6.1 Physcomitrella patens 54 2e-42 174
WERAM-Tum-0027 CAZ85029 Tuber melanosporum 53 2e-42 174
WERAM-Sei-0078 Si021071m Setaria italica 55 3e-42 173
WERAM-Orbr-0127 OB12G25360.1 Oryza brachyantha 54 4e-42 173
WERAM-Brd-0092 BRADI4G01790.1 Brachypodium distachyon 55 5e-42 172
WERAM-Thc-0094 EOY15831 Theobroma cacao 56 8e-42 172
WERAM-Sol-0003 Solyc01g006880.2.1 Solanum lycopersicum 54 1e-41 171
WERAM-Prp-0018 EMJ21490 Prunus persica 55 2e-41 171
WERAM-Asn-0015 CADANIAP00003254 Aspergillus nidulans 53 2e-41 171
WERAM-Ast-0003 CADATEAP00001100 Aspergillus terreus 52 2e-41 170
WERAM-Asc-0034 CADACLAP00008186 Aspergillus clavatus 52 2e-41 170
WERAM-Asni-0037 CADANGAP00014055 Aspergillus niger 52 2e-41 170
WERAM-Aso-0006 CADAORAP00000676 Aspergillus oryzae 52 3e-41 170
WERAM-Coi-0035 EAS31778 Coccidioides immitis 52 3e-41 170
WERAM-Pot-0072 POPTR_0005s28130.1 Populus trichocarpa 55 3e-41 170
Created Date 25-Jun-2016