WERAM Information


Tag Content
WERAM ID WERAM-Paa-0116
Ensembl Protein ID ENSPANP00000000470.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPANG00000016763.1 ENSPANT00000003649.1 ENSPANP00000000470.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.60e-52 176.7 1944 2060
Me_Reader PWWP 6.00e-33 112.9 323 1818
HMT SET1 2.10e-29 102.1 1944 2060
Me_Reader PHD 3.50e-12 46.4 1545 1751
Organism Papio anubis
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSPANP00000000470.1 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2030
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSPANP00000000470.1 2031 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSPANP00000000470.1 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSPANP00000000470.1 1758 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1818
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSPANP00000000470.1 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2028
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSPANP00000000470.1 2029 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSPANP00000000470.1 1545 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1589
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSPANP00000000470.1 1592 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1639
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSPANP00000000470.1 1640 ICITCHAANPANVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1693
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSPANP00000000470.1 1710 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1751
7****..44443..9******************...****.*******85 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG 120
PTALAMKQEL SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QAKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMLFEDC TNDPESEHDL LLNGCLKSLA 480
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG 540
DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDAL LGLPEGALIS 600
KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN 660
SVLEITDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE 720
PGTETSQVNL SDLKASTLVH KPRSDFTSDD LSPKFNMSSS ISSENSLIKG GAVNQALLHS 780
KSKQPKFRSI KCKHKENPVM VEPPVTNDEY SLKCCSSDTK GSPLASISKS GKVDGLKLLN 840
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSPSQNHIPI 900
EPDYKFSTLL MMLKDMHDSK TKEQRLMTAQ NLVSYRSPGR GDCSTNSPVG VSKVLVSGGS 960
THNSEKKGDG TQNSANLSPS GGDSALSGEL SASLPGLVSD KRDLPASGKS RSNCVTRRNC 1020
GRSKPSSKLR DAFSAQVVKN TVNRKALKTE RKRKLNQLSS VTLDAALQGD REHGGSLRGG 1080
AEDPSKEEPL QIMGHLPSED GDHFSDVHFD NKVKQSDPGK ISEKGPSFEN GKGPELDSVM 1140
NSENDELNGV NQVVPKKRWQ RLNQRRTKPR KRMNRFKEKE NSECAFGALL PSDPVQEGRD 1200
EFPEHRTSSA SILEEPLTDQ KHADCLDSVG PRLNVCDKSS ASIGDMEKEP GIPSLTPQAE 1260
LPEPAVRSEK KRLRKPSKWL LEYTEEYDQI FAPKKKQKKV QEQVHKVSSR CEEESLLARG 1320
RSSAQNKQVD ENSLISTKEE PPVLEREAPF LEGPLAQSEL GGGHAELPQL TLSVPVAPEV 1380
SPRPALESEE LLVKTPGNYE SKRQRKPTKK LLESNDLDPG FMPKKGDLGL SKKCYEAGHL 1440
ENGITESCAT SYSKDFGGVI GTSKIFDRPR KRKRQRHAAA KMQCKKVKND DSSKEIPSLE 1500
GELMPHRTAA SPKETVEEGV EHDSGMPASK KMQGERGGGA ALKENVCQNC EKLGELLLCE 1560
AQCCGAFHLE CLGLTEMPRG KFICNECRTG IHTCFVCKQS GEDVKRCLLP LCGKFYHEEC 1620
VQKYPPTVMQ NKGFRCSLHI CITCHAANPA NVSASKGRLM RCVRCPVAYH ANDFCLAAGS 1680
KILASNSIIC PNHFTPRRGC RNHEHVNVSW CFVCSEGGSL LCCDSCPAAF HRECLNIDIP 1740
EGNWYCNDCK AGKKPHYREI VWVKVGRYRW WPAEICHPRA VPSNIDKMRH DVGEFPVLFF 1800
GSNDYLWTHQ ARVFPYMEGD VSSKDKMGKG VDGTYKKALQ EAAARFEELK AQKELRQLQE 1860
DRKNDKKPPP YKHIKVNRPI GRVQIFTADL SEIPRCNCKA TDENPCGIDS ECINRMLLYE 1920
CHPTVCPAGG RCQNQCFSKR QYPEVEIFRT LQRGWGLRTK TDIKKGEFVN EYVGELIDEE 1980
ECRARIRYAQ EHDITNFYML TLDKDRIIDA GPKGNYARFM NHCCQPNCET QKWSVNGDTR 2040
VGLFALSDIK AGTELTFNYN LECLGNGKTV CKCGAPNCSG FLGVRPKNQP IATEEKSKKK 2100
KLEFWGKRWL VSGISSLNQG QCEGRSSSTC SDCSIAMAHV IYISFFLNFC FTSVFSGKWE 2160
CPWHQCDICG KEAASFCEMC PSSFCKQHRE GMLFISKLDG RLSCTEHDPC GPNPLEPGEI 2220
REYVPPPVPL PPGPSTHLAE QSTGMAAQAP KMSDKPPADT NQTLSLSKKA LAGTCQRPLL 2280
PERPLERTDS RSQPLDKVRD LAGSGTKSQS LVSSQRPLDR QPAVAGPRPQ LSDKPSPVTS 2340
PSSSPSVRSQ PLERPLGTAD PRLDKSIGAA SPRPQSLEKT PVPTGLRLPP PDRLLITSSP 2400
KPQTSDRPPD KPHASLSQRL PPPEKVLSAV VQTLVAKEKA LRPVDQNTQS KNRAALVMDL 2460
IDLTPRQKER AASPHEVTPQ ADEKMPVLES SSWPASKGLG HMPRAVEKGS VSDPLQTSGK 2520
VAAHSEDPWQ AVKSFTQARL LSQPPAKAFL YEPTTQASGR APAGTEQTPG PLSQVPGLVK 2580
QAKQMVGGQQ LPALAARSGQ SFRSLGKAPA SLPTEEKKLV TTEQSPWALG KASSRAGLWP 2640
IVAGQTLAQS CWSPGSTQTL AQTCWSLGRG QDPKPEQNTL PALNQAPSTH KCAESEQK 2698
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ACCCAGAAGA AATTGTCTGC TGCCCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTA TCGACTGTCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA GACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCTGATGGAT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGACGCCA ATTGTTTGCA CTTCTCTGAG TCCTGGTGGT 360
CCTACAGCAC TTGCTATGAA ACAGGAACTC TCTTGTAATA ACTCCCCTGA ACTCCAGGTA 420
AAAGTAACAA AGACTATCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGAAAC TCAGACCAAT GCCACCTGCA ATTATGAGAC TAAATCAGAG 600
AATGGTGTAA AAGTGGCCAT GGGAAGTGAA CAAGACAGCA CACCAGAGAG TAGACACGGT 660
GCAGTCAAAT CGCCATTCTT GCCATTAGCT CCTCAGACTG AAACACAGAA AAATAAGCAA 720
AGAAATGAAG TGGACGGCAG CAATGAAAAA GCAGCCCTTC TCCCAGCCCC CTTTTCACTA 780
GGAGATACAA ACATTACAAT AGAAGAGCAA TTAAACTCAA TAAATTTATC TTTTCAGGAT 840
GATCCAGATT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTTGT CAAGCTAAGA AAAAGTCTAC GCCACTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAA TTCAAGAGAC GCCCATGGTG GCCCTGCAGG 1020
ATTTGTTCTG ATCCGTTGAT TAATACACAT TCAAAAATGA AAGTTTCCAA CCGGAGGCCC 1080
TATCGGCAGT ACTACGTGGA GGCTTTTGGA GATCCTTCTG AGAGAGCCTG GGTGGCTGGA 1140
AAAGCAATCG TCATGTTTGA AGGCAGGCAT CAATTCGAAG AGCTACCTGT CCTTAGGAGA 1200
AGAGGGAAAC AGAAAGAAAA AGGATATAGG CATAAGGTTC CTCAGAAAAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGACT TGCAGAACAG TATGATGTTC CCAAGGGGTC AAAGAACCGA 1320
AAATGTATTC CTGGTTCAAT CAAGTTGGAC AGTGAGGAAG ATATGCTGTT CGAGGACTGC 1380
ACAAATGATC CTGAGTCAGA ACATGACCTG TTGCTTAATG GCTGCTTGAA ATCACTGGCT 1440
TTTGATTCTG AACATTCTGC AGATGAGAAG GAAAAGCCTT GTGCTAAATC TCGAGCCAGA 1500
AAGAGCTCTG ATAATCCAAA AAGGACTAGT GTGAAAAAGG GCCACATACA ATTTGAAGCA 1560
CATAAAGATG AACGGAGGGG AAAGATTCCA GAGAACCTTG GCCTAAACTT TATCTCTGGG 1620
GATATATCTG ACACGCAGGC CTCTAATGAA CTTTCCAGGA TAGCAAATAG CCTCACAGGG 1680
TCCAACACTG CCCCAGGAAG TTTTCTGTTT TCTTCCTGTG GAAAAAACAC TGCAAAGAAA 1740
GAATTTGAGA CTTCAAATGG TGACGCTTTA TTGGGCTTAC CTGAGGGTGC TTTGATCTCC 1800
AAGTGTTCTC GAGAGAAGAA TAAACCCCAA CGAAGTTTGG TGTGTGGTTC AAAAGTGAAG 1860
CTCTGCTATA TTGGAGCAGG TGATGAGGAA AAGCGAAGTG ATTCCATTAG TATCTGTACC 1920
ACTTCTGATG ATGGAAGCAG TGACCTGGAT CCCATAGAAC ACAGCTCAGA GTCTGATAAC 1980
AGTGTCCTTG AAATTACAGA TGCTTTTGAT AGAACAGAGA ACATGTTATC TATGCAGAAA 2040
AATGAAAAGA TAAAGTATTC TAGGTTTGCT GCCACAAACA CTAGGGTAAA AGCAAAACAG 2100
AAGCCTCTCA TTAGTAACTC ACATACAGAC CACTTAATGG GTTGTACTAA GAGTGCAGAA 2160
CCTGGAACTG AGACGTCTCA GGTTAATCTC TCTGACCTGA AGGCATCTAC TCTTGTTCAC 2220
AAACCCCGAT CAGATTTTAC AAGTGACGAT CTCTCTCCAA AATTCAACAT GTCATCAAGC 2280
ATATCCAGTG AGAACTCACT AATAAAGGGT GGGGCAGTAA ATCAAGCTCT ATTACATTCG 2340
AAAAGCAAAC AGCCCAAGTT CCGAAGTATA AAGTGCAAAC ACAAAGAAAA TCCAGTTATG 2400
GTAGAACCCC CAGTTACAAA TGATGAGTAC AGTTTGAAAT GCTGCTCTTC TGATACCAAA 2460
GGCTCTCCTT TGGCCAGCAT TTCTAAAAGT GGGAAAGTGG ATGGTCTAAA ACTACTGAAC 2520
AATATGCATG AGAAAACCAG GGATTCAAGT GACATAGAAA CTGCAGTGGT GAAACATGTT 2580
TTATCCGAGT TGAAGGAACT CTCTTACAGA TCCTTAGGTG AGGATGTCAG TGACTCTGGA 2640
ACATCAAAGC CATCAAAACC ATTACTTTTC TCTTCTCCTA GTCAGAATCA CATACCTATT 2700
GAACCAGACT ACAAATTCAG TACATTGCTA ATGATGTTGA AAGATATGCA TGATAGTAAG 2760
ACAAAGGAGC AGCGGTTGAT GACTGCTCAA AACCTGGTCT CTTACCGGAG TCCTGGTCGT 2820
GGGGACTGTT CTACTAATAG TCCTGTAGGA GTCTCTAAGG TTTTGGTTTC AGGAGGCTCC 2880
ACACACAATT CAGAGAAAAA GGGAGATGGC ACTCAGAACT CCGCCAATCT TAGCCCTAGT 2940
GGAGGTGACT CTGCATTGTC TGGGGAATTG TCTGCTTCGC TACCTGGCTT AGTGTCCGAC 3000
AAAAGAGATC TCCCTGCTTC TGGTAAAAGT CGTTCAAACT GTGTTACTAG GCGCAACTGT 3060
GGAAGATCAA AGCCTTCATC CAAATTGCGA GATGCTTTTT CAGCCCAAGT GGTAAAGAAC 3120
ACAGTGAATC GTAAAGCATT AAAGACCGAG CGCAAAAGAA AACTGAATCA GCTTTCAAGT 3180
GTGACTCTTG ATGCTGCACT GCAGGGAGAC CGAGAACATG GAGGTTCATT GAGAGGTGGG 3240
GCAGAAGATC CTAGTAAAGA GGAACCCCTT CAGATAATGG GCCACTTACC AAGTGAAGAT 3300
GGTGATCATT TTTCTGATGT GCATTTTGAT AACAAGGTTA AGCAATCTGA TCCTGGTAAA 3360
ATTTCTGAAA AAGGACCCTC TTTTGAAAAC GGAAAAGGCC CAGAGCTGGA CTCTGTAATG 3420
AACAGTGAGA ATGATGAACT CAATGGTGTA AATCAAGTGG TGCCTAAAAA GCGGTGGCAG 3480
CGTTTAAACC AAAGGCGCAC TAAACCTCGT AAGCGCATGA ACAGATTTAA AGAGAAAGAA 3540
AACTCTGAGT GTGCCTTTGG GGCCTTACTT CCTAGTGACC CTGTGCAGGA GGGGCGGGAT 3600
GAGTTTCCAG AGCATAGAAC TTCTTCAGCA AGCATACTTG AGGAACCACT GACAGATCAA 3660
AAGCATGCTG ATTGCTTAGA TTCAGTTGGG CCACGGTTAA ATGTTTGTGA TAAATCCAGT 3720
GCCAGCATTG GTGACATGGA AAAGGAGCCA GGAATTCCCA GTTTGACACC ACAGGCTGAG 3780
CTCCCTGAAC CAGCTGTACG GTCAGAGAAG AAACGCCTTA GGAAGCCAAG CAAGTGGCTT 3840
TTGGAATATA CAGAAGAATA TGATCAGATA TTTGCTCCTA AGAAAAAACA AAAGAAGGTA 3900
CAGGAGCAGG TGCACAAGGT AAGTTCCCGC TGTGAAGAGG AAAGCCTTCT AGCCCGAGGT 3960
CGATCTAGTG CTCAGAACAA GCAGGTGGAC GAGAATTCTT TGATTTCAAC CAAAGAAGAG 4020
CCTCCAGTTC TTGAAAGGGA GGCTCCGTTT TTGGAGGGCC CCTTGGCTCA GTCAGAACTT 4080
GGAGGTGGAC ATGCTGAGTT GCCGCAGCTG ACCTTGTCTG TGCCTGTGGC TCCGGAAGTC 4140
TCTCCACGGC CTGCCCTTGA GTCTGAGGAA TTGCTAGTTA AAACGCCAGG AAATTATGAA 4200
AGTAAACGTC AAAGAAAACC AACTAAGAAA CTTCTTGAAT CCAATGATTT AGACCCTGGA 4260
TTTATGCCCA AGAAGGGGGA CCTTGGCCTT TCTAAAAAGT GCTATGAAGC TGGTCACCTG 4320
GAGAATGGCA TAACTGAATC TTGTGCCACA TCTTATTCAA AAGATTTTGG TGGAGTCATA 4380
GGCACTTCCA AGATATTTGA CAGACCAAGG AAGCGAAAAC GACAGAGGCA TGCTGCAGCC 4440
AAGATGCAGT GTAAAAAAGT GAAAAATGAT GACTCGTCAA AAGAGATTCC AAGCTTAGAG 4500
GGAGAACTAA TGCCTCACAG GACGGCCGCA AGCCCCAAGG AGACTGTTGA GGAAGGTGTG 4560
GAACACGATT CTGGGATGCC TGCCTCTAAA AAAATGCAGG GTGAACGCGG TGGAGGAGCT 4620
GCACTCAAGG AGAATGTTTG TCAGAATTGT GAAAAATTGG GTGAGCTGCT GTTATGTGAG 4680
GCTCAGTGCT GCGGGGCTTT CCACCTGGAG TGCCTTGGAT TGACTGAGAT GCCAAGAGGA 4740
AAATTTATCT GCAATGAATG TCGCACAGGA ATCCATACCT GTTTTGTATG TAAGCAGAGT 4800
GGGGAAGATG TTAAAAGGTG CCTTCTACCC TTGTGTGGAA AGTTTTACCA TGAAGAGTGT 4860
GTCCAGAAGT ACCCACCCAC TGTTATGCAG AACAAGGGCT TCCGGTGCTC CCTCCACATC 4920
TGTATAACCT GTCATGCTGC TAATCCAGCC AATGTTTCTG CATCTAAAGG TCGATTGATG 4980
CGCTGTGTCC GCTGTCCTGT GGCATACCAC GCCAATGACT TTTGCCTGGC TGCTGGGTCA 5040
AAGATCCTTG CATCTAATAG TATCATCTGC CCTAATCACT TTACCCCTAG GCGAGGCTGC 5100
CGAAATCATG AGCATGTTAA TGTTAGCTGG TGCTTTGTGT GCTCAGAAGG AGGCAGCCTT 5160
CTGTGCTGTG ATTCTTGCCC TGCTGCTTTT CATCGTGAAT GCCTGAACAT TGATATCCCT 5220
GAAGGAAACT GGTATTGCAA TGACTGTAAG GCAGGCAAAA AGCCACACTA CAGGGAGATT 5280
GTCTGGGTAA AAGTTGGACG ATACAGGTGG TGGCCAGCTG AGATCTGCCA TCCTCGAGCT 5340
GTTCCTTCCA ATATTGATAA GATGAGACAT GATGTGGGAG AGTTCCCTGT CCTCTTTTTT 5400
GGATCTAATG ACTACTTGTG GACTCACCAG GCCCGAGTCT TCCCTTATAT GGAGGGTGAT 5460
GTGAGCAGCA AGGATAAGAT GGGCAAAGGA GTGGATGGGA CATATAAAAA AGCTCTTCAG 5520
GAAGCTGCAG CAAGGTTTGA GGAATTAAAG GCCCAAAAAG AGCTAAGACA GCTGCAGGAA 5580
GACCGAAAGA ATGACAAGAA ACCACCACCT TATAAACATA TAAAGGTGAA CCGTCCTATT 5640
GGCAGGGTAC AGATCTTCAC TGCAGACTTA TCTGAAATAC CCCGTTGCAA CTGTAAAGCT 5700
ACTGATGAGA ACCCCTGTGG GATAGACTCT GAATGCATCA ACCGCATGCT GCTCTATGAG 5760
TGCCACCCCA CAGTATGTCC TGCCGGAGGG CGCTGTCAAA ACCAGTGCTT TTCCAAGCGC 5820
CAATATCCAG AGGTTGAAAT TTTCCGCACG TTACAGCGGG GTTGGGGTCT ACGGACAAAA 5880
ACAGATATTA AAAAGGGTGA ATTTGTGAAT GAGTATGTGG GTGAGCTTAT AGATGAAGAA 5940
GAATGCAGAG CTCGAATTCG CTATGCTCAA GAACACGATA TCACTAATTT CTATATGCTC 6000
ACCCTAGACA AAGACCGAAT CATTGATGCT GGTCCCAAAG GAAACTATGC TCGGTTCATG 6060
AATCATTGCT GCCAGCCCAA CTGTGAAACA CAGAAGTGGT CTGTGAACGG AGATACCCGT 6120
GTAGGCCTTT TTGCGCTAAG TGACATTAAA GCAGGCACTG AACTTACCTT CAACTACAAC 6180
CTAGAATGTC TTGGGAATGG AAAGACTGTT TGCAAATGTG GAGCCCCGAA CTGCAGTGGC 6240
TTCTTGGGTG TAAGGCCAAA GAATCAACCC ATTGCCACGG AAGAAAAGTC AAAGAAAAAG 6300
AAGCTGGAAT TCTGGGGCAA GAGGTGGCTG GTGAGTGGCA TAAGCTCTCT GAACCAGGGG 6360
CAGTGTGAAG GAAGGTCATC ATCCACATGT TCGGACTGTA GCATAGCCAT GGCCCATGTG 6420
ATATATATCT CTTTTTTCCT AAACTTTTGT TTTACTTCTG TGTTTTCAGG GAAATGGGAA 6480
TGTCCGTGGC ATCAGTGTGA CATCTGTGGG AAGGAAGCAG CCTCCTTCTG TGAGATGTGC 6540
CCCAGCTCCT TTTGTAAGCA GCATCGAGAA GGGATGCTTT TCATTTCCAA ACTGGATGGG 6600
CGTCTGTCTT GTACTGAACA TGACCCCTGT GGGCCCAATC CTCTGGAACC TGGGGAGATC 6660
CGTGAGTATG TGCCTCCCCC AGTACCGCTG CCTCCAGGGC CAAGCACTCA CCTGGCAGAG 6720
CAATCAACAG GAATGGCTGC TCAGGCACCC AAAATGTCAG ATAAACCTCC TGCTGACACC 6780
AACCAGACGC TGTCGCTCTC CAAAAAAGCT CTGGCAGGGA CTTGTCAGAG GCCACTGCTA 6840
CCTGAAAGAC CTCTTGAGAG AACTGACTCC AGGTCCCAAC CTTTAGATAA GGTCAGAGAC 6900
CTTGCTGGGT CAGGGACCAA ATCCCAATCC TTGGTTTCCA GCCAGAGGCC ACTGGACAGG 6960
CAACCAGCAG TGGCAGGACC AAGACCCCAG CTAAGCGACA AACCCTCTCC AGTGACCAGC 7020
CCAAGCTCCT CACCCTCAGT CAGGTCCCAA CCACTGGAAA GACCTCTGGG GACGGCTGAC 7080
CCAAGGCTGG ATAAATCCAT AGGTGCTGCC AGCCCAAGGC CCCAGTCACT GGAGAAAACC 7140
CCAGTTCCCA CTGGCCTGAG ACTTCCGCCG CCAGACAGAC TGCTCATTAC CAGCAGTCCC 7200
AAACCCCAGA CTTCAGACAG GCCCCCTGAC AAACCCCATG CCTCTTTGTC CCAGAGACTC 7260
CCGCCTCCTG AGAAAGTACT GTCAGCTGTG GTCCAGACCC TTGTAGCTAA AGAAAAAGCA 7320
CTGAGGCCTG TGGACCAGAA TACTCAGTCA AAAAATAGAG CTGCTTTGGT GATGGATCTC 7380
ATAGACCTAA CTCCTCGCCA GAAGGAGCGG GCAGCTTCAC CTCATGAGGT CACACCACAG 7440
GCTGATGAGA AGATGCCAGT GTTGGAGTCG AGTTCATGGC CTGCCAGCAA AGGTCTGGGG 7500
CATATGCCGA GAGCTGTTGA GAAAGGCAGT GTGTCAGATC CTCTTCAGAC ATCTGGGAAA 7560
GTAGCAGCCC ATTCAGAGGA CCCCTGGCAA GCTGTTAAAT CATTCACCCA GGCCAGACTT 7620
CTTTCTCAGC CTCCTGCCAA GGCTTTTTTA TATGAGCCAA CAACTCAGGC CTCAGGAAGA 7680
GCTCCTGCAG GGACTGAGCA GACCCCAGGG CCTCTTAGCC AAGTCCCGGG CCTGGTGAAG 7740
CAGGCGAAGC AGATGGTCGG AGGCCAGCAA CTACCTGCAC TTGCCGCCAG GAGTGGGCAG 7800
TCCTTTAGGT CTCTCGGGAA GGCCCCAGCC TCCCTCCCCA CTGAAGAAAA GAAGTTGGTA 7860
ACCACAGAGC AAAGTCCCTG GGCCCTGGGA AAAGCCTCCT CACGGGCAGG GCTCTGGCCC 7920
ATAGTGGCTG GACAGACACT GGCACAGTCT TGCTGGTCTC CTGGGAGCAC ACAGACATTG 7980
GCACAGACTT GCTGGTCTCT TGGAAGAGGG CAAGACCCCA AACCAGAGCA AAATACACTT 8040
CCAGCTCTTA ACCAGGCTCC TTCCACTCAC AAGTGTGCAG AATCAGAACA GAAGTAA 8098
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 98 0.0 5133
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 96 0.0 5035
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 96 0.0 5020
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 90 0.0 4681
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 89 0.0 4672
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 89 0.0 4662
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 89 0.0 4626
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 88 0.0 4606
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 88 0.0 4585
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 97 0.0 4578
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 87 0.0 4520
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 95 0.0 4469
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 95 0.0 4459
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 85 0.0 4430
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 94 0.0 4305
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 81 0.0 4210
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 88 0.0 4092
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 88 0.0 4073
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 87 0.0 4060
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 88 0.0 4059
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 93 0.0 4049
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 87 0.0 3925
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 87 0.0 3904
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 88 0.0 3737
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 81 0.0 3702
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 84 0.0 3685
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 68 0.0 3429
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 67 0.0 2998
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 77 0.0 2649
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 86 0.0 2634
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 87 0.0 2565
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 81 0.0 2247
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 55 0.0 2189
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 55 0.0 2183
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 55 0.0 2165
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 55 0.0 2027
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 57 0.0 1994
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 79 0.0 1812
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 49 0.0 1702
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 89 0.0 1608
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 68 0.0 1540
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 54 0.0 1402
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 82 0.0 1331
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 75 0.0 1302
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 77 0.0 1293
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 84 0.0 1210
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 59 0.0 1154
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 71 0.0 1124
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 71 0.0 1123
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 81 0.0 1071
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 71 0.0 1056
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 82 0.0 1017
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 73 0.0 998
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 59 0.0 989
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 71 0.0 966
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 63 0.0 934
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 60 0.0 918
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 83 0.0 901
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 59 0.0 888
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 60 0.0 881
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 66 0.0 879
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 61 0.0 874
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 58 0.0 798
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 54 5e-177 620
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 4e-122 438
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 37 5e-56 218
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 39 2e-50 200
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 40 3e-50 199
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 40 4e-50 199
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 40 1e-49 197
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 5e-49 195
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 195
Created Date 25-Jun-2016