WERAM Information


Tag Content
WERAM ID WERAM-Pat-0143
Ensembl Protein ID ENSPTRP00000044805.4
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPTRG00000017575.5 ENSPTRT00000042784.4 ENSPTRP00000044805.4
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.50e-52 176.7 1944 2060
Me_Reader PWWP 5.80e-33 112.8 323 1818
HMT SET1 2.00e-29 102.1 1944 2060
Me_Reader PHD 1.50e-19 69.9 1545 2164
Organism Pan troglodytes
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSPTRP00000044805.4 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2030
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSPTRP00000044805.4 2031 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSPTRP00000044805.4 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSPTRP00000044805.4 1758 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1818
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSPTRP00000044805.4 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2028
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSPTRP00000044805.4 2029 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSPTRP00000044805.4 1545 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1589
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSPTRP00000044805.4 1592 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1639
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSPTRP00000044805.4 1640 ICITCHAANPANVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1693
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSPTRP00000044805.4 1710 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1751
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSPTRP00000044805.4 2122 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2164
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STVSGTSQNA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQTP IVCTSLSPGG 120
PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180
EEIFEETQTN ATCNYETKSE NGVKVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240
RNEVDGSNEK AALLPAPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDVPKGSKNR KCIPGSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA 480
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHIQFEA HKDERRGKIP ENLGLNFISG 540
DISDTQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNGDSL LGLPEGALIS 600
KCSREKNKPQ RSLVCGSKVK LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIEHSSESDN 660
SVLEIPDAFD RTENMLSMQK NEKIKYSRFA ATNTRVKAKQ KPLISNSHTD HLMGCTKSAE 720
PGTETSQVNL SDLKASTLVH KPQSDFTNDA LSPKFNMSSS ISSENSLIKG GAANQALLHS 780
KSKQPKFRSI KCKHKENPVM VEPPVINEEC SLKCCSSDTK GSPLASISKS GKVDGLKLLN 840
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLGEDVSDSG TSKPSKPLLF SSASSQNHIP 900
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG RGDCSTNSPV GVSKVLVSGG 960
STHNSEKKGD GTQNSANPSP SGGDSALSGE LSASLPGLVS DKRDLPASGK SRSDCVTRRN 1020
CGRSKPSSKL RDAFSAQMVK NTVNRKALKT ERKRKLNQLP SVTLDAVLQG DREHGGSLRG 1080
GAEDPSKEDP LQIMGHLTSE DGDHFSDVHF DSKVKQSDPG KISEKGLSFE NGKGPELDSV 1140
MNSENDELNG VNQVVPKKRW QRLNQRRTKP RKRMNRFKEK ENSECAFRVL LPSDPVQEGR 1200
DEFPEHRTPP SASILEEPLT EQNHADCLDS VGPRLNVCDK SSASIGDMEK EPGIPSLTPQ 1260
AELPEPAVRS EKKRLRKPSK WLLEYTEEYD QIFAPKKKQK KVQEQVHKVS SRCEEESLLA 1320
RGRSSAQNKQ VDENSLISTK EEPPVLEREA PFLEGPLAQS ELGGGHAELP QLTLSVPVAP 1380
EVSPRPALES EELLVKTPGN YESKRQRKPT KKLLESNDLD PGFMPKKGDL GLSKKCYEAG 1440
HLENGITESC ATSYSKDFGG GTTKIFDKPR KRKRQRHAAA KMQCKKVKND DSSKEIPGSE 1500
GELMPHRTAT SPKETVEEGV EHDPGMPASK KMQGERGGGA ALKENVCQNC EKLGELLLCE 1560
AQCCGAFHLE CLGLTEMPRG KFICNECRTG IHTCFVCKQS GEDVKRCLLP LCGKFYHEEC 1620
VQKYPPTVMQ NKGFRCSLHI CITCHAANPA NVSASKGRLM RCVRCPVAYH ANDFCLAAGS 1680
KILASNSIIC PNHFTPRRGC RNHEHVNVSW CFVCSEGGSL LCCDSCPAAF HRECLNIDIP 1740
EGNWYCNDCK AGKKPHYREI VWVKVGRYRW WPAEICHPRA VPSNIDKMRH DVGEFPVLFF 1800
GSNDYLWTHQ ARVFPYMEGD VSSKDKMGKG VDGTYKKALQ EAAARFEELK AQKELRQLQE 1860
DRKNDKKPPP YKHIKVNRPI GRVQIFTADL SEIPRCNCKA TDENPCGIDS ECINRMLLYE 1920
CHPTVCPAGG RCQNQCFSKR QYPEVEIFRT LQRGWGLRTK TDIKKGEFVN EYVGELIDEE 1980
ECRARIRYAQ EHDITNFYML TLDKDRIIDA GPKGNYARFM NHCCQPNCET QKWSVNGDTR 2040
VGLFALSDIK AGTELTFNYN LECLGNGKTV CKCGAPNCSG FLGVRPKNQP IATEEKSKKF 2100
KKKQQGKRRT QGEITKERED ECFSCGDAGQ LVSCKKPGCP KVYHADCLNL TKRPAGKWEC 2160
PWHQCDICGK EAASFCEMCP SSFCKQHREG MLFISKLDGR LSCTEHDPCG PNPLEPGEIR 2220
EYVPPPVPLP PGPSTHLAEQ STGMAAQAPK MSDKPPADTN QTLSLSKKAL AGTCQRPLLP 2280
ERPLERTDSR PQPLDKVRDL AGSGTKSQSL VSSQRPLDRP PAVAGPRPQL SDKPSPVTSP 2340
SSSPSVRSQP LERPLGTADP RLDKSIGAAS PRPQSLEKTP VPTGLRLPPP DRLLITSSPK 2400
PQTSDRPTDK PHASLSQRLP PPEKVLSAVV QTLVAKEKAL RPVDQNTQSK NRAALVMDLI 2460
DLTPRQKERA ASPHEVTPQA DEKMPVLESS SWPASKGLGH MPRAVEKGCV SDPLQTSGKA 2520
AAPSEDPWQA VKSLTQARLL SQPPAKAFLY EPTTQASGRA SAGAEQTPGP LSQSLGLVKQ 2580
AKQMVGGQQL PALAAKSGQS FRSLGKAPAS LPTEEKKLVT TEQSPWALGK ASSRAGLWPI 2640
VAGQTLAQSC WSAGSTQTLA QTCWSLGRGQ DPKPEQNTLP ALNQAPSSHK CAESEQK 2697
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ACCCAGAAGA AATTGTCTGC TGCCCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTA TCGACTGTCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA GACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCTGATGGAT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGACGCCA ATTGTTTGCA CTTCCTTGAG TCCTGGTGGT 360
CCTACAGCAC TTGCTATGAA ACAGGAACCC TCTTGTAATA ACTCCCCTGA ACTCCAGGTA 420
AAAGTAACAA AGACTATCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGAAAC TCAGACCAAT GCCACCTGCA ATTATGAGAC TAAATCAGAG 600
AATGGTGTAA AAGTGGCCAT GGGAAGTGAA CAAGACAGCA CACCAGAGAG TAGACACGGT 660
GCAGTCAAAT CGCCATTCTT GCCATTAGCT CCTCAGACTG AAACACAGAA AAATAAGCAA 720
AGAAATGAAG TGGACGGCAG CAATGAAAAA GCAGCCCTTC TCCCAGCCCC CTTTTCACTA 780
GGAGACACAA ACATTACAAT AGAAGAGCAA TTAAACTCAA TAAATTTATC TTTTCAGGAT 840
GATCCAGATT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTTGT CAACCTAAGA AAAAGTCTAC GCCACTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAA TTCAAGAGAC GCCCATGGTG GCCCTGCAGG 1020
ATTTGTTCTG ATCCGTTGAT TAACACACAT TCAAAAATGA AAGTTTCCAA CCGGAGGCCC 1080
TATCGGCAGT ACTACGTGGA GGCTTTTGGA GATCCTTCTG AGAGAGCCTG GGTGGCTGGA 1140
AAAGCAATCG TCATGTTTGA AGGCAGACAT CAATTCGAAG AGCTACCTGT CCTTAGGAGA 1200
AGAGGGAAAC AGAAAGAAAA AGGATATAGG CATAAGGTTC CTCAGAAAAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGACT TGCAGAACAG TATGATGTTC CCAAGGGGTC AAAGAACCGA 1320
AAATGTATTC CTGGTTCAAT CAAGTTGGAC AGTGAAGAAG ATATGCCATT TGAAGACTGC 1380
ACAAATGATC CTGAGTCAGA ACATGATCTG TTGCTTAATG GCTGTTTGAA ATCACTGGCT 1440
TTTGATTCTG AACATTCTGC AGATGAGAAG GAAAAGCCTT GTGCTAAATC TCGAGCCAGA 1500
AAGAGCTCTG ATAATCCAAA AAGGACTAGT GTGAAAAAGG GCCACATACA ATTTGAAGCA 1560
CATAAAGATG AACGGAGGGG AAAGATTCCA GAGAACCTTG GCCTAAACTT TATCTCTGGG 1620
GATATATCTG ATACGCAGGC CTCTAATGAA CTTTCCAGGA TAGCAAATAG CCTCACAGGG 1680
TCCAACACTG CCCCAGGAAG TTTTCTGTTT TCTTCCTGTG GAAAAAACAC TGCAAAGAAA 1740
GAATTTGAGA CTTCAAATGG TGACTCTTTA TTGGGCTTGC CTGAGGGTGC TTTGATCTCA 1800
AAGTGTTCTC GAGAGAAGAA TAAACCCCAA CGAAGCCTGG TGTGTGGTTC AAAAGTGAAG 1860
CTCTGCTATA TTGGAGCAGG TGATGAGGAA AAGCGAAGTG ATTCCATTAG TATCTGTACC 1920
ACTTCTGATG ATGGAAGCAG TGACCTGGAT CCCATAGAAC ACAGCTCAGA GTCTGATAAC 1980
AGTGTCCTTG AAATTCCAGA TGCTTTCGAT AGAACAGAGA ACATGTTATC TATGCAGAAA 2040
AATGAAAAGA TAAAGTATTC TAGGTTTGCT GCCACAAACA CTAGGGTAAA AGCAAAACAG 2100
AAGCCTCTCA TTAGTAACTC ACATACAGAC CACTTAATGG GTTGTACTAA GAGTGCAGAG 2160
CCTGGAACTG AGACGTCTCA GGTTAATCTC TCTGATCTGA AGGCATCTAC TCTTGTTCAC 2220
AAACCCCAGT CAGATTTTAC AAATGATGCT CTCTCTCCAA AATTCAACAT GTCATCAAGC 2280
ATATCCAGTG AGAACTCGTT AATAAAGGGT GGGGCAGCAA ATCAAGCTCT ATTACATTCG 2340
AAAAGCAAAC AGCCCAAGTT CCGAAGTATA AAGTGCAAAC ACAAAGAAAA TCCAGTTATG 2400
GTAGAACCCC CAGTTATAAA TGAGGAGTGC AGTTTGAAAT GCTGCTCTTC TGATACCAAA 2460
GGCTCTCCTT TGGCCAGCAT TTCTAAAAGT GGGAAAGTGG ATGGTCTAAA ACTACTGAAC 2520
AATATGCATG AGAAAACCAG GGATTCAAGT GACATAGAAA CAGCAGTGGT GAAACATGTT 2580
TTATCCGAGT TGAAGGAACT CTCTTACAGA TCCTTAGGTG AGGATGTCAG TGACTCTGGA 2640
ACATCAAAGC CATCAAAACC ATTACTTTTC TCTTCTGCTT CTAGTCAGAA TCACATACCT 2700
ATTGAACCAG ACTACAAATT CAGTACATTG CTAATGATGT TGAAAGATAT GCATGATAGT 2760
AAGACGAAGG AGCAGCGGTT GATGACTGCT CAAAACCTGG TCTCTTACCG GAGTCCTGGT 2820
CGTGGGGACT GTTCTACTAA TAGTCCTGTA GGAGTCTCTA AGGTTTTGGT TTCAGGAGGC 2880
TCCACACACA ATTCAGAGAA AAAGGGAGAT GGCACTCAGA ACTCCGCCAA TCCTAGCCCT 2940
AGTGGGGGTG ACTCTGCATT ATCTGGCGAG TTGTCTGCTT CCCTACCTGG CTTAGTGTCC 3000
GACAAGAGAG ACCTCCCTGC TTCTGGTAAA AGTCGTTCAG ACTGTGTTAC TAGGCGCAAC 3060
TGTGGACGAT CAAAGCCTTC ATCCAAATTG CGAGATGCTT TTTCAGCCCA AATGGTAAAG 3120
AACACAGTGA ACCGTAAAGC CTTAAAGACC GAGCGCAAAA GAAAACTGAA TCAGCTTCCA 3180
AGTGTGACTC TTGATGCTGT ACTGCAGGGA GACCGAGAAC ATGGAGGTTC ATTGAGAGGT 3240
GGGGCAGAAG ATCCTAGTAA AGAGGATCCC CTTCAGATAA TGGGCCACTT AACAAGTGAA 3300
GATGGTGACC ATTTTTCTGA TGTGCATTTC GATAGCAAGG TTAAGCAATC TGATCCTGGT 3360
AAAATTTCTG AAAAAGGACT CTCTTTTGAA AACGGAAAAG GCCCAGAGCT GGACTCTGTA 3420
ATGAACAGTG AGAATGATGA ACTCAATGGT GTAAATCAAG TGGTGCCTAA AAAGCGGTGG 3480
CAGCGTTTAA ACCAAAGGCG CACTAAACCT CGTAAGCGCA TGAACAGATT TAAAGAGAAA 3540
GAAAACTCTG AGTGTGCCTT TAGGGTCTTA CTTCCTAGTG ACCCTGTGCA GGAGGGGCGG 3600
GATGAGTTTC CAGAGCATAG AACTCCTCCT TCAGCAAGCA TACTTGAGGA ACCACTGACA 3660
GAGCAAAATC ATGCTGACTG CTTAGATTCA GTTGGGCCAC GGTTAAATGT TTGTGATAAA 3720
TCCAGTGCCA GCATTGGTGA CATGGAAAAG GAGCCAGGAA TTCCCAGTTT GACACCACAG 3780
GCTGAGCTCC CTGAACCAGC TGTGCGGTCA GAGAAGAAAC GCCTTAGGAA GCCAAGCAAG 3840
TGGCTTTTGG AATATACAGA AGAATATGAT CAGATATTTG CTCCTAAGAA AAAACAAAAG 3900
AAGGTACAGG AGCAGGTGCA CAAGGTAAGT TCCCGCTGTG AAGAGGAAAG CCTTCTAGCC 3960
CGAGGTCGAT CTAGTGCTCA GAACAAGCAG GTGGACGAGA ATTCTTTGAT TTCAACCAAA 4020
GAAGAGCCTC CAGTTCTTGA AAGGGAGGCT CCGTTTTTGG AGGGCCCCTT GGCTCAGTCA 4080
GAACTTGGAG GTGGACATGC TGAGTTGCCG CAGCTGACCT TGTCTGTGCC TGTGGCTCCG 4140
GAAGTCTCTC CACGGCCTGC CCTTGAGTCT GAGGAATTGC TAGTTAAAAC GCCAGGAAAT 4200
TATGAAAGTA AACGTCAAAG AAAACCAACT AAGAAACTTC TTGAATCCAA TGATTTAGAC 4260
CCTGGATTTA TGCCCAAGAA GGGGGACCTT GGCCTTTCTA AAAAGTGCTA TGAAGCTGGT 4320
CACCTGGAGA ATGGCATAAC TGAATCTTGT GCCACATCTT ATTCAAAAGA TTTTGGTGGA 4380
GGCACTACCA AGATATTTGA CAAGCCAAGG AAGCGAAAAC GACAGAGGCA TGCTGCAGCC 4440
AAGATGCAGT GTAAAAAAGT GAAAAATGAT GACTCGTCAA AAGAGATTCC AGGCTCAGAG 4500
GGAGAACTAA TGCCTCACAG GACGGCCACA AGCCCCAAGG AGACTGTTGA GGAAGGTGTA 4560
GAACACGATC CCGGGATGCC TGCCTCTAAA AAAATGCAGG GTGAACGCGG TGGAGGAGCT 4620
GCACTCAAGG AGAATGTCTG TCAGAATTGT GAAAAATTGG GTGAGCTGCT GTTATGTGAG 4680
GCTCAGTGCT GTGGGGCTTT CCACCTGGAG TGCCTTGGAT TGACTGAGAT GCCAAGAGGA 4740
AAATTTATCT GCAATGAATG TCGCACAGGA ATCCATACCT GTTTTGTATG TAAGCAGAGT 4800
GGGGAAGATG TTAAAAGGTG CCTTCTACCC TTGTGTGGAA AGTTTTACCA TGAAGAGTGT 4860
GTCCAGAAGT ACCCACCCAC TGTTATGCAG AACAAGGGCT TCCGGTGCTC CCTCCACATC 4920
TGTATAACCT GTCATGCTGC TAATCCAGCC AATGTTTCTG CATCTAAAGG TCGGTTGATG 4980
CGCTGTGTCC GCTGTCCTGT GGCATACCAC GCCAATGACT TTTGCCTGGC TGCTGGGTCA 5040
AAGATCCTTG CATCTAATAG TATCATCTGC CCTAATCACT TTACCCCTAG GCGGGGCTGC 5100
CGAAATCATG AGCATGTTAA TGTTAGCTGG TGCTTTGTGT GCTCAGAAGG AGGCAGCCTT 5160
CTGTGCTGTG ATTCTTGCCC TGCTGCTTTT CATCGTGAAT GCCTGAACAT TGATATCCCT 5220
GAAGGAAACT GGTATTGCAA TGACTGTAAG GCAGGCAAAA AGCCACACTA CAGGGAGATT 5280
GTCTGGGTAA AAGTTGGACG ATACAGGTGG TGGCCAGCTG AGATCTGCCA TCCTCGAGCT 5340
GTTCCTTCCA ACATTGATAA GATGAGACAT GATGTGGGAG AGTTCCCAGT CCTCTTTTTT 5400
GGATCTAATG ACTATTTGTG GACTCACCAG GCCCGAGTCT TCCCTTACAT GGAGGGTGAC 5460
GTGAGCAGCA AGGATAAGAT GGGCAAAGGA GTGGATGGGA CATATAAAAA AGCTCTTCAG 5520
GAAGCTGCAG CAAGGTTTGA GGAATTAAAG GCCCAAAAAG AGCTAAGACA GCTGCAGGAA 5580
GACCGAAAGA ATGACAAGAA GCCACCACCT TATAAACATA TAAAGGTAAA CCGTCCTATT 5640
GGCAGGGTAC AGATCTTCAC TGCAGACTTA TCTGAAATAC CCCGTTGCAA CTGTAAAGCT 5700
ACTGATGAGA ACCCCTGTGG GATAGACTCT GAATGCATCA ACCGCATGCT GCTCTATGAG 5760
TGCCACCCCA CAGTGTGTCC TGCCGGAGGG CGCTGTCAAA ACCAGTGCTT TTCCAAGCGC 5820
CAATATCCAG AGGTTGAAAT TTTCCGCACA TTACAGCGGG GTTGGGGTCT ACGGACAAAA 5880
ACTGATATTA AAAAGGGTGA ATTTGTGAAT GAGTATGTGG GTGAGCTTAT AGATGAAGAA 5940
GAATGCAGAG CTCGAATTCG CTATGCTCAA GAACATGATA TCACTAATTT CTATATGCTC 6000
ACCCTAGACA AAGACCGAAT CATTGATGCT GGTCCCAAAG GAAACTATGC TCGGTTCATG 6060
AATCACTGCT GCCAGCCCAA CTGTGAAACA CAGAAGTGGT CTGTGAATGG AGATACCCGT 6120
GTAGGCCTTT TTGCACTAAG TGACATTAAA GCAGGCACTG AACTTACCTT CAACTACAAC 6180
CTAGAATGTC TTGGGAATGG AAAGACTGTT TGCAAATGTG GAGCCCCGAA CTGCAGTGGC 6240
TTCTTGGGTG TAAGGCCAAA GAACCAACCC ATTGCCACGG AAGAAAAGTC AAAGAAATTC 6300
AAGAAGAAGC AACAGGGAAA GCGCAGGACC CAGGGTGAAA TCACAAAGGA GCGAGAAGAT 6360
GAGTGTTTTA GTTGTGGGGA TGCTGGCCAG CTCGTCTCCT GCAAGAAACC AGGCTGCCCA 6420
AAAGTTTACC ATGCAGACTG TCTCAATCTG ACCAAGCGAC CAGCAGGGAA ATGGGAATGT 6480
CCGTGGCATC AGTGTGACAT CTGCGGGAAG GAAGCAGCCT CCTTCTGTGA GATGTGCCCC 6540
AGCTCCTTTT GTAAGCAGCA TCGAGAAGGG ATGCTTTTCA TTTCCAAACT GGATGGGCGT 6600
CTGTCTTGTA CTGAGCATGA CCCCTGTGGG CCCAATCCTC TGGAACCTGG GGAGATCCGT 6660
GAGTATGTGC CTCCCCCAGT ACCGCTGCCT CCAGGGCCAA GCACTCACCT GGCAGAGCAA 6720
TCAACAGGAA TGGCTGCTCA GGCACCCAAA ATGTCAGATA AACCTCCTGC TGACACCAAC 6780
CAGACGCTGT CGCTCTCCAA AAAAGCTCTG GCAGGGACTT GTCAGAGGCC ACTGCTACCT 6840
GAAAGACCTC TTGAGAGAAC TGACTCCAGG CCCCAGCCTT TAGATAAGGT CAGAGACCTC 6900
GCTGGGTCAG GGACCAAATC CCAATCCTTG GTTTCCAGCC AGAGGCCACT GGACAGGCCA 6960
CCAGCAGTGG CAGGACCAAG ACCCCAGCTA AGCGACAAAC CCTCTCCAGT GACCAGCCCA 7020
AGCTCCTCAC CCTCAGTCAG GTCCCAACCA CTGGAAAGAC CTCTGGGGAC GGCTGACCCA 7080
AGGCTGGATA AATCCATAGG TGCTGCCAGC CCAAGGCCCC AGTCACTGGA GAAAACCCCA 7140
GTTCCCACTG GCCTGAGACT TCCGCCGCCA GACAGACTGC TCATTACTAG CAGTCCCAAA 7200
CCCCAGACTT CAGACAGGCC TACTGACAAA CCCCATGCCT CTTTGTCCCA GAGACTCCCA 7260
CCTCCTGAGA AAGTACTATC AGCTGTGGTC CAGACCCTTG TAGCTAAAGA AAAAGCACTG 7320
AGGCCTGTGG ACCAGAATAC TCAGTCAAAA AATAGAGCTG CTTTGGTCAT GGATCTCATA 7380
GACCTAACTC CTCGCCAGAA GGAGCGGGCA GCTTCACCTC ATGAGGTCAC ACCACAGGCT 7440
GATGAGAAGA TGCCAGTGTT GGAGTCAAGT TCATGGCCTG CCAGCAAAGG TCTGGGGCAT 7500
ATGCCGAGAG CTGTTGAGAA AGGCTGTGTG TCAGATCCTC TTCAGACATC TGGGAAAGCA 7560
GCAGCCCCTT CAGAGGACCC CTGGCAAGCT GTTAAATCAC TCACCCAGGC CAGACTTCTT 7620
TCTCAGCCTC CTGCCAAGGC CTTTTTATAT GAGCCAACAA CTCAGGCCTC AGGAAGAGCT 7680
TCTGCAGGGG CTGAGCAGAC CCCAGGGCCT CTTAGCCAAT CCCTGGGCCT GGTGAAGCAG 7740
GCGAAGCAGA TGGTCGGAGG CCAGCAACTA CCTGCACTTG CCGCCAAGAG TGGGCAATCT 7800
TTTAGGTCTC TCGGGAAGGC CCCAGCCTCC CTCCCCACTG AAGAAAAGAA GTTGGTAACC 7860
ACAGAGCAAA GTCCCTGGGC CCTGGGAAAA GCCTCATCAC GGGCAGGGCT CTGGCCCATA 7920
GTGGCTGGAC AGACACTGGC ACAGTCTTGC TGGTCTGCTG GGAGCACACA GACATTGGCA 7980
CAGACTTGCT GGTCTCTTGG AAGAGGGCAA GACCCCAAAC CAGAGCAAAA TACACTTCCA 8040
GCTCTTAACC AGGCTCCTTC CAGTCACAAG TGTGCAGAAT CAGAACAGAA GTAGTACCAA 8100
TCAATGTCAC ATGAACAAAC AAGCTGCCCC CAGGGTACCA TTTGGGGAGG GGAAATCTTT 8160
TCTTTCTTTC CCCCTTAAAA AAAAAACACA TCTGCCCCGA ACACTTTCCC ACTGTTATTC 8220
TTTCCTCATA TCCCAACACT CAGAACTCTT GTGACATTAG CCAGTGGGGG CTTATGGTTG 8280
TGTGAACCAT GTATGAAAAT CCAGTGGGCC CCAACCAAGG AGACAGACAG ACTTGGGTCT 8340
CTTTCCCCCA ACTTTTCCAC ATGGTCATCG TGAAATAAAA AGTCCACTCT GGAGTC 8397
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 100 0.0 5233
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 98 0.0 5154
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 96 0.0 5046
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 92 0.0 4824
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 92 0.0 4822
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 91 0.0 4814
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 91 0.0 4776
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 91 0.0 4752
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 91 0.0 4739
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 90 0.0 4671
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 99 0.0 4657
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 98 0.0 4646
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 98 0.0 4608
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 88 0.0 4586
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 98 0.0 4522
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 84 0.0 4331
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 90 0.0 4232
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 90 0.0 4204
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 91 0.0 4202
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 96 0.0 4192
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 91 0.0 4178
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 90 0.0 4058
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 90 0.0 4058
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 91 0.0 3884
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 88 0.0 3837
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 83 0.0 3819
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3542
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3107
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 89 0.0 2727
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 77 0.0 2684
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 88 0.0 2622
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 85 0.0 2335
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 58 0.0 2295
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2279
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2269
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 57 0.0 2137
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 60 0.0 2099
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 80 0.0 1867
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 51 0.0 1790
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 91 0.0 1658
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 71 0.0 1628
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 57 0.0 1490
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 88 0.0 1425
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 82 0.0 1364
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 76 0.0 1347
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1284
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1243
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 76 0.0 1202
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 75 0.0 1202
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1137
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 81 0.0 1077
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1077
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 64 0.0 1069
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 83 0.0 1054
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 77 0.0 1052
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 65 0.0 998
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 973
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 961
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 951
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 61 0.0 939
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 60 0.0 932
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 58 0.0 914
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 84 0.0 903
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 650
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 1e-121 436
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 1e-55 217
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 3e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 7e-50 198
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 9e-50 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 3e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 4e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 195
Created Date 25-Jun-2016