WERAM Information


Tag Content
WERAM ID WERAM-Caf-0170
Ensembl Protein ID ENSCAFP00000024244.3
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSCAFG00000016473.4 ENSCAFT00000026110.3 ENSCAFP00000024244.3
ENSCAFG00000016473.4 ENSCAFT00000045272.2 ENSCAFP00000037320.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.90e-52 176.7 1944 2060
Me_Reader PWWP 1.10e-32 112.2 323 1818
HMT SET1 2.50e-29 102.1 1944 2060
Me_Reader PHD 1.90e-19 69.9 1545 2164
Organism Canis familiaris
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSCAFP00000024244.3 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2030
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSCAFP00000024244.3 2031 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSCAFP00000024244.3 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 12 YpwWPalvisppleakklktqeaeenk 38
Y +Pa +++ ++k+l t+++ +++
ENSCAFP00000024244.3 686 YTRYPATNTKVKAKQKSLITNSHTDHL 712
899****99999999999888888775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSCAFP00000024244.3 1758 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1818
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSCAFP00000024244.3 1944 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2028
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSCAFP00000024244.3 2029 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2060
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSCAFP00000024244.3 1545 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1589
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSCAFP00000024244.3 1592 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1639
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSCAFP00000024244.3 1640 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1693
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSCAFP00000024244.3 1710 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1751
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSCAFP00000024244.3 2122 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2164
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STASGTSQNA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQSP VVCTSLSPGG 120
PTALAMKQEP SCNNSPELQV KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180
EEIFEETQTN AICNYEPKSE NGVDVAMGNE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240
RNEVDGSNEK AALLPAPFAL GDTNVTIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDVPKGSKNR RCVSSSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA 480
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS MKKGHMQFEA HKEERRGKIP ENLGLSFISG 540
DVSDKQASNE LSRIANSLTG SNTAPGSFLF SSCGKNTAKK EFETSNCDSL LGLSEGALIS 600
KHSEEKKKLQ RGLMCSSKVQ LCYIGAGDEE KRSDSISICT TSDDGSSDLD PIDNSSESDN 660
SVLEITDAFD RTENMLSVQK NEKVKYTRYP ATNTKVKAKQ KSLITNSHTD HLMDCAKTVE 720
PGTETSQVNL SDLKVSAVVR KPQSDFRNDS FSPKFNTPSS ISSENSLIKS GATNQALLHA 780
KSKQPKIRSI KCKHKENPVV VEPPVTNEDC SLKCCSSDTK GSPLASISKS GKVDGLKLLS 840
NMHEKTRDSN DIETAVVKHV LSELKELSYR SLSEDVSDSG TSKPSKPLLF SSASGQNHIP 900
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLVSYRSPG LGDCSSSSPV SASKVLVSGS 960
STHSSEKNGD GTQKSVRPSP SGGDSALSGE LSVPVPGLVS DRRDVPASSK SHSDCVTRRN 1020
CGRSKPSSKL RDSFAAQMGR NTLNRKALKT ERKRKLSRLP AVTLEAALQG DRVSGDSENG 1080
SSRGGLEDSG KEEPLQLMGH LTSEDSAHFS SVHFDNKVNH PDPDKIPEKG PSFENRKGPE 1140
LDNEMNSEND EPSGVNQAVP KKRWQRLNQR RTKPRKRTNR FREKENSEGA FGVLLPSDPV 1200
KKGDEFPEHR PPTSTNVIED TLADPNHTSC LDSIGPRLNV CDKSSASVEE MEKEPGIPSL 1260
TPQPELPEPA VRSEKKRLRK PSKWLLEYTE EYDQIFAPKK KQKKVQEQVH KVSSRCEEES 1320
LLARCRSSAQ NKQVDENSLI STKEEPPVLE REAPFLEGPL AQSELGGGHA ELPQLTLSVP 1380
VAPEVSPRPI LESEELLVKT PGNYESKRQR KPTKKLLESN DLDPGFMPKK GDLGLSKKCY 1440
EAGHLENDSE SRAASREYGG GAAKIFDKPR KRKRQRHATA KVHCKKMKND DSSKETPGSE 1500
GELMTHRTAA SPKETVEESV ENDHGMPASK KLQGERGGGA ALKENVCQNC EKLGELLLCE 1560
AQCCGAFHLE CLGLTEMPRG KFICNECRTG IHTCFVCKQS GEDVKRCLLP LCGKFYHEEC 1620
VQKYPPTVMQ NKGFRCSLHI CITCHAANPA SVSASKGRLM RCVRCPVAYH ANDFCLAAGS 1680
KILASNSIIC PNHFTPRRGC RNHEHVNVSW CFVCSEGGSL LCCDSCPAAF HRECLNIDIP 1740
EGNWYCNDCK AGKKPHYREI VWVKVGRYRW WPAEICHPRA VPSNIDKMRH DVGEFPVLFF 1800
GSNDYLWTHQ ARVFPYMEGD VSSKDKMGKG VDGTYKKALQ EAAARFEELK AQKELRQLQE 1860
DRKNDKKPPP YKHIKVNRPI GRVQIFTADL SEIPRCNCKA TDDNPCGIDS ECINRMLLYE 1920
CHPTVCPAGG RCQNQCFTKR QYPEVEIFRT LQRGWGLRTK TDIKKGEFVN EYVGELIDEE 1980
ECRARIRYAQ EHDITNFYML TLDKDRIIDA GPKGNYARFM NHCCQPNCET QKWSVNGDTR 2040
VGLFALSDIK AGTELTFNYN LECLGNGKTV CKCGAPNCSG FLGVRPKNQP IATEEKSKKF 2100
KKKQQGKRRT QGEITKERED ECFSCGDAGQ LVSCKKPGCP KVYHADCLNL TKRPAGKWEC 2160
PWHQCDICGK EAASFCEMCP SSFCKQHREG MLFISKLDGR LSCTEHDPCG PNPLEPGEIR 2220
EYVPPPVPLP PGPGTHLAEH SSGVAAQGPK MLDKLPADTN QTLSLSKKAL AGTCQRPLLP 2280
ERPPDRTDSR PQPVDRVRDL AGSGTKPQSL VSSQKPLDRP PAVAGPRPLL SDKPSPVTGI 2340
SSSPSVRSQS LERPLGTADP RLDKSIGAAS PRPQSLEKTP GPPGLRLPPP DRLLVTSSPK 2400
PQTSDRPPDK SHASLSQRLP PPDKVLSAVV QTLVAKEKAL RPVDQNTQSK NRAALVMDLI 2460
DLTPHQKERA ASPHEVTPQV DEKMPVLESS SWTASKGLGQ MPRAVERGSM SDPVLQPPGK 2520
TAVPSEHPWQ AVKSLTQARL LSQPSAKAFL YEPTTQASGR APAGVEQIPG PPSQAPGLVK 2580
QVKQMTGGQQ LPGLAAKSGQ SFRPLGKAPS TLCTEEKKLA TAEQSPWALG KSSPGPGLWP 2640
MVAGQTISPS CWSSGNTQTL AQTCWSLGRG QDPKPEQNTL PALNQAPSSH KCAESEQK 2698
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ACCTAGAAGA AATTGTCTGC TGCCCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTA TCGACTGCCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA GACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCTGATGGAT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGTCGCCA GTTGTTTGCA CTTCCTTGAG TCCTGGTGGT 360
CCAACAGCAC TTGCTATGAA ACAGGAACCC TCTTGTAATA ACTCCCCCGA ACTCCAGGTA 420
AAAGTAACAA AGACTATCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGAAAC TCAGACCAAT GCCATCTGCA ATTATGAGCC TAAATCAGAG 600
AATGGTGTAG ACGTGGCCAT GGGAAATGAA CAAGACAGCA CACCAGAGAG TAGACATGGT 660
GCAGTCAAAT CGCCATTCTT GCCATTAGCT CCTCAAACTG AAACACAGAA AAATAAGCAA 720
AGAAATGAAG TGGACGGCAG CAATGAAAAA GCAGCCCTTC TCCCAGCCCC CTTTGCACTA 780
GGAGATACAA ACGTTACCAT AGAAGAGCAA TTAAACTCAA TAAATTTATC TTTTCAGGAT 840
GATCCAGACT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTTGT CAACCCAAGA AAAAGTCTAC GCCACTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAA TTCAAGAGAC GCCCATGGTG GCCCTGCAGG 1020
ATTTGTTCTG ATCCGTTGAT TAATACACAT TCAAAAATGA AAGTTTCAAA CCGGAGGCCC 1080
TATCGACAAT ACTATGTGGA GGCTTTTGGG GACCCTTCAG AGAGAGCCTG GGTGGCTGGA 1140
AAAGCAATCG TCATGTTCGA AGGCAGACAT CAATTTGAAG AGCTACCTGT TCTTAGGAGA 1200
AGAGGGAAGC AGAAAGAAAA AGGATATAGG CACAAGGTTC CTCAGAAAAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGTCT TGCTGAACAG TATGATGTTC CCAAAGGGTC GAAGAACCGA 1320
AGATGTGTCA GCAGTTCAAT CAAGTTGGAC AGTGAGGAGG ATATGCCATT TGAGGATTGT 1380
ACAAATGATC CTGAATCAGA ACATGACCTA TTACTTAATG GCTGCTTGAA ATCTCTGGCT 1440
TTTGACTCTG AACATTCTGC AGATGAAAAG GAAAAGCCTT GTGCTAAGTC TCGAGCCAGA 1500
AAGAGTTCTG ATAATCCAAA AAGGACTAGT ATGAAAAAGG GCCACATGCA ATTTGAAGCA 1560
CATAAGGAAG AACGAAGGGG AAAGATTCCA GAGAACCTTG GCCTAAGCTT TATTTCTGGG 1620
GATGTATCTG ATAAGCAGGC CTCTAATGAA CTTTCCAGGA TAGCAAATAG CCTCACAGGG 1680
TCCAACACTG CCCCAGGAAG TTTCCTATTT TCTTCTTGTG GAAAAAACAC AGCAAAGAAG 1740
GAATTTGAGA CTTCAAATTG TGACTCTTTA CTGGGCTTGT CTGAGGGTGC CTTGATCTCT 1800
AAACATTCTG AGGAAAAGAA GAAACTCCAA CGAGGTTTGA TGTGTAGTTC AAAAGTACAG 1860
CTCTGTTATA TTGGAGCAGG TGATGAAGAA AAGCGAAGTG ATTCCATTAG TATCTGTACC 1920
ACCTCTGATG ATGGAAGCAG TGATCTGGAT CCCATAGATA ATAGTTCAGA GTCTGATAAC 1980
AGTGTCCTTG AAATTACAGA TGCTTTTGAT AGAACAGAGA ACATGTTATC TGTTCAGAAA 2040
AATGAAAAGG TAAAGTATAC TAGGTATCCT GCCACAAACA CTAAGGTAAA AGCAAAGCAG 2100
AAGTCCCTCA TTACTAACTC ACATACAGAC CACTTAATGG ATTGTGCTAA GACAGTAGAG 2160
CCTGGAACTG AAACATCTCA GGTTAATCTC TCTGATCTTA AAGTATCTGC TGTTGTTCGC 2220
AAACCCCAGT CAGATTTTAG AAATGATAGT TTCTCTCCAA AATTCAACAC ACCATCAAGC 2280
ATTTCCAGTG AGAACTCACT AATAAAAAGT GGGGCTACAA ATCAAGCTCT GTTACATGCA 2340
AAAAGCAAAC AGCCCAAGAT CCGAAGTATA AAGTGCAAAC ATAAAGAAAA TCCCGTTGTA 2400
GTGGAACCTC CAGTTACAAA TGAGGACTGC AGTTTGAAAT GCTGCTCTTC TGATACCAAA 2460
GGCTCTCCTT TGGCCAGCAT ATCCAAAAGT GGGAAAGTGG ATGGGCTAAA ACTACTGAGC 2520
AACATGCATG AGAAAACCAG GGATTCAAAT GACATAGAAA CAGCAGTGGT GAAACATGTT 2580
CTGTCAGAGT TGAAGGAACT CTCTTATAGA TCCTTAAGTG AAGATGTGAG TGACTCGGGA 2640
ACATCAAAGC CGTCAAAACC ATTACTTTTT TCTTCTGCCT CTGGTCAGAA TCATATACCT 2700
ATTGAACCAG ACTATAAATT CAGTACATTG CTAATGATGT TGAAAGATAT GCATGATAGT 2760
AAGACCAAGG AGCAACGGTT AATGACTGCT CAAAATTTGG TCTCGTATCG GAGTCCTGGT 2820
CTTGGAGACT GTTCTTCGAG TAGTCCTGTA AGTGCATCTA AGGTCTTGGT TTCAGGCAGC 2880
TCCACTCACA GTTCAGAAAA AAATGGAGAT GGCACTCAGA AATCAGTACG TCCTAGCCCT 2940
AGTGGGGGTG ACTCTGCACT GTCTGGAGAA TTATCTGTAC CTGTACCTGG CTTAGTGTCT 3000
GACAGAAGAG ATGTCCCTGC TTCTAGTAAA AGTCATTCAG ACTGTGTTAC TAGGCGCAAC 3060
TGTGGGCGTT CAAAACCATC ATCTAAATTG CGAGATAGTT TTGCAGCCCA GATGGGAAGG 3120
AACACATTGA ACCGTAAAGC CTTAAAGACA GAGCGCAAAA GAAAACTCAG CCGACTTCCA 3180
GCTGTGACTC TTGAGGCTGC CCTGCAGGGA GACAGAGTAA GTGGAGATTC AGAGAATGGT 3240
TCCTCAAGAG GTGGGCTAGA AGACTCTGGT AAAGAAGAAC CCCTTCAGTT AATGGGCCAT 3300
TTAACAAGTG AAGACAGTGC CCATTTTTCC AGTGTTCATT TTGATAACAA AGTCAACCAC 3360
CCTGACCCTG ATAAAATTCC AGAGAAAGGT CCCTCTTTTG AAAACAGAAA AGGCCCAGAG 3420
TTGGATAATG AAATGAACAG TGAGAATGAT GAACCCAGTG GTGTAAATCA AGCTGTGCCT 3480
AAAAAGCGGT GGCAGCGTTT AAACCAAAGG CGCACCAAAC CTCGTAAGCG CACTAACAGA 3540
TTTAGGGAGA AAGAAAACTC TGAGGGTGCC TTTGGGGTTT TGCTTCCATC TGACCCTGTA 3600
AAGAAGGGGG ATGAGTTCCC AGAGCATAGA CCCCCTACTT CAACAAATGT AATAGAAGAT 3660
ACACTGGCTG ATCCAAATCA TACCAGCTGC TTAGATTCAA TTGGACCACG GTTGAATGTT 3720
TGTGATAAAT CCAGTGCCAG CGTTGAAGAG ATGGAAAAAG AGCCGGGAAT TCCCAGTTTG 3780
ACACCCCAAC CTGAGCTCCC TGAACCAGCT GTGCGGTCAG AGAAGAAACG CCTTAGGAAG 3840
CCAAGCAAGT GGCTTCTGGA ATATACAGAA GAATATGATC AGATATTTGC TCCTAAGAAA 3900
AAACAAAAGA AGGTACAGGA ACAGGTGCAC AAGGTAAGTT CCCGCTGTGA AGAGGAAAGC 3960
CTTTTAGCCC GATGTCGATC TAGTGCTCAG AACAAGCAGG TGGATGAGAA TTCTTTGATT 4020
TCAACCAAAG AAGAGCCTCC AGTTCTTGAA AGGGAGGCTC CATTTTTGGA AGGGCCCTTG 4080
GCTCAGTCAG AACTTGGAGG TGGACATGCT GAGTTACCAC AGCTGACCTT GTCTGTGCCT 4140
GTGGCTCCCG AAGTGTCTCC ACGGCCTATC CTTGAGTCTG AGGAGTTACT AGTTAAAACA 4200
CCAGGAAATT ATGAAAGTAA ACGTCAAAGA AAACCAACTA AGAAACTTCT TGAATCCAAT 4260
GATTTAGACC CGGGATTTAT GCCCAAGAAA GGGGATCTTG GTCTTTCTAA AAAGTGTTAT 4320
GAAGCTGGTC ATTTGGAGAA TGACAGTGAA TCACGTGCTG CATCTAGGGA GTACGGTGGA 4380
GGTGCTGCCA AGATATTTGA TAAACCAAGA AAACGAAAAC GACAGAGGCA TGCTACAGCC 4440
AAGGTGCATT GTAAAAAAAT GAAAAATGAC GACTCTTCAA AAGAAACTCC AGGCTCAGAG 4500
GGAGAACTGA TGACACACCG AACGGCTGCA AGCCCCAAGG AGACTGTTGA GGAGAGTGTA 4560
GAGAACGATC ATGGGATGCC GGCATCTAAA AAGCTGCAGG GTGAACGAGG CGGAGGAGCT 4620
GCACTCAAGG AGAATGTTTG TCAGAACTGT GAGAAACTGG GTGAGCTGCT GTTATGTGAG 4680
GCTCAGTGCT GTGGGGCTTT CCACCTGGAG TGCCTTGGAT TAACTGAAAT GCCAAGAGGA 4740
AAATTTATCT GCAATGAATG TCGCACAGGA ATCCATACCT GTTTTGTTTG TAAGCAGAGT 4800
GGGGAAGATG TTAAAAGGTG CCTTTTGCCC TTGTGTGGAA AGTTTTATCA TGAAGAGTGT 4860
GTCCAGAAGT ATCCACCCAC TGTCATGCAG AACAAGGGCT TCCGGTGCTC CCTCCACATC 4920
TGTATAACCT GCCATGCTGC TAATCCAGCC AGTGTTTCTG CATCTAAAGG TCGCCTGATG 4980
CGCTGTGTCC GCTGCCCTGT GGCATACCAT GCCAATGACT TTTGCCTGGC TGCCGGGTCC 5040
AAGATTCTTG CATCTAATAG TATCATCTGC CCTAATCACT TTACACCTAG GCGGGGCTGC 5100
CGAAACCATG AGCATGTTAA TGTTAGCTGG TGTTTTGTAT GCTCAGAAGG AGGCAGCCTT 5160
CTATGCTGTG ATTCCTGCCC TGCAGCTTTT CATCGTGAAT GCCTGAACAT TGATATCCCT 5220
GAAGGAAACT GGTATTGCAA TGATTGTAAG GCAGGCAAAA AGCCGCATTA CAGGGAAATT 5280
GTTTGGGTAA AAGTTGGGCG ATACAGGTGG TGGCCAGCTG AGATCTGCCA TCCTCGAGCT 5340
GTACCTTCCA ATATTGATAA GATGAGACAT GATGTGGGCG AGTTCCCTGT GCTTTTCTTT 5400
GGATCTAATG ACTATCTGTG GACGCATCAG GCCCGAGTCT TTCCCTATAT GGAGGGGGAT 5460
GTGAGCAGCA AGGATAAGAT GGGCAAAGGA GTAGATGGGA CATATAAAAA AGCTCTTCAG 5520
GAAGCTGCAG CAAGGTTTGA GGAGTTAAAG GCCCAAAAAG AGCTAAGACA GCTGCAGGAA 5580
GACCGAAAGA ATGACAAGAA ACCTCCGCCT TACAAACATA TAAAGGTGAA CCGTCCTATT 5640
GGCAGGGTAC AGATCTTCAC TGCAGACTTG TCTGAAATTC CCCGTTGCAA CTGTAAAGCT 5700
ACTGATGATA ATCCTTGTGG GATAGACTCT GAGTGCATCA ATCGCATGCT ACTATATGAG 5760
TGCCACCCCA CAGTATGTCC TGCTGGAGGA CGCTGCCAAA ACCAGTGTTT CACTAAGCGC 5820
CAGTATCCAG AGGTTGAAAT TTTCCGCACG TTACAGAGGG GTTGGGGTCT TCGGACTAAA 5880
ACAGATATTA AAAAGGGTGA ATTTGTGAAT GAGTATGTGG GTGAGCTAAT AGATGAAGAA 5940
GAGTGCAGAG CTCGAATCCG TTATGCCCAG GAACATGATA TCACTAATTT CTATATGCTT 6000
ACCCTAGACA AAGACCGGAT CATTGATGCT GGTCCCAAAG GAAACTATGC TCGGTTTATG 6060
AATCATTGCT GCCAGCCCAA CTGTGAAACA CAGAAATGGT CTGTGAATGG AGATACCCGT 6120
GTTGGCCTTT TTGCCCTGAG TGACATTAAA GCAGGCACTG AACTTACCTT CAACTACAAC 6180
CTAGAATGTC TTGGGAATGG AAAGACTGTT TGCAAGTGTG GAGCCCCAAA CTGCAGCGGG 6240
TTTTTGGGTG TAAGGCCAAA GAATCAGCCG ATTGCCACAG AAGAAAAGTC AAAGAAATTC 6300
AAGAAGAAGC AACAGGGGAA GCGCAGGACC CAGGGTGAAA TCACAAAGGA GAGAGAGGAT 6360
GAGTGTTTCA GCTGTGGGGA TGCTGGCCAG CTCGTCTCCT GTAAGAAGCC AGGCTGCCCA 6420
AAAGTTTACC ACGCTGACTG TCTAAATCTA ACCAAGCGAC CAGCAGGGAA ATGGGAGTGT 6480
CCTTGGCATC AGTGTGACAT CTGTGGGAAG GAAGCAGCCT CCTTCTGTGA GATGTGTCCC 6540
AGCTCCTTTT GTAAACAGCA TCGGGAAGGG ATGCTCTTCA TCTCCAAACT GGATGGGCGT 6600
TTGTCTTGTA CTGAGCATGA CCCCTGTGGG CCCAACCCTT TGGAACCTGG GGAGATCCGT 6660
GAGTATGTGC CTCCCCCAGT ACCGCTGCCT CCAGGCCCAG GCACTCACCT GGCAGAGCAT 6720
TCATCAGGAG TGGCTGCTCA GGGGCCCAAG ATGTTGGATA AGCTGCCTGC TGACACCAAC 6780
CAGACACTGT CACTGTCCAA AAAAGCTCTG GCAGGAACTT GTCAGAGGCC ACTGCTGCCT 6840
GAAAGACCTC CTGATAGAAC TGACTCCAGG CCCCAGCCTG TAGATAGGGT CAGGGACCTT 6900
GCTGGGTCGG GGACCAAACC CCAATCCTTG GTATCCAGCC AGAAGCCATT GGACAGGCCA 6960
CCTGCAGTGG CAGGACCAAG ACCCCTACTA TCTGACAAAC CCTCTCCAGT GACCGGTATA 7020
AGCTCCTCAC CCTCAGTCAG GTCTCAATCA CTGGAAAGAC CTCTGGGGAC AGCTGACCCA 7080
AGGCTGGATA AATCCATAGG TGCTGCCAGC CCAAGGCCCC AGTCACTGGA GAAAACCCCA 7140
GGCCCTCCTG GCCTGAGACT TCCACCGCCA GACAGACTGC TTGTCACCAG CAGTCCCAAA 7200
CCCCAGACTT CAGACAGGCC CCCAGACAAA TCCCATGCTT CTTTGTCCCA GAGACTCCCA 7260
CCTCCTGACA AAGTACTTTC AGCTGTGGTC CAGACCCTGG TAGCTAAAGA AAAAGCACTG 7320
AGGCCCGTGG ACCAGAATAC TCAGTCAAAA AATAGAGCTG CTTTGGTGAT GGATCTCATA 7380
GACCTAACTC CTCACCAGAA AGAGCGGGCT GCTTCACCTC ATGAGGTCAC ACCACAGGTT 7440
GATGAGAAGA TGCCAGTGTT GGAGTCAAGC TCATGGACTG CCAGTAAAGG TCTGGGGCAG 7500
ATGCCACGAG CTGTAGAGAG AGGCAGTATG TCAGACCCTG TCCTTCAGCC ACCAGGGAAA 7560
ACAGCCGTCC CTTCGGAGCA CCCCTGGCAA GCGGTTAAAT CACTCACCCA GGCCAGACTT 7620
CTCTCTCAGC CTTCTGCCAA GGCTTTTTTA TATGAGCCAA CAACTCAGGC CTCAGGAAGA 7680
GCACCTGCAG GGGTGGAGCA GATACCAGGG CCTCCCAGCC AAGCACCAGG CCTGGTGAAG 7740
CAGGTGAAGC AGATGACCGG AGGCCAGCAA CTACCTGGAC TTGCTGCCAA GAGTGGGCAG 7800
TCCTTCAGGC CTCTTGGGAA GGCCCCATCC ACCCTCTGTA CTGAAGAGAA GAAGTTGGCA 7860
ACTGCAGAGC AGAGTCCCTG GGCCCTGGGA AAGTCCTCAC CAGGGCCAGG GCTCTGGCCC 7920
ATGGTGGCTG GACAGACAAT ATCGCCGTCT TGCTGGTCCT CTGGGAACAC ACAGACATTG 7980
GCACAGACTT GCTGGTCTCT TGGAAGAGGG CAAGACCCTA AACCAGAGCA AAATACACTT 8040
CCAGCTCTTA ACCAGGCTCC TTCCAGTCAC AAGTGTGCAG AGTCAGAACA GAAGTAA 8098
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 96 0.0 5084
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 96 0.0 5077
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 96 0.0 5031
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 93 0.0 4903
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 91 0.0 4811
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 91 0.0 4772
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 91 0.0 4767
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 90 0.0 4751
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 91 0.0 4741
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 89 0.0 4638
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 88 0.0 4635
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 92 0.0 4355
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 92 0.0 4323
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 83 0.0 4304
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 92 0.0 4254
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 90 0.0 4217
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4217
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4209
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 89 0.0 4159
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 90 0.0 4076
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 89 0.0 4055
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 4018
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 90 0.0 3965
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 90 0.0 3892
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3817
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 83 0.0 3813
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3595
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3141
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 77 0.0 2729
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 90 0.0 2703
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 88 0.0 2701
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 84 0.0 2349
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2287
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 57 0.0 2283
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2280
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 57 0.0 2165
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2092
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 81 0.0 1865
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 93 0.0 1670
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 70 0.0 1611
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 59 0.0 1501
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 57 0.0 1483
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 77 0.0 1422
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 85 0.0 1390
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 81 0.0 1332
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1280
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1249
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 76 0.0 1205
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1204
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1139
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 82 0.0 1094
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 64 0.0 1077
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1076
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1050
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 83 0.0 1029
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 66 0.0 996
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 969
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 958
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 952
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 59 0.0 940
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 62 0.0 933
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 916
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 83 0.0 912
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 649
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 4e-121 435
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 31 4e-55 215
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 5e-51 202
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 1e-50 200
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 2e-50 199
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 5e-50 198
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 42 2e-49 197
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 8e-49 194
WERAM-Viv-0116 VIT_18s0072g00220.t01 Vitis vinifera 47 6e-48 192
WERAM-Sol-0089 Solyc07g008460.2.1 Solanum lycopersicum 41 8e-48 191
WERAM-Sot-0073 PGSC0003DMT400059166 Solanum tuberosum 42 9e-48 191
Created Date 25-Jun-2016