WERAM Information


Tag Content
WERAM ID WERAM-Dan-0110
Ensembl Protein ID ENSDNOP00000013609.3
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSDNOG00000017547.3 ENSDNOT00000017546.3 ENSDNOP00000013609.3
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 2.10e-52 176.6 1953 2069
Me_Reader PWWP 1.60e-32 111.8 320 1827
HMT SET1 2.50e-29 102.2 1953 2069
Me_Reader PHD 3.80e-13 49.8 1601 2173
Organism Dasypus novemcinctus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSDNOP00000013609.3 1953 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2039
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSDNOP00000013609.3 2040 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2069
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSDNOP00000013609.3 320 VGDLIWAKFKRRPWWPCKICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 384
69********************98766665466888999******************99998775 PP
PWWP.txt 12 YpwWPalvisppleakklktqeaeenk 38
Y+ +Pa +++ ++k + t+++ +++
ENSDNOP00000013609.3 683 YSRYPATNTRVKAKQKTFITNSHTDHL 709
999***999999999999888887775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSDNOP00000013609.3 1767 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1827
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSDNOP00000013609.3 1953 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2037
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSDNOP00000013609.3 2038 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2069
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50  
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSDNOP00000013609.3 1601 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVIQNKGFRCSLH 1648
58****777777...55899889**********977555555547999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSDNOP00000013609.3 1649 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1702
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSDNOP00000013609.3 1719 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1760
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSDNOP00000013609.3 2131 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2173
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPFSNPV NLDASEDKDS PFGNGQSNFS EPLNGCTMQL STASGTSQNA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQGP EKSDSRAQSP IVCTSLSPGG 120
PTALAMKQEP SCNNSPELQV KVKTVKNGFL HIENFTCVDD ADVVSEMDPE QPVTEDESIE 180
EIFEETQTNA TCNYEPKSEN GVEVAMGKEQ DSTPESRHGA VKSPFLPLAS QTETQKNKER 240
NEVDGNNEKA LLPAPFSLGD TNITIEEQLN SINLSFQDDP DSSTTLGNML ELPGTSSSST 300
SQELPFCQPK KKSTPLKYEV GDLIWAKFKR RPWWPCKICS DPLINTHSKM KVSNRRPYRQ 360
YYVEAFGDPS ERAWVAGKAI VMFEGRHQFE ELPVLRRRGK QKEKGYRHKV PQKILSKWEA 420
SVGLAEQYDV PKGSKNRKCV RSSIKLDSEE DIPFEDCTND PESEHDLLLN GCLKSLAFDS 480
EHSADEKEKP CAKSRVRKSS DHPKRTSVKK GHIQFETHKE ERRGKIPENL GINFISGDVS 540
EKQASSELSR IANSLTGSNT TPGSFLYSSC GKNSAKKEFE TSSCDSLLGL PESALISKSS 600
GEKKKPQRGL VCSSKVQLCY IGAGDDEKRS DSISICTTSD DGSSDLDHID HSSESDNSVL 660
EITDAFDRTE NMLSMQKNEN IKYSRYPATN TRVKAKQKTF ITNSHTDHLL DCTKTAEPGT 720
ETSQVSLSDL KVSTVVRRPP LDFRNDGLSP QFTMPSSMSS NICSENSLIK CGTTNQALLH 780
SKSKQSKIRS IKCKHKENPV VVEAPITSED CSLKCCSSDI KGSPLASISK SGKVDGLKLL 840
SNMHEKTRDS SDIETAVVKH VLSELKELSY RSLSEDVSDS GTSKPSKPLL FTSASGQNHI 900
PIEPDYKFST LLMMLKDMHD SKTKEQRLMT AQNLVSYRSP GLGDCSTSNS IGASKILVSG 960
GSVQNSEKNV DGTQDSAHPI PSGGDSVPSG ESSASLPGLV SDRRDLSATG KSRSNCVARR 1020
NCGRSKPSAK LLDGFSAQMG KSTVNRKALK TERKRKLNQL PELTPEAALQ ADRESGGTMS 1080
GSSRDVGEEP GKEEPLPLIG HLTSEDCAHF SDGHVDSKVK QSDPANIPEK GASFEDRKGP 1140
ELASEMNSEN DEPNSVNQVV PKKRWQRLNQ RRTKPRKRTN RFREKENSEG AFGVLLPGDP 1200
VQKGDEFPEH RPPTSTNVLE DAVTDPNRTG CLDSVGPRLN VCDKSNASIE EMEKEPGIPN 1260
LTPQSELPEP VVRSEKKRLR KPSKWLLEYT EEYDQIFAPK KKQKKVQEQV HKVSSRCEEE 1320
SLLARCRSSA QNKHVDENSL MSAKEEPPVL EREAPFLEGP LAQSDLGSAH AELPQLTLSV 1380
PVPPEVSPRP ILESEELQVK TSGNYESKRQ RKPTKKLLES NDLDPGFMPK KGDLGLSKKC 1440
YEAGHLENDI AESCAGSHSK DFGEGTTKIF DKPRKRKRQR HAVAKVQSKK VRNDDPSKET 1500
PSSEGELLTH RTATSPKEAI EEGVEHDHGI PVSKKLQGER GGGAALKENV CQVARTIGLR 1560
HGPALCVALC MSQLVFHQEA PGIESWTSSM LSFSLCFLGI HTCFVCKQSG EDVKRCLLPL 1620
CGKFYHEECV QKYPPTVIQN KGFRCSLHIC ITCHAANPAS VSASKGRLMR CVRCPVAYHA 1680
NDFCLAAGSK ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH 1740
RECLNIDIPE GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD 1800
VGEFPVLFFG SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA 1860
QKELRQLQED RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE 1920
CINRMLLYEC HPTVCPAGGR CQNQCFTKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE 1980
YVGELIDEEE CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ 2040
KWSVNGDTRV GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNHPI 2100
ATEEKSKKFK KKQQGKRRTQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT 2160
KRPAGKWECP WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP 2220
NPLEPGEIRE YVPPPVPLPP GPSTHVAEQS TGVAAQGPKM SDNSPADTSQ MTLLSKKALT 2280
GTCQRPQPPE RPLERTDSRP QPLDRVRDLA GSGTKPQSLV PTQKPLDRPA IVPGPRSQLS 2340
DKPSPVTGPS SSPSVRSQPL ERPLGMSGSR LDKSLGAASP RPQPLEKTPV PTGLRLPPPD 2400
RLLVTSGPKT QISDKPPDKC HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN 2460
RAALVMDLID LTPRQKERAA SPHEITPQAD EKMPVLESSS WPASKGLGQI PRAVERGSVS 2520
DPVLQSPGRA VASLEHPWQA VKSLAQARLL SQPPTKAFLY EPATQASGRA PVGGEHTPGP 2580
PSQAAGLVKQ MAGGQQLPGL AAKVTALSGQ PFRPLGNAPA TLPTEEKKLT TTEQSPWVLG 2640
KASPGPGLWP MVAGQTLTQS CWSSGNTQTL AQTCWSLGRG QDPKPEQNTV PALNQAPSSH 2700
KCAESEQK 2708
Nucleotide Sequence
(Fasta)
GTTGATGCCG GCCCAGGATG GATCAGACCT GTGAACTACC CAGAAGAAAT TGCCTGCTGC 60
CCTTTTCTAA TCCAGTGAAT TTAGATGCCT CTGAAGACAA GGACAGCCCT TTCGGTAATG 120
GTCAATCCAA TTTTTCTGAG CCACTTAATG GGTGTACTAT GCAGTTATCG ACTGCCAGTG 180
GAACATCCCA AAATGCTTAT GGACAAGATT CTCCATCTTG TTACATTCCA TTGCGGAGAC 240
TACAGGATTT GGCCTCCATG ATCAATGTAG AATATTTAAA TGGGTCTGCT GATGGCTCAG 300
AATCCTTTCA AGGCCCTGAA AAAAGTGATT CAAGAGCTCA GTCGCCAATT GTTTGCACTT 360
CCTTGAGTCC TGGTGGTCCA ACAGCACTTG CTATGAAACA GGAGCCCTCT TGTAATAACT 420
CCCCCGAACT CCAGGTAAAA GTAAAGACTG TCAAGAATGG CTTTCTGCAC ATTGAGAATT 480
TTACTTGTGT GGACGATGCA GATGTAGTTT CTGAAATGGA CCCAGAACAG CCAGTCACAG 540
AGGATGAGAG TATAGAGGAG ATCTTTGAGG AAACTCAGAC CAATGCCACC TGCAATTATG 600
AGCCTAAATC AGAGAATGGT GTAGAAGTGG CCATGGGAAA GGAACAAGAC AGCACACCAG 660
AGAGTAGACA CGGTGCAGTC AAATCGCCAT TCTTGCCTTT AGCTTCTCAA ACTGAAACAC 720
AGAAAAATAA GGAAAGAAAT GAAGTGGACG GCAACAATGA AAAAGCCCTT CTCCCAGCCC 780
CTTTTTCACT AGGAGATACA AACATTACCA TAGAAGAGCA ATTAAACTCA ATAAATTTAT 840
CTTTTCAGGA TGATCCAGAC TCCAGTACAA CATTAGGAAA CATGCTAGAA TTACCTGGAA 900
CTTCATCATC ATCTACTTCA CAGGAATTGC CATTTTGTCA ACCCAAGAAA AAATCTACGC 960
CGCTGAAGTA TGAAGTTGGA GATCTCATCT GGGCTAAATT CAAGAGACGC CCATGGTGGC 1020
CCTGCAAGAT TTGTTCTGAT CCATTGATTA ATACACACTC AAAAATGAAA GTTTCCAACC 1080
GGAGGCCCTA TCGACAGTAC TATGTGGAGG CTTTTGGAGA CCCTTCTGAA AGAGCCTGGG 1140
TGGCTGGAAA AGCAATCGTT ATGTTTGAAG GCAGACATCA GTTTGAAGAG CTACCTGTCC 1200
TTAGGAGAAG AGGAAAGCAG AAAGAAAAAG GCTACAGACA TAAGGTTCCT CAGAAAATTT 1260
TGAGTAAATG GGAAGCCAGT GTTGGTCTTG CTGAACAATA TGATGTTCCC AAAGGGTCAA 1320
AGAACCGAAA ATGTGTCAGA AGTTCAATCA AGTTGGACAG TGAGGAGGAT ATACCATTTG 1380
AGGACTGTAC CAATGATCCT GAATCAGAAC ATGATCTGTT ACTTAATGGT TGTTTGAAAT 1440
CTCTGGCTTT TGACTCTGAA CATTCTGCAG ATGAGAAGGA AAAGCCTTGT GCTAAGTCTC 1500
GAGTCAGAAA GAGCTCTGAT CATCCAAAAA GGACTAGTGT GAAAAAAGGC CACATTCAGT 1560
TTGAAACACA TAAGGAAGAA CGGAGGGGAA AGATTCCGGA GAACCTTGGC ATAAACTTTA 1620
TTTCTGGGGA TGTATCTGAA AAGCAGGCCT CTAGTGAACT TTCCAGAATA GCAAACAGCC 1680
TCACAGGGTC CAACACTACT CCAGGAAGTT TTTTGTATTC TTCTTGTGGA AAAAACTCTG 1740
CAAAGAAAGA ATTTGAGACT TCAAGTTGTG ACTCTTTATT GGGCTTGCCT GAGAGTGCCT 1800
TGATATCTAA AAGTTCTGGG GAGAAGAAGA AACCCCAACG AGGTCTGGTT TGCAGTTCAA 1860
AGGTACAGCT CTGCTATATT GGAGCAGGCG ATGATGAAAA GCGAAGTGAT TCCATCAGTA 1920
TCTGCACCAC TTCCGATGAT GGAAGCAGTG ATCTGGATCA TATAGATCAC AGCTCAGAGT 1980
CTGACAATAG TGTCCTTGAA ATTACGGATG CTTTTGATAG AACAGAGAAC ATGTTATCCA 2040
TGCAGAAAAA CGAAAATATC AAGTATTCTA GGTATCCTGC AACAAACACT AGGGTAAAAG 2100
CAAAGCAGAA GACCTTCATT ACTAACTCAC ATACAGACCA CTTATTAGAT TGTACTAAGA 2160
CAGCAGAGCC TGGAACTGAG ACATCTCAGG TTAGCCTCTC TGATCTTAAG GTGTCCACTG 2220
TTGTTCGCAG ACCCCCATTG GATTTTAGAA ACGATGGTCT CTCTCCACAG TTCACTATGC 2280
CATCAAGCAT GTCATCAAAC ATTTGTAGTG AGAACTCATT AATAAAGTGT GGGACAACGA 2340
ATCAAGCACT GTTACATTCA AAAAGCAAAC AGTCCAAGAT ACGAAGTATA AAGTGCAAAC 2400
ATAAAGAAAA TCCAGTTGTA GTAGAAGCCC CAATTACAAG TGAGGACTGC AGTTTGAAAT 2460
GCTGCTCTTC TGATATCAAA GGCTCTCCTT TGGCCAGCAT TTCCAAAAGT GGGAAAGTGG 2520
ATGGGCTTAA ACTACTGAGC AACATGCATG AGAAAACTAG GGATTCAAGT GACATAGAAA 2580
CAGCAGTGGT AAAACATGTT CTGTCAGAGT TGAAAGAACT CTCTTATAGA TCCTTAAGTG 2640
AGGATGTCAG TGACTCTGGA ACATCAAAGC CATCAAAACC ATTACTTTTT ACTTCTGCCT 2700
CTGGTCAGAA TCATATACCT ATTGAACCAG ACTACAAATT TAGCACCTTG TTAATGATGT 2760
TGAAAGATAT GCACGATAGT AAGACCAAGG AGCAACGATT GATGACTGCT CAAAACTTGG 2820
TCTCCTATCG GAGTCCTGGT CTTGGGGACT GTTCCACCAG TAATTCTATA GGGGCTTCTA 2880
AGATCTTGGT TTCAGGAGGC TCTGTCCAAA ATTCAGAAAA AAATGTAGAT GGTACTCAAG 2940
ACTCAGCCCA TCCTATCCCT AGTGGGGGTG ACTCTGTACC ATCTGGGGAG TCATCTGCCT 3000
CCTTACCTGG CTTGGTGTCA GACAGAAGAG ACCTCAGTGC TACTGGCAAA AGTCGTTCAA 3060
ACTGTGTTGC TAGACGCAAC TGTGGGCGAT CAAAACCATC AGCCAAATTG CTAGATGGCT 3120
TTTCAGCCCA GATGGGGAAG AGCACAGTGA ATCGTAAAGC CTTAAAAACA GAGCGCAAAA 3180
GAAAACTGAA CCAGCTTCCA GAATTGACTC CTGAGGCTGC ACTGCAGGCT GACAGAGAAA 3240
GTGGAGGTAC AATGAGTGGC TCTTCGAGGG ATGTGGGAGA AGAGCCTGGG AAAGAAGAAC 3300
CCCTTCCATT AATAGGCCAT TTAACAAGTG AAGACTGTGC CCATTTTTCT GATGGTCATG 3360
TTGATAGCAA GGTCAAACAG TCTGACCCTG CTAACATTCC TGAAAAAGGC GCCTCTTTTG 3420
AAGACAGAAA AGGCCCAGAG CTGGCCTCTG AAATGAATAG TGAGAATGAT GAACCTAATA 3480
GTGTAAATCA AGTGGTGCCT AAAAAGCGGT GGCAGCGTTT AAACCAAAGG CGCACTAAAC 3540
CTCGTAAGCG CACTAACAGA TTTAGAGAGA AAGAAAACTC TGAGGGTGCC TTTGGGGTCT 3600
TGCTTCCTGG TGACCCTGTG CAGAAGGGGG ATGAGTTCCC TGAGCATAGA CCTCCAACTT 3660
CAACAAATGT ACTAGAAGAT GCGGTGACAG ATCCAAATCG TACTGGCTGC TTAGATTCAG 3720
TTGGGCCACG GTTGAATGTT TGTGATAAAT CCAATGCCAG CATTGAGGAG ATGGAAAAGG 3780
AGCCAGGAAT TCCCAATTTG ACTCCCCAGT CCGAGCTCCC GGAACCAGTT GTGCGGTCAG 3840
AGAAGAAACG CCTTAGGAAG CCAAGCAAGT GGTTGCTAGA ATATACAGAA GAATATGATC 3900
AGATATTTGC TCCTAAGAAG AAACAAAAGA AAGTGCAGGA GCAGGTACAC AAGGTAAGTT 3960
CCCGCTGTGA AGAGGAAAGC CTTCTAGCCC GGTGTCGGTC CAGTGCTCAG AACAAGCATG 4020
TAGACGAGAA TTCTTTGATG TCAGCCAAAG AAGAGCCTCC TGTTCTTGAG AGGGAGGCTC 4080
CATTTCTGGA AGGACCCTTG GCTCAGTCAG ACCTTGGAAG TGCACATGCT GAATTGCCAC 4140
AGCTGACCTT ATCTGTGCCT GTGCCTCCGG AAGTCTCCCC ACGGCCTATC CTTGAGTCTG 4200
AGGAATTACA AGTTAAAACG TCAGGAAATT ATGAAAGTAA GCGTCAGAGA AAGCCAACTA 4260
AGAAACTTCT TGAATCCAAT GATTTAGATC CTGGATTTAT GCCCAAGAAG GGAGATCTGG 4320
GCCTTTCTAA AAAGTGTTAT GAAGCTGGTC ACTTGGAGAA TGACATTGCT GAATCATGTG 4380
CTGGTTCTCA TTCTAAAGAC TTTGGGGAAG GCACTACCAA GATATTTGAT AAACCAAGAA 4440
AGCGAAAACG ACAGAGACAT GCTGTGGCCA AGGTGCAATC TAAGAAAGTG AGAAATGATG 4500
ACCCATCGAA GGAGACTCCA AGTTCAGAGG GAGAACTGCT GACACACAGA ACGGCTACAA 4560
GCCCCAAGGA GGCTATTGAG GAGGGTGTGG AGCATGATCA TGGGATACCT GTATCTAAAA 4620
AACTGCAGGG TGAACGAGGT GGAGGAGCTG CACTCAAGGA GAATGTTTGC CAGGTAGCTA 4680
GGACTATAGG ACTACGGCAC GGACCAGCTC TCTGTGTGGC ACTCTGCATG AGCCAGCTTG 4740
TCTTTCACCA GGAGGCCCCA GGAATCGAAT CCTGGACCTC CTCTATGTTA TCATTTTCTC 4800
TCTGTTTCTT AGGAATCCAT ACCTGTTTTG TATGTAAGCA GAGTGGGGAA GATGTTAAAA 4860
GATGCCTTCT GCCTCTGTGC GGAAAGTTTT ACCACGAAGA ATGTGTCCAG AAGTACCCAC 4920
CCACTGTCAT ACAGAACAAG GGTTTCCGGT GCTCCCTCCA CATCTGTATA ACCTGCCATG 4980
CTGCTAATCC AGCCAGTGTT TCTGCATCTA AAGGTCGTCT GATGCGCTGT GTCCGCTGCC 5040
CTGTGGCATA CCATGCCAAT GACTTTTGCT TAGCTGCTGG GTCAAAGATT CTTGCGTCTA 5100
ACAGTATCAT CTGCCCTAAT CACTTTACCC CTCGGCGGGG CTGTCGAAAT CATGAGCATG 5160
TTAATGTTAG CTGGTGTTTT GTGTGCTCAG AAGGAGGCAG CCTTCTTTGC TGTGATTCTT 5220
GCCCTGCTGC TTTTCATCGT GAATGCCTGA ACATTGATAT CCCTGAAGGA AACTGGTATT 5280
GTAATGATTG TAAGGCAGGC AAAAAGCCAC ACTACAGGGA GATTGTGTGG GTAAAAGTAG 5340
GACGATACAG GTGGTGGCCA GCTGAGATCT GCCATCCTCG AGCTGTACCA TCCAACATCG 5400
ATAAGATGAG ACATGATGTG GGCGAGTTCC CTGTACTTTT CTTTGGATCT AATGACTACC 5460
TGTGGACCCA CCAGGCCAGA GTCTTCCCCT ACATGGAGGG GGATGTTAGC AGCAAGGATA 5520
AGATGGGCAA AGGTGTGGAT GGAACATATA AAAAAGCTCT TCAGGAAGCT GCAGCAAGGT 5580
TTGAGGAGTT AAAGGCCCAA AAAGAGCTAA GACAGCTTCA GGAAGACCGA AAGAATGACA 5640
AGAAACCACC TCCTTATAAA CATATAAAGG TGAACCGTCC CATTGGTAGG GTCCAGATCT 5700
TCACTGCAGA TTTGTCTGAA ATTCCCCGTT GCAACTGTAA AGCTACTGAT GAGAACCCCT 5760
GCGGGATAGA TTCTGAGTGC ATCAACCGCA TGCTGCTCTA TGAGTGCCAT CCCACTGTAT 5820
GTCCTGCCGG AGGGCGCTGC CAAAACCAGT GCTTCACCAA ACGCCAGTAT CCAGAGGTTG 5880
AAATTTTTCG CACATTACAG AGGGGCTGGG GTCTCCGGAC AAAAACAGAT ATTAAAAAGG 5940
GAGAATTTGT GAATGAGTAT GTGGGTGAGT TAATAGATGA AGAAGAGTGC AGAGCTCGAA 6000
TCCGTTATGC CCAAGAACAT GATATCACTA ATTTCTACAT GCTCACCCTA GACAAAGACC 6060
GGATCATTGA TGCTGGTCCC AAAGGAAACT ATGCTCGGTT CATGAATCAT TGCTGCCAGC 6120
CCAACTGTGA AACACAGAAG TGGTCTGTGA ATGGTGATAC CCGTGTTGGT CTTTTTGCCC 6180
TGAGTGACAT CAAAGCAGGC ACTGAACTTA CCTTCAACTA CAACCTAGAA TGTCTTGGGA 6240
ATGGAAAGAC TGTTTGCAAA TGTGGAGCCC CAAACTGCAG TGGCTTCCTG GGTGTAAGGC 6300
CAAAGAACCA TCCCATTGCC ACAGAAGAAA AGTCGAAGAA ATTCAAGAAG AAGCAGCAGG 6360
GAAAGCGCAG GACCCAGGGT GAAATCACAA AGGAGCGAGA GGATGAATGT TTCAGCTGTG 6420
GAGATGCCGG GCAGCTCGTC TCCTGCAAGA AGCCAGGCTG CCCAAAAGTC TACCATGCAG 6480
ACTGTCTCAA TCTAACCAAG CGGCCAGCAG GAAAGTGGGA GTGTCCTTGG CATCAGTGTG 6540
ACATCTGTGG GAAGGAAGCA GCCTCTTTCT GCGAGATGTG CCCCAGTTCC TTCTGTAAGC 6600
AGCATCGGGA AGGAATGCTT TTCATCTCGA AACTGGACGG GCGTCTGTCT TGTACTGAGC 6660
ATGATCCCTG TGGGCCCAAC CCTCTGGAAC CTGGGGAGAT CCGTGAGTAT GTGCCTCCCC 6720
CTGTACCGCT GCCTCCAGGC CCAAGCACAC ACGTGGCAGA GCAATCAACG GGAGTTGCTG 6780
CTCAGGGGCC CAAGATGTCG GATAACTCGC CTGCAGATAC CAGCCAGATG ACGTTGCTGT 6840
CCAAAAAAGC TCTGACAGGC ACTTGTCAGA GACCACAGCC TCCTGAAAGA CCTCTTGAGA 6900
GAACTGACTC CAGGCCCCAA CCTTTAGATA GGGTCAGGGA CCTTGCTGGG TCAGGGACCA 6960
AACCCCAGTC CTTGGTACCC ACCCAGAAGC CATTGGACAG GCCTGCTATT GTGCCAGGGC 7020
CAAGATCTCA GCTCTCTGAC AAGCCCTCTC CAGTCACCGG CCCAAGTTCC TCACCTTCAG 7080
TCAGGTCCCA ACCACTGGAA AGACCTCTGG GGATGTCTGG CTCAAGGCTG GATAAATCTC 7140
TAGGTGCTGC CAGCCCAAGG CCTCAGCCAT TGGAGAAAAC CCCAGTCCCC ACTGGCCTGA 7200
GACTTCCGCC GCCAGACAGA CTACTAGTCA CCAGCGGTCC CAAAACTCAA ATTTCAGACA 7260
AGCCCCCAGA CAAATGTCAT GCCTCTTTGT CCCAGAGACT CCCACCTCCT GAGAAAGTAC 7320
TGTCAGCTGT GGTTCAAACC CTTGTGGCTA AAGAAAAGGC ACTGAGGCCT GTGGACCAGA 7380
ATACTCAGTC AAAAAACAGA GCTGCTTTGG TTATGGATCT CATAGACCTA ACTCCTCGCC 7440
AGAAGGAGCG GGCAGCTTCT CCTCATGAGA TCACACCACA GGCTGATGAG AAGATGCCAG 7500
TGTTGGAGTC AAGCTCGTGG CCTGCCAGCA AAGGCCTGGG GCAGATACCC CGGGCTGTTG 7560
AGAGAGGCAG TGTGTCAGAT CCTGTCCTCC AGTCACCTGG GAGAGCAGTG GCCTCTTTGG 7620
AGCACCCCTG GCAAGCTGTT AAGTCACTTG CCCAGGCCAG ACTTCTATCT CAGCCCCCTA 7680
CCAAGGCTTT TTTATATGAG CCAGCAACTC AGGCCTCAGG AAGAGCACCT GTAGGGGGTG 7740
AGCATACCCC AGGGCCTCCC AGCCAGGCAG CAGGCCTGGT GAAGCAGATG GCTGGAGGCC 7800
AGCAACTACC TGGACTTGCT GCCAAAGTAA CAGCACTGAG TGGGCAGCCC TTCAGGCCTC 7860
TTGGGAATGC CCCAGCCACC CTCCCTACTG AGGAGAAGAA GTTGACAACC ACAGAGCAGA 7920
GTCCCTGGGT CCTGGGAAAG GCCTCACCGG GTCCAGGACT TTGGCCCATG GTGGCTGGAC 7980
AGACACTGAC ACAGTCTTGC TGGTCCTCTG GGAACACACA GACATTGGCA CAGACTTGCT 8040
GGTCTCTTGG AAGGGGGCAA GACCCTAAAC CAGAGCAAAA TACAGTCCCA GCTCTTAACC 8100
AGGCTCCTTC CAGTCACAAG TGTGCAGAGT CAGAACAGAA ATAA 8145
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 89 0.0 4600
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 89 0.0 4593
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 88 0.0 4578
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 88 0.0 4549
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 88 0.0 4541
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 88 0.0 4526
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 88 0.0 4511
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 87 0.0 4495
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 87 0.0 4487
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 86 0.0 4430
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 85 0.0 4382
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 80 0.0 4058
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 87 0.0 4034
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 88 0.0 4031
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 87 0.0 4002
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 86 0.0 3991
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 87 0.0 3985
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 87 0.0 3968
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 88 0.0 3949
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 85 0.0 3919
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 85 0.0 3845
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 85 0.0 3799
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 86 0.0 3703
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 84 0.0 3663
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 79 0.0 3580
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 85 0.0 3571
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 69 0.0 3440
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 67 0.0 3027
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 75 0.0 2581
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 84 0.0 2518
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 84 0.0 2479
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 55 0.0 2200
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 55 0.0 2195
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 56 0.0 2189
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 80 0.0 2137
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 55 0.0 2060
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 58 0.0 1998
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 80 0.0 1824
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 50 0.0 1701
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 90 0.0 1601
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 78 0.0 1531
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 79 0.0 1454
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 55 0.0 1414
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 80 0.0 1315
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 83 0.0 1304
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 86 0.0 1207
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 61 0.0 1183
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 73 0.0 1138
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 74 0.0 1135
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1074
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 73 0.0 1072
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 65 0.0 1067
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 83 0.0 1061
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1049
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 80 0.0 1047
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 957
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 64 0.0 940
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 62 0.0 919
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 61 0.0 914
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 56 0.0 890
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 81 0.0 884
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 61 0.0 883
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 54 0.0 869
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 650
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 42 2e-120 432
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 2e-50 200
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 5e-50 198
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 37 8e-50 198
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 8e-50 198
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 2e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 4e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 195
Created Date 25-Jun-2016