WERAM Information


Tag Content
WERAM ID WERAM-Bot-0197
Ensembl Protein ID ENSBTAP00000034104.4
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSBTAG00000025426.4 ENSBTAT00000034204.4 ENSBTAP00000034104.4
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.70e-52 176.7 1946 2062
Me_Reader PWWP 6.40e-33 112.8 323 1820
HMT SET1 2.10e-29 102.1 1946 2062
Me_Reader PHD 3.60e-19 68.8 1547 2166
Organism Bos taurus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSBTAP00000034104.4 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2032
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSBTAP00000034104.4 2033 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSBTAP00000034104.4 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 12 YpwWPalvisppleakklktqeaeenk 38
Y+ +Pa +++ ++k+l t+++ +++
ENSBTAP00000034104.4 686 YSRYPATNTKVKAKQKSLITNSHTDHL 712
999***********9999888887775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSBTAP00000034104.4 1760 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1820
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSBTAP00000034104.4 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2030
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSBTAP00000034104.4 2031 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSBTAP00000034104.4 1547 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1591
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSBTAP00000034104.4 1594 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1641
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC +C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSBTAP00000034104.4 1642 ICTTCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1695
8****86666644455677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSBTAP00000034104.4 1712 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1753
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSBTAP00000034104.4 2124 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2166
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STASGTSQNA 60
YGQDSPSCYI PLRKLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQSP VVCTSLNPGG 120
PTALAMKQEP SCNNSPELQV KVTKTVKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180
EEIFEETQTN ATCNYEPKSE NGVDVAMGDE QDSTPDSRHG AGKPPFLPLA PQTETQRNKQ 240
RSDVDGSHEK AALLPAPLSL GDTNVTIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QPKKKATPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDIPKGSKNR KCVTSSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA 480
FDSEHSADEK EKPCAKSRAR KSSDNPKRTS VKKGHMQFET HKEERRGKIP ENLGLNFISG 540
DVSDKQASNE LSRIANSLTG PSTAPGSFLF SSCAKNTAKK EFETSNCDSL LGLSEGALIS 600
KRSGEKKKFQ RGLMCSSKVQ LCYIGAGDEE KRSDSISICT TSDDGSSDLD PVDHSSESDN 660
SVLEITDAFD RSENLLPVQK NEKVKYSRYP ATNTKVKAKQ KSLITNSHTD HLINCTKTTE 720
PGTETSQINL SDLKVSTLVR KPQPDFRNDG FSPKFNTSSS ISSENSIIKG GAKNQALLHS 780
KSKQPKIRSI KCKHKENPVV AEPPVANEDC SLKCCSSDNK GSPLASISKS GKVDGLKLLS 840
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLSEDVSDSG TSKPSKPLLF SSASGQNHIP 900
IEPDYKFSTL LMMLKDMHDS KTKEQRIMTA QNLVSYRSPG LGDCSTSSPV SAPKVLVSGG 960
SNHSSEKSGD GTQDPVHPGP GGGDSALSGE LSTSLPGLGS DKRDLPASGK NRSNCVTRRN 1020
CGRSKPSKFR DGFSAQMGKN TVNRKALKTE RRRKLNELPA VTLEAALQGD RESRGSEKSS 1080
SRGEAEDPGK EPTLQLMGHL TSEDGAHFSS VSFDNKVNQS DPEKIPEKGP SFEIRKVPEL 1140
DSEMNSENDE PSSINEAVPK KRWQRLNQRR TKPRKRTNRF KEKENSEGAF GVLLSADPVK 1200
KEDEFPEQRP PASTNKLEDA LTDPNHANHL DSAGPRLNVC DKSNASNEEM EKEPGIPSLT 1260
PQPELPEPAV RSEKKRLRKP SKWLLEYTEE YDQIFAPKKK QKKTQEQVHK VSSRCEEESL 1320
LARCRSSAQN KQVDENSLIS TKEEPPVLER EAPFLEGPLA QSELGGGHAE LPQLTLSVPV 1380
APEVSPRPAL ESEELLVKPP GNYESKRQRK PTKKLLESND LDPGFMPKKG DLGLTKKCYE 1440
AGHLENDINE SCAAPRSKEF GGGTTKLFDK PRKRKRQRHA TAKLHCKKVK NDISSKETPN 1500
SEGELMTHRT AASPKETVEE GVENDHGMPA SKKLQGERGG GAALKENVCQ NCEKLGELLL 1560
CEAQCCGAFH LECLGLTEMP RGKFICNECR TGIHTCFVCK QSGEDVKRCL LPLCGKFYHE 1620
ECVQKYPPTV MQNKGFRCSL HICTTCHAAN PASVSASKGR LMRCVRCPVA YHANDFCLAA 1680
GSKILASNSI ICPNHFTPRR GCRNHEHVNV SWCFVCSEGG SLLCCDSCPA AFHRECLNID 1740
IPEGNWYCND CKAGKKPHYR EIVWVKVGRY RWWPAEICHP RAVPSNIDKM RHDVGEFPVL 1800
FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA LQEAAARFEE LKAQKELRQL 1860
QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC KATDENPCGI DSECINRMLL 1920
YECHPTVCPA GGRCQNQCFT KRQYPEVEIF RTLQRGWGLR TKTDIKKGEF VNEYVGELID 1980
EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR FMNHCCQPNC ETQKWSVNGD 2040
TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC SGFLGVRPKN QPIATEEKSK 2100
KFKKKQQGKR RTQGEVTKER EDECFSCGDA GQLVSCKKPG CPKVYHADCL NLTKRPAGKW 2160
ECPWHQCDIC GKEAASFCEM CPSSFCKQHR EGMLFISKLD GRLSCTEHDP CGPNPLEPGE 2220
IREYVPPPVP LPPGPSTHLA EQSSGVAAQG PKMSDKPSAD TNPSLSLSKK ALAGTCQRPP 2280
LPERPPDRTD SRPQPVDRVR DLAGSGTKPQ SLVSSQKPLD RPPAVAGPRP QLSDKPSPVT 2340
GPSSSPSVRS QPLERPLGTA DPRLDKSIGA ASPRPQSLEK TPVPTGLRLP PPEKLLVTSG 2400
PKPQTSDRPP DKSHVSLSQR LPPPDKVLSA VVQTLVAKEK ALRPVDQNTQ SKNRAALVMD 2460
LIDLTPRQKD RAGSPHELTP QADEKMPVLE SSSWPASKGL GQIPRAVERG SVSDAVLQPL 2520
GKAAATSEHS WQAVKSLTQA RLLSQPPAKA FLYEPATQAS GRAPAGAEQT PGPPSQAPGL 2580
VKQVKQMAGG QQLPGLAAKS GQSFRPLGKS SLSTEEKKLA PTEQSPWALG KASPGPGLWP 2640
MVAGQTLAQS CWSSGSTQTL AQTCWSLGRG KDPKPEQSTL PALNQAPSSH KCAESEQK 2698
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ACCTAGAAGA AATTGTCTGC TGCCCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTA TCGACTGCCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA AACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCTGATGGAT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGTCGCCA GTTGTTTGCA CTTCCTTGAA TCCTGGTGGT 360
CCAACAGCAC TTGCTATGAA ACAGGAACCC TCTTGTAATA ACTCCCCTGA ACTCCAGGTA 420
AAAGTAACAA AGACTGTCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGAAAC TCAAACCAAT GCCACCTGCA ATTATGAGCC TAAATCAGAG 600
AATGGTGTAG ACGTGGCCAT GGGAGATGAA CAAGACAGCA CACCAGACAG TAGACACGGT 660
GCAGGCAAAC CGCCATTCTT GCCATTAGCT CCTCAGACCG AAACGCAGAG AAATAAGCAA 720
AGAAGTGACG TGGACGGCAG CCATGAAAAA GCAGCCCTTC TCCCAGCCCC CCTTTCGCTA 780
GGAGATACAA ACGTTACCAT AGAAGAGCAA TTAAACTCAA TAAATTTATC TTTTCAGGAT 840
GATCCAGACT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTTGT CAACCCAAGA AAAAGGCTAC GCCGCTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAA TTCAAGAGAC GCCCATGGTG GCCCTGCAGG 1020
ATTTGTTCTG ATCCGTTGAT TAATACACAC TCAAAAATGA AAGTTTCTAA CCGCAGGCCC 1080
TATCGACAGT ACTACGTGGA GGCTTTTGGA GATCCTTCTG AGAGAGCCTG GGTGGCTGGA 1140
AAAGCAATCG TTATGTTCGA AGGCAGACAT CAATTTGAAG AGCTACCTGT CCTTAGGAGA 1200
AGAGGGAAGC AGAAAGAGAA GGGATATAGG CATAAGGTTC CTCAGAAGAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGTCT TGCTGAGCAG TATGATATTC CCAAAGGGTC GAAGAACCGA 1320
AAGTGTGTCA CCAGTTCAAT CAAGTTGGAC AGTGAGGAGG ATATGCCATT TGAGGACTGT 1380
ACAAATGATC CTGAATCAGA ACATGACCTG TTGCTTAATG GCTGCTTGAA ATCTCTGGCT 1440
TTTGACTCTG AACATTCTGC AGATGAGAAG GAAAAACCTT GTGCTAAGTC TCGAGCCAGA 1500
AAGAGCTCTG ATAATCCAAA AAGGACTAGT GTGAAAAAGG GCCATATGCA ATTTGAAACA 1560
CATAAGGAAG AACGGAGGGG AAAGATTCCA GAAAACCTTG GCTTAAACTT TATTTCTGGG 1620
GATGTGTCTG ATAAGCAGGC CTCGAATGAA CTTTCCCGGA TAGCGAACAG CCTCACAGGG 1680
CCCAGCACTG CTCCAGGAAG TTTCCTGTTT TCTTCTTGTG CAAAAAACAC TGCAAAGAAA 1740
GAATTTGAGA CTTCAAATTG TGACTCTTTA CTGGGCTTGT CTGAGGGTGC CTTGATCTCT 1800
AAACGTTCTG GGGAGAAGAA GAAATTCCAG CGAGGTCTGA TGTGTAGTTC GAAAGTACAG 1860
CTCTGCTATA TTGGAGCAGG TGATGAAGAA AAACGAAGTG ATTCCATTAG TATTTGTACC 1920
ACTTCTGATG ATGGAAGCAG TGATTTGGAT CCTGTAGATC ATAGTTCAGA GTCTGATAAC 1980
AGTGTCCTTG AAATTACAGA TGCTTTTGAT AGATCAGAGA ACTTGTTACC TGTGCAGAAA 2040
AATGAAAAGG TAAAGTATTC TAGGTATCCT GCCACAAACA CTAAGGTAAA AGCAAAGCAG 2100
AAGTCTCTGA TTACTAACTC ACACACGGAC CACCTAATAA ATTGTACCAA GACAACAGAG 2160
CCTGGAACTG AGACGTCTCA GATTAATCTC TCTGATCTTA AAGTGTCAAC TCTTGTCCGA 2220
AAACCCCAAC CAGATTTTAG AAACGATGGA TTTTCTCCAA AATTCAACAC ATCATCAAGT 2280
ATTTCCAGTG AGAACTCAAT AATAAAAGGT GGGGCTAAAA ATCAAGCTCT GTTACATTCA 2340
AAAAGCAAAC AGCCCAAAAT TCGAAGTATC AAGTGCAAAC ATAAAGAAAA CCCAGTTGTA 2400
GCAGAACCTC CAGTTGCAAA TGAGGACTGC AGTTTGAAAT GCTGCTCTTC TGATAACAAA 2460
GGCTCTCCTT TGGCCAGCAT TTCTAAAAGC GGGAAAGTGG ATGGACTGAA ACTACTGAGC 2520
AACATGCATG AGAAAACCAG GGATTCGAGT GACATAGAAA CAGCAGTGGT GAAACACGTT 2580
CTGTCAGAGT TGAAGGAGCT CTCTTACAGA TCCTTAAGTG AAGATGTCAG TGACTCCGGA 2640
ACGTCAAAGC CATCGAAACC ATTACTTTTT TCTTCTGCCT CTGGTCAGAA TCACATACCT 2700
ATTGAACCAG ACTACAAATT CAGTACATTA CTAATGATGT TGAAAGATAT GCATGATAGT 2760
AAGACCAAGG AGCAACGAAT AATGACAGCT CAGAACTTGG TCTCTTATCG GAGTCCTGGT 2820
CTTGGGGACT GTTCTACCAG CAGTCCTGTG TCAGCTCCTA AGGTCTTGGT TTCAGGAGGC 2880
TCCAATCACA GTTCAGAAAA AAGTGGAGAT GGCACTCAGG ATCCAGTCCA CCCTGGCCCT 2940
GGTGGGGGTG ACTCTGCACT GTCTGGGGAG CTGTCTACTT CCCTGCCTGG CTTGGGGTCT 3000
GATAAAAGAG ACCTCCCTGC TTCTGGCAAA AATCGTTCAA ACTGCGTTAC TAGGCGCAAC 3060
TGTGGTCGAT CAAAGCCATC CAAATTTCGA GATGGTTTTT CAGCCCAGAT GGGAAAGAAC 3120
ACAGTGAACC GTAAAGCCTT AAAAACAGAA CGCAGAAGAA AACTGAATGA GCTTCCAGCT 3180
GTGACTCTTG AGGCTGCACT ACAGGGAGAC AGAGAGAGTA GGGGTTCAGA GAAGAGCTCC 3240
TCCAGAGGTG AAGCAGAAGA CCCTGGTAAA GAACCAACCC TTCAATTAAT GGGCCATTTA 3300
ACAAGTGAAG ATGGTGCCCA TTTTTCCAGT GTTAGTTTTG ATAATAAAGT CAACCAGTCT 3360
GACCCTGAAA AAATTCCTGA AAAAGGCCCC TCTTTTGAAA TCAGGAAAGT CCCAGAGCTG 3420
GACTCTGAAA TGAACAGTGA GAATGATGAA CCCAGTAGTA TAAATGAAGC AGTGCCTAAA 3480
AAGCGATGGC AACGTTTAAA CCAAAGGCGC ACTAAACCTC GTAAACGCAC TAACAGATTT 3540
AAGGAAAAAG AAAACTCTGA GGGTGCCTTT GGGGTCTTGC TTTCTGCTGA CCCTGTAAAG 3600
AAGGAAGATG AGTTCCCAGA GCAGAGACCT CCTGCTTCGA CAAACAAACT AGAGGATGCA 3660
CTGACAGATC CAAATCATGC CAACCACTTA GATTCAGCTG GGCCGCGGTT GAATGTTTGT 3720
GATAAATCCA ATGCTAGCAA TGAGGAGATG GAAAAGGAGC CAGGAATTCC CAGTTTGACT 3780
CCTCAACCTG AGCTCCCTGA ACCAGCTGTG CGATCAGAGA AGAAACGCCT TAGGAAGCCA 3840
AGCAAGTGGC TTCTGGAATA TACAGAAGAA TATGATCAGA TATTTGCTCC TAAGAAAAAA 3900
CAAAAGAAGA CACAGGAACA GGTGCACAAG GTAAGTTCCC GCTGTGAAGA GGAAAGCCTT 3960
TTAGCCCGAT GTCGATCTAG TGCTCAGAAC AAACAGGTGG ATGAGAATTC TTTGATTTCA 4020
ACCAAAGAAG AGCCTCCAGT TCTTGAAAGG GAGGCTCCAT TTTTGGAAGG GCCCTTGGCT 4080
CAGTCAGAAC TTGGAGGTGG ACATGCTGAG TTGCCACAGC TGACCTTATC TGTACCTGTG 4140
GCTCCGGAAG TCTCTCCACG GCCTGCCCTT GAGTCTGAGG AATTGCTAGT TAAACCACCA 4200
GGAAATTATG AAAGTAAGCG TCAGAGAAAA CCAACTAAGA AACTTCTTGA ATCCAATGAT 4260
TTAGACCCTG GATTTATGCC CAAGAAAGGG GATCTTGGCC TTACTAAAAA GTGTTATGAA 4320
GCTGGTCACT TGGAGAATGA CATTAATGAA TCATGTGCTG CACCTCGTTC TAAAGAGTTT 4380
GGTGGAGGCA CCACCAAGCT GTTTGATAAA CCAAGGAAGC GAAAACGACA GAGGCATGCT 4440
ACAGCCAAGT TGCATTGTAA AAAAGTGAAA AATGACATCT CATCAAAAGA AACTCCAAAC 4500
TCTGAGGGAG AACTGATGAC ACACAGGACA GCTGCAAGCC CCAAGGAGAC TGTTGAGGAG 4560
GGTGTAGAAA ACGACCATGG AATGCCTGCA TCTAAAAAGC TGCAGGGGGA ACGAGGAGGT 4620
GGAGCCGCAC TCAAGGAGAA TGTCTGTCAG AACTGTGAGA AACTGGGTGA GCTGCTATTA 4680
TGTGAGGCTC AGTGCTGTGG GGCTTTCCAC TTGGAGTGCC TTGGATTAAC TGAAATGCCC 4740
AGAGGAAAGT TTATCTGCAA TGAATGTCGC ACAGGAATAC ATACCTGTTT TGTATGCAAG 4800
CAGAGTGGGG AAGATGTGAA AAGGTGCCTT CTGCCCTTAT GTGGAAAGTT TTACCATGAA 4860
GAGTGCGTCC AGAAGTACCC ACCCACCGTG ATGCAGAACA AGGGCTTCCG GTGTTCCCTC 4920
CACATATGTA CTACCTGCCA TGCTGCCAAT CCAGCCAGTG TTTCTGCGTC TAAAGGTCGA 4980
CTGATGCGCT GTGTCCGCTG CCCAGTGGCA TACCATGCCA ATGACTTTTG CCTGGCTGCT 5040
GGGTCAAAGA TCCTTGCATC CAATAGCATC ATCTGCCCTA ATCACTTTAC CCCTAGGCGT 5100
GGCTGCCGAA ATCATGAGCA TGTTAATGTT AGCTGGTGTT TTGTGTGCTC TGAAGGAGGC 5160
AGCCTTCTGT GTTGTGATTC TTGCCCTGCT GCTTTTCATC GTGAATGCCT GAACATTGAT 5220
ATCCCTGAAG GAAACTGGTA TTGCAATGAC TGTAAGGCAG GCAAAAAGCC ACATTACAGA 5280
GAAATTGTCT GGGTAAAAGT TGGAAGATAC AGGTGGTGGC CAGCTGAGAT CTGCCATCCT 5340
CGAGCTGTAC CTTCCAATAT TGACAAGATG AGACATGATG TAGGCGAGTT CCCTGTGCTC 5400
TTCTTTGGGT CTAATGACTA TCTGTGGACT CACCAGGCCC GAGTCTTTCC TTACATGGAG 5460
GGGGATGTGA GCAGCAAGGA TAAGATGGGC AAAGGAGTCG ACGGGACATA CAAAAAAGCT 5520
CTTCAGGAAG CTGCAGCAAG GTTTGAGGAG TTGAAGGCCC AAAAAGAGCT AAGACAGCTG 5580
CAGGAAGATC GAAAGAATGA CAAGAAGCCA CCACCTTACA AACATATAAA GGTGAACCGT 5640
CCTATTGGCA GGGTACAGAT CTTCACTGCA GACTTGTCTG AGATCCCCCG TTGCAACTGT 5700
AAAGCCACGG ATGAGAACCC CTGCGGCATA GACTCTGAGT GCATCAACCG CATGCTGTTG 5760
TATGAGTGCC ACCCTACCGT GTGCCCTGCT GGAGGTCGCT GCCAGAACCA GTGCTTTACC 5820
AAGCGCCAGT ACCCAGAGGT GGAAATTTTT CGCACATTAC AGAGGGGCTG GGGTCTCCGA 5880
ACAAAAACAG ATATTAAAAA GGGTGAATTT GTGAATGAAT ATGTGGGTGA GCTAATAGAT 5940
GAAGAAGAGT GCAGAGCTCG AATCCGTTAT GCCCAAGAAC ATGATATCAC TAATTTTTAT 6000
ATGCTAACTC TAGACAAAGA CCGGATTATT GATGCTGGCC CCAAAGGAAA CTATGCTCGA 6060
TTCATGAATC ATTGCTGCCA GCCTAACTGT GAAACACAGA AGTGGTCTGT GAATGGAGAC 6120
ACCCGGGTTG GCCTTTTTGC CCTGAGTGAC ATTAAAGCAG GCACTGAACT TACCTTCAAC 6180
TACAATCTAG AATGTCTTGG GAATGGAAAG ACCGTTTGCA AATGTGGAGC CCCAAACTGC 6240
AGTGGCTTTT TGGGTGTAAG GCCAAAGAAT CAACCCATTG CCACAGAGGA AAAGTCAAAG 6300
AAATTCAAGA AGAAGCAACA GGGGAAGCGC AGAACCCAGG GTGAAGTCAC AAAGGAGCGA 6360
GAGGATGAAT GTTTCAGCTG TGGGGATGCT GGCCAGCTCG TCTCTTGTAA GAAGCCAGGC 6420
TGCCCCAAAG TTTACCATGC AGACTGTCTC AATCTAACCA AGCGACCAGC AGGGAAATGG 6480
GAGTGTCCTT GGCATCAGTG TGACATATGC GGAAAGGAAG CAGCCTCCTT CTGTGAGATG 6540
TGTCCCAGCT CCTTCTGCAA GCAGCATCGG GAAGGGATGC TCTTCATCTC CAAACTGGAT 6600
GGGCGCCTGT CTTGTACTGA GCATGATCCC TGTGGGCCCA ACCCTCTGGA ACCGGGGGAG 6660
ATCCGTGAGT ATGTACCTCC TCCAGTACCA CTGCCTCCAG GCCCAAGCAC TCACCTGGCA 6720
GAGCAATCAT CAGGAGTGGC TGCTCAAGGG CCCAAGATGT CGGACAAGCC ATCCGCTGAC 6780
ACCAACCCGT CGCTGTCGCT GTCCAAAAAA GCTCTGGCAG GGACTTGTCA GAGGCCCCCG 6840
CTGCCTGAAA GGCCTCCTGA CAGAACTGAC TCCAGGCCCC AGCCTGTAGA TAGGGTCAGG 6900
GACCTTGCTG GGTCAGGGAC CAAACCCCAA TCATTGGTAT CCAGCCAGAA GCCATTGGAC 6960
AGGCCACCTG CAGTGGCAGG ACCAAGACCC CAATTATCTG ACAAACCCTC TCCCGTGACC 7020
GGCCCAAGCT CCTCACCCTC AGTCAGGTCT CAGCCACTGG AAAGACCTCT GGGGACAGCT 7080
GACCCAAGGC TGGATAAATC CATAGGTGCT GCCAGCCCAA GGCCCCAGTC ACTGGAGAAA 7140
ACCCCTGTCC CTACGGGCCT GAGACTTCCA CCGCCAGAGA AACTGCTAGT CACCAGCGGT 7200
CCCAAACCCC AGACTTCAGA CAGACCCCCT GACAAATCCC ATGTCTCTTT GTCCCAGAGA 7260
CTTCCACCTC CTGACAAAGT ACTGTCAGCT GTGGTCCAGA CCCTGGTAGC TAAAGAAAAA 7320
GCACTGAGGC CCGTGGACCA GAATACTCAG TCAAAAAATA GAGCTGCTTT GGTGATGGAT 7380
CTCATAGACC TAACTCCTCG CCAGAAGGAT CGGGCAGGTT CACCCCATGA GCTCACACCA 7440
CAGGCTGATG AGAAGATGCC AGTGTTGGAG TCGAGCTCAT GGCCTGCCAG CAAAGGTCTA 7500
GGACAGATAC CACGAGCTGT TGAGAGAGGC AGTGTGTCAG ATGCTGTCCT TCAGCCACTG 7560
GGCAAAGCAG CGGCCACTTC AGAACACTCC TGGCAAGCTG TTAAATCACT CACCCAGGCC 7620
AGACTTCTTT CTCAGCCCCC TGCCAAGGCT TTTTTATATG AGCCAGCAAC TCAGGCCTCA 7680
GGAAGAGCAC CTGCAGGGGC TGAACAGACC CCAGGGCCTC CCAGTCAAGC GCCAGGCCTG 7740
GTGAAGCAGG TGAAGCAGAT GGCTGGGGGC CAGCAACTAC CTGGACTTGC TGCCAAGAGT 7800
GGGCAGTCCT TCAGGCCTCT TGGGAAGTCG TCCCTTTCCA CTGAAGAGAA GAAGCTGGCA 7860
CCCACAGAGC AGAGTCCCTG GGCCCTGGGC AAGGCCTCGC CAGGGCCAGG GCTCTGGCCC 7920
ATGGTGGCTG GACAGACACT GGCACAGTCT TGCTGGTCCT CCGGGAGCAC ACAGACACTG 7980
GCACAGACTT GCTGGTCTCT TGGAAGAGGG AAAGACCCTA AACCAGAGCA AAGTACACTT 8040
CCAGCTCTTA ACCAGGCTCC TTCCAGTCAC AAGTGTGCAG AGTCAGAACA GAAGTAA 8098
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 94 0.0 4993
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 94 0.0 4983
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 93 0.0 4967
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 93 0.0 4906
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 91 0.0 4830
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 91 0.0 4804
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 91 0.0 4798
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 90 0.0 4779
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 90 0.0 4747
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 88 0.0 4674
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 98 0.0 4669
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 88 0.0 4653
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 93 0.0 4378
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 94 0.0 4378
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 83 0.0 4335
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4277
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 90 0.0 4272
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4262
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 89 0.0 4224
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 90 0.0 4155
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 89 0.0 4108
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 91 0.0 4067
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 4035
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 89 0.0 3932
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3856
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 82 0.0 3826
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3601
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3191
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 78 0.0 2753
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 88 0.0 2749
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 89 0.0 2714
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 84 0.0 2359
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 57 0.0 2325
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2322
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2315
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 56 0.0 2191
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2098
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 80 0.0 1890
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 51 0.0 1807
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 94 0.0 1706
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 76 0.0 1642
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 56 0.0 1500
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 77 0.0 1422
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 86 0.0 1397
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 81 0.0 1357
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1281
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1251
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 76 0.0 1205
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1204
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1140
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 84 0.0 1088
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 82 0.0 1081
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 79 0.0 1078
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 65 0.0 1069
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 77 0.0 1050
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 62 0.0 996
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 972
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 957
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 952
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 63 0.0 940
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 63 0.0 932
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 84 0.0 929
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 916
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 57 0.0 650
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 1e-121 437
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 9e-56 218
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 2e-50 200
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 5e-50 199
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 6e-50 198
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 2e-49 197
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 3e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 195
Created Date 25-Jun-2016