WERAM Information


Tag Content
WERAM ID WERAM-Orc-0022
Ensembl Protein ID ENSOCUP00000002408.2
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSOCUG00000002775.2 ENSOCUT00000002776.2 ENSOCUP00000002408.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.60e-52 176.6 1946 2062
Me_Reader PWWP 2.90e-32 110.6 323 1820
HMT SET1 1.80e-29 102.2 1946 2062
Me_Reader PHD 2.40e-19 69.3 1547 2166
Organism Oryctolagus cuniculus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSOCUP00000002408.2 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2032
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSOCUP00000002408.2 2033 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSOCUP00000002408.2 323 VGDLIWAKFKRRPWWPCKICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 12 YpwWPalvisppleakklktqeaeenk 38
Y+ +Pa ++++ ++k+l t+++ +++
ENSOCUP00000002408.2 686 YSRYPATNSRVKPKQKSLITNSHTDHL 712
999****99999999999888887775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSOCUP00000002408.2 1760 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1820
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSOCUP00000002408.2 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2030
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSOCUP00000002408.2 2031 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSOCUP00000002408.2 1547 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1591
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSOCUP00000002408.2 1594 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1641
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSOCUP00000002408.2 1642 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1695
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSOCUP00000002408.2 1712 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1753
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSOCUP00000002408.2 2124 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2166
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLLPLCNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STASGTSPSA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQSP VVCASLRPGG 120
PTALAMKQEP CCNNSPELQV KVTKTIKNGL VHFENCTCVD DADVESEMDP EQPVTEDECI 180
EEIFEETQTN ATCNYEPKSE NGAKVAMGSE QDSTPESRHG AVKSPFLLLA PQTETQNNKQ 240
RSEVDGSNEK AALLPTPFSL GDANGTLEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCK ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDVPKGSKNR NCVSTSIKLD SEEDMPFEDC TNDPDSEHDL LLNGCLKSLA 480
FDSEHSADEK EKSCAKSRAR KNCDNPKRTS VKKGHMQFEA HKEERRGKIS ENLGLNFISG 540
DVSDKQASNE LSRIANSLTG SNTAPGSFLF SSCGQNTAKK EFETSNCDSL LGLSEGTLIS 600
KCSGEKKKPQ RGLVCSSKVQ LCYIGTGDEE KRSDSISICT TSDDGSSDLD PVEHSSESDN 660
SILEITDTFD RTENILSMQK NEKIKYSRYP ATNSRVKPKQ KSLITNSHTD HLMNCTKTTE 720
LGTEMSQVNL SDLTVSTLVH KPQSDFKNDS LAPKFNTPSA ISSENSLVTG GATNQTLLHS 780
KSKPPKFRSI KCKHKENPLI VEPSVPNEDC SLKCCSSDTK GSPLASISKS GKMDGLKLLS 840
NMHEKTRDSS DIETAVVKHV LSELKELSYR SLSEDVSDSG TSKPSKPLLF SSASNQNHIP 900
IEPDYKFSTL LMMLKDMHDS KTKEQRLMTA QNLISYRSPS RGDCSTSSPV GASKILVSGS 960
FTHNSEKSGD VTQDSARPSP SGGDSAPSVE LSASLPGLVS DKRDLSVSVK SRSNCVTRRN 1020
CGRSKPSKLR DAFSTQMGKN TVNRKALKTE RKRKPSQLPA VTLEVPLQGD KESGSSVSGS 1080
SRDGAEDSGK ESSQQTGHLT SEDAIQFSDV HFDNKVKQSD PDKIPEKEPT FENRKDPELN 1140
SEMNSENDEP NGVNQVVPKK RWQRLNQRRT KPRKRTNRSR EKENSEDAFG VLLPGDPVQK 1200
GRDEFPEHRT PPPTNIVEDS VTDPNHAGCL DSVGPQLNVC DKSAASNEDM EKEPGIPSLT 1260
PQAELPEPVV RSEKKRLRKP SKWLLEYTEE YDQIFAPKKK QKKVQEQVHK VSSRCEEEGL 1320
LARCQSSAQN KQVDENSLIS TKEEPPVLER EAPFLEGPLA QSELGGGNAE LPQLTLSVPV 1380
APEVSPRPAL ESEELLVKTP GNYESKRQRK PTKKLLESND LDPGFMPKKG DLGLTKKCYE 1440
AGCLENGITE SCAVSRSKEF GGGTTKIFDK PRKRKRQRHV AAKVQCKKVK NDDSSKGMPG 1500
SEGELMAHRT TASPKEAVEE GVEHDPGMPA SKKMQGERGG GAALKENVCQ NCEKLGELLL 1560
CEAQCCGAFH LECLGLTEMP RGKFICNECR TGIHTCFVCK QSGEDVKRCL LPLCGKFYHE 1620
ECVQKYPPTV MQNKGFRCSL HICITCHAAN PASVSASKGR LMRCVRCPVA YHANDFCLAA 1680
GSKILASNSI ICPNHFTPRR GCRNHEHVNV SWCFVCSEGG SLLCCDSCPA AFHRECLNID 1740
IPEGNWYCND CKAGKKPHYR EIVWVKVGRY RWWPAEICHP RAVPSNIDKM RHDVGEFPVL 1800
FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA LQEAAARFEE LKAQKELRQL 1860
QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC KATDENPCGI DSECINRMLL 1920
YECHPTVCPA GGRCQNQCFT KRQYPEVEIF RTLQRGWGLR TKTDIKKGEF VNEYVGELID 1980
EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR FMNHCCQPNC ETQKWSVNGD 2040
TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC SGFLGVRPKN QPIATEEKSK 2100
KFKKKQPGKR RSQGEITKER EDECFSCGDA GQLVSCKKPG CPKVYHADCL NLTKRPAGKW 2160
ECPWHQCDVC GKEAASFCEM CPSSFCKQHR EGMLFISKLD GRLSCTEHDP CGPNPLEPGE 2220
IREYVPPPVP LPPGPGAHLA EQSSGTAAQG PKMSDKPPAD TNQTLPLSKK ALAGTCQRPL 2280
LPERPLERTD SRPQLLDRVR DLAGSGTKSQ PLASSQRPLD RSPPVAGPRP QLSDKPSPVT 2340
GPGSSPSVRP QPLERPLGTT DPRLDKSIGA VSPRPQSLEK TPVPTGLRLL PPDRLLVTSS 2400
PKPQTSERPP DKSHAPLSQR LPPPEKVLSA VVQTLVAKEK ALRPVDQNTQ SKNRAALVMD 2460
LIDLTPRQKE RAASPHEVTP QADEKVPVLE SSSWAASKGL GHMPRVVEKG SMSEPLLQPP 2520
GKTAAPAEHP WQAVKSLTQA RLLSQPPAKA FLYEPATQAS GRAPAGAEQT PGPPSQAPGL 2580
VKQVKQMAGS QQLPGLAAKT GQSFRSLVKT PASLSTEEKK LATPEQSSWA LGKTSAGAGL 2640
WPMVAGQTLM QSCWPAGSTQ TLAQTCWSLG RGQDPKPEQN TLPALNQAPS NHKCAESEQK 2700
Nucleotide Sequence
(Fasta)
ATGGATCAGA CTTGTGAACT ACCTAGAAGA AATTGTCTGC TGCCCCTTTG CAACCCAGTG 60
AACTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTG TCGACTGCCA GTGGAACATC CCCAAGTGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA GACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCAGATGGTT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGTCGCCA GTGGTTTGCG CTTCCTTGAG GCCTGGTGGT 360
CCAACAGCAC TTGCTATGAA ACAGGAGCCC TGTTGTAATA ACTCCCCTGA ACTCCAGGTA 420
AAAGTTACAA AGACTATCAA GAATGGCTTG GTGCACTTTG AGAATTGTAC TTGTGTGGAC 480
GATGCAGATG TAGAATCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGTGTATA 540
GAGGAGATCT TTGAGGAAAC TCAGACCAAT GCCACCTGCA ATTATGAGCC TAAATCAGAG 600
AATGGTGCAA AAGTGGCCAT GGGAAGTGAA CAAGACAGCA CACCAGAGAG TAGACACGGT 660
GCAGTCAAAT CGCCATTCTT GCTATTAGCT CCTCAGACCG AAACACAGAA CAATAAGCAA 720
AGAAGTGAAG TGGACGGCAG CAATGAAAAA GCAGCCCTTC TCCCAACCCC TTTTTCACTA 780
GGAGATGCAA ACGGTACCCT AGAAGAGCAA TTAAACTCGA TAAATTTATC TTTTCAGGAT 840
GATCCAGACT CCAGTACCAG TACATTAGGA AACATGCTAG AGTTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTTGT CAACCCAAGA AAAAGTCGAC ACCACTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAG TTCAAGAGAC GCCCATGGTG GCCCTGCAAG 1020
ATTTGTTCTG ATCCATTGAT TAACACACAC TCAAAAATGA AAGTTTCCAA CCGGAGGCCC 1080
TATCGGCAAT ACTATGTTGA GGCTTTTGGA GACCCTTCTG AAAGAGCCTG GGTAGCTGGA 1140
AAAGCAATCG TCATGTTTGA AGGCAGACAT CAGTTTGAAG AGCTACCTGT CCTTAGGAGA 1200
AGAGGGAAGC AGAAAGAAAA AGGATATAGG CATAAGGTTC CTCAGAAAAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGTCT TGCTGAACAG TATGATGTTC CCAAAGGATC AAAGAATCGC 1320
AACTGTGTCA GTACTTCCAT CAAGTTGGAC AGTGAGGAGG ACATGCCATT TGAGGACTGT 1380
ACAAATGATC CTGATTCAGA ACACGACCTA TTACTTAATG GCTGCTTGAA ATCTCTGGCT 1440
TTTGATTCTG AACATTCTGC AGACGAGAAG GAAAAGTCTT GTGCTAAATC TCGAGCCAGA 1500
AAGAACTGTG ATAATCCAAA AAGGACTAGT GTGAAAAAGG GCCACATGCA GTTTGAAGCA 1560
CATAAGGAAG AACGGAGGGG AAAGATTTCT GAGAACCTTG GCCTAAACTT TATCTCTGGG 1620
GATGTGTCTG ATAAACAGGC ATCTAATGAA CTTTCCAGGA TAGCAAACAG CCTCACAGGG 1680
TCCAATACTG CCCCAGGAAG TTTCCTATTT TCTTCTTGTG GACAAAACAC TGCAAAGAAA 1740
GAATTTGAGA CTTCAAATTG TGACTCTTTA TTGGGCTTGT CTGAGGGTAC CTTGATTTCT 1800
AAATGTTCTG GAGAGAAAAA GAAGCCCCAG CGAGGTCTGG TGTGTAGTTC AAAAGTGCAG 1860
CTTTGCTATA TTGGAACAGG TGATGAGGAA AAGCGAAGTG ATTCCATTAG TATCTGCACC 1920
ACTTCTGATG ATGGAAGCAG TGATCTAGAT CCTGTAGAAC ACAGCTCAGA ATCAGATAAC 1980
AGTATCCTTG AAATTACAGA TACTTTTGAT AGAACAGAGA ACATATTATC CATGCAGAAA 2040
AATGAAAAGA TAAAGTATTC TAGGTATCCT GCCACAAACT CCAGAGTAAA ACCAAAACAG 2100
AAATCCCTCA TTACTAACTC ACATACAGAC CACTTAATGA ATTGTACTAA GACAACAGAG 2160
CTGGGAACTG AGATGTCTCA GGTTAATCTC TCTGACCTTA CGGTGTCCAC TCTTGTCCAC 2220
AAACCCCAAT CAGATTTTAA AAATGATAGT CTTGCTCCCA AATTCAACAC ACCATCAGCC 2280
ATTTCCAGTG AGAACTCATT AGTAACGGGT GGGGCTACAA ATCAGACTCT GTTACATTCG 2340
AAAAGCAAAC CACCCAAGTT CCGAAGTATA AAGTGCAAGC ATAAAGAAAA TCCACTTATT 2400
GTAGAACCAT CAGTTCCAAA TGAGGACTGC AGTTTGAAAT GCTGCTCTTC AGATACTAAA 2460
GGCTCTCCTT TGGCCAGCAT TTCAAAAAGT GGGAAAATGG ACGGGCTGAA ACTACTGAGC 2520
AACATGCATG AGAAAACCAG AGATTCAAGT GACATAGAAA CAGCCGTGGT GAAGCATGTT 2580
CTGTCAGAGC TGAAGGAACT CTCTTACAGA TCCTTAAGTG AGGATGTCAG TGACTCTGGT 2640
ACGTCAAAGC CATCAAAACC GTTACTTTTT TCTTCTGCTT CTAATCAGAA TCATATACCT 2700
ATTGAGCCAG ACTACAAATT CAGTACATTG CTAATGATGT TAAAAGACAT GCATGATAGT 2760
AAGACAAAGG AGCAGCGATT GATGACTGCT CAAAACTTGA TCTCTTATCG GAGTCCTAGT 2820
CGGGGGGATT GTTCTACCAG TAGTCCTGTA GGGGCTTCCA AGATTTTGGT TTCGGGAAGC 2880
TTCACCCATA ATTCAGAAAA AAGTGGAGAT GTCACTCAGG ACTCAGCCCG TCCTAGCCCT 2940
AGCGGGGGTG ACTCTGCACC GTCTGTGGAG TTGTCTGCCT CTTTACCTGG GTTAGTGTCT 3000
GACAAGAGAG ACCTCTCTGT TTCTGTTAAA AGTCGTTCAA ACTGTGTGAC TAGGCGCAAC 3060
TGCGGGCGAT CTAAGCCATC CAAATTGCGT GATGCTTTTT CAACCCAGAT GGGAAAGAAC 3120
ACCGTGAACC GCAAGGCCTT AAAGACAGAG CGCAAGAGAA AACCGAGCCA GCTTCCTGCT 3180
GTGACTCTAG AGGTTCCACT GCAGGGAGAC AAAGAGAGTG GAAGCTCAGT GAGTGGCTCA 3240
TCAAGAGATG GGGCAGAAGA TTCTGGTAAA GAATCCAGTC AACAAACGGG CCATTTAACG 3300
AGTGAAGATG CTATCCAATT TTCTGATGTT CATTTTGATA ACAAGGTTAA GCAGTCTGAC 3360
CCTGATAAAA TTCCCGAAAA AGAACCTACT TTTGAAAACA GGAAAGATCC TGAATTGAAC 3420
TCTGAAATGA ACAGTGAGAA TGATGAACCC AATGGTGTAA ATCAAGTGGT GCCTAAAAAG 3480
CGCTGGCAGC GTTTAAACCA AAGGCGCACT AAACCTCGTA AGCGCACTAA CAGATCTAGG 3540
GAGAAAGAAA ACTCTGAGGA TGCCTTTGGA GTCTTGCTTC CTGGTGACCC TGTGCAGAAG 3600
GGGCGGGATG AGTTCCCAGA GCATAGAACT CCTCCTCCTA CAAACATAGT AGAGGACTCA 3660
GTGACAGATC CAAATCATGC TGGCTGCTTA GATTCAGTTG GGCCACAGTT GAATGTTTGT 3720
GATAAATCTG CCGCCAGCAA TGAGGACATG GAAAAGGAAC CAGGAATCCC CAGTTTGACA 3780
CCACAAGCTG AGCTTCCCGA ACCAGTTGTG CGGTCAGAGA AGAAACGCCT TAGGAAGCCA 3840
AGCAAGTGGC TTCTGGAATA TACAGAAGAA TATGATCAGA TATTTGCTCC TAAGAAGAAA 3900
CAAAAGAAAG TACAGGAGCA GGTGCACAAG GTAAGTTCCC GCTGTGAAGA GGAAGGCCTT 3960
CTAGCCCGAT GTCAATCTAG TGCCCAGAAC AAGCAGGTGG ACGAGAATTC TTTGATTTCA 4020
ACCAAAGAAG AGCCTCCAGT TCTTGAAAGG GAGGCTCCAT TTTTGGAGGG GCCCTTGGCC 4080
CAGTCAGAAC TTGGAGGTGG AAATGCTGAG TTGCCACAGC TTACCTTGTC TGTGCCTGTG 4140
GCTCCGGAAG TCTCTCCACG ACCTGCCCTT GAGTCTGAGG AATTGCTTGT TAAAACACCA 4200
GGAAATTATG AAAGTAAGCG TCAACGAAAG CCAACGAAGA AGCTTCTTGA ATCCAATGAT 4260
TTAGACCCTG GATTTATGCC CAAGAAGGGT GACCTTGGCC TTACTAAAAA GTGTTATGAA 4320
GCTGGTTGCT TGGAGAATGG GATTACTGAA TCATGTGCTG TATCTCGTTC AAAAGAGTTT 4380
GGCGGAGGCA CTACCAAGAT TTTTGATAAA CCAAGGAAAC GAAAAAGACA GAGGCATGTT 4440
GCAGCTAAGG TGCAGTGTAA AAAAGTGAAA AATGACGACT CATCAAAAGG GATGCCAGGG 4500
TCAGAGGGAG AACTCATGGC TCACAGGACG ACTGCAAGTC CTAAGGAGGC TGTTGAGGAG 4560
GGCGTAGAAC ACGACCCTGG GATGCCCGCG TCTAAAAAAA TGCAGGGTGA ACGAGGTGGC 4620
GGAGCTGCAC TCAAGGAGAA TGTTTGTCAG AACTGTGAGA AACTGGGTGA GCTGCTCTTG 4680
TGTGAGGCTC AGTGCTGTGG GGCTTTCCAC CTGGAGTGCC TTGGGTTGAC TGAGATGCCC 4740
AGAGGAAAAT TCATCTGCAA TGAATGTCGA ACAGGAATCC ATACCTGTTT TGTATGTAAA 4800
CAGAGTGGGG AAGATGTTAA AAGGTGCCTT CTGCCCTTAT GTGGAAAGTT CTACCATGAA 4860
GAGTGTGTGC AGAAGTACCC ACCAACAGTC ATGCAGAACA AGGGCTTCCG GTGCTCCCTC 4920
CACATCTGTA TAACCTGCCA TGCTGCTAAT CCAGCCAGTG TTTCTGCGTC TAAAGGACGT 4980
CTGATGCGCT GTGTCCGCTG TCCGGTGGCA TACCATGCCA ATGACTTCTG CCTGGCTGCC 5040
GGGTCAAAGA TCCTTGCATC TAATAGTATC ATCTGTCCTA ATCACTTTAC CCCTAGACGG 5100
GGCTGTCGAA ACCATGAGCA TGTGAATGTT AGCTGGTGTT TCGTGTGCTC AGAAGGCGGC 5160
AGCCTTTTGT GTTGTGATTC TTGCCCTGCT GCTTTTCATC GTGAATGCCT GAACATTGAT 5220
ATCCCTGAAG GAAACTGGTA CTGCAATGAC TGCAAGGCAG GAAAAAAGCC ACACTACAGG 5280
GAGATTGTTT GGGTAAAAGT TGGACGATAT AGGTGGTGGC CAGCAGAGAT CTGCCATCCT 5340
CGAGCGGTTC CTTCCAACAT TGATAAGATG AGACACGATG TGGGCGAGTT CCCTGTCCTC 5400
TTCTTTGGGT CTAATGACTA TCTGTGGACC CATCAGGCCC GAGTCTTCCC ATACATGGAA 5460
GGGGACGTGA GCAGCAAGGA CAAGATGGGC AAAGGCGTGG ATGGGACTTA TAAAAAAGCT 5520
CTTCAAGAAG CTGCAGCAAG ATTTGAGGAA TTAAAGGCCC AGAAGGAGCT AAGACAACTG 5580
CAAGAAGACC GAAAGAATGA CAAGAAACCA CCACCTTATA AACATATAAA GGTGAACCGT 5640
CCTATAGGCA GGGTACAGAT CTTCACTGCA GACTTATCTG AAATTCCCCG TTGCAACTGT 5700
AAAGCTACAG ACGAGAACCC TTGTGGCATA GACTCTGAGT GCATCAACCG CATGCTGCTG 5760
TATGAGTGCC ACCCCACAGT GTGTCCTGCT GGCGGGCGCT GCCAAAACCA GTGCTTCACC 5820
AAGCGCCAGT ATCCAGAGGT TGAAATTTTC CGCACCTTAC AGAGGGGTTG GGGTCTGCGG 5880
ACAAAAACAG ATATTAAGAA GGGTGAATTT GTGAATGAAT ATGTAGGTGA GCTAATAGAC 5940
GAAGAAGAAT GCAGAGCTCG AATCCGTTAT GCCCAGGAAC ATGATATCAC CAATTTCTAT 6000
ATGCTAACAC TAGACAAAGA CCGAATCATT GATGCTGGTC CCAAAGGAAA TTATGCTCGG 6060
TTCATGAATC ATTGCTGTCA GCCCAACTGT GAAACCCAGA AGTGGTCTGT GAATGGAGAT 6120
ACCCGCGTTG GCCTTTTTGC CCTGAGTGAC ATTAAAGCAG GCACTGAGCT TACCTTCAAC 6180
TATAACCTAG AATGTCTTGG GAATGGAAAA ACTGTCTGCA AATGTGGAGC CCCAAACTGC 6240
AGTGGCTTCC TGGGTGTAAG GCCAAAGAAT CAACCCATTG CCACAGAAGA AAAATCAAAG 6300
AAGTTCAAGA AGAAGCAGCC GGGAAAGCGC AGGAGCCAGG GCGAGATCAC AAAGGAGCGA 6360
GAAGATGAGT GTTTCAGCTG TGGGGACGCC GGCCAGCTCG TCTCCTGCAA GAAGCCGGGC 6420
TGCCCGAAAG TTTACCACGC AGACTGTCTG AATCTGACCA AGCGGCCAGC AGGGAAGTGG 6480
GAGTGTCCGT GGCATCAGTG TGATGTGTGT GGGAAGGAAG CAGCCTCCTT CTGTGAGATG 6540
TGCCCCAGTT CCTTCTGCAA GCAGCATCGG GAAGGCATGC TGTTCATCTC CAAACTGGAT 6600
GGGCGTCTAT CTTGTACTGA GCACGACCCC TGTGGGCCCA ACCCTCTGGA ACCCGGGGAG 6660
ATCCGTGAGT ATGTGCCTCC CCCGGTACCA TTGCCTCCAG GGCCGGGCGC TCACCTGGCA 6720
GAACAGTCAT CAGGAACAGC TGCTCAGGGG CCCAAGATGT CAGATAAGCC ACCTGCCGAC 6780
ACCAACCAGA CGCTGCCACT CTCCAAGAAA GCTCTGGCAG GGACTTGTCA GAGGCCATTG 6840
CTGCCTGAAA GACCTCTTGA AAGAACTGAC TCCAGGCCCC AGCTTTTAGA TAGGGTCAGA 6900
GACCTTGCTG GGTCAGGGAC CAAATCCCAA CCCTTGGCAT CCAGCCAGAG GCCACTGGAC 6960
AGGTCACCTC CAGTGGCAGG ACCAAGACCC CAGCTATCTG ACAAGCCCTC TCCAGTGACT 7020
GGCCCTGGCT CCTCACCCTC AGTCAGGCCC CAACCACTGG AAAGACCTCT GGGGACAACT 7080
GACCCACGGC TGGATAAATC TATAGGTGCT GTCAGCCCAC GGCCCCAGTC ACTGGAGAAA 7140
ACCCCAGTTC CCACTGGCTT ACGACTTCTG CCGCCAGACA GACTGCTAGT TACCAGCAGT 7200
CCCAAACCTC AGACTTCAGA GCGGCCCCCA GACAAATCCC ATGCCCCTTT GTCCCAGAGA 7260
CTCCCACCTC CTGAGAAAGT ACTATCAGCT GTGGTCCAGA CCTTGGTAGC TAAAGAAAAA 7320
GCACTGAGGC CTGTGGACCA GAATACTCAG TCAAAAAATA GAGCTGCTTT GGTGATGGAT 7380
CTCATAGACC TAACCCCTCG CCAGAAGGAG CGGGCAGCAT CTCCTCATGA GGTCACACCA 7440
CAGGCTGATG AGAAGGTGCC GGTGTTGGAA TCGAGCTCAT GGGCAGCCAG CAAAGGCCTG 7500
GGGCATATGC CACGAGTGGT TGAGAAAGGC AGCATGTCAG AACCTCTTCT CCAGCCACCT 7560
GGAAAGACCG CAGCCCCTGC AGAGCACCCC TGGCAAGCTG TTAAATCACT CACCCAGGCC 7620
AGACTTCTTT CTCAGCCGCC TGCCAAGGCT TTTTTATATG AGCCAGCAAC TCAGGCCTCA 7680
GGAAGAGCTC CTGCAGGGGC TGAGCAGACC CCAGGACCTC CCAGCCAAGC ACCAGGCCTG 7740
GTGAAGCAGG TGAAGCAGAT GGCCGGAAGC CAGCAACTAC CTGGACTTGC TGCCAAAACT 7800
GGGCAGTCCT TCAGGTCTCT TGTGAAGACC CCAGCCTCCC TCTCTACTGA AGAGAAGAAG 7860
TTGGCAACTC CAGAGCAGAG TTCCTGGGCC TTGGGGAAAA CCTCAGCAGG GGCAGGGCTG 7920
TGGCCCATGG TGGCTGGACA GACACTGATG CAGTCCTGCT GGCCTGCGGG GAGCACACAG 7980
ACATTGGCAC AGACTTGCTG GTCTCTTGGA AGAGGGCAAG ACCCCAAACC AGAGCAAAAT 8040
ACACTTCCAG CTCTAAACCA AGCCCCTTCC AATCACAAAT GTGCAGAGTC AGAACAGAAG 8100
TAA 8104
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 91 0.0 4837
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 90 0.0 4821
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 91 0.0 4817
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 90 0.0 4810
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 91 0.0 4802
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 90 0.0 4786
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 90 0.0 4783
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 90 0.0 4768
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 88 0.0 4690
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 89 0.0 4663
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 86 0.0 4569
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 83 0.0 4343
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4333
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 90 0.0 4327
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4314
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 90 0.0 4295
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 89 0.0 4284
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 90 0.0 4276
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 89 0.0 4226
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 89 0.0 4169
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 89 0.0 4110
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 4086
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 90 0.0 4011
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 87 0.0 3922
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3892
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 83 0.0 3883
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3588
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3201
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 89 0.0 2786
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 76 0.0 2706
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 86 0.0 2598
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 87 0.0 2461
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 58 0.0 2328
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2322
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 57 0.0 2293
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 58 0.0 2164
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2109
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 79 0.0 1820
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 50 0.0 1803
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 82 0.0 1630
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 89 0.0 1602
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 55 0.0 1508
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 86 0.0 1424
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 82 0.0 1407
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 76 0.0 1397
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1300
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1253
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 77 0.0 1212
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 77 0.0 1209
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1147
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 65 0.0 1104
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 83 0.0 1093
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 79 0.0 1085
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 80 0.0 1066
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1054
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 67 0.0 1004
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 977
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 968
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 64 0.0 960
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 59 0.0 946
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 61 0.0 938
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 83 0.0 926
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 921
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 659
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 1e-121 436
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 1e-55 217
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 40 1e-50 201
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 40 3e-50 199
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 40 3e-50 199
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 41 1e-49 197
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 3e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 6e-49 195
Created Date 25-Jun-2016