WERAM Information


Tag Content
WERAM ID WERAM-Myl-0080
Ensembl Protein ID ENSMLUP00000006392.2
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMLUG00000006970.2 ENSMLUT00000006991.2 ENSMLUP00000006392.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.60e-52 176.6 1946 2062
Me_Reader PWWP 5.00e-33 113.1 323 1820
HMT SET1 2.00e-29 102.1 1946 2062
Me_Reader PHD 1.70e-19 69.8 1547 2166
Organism Myotis lucifugus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSMLUP00000006392.2 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2032
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSMLUP00000006392.2 2033 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSMLUP00000006392.2 323 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 387
69********************98766665466888999******************99998775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSMLUP00000006392.2 1760 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1820
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSMLUP00000006392.2 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2030
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSMLUP00000006392.2 2031 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSMLUP00000006392.2 1547 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1591
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSMLUP00000006392.2 1594 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1641
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSMLUP00000006392.2 1642 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1695
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSMLUP00000006392.2 1712 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1753
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSMLUP00000006392.2 2124 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2166
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTSELPRR NCLLPFSNPV NLDAPEDKDS PFGNGQSTFS EPLNECTMQL STASGTSQNA 60
YGQDSPSCYI PLRKLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQSP IVCTSLSPGG 120
PTALAMKQEP SCNNSPELQV KVTKTVKNGF LHFENFTCVD DADVDSEMDP EQPVTEDESI 180
EEIFEETQTN ATCNYEPKSE NGVDVAMGNE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240
RNEVDGSNEK ASLLPAPFSL GDTNVTIEEQ LNSVNLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFC QPKKKSTPLK YEVGDLIWAK FKRRPWWPCR ICSDPLINTH SKMKVSNRRP 360
YRQYYVEAFG DPSERAWVAG KAIVMFEGRH QFEELPVLRR RGKQKEKGYR HKVPQKILSK 420
WEASVGLAEQ YDVPKGSKNR KCVSSSIKLD SEEDMPFEDC TNDPESEHDL LLNGCLKSLA 480
FDSEHSADEK EKPCAKSRVR KSTDNPKRTC VKKGHMQFET NKEERRGKIP ENLGINFISG 540
DVSDKQASNE LSRIANSLTG ASTAPGSFLF SSCAKNPAKK EFETSNCDSL LGLSEGALIS 600
KRSGEKKKLQ RGLVCSSKVQ LCYIKAGDEE KRSDSISICT TSDDGSSDLD PIDHSSESDN 660
SVLEITDAFD RTENMLSLQK NEKVKYSRYP ATNTKVKAKQ KSLITSSLTD HLTDCTKTAE 720
SGTETSQVNL SDFKVSTLVR KPQMDFRNDG FSTKFNTPSS ICSENSLIKG GATNQTLLHS 780
KSKQPKIRSI KCKHKENPVV EPPVTNEDCS LKCCSSDTKG SPLASISKSG KGDGLKLLSN 840
MHEKTRDSND IETAVVKHVL SELKELSYRS LSEDVSESGT SKPSKPLLFS SASSQNHIPI 900
EPDYKFSTLL MMLKDMHDSK TKEQRLMTAQ NLVSYRSPGL GDCSTSSPLG TSKVLVSGAS 960
NHNSEKNGDG TQDPVYPSPS GGDSALSGEL SASLPGLVSD RRDLPASGKS RSNCVTRRNC 1020
GRSKPSKLQD AFSGQMGKNT ANHRALKTER KRKLNQVPAV IPEAALPGDR QNAASENGSL 1080
GGGAEDPAKE EPLQLMGHLT SEDSAHFSDV HFDNKVNQSE PDKTEKGSIF ENRKGPELDS 1140
EMNSENDEPS GVNQVVPKKR WQRLNQRRTK PRKRTNRFRE KENSEGSFGV SLSADSVKKG 1200
GEFPEHRPSP STNILEDVLT DPNHTGHLDS VGPLLNVCDK SGANTEEMEK EPGIPTLTPQ 1260
PELPEPAVRS EKKRLRKPSK WLLEYTEEYD QIFAPKKKQK KVQEQVHKVS SRCEEENLLA 1320
RSRSNAQNKQ VDENSLISTK EEPPVLEREA PFLEGPLSHS ELGGGHAELP QLTLSVPVAL 1380
EVSPRPPLKS EELLVKTPGN YESKRQRKPT KKLLESNDLD PGFMPKKGDI GLSKKCYDAG 1440
HLENNIAESC GASHSKKFGG VIGTTKIFDK PRRRKRQRHA IAKMHCKRVK NEDSSRETPG 1500
SEGELMTHRM AASLKEAVEE GIENDHGMPA SKKLQGERGG GAALKENVCQ NCEKLGELLL 1560
CEAQCCGAFH LECLGLTEMP RGKFICNECR TGIHTCFVCK QSGEDVKRCL LPLCGKFYHE 1620
ECVQKYPPTV MQNKGFRCSL HICITCHAAN PASVSASKGR LMRCVRCPVA YHANDFCLAA 1680
GSKILASNSI ICPNHFTPRR GCRNHEHVNV SWCFVCSEGG SLLCCDSCPA AFHRECLNID 1740
IPEGNWYCND CKAGKKPHYR EIVWVKVGRY RWWPAEICHP RAVPSNIDKM RHDVGEFPVL 1800
FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA LQEAAARFEE LKAQKELRQL 1860
QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC KATDDNPCGI DSECINRMLL 1920
YECHPTVCPA GGRCQNQCFT KRQYPEVEIF RTLQRGWGLR TKTDIKKGEF VNEYVGELID 1980
EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR FMNHCCQPNC ETQKWSVNGD 2040
TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC SGFLGVRPKN QPIATEEKSK 2100
KFKKKQQGKR RSQGEITKER EDECFSCGDA GQLVSCKKPG CPKVYHADCL NLTKRPAGKW 2160
ECPWHQCDIC GKEAASFCEM CPSSFCKQHR EGMLFISKLD GRLSCTEHDP CGPNPLEPGE 2220
IREYVPPPVP LTPGPNTHVA EQQSSGVAAQ GPKMLDKPPA EANQTLSLSK KTLAGTCQRP 2280
PPPERPPDRT DSRPQPVERV RTLAGSGAKP QSLVSSQKSL DRPSIVGGPR TQLSDKPSPV 2340
TGPSSSPSVR SQPLERPLGA ADPRLDKSIG AASPRPQSLE KTPVPTVLRL PPSDRLLVTS 2400
SPKPQTSDRS PDKSHTSLTQ RFPPPDKVLS AVVQTLVAKE KALRPVDQNT QSKNRAALVM 2460
DLIDLTSRQK ERAASPHEAT PQADEKMPVL ESSSWLASKG LGQMSRAVER GSVSEPVLQP 2520
PGKAVAPLEH PWQAVKSLTQ ARLLSPSPAK AFLYESATQA AGRAPAGAEQ IPGPPSQALG 2580
LVKQVKQMAG GHQVPGLVAK SGQSFRPLGK APSSLSTEEK KLATTEQSPW ALGKTSPGPG 2640
LWPMLAGHTL ASSCWSSGST QTLAQTCWSL GRGQDPKPEQ NTFPTLNQAP SSHKYAESEQ 2700
K 2701
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTCTGAACT ACCCAGAAGA AATTGTCTGC TGCCTTTTTC CAATCCAGTC 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CACTTTTTCT 120
GAACCACTTA ATGAGTGTAC TATGCAGTTA TCGACTGCCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTACATT CCACTGCGGA AACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAGTATTT AAATGGGTCT GCTGATGGAT CAGAATCCTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGTCGCCA ATTGTTTGCA CTTCCTTGAG TCCTGGTGGT 360
CCAACAGCAC TTGCTATGAA ACAGGAACCC TCTTGTAATA ACTCCCCTGA ACTCCAGGTA 420
AAAGTAACAA AGACTGTCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCACAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGAAAC TCAGACCAAT GCCACCTGCA ATTATGAGCC TAAATCAGAG 600
AATGGTGTAG ACGTGGCCAT GGGAAATGAA CAAGACAGCA CACCAGAGAG TAGACATGGT 660
GCAGTCAAAT CGCCATTCTT GCCATTAGCT CCTCAAACTG AAACACAGAA AAATAAGCAA 720
AGAAATGAAG TGGACGGCAG CAATGAAAAA GCATCCCTTC TCCCAGCCCC CTTTTCACTA 780
GGAGATACAA ACGTTACCAT AGAAGAGCAA TTAAACTCAG TAAATTTATC TTTTCAGGAT 840
GATCCAGACT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CCCAGGAATT ACCATTTTGT CAACCCAAGA AAAAGTCAAC ACCATTGAAG 960
TATGAAGTTG GAGATCTCAT CTGGGCAAAA TTCAAGAGAC GCCCATGGTG GCCCTGCAGG 1020
ATTTGTTCTG ATCCGTTGAT TAACACACAC TCAAAAATGA AAGTTTCGAA CCGGAGACCT 1080
TATCGACAGT ACTATGTGGA GGCTTTTGGA GACCCTTCTG AGAGAGCCTG GGTGGCTGGA 1140
AAAGCAATTG TCATGTTCGA AGGCAGACAT CAGTTTGAAG AGCTACCTGT CCTTAGGAGA 1200
AGAGGGAAGC AAAAAGAAAA AGGATATAGG CATAAGGTTC CTCAGAAAAT TTTGAGTAAA 1260
TGGGAAGCCA GTGTTGGTCT TGCTGAACAG TATGATGTTC CCAAAGGGTC AAAGAACCGA 1320
AAATGTGTCA GCAGTTCAAT CAAATTGGAC AGTGAAGAGG ATATGCCATT TGAGGACTGT 1380
ACAAATGATC CTGAGTCAGA ACATGACCTG TTGCTTAATG GCTGCTTGAA ATCTCTGGCT 1440
TTTGACTCTG AACATTCTGC AGATGAGAAG GAGAAGCCTT GTGCTAAGTC TCGAGTCAGA 1500
AAGAGCACTG ATAATCCAAA AAGGACTTGT GTGAAAAAGG GCCACATGCA GTTTGAAACA 1560
AATAAGGAAG AACGGAGGGG AAAGATTCCA GAGAACCTTG GCATAAACTT TATTTCTGGG 1620
GATGTATCTG ACAAGCAGGC CTCTAATGAA CTTTCCAGGA TAGCAAACAG CCTCACAGGG 1680
GCCAGCACTG CTCCAGGAAG TTTTCTGTTT TCTTCTTGTG CAAAAAACCC TGCAAAGAAA 1740
GAATTTGAGA CTTCAAATTG TGACTCTTTA TTGGGCTTGT CTGAGGGTGC CTTGATTTCT 1800
AAACGTTCTG GGGAGAAGAA GAAACTTCAA CGAGGTCTGG TGTGTAGTTC AAAGGTACAG 1860
CTCTGCTATA TTAAAGCAGG TGATGAAGAA AAACGAAGTG ATTCCATTAG TATTTGTACC 1920
ACTTCTGATG ATGGAAGCAG TGATCTGGAT CCCATAGATC ACAGTTCAGA GTCTGATAAC 1980
AGTGTCCTTG AAATTACAGA TGCTTTTGAT AGAACAGAGA ATATGTTGTC CTTGCAGAAA 2040
AATGAAAAGG TAAAGTATTC CAGGTATCCT GCCACAAACA CTAAGGTAAA AGCAAAGCAG 2100
AAGTCCCTCA TTACTAGCTC ACTTACAGAC CACTTAACAG ATTGTACTAA GACAGCAGAG 2160
TCTGGAACTG AGACATCTCA GGTTAACCTC TCTGATTTTA AAGTGTCCAC TCTTGTTCGA 2220
AAACCCCAAA TGGATTTTAG AAATGATGGT TTCTCTACAA AATTCAACAC CCCATCAAGC 2280
ATTTGCAGTG AGAACTCACT AATAAAGGGT GGGGCTACAA ATCAAACTCT GTTACACTCG 2340
AAAAGCAAAC AGCCCAAGAT CCGAAGTATA AAGTGCAAAC ATAAAGAAAA TCCAGTTGTA 2400
GAACCTCCAG TTACAAATGA AGACTGCAGT TTGAAATGCT GTTCTTCTGA TACCAAAGGT 2460
TCTCCTTTGG CCAGCATTTC CAAAAGTGGG AAAGGGGATG GGCTAAAACT ACTGAGCAAC 2520
ATGCATGAGA AAACCAGAGA TTCGAATGAC ATAGAAACAG CAGTGGTGAA ACACGTTCTG 2580
TCTGAGTTGA AGGAACTCTC TTATAGATCC TTAAGTGAAG ATGTCAGTGA ATCTGGAACA 2640
TCAAAGCCAT CAAAACCATT ACTTTTTTCT TCTGCCTCTA GTCAGAATCA TATACCTATT 2700
GAACCAGACT ATAAATTCAG TACCTTGCTA ATGATGTTGA AAGATATGCA TGATAGTAAG 2760
ACCAAGGAGC AACGGTTAAT GACTGCTCAA AACTTGGTCT CCTATCGGAG CCCTGGTCTT 2820
GGAGACTGTT CTACCAGTAG TCCTCTAGGG ACTTCTAAGG TCTTAGTTTC AGGAGCCTCC 2880
AATCATAATT CAGAAAAAAA TGGAGATGGC ACTCAGGATC CAGTCTATCC TAGCCCCAGT 2940
GGGGGTGACT CTGCACTGTC TGGGGAGTTG TCTGCCTCCC TACCTGGCTT AGTGTCTGAC 3000
AGAAGAGACC TCCCTGCTTC TGGCAAGAGT CGTTCAAACT GTGTTACTAG ACGCAACTGT 3060
GGGCGATCAA AGCCATCCAA ATTGCAAGAT GCTTTTTCAG GTCAGATGGG AAAGAACACA 3120
GCGAACCATA GAGCCTTAAA GACAGAGCGC AAAAGAAAAT TGAACCAGGT TCCAGCTGTG 3180
ATTCCAGAGG CTGCACTGCC AGGAGACAGA CAGAATGCAG CCTCAGAGAA TGGCTCCTTG 3240
GGAGGTGGAG CAGAAGACCC TGCTAAAGAA GAACCCCTTC AATTAATGGG CCATTTAACA 3300
AGTGAAGACA GTGCCCATTT TTCTGATGTT CATTTTGACA ACAAAGTCAA CCAGTCTGAG 3360
CCTGATAAAA CTGAAAAAGG CTCCATCTTT GAAAACAGAA AAGGTCCAGA GCTGGACTCT 3420
GAAATGAACA GTGAAAATGA TGAACCCAGT GGTGTTAATC AAGTAGTACC TAAAAAGCGG 3480
TGGCAGCGTT TAAACCAAAG GCGCACTAAA CCTCGTAAGC GCACTAACCG ATTTAGGGAG 3540
AAAGAAAACT CTGAGGGTTC CTTTGGGGTC TCGCTTTCTG CTGACTCTGT AAAGAAGGGG 3600
GGTGAGTTCC CAGAGCATAG ACCTTCTCCT TCAACAAACA TACTGGAGGA TGTGCTGACA 3660
GATCCAAATC ACACCGGCCA CTTAGATTCA GTTGGGCCAC TTCTGAATGT TTGTGATAAA 3720
TCCGGTGCCA ACACTGAGGA GATGGAAAAG GAGCCAGGAA TTCCCACTTT GACTCCCCAG 3780
CCTGAGCTTC CTGAACCAGC TGTGCGGTCA GAGAAGAAGA GACTTAGGAA GCCAAGCAAG 3840
TGGCTTCTGG AATATACAGA AGAATATGAT CAGATATTTG CTCCTAAGAA AAAACAAAAG 3900
AAGGTACAGG AACAGGTACA CAAGGTAAGT TCCCGCTGTG AAGAAGAAAA CCTTTTAGCT 3960
CGAAGTCGAT CTAATGCTCA GAACAAGCAG GTGGATGAGA ATTCATTGAT TTCAACCAAA 4020
GAAGAGCCTC CAGTTCTTGA AAGGGAGGCT CCGTTTTTGG AAGGGCCCTT GTCTCACTCA 4080
GAACTTGGAG GTGGACATGC TGAGTTGCCA CAGTTGACTT TGTCTGTGCC TGTGGCTCTG 4140
GAAGTCTCTC CACGGCCTCC CCTTAAATCT GAGGAATTGC TAGTTAAAAC ACCAGGAAAT 4200
TATGAAAGTA AGCGGCAGAG AAAACCAACT AAGAAACTTC TTGAATCAAA TGATTTAGAC 4260
CCTGGATTTA TGCCCAAGAA AGGGGATATT GGCCTTTCTA AAAAGTGTTA TGACGCTGGT 4320
CACTTGGAGA ATAACATTGC TGAATCATGT GGTGCATCTC ATTCTAAAAA GTTTGGTGGA 4380
GTCATAGGCA CTACCAAGAT ATTTGATAAA CCAAGAAGGC GAAAACGACA AAGGCATGCT 4440
ATAGCTAAGA TGCATTGTAA AAGAGTGAAA AATGAAGACT CATCAAGAGA AACTCCAGGC 4500
TCAGAGGGGG AACTGATGAC ACACAGGATG GCTGCAAGTC TCAAGGAGGC TGTTGAAGAG 4560
GGCATAGAAA ACGACCATGG GATGCCTGCA TCTAAAAAAC TGCAGGGTGA ACGAGGTGGA 4620
GGAGCTGCCC TCAAGGAGAA TGTTTGTCAG AACTGTGAGA AACTGGGTGA GCTGCTGTTA 4680
TGTGAGGCTC AGTGCTGTGG GGCGTTCCAC CTGGAGTGCC TTGGATTAAC TGAGATGCCC 4740
AGAGGAAAAT TTATCTGCAA TGAATGTCGC ACAGGAATCC ATACCTGTTT TGTATGTAAA 4800
CAGAGTGGGG AAGATGTTAA AAGGTGCCTT CTGCCCTTGT GTGGAAAGTT TTATCATGAA 4860
GAGTGTGTCC AGAAGTACCC ACCCACTGTC ATGCAGAACA AGGGCTTTCG GTGCTCCCTC 4920
CACATCTGTA TAACCTGCCA TGCTGCTAAT CCAGCCAGTG TTTCTGCATC TAAAGGTCGT 4980
CTGATGCGAT GTGTCCGCTG CCCTGTGGCA TACCATGCCA ATGACTTTTG TCTGGCTGCT 5040
GGGTCAAAAA TACTTGCTTC TAATAGTATC ATCTGCCCAA ATCACTTTAC ACCCAGGCGA 5100
GGCTGCCGAA ATCATGAGCA TGTTAACGTT AGCTGGTGTT TTGTATGCTC GGAAGGAGGC 5160
AGCCTTCTGT GCTGTGATTC TTGCCCTGCT GCTTTTCATC GGGAATGCCT GAACATTGAT 5220
ATCCCTGAAG GAAACTGGTA TTGCAATGAC TGTAAGGCAG GCAAAAAACC ACATTACAGG 5280
GAAATTGTCT GGGTAAAGGT TGGACGATAC AGGTGGTGGC CAGCTGAGAT CTGCCATCCT 5340
CGAGCTGTAC CTTCAAACAT TGATAAGATG AGACATGATG TGGGAGAGTT CCCTGTGCTC 5400
TTCTTTGGGT CTAATGACTA TCTCTGGACT CACCAGGCCA GAGTCTTTCC CTACATGGAG 5460
GGGGATGTAA GCAGCAAAGA TAAGATGGGC AAAGGAGTGG ATGGGACATA TAAAAAAGCT 5520
CTTCAGGAAG CTGCAGCAAG GTTTGAGGAA TTAAAGGCCC AAAAAGAACT AAGACAGCTG 5580
CAGGAAGACC GAAAGAACGA TAAGAAACCA CCGCCTTATA AACATATAAA GGTGAACCGT 5640
CCTATTGGCC GGGTACAGAT CTTCACTGCA GACTTGTCTG AAATTCCCCG TTGCAACTGT 5700
AAAGCTACTG ATGATAACCC CTGTGGGATA GACTCTGAGT GCATCAACCG CATGCTGCTT 5760
TATGAGTGCC ACCCCACAGT ATGTCCTGCC GGAGGACGCT GCCAAAACCA GTGCTTTACT 5820
AAGCGCCAGT ACCCAGAGGT GGAGATTTTT CGCACATTAC AGAGGGGCTG GGGTCTTCGA 5880
ACAAAAACCG ATATTAAAAA GGGTGAATTT GTGAATGAGT ATGTGGGTGA GCTAATAGAT 5940
GAAGAAGAAT GCAGAGCTCG AATCCGTTAT GCCCAAGAAC ACGATATCAC TAATTTCTAT 6000
ATGCTCACCC TTGACAAAGA TCGGATCATT GATGCTGGTC CCAAAGGAAA CTATGCTCGG 6060
TTCATGAATC ACTGCTGCCA GCCCAACTGT GAAACACAGA AGTGGTCTGT GAATGGAGAC 6120
ACCCGTGTTG GCCTTTTTGC CCTGAGTGAC ATTAAAGCAG GCACTGAACT TACCTTCAAC 6180
TACAACCTAG AATGTCTTGG GAATGGAAAG ACTGTTTGCA AATGTGGAGC CCCAAACTGC 6240
AGCGGCTTTT TGGGTGTAAG GCCAAAGAAT CAACCCATTG CCACAGAAGA AAAGTCAAAG 6300
AAATTCAAGA AGAAGCAACA GGGGAAGCGA AGGAGCCAGG GTGAAATCAC AAAGGAGCGA 6360
GAGGATGAGT GTTTCAGCTG TGGGGATGCT GGCCAGCTCG TCTCCTGTAA GAAGCCAGGC 6420
TGCCCAAAAG TTTACCATGC AGACTGTCTC AATCTAACCA AGCGACCAGC AGGGAAATGG 6480
GAGTGTCCTT GGCATCAGTG TGACATCTGT GGGAAGGAAG CAGCCTCCTT CTGCGAGATG 6540
TGTCCTAGCT CCTTTTGCAA GCAGCATCGG GAAGGGATGC TCTTCATCTC CAAACTGGAT 6600
GGACGTCTGT CTTGTACTGA GCACGACCCC TGTGGGCCCA ACCCTTTGGA ACCTGGGGAG 6660
ATCCGTGAGT ATGTGCCTCC CCCGGTACCA CTGACTCCAG GCCCAAACAC TCACGTGGCA 6720
GAGCAACAAT CATCAGGAGT AGCTGCTCAG GGGCCCAAGA TGTTGGACAA GCCACCTGCT 6780
GAAGCCAACC AGACGTTGTC ACTGTCCAAA AAAACTCTGG CAGGGACTTG TCAGAGGCCA 6840
CCGCCGCCTG AAAGACCTCC TGACAGAACT GACTCCAGGC CCCAGCCTGT AGAAAGGGTC 6900
AGGACCCTTG CTGGGTCAGG GGCCAAACCC CAGTCCTTGG TATCCAGCCA GAAGTCATTG 6960
GACAGGCCAT CTATAGTGGG AGGACCAAGA ACCCAACTAT CTGACAAACC CTCTCCAGTG 7020
ACTGGCCCAA GCTCTTCACC CTCAGTCAGG TCCCAACCAC TGGAAAGACC TCTAGGGGCA 7080
GCTGACCCAA GGCTGGATAA ATCCATAGGT GCTGCCAGCC CAAGGCCCCA GTCACTGGAG 7140
AAAACCCCAG TCCCTACTGT CCTGAGACTC CCACCATCAG ACAGACTGCT CGTCACCAGC 7200
AGCCCCAAAC CCCAGACTTC AGACAGGTCT CCAGACAAAT CCCATACTTC TTTGACCCAG 7260
AGATTCCCAC CTCCTGACAA AGTGCTATCA GCTGTGGTCC AGACACTGGT AGCTAAAGAA 7320
AAAGCACTGA GGCCTGTGGA CCAGAATACT CAGTCAAAAA ACAGAGCTGC TTTGGTGATG 7380
GATCTTATAG ACCTAACTTC TCGCCAGAAA GAACGGGCAG CTTCTCCTCA TGAGGCCACA 7440
CCACAAGCTG ATGAGAAGAT GCCAGTGTTG GAGTCCAGCT CATGGCTTGC CAGCAAAGGT 7500
CTGGGGCAGA TGTCTCGAGC TGTGGAGAGA GGCAGTGTGT CAGAGCCTGT CCTTCAGCCT 7560
CCTGGAAAAG CCGTGGCCCC CTTGGAGCAC CCCTGGCAAG CTGTTAAATC ACTCACCCAA 7620
GCCAGACTTC TTTCTCCATC CCCTGCCAAG GCTTTTTTAT ATGAGTCAGC AACTCAGGCT 7680
GCAGGAAGAG CACCTGCAGG AGCTGAGCAG ATCCCAGGGC CTCCCAGCCA AGCACTGGGC 7740
CTGGTGAAGC AGGTGAAACA GATGGCTGGA GGCCATCAAG TACCTGGACT TGTTGCCAAG 7800
AGTGGGCAGT CCTTCAGGCC TCTTGGGAAG GCCCCATCCT CCCTCTCCAC TGAAGAGAAG 7860
AAGTTGGCAA CCACAGAGCA GAGTCCCTGG GCCTTAGGAA AGACCTCACC AGGGCCAGGG 7920
CTCTGGCCCA TGTTGGCTGG ACACACACTG GCATCATCTT GCTGGTCCTC CGGGAGCACA 7980
CAGACATTGG CACAGACTTG CTGGTCTCTT GGAAGAGGGC AAGACCCTAA ACCAGAGCAA 8040
AACACATTTC CAACTCTTAA CCAGGCGCCT TCCAGTCACA AGTATGCAGA GTCAGAACAG 8100
AAATAA 8107
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 92 0.0 4858
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 92 0.0 4846
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 92 0.0 4831
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 92 0.0 4790
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 91 0.0 4775
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 90 0.0 4677
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 89 0.0 4653
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 89 0.0 4632
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 88 0.0 4610
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 87 0.0 4553
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 87 0.0 4543
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 90 0.0 4237
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 91 0.0 4231
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 82 0.0 4220
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 90 0.0 4183
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 88 0.0 4131
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 88 0.0 4125
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 88 0.0 4114
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 87 0.0 4090
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 88 0.0 4012
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 87 0.0 3967
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 87 0.0 3923
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 88 0.0 3840
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 88 0.0 3837
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 88 0.0 3755
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 82 0.0 3734
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3532
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3127
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 90 0.0 2757
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 77 0.0 2732
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 87 0.0 2625
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 57 0.0 2283
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2281
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 56 0.0 2273
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 82 0.0 2245
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 57 0.0 2161
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 60 0.0 2097
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 80 0.0 1867
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 65 0.0 1742
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 93 0.0 1699
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 58 0.0 1503
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 54 0.0 1481
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 75 0.0 1373
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 80 0.0 1311
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 83 0.0 1303
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1281
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1248
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 62 0.0 1222
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1205
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1140
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 84 0.0 1089
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 82 0.0 1079
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1078
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1051
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 64 0.0 1043
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 67 0.0 997
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 972
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 959
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 64 0.0 953
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 55 0.0 941
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 64 0.0 933
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 82 0.0 924
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 917
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 651
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 3e-121 435
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 2e-55 217
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 9e-51 201
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 2e-50 200
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 3e-50 199
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 8e-50 198
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 42 2e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 1e-48 194
WERAM-Viv-0116 VIT_18s0072g00220.t01 Vitis vinifera 47 7e-48 191
WERAM-Sol-0089 Solyc07g008460.2.1 Solanum lycopersicum 41 8e-48 191
WERAM-Sot-0073 PGSC0003DMT400059166 Solanum tuberosum 42 1e-47 191
Created Date 25-Jun-2016