WERAM Information


Tag Content
WERAM ID WERAM-Cap-0095
Ensembl Protein ID ENSCPOP00000007137.2
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSCPOG00000007928.3 ENSCPOT00000008003.2 ENSCPOP00000007137.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.20e-52 177 1758 1874
HMT SET1 1.50e-29 102.5 1758 1874
Me_Reader PWWP 1.30e-25 89.2 325 1632
Me_Reader PHD 3.30e-19 68.8 1363 1979
Organism Cavia porcellus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSCPOP00000007137.2 1758 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 1844
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSCPOP00000007137.2 1845 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 1874
*****************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSCPOP00000007137.2 1758 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 1842
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSCPOP00000007137.2 1843 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1874
*******************************7 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSCPOP00000007137.2 325 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVANRRPYREYYVEAFGDPSERAWVAGKAIVMFE 389
69********************98766665466789999******************99998775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++V ++++ w ++ +++py+e
ENSCPOP00000007137.2 1576 REIVWVKVGRYRWWPAEICHPRAIPSNIDKMRHDVGEFPV-----SNDYLWTHQARVFPYME 1632
589*************************777777777777.....799************87 PP

  Me_Reader PHD

               PHD.txt    6 vCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
vC+++ e+ e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSCPOP00000007137.2 1363 VCQQNCEKVGELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1407
465555554559****99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSCPOP00000007137.2 1410 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1457
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSCPOP00000007137.2 1458 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1511
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSCPOP00000007137.2 1528 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1569
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSCPOP00000007137.2 1937 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 1979
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELSRR NCLLRFSNPV NLDAPEDKDS PFGNGQSNFS EPLNGCTMQL STASGTPQNA 60
YGQDSPSCYI PLRRLQDLAS MINVEYLNGS ADGSESFQDP EKSDSRAQSP IVCTSLSPGG 120
PTALAMKQEP SCNNSPELQL KVTKTIKNGF LHFENFTCVD DADVDSEMDP EQPVKEDESI 180
EEIFEDIQTN ATCKNESKSE NDVEVAMGSE QDSTPESRHG AVKSPFLPLA PQTETQKNKQ 240
RDEVDGSNEK AALLPVPFSL GDTNITIEEQ LNSINLSFQD DPDSSTSTLG NMLELPGTSS 300
SSTSQELPFL KCQPKKKSAP LKYEVGDLIW AKFKRRPWWP CRICSDPLIN THSKMKVANR 360
RPYREYYVEA FGDPSERAWV AGKAIVMFEG RHQFEELPVL RKRGKQKEKG YRHKVPQKIL 420
SKWEASVGLA EQYDVPKGSK NRKCVTSSIK LDSEEDMPFE DCTNDPESEH DLLLNGCLKS 480
LAFDSEHSAD EKEKPSAKSR VRKSSDNLKR TSVKKGLIQF ETHKEERRGK TPENLNLNFI 540
SGDVSDNQAS NELSRIANSL SGSNTAPSSF LFSSCGKNTE KKEFETSSCD ALLGLSEGAL 600
ISKRSVEKKR GLVCNSKVQL CYVGVGDEEK RSDSISICTT SDDGSSDLDP IEHSSESDNS 660
VLEVTDTFAR TENVLPVQKN EKMKYSRYPD TNTRVKAKTK PFIANSHTEH LLDRKIAEPE 720
SETSQVNLSD LKVSTFVQKS HSDFRNDTLS PKFNTPSISS ENSLIKSGAT HQVLLNSKSK 780
QPKFRNTKCK HKENPVMVES PVINEDCSLK CCSSDPKGSP LTSISKSGKV DGLKLLNNMH 840
EKTRDSSDIE TAVVKHVLSE LKELSYRSLN DDVSDSGTSK PSKPLLFSST SSQNHIPIEP 900
DYKFSTLLMM LKDMHDSKTK ERLMTGQNLV SFRSPAREIV LPVPTFENSK GPMLDSEMNS 960
ENDESGVNQV VPKKRWQRVN QRRAKTGKRT NRFREKENSE SVFGILPPGD HVEKGLSDFS 1020
ECRAPPSTDI LEDSLIAPNH VSHLDSVGPQ LNICGKSSTS MEHMEKEPGI PNLTPQAELP 1080
EPAVRSEKKR LRKPSKWLLE YTEEYDQIFA PKKKHKKVQE QAHKVSSRCE EESRLSRCQS 1140
SSQNKQVDEN SLISTKEEPP VLEREAPFLE GPLAHAELGV GHAELPQLTL SVPMAPEVSP 1200
RPALESEDLI VKTPGNYESK RQRKPTKKLL ESNDLDPGFM PKKGDIGFSK KCYEGGHLEN 1260
GITESCTPSH LKEFGVIGTT KIFDKPRKRK RQRHITAKVQ CKKLKNDASA KETPGSEGEL 1320
MTHRTAASPK ENIEEGIEHD ASMSSSKKMQ GERGGGAALK ENVCQQNCEK VGELLLCEAQ 1380
CCGAFHLECL GLTEMPRGKF ICNECRTGIH TCFVCKQSGE DVKRCLLPLC GKFYHEECVQ 1440
KYPPTVMQNK GFRCSLHICI TCHAANPASV SASKGRLMRC VRCPVAYHAN DFCLAAGSKI 1500
LASNSIICPN HFTPRRGCRN HEHVNVSWCF VCSEGGSLLC CDSCPAAFHR ECLNIDIPEG 1560
NWYCNDCKAG KKPHYREIVW VKVGRYRWWP AEICHPRAIP SNIDKMRHDV GEFPVSNDYL 1620
WTHQARVFPY MEGDVSSKDK MGKGVDGTYK KALQEAAARF EELKAQKELR QLQEDRKNDK 1680
KPPPYKHIKV NRPIGRVQIF TADLSEIPRC NCKATDENPC GIDSECINRM LLYECHPTVC 1740
PAGGRCQNQC FSKRQYPEVE IFRTLQRGWG LRTKTDIKKG EFVNEYVGEL IDEEECRARI 1800
RYAQEHDITN FYMLTLDKDR IIDAGPKGNY ARFMNHCCQP NCETQKWSVN GDTRVGLFAL 1860
SDIKAGTELT FNYNLECLGN GKTVCKCGAP NCSGFLGVRP KKKQPIATEE KSKKFKKKQQ 1920
GKRRSQGEIT KEREDECFSC GDAGQLVSCK KPGCPKVYHA DCLNLTKRPA GKWECPWHQC 1980
DICGKEAASF CEMCPSSFCK QHREGMLFIS KLDGRLSCTE HDPCGPNPLE PGEIREYVPP 2040
SVPLPPSPGT HLAEQSSGMA AQGPKMSDKP PADASQTLPV SKKALTGTCQ RPLLPERPLE 2100
RSDSSSQLLD RVRDLAGSGT KSQSLLSNQR PLDRPPAVEG PRPQLSDKPS PVTSPSSSPS 2160
MRSQPLERPL GMADPRLDKS IGAASSRPQS LEKTLAPSGL RLPPPERLLI TSSPKPQASD 2220
RPTDKSHPSL SQRLPPPEKV LSAVVQTLVA TEKALRPVDQ NTQSKNRAAL VMDLIDLTPR 2280
QKEQATSHEV TLQADEKMSV LESSSWPAGK GLGHVSRALE KGSMSDPLFQ PPGKNTVHSE 2340
HTWQAVKSLT QARLFPQPPA KAFLYESATQ ASGRAPGGAE QTPGPPSQPP GSVKQVKQMA 2400
GGQHLPGLTA KSGQSFRSLG KTPTSLPTEE KKLPTTEQSH WSLGKASPGT GLWPIVAGQA 2460
LAQSCWSAGS TQTLAQTCWS LGRGQDPKPE QNTLPTLNQA PSSHKCAESE QK 2512
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ATCTAGAAGA AATTGTCTGC TGCGCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCTGAAGA CAAGGACAGC CCTTTCGGTA ATGGTCAATC CAATTTTTCT 120
GAGCCACTTA ATGGGTGTAC TATGCAGTTA TCGACTGCTA GTGGAACACC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTGTTATATT CCACTGCGGA GACTACAGGA TTTGGCCTCC 240
ATGATCAATG TAGAATATTT AAATGGGTCT GCTGATGGAT CAGAGTCTTT TCAAGACCCT 300
GAAAAAAGTG ATTCAAGAGC TCAGTCGCCA ATTGTTTGCA CTTCCTTGAG TCCTGGTGGT 360
CCAACAGCAC TTGCTATGAA ACAGGAACCC TCTTGTAATA ACTCCCCTGA ACTCCAGTTA 420
AAAGTAACAA AGACTATCAA GAATGGCTTT CTGCACTTTG AGAATTTTAC TTGTGTGGAC 480
GATGCAGATG TAGATTCTGA AATGGACCCA GAACAGCCAG TCAAAGAGGA TGAGAGTATA 540
GAGGAGATCT TTGAGGACAT TCAGACCAAT GCCACCTGCA AAAATGAGTC CAAATCAGAG 600
AATGATGTAG AAGTGGCCAT GGGAAGTGAA CAAGACAGCA CACCAGAGAG TAGACATGGT 660
GCAGTCAAAT CGCCATTCTT GCCATTAGCT CCTCAAACTG AAACACAGAA AAATAAGCAA 720
AGAGATGAAG TGGACGGCAG CAATGAAAAA GCAGCCCTTC TCCCAGTCCC CTTTTCACTA 780
GGAGATACAA ACATTACCAT AGAAGAGCAA TTAAATTCAA TAAATTTATC TTTTCAGGAT 840
GATCCAGACT CCAGTACCAG TACATTAGGA AACATGCTAG AATTACCTGG AACTTCATCA 900
TCATCTACTT CACAGGAATT GCCATTTCTA AAGTGTCAAC CCAAGAAAAA GTCTGCGCCA 960
CTGAAATACG AAGTTGGAGA TCTCATTTGG GCAAAATTCA AGAGACGCCC ATGGTGGCCC 1020
TGCAGGATTT GCTCTGATCC ATTGATTAAC ACGCATTCAA AAATGAAAGT TGCCAACCGG 1080
AGACCCTATC GTGAGTACTA TGTGGAGGCT TTTGGAGACC CTTCTGAAAG AGCCTGGGTG 1140
GCTGGAAAAG CAATCGTCAT GTTTGAAGGC AGGCATCAAT TTGAAGAATT ACCTGTCCTT 1200
AGGAAAAGAG GAAAGCAGAA AGAAAAAGGT TATAGACATA AGGTGCCTCA GAAAATTTTG 1260
AGTAAATGGG AAGCCAGTGT TGGTCTTGCT GAACAATATG ATGTTCCCAA AGGGTCAAAG 1320
AACAGGAAAT GTGTCACCAG TTCAATCAAG TTGGACAGCG AAGAGGATAT GCCATTTGAG 1380
GACTGCACAA ATGATCCTGA ATCAGAACAT GACCTGTTGC TTAATGGCTG CTTGAAATCT 1440
CTGGCTTTTG ACTCTGAACA TTCTGCAGAT GAGAAGGAAA AACCCTCTGC TAAGTCTCGA 1500
GTCAGAAAAA GCTCTGATAA TCTAAAAAGG ACTAGTGTAA AAAAGGGCCT TATACAGTTT 1560
GAAACACATA AAGAGGAACG AAGGGGAAAG ACTCCTGAGA ACCTTAACCT AAATTTTATC 1620
TCTGGGGACG TATCTGATAA TCAGGCTTCT AATGAACTTT CTAGAATAGC AAATAGCCTC 1680
AGTGGGTCTA ACACTGCCCC AAGCAGTTTT CTTTTTTCTT CATGTGGAAA AAACACGGAA 1740
AAGAAAGAAT TTGAGACTTC AAGTTGTGAT GCGTTACTAG GATTATCTGA GGGTGCCTTG 1800
ATCTCTAAAC GTTCTGTAGA GAAAAAGCGA GGTCTGGTTT GTAATTCAAA AGTACAGCTT 1860
TGCTATGTTG GAGTTGGTGA TGAAGAAAAG CGAAGTGATT CTATTAGTAT CTGTACCACT 1920
TCTGATGATG GAAGCAGTGA TCTGGATCCC ATAGAACACA GCTCAGAATC TGATAACAGT 1980
GTCCTTGAGG TAACAGATAC TTTTGCTAGA ACAGAAAACG TGTTACCTGT GCAGAAAAAT 2040
GAAAAGATGA AGTATTCTAG GTATCCTGAC ACAAACACTA GAGTAAAAGC CAAAACGAAG 2100
CCCTTCATTG CTAACTCACA TACAGAACAC TTACTGGATC GTAAGATAGC AGAGCCTGAA 2160
AGTGAAACAT CTCAGGTTAA TCTCTCTGAT CTTAAGGTGT CTACTTTTGT TCAGAAATCC 2220
CATTCAGATT TTAGAAATGA TACTCTGTCT CCAAAATTCA ATACACCCAG CATTTCGAGT 2280
GAGAACTCAC TAATAAAAAG CGGGGCTACA CATCAAGTTC TATTAAATTC AAAAAGCAAA 2340
CAGCCCAAGT TCCGAAATAC AAAGTGCAAA CATAAAGAAA ACCCAGTTAT GGTAGAATCA 2400
CCAGTTATAA ATGAGGACTG CAGTTTGAAA TGCTGCTCTT CTGATCCTAA AGGTTCTCCT 2460
TTGACCAGCA TTTCTAAAAG TGGGAAAGTG GATGGGCTGA AACTACTAAA CAACATGCAT 2520
GAGAAAACCA GGGATTCAAG TGACATAGAA ACTGCAGTGG TGAAACATGT TCTGTCAGAA 2580
TTGAAAGAAC TTTCTTACAG ATCCTTAAAT GATGATGTCA GTGACTCTGG AACATCAAAG 2640
CCATCAAAAC CATTACTGTT TTCTTCTACT TCTAGTCAGA ATCATATACC TATTGAACCA 2700
GACTACAAAT TCAGTACACT GCTAATGATG CTAAAAGATA TGCATGATAG TAAGACCAAG 2760
GAGAGATTGA TGACAGGTCA AAATCTGGTC TCTTTTCGAA GTCCTGCCCG TGAGATTGTT 2820
CTACCAGTAC CTACTTTTGA GAATAGTAAA GGCCCTATGC TGGACTCTGA AATGAACAGT 2880
GAGAACGATG AATCTGGTGT AAATCAAGTA GTGCCTAAAA AGCGGTGGCA GCGTGTAAAC 2940
CAAAGGCGTG CTAAAACTGG AAAGCGCACT AATAGGTTTA GGGAGAAGGA GAATTCTGAG 3000
AGTGTTTTTG GGATCTTGCC TCCTGGTGAC CATGTAGAGA AAGGGCTCAG TGATTTCTCA 3060
GAGTGTAGAG CTCCTCCTTC TACAGATATA CTGGAGGATT CACTAATAGC TCCTAATCAT 3120
GTCAGTCACT TAGATTCAGT TGGGCCACAG TTGAATATTT GTGGTAAATC CAGTACCAGC 3180
ATGGAGCACA TGGAAAAAGA GCCAGGAATT CCCAACTTGA CACCACAGGC TGAGCTCCCT 3240
GAGCCAGCTG TGCGGTCAGA GAAGAAACGC CTTAGGAAAC CAAGCAAGTG GCTTCTGGAA 3300
TATACAGAAG AATATGATCA GATATTTGCG CCGAAGAAAA AGCACAAGAA GGTACAGGAA 3360
CAAGCACATA AGGTAAGTTC CCGCTGTGAA GAGGAAAGCC GTCTGTCCCG CTGTCAATCT 3420
AGTTCTCAGA ACAAGCAGGT GGATGAGAAT TCTTTGATTT CAACCAAAGA AGAGCCTCCA 3480
GTTCTTGAAA GGGAGGCTCC ATTTTTGGAG GGGCCCTTGG CTCATGCAGA ACTTGGAGTT 3540
GGCCATGCTG AGTTGCCACA GCTAACCTTG TCTGTGCCTA TGGCTCCAGA AGTCTCTCCA 3600
AGACCTGCCC TTGAGTCTGA AGATTTGATT GTTAAAACAC CAGGAAACTA TGAAAGTAAA 3660
CGCCAAAGGA AACCAACTAA GAAACTTCTT GAATCCAATG ATTTAGACCC TGGATTTATG 3720
CCCAAGAAGG GGGACATTGG CTTTTCTAAA AAGTGTTATG AAGGTGGTCA TTTGGAGAAT 3780
GGCATTACCG AATCATGTAC TCCATCTCAT TTGAAGGAAT TTGGTGTCAT AGGTACAACC 3840
AAGATATTTG ACAAGCCAAG GAAGCGAAAA CGACAGAGAC ATATTACAGC TAAGGTGCAG 3900
TGTAAAAAAC TGAAAAATGA CGCCTCGGCT AAAGAGACTC CAGGTTCAGA GGGAGAACTG 3960
ATGACTCACA GGACAGCTGC AAGCCCCAAG GAGAATATTG AGGAGGGTAT AGAACACGAT 4020
GCTTCGATGT CTTCGTCTAA AAAAATGCAA GGTGAACGAG GTGGAGGAGC TGCACTCAAG 4080
GAGAATGTTT GTCAGCAGAA CTGTGAGAAA GTGGGTGAGC TGCTCTTATG TGAGGCTCAG 4140
TGCTGTGGGG CTTTCCACCT GGAGTGCCTT GGGTTAACTG AGATGCCAAG AGGAAAATTT 4200
ATCTGCAATG AGTGCCGCAC AGGGATCCAT ACCTGTTTTG TATGTAAGCA GAGTGGGGAA 4260
GATGTTAAAA GGTGCCTTCT ACCCTTGTGT GGAAAGTTTT ACCATGAAGA ATGTGTCCAG 4320
AAGTACCCAC CAACTGTCAT GCAGAATAAG GGCTTCCGGT GTTCCCTCCA CATCTGTATA 4380
ACCTGCCATG CTGCTAATCC AGCCAGTGTT TCAGCATCTA AAGGTCGTCT GATGCGCTGT 4440
GTCCGTTGTC CTGTGGCATA TCATGCAAAT GACTTTTGTC TAGCTGCAGG GTCAAAGATC 4500
CTTGCATCTA ATAGTATCAT CTGTCCTAAT CACTTTACCC CTAGACGGGG CTGCCGAAAT 4560
CATGAGCATG TCAATGTCAG CTGGTGTTTT GTGTGCTCAG AAGGAGGCAG CCTTCTGTGC 4620
TGTGATTCTT GTCCTGCTGC TTTTCATCGT GAATGCCTGA ATATTGATAT CCCTGAAGGA 4680
AATTGGTATT GCAATGACTG TAAGGCAGGC AAAAAGCCAC ACTACAGAGA AATTGTATGG 4740
GTAAAAGTTG GACGATACAG GTGGTGGCCA GCAGAGATCT GCCATCCTCG AGCTATTCCT 4800
TCCAACATTG ATAAAATGAG ACATGATGTA GGCGAGTTCC CTGTATCTAA TGACTATCTA 4860
TGGACTCATC AGGCTCGAGT GTTTCCTTAT ATGGAGGGGG ATGTGAGCAG TAAGGATAAG 4920
ATGGGCAAAG GTGTGGATGG AACATATAAA AAAGCTCTTC AGGAAGCTGC AGCAAGATTT 4980
GAGGAGTTAA AGGCACAGAA AGAGCTAAGA CAGCTGCAGG AAGATAGAAA AAATGACAAG 5040
AAGCCACCAC CTTATAAACA TATAAAGGTG AATCGTCCTA TTGGCCGGGT GCAGATCTTC 5100
ACTGCAGACT TATCAGAAAT CCCCCGCTGC AACTGTAAAG CTACTGATGA AAACCCTTGC 5160
GGAATAGACT CTGAGTGCAT CAACCGCATG CTGCTGTATG AGTGTCACCC CACAGTGTGT 5220
CCTGCTGGAG GCCGCTGCCA GAACCAGTGT TTCAGCAAGC GCCAGTACCC AGAGGTTGAA 5280
ATTTTCCGCA CATTACAGAG GGGCTGGGGT CTACGGACAA AAACAGATAT TAAAAAGGGT 5340
GAGTTTGTGA ATGAGTATGT GGGTGAACTA ATAGATGAAG AAGAATGCAG AGCTCGAATC 5400
CGTTATGCCC AAGAACATGA TATCACTAAT TTCTATATGC TCACTCTAGA TAAAGATCGA 5460
ATCATTGATG CTGGTCCCAA AGGGAACTAT GCTCGCTTTA TGAATCATTG CTGCCAGCCC 5520
AACTGTGAAA CACAGAAGTG GTCTGTGAAT GGAGATACCC GTGTTGGCCT TTTTGCCCTG 5580
AGTGACATTA AAGCAGGTAC TGAACTTACC TTCAATTACA ACCTAGAATG TCTTGGGAAT 5640
GGAAAAACTG TTTGCAAATG TGGAGCTCCA AACTGCAGTG GCTTCTTGGG TGTCAGGCCA 5700
AAGAAAAAAC AACCCATTGC CACAGAAGAA AAGTCAAAAA AATTCAAGAA GAAGCAGCAG 5760
GGGAAGCGCA GAAGTCAGGG TGAGATCACA AAAGAACGAG AGGATGAGTG TTTCAGCTGT 5820
GGGGATGCTG GCCAGTTGGT CTCCTGCAAG AAGCCGGGCT GCCCCAAAGT TTACCATGCG 5880
GACTGTCTCA ATCTGACCAA GCGGCCAGCA GGGAAGTGGG AGTGTCCTTG GCACCAGTGT 5940
GACATCTGTG GGAAGGAAGC AGCCTCCTTC TGTGAGATGT GCCCTAGCTC CTTCTGCAAG 6000
CAACATCGGG AAGGGATGCT CTTCATTTCC AAACTTGATG GGCGTCTCTC TTGTACTGAG 6060
CATGATCCCT GTGGGCCCAA CCCTCTGGAA CCTGGGGAGA TTCGTGAGTA TGTGCCTCCC 6120
TCAGTACCGC TGCCTCCAAG CCCTGGTACT CACCTGGCAG AACAATCATC AGGAATGGCT 6180
GCTCAGGGGC CCAAGATGTC AGATAAGCCA CCTGCTGATG CCAGCCAGAC GTTGCCAGTC 6240
TCCAAAAAAG CTCTGACAGG AACTTGTCAG AGGCCACTGC TACCTGAAAG ACCTCTGGAG 6300
AGAAGTGACT CCAGCTCCCA GCTTTTAGAT AGGGTCAGAG ACCTTGCTGG GTCAGGGACC 6360
AAATCACAAT CGTTGTTATC AAACCAGAGG CCACTGGACA GGCCACCTGC AGTGGAAGGA 6420
CCAAGACCAC AGCTGTCTGA CAAACCCTCT CCAGTGACCA GCCCAAGCTC CTCACCCTCA 6480
ATGAGGTCCC AACCACTGGA AAGACCTCTG GGGATGGCTG ACCCAAGGCT GGATAAATCT 6540
ATAGGTGCTG CCAGCTCAAG GCCCCAGTCA CTGGAGAAAA CCCTAGCCCC CAGTGGTCTG 6600
AGACTTCCAC CACCAGAAAG ACTGCTAATC ACCAGCAGTC CCAAACCCCA GGCTTCTGAC 6660
AGGCCCACAG ACAAATCCCA CCCTTCTTTG TCTCAGAGAC TTCCACCTCC TGAGAAAGTA 6720
CTGTCAGCAG TGGTGCAGAC TCTGGTGGCT ACAGAAAAAG CCCTGAGGCC TGTGGACCAG 6780
AATACTCAGT CAAAAAACAG AGCTGCTTTG GTGATGGATC TCATAGACTT AACTCCTCGC 6840
CAGAAAGAAC AGGCTACTTC TCACGAGGTT ACACTGCAGG CTGATGAGAA GATGTCAGTG 6900
TTGGAGTCGA GCTCATGGCC TGCTGGCAAA GGTCTGGGGC ATGTGTCACG AGCTCTGGAG 6960
AAAGGCAGCA TGTCAGACCC CCTTTTCCAA CCACCTGGGA AAAACACAGT CCATTCAGAG 7020
CACACCTGGC AAGCTGTTAA ATCACTCACC CAGGCCAGAC TTTTTCCTCA GCCACCTGCC 7080
AAGGCTTTTT TATATGAGTC CGCTACTCAG GCTTCAGGAA GAGCTCCTGG AGGAGCTGAG 7140
CAGACTCCAG GTCCTCCCAG CCAACCTCCA GGATCGGTGA AGCAGGTAAA ACAGATGGCT 7200
GGAGGCCAGC ATCTACCTGG ACTTACTGCC AAGAGTGGCC AATCCTTTAG GTCTCTTGGC 7260
AAGACTCCAA CCTCCCTCCC CACTGAAGAA AAGAAGTTGC CAACCACAGA GCAGAGTCAC 7320
TGGTCCCTGG GAAAAGCCTC ACCAGGGACA GGGCTCTGGC CCATAGTGGC TGGACAGGCA 7380
CTGGCACAGT CTTGCTGGTC TGCTGGGAGT ACACAGACAT TGGCACAGAC TTGCTGGTCT 7440
CTTGGAAGAG GGCAAGACCC CAAACCAGAG CAAAATACAC TTCCAACTCT TAATCAGGCT 7500
CCTTCCAGTC ACAAGTGTGC AGAGTCAGAA CAGAAATAA
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 89 0.0 2770
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 89 0.0 2734
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 89 0.0 2723
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 88 0.0 2720
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 89 0.0 2720
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 89 0.0 2719
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 89 0.0 2709
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 89 0.0 2709
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 89 0.0 2707
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 88 0.0 2706
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 88 0.0 2705
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 88 0.0 2692
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 88 0.0 2685
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 87 0.0 2680
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 88 0.0 2680
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 87 0.0 2678
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 88 0.0 2674
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 88 0.0 2667
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 87 0.0 2644
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 86 0.0 2631
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 87 0.0 2610
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 85 0.0 2607
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 84 0.0 2576
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 84 0.0 2565
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 89 0.0 2443
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 83 0.0 2345
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 2286
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 83 0.0 2247
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 71 0.0 2122
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 70 0.0 2071
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 89 0.0 1800
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 63 0.0 1689
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 64 0.0 1687
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 64 0.0 1674
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 70 0.0 1632
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 72 0.0 1587
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 87 0.0 1587
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 86 0.0 1565
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 71 0.0 1559
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 80 0.0 1531
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 59 0.0 1494
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 66 0.0 1396
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 82 0.0 1352
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 84 0.0 1338
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 89 0.0 1263
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1230
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1187
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 76 0.0 1186
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 76 0.0 1120
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1066
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 82 0.0 1046
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 63 0.0 1030
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 75 0.0 1030
WERAM-Tub-0031 ENSTBEP00000004056.1 Tupaia belangeri 78 0.0 1009
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 65 0.0 973
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 59 0.0 947
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 943
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 64 0.0 933
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 63 0.0 919
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 60 0.0 912
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 905
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 78 0.0 891
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 80 0.0 867
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 637
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 36 1e-116 420
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 1e-54 214
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 7e-50 198
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 2e-49 196
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 3e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 4e-49 195
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 5e-49 195
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 7e-49 194
Created Date 25-Jun-2016