WERAM Information


Tag Content
WERAM ID WERAM-Ict-0147
Ensembl Protein ID ENSSTOP00000023043.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSSTOG00000020045.1 ENSSTOT00000025930.1 ENSSTOP00000023043.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.20e-52 177 1653 1769
Me_Reader PWWP 8.00e-33 112.4 32 1527
HMT SET1 1.60e-29 102.4 1653 1769
Me_Reader PHD 1.30e-19 70.1 1254 1873
Organism Ictidomys tridecemlineatus
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSSTOP00000023043.1 1653 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 1739
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSSTOP00000023043.1 1740 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 1769
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt  1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSSTOP00000023043.1 32 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVANRRPYREYYVEAFGDPSERAWVAGKAIVMFE 96
69********************98766665466789999******************99998775 PP
PWWP.txt 11 gYpwWPalvisppleakklktqeaee 36
Y+ +Pa +++ ++k+l +++ +
ENSSTOP00000023043.1 394 KYSRYPATNTRVKAKQKSL-INSHTD 418
499****999999999998.555554 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSSTOP00000023043.1 1467 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1527
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSSTOP00000023043.1 1653 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 1737
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSSTOP00000023043.1 1738 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1769
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSSTOP00000023043.1 1254 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1298
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSSTOP00000023043.1 1301 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1348
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSSTOP00000023043.1 1349 ICITCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1402
8****86666644456677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSSTOP00000023043.1 1419 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1460
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSSTOP00000023043.1 1831 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 1873
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MLELPGTSSS STSQELPFCQ PKKKSTPLKY EVGDLIWAKF KRRPWWPCRI CSDPLINTHS 60
KMKVANRRPY REYYVEAFGD PSERAWVAGK AIVMFEGRHQ FEELPVLRKR GKQKEKGYRH 120
KVPQKILSKW EASVGLAEQY DVPKGSKNRK CVTSSGKLDS EEDMPFEDCT NDPESEHDLL 180
LNGCLKSLAF DSEHSADEKE KPCAKSRARK SSDNPKRTSV KKSIMQFEAH KEERRGKIPE 240
NLGLNFISGD VSDKQASNEL SRIANSLTGS STAPGSFLFS SCGKNSAKKE FETSNCDSLL 300
GLSEGALISK HSGEKKKLQR GLVCSSKVQL CYIGAGDEEK RSDSISICTT SDDGSSDLDP 360
VEHSSESDNS VLEITDTFDR RENLLSMQKN EKMKYSRYPA TNTRVKAKQK SLINSHTDHL 420
VDCTKTVDPG TETSQVNLTD IKVSTFVHKT QSEFRNDGLS PKFNTPSSIS SENTLIKGGT 480
TNQALLHLKS KQPKFRSIKC KHKENPVIVE PQGTNEDCGL KCCSSDTKGS PLASISKTGK 540
VDGLKLLNNM HEKTRDSSDI ETAVVKHVLS ELKELSYRSL GDDVSDSGTS KPSKPLLFSS 600
ASSQNHIPIE PDYKFSTLLM MLKDMHDSKT KEQRLMTAQN LVSYRSPVRG DCSTTSSPVA 660
ASKVLVSGGS THNSEKNGEG TQDSAHPTSS GDSVLSGELS ASLPGIVSDR GDLPASGKSR 720
PNCVTRRNCA RSKPSSKFRD VFSAQVAKNT VNRKALKTER KRKLNQNRLP AVTLETALQG 780
DKESGCSVNG PSKGGVEDLG KEETLQLTGH LKSEDAQFSD VHFDKQSEPD KILEKGSPFE 840
NRKGPELDSE MNSENDELNG VNQVVPKKRW QRLNQRRTKP GKRTNRFKEK ENSEDAFGVL 900
LPSDPVQKGR DDFPENRTPT STNILEDSLT DPNHASHLDS VGPPLNVCDK SSASMEDMEK 960
EPVIPSLTPQ AELPEPAVRS EKKRLRKPSK WLLEYTEEYD QIFAPKKKKK IQEQVHKVSS 1020
RCEEENLVAR CRTSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ 1080
LTLSVPVAPE VSLRPALESE ELLVKTPGNY ESKRQRKPTK KLLESNDLDP GFMPKKGDLG 1140
FSKKCYDAGH FQNGISNSCA ASHLKEFVGG TTKIFDKPRK RKRQRHVTTK VQCKKVKNDD 1200
SSKGIPNSEG ELMTHRTDAT PKETVEEGVE HDSGMSASKK MQGERGGGAA LKENVCQNCE 1260
KLGELLLCEA QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL 1320
CGKFYHEECV QKYPPTVMQN KGFRCSLHIC ITCHAANPAS VSASKGRLMR CVRCPVAYHA 1380
NDFCLAAGSK ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH 1440
RECLNIDIPE GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD 1500
VGEFPVLFFG SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA 1560
QKELRQLQED RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE 1620
CINRMLLYEC HPTVCPAGGR CQNQCFSKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE 1680
YVGELIDEEE CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ 1740
KWSVNGDTRV GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI 1800
ATEEKSKKFK KKQQGKRRSQ GEITKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT 1860
KRPAGKWECP WHQCDVCGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP 1920
NPLEPGEIRE YVPPPVPMPP SPSSRLAEQS SEMAAQGPKM SDKPPADAHQ TLPLSKKALA 1980
GTCQRPVLSE RPLERTDSSS QLLDRVRDLA GSGTKSQSLV SSQRPLDRPP AVEGPRPLLS 2040
DKPSPVTSPS SSPSVRSQPL ERPLGTADPR LDKSIGAASP RPHSLEKAPA PTGLRLPPTD 2100
RLLVTSSSPK PPTFDRPPDK SHASLSQRLP PPEKVLSAVV QTLVAKEKAL RPVDQNTQSK 2160
NRAALVMDLI DLTPRQKERA TSPHEITPQA EEKVPALESS SWSASKGLGH MSRALDKGSV 2220
SDPLLQPSGK TALPSEHPWQ AVKSLTQARL LSQPPAKAFL YEPATQASGR VPAGAEQTPG 2280
PPSQAPGLVK QMAGGQQLPG FAAKSGQSFR SLGKAPASLS TEEKKLATTE QSPWSLGKTS 2340
SGAGLWPIVA GQTLAQSCWS AGSTQTLAQT CWSLGRGQDP KPEQNTLPAL NQAPSNHKCA 2400
ESEQK 2405
Nucleotide Sequence
(Fasta)
GTTGGGAGGT GCTGCCGCAG CTGCGGGAAG GAGCGCGGCC CGGGCAGGCG GTGTCGGCGT 60
CGGCAGCAGC CATGTTTGTC AAGCTGTAGC AGCTGCTGCT ATCCTGATTT GGCTTCACCG 120
GCCGCCTCGG TTTCTCTTTT TGCGGCGTCC TGGCCTATGG GCCCTGCAGC TGTGGACAGG 180
GCAGGCCTGC AGTCAGGAAG GCGAACTCCG GGGTGCACCC GCCTCGGCCG ACTCGCCCGC 240
GGCCTGTGCA CTGCCGCTGC AAAGGCTCCG GCGCCGGCTA GGCGCAGGGT GCAGCGCTAT 300
TGTGACCGCT GCGCCCGAGC GAGCCAGGAA GGAGGAGGGG TACCTTTTTG TGCAGGGTCC 360
TGAAGCCCCT CTGGAATCCC ATAGCCCCCT CCCTTCAAGA GATCCGGCCG CTGGACCCCG 420
GACGGGGGAG GACGAGGACA ATTCCTGTTT TGAGATTTTT AGGAATTTTG ATCGCCAACG 480
GGATTTTAAT TTTAGTTTAA CCCAAGTGTC CACCAGTCTG CAGTGCAGGA AAAAGGGACG 540
GGTTGTTTCC ATGTGGCAGG AGTTGCCCAG CTTCTAAAAG ATGTAGTTTG CTGAGGCTCA 600
CTGAGGCCAT TTTTCCACCT TCAGCCAGGA GAACTTTTTA CACCCTTGGA AGTACAGCAG 660
AAAAGCATAG AGGCCACTAG GCCTTGAGAA ATGGCTGCCA TTTTGAAGAG AAGAGTCAGA 720
TGGCCTTATT AACTCGGATT AATTACTGTG TTTTTGGATT CCAGGTTGAT GCTGGCCCAG 780
GATGGATCAG ACCTGTGAAC TATCTAGAAG AAATTGTCTG CTGCCCTTTT CCAATCCAGT 840
GAATTTAGAT GCCCCTGCAG ACAAGGACAG CCCTTTCGGT AATGGTCAAT CCAATTTTTC 900
TGAGCCACTT AATGGGTGTA CTATGCAGTT ATCGGCTGCC AGTGGAACAT CCCAAAATGC 960
TTATGGACAA GATTCTCCAT CTTGTTACAT TCCACTGCGG AGACTACAGG ATTTGGCCTC 1020
CATGATCAAT GTAGAATATT TAAATGGGTC TGCTGATGGA TCAGAATCCT TTCAAGACCC 1080
TGAGAAAAGT GATTCAAGAG CTCAGTCGCC AATTGTTTGC ACTTCCTTGA GTCCTGGTGG 1140
TCCAACAGCA CTTGCTATGA AACAGGAACC CTCTTGTAAT AACTCCCCTG AACTCCAGTT 1200
AAAAGTAACA AAGACGATCA AGAATGGCTT TCTGCACTTT GAGAATTTTA CTTGTGTGGA 1260
CGATGTAGAT TCTGAAATGG ACCCAGAACA GCCAGTCACA GAGGATGAGA GTATAGAGGA 1320
GATCTTTGAG GAAACTCAGA CCAATGCCAC CTGCAATTAT GAGCCTAAAT CAGAGAATGG 1380
TGTAGAAATG GCCATGGGAA GTGAACAAGA CAGCACAACA GAGAGTAGAC ACGGTGCAGT 1440
CAAATCGCCA TTCTTGCCAT TAGCTCCTCA AACTGAATCG CAGAAAAATA AGCAAAGAAG 1500
TGAAGTGGAC GGCAGCAATG AAAAAACAGC CCTTCTCCCA GCCCCCTTTT CACTAGGAGA 1560
TACAAAAGTT ACCATAGAAG AGCAATTAAA CTCAATAAAT TTATCTTTTC AGGATGATCC 1620
AGACTCCAGC ACCAGTACAT TAGGAAACAT GCTAGAATTA CCTGGAACTT CATCATCATC 1680
TACTTCACAG GAATTGCCAT TTTGTCAACC CAAGAAAAAG TCTACACCAC TGAAGTATGA 1740
AGTTGGAGAT CTCATTTGGG CAAAATTCAA GAGACGCCCA TGGTGGCCCT GCAGGATTTG 1800
TTCTGATCCA TTGATTAACA CACACTCAAA AATGAAAGTT GCCAATCGGA GACCATATCG 1860
GGAATATTAC GTGGAGGCTT TTGGAGACCC TTCTGAAAGA GCCTGGGTGG CTGGAAAAGC 1920
AATCGTCATG TTTGAAGGCA GGCATCAATT TGAAGAACTA CCTGTCCTTA GGAAAAGAGG 1980
AAAGCAGAAA GAAAAAGGAT ATAGGCATAA GGTTCCTCAG AAAATTTTGA GTAAATGGGA 2040
AGCCAGCGTT GGTCTTGCTG AACAATATGA TGTTCCTAAA GGGTCAAAGA ACCGAAAATG 2100
TGTCACTAGT TCAGGCAAGT TGGACAGTGA GGAGGATATG CCATTTGAGG ACTGTACAAA 2160
TGATCCTGAA TCAGAACATG ATCTGTTGCT TAATGGCTGC TTGAAATCTC TGGCTTTTGA 2220
TTCTGAGCAT TCTGCAGATG AGAAGGAAAA GCCCTGTGCT AAGTCTCGAG CCAGAAAGAG 2280
CTCTGATAAT CCAAAAAGGA CTAGTGTGAA AAAAAGCATC ATGCAATTTG AAGCACATAA 2340
GGAAGAACGG AGGGGTAAGA TTCCAGAGAA CCTTGGCCTA AACTTTATCT CTGGGGATGT 2400
ATCTGATAAG CAGGCCTCTA ATGAACTTTC AAGGATAGCA AACAGCCTCA CAGGGTCCAG 2460
CACTGCCCCA GGAAGTTTCC TGTTTTCTTC GTGTGGCAAA AACAGTGCAA AGAAAGAATT 2520
TGAGACTTCA AATTGTGATT CTTTATTGGG CTTGTCTGAG GGTGCCTTGA TCTCTAAACA 2580
TTCTGGGGAG AAAAAGAAAC TTCAGCGAGG TCTGGTGTGT AGTTCAAAAG TACAGCTCTG 2640
TTATATTGGA GCTGGTGATG AGGAAAAGCG AAGTGATTCC ATCAGTATCT GTACCACTTC 2700
TGATGATGGA AGTAGTGATC TGGACCCTGT AGAACACAGC TCAGAGTCTG ATAACAGTGT 2760
CCTTGAAATT ACAGATACGT TTGATAGAAG AGAGAACCTG TTATCCATGC AGAAAAATGA 2820
AAAGATGAAG TATTCTAGGT ATCCTGCCAC GAACACTAGG GTGAAAGCAA AACAGAAATC 2880
TCTGATTAAC TCACATACAG ACCACTTAGT AGATTGTACA AAGACAGTAG ATCCTGGAAC 2940
TGAGACATCT CAGGTTAATC TCACTGATAT TAAAGTATCC ACTTTCGTCC ACAAAACCCA 3000
ATCAGAATTT AGAAATGATG GTCTTTCTCC AAAATTTAAC ACACCATCAA GCATTTCCAG 3060
TGAAAACACA CTGATAAAGG GTGGTACTAC AAATCAAGCT CTGTTACATT TGAAAAGCAA 3120
ACAGCCCAAG TTCCGAAGTA TAAAGTGTAA ACATAAAGAA AATCCAGTTA TAGTAGAACC 3180
CCAAGGTACA AATGAGGACT GTGGTTTGAA ATGCTGCTCT TCTGATACCA AAGGCTCTCC 3240
TTTGGCCAGC ATTTCTAAAA CTGGAAAGGT GGATGGGCTA AAACTACTGA ACAACATGCA 3300
TGAGAAAACC AGGGATTCAA GTGACATAGA AACAGCAGTA GTTAAACATG TTCTATCTGA 3360
ATTGAAGGAA CTCTCTTACA GATCTTTAGG TGATGATGTC AGTGACTCTG GAACATCAAA 3420
GCCATCAAAA CCGCTACTTT TTTCTTCTGC TTCTAGTCAG AATCATATAC CTATTGAACC 3480
AGACTACAAA TTTAGTACAT TGCTAATGAT GTTGAAAGAT ATGCATGATA GTAAGACCAA 3540
GGAGCAAAGG TTGATGACTG CTCAAAATTT GGTCTCTTAT CGAAGTCCTG TTCGTGGCGA 3600
CTGTTCTACT ACCAGTAGTC CTGTGGCAGC ATCTAAGGTT TTGGTTTCAG GAGGCTCCAC 3660
CCACAATTCA GAAAAAAATG GAGAGGGCAC TCAGGACTCA GCCCATCCTA CCTCTAGTGG 3720
TGACTCTGTG CTCTCTGGGG AATTGTCTGC CTCCTTACCT GGCATAGTGT CTGACAGAGG 3780
AGACCTTCCT GCTTCTGGCA AAAGTCGTCC CAACTGTGTT ACCAGGCGCA ACTGTGCCCG 3840
ATCAAAACCA TCCTCCAAAT TTCGAGATGT TTTTTCAGCC CAGGTGGCAA AGAATACAGT 3900
GAACCGGAAA GCCTTAAAGA CAGAGCGAAA AAGAAAACTG AACCAGAACC GACTTCCAGC 3960
TGTGACTCTG GAGACTGCAC TGCAGGGAGA CAAAGAGAGT GGATGCTCTG TGAATGGCCC 4020
ATCCAAGGGT GGGGTAGAAG ATCTTGGTAA AGAAGAAACT CTTCAATTAA CAGGACATTT 4080
AAAAAGTGAA GATGCTCAGT TTTCTGATGT ACATTTTGAT AAACAGTCTG AACCTGATAA 4140
AATTCTTGAA AAGGGTTCCC CCTTTGAGAA CAGAAAAGGC CCAGAGCTGG ACTCTGAAAT 4200
GAATAGTGAG AATGATGAAC TAAATGGGGT AAATCAAGTG GTGCCTAAAA AGCGGTGGCA 4260
GCGTTTAAAC CAAAGGCGCA CAAAACCTGG AAAGCGCACT AACAGGTTTA AGGAGAAAGA 4320
GAACTCTGAA GATGCCTTTG GGGTCTTGCT TCCTAGTGAC CCTGTTCAGA AGGGTCGGGA 4380
TGATTTCCCA GAGAATAGAA CTCCAACTTC TACAAACATA CTAGAGGACT CACTGACAGA 4440
TCCAAATCAT GCTAGCCACT TAGATTCAGT TGGTCCACCC TTGAATGTTT GTGATAAATC 4500
CAGTGCAAGC ATGGAAGATA TGGAAAAGGA GCCAGTAATT CCCAGTTTGA CACCCCAGGC 4560
TGAGCTCCCT GAACCAGCTG TGCGATCAGA GAAGAAACGC CTCAGGAAGC CCAGCAAGTG 4620
GCTTCTGGAA TATACAGAAG AATATGATCA GATATTTGCT CCTAAGAAAA AAAAGAAGAT 4680
ACAGGAACAG GTACACAAGG TAAGTTCCCG CTGTGAAGAG GAAAACCTTG TAGCCCGATG 4740
TCGAACTAGT GCTCAGAACA AGCAGGTGGA TGAGAATTCT TTGATTTCAA CCAAAGAAGA 4800
GCCTCCAGTT CTTGAAAGGG AGGCTCCATT TTTGGAGGGT CCCTTGGCTC AGTCAGAACT 4860
TGGAGGTGGA CATGCTGAGT TGCCACAGCT AACCTTGTCT GTTCCTGTGG CTCCAGAAGT 4920
CTCCCTACGA CCTGCCCTTG AGTCTGAGGA ATTGCTTGTT AAAACACCAG GAAATTATGA 4980
AAGTAAGCGT CAAAGAAAAC CAACAAAGAA ACTTCTTGAA TCCAATGATT TAGACCCTGG 5040
ATTTATGCCT AAGAAGGGAG ACCTTGGCTT TTCTAAAAAG TGTTATGATG CTGGTCACTT 5100
CCAGAATGGC ATTTCTAATT CATGTGCTGC ATCTCATTTA AAAGAGTTTG TTGGAGGCAC 5160
TACCAAGATA TTTGATAAGC CAAGGAAGCG AAAACGACAG AGGCATGTTA CAACTAAAGT 5220
GCAGTGTAAA AAAGTGAAAA ATGATGACTC ATCAAAAGGA ATTCCAAACT CAGAGGGAGA 5280
ACTGATGACT CACAGGACGG ATGCAACCCC CAAGGAGACT GTTGAGGAGG GTGTAGAACA 5340
TGACTCTGGA ATGTCTGCAT CTAAAAAAAT GCAGGGTGAA CGAGGTGGAG GAGCTGCCCT 5400
CAAGGAGAAT GTTTGTCAGA ACTGTGAGAA ACTTGGTGAG CTGCTCTTAT GTGAGGCTCA 5460
GTGCTGTGGG GCTTTCCATC TGGAGTGCCT TGGCTTAACT GAGATGCCAA GAGGAAAATT 5520
TATCTGCAAT GAATGTCGCA CAGGAATCCA TACCTGTTTT GTATGTAAGC AAAGTGGGGA 5580
AGATGTTAAA AGGTGCCTTT TGCCTTTGTG TGGAAAATTT TACCATGAAG AATGTGTCCA 5640
GAAATACCCA CCAACTGTCA TGCAGAACAA GGGCTTTCGG TGCTCCCTCC ACATCTGTAT 5700
AACCTGCCAT GCTGCTAATC CAGCCAGTGT TTCTGCATCT AAAGGTCGTC TGATGCGCTG 5760
TGTCCGATGC CCTGTGGCAT ACCATGCTAA TGATTTTTGT CTGGCTGCTG GGTCAAAGAT 5820
CCTTGCATCT AATAGTATCA TCTGCCCTAA TCACTTTACC CCTAGGCGGG GCTGTCGAAA 5880
TCATGAGCAT GTTAATGTCA GCTGGTGTTT TGTATGCTCA GAAGGAGGCA GCCTTCTGTG 5940
CTGTGATTCT TGCCCTGCTG CTTTTCATCG TGAATGCCTG AACATTGATA TCCCTGAAGG 6000
AAACTGGTAT TGCAATGACT GTAAGGCAGG CAAAAAGCCA CACTACAGGG AGATCGTGTG 6060
GGTAAAAGTT GGACGATACA GGTGGTGGCC AGCTGAGATC TGCCATCCTC GAGCTGTTCC 6120
TTCCAATATT GATAAGATGA GACATGATGT GGGAGAGTTC CCCGTGCTCT TCTTTGGGTC 6180
CAATGACTAT CTCTGGACTC ATCAGGCCCG AGTCTTTCCT TACATGGAGG GAGATGTGAG 6240
CAGCAAGGAT AAGATGGGCA AAGGAGTGGA TGGAACATAT AAAAAAGCTC TTCAGGAAGC 6300
TGCAGCAAGG TTTGAAGAGT TAAAAGCCCA GAAAGAGCTA AGACAGCTGC AGGAAGATCG 6360
AAAGAATGAC AAAAAACCAC CACCTTATAA ACACATAAAG GTTAACCGTC CTATTGGCAG 6420
GGTACAGATC TTCACTGCGG ACTTGTCGGA AATTCCCCGT TGCAACTGTA AAGCTACTGA 6480
TGAGAACCCC TGTGGGATAG ACTCTGAGTG CATCAACCGC ATGCTACTTT ATGAGTGTCA 6540
CCCTACAGTA TGTCCTGCTG GAGGGCGTTG TCAAAATCAG TGCTTCAGCA AGCGCCAGTA 6600
TCCAGAGGTT GAAATTTTCC GCACATTACA GAGGGGCTGG GGACTTCGGA CAAAAACAGA 6660
TATTAAAAAG GGTGAATTTG TGAATGAATA TGTGGGTGAA CTAATAGATG AAGAAGAGTG 6720
CAGAGCTCGA ATACGTTATG CCCAAGAACA TGATATCACT AATTTTTATA TGCTTACCCT 6780
AGACAAAGAC CGAATCATTG ATGCTGGTCC CAAAGGAAAC TATGCTCGCT TCATGAATCA 6840
TTGCTGCCAG CCTAACTGTG AAACACAGAA GTGGTCTGTG AATGGTGATA CCCGTGTTGG 6900
ACTTTTTGCC CTGAGTGACA TTAAAGCAGG CACTGAACTT ACCTTCAATT ACAACTTAGA 6960
ATGTCTTGGG AACGGAAAGA CTGTTTGCAA ATGTGGAGCC CCAAACTGCA GTGGCTTCTT 7020
GGGTGTCAGA CCAAAGAATC AACCTATTGC TACAGAAGAA AAATCAAAGA AATTCAAGAA 7080
GAAGCAACAG GGGAAGCGCA GGAGCCAGGG TGAAATCACT AAGGAACGAG AAGATGAATG 7140
TTTCAGCTGT GGGGATGCTG GTCAGCTCGT CTCCTGCAAG AAACCAGGCT GCCCAAAAGT 7200
TTACCATGCA GACTGTCTTA ATCTGACCAA GAGACCAGCA GGGAAGTGGG AGTGCCCTTG 7260
GCACCAATGT GACGTCTGTG GGAAAGAAGC AGCCTCCTTC TGTGAGATGT GCCCCAGCTC 7320
TTTTTGCAAG CAGCATCGGG AAGGGATGCT CTTCATTTCC AAACTTGATG GGCGTCTGTC 7380
TTGTACTGAA CATGACCCCT GTGGGCCCAA CCCTCTGGAA CCTGGGGAGA TCCGTGAGTA 7440
TGTGCCTCCC CCTGTGCCAA TGCCTCCAAG CCCCAGCTCA CGCCTGGCAG AGCAATCATC 7500
AGAAATGGCT GCTCAGGGGC CCAAGATGTC AGATAAGCCA CCGGCTGATG CCCATCAGAC 7560
GCTGCCACTC TCCAAAAAAG CTTTGGCAGG GACTTGTCAG AGGCCAGTGC TATCTGAAAG 7620
ACCTCTGGAA AGAACTGACT CCAGCTCCCA GCTTTTAGAT AGGGTCAGAG ACCTTGCTGG 7680
ATCAGGGACC AAATCCCAGT CCTTGGTATC CAGCCAGAGA CCACTGGACA GACCACCAGC 7740
AGTGGAAGGA CCAAGACCTC TGCTATCTGA CAAACCCTCT CCAGTGACCA GCCCAAGCTC 7800
CTCACCCTCA GTCAGGTCCC AACCACTGGA AAGACCTCTG GGGACAGCTG ACCCAAGGCT 7860
GGATAAATCC ATAGGTGCTG CCAGCCCAAG GCCCCATTCA CTGGAGAAAG CCCCAGCCCC 7920
AACTGGCCTG AGACTTCCAC CGACAGACAG ACTGCTAGTC ACCAGTAGTA GTCCTAAACC 7980
CCCAACTTTC GACAGGCCCC CAGACAAATC CCATGCCTCT TTGTCCCAGA GACTTCCACC 8040
TCCTGAGAAA GTACTATCAG CTGTGGTGCA GACCCTGGTA GCTAAAGAAA AAGCACTGAG 8100
GCCCGTGGAC CAGAATACTC AGTCAAAAAA CAGAGCTGCT TTAGTGATGG ATCTCATAGA 8160
CTTAACTCCT CGCCAGAAGG AGCGGGCTAC TTCTCCTCAT GAGATCACAC CACAGGCTGA 8220
GGAGAAGGTG CCTGCATTGG AGTCCAGCTC GTGGTCTGCC AGCAAAGGTC TGGGGCATAT 8280
GTCTCGAGCT CTGGATAAAG GCAGTGTGTC AGATCCTCTT CTCCAACCAT CTGGGAAAAC 8340
AGCACTCCCT TCAGAGCACC CCTGGCAAGC TGTTAAATCA CTCACCCAGG CCAGACTTCT 8400
TTCTCAGCCT CCTGCCAAGG CTTTTTTATA TGAGCCAGCA ACTCAGGCCT CAGGAAGAGT 8460
TCCTGCAGGG GCTGAGCAGA CCCCAGGGCC TCCCAGTCAA GCACCAGGCC TGGTGAAGCA 8520
GATGGCCGGA GGTCAGCAAC TACCTGGATT TGCTGCCAAG AGTGGGCAGT CCTTCAGGTC 8580
TCTTGGGAAG GCCCCAGCCT CCCTCTCCAC TGAAGAGAAG AAGTTGGCAA CCACAGAACA 8640
GAGTCCTTGG TCCCTGGGAA AAACTTCATC AGGGGCAGGG CTCTGGCCCA TAGTGGCTGG 8700
ACAGACGTTG GCGCAGTCTT GCTGGTCTGC TGGAAGCACA CAGACATTGG CACAGACTTG 8760
CTGGTCCCTT GGAAGAGGGC AAGACCCCAA ACCAGAGCAA AATACACTTC CAGCTCTTAA 8820
TCAGGCTCCT TCCAATCACA AGTGTGCAGA GTCAGAACAG AAATAA 8867
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 90 0.0 4294
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 89 0.0 4283
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 89 0.0 4275
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 90 0.0 4264
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 90 0.0 4256
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 90 0.0 4248
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4246
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 90 0.0 4244
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4240
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 90 0.0 4239
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 89 0.0 4227
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 89 0.0 4219
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 89 0.0 4211
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 89 0.0 4194
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 89 0.0 4155
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 88 0.0 4146
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 88 0.0 4139
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 88 0.0 4100
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 89 0.0 4081
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 4043
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 85 0.0 4038
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 83 0.0 3916
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 89 0.0 3902
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 83 0.0 3897
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 86 0.0 3821
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3758
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 69 0.0 3147
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 68 0.0 3099
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 90 0.0 2789
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 76 0.0 2672
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 84 0.0 2374
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 57 0.0 2308
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2293
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 57 0.0 2268
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 61 0.0 2163
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 86 0.0 2147
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2098
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 51 0.0 1798
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 81 0.0 1664
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 74 0.0 1618
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 57 0.0 1497
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 85 0.0 1378
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 83 0.0 1375
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 75 0.0 1373
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 86 0.0 1290
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1283
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1248
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 77 0.0 1206
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1204
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1139
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 79 0.0 1079
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 82 0.0 1070
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 64 0.0 1056
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1047
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 67 0.0 997
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 973
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 65 0.0 962
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 957
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 64 0.0 952
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 64 0.0 940
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 64 0.0 932
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 915
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 82 0.0 913
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 650
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 2e-121 436
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 1e-55 217
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 3e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 8e-50 197
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 9e-50 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 3e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 4e-49 195
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 194
Created Date 25-Jun-2016