WERAM Information


Tag Content
WERAM ID WERAM-Prc-0109
Ensembl Protein ID ENSPCAP00000009738.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPCAG00000010330.1 ENSPCAT00000010436.1 ENSPCAP00000009738.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.40e-52 176.4 1945 2061
HMT SET1 1.70e-29 102 1945 2061
Me_Reader PWWP 7.10e-20 70.6 321 1819
Me_Reader PHD 5.40e-11 42.2 1546 2165
Organism Procavia capensis
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSPCAP00000009738.1 1945 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2031
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSPCAP00000009738.1 2032 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2061
*****************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSPCAP00000009738.1 1945 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2029
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSPCAP00000009738.1 2030 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2061
*******************************7 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl 29 
+gdL+waK k+ pwWP++++s+pl ++
ENSPCAP00000009738.1 321 VGDLIWAKFKRRPWWPCRICSDPLINTHS 349
69********************9876665 PP
PWWP.txt 14 wWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSPCAP00000009738.1 1771 WWPAEICHPRTVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1819
********************************.***************87 PP

  Me_Reader PHD

               PHD.txt    6 vCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
vC+ +e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSPCAP00000009738.1 1546 VCQXXXXXXSELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1590
576666666679****99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSPCAP00000009738.1 1593 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1640
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSPCAP00000009738.1 1641 ICITCHAANPASVTaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1694
8****87777754466778************965599988.55555558999998 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSPCAP00000009738.1 2123 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2165
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLPSFSNPV NLDAPDDKDS PFGNGQSSFS EPINGCTVQL PTVSGTSQNA 60
YGQDSPSYIP LRRLQDLASM INVEYLNGSA DGSESFQDPE KSDSRAQSPV CTSLSPGGPT 120
ALPMKQKPSC NNSPELQVKV TKTVKNGFLH FENFTCVDDA DVDSEMDPEQ PVTEDDSIEE 180
IFEETQTNAT CNYEPKSENR VEVAMGNEQD STSESRHGAV KSPFLPLAPQ TETQKNKQRN 240
EVDGSSEKAA LLPAPFSLGD TNITIEEQLN SINLSFQDDP DSSTSTLGNM LELPGTSSSS 300
TSQELPFCQP KKKSTPLKYE VGDLIWAKFK RRPWWPCRIC SDPLINTHSK MKXXXXXXXX 360
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX VPQKILSKWE 420
ASVGLAEQYD VPRGSKNQKC VRSSVKMDSE EDMPFEDCTN DPESEHDLLL NGCLSSLAFD 480
SEHSADEKEK PCVKSRVRKT PDNPKRTSVK KSHTQFEAHK DERRGKIPEN LGLNFISGDV 540
SDKQASHELS RIASSLTGSN TAPRSFLFSS CGKNTADKEF ETSHCDSLLG LPEGALISKH 600
SGEKIKPQRG LVCNSKVQLC YIGAGDEEKR SDSISICTTS DDGSSDLDPV DHSSESDNSV 660
LEITDAFERT ENMLSMQQNE KIRYSRFSAT NTRVRAKQKS LITNSHTDHL MGCTKTTETG 720
TETSPVNLSD LKASTLVCKS SSDFRNDSVP PKFSTSSSIS SESSLIKGGI TDQALLHSKS 780
RQPRIRSIKC KHKENPAVVE PPTTNEDCSL KCCSSDTKGS PLASVSKSGK MDGLKLLSNV 840
HEKTRDSSDI ETAVVKHVLS ELKELSCRSL SEDVNDSGTS KPSKPILFTS ASGQNHIPIE 900
PDYKFSTLLM MLKDMHDSKT KEQRLMTAQN LVSYRSPGLG DCSTNNSVGS SKVLISGGST 960
YNSEKSGGDS QDSVHPSPSG SDSALSGELS TSLPGLVSAR RDLPASGRSR SNCVTRRNCG 1020
QSKSSKLQDG FSSQLGKNTV NRKALKTERK RKVNRLPGVT LEAALQGGRE SGGSVSVSSR 1080
SGREDPGKEE LLQLKGHLTS ENCAQFSEVH FDNKVKQSDL DKIPKKACSC EKRKSPELDS 1140
EMNSENGEHN GVYQVVPKKR WQCLNQKRTK PRKRTNRFKE KENSEGAFGV LLPGDPVQKG 1200
SDFPEHRPPT STNVLEDALT EPNCAGHLDS AGSRLNGCDK TSTNTEDMEK EPGIPSLTPQ 1260
SELLEPXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1320
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXHAELP QLTLSVPVAP 1380
EISPRPDRES EELLVKPSGN YESKRQRKPT KKLLESNDLD PGFMPKKGDQ GLPKKXXXXX 1440
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1500
GELMTHRTAG SPKVAVEDGV EHDHGMPASK RMQSERGGGA ALKENVCQXX XXXXSELLLC 1560
EAQCCGAFHL ECLGLTEMPR GKFICNECRT GIHTCFVCKQ SGEDVKRCLL PLCGKFYHEE 1620
CVQKYPPTVM QNKGFRCSLH ICITCHAANP ASVTASKGRL MRCVRCPVAY HANDFCLAAG 1680
SKILASNSII CPNHFTPRRG CRNHEHVNVS WCFVCSEXXX XXXXXXXXXX XXXXXXXXXX 1740
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX WWPAEICHPR TVPSNIDKMR HDVGEFPVLF 1800
FGSNDYLWTH QARVFPYMEG DVSSKDKMGK GVDGTYKKAL QEAAARFEEL KAQKELRQLQ 1860
EDRKNDKKPP PYKHIKVNRP IGRVQIFTAD LSEIPRCNCK ATDENPCGID SECINRMLLY 1920
ECHPTVCPAG GRCQNQCFTK RQYPEVEIFR TLQRGWGLRT KTDIKKGEFV NEYVGELIDE 1980
EECRARIRYA QEHDITNFYM LTLDKDRIID AGPKGNYARF MNHCCQPNCE TQKWSVNGDT 2040
RVGLFALSDI KAGTELTFNY NLECLGNGKT VCKCGAPNCS GFLGVRPKNQ PIATEEKSKK 2100
FKKKQQGKRR TQGEVTKERE DECFSCGDAG QLVSCKKPGC PKVYHADCLN LTKRPAGKWE 2160
CPWHQCDICG KEAASFCEMC PSSFCKQHRE GMLFISKLDG RLSCTEHDPC GPNPLEPGEI 2220
REYVPPQVPL PPGSSPHPAE QSSGMAAQGP KMSEKPHADT SQTVSLSIKA LPGTCQRPPL 2280
PERSLERTDC RPQPLDRVRD LAGPGTKSQP CGSSQRQLDR PPAIAGPRPQ LSDKSSPVTS 2340
SCSSPSVRSQ ALERPLGMAG SRLDKSIGAA SPRHQPLEKA PVPPGLRLPP PDRLLITSSP 2400
KPQTSDRPKD KSHVSLSQRL PPPXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2460
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 2520
XXXXXXXXXQ AVKSLTQARL LSQPLTKFLY EPATQASGRS AEAEQTPGFP SQAPGLGKQT 2580
TGGQQLPRLS AKGTTLSGQS FRSLGNAPAS LPTEEKKLTT TEQSPWVLGK ASLGPGLWPM 2640
VAGQTLTPPC WSTGSTQTLA QTCWSLGRGQ DPKPEQNTVP AINQTSSSHK CAESEQK 2697
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT ACCCAGAAGA AATTGTCTGC CGTCCTTTTC CAATCCAGTG 60
AATTTAGATG CCCCAGATGA CAAGGACAGC CCATTCGGTA ATGGTCAATC CAGTTTTTCT 120
GAGCCCATTA ATGGGTGTAC TGTGCAGTTA CCGACTGTCA GTGGAACATC CCAAAATGCT 180
TATGGACAAG ATTCTCCATC TTACATTCCA CTGCGGAGAC TACAGGATTT GGCCTCCATG 240
ATCAATGTAG AATATTTAAA TGGGTCTGCT GATGGATCAG AATCCTTTCA AGACCCTGAA 300
AAAAGTGATT CAAGAGCTCA GTCGCCAGTT TGCACTTCCT TGAGTCCTGG TGGTCCAACA 360
GCACTTCCTA TGAAACAGAA ACCCTCTTGT AATAACTCCC CTGAACTCCA GGTAAAAGTA 420
ACAAAGACTG TCAAGAATGG CTTTCTGCAC TTTGAAAATT TTACTTGTGT GGACGATGCA 480
GATGTAGATT CTGAAATGGA CCCAGAACAG CCAGTCACAG AGGATGACAG TATAGAGGAG 540
ATCTTTGAGG AAACTCAGAC CAATGCCACC TGCAATTATG AGCCTAAATC AGAGAATCGT 600
GTAGAAGTGG CCATGGGAAA TGAACAAGAC AGCACATCAG AGAGTAGACA CGGTGCAGTC 660
AAATCGCCAT TCTTGCCATT AGCTCCTCAA ACTGAAACAC AGAAAAATAA GCAAAGAAAT 720
GAAGTGGACG GCAGCAGTGA AAAAGCAGCC CTTCTCCCAG CCCCCTTTTC ACTAGGAGAT 780
ACAAACATTA CTATAGAAGA GCAATTAAAC TCAATAAATT TATCTTTTCA GGATGATCCA 840
GACTCCAGTA CCAGTACATT AGGAAACATG CTAGAATTAC CTGGAACTTC ATCATCATCT 900
ACTTCACAGG AGTTGCCATT TTGTCAACCC AAGAAGAAGT CTACACCACT GAAGTATGAA 960
GTTGGTGATC TCATCTGGGC AAAATTCAAG AGACGCCCAT GGTGGCCCTG CAGGATTTGT 1020
TCTGATCCAT TGATTAATAC ACACTCAAAA ATGAAAGNNN NNNNNNNNNN NNNNNNNNNN 1080
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1140
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1200
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN GTTCCTCAGA AGATTTTGAG TAAATGGGAA 1260
GCCAGTGTTG GTCTTGCTGA ACAGTATGAT GTTCCTAGAG GGTCAAAGAA CCAAAAATGT 1320
GTCAGAAGTT CAGTCAAGAT GGACAGTGAG GAGGATATGC CATTTGAGGA CTGTACAAAT 1380
GACCCTGAAT CAGAACATGA TCTGTTGCTT AACGGCTGTT TGAGTTCTCT GGCTTTTGAC 1440
TCTGAGCATT CTGCAGATGA GAAAGAAAAG CCATGTGTTA AGTCTCGAGT CAGAAAGACC 1500
CCTGATAATC CAAAACGGAC TAGTGTGAAA AAGAGCCACA CGCAGTTTGA AGCACATAAG 1560
GATGAACGGA GAGGAAAGAT TCCAGAGAAC CTTGGCCTAA ACTTTATTTC TGGGGATGTT 1620
TCTGATAAGC AGGCCTCTCA TGAACTTTCC AGGATAGCCA GCAGCCTCAC AGGGTCCAAC 1680
ACTGCCCCAA GGAGTTTCCT GTTTTCTTCT TGTGGAAAAA ATACTGCAGA TAAAGAATTT 1740
GAGACTTCAC ATTGTGACTC TTTACTTGGC TTGCCTGAGG GTGCCTTGAT CTCTAAACAT 1800
TCTGGCGAGA AGATAAAACC CCAAAGAGGT CTGGTTTGCA ATTCAAAGGT GCAGCTCTGC 1860
TATATTGGAG CAGGTGATGA GGAAAAACGC AGTGATTCCA TTAGCATCTG TACCACTTCT 1920
GATGATGGAA GCAGTGATCT GGATCCTGTA GATCACAGCT CAGAGTCTGA TAACAGTGTC 1980
CTTGAAATTA CAGATGCTTT TGAGAGAACA GAGAACATGT TATCCATGCA ACAAAATGAA 2040
AAGATCAGAT ATTCTCGATT TTCTGCCACA AACACTAGGG TAAGAGCAAA GCAGAAATCC 2100
CTCATTACCA ACTCACATAC AGACCACTTA ATGGGTTGTA CTAAGACAAC AGAGACTGGA 2160
ACTGAGACAT CTCCAGTTAA TCTCTCTGAT CTTAAGGCGT CTACCCTTGT TTGCAAATCT 2220
TCCTCAGATT TTAGAAATGA CAGTGTCCCT CCAAAATTCA GCACATCATC AAGTATTTCC 2280
AGTGAGAGTT CACTAATAAA AGGTGGGATT ACAGATCAAG CTCTGTTACA TTCAAAAAGC 2340
AGACAGCCCA GGATCCGAAG TATAAAGTGC AAACACAAAG AAAATCCGGC AGTTGTAGAA 2400
CCCCCAACTA CAAATGAGGA CTGCAGTTTG AAATGCTGCT CTTCTGATAC CAAAGGCTCT 2460
CCTTTGGCCA GTGTTTCCAA AAGTGGAAAA ATGGATGGGC TAAAACTACT GAGTAATGTG 2520
CATGAGAAAA CCAGGGATTC TAGTGACATA GAAACAGCAG TGGTGAAACA CGTTCTGTCA 2580
GAGTTGAAGG AACTCTCCTG TAGATCCTTA AGTGAGGATG TCAATGATTC TGGAACATCA 2640
AAGCCATCAA AACCAATACT TTTTACTTCT GCTTCTGGCC AGAATCATAT ACCCATTGAA 2700
CCAGACTACA AATTTAGCAC ATTGCTAATG ATGTTGAAAG ATATGCACGA TAGTAAGACC 2760
AAAGAGCAAC GGTTGATGAC TGCTCAAAAC TTGGTCTCCT ATCGGAGTCC TGGTCTGGGG 2820
GATTGTTCTA CTAATAATTC TGTAGGGTCT TCTAAGGTCT TGATTTCAGG AGGCTCCACT 2880
TACAATTCAG AAAAAAGTGG AGGTGACAGT CAAGACTCAG TCCATCCCAG CCCTAGTGGG 2940
AGTGACTCTG CACTGTCTGG GGAGTTGTCT ACCTCCTTAC CTGGCTTAGT GTCAGCCAGA 3000
AGGGACCTTC CTGCTTCTGG GAGAAGTCGT TCAAACTGTG TTACTAGGCG CAACTGTGGA 3060
CAGTCAAAGT CATCCAAGTT GCAAGATGGT TTTTCATCCC AGTTGGGAAA GAACACAGTG 3120
AACCGGAAAG CCTTAAAGAC AGAACGCAAA AGGAAGGTGA ACCGGCTTCC AGGTGTAACT 3180
CTTGAGGCTG CACTGCAAGG AGGCAGAGAA AGTGGAGGCT CAGTGAGTGT TTCTTCAAGG 3240
AGTGGAAGAG AAGACCCTGG TAAAGAAGAA CTTCTTCAGT TAAAGGGCCA TTTAACAAGT 3300
GAAAACTGTG CTCAGTTTTC TGAAGTTCAT TTTGATAACA AGGTCAAACA ATCTGACCTT 3360
GATAAAATTC CCAAGAAAGC CTGCTCTTGT GAAAAAAGAA AAAGTCCAGA GCTGGACTCT 3420
GAAATGAACA GTGAGAATGG TGAACACAAT GGTGTATATC AAGTAGTGCC TAAAAAACGG 3480
TGGCAGTGTT TAAACCAAAA GCGCACTAAA CCACGTAAGC GCACTAACAG ATTTAAGGAG 3540
AAAGAAAACT CTGAGGGTGC CTTTGGGGTC TTGCTTCCTG GTGACCCTGT ACAGAAGGGG 3600
AGTGACTTCC CAGAACATAG ACCACCTACT TCTACAAATG TACTGGAAGA TGCACTGACA 3660
GAGCCAAATT GTGCAGGCCA CTTAGATTCA GCTGGGTCAC GGTTGAATGG TTGTGATAAG 3720
ACCAGTACCA ACACTGAAGA TATGGAAAAG GAGCCAGGAA TTCCCAGTTT GACACCCCAG 3780
TCAGAGCTCC TGGAACCAGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3840
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3900
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3960
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4020
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4080
NNNNNNNNNN NNNGACATGC TGAGTTACCA CAGCTGACTT TGTCTGTGCC TGTAGCTCCG 4140
GAAATCTCTC CACGGCCTGA CCGTGAGTCT GAAGAATTGC TAGTTAAACC ATCAGGAAAC 4200
TATGAAAGTA AGCGCCAAAG GAAACCAACT AAGAAACTTC TTGAATCCAA TGATTTAGAC 4260
CCTGGATTTA TGCCCAAGAA GGGGGATCAG GGCCTACCTA AAAAGNNNNN NNNNNNNNNN 4320
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4500
GGAGAACTGA TGACACATAG AACGGCAGGA AGCCCCAAGG TGGCTGTTGA GGATGGTGTA 4560
GAACATGACC ATGGGATGCC TGCATCTAAA AGAATGCAAA GTGAACGTGG TGGAGGAGCA 4620
GCACTCAAGG AGAATGTTTG TCAGNNNNNN NNNNNNNNNN TCAGTGAGCT CCTGTTATGT 4680
GAGGCTCAGT GCTGTGGAGC TTTCCACCTG GAGTGCCTTG GGTTAACTGA GATGCCAAGA 4740
GGAAAATTTA TCTGCAATGA GTGTCGCACA GGAATCCATA CCTGTTTTGT ATGTAAACAG 4800
AGTGGGGAAG ATGTTAAAAG GTGCCTTTTG CCCTTATGTG GAAAATTTTA CCATGAAGAG 4860
TGTGTCCAGA AGTACCCACC CACTGTCATG CAAAACAAGG GTTTCCGGTG CTCCCTTCAT 4920
ATCTGCATAA CCTGCCATGC TGCTAATCCA GCCAGTGTTA CTGCATCTAA AGGTCGCCTA 4980
ATGCGCTGTG TTCGCTGCCC TGTGGCATAC CACGCCAATG ACTTTTGCCT GGCTGCTGGG 5040
TCAAAGATCC TTGCATCTAA CAGTATCATC TGCCCTAATC ATTTTACGCC TAGGCGGGGT 5100
TGTCGAAATC ACGAACATGT GAATGTTAGC TGGTGTTTTG TGTGCTCAGA AGNNNNNNNN 5160
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5220
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 5280
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNG TGGTGGCCAG CTGAAATCTG CCATCCTCGC 5340
ACTGTACCTT CCAACATCGA TAAGATGAGA CATGATGTGG GCGAATTCCC TGTACTCTTC 5400
TTTGGGTCTA ATGACTATCT GTGGACTCAC CAGGCCAGAG TCTTCCCCTA CATGGAAGGA 5460
GATGTTAGCA GCAAGGATAA GATGGGCAAA GGAGTGGATG GGACATACAA AAAAGCCCTT 5520
CAGGAAGCTG CAGCAAGATT TGAGGAGTTA AAGGCCCAAA AAGAGCTAAG ACAGCTACAG 5580
GAAGACCGTA AGAATGACAA GAAGCCACCG CCGTATAAAC ATATAAAGGT GAATCGCCCT 5640
ATTGGCAGGG TTCAGATCTT CACTGCAGAC TTGTCTGAAA TTCCCCGTTG CAACTGTAAA 5700
GCTACTGATG AGAACCCTTG TGGGATAGAT TCCGAGTGCA TCAACCGCAT GCTGCTCTAT 5760
GAGTGCCATC CTACAGTATG TCCTGCCGGG GGGCGCTGCC AAAACCAGTG CTTCACCAAG 5820
CGCCAGTACC CAGAAGTTGA AATTTTTCGC ACATTGCAGA GGGGTTGGGG CCTTCGAACA 5880
AAAACAGATA TAAAAAAGGG TGAATTTGTG AATGAGTATG TGGGTGAGCT AATAGATGAA 5940
GAAGAATGCA GAGCTCGAAT TCGTTATGCC CAGGAACATG ATATCACTAA TTTCTATATG 6000
CTCACACTAG ACAAAGACCG GATCATTGAT GCTGGTCCCA AAGGAAACTA TGCTCGGTTC 6060
ATGAATCACT GCTGCCAGCC CAACTGTGAG ACACAGAAGT GGTCTGTGAA TGGAGATACC 6120
CGTGTTGGCC TTTTTGCCCT AAGTGACATC AAAGCAGGTA CAGAACTTAC CTTCAACTAC 6180
AACCTGGAAT GTCTTGGGAA TGGAAAAACT GTTTGTAAAT GTGGAGCCCC GAACTGCAGT 6240
GGCTTCCTGG GGGTGAGGCC AAAGAATCAG CCCATTGCCA CAGAAGAAAA GTCCAAGAAA 6300
TTCAAGAAGA AACAACAGGG CAAGCGCAGG ACCCAGGGCG AAGTCACAAA GGAGCGAGAG 6360
GATGAGTGCT TCAGCTGTGG AGATGCTGGG CAGCTTGTCT CTTGTAAAAA GCCAGGCTGC 6420
CCAAAAGTCT ACCATGCAGA CTGTCTCAAT CTAACCAAGC GCCCAGCAGG AAAATGGGAG 6480
TGTCCTTGGC ACCAGTGTGA CATCTGTGGG AAGGAAGCCG CCTCCTTCTG TGAGATGTGC 6540
CCCAGCTCCT TTTGCAAGCA GCATCGGGAA GGAATGCTCT TCATCTCCAA ACTGGATGGG 6600
CGTCTGTCTT GTACTGAGCA TGACCCCTGT GGGCCAAACC CTCTGGAGCC TGGGGAGATT 6660
CGTGAGTATG TGCCTCCCCA AGTACCACTG CCTCCAGGCT CAAGCCCTCA CCCAGCAGAG 6720
CAGTCATCGG GAATGGCTGC TCAGGGGCCA AAGATGTCGG AAAAGCCACA TGCTGACACC 6780
AGCCAGACAG TGTCGCTGTC CATAAAAGCT CTCCCAGGAA CTTGTCAGAG GCCACCACTG 6840
CCTGAAAGAT CTCTTGAGAG AACTGACTGT AGGCCCCAGC CTTTAGATCG GGTCAGGGAC 6900
CTTGCTGGGC CAGGAACCAA ATCCCAACCT TGTGGATCCA GCCAGAGGCA ATTAGACAGG 6960
CCTCCTGCAA TTGCAGGACC AAGACCCCAG CTCTCTGACA AATCTTCTCC AGTGACCAGC 7020
TCATGTTCTT CACCTTCAGT TAGGTCCCAG GCACTGGAAA GACCACTGGG GATGGCTGGC 7080
TCAAGGCTGG ATAAATCCAT AGGTGCTGCC AGTCCAAGGC ATCAGCCACT GGAGAAAGCC 7140
CCAGTACCCC CTGGCCTGAG ACTTCCACCC CCAGACAGAC TACTAATTAC CAGTAGTCCC 7200
AAGCCCCAGA CTTCAGATAG GCCTAAAGAC AAATCCCATG TCTCTTTGTC CCAGAGACTC 7260
CCACCTCCNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7320
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7380
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 7560
NNNNNNNNNN NNNNNNNNNN NNNNNNNCAA GCTGTTAAAT CACTCACCCA GGCCAGACTT 7620
CTTTCTCAGC CCCTGACCAA GTTTTTATAT GAGCCAGCAA CTCAGGCCTC AGGAAGATCA 7680
GCAGAGGCTG AGCAGACCCC GGGGTTTCCC AGCCAAGCCC CAGGCCTTGG GAAGCAGACA 7740
ACTGGAGGCC AGCAATTACC TAGACTTTCT GCCAAAGGGA CAACACTGAG TGGGCAGTCC 7800
TTCAGGTCTC TTGGGAATGC CCCAGCCTCC CTTCCCACTG AGGAAAAGAA GTTGACCACA 7860
ACAGAGCAGA GTCCCTGGGT ACTGGGAAAG GCCTCCCTGG GGCCAGGACT CTGGCCCATG 7920
GTGGCTGGAC AGACACTGAC GCCACCATGC TGGTCCACTG GGAGCACACA GACATTGGCA 7980
CAGACTTGCT GGTCTCTTGG ACGAGGGCAA GACCCTAAAC CAGAGCAAAA TACAGTTCCA 8040
GCTATTAACC AGACTTCTTC CAGTCACAAG TGTGCAGAAT CGGAACAGAA ATAA 8095
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 81 0.0 1949
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 81 0.0 1946
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 80 0.0 1930
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 81 0.0 1928
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 80 0.0 1913
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 80 0.0 1910
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 80 0.0 1900
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 80 0.0 1899
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 80 0.0 1886
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 80 0.0 1879
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 80 0.0 1875
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 79 0.0 1855
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 78 0.0 1848
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 84 0.0 1703
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 82 0.0 1666
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 83 0.0 1659
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 82 0.0 1658
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 82 0.0 1652
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 82 0.0 1647
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 82 0.0 1645
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 81 0.0 1637
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 81 0.0 1631
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 72 0.0 1620
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 88 0.0 1560
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 87 0.0 1550
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 86 0.0 1535
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 83 0.0 1503
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 82 0.0 1457
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 88 0.0 1450
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 76 0.0 1382
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 76 0.0 1332
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 74 0.0 1332
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 68 0.0 1311
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 73 0.0 1306
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 73 0.0 1305
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 84 0.0 1274
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 78 0.0 1270
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 80 0.0 1238
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 83 0.0 1140
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 76 0.0 1135
WERAM-Dar-0128 ENSDARP00000078549.4 Danio rerio 71 0.0 1078
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 72 0.0 1064
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 69 0.0 1060
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 84 0.0 1018
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 70 0.0 1003
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 67 0.0 993
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 82 0.0 953
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 72 0.0 952
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 70 0.0 932
WERAM-Pes-0051 ENSPSIP00000007311.1 Pelodiscus sinensis 62 0.0 895
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 60 0.0 866
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 75 0.0 864
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 84 0.0 860
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 58 0.0 837
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 59 0.0 833
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 64 0.0 829
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 57 0.0 828
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 57 0.0 814
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 77 0.0 713
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 63 0.0 709
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 66 0.0 638
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 77 0.0 634
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 65 2e-175 615
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 51 1e-154 546
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 43 5e-78 291
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 3e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 8e-50 198
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 1e-49 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 3e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 4e-49 196
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 8e-49 194
Created Date 25-Jun-2016