WERAM Information


Tag Content
WERAM ID WERAM-Sah-0142
Ensembl Protein ID ENSSHAP00000014900.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSSHAG00000012712.1 ENSSHAT00000015025.1 ENSSHAP00000014900.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.70e-52 176.7 1946 2062
HMT SET1 2.10e-29 102.2 1946 2062
Me_Reader PWWP 1.90e-27 95.3 329 1820
Me_Reader PHD 5.00e-19 68.4 1547 2166
Organism Sarcophilus harrisii
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSSHAP00000014900.1 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2032
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSSHAP00000014900.1 2033 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*****************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSSHAP00000014900.1 1946 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 2030
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSSHAP00000014900.1 2031 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 2062
*******************************7 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpy 61 
+g+LVwaK ++ pwWP +++ +pl ++ +++ ++y V+ +g+ e awv k++v +
ENSSHAP00000014900.1 329 VGQLVWAKFNRRPWWPSRICCDPLMNTHSkmkVSSQRPYRQYYVEALGDPSEKAWVAGKSIVLF 392
589******************998877776566778899******************9999866 PP
PWWP.txt 3 dLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
+ Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSSHAP00000014900.1 1761 EVVWVKVGRYRWWPAEICHPRAIPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1820
78*****************************************.***************87 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C+k++e ++ C+ C +fHl+C++l+ ++p+g +++C++C++
ENSSHAP00000014900.1 1547 NVCQNCEKVGE----LLLCEAqCCGAFHLECIGLT--EMPKG-KFICKECRT 1591
78999955544....899**99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
+C+vC++++e+ + C C + +H C++ ++ ++k ++C ++
ENSSHAP00000014900.1 1594 HTCFVCKTSGED---VKRCLLplCGKFYHEACIQKYPPTVLQNKGFRCSLHM 1642
58****666666...55899889**********977666666557***9886 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
+C++C+ ++ + ++ C C+ ++H++ C+ + l ++ s +Cp++
ENSSHAP00000014900.1 1642 MCITCHAANPASLSaskgrLMRCVRCPVAYHANdfCLAAGSKVLASN-SIICPNH 1695
7****88888755567788************9655999984444455.9999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSSHAP00000014900.1 1712 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1753
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSSHAP00000014900.1 2124 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 2166
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MDQTCELPRR NCLQPFSNRV DLDAPEEKDT PFGNGQSNIS EPLNGCAIQL SAAINSGTSK 60
ITYGQDPSSC YIPLRRLQDL ASMINVEYLN GSTDGSESLQ DPEKRDSRSG SPNVCTSLSP 120
SDPRALAMKQ EPTCNDSPEF KVKITKTVKN GILHFDNFVC MEDADTDIES EMDTEQPGTE 180
DDSIDETFEK TRQDLTKTNA TCNYQPQSEN GIEVLMGIEQ DSTTESCEIK SPVLPLAPNE 240
MQKNNQRTAM DSRIEKPELH PGPCSLGDTN ITLEEQLNSI NLSFQDDPDS STSTLGNMIE 300
LPGTSSSSTS QELQFCQPKK KPAPLKYGVG QLVWAKFNRR PWWPSRICCD PLMNTHSKMK 360
VSSQRPYRQY YVEALGDPSE KAWVAGKSIV LFEGRHQFEE LPVLRRRGKQ KEKGYRHKVP 420
KKILSKWEAS VGLAEQRDIP KDFKVRKRIR SSTKLDSEED MPFEDCTNDP ESEHERVLNG 480
CLKSLAFDSE TSSDEKVKPC TKSRVRKRST SLKRTSIKKS SMPFEAHNEE RREKMTETLG 540
LNYVSGDVSD KQASTELCRI ASSLTAASNT TTTNYLFSSR GKTTMKKEFE TSNCDSLLGL 600
PESVLISSSS VERKKPLRSM VHSAKSPHLC YISAGDEEKR SDSASLWTTS DDESSDSDLD 660
PVDQSSESDE STLEISGSFD KDESELPLEK NEKLQYSRYH TKNTRVKARK KSIKTSSHLD 720
HLLDCTKTPD PGSEESQGAD SDSKMPIIAR NDNLSNKYTL PQNISHENAF VKSGLSNQPL 780
LQSKHNKQPK ARSIKCKHKE SPTAVDSSVL EEDFSLKCCS NDSKGSSVDC VVKSGKLDSL 840
KLLSNMHEKP RDSAEIETAV VKHVLSELKE LSYRSLNDEI SDSGSPKINK PLLFPSSSGQ 900
SRLPIEPNYK FSTLLMMLKD MHDSKTKEQQ LMTTQNVVSY RNPGLGDCSS SSAAGGSKSL 960
VSGSLTHKAE KNGDCIQDSI CQNPGGSESS LIREPSGSLT GLLTKKRRLT ASGKRRANCI 1020
TRRNCARSKS SKLQDDFLAQ REKDIIESKD LKSEKKKKLN QLPSMTIETA LSECKDVPDS 1080
VNKSLSSRPA DSKRKQSLQL MEHLTGEDCA HFSDVHFDSM VKKSDPDKNP EKCPSFENGE 1140
VLELDSEMNS ESSLSDESND INHVAPKKRW QRFNQSRAKT SKRISRSRRE KNSKSAFGGL 1200
HPCDFLLEGN EGPENRPSSP SKVLEEEIKQ NCEGRLDVVV PQLNVCDKPN SDGLHRESGL 1260
SNFPPQPELS TQDARSEKKR LRRPSRWLLE YTEKYDQIFA PKKKQKKVQE QSKVSSPREN 1320
EAQTTRCQSS TDLQNKPVEE SSSTKEDPPV LEREAPFLER PIGQSELGGG NAELPELTLS 1380
VPITPEVSSW SSLEPEESLL KPRNYESKRQ RKPTKKLLES NDLDPGFMPK KGDLTLLRKC 1440
YNTGPLENDI SGSCTTLTSM DFGEGANKVF EKQRKRKRQR HVPAHCKKVR NEDSSREAQN 1500
SEGEPVTSGT TSSPKDAIEE SVEHDHGMPV SKKIQAERGG GAALKENVCQ NCEKVGELLL 1560
CEAQCCGAFH LECIGLTEMP KGKFICKECR TGIHTCFVCK TSGEDVKRCL LPLCGKFYHE 1620
ACIQKYPPTV LQNKGFRCSL HMCITCHAAN PASLSASKGR LMRCVRCPVA YHANDFCLAA 1680
GSKVLASNSI ICPNHFAPRR GCRNHEHVNV SWCFVCSEGG SLLCCDSCPA AFHRECLNID 1740
IPEGNWYCND CKAGKKPHYR EVVWVKVGRY RWWPAEICHP RAIPSNIDKM RHDVGEFPVL 1800
FFGSNDYLWT HQARVFPYME GDVSSKDKMG KGVDGTYKKA LQEAAVRFEE LKAQKELRQL 1860
QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA DLSEIPRCNC KASDDNPCGI DSECINRMLL 1920
YECHPTVCPA GGRCQNQCFS KRQYPEVEIF RTLQRGWGLR TKTDIKKGEF VNEYVGELID 1980
EEECRARIRY AQEHDITNFY MLTLDKDRII DAGPKGNYAR FMNHCCQPNC ETQKWSVNGD 2040
TRVGLFALSD IKAGTELTFN YNLECLGNGK TVCKCGAPNC SGFLGVRPKN HPNPTEEKSK 2100
KLKRKQQVKR RSQGEITKER EDECFSCGDA GQLVSCKKPG CPKVYHADCL NLTKRPAGKW 2160
ECPWHQCDVC GKEAASFCEM CPSSFCKQHR EGMLFISKLD GRLSCTEHDP CGPNPLEPGE 2220
IREYVPPPVP LTSGANTQLA EQPSEIPAQR PLMVDKAPGA MGQRLQLSEK TLVGTCQRPQ 2280
LSDKPLVMPD STPQSPDKIP GVSGSRPQPL ELGQRPADTS LVVTSPKPQL SDKPPLSGSK 2340
SPPSVRSQLL DRPLVTSPRP QSLDKSLGSL SSKSQQLDRP VVTAGPRLQA SDKSPVTNGP 2400
KPQISDKPPT SDRPLAPLGQ RLPSSEKVLS AVVQNLVSNE KALRPVDQNT RPKDRATMVI 2460
ELSPRQKERA TSPHEITPQP TEKLPVLEQS PWPVNKALGQ MPRAAEKVHP SEPVLQASGR 2520
ASAPVEHTWQ AGKTLIQARL VPRPPAKGYL FEQAPRATGR APVSMEQSSG SFGKALASGE 2580
PMAGSPQPPG LATKATPLMV QSPRPPVKSP DLVPPAEKRS AVTEHPPWAL GKSPAGPTPW 2640
PAGADQPLAQ TCRSPGSPQT LAQTCRPLDK GLIKGPDPKP EQSAVPALNQ TPSSHEPAES 2700
KEK 2703
Nucleotide Sequence
(Fasta)
ATGGATCAGA CCTGTGAACT GCCCAGAAGA AATTGTCTGC AGCCCTTTTC TAACAGAGTG 60
GATTTAGATG CCCCTGAAGA AAAGGACACC CCTTTCGGTA ATGGTCAATC CAATATTTCT 120
GAGCCTCTTA ATGGGTGTGC TATACAATTA TCGGCTGCCA TTAATAGTGG AACATCCAAA 180
ATTACTTATG GACAAGATCC ATCATCTTGT TACATTCCAC TGCGGCGACT ACAGGATTTG 240
GCCTCCATGA TCAATGTAGA ATATTTAAAT GGGTCTACTG ATGGATCAGA ATCCCTTCAA 300
GACCCTGAAA AGAGAGATTC AAGATCTGGG TCACCAAATG TTTGCACTTC CTTGAGTCCT 360
AGTGATCCAA GAGCACTTGC TATGAAACAG GAACCCACTT GTAATGACTC CCCTGAGTTC 420
AAGGTAAAAA TAACAAAGAC TGTCAAGAAT GGGATTCTAC ACTTTGACAA TTTTGTTTGT 480
ATGGAAGATG CAGATACAGA TATAGAGTCT GAAATGGACA CAGAGCAGCC AGGCACAGAG 540
GATGACAGTA TAGACGAAAC CTTTGAAAAA ACAAGGCAAG ACCTAACTAA GACAAATGCT 600
ACCTGCAATT ACCAGCCCCA ATCAGAGAAT GGTATAGAAG TGCTCATGGG AATTGAACAA 660
GACAGCACAA CAGAAAGTTG TGAAATAAAG TCACCAGTTT TACCATTAGC TCCAAACGAA 720
ATGCAGAAAA ATAATCAAAG AACTGCCATG GACAGCAGAA TTGAAAAGCC AGAGCTTCAC 780
CCAGGCCCCT GTTCACTAGG AGATACAAAT ATTACATTAG AAGAACAATT AAACTCCATA 840
AATTTATCTT TTCAGGATGA TCCAGACTCC AGTACCAGTA CGTTAGGAAA CATGATAGAA 900
TTACCTGGAA CATCATCATC ATCTACTTCA CAAGAATTAC AATTTTGTCA ACCCAAGAAA 960
AAGCCTGCTC CCTTGAAATA CGGAGTTGGA CAACTTGTCT GGGCAAAATT CAACAGACGT 1020
CCATGGTGGC CCAGCAGAAT TTGTTGCGAT CCATTAATGA ACACTCACTC AAAAATGAAA 1080
GTTTCCAGCC AAAGGCCTTA TCGCCAATAC TACGTGGAAG CTTTAGGAGA TCCTTCTGAA 1140
AAAGCGTGGG TGGCAGGAAA ATCGATTGTC CTATTTGAAG GAAGACATCA GTTTGAAGAG 1200
CTTCCTGTAT TGAGGAGAAG AGGCAAACAG AAAGAAAAAG GCTATAGACA TAAGGTTCCT 1260
AAGAAAATCT TGAGTAAGTG GGAAGCCAGC GTTGGACTTG CTGAGCAGCG AGATATTCCT 1320
AAGGATTTTA AGGTCAGAAA GCGTATCAGA AGTTCAACGA AACTAGACAG CGAAGAAGAT 1380
ATGCCTTTTG AGGACTGTAC AAACGATCCT GAATCGGAGC ATGAACGAGT ACTGAATGGC 1440
TGCCTGAAAT CACTAGCCTT TGACTCTGAA ACCTCATCAG ATGAAAAGGT GAAGCCGTGC 1500
ACTAAATCTC GAGTCAGAAA GAGATCCACA AGCCTCAAAA GGACTAGTAT TAAAAAAAGC 1560
AGCATGCCAT TTGAAGCTCA TAATGAAGAA AGAAGGGAAA AGATGACAGA GACTCTTGGC 1620
CTGAATTATG TTTCTGGAGA TGTGTCCGAT AAGCAGGCTT CTACTGAACT ATGCAGAATA 1680
GCAAGCAGTC TTACAGCAGC ATCCAACACC ACCACCACAA ATTACTTGTT TTCTTCACGT 1740
GGGAAAACCA CTATGAAAAA AGAGTTTGAG ACTTCAAATT GTGATTCTCT GCTTGGTTTG 1800
CCTGAGAGTG TTTTGATTTC TAGCAGTTCT GTGGAGAGAA AGAAACCTCT CCGCAGTATG 1860
GTTCATAGTG CAAAGTCACC CCATCTGTGC TATATCAGTG CTGGTGATGA GGAAAAGAGA 1920
AGTGATTCAG CCAGTTTATG GACCACCTCT GATGATGAGA GCAGTGACAG TGATCTGGAC 1980
CCAGTTGACC AAAGTTCTGA GTCTGATGAA AGTACTCTAG AAATAAGTGG ATCTTTTGAT 2040
AAAGATGAAA GTGAGTTACC TTTGGAGAAA AATGAGAAGC TTCAATATTC CAGGTATCAC 2100
ACAAAGAATA CTAGGGTAAA AGCAAGAAAG AAGTCTATAA AAACTAGTTC ACACCTAGAC 2160
CACTTGTTAG ATTGTACTAA AACTCCTGAC CCTGGGAGTG AAGAATCTCA AGGTGCTGAT 2220
TCTGATTCCA AAATGCCCAT CATCGCTCGA AACGACAATC TTTCCAACAA ATACACCCTG 2280
CCACAAAACA TATCCCATGA GAATGCTTTC GTAAAGAGTG GGCTTAGTAA TCAACCTCTG 2340
TTACAATCAA AACATAACAA ACAGCCTAAA GCCAGAAGTA TAAAATGCAA ACACAAAGAG 2400
AGCCCAACTG CAGTAGATTC CTCAGTTTTA GAGGAGGATT TCAGCTTAAA GTGCTGTTCT 2460
AATGATTCGA AAGGGTCTTC TGTAGACTGT GTTGTCAAAA GTGGAAAGCT TGATAGTCTA 2520
AAACTACTGA GCAACATGCA TGAGAAACCC AGAGATTCTG CGGAAATTGA AACCGCTGTT 2580
GTAAAACATG TTTTGTCAGA GTTGAAAGAA CTGTCTTATA GATCCCTAAA TGATGAAATA 2640
AGTGACTCTG GTTCACCTAA AATAAACAAA CCTTTACTTT TTCCTTCTTC ATCTGGTCAA 2700
AGTCGTTTGC CAATTGAGCC AAATTACAAA TTCAGCACCT TATTGATGAT GTTGAAAGAT 2760
ATGCATGATA GTAAGACTAA GGAACAGCAG CTAATGACCA CTCAAAATGT GGTCTCCTAT 2820
CGTAATCCTG GCCTTGGAGA CTGTTCCAGC AGCAGTGCTG CAGGTGGCTC AAAGTCCCTA 2880
GTTTCAGGAA GCCTGACTCA TAAGGCTGAA AAAAATGGAG ATTGTATTCA GGACTCAATC 2940
TGTCAAAACC CTGGTGGGAG TGAGTCTTCT CTCATCCGAG AGCCATCTGG ATCTTTAACT 3000
GGATTACTCA CTAAGAAAAG AAGGCTCACT GCTTCTGGCA AACGTCGAGC AAACTGTATT 3060
ACTCGGAGAA ATTGCGCGAG ATCCAAGTCA TCCAAACTGC AGGATGACTT TTTGGCACAA 3120
AGAGAAAAGG ACATTATAGA AAGCAAAGAC TTGAAATCAG AAAAGAAAAA GAAACTAAAC 3180
CAACTACCCA GTATGACCAT TGAGACTGCA CTGAGTGAAT GCAAAGATGT CCCAGATTCA 3240
GTAAATAAGT CTCTAAGCAG CAGGCCTGCA GATTCGAAAA GAAAGCAGTC TCTTCAATTA 3300
ATGGAACATT TAACAGGTGA AGACTGTGCA CATTTTTCTG ATGTCCATTT TGATTCCATG 3360
GTTAAAAAAT CTGACCCTGA TAAAAATCCT GAAAAGTGCC CCTCTTTTGA AAACGGGGAA 3420
GTCCTAGAGC TGGACTCTGA AATGAACAGT GAAAGCTCCC TGAGTGATGA ATCTAATGAT 3480
ATAAACCATG TGGCACCCAA AAAGCGGTGG CAGCGTTTTA ACCAAAGCAG AGCTAAAACC 3540
AGTAAGCGCA TCAGTAGATC TAGGAGGGAA AAGAACTCAA AGAGTGCCTT TGGGGGCCTA 3600
CACCCTTGTG ACTTTCTGCT GGAGGGGAAT GAGGGCCCAG AGAATAGGCC TTCCAGCCCT 3660
TCCAAGGTGC TAGAGGAAGA AATTAAACAG AATTGTGAGG GTCGCTTAGA TGTAGTTGTG 3720
CCACAGTTGA ATGTGTGTGA TAAGCCAAAC AGTGATGGGC TGCACAGAGA ATCAGGGCTT 3780
TCCAATTTTC CTCCACAGCC TGAGCTCTCC ACACAAGATG CACGCTCAGA GAAGAAACGT 3840
CTTAGGAGGC CAAGCAGATG GCTTCTGGAA TATACAGAAA AATATGACCA GATATTTGCT 3900
CCCAAGAAAA AACAAAAGAA GGTACAGGAA CAGTCTAAGG TAAGTTCCCC AAGAGAAAAT 3960
GAAGCCCAGA CTACACGGTG TCAATCTAGC ACTGATCTCC AGAATAAACC AGTGGAAGAG 4020
AGCTCTTCAA CCAAAGAAGA TCCTCCAGTC CTTGAGAGAG AGGCACCATT TTTGGAAAGA 4080
CCAATAGGTC AGTCTGAACT TGGAGGTGGA AATGCAGAGT TGCCAGAACT GACCTTGTCT 4140
GTGCCAATCA CACCTGAAGT TTCTTCTTGG TCTTCACTGG AACCAGAGGA GTCATTATTG 4200
AAACCAAGAA ATTATGAAAG TAAGCGTCAG AGGAAGCCAA CAAAAAAGCT CCTTGAATCC 4260
AATGATTTAG ATCCTGGATT TATGCCCAAG AAAGGGGATT TGACACTCCT TAGAAAGTGT 4320
TATAATACTG GTCCCTTGGA GAATGATATT TCTGGATCAT GTACAACGCT CACATCTATG 4380
GATTTTGGTG AAGGTGCAAA CAAAGTTTTT GAAAAGCAAA GGAAGCGAAA GAGACAGAGG 4440
CATGTACCAG CACATTGTAA GAAAGTGAGA AATGAAGACT CTTCACGGGA GGCTCAAAAC 4500
TCTGAGGGAG AACCAGTTAC TTCTGGGACT ACCTCAAGCC CCAAAGATGC CATAGAAGAG 4560
AGTGTGGAAC ATGATCATGG GATGCCTGTA TCTAAAAAAA TACAGGCTGA ACGTGGTGGA 4620
GGAGCAGCTC TCAAGGAAAA TGTTTGTCAG AACTGTGAGA AAGTGGGTGA GTTGTTGCTA 4680
TGTGAGGCTC AATGTTGTGG TGCCTTCCAC CTGGAATGTA TTGGCCTGAC AGAGATGCCA 4740
AAAGGCAAAT TTATCTGCAA GGAATGTCGA ACAGGAATTC ATACCTGCTT TGTGTGTAAA 4800
ACCAGTGGAG AAGATGTCAA AAGGTGTTTG CTACCTCTTT GTGGAAAGTT TTACCATGAA 4860
GCCTGTATAC AGAAATATCC GCCAACTGTC CTACAGAACA AGGGTTTTCG GTGCTCCCTT 4920
CACATGTGTA TAACCTGTCA TGCTGCTAAT CCAGCTAGTC TTTCTGCATC TAAAGGTCGC 4980
CTAATGCGCT GTGTCCGGTG TCCTGTGGCT TACCATGCCA ATGACTTCTG TCTGGCTGCA 5040
GGATCAAAGG TCCTTGCATC AAATAGCATC ATCTGCCCTA ATCATTTTGC TCCTCGGAGG 5100
GGTTGCAGAA ATCATGAACA TGTTAATGTT AGCTGGTGTT TTGTATGCTC AGAAGGGGGC 5160
AGTCTTTTGT GCTGTGACTC TTGCCCTGCT GCATTTCATC GAGAATGTCT GAATATTGAT 5220
ATCCCAGAGG GCAACTGGTA TTGCAATGAC TGTAAGGCAG GCAAAAAGCC ACACTATCGA 5280
GAAGTTGTCT GGGTAAAAGT TGGCCGATAT AGGTGGTGGC CAGCTGAAAT TTGTCATCCT 5340
CGAGCTATTC CTTCCAATAT TGATAAGATG AGACATGATG TTGGTGAATT TCCAGTTTTA 5400
TTTTTTGGCT CTAATGATTA CTTATGGACC CATCAGGCTC GTGTCTTTCC ATATATGGAA 5460
GGAGATGTCA GTAGCAAGGA CAAAATGGGG AAAGGTGTTG ATGGGACATA TAAAAAAGCT 5520
CTTCAGGAAG CTGCTGTCCG TTTTGAGGAA TTAAAAGCCC AGAAAGAACT AAGACAGCTT 5580
CAGGAAGACA GAAAAAATGA TAAGAAACCA CCACCTTACA AACATATCAA GGTGAACCGT 5640
CCAATAGGCA GGGTTCAGAT CTTTACTGCA GACCTGTCAG AGATTCCCCG ATGTAACTGT 5700
AAAGCTTCAG ATGACAACCC CTGTGGCATC GATTCAGAAT GCATTAACCG AATGCTCCTG 5760
TATGAGTGCC ATCCCACTGT CTGTCCTGCG GGCGGACGCT GCCAGAACCA ATGCTTCTCA 5820
AAACGCCAGT ATCCTGAAGT TGAAATCTTT CGTACCTTAC AGCGAGGCTG GGGCTTACGG 5880
ACAAAAACAG ATATTAAAAA GGGTGAATTT GTAAATGAGT ATGTGGGAGA GCTGATAGAT 5940
GAAGAGGAAT GTAGAGCTCG AATCCGTTAT GCCCAAGAAC ACGACATCAC CAATTTCTAC 6000
ATGCTCACTT TGGATAAGGA CCGGATTATT GATGCTGGTC CCAAAGGAAA TTATGCCCGG 6060
TTTATGAATC ACTGTTGCCA ACCCAACTGT GAAACTCAGA AGTGGTCTGT GAATGGAGAT 6120
ACCCGAGTTG GGCTTTTTGC CCTAAGTGAC ATCAAAGCAG GCACTGAACT CACCTTTAAC 6180
TACAATCTAG AGTGTCTTGG GAATGGAAAA ACTGTTTGCA AATGTGGAGC ACCAAATTGT 6240
AGCGGTTTCT TGGGTGTTCG GCCAAAGAAT CATCCCAATC CCACAGAAGA GAAATCCAAA 6300
AAACTAAAAA GAAAACAACA GGTAAAGCGT CGATCCCAGG GTGAGATCAC AAAAGAAAGA 6360
GAAGATGAAT GTTTCAGCTG TGGAGATGCA GGGCAGCTTG TATCCTGTAA GAAACCAGGT 6420
TGTCCTAAAG TCTATCATGC TGACTGCCTC AACTTGACAA AAAGGCCTGC AGGGAAATGG 6480
GAATGTCCAT GGCACCAATG TGATGTTTGT GGAAAAGAAG CTGCATCTTT TTGTGAGATG 6540
TGTCCTAGTT CCTTTTGTAA GCAACATCGA GAGGGAATGC TGTTCATCTC TAAACTGGAT 6600
GGTCGCCTGT CTTGTACAGA GCATGATCCA TGTGGCCCCA ACCCTCTGGA GCCTGGGGAG 6660
ATCCGTGAGT ACGTGCCTCC CCCTGTGCCG CTGACTTCAG GTGCAAACAC TCAGCTAGCA 6720
GAGCAGCCTT CTGAGATTCC TGCTCAAAGA CCCCTGATGG TGGACAAAGC CCCTGGGGCA 6780
ATGGGCCAAA GGCTCCAGCT GTCAGAGAAA ACATTAGTAG GAACATGTCA GAGGCCACAG 6840
TTGTCTGATA AACCACTTGT GATGCCTGAC TCCACGCCCC AGTCACCAGA TAAGATCCCT 6900
GGAGTATCAG GGTCAAGACC TCAGCCCTTA GAATTAGGCC AGAGGCCAGC AGACACATCT 6960
CTTGTAGTGA CAAGCCCCAA ACCTCAACTG TCTGATAAAC CTCCATTGTC AGGTTCCAAA 7020
TCTCCACCAT CAGTTAGATC CCAACTCTTG GACAGACCTC TAGTAACCAG CCCAAGGCCT 7080
CAATCTCTGG ATAAGTCCTT GGGTTCTCTC AGCTCAAAGT CTCAGCAATT AGATAGGCCT 7140
GTAGTTACTG CTGGACCAAG ACTCCAGGCA TCAGACAAGT CCCCAGTTAC AAATGGCCCA 7200
AAGCCCCAGA TCTCAGACAA GCCCCCAACC TCAGACAGGC CCCTTGCCCC TTTGGGCCAG 7260
AGACTCCCAT CTTCAGAAAA AGTGCTGTCA GCTGTGGTCC AGAACCTTGT ATCTAATGAA 7320
AAAGCACTAA GGCCTGTGGA CCAAAATACT CGGCCAAAAG ATCGAGCTAC TATGGTTATT 7380
GAACTGAGTC CTCGTCAAAA GGAACGAGCA ACTTCACCTC ATGAAATTAC ACCCCAGCCC 7440
ACTGAAAAAT TACCAGTGCT GGAGCAGAGT CCCTGGCCTG TTAACAAAGC ACTGGGACAA 7500
ATGCCTCGGG CAGCTGAGAA AGTTCACCCT TCTGAACCAG TCCTCCAAGC ATCTGGAAGA 7560
GCTTCAGCCC CTGTGGAACA CACCTGGCAG GCTGGCAAAA CACTCATACA AGCCAGACTC 7620
GTCCCGCGGC CCCCTGCCAA GGGTTACTTG TTTGAACAGG CTCCTCGGGC CACAGGACGT 7680
GCACCTGTGT CAATGGAGCA GAGCTCTGGG TCCTTCGGCA AAGCCTTAGC CTCAGGAGAA 7740
CCCATGGCTG GATCCCCTCA ACCCCCAGGG CTTGCCACAA AAGCAACACC ATTGATGGTG 7800
CAGTCCCCTA GGCCCCCTGT CAAATCACCA GACCTTGTTC CCCCAGCAGA GAAACGGTCA 7860
GCAGTGACAG AGCACCCCCC CTGGGCCCTG GGGAAATCCC CAGCAGGCCC CACTCCCTGG 7920
CCTGCAGGGG CAGACCAGCC ACTGGCACAG ACTTGTCGGT CACCTGGGAG TCCACAGACA 7980
TTGGCACAGA CTTGTCGGCC CCTTGACAAA GGGCTAATCA AAGGGCCAGA CCCTAAGCCT 8040
GAGCAAAGTG CAGTACCAGC TCTTAACCAG ACCCCTTCCA GCCATGAGCC TGCAGAATCG 8100
AAAGAGAAGT GA 8113
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 91 0.0 4122
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 70 0.0 3532
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 70 0.0 3520
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 70 0.0 3511
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 70 0.0 3504
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 70 0.0 3502
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 70 0.0 3479
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 70 0.0 3477
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 70 0.0 3476
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 70 0.0 3467
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 69 0.0 3465
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 69 0.0 3431
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 68 0.0 3353
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 66 0.0 3231
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 70 0.0 3112
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 69 0.0 3073
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 69 0.0 3071
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 69 0.0 3070
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 69 0.0 3047
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 69 0.0 3038
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 69 0.0 3033
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 69 0.0 2999
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 71 0.0 2935
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 68 0.0 2908
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 68 0.0 2900
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 71 0.0 2860
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 67 0.0 2856
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 68 0.0 2838
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 58 0.0 2312
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 59 0.0 2298
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2285
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 60 0.0 2180
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 64 0.0 2110
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 71 0.0 2090
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 60 0.0 2062
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 69 0.0 1872
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 63 0.0 1726
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 81 0.0 1598
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 59 0.0 1510
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 85 0.0 1484
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 76 0.0 1403
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 66 0.0 1397
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1284
WERAM-Dar-0128 ENSDARP00000078549.4 Danio rerio 73 0.0 1231
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 68 0.0 1217
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 68 0.0 1212
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 71 0.0 1201
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1199
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 78 0.0 1144
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 79 0.0 1083
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 76 0.0 1038
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 65 0.0 1008
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 66 0.0 998
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 979
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 64 0.0 962
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 72 0.0 960
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 950
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 64 0.0 942
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 64 0.0 935
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 57 0.0 920
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 906
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 57 0.0 854
WERAM-Ect-0036 ENSETEP00000003241.1 Echinops telfairi 58 0.0 853
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 649
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 36 4e-119 428
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 32 5e-53 209
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 42 3e-51 203
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 42 2e-50 200
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 42 2e-50 200
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 5e-50 198
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 43 5e-50 198
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 43 6e-49 195
WERAM-Viv-0116 VIT_18s0072g00220.t01 Vitis vinifera 49 5e-48 192
WERAM-Sol-0089 Solyc07g008460.2.1 Solanum lycopersicum 41 5e-48 192
WERAM-Sot-0073 PGSC0003DMT400059166 Solanum tuberosum 41 7e-48 191
Created Date 25-Jun-2016