WERAM Information


Tag Content
WERAM ID WERAM-Mod-0041
Ensembl Protein ID ENSMODP00000005995.3
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMODG00000004870.3 ENSMODT00000006122.3 ENSMODP00000005995.3
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.20e-49 167.5 1649 1770
HMT SET1 1.60e-28 99.3 1649 1770
Me_Reader PWWP 2.30e-27 95 54 1523
Me_Reader PHD 6.00e-16 58.5 1272 1874
Organism Monodelphis domestica
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldk.....deviDatkkGnlaRfinhsCe 83  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldk d++iDa kGn+aRf+nh+C+
ENSMODP00000005995.3 1649 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKastenDRIIDAGPKGNYARFMNHCCQ 1735
699********************************************************8533333789****************** PP
SET2.txt 84 PncetqkwtvegelrvglfakkkikkgeeltfdYn 118
Pncetqkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSMODP00000005995.3 1736 PNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1770
**********************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedae...vvvdatkkgniarfinhsce 84  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+ ++ ++da kgn+arf+nh+c+
ENSMODP00000005995.3 1649 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKASTendRIIDAGPKGNYARFMNHCCQ 1735
6777788889************************99999988887777778789**99998765667******************** PP
SET1.txt 85 pNceakvvavdgekkiviyakraIekgeeltydYk 119
pNce++ +v+g+++++++a +I++g+elt++Y+
ENSMODP00000005995.3 1736 PNCETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1770
**********************************7 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpy 61 
+g+LVwaK ++ pwWP +++ +pl ++ +++ ++y V+ +g+ e awv k++v +
ENSMODP00000005995.3 54 VGQLVWAKFNRRPWWPSRICCDPLINTHSkmkVSSQRPYRQYYVEALGDPSEKAWVAGKSIVLF 117
589******************998766665466778999******************9999866 PP
PWWP.txt 3 dLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
+ Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSMODP00000005995.3 1464 EVVWVKVGRYRWWPAEICHPRAIPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1523
78*****************************************.***************87 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C+k++e ++ C+ C +fHl+C++l+ ++p+g +++C++C++
ENSMODP00000005995.3 1272 NVCQNCEKVGE----LLLCEAqCCGAFHLECIGLT--EMPKG-KFICKECRT 1316
78999955544....899**99*************..*****.*******96 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
+C++C+ ++ + ++ C C+ ++H++ C+ + l ++ s +Cp++
ENSMODP00000005995.3 1345 MCITCHAANPASLSaskgrLMRCVRCPVAYHANdfCLAAGSKVLASN-SIICPNH 1398
68888766666544566678889999999998555888884444444.8899887 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSMODP00000005995.3 1415 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1456
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSMODP00000005995.3 1832 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 1874
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MPLKKRTPLS DDPESSTSTL GNMIELPGTS SSSTSQELQF CQPKKKPAPL KYGVGQLVWA 60
KFNRRPWWPS RICCDPLINT HSKMKVSSQR PYRQYYVEAL GDPSEKAWVA GKSIVLFEGR 120
HQFEELPVLR RRGKQKEKGY RHKVPKKILS KWEASVGLAE QRDVPKDFKV RKRIRTSAKL 180
DSEEDMPFED CTNDPESEHE RVLNGCLKSL AFDSETSSDE KVKPCTKSRV RKRSTSLKRT 240
SIKKSSSAFE SHKEERREKI TETLGLNYIS SDVSDKQAST ELCRIASSLS AASNTTTNYL 300
FSSCGKTTTK KEFETSNCDS LLGLSESVLI SSSSVEIKKP LRSMVHSAKP SRLCYISAGD 360
EEKRSDSASL WTTSDEESSD SDLDLIDQSS ESDGSALEIS GSFEKNESES PMEKNEKVQY 420
SRYRTKNSRV KARKRSRKTS SHLDHLLDCT KAPEPGSEES QGADSDSKMP IIARNDNLST 480
KYTMPQSISH ENSFVKSGLT NQPLLQSKHN KQPKARSIKC KHKESPTAEQ PSVLDEDFSL 540
KCCSSDSKGS PLDCVVNSGK LDSLKLLSNM HEKPRDPAEI ETVVVKHVLS ELKELSYRSL 600
NDDLNDSGSS KINKPLLFPS SSGPTCLPIE PNYKFSTLLM MLKDMHDSKT KEQQLMTTQN 660
VVSYRSPGLG DCSSSSAAGG SKSLVSGGLT HKTEKNGNCI QDSVCQNPGG SESSLTQESS 720
GSLTGLLSKK RRLTASSRRR ANSITRRNCA RSKSSKLQDG FLAQMGKDTI ESKDLKSEKK 780
RKLNQPPNMT IETALGECKE APTSVNKSLS SGPADSKRKQ SLQLMEHLTG EDCAHFSDVH 840
FDPMIKKPET DKNPEKCPSF ENGEVLELDS EMNSESSLSD ESNDVNHVAP KKRWQRFNQS 900
RAKTSKRISR SSREKNSKSA FGGLHPCDFL LEGNEGPEDR PSSPSKVLEE EKKQKCEGHL 960
DIVVPQLNVC DKPNSDGLPR ESGLSNFPPQ PELNTQVAHS EKKRLRKPSK WLLEYTEKYD 1020
QIFAPKKKQK KVQEQLPKVS SLKENEVQTT QCQPSTDLQN KPVEESSSTK EDPPVLEREA 1080
PFLEGPIGQS ELGGGNAELP ELTLSVPITR EVSSWSSLDS EEPSLKPRNY ESKRQRKPTK 1140
KLLESNDLDP GFMPKKGDLT LLRKCYNTGH LENDISGSCT TLTSLEFGEG TNKVFEKQRK 1200
RKRQRHVAAH CKKVRNEDSS REAQNSEGEP ITPGTTASPK DSIEESVEHD HGMPVSKKMQ 1260
AERGGGAALK ENVCQNCEKV GELLLCEAQC CGAFHLECIG LTEMPKGKFI CKECRTGIHT 1320
CFVCKDGEDV KRCLLPLCFR CSLHMCITCH AANPASLSAS KGRLMRCVRC PVAYHANDFC 1380
LAAGSKVLAS NSIICPNHFA PRRGCRNHEH VNVSWCFVCS EGGSLLCCDS CPAAFHRECL 1440
NIDIPEGNWY CNDCKAGKKP HYREVVWVKV GRYRWWPAEI CHPRAIPSNI DKMRHDVGEF 1500
PVLFFGSNDY LWTHQARVFP YMEGDVSSKD KMGKGVDGTY KKALQEAAVR FEELKAQKEL 1560
RQLQEDRKND KKPPPYKHIK VNRPIGRVQI FTADLSEIPR CNCKASDENP CGIDSECINR 1620
MLLYECHPTV CPAGGRCQNQ CFSKRQYPEV EIFRTLQRGW GLRTKTDIKK GEFVNEYVGE 1680
LIDEEECRAR IRYAQEHDIT NFYMLTLDKA STENDRIIDA GPKGNYARFM NHCCQPNCET 1740
QKWSVNGDTR VGLFALSDIK AGTELTFNYN LECLGNGKTV CKCGAPNCSG FLGVRPKNHP 1800
NPTEEKSKKL KRRQQVKRRS QGEITKERED ECFSCGDAGQ LVSCKKPGCP KVYHADCLNL 1860
TKRPAGKWEC PWHQCDVCGK EAASFCEMCP SSFCKQHREG MLFISKLDGR LSCTEHDPCG 1920
PNPLEPGEIR EYVPPPVPLT SGANTHLAEQ PSEIPAQRPL MVDKAPGAMG QRLQLSEKTL 1980
VGTCQRPQLS DKPLVMPDST PQSPDKSPGV SGPRPQPLEV GQRPADTSLV VTSPRPQLSD 2040
KSSQALSGSK SPPSVRSSVA AGLRSQLLDR PLALAGPKPQ SLDKSLSTIS SKSQQPDRPV 2100
VTTGPRLQPS DKSPITNGPK PQTSDKPLVP LGQRLPPSEK VLSAVVQNLV SNEKALRPVD 2160
QNTRPKDRAT MVLELSPRQK ERAASPHEIT PQPTEKLPVL EQSPWPVNKA LGQMPRAAEK 2220
VHPSEAVLQA SGRASAPVEH TWQAGKTLIQ ARLIPRPPAK GYLFEQAPRA AGRVPVSMEQ 2280
SSGSFGKALA SGEPMAGSPQ SPGLATKATP LMVQSPRPPV KSPDLGPPAE KRLAMTEHPP 2340
WALGKSPAGP TPWPAGKSLA QTCRSPGSPQ TLAQTCRPLG KGLNKGPDPK PEQSAVPALN 2400
QTPSNHEPAE SKQK 2414
Nucleotide Sequence
(Fasta)
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN GAGTGGGGGG CGCTACAGGC TGCTAAGGCA 60
GAACCATGTG TGGCCGCAGC GGGACTTGGA GCGGGAGCAA AGGCGGCAGA GCCCAGTGGT 120
GTGTAAGGGG TGGCTGGGAC AATCCTGGCT TCGTCGGCCC TCACCGTGCG CTTCCGACGG 180
AGACACCTAC GGATCCAGCT CCCACCCCTT CCACTCCCCC TCCCCCCCCA AGCGGGCCCT 240
TGCTGTCACA GGACAGGGCC GCGGGCGCAG TGGGCCCGGC GCTGGAGGGC CAGGCACGAG 300
GAGGGTGGTG GACCGGGAGT GGGGGCGGTG GTATGTGTAG TGGGCCCCCC CTTAGTGAAA 360
CTAAGGTAAG GAGCTGATTG GGCGGCGCCG TTGCGAACCG GCCTTCACCC GCACTGAACC 420
CCAAGAGTGA GGAGGAGGGA GCGCGCGCTG GGAGACGAAG CCCCGGGGGC GGGAGAGGGG 480
AAGCACGCGC ACACGCGCAC GCGCGCACTG GGCCCCGGCG CGCGCCGTGA AGGGTGGAGG 540
AGGGTGGAGG AGGGTGGTGG TGGTGGTGTG GTGGTGGTGG CGGCAGGGGG GAGTGAGTGA 600
GGAGGGAGGA GGAGGGGGAG GAGGAGGAGG AGGAGGAGAA ACAGGCCGGG GCAGGTCGGG 660
TGCTTGCTAG CTGCCTTCAG CCTAGGAGGA GAGAAGGGGG AAGGGGGGCG CAGCGAGCGG 720
CTTTCTCCCC ATTTACTGCT TACCCCCTAC CCCCTACCCA GCGTAATGAG GTCAGCCCCC 780
CTCCCCTGCA GCCATACCGG GGAGGGGGCG CCACGAGCAC TGAGCAGGGG AGGGCTGGCC 840
CGGGCCGGGC CTGGCGGGGA AGATGGTGGC GGCCGTGTGA GGTTGATGTT GGCCCAGAAT 900
GGATCAGACC TGTGAACTGC CCAGAAGAAA TTGTCTGCTG CCCTTTTCCA ATACAGTGGA 960
TTTAGATGCC CCTGAAGAAA AGGACACCCC TTTCGGATGA TCCAGAATCC AGTACCAGTA 1020
CATTAGGAAA CATGATAGAA TTACCTGGAA CATCATCATC ATCTACTTCA CAAGAATTAC 1080
AATTTTGTCA ACCCAAGAAA AAACCTGCTC CCTTGAAATA CGGAGTTGGA CAACTTGTCT 1140
GGGCAAAATT CAACAGACGT CCATGGTGGC CCAGCAGAAT TTGTTGTGAT CCATTGATTA 1200
ACACTCACTC AAAAATGAAA GTTTCCAGCC AGAGGCCTTA TCGGCAGTAC TATGTGGAAG 1260
CCTTAGGAGA CCCTTCTGAA AAAGCCTGGG TTGCAGGAAA ATCGATTGTC CTATTTGAAG 1320
GAAGACATCA ATTTGAAGAA CTTCCTGTAC TGAGGAGAAG AGGCAAACAG AAAGAAAAAG 1380
GCTATAGACA TAAGGTTCCT AAGAAAATCT TGAGTAAGTG GGAAGCCAGT GTTGGACTTG 1440
CTGAGCAGAG GGATGTTCCT AAGGATTTTA AAGTCAGAAA GCGTATCAGA ACGTCAGCGA 1500
AGCTAGACAG TGAGGAAGAC ATGCCCTTTG AGGACTGTAC AAATGATCCT GAATCTGAAC 1560
ATGAACGAGT ACTAAATGGC TGCCTGAAAT CACTAGCCTT TGACTCTGAA ACCTCTTCAG 1620
ATGAGAAAGT GAAACCGTGC ACCAAATCTC GAGTCAGAAA GAGATCCACA AGCCTCAAAC 1680
GGACTAGTAT TAAAAAAAGT AGCTCAGCAT TTGAATCTCA TAAGGAAGAA AGAAGGGAAA 1740
AGATTACAGA GACTCTTGGC CTGAACTATA TTTCTAGTGA TGTGTCTGAT AAGCAAGCTT 1800
CTACTGAACT GTGCAGAATA GCAAGCAGTC TTTCAGCAGC ATCCAACACC ACAACAAACT 1860
ACTTGTTTTC TTCATGTGGG AAGACCACTA CGAAAAAAGA GTTTGAGACT TCAAATTGTG 1920
ATTCTCTGCT TGGTTTGTCA GAGAGTGTTT TGATTTCTAG CAGTTCTGTG GAAATAAAGA 1980
AACCTCTCCG CAGCATGGTT CACAGTGCAA AGCCATCCCG ATTGTGCTAT ATCAGCGCTG 2040
GTGATGAGGA AAAGAGAAGT GATTCAGCCA GTTTATGGAC CACCTCTGAT GAAGAGAGCA 2100
GTGACAGTGA TCTGGACCTC ATTGACCAAA GTTCAGAGTC TGATGGAAGT GCTCTGGAAA 2160
TCAGTGGCTC TTTTGAGAAA AATGAAAGTG AGTCGCCTAT GGAGAAAAAT GAAAAGGTTC 2220
AGTATTCCAG GTATCGCACA AAGAACAGTA GGGTAAAAGC AAGAAAGAGG TCTAGAAAAA 2280
CTAGTTCGCA CCTAGACCAC TTGTTAGATT GTACTAAAGC TCCTGAACCT GGGAGTGAAG 2340
AATCTCAAGG TGCTGATTCT GATTCCAAAA TGCCCATCAT TGCCAGAAAT GACAATCTTT 2400
CCACCAAATA TACCATGCCA CAAAGCATTT CCCATGAGAA TTCTTTTGTA AAGAGTGGGC 2460
TCACTAATCA ACCTCTGTTA CAATCAAAAC ATAACAAACA GCCCAAAGCC AGAAGTATAA 2520
AATGCAAACA CAAAGAGAGC CCTACTGCAG AACAACCGTC AGTTTTAGAT GAAGATTTCA 2580
GCTTAAAGTG CTGCTCTTCT GATTCCAAAG GGTCTCCTTT AGACTGTGTT GTCAATAGTG 2640
GGAAGCTTGA TAGTCTAAAA CTACTGAGCA ACATGCATGA GAAACCCAGA GATCCTGCAG 2700
AAATTGAAAC AGTTGTTGTA AAACATGTCC TGTCTGAGTT GAAAGAACTG TCATATAGAT 2760
CCTTAAATGA TGACTTAAAT GACTCTGGTT CATCCAAAAT AAACAAACCT TTACTTTTCC 2820
CTTCTTCCTC TGGTCCAACC TGCTTGCCGA TTGAGCCAAA TTACAAATTC AGCACCTTAT 2880
TGATGATGTT GAAAGATATG CATGATAGTA AGACGAAGGA ACAACAGCTA ATGACTACTC 2940
AAAATGTAGT TTCCTATCGT AGTCCTGGTC TTGGGGACTG TTCCAGCAGC AGTGCTGCAG 3000
GTGGCTCAAA GTCCCTGGTT TCAGGAGGCT TGACTCATAA GACAGAAAAA AATGGGAATT 3060
GTATTCAGGA CTCAGTCTGT CAAAACCCTG GTGGGAGTGA GTCTTCTCTC ACCCAGGAGT 3120
CATCTGGATC TTTAACTGGA TTGCTATCTA AAAAAAGAAG ACTCACTGCT TCTAGCAGAC 3180
GGCGAGCAAA TTCTATCACC CGGCGAAATT GTGCAAGATC CAAGTCATCC AAACTGCAGG 3240
ATGGCTTTTT GGCCCAAATG GGAAAAGACA CTATAGAGAG CAAAGATTTA AAATCAGAAA 3300
AGAAAAGGAA GCTAAACCAG CCACCCAATA TGACCATTGA GACTGCACTG GGTGAATGTA 3360
AAGAAGCTCC AACTTCAGTA AATAAGTCTC TAAGCAGCGG GCCTGCAGAT TCAAAAAGAA 3420
AGCAATCTCT TCAGTTAATG GAACATTTAA CAGGTGAAGA CTGTGCACAT TTTTCTGATG 3480
TCCATTTTGA TCCTATGATT AAAAAACCTG AAACTGATAA AAATCCTGAA AAGTGCCCCT 3540
CTTTTGAAAA TGGGGAAGTC CTAGAGCTGG ACTCTGAAAT GAACAGTGAA AGCTCCTTGA 3600
GTGATGAATC TAATGATGTA AACCATGTGG CACCAAAAAA GCGGTGGCAA CGTTTTAACC 3660
AAAGCAGAGC TAAAACCAGT AAGCGCATCA GTAGATCTAG TAGAGAAAAG AACTCAAAGA 3720
GTGCCTTTGG GGGCCTGCAC CCTTGTGACT TTCTGCTGGA GGGGAATGAG GGCCCAGAGG 3780
ATAGGCCTTC CAGCCCTTCT AAGGTGCTAG AGGAAGAAAA GAAACAGAAG TGTGAGGGTC 3840
ACCTAGACAT AGTTGTGCCA CAGTTGAATG TGTGTGATAA GCCAAACAGT GATGGGCTGC 3900
CCAGGGAATC AGGGCTTTCC AATTTTCCTC CACAGCCTGA GCTCAACACA CAAGTTGCGC 3960
ACTCAGAGAA GAAACGTCTT AGGAAACCAA GCAAGTGGCT TCTGGAATAT ACAGAAAAAT 4020
ATGACCAGAT ATTTGCCCCC AAGAAAAAAC AAAAGAAGGT ACAGGAACAG TTGCCAAAGG 4080
TAAGTTCCTT AAAAGAAAAT GAAGTTCAGA CTACACAGTG TCAACCTAGC ACTGATCTCC 4140
AGAATAAGCC AGTGGAAGAG AGCTCTTCAA CCAAAGAGGA TCCGCCTGTT CTTGAGAGAG 4200
AGGCTCCATT TTTGGAAGGA CCAATAGGTC AGTCTGAACT TGGAGGTGGA AATGCAGAAT 4260
TGCCAGAACT GACCTTGTCT GTGCCAATCA CACGTGAAGT TTCTTCTTGG TCTTCACTGG 4320
ATTCAGAAGA GCCTTCATTG AAACCAAGAA ATTATGAAAG TAAGCGTCAG AGGAAGCCTA 4380
CAAAAAAGCT CCTTGAATCC AATGATTTGG ACCCTGGATT TATGCCCAAG AAAGGGGATT 4440
TGACACTCCT TAGAAAGTGT TATAATACTG GTCACTTGGA GAATGATATT TCTGGATCAT 4500
GTACAACGCT TACATCTTTG GAGTTTGGTG AAGGTACAAA CAAGGTCTTT GAAAAACAAA 4560
GGAAGCGAAA GAGGCAAAGG CATGTTGCAG CACATTGTAA GAAAGTGAGA AATGAAGATT 4620
CTTCACGGGA GGCTCAAAAT TCTGAGGGAG AACCAATTAC TCCTGGTACT ACTGCAAGCC 4680
CGAAAGATTC CATAGAAGAG AGTGTGGAAC ATGATCATGG GATGCCTGTA TCAAAAAAGA 4740
TGCAGGCTGA ACGTGGTGGA GGAGCAGCTC TCAAGGAAAA TGTTTGCCAG AATTGTGAAA 4800
AAGTGGGTGA GTTGTTGCTG TGTGAGGCTC AGTGTTGTGG TGCTTTCCAC CTGGAGTGTA 4860
TTGGCCTGAC AGAGATGCCA AAAGGCAAAT TTATCTGCAA GGAATGTCGA ACAGGAATTC 4920
ACACCTGCTT TGTGTGTAAA GATGGGGAAG ATGTCAAAAG GTGTTTGCTA CCTCTTTGTT 4980
TTCGGTGCTC TCTTCACATG TGTATAACCT GTCATGCTGC TAATCCAGCC AGTCTTTCCG 5040
CATCAAAAGG TCGCCTAATG CGTTGTGTCC GGTGTCCTGT GGCTTACCAT GCAAATGACT 5100
TCTGTCTGGC TGCAGGGTCA AAGGTTCTTG CATCCAATAG CATCATCTGT CCTAATCATT 5160
TTGCCCCTCG GAGAGGTTGC AGGAATCATG AACATGTTAA TGTTAGCTGG TGTTTTGTAT 5220
GCTCAGAAGG GGGCAGTCTT TTGTGCTGTG ACTCTTGCCC TGCTGCATTT CATAGAGAAT 5280
GTCTGAATAT TGATATCCCT GAGGGCAACT GGTATTGCAA TGACTGTAAG GCAGGCAAAA 5340
AACCACACTA CCGAGAAGTT GTCTGGGTAA AAGTTGGCCG ATACAGGTGG TGGCCAGCTG 5400
AAATTTGTCA TCCTCGAGCT ATTCCTTCCA ATATTGATAA GATGAGACAT GATGTGGGTG 5460
AATTTCCTGT TTTATTTTTT GGTTCTAATG ATTATTTATG GACCCATCAG GCTCGAGTCT 5520
TTCCCTACAT GGAGGGAGAT GTCAGTAGTA AGGACAAGAT GGGCAAAGGA GTTGATGGGA 5580
CATATAAAAA AGCTCTTCAA GAAGCTGCTG TCCGTTTTGA AGAATTAAAG GCCCAGAAGG 5640
AGCTGAGACA GCTCCAGGAA GACAGAAAAA ATGACAAGAA ACCACCACCC TACAAGCATA 5700
TCAAGGTAAA TCGTCCAATA GGCAGAGTTC AGATCTTTAC TGCAGACCTG TCAGAAATTC 5760
CCCGTTGTAA CTGCAAAGCT TCAGATGAAA ACCCTTGTGG CATTGACTCA GAGTGCATTA 5820
ACCGCATGCT CTTGTATGAG TGCCATCCCA CTGTCTGTCC TGCTGGTGGA CGCTGCCAGA 5880
ACCAATGCTT CTCAAAACGC CAGTATCCTG AGGTTGAAAT CTTTCGTACA TTACAGCGAG 5940
GCTGGGGCTT ACGGACAAAG ACGGATATTA AAAAGGGTGA ATTTGTAAAT GAGTATGTGG 6000
GAGAACTAAT AGATGAAGAG GAGTGTAGGG CCCGAATCCG TTATGCCCAA GAACATGACA 6060
TCACCAATTT CTACATGCTC ACTTTGGATA AGGCAAGTAC TGAAAATGAC CGGATTATTG 6120
ATGCTGGCCC CAAAGGAAAT TATGCTCGGT TTATGAATCA CTGCTGCCAG CCCAACTGTG 6180
AAACTCAGAA GTGGTCTGTG AATGGAGATA CACGGGTTGG GCTTTTTGCC CTAAGCGACA 6240
TCAAAGCGGG CACTGAACTC ACCTTCAACT ACAATCTAGA GTGTCTAGGG AATGGAAAAA 6300
CTGTTTGCAA ATGTGGAGCA CCAAACTGCA GTGGTTTCTT GGGTGTTCGG CCAAAGAATC 6360
ATCCTAATCC CACAGAAGAG AAATCCAAGA AACTAAAAAG AAGACAACAA GTAAAGCGCC 6420
GATCCCAGGG TGAGATTACA AAGGAGCGAG AGGATGAATG TTTCAGCTGT GGGGATGCAG 6480
GGCAACTCGT ATCCTGCAAG AAACCAGGTT GCCCCAAAGT CTATCATGCT GACTGCCTCA 6540
ACTTGACAAA AAGACCTGCA GGGAAGTGGG AGTGTCCATG GCACCAGTGT GACGTTTGTG 6600
GAAAAGAAGC TGCATCTTTT TGTGAGATGT GCCCTAGTTC CTTCTGCAAA CAACACCGAG 6660
AGGGAATGCT GTTCATCTCT AAATTGGATG GTCGCCTGTC TTGCACAGAG CATGATCCAT 6720
GTGGCCCCAA CCCTCTGGAG CCTGGGGAGA TCCGTGAGTA CGTGCCTCCC CCTGTGCCGC 6780
TGACTTCAGG TGCAAACACT CACCTAGCAG AGCAGCCTTC TGAGATTCCT GCTCAAAGAC 6840
CCCTGATGGT GGACAAAGCC CCTGGGGCAA TGGGCCAAAG GCTCCAGCTA TCAGAGAAAA 6900
CATTAGTAGG AACATGTCAG AGGCCACAGT TGTCTGATAA ACCACTTGTG ATGCCTGACT 6960
CCACGCCCCA GTCACCAGAT AAGAGCCCTG GAGTATCAGG TCCAAGACCT CAGCCCTTAG 7020
AAGTAGGCCA GAGGCCAGCA GACACATCTC TTGTAGTGAC CAGCCCCAGA CCTCAACTGT 7080
CTGACAAATC TTCTCAAGCA CTGTCAGGCT CCAAATCCCC ACCATCAGTC AGGTCCTCAG 7140
TAGCAGCTGG CCTCAGGTCC CAACTGTTGG ACAGACCTCT AGCTTTAGCA GGCCCAAAGC 7200
CTCAATCTTT GGATAAGTCC TTGAGTACTA TCAGCTCAAA GTCTCAGCAA CCAGATAGGC 7260
CTGTAGTTAC TACTGGACCA AGACTCCAGC CATCAGACAA GTCCCCAATC ACAAATGGTC 7320
CAAAGCCCCA GACCTCAGAC AAGCCCCTTG TCCCTCTGGG CCAGAGACTC CCACCTTCTG 7380
AAAAAGTGCT GTCAGCTGTG GTCCAGAACC TTGTATCTAA TGAAAAAGCA CTAAGGCCTG 7440
TGGACCAAAA TACTCGGCCA AAAGATCGAG CTACTATGGT TCTTGAACTG AGTCCTCGTC 7500
AAAAGGAACG AGCAGCTTCA CCTCATGAAA TTACACCCCA GCCCACTGAA AAATTACCAG 7560
TGCTGGAGCA GAGTCCCTGG CCTGTTAACA AAGCACTGGG ACAAATGCCT CGGGCCGCTG 7620
AGAAAGTTCA CCCTTCCGAA GCAGTCCTCC AAGCATCTGG AAGAGCTTCA GCCCCTGTGG 7680
AGCACACCTG GCAGGCTGGC AAAACACTCA TACAAGCCAG ACTGATCCCT CGGCCCCCTG 7740
CCAAGGGTTA CTTGTTTGAA CAGGCTCCTC GGGCCGCAGG ACGTGTACCT GTGTCAATGG 7800
AGCAGAGCTC TGGGTCATTC GGCAAAGCCC TAGCCTCAGG AGAACCCATG GCTGGATCCC 7860
CACAATCCCC AGGGCTTGCC ACAAAAGCAA CACCATTGAT GGTGCAGTCC CCTAGGCCCC 7920
CTGTCAAATC ACCAGACCTT GGTCCCCCAG CAGAGAAACG GTTAGCAATG ACAGAGCACC 7980
CCCCCTGGGC CCTGGGGAAA TCCCCAGCAG GGCCCACTCC CTGGCCTGCA GGCAAATCAT 8040
TGGCACAGAC TTGTCGGTCA CCTGGGAGTC CACAGACATT GGCACAGACT TGTCGGCCCC 8100
TTGGCAAAGG GCTAAACAAA GGGCCAGACC CTAAGCCTGA GCAAAGTGCA GTACCAGCTC 8160
TTAACCAGAC CCCTTCCAAC CATGAGCCTG CAGAGTCAAA ACAGAAGTGA 8211
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 91 0.0 4137
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 69 0.0 3083
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 69 0.0 3079
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 69 0.0 3074
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 69 0.0 3068
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 68 0.0 3059
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 69 0.0 3055
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 68 0.0 3051
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 69 0.0 3045
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 69 0.0 3037
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 69 0.0 3036
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 69 0.0 3030
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 69 0.0 3029
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 69 0.0 3024
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 68 0.0 3023
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 68 0.0 2999
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 68 0.0 2996
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 67 0.0 2987
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 69 0.0 2982
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 69 0.0 2982
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 67 0.0 2916
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 70 0.0 2893
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 67 0.0 2878
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 67 0.0 2843
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 65 0.0 2802
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 65 0.0 2793
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 67 0.0 2784
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 70 0.0 2764
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 58 0.0 2216
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 58 0.0 2212
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 57 0.0 2196
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 59 0.0 2096
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 70 0.0 2083
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 62 0.0 1991
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 60 0.0 1989
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 67 0.0 1832
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 75 0.0 1569
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 79 0.0 1558
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 82 0.0 1480
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 58 0.0 1461
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 56 0.0 1392
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 74 0.0 1375
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 87 0.0 1217
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 55 0.0 1196
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 75 0.0 1167
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 70 0.0 1160
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 76 0.0 1149
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 76 0.0 1102
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 70 0.0 1082
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 77 0.0 1040
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 65 0.0 1032
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 73 0.0 998
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 61 0.0 946
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 62 0.0 926
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 69 0.0 918
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 61 0.0 907
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 60 0.0 906
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 62 0.0 896
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 62 0.0 882
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 55 0.0 865
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 56 0.0 860
WERAM-Ect-0036 ENSETEP00000003241.1 Echinops telfairi 56 0.0 801
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 55 0.0 800
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 56 0.0 643
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 42 5e-112 404
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 37 5e-49 195
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 8e-49 194
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 2e-48 193
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 4e-48 192
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 5e-48 192
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 42 1e-47 190
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 41 1e-47 190
Created Date 25-Jun-2016