WERAM Information


Tag Content
WERAM ID WERAM-Ova-0029
Ensembl Protein ID ENSOARP00000004108.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSOARG00000003834.1 ENSOART00000004180.1 ENSOARP00000004108.1
ENSOARG00000003834.1 ENSOART00000004183.1 ENSOARP00000004111.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.60e-52 176.8 1677 1793
Me_Reader PWWP 5.80e-33 113 54 1551
HMT SET1 2.00e-29 102.3 1677 1793
Me_Reader PHD 2.00e-19 69.7 1278 1897
Organism Ovis aries
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSOARP00000004108.1 1677 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 1763
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSOARP00000004108.1 1764 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 1793
*****************************8 PP

  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62 
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSOARP00000004108.1 54 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 118
69********************98766665466888999******************99998775 PP
PWWP.txt 12 YpwWPalvisppleakklktqeaeenk 38
Y+ +Pa +++ ++k+l t+++ +++
ENSOARP00000004108.1 417 YSRYPAANTKVKAKQKSLITNSHTDHL 443
999*************99888888776 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSOARP00000004108.1 1491 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1551
589*****************************************.***************87 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSOARP00000004108.1 1677 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 1761
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSOARP00000004108.1 1762 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1793
*******************************7 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSOARP00000004108.1 1278 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1322
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSOARP00000004108.1 1325 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1372
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC +C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSOARP00000004108.1 1373 ICTTCHAANPASVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1426
8****86666644455677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSOARP00000004108.1 1443 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1484
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSOARP00000004108.1 1855 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 1897
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
MPLKTRTALS DDPDSSTSTL GNMLELPGTS SSSTSQELPF CQPKKKATPL KYEVGDLIWA 60
KFKRRPWWPC RICSDPLINT HSKMKVSNRR PYRQYYVEAF GDPSERAWVA GKAIVMFEGR 120
HQFEELPVLR RRGKQKEKGY RHKVPQKILS KWEASVGLAE QYDIPKGSKN RKCVTSSIKL 180
DSEEDMPFED CTNDPESEHD LLLNGCLKSL AFDSEHSADE KEKPCAKSRA RKSSDNPKRT 240
SVKKGHMQFE THKEERRGKI PENLGLNFIS GDVSDKQASN ELSRIANSLT GPGTAPGSFL 300
FSSCAKNTAK KEFETSNCDS LLGLSEGALI SKRSGEKKKF QRGLVCSSKV QLCYIGAGDE 360
EKRSDSISIC TTSDDGSSDL DPVDHSSESD NSVLEITDAF DRSENLLPVQ KNEKVKYSRY 420
PAANTKVKAK QKSLITNSHT DHLMNCTKTA EPGTETSQIN LSDLKVSSLV RKPQPDFRND 480
GFSPKFSTSS SISSENSIIK GGAKNQALLH SKSKQPKIRS IKCKHKEHPV VAEPPVANED 540
CSLKCCSSDN KASPLASISK SGKVDGLKLL SNMHEKTRDS SDIETAVVKH VLSELKELSY 600
RSLGEDVSDS GTSKPSKPLL FSSASGQNHI PIEPDYKFST LLMMLKDMHD SKTKEQRIMT 660
AQNLVSYRSP GLGDCSTSSP VSAPKVLVSG GSNHSSEKIG DGTQDPVHPG PGGGNSALSG 720
ELSTSLPGLG SDKRDLPASG KNRSNCVTRR NCGRSKPSKF RDGFSAQLGK NTVNRKALKT 780
ERRRKLNELP AVTLEAALQG DRESRGSENS SSRGEAEDPG KEPPLQLMGH LTSKDGAHFS 840
SVSFDNKVNQ SDPEKISEKG PSFEIRKVPE LDSEMNSEND EPNSVNEAVP KKRWQRLNQR 900
RTKPRKRTNR LKEKENSEGA FGVLLPADPV KKEDEFPEQR PPASTNKLEP ALTDPNRADH 960
LDSAGPRLNV CDKSKASNEE MEKEPGIPSL TPQPELPEPA VRSEKKRLRK PSKWLLEYTE 1020
EYDQIFAPKK KQKKTQEQVH KVSSRCEEES LLARCRSSAQ NKQVDENSLI STKEEPPVLE 1080
REAPFLEGPL AQSELGGGHA ELPQLTLSVP VAPEVSPRPA LESEELLVKP PGNYESKRQR 1140
KPTKKLLESN DLDPGFMPKK GDLGLTKKCY EAGHLENDIN ESCVAPRSKE FGGGTAKLFD 1200
RPRKRKRQRH ATAKVHCKKV RNDTSSKETP NSEGELMTHR TAASPKDTME EGVENDHGMP 1260
ASKKLQGERG GGAALKENVC QNCEKLGELL LCEAQCCGAF HLECLGLTEM PRGKFICNEC 1320
RTGIHTCFVC KQSGEDVKRC LLPLCGKFYH EECVQKYPPT VMQNKGFRCS LHICTTCHAA 1380
NPASVSASKG RLMRCVRCPV AYHANDFCLA AGSKILASNS IICPNHFTPR RGCRNHEHVN 1440
VSWCFVCSEG GSLLCCDSCP AAFHRECLNI DIPEGNWYCN DCKAGKKPHY REIVWVKVGR 1500
YRWWPAEICH PRAVPSNIDK MRHDVGEFPV LFFGSNDYLW THQARVFPYM EGDVSSKDKM 1560
GKGVDGTYKK ALQEAAARFE ELKAQKELRQ LQEDRKNDKK PPPYKHIKVN RPIGRVQIFT 1620
ADLSEIPRCN CKATDENPCG IDSECINRML LYECHPTVCP AGGRCQNQCF TKRQYPEVEI 1680
FRTLQRGWGL RTKTDIKKGE FVNEYVGELI DEEECRARIR YAQEHDITNF YMLTLDKDRI 1740
IDAGPKGNYA RFMNHCCQPN CETQKWSVNG DTRVGLFALS DIKAGTELTF NYNLECLGNG 1800
KTVCKCGAPN CSGFLGVRPK NQPIATEEKS KKFKKKQQGK RRTQGEVTKE REDECFSCGD 1860
AGQLVSCKKP GCPKVYHADC LNLTKRPAGK WECPWHQCDI CGKEAASFCE MCPSSFCKQH 1920
REGMLFISKL DGRLSCTEHD PCGPNPLEPG EIREYVPPPV PLPPGPSTHL AEQSSGVAAQ 1980
GPKMSDKPSA DTNQSLSLSK KALAGTCQRP PLPERPPDRT DSRPQPVDKV RDLAGSGTKP 2040
QSLVSSQKPL DRPTAVAGPR PQLSDKPSPV TGPSSSPSVR SQPLERPLGT ADPRLDKSIG 2100
AASPRPQSLE KTPVPTGLRL PPPEKLLVTS GPKPQTSDRP PDKSHVSLSQ RLPPPDKVLS 2160
AVVQTLVAKE KALRPVDQNT QSKNRAALVM DLIDLTPRQK ERAGSPHELT PQADEKMPVL 2220
ESSSWPASKG LGQIPRAVER GSVSDAVLQP LGKAAAPSEH SWQAVKSLTQ ARLLSQPPAK 2280
AFLYEPATQA SGRAPAGAEQ TPGPPSQVPG LVKQVKQMAG GQQLPGLTAK SGQSFRPLGK 2340
PSLSTEEKKL ATTEQSPWAL GKASPGPGLW PMVAGQTLAQ SCWSSGSTQT LAQTCWSLGR 2400
GQDPKPEQST LPALNQAPSS HKCAESEQK 2429
Nucleotide Sequence
(Fasta)
GTTGATGCTG GCCCAGGATG GATCAGACCT GTAAACTACC TAGAAGAAAT TGTCTGCTGC 60
CCTTTTCCAA TCCAGTGAAT TTAGATGCCC CTGAAGACAA GGACAGCCCT TTCGGATGAT 120
CCAGACTCCA GTACCAGTAC ATTAGGAAAC ATGCTAGAAT TACCTGGAAC TTCATCATCA 180
TCTACTTCAC AGGAATTGCC ATTTTGTCAA CCCAAGAAAA AGGCTACGCC ACTGAAGTAT 240
GAAGTTGGAG ATCTCATCTG GGCAAAATTC AAGAGACGCC CATGGTGGCC CTGCAGGATT 300
TGTTCTGATC CGTTGATTAA TACACACTCA AAAATGAAAG TTTCTAACCG CAGGCCCTAT 360
CGTCAGTACT ACGTGGAGGC TTTTGGAGAT CCTTCTGAAA GAGCCTGGGT GGCTGGAAAA 420
GCAATTGTTA TGTTCGAAGG CAGACATCAG TTTGAAGAGC TACCTGTTCT TAGGAGAAGA 480
GGGAAGCAGA AAGAGAAGGG ATATAGGCAT AAGGTTCCTC AGAAGATTTT GAGTAAATGG 540
GAAGCCAGTG TTGGTCTTGC TGAGCAGTAT GATATTCCCA AAGGGTCGAA GAACCGAAAG 600
TGTGTCACCA GTTCAATCAA GTTGGACAGC GAGGAGGATA TGCCATTTGA GGACTGTACA 660
AATGATCCTG AGTCAGAACA TGACCTGTTG CTTAATGGCT GCTTGAAATC TCTGGCTTTT 720
GACTCTGAAC ATTCTGCAGA TGAGAAGGAA AAACCATGTG CTAAGTCTCG AGCCAGAAAG 780
AGCTCTGATA ATCCAAAAAG GACTAGTGTG AAAAAGGGCC ATATGCAATT TGAAACACAT 840
AAGGAAGAAC GGAGGGGAAA GATTCCAGAA AACCTTGGCT TAAACTTTAT TTCTGGGGAT 900
GTGTCTGATA AGCAGGCCTC GAATGAACTT TCCCGGATAG CGAACAGCCT CACAGGGCCC 960
GGCACTGCCC CAGGAAGTTT CCTGTTTTCT TCTTGTGCAA AAAACACTGC AAAGAAAGAA 1020
TTTGAGACTT CAAATTGTGA CTCTTTACTG GGCTTGTCTG AGGGTGCCTT GATCTCTAAA 1080
CGTTCTGGGG AGAAGAAGAA ATTCCAGCGA GGTCTGGTGT GTAGTTCGAA AGTACAACTC 1140
TGCTATATTG GAGCAGGTGA TGAGGAAAAA CGAAGTGATT CCATTAGTAT TTGTACCACT 1200
TCTGATGATG GAAGCAGTGA TCTGGATCCT GTAGATCATA GTTCAGAGTC TGATAACAGT 1260
GTCCTTGAAA TTACAGATGC TTTTGATAGA TCAGAGAACC TGTTACCTGT GCAAAAAAAT 1320
GAAAAGGTAA AGTATTCTAG GTATCCTGCC GCAAACACTA AGGTAAAAGC AAAGCAGAAG 1380
TCTCTGATTA CTAACTCACA TACGGACCAC CTAATGAATT GTACCAAGAC AGCAGAGCCT 1440
GGAACTGAGA CATCTCAGAT TAATCTCTCT GATCTTAAAG TGTCAAGTCT TGTCCGAAAA 1500
CCCCAACCAG ATTTTAGAAA TGATGGATTT TCTCCAAAAT TCAGCACATC ATCAAGCATT 1560
TCCAGTGAGA ACTCAATAAT AAAAGGTGGG GCTAAAAATC AAGCTCTGTT ACATTCAAAA 1620
AGCAAACAGC CCAAAATTCG AAGTATCAAG TGCAAACATA AAGAACACCC GGTTGTAGCA 1680
GAACCTCCAG TTGCAAATGA GGACTGCAGT TTAAAATGCT GCTCTTCTGA TAACAAAGCC 1740
TCTCCTTTGG CCAGCATTTC TAAAAGCGGG AAAGTGGATG GACTGAAACT ACTGAGCAAC 1800
ATGCATGAGA AAACCAGGGA TTCGAGTGAC ATAGAAACAG CAGTGGTGAA ACACGTTCTG 1860
TCAGAGTTGA AGGAACTCTC TTACAGATCC TTAGGTGAAG ATGTCAGTGA CTCTGGAACG 1920
TCAAAGCCAT CGAAACCATT ACTTTTTTCT TCTGCCTCTG GTCAGAACCA TATACCTATT 1980
GAACCAGACT ACAAATTCAG TACATTACTA ATGATGTTGA AAGATATGCA TGATAGTAAG 2040
ACCAAGGAGC AACGAATAAT GACAGCTCAG AACTTGGTCT CTTATCGGAG TCCTGGTCTT 2100
GGGGACTGTT CTACCAGTAG TCCTGTGTCA GCTCCTAAGG TCTTGGTTTC AGGAGGCTCC 2160
AATCACAGTT CAGAAAAAAT TGGAGATGGC ACTCAGGATC CAGTCCACCC TGGCCCTGGT 2220
GGGGGCAACT CTGCACTGTC TGGGGAGCTG TCTACTTCCC TGCCTGGCTT GGGGTCTGAT 2280
AAAAGAGACC TCCCTGCTTC TGGCAAAAAT CGTTCAAACT GCGTTACTAG GCGCAACTGT 2340
GGTCGATCAA AGCCATCCAA ATTTCGAGAT GGTTTTTCAG CCCAGTTGGG AAAGAACACA 2400
GTGAACCGTA AAGCCTTAAA AACAGAACGC AGAAGAAAAC TGAACGAGCT TCCAGCTGTG 2460
ACTCTTGAGG CTGCACTACA GGGAGACAGA GAGAGTAGGG GTTCAGAGAA TAGCTCCTCC 2520
AGAGGTGAGG CAGAAGACCC TGGTAAGGAA CCACCCCTTC AATTAATGGG CCATTTAACA 2580
AGTAAAGATG GTGCCCATTT TTCCAGTGTT AGTTTTGATA ATAAAGTCAA CCAGTCTGAC 2640
CCTGAAAAAA TTTCTGAAAA AGGCCCCTCT TTTGAAATCA GAAAAGTCCC AGAGCTGGAC 2700
TCTGAAATGA ACAGTGAGAA TGATGAACCT AATAGTGTAA ACGAAGCAGT GCCTAAAAAG 2760
CGATGGCAGC GTTTAAACCA AAGGCGCACT AAACCTCGTA AACGCACTAA CAGACTTAAG 2820
GAGAAAGAAA ACTCTGAGGG TGCCTTTGGG GTCTTGCTTC CTGCTGACCC TGTAAAGAAG 2880
GAAGACGAGT TCCCAGAGCA GAGACCTCCT GCTTCGACAA ACAAACTAGA GCCTGCACTG 2940
ACAGATCCAA ATCGTGCCGA CCACTTAGAT TCAGCTGGGC CACGGTTGAA TGTTTGTGAT 3000
AAATCGAAAG CCAGCAATGA GGAGATGGAA AAGGAGCCAG GAATTCCCAG TTTGACTCCT 3060
CAACCTGAGC TCCCCGAACC AGCTGTGCGA TCAGAGAAGA AACGCCTTAG GAAGCCAAGC 3120
AAGTGGCTTC TAGAATATAC AGAAGAATAT GATCAGATAT TTGCTCCTAA GAAAAAACAA 3180
AAGAAGACAC AGGAACAGGT GCACAAGGTA AGTTCCCGCT GTGAAGAGGA AAGCCTTTTA 3240
GCCCGATGTC GATCTAGTGC TCAGAACAAA CAGGTGGATG AGAATTCTTT GATTTCAACC 3300
AAAGAAGAGC CTCCAGTTCT TGAAAGGGAG GCTCCATTTT TGGAAGGGCC CTTGGCTCAG 3360
TCAGAACTTG GAGGTGGACA TGCTGAGTTG CCACAGCTGA CCTTATCTGT GCCTGTGGCT 3420
CCGGAAGTCT CTCCACGGCC TGCCCTTGAG TCTGAGGAAT TGCTAGTTAA ACCACCAGGA 3480
AATTACGAAA GTAAGCGTCA GAGAAAACCA ACTAAGAAAC TTCTTGAATC CAATGATTTA 3540
GACCCTGGAT TTATGCCTAA GAAAGGGGAT CTTGGCCTTA CTAAAAAGTG TTATGAAGCT 3600
GGTCACTTGG AGAATGACAT TAATGAATCG TGTGTTGCAC CTCGTTCTAA AGAGTTTGGT 3660
GGAGGCACTG CCAAGCTGTT TGATAGACCA AGGAAGCGAA AACGACAGAG GCATGCTACA 3720
GCCAAGGTGC ATTGTAAAAA AGTGAGAAAT GACACCTCAT CAAAAGAAAC TCCAAACTCT 3780
GAGGGAGAAC TGATGACACA CAGGACAGCT GCAAGCCCCA AGGACACTAT GGAGGAGGGT 3840
GTAGAAAACG ACCATGGAAT GCCTGCATCT AAAAAACTGC AGGGGGAGCG AGGAGGTGGA 3900
GCCGCACTCA AGGAGAATGT CTGTCAGAAC TGTGAGAAAC TGGGTGAGCT GCTATTATGT 3960
GAGGCTCAGT GCTGTGGGGC TTTCCACTTG GAGTGCCTTG GATTAACTGA AATGCCCAGA 4020
GGAAAGTTTA TCTGCAATGA ATGTCGCACA GGAATACATA CCTGTTTTGT ATGCAAGCAG 4080
AGTGGGGAAG ATGTGAAAAG GTGCCTTCTG CCCTTATGTG GAAAGTTTTA CCATGAAGAG 4140
TGCGTCCAGA AGTACCCACC CACCGTGATG CAAAACAAGG GCTTCCGGTG TTCCCTCCAC 4200
ATATGTACTA CCTGTCACGC TGCCAATCCA GCCAGTGTTT CTGCGTCTAA AGGTCGACTG 4260
ATGCGCTGTG TCCGCTGCCC AGTGGCATAC CATGCCAATG ACTTTTGCCT GGCTGCTGGG 4320
TCAAAGATCC TTGCATCCAA TAGCATCATC TGCCCTAATC ACTTTACCCC GAGGCGTGGC 4380
TGCCGAAATC ATGAGCATGT TAATGTTAGT TGGTGTTTTG TGTGCTCTGA AGGAGGCAGC 4440
CTTCTGTGTT GTGATTCTTG CCCTGCTGCT TTTCATCGTG AATGCCTGAA CATTGATATC 4500
CCTGAAGGAA ACTGGTATTG CAATGACTGT AAGGCAGGCA AAAAGCCACA TTACAGAGAA 4560
ATTGTCTGGG TAAAAGTTGG AAGATACAGG TGGTGGCCAG CTGAGATCTG CCATCCTCGA 4620
GCTGTACCTT CCAATATTGA CAAGATGAGA CATGATGTAG GCGAGTTCCC TGTGCTCTTC 4680
TTTGGGTCTA ATGACTATCT GTGGACTCAC CAGGCCCGAG TCTTTCCCTA CATGGAGGGG 4740
GATGTGAGCA GCAAGGATAA GATGGGCAAA GGAGTCGACG GGACATATAA AAAAGCTCTT 4800
CAGGAAGCTG CAGCAAGGTT TGAGGAGTTG AAGGCCCAAA AAGAGCTAAG ACAGCTCCAG 4860
GAAGATCGAA AGAATGATAA GAAGCCACCA CCATACAAAC ATATAAAGGT GAACCGCCCT 4920
ATTGGCAGGG TGCAGATCTT CACTGCAGAC TTGTCTGAGA TCCCCCGCTG CAACTGTAAA 4980
GCCACAGACG AGAACCCCTG CGGCATAGAC TCCGAGTGCA TCAACCGCAT GCTGCTGTAC 5040
GAGTGCCACC CCACAGTGTG CCCTGCGGGA GGCCGCTGCC AGAACCAGTG CTTTACCAAG 5100
CGCCAGTACC CAGAGGTGGA AATTTTCCGC ACCTTACAGA GGGGCTGGGG TCTCCGAACA 5160
AAAACAGATA TTAAAAAGGG TGAATTTGTG AATGAATATG TGGGTGAGCT AATAGATGAA 5220
GAAGAGTGCA GAGCTCGAAT CCGTTACGCC CAAGAACATG ATATCACTAA TTTTTATATG 5280
CTCACTCTAG ACAAAGACCG GATTATTGAT GCTGGCCCCA AAGGAAACTA TGCTCGATTC 5340
ATGAATCATT GCTGCCAGCC TAACTGTGAA ACACAGAAGT GGTCTGTCAA TGGAGACACC 5400
CGGGTTGGCC TTTTTGCCCT GAGTGACATT AAAGCAGGCA CTGAACTTAC CTTCAACTAC 5460
AATCTAGAAT GTCTTGGGAA TGGAAAGACC GTTTGCAAAT GTGGAGCCCC AAACTGCAGT 5520
GGCTTTTTGG GTGTAAGGCC AAAGAATCAA CCCATTGCTA CAGAGGAAAA GTCAAAGAAA 5580
TTCAAGAAGA AGCAACAGGG GAAGCGCAGA ACCCAGGGTG AAGTCACAAA GGAGCGAGAG 5640
GATGAATGTT TCAGCTGTGG GGATGCTGGC CAGCTCGTCT CTTGTAAGAA GCCAGGCTGC 5700
CCCAAAGTTT ACCATGCAGA TTGTCTCAAT CTAACCAAGC GACCAGCAGG GAAATGGGAG 5760
TGTCCTTGGC ACCAGTGTGA CATATGCGGA AAGGAAGCAG CCTCCTTCTG TGAGATGTGT 5820
CCCAGCTCCT TCTGCAAGCA GCATCGGGAA GGGATGCTCT TCATCTCCAA ACTGGATGGG 5880
CGTCTGTCTT GTACTGAGCA TGATCCCTGT GGGCCCAACC CTCTGGAACC GGGGGAGATC 5940
CGTGAGTATG TACCTCCTCC AGTACCACTG CCTCCAGGCC CAAGCACTCA CCTGGCAGAG 6000
CAATCATCAG GAGTGGCTGC TCAAGGGCCC AAGATGTCGG ACAAGCCATC TGCTGACACC 6060
AACCAGTCGC TGTCGCTGTC CAAAAAAGCT CTGGCAGGGA CTTGTCAGAG GCCCCCGCTG 6120
CCTGAAAGGC CTCCTGACAG AACTGACTCC AGGCCCCAGC CTGTAGATAA GGTCAGGGAC 6180
CTTGCTGGGT CAGGGACCAA ACCCCAATCA TTGGTATCCA GCCAGAAGCC ATTGGACAGG 6240
CCAACTGCAG TGGCAGGACC AAGACCCCAA CTATCTGACA AACCCTCTCC AGTGACCGGC 6300
CCAAGCTCCT CACCCTCAGT CAGGTCTCAG CCACTGGAAA GACCTCTGGG GACAGCTGAT 6360
CCAAGGCTGG ATAAATCCAT AGGTGCTGCC AGCCCAAGGC CCCAGTCACT GGAGAAAACC 6420
CCTGTCCCTA CTGGCCTGAG ACTTCCACCG CCAGAGAAAC TGCTAGTCAC CAGCGGTCCC 6480
AAACCCCAGA CTTCAGACAG ACCCCCTGAC AAATCCCATG TCTCTTTGTC CCAGAGACTT 6540
CCACCTCCTG ACAAAGTACT GTCAGCTGTG GTCCAGACCC TGGTAGCTAA AGAAAAAGCA 6600
CTGAGGCCCG TGGACCAGAA TACTCAGTCA AAAAATAGAG CTGCTTTGGT GATGGATCTC 6660
ATAGACCTAA CTCCTCGCCA GAAGGAACGG GCAGGTTCAC CCCATGAGCT CACACCACAG 6720
GCTGATGAGA AGATGCCAGT GTTGGAGTCA AGCTCATGGC CTGCCAGCAA AGGTCTAGGA 6780
CAGATACCAC GAGCTGTTGA GAGAGGCAGT GTGTCAGATG CTGTCCTTCA GCCACTGGGC 6840
AAAGCAGCGG CCCCTTCAGA ACACTCCTGG CAAGCTGTTA AATCACTCAC CCAGGCCAGA 6900
CTTCTTTCTC AGCCCCCTGC CAAGGCTTTT TTATATGAGC CAGCAACTCA GGCCTCAGGA 6960
AGAGCACCTG CAGGGGCTGA ACAGACCCCA GGGCCTCCCA GTCAAGTGCC AGGCCTGGTG 7020
AAGCAGGTGA AGCAGATGGC TGGGGGCCAG CAACTACCTG GACTCACTGC CAAGAGTGGG 7080
CAGTCCTTCA GGCCTCTTGG GAAGCCCTCC CTCTCCACGG AAGAGAAGAA GCTGGCAACC 7140
ACAGAGCAGA GTCCCTGGGC CCTGGGCAAG GCCTCGCCAG GGCCAGGGCT CTGGCCCATG 7200
GTGGCTGGAC AGACACTGGC ACAGTCTTGC TGGTCCTCCG GGAGCACACA GACACTGGCA 7260
CAGACTTGCT GGTCTCTTGG AAGAGGGCAA GACCCTAAAC CAGAGCAAAG TACACTTCCA 7320
GCTCTTAACC AGGCTCCTTC CAGTCACAAG TGTGCAGAGT CAGAACAGAA GTAA 7375
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 98 0.0 4648
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 93 0.0 4429
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 93 0.0 4418
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 93 0.0 4408
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 92 0.0 4350
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 93 0.0 4347
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 92 0.0 4337
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 90 0.0 4267
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 90 0.0 4266
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 90 0.0 4264
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4260
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 90 0.0 4257
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4255
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 90 0.0 4244
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 89 0.0 4215
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 89 0.0 4196
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 88 0.0 4137
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 87 0.0 4133
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 90 0.0 4131
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 89 0.0 4093
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 91 0.0 4039
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 4018
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 89 0.0 3912
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 82 0.0 3827
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 82 0.0 3809
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3739
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3207
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3177
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 88 0.0 2737
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 77 0.0 2677
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 84 0.0 2355
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 57 0.0 2312
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 57 0.0 2311
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 58 0.0 2302
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 88 0.0 2202
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 57 0.0 2177
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2096
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 51 0.0 1808
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 82 0.0 1704
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 76 0.0 1637
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 56 0.0 1494
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 76 0.0 1406
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 86 0.0 1396
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 81 0.0 1352
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 89 0.0 1351
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1279
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1246
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 62 0.0 1216
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1201
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1137
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 84 0.0 1077
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 65 0.0 1076
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 79 0.0 1075
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 77 0.0 1049
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 62 0.0 994
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 970
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 65 0.0 961
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 957
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 951
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 54 0.0 932
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 63 0.0 930
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 84 0.0 929
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 915
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 57 0.0 649
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 2e-121 435
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 7e-56 218
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 3e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 8e-50 198
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 9e-50 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 3e-49 196
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 4e-49 195
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 7e-49 194
Created Date 25-Jun-2016