WERAM Information


Tag Content
WERAM ID WERAM-Ocp-0021
Ensembl Protein ID ENSOPRP00000001916.1
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSOPRG00000002027.1 ENSOPRT00000002079.1 ENSOPRP00000001916.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.20e-46 157.3 1916 2023
Me_Reader PHD 7.20e-19 67.4 1527 2136
Me_Reader PWWP 4.60e-18 64.8 1730 1790
Organism Ochotona princeps
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSOPRP00000001916.1 1916 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 2002
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikk 109
qkw+v+g++rvglfa ++ik+
ENSOPRP00000001916.1 2003 QKWSVNGDTRVGLFALSDIKA 2023
******************986 PP

  Me_Reader PHD

               PHD.txt   16 emvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSOPRP00000001916.1 1527 ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1561
2799**99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C ++
ENSOPRP00000001916.1 1564 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLHM 1612
58****777777...55899889**********988777776658***9886 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
+C++C+ ++ + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSOPRP00000001916.1 1612 MCITCHAANPTSVSaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1665
7****86666644455677************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSOPRP00000001916.1 1682 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1723
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSOPRP00000001916.1 2094 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--RRPAG-KWECPWHQ 2136
9999..33442..9********************..*****.*****886 PP

  Me_Reader PWWP

              PWWP.txt    2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63  
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSOPRP00000001916.1 1730 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1790
589*****************************************.***************87 PP

Protein Sequence
(Fasta)
PRQDCDLPRR NYIFFFSPPN LVVLEDKDRP PRNCQYNFSL PPPTMYLTKS GPHQNAYGQD 60
SLSCYIPLRR LQDLASMINV EYLNGSADGS ESFQDPEKSD SRAQSPLRPG GAAALAVKQE 120
PRANNSPELQ GEVTETTKSG FPHFENFTGV DDADVDSEMD PEQPAPEAER IAETQSNATC 180
TAEPKSEHAV KVAVESERDG TAESSPGAVN SPFLPLAPQT ETQNNKQRSE VDGSSEKALL 240
PAPLALGDAD LPLEEQLNSI NLSFQDDPDS SSTSTLENML ELPGTSSSAT SQELPFXXXX 300
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 360
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXV PQKILSKWEA SVGLAEQYDV 420
PKGSKTRKCA SSSIKLDSEE DMPFEDCTHD PESEHGLLLN GCLESLDFDS EQSADEKEKL 480
CARKNSDNPK RTSVKKGHLQ FEAHKEVRRA KISENLGLNF ISGDASDTQA SNELSRIANS 540
LTGSSSTPAT FLFSSCGKNT AKKEFEASNC DSLLGLSEGA LISKRSGEKK KPQRGPVCGS 600
KVQLCYIGAS EEEKRSDSIS ICTTSDDGSS DLDAGEQSSE SDSGVLELAD TFDRTENMLS 660
MQKNEKITYS RYSTTNSRVK AKPKPLLTNS HTDHLINCTK TMELGTEIPP VNLSDLTVST 720
LVHKPQSDFK NDGLTPKFNT PSTISSENSL VTAGGTTNQA LLHLKSKPPK FRSIKFKART 780
PVSVEPPVPN EDCSLKCCSS DTKGSALASG PKSGKMDGLK LLSNMHEKTR DSSDIESAVV 840
KHVLSELKEL SYRSLGEDVS DSGTSKPSKP LLFSSSNQNH MPLEPDYKFS TLLMMLKDMH 900
DSKTKEQRLM TAQNLVSYRS PSRGDCSTST PAGASKVLTP GSSMHKSEKS GDGAPDTARP 960
SPGGVESPPS APLPVLAPEG RDPPASVKHR ANCVTRRHCG RSKPSKLRDA FSAQMRKDIV 1020
NRKALKIERK RKPTRLLVDA XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1080
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1140
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1200
XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX VRSEKKRLRK PSKWLLEYTE 1260
EYDQIFAPKK KQKKAQEVHK VNSRCEDESL LARCQSSTQN KQVDENSLLS TKEEPPVLER 1320
EAPFLEGPLA QSELGGGNTE LPQLTLSVPV APEVSARAAL EPEELLVKTP GNYESQRQRK 1380
PTKKLLESND LNPGFMPKKG DLGFSKKCYE AGYLENGIPD SCAASHSKEL GSGTTKIFDK 1440
PRKRKRQRHI GAKVQCKKVK NADSSKGVPG SEXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1500
XXXXXXXXXX XXXXXXXXXX NCEKVGELLL CEAQCCGAFH LECLGLTEMP RGKFICNECR 1560
TGIHTCFVCK QSGEDVKRCL LPLCGKFYHE ECVQKYPPTV MQNKGFRCSL HMCITCHAAN 1620
PTSVSASKGR LMRCVRCPVA YHANDFCLAA GSKILASNSI ICPNHFTPRR GCRNHEHVNV 1680
SWCFVCSEGG SLLCCDSCPA AFHRECLNID IPEGNWYCND CKAGKKPHYR EIVWVKVGRY 1740
RWWPAEICHP RAVPSNIDKM RHDVGEFPVL FFGSNDYLWT HQARVFPYME GDVSSKDKMG 1800
KGVDGTYKKA LQEAAARFEE LKAQKELRQL QEDRKNDKKP PPYKHIKVNR PIGRVQIFTA 1860
DLSEIPRCNC KATDENPCGI DSECINRMLL YECHPTVCPA GGRCQNQCFT KRQYPEVEIF 1920
RTLQRGWGLR TKTDIKKGEF VNEYVGELID EEECRARIRY AQEHDITNFY MLTLDKDRII 1980
DAGPKGNYAR FMNHCCQPNC ETQKWSVNGD TRVGLFALSD IKAXXXXXXX XXXXXXXXXX 2040
XXXXXXXXXX XXXXXXXXXN QPIATEEKSK KFKKKQHGKR RSQGEITKER EDECFSCGDA 2100
GQLVSCKKPG CPKVYHADCL NLTRRPAGKW ECPWHQCDIC GKEAASFCEM CPSSFCKQHR 2160
EGMLFISKLD GRLSCTEHDP CGPNPLEPGE IREYVPPPVP LPAGPGAPLA EQSSGAAAQA 2220
PKVSDKASAE TNQTPLSKKA LAGTCQRPLL SEKPLERTDS RPLLLDRVRE LAGSSTKSQS 2280
LASSQKPLAR SPSVAGPRLQ LSDKPSLATG PGSSPSVRPQ PLERPLGTTD PRLDKSIGAA 2340
SPRPQSLEKT PVPSGLRLLP PDRLLVSSSP KAQTADRPPD KPHIPLSQRL PPTEKVLSAV 2400
VQTLVAKEKA LRPVDQNTQS KNRAALVMDL IDLTPRQKDR AASPPDGTLQ ADEKVPVLES 2460
SSWAAGKGLG HVLRVVEKGS VSEPLLQPPG KTVAPSEHPW QAVKSLTQAR LLSQPPAKTF 2520
LYEPATQASG RAPAGAEQTP GPPSQVPGLV KQVKQLAGSQ QPAGLAVKSG QSFRSLGKAP 2580
ASLPTEEKKL TTTEQSSWAL GKASSGAGLW PLVAGQTLVQ SCWSAGSAQT LAQTCWSLGR 2640
GQEPKSEQTT LAALSQAPSG HKCAEAEQK 2669
Nucleotide Sequence
(Fasta)
CCGCGTCAGG ACTGTGACCT CCCTAGAAGA AATTATATTT TTTTTTTTTC TCCTCCCAAT 60
CTAGTTGTTC TTGAAGACAA GGACAGACCC CCCCGCAATT GTCAATACAA TTTTTCTCTC 120
CCCCCCCCCA CTATGTATTT GACAAAAAGT GGCCCACACC AAAATGCTTA TGGACAAGAT 180
TCTCTATCTT GTTACATTCC ACTGCGGAGA CTACAGGATT TGGCCTCCAT GATCAATGTA 240
GAATATTTAA ATGGGTCTGC TGATGGTTCA GAATCCTTCC AAGACCCTGA GAAAAGTGAT 300
TCAAGAGCTC AGTCGCCCCT GAGGCCTGGC GGTGCAGCAG CACTTGCTGT GAAACAGGAA 360
CCCCGTGCTA ATAACTCCCC TGAACTCCAG GGAGAAGTTA CAGAGACTAC CAAGAGTGGC 420
TTTCCGCACT TTGAGAATTT TACTGGTGTG GACGATGCAG ATGTAGATTC TGAAATGGAC 480
CCAGAACAGC CAGCCCCAGA GGCTGAAAGG ATAGCGGAGA CTCAGAGCAA TGCCACCTGC 540
ACTGCTGAGC CTAAATCAGA GCATGCTGTA AAAGTGGCCG TGGAGAGTGA ACGAGACGGC 600
ACAGCCGAGA GTAGCCCCGG TGCAGTCAAT TCGCCATTCT TGCCATTAGC TCCTCAGACC 660
GAGACGCAGA ACAATAAGCA AAGGAGTGAA GTGGACGGCA GCAGTGAAAA AGCCCTTCTC 720
CCAGCCCCTC TTGCTCTAGG AGATGCAGAC CTTCCCCTAG AAGAGCAATT AAACTCAATA 780
AATTTATCTT TTCAGGATGA TCCAGACTCC TCCAGTACCA GTACATTAGA AAACATGCTA 840
GAATTACCTG GAACTTCATC ATCAGCCACT TCACAGGAAT TGCCATTTNN NNNNNNNNNN 900
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 960
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1020
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1080
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1140
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNGTT 1200
CCTCAGAAAA TTTTGAGTAA ATGGGAAGCC AGTGTTGGTC TTGCCGAACA GTATGATGTT 1260
CCCAAAGGGT CAAAGACTCG AAAATGTGCC AGCAGCTCGA TCAAGTTGGA CAGTGAGGAG 1320
GATATGCCGT TCGAGGACTG CACACATGAT CCTGAATCGG AGCACGGCTT GTTGCTTAAT 1380
GGCTGCTTGG AGTCTCTGGA TTTTGATTCT GAACAATCTG CAGACGAGAA GGAAAAGCTT 1440
TGTGCCCGAA AGAACTCTGA TAACCCCAAA AGGACTAGTG TCAAAAAGGG CCACCTGCAG 1500
TTTGAGGCAC ATAAGGAAGT GCGGAGGGCA AAGATCTCCG AGAACCTTGG CCTGAACTTC 1560
ATCTCTGGGG ATGCATCTGA TACGCAAGCA TCTAATGAAC TTTCCAGGAT AGCAAACAGC 1620
CTCACAGGGT CCAGTTCTAC CCCAGCAACT TTTCTGTTTT CTTCTTGTGG AAAAAACACT 1680
GCAAAGAAAG AATTTGAGGC TTCCAATTGT GACTCTTTAT TGGGCTTGTC GGAGGGTGCC 1740
TTGATTTCTA AACGTTCTGG GGAGAAAAAG AAGCCCCAGC GAGGGCCTGT GTGCGGTTCT 1800
AAAGTGCAGC TTTGCTATAT TGGAGCGAGT GAGGAGGAGA AACGCAGTGA TTCCATTAGT 1860
ATCTGCACCA CTTCTGATGA TGGAAGCAGC GACCTCGATG CTGGGGAACA GAGCTCTGAG 1920
TCAGATAGTG GTGTCCTTGA ACTTGCAGAT ACTTTTGATA GAACAGAGAA CATGTTATCC 1980
ATGCAGAAAA ATGAGAAGAT AACGTATTCT AGGTATTCTA CCACAAACTC TAGGGTAAAA 2040
GCGAAACCGA AGCCCCTCCT CACTAACTCC CATACTGACC ACTTAATAAA CTGCACCAAG 2100
ACAATGGAGC TGGGAACTGA GATACCTCCA GTTAATCTCT CTGATCTTAC GGTGTCCACT 2160
CTTGTCCACA AACCCCAATC AGACTTTAAA AATGATGGTC TCACTCCAAA ATTCAACACC 2220
CCGTCAACCA TATCCAGTGA GAACTCACTA GTGACGGCGG GTGGGACTAC CAATCAAGCT 2280
CTCTTACATT TGAAAAGTAA ACCCCCCAAG TTCCGAAGTA TAAAGTTCAA AGCCAGAACT 2340
CCAGTGAGTG TAGAACCGCC AGTTCCAAAT GAGGACTGCA GTTTGAAATG CTGCTCTTCG 2400
GATACCAAAG GCTCTGCTCT GGCCAGCGGC CCTAAGAGTG GGAAGATGGA CGGGCTGAAA 2460
CTACTGAGCA ACATGCACGA GAAGACCAGG GATTCGAGCG ACATAGAGAG CGCGGTGGTG 2520
AAGCACGTTC TGTCTGAGCT GAAGGAGCTC TCTTACAGAT CCTTAGGTGA GGATGTCAGT 2580
GACTCTGGGA CTTCAAAGCC ATCCAAACCC TTACTTTTTT CTTCTTCTAA TCAGAATCAC 2640
ATGCCTCTTG AACCAGACTA CAAATTCAGC ACATTGCTAA TGATGCTGAA AGATATGCAT 2700
GATAGCAAGA CCAAGGAGCA GCGATTGATG ACAGCTCAAA ACTTGGTCTC TTATCGGAGT 2760
CCTAGTCGGG GGGATTGTTC CACCAGTACT CCCGCAGGGG CTTCCAAGGT GCTGACTCCA 2820
GGGAGCTCCA TGCATAAGTC AGAAAAAAGT GGAGATGGCG CTCCAGACAC TGCCCGTCCT 2880
AGCCCAGGTG GGGTGGAGTC ACCGCCGTCT GCTCCCCTAC CTGTGTTAGC ACCTGAGGGA 2940
AGAGACCCCC CTGCTTCTGT TAAACACCGT GCAAACTGTG TGACGAGGCG CCACTGTGGG 3000
CGATCCAAGC CATCCAAGTT GCGTGATGCT TTTTCAGCCC AAATGAGAAA AGACATCGTG 3060
AACCGTAAGG CCTTAAAGAT AGAGCGCAAA AGAAAGCCCA CCCGCCTTCT TGTGGATGCT 3120
GNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3180
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3240
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3300
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3360
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3420
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3480
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3540
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3600
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3660
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNTT 3720
GTGCGGTCTG AGAAGAAACG CCTTAGGAAG CCAAGCAAGT GGCTTCTGGA ATATACAGAA 3780
GAGTATGATC AGATATTTGC TCCGAAGAAG AAGCAAAAGA AAGCACAGGA GGTGCACAAG 3840
GTAAATTCCC GCTGTGAAGA CGAAAGCCTT CTAGCCCGAT GTCAATCTAG TACCCAGAAC 3900
AAGCAGGTGG ATGAGAATTC TTTACTTTCA ACCAAAGAAG AGCCTCCAGT TCTTGAAAGG 3960
GAGGCTCCCT TTTTGGAGGG GCCCTTGGCT CAGTCAGAGC TTGGAGGTGG AAATACTGAG 4020
TTGCCGCAGC TAACACTGTC TGTGCCTGTG GCTCCGGAAG TCTCTGCACG AGCTGCCCTT 4080
GAGCCTGAGG AACTGCTTGT TAAAACACCA GGAAATTATG AAAGTCAGAG ACAACGGAAG 4140
CCAACTAAGA AGCTTCTTGA ATCCAATGAT TTAAACCCTG GATTTATGCC TAAGAAGGGT 4200
GACCTTGGCT TTTCTAAAAA GTGTTATGAA GCTGGCTATT TGGAGAATGG GATTCCTGAT 4260
TCATGTGCTG CGTCTCATTC AAAAGAGTTG GGCAGTGGCA CTACCAAAAT TTTTGATAAA 4320
CCAAGAAAGC GAAAGCGACA GAGGCATATT GGAGCTAAGG TGCAGTGTAA AAAAGTGAAA 4380
AATGCCGACT CATCCAAAGG GGTGCCAGGC TCAGAGNNNN NNNNNNNNNN NNNNNNNNNN 4440
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 4560
AATTGTGAGA AGGTGGGTGA GCTGCTGTTA TGTGAGGCTC AGTGCTGCGG GGCTTTCCAC 4620
CTCGAGTGCC TTGGGTTAAC GGAGATGCCC AGAGGAAAAT TTATCTGCAA CGAATGTCGC 4680
ACAGGAATAC ATACCTGTTT TGTATGTAAG CAGAGCGGGG AAGATGTTAA AAGATGCCTG 4740
TTGCCTTTAT GTGGGAAGTT CTACCATGAA GAGTGTGTGC AGAAGTACCC ACCCACGGTC 4800
ATGCAGAACA AGGGCTTCCG GTGCTCCCTC CACATGTGCA TCACCTGCCA TGCTGCCAAC 4860
CCCACGAGTG TCTCTGCATC TAAAGGTCGA CTGATGCGTT GTGTGCGCTG TCCGGTGGCA 4920
TACCACGCCA ATGACTTCTG CCTCGCTGCC GGGTCCAAGA TCCTCGCATC TAACAGCATC 4980
ATCTGCCCGA ATCACTTCAC CCCTAGACGC GGCTGTCGGA ATCACGAGCA TGTTAATGTT 5040
AGCTGGTGTT TTGTGTGCTC AGAAGGAGGC AGCCTGCTGT GCTGTGATTC TTGCCCTGCT 5100
GCTTTTCATC GTGAATGCCT GAACATTGAT ATCCCTGAAG GAAACTGGTA TTGCAATGAC 5160
TGCAAGGCGG GAAAGAAGCC ACACTACAGG GAGATTGTCT GGGTGAAGGT CGGACGATAC 5220
AGGTGGTGGC CAGCAGAGAT CTGCCATCCG CGAGCTGTTC CCTCCAACAT TGATAAGATG 5280
AGACATGATG TGGGCGAGTT CCCTGTCCTC TTCTTTGGGT CAAATGACTA TCTGTGGACC 5340
CACCAGGCCC GAGTGTTCCC CTATATGGAA GGAGACGTGA GCAGCAAGGA CAAGATGGGC 5400
AAAGGAGTGG ATGGAACGTA TAAAAAAGCT CTTCAGGAAG CTGCAGCAAG GTTTGAAGAG 5460
CTAAAGGCAC AGAAAGAGCT ACGACAGCTG CAGGAAGACC GAAAGAATGA TAAGAAGCCC 5520
CCACCTTATA AGCATATAAA GGTGAACCGC CCTATAGGCA GGGTGCAGAT CTTCACCGCG 5580
GACTTGTCCG AAATCCCCCG TTGCAACTGT AAGGCCACTG ACGAGAACCC TTGTGGCATC 5640
GACTCGGAGT GCATCAACCG CATGCTGCTG TATGAGTGCC ACCCCACGGT GTGCCCTGCT 5700
GGAGGGCGCT GTCAGAACCA GTGCTTCACC AAGCGCCAGT ACCCCGAGGT GGAGATTTTC 5760
CGCACCTTGC AGAGAGGTTG GGGCCTGCGG ACAAAAACAG ATATTAAGAA GGGAGAGTTT 5820
GTGAATGAGT ATGTGGGTGA GCTAATAGAC GAAGAAGAAT GCAGAGCTCG AATCCGTTAT 5880
GCCCAAGAAC ATGATATCAC CAATTTCTAT ATGCTGACCC TAGATAAAGA CCGAATCATC 5940
GATGCTGGCC CCAAAGGAAA CTATGCGCGG TTCATGAATC ATTGCTGCCA GCCAAACTGT 6000
GAGACTCAGA AATGGTCTGT GAATGGCGAT ACCCGTGTTG GTCTTTTTGC CCTGAGTGAC 6060
ATTAAAGCAG NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 6120
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNAAT 6180
CAGCCCATTG CCACGGAAGA AAAGTCTAAG AAGTTCAAGA AGAAGCAACA TGGAAAGCGC 6240
CGGAGCCAGG GCGAGATCAC GAAGGAGCGA GAGGATGAGT GCTTCAGCTG CGGGGACGCT 6300
GGTCAGCTCG TGTCCTGCAA AAAGCCTGGC TGCCCGAAAG TCTACCATGC AGACTGTCTG 6360
AACCTGACCC GGAGGCCAGC AGGGAAGTGG GAGTGTCCAT GGCATCAGTG TGACATCTGT 6420
GGGAAGGAAG CAGCCTCCTT CTGTGAGATG TGCCCCAGCT CCTTCTGCAA GCAGCACCGG 6480
GAAGGCATGC TCTTCATTTC CAAACTGGAC GGGCGTCTCT CTTGTACTGA GCATGACCCT 6540
TGTGGGCCCA ACCCTCTGGA GCCTGGGGAG ATCCGTGAGT ATGTGCCTCC CCCGGTCCCG 6600
CTGCCTGCCG GCCCGGGGGC TCCCCTGGCA GAACAGTCAT CAGGGGCAGC TGCTCAGGCG 6660
CCCAAGGTAT CAGATAAGGC ATCTGCTGAA ACCAACCAGA CGCCACTCTC TAAGAAAGCT 6720
CTGGCAGGGA CTTGTCAGAG GCCACTGCTG TCTGAAAAAC CTCTTGAAAG AACTGACTCC 6780
AGGCCCCTGC TTTTAGATAG GGTCAGAGAA CTGGCTGGAT CAAGCACCAA ATCCCAGTCC 6840
TTGGCATCCA GCCAAAAGCC ATTGGCCAGG TCACCTTCAG TGGCAGGACC AAGACTGCAG 6900
CTATCTGACA AACCTTCTCT GGCAACTGGC CCAGGCTCCT CACCCTCTGT CAGGCCCCAG 6960
CCACTGGAGA GACCTTTGGG AACAACTGAC CCACGTCTGG ATAAATCTAT AGGTGCTGCC 7020
AGCCCAAGGC CTCAGTCACT AGAGAAAACC CCAGTTCCTT CCGGCCTGAG ACTTCTGCCA 7080
CCAGACAGGC TGCTAGTGAG CAGCAGTCCC AAAGCCCAGA CTGCAGACCG ACCCCCGGAC 7140
AAGCCCCATA TCCCTCTATC CCAGAGACTC CCGCCTACAG AGAAAGTCCT GTCCGCTGTG 7200
GTCCAGACTT TGGTGGCTAA AGAGAAAGCA CTAAGGCCTG TGGACCAGAA TACTCAGTCA 7260
AAAAACAGAG CTGCTTTAGT GATGGATCTC ATAGACCTGA CCCCTCGCCA GAAGGACCGG 7320
GCAGCATCTC CTCCTGATGG CACTCTGCAG GCTGATGAGA AAGTGCCAGT GTTGGAATCA 7380
AGCTCATGGG CTGCCGGCAA AGGCCTGGGG CATGTGCTGC GAGTGGTTGA AAAAGGCAGC 7440
GTGTCAGAAC CTCTTCTCCA ACCACCTGGA AAGACGGTGG CCCCTTCAGA GCACCCCTGG 7500
CAAGCTGTTA AATCACTCAC CCAGGCTAGG CTTCTTTCTC AGCCTCCTGC CAAGACTTTT 7560
TTATATGAGC CAGCAACTCA GGCCTCAGGA AGAGCTCCTG CAGGGGCTGA GCAGACCCCA 7620
GGACCTCCCA GCCAAGTACC AGGCCTGGTG AAGCAGGTGA AGCAGTTGGC CGGAAGCCAG 7680
CAACCAGCTG GACTTGCTGT CAAAAGTGGG CAGTCTTTCA GGTCTCTTGG GAAGGCCCCC 7740
GCCTCCCTCC CTACTGAAGA GAAGAAGTTG ACAACTACAG AGCAGAGCTC CTGGGCCTTG 7800
GGAAAAGCCT CCTCAGGGGC AGGGCTGTGG CCGCTGGTGG CTGGACAGAC GCTGGTACAG 7860
TCTTGCTGGT CTGCTGGGAG TGCACAGACA TTGGCACAAA CCTGCTGGTC TCTTGGAAGA 7920
GGGCAAGAGC CCAAATCAGA GCAAACTACA CTTGCAGCTC TAAGCCAGGC TCCTTCCGGT 7980
CACAAGTGTG CAGAGGCGGA ACAGAAGTGA 8011
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 87 0.0 2448
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 84 0.0 2382
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 84 0.0 2377
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 84 0.0 2377
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 85 0.0 2376
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 84 0.0 2376
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 84 0.0 2374
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 85 0.0 2374
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 85 0.0 2371
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 85 0.0 2371
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 84 0.0 2368
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 85 0.0 2366
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 85 0.0 2360
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 84 0.0 2357
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 83 0.0 2341
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 84 0.0 2330
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 84 0.0 2320
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 82 0.0 2319
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 82 0.0 2312
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 83 0.0 2288
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 82 0.0 2259
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 83 0.0 2252
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 80 0.0 2244
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 79 0.0 2225
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 79 0.0 2212
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 86 0.0 2151
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 84 0.0 2085
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 84 0.0 1950
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 1930
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 68 0.0 1842
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 85 0.0 1718
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 61 0.0 1461
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 60 0.0 1459
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 74 0.0 1455
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 61 0.0 1445
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 61 0.0 1414
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 81 0.0 1392
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 90 0.0 1372
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 70 0.0 1368
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 66 0.0 1296
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 66 0.0 1262
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 81 0.0 1206
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 84 0.0 1203
WERAM-Dar-0128 ENSDARP00000078549.4 Danio rerio 73 0.0 1120
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 72 0.0 1095
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 73 0.0 1092
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 67 0.0 1057
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 72 0.0 1043
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 73 0.0 993
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 71 0.0 968
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 83 0.0 918
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 62 0.0 901
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 59 0.0 883
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 66 0.0 880
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 58 0.0 863
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 59 0.0 852
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 91 0.0 834
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 74 0.0 806
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 70 0.0 780
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 54 0.0 774
WERAM-Ten-0192 ENSTNIP00000019016.1 Tetraodon nigroviridis 50 0.0 747
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 59 0.0 721
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 51 3e-166 585
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 80 4e-135 481
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 40 7e-112 404
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 40 1e-54 214
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 2e-40 167
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 42 5e-40 165
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 42 5e-40 165
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 1e-39 164
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 2e-39 163
WERAM-Met-0069 KEH35350 Medicago truncatula 41 6e-38 159
WERAM-Orbr-0026 OB02G27880.1 Oryza brachyantha 38 6e-38 159
WERAM-Chr-0033 EDP02327 Chlamydomonas reinhardtii 40 7e-38 158
WERAM-Prp-0004 EMJ23127 Prunus persica 45 7e-38 158
WERAM-Viv-0111 VIT_18s0001g01700.t01 Vitis vinifera 45 1e-37 157
Created Date 25-Jun-2016