WERAM Information


Tag Content
WERAM ID WERAM-Loa-0062
Ensembl Protein ID ENSLAFP00000004572.4
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSLAFG00000005450.4 ENSLAFT00000005453.4 ENSLAFP00000004572.4
ENSLAFG00000005450.4 ENSLAFT00000034875.1 ENSLAFP00000022032.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.60e-52 177 1593 1709
HMT SET1 2.10e-29 102.4 1593 1709
Me_Reader PWWP 9.30e-21 74.1 4 1467
Me_Reader PHD 9.90e-20 70.8 1194 1813
Organism Loxodonta africana
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSLAFP00000004572.4 1593 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 1679
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSLAFP00000004572.4 1680 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 1709
*****************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSLAFP00000004572.4 1593 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 1677
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSLAFP00000004572.4 1678 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1709
*******************************7 PP

  Me_Reader PWWP

              PWWP.txt 33 eaeenkylVlFFgnkherawvkrkklvpys 62
++ ++y V+ Fg+ erawv k +v ++
ENSLAFP00000004572.4 4 RRPYRQYYVEAFGDPSERAWVAGKAIVMFE 33
567799****************99998775 PP
PWWP.txt 2 gdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
++Vw+K+++Y+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSLAFP00000004572.4 1407 REIVWVKVGRYRWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1467
589*****************************************.***************87 PP

  Me_Reader PHD

               PHD.txt    2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCke 52  
++C+ C ++ g e++ C+ C +fHl+C++l+ ++p g +++C++C++
ENSLAFP00000004572.4 1194 NVCQNC--EKLG--ELLLCEAqCCGAFHLECLGLT--EMPRG-KFICNECRT 1238
678888..3333..289***99*************..*****.*******96 PP
PHD.txt 2 tiClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsC 50
+C+vC++++e+ + C C + +H +Cv+ ++ ++k ++C +
ENSLAFP00000004572.4 1241 HTCFVCKQSGED---VKRCLLplCGKFYHEECVQKYPPTVMQNKGFRCSLH 1288
58****777777...55899889**********988777776657999876 PP
PHD.txt 3 iClvCgkddegeke.....mvqCdeCddwfHlk..CvklplsslpegkswyCpsC 50
iC++C+ ++ + + ++ C C+ ++H++ C+ s++ +++s +Cp++
ENSLAFP00000004572.4 1289 ICITCHAANPASVTaskgrLMRCVRCPVAYHANdfCLAAG-SKILASNSIICPNH 1342
8****87777754466778************965599988.55555558999998 PP
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52
C+vC ++ g+ +++Cd+C+ +fH +C+++ +peg +wyC+ Ck+
ENSLAFP00000004572.4 1359 WCFVC--SEGGS--LLCCDSCPAAFHRECLNID---IPEG-NWYCNDCKA 1400
7****..44443..9******************...****.*******85 PP
PHD.txt 4 ClvCgkddegekemvqCde..CddwfHlkCvklplsslpegkswyCpsCk 51
C+ C d+g+ +v C++ C++ +H++C++l+ + p+g +w Cp ++
ENSLAFP00000004572.4 1771 CFSC--GDAGQ--LVSCKKpgCPKVYHADCLNLT--KRPAG-KWECPWHQ 1813
9999..33442..9********************..*****.*****886 PP

Protein Sequence
(Fasta)
VSNRRPYRQY YVEAFGDPSE RAWVAGKAIV MFEGRHQFEE LPVLRRRGKQ KEKGYRHKVP 60
QKILSKWEAS VGLAEQYDVP KGPKNRKCVR SSIKLDSEED MPFEDCTNDP ESEHDLLLNG 120
CLNSLAFDSE HSADEKEKPC AKSRVRKSSD NPKRTSVKKS HMQFEAHKEE RRGKIPENLG 180
LNFISGDVSD KQASNELSRI ANSLTGSNTA PRSFLFSSCG KNTAKKEFES SNCDSLLGLP 240
EGALISKRSG EKKKPQRGLV CSSKVQLCYI GAGDEEKRSD SISICTTSDD GSSDLDPVDQ 300
SSESDNSVLE ITDAFDRTEN MLSMQKNEKI KHSRFPATNT RVKAKQKSLI TNSHTDHLMD 360
CTKTAEPGTE TSQVNLSDLK ASTLVCKPSS DFRNDSLPPK FNTSPSISSE NSLIKGANTN 420
QALLHSKSKQ PKIRSIKCKH KENPVVVEPP ATNEDCSLKC CSSDTKGSPL ASISKSGKVD 480
GLKLLSNMHE KTRDSSDIET AVVKHVLSEL KELSYRSLSE DVSDSGTSKP SKPLLFTSAS 540
GQNHIPIEPD YKFSTLLMML KDMHDSKTKE QRLMTAQNLV SYRSPGPGDC STSSPAGVSR 600
ILVSGGSNHN SEKNGDGTQD SAHPSPSGGD AVLSGELSTS LPGLVSDRRD LPASGKSRSN 660
CVTRRNCGRS KLSSRLQDGF SAQLGKNTVN RKALKTERKR KLNRIPGVTL QAALQGDREN 720
GASVSVSSRG GGEDHGKEEP LQLKGHLTSE DCDHFSDVHF GNKVKQSDLD KIPEKAPSFE 780
NRKGPELDSE MNSENDGHNG VHQVVPKKRW QRLNQRRTKP RKRTNRFREK ENSEGAFGVL 840
LPGDPVQKGG DFSEHRPPTS TNVLEDALTD PNCAGRLDSA GPRLNVCDKT SASTEEMEKE 900
PGIPSLTPQS ELPEPAVRSE KKRLRKPSKW LLEYTEEYDQ IFAPKKKQKR VQEHTHKVSS 960
RCEEESLLAR CRPSAQNKQV DENSLISTKE EPPVLEREAP FLEGPLAQSE LGGGHAELPQ 1020
LTLSVPVAPE ISPRPALESE ELLVKPSGNY EGKRQRKPTK KLLESNDLDP GFMPKKGDLG 1080
YSKKCYETGH LENDITESCG ATHSKQFGEG TTKMFDKPRK RKRQRHATAK VQCKKVKNDD 1140
RSKETASSEG ELMTHRTAAS PKEAIEEGVE HDHGMPVSKR MQGERGGGAA LKENVCQNCE 1200
KLGELLLCEA QCCGAFHLEC LGLTEMPRGK FICNECRTGI HTCFVCKQSG EDVKRCLLPL 1260
CGKFYHEECV QKYPPTVMQN KGFRCSLHIC ITCHAANPAS VTASKGRLMR CVRCPVAYHA 1320
NDFCLAAGSK ILASNSIICP NHFTPRRGCR NHEHVNVSWC FVCSEGGSLL CCDSCPAAFH 1380
RECLNIDIPE GNWYCNDCKA GKKPHYREIV WVKVGRYRWW PAEICHPRAV PSNIDKMRHD 1440
VGEFPVLFFG SNDYLWTHQA RVFPYMEGDV SSKDKMGKGV DGTYKKALQE AAARFEELKA 1500
QKELRQLQED RKNDKKPPPY KHIKVNRPIG RVQIFTADLS EIPRCNCKAT DENPCGIDSE 1560
CINRMLLYEC HPTVCPAGGR CQNQCFTKRQ YPEVEIFRTL QRGWGLRTKT DIKKGEFVNE 1620
YVGELIDEEE CRARIRYAQE HDITNFYMLT LDKDRIIDAG PKGNYARFMN HCCQPNCETQ 1680
KWSVNGDTRV GLFALSDIKA GTELTFNYNL ECLGNGKTVC KCGAPNCSGF LGVRPKNQPI 1740
ATEEKSKKFK KKQQGKRRTQ GEVTKEREDE CFSCGDAGQL VSCKKPGCPK VYHADCLNLT 1800
KRPAGKWECP WHQCDICGKE AASFCEMCPS SFCKQHREGM LFISKLDGRL SCTEHDPCGP 1860
NPLEPGEIRE YVPPPVPLPP GPSSHPVEQS SGLAAQGPKM SEKPPADTNQ TLSLSKKALA 1920
GTCQRPLLPE RPLERTDCRP QPLDRVRDLA GPGTKPQSLG SSQRPLDRPP AMAGPRPQLS 1980
DKPSPVTSPS SSPSVRSQPL ERPLGMAGSR LDKSIGAASP RHQPVEKAPI PTGLRLPPPD 2040
RLLITSGPRP QTSDRPPDKS HASLSQRLPP PEKVLSAVVQ TLVAKEKALR PVDQNTQSKN 2100
RAALVMDLID LTPRQKERGP SPQEGTPQAD EKMPVLESSS WPASKCLGQM PRASERGSVS 2160
DPVIPPPGKA AVPSEHPWQA VKSLTQARLL SQPPAKGFLY EPATQASGRA PAEAEQTPGL 2220
PSQAPGLVKQ MAGGQQLPGL AAKGTTLSGQ SFRPLGNAPA SHPAEEKKLA TTEQSPWVLG 2280
KASLGPGLWP MVAGQTLMPP CWSSGSTQTL AQTCWSLGRG QDPKSEQNTV PALNQAPSCH 2340
KGAESEQK 2348
Nucleotide Sequence
(Fasta)
GTTTCCAATC GGAGGCCCTA TCGGCAGTAC TACGTGGAGG CTTTTGGAGA CCCTTCTGAG 60
AGAGCCTGGG TGGCTGGAAA AGCAATCGTC ATGTTTGAAG GCAGACATCA ATTCGAAGAG 120
CTACCTGTCC TTAGGAGAAG AGGGAAGCAG AAAGAAAAAG GATATAGGCA TAAGGTTCCT 180
CAGAAAATTT TGAGTAAATG GGAAGCCAGT GTTGGTCTTG CTGAACAGTA TGATGTTCCC 240
AAAGGGCCAA AGAACCGAAA ATGTGTCAGA AGTTCGATCA AGTTGGACAG TGAGGAGGAT 300
ATGCCTTTTG AGGACTGTAC AAATGACCCT GAATCAGAAC ATGATCTTTT GCTTAATGGC 360
TGTTTGAATT CTCTGGCTTT TGACTCTGAA CATTCTGCAG ATGAGAAGGA AAAGCCATGT 420
GCTAAGTCTC GAGTCAGAAA GAGCTCTGAT AATCCAAAAA GGACTAGTGT GAAAAAGAGC 480
CACATGCAAT TTGAAGCACA TAAGGAAGAA CGGAGGGGAA AGATTCCAGA GAACCTTGGC 540
CTAAACTTTA TTTCTGGGGA TGTATCTGAT AAGCAGGCCT CTAATGAACT TTCCAGGATA 600
GCAAACAGCC TCACAGGGTC CAACACTGCC CCAAGAAGTT TCCTGTTTTC TTCTTGTGGA 660
AAAAACACTG CAAAGAAAGA ATTTGAGAGT TCAAATTGTG ACTCTTTACT GGGCTTGCCT 720
GAGGGTGCCT TGATCTCTAA ACGTTCTGGG GAAAAGAAGA AACCCCAAAG AGGTCTGGTT 780
TGCAGTTCAA AGGTGCAGCT CTGCTATATT GGAGCGGGTG ATGAGGAAAA GCGAAGTGAT 840
TCCATTAGCA TCTGTACCAC TTCTGATGAT GGAAGCAGTG ATCTGGATCC TGTAGATCAG 900
AGCTCAGAGT CTGATAACAG TGTCCTTGAA ATTACGGATG CTTTTGATAG AACTGAGAAC 960
ATGTTATCCA TGCAGAAAAA TGAAAAGATA AAGCATTCTC GGTTTCCTGC CACAAACACT 1020
AGGGTAAAAG CAAAGCAAAA GTCCCTCATT ACCAACTCAC ATACAGACCA CTTAATGGAT 1080
TGTACTAAGA CGGCAGAGCC TGGAACTGAG ACATCTCAGG TTAATCTCTC TGATCTTAAG 1140
GCATCCACCC TTGTCTGTAA ACCCTCCTCG GACTTTAGAA ATGACAGTCT CCCTCCAAAA 1200
TTCAACACAT CACCAAGCAT TTCCAGTGAG AACTCACTAA TAAAGGGTGC AAATACAAAT 1260
CAAGCTCTGT TACATTCAAA AAGTAAACAG CCCAAGATCC GAAGTATAAA ATGCAAACAC 1320
AAAGAAAATC CAGTTGTGGT AGAACCCCCA GCTACAAATG AGGACTGTAG TTTGAAATGC 1380
TGCTCTTCTG ATACCAAAGG CTCTCCTTTG GCCAGCATTT CCAAAAGTGG GAAAGTGGAT 1440
GGGCTGAAAC TACTGAGCAA CATGCATGAG AAAACCAGGG ATTCGAGTGA CATAGAAACA 1500
GCAGTGGTGA AACACGTTCT GTCTGAGTTG AAGGAACTCT CCTATAGATC CTTAAGTGAG 1560
GATGTCAGTG ATTCTGGAAC ATCAAAGCCA TCAAAACCAT TACTTTTTAC TTCTGCCTCT 1620
GGCCAGAATC ATATACCGAT TGAACCAGAC TACAAATTCA GCACATTACT AATGATGTTG 1680
AAAGATATGC ATGATAGTAA GACCAAGGAG CAACGGTTGA TGACTGCTCA AAACCTGGTC 1740
TCCTATCGGA GTCCTGGTCC TGGGGATTGT TCTACTAGTA GTCCTGCAGG GGTTTCTAGG 1800
ATCTTGGTTT CAGGAGGCTC CAACCACAAT TCAGAAAAAA ATGGAGATGG CACTCAGGAC 1860
TCAGCCCATC CCAGCCCTAG TGGGGGCGAC GCTGTACTGT CTGGGGAGTT GTCTACCTCC 1920
TTACCTGGCT TAGTGTCTGA CAGAAGAGAT CTTCCTGCTT CTGGGAAAAG TCGTTCAAAC 1980
TGTGTTACTA GGCGCAACTG TGGGCGATCA AAGCTGTCAT CCAGATTGCA AGATGGTTTT 2040
TCAGCCCAGT TGGGAAAGAA CACAGTGAAC CGGAAGGCCT TAAAAACAGA GCGCAAAAGA 2100
AAATTGAACC GGATTCCAGG TGTGACTCTT CAGGCTGCGC TGCAAGGAGA CAGAGAAAAT 2160
GGAGCCTCAG TGAGTGTCTC TTCAAGGGGT GGGGGAGAAG ACCATGGTAA AGAAGAGCCC 2220
CTTCAATTAA AGGGCCATTT AACAAGTGAA GATTGTGACC ATTTTTCTGA TGTTCATTTT 2280
GGTAACAAGG TCAAACAGTC TGACCTTGAT AAAATTCCTG AGAAAGCCCC CTCTTTTGAA 2340
AACAGAAAAG GCCCAGAACT GGACTCCGAA ATGAACAGTG AGAATGATGG ACACAATGGT 2400
GTACATCAAG TGGTGCCTAA AAAGCGGTGG CAGCGTTTAA ACCAAAGGCG CACTAAACCT 2460
CGTAAGCGCA CTAACAGATT TAGGGAGAAA GAAAACTCTG AGGGTGCCTT TGGGGTCTTG 2520
CTTCCTGGTG ACCCTGTGCA GAAGGGGGGT GACTTCTCTG AGCATAGACC TCCTACTTCG 2580
ACAAACGTAC TGGAAGATGC ACTGACAGAT CCGAATTGTG CAGGCCGCTT AGATTCAGCT 2640
GGGCCACGGT TGAATGTTTG TGATAAGACC AGTGCTAGCA CTGAGGAGAT GGAAAAGGAA 2700
CCAGGAATTC CCAGTTTGAC CCCCCAGTCT GAGCTCCCGG AACCAGCTGT GCGGTCAGAG 2760
AAGAAACGCC TTAGGAAGCC AAGCAAGTGG CTTCTGGAAT ATACTGAAGA ATATGATCAG 2820
ATATTTGCTC CTAAGAAAAA ACAAAAGAGA GTTCAGGAAC ACACGCACAA GGTAAGTTCC 2880
CGCTGTGAAG AGGAAAGCCT TCTAGCCCGA TGTCGACCTA GTGCTCAAAA CAAACAAGTG 2940
GATGAGAATT CTTTGATTTC AACCAAAGAA GAGCCTCCAG TTCTTGAAAG AGAGGCTCCG 3000
TTTTTGGAAG GGCCCTTGGC TCAGTCGGAA CTTGGAGGTG GACATGCTGA GTTGCCACAG 3060
CTGACCTTGT CTGTGCCTGT GGCTCCGGAA ATCTCTCCAC GGCCTGCCCT TGAGTCTGAG 3120
GAATTGCTAG TTAAACCATC AGGAAACTAT GAAGGTAAGC GTCAGAGAAA ACCAACCAAG 3180
AAACTTCTTG AATCCAATGA TTTAGACCCT GGATTTATGC CCAAGAAGGG GGATCTGGGC 3240
TATTCTAAAA AGTGTTATGA AACTGGCCAC TTGGAGAATG ACATTACTGA ATCATGTGGC 3300
GCAACTCATT CTAAACAGTT TGGTGAAGGT ACTACCAAGA TGTTTGATAA ACCAAGGAAG 3360
CGAAAACGTC AGAGGCACGC TACAGCCAAG GTGCAGTGTA AAAAAGTGAA AAATGATGAC 3420
CGATCAAAGG AGACTGCAAG CTCAGAGGGA GAACTGATGA CACACAGGAC GGCGGCAAGC 3480
CCCAAAGAGG CCATTGAGGA GGGCGTAGAG CATGACCATG GGATGCCTGT GTCTAAAAGA 3540
ATGCAAGGCG AACGCGGTGG AGGAGCTGCA CTCAAGGAGA ATGTTTGTCA GAACTGTGAG 3600
AAACTGGGCG AGCTGCTGTT GTGTGAGGCT CAGTGCTGTG GGGCGTTCCA CCTGGAGTGC 3660
CTTGGGTTAA CTGAGATGCC AAGAGGAAAA TTTATCTGCA ATGAATGTCG CACAGGAATC 3720
CATACCTGTT TTGTATGTAA ACAGAGTGGG GAAGATGTTA AAAGGTGCCT TCTGCCCTTG 3780
TGTGGAAAAT TTTATCATGA AGAGTGTGTC CAGAAATACC CACCCACTGT CATGCAAAAC 3840
AAGGGCTTCC GGTGCTCCCT TCACATCTGT ATAACCTGCC ATGCTGCTAA TCCAGCCAGT 3900
GTTACTGCAT CTAAAGGTCG CCTGATGCGC TGTGTTCGCT GCCCTGTGGC ATACCATGCC 3960
AATGACTTTT GCTTGGCTGC TGGGTCAAAG ATCCTTGCAT CTAATAGTAT CATCTGCCCT 4020
AATCACTTTA CCCCTAGGCG GGGTTGTCGA AATCATGAGC ATGTTAATGT TAGCTGGTGT 4080
TTTGTGTGCT CAGAAGGAGG CAGCCTTCTG TGCTGTGATT CTTGCCCTGC TGCTTTTCAT 4140
CGTGAATGCC TGAACATTGA TATCCCTGAA GGAAACTGGT ATTGCAATGA CTGTAAAGCA 4200
GGCAAAAAGC CACACTACAG GGAGATTGTC TGGGTAAAAG TTGGGCGATA CAGGTGGTGG 4260
CCAGCTGAGA TCTGCCATCC TCGCGCTGTA CCTTCCAACA TCGATAAGAT GAGACATGAT 4320
GTGGGCGAGT TCCCTGTACT CTTCTTTGGG TCTAATGACT ATCTGTGGAC CCACCAGGCC 4380
AGAGTCTTCC CCTACATGGA AGGGGATGTT AGCAGCAAGG ATAAGATGGG CAAAGGAGTG 4440
GATGGGACGT ATAAAAAAGC TCTTCAGGAA GCTGCAGCAA GGTTTGAGGA GTTAAAGGCC 4500
CAAAAAGAGC TGAGACAGCT GCAGGAAGAC CGTAAGAATG ACAAGAAGCC ACCACCTTAT 4560
AAACATATAA AGGTGAACCG TCCTATTGGC AGGGTCCAGA TCTTTACCGC AGACTTGTCT 4620
GAAATTCCCC GTTGCAACTG TAAGGCTACT GATGAGAACC CCTGTGGGAT AGACTCCGAA 4680
TGCATCAACC GCATGCTGCT CTATGAGTGC CATCCTACAG TATGTCCTGC CGGGGGACGC 4740
TGCCAAAACC AGTGCTTCAC CAAGCGCCAG TATCCAGAGG TTGAAATTTT CCGCACGTTG 4800
CAGAGGGGCT GGGGCCTCCG AACAAAAACA GATATTAAAA AGGGTGAATT TGTGAATGAG 4860
TATGTGGGTG AGCTAATAGA TGAAGAAGAG TGCAGAGCTC GAATTCGTTA TGCCCAAGAA 4920
CACGATATCA CTAATTTCTA TATGCTCACC CTAGACAAAG ACCGGATCAT TGATGCTGGT 4980
CCCAAAGGAA ACTATGCCCG GTTCATGAAT CATTGCTGCC AGCCCAACTG TGAAACACAG 5040
AAGTGGTCTG TGAATGGAGA CACCCGTGTT GGCCTTTTTG CCCTGAGTGA CATCAAAGCA 5100
GGCACAGAAC TTACCTTCAA CTACAACCTG GAATGTCTTG GGAATGGAAA GACTGTTTGC 5160
AAATGTGGAG CCCCGAATTG CAGTGGCTTC CTGGGTGTAA GGCCAAAGAA TCAGCCCATT 5220
GCCACAGAAG AAAAGTCCAA GAAATTCAAG AAGAAGCAAC AGGGCAAGCG CAGAACCCAG 5280
GGTGAAGTCA CAAAGGAGCG AGAGGATGAG TGCTTCAGCT GTGGGGATGC CGGGCAGCTT 5340
GTCTCTTGCA AGAAGCCAGG CTGCCCAAAA GTCTACCACG CAGACTGTCT CAATCTAACC 5400
AAGCGCCCAG CAGGGAAATG GGAGTGTCCT TGGCATCAGT GTGACATATG TGGAAAAGAA 5460
GCAGCCTCCT TCTGTGAGAT GTGCCCCAGC TCGTTTTGCA AGCAGCATAG GGAAGGAATG 5520
CTCTTCATCT CCAAACTGGA TGGGCGTCTG TCTTGTACTG AGCATGATCC CTGTGGGCCC 5580
AACCCTCTGG AACCCGGGGA GATCCGTGAG TATGTGCCTC CCCCAGTACC ACTGCCTCCA 5640
GGCCCAAGCT CTCACCCAGT AGAGCAATCA TCAGGATTGG CTGCTCAGGG GCCCAAGATG 5700
TCGGAAAAGC CGCCTGCTGA CACCAACCAG ACACTGTCGC TGTCCAAGAA AGCTCTGGCA 5760
GGAACTTGTC AGAGGCCACT GCTGCCTGAA AGACCTCTTG AAAGAACTGA CTGCAGGCCC 5820
CAGCCTTTAG ACCGGGTCAG GGACCTTGCT GGGCCAGGGA CCAAACCCCA ATCCTTGGGA 5880
TCCAGCCAGA GGCCATTGGA CAGGCCTCCT GCAATGGCAG GACCAAGACC CCAGCTCTCT 5940
GACAAACCCT CTCCAGTAAC CAGCCCAAGT TCTTCACCTT CAGTTAGGTC CCAACCACTG 6000
GAAAGACCTC TGGGGATGGC CGGCTCAAGG CTGGATAAAT CCATAGGTGC TGCCAGCCCA 6060
AGGCATCAGC CAGTGGAGAA AGCCCCAATC CCCACTGGCC TGAGACTTCC GCCGCCAGAC 6120
AGACTGCTAA TTACCAGTGG TCCCAGGCCC CAGACTTCAG ACCGGCCCCC TGACAAATCC 6180
CATGCCTCTT TATCCCAGAG ACTCCCACCT CCTGAGAAAG TACTATCAGC TGTGGTCCAG 6240
ACACTTGTAG CTAAAGAAAA AGCGCTGAGG CCCGTGGACC AGAATACTCA GTCAAAAAAT 6300
AGAGCTGCTT TGGTTATGGA TCTCATAGAC CTAACTCCTC GCCAGAAGGA GCGAGGACCT 6360
TCTCCTCAGG AGGGCACGCC ACAGGCTGAT GAGAAGATGC CAGTGTTGGA GTCAAGCTCA 6420
TGGCCTGCGA GTAAATGTCT GGGGCAGATG CCTCGAGCTA GTGAGAGAGG TAGCGTGTCA 6480
GACCCTGTCA TCCCACCACC TGGGAAAGCA GCGGTCCCTT CAGAGCATCC CTGGCAAGCT 6540
GTTAAATCAC TCACCCAGGC CAGACTTCTT TCTCAGCCCC CTGCCAAGGG TTTTTTATAT 6600
GAGCCAGCAA CTCAGGCCTC AGGAAGAGCA CCTGCAGAGG CTGAACAGAC CCCTGGGCTT 6660
CCCAGCCAAG CCCCAGGCCT GGTGAAGCAG ATGGCTGGAG GCCAACAACT ACCTGGACTT 6720
GCTGCCAAAG GGACAACACT GAGTGGGCAG TCCTTCAGGC CTCTTGGGAA TGCCCCAGCC 6780
TCCCATCCTG CTGAGGAGAA GAAGTTGGCC ACCACAGAGC AGAGTCCCTG GGTCCTGGGA 6840
AAGGCCTCCC TGGGGCCAGG ACTCTGGCCC ATGGTGGCCG GACAGACACT GATGCCACCG 6900
TGCTGGTCCT CTGGGAGCAC ACAGACATTG GCACAGACTT GCTGGTCTCT TGGACGAGGG 6960
CAAGACCCTA AATCAGAGCA AAATACAGTT CCAGCTCTTA ACCAGGCTCC TTCCTGTCAC 7020
AAGGGTGCAG AGTCAGAACA GAAATAA 7048
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 91 0.0 4201
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 90 0.0 4184
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 90 0.0 4168
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 90 0.0 4146
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 90 0.0 4140
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 90 0.0 4127
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 90 0.0 4115
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 90 0.0 4108
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 90 0.0 4103
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 90 0.0 4093
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 90 0.0 4088
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 90 0.0 4085
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 89 0.0 4083
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 89 0.0 4081
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 89 0.0 4075
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 88 0.0 4058
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 89 0.0 4057
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 88 0.0 4054
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 87 0.0 4005
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 87 0.0 3982
WERAM-Gog-0133 ENSGGOP00000011464.2 Gorilla gorilla 88 0.0 3976
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 88 0.0 3808
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 89 0.0 3781
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 82 0.0 3711
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 81 0.0 3665
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 88 0.0 3613
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 70 0.0 3110
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 69 0.0 3072
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 87 0.0 2702
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 79 0.0 2689
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 82 0.0 2299
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 58 0.0 2277
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 58 0.0 2257
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 57 0.0 2236
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 56 0.0 2194
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 87 0.0 2100
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2026
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 51 0.0 1801
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 84 0.0 1737
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 75 0.0 1643
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 55 0.0 1498
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 77 0.0 1426
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 81 0.0 1340
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 84 0.0 1336
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 86 0.0 1300
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 90 0.0 1276
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 63 0.0 1251
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 62 0.0 1216
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 78 0.0 1200
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 87 0.0 1158
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 77 0.0 1136
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 64 0.0 1074
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 78 0.0 1073
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 77 0.0 1048
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 67 0.0 993
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 64 0.0 969
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 65 0.0 960
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 71 0.0 956
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 63 0.0 951
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 63 0.0 937
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 63 0.0 930
WERAM-Ere-0121 ENSEEUP00000012685.1 Erinaceus europaeus 82 0.0 919
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 57 0.0 913
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 57 0.0 649
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 37 5e-121 434
WERAM-Cis-0083 ENSCSAVP00000018342.1 Ciona savignyi 39 5e-56 218
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 4e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 1e-49 197
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 1e-49 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 6e-49 195
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 6e-49 195
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 1e-48 194
Created Date 25-Jun-2016