WERAM Information


Tag Content
WERAM ID WERAM-Gog-0133
Ensembl Protein ID ENSGGOP00000011464.2
Gene Name NSD1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSGGOG00000011744.2 ENSGGOT00000011797.2 ENSGGOP00000011464.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT SET2 1.80e-52 176.9 1632 1748
HMT SET1 2.30e-29 102.3 1632 1748
Me_Reader PWWP 2.60e-26 92 14 1506
Organism Gorilla gorilla
Domain Profile
  HMT SET2

              SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88  
+ve+++t ++G+Glr+k++ikk+ef++eYvGe+ide+e+++R++ ++e+++++fY+l+ldkd++iDa kGn+aRf+nh+C+Pncet
ENSGGOP00000011464.2 1632 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITNFYMLTLDKDRIIDAGPKGNYARFMNHCCQPNCET 1718
699************************************************************************************ PP
SET2.txt 89 qkwtvegelrvglfakkkikkgeeltfdYn 118
qkw+v+g++rvglfa ++ik+g+eltf+Yn
ENSGGOP00000011464.2 1719 QKWSVNGDTRVGLFALSDIKAGTELTFNYN 1748
*****************************8 PP

  HMT SET1

              SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNc 87  
e+e+ + +g+gl++k +i+k+e+v EYvGe+i +e+ r ++ +++ i+ y+ ld+d ++da kgn+arf+nh+c+pNc
ENSGGOP00000011464.2 1632 EVEIFRTLQRGWGLRTKTDIKKGEFVNEYVGELIDEEECRARIRYAQEHDITnFYMLTLDKD--RIIDAGPKGNYARFMNHCCQPNC 1716
6777788889************************99999888887777778789******99..*********************** PP
SET1.txt 88 eakvvavdgekkiviyakraIekgeeltydYk 119
e++ +v+g+++++++a +I++g+elt++Y+
ENSGGOP00000011464.2 1717 ETQKWSVNGDTRVGLFALSDIKAGTELTFNYN 1748
*******************************7 PP

  Me_Reader PWWP

              PWWP.txt  1 agdLVwaKlkgYpwWPalvisppleakkl...ktqeaeenkylVlFFgnkherawvkrkklvpys 62
+gdL+waK k+ pwWP++++s+pl ++ + ++ ++y V+ Fg+ erawv k +v ++
ENSGGOP00000011464.2 14 VGDLIWAKFKRRPWWPCRICSDPLINTHSkmkVSNRRPYRQYYVEAFGDPSERAWVAGKAIVMFE 78
69********************98766665466888999******************99998775 PP
PWWP.txt 13 pwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63
+wWPa++++p+ ++++ ++ ++ ++++VlFFg ++++ w ++ +++py+e
ENSGGOP00000011464.2 1457 RWWPAEICHPRAVPSNIDKMRHDVGEFPVLFFG-SNDYLWTHQARVFPYME 1506
6********************************.***************87 PP

Protein Sequence
(Fasta)
CQPKKKSTPL KYEVGDLIWA KFKRRPWWPC RICSDPLINT HSKMKVSNRR PYRQYYVEAF 60
GDPSERAWVA GKAIVMFEGR HQFEELPVLR RRGKQKEKGY RHKVPQKILS KWEASVGLAE 120
QYDVPKGSKN RKCIPGSIKL DSEEDMPFED CTNDPESEHD LLLNGCLKSL AFDSEHSADE 180
KEKPCAKSRA RKSSDNPKRT SVKKGHIQFE AHKDERRGKI PENLGLNFIS GDISDTQASN 240
ELSRIANSLT GSNTAPGSFL FSSCGKNTAK KEFETSNGDS LLGLPEGALI SKCSREKNKP 300
QRSLVCGSKV KLCYIGAGDE EKRSDSISIC TTSDDGSSDL DPIEHSSESD NSVLEIPDAF 360
DRTENMLSMQ KNEKIKYSRF AATNTRVKAK QKPLISNSHT DHLMGCTKSA EPGTETSQVN 420
LSDLKASTLV HKPQSDFTND ALSPKFNMSS SISSENSLIK GGAANQALLH SKSKQPKFRS 480
IKCKHKENPV MVEPPVINEE CSLKCCSSDT KGSPLASISK SGKVDGLKLL NNMHEKTRDS 540
SDIETAVVKH VLSELKELSY RSLGEDVSDS GTSKPSKPLL FSSASSQNHI PIEPDYKFST 600
LLMMLKDMHD SKTKEQRLMT AQNLVSYRSP GRGDCSTNSP VGVSKVLVSG GSTHNSEKKG 660
DGTQNSANPS PSGGDSALSG ELSASLPGLV SDKRDLPASG KSRSDCVTRR NCGRSKPSSK 720
LRDAFSAQMV KNTVNRKALK TERKRKLNQL ASVTLDAVLQ GDREHGGSLR GGAEVPSKED 780
PLQIMGHLTS EDGDHFSDVH FDNKVKQSDP GKISEKGLSF ENRKGPELDS VMNSENDELN 840
GVNQVVPKKR WQRLNQRRTK PRKRMNRFKE KENSECAFRV LLPSDPVQEG RDEFPEHRTP 900
PSASILEEPL TEQNHADCLD SVGPRLNVCD KSSASIGDME KEPGIPSLTP QAELPEPAVR 960
SEKKRLRKPS KWLLEYTEEY DQIFAPKKKQ KKVQEQVHKV SSRCEEESLL ARGRSSAQNK 1020
QVDENSLIST KEEPPVLERE APFLEGPLAQ SELGGGHAEL PQLTLSVPVA PEVSPRPALE 1080
SEELLVKTPG NYESKRQRKP TKKLLESNDL DPGFMPKKGD LGLSKKCYEA GHLENGITES 1140
CATSYSKDFG GGTTKIFDKP RKRKRQRHAA AKMQCKKVKN DDSSKEIPGS EGELMPHRTA 1200
TSPKETVEEG VEHDPGMPAS KKIQGERGGG AALKENVCQN CEKLGELLLC EAQCCGAFHL 1260
ECLGLTEMPR GKFICNECRT GIHTCFVCKQ SGEDVKRCLL PLCGKFYHEE CVQKYPPTVM 1320
QNKGFRCSLH ICITCHAANP ANVSASKGRL MRCVRCPVAY HANDFCLAAG SKILASNSII 1380
CPNHFTPRRG CRNHEHVNVS WCFVCSEGKT WLCCPIGFSL GCLNFCDPTA CCWHCCKQTS 1440
FTPHFRHFFF FKIFFPRWWP AEICHPRAVP SNIDKMRHDV GEFPVLFFGS NDYLWTHQAR 1500
VFPYMEGDVS SKDKMGKGVD GTYKKALQEA AARFEELKAQ KELRQLQEDR KNDKKPPPYK 1560
HIKVNRPIGR VQIFTADLSE IPRCNCKATD ENPCGIDSEC INRMLLYECH PTVCPAGGRC 1620
QNQCFSKRQY PEVEIFRTLQ RGWGLRTKTD IKKGEFVNEY VGELIDEEEC RARIRYAQEH 1680
DITNFYMLTL DKDRIIDAGP KGNYARFMNH CCQPNCETQK WSVNGDTRVG LFALSDIKAG 1740
TELTFNYNLE CLGNGKTVCK CGAPNCSGFL GVRPKNQPIA TEEKAKKFKK KQQGKRRTQG 1800
EITKEREDEC FSCGDAGQLV SCKKPGCPKV YHADCLNLTK RPAGKWECPW HQCDICGKEA 1860
ASFCEMCPSS FCKQHREGML FISKLDGRLS CTEHDPCGPN PLEPGEIREY VPPPVPLPPG 1920
PSTHLAEQST GMAAQAPKMS DKPPADTNQT LSLSKKALAG TCQRPLLPER PLERTDSRPQ 1980
PLDKVRDLAG SGTKSQSLVS SQRPLDRPPA VAGPRPQLSD KPSPVTSPSS SPSVRSQPLE 2040
RPLGTADPRL DKSIGAASPR PQSLEKTPVP TGLRLPPPDR LLITSSPKPQ TSDRPTDKPH 2100
ASLSQRLPPP EKVLSAVVQT LVAKEKALRP VDQNTQSKNR AALVMDLIDL TPRQKERAAS 2160
PHEVTPQADE KMPVLESSSW PASKGLGHMP RAVEKGCVSD PLQTSGKAAA PSEDPWQAVK 2220
SLTQARLLSQ PPAKAFLYEP TTQASGRASA GAEQTPGPLS QSSGLVKQAK QMVGGQQLPA 2280
LAAKSGQSFR SLGKAPASLP TEEKKLVTTE QSPWALGKAS SRAGLWPIVA GQTLAQSCWS 2340
AGSTQTLAQT CWSLGRGQDP KPEQNTLPAL NQAPSSHKCA ESEQK 2385
Nucleotide Sequence
(Fasta)
TGTCAACCTA AGAAAAAGTC TACGCCACTG AAGTATGAAG TTGGAGATCT CATCTGGGCA 60
AAATTCAAGA GACGCCCATG GTGGCCCTGC AGGATTTGTT CTGATCCGTT GATTAACACA 120
CATTCAAAAA TGAAAGTTTC CAACCGGAGG CCCTATCGGC AGTACTACGT GGAGGCTTTT 180
GGAGATCCTT CTGAGAGAGC CTGGGTGGCT GGAAAAGCAA TCGTCATGTT TGAAGGCAGA 240
CATCAATTCG AAGAGCTACC TGTCCTTAGG AGAAGAGGGA AACAGAAAGA AAAAGGATAT 300
AGGCATAAGG TTCCTCAGAA AATTTTGAGT AAATGGGAAG CCAGTGTTGG ACTTGCAGAA 360
CAGTATGATG TTCCCAAGGG GTCAAAGAAC CGAAAATGTA TTCCTGGTTC AATCAAGTTG 420
GACAGTGAAG AAGATATGCC ATTTGAAGAC TGCACAAATG ATCCTGAGTC AGAACATGAC 480
CTGTTGCTTA ATGGCTGTTT GAAATCACTG GCTTTTGATT CTGAACATTC TGCAGATGAG 540
AAGGAAAAGC CTTGTGCTAA ATCTCGAGCC AGAAAGAGCT CTGATAATCC AAAAAGGACT 600
AGTGTGAAAA AGGGCCACAT ACAATTTGAA GCACATAAAG ATGAACGGAG GGGAAAGATT 660
CCAGAGAACC TTGGCCTAAA CTTTATCTCT GGGGATATAT CTGATACGCA GGCCTCTAAT 720
GAACTTTCCA GGATAGCAAA TAGCCTCACA GGGTCCAACA CTGCCCCAGG AAGTTTTCTG 780
TTTTCTTCCT GTGGAAAAAA CACTGCAAAG AAAGAATTTG AGACTTCAAA TGGTGACTCT 840
TTATTGGGCT TGCCTGAGGG TGCTTTGATC TCAAAGTGTT CTCGAGAGAA GAATAAACCC 900
CAACGAAGCC TGGTGTGTGG TTCAAAAGTG AAGCTCTGCT ATATTGGAGC AGGTGATGAG 960
GAAAAGCGAA GTGATTCCAT TAGTATCTGT ACCACTTCTG ATGATGGAAG CAGTGACCTG 1020
GATCCCATAG AACACAGCTC AGAGTCTGAT AACAGTGTCC TTGAAATTCC AGATGCTTTC 1080
GATAGAACAG AGAACATGTT ATCTATGCAG AAAAATGAAA AGATAAAGTA TTCTAGGTTT 1140
GCTGCCACAA ACACTAGGGT AAAAGCAAAA CAGAAGCCTC TCATTAGTAA CTCACATACA 1200
GACCACTTAA TGGGTTGTAC TAAGAGTGCA GAGCCTGGAA CTGAGACGTC TCAGGTTAAT 1260
CTCTCTGATC TGAAGGCATC TACTCTTGTT CACAAACCCC AATCAGATTT TACAAATGAT 1320
GCTCTCTCTC CAAAATTCAA CATGTCATCA AGCATATCCA GTGAGAACTC GTTAATAAAG 1380
GGTGGGGCAG CAAATCAAGC TCTATTACAT TCGAAAAGCA AACAGCCCAA GTTCCGAAGT 1440
ATAAAGTGCA AACACAAAGA AAATCCAGTT ATGGTAGAAC CCCCAGTTAT AAATGAGGAG 1500
TGCAGTTTGA AATGCTGCTC TTCTGATACC AAAGGCTCTC CTTTGGCCAG CATTTCTAAA 1560
AGTGGGAAAG TGGATGGTCT AAAACTACTG AACAATATGC ATGAGAAAAC CAGGGATTCA 1620
AGTGACATAG AAACAGCAGT GGTGAAACAT GTTTTATCCG AGTTGAAGGA ACTCTCTTAC 1680
AGATCCTTAG GTGAGGATGT CAGTGACTCT GGAACATCAA AGCCATCAAA ACCATTACTT 1740
TTCTCTTCTG CTTCTAGTCA GAATCACATA CCTATTGAAC CAGACTACAA ATTCAGTACA 1800
TTGCTAATGA TGTTGAAAGA TATGCATGAT AGTAAGACGA AGGAGCAGCG GTTGATGACT 1860
GCTCAAAACC TGGTCTCTTA CCGGAGTCCT GGTCGTGGGG ACTGTTCTAC TAATAGTCCT 1920
GTAGGAGTCT CTAAGGTTTT GGTTTCAGGA GGCTCCACAC ACAATTCAGA GAAAAAGGGA 1980
GATGGCACTC AGAACTCTGC CAATCCTAGC CCTAGTGGGG GTGACTCTGC ATTATCTGGG 2040
GAGTTGTCTG CTTCCCTACC TGGCTTAGTG TCCGACAAGA GAGACCTCCC TGCTTCTGGT 2100
AAAAGTCGTT CAGACTGTGT TACTAGGCGC AACTGTGGAC GATCAAAGCC TTCATCCAAA 2160
TTGCGAGATG CTTTTTCAGC TCAAATGGTA AAGAACACAG TGAACCGTAA AGCCTTAAAG 2220
ACCGAGCGCA AAAGAAAACT GAATCAGCTT GCAAGTGTGA CTCTTGATGC TGTACTGCAG 2280
GGAGACCGAG AACATGGAGG TTCATTGAGA GGTGGGGCAG AAGTTCCTAG TAAAGAGGAT 2340
CCCCTTCAGA TAATGGGCCA CTTAACAAGT GAAGATGGTG ACCATTTTTC TGATGTGCAT 2400
TTCGATAACA AGGTTAAGCA ATCTGATCCT GGTAAAATTT CTGAAAAAGG ACTCTCTTTT 2460
GAAAACAGAA AAGGCCCAGA GCTGGACTCT GTAATGAACA GTGAGAATGA TGAACTCAAT 2520
GGTGTAAATC AAGTGGTGCC TAAAAAGCGG TGGCAGCGTT TAAACCAAAG GCGCACTAAA 2580
CCTCGTAAGC GCATGAACAG ATTTAAAGAG AAAGAAAACT CTGAGTGTGC CTTTAGGGTC 2640
TTACTTCCTA GTGACCCTGT GCAGGAGGGG CGGGATGAGT TTCCAGAGCA TAGAACTCCT 2700
CCTTCAGCAA GCATACTTGA GGAACCACTG ACAGAGCAAA ATCATGCTGA CTGCTTAGAT 2760
TCAGTTGGGC CACGGTTAAA TGTTTGTGAT AAATCCAGTG CCAGCATTGG TGACATGGAA 2820
AAGGAGCCAG GAATTCCCAG TTTGACACCA CAGGCTGAGC TCCCTGAACC AGCTGTGCGG 2880
TCAGAGAAGA AACGCCTTAG GAAGCCAAGC AAGTGGCTTT TGGAATATAC AGAAGAATAT 2940
GATCAGATAT TTGCTCCTAA GAAAAAACAA AAGAAGGTAC AGGAGCAGGT GCACAAGGTA 3000
AGTTCCCGCT GTGAAGAGGA AAGCCTTCTA GCCCGAGGTC GATCTAGTGC TCAGAACAAG 3060
CAGGTGGACG AGAATTCTTT GATTTCAACC AAAGAAGAGC CTCCAGTTCT TGAAAGGGAG 3120
GCTCCGTTTT TGGAGGGCCC CTTGGCTCAG TCAGAACTTG GAGGTGGACA TGCTGAGTTG 3180
CCGCAGCTGA CCTTGTCTGT GCCTGTGGCT CCGGAAGTCT CTCCACGGCC TGCCCTTGAG 3240
TCTGAGGAAT TGCTAGTTAA AACGCCAGGA AATTATGAAA GTAAACGTCA AAGAAAACCA 3300
ACTAAGAAAC TTCTTGAATC CAATGATTTA GACCCTGGAT TTATGCCCAA GAAGGGGGAC 3360
CTTGGCCTTT CTAAAAAGTG CTATGAAGCT GGTCACCTGG AGAATGGCAT AACTGAATCT 3420
TGTGCCACAT CTTATTCAAA AGATTTTGGT GGAGGCACTA CCAAGATATT TGACAAACCA 3480
AGGAAGCGAA AACGACAGAG GCATGCTGCA GCCAAGATGC AGTGTAAAAA AGTGAAAAAT 3540
GATGACTCGT CAAAAGAGAT TCCAGGCTCA GAGGGAGAAC TAATGCCTCA CAGGACGGCC 3600
ACAAGCCCCA AGGAGACTGT TGAGGAAGGT GTAGAACACG ATCCCGGGAT GCCTGCCTCT 3660
AAAAAAATAC AGGGTGAACG CGGTGGAGGA GCTGCACTCA AGGAGAATGT CTGTCAGAAT 3720
TGTGAAAAAT TGGGTGAGCT GCTGTTATGT GAGGCTCAGT GCTGTGGGGC TTTCCACCTG 3780
GAGTGCCTTG GATTGACTGA GATGCCAAGA GGAAAATTTA TCTGCAATGA ATGTCGCACA 3840
GGAATCCATA CCTGTTTTGT ATGTAAGCAG AGTGGGGAAG ATGTTAAAAG GTGCCTTCTA 3900
CCCTTGTGTG GAAAGTTTTA CCATGAAGAG TGTGTCCAGA AGTACCCACC CACTGTTATG 3960
CAGAACAAGG GCTTCCGGTG CTCCCTCCAC ATCTGTATAA CCTGTCATGC TGCTAATCCA 4020
GCCAATGTTT CTGCATCTAA AGGTCGGTTG ATGCGCTGTG TCCGCTGTCC TGTGGCATAC 4080
CACGCCAATG ACTTTTGCCT GGCTGCTGGG TCAAAGATCC TTGCATCTAA TAGTATCATC 4140
TGCCCTAATC ACTTTACCCC TAGGCGGGGC TGCCGAAATC ATGAGCATGT TAATGTTAGC 4200
TGGTGCTTTG TGTGCTCAGA AGGTAAAACT TGGCTTTGTT GCCCAATAGG TTTTTCTCTT 4260
GGTTGTCTGA ACTTTTGTGA TCCCACAGCA TGCTGTTGGC ATTGTTGTAA ACAAACAAGC 4320
TTTACTCCCC ACTTCAGGCA CTTTTTCTTT TTTAAAATAT TCTTTCCCAG GTGGTGGCCA 4380
GCTGAGATCT GCCATCCTCG AGCTGTTCCT TCCAACATTG ATAAGATGAG ACATGATGTG 4440
GGAGAGTTCC CAGTCCTCTT TTTTGGATCT AATGACTATT TGTGGACTCA CCAGGCCCGA 4500
GTCTTCCCTT ACATGGAGGG TGACGTGAGC AGCAAGGATA AGATGGGCAA AGGAGTGGAT 4560
GGGACATATA AAAAAGCTCT TCAGGAAGCT GCAGCAAGGT TTGAGGAATT AAAGGCCCAA 4620
AAAGAGCTAA GACAGCTGCA GGAAGACCGA AAGAATGACA AGAAGCCACC ACCTTATAAA 4680
CATATAAAGG TAAACCGTCC TATTGGCAGG GTACAGATCT TCACTGCAGA CTTATCTGAA 4740
ATACCCCGTT GCAACTGTAA AGCTACTGAT GAGAACCCCT GTGGGATAGA CTCTGAATGC 4800
ATCAACCGCA TGCTGCTCTA TGAGTGCCAC CCCACAGTGT GTCCTGCCGG AGGGCGCTGT 4860
CAAAACCAGT GCTTTTCCAA GCGCCAATAT CCAGAGGTTG AAATTTTCCG CACGTTACAG 4920
CGGGGTTGGG GTCTACGGAC AAAAACAGAT ATTAAAAAGG GTGAATTTGT GAATGAGTAT 4980
GTGGGTGAGC TTATAGATGA AGAAGAATGC AGAGCTCGAA TTCGCTATGC TCAAGAACAT 5040
GATATCACTA ATTTCTATAT GCTCACCCTA GACAAAGACC GAATCATTGA TGCTGGTCCC 5100
AAAGGAAACT ATGCTCGGTT CATGAATCAT TGCTGCCAGC CCAACTGTGA AACACAGAAG 5160
TGGTCTGTGA ACGGAGATAC CCGTGTAGGC CTTTTTGCAC TAAGTGACAT TAAAGCAGGC 5220
ACTGAACTTA CCTTCAACTA CAACCTAGAA TGTCTTGGGA ATGGAAAGAC TGTTTGCAAA 5280
TGTGGAGCCC CGAACTGCAG TGGCTTCTTG GGTGTAAGGC CAAAGAACCA ACCCATTGCC 5340
ACGGAAGAAA AGGCAAAGAA ATTCAAGAAG AAGCAACAGG GAAAGCGCAG GACCCAGGGT 5400
GAAATCACAA AGGAGCGAGA AGATGAGTGT TTTAGTTGTG GGGATGCTGG CCAGCTCGTC 5460
TCCTGCAAGA AACCAGGCTG CCCAAAAGTT TACCATGCAG ACTGTCTCAA TCTGACCAAG 5520
CGACCAGCAG GGAAATGGGA ATGTCCGTGG CATCAGTGTG ACATCTGCGG GAAGGAAGCA 5580
GCCTCCTTCT GTGAGATGTG CCCCAGCTCC TTTTGTAAGC AGCATCGAGA AGGGATGCTT 5640
TTCATTTCCA AACTGGATGG GCGTCTGTCT TGTACTGAGC ATGACCCCTG TGGGCCCAAT 5700
CCTCTGGAAC CTGGGGAGAT CCGTGAGTAT GTGCCTCCCC CAGTACCGCT GCCTCCAGGG 5760
CCAAGCACTC ACCTGGCAGA GCAATCAACA GGAATGGCTG CTCAGGCACC CAAAATGTCA 5820
GATAAACCTC CTGCTGACAC CAACCAGACG CTGTCGCTCT CCAAAAAAGC TCTGGCAGGG 5880
ACTTGTCAGA GGCCACTGCT ACCTGAAAGA CCTCTTGAGA GAACTGACTC CAGGCCGCAG 5940
CCTTTAGATA AGGTCAGAGA CCTCGCTGGG TCAGGGACCA AATCCCAATC CTTGGTTTCC 6000
AGCCAGAGGC CACTGGACAG GCCACCAGCA GTGGCAGGAC CAAGACCCCA GCTAAGCGAC 6060
AAACCCTCTC CAGTGACCAG CCCAAGCTCC TCACCCTCAG TCAGGTCCCA ACCACTGGAA 6120
AGACCTCTGG GGACGGCTGA CCCAAGGCTG GATAAATCCA TAGGTGCTGC CAGCCCAAGG 6180
CCCCAGTCAC TGGAGAAAAC CCCAGTTCCC ACTGGCCTGA GACTTCCGCC GCCAGACAGA 6240
CTGCTCATTA CTAGCAGTCC CAAACCCCAG ACTTCAGACA GGCCTACTGA CAAACCCCAT 6300
GCCTCTTTGT CCCAGAGACT CCCACCTCCT GAGAAAGTAC TATCAGCTGT GGTCCAGACC 6360
CTTGTAGCTA AAGAAAAAGC ACTGAGGCCT GTGGACCAGA ATACTCAGTC AAAAAATAGA 6420
GCTGCTTTGG TGATGGATCT CATAGACCTA ACTCCTCGCC AGAAGGAGCG GGCAGCTTCA 6480
CCTCATGAGG TCACACCACA GGCTGATGAG AAGATGCCAG TGTTGGAGTC AAGTTCATGG 6540
CCTGCCAGCA AAGGTCTGGG GCATATGCCG AGAGCTGTTG AGAAAGGCTG TGTGTCAGAT 6600
CCTCTTCAGA CATCTGGGAA AGCAGCAGCC CCTTCAGAGG ACCCCTGGCA AGCTGTTAAA 6660
TCACTCACCC AGGCCAGACT TCTTTCTCAG CCTCCTGCCA AGGCCTTTTT ATATGAGCCA 6720
ACAACTCAGG CCTCAGGAAG AGCTTCTGCA GGGGCTGAGC AGACCCCAGG GCCTCTTAGC 6780
CAATCCTCGG GCCTGGTGAA GCAGGCGAAG CAGATGGTCG GAGGCCAGCA ACTACCTGCA 6840
CTTGCCGCCA AGAGTGGGCA ATCTTTTAGG TCTCTCGGGA AGGCCCCAGC CTCCCTCCCC 6900
ACTGAAGAAA AGAAGTTGGT AACCACAGAG CAAAGTCCCT GGGCCCTGGG AAAAGCCTCA 6960
TCACGGGCAG GGCTCTGGCC CATAGTGGCT GGACAGACAC TGGCACAGTC TTGCTGGTCT 7020
GCTGGGAGCA CACAGACATT GGCACAGACT TGCTGGTCTC TTGGAAGAGG GCAAGACCCC 7080
AAACCAGAGC AAAATACACT TCCAGCTCTT AACCAGGCTC CTTCCAGCCA CAAGTGTGCA 7140
GAATCAGAAC AGAAGTAGTA CCAATCAATG TCACATGAAC AAACAAGCTG CCCCCAGGGT 7200
ACCATTTGGG GAGGGGAAAT CTTTTCTTTC TTTCCCCCTT AAAAAAAAAA AACACATCTG 7260
CCCCGAACAC TTTCCCACTG TTATTCTTTC CTCATATCCC AACACTCAGA ACTCTTGTGA 7320
CATTAGCCAG TGGGGGCTTA TGGTTGTGTG AACCATGTAT GAAAATCCAG TGGGCCCCAA 7380
CCAAGGAGAC AGACAGACTT GGGTCTCTTT CCCCCAACTT TTCCACATGG TCATCGTGAA 7440
ATAAAAAGTC CACTCTGG 7459
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0143 ENSPTRP00000044805.4 Pan troglodytes 98 0.0 4520
WERAM-Hos-0181 ENSP00000395929.2 Homo sapiens 98 0.0 4492
WERAM-Poa-0143 ENSPPYP00000018000.2 Pongo abelii 97 0.0 4461
WERAM-Nol-0141 ENSNLEP00000015392.2 Nomascus leucogenys 97 0.0 4450
WERAM-Mam-0074 ENSMMUP00000011851.2 Macaca mulatta 96 0.0 4425
WERAM-Chs-0105 ENSCSAP00000005185.1 Chlorocebus sabaeus 96 0.0 4417
WERAM-Paa-0116 ENSPANP00000000470.1 Papio anubis 94 0.0 4315
WERAM-Fec-0017 ENSFCAP00000001421.3 Felis catus 89 0.0 4112
WERAM-Mup-0139 ENSMPUP00000012178.1 Mustela putorius furo 89 0.0 4111
WERAM-Aim-0211 ENSAMEP00000019234.1 Ailuropoda melanoleuca 90 0.0 4108
WERAM-Eqc-0074 ENSECAP00000009434.1 Equus caballus 89 0.0 4075
WERAM-Orc-0022 ENSOCUP00000002408.2 Oryctolagus cuniculus 89 0.0 4062
WERAM-Caf-0170 ENSCAFP00000024244.3 Canis familiaris 89 0.0 4058
WERAM-Bot-0197 ENSBTAP00000034104.4 Bos taurus 89 0.0 4056
WERAM-Sus-0103 ENSSSCP00000014933.2 Sus scrofa 89 0.0 4051
WERAM-Ova-0029 ENSOARP00000004108.1 Ovis aries 89 0.0 4044
WERAM-Ict-0147 ENSSTOP00000023043.1 Ictidomys tridecemlineatus 88 0.0 4035
WERAM-Caj-0079 ENSCJAP00000013142.2 Callithrix jacchus 94 0.0 3999
WERAM-Myl-0080 ENSMLUP00000006392.2 Myotis lucifugus 87 0.0 3962
WERAM-Loa-0062 ENSLAFP00000004572.4 Loxodonta africana 88 0.0 3927
WERAM-Otg-0116 ENSOGAP00000010251.2 Otolemur garnettii 88 0.0 3920
WERAM-Dan-0110 ENSDNOP00000013609.3 Dasypus novemcinctus 85 0.0 3897
WERAM-Tut-0110 ENSTTRP00000009069.1 Tursiops truncatus 87 0.0 3731
WERAM-Ran-0107 ENSRNOP00000057648.2 Rattus norvegicus 82 0.0 3687
WERAM-Mum-0037 ENSMUSP00000097089.2 Mus musculus 81 0.0 3665
WERAM-Tas-0005 ENSTSYP00000000565.1 Tarsius syrichta 89 0.0 3620
WERAM-Sah-0142 ENSSHAP00000014900.1 Sarcophilus harrisii 68 0.0 2983
WERAM-Mod-0041 ENSMODP00000005995.3 Monodelphis domestica 67 0.0 2919
WERAM-Cap-0095 ENSCPOP00000007137.2 Cavia porcellus 87 0.0 2601
WERAM-Soa-0066 ENSSARP00000006577.1 Sorex araneus 76 0.0 2484
WERAM-Ocp-0021 ENSOPRP00000001916.1 Ochotona princeps 83 0.0 2207
WERAM-Anp-0015 ENSAPLP00000002635.1 Anas platyrhynchos 56 0.0 2180
WERAM-Meg-0043 ENSMGAP00000004050.2 Meleagris gallopavo 55 0.0 2161
WERAM-Gaga-0030 ENSGALP00000004690.4 Gallus gallus 56 0.0 2153
WERAM-Ptv-0030 ENSPVAP00000003619.1 Pteropus vampyrus 87 0.0 2077
WERAM-Fia-0106 ENSFALP00000009080.1 Ficedula albicollis 55 0.0 2019
WERAM-Pes-0121 ENSPSIP00000014467.1 Pelodiscus sinensis 59 0.0 2000
WERAM-Anc-0055 ENSACAP00000005495.2 Anolis carolinensis 49 0.0 1672
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 70 0.0 1506
WERAM-Mim-0043 ENSMICP00000003979.1 Microcebus murinus 88 0.0 1424
WERAM-Lac-0101 ENSLACP00000012395.1 Latimeria chalumnae 55 0.0 1377
WERAM-Prc-0109 ENSPCAP00000009738.1 Procavia capensis 76 0.0 1318
WERAM-Vip-0024 ENSVPAP00000002299.1 Vicugna pacos 88 0.0 1316
WERAM-Dio-0082 ENSDORP00000008270.1 Dipodomys ordii 78 0.0 1239
WERAM-Chh-0111 ENSCHOP00000012034.1 Choloepus hoffmanni 73 0.0 1239
WERAM-Tag-0006 ENSTGUP00000000444.1 Taeniopygia guttata 85 0.0 1170
WERAM-Dar-0135 ENSDARP00000106822.1 Danio rerio 60 0.0 1129
WERAM-Pof-0001 ENSPFOP00000000016.1 Poecilia formosa 73 0.0 1100
WERAM-Tar-0055 ENSTRUP00000012244.1 Takifugu rubripes 72 0.0 1099
WERAM-Ect-0087 ENSETEP00000010123.1 Echinops telfairi 83 0.0 1053
WERAM-Xim-0228 ENSXMAP00000018920.1 Xiphophorus maculatus 73 0.0 1033
WERAM-Gaa-0222 ENSGACP00000027499.1 Gasterosteus aculeatus 74 0.0 973
WERAM-Mae-0019 ENSMEUP00000001571.1 Macropus eugenii 61 0.0 950
WERAM-Orla-0054 ENSORLP00000006946.1 Oryzias latipes 73 0.0 950
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 62 0.0 900
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 61 0.0 879
WERAM-Tub-0107 ENSTBEP00000012130.1 Tupaia belangeri 62 0.0 870
WERAM-Asm-0131 ENSAMXP00000012840.1 Astyanax mexicanus 60 0.0 858
WERAM-Gam-0141 ENSGMOP00000014242.1 Gadus morhua 67 0.0 852
WERAM-Orn-0148 ENSONIP00000015592.1 Oreochromis niloticus 58 0.0 845
WERAM-Ten-0149 ENSTNIP00000015371.1 Tetraodon nigroviridis 57 0.0 838
WERAM-Pem-0013 ENSPMAP00000002102.1 Petromyzon marinus 55 0.0 823
WERAM-Ere-0029 ENSEEUP00000002285.1 Erinaceus europaeus 54 0.0 786
WERAM-Cii-0053 ENSCINP00000025830.2 Ciona intestinalis 53 7e-163 573
WERAM-Drm-0072 FBpp0084636 Drosophila melanogaster 35 9e-104 377
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 41 4e-50 199
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 41 1e-49 197
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 41 1e-49 197
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 42 4e-49 195
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 38 5e-49 195
WERAM-Pot-0069 POPTR_0005s21720.1 Populus trichocarpa 41 8e-49 194
Created Date 25-Jun-2016