WERAM Information


Tag Content
WERAM ID WERAM-Myl-0046
Ensembl Protein ID ENSMLUP00000004017.2
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMLUG00000004399.2 ENSMLUT00000004414.2 ENSMLUP00000004017.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 1.60e-182 605.7 16 330
Organism Myotis lucifugus
Domain Profile
  HMT nonSET

            nonSET.txt   1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkipr 89 
+++v++wpl++ydk++daa+eiie+i++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+
ENSMLUP00000004017.2 16 EPAVYPWPLPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPS 103
689******************************************************************************.******* PP
nonSET.txt 90 tsllrdilqlvynrsvtdaeklnnyeafspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaa 178
++llr+ilq+vyn+svtd+eklnnye+fspevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+a
ENSMLUP00000004017.2 104 NGLLRHILQQVYNHSVTDPEKLNNYEPFSPEVYGETSFDLVAQMIDEIKMTEDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPA 192
***************************************************************************************** PP
nonSET.txt 179 kyaelmdeelrkrmklyGkrlaeyelekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrins 267
kyae+md+e+rk+mk+yGk++aey+le+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrins
ENSMLUP00000004017.2 193 KYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINS 281
***************************************************************************************** PP
nonSET.txt 268 rnlsdigtilkvveldllkgsvsWtgkpvsyylhtidrtilesyfsslk 316
rnlsdigti++vvel++lkgsvsWtgkpvsyylhtidrtile+yfsslk
ENSMLUP00000004017.2 282 RNLSDIGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRTILENYFSSLK 330
**********************************************997 PP

Protein Sequence
(Fasta)
MGEKLELRLK SPVGAEPAVY PWPLPVYDKH HDAAHEIIET IRWVCEEIPD LKLAMENYVL 60
IDYDTKSFES MQRLCDKYNR AIDSIHQLWK GTTQPMKLNT RPSNGLLRHI LQQVYNHSVT 120
DPEKLNNYEP FSPEVYGETS FDLVAQMIDE IKMTEDDLFV DLGSGVGQVV LQVAAATNCK 180
HHYGVEKADI PAKYAETMDR EFRKWMKWYG KKHAEYTLER GDFLSEEWRE RIANTSVIFV 240
NNFAFGPEVD HQLKERFANM KEGGRIVSSK PFAPLNFRIN SRNLSDIGTI MRVVELSPLK 300
GSVSWTGKPV SYYLHTIDRT ILENYFSSLK NPKLREEQEA ARRRQQRENK SNTTTPTKVP 360
ESKAAVPADT SVDSGAEEEK AATTAIKKPS PSKARKKKLN KKGRKLAGRK RGRPKKMSTA 420
NPERKPKKNP TALDLLHAQT VSQTATPSPQ DAYKSPHSPF YQLPPSVQRH PPDQLLLAPT 480
PPALQKLLES FKIQYLQFLA YTKTPQYKAS LQQLLDQEKE KNARLLGAAQ QLFGHCQAQK 540
EEIKKLFQQK LDELGVKALT YSDLIQAQKE ISAHNQQLRE QTEQLEKDNR ELRSQSLQLL 600
KARCEELKLD WSTLSLENLL KEKQALKSQI SEKQRHCLEL QISIVELEKS QRQQELLQLK 660
SCVPPDDAVS LHLRGKGGLG RELEAEPSRL HLELDCSKFS LPHFSSMSPE LSMNGHAASY 720
ELCNTLSRPS SKQNTPQYLA SPLDQEVVPC TPSHSGRPRL EKLSGLALPD YTRLSPAKLV 780
LRRHLSQDHT ASGKTPAGEL HPRTEHAKEN GLPYQSPGLA NGIKLSPQDP RPSSPVALQM 840
TGEKGSEKGL KERAYASSGE AITSLPVSIP LSTVQPSKLP VSIPLASVVL PNRAEKVRST 900
PSPVPQTRES SSTLEKQMGA NAHGAGSNAA GSKSLTLGPT GFYAGSVAIS GALAGSPAPL 960
APGVEPPVFD ESSSSGSLFT TMGSRSSTPQ HPQLLVQPRN SGPASPTHQL CVSPRLGAAQ 1020
GPLPDGSKGD LPSEVGFSDP ESEAKRRIIF TISAGASSTK QSPSNKHSPL PGGARGDSSQ 1080
SHGQDSRKRG RRKRASAGTP NLSSGVSPKR RALPSVAGLF TQSSGSPLNL NSMVNNINQP 1140
LEITAISSPE SSLKSSPVPY QDNDQPPVLK KEKPLSQTNG AHYSPLTSDE EPGSEDEQGS 1200
TRIERKIATI SLESKSPPKT LENGGSLVGR KPMPSSEPIN SSKWKSTFSP IADLGLAKAT 1260
DSPLQASSAL SQNSLFAFRP TLEELSSVDA KLATHPRKSF PGALSGAGGL SPISNPPNGF 1320
AFSGSLAADL SLHSFSDGAS LSHKAPEVAS LGAPLSFSAP RGKESSTEPG PFVNKRQLDG 1380
LGGPKGEGVK GREGEVSLPV CTPSDKASSH GKVGKGRDRE PDFKNGHNLF ISAAAVPPGG 1440
LLSGPGLTTA ASSASGTAPS SQTHRPFLGT FAPGPQFALG PMSLQANLGS SVLQSLFTSV 1500
PAAAGLVHVS SAATRLTNSH AMGSFSSGVA GGAVGGIFNH TVPSASTHPF GASFGSGAAC 1560
RSTTLSLTPM QAVASTPASS FQALSSVETR QPPPQPPLPL GRPPAGPPAL HAPPPPPNAA 1620
LPPAPALLPA NSEPVLLQNL APLPANQAFL PASSAASLPP ANASLSIKLA SLPHKVSRPS 1680
FMVHHQPLPG LALAQASPMI PQASSTGPSA VWVSLGMPPP YAARLAGVKP R 1731
Nucleotide Sequence
(Fasta)
ATGGGGGAGA AGCTGGAGCT GAGGCTCAAG TCGCCCGTGG GGGCCGAGCC CGCCGTCTAT 60
CCGTGGCCGC TGCCGGTCTA CGACAAACAC CACGATGCTG CTCATGAAAT CATCGAGACC 120
ATCCGATGGG TCTGTGAGGA AATCCCAGAC CTCAAGCTCG CTATGGAGAA TTATGTCCTA 180
ATCGACTACG ACACCAAAAG CTTCGAAAGC ATGCAGAGGC TCTGTGATAA GTACAACCGG 240
GCCATCGACA GCATCCACCA GCTGTGGAAG GGAACCACGC AGCCCATGAA GCTGAATACG 300
CGGCCATCCA ATGGGCTCCT GCGGCACATC CTGCAGCAGG TTTACAACCA CTCAGTGACG 360
GACCCTGAGA AGCTCAACAA CTACGAGCCC TTCTCCCCAG AGGTATATGG GGAGACCTCC 420
TTTGACCTGG TCGCACAGAT GATTGATGAG ATCAAGATGA CCGAGGACGA CCTGTTTGTG 480
GACCTGGGAA GTGGAGTGGG CCAGGTCGTG CTGCAAGTCG CCGCGGCCAC CAACTGCAAA 540
CATCACTACG GAGTCGAGAA AGCGGACATC CCAGCCAAGT ACGCGGAGAC CATGGACCGA 600
GAGTTCAGGA AGTGGATGAA ATGGTATGGA AAAAAGCATG CAGAATACAC ACTGGAAAGA 660
GGTGATTTCC TCTCGGAGGA GTGGAGAGAG CGGATCGCCA ACACAAGTGT TATATTTGTG 720
AATAACTTTG CCTTTGGTCC TGAGGTGGAT CACCAGCTGA AGGAGCGATT TGCAAACATG 780
AAGGAAGGTG GCAGAATTGT GTCCTCGAAG CCCTTTGCAC CTCTGAACTT CAGAATAAAC 840
AGTAGAAACT TGAGTGACAT CGGCACCATC ATGCGCGTCG TGGAGCTCTC GCCCCTGAAG 900
GGCTCAGTGT CGTGGACGGG GAAGCCCGTC TCCTACTACC TGCACACCAT CGACCGCACC 960
ATACTTGAAA ACTATTTTTC TAGTCTGAAA AATCCAAAAC TCAGGGAGGA ACAAGAGGCA 1020
GCTAGGCGCC GGCAGCAGCG AGAAAACAAG AGTAATACAA CCACCCCCAC GAAGGTGCCT 1080
GAGAGCAAGG CGGCTGTGCC TGCGGACACC TCTGTGGATT CTGGTGCTGA GGAAGAGAAA 1140
GCAGCGACCA CCGCTATCAA GAAGCCATCC CCCTCCAAAG CACGGAAGAA GAAGCTGAAC 1200
AAGAAGGGCC GGAAGCTGGC TGGACGGAAG CGCGGACGTC CCAAGAAGAT GAGCACTGCG 1260
AACCCTGAGC GTAAGCCCAA GAAGAACCCT ACTGCACTGG ACCTCCTGCA CGCCCAGACT 1320
GTGTCCCAGA CGGCGACGCC CTCACCACAA GATGCATACA AGTCACCTCA CAGCCCATTT 1380
TATCAGCTAC CTCCCAGCGT GCAGCGGCAC CCCCCTGACC AGCTGTTGCT GGCACCCACC 1440
CCGCCCGCAC TGCAGAAGCT GCTAGAGTCC TTCAAGATTC AGTACCTGCA GTTCCTGGCA 1500
TACACGAAGA CCCCTCAGTA CAAGGCCAGC CTGCAGCAGC TGCTGGACCA GGAGAAGGAG 1560
AAGAATGCCC GATTGCTGGG TGCCGCACAG CAGCTGTTCG GTCACTGCCA GGCCCAGAAA 1620
GAAGAAATAA AGAAGCTTTT TCAGCAGAAA CTGGATGAGT TGGGAGTGAA GGCGCTGACC 1680
TACAGTGACC TGATCCAAGC TCAGAAAGAG ATCTCGGCTC ACAACCAGCA GCTGCGAGAG 1740
CAGACGGAGC AGCTGGAGAA GGACAACAGG GAGCTGAGAA GCCAGAGCCT GCAGCTGCTG 1800
AAGGCTCGGT GTGAGGAGCT GAAGTTGGAC TGGTCCACAC TGTCCCTGGA GAACCTGCTA 1860
AAGGAGAAGC AGGCCCTGAA GAGCCAGATC TCCGAGAAAC AGAGGCACTG CCTGGAGCTG 1920
CAGATTAGCA TCGTGGAGCT GGAAAAGAGC CAGCGGCAGC AGGAGCTCCT GCAACTCAAG 1980
TCCTGCGTGC CGCCTGATGA TGCTGTGTCC CTGCACCTAC GAGGGAAAGG TGGCCTGGGC 2040
CGAGAACTGG AGGCGGAACC CAGCCGGCTG CACCTTGAAC TGGACTGCTC CAAATTCTCC 2100
CTGCCCCACT TCAGTAGCAT GAGCCCAGAG CTCTCCATGA ATGGCCACGC GGCCAGCTAT 2160
GAGCTCTGCA ACACACTGAG TCGGCCCTCG TCCAAGCAGA ACACCCCCCA GTACCTGGCC 2220
TCTCCACTGG ACCAGGAGGT GGTGCCATGT ACTCCCAGCC ACAGCGGCCG GCCCCGGCTC 2280
GAGAAGCTGT CTGGCTTGGC CTTGCCAGAC TACACCAGGC TCTCCCCGGC CAAGCTGGTG 2340
CTGAGACGCC ACCTGAGCCA GGACCACACT GCCAGTGGCA AGACACCTGC TGGTGAGCTA 2400
CACCCACGAA CTGAGCACGC CAAGGAGAAT GGCCTTCCAT ACCAGAGCCC GGGCCTCGCC 2460
AATGGCATCA AGCTGAGTCC TCAGGACCCA CGGCCCTCAT CCCCTGTGGC CTTACAGATG 2520
ACAGGAGAAA AAGGCAGTGA GAAGGGTCTG AAGGAGCGCG CCTATGCCAG CAGTGGGGAG 2580
GCCATCACCA GCCTACCCGT CAGCATTCCA CTCAGCACTG TGCAGCCCAG CAAGCTGCCC 2640
GTCAGCATCC CCCTGGCCAG CGTGGTGTTG CCCAACCGTG CCGAGAAGGT AAGAAGCACC 2700
CCCAGTCCTG TGCCTCAGAC CCGAGAGTCC TCGTCCACAC TTGAAAAGCA GATGGGTGCT 2760
AATGCCCATG GTGCTGGGAG CAACGCTGCT GGGAGCAAAA GCCTCACCCT GGGGCCCACA 2820
GGCTTCTATG CAGGCTCAGT CGCCATCAGT GGGGCCCTGG CAGGCAGCCC GGCCCCACTC 2880
GCTCCTGGAG TTGAGCCCCC TGTCTTCGAT GAGTCCTCCA GCTCGGGGAG CCTCTTCACC 2940
ACCATGGGCT CCCGCAGCTC CACCCCACAG CATCCTCAAC TGCTAGTGCA ACCCCGGAAC 3000
TCCGGTCCAG CCTCGCCCAC GCACCAGCTC TGCGTCAGCC CCCGACTCGG CGCTGCCCAG 3060
GGCCCACTCC CTGATGGGAG CAAAGGGGAC CTTCCCTCTG AGGTTGGCTT CTCAGATCCA 3120
GAGAGCGAGG CCAAGAGGAG GATCATCTTC ACCATCTCAG CTGGTGCCAG CAGCACCAAG 3180
CAGTCACCTT CCAACAAGCA CAGCCCTCTG CCTGGGGGTG CTCGTGGAGA CAGCAGTCAG 3240
AGCCATGGGC AGGACAGTCG AAAGCGGGGC AGGAGGAAGC GGGCATCAGC GGGGACCCCC 3300
AATCTGAGCT CAGGCGTGTC CCCCAAGCGC CGGGCCTTGC CATCCGTCGC CGGCCTCTTC 3360
ACTCAGTCTT CAGGGTCTCC CCTCAACCTG AACTCTATGG TCAACAACAT TAACCAACCT 3420
TTGGAAATCA CAGCCATCTC GTCCCCTGAG AGCTCCCTGA AGAGCTCCCC TGTACCTTAC 3480
CAGGACAACG ACCAGCCACC CGTGCTCAAG AAAGAGAAGC CCCTGAGCCA GACCAATGGA 3540
GCCCACTACT CCCCACTGAC CTCAGATGAG GAGCCGGGCT CTGAGGACGA GCAAGGCAGC 3600
ACCAGAATTG AGAGAAAAAT TGCAACTATC TCCTTAGAAA GCAAATCTCC TCCGAAAACC 3660
TTGGAAAATG GTGGCAGCCT GGTAGGAAGG AAGCCGATGC CCTCCAGCGA GCCCATCAAC 3720
AGCAGCAAGT GGAAGTCCAC CTTCTCACCC ATCGCTGACC TTGGCCTGGC CAAGGCTACT 3780
GACAGCCCGC TACAGGCCAG CTCCGCTCTG AGCCAGAACT CCCTGTTCGC TTTTCGGCCC 3840
ACCCTAGAGG AGCTCAGCTC AGTTGATGCC AAGCTGGCCA CCCACCCCAG GAAGAGCTTT 3900
CCGGGTGCAC TGTCGGGGGC TGGCGGGCTG AGTCCCATCA GCAACCCCCC CAACGGCTTT 3960
GCCTTCAGCG GGAGCCTGGC TGCCGACCTC AGTTTACACA GTTTCAGTGA TGGTGCTTCT 4020
CTCTCCCACA AGGCCCCTGA GGTGGCCAGC CTGGGCGCCC CCCTGAGCTT TTCCGCCCCG 4080
AGGGGTAAGG AGAGCAGCAC GGAGCCTGGC CCCTTTGTGA ACAAGAGGCA GCTGGATGGA 4140
CTGGGTGGCC CGAAGGGTGA GGGGGTCAAG GGCAGGGAGG GAGAAGTCAG CTTGCCCGTG 4200
TGCACACCCT CAGACAAGGC CTCATCACAT GGCAAGGTGG GCAAGGGCCG GGACCGCGAG 4260
CCTGATTTCA AAAATGGCCA CAATCTCTTC ATTTCTGCCG CTGCCGTGCC TCCTGGGGGC 4320
CTCCTCAGCG GGCCAGGCCT CACCACAGCA GCGTCCTCAG CAAGCGGCAC AGCCCCCTCC 4380
TCCCAGACAC ACCGGCCCTT CCTGGGCACT TTTGCACCGG GCCCACAGTT TGCCTTGGGC 4440
CCCATGTCCC TGCAGGCCAA CTTAGGCTCA TCTGTGCTGC AGTCCTTGTT CACCTCTGTG 4500
CCGGCTGCTG CTGGCCTAGT GCACGTCTCA TCCGCTGCAA CCAGACTGAC CAACTCTCAT 4560
GCCATGGGCA GCTTTTCCTC CGGTGTGGCA GGTGGTGCAG TTGGAGGTAT CTTTAACCAC 4620
ACGGTGCCTT CCGCCTCCAC TCATCCGTTT GGAGCCAGTT TCGGCAGCGG GGCTGCTTGT 4680
CGCAGCACCA CGCTGAGCTT AACCCCGATG CAGGCGGTGG CCAGTACCCC CGCCTCTTCC 4740
TTTCAGGCCC TGTCCTCTGT GGAGACCAGG CAGCCCCCAC CCCAGCCTCC TCTGCCCCTG 4800
GGTCGGCCCC CTGCGGGGCC ACCAGCACTT CACGCGCCCC CCCCTCCTCC TAACGCTGCC 4860
TTGCCTCCTG CTCCTGCACT GCTTCCTGCT AACTCTGAGC CTGTGCTCCT GCAGAATCTT 4920
GCGCCCCTCC CGGCTAACCA AGCTTTCTTA CCTGCCTCCT CTGCCGCCTC TCTGCCTCCT 4980
GCTAACGCCT CTCTATCTAT CAAGCTCGCC TCCCTCCCAC ACAAGGTCTC CCGCCCCTCC 5040
TTCATGGTGC ACCACCAGCC CCTGCCTGGA CTGGCTCTGG CCCAGGCCTC ACCCATGATC 5100
CCACAGGCCA GCTCCACGGG GCCATCCGCC GTGTGGGTTT CCCTTGGCAT GCCGCCTCCT 5160
TATGCCGCGC GCCTTGCGGG GGTTAAGCCG CGATAA 5197
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 87 0.0 2469
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 87 0.0 2418
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 83 0.0 2327
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 82 0.0 2326
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 85 0.0 2202
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 90 0.0 2199
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 85 0.0 2191
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 87 0.0 2176
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 84 0.0 2174
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 83 0.0 2139
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 84 0.0 2120
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 83 0.0 2107
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 77 0.0 2068
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 75 0.0 2037
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 88 0.0 2030
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 73 0.0 2027
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 79 0.0 2016
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 81 0.0 1999
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 72 0.0 1995
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 86 0.0 1958
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 71 0.0 1947
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 79 0.0 1923
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 75 0.0 1914
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 91 0.0 1892
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 74 0.0 1878
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 87 0.0 1875
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 74 0.0 1709
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 80 0.0 1551
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 79 0.0 1532
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 65 0.0 1372
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 86 0.0 1174
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 58 0.0 1164
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 58 0.0 1118
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 68 0.0 1051
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 66 0.0 1045
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 68 0.0 1044
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 67 0.0 1043
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 66 0.0 1029
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 67 0.0 1026
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 82 0.0 740
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 72 4e-167 587
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 58 4e-102 371
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 4e-17 89.0
Created Date 25-Jun-2016