WERAM Information


Tag Content
WERAM ID WERAM-Eqc-0099
Ensembl Protein ID ENSECAP00000011661.1
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSECAG00000013711.1 ENSECAT00000014573.1 ENSECAP00000011661.1
ENSECAG00000013711.1 ENSECAT00000014621.1 ENSECAP00000011698.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 1.30e-159 530.6 1 276
Organism Equus caballus
Domain Profile
  HMT nonSET

            nonSET.txt  40 fensvlididtksfesmirlvdkynraidsirilekgttlpleklnkiprtsllrdilqlvynrsvtdaeklnnyeafspevyGelsfd 128
+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+++llr++lq+vyn+svtd+eklnnye+fspevyGe+sfd
ENSECAP00000011661.1 1 MENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPSNGLLRHLLQQVYNHSVTDPEKLNNYEPFSPEVYGETSFD 88
89****************************************.********************************************** PP
nonSET.txt 129 lvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaakyaelmdeelrkrmklyGkrlaeyelekgdflvdenrer 217
lvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+akyae+md+e+rk+mk+yGk++aey+le+gdfl++e+rer
ENSECAP00000011661.1 89 LVAQMIDEIKMTEDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPAKYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRER 177
***************************************************************************************** PP
nonSET.txt 218 iaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrinsrnlsdigtilkvveldllkgsvsWtgkpvsyylhtidrt 306
ia+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrinsrnlsdigti++vvel++lkgsvsWtgkpvsyylhtidrt
ENSECAP00000011661.1 178 IANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINSRNLSDIGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRT 266
***************************************************************************************** PP
nonSET.txt 307 ilesyfsslk 316
ile+yfsslk
ENSECAP00000011661.1 267 ILENYFSSLK 276
*******997 PP

Protein Sequence
(Fasta)
MENYVLIDYD TKSFESMQRL CDKYNRAIDS IHQLWKGTTQ PMKLNTRPSN GLLRHLLQQV 60
YNHSVTDPEK LNNYEPFSPE VYGETSFDLV AQMIDEIKMT EDDLFVDLGS GVGQVVLQVA 120
AATNCKHHYG VEKADIPAKY AETMDREFRK WMKWYGKKHA EYTLERGDFL SEEWRERIAN 180
TSVIFVNNFA FGPEVDHQLK ERFANMKEGG RIVSSKPFAP LNFRINSRNL SDIGTIMRVV 240
ELSPLKGSVS WTGKPVSYYL HTIDRTILEN YFSSLKNPKL REEQEAARRR QQRDAKSNTT 300
TPTKVPEGKA AAPVDTPMDS GAEDEKAGAT AVKKPSPSKA RKKKLNKKGR KMAGRKRGRP 360
KKASTANPER KPKKTQTALD LLHAQTVSQA ASPSPQDAYK SPHSPFYQLP PSVQRHPPDQ 420
LLLAPTPPAL QKLLESFKIQ YLQFLAYTKT PQYKANLQQL LDQEKEKNAR LLGAAQQLFG 480
HCQAQKEEIK RLFQQKLDEL GVKALTYNDL IQAQKEISAH NQQLREQTEQ LEKDNQELRS 540
QSLQLLKARC EELKLDWSTL SLESLLKEKQ ALKSQISEKQ RHCLELQISI VELEKSQRQQ 600
ELLQLKSCVP PDDGLPLPLR GKGALGRELE AEPGRLPLEL DCSKLSLPHF SSMSPELSMN 660
GHAAGYELCS ALSRPSSKQN TPQYLASPLD QEVVPCTPSH GSRPRLEKLS GLALPDYTRL 720
SPAKLVLRRH LSQDHAVSSK AAAGELHPRA EHAKENGLPY QSPVIANGIK LSPQDPRPSS 780
PVALQMTGEK GSEKGLKERT YASSGEAITS LPVSIPLSTV QPSKLPVSIP LASVVLPSRA 840
EKARSTPSPV PQPRDSSSTL EKQPGAGAHG AASSAAGGRG LALAPAGFSY AGSVAISGAL 900
AGSPAPLAPG VEPTAFDESS SSGSLFATMG SRSSTPQQPP LLVQPRNSGS ASPAHQLCAS 960
PRLSAAQGPP PDAGKVDLPC ESGLSDPESE ARRRVVFTIS AGAPGSKQSP SSKHSPLPSG 1020
ARGDSGQSHG QDSRKRGRRK RASVGTPSLS SGVSPKRRAL PSVAGLFTQS SGSPLNLNSM 1080
VNNINQPLEI TAISSPESSL KSSPVPYQDN DQPPVLKKEK PLSQTNGAHY SPLTSDEEQG 1140
SEDEPGSARR RKIATISLES KSPPKTLENG GGLAGRKPTP AGEPVNSSKW KSTFSPISDL 1200
GLAKAADSPL QASSTLSQNS LFAFRPGLDE PGSADAKLAA HPRKSFPGAL AGAGGLSPSS 1260
TPPNGFAFSG GLAADLSAHS FSDGAALSHK APEVAGLGAA PSFPAPRGKE AGAAEPGSFV 1320
NKRQLDGLGG PKGEGGKGRD VGELGLPVVG PSDRALLVHS KVGKGRDREP DSKNGHNLFI 1380
SAATVPPGGL LSGPGLATAA SSAGSATPSA QTHRPFLGSF TPGPQFALGP MSLQANLGSS 1440
VLQSLFSSVP AAAGLVHVSS AATRLTNSHA MGSFSSGVAG GAVAGN 1486
Nucleotide Sequence
(Fasta)
GACAAGCACC ACGATGCTGC TCACGAGATC ATCGAGACCA TCCGATGGGT GTGTGAAGAA 60
ATCCCAGATC TCAAGCTTGC CATGGAGAAC TATGTTTTAA TCGACTACGA CACCAAAAGC 120
TTTGAAAGCA TGCAGAGGCT CTGTGACAAG TACAACCGGG CCATCGACAG CATCCACCAG 180
CTGTGGAAGG GGACCACGCA GCCCATGAAG CTGAACACGC GGCCGTCCAA CGGGCTCCTG 240
CGTCACCTGC TGCAGCAGGT GTACAACCAC TCGGTGACTG ACCCCGAGAA GCTCAACAAC 300
TACGAGCCCT TCTCCCCGGA GGTGTACGGC GAGACCTCCT TCGACCTGGT GGCGCAGATG 360
ATCGATGAGA TCAAGATGAC TGAGGACGAC CTATTCGTGG ACCTGGGCAG CGGAGTGGGC 420
CAGGTCGTAC TGCAGGTCGC CGCTGCCACC AACTGCAAAC ATCATTATGG CGTCGAGAAA 480
GCTGACATCC CAGCCAAGTA CGCAGAGACC ATGGACCGAG AGTTCAGGAA GTGGATGAAA 540
TGGTATGGAA AAAAGCATGC AGAATACACA CTGGAAAGAG GCGATTTCCT CTCGGAAGAG 600
TGGAGAGAGC GGATTGCCAA CACAAGTGTT ATATTTGTGA ATAACTTTGC CTTTGGTCCT 660
GAGGTGGATC ACCAGCTGAA GGAGCGATTT GCAAACATGA AGGAAGGTGG CAGAATTGTG 720
TCCTCGAAGC CCTTTGCACC TCTGAACTTC AGAATAAACA GTAGAAACTT GAGTGACATC 780
GGCACCATCA TGCGCGTTGT GGAGCTCTCG CCACTGAAGG GCTCGGTGTC GTGGACGGGG 840
AAGCCGGTCT CCTACTACCT GCACACCATC GACCGCACCA TACTTGAAAA CTATTTTTCT 900
AGTCTGAAAA ACCCAAAACT CAGGGAGGAG CAAGAGGCAG CCCGGCGCCG GCAGCAGCGG 960
GACGCCAAGA GCAACACGAC CACTCCCACC AAGGTGCCCG AGGGCAAGGC GGCCGCGCCT 1020
GTGGACACCC CCATGGATTC TGGTGCTGAG GACGAGAAAG CTGGGGCGAC CGCCGTCAAA 1080
AAGCCGTCCC CGTCCAAAGC GCGGAAGAAG AAGCTGAACA AGAAGGGCCG GAAGATGGCC 1140
GGCCGGAAGC GCGGGCGTCC CAAGAAGGCG AGCACTGCGA ACCCCGAGCG TAAGCCCAAG 1200
AAGACCCAAA CTGCACTGGA CCTCCTGCAC GCGCAGACCG TGTCTCAGGC GGCGTCGCCC 1260
TCGCCGCAGG ATGCGTACAA GTCACCTCAC AGCCCGTTCT ACCAGCTACC TCCCAGCGTG 1320
CAGCGGCACC CCCCCGACCA GCTGCTGCTG GCCCCCACCC CGCCCGCACT GCAGAAGCTG 1380
CTAGAGTCCT TCAAGATTCA GTACTTGCAG TTCTTGGCGT ACACGAAGAC CCCTCAGTAC 1440
AAGGCCAACC TGCAGCAGCT GCTGGACCAG GAGAAGGAGA AGAACGCTCG CTTGCTGGGC 1500
GCCGCGCAGC AGCTGTTCGG CCACTGCCAG GCCCAGAAGG AGGAGATCAA GAGGCTCTTC 1560
CAGCAGAAGC TGGATGAGCT GGGAGTGAAG GCGCTGACCT ACAATGACCT GATCCAAGCG 1620
CAGAAGGAGA TCTCGGCTCA CAACCAGCAG CTGAGGGAGC AGACGGAGCA GCTGGAGAAG 1680
GACAACCAGG AGCTGCGGAG CCAGAGCCTG CAGCTGCTCA AGGCTCGGTG TGAGGAGCTG 1740
AAGCTGGACT GGTCCACGCT GTCCCTGGAG AGCCTGCTGA AGGAGAAGCA GGCCCTGAAG 1800
AGCCAGATCT CCGAGAAGCA GCGGCACTGC CTGGAGCTGC AGATCAGCAT CGTGGAGCTG 1860
GAGAAGAGCC AGCGGCAGCA GGAGCTCCTG CAGCTCAAGT CCTGCGTGCC GCCCGACGAT 1920
GGCCTGCCCC TGCCCCTGCG CGGGAAGGGC GCGCTGGGCC GGGAGCTGGA GGCGGAGCCC 1980
GGCCGGCTGC CCCTGGAGCT GGACTGCTCC AAGCTCTCCC TGCCCCACTT CAGCAGCATG 2040
AGCCCGGAGC TCTCCATGAA CGGCCACGCG GCCGGCTACG AGCTCTGCAG CGCGCTGAGC 2100
CGGCCCTCGT CCAAGCAGAA CACCCCCCAG TACCTGGCCT CCCCGCTGGA CCAGGAGGTC 2160
GTGCCCTGCA CCCCCAGCCA CGGCAGCCGG CCGAGGCTCG AGAAGCTGTC CGGGCTGGCC 2220
TTGCCTGACT ACACCCGGCT CTCGCCGGCC AAGCTGGTGC TGAGGCGCCA CCTGAGCCAG 2280
GACCACGCGG TCAGCAGCAA GGCAGCCGCT GGCGAGCTGC ACCCGAGAGC TGAGCACGCC 2340
AAGGAGAACG GCCTTCCCTA CCAGAGCCCT GTCATCGCCA ACGGGATCAA GCTGAGCCCT 2400
CAGGACCCTC GGCCCTCGTC CCCCGTGGCC TTACAGATGA CGGGAGAGAA GGGCAGCGAG 2460
AAGGGCCTGA AGGAGCGCAC CTACGCCAGC AGCGGGGAGG CCATCACCAG CCTGCCCGTG 2520
AGCATCCCGC TCAGCACCGT GCAGCCCAGC AAGCTGCCTG TCAGCATCCC CCTGGCCAGC 2580
GTGGTGCTGC CCAGCCGCGC CGAGAAGGCG AGAAGCACCC CCAGCCCTGT GCCGCAGCCC 2640
CGAGACTCCT CATCCACACT GGAGAAGCAG CCGGGTGCCG GTGCCCATGG TGCCGCGAGC 2700
AGCGCTGCTG GAGGCAGAGG CCTCGCCCTG GCACCTGCGG GATTCTCCTA TGCTGGCTCC 2760
GTGGCCATCA GTGGGGCCCT GGCGGGCAGC CCGGCCCCAC TCGCTCCTGG AGTCGAGCCC 2820
ACCGCCTTTG ATGAGTCCTC CAGCTCGGGG AGCCTCTTCG CCACCATGGG GTCCCGCAGC 2880
TCCACCCCGC AGCAGCCCCC CCTGCTTGTG CAGCCGCGGA ACTCCGGCTC GGCCTCGCCC 2940
GCCCACCAGC TCTGCGCCAG TCCACGGCTC AGCGCCGCCC AGGGTCCGCC CCCCGATGCC 3000
GGCAAGGTGG ACCTTCCGTG CGAGTCCGGC CTCTCCGACC CGGAGAGCGA GGCCAGGAGG 3060
AGGGTCGTCT TCACCATCTC GGCAGGTGCC CCCGGCTCCA AGCAGTCGCC TTCCAGCAAG 3120
CACAGCCCCC TGCCCTCGGG TGCCCGTGGG GACAGCGGCC AGAGCCACGG GCAGGACAGC 3180
CGCAAGCGGG GCAGGAGGAA GCGGGCGTCG GTGGGGACCC CCAGCCTCAG CTCGGGTGTG 3240
TCCCCCAAGC GCCGGGCCCT GCCATCCGTC GCCGGCCTCT TCACGCAGTC TTCAGGGTCC 3300
CCCCTGAACC TCAACTCCAT GGTCAACAAC ATCAACCAGC CTCTGGAAAT CACGGCCATC 3360
TCGTCTCCAG AGAGCTCCCT GAAGAGCTCC CCGGTCCCTT ACCAGGACAA TGACCAGCCG 3420
CCCGTGCTCA AGAAGGAGAA GCCCCTGAGC CAGACCAATG GGGCCCACTA TTCCCCGCTG 3480
ACCTCGGACG AGGAGCAGGG CTCCGAGGAC GAGCCCGGCA GCGCCAGGCG AAGAAAAATT 3540
GCAACTATCT CCTTAGAAAG CAAATCTCCT CCAAAAACCT TGGAAAACGG TGGCGGCCTG 3600
GCGGGGAGGA AGCCGACGCC CGCCGGCGAG CCCGTCAACA GCAGCAAGTG GAAGTCCACC 3660
TTCTCCCCGA TCTCCGACCT CGGCCTGGCC AAGGCCGCCG ACAGCCCGCT GCAGGCCAGC 3720
TCCACTCTGA GCCAGAACTC CCTGTTCGCT TTCCGGCCCG GCCTGGACGA GCCCGGCTCG 3780
GCTGACGCCA AGCTGGCCGC CCACCCCAGG AAGAGCTTTC CCGGCGCCCT GGCAGGGGCT 3840
GGCGGGCTGA GCCCGAGCAG CACCCCTCCC AACGGCTTCG CCTTCAGCGG GGGCCTGGCT 3900
GCTGACCTCA GTGCACACAG CTTCAGTGAT GGCGCTGCTC TCTCCCACAA GGCCCCTGAG 3960
GTGGCCGGCC TGGGTGCCGC CCCGAGCTTT CCCGCGCCGA GGGGCAAGGA GGCCGGTGCC 4020
GCGGAGCCTG GCTCATTTGT GAACAAGAGG CAGCTGGATG GACTGGGTGG CCCGAAGGGC 4080
GAGGGGGGCA AGGGCAGGGA TGTGGGCGAG CTGGGCCTGC CCGTGGTCGG GCCCTCGGAC 4140
AGGGCCTTGC TGGTGCACAG CAAGGTGGGC AAGGGCCGCG ACCGCGAGCC CGACTCCAAG 4200
AATGGCCACA ATCTCTTCAT TTCTGCTGCC ACTGTGCCTC CCGGGGGCCT CCTCAGTGGA 4260
CCAGGCCTCG CCACAGCGGC ATCCTCGGCG GGCAGCGCAA CGCCCTCTGC CCAGACGCAC 4320
CGCCCCTTCC TGGGCTCCTT CACCCCCGGC CCGCAGTTTG CCCTGGGCCC CATGTCCCTG 4380
CAGGCCAACC TGGGCTCGTC TGTGCTGCAG TCCTTGTTCA GCTCCGTGCC GGCCGCCGCC 4440
GGCCTGGTGC ACGTCTCGTC CGCCGCAACC AGACTGACCA ACTCGCACGC CATGGGCAGC 4500
TTCTCCTCCG GGGTAGCAGG CGGCGCAGTT GCAGGTAACT AGGATTTCTA CCTCACCCGT 4560
GAGACCTATG CAAGGACGGG G 4582
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 89 0.0 2211
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 88 0.0 2127
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 88 0.0 2125
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 87 0.0 2118
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 85 0.0 2093
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 85 0.0 2077
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 85 0.0 2074
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 84 0.0 2074
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 85 0.0 2070
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 85 0.0 2065
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 81 0.0 1978
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 82 0.0 1958
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 87 0.0 1955
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 81 0.0 1936
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 79 0.0 1909
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 80 0.0 1889
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 86 0.0 1888
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 76 0.0 1883
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 80 0.0 1882
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 75 0.0 1827
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 74 0.0 1821
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 90 0.0 1821
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 74 0.0 1815
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 74 0.0 1811
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 74 0.0 1762
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 86 0.0 1758
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 73 0.0 1643
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 79 0.0 1529
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 81 0.0 1525
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 64 0.0 1222
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 85 0.0 1131
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 59 0.0 1068
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 59 0.0 978
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 66 0.0 941
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 66 0.0 937
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 64 0.0 912
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 66 0.0 910
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 65 0.0 908
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 64 0.0 905
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 64 0.0 892
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 80 0.0 704
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 61 1e-90 333
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 3e-16 85.9
Created Date 25-Jun-2016