WERAM Information


Tag Content
WERAM ID WERAM-Nol-0022
Ensembl Protein ID ENSNLEP00000002551.1
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSNLEG00000002122.2 ENSNLET00000002686.2 ENSNLEP00000002551.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 1.30e-182 606 16 330
Organism Nomascus leucogenys
Domain Profile
  HMT nonSET

            nonSET.txt   1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkipr 89 
+++v++wpl++ydk++daa+eiie+i++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+
ENSNLEP00000002551.1 16 EPAVYPWPLPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPS 103
689******************************************************************************.******* PP
nonSET.txt 90 tsllrdilqlvynrsvtdaeklnnyeafspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaa 178
t+llr+ilq+vyn+svtd+eklnnye+fspevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+a
ENSNLEP00000002551.1 104 TGLLRHILQQVYNHSVTDPEKLNNYEPFSPEVYGETSFDLVAQMIDEIKMTDDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPA 192
***************************************************************************************** PP
nonSET.txt 179 kyaelmdeelrkrmklyGkrlaeyelekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrins 267
kyae+md+e+rk+mk+yGk++aey+le+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrins
ENSNLEP00000002551.1 193 KYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINS 281
***************************************************************************************** PP
nonSET.txt 268 rnlsdigtilkvveldllkgsvsWtgkpvsyylhtidrtilesyfsslk 316
rnlsdigti++vvel++lkgsvsWtgkpvsyylhtidrtile+yfsslk
ENSNLEP00000002551.1 282 RNLSDIGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRTILENYFSSLK 330
**********************************************997 PP

Protein Sequence
(Fasta)
MGEKLELRLK SPVGAEPAVY PWPLPVYDKH HDAAHEIIET IRWVCEEIPD LKLAMENYVL 60
IDYDTKSFES MQRLCDKYNR AIDSIHQLWK GTTQPMKLNT RPSTGLLRHI LQQVYNHSVT 120
DPEKLNNYEP FSPEVYGETS FDLVAQMIDE IKMTDDDLFV DLGSGVGQVV LQVAAATNCK 180
HHYGVEKADI PAKYAETMDR EFRKWMKWYG KKHAEYTLER GDFLSEEWRE RIANTSVIFV 240
NNFAFGPEVD HQLKERFANM KEGGRIVSSK PFAPLNFRIN SRNLSDIGTI MRVVELSPLK 300
GSVSWTGKPV SYYLHTIDRT ILENYFSSLK NPKLREEQEA ARRRQQRDSK SNAATPTNGP 360
EGKVXXXXXX XXDSGAEEEK AGAATVKKPS PSKARKKKLN KKGRKMAGRK RGRPKKMNTA 420
NPERKPKKNQ TALDALHAQT VSQTAASSPQ DAYRSPHSPF YQLPPSVQRH SPNPLLVAPT 480
PPALQKLLES FKIQYLQFLA YTKTPQYKAS LQELLGQEKE KNAQLLGTAQ QLLSHCQAQK 540
EEIRRLFQQK LDELGVKALT YNDLIQAQKE ISAHNQQLRE QSEQLEQDNR ALRSQSLQLL 600
KARCEELQLD WATLSLEKLL KEKQALKSQI SEKQRHCLEL QISIVELEKS QRQQELLQLK 660
SCVPPDDALS LHLRGKGALG RELEPDASRL HLELDCAKFS LPHLSSMSPE LSMNGQAAGY 720
ELCGALSRPS SKQNTPQYLA SPLDQEVVPC TPSHGGRPRL EKLSGLAAPD YTRLSPAKIV 780
LRRHLSQDHT VPGRPAASEL HSRAEHTKEN GLPYQSPSVP GSMKLSPQDP RPLSPGALQL 840
AGEKSSEKGL RERAYGSSGE LITSLPISIP LSTVQPNKLP VSIPLASVVL PSRAERARST 900
PSPVLQPRDP SSTLEKQIGA NAHGAGSRSL ALAPAGFSYA GSVAISGALA GSPASLTPGA 960
EPATLDESSS SGSLFATVGS RSSTPQHPLL LAQPRNSLPA SPAHQLSSSP RLGGATQGPL 1020
PEASKGDLPS DSGFSDPESE AKRRIVFTIA TGAGSAKQSP SSKHSPLTSS ARGDCVLSHG 1080
QDSRKRGRRK RASTGTPSLS AGVSPKRRAL PSVAGLFTQP SGSPLNLNSM LSNIIQPLGS 1140
TPISSPETSL KSYLMLQDHD QMSVVKKERP LSQTNGAHYS PLTSDEEPGS EDEPSSARIE 1200
RKIATISLES KSPPKTLENG GGLAGRKPAP AGEPVNSSKW KSTFSPISDI GLAKSADSPL 1260
QASSTLSQNS LFAFRPALEP SADAKLVAHP RKGFPGSLSL ADGHTPETYP VNDSFINSVA 1320
NLNLHSFIDG ASLPHKGPEA PGLSPLSFPS MSQEGSKTNP FLNLRQLDLA GLNGQGSHGK 1380
EGRQGGLFLC GPTDKTPQLT CKAAARDHKL DHLNGHNLFI SAASVPPGSL LNERGLDPAA 1440
SSTGTVPSST QTHRPFQNPF PPGP 1464
Nucleotide Sequence
(Fasta)
TGGCGGAGGC GCTGAAAGCT CCGGGCCTGT GACTACAAAG AGGGAGCCGG GGGCCGGGCC 60
GGACCGGAGC GCGGCGGCGG CGGCGGCGGC CGAGGCCGAG GCCAGGCCCC CTCCCCTCAG 120
CCTCCCGCCC CTCCCTCCCG CCCGCCCTCC TCCGCCCACC GGCGGCCCCG CCCCTCCCCC 180
AACCGCCCGC CTAGCATGGT GCGGCGGCCG CGCGCGCGGA CATGGGGGAG AAGCTGGAGC 240
TGAGACTGAA GTCGCCCGTG GGGGCTGAGC CCGCCGTGTA CCCGTGGCCG CTGCCGGTCT 300
ACGATAAACA TCACGATGCT GCTCATGAAA TCATCGAGAC CATCCGATGG GTCTGTGAAG 360
AAATCCCGGA TCTCAAGCTC GCTATGGAGA ATTACGTTTT AATTGACTAT GACACCAAAA 420
GCTTCGAGAG CATGCAGAGG CTCTGTGACA AGTACAACCG TGCCATCGAC AGCATCCACC 480
AGCTGTGGAA GGGCACCACG CAGCCCATGA AGCTGAACAC GCGGCCGTCC ACCGGGCTCC 540
TGCGCCACAT CCTGCAGCAG GTCTACAACC ACTCGGTGAC CGACCCCGAG AAGCTCAACA 600
ACTACGAGCC CTTCTCCCCC GAGGTGTATG GGGAGACCTC CTTCGACCTG GTGGCCCAGA 660
TGATTGACGA GATCAAGATG ACCGACGACG ACCTGTTTGT GGACTTGGGG AGCGGTGTGG 720
GCCAGGTCGT GCTCCAGGTT GCCGCTGCCA CCAACTGCAA ACATCACTAT GGCGTCGAGA 780
AAGCAGACAT CCCGGCCAAG TATGCGGAGA CCATGGACCG CGAGTTCAGG AAGTGGATGA 840
AATGGTATGG AAAAAAGCAT GCAGAATACA CATTGGAGAG AGGCGATTTC CTCTCAGAAG 900
AGTGGAGGGA GCGAATCGCC AACACGAGTG TTATATTTGT GAATAATTTT GCCTTTGGTC 960
CTGAGGTGGA TCACCAGCTG AAGGAGCGGT TTGCAAACAT GAAGGAAGGT GGCAGAATCG 1020
TGTCCTCGAA ACCCTTTGCA CCTCTGAACT TCAGAATAAA CAGTAGAAAC TTGAGTGACA 1080
TCGGCACCAT CATGCGCGTG GTGGAGCTCT CGCCCCTGAA GGGCTCGGTG TCGTGGACGG 1140
GGAAGCCAGT CTCCTACTAC CTGCACACTA TCGACCGCAC CATACTTGAA AACTATTTTT 1200
CTAGTCTGAA AAACCCAAAA CTCAGGGAGG AACAGGAGGC AGCCCGGCGC CGCCAGCAGC 1260
GCGACAGCAA GAGCAATGCA GCCACGCCCA CTAATGGCCC AGAGGGCAAG GTGNNNNNNN 1320
NNNNNNNNNN NNNNNNNGAC TCTGGTGCTG AGGAAGAGAA GGCGGGAGCA GCCACCGTGA 1380
AGAAGCCGTC TCCCTCCAAA GCCCGCAAGA AGAAGCTAAA CAAGAAGGGG AGGAAGATGG 1440
CCGGCCGCAA GCGTGGGCGC CCCAAGAAGA TGAACACTGC AAACCCCGAG CGGAAGCCCA 1500
AGAAGAACCA AACTGCACTG GATGCCCTGC ACGCTCAGAC CGTGTCTCAG ACGGCGGCCT 1560
CCTCACCCCA GGATGCCTAC AGATCCCCTC ACAGCCCGTT CTACCAGCTA CCTCCGAGCG 1620
TGCAGCGGCA CTCCCCCAAC CCGCTGCTGG TGGCGCCCAC CCCGCCCGCG CTGCAGAAGC 1680
TGCTAGAGTC CTTCAAGATC CAGTACCTGC AGTTCCTGGC ATACACAAAG ACCCCCCAGT 1740
ACAAGGCCAG CCTGCAGGAG CTGCTGGGCC AGGAGAAGGA GAAGAATGCC CAGCTCCTGG 1800
GCACGGCTCA GCAGCTCCTC AGCCACTGCC AGGCCCAGAA GGAGGAGATC AGGAGACTGT 1860
TCCAGCAAAA ATTGGATGAG CTGGGCGTTA AGGCGCTGAC CTACAACGAC CTGATTCAAG 1920
CGCAGAAGGA GATCTCTGCC CATAACCAGC AGCTGCGGGA GCAGTCGGAG CAGCTGGAGC 1980
AGGACAACCG CGCGCTCCGC AGCCAGAGCT TGCAGCTGCT CAAGGCTCGC TGCGAGGAGC 2040
TGCAGCTGGA CTGGGCCACG CTGTCTCTAG AGAAGCTGCT GAAGGAGAAG CAGGCCTTGA 2100
AGAGCCAGAT CTCAGAGAAG CAGAGGCACT GCCTGGAGCT GCAGATCAGC ATTGTGGAGC 2160
TAGAGAAGAG CCAGCGGCAG CAGGAGCTCC TGCAGCTCAA GTCCTGTGTG CCGCCTGACG 2220
ACGCCTTGTC CCTGCACCTG CGTGGGAAGG GCGCCCTGGG CCGCGAGCTG GAGCCTGATG 2280
CCAGCCGGCT GCACCTGGAG CTGGACTGCG CCAAGTTCTC GCTGCCTCAC TTGAGCAGCA 2340
TGAGCCCGGA GCTCTCCATG AACGGCCAGG CTGCTGGCTA TGAGCTCTGC GGCGCGCTGA 2400
GCCGGCCCTC GTCCAAGCAG AACACACCCC AGTACCTGGC CTCACCCCTG GACCAGGAGG 2460
TGGTGCCCTG CACCCCCAGC CATGGCGGCC GGCCACGCCT GGAGAAGCTC TCTGGCCTAG 2520
CCGCACCCGA CTACACTAGG CTGTCCCCCG CCAAGATTGT GCTGAGGCGG CACCTGAGCC 2580
AGGACCACAC GGTGCCCGGC AGGCCGGCCG CCAGTGAGCT GCACTCGAGA GCTGAGCACA 2640
CCAAGGAGAA CGGCCTTCCC TACCAGAGCC CCAGCGTGCC TGGCAGCATG AAGCTGAGCC 2700
CTCAGGACCC GCGGCCCCTG TCCCCTGGGG CCTTGCAGCT TGCTGGAGAG AAGAGCAGTG 2760
AGAAGGGCCT GAGAGAGCGC GCCTACGGCA GCAGCGGGGA ACTCATCACC AGCCTGCCCA 2820
TCAGCATCCC GCTCAGCACC GTGCAGCCCA ACAAGCTCCC GGTCAGCATC CCCCTGGCCA 2880
GCGTGGTGCT GCCCAGCCGC GCCGAGAGGG CGAGGAGCAC CCCCAGTCCT GTGCTGCAGC 2940
CCCGTGACCC CTCGTCCACA CTTGAAAAGC AGATTGGTGC TAATGCCCAC GGTGCTGGGA 3000
GCAGAAGCCT TGCCCTGGCC CCCGCAGGCT TCTCCTACGC TGGCTCGGTG GCCATCAGCG 3060
GGGCCTTGGC GGGCAGCCCG GCCTCTCTCA CACCTGGAGC CGAGCCGGCC ACCTTGGATG 3120
AGTCCTCCAG CTCTGGGAGC CTTTTTGCCA CCGTGGGGTC CCGCAGCTCC ACGCCACAGC 3180
ACCCCCTGCT GCTGGCACAG CCCCGGAACT CGCTCCCGGC CTCTCCCGCC CACCAGCTCT 3240
CCTCCAGTCC CCGGCTTGGT GGGGCCACCC AGGGCCCGCT CCCCGAGGCC AGCAAGGGGG 3300
ACCTGCCCTC CGATTCCGGC TTCTCAGATC CTGAGAGTGA AGCCAAGAGG AGGATCGTGT 3360
TCACCATCGC CACTGGTGCG GGCAGTGCCA AGCAGTCGCC CTCCAGCAAG CACAGCCCCC 3420
TGACCTCCAG CGCCCGTGGG GACTGTGTGC TGAGCCACGG GCAGGACAGC CGCAAGCGCG 3480
GCCGGCGGAA GCGAGCATCT ACGGGGACGC CCAGCTTGAG TGCAGGCGTG TCCCCCAAGC 3540
GCCGAGCCCT GCCATCTGTC GCTGGCCTTT TCACACAGCC GTCGGGGTCT CCCCTCAACC 3600
TCAACTCCAT GCTCAGTAAC ATCATCCAGC CGTTGGGGAG CACACCCATC TCGTCTCCCG 3660
AGACCTCCCT AAAGAGCTAC CTCATGCTCC AGGACCACGA CCAGATGAGC GTGGTCAAGA 3720
AGGAGCGTCC TTTGAGCCAG ACCAATGGGG CACACTACTC CCCACTCACC TCAGATGAGG 3780
AGCCAGGCTC TGAGGACGAG CCCAGCAGTG CCCGAATTGA GAGAAAAATT GCAACAATCT 3840
CCTTAGAAAG CAAATCTCCC CCGAAAACCT TGGAAAATGG TGGTGGCTTG GCGGGAAGGA 3900
AGCCAGCGCC TGCCGGCGAG CCAGTCAATA GCAGCAAGTG GAAGTCCACC TTCTCGCCCA 3960
TCTCTGACAT CGGCCTGGCC AAGTCTGCAG ACAGCCCGCT GCAGGCCAGC TCCACCCTCA 4020
GCCAGAACTC CCTGTTCGCG TTCCGGCCCG CCCTGGAGCC CTCTGCCGAT GCCAAGCTGG 4080
TCGCTCACCC CAGGAAAGGC TTTCCCGGTT CCCTGTCTCT TGCTGATGGA CACACCCCGG 4140
AAACCTACCC TGTGAACGAT TCATTCATAA ATTCTGTCGC GAACCTGAAT TTACACAGCT 4200
TCATTGATGG TGCTTCTCTT CCCCACAAAG GCCCCGAGGC ACCCGGCCTG TCCCCCCTGA 4260
GCTTCCCCTC GATGAGCCAG GAAGGCTCGA AAACCAACCC TTTCCTGAAC TTGAGGCAGC 4320
TGGATCTGGC TGGGCTGAAC GGTCAGGGAA GCCACGGCAA GGAGGGACGG CAGGGAGGCC 4380
TGTTCCTGTG CGGGCCCACG GACAAAACCC CGCAGCTGAC CTGCAAGGCC GCCGCCCGGG 4440
ACCACAAGCT CGACCACCTC AATGGCCACA ACCTCTTCAT CTCAGCGGCG TCCGTGCCTC 4500
CCGGAAGCCT CCTCAACGAA CGCGGTCTGG ACCCGGCGGC CTCCTCCACA GGAACCGTGC 4560
CGTCCTCCAC CCAGACACAC CGGCCCTTCC AGAACCCCTT CCCGCCGGGA CCG 4614
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 93 0.0 2464
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 93 0.0 2463
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 93 0.0 2450
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 93 0.0 2439
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 93 0.0 2407
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 86 0.0 2236
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 87 0.0 2194
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 82 0.0 2138
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 80 0.0 2016
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 80 0.0 2012
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 80 0.0 1998
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 79 0.0 1981
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 81 0.0 1962
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 80 0.0 1954
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 87 0.0 1929
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 77 0.0 1891
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 80 0.0 1889
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 76 0.0 1876
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 75 0.0 1872
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 86 0.0 1843
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 73 0.0 1837
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 73 0.0 1833
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 73 0.0 1829
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 72 0.0 1826
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 79 0.0 1822
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 72 0.0 1792
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 72 0.0 1697
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 81 0.0 1586
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 77 0.0 1553
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 65 0.0 1345
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 59 0.0 1151
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 60 0.0 1089
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 81 0.0 1082
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 56 0.0 1077
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 58 0.0 1077
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 66 0.0 1029
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 63 0.0 1021
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 66 0.0 1014
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 67 0.0 1011
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 67 0.0 1007
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 83 0.0 768
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 58 5e-102 370
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 6e-17 88.2
Created Date 25-Jun-2016