WERAM Information


Tag Content
WERAM ID WERAM-Hos-0067
Ensembl Protein ID ENSP00000381657.3
Uniprot Accession Q8TEK3; DOT1L_HUMAN; O60379; Q96JL1
Genbank Protein ID NP_115871.1
Protein Name Histone-lysine N-methyltransferase, H3 lysine-79 specific
Genbank Nucleotide ID NM_032482.2
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000104885.17 ENST00000398665.7 ENSP00000381657.3
Details
Type Family Domain Substrates AA References (PMIDs)
HMT nonSET DOT1 H3K79 K 26807165; 20951770; 25537518; 20889125
Status Reviewed
Classification
Type Family E-value Score Start End
HMT nonSET 7.20e-182 605.9 16 330
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-79' of histone H3. Nucleosomes are preferred as substrate compared to free histones. Binds to DNA.
Domain Profile
  HMT nonSET

         nonSET.txt   1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkiprtsl 92 
+++v++wpl++ydk++daa+eiie+i++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+t+l
ENSP00000381657.3 16 EPAVYPWPLPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPSTGL 106
689******************************************************************************.********** PP
nonSET.txt 93 lrdilqlvynrsvtdaeklnnyeafspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaakyaelm 184
lr+ilq+vyn+svtd+eklnnye+fspevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+akyae+m
ENSP00000381657.3 107 LRHILQQVYNHSVTDPEKLNNYEPFSPEVYGETSFDLVAQMIDEIKMTDDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPAKYAETM 198
******************************************************************************************** PP
nonSET.txt 185 deelrkrmklyGkrlaeyelekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrinsrnlsdigti 276
d+e+rk+mk+yGk++aey+le+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrinsrnlsdigti
ENSP00000381657.3 199 DREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINSRNLSDIGTI 290
******************************************************************************************** PP
nonSET.txt 277 lkvveldllkgsvsWtgkpvsyylhtidrtilesyfsslk 316
++vvel++lkgsvsWtgkpvsyylhtidrtile+yfsslk
ENSP00000381657.3 291 MRVVELSPLKGSVSWTGKPVSYYLHTIDRTILENYFSSLK 330
*************************************997 PP

Protein Sequence
(Fasta)
MGEKLELRLK SPVGAEPAVY PWPLPVYDKH HDAAHEIIET IRWVCEEIPD LKLAMENYVL 60
IDYDTKSFES MQRLCDKYNR AIDSIHQLWK GTTQPMKLNT RPSTGLLRHI LQQVYNHSVT 120
DPEKLNNYEP FSPEVYGETS FDLVAQMIDE IKMTDDDLFV DLGSGVGQVV LQVAAATNCK 180
HHYGVEKADI PAKYAETMDR EFRKWMKWYG KKHAEYTLER GDFLSEEWRE RIANTSVIFV 240
NNFAFGPEVD HQLKERFANM KEGGRIVSSK PFAPLNFRIN SRNLSDIGTI MRVVELSPLK 300
GSVSWTGKPV SYYLHTIDRT ILENYFSSLK NPKLREEQEA ARRRQQRESK SNAATPTKGP 360
EGKVAGPADA PMDSGAEEEK AGAATVKKPS PSKARKKKLN KKGRKMAGRK RGRPKKMNTA 420
NPERKPKKNQ TALDALHAQT VSQTAASSPQ DAYRSPHSPF YQLPPSVQRH SPNPLLVAPT 480
PPALQKLLES FKIQYLQFLA YTKTPQYKAS LQELLGQEKE KNAQLLGAAQ QLLSHCQAQK 540
EEIRRLFQQK LDELGVKALT YNDLIQAQKE ISAHNQQLRE QSEQLEQDNR ALRGQSLQLL 600
KARCEELQLD WATLSLEKLL KEKQALKSQI SEKQRHCLEL QISIVELEKS QRQQELLQLK 660
SCVPPDDALS LHLRGKGALG RELEPDASRL HLELDCTKFS LPHLSSMSPE LSMNGQAAGY 720
ELCGVLSRPS SKQNTPQYLA SPLDQEVVPC TPSHVGRPRL EKLSGLAAPD YTRLSPAKIV 780
LRRHLSQDHT VPGRPAASEL HSRAEHTKEN GLPYQSPSVP GSMKLSPQDP RPLSPGALQL 840
AGEKSSEKGL RERAYGSSGE LITSLPISIP LSTVQPNKLP VSIPLASVVL PSRAERARST 900
PSPVLQPRDP SSTLEKQIGA NAHGAGSRSL ALAPAGFSYA GSVAISGALA GSPASLTPGA 960
EPATLDESSS SGSLFATVGS RSSTPQHPLL LAQPRNSLPA SPAHQLSSSP RLGGAAQGPL 1020
PEASKGDLPS DSGFSDPESE AKRRIVFTIT TGAGSAKQSP SSKHSPLTAS ARGDCVPSHG 1080
QDSRRRGRRK RASAGTPSLS AGVSPKRRAL PSVAGLFTQP SGSPLNLNSM VSNINQPLEI 1140
TAISSPETSL KSSPVPYQDH DQPPVLKKER PLSQTNGAHY SPLTSDEEPG SEDEPSSARI 1200
ERKIATISLE SKSPPKTLEN GGGLAGRKPA PAGEPVNSSK WKSTFSPISD IGLAKSADSP 1260
LQASSALSQN SLFTFRPALE EPSADAKLAA HPRKGFPGSL SGADGLSPGT NPANGCTFGG 1320
GLAADLSLHS FSDGASLPHK GPEAAGLSSP LSFPSQRGKE GSDANPFLSK RQLDGLAGLK 1380
GEGSRGKEAG EGGLPLCGPT DKTPLLSGKA AKARDREVDL KNGHNLFISA AAVPPGSLLS 1440
GPGLAPAASS AGGAASSAQT HRSFLGPFPP GPQFALGPMS LQANLGSVAG SSVLQSLFSS 1500
VPAAAGLVHV SSAATRLTNS HAMGSFSGVA GGTVGGN 1537
Nucleotide Sequence
(Fasta)
CCCGCCTAGC ATGGTGCGGC GGCCGCGCGC GCGGACATGG GGGAGAAGCT GGAGCTGAGA 60
CTGAAGTCGC CCGTGGGGGC TGAGCCCGCC GTCTACCCGT GGCCGCTGCC GGTCTACGAT 120
AAACATCACG ATGCTGCTCA TGAAATCATC GAGACCATCC GATGGGTCTG TGAAGAAATC 180
CCGGATCTCA AGCTCGCTAT GGAGAATTAC GTTTTAATTG ACTATGACAC CAAAAGCTTC 240
GAGAGCATGC AGAGGCTCTG CGACAAGTAC AACCGTGCCA TCGACAGCAT CCACCAGCTG 300
TGGAAGGGCA CCACGCAGCC CATGAAGCTG AACACGCGGC CGTCCACTGG ACTCCTGCGC 360
CATATCCTGC AGCAGGTCTA CAACCACTCG GTGACCGACC CCGAGAAGCT CAACAACTAC 420
GAGCCCTTCT CCCCCGAGGT GTACGGGGAG ACCTCCTTCG ACCTGGTGGC CCAGATGATT 480
GATGAGATCA AGATGACCGA CGACGACCTG TTTGTGGACT TGGGGAGCGG TGTGGGCCAG 540
GTCGTGCTCC AGGTTGCTGC TGCCACCAAC TGCAAACATC ACTATGGCGT CGAGAAAGCA 600
GACATCCCGG CCAAGTATGC GGAGACCATG GACCGCGAGT TCAGGAAGTG GATGAAATGG 660
TATGGAAAAA AGCATGCAGA ATACACATTG GAGAGAGGCG ATTTCCTCTC AGAAGAGTGG 720
AGGGAGCGAA TCGCCAACAC GAGTGTTATA TTTGTGAATA ATTTTGCCTT TGGTCCTGAG 780
GTGGATCACC AGCTGAAGGA GCGGTTTGCA AACATGAAGG AAGGTGGCAG AATCGTGTCC 840
TCGAAACCCT TTGCACCTCT GAACTTCAGA ATAAACAGTA GAAACTTGAG TGACATCGGC 900
ACCATCATGC GCGTGGTGGA GCTCTCGCCC CTGAAGGGCT CGGTGTCGTG GACGGGGAAG 960
CCAGTCTCCT ACTACCTGCA CACTATCGAC CGCACCATAC TTGAAAACTA TTTTTCTAGT 1020
CTGAAAAACC CAAAACTCAG GGAGGAACAG GAGGCAGCCC GGCGCCGCCA GCAGCGCGAG 1080
AGCAAGAGCA ACGCGGCCAC GCCCACTAAG GGCCCAGAGG GCAAGGTGGC CGGCCCCGCC 1140
GACGCCCCCA TGGACTCTGG TGCTGAGGAA GAGAAGGCGG GAGCAGCCAC CGTGAAGAAG 1200
CCGTCTCCCT CCAAAGCCCG CAAGAAGAAG CTAAACAAGA AGGGGAGGAA GATGGCTGGC 1260
CGCAAGCGCG GGCGCCCCAA GAAGATGAAC ACTGCGAACC CCGAGCGGAA GCCCAAGAAG 1320
AACCAAACTG CACTGGATGC CCTGCACGCT CAGACCGTGT CTCAGACGGC GGCCTCCTCA 1380
CCCCAGGATG CCTACAGATC CCCTCACAGC CCGTTCTACC AGCTACCTCC GAGCGTGCAG 1440
CGGCACTCCC CCAACCCGCT GCTGGTGGCG CCCACCCCGC CCGCGCTGCA GAAGCTTCTA 1500
GAGTCCTTCA AGATCCAGTA CCTGCAGTTC CTGGCATACA CAAAGACCCC CCAGTACAAG 1560
GCCAGCCTGC AGGAGCTGCT GGGCCAGGAG AAGGAGAAGA ACGCCCAGCT CCTGGGTGCG 1620
GCTCAGCAGC TCCTCAGCCA CTGCCAGGCC CAGAAGGAGG AGATCAGGAG GCTGTTTCAG 1680
CAAAAATTGG ATGAGCTGGG TGTGAAGGCG CTGACCTACA ACGACCTGAT TCAAGCGCAG 1740
AAGGAGATCT CCGCCCATAA CCAGCAGCTG CGGGAGCAGT CGGAGCAGCT GGAGCAGGAC 1800
AACCGCGCGC TCCGCGGCCA GAGCTTGCAG CTGCTCAAGG CTCGCTGCGA GGAGCTGCAG 1860
CTGGACTGGG CCACGCTGTC GCTGGAGAAG CTGTTGAAGG AGAAGCAGGC CCTGAAGAGC 1920
CAGATCTCGG AGAAGCAGAG GCACTGCCTG GAGCTGCAGA TCAGCATTGT GGAGCTAGAG 1980
AAGAGCCAGC GGCAGCAGGA GCTCCTGCAG CTCAAGTCCT GTGTGCCGCC TGACGACGCC 2040
CTGTCCCTGC ACCTGCGTGG GAAGGGCGCC CTGGGCCGCG AGCTGGAGCC TGACGCCAGC 2100
CGGCTGCACC TGGAGCTGGA CTGCACCAAG TTCTCGCTGC CTCACTTGAG CAGCATGAGC 2160
CCGGAGCTCT CCATGAACGG CCAGGCTGCT GGCTATGAGC TCTGCGGTGT GCTGAGCCGG 2220
CCTTCGTCGA AGCAGAACAC GCCCCAGTAC CTGGCCTCAC CCCTGGACCA GGAGGTGGTG 2280
CCCTGTACCC CTAGCCACGT CGGCCGGCCG CGCCTGGAGA AGCTGTCTGG CCTAGCCGCA 2340
CCCGACTACA CTAGGCTGTC CCCGGCCAAG ATTGTGCTGA GGCGGCACCT GAGCCAGGAC 2400
CACACGGTGC CCGGCAGGCC GGCTGCCAGT GAGCTGCATT CGAGAGCTGA GCACACCAAG 2460
GAGAACGGCC TTCCCTACCA GAGCCCCAGC GTGCCTGGCA GCATGAAGCT GAGCCCTCAG 2520
GACCCGCGGC CCCTGTCCCC TGGGGCCTTG CAGCTTGCTG GAGAGAAGAG CAGTGAGAAG 2580
GGCCTGAGAG AGCGCGCCTA CGGCAGCAGC GGGGAGCTCA TCACCAGCCT GCCCATCAGC 2640
ATCCCGCTCA GCACCGTGCA GCCCAACAAG CTCCCGGTCA GCATTCCCCT GGCCAGCGTG 2700
GTGCTGCCCA GCCGCGCCGA GAGGGCGAGG AGCACCCCCA GTCCCGTGCT GCAGCCCCGT 2760
GACCCCTCGT CCACACTTGA AAAGCAGATT GGTGCTAATG CCCACGGTGC TGGGAGCAGA 2820
AGCCTTGCCC TGGCCCCCGC AGGCTTCTCC TACGCTGGCT CGGTGGCCAT CAGCGGGGCC 2880
TTGGCGGGCA GCCCGGCCTC TCTCACACCT GGAGCCGAGC CGGCCACCTT GGATGAGTCC 2940
TCCAGCTCTG GGAGCCTTTT TGCCACCGTG GGGTCCCGCA GCTCCACGCC ACAGCACCCC 3000
CTGCTGCTGG CACAGCCCCG GAACTCGCTT CCTGCCTCTC CCGCCCACCA GCTCTCCTCC 3060
AGTCCCCGGC TTGGTGGGGC CGCCCAGGGC CCGTTGCCCG AGGCCAGCAA GGGAGACCTG 3120
CCCTCCGATT CCGGCTTCTC AGATCCTGAG AGTGAAGCCA AGAGGAGGAT TGTGTTCACC 3180
ATCACCACTG GTGCGGGCAG TGCCAAGCAG TCGCCCTCCA GCAAGCACAG CCCCCTGACC 3240
GCCAGCGCCC GTGGGGACTG TGTGCCGAGC CACGGGCAGG ACAGTCGCAG GCGCGGCCGG 3300
CGGAAGCGAG CATCTGCGGG GACGCCCAGC TTGAGCGCAG GCGTGTCCCC CAAGCGCCGA 3360
GCCCTGCCGT CCGTCGCTGG CCTTTTCACA CAGCCTTCGG GGTCTCCCCT CAACCTCAAC 3420
TCCATGGTCA GTAACATCAA CCAGCCCCTG GAGATTACAG CCATCTCGTC CCCGGAGACC 3480
TCCCTGAAGA GCTCCCCTGT GCCCTACCAG GACCACGACC AGCCCCCCGT GCTCAAGAAG 3540
GAGCGGCCTC TGAGCCAGAC CAATGGGGCA CACTACTCCC CACTCACCTC AGACGAGGAG 3600
CCAGGCTCTG AGGACGAGCC CAGCAGTGCT CGAATTGAGA GAAAAATTGC AACAATCTCC 3660
TTAGAAAGCA AATCTCCCCC GAAAACCTTG GAAAATGGTG GTGGCTTGGC GGGAAGGAAG 3720
CCCGCGCCCG CCGGCGAGCC AGTCAATAGC AGCAAGTGGA AGTCCACCTT CTCGCCCATC 3780
TCCGACATCG GCCTGGCCAA GTCGGCGGAC AGCCCGCTGC AGGCCAGCTC CGCCCTCAGC 3840
CAGAACTCCC TGTTCACGTT CCGGCCCGCC CTGGAGGAGC CCTCTGCCGA TGCCAAGCTG 3900
GCCGCTCACC CCAGGAAAGG CTTTCCCGGC TCCCTGTCGG GGGCTGACGG ACTCAGCCCG 3960
GGCACCAACC CTGCCAACGG CTGCACCTTC GGCGGGGGCC TGGCCGCGGA CCTGAGTTTA 4020
CACAGCTTCA GTGATGGTGC TTCTCTTCCC CACAAGGGCC CCGAGGCGGC CGGCCTGAGC 4080
TCCCCGCTGA GCTTCCCCTC GCAGCGCGGC AAGGAGGGCT CGGACGCCAA CCCTTTCCTG 4140
AGCAAGAGGC AGCTGGACGG CCTGGCTGGG CTGAAGGGCG AGGGCAGCCG CGGCAAGGAG 4200
GCAGGGGAGG GCGGCCTACC GCTGTGCGGG CCCACGGACA AGACCCCACT GCTGAGCGGC 4260
AAGGCCGCCA AGGCCCGGGA CCGCGAGGTC GACCTCAAGA ATGGCCACAA CCTCTTCATC 4320
TCTGCGGCGG CCGTGCCTCC CGGAAGCCTC CTCAGCGGCC CCGGCCTGGC CCCGGCGGCG 4380
TCCTCCGCAG GCGGCGCGGC GTCCTCCGCC CAGACGCACC GGTCCTTCCT GGGCCCCTTC 4440
CCGCCGGGAC CGCAGTTCGC GCTCGGCCCC ATGTCCCTGC AGGCCAACCT CGGCTCCGTG 4500
GCCGGCTCCT CCGTGCTGCA GTCGCTGTTC AGCTCTGTGC CGGCCGCCGC AGGCCTGGTG 4560
CACGTGTCGT CCGCTGCCAC CAGACTGACC AACTCGCACG CCATGGGCAG CTTTTCCGGG 4620
GTGGCAGGCG GCACAGTTGG AGGTAACTAG GATTTCTACC TCAACCGCGA GACCTATGCA 4680
AGGACGGTGT GGACCAACTC GCGCCCGCGG CATGGTGCCC GCCGGCCTGC CGGGCTCCCA 4740
CCCCTGGACG GCAGAGGCAA GGACGGACGG GAGCTCCACT GTGAATCGGC GGCACGCGCC 4800
GCAGGAGGCT GGGACTGGTC CAGTTTGTAC TGTCGATAGT TTTAGATAAA GTATTTATCA 4860
TTTTTTAAAA AGTATAAACA ATTCTGACTT ATTTTATTCC ATCTAAGTGG TAAAAGGCAA 4920
CTTATTGAGA AATATAAATA TCTATATATG AGAGCTCTAT ATAAAGACAC GTGTCTGCAG 4980
GGCGGGCCCG CCAGCGGATT CGCCACAGCC TGCCCCGGTG CTATCTCGTC CCCAGGCCCG 5040
CGCCTGCCTC CACCCGCTTG GTGCTGACTA GACGCTGACA ACGCCGAACC CCGTTCTCGG 5100
AAACGCCGCC CGGCCGGCTC CCCCGACGCG CTGCTCCCGT ACCAAAGGCA GGCCCGTCGC 5160
CACCACATTC CTCGGAGGCC TCCCCGCGGC CTGAGCCCCT TCCTGAGCGC CCTGGCGCCT 5220
GCCCTGAGCT CTTCACCTTT ACCCCGGCAC TGTGAACCCC CAGACTGTTC ACCCTCCGGG 5280
GCGTGGGTTG CGCCCTTGCA TGTGAAGGGG CCTGCGCGGT GACGCAGCTG GCCATGTGCT 5340
GCGCGATGGT GCTGTGAGGA CGGCGCGGGC ACGTTGAACA AGTGCATTTA CTTTTGTATT 5400
TCTCGGCTGT CCATGGCTCG CAGCATGCCC TGCGATGCGG GGCAGGCCTG TCGTGGGTCC 5460
CTTGGTGTTT CTGTACAGGA GAGAGTCACA CTAATGAGTG GCAGTATTTT ATAGAGATGT 5520
GATGAGAATT TATAAATTTC ATAGATTTGA CAGCTTTTAT TTTTAGATGG TATAATGCAC 5580
AGTGAAGAGG AAAGAAAAGC GAGGGGAAAA AACCTTATTT ATTCAAACAG TGCACAAAAT 5640
GGCCCCAGCG TCAGCCCCGA CCCTAGACCC CTCAGTTGCA GCTCCCAGCA GCCCAGACAG 5700
AGCTGCCGGC GCCCCTGCCT GCCCCACATC CCTTCCTGTC AGGGCCACGC CTGGCACCCA 5760
TCCCTTGGAG CCTGTGCTGG TTCTCCCAGC TGCTGTGGGT GTGCTGGGGC CAGGGTGCAC 5820
TGCTGAAACC TGGCCTCTCT GGCCCTAGGC CCCAGGGTGA CGTCGGCCCC CCACTCTGCA 5880
GCCTTGGCGG GTGCCTGGGA CTGGGTGTGG AAGGAGAGGA GCTGAGGCCG GGGTGTAGCA 5940
GGCAGGCAGG GCCACTCCAG TGCTTCTGGA GCCCTGAGCA GTCAGGGCCT GGGTTGTCTG 6000
AGCAGTGGTG GCTCTGTGCC CTCCCTGGAG GATGGGATCT GGGAGTCTGA GCTCCCCGCA 6060
TCTGGCCCTG GGCTGTGTGG CACTTGCTGA GCCCACCTTC TCAAGTGCTT GCTCCTGTGA 6120
GATGGCATCG GGGAGCCCCT TCCCCAAGGT GCCACAGATC CACCCTCCAG GGAGCTGCCA 6180
GCCCTGTGTT CTGGTTCCCA AGGGCAGGAT GGACACACGT CACATCCCTA CCACGTGGCC 6240
TCCAAAGGGA GCCACGGAGG AAAGGCTTCT GTGGTTGCTA GGTGGGGGAG TCCTGTGTGG 6300
GAGGGCCTGA AGACCCCTGC TTGTGCCTGG TGAGGGGGGT GCTGCCTCCC CCAGCCCCCA 6360
ACAACCTCTC AGACCCCCAC CCTCCAACAT AGCTGAGTTC TGAAGATGGT GCTCCGGACC 6420
TGTCCTCTTA AGTGGTGCCC AGTGCCCTCC CCACCCCACG TTGGTGCTCT CAGCTAGAAG 6480
GTGCTGTGCC TCTGCCTGAG CCCCAAGCCC CGAGCCTGGC CTTCAGGACA GGCAGCCTGC 6540
TCTGTGTCGC CACGGGCCGG ATACGCCACA GGGTTGATGG CAGAGACGGC CGAGTCCCTG 6600
GTCTAGAACA AGACACATTC TTTAAACACT GTATTACTTC TGCCTCCCTC TAGGTGACAG 6660
TGGCAGTCCG GGTGCCATCA CGGGTCCTGC AGATGGCCAT GCAGGGCTCC TGCCCACGCA 6720
GGCCACCGTA TGTTCAGGAC ACGCACTGGG TCTCAGAGCC ACTGGCCCAG GCAGAAGTCT 6780
CCTTGAGCCC ACTGGGTCAT ATGCGTGTCA CCACACGTGA ACTAGTGTGG TGGCTGCCTG 6840
CGGACACCCT CCTGTTCTGA GCCCTGGGCC TGTGTTCTTC TCAGACACTC CCAGACTGAG 6900
GGGTGGTGTG TGGCGGGTGG CAGGGTGGCT GTGGAGACTG GGGATCTGGA GCCTGGTGCT 6960
GGCACCTGGC CTGAGTTTCC GTGGGCAGCT GGCGGGGACC TGTGCTGCTG CTGCTGACTG 7020
TGGGTGGGCG GGCGGCGCCT GGGAGTGGCT CTTGCTCAGG AATTGATAGG AACCCTAAAA 7080
ACTAGGATAC CCCCTCCTCG GCCCATGAGG CACGCACAGT GACTTATTTA AGACTTCCCC 7140
CTTAATTTAT CTGCCCCCAG GATGCGTCAG TCTGTTCAGT GGTCAGCAGG CCCCCCACCC 7200
CCCGCCGACT GCCCTCGCCA TCGTGGTCAG ACCCCCCTCC CAACACAACA CGCTGCTGGT 7260
CTGTGTCAGC CTTTGTAACG TGGGAGGCTC TGCCGTGTCT TCCGGGTGAA CTGTATTTGG 7320
ATTGCGCGCA TTGTCACGGT CCGCCCCTGG GCTGCAGGCG CCCCTTCCTC TGGGCACCCC 7380
TGCATTCTGC ATCCCCACCT CTAGACGCTG TAATAAACAG ACTGTTTTCA CTCGGA 7437
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0181--Complete proteome
KW-0238--DNA-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0808--Transferase
--

Interpro

IPR025789--DOT1_dom
IPR021169--DOT1L/grappa
IPR030445--H3-K79_meTrfase
IPR029063--SAM-dependent_MTases

PROSITE

PS51569--DOT1

Pfam

PF08123--DOT1

Gene Ontology

GO:0005654--C:nucleoplasm
GO:0003677--F:DNA binding
GO:0031151--F:histone methyltransferase activity (H3-K79 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0008134--F:transcription factor binding
GO:0006342--P:chromatin silencing
GO:0051726--P:regulation of cell cycle
GO:0046425--P:regulation of JAK-STAT cascade
GO:2000677--P:regulation of transcription regulatory region DNA binding
GO:0032200--P:telomere organization

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 99 0.0 2706
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 99 0.0 2704
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 98 0.0 2643
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 98 0.0 2626
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 91 0.0 2394
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 93 0.0 2384
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 89 0.0 2372
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 86 0.0 2269
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 84 0.0 2196
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 83 0.0 2178
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 84 0.0 2169
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 85 0.0 2164
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 85 0.0 2117
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 85 0.0 2114
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 81 0.0 2065
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 80 0.0 2032
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 75 0.0 1966
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 77 0.0 1965
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 89 0.0 1957
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 75 0.0 1956
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 75 0.0 1954
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 75 0.0 1946
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 84 0.0 1940
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 75 0.0 1903
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 81 0.0 1871
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 87 0.0 1846
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 74 0.0 1746
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 82 0.0 1615
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 78 0.0 1573
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 67 0.0 1377
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 59 0.0 1149
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 59 0.0 1101
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 58 0.0 1082
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 82 0.0 1082
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 66 0.0 1018
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 63 0.0 1011
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 67 0.0 1008
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 66 0.0 1003
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 66 0.0 999
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 67 0.0 999
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 83 0.0 747
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 58 5e-102 370
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 9e-17 87.4
Created Date 25-Jun-2016