WERAM Information


Tag Content
WERAM ID WERAM-Pat-0084
Ensembl Protein ID ENSPTRP00000059247.2
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPTRG00000010225.5 ENSPTRT00000067663.2 ENSPTRP00000059247.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 1.40e-182 605.8 16 330
Organism Pan troglodytes
Domain Profile
  HMT nonSET

            nonSET.txt   1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkipr 89 
+++v++wpl++ydk++daa+eiie+i++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+
ENSPTRP00000059247.2 16 EPAVYPWPLPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPS 103
689******************************************************************************.******* PP
nonSET.txt 90 tsllrdilqlvynrsvtdaeklnnyeafspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaa 178
t+llr+ilq+vyn+svtd+eklnnye+fspevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+a
ENSPTRP00000059247.2 104 TGLLRHILQQVYNHSVTDPEKLNNYEPFSPEVYGETSFDLVAQMIDEIKMTDDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPA 192
***************************************************************************************** PP
nonSET.txt 179 kyaelmdeelrkrmklyGkrlaeyelekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrins 267
kyae+md+e+rk+mk+yGk++aey+le+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrins
ENSPTRP00000059247.2 193 KYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINS 281
***************************************************************************************** PP
nonSET.txt 268 rnlsdigtilkvveldllkgsvsWtgkpvsyylhtidrtilesyfsslk 316
rnlsdigti++vvel++lkgsvsWtgkpvsyylhtidrtile+yfsslk
ENSPTRP00000059247.2 282 RNLSDIGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRTILENYFSSLK 330
**********************************************997 PP

Protein Sequence
(Fasta)
MGEKLELRLK SPVGAEPAVY PWPLPVYDKH HDAAHEIIET IRWVCEEIPD LKLAMENYVL 60
IDYDTKSFES MQRLCDKYNR AIDSIHQLWK GTTQPMKLNT RPSTGLLRHI LQQVYNHSVT 120
DPEKLNNYEP FSPEVYGETS FDLVAQMIDE IKMTDDDLFV DLGSGVGQVV LQVAAATNCK 180
HHYGVEKADI PAKYAETMDR EFRKWMKWYG KKHAEYTLER GDFLSEEWRE RIANTSVIFV 240
NNFAFGPEVD HQLKERFANM KEGGRIVSSK PFAPLNFRIN SRNLSDIGTI MRVVELSPLK 300
GSVSWTGKPV SYYLHTIDRT ILENYFSSLK NPKLREEQEA ARRRQQRESK SNAATPTKGP 360
EGKVAGPADA PMDSGAEEEK AGAATVKKPS PSKARKKKLN KKGRKMAGRK RGRPKKMNTA 420
NPERKPKKNQ TALDALHAQT VSQTAASSPQ DAYRSPHSPF LQLPPSVQRH SPNPLLVAPT 480
PPALQKLLES FKIQYLQFLA YTKTPQYKAS LQELLGQEKE KNAQLLGAAQ QLLSHCQAQK 540
EEIRRLFQQK LDELGVKALT YNDLIQAQKE ISAHNQQLRE QSEQLEQDNR ALRGQSLQLL 600
KARCEELQLD WATLSLEKLL RRKQALKSQI SEKQRHCLEL QISIVELEKS QRQQELLQLK 660
SCVPPDDALS LHLRGKGALG RELEPDASRL HLELDCTKFS LPHLSSMSPE LSMNGQAAGY 720
ELCGALSRPP SKQNTPQYLA SPLDQEVVPC TPSHVGRPRL EKLSGLAAPD YTRLSPAKIV 780
LRRHLSQDHT VPGRPAASEL HSRAEHTKEN GLPYQSPSVP GSMKLSPQDP RPLSPGALQL 840
AGEKSSEKGL RERAYGSSGE LITSLPISIP LSTVQPNKLP VSIPLASVVL PSRAERARST 900
PSPVLQPRDP SSTLEKQIGA NAHGAGSRSL ALAPAGFSYA GSVAISGALA GSPASLTPGA 960
EPATLDESSS SGSLFATVGS RSSTPQHPLL LAQPRNSLPA SPAHQLSSSP RLGGATQGPL 1020
PEATKGDLPS DSGFSDPESE AKRRIVFTIT TGAGSAKQSP SSKHSPLTTS ARGDCVPSHG 1080
QDSRKRGRRK RASAGTPSLS AGVSPKRRAL PSVAGLFTQP SGSPLNLNSM VSNINQPLEI 1140
TAISSPETSL KSSPVPYQDH DQPPVLKKER PLSQTNGAHY SPLTSDEEPG SEDEPSSARI 1200
ERKIATISLE SKSPPKTLEN GGGLAGRKPA PAGEPVNSSK WKSTFSPISD IGLAKSADSP 1260
LQASSALSQN SLFTFRPALE EPSADAKLAA HPRKGFPGSL SGADGLSPGT NPANGCTFGG 1320
GLAADLSLHS FSDGASLPHK GPEAAGLSSP LSFPSQRGKE GSDANPFLSK RQLDGLAGLK 1380
GEGSRGKEAG EGGLPLCGPT DKTPLLSGKA AKARDRELDL KNGHNLFISA AAVPPGSLLS 1440
GPGLAPAASS AGGAASSAQT HRSFLGPFPP GPQFALGPMS LQANLGSVAG SSVLQSLFSS 1500
VPAARSLVHV SSAATRLTNS HAMGSFSGVA GGTVGG 1536
Nucleotide Sequence
(Fasta)
CCCGCCTAGC ATGGTGCGGC GGCCGCGCGC GCGGACATGG GGGAGAAGCT GGAGCTGAGA 60
CTGAAGTCGC CCGTGGGGGC TGAGCCCGCC GTCTACCCGT GGCCGCTGCC GGTCTACGAT 120
AAACATCACG ATGCTGCTCA TGAAATCATC GAGACCATCC GATGGGTCTG TGAAGAAATC 180
CCGGATCTCA AGCTCGCTAT GGAGAATTAC GTTTTAATTG ACTATGACAC CAAAAGCTTC 240
GAGAGCATGC AGAGGCTCTG CGACAAGTAC AACCGTGCCA TTGACAGCAT CCACCAGCTG 300
TGGAAGGGCA CCACGCAGCC CATGAAGCTG AACACGCGGC CGTCCACTGG ACTCCTGCGC 360
CATATCCTGC AGCAGGTCTA CAACCACTCG GTGACCGACC CCGAGAAGCT CAACAACTAC 420
GAGCCCTTCT CCCCCGAGGT GTACGGGGAG ACCTCCTTCG ACCTGGTGGC CCAGATGATT 480
GACGAGATCA AGATGACCGA CGACGACCTG TTTGTGGACT TGGGGAGCGG TGTGGGTCAG 540
GTCGTGCTCC AGGTTGCTGC TGCCACCAAC TGCAAACATC ACTATGGCGT CGAGAAAGCA 600
GACATCCCGG CCAAGTATGC GGAGACCATG GACCGCGAGT TCAGGAAGTG GATGAAATGG 660
TATGGAAAAA AGCATGCAGA ATACACATTG GAGAGAGGCG ATTTCCTCTC AGAAGAGTGG 720
AGGGAGCGAA TCGCCAACAC GAGTGTTATA TTTGTGAATA ATTTTGCCTT TGGTCCTGAG 780
GTGGATCACC AGCTGAAGGA GCGGTTTGCA AACATGAAGG AAGGTGGCAG AATCGTGTCC 840
TCGAAACCCT TTGCACCTCT GAACTTCAGA ATAAACAGTA GAAACTTGAG TGACATCGGC 900
ACCATCATGC GGGTGGTGGA GCTCTCGCCC CTGAAGGGCT CGGTGTCGTG GACGGGGAAG 960
CCAGTCTCCT ACTACCTGCA CACTATCGAC CGCACCATAC TTGAAAACTA TTTTTCTAGT 1020
CTGAAAAACC CAAAACTCAG GGAGGAACAG GAGGCAGCCC GGCGCCGCCA GCAGCGCGAG 1080
AGCAAGAGTA ACGCGGCCAC GCCCACTAAG GGCCCAGAGG GCAAGGTGGC CGGCCCCGCC 1140
GACGCCCCCA TGGACTCTGG TGCTGAGGAA GAGAAGGCGG GAGCAGCCAC CGTGAAGAAG 1200
CCGTCTCCCT CCAAAGCCCG CAAGAAGAAG CTAAACAAGA AGGGGAGGAA GATGGCTGGC 1260
CGCAAGCGCG GGCGCCCCAA GAAGATGAAC ACTGCGAACC CCGAGCGGAA GCCCAAGAAG 1320
AACCAAACTG CACTGGATGC CCTGCACGCT CAGACCGTGT CTCAGACGGC GGCCTCCTCA 1380
CCCCAGGATG CCTACAGATC CCCTCACAGC CCGTTTCTCC AGCTACCTCC GAGCGTGCAG 1440
CGGCACTCCC CCAACCCGCT GCTGGTGGCG CCCACCCCGC CCGCGCTGCA GAAGCTGCTA 1500
GAGTCCTTCA AGATCCAGTA CCTGCAGTTC CTGGCATACA CAAAGACCCC CCAGTACAAG 1560
GCCAGCCTGC AGGAGCTGCT GGGCCAGGAG AAGGAGAAGA ACGCCCAGCT CCTGGGTGCG 1620
GCTCAGCAGC TCCTCAGCCA CTGCCAGGCC CAGAAGGAGG AGATCAGGAG GCTGTTCCAG 1680
CAAAAATTGG ATGAGCTGGG TGTGAAGGCG CTGACCTACA ATGACCTGAT TCAAGCGCAG 1740
AAGGAGATCT CCGCCCATAA CCAGCAGCTG CGGGAGCAGT CGGAGCAGCT GGAGCAGGAC 1800
AACCGCGCGC TCCGCGGCCA GAGCTTGCAG CTGCTCAAGG CTCGCTGCGA GGAGCTGCAG 1860
CTGGACTGGG CCACGCTGTC GCTGGAGAAG CTGTTAAGGA GGAAGCAGGC CCTGAAGAGC 1920
CAGATCTCGG AGAAGCAGAG GCACTGCCTG GAGCTGCAGA TCAGCATTGT GGAGCTAGAG 1980
AAGAGCCAGC GGCAGCAGGA GCTCCTGCAG CTCAAGTCCT GTGTGCCGCC TGACGATGCC 2040
CTGTCCCTGC ACCTGCGTGG GAAGGGCGCC CTGGGCCGCG AGCTGGAGCC TGACGCCAGC 2100
CGGCTGCACC TGGAGCTGGA CTGCACCAAG TTCTCGCTGC CTCACTTGAG CAGCATGAGC 2160
CCGGAGCTCT CCATGAACGG CCAGGCTGCT GGCTATGAGC TCTGCGGTGC GCTGAGCCGG 2220
CCTCCGTCGA AGCAGAACAC GCCCCAGTAC CTGGCCTCAC CCCTGGACCA GGAGGTGGTG 2280
CCCTGTACCC CTAGCCACGT CGGCCGGCCG CGCCTGGAGA AGCTGTCTGG CCTAGCCGCA 2340
CCCGACTACA CTAGGCTGTC CCCGGCCAAG ATTGTGCTGA GGCGGCACCT GAGCCAGGAC 2400
CACACGGTGC CCGGCAGGCC GGCCGCCAGT GAGCTGCATT CGAGAGCTGA GCACACCAAG 2460
GAGAACGGCC TTCCCTACCA GAGCCCCAGC GTGCCTGGCA GCATGAAGCT GAGCCCTCAG 2520
GACCCGCGGC CCCTGTCCCC TGGGGCCTTG CAGCTTGCTG GAGAGAAGAG CAGTGAGAAG 2580
GGCCTGAGAG AGCGCGCCTA TGGCAGCAGC GGGGAGCTCA TCACCAGCCT GCCCATCAGC 2640
ATCCCGCTCA GCACCGTGCA GCCCAACAAG CTCCCGGTCA GCATTCCCCT GGCCAGCGTG 2700
GTGCTGCCCA GCCGCGCCGA GAGGGCGAGG AGCACCCCCA GTCCCGTGCT GCAGCCCCGT 2760
GACCCCTCGT CCACACTTGA AAAGCAGATT GGTGCTAATG CCCACGGTGC TGGGAGCAGA 2820
AGCCTTGCCC TGGCCCCCGC AGGCTTCTCC TACGCTGGCT CGGTGGCCAT CAGCGGGGCC 2880
TTGGCGGGCA GCCCGGCCTC TCTCACACCT GGAGCCGAGC CGGCCACCTT GGATGAGTCC 2940
TCCAGCTCTG GGAGCCTTTT TGCCACGGTG GGGTCCCGCA GCTCCACGCC ACAGCACCCC 3000
CTGCTGCTGG CACAGCCCCG GAACTCGCTT CCTGCCTCTC CCGCCCACCA GCTCTCCTCC 3060
AGTCCCCGGC TTGGTGGGGC CACACAGGGC CCGCTGCCCG AGGCCACCAA GGGGGACCTG 3120
CCCTCCGATT CCGGCTTCTC AGATCCTGAG AGTGAAGCCA AGAGGAGGAT TGTGTTCACC 3180
ATCACCACTG GTGCGGGCAG TGCCAAGCAG TCGCCCTCCA GCAAGCACAG CCCCCTGACC 3240
ACCAGCGCCC GTGGGGACTG TGTGCCGAGC CACGGGCAGG ACAGCCGCAA GCGCGGCCGG 3300
CGGAAGCGAG CATCTGCGGG GACGCCCAGC TTGAGCGCAG GCGTGTCCCC CAAGCGCCGA 3360
GCCCTGCCGT CCGTCGCTGG CCTTTTCACA CAGCCTTCGG GGTCTCCCCT CAACCTCAAC 3420
TCCATGGTCA GTAACATCAA CCAGCCCCTG GAGATTACAG CCATCTCGTC CCCGGAGACC 3480
TCCCTGAAGA GCTCCCCTGT GCCCTACCAG GACCACGACC AGCCCCCCGT GCTCAAGAAG 3540
GAGCGGCCTC TGAGCCAGAC CAATGGGGCA CACTACTCCC CACTCACCTC AGACGAGGAG 3600
CCAGGCTCTG AGGACGAGCC CAGCAGTGCT CGAATTGAGA GAAAAATTGC AACAATCTCC 3660
TTAGAAAGCA AATCTCCCCC GAAAACCTTG GAAAACGGTG GTGGCTTGGC GGGAAGGAAG 3720
CCCGCGCCCG CCGGCGAGCC AGTCAATAGC AGCAAGTGGA AGTCCACCTT CTCGCCCATC 3780
TCCGACATCG GCCTGGCCAA GTCGGCGGAC AGCCCGCTGC AGGCCAGCTC CGCCCTCAGC 3840
CAGAACTCCC TGTTCACGTT CCGGCCCGCC CTGGAGGAGC CCTCTGCCGA TGCCAAGCTG 3900
GCCGCTCACC CCAGGAAAGG CTTTCCCGGC TCCCTGTCGG GGGCTGATGG ACTCAGCCCG 3960
GGCACCAACC CTGCCAACGG CTGCACCTTC GGCGGGGGCC TGGCCGCGGA CCTGAGTTTA 4020
CACAGCTTCA GTGATGGTGC TTCTCTTCCC CACAAGGGCC CCGAGGCGGC CGGCCTGAGC 4080
TCCCCGCTGA GCTTCCCCTC GCAGCGCGGC AAGGAGGGCT CGGACGCCAA CCCTTTCCTG 4140
AGCAAGAGGC AGCTGGACGG CCTGGCTGGG CTGAAGGGCG AGGGCAGCCG CGGCAAGGAG 4200
GCAGGGGAGG GCGGCCTACC GCTGTGCGGG CCCACGGACA AGACCCCACT GCTGAGCGGC 4260
AAGGCCGCCA AGGCCCGGGA CCGCGAGCTC GACCTCAAGA ATGGCCACAA CCTCTTCATC 4320
TCTGCGGCGG CCGTGCCTCC CGGAAGCCTC CTCAGCGGCC CCGGCCTGGC CCCGGCGGCG 4380
TCGTCCGCAG GCGGCGCGGC GTCCTCCGCC CAGACGCACC GGTCCTTCCT GGGCCCCTTC 4440
CCGCCGGGGC CGCAGTTCGC GCTCGGCCCC ATGTCCCTGC AGGCCAACCT CGGCTCCGTG 4500
GCCGGCTCCT CCGTGCTGCA GTCGCTGTTC AGCTCTGTGC CGGCCGCCCG CAGCCTGGTG 4560
CACGTGTCGT CCGCTGCCAC CAGACTGACC AACTCGCACG CCATGGGCAG CTTTTCCGGG 4620
GTGGCAGGCG GCACAGTTGG AGGT 4645
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 99 0.0 2736
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 99 0.0 2725
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 97 0.0 2663
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 98 0.0 2647
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 93 0.0 2415
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 90 0.0 2411
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 89 0.0 2394
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 85 0.0 2285
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 84 0.0 2211
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 83 0.0 2194
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 84 0.0 2189
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 85 0.0 2186
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 84 0.0 2134
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 85 0.0 2132
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 81 0.0 2088
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 80 0.0 2055
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 77 0.0 1985
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 75 0.0 1984
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 89 0.0 1978
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 75 0.0 1975
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 75 0.0 1972
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 84 0.0 1972
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 75 0.0 1966
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 74 0.0 1922
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 81 0.0 1895
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 87 0.0 1870
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 73 0.0 1769
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 82 0.0 1638
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 77 0.0 1595
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 66 0.0 1397
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 59 0.0 1168
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 59 0.0 1119
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 83 0.0 1114
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 58 0.0 1100
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 66 0.0 1038
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 63 0.0 1034
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 67 0.0 1026
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 66 0.0 1024
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 66 0.0 1018
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 66 0.0 1013
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 82 0.0 769
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 58 4e-102 370
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 9e-17 87.8
Created Date 25-Jun-2016