WERAM Information


Tag Content
WERAM ID WERAM-Tag-0004
Ensembl Protein ID ENSTGUP00000000304.1
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSTGUG00000000295.1 ENSTGUT00000000309.1 ENSTGUP00000000304.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 4.80e-175 581 8 319
Organism Taeniopygia guttata
Domain Profile
  HMT nonSET

            nonSET.txt   1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkipr 89 
+++v++wp+++ydk++daa+eiie+i++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+
ENSTGUP00000000304.1 8 EPAVYPWPVPVYDKHHDAAHEIIETIRWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPS 95
689******************************************************************************.******* PP
nonSET.txt 90 tsllrdilqlvynrsvtdaeklnnyeafspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaa 178
+llr+il+ vyn+s d+eklnnye+f pevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+a
ENSTGUP00000000304.1 96 HGLLRHILHAVYNHS--DPEKLNNYEPF-PEVYGETSFDLVAQMIDEIKMTEDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPA 181
**************8..***********.8*********************************************************** PP
nonSET.txt 179 kyaelmdeelrkrmklyGkrlaeyelekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrins 267
kyae+md+e+rk+mk+yGk++aey+le+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrins
ENSTGUP00000000304.1 182 KYAETMDREFRKWMKWYGKKHAEYTLERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINS 270
***************************************************************************************** PP
nonSET.txt 268 rnlsdigtilkvveldllkgsvsWtgkpvsyylhtidrtilesyfsslk 316
rnlsdigti++vvel++lkgsvsWtgkpvsyylhtidrtile+yfsslk
ENSTGUP00000000304.1 271 RNLSDIGTIMRVVELSPLKGSVSWTGKPVSYYLHTIDRTILENYFSSLK 319
**********************************************997 PP

Protein Sequence
(Fasta)
XKSPVGAEPA VYPWPVPVYD KHHDAAHEII ETIRWVCEEI PDLKLAMENY VLIDYDTKSF 60
ESMQRLCDKY NRAIDSIHQL WKGTTQPMKL NTRPSHGLLR HILHAVYNHS DPEKLNNYEP 120
FPEVYGETSF DLVAQMIDEI KMTEDDLFVD LGSGVGQVVL QVAAATNCKH HYGVEKADIP 180
AKYAETMDRE FRKWMKWYGK KHAEYTLERG DFLSEEWRER IANTSVIFVN NFAFGPEVDH 240
QLKERFANMK EGGRIVSSKP FAPLNFRINS RNLSDIGTIM RVVELSPLKG SVSWTGKPVS 300
YYLHTIDRTI LENYFSSLKN PKLREEQEAA RRRQQRENKS NTTTPTKVQE NKVTLLHFWV 360
LSDSGAEEEK AAANSVKKPS PSKARKKKLS KKGRKMAGRK RGRPKKMNTA STERKTKKNQ 420
TALELLHAQT VSQTSSSSPQ DAYKSPHSPY YQLPPKVQRH SSNQLLVTPT PPALQKLLDS 480
FKIQYMQFMA YMKTPQYKAS LEQLLEQEKE KNAQLLGTAQ QLFTHCQAQK EEIKRLFQQK 540
LDELGVKALT YNDLIQAQKE ISAHNQQLKE QTKQLEKDNS ELRNQSLQLL KARCEELKLD 600
WSTLSLENLL KEKQALKNQI SEKQKHCLEL QISIVELEKS QRQQELLQLK SYTPSEEPLS 660
VQLHSKSHLS REPEAEHGRF QLELECSKFP MPHINGMSPE LSMNGHATPY ELQHAFSRPS 720
SKQNTPQYPN SHLDQDIVPC TPNHSSRQKA DKVTSLSLPD YTRFSPAKIA LRRHLNQDHG 780
VNGRATANEI QRRTEHVKEN GLTYPSPGIA NGIKLSPQET RPSSPAALPS AGEKGEKDVL 840
RERTSVSNGE TITSLPISIP LSTVQPNKLP VSIPLASVVL PSRVEKVRST PSPVHQSRDS 900
STLEKQIGAS SHNGISNAAG NKPLALATSG FSYSSGSVAV NGNLTNSPAH LNHSVDQAAL 960
DDSGSLFNSV GSRSSTPQHP LLMMRNSGQS SPAQQHSSPR LKAQKIFAEG TKGDLQSDAA 1020
FSDPENEAKR RIIFTISPNT GHVKQSPSSK HSPVPGSARP EGGQAHAQDG KKRGRRKRSS 1080
TGNASGNAAV SPKRKPLPSV AGLFTQPSGS PLNINSMVNN INQPLEITAI SSPENSLKNS 1140
PVPYQDNDQP PVLKKEKPLV QSNGVHYSPL TSDDEQGSED EHNSSRIERK IATISLESKS 1200
PQKTVENGGS ISGRKQAQSN ENMNSSKWKS TFSPISDINL TKTTDSPLQA VSALSQNSLF 1260
AFRPPPEDGL PMDTKVPGHP RKSLAVPSDG LSPGTNPPNG FSYNGGLSSE LGLHGFMDGA 1320
ALPHKAGDGS AASCSLGFPS QRGKELGVAD TNLFLNKRQL EALGTKGEEL SRPGAKGKEL 1380
SELSARAGGS VEKNSVQHNG KVGKGRDRDV EFKNGHNLFI SAAVSSGGLL NGKSLSTAVS 1440
SAGNPAPSVP THHPFLNTLT TGSQFPLGPM ALQANLNSVT NSSVLQSLFN SMPAAASLVH 1500
VSSAATRLTN SHTMGNFSPG VTGGTVGGIF NHAVPSASSP HQFGASFSSS AVSSSTVLSL 1560
NPLQAVASTS SSSFPPSSSN LVTSSETRPA QHLNRAPVQS VFHPPPPPPN VSLPPPPPLL 1620
ASNSEPALLQ NLPSIPPGET FLPSSSAPVQ SNSSLSIKLA SLQHKPSRPS FTVHHPPLPR 1680
VLPQPNAAGT AAMWVTLGMQ PPYASHLSGV KPR 1713
Nucleotide Sequence
(Fasta)
NTGAAGTCTC CCGTGGGAGC CGAGCCCGCC GTGTACCCCT GGCCGGTGCC CGTCTACGAC 60
AAACACCACG ATGCTGCTCA TGAAATTATT GAGACAATTC GGTGGGTCTG TGAAGAAATC 120
CCAGATCTCA AGCTTGCCAT GGAAAACTAT GTTTTAATTG ACTATGATAC AAAAAGCTTT 180
GAGAGCATGC AGAGACTGTG TGACAAGTAC AACCGAGCCA TCGACAGCAT CCACCAGTTG 240
TGGAAGGGGA CGACGCAGCC CATGAAGCTG AACACGCGTC CCTCCCACGG GCTCCTGAGG 300
CACATCCTGC ATGCAGTCTA CAACCACTCG GACCCCGAGA AACTCAACAA CTACGAGCCC 360
TTCCCCGAGG TCTACGGAGA AACTTCCTTT GACTTGGTGG CCCAAATGAT TGATGAGATT 420
AAAATGACAG AGGATGATTT GTTCGTTGAC TTGGGCAGTG GTGTGGGGCA GGTGGTGCTT 480
CAAGTGGCTG CAGCCACAAA CTGCAAACAT CACTACGGAG TGGAGAAAGC TGATATTCCA 540
GCCAAGTACG CCGAGACGAT GGACAGAGAG TTCAGAAAAT GGATGAAATG GTATGGGAAG 600
AAACATGCAG AATACACACT GGAAAGAGGT GACTTCCTCT CAGAAGAATG GAGGGAGAGG 660
ATTGCAAACA CAAGTGTTAT TTTTGTGAAC AATTTTGCCT TCGGGCCTGA GGTGGATCAC 720
CAGCTGAAGG AGCGCTTTGC AAACATGAAG GAAGGTGGGA GAATTGTGTC CTCAAAACCC 780
TTTGCACCTC TAAATTTTAG AATTAACAGT CGAAACTTGA GTGATATTGG CACTATCATG 840
AGGGTTGTGG AGCTGTCACC ACTGAAGGGC TCTGTGTCCT GGACGGGGAA ACCAGTCTCC 900
TACTACCTGC ACACCATTGA CAGAACCATA CTTGAAAACT ATTTTTCTAG TCTCAAAAAT 960
CCAAAACTCA GGGAGGAACA AGAGGCAGCT AGACGTCGTC AACAACGAGA AAACAAAAGT 1020
AACACAACGA CTCCAACGAA GGTCCAGGAA AACAAGGTGA CCCTCCTGCA TTTCTGGGTG 1080
CTGAGTGATT CTGGTGCTGA AGAAGAAAAA GCTGCAGCAA ATTCTGTTAA AAAGCCATCT 1140
CCCTCCAAGG CCCGGAAGAA GAAGCTGAGT AAGAAAGGCA GGAAAATGGC AGGGAGGAAA 1200
CGAGGGCGCC CCAAGAAAAT GAACACGGCG AGTACTGAGC GCAAGACCAA GAAGAACCAA 1260
ACTGCACTAG AACTTCTGCA TGCTCAGACT GTCTCCCAGA CATCTTCATC CTCTCCTCAG 1320
GATGCCTACA AGTCACCTCA TAGCCCATAT TACCAACTAC CTCCTAAAGT GCAACGGCAT 1380
TCGTCCAACC AGCTCCTGGT GACACCCACC CCCCCTGCAC TACAGAAGCT GCTAGACTCA 1440
TTTAAGATCC AATACATGCA GTTCATGGCA TACATGAAAA CTCCTCAGTA CAAAGCAAGT 1500
CTGGAACAGT TACTGGAGCA GGAAAAGGAG AAGAATGCTC AGCTGCTGGG AACAGCCCAG 1560
CAGTTATTCA CCCACTGCCA GGCCCAGAAA GAGGAAATCA AGCGCCTGTT CCAGCAGAAG 1620
CTCGATGAGC TGGGAGTTAA AGCTCTGACA TACAACGACC TGATCCAGGC TCAGAAGGAA 1680
ATCTCTGCTC ACAACCAGCA GCTGAAAGAA CAGACAAAAC AGCTGGAGAA GGACAACAGT 1740
GAACTCAGGA ACCAGAGCTT GCAGCTGCTG AAGGCTCGGT GTGAAGAGCT GAAGCTGGAT 1800
TGGTCAACGC TGTCACTGGA GAACTTGCTG AAAGAGAAGC AGGCCTTGAA GAATCAGATT 1860
TCTGAGAAGC AAAAGCACTG TTTGGAACTG CAGATCAGCA TCGTGGAACT CGAGAAGAGT 1920
CAGCGGCAGC AGGAGCTGCT GCAGCTCAAG TCCTACACAC CCTCAGAGGA GCCCCTGTCA 1980
GTGCAGCTGC ACAGCAAAAG CCACCTGAGC CGTGAGCCCG AGGCTGAGCA CGGCAGGTTC 2040
CAGCTGGAGC TGGAGTGCTC CAAGTTCCCC ATGCCCCACA TCAATGGCAT GAGCCCCGAG 2100
CTGTCCATGA ACGGCCACGC CACCCCCTAC GAGCTGCAGC ACGCCTTCAG CCGGCCCTCG 2160
TCCAAGCAGA ACACCCCCCA GTACCCCAAC TCCCACCTGG ACCAAGACAT CGTGCCCTGC 2220
ACCCCCAACC ACAGCAGCAG GCAGAAGGCA GACAAGGTGA CCAGCCTGTC CCTCCCTGAT 2280
TACACCAGGT TCTCCCCTGC TAAAATCGCC CTGCGCAGAC ACTTGAATCA GGACCACGGA 2340
GTCAATGGAA GAGCAACAGC TAATGAGATA CAGAGGAGGA CTGAACATGT CAAAGAGAAT 2400
GGCCTTACGT ATCCAAGCCC TGGAATTGCA AATGGCATAA AGCTGAGTCC TCAGGAAACT 2460
CGGCCCTCCT CCCCCGCGGC CTTACCCAGT GCAGGAGAGA AGGGAGAGAA AGATGTTTTG 2520
AGAGAGAGAA CCTCCGTGAG TAATGGAGAA ACTATCACCA GCCTCCCTAT CAGTATTCCT 2580
CTGAGCACAG TGCAGCCCAA TAAACTCCCT GTTAGTATCC CTCTGGCCAG CGTAGTCCTA 2640
CCTAGCCGTG TTGAGAAGGT GAGAAGCACA CCCAGCCCAG TTCACCAGAG CAGAGACTCG 2700
TCGACACTTG AAAAGCAGAT TGGTGCTAGT TCTCATAATG GCATAAGCAA TGCTGCTGGA 2760
AACAAGCCTC TGGCTTTGGC TACCTCAGGT TTTTCTTACT CTTCTGGCTC CGTGGCAGTC 2820
AATGGAAATC TTACAAACAG CCCAGCCCAT CTTAACCATA GTGTTGATCA GGCAGCTCTG 2880
GACGACTCTG GGAGTCTCTT TAACTCTGTA GGGTCTCGGA GTTCCACTCC ACAACATCCT 2940
TTGCTAATGA TGAGGAACTC TGGGCAGAGC TCCCCTGCTC AGCAGCACTC GAGCCCCCGC 3000
TTAAAGGCTC AGAAGATCTT TGCCGAAGGA ACAAAGGGGG ACCTGCAGTC TGATGCAGCG 3060
TTTTCAGATC CTGAGAACGA GGCCAAGAGG AGAATCATCT TTACAATCTC TCCTAATACA 3120
GGACATGTAA AACAATCCCC TTCCAGCAAG CACAGCCCCG TGCCCGGCAG CGCCCGGCCG 3180
GAGGGCGGCC AGGCCCACGC GCAGGACGGG AAGAAGCGGG GCCGGCGGAA GAGATCCTCC 3240
ACTGGGAACG CCAGCGGGAA CGCCGCCGTC TCCCCGAAAC GCAAACCCCT GCCCTCGGTG 3300
GCCGGGCTCT TTACGCAGCC CTCGGGGTCG CCGCTCAATA TCAACTCCAT GGTCAATAAC 3360
ATTAACCAGC CCTTAGAAAT AACAGCCATT TCCTCTCCTG AAAACTCCCT GAAGAACTCT 3420
CCTGTTCCTT ACCAGGACAA CGACCAGCCC CCAGTGCTGA AAAAAGAGAA GCCCCTCGTT 3480
CAGAGCAACG GTGTCCATTA CTCCCCCCTG ACATCGGATG ACGAGCAGGG CTCTGAGGAT 3540
GAGCACAATA GCAGCAGAAT TGAGAGGAAA ATTGCAACAA TATCATTGGA AAGCAAATCT 3600
CCACAGAAGA CTGTTGAAAA TGGTGGCAGC ATATCGGGGA GGAAACAAGC ACAAAGCAAC 3660
GAGAACATGA ACAGCAGCAA ATGGAAATCG ACTTTTTCAC CGATATCTGA CATCAACCTG 3720
ACCAAAACCA CAGACAGCCC CTTGCAGGCC GTGTCAGCTC TGAGCCAGAA CTCCCTGTTT 3780
GCCTTCAGGC CGCCCCCCGA GGATGGGCTG CCCATGGACA CCAAGGTCCC GGGACACCCC 3840
AGGAAGAGCC TGGCAGTGCC CTCGGATGGG CTGAGCCCTG GCACAAACCC CCCCAATGGC 3900
TTCAGCTACA ATGGGGGCCT GTCCTCCGAG CTGGGCCTGC ACGGCTTCAT GGACGGCGCT 3960
GCTCTTCCAC ACAAAGCCGG GGATGGTTCG GCTGCCAGCT GCTCCCTGGG CTTCCCCTCG 4020
CAGCGAGGCA AGGAGCTGGG GGTGGCAGAC ACGAACCTGT TCCTGAACAA GAGGCAGCTG 4080
GAAGCTCTGG GCACCAAGGG AGAAGAGCTG AGCCGGCCGG GGGCGAAGGG CAAAGAGCTC 4140
AGCGAGCTGA GCGCCCGAGC GGGGGGGTCT GTGGAGAAAA ACTCTGTGCA GCACAACGGA 4200
AAGGTCGGCA AGGGAAGGGA CCGAGACGTG GAGTTTAAAA ATGGCCACAA CCTTTTCATT 4260
TCTGCTGCTG TTTCGTCTGG TGGCCTTCTG AATGGTAAAA GCCTTTCTAC TGCTGTTTCC 4320
TCAGCAGGGA ACCCAGCGCC GTCTGTCCCG ACGCACCATC CTTTCCTCAA CACTCTGACC 4380
ACTGGATCAC AGTTCCCCCT CGGCCCCATG GCCTTGCAAG CAAACCTCAA CTCGGTGACA 4440
AATTCCTCAG TATTGCAGTC CTTATTTAAT TCAATGCCAG CTGCTGCCAG TCTGGTCCAC 4500
GTGTCATCAG CTGCAACCAG ACTGACTAAT TCTCACACTA TGGGGAACTT CTCTCCTGGG 4560
GTTACAGGTG GAACAGTTGG AGGTATTTTT AACCATGCGG TGCCTTCTGC CTCCTCTCCT 4620
CATCAATTTG GAGCCAGTTT CAGCAGCAGT GCTGTCTCTA GCAGCACCGT GCTAAGCTTA 4680
AACCCTCTGC AGGCTGTTGC CAGCACCTCA TCCTCATCCT TCCCACCCTC TTCCTCTAAT 4740
TTAGTAACAT CTAGTGAGAC TAGACCTGCT CAGCACCTCA ACAGAGCTCC AGTGCAATCT 4800
GTCTTTCATC CCCCTCCGCC CCCTCCTAAC GTGTCCTTGC CTCCCCCTCC TCCTTTACTC 4860
GCTTCTAACT CCGAGCCTGC TCTCCTGCAG AACCTGCCCT CCATCCCTCC TGGCGAGACG 4920
TTCCTGCCCT CCTCCTCTGC TCCTGTCCAG TCTAACTCTT CTTTGTCTAT TAAACTGGCT 4980
TCTCTCCAGC ACAAACCCTC CCGCCCCTCC TTTACAGTCC ATCACCCGCC CCTGCCCCGC 5040
GTGCTCCCGC AGCCCAACGC CGCTGGCACG GCCGCTATGT GGGTGACCCT TGGCATGCAG 5100
CCTCCTTATG CTTCGCACCT TTCGGGGGTT AAGCCACGAT AA 5143
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 90 0.0 2641
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 91 0.0 2521
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 90 0.0 2475
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 93 0.0 2390
WERAM-Sah-0123 ENSSHAP00000012830.1 Sarcophilus harrisii 79 0.0 2234
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 77 0.0 2211
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 72 0.0 2045
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 72 0.0 2035
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 75 0.0 1987
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 75 0.0 1960
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 74 0.0 1952
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 74 0.0 1952
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 70 0.0 1950
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 74 0.0 1907
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 73 0.0 1902
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 72 0.0 1871
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 71 0.0 1845
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 71 0.0 1844
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 70 0.0 1826
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 72 0.0 1825
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 89 0.0 1817
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 74 0.0 1810
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 70 0.0 1806
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 72 0.0 1789
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 73 0.0 1701
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 73 0.0 1680
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 77 0.0 1664
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 78 0.0 1624
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 69 0.0 1439
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 71 0.0 1396
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 59 0.0 1184
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 59 0.0 1097
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 58 0.0 1097
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 57 0.0 1064
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 66 0.0 1062
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 65 0.0 1043
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 64 0.0 1015
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 66 0.0 1011
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 64 0.0 997
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 69 0.0 925
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 73 0.0 674
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 56 1e-96 353
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 7e-17 88.2
Created Date 25-Jun-2016