WERAM Information


Tag Content
WERAM ID WERAM-Sah-0123
Ensembl Protein ID ENSSHAP00000012830.1
Gene Name DOT1L
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSSHAG00000010977.1 ENSSHAT00000012935.1 ENSSHAP00000012830.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HMT nonSET 1.70e-168 559.8 2 290
Organism Sarcophilus harrisii
Domain Profile
  HMT nonSET

            nonSET.txt  27 kyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkiprtsllrdilqlvynrsvtdaeklnnye 115
++vCee+Pdlk+a+en+vlid+dtksfesm+rl+dkynraidsi++l+kgtt+p+ kln++p+++llr+ilq+vyn+svtd+eklnnye
ENSSHAP00000012830.1 2 RWVCEEIPDLKLAMENYVLIDYDTKSFESMQRLCDKYNRAIDSIHQLWKGTTQPM-KLNTRPSNGLLRHILQQVYNHSVTDPEKLNNYE 89
8******************************************************.********************************* PP
nonSET.txt 116 afspevyGelsfdlvaqvldevklkkddtfvdlGsGvGqvvlqvaaetncklsfGvekadiaakyaelmdeelrkrmklyGkrlaeyel 204
+fspevyGe+sfdlvaq++de+k+++dd+fvdlGsGvGqvvlqvaa+tnck+++Gvekadi+akyae+md+e+rk+mk+yGk++aey+l
ENSSHAP00000012830.1 90 PFSPEVYGETSFDLVAQMIDEIKMTEDDLFVDLGSGVGQVVLQVAAATNCKHHYGVEKADIPAKYAETMDREFRKWMKWYGKKHAEYTL 178
***************************************************************************************** PP
nonSET.txt 205 ekgdflvdenreriaqtdvilvnnfafgeevdkklkerladlkeGarivsskslaplnfrinsrnlsdigtilkvveldllkgsvsWtg 293
e+gdfl++e+reria+t+vi+vnnfafg+evd++lker+a++keG+rivssk++aplnfrinsrnlsdigti++vvel++lkgsvsWtg
ENSSHAP00000012830.1 179 ERGDFLSEEWRERIANTSVIFVNNFAFGPEVDHQLKERFANMKEGGRIVSSKPFAPLNFRINSRNLSDIGTIMRVVELSPLKGSVSWTG 267
***************************************************************************************** PP
nonSET.txt 294 kpvsyylhtidrtilesyfsslk 316
kpvsyylhtidrtile+yfsslk
ENSSHAP00000012830.1 268 KPVSYYLHTIDRTILENYFSSLK 290
********************997 PP

Protein Sequence
(Fasta)
FRWVCEEIPD LKLAMENYVL IDYDTKSFES MQRLCDKYNR AIDSIHQLWK GTTQPMKLNT 60
RPSNGLLRHI LQQVYNHSVT DPEKLNNYEP FSPEVYGETS FDLVAQMIDE IKMTEDDLFV 120
DLGSGVGQVV LQVAAATNCK HHYGVEKADI PAKYAETMDR EFRKWMKWYG KKHAEYTLER 180
GDFLSEEWRE RIANTSVIFV NNFAFGPEVD HQLKERFANM KEGGRIVSSK PFAPLNFRIN 240
SRNLSDIGTI MRVVELSPLK GSVSWTGKPV SYYLHTIDRT ILENYFSSLK NPKLREEQEA 300
ARRRQQRENK SNTTTPTKVQ ENKGAVALEP RVDSGAEEDK NGTTTLKKPS PSKPRKKKLN 360
KKGRKMAGRK RGRPKKMSAA NPERKPKKTP TALELLHAQT VSQAAASSPQ DAYKSPHSPY 420
YQLPPKVQRH SSNQLLATPT PPALQKLLES FKVQYLQFLA YMKTPQYKAN LQQLVEQEKE 480
KNTQLLGTAQ QLFSHCQAQK EEIKRLFQQK LDELGVKALT YNDLIQAQKE ISAHNQQLKE 540
QTKQLESDNS ELRNQSLQLL KARCEELKLD WSTLSLENLL KEKQALKNQI SEKQKHCLEL 600
QISIVELEKS QRQQELLQLK SYAPADEALS VHLRSKSSFS RELETDPARF QLELECSKLS 660
LPHINSMSPE LSMNGHATTY EIHSMLSRPS SKQNTPQYLT SHLDQEIVPC TPIHNNRQKA 720
DKLSSLSLPD YTRFSPAKLA LRRHLNQDHV VHGKATTNEI HHRTEHAKEN GLTYQSAGLS 780
NGLKLSPQET RPSSPSALQI AGEKSSDKVL KERAYASNGE TITSLPISIP LSTVQPNKLP 840
VSIPLASVVL PSRAEKVRNT PSPVHQNRES TSATLEKQFG ANSHNGMNSA VGNKSLALAT 900
SGFSYSGSLA VNGTLSSSPA HLNHGAEQAA LDDSSNSGSL FNSVGSRSST PQHPSLLMMQ 960
SRNSGQNSPA HPLAASPRLN GASQKLFSEG NKGDLPLDSG FSDPESEVKR RILFTISTNT 1020
GNLKQSPSNK HSPLTTSIRP DCGQACVQEG KKRGRRKRSS TGNPSVNSAV SPKRKPLPSV 1080
AGLFTQSSGS PLNINSMVNN INQPLEITAI SSPENSLKNS PIPYQDNDQP PVLKKEKPLI 1140
QTNGAHYSPL TSDEEQGSED EHNSIRIERK IATISLESKS PQKTENGGGL TGRKQTQLTE 1200
GINSSKWKST FSPISDINLT KTSDSPLQAA SSLSQNSLFA FRPSSEDTMA IDAKISVHPR 1260
KSFPSSLSGS DGLSPNTNSS NGFSYTGGLS AADMSLHSFH DGASLSHKTT EAAGLSSPLG 1320
FQAQRSKDAS DANPFLNKRQ LDGLGSMKGE ELPRAGGRSK EVLDMALQLG GPPDKGSLQH 1380
IGKAGKGRDR EVEFKNGHNL FISAAAAVPP GGLLNGKGLS STASVVGHPA SSVSAHHPFL 1440
NTFTTGSQFP LGPMSLQANM PAAASLVHVS SAATRLTNSH AMGNFSSGVT GGTVGGVFNH 1500
AVPSASSHQF GASFSSSAVC SSTMLSLNPV QAVVTPTSSS FQAPSSSIGT CSEARAPQHL 1560
NQVPVQPALH PPPPAPPPPN VSLPPPPPLL TASNPEPGLL QTLTSIPANK AFVAPSSAAS 1620
LQANASLSIK LASLPHKASR PSFTVHHQPL PRLALAQAAP AVPQSDAAGP PAMWVTLGMQ 1680
SPYASHLSGV KPR 1693
Nucleotide Sequence
(Fasta)
TTCAGATGGG TCTGCGAAGA AATACCAGAT CTCAAGCTTG CCATGGAAAA TTATGTTTTA 60
ATTGACTATG ATACCAAAAG CTTTGAAAGC ATGCAGAGAC TCTGTGACAA GTACAACCGA 120
GCCATTGATA GCATTCATCA ACTGTGGAAA GGAACAACTC AGCCCATGAA ATTGAACACT 180
CGGCCATCCA ATGGGCTTCT TCGACATATC TTGCAGCAAG TATATAACCA CTCAGTCACA 240
GATCCAGAGA AACTCAACAA CTATGAGCCC TTTTCTCCAG AGGTGTATGG AGAAACTTCA 300
TTCGACTTAG TTGCCCAAAT GATTGATGAG ATTAAAATGA CAGAGGATGA TTTGTTTGTT 360
GATTTGGGCA GTGGAGTGGG GCAGGTGGTT TTGCAAGTGG CTGCAGCCAC AAATTGTAAA 420
CATCACTATG GTGTGGAGAA AGCAGACATC CCGGCAAAAT ATGCTGAGAC AATGGACCGA 480
GAATTCAGAA AATGGATGAA ATGGTATGGG AAGAAGCATG CAGAATACAC ACTGGAAAGA 540
GGTGACTTCC TTTCAGAAGA ATGGCGGGAA AGAATTGCAA ATACAAGTGT TATTTTTGTG 600
AATAATTTTG CCTTTGGTCC TGAGGTGGAT CACCAACTGA AGGAGCGATT TGCTAACATG 660
AAAGAAGGTG GCAGAATTGT GTCCTCCAAA CCTTTTGCAC CTCTAAATTT TAGAATAAAC 720
AGTAGAAACT TGAGTGATAT TGGCACTATA ATGAGAGTTG TGGAACTATC ACCACTGAAA 780
GGGTCGGTTT CATGGACTGG CAAGCCAGTT TCTTACTACC TGCATACTAT TGATCGAACC 840
ATACTTGAAA ACTATTTTTC TAGTCTCAAA AATCCAAAAC TCAGGGAGGA GCAGGAGGCA 900
GCCCGCCGCC GGCAACAAAG AGAGAACAAG AGCAACACGA CGACTCCCAC CAAGGTCCAA 960
GAAAATAAGG GTGCTGTTGC TTTGGAGCCT CGAGTGGATT CTGGTGCTGA AGAAGATAAA 1020
AATGGGACAA CTACTCTTAA AAAACCATCT CCTTCCAAGC CTCGCAAAAA GAAATTAAAC 1080
AAAAAAGGAA GGAAGATGGC AGGAAGAAAA CGTGGACGTC CCAAGAAAAT GAGTGCTGCA 1140
AACCCTGAAA GGAAACCCAA GAAGACTCCC ACTGCACTGG AGCTTCTGCA TGCTCAGACT 1200
GTGTCCCAGG CAGCTGCATC CTCTCCTCAG GATGCATACA AGTCACCTCA TAGTCCATAC 1260
TATCAACTAC CTCCTAAAGT ACAACGGCAT TCCTCCAACC AGCTATTGGC AACACCAACC 1320
CCACCTGCAC TCCAGAAGCT GTTAGAATCC TTTAAGGTTC AGTACTTGCA GTTTCTGGCC 1380
TATATGAAAA CCCCACAGTA CAAAGCTAAC CTACAGCAAT TGGTGGAGCA GGAGAAGGAA 1440
AAGAACACTC AATTGTTAGG TACAGCCCAG CAGTTGTTCA GCCACTGTCA GGCTCAGAAA 1500
GAGGAGATCA AGAGACTGTT TCAGCAGAAA CTTGATGAGT TGGGAGTTAA GGCTCTGACC 1560
TACAATGATC TGATACAAGC TCAGAAAGAG ATCTCTGCTC ACAACCAACA GCTAAAAGAA 1620
CAGACAAAGC AGCTAGAAAG TGATAACAGT GAACTCAGGA ATCAAAGCTT GCAGCTGCTT 1680
AAGGCTCGGT GTGAAGAGCT GAAGCTGGAT TGGTCTACTC TCTCACTGGA AAATTTGCTG 1740
AAGGAGAAGC AGGCACTGAA GAATCAAATT TCCGAGAAGC AAAAGCACTG TCTGGAGCTT 1800
CAGATCAGCA TTGTGGAGCT TGAGAAAAGC CAAAGGCAGC AGGAACTACT GCAGCTGAAA 1860
TCCTACGCGC CGGCAGATGA GGCCCTGTCC GTCCACCTGC GAAGTAAAAG CAGCTTCAGC 1920
CGTGAGCTGG AGACTGACCC TGCCAGGTTC CAGCTGGAGC TCGAGTGCTC CAAGTTGTCT 1980
TTGCCCCACA TCAATAGCAT GAGCCCCGAA CTTTCTATGA ATGGCCATGC AACCACCTAT 2040
GAGATTCATA GTATGCTTAG TCGACCTTCT TCCAAACAAA ACACTCCACA GTATCTGACC 2100
TCCCACCTGG ACCAAGAAAT AGTTCCTTGC ACCCCCATTC ATAACAACAG ACAGAAGGCA 2160
GACAAATTGT CAAGCTTGTC CTTGCCTGAT TACACCAGGT TTTCCCCCGC TAAGTTAGCC 2220
CTAAGGAGAC ACTTGAATCA GGATCATGTA GTCCATGGAA AAGCTACAAC TAATGAAATA 2280
CACCACAGAA CTGAACATGC AAAAGAGAAT GGCCTGACAT ATCAGAGTGC TGGTCTGTCA 2340
AATGGCCTAA AGCTGAGCCC TCAGGAAACT CGGCCTTCCT CCCCCTCGGC CTTACAGATT 2400
GCAGGAGAGA AGAGCAGTGA CAAGGTTTTA AAAGAGAGAG CCTATGCCAG CAATGGGGAA 2460
ACAATCACCA GCCTACCTAT CAGTATTCCT CTCAGTACAG TGCAGCCCAA TAAACTTCCT 2520
GTCAGTATCC CATTGGCCAG TGTAGTGTTG CCTAGCCGTG CGGAGAAAGT GAGGAACACA 2580
CCTAGTCCAG TTCATCAGAA TCGAGAGTCT ACTTCAGCAA CACTTGAAAA ACAGTTTGGT 2640
GCTAATTCCC ATAATGGTAT GAACAGTGCT GTAGGAAACA AGTCCCTTGC TTTGGCTACC 2700
TCAGGTTTTT CTTATAGTGG CTCCTTGGCA GTCAATGGCA CACTTTCCAG TAGCCCAGCC 2760
CATCTTAACC ATGGTGCTGA GCAGGCAGCC CTTGATGATT CCTCCAATTC AGGGAGCCTC 2820
TTTAATTCAG TGGGGTCCCG GAGTTCTACG CCCCAGCATC CTTCTTTACT CATGATGCAG 2880
TCCCGGAACT CTGGCCAGAA CTCTCCAGCC CATCCACTTG CTGCCAGCCC GCGCCTCAAT 2940
GGAGCTTCTC AAAAATTGTT CTCTGAAGGT AACAAGGGGG ATCTTCCACT AGATTCAGGT 3000
TTTTCAGATC CTGAAAGTGA AGTCAAGAGA AGAATCCTCT TTACAATCTC GACCAATACA 3060
GGGAACCTGA AGCAGTCTCC TTCCAACAAA CACAGCCCTC TGACCACTAG CATTCGCCCG 3120
GATTGTGGAC AGGCTTGTGT GCAGGAGGGC AAGAAGAGGG GCAGAAGGAA GCGGTCTTCC 3180
ACGGGGAACC CCAGTGTGAA CTCAGCGGTA TCTCCCAAGC GCAAGCCCTT ACCATCTGTG 3240
GCAGGCCTTT TTACCCAATC TTCAGGCTCA CCTCTCAACA TCAACTCTAT GGTCAATAAC 3300
ATTAATCAAC CTTTAGAAAT AACAGCCATT TCCTCCCCTG AAAACTCCCT GAAGAACTCT 3360
CCTATTCCTT ACCAGGACAA CGACCAGCCT CCTGTGCTGA AAAAGGAGAA GCCCTTGATC 3420
CAGACGAACG GGGCACATTA CTCGCCCCTG ACCTCGGATG AAGAGCAAGG ATCTGAAGAT 3480
GAGCACAACA GTATCAGGAT TGAAAGGAAA ATCGCAACAA TCTCTTTAGA GAGCAAGTCT 3540
CCCCAGAAAA CAGAAAATGG TGGTGGCCTT ACAGGAAGGA AGCAAACCCA GCTGACTGAA 3600
GGCATAAACA GTAGTAAATG GAAGTCTACC TTTTCACCAA TATCAGACAT CAACCTTACC 3660
AAAACATCCG ACAGCCCCTT ACAGGCAGCT TCGTCCCTGA GCCAGAATTC CCTGTTTGCC 3720
TTCCGGCCGT CCTCCGAGGA CACGATGGCC ATTGATGCCA AGATCTCTGT GCACCCAAGG 3780
AAGAGCTTCC CCAGTTCCTT GTCGGGGTCG GATGGTCTCA GCCCCAACAC AAACTCCTCC 3840
AATGGATTCT CCTATACCGG GGGCCTGTCT GCTGCTGACA TGAGTTTACA CAGCTTTCAC 3900
GATGGTGCTT CTCTTTCTCA CAAAACCACC GAGGCGGCCG GCCTAAGCTC CCCTCTGGGT 3960
TTTCAGGCAC AGCGGAGCAA AGATGCATCA GATGCCAATC CTTTTCTGAA CAAGAGGCAG 4020
CTGGACGGCC TGGGCAGTAT GAAGGGAGAA GAGTTGCCGA GGGCCGGAGG CAGAAGTAAA 4080
GAGGTGCTTG ACATGGCCCT GCAGCTGGGG GGGCCCCCCG ACAAAGGCTC CCTGCAGCAC 4140
ATTGGCAAGG CTGGCAAAGG GCGGGACCGG GAGGTGGAGT TTAAGAATGG CCACAACCTC 4200
TTCATTTCTG CTGCGGCCGC CGTACCTCCA GGGGGCCTCC TCAACGGCAA AGGGCTGTCT 4260
AGCACGGCCT CTGTGGTTGG TCACCCTGCT TCTTCTGTAT CGGCGCACCA TCCCTTCCTC 4320
AACACTTTCA CCACTGGATC CCAGTTTCCC CTGGGCCCCA TGTCCCTGCA GGCCAACATG 4380
CCCGCCGCTG CCAGTTTGGT CCACGTGTCA TCAGCCGCGA CCAGATTGAC AAATTCTCAC 4440
GCTATGGGAA ACTTCTCTTC TGGGGTTACA GGTGGAACCG TTGGAGGTGT TTTTAACCAT 4500
GCGGTGCCTT CTGCCTCTTC TCATCAGTTT GGAGCCAGTT TCAGCAGTAG TGCTGTGTGT 4560
AGCAGCACCA TGCTAAGCTT AAACCCAGTG CAGGCTGTGG TCACCCCCAC TTCCTCTTCC 4620
TTCCAGGCCC CTTCCTCTAG CATAGGAACA TGTAGCGAGG CTAGAGCCCC CCAGCACCTG 4680
AACCAAGTGC CTGTGCAGCC TGCCCTGCAT CCTCCTCCCC CCGCTCCTCC TCCTCCTAAC 4740
GTCTCATTAC CTCCCCCCCC TCCATTACTC ACTGCTTCTA ACCCGGAGCC GGGGCTTCTG 4800
CAGACCCTAA CGTCCATTCC TGCTAACAAA GCTTTTGTTG CCCCCTCCTC AGCTGCTTCT 4860
CTCCAGGCTA ACGCTTCTCT GTCTATCAAG CTGGCCTCCC TCCCCCACAA AGCTTCCCGC 4920
CCCTCCTTCA CAGTCCACCA TCAGCCTCTG CCCCGTTTGG CCCTGGCCCA GGCTGCACCC 4980
GCGGTCCCGC AGTCCGACGC CGCTGGCCCA CCCGCTATGT GGGTTACCCT TGGCATGCAG 5040
TCTCCTTACG CTTCGCACCT TTCTGGGGTT AAGCCGCGAT AAAGAGCTTG CTTAGCTAGC 5100
AGTTCATATT ATGTAAG 5118
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mod-0038 ENSMODP00000005746.2 Monodelphis domestica 92 0.0 2702
WERAM-Anp-0075 ENSAPLP00000009183.1 Anas platyrhynchos 81 0.0 2309
WERAM-Meg-0021 ENSMGAP00000001612.2 Meleagris gallopavo 80 0.0 2307
WERAM-Gaga-0008 ENSGALP00000001234.4 Gallus gallus 82 0.0 2224
WERAM-Tag-0004 ENSTGUP00000000304.1 Taeniopygia guttata 79 0.0 2220
WERAM-Myl-0046 ENSMLUP00000004017.2 Myotis lucifugus 77 0.0 2071
WERAM-Otg-0049 ENSOGAP00000003300.2 Otolemur garnettii 77 0.0 2049
WERAM-Paa-0089 ENSPANP00000009219.1 Papio anubis 77 0.0 2045
WERAM-Hos-0067 ENSP00000381657.3 Homo sapiens 77 0.0 2040
WERAM-Gog-0076 ENSGGOP00000006459.2 Gorilla gorilla 77 0.0 2038
WERAM-Pat-0084 ENSPTRP00000059247.2 Pan troglodytes 77 0.0 2032
WERAM-Poa-0077 ENSPPYP00000010482.2 Pongo abelii 77 0.0 2030
WERAM-Fia-0168 ENSFALP00000014375.1 Ficedula albicollis 80 0.0 2017
WERAM-Aim-0067 ENSAMEP00000005765.1 Ailuropoda melanoleuca 77 0.0 2001
WERAM-Bot-0098 ENSBTAP00000013182.5 Bos taurus 75 0.0 1971
WERAM-Mup-0075 ENSMPUP00000006907.1 Mustela putorius furo 76 0.0 1966
WERAM-Eqc-0099 ENSECAP00000011661.1 Equus caballus 76 0.0 1953
WERAM-Caj-0129 ENSCJAP00000024230.1 Callithrix jacchus 74 0.0 1937
WERAM-Dan-0030 ENSDNOP00000002767.3 Dasypus novemcinctus 72 0.0 1916
WERAM-Nol-0022 ENSNLEP00000002551.1 Nomascus leucogenys 75 0.0 1897
WERAM-Loa-0036 ENSLAFP00000002270.4 Loxodonta africana 72 0.0 1895
WERAM-Mum-0211 ENSMUSP00000100973.2 Mus musculus 74 0.0 1891
WERAM-Ran-0179 ENSRNOP00000043691.3 Rattus norvegicus 73 0.0 1882
WERAM-Pes-0106 ENSPSIP00000013048.1 Pelodiscus sinensis 86 0.0 1841
WERAM-Ptv-0109 ENSPVAP00000009648.1 Pteropus vampyrus 77 0.0 1838
WERAM-Ova-0153 ENSOARP00000015227.1 Ovis aries 75 0.0 1784
WERAM-Caf-0201 ENSCAFP00000032181.2 Canis familiaris 81 0.0 1783
WERAM-Cap-0178 ENSCPOP00000016104.1 Cavia porcellus 79 0.0 1767
WERAM-Ocp-0012 ENSOPRP00000001490.2 Ochotona princeps 76 0.0 1555
WERAM-Xet-0090 ENSXETP00000029898.3 Xenopus tropicalis 66 0.0 1423
WERAM-Leo-0052 ENSLOCP00000007020.1 Lepisosteus oculatus 60 0.0 1215
WERAM-Pof-0009 ENSPFOP00000001322.2 Poecilia formosa 58 0.0 1133
WERAM-Xim-0156 ENSXMAP00000012884.1 Xiphophorus maculatus 57 0.0 1121
WERAM-Orn-0052 ENSONIP00000006006.1 Oreochromis niloticus 59 0.0 1117
WERAM-Lac-0083 ENSLACP00000010704.1 Latimeria chalumnae 69 0.0 1085
WERAM-Asm-0190 ENSAMXP00000018246.1 Astyanax mexicanus 61 0.0 1050
WERAM-Fec-0094 ENSFCAP00000007980.3 Felis catus 72 0.0 1047
WERAM-Tar-0217 ENSTRUP00000045719.1 Takifugu rubripes 64 0.0 1026
WERAM-Ten-0051 ENSTNIP00000006986.1 Tetraodon nigroviridis 64 0.0 1024
WERAM-Dar-0143 ENSDARP00000083509.4 Danio rerio 66 0.0 1009
WERAM-Prc-0009 ENSPCAP00000000843.1 Procavia capensis 76 0.0 781
WERAM-Drm-0098 FBpp0292800 Drosophila melanogaster 60 9e-96 350
WERAM-Sac-0010 YDR440W Saccharomyces cerevisiae 28 9e-17 87.8
Created Date 25-Jun-2016