WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Tag-0004 | ||||||||||||
Ensembl Protein ID | ENSTGUP00000000304.1 | ||||||||||||
Gene Name | DOT1L | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Taeniopygia guttata | ||||||||||||
Domain Profile | HMT nonSET nonSET.txt 1 dvivfkwplaiydkkydaaseiielikyvCeelPdlkaafensvlididtksfesmirlvdkynraidsirilekgttlpleklnkipr 89 |
||||||||||||
Protein Sequence (Fasta) | XKSPVGAEPA VYPWPVPVYD KHHDAAHEII ETIRWVCEEI PDLKLAMENY VLIDYDTKSF 60 ESMQRLCDKY NRAIDSIHQL WKGTTQPMKL NTRPSHGLLR HILHAVYNHS DPEKLNNYEP 120 FPEVYGETSF DLVAQMIDEI KMTEDDLFVD LGSGVGQVVL QVAAATNCKH HYGVEKADIP 180 AKYAETMDRE FRKWMKWYGK KHAEYTLERG DFLSEEWRER IANTSVIFVN NFAFGPEVDH 240 QLKERFANMK EGGRIVSSKP FAPLNFRINS RNLSDIGTIM RVVELSPLKG SVSWTGKPVS 300 YYLHTIDRTI LENYFSSLKN PKLREEQEAA RRRQQRENKS NTTTPTKVQE NKVTLLHFWV 360 LSDSGAEEEK AAANSVKKPS PSKARKKKLS KKGRKMAGRK RGRPKKMNTA STERKTKKNQ 420 TALELLHAQT VSQTSSSSPQ DAYKSPHSPY YQLPPKVQRH SSNQLLVTPT PPALQKLLDS 480 FKIQYMQFMA YMKTPQYKAS LEQLLEQEKE KNAQLLGTAQ QLFTHCQAQK EEIKRLFQQK 540 LDELGVKALT YNDLIQAQKE ISAHNQQLKE QTKQLEKDNS ELRNQSLQLL KARCEELKLD 600 WSTLSLENLL KEKQALKNQI SEKQKHCLEL QISIVELEKS QRQQELLQLK SYTPSEEPLS 660 VQLHSKSHLS REPEAEHGRF QLELECSKFP MPHINGMSPE LSMNGHATPY ELQHAFSRPS 720 SKQNTPQYPN SHLDQDIVPC TPNHSSRQKA DKVTSLSLPD YTRFSPAKIA LRRHLNQDHG 780 VNGRATANEI QRRTEHVKEN GLTYPSPGIA NGIKLSPQET RPSSPAALPS AGEKGEKDVL 840 RERTSVSNGE TITSLPISIP LSTVQPNKLP VSIPLASVVL PSRVEKVRST PSPVHQSRDS 900 STLEKQIGAS SHNGISNAAG NKPLALATSG FSYSSGSVAV NGNLTNSPAH LNHSVDQAAL 960 DDSGSLFNSV GSRSSTPQHP LLMMRNSGQS SPAQQHSSPR LKAQKIFAEG TKGDLQSDAA 1020 FSDPENEAKR RIIFTISPNT GHVKQSPSSK HSPVPGSARP EGGQAHAQDG KKRGRRKRSS 1080 TGNASGNAAV SPKRKPLPSV AGLFTQPSGS PLNINSMVNN INQPLEITAI SSPENSLKNS 1140 PVPYQDNDQP PVLKKEKPLV QSNGVHYSPL TSDDEQGSED EHNSSRIERK IATISLESKS 1200 PQKTVENGGS ISGRKQAQSN ENMNSSKWKS TFSPISDINL TKTTDSPLQA VSALSQNSLF 1260 AFRPPPEDGL PMDTKVPGHP RKSLAVPSDG LSPGTNPPNG FSYNGGLSSE LGLHGFMDGA 1320 ALPHKAGDGS AASCSLGFPS QRGKELGVAD TNLFLNKRQL EALGTKGEEL SRPGAKGKEL 1380 SELSARAGGS VEKNSVQHNG KVGKGRDRDV EFKNGHNLFI SAAVSSGGLL NGKSLSTAVS 1440 SAGNPAPSVP THHPFLNTLT TGSQFPLGPM ALQANLNSVT NSSVLQSLFN SMPAAASLVH 1500 VSSAATRLTN SHTMGNFSPG VTGGTVGGIF NHAVPSASSP HQFGASFSSS AVSSSTVLSL 1560 NPLQAVASTS SSSFPPSSSN LVTSSETRPA QHLNRAPVQS VFHPPPPPPN VSLPPPPPLL 1620 ASNSEPALLQ NLPSIPPGET FLPSSSAPVQ SNSSLSIKLA SLQHKPSRPS FTVHHPPLPR 1680 VLPQPNAAGT AAMWVTLGMQ PPYASHLSGV KPR 1713 |
||||||||||||
Nucleotide Sequence (Fasta) | NTGAAGTCTC CCGTGGGAGC CGAGCCCGCC GTGTACCCCT GGCCGGTGCC CGTCTACGAC 60 AAACACCACG ATGCTGCTCA TGAAATTATT GAGACAATTC GGTGGGTCTG TGAAGAAATC 120 CCAGATCTCA AGCTTGCCAT GGAAAACTAT GTTTTAATTG ACTATGATAC AAAAAGCTTT 180 GAGAGCATGC AGAGACTGTG TGACAAGTAC AACCGAGCCA TCGACAGCAT CCACCAGTTG 240 TGGAAGGGGA CGACGCAGCC CATGAAGCTG AACACGCGTC CCTCCCACGG GCTCCTGAGG 300 CACATCCTGC ATGCAGTCTA CAACCACTCG GACCCCGAGA AACTCAACAA CTACGAGCCC 360 TTCCCCGAGG TCTACGGAGA AACTTCCTTT GACTTGGTGG CCCAAATGAT TGATGAGATT 420 AAAATGACAG AGGATGATTT GTTCGTTGAC TTGGGCAGTG GTGTGGGGCA GGTGGTGCTT 480 CAAGTGGCTG CAGCCACAAA CTGCAAACAT CACTACGGAG TGGAGAAAGC TGATATTCCA 540 GCCAAGTACG CCGAGACGAT GGACAGAGAG TTCAGAAAAT GGATGAAATG GTATGGGAAG 600 AAACATGCAG AATACACACT GGAAAGAGGT GACTTCCTCT CAGAAGAATG GAGGGAGAGG 660 ATTGCAAACA CAAGTGTTAT TTTTGTGAAC AATTTTGCCT TCGGGCCTGA GGTGGATCAC 720 CAGCTGAAGG AGCGCTTTGC AAACATGAAG GAAGGTGGGA GAATTGTGTC CTCAAAACCC 780 TTTGCACCTC TAAATTTTAG AATTAACAGT CGAAACTTGA GTGATATTGG CACTATCATG 840 AGGGTTGTGG AGCTGTCACC ACTGAAGGGC TCTGTGTCCT GGACGGGGAA ACCAGTCTCC 900 TACTACCTGC ACACCATTGA CAGAACCATA CTTGAAAACT ATTTTTCTAG TCTCAAAAAT 960 CCAAAACTCA GGGAGGAACA AGAGGCAGCT AGACGTCGTC AACAACGAGA AAACAAAAGT 1020 AACACAACGA CTCCAACGAA GGTCCAGGAA AACAAGGTGA CCCTCCTGCA TTTCTGGGTG 1080 CTGAGTGATT CTGGTGCTGA AGAAGAAAAA GCTGCAGCAA ATTCTGTTAA AAAGCCATCT 1140 CCCTCCAAGG CCCGGAAGAA GAAGCTGAGT AAGAAAGGCA GGAAAATGGC AGGGAGGAAA 1200 CGAGGGCGCC CCAAGAAAAT GAACACGGCG AGTACTGAGC GCAAGACCAA GAAGAACCAA 1260 ACTGCACTAG AACTTCTGCA TGCTCAGACT GTCTCCCAGA CATCTTCATC CTCTCCTCAG 1320 GATGCCTACA AGTCACCTCA TAGCCCATAT TACCAACTAC CTCCTAAAGT GCAACGGCAT 1380 TCGTCCAACC AGCTCCTGGT GACACCCACC CCCCCTGCAC TACAGAAGCT GCTAGACTCA 1440 TTTAAGATCC AATACATGCA GTTCATGGCA TACATGAAAA CTCCTCAGTA CAAAGCAAGT 1500 CTGGAACAGT TACTGGAGCA GGAAAAGGAG AAGAATGCTC AGCTGCTGGG AACAGCCCAG 1560 CAGTTATTCA CCCACTGCCA GGCCCAGAAA GAGGAAATCA AGCGCCTGTT CCAGCAGAAG 1620 CTCGATGAGC TGGGAGTTAA AGCTCTGACA TACAACGACC TGATCCAGGC TCAGAAGGAA 1680 ATCTCTGCTC ACAACCAGCA GCTGAAAGAA CAGACAAAAC AGCTGGAGAA GGACAACAGT 1740 GAACTCAGGA ACCAGAGCTT GCAGCTGCTG AAGGCTCGGT GTGAAGAGCT GAAGCTGGAT 1800 TGGTCAACGC TGTCACTGGA GAACTTGCTG AAAGAGAAGC AGGCCTTGAA GAATCAGATT 1860 TCTGAGAAGC AAAAGCACTG TTTGGAACTG CAGATCAGCA TCGTGGAACT CGAGAAGAGT 1920 CAGCGGCAGC AGGAGCTGCT GCAGCTCAAG TCCTACACAC CCTCAGAGGA GCCCCTGTCA 1980 GTGCAGCTGC ACAGCAAAAG CCACCTGAGC CGTGAGCCCG AGGCTGAGCA CGGCAGGTTC 2040 CAGCTGGAGC TGGAGTGCTC CAAGTTCCCC ATGCCCCACA TCAATGGCAT GAGCCCCGAG 2100 CTGTCCATGA ACGGCCACGC CACCCCCTAC GAGCTGCAGC ACGCCTTCAG CCGGCCCTCG 2160 TCCAAGCAGA ACACCCCCCA GTACCCCAAC TCCCACCTGG ACCAAGACAT CGTGCCCTGC 2220 ACCCCCAACC ACAGCAGCAG GCAGAAGGCA GACAAGGTGA CCAGCCTGTC CCTCCCTGAT 2280 TACACCAGGT TCTCCCCTGC TAAAATCGCC CTGCGCAGAC ACTTGAATCA GGACCACGGA 2340 GTCAATGGAA GAGCAACAGC TAATGAGATA CAGAGGAGGA CTGAACATGT CAAAGAGAAT 2400 GGCCTTACGT ATCCAAGCCC TGGAATTGCA AATGGCATAA AGCTGAGTCC TCAGGAAACT 2460 CGGCCCTCCT CCCCCGCGGC CTTACCCAGT GCAGGAGAGA AGGGAGAGAA AGATGTTTTG 2520 AGAGAGAGAA CCTCCGTGAG TAATGGAGAA ACTATCACCA GCCTCCCTAT CAGTATTCCT 2580 CTGAGCACAG TGCAGCCCAA TAAACTCCCT GTTAGTATCC CTCTGGCCAG CGTAGTCCTA 2640 CCTAGCCGTG TTGAGAAGGT GAGAAGCACA CCCAGCCCAG TTCACCAGAG CAGAGACTCG 2700 TCGACACTTG AAAAGCAGAT TGGTGCTAGT TCTCATAATG GCATAAGCAA TGCTGCTGGA 2760 AACAAGCCTC TGGCTTTGGC TACCTCAGGT TTTTCTTACT CTTCTGGCTC CGTGGCAGTC 2820 AATGGAAATC TTACAAACAG CCCAGCCCAT CTTAACCATA GTGTTGATCA GGCAGCTCTG 2880 GACGACTCTG GGAGTCTCTT TAACTCTGTA GGGTCTCGGA GTTCCACTCC ACAACATCCT 2940 TTGCTAATGA TGAGGAACTC TGGGCAGAGC TCCCCTGCTC AGCAGCACTC GAGCCCCCGC 3000 TTAAAGGCTC AGAAGATCTT TGCCGAAGGA ACAAAGGGGG ACCTGCAGTC TGATGCAGCG 3060 TTTTCAGATC CTGAGAACGA GGCCAAGAGG AGAATCATCT TTACAATCTC TCCTAATACA 3120 GGACATGTAA AACAATCCCC TTCCAGCAAG CACAGCCCCG TGCCCGGCAG CGCCCGGCCG 3180 GAGGGCGGCC AGGCCCACGC GCAGGACGGG AAGAAGCGGG GCCGGCGGAA GAGATCCTCC 3240 ACTGGGAACG CCAGCGGGAA CGCCGCCGTC TCCCCGAAAC GCAAACCCCT GCCCTCGGTG 3300 GCCGGGCTCT TTACGCAGCC CTCGGGGTCG CCGCTCAATA TCAACTCCAT GGTCAATAAC 3360 ATTAACCAGC CCTTAGAAAT AACAGCCATT TCCTCTCCTG AAAACTCCCT GAAGAACTCT 3420 CCTGTTCCTT ACCAGGACAA CGACCAGCCC CCAGTGCTGA AAAAAGAGAA GCCCCTCGTT 3480 CAGAGCAACG GTGTCCATTA CTCCCCCCTG ACATCGGATG ACGAGCAGGG CTCTGAGGAT 3540 GAGCACAATA GCAGCAGAAT TGAGAGGAAA ATTGCAACAA TATCATTGGA AAGCAAATCT 3600 CCACAGAAGA CTGTTGAAAA TGGTGGCAGC ATATCGGGGA GGAAACAAGC ACAAAGCAAC 3660 GAGAACATGA ACAGCAGCAA ATGGAAATCG ACTTTTTCAC CGATATCTGA CATCAACCTG 3720 ACCAAAACCA CAGACAGCCC CTTGCAGGCC GTGTCAGCTC TGAGCCAGAA CTCCCTGTTT 3780 GCCTTCAGGC CGCCCCCCGA GGATGGGCTG CCCATGGACA CCAAGGTCCC GGGACACCCC 3840 AGGAAGAGCC TGGCAGTGCC CTCGGATGGG CTGAGCCCTG GCACAAACCC CCCCAATGGC 3900 TTCAGCTACA ATGGGGGCCT GTCCTCCGAG CTGGGCCTGC ACGGCTTCAT GGACGGCGCT 3960 GCTCTTCCAC ACAAAGCCGG GGATGGTTCG GCTGCCAGCT GCTCCCTGGG CTTCCCCTCG 4020 CAGCGAGGCA AGGAGCTGGG GGTGGCAGAC ACGAACCTGT TCCTGAACAA GAGGCAGCTG 4080 GAAGCTCTGG GCACCAAGGG AGAAGAGCTG AGCCGGCCGG GGGCGAAGGG CAAAGAGCTC 4140 AGCGAGCTGA GCGCCCGAGC GGGGGGGTCT GTGGAGAAAA ACTCTGTGCA GCACAACGGA 4200 AAGGTCGGCA AGGGAAGGGA CCGAGACGTG GAGTTTAAAA ATGGCCACAA CCTTTTCATT 4260 TCTGCTGCTG TTTCGTCTGG TGGCCTTCTG AATGGTAAAA GCCTTTCTAC TGCTGTTTCC 4320 TCAGCAGGGA ACCCAGCGCC GTCTGTCCCG ACGCACCATC CTTTCCTCAA CACTCTGACC 4380 ACTGGATCAC AGTTCCCCCT CGGCCCCATG GCCTTGCAAG CAAACCTCAA CTCGGTGACA 4440 AATTCCTCAG TATTGCAGTC CTTATTTAAT TCAATGCCAG CTGCTGCCAG TCTGGTCCAC 4500 GTGTCATCAG CTGCAACCAG ACTGACTAAT TCTCACACTA TGGGGAACTT CTCTCCTGGG 4560 GTTACAGGTG GAACAGTTGG AGGTATTTTT AACCATGCGG TGCCTTCTGC CTCCTCTCCT 4620 CATCAATTTG GAGCCAGTTT CAGCAGCAGT GCTGTCTCTA GCAGCACCGT GCTAAGCTTA 4680 AACCCTCTGC AGGCTGTTGC CAGCACCTCA TCCTCATCCT TCCCACCCTC TTCCTCTAAT 4740 TTAGTAACAT CTAGTGAGAC TAGACCTGCT CAGCACCTCA ACAGAGCTCC AGTGCAATCT 4800 GTCTTTCATC CCCCTCCGCC CCCTCCTAAC GTGTCCTTGC CTCCCCCTCC TCCTTTACTC 4860 GCTTCTAACT CCGAGCCTGC TCTCCTGCAG AACCTGCCCT CCATCCCTCC TGGCGAGACG 4920 TTCCTGCCCT CCTCCTCTGC TCCTGTCCAG TCTAACTCTT CTTTGTCTAT TAAACTGGCT 4980 TCTCTCCAGC ACAAACCCTC CCGCCCCTCC TTTACAGTCC ATCACCCGCC CCTGCCCCGC 5040 GTGCTCCCGC AGCCCAACGC CGCTGGCACG GCCGCTATGT GGGTGACCCT TGGCATGCAG 5100 CCTCCTTATG CTTCGCACCT TTCGGGGGTT AAGCCACGAT AA 5143 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |