Tag |
Content |
WERAM ID |
WERAM-Sah-0158 |
Ensembl Protein ID |
ENSSHAP00000016639.1 |
Gene Name |
LOC100933758 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SET1 |
4.10e-28 |
98 |
231 |
342 |
HMT |
HMT_other |
1.00e-141 |
499 |
|
|
|
Organism |
Sarcophilus harrisii |
Domain Profile |
HMT SET1
SET1.txt 10 ikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedae.vvvdatkkgn.iarfinhscepNceakvvavd 95 kg+g+ a+k++ +e+v+EY G++i+++ a+kre+ y ++ ++ +y++ ++ ++ ++vdat++ + r+inhs Nc++k +d ENSSHAP00000016639.1 231 GKGRGVIATKQFTRGEFVVEYHGDLIENTDAKKREALYAQDPSTgCYMYYFQYLSKtYCVDATRETDrLGRLINHSKWGNCQTKLHDID 319 579************************************999888***999875444*******998799******************* PP SET1.txt 96 gekkiviyakraIekgeeltydY 118 g +++++a+r+I++geel+ydY ENSSHAP00000016639.1 320 GVPHLILIASRDIKAGEELLYDY 342 *********************** PP
HMT HMT_other
Query: 1 MARGKKMSKTRXXXXXXXXXXXXXXXXXXXPVPDPETMDRRNPARPRTNGENVFMGQSKI 60 MARG+KMSK R P PE ++RR P RPRT+GENVF GQSKI Sbjct: 1 MARGRKMSKPRAVEAAAAAAAVAATA------PGPEMVERRGPGRPRTDGENVFTGQSKI 54 Query: 61 YSYMSSHRSSGTCPPFQEENSVAHHDVKYQGKALTELHKKGEEKRTGDYASASTMDSEEQ 120 YSYMS ++ SG P QEENSV HH+VK QGK L +++K EEKR A S M SEEQ Sbjct: 55 YSYMSPNKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQ 114 Query: 121 KGRELKRDFLEPLPNHRSIAAGNPKXXXXXXXXXXXXGVKHALKNPPQTKQASRRKAQGK 180 K ++ ++ L P PN +S AA PK K ALK P + KQA R+KAQGK Sbjct: 115 KIKDARKGPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGK 174 Query: 181 TQQNRKVTDYYPVRRSSRKSKAEIQSEEKRRLDELIESGKEEGMKIDIIDGKGRGVIATK 240 TQQNRK+TD+YPVRRSSRKSKAE+QSEE++R+DELIESGKEEGMKID+IDGKGRGVIATK Sbjct: 175 TQQNRKLTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATK 234 Query: 241 QFTRGEFVVEYHGDLIENTDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETDRLG 300 QF+RG+FVVEYHGDLIE TDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRET+RLG Sbjct: 235 QFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLG 294 Query: 301 RLINHSKWGNCQTKLHDIDGVPHLILIASRDIKAGEELLYDYGDRSKASLEAHPWLKH 358 RLINHSK GNCQTKLHDIDGVPHLILIASRDI AGEELLYDYGDRSKAS+EAHPWLKH Sbjct: 295 RLINHSKCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH 352
|
Protein Sequence (Fasta) | MARGKKMSKT RAEEAAAAAA AATPPGGTAA PVPDPETMDR RNPARPRTNG ENVFMGQSKI 60 YSYMSSHRSS GTCPPFQEEN SVAHHDVKYQ GKALTELHKK GEEKRTGDYA SASTMDSEEQ 120 KGRELKRDFL EPLPNHRSIA AGNPKAMALP ANAADAAGVK HALKNPPQTK QASRRKAQGK 180 TQQNRKVTDY YPVRRSSRKS KAEIQSEEKR RLDELIESGK EEGMKIDIID GKGRGVIATK 240 QFTRGEFVVE YHGDLIENTD AKKREALYAQ DPSTGCYMYY FQYLSKTYCV DATRETDRLG 300 RLINHSKWGN CQTKLHDIDG VPHLILIASR DIKAGEELLY DYGDRSKASL EAHPWLKH 358Protein Fasta Sequence
>ENSSHAP00000016639.1|HMT_other|Sarcophilus harrisii MARGKKMSKTRAEEAAAAAAAATPPGGTAAPVPDPETMDRRNPARPRTNGENVFMGQSKIYSYMSSHRSSGTCPPFQEENSVAHHDVKYQGKALTELHKKGEEKRTGDYASASTMDSEEQKGRELKRDFLEPLPNHRSIAAGNPKAMALPANAADAAGVKHALKNPPQTKQASRRKAQGKTQQNRKVTDYYPVRRSSRKSKAEIQSEEKRRLDELIESGKEEGMKIDIIDGKGRGVIATKQFTRGEFVVEYHGDLIENTDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETDRLGRLINHSKWGNCQTKLHDIDGVPHLILIASRDIKAGEELLYDYGDRSKASLEAHPWLKH
|
Nucleotide Sequence (Fasta) | ATGGCCAGAG GCAAGAAGAT GTCCAAGACG CGCGCGGAGG AGGCGGCTGC GGCAGCGGCA 60 GCGGCGACTC CGCCCGGAGG AACAGCGGCC CCCGTTCCGG ATCCGGAGAC TATGGACCGA 120 AGGAACCCAG CGCGGCCCAG GACCAACGGG GAGAATGTAT TTATGGGCCA GTCAAAGATC 180 TATTCCTACA TGAGCTCCCA CAGATCCTCT GGAACTTGTC CTCCATTCCA GGAGGAAAAT 240 TCTGTTGCAC ATCATGACGT CAAATACCAG GGGAAAGCCT TAACGGAATT GCACAAGAAA 300 GGAGAGGAGA AAAGGACTGG CGACTACGCA TCAGCGAGCA CTATGGACTC AGAAGAGCAG 360 AAAGGCAGAG AATTGAAGAG GGATTTTCTG GAGCCCCTTC CCAACCACAG ATCCATAGCT 420 GCAGGAAACC CCAAAGCCAT GGCGCTGCCC GCCAACGCTG CTGATGCCGC CGGCGTGAAG 480 CACGCCCTGA AGAACCCCCC CCAAACCAAG CAGGCCTCAA GGAGAAAAGC TCAGGGAAAA 540 ACGCAACAGA ATCGCAAAGT CACTGACTAT TATCCCGTGA GACGGAGCTC CCGAAAAAGC 600 AAAGCCGAGA TACAGTCGGA AGAAAAGAGA AGACTTGACG AACTGATTGA AAGCGGAAAG 660 GAAGAAGGCA TGAAGATCGA TATCATCGAT GGCAAAGGCA GAGGAGTGAT CGCCACTAAA 720 CAGTTTACAA GAGGGGAGTT TGTGGTAGAG TATCACGGGG ACCTCATTGA GAACACTGAC 780 GCCAAAAAAC GCGAGGCTCT CTATGCTCAG GATCCTTCCA CTGGCTGCTA CATGTACTAT 840 TTTCAGTACC TGAGCAAAAC CTACTGTGTG GATGCCACTC GAGAAACAGA TCGTCTCGGA 900 AGACTGATCA ATCACAGCAA GTGGGGTAAC TGCCAAACCA AACTCCACGA CATTGACGGC 960 GTGCCTCACC TCATCCTCAT CGCCTCCAGG GACATCAAAG CTGGCGAAGA GCTGCTGTAT 1020 GACTATGGTG ATCGCAGCAA AGCTTCCCTG GAAGCCCACC CGTGGCTCAA ACACTGA
1078Nucleotide Fasta Sequence
>ENSSHAP00000016639.1|HMT_other|Sarcophilus harrisii ATGGCCAGAGGCAAGAAGATGTCCAAGACGCGCGCGGAGGAGGCGGCTGCGGCAGCGGCAGCGGCGACTCCGCCCGGAGGAACAGCGGCCCCCGTTCCGGATCCGGAGACTATGGACCGAAGGAACCCAGCGCGGCCCAGGACCAACGGGGAGAATGTATTTATGGGCCAGTCAAAGATCTATTCCTACATGAGCTCCCACAGATCCTCTGGAACTTGTCCTCCATTCCAGGAGGAAAATTCTGTTGCACATCATGACGTCAAATACCAGGGGAAAGCCTTAACGGAATTGCACAAGAAAGGAGAGGAGAAAAGGACTGGCGACTACGCATCAGCGAGCACTATGGACTCAGAAGAGCAGAAAGGCAGAGAATTGAAGAGGGATTTTCTGGAGCCCCTTCCCAACCACAGATCCATAGCTGCAGGAAACCCCAAAGCCATGGCGCTGCCCGCCAACGCTGCTGATGCCGCCGGCGTGAAGCACGCCCTGAAGAACCCCCCCCAAACCAAGCAGGCCTCAAGGAGAAAAGCTCAGGGAAAAACGCAACAGAATCGCAAAGTCACTGACTATTATCCCGTGAGACGGAGCTCCCGAAAAAGCAAAGCCGAGATACAGTCGGAAGAAAAGAGAAGACTTGACGAACTGATTGAAAGCGGAAAGGAAGAAGGCATGAAGATCGATATCATCGATGGCAAAGGCAGAGGAGTGATCGCCACTAAACAGTTTACAAGAGGGGAGTTTGTGGTAGAGTATCACGGGGACCTCATTGAGAACACTGACGCCAAAAAACGCGAGGCTCTCTATGCTCAGGATCCTTCCACTGGCTGCTACATGTACTATTTTCAGTACCTGAGCAAAACCTACTGTGTGGATGCCACTCGAGAAACAGATCGTCTCGGAAGACTGATCAATCACAGCAAGTGGGGTAACTGCCAAACCAAACTCCACGACATTGACGGCGTGCCTCACCTCATCCTCATCGCCTCCAGGGACATCAAAGCTGGCGAAGAGCTGCTGTATGACTATGGTGATCGCAGCAAAGCTTCCCTGGAAGCCCACCCGTGGCTCAAACACTGA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |