Tag |
Content |
WERAM ID |
WERAM-Chs-0023 |
Ensembl Protein ID |
ENSCSAP00000015972.1 |
Gene Name |
ENSCSAP00000015972 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
HMT_other |
0 |
678 |
|
|
HMT |
HMT_other |
8.00e-53 |
204 |
|
|
HMT |
SET1 |
2.60e-28 |
98.4 |
219 |
330 |
|
Organism |
Chlorocebus sabaeus |
Domain Profile |
HMT HMT_other
Query: 1 MSKPRXXXXXXXXXXXXXXXPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGM 60 MSKPR PGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGM Sbjct: 7 MSKPRAVEAAAAAAAVAATAPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGM 66 Query: 61 RFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDARRGPLAP 120 RFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDAR+GPL P Sbjct: 67 RFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDARKGPLVP 126 Query: 121 FPNQKSEAAEPPKTPPSSCDSTNAAITKQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYP 180 FPNQKSEAAEPPKTPPSSCDSTNAAI KQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYP Sbjct: 127 FPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYP 186 Query: 181 VRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYH 240 VRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYH Sbjct: 187 VRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYH 246 Query: 241 GDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQ 300 GDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQ Sbjct: 247 GDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQ 306 Query: 301 TKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH 346 TKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH Sbjct: 307 TKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH 352
HMT HMT_other
Query: 136 PSSCDSTNAAITKQALKKPIKGKQAPRKKAQGK-------TQQNRKLTDFYPVRRSSRKS 188 P++ +S A + P+K K + +G+ T NR++TDF+PVRRS RK+ Sbjct: 474 PATANSNKAGMKTMLKPAPVKSKTKSKGPTKGQPPLPLAATNGNREMTDFFPVRRSVRKT 533 Query: 189 KAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEITD 248 K ++ E + +++ + + +G+++ GKGRGV+A + F R +FVVEY GDLI I + Sbjct: 534 KTAVKEEWMRGLEQAVLEERCDGLQVRHFMGKGRGVVADRPFKRNEFVVEYVGDLISIGE 593 Query: 249 AKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDIDG 308 A +RE YA D + GCYMYYF++ S+ YC+DAT +T +LGRLINHS+ GN TK+ I Sbjct: 594 AAEREKRYALDENAGCYMYYFKHKSQQYCIDATVDTGKLGRLINHSRAGNLMTKVVLIKQ 653 Query: 309 VPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWL 344 PHL+L+A DI GEEL YDYGDRSK S+ HPWL Sbjct: 654 RPHLVLLAKDDIEPGEELTYDYGDRSKESLLHHPWL 689
HMT SET1
SET1.txt 10 ikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedae.vvvdatkkgn.iarfinhscepNceakvvavd 95 kg+g+ a+k+++ +++v+EY G++i+ + a+kre+ y ++ ++ +y++ ++ ++ ++vdat++ n r+inhs Nc++k +d ENSCSAP00000015972.1 219 GKGRGVIATKQFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTgCYMYYFQYLSKtYCVDATRETNrLGRLINHSKCGNCQTKLHDID 307 4799***********************************999888***999875444**********889******************* PP SET1.txt 96 gekkiviyakraIekgeeltydY 118 g +++++a+r+I++geel+ydY ENSCSAP00000015972.1 308 GVPHLILIASRDIAAGEELLYDY 330 *********************** PP
|
Protein Sequence (Fasta) | MSKPRAVEAA AAAAAVAATA PGPEMVERRG PGRPRTDGEN VFTGQSKIYS YMSPNKCSGM 60 RFPLQEENSV THHEVKCQGK PLAGIYRKRE EKRNAGNAVR SAMKSEEQKI KDARRGPLAP 120 FPNQKSEAAE PPKTPPSSCD STNAAITKQA LKKPIKGKQA PRKKAQGKTQ QNRKLTDFYP 180 VRRSSRKSKA ELQSEERKRI DELIESGKEE GMKIDLIDGK GRGVIATKQF SRGDFVVEYH 240 GDLIEITDAK KREALYAQDP STGCYMYYFQ YLSKTYCVDA TRETNRLGRL INHSKCGNCQ 300 TKLHDIDGVP HLILIASRDI AAGEELLYDY GDRSKASIEA HPWLKH 346Protein Fasta Sequence
>ENSCSAP00000015972.1|SET1|Chlorocebus sabaeus MSKPRAVEAAAAAAAVAATAPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDARRGPLAPFPNQKSEAAEPPKTPPSSCDSTNAAITKQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH
|
Nucleotide Sequence (Fasta) | TCCCCCGGGT CCCCCTCTCC AGGCAGGAAG ATGTCCAAGC CCCGCGCGGT GGAGGCGGCG 60 GCGGCGGCGG CGGCGGTGGC AGCGACGGCC CCGGGCCCGG AGATGGTGGA GCGGAGGGGC 120 CCGGGGAGGC CCCGCACCGA CGGGGAGAAT GTGTTTACCG GGCAGTCAAA GATCTATTCC 180 TACATGAGCC CGAACAAATG CTCTGGAATG CGTTTCCCCC TTCAGGAAGA GAACTCAGTT 240 ACACATCACG AAGTCAAATG CCAGGGGAAA CCATTAGCTG GAATCTACAG GAAACGAGAA 300 GAGAAAAGAA ATGCTGGGAA TGCAGTGCGG AGCGCCATGA AGTCTGAGGA ACAGAAGATC 360 AAAGACGCCA GGAGAGGTCC CCTGGCACCT TTTCCAAACC AAAAATCTGA AGCAGCAGAA 420 CCTCCAAAAA CTCCACCCTC ATCTTGTGAT TCCACCAATG CAGCCATCAC CAAGCAAGCC 480 CTGAAAAAGC CCATCAAGGG CAAACAGGCC CCCCGAAAAA AAGCTCAAGG AAAAACGCAA 540 CAGAACCGCA AACTTACGGA TTTCTACCCT GTCCGAAGGA GCTCCAGGAA GAGCAAAGCC 600 GAGCTGCAGT CTGAAGAAAG GAAAAGAATA GATGAATTGA TTGAGAGTGG GAAGGAAGAA 660 GGGATGAAGA TCGACCTCAT CGATGGCAAA GGCAGGGGTG TGATTGCCAC CAAGCAGTTC 720 TCCCGAGGTG ACTTTGTGGT GGAATACCAC GGGGACCTCA TCGAGATCAC CGATGCCAAG 780 AAACGGGAGG CTCTGTACGC ACAGGACCCT TCCACGGGCT GTTACATGTA CTATTTTCAG 840 TATCTGAGCA AAACCTATTG CGTGGATGCA ACTAGAGAGA CAAATCGCCT AGGAAGACTG 900 ATCAATCATA GCAAATGTGG GAACTGCCAA ACCAAACTGC ACGACATTGA CGGCGTACCT 960 CACCTCATCC TCATCGCCTC CCGAGACATC GCGGCTGGGG AGGAGCTCCT GTATGACTAT 1020 GGGGATCGCA GCAAGGCTTC CATTGAAGCT CACCCGTGGC TGAAGCATTA ACCGGCTGAC 1080 CCCACACCCT CCCCGCCCCA CCTTCCCTTC TGCAAAGGAC AAAGTGCCCT CAAAGGGAAT 1140 TGAATTTTTT TTTTTTTACA CACTTACTTA ATCTTAGCGG ATTACTTCAG ATGTTTTTAA 1200 AAAGTATATT AAGATGCCTT TTCACTGTAG TATTTAAATA TCTGTTACAG GTTTCCAAGG 1260 TGGACTTGAA CAGATGGCCT TATATTACCA AAACTTTTAT ATTCTAGTTG TTTTTGTACT 1320 TTTTTTGCAT ACAAGCCGAA CGTTTGTGCT TCCCGTGCAT GCAGTCAAAG ACTCAGCACA 1380 GGTTTTAGAG GAAAGAGTCA AACATGAACT AGGAAGCCAG GTGAGTCTCC TTTCTCCAGT 1440 GGAAGAGCCG GGACCTTCCC CCTGCACCCC TGACATCCAG GGACGGGGTG TGAGGAAGAC 1500 GCTGCCTCCC AATGGCCTGG ACGGGATGTT TCCAAGCTCT TGTTCTCCTG ACGTCTCGAC 1560 AGGCGCTCAC TGAAGTGTAT GAATATTTTT TAAAAAGGTT TTGCAGTAAA CTAGTCTTCC 1620 CCTCTGCTTT CTCGAAAGCT TACTGAGCCC TGGGCCCCAA GCCTGGGCCG GGCATAGATT 1680 TCCTCTTCCA CAAGCTGCCG CTTTTCTGGG CACCTTGAAG CATCAGGGCG CGAAATCGAA 1740 CTAGATGTGG GCAGGGAGAG TGTTGCTTAC CTGCCCTGCG GGGGCAGGGT TTCCTGAAAC 1800 TGGGTTAATT CTTTATAGAA ATGTGAACAC TGAATTTATT TTAAAAAAAT AATAAAAATT 1860 AAAAATAAAT AAAAATTAAA AAAAAAAAAC CACAGAAAAC AACTTACATG TATATAGGTC 1920 TTGAAGTGAG TGAAGTGGCT GCTTTTTTTT TCTTTTGCTT TTGCTTTTTT TTGCTTTTTG 1980 TAGAAGAGAT TGAGAATGGT ACTCTAATCA AAAATAAAGT TTTGTAGTGG GACCAGAAAT 2040 TACTTACCTA ACATCCACCC CCATTTCCCC TCATCCTGCT GGGGTTGAAG GTTCCAGACC 2100 TGCTGTCGAG GCCTTGTGTT TGTCAGACAC CCGGCATCCT CCTGCAAGGA CGCAACTGTG 2160 AGCTGAGGTG TGAGCCTAGG AGCCCAGGAC CCCGACCCCA GCCACTGCTG CCAGCCTCAG 2220 AAAGGCACCC AGGTGTGCAG GGGAGCACGC AGGGCCCGGC GGCCCCCAGG AATCACAGAT 2280 AGGGCTAAGG TTTTCACCTT AACTGTGAAG GTAGGAGGAG TAGGTGGCTG CCTCCTCCCG 2340 TCCTTCACAG AACTGATTCT CACTCTCTGT CCCTTCAGTC CAGGGGGCCA GGGCTCAGGA 2400 GCCATGACCT GGTGTCTCCT GCCCACCTTG GTCCCAGGTA AATGTGAATG GAGACAGGTA 2460 TGAGAGCCCG TCCTAGTCTT TGATTCCCCC CAACCCCACC TCGGGCCTCA CGACGGTGCT 2520 ACCTAAGAAA GTCTTCCCTC CCACCCCCCG CTAGCCTGGT CAGTGGTCAG CAAATTGGAA 2580 GAGGATCCGA CAGGAGTGTA AATGTGAGAC ACAATGTCGA TTATACCTAT TTGTGGTTTA 2640 GCTTTGTATT TAAACAAGGA AATAAACTTG AAAATTATTT GTCATCATAA AAATGAAACA 2700 AAAATTAAAA TATTTATTGC CAGGCAAGGC
2731Nucleotide Fasta Sequence
>ENSCSAP00000015972.1|SET1|Chlorocebus sabaeus TCCCCCGGGTCCCCCTCTCCAGGCAGGAAGATGTCCAAGCCCCGCGCGGTGGAGGCGGCGGCGGCGGCGGCGGCGGTGGCAGCGACGGCCCCGGGCCCGGAGATGGTGGAGCGGAGGGGCCCGGGGAGGCCCCGCACCGACGGGGAGAATGTGTTTACCGGGCAGTCAAAGATCTATTCCTACATGAGCCCGAACAAATGCTCTGGAATGCGTTTCCCCCTTCAGGAAGAGAACTCAGTTACACATCACGAAGTCAAATGCCAGGGGAAACCATTAGCTGGAATCTACAGGAAACGAGAAGAGAAAAGAAATGCTGGGAATGCAGTGCGGAGCGCCATGAAGTCTGAGGAACAGAAGATCAAAGACGCCAGGAGAGGTCCCCTGGCACCTTTTCCAAACCAAAAATCTGAAGCAGCAGAACCTCCAAAAACTCCACCCTCATCTTGTGATTCCACCAATGCAGCCATCACCAAGCAAGCCCTGAAAAAGCCCATCAAGGGCAAACAGGCCCCCCGAAAAAAAGCTCAAGGAAAAACGCAACAGAACCGCAAACTTACGGATTTCTACCCTGTCCGAAGGAGCTCCAGGAAGAGCAAAGCCGAGCTGCAGTCTGAAGAAAGGAAAAGAATAGATGAATTGATTGAGAGTGGGAAGGAAGAAGGGATGAAGATCGACCTCATCGATGGCAAAGGCAGGGGTGTGATTGCCACCAAGCAGTTCTCCCGAGGTGACTTTGTGGTGGAATACCACGGGGACCTCATCGAGATCACCGATGCCAAGAAACGGGAGGCTCTGTACGCACAGGACCCTTCCACGGGCTGTTACATGTACTATTTTCAGTATCTGAGCAAAACCTATTGCGTGGATGCAACTAGAGAGACAAATCGCCTAGGAAGACTGATCAATCATAGCAAATGTGGGAACTGCCAAACCAAACTGCACGACATTGACGGCGTACCTCACCTCATCCTCATCGCCTCCCGAGACATCGCGGCTGGGGAGGAGCTCCTGTATGACTATGGGGATCGCAGCAAGGCTTCCATTGAAGCTCACCCGTGGCTGAAGCATTAACCGGCTGACCCCACACCCTCCCCGCCCCACCTTCCCTTCTGCAAAGGACAAAGTGCCCTCAAAGGGAATTGAATTTTTTTTTTTTTACACACTTACTTAATCTTAGCGGATTACTTCAGATGTTTTTAAAAAGTATATTAAGATGCCTTTTCACTGTAGTATTTAAATATCTGTTACAGGTTTCCAAGGTGGACTTGAACAGATGGCCTTATATTACCAAAACTTTTATATTCTAGTTGTTTTTGTACTTTTTTTGCATACAAGCCGAACGTTTGTGCTTCCCGTGCATGCAGTCAAAGACTCAGCACAGGTTTTAGAGGAAAGAGTCAAACATGAACTAGGAAGCCAGGTGAGTCTCCTTTCTCCAGTGGAAGAGCCGGGACCTTCCCCCTGCACCCCTGACATCCAGGGACGGGGTGTGAGGAAGACGCTGCCTCCCAATGGCCTGGACGGGATGTTTCCAAGCTCTTGTTCTCCTGACGTCTCGACAGGCGCTCACTGAAGTGTATGAATATTTTTTAAAAAGGTTTTGCAGTAAACTAGTCTTCCCCTCTGCTTTCTCGAAAGCTTACTGAGCCCTGGGCCCCAAGCCTGGGCCGGGCATAGATTTCCTCTTCCACAAGCTGCCGCTTTTCTGGGCACCTTGAAGCATCAGGGCGCGAAATCGAACTAGATGTGGGCAGGGAGAGTGTTGCTTACCTGCCCTGCGGGGGCAGGGTTTCCTGAAACTGGGTTAATTCTTTATAGAAATGTGAACACTGAATTTATTTTAAAAAAATAATAAAAATTAAAAATAAATAAAAATTAAAAAAAAAAAACCACAGAAAACAACTTACATGTATATAGGTCTTGAAGTGAGTGAAGTGGCTGCTTTTTTTTTCTTTTGCTTTTGCTTTTTTTTGCTTTTTGTAGAAGAGATTGAGAATGGTACTCTAATCAAAAATAAAGTTTTGTAGTGGGACCAGAAATTACTTACCTAACATCCACCCCCATTTCCCCTCATCCTGCTGGGGTTGAAGGTTCCAGACCTGCTGTCGAGGCCTTGTGTTTGTCAGACACCCGGCATCCTCCTGCAAGGACGCAACTGTGAGCTGAGGTGTGAGCCTAGGAGCCCAGGACCCCGACCCCAGCCACTGCTGCCAGCCTCAGAAAGGCACCCAGGTGTGCAGGGGAGCACGCAGGGCCCGGCGGCCCCCAGGAATCACAGATAGGGCTAAGGTTTTCACCTTAACTGTGAAGGTAGGAGGAGTAGGTGGCTGCCTCCTCCCGTCCTTCACAGAACTGATTCTCACTCTCTGTCCCTTCAGTCCAGGGGGCCAGGGCTCAGGAGCCATGACCTGGTGTCTCCTGCCCACCTTGGTCCCAGGTAAATGTGAATGGAGACAGGTATGAGAGCCCGTCCTAGTCTTTGATTCCCCCCAACCCCACCTCGGGCCTCACGACGGTGCTACCTAAGAAAGTCTTCCCTCCCACCCCCCGCTAGCCTGGTCAGTGGTCAGCAAATTGGAAGAGGATCCGACAGGAGTGTAAATGTGAGACACAATGTCGATTATACCTATTTGTGGTTTAGCTTTGTATTTAAACAAGGAAATAAACTTGAAAATTATTTGTCATCATAAAAATGAAACAAAAATTAAAATATTTATTGCCAGGCAAGGC
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |