Tag |
Content |
WERAM ID |
WERAM-Pat-0047 |
Ensembl Protein ID |
ENSPTRP00000009519.4 |
Gene Name |
KMT5A |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
HMT_other |
0 |
693 |
|
|
HMT |
HMT_other |
1.00e-52 |
204 |
|
|
HMT |
SET1 |
3.00e-28 |
98.3 |
225 |
336 |
|
Organism |
Pan troglodytes |
Domain Profile |
HMT HMT_other
Query: 1 MARGRKMSKPRXXXXXXXXXXXXXXXPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSP 60 MARGRKMSKPR PGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSP Sbjct: 1 MARGRKMSKPRAVEAAAAAAAVAATAPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSP 60 Query: 61 NKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDAR 120 NKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDAR Sbjct: 61 NKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDAR 120 Query: 121 RGPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRK 180 +GPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRK Sbjct: 121 KGPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRK 180 Query: 181 LTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGD 240 LTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGD Sbjct: 181 LTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGD 240 Query: 241 FVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHS 300 FVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHS Sbjct: 241 FVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHS 300 Query: 301 KCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH 352 KCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH Sbjct: 301 KCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH 352
HMT HMT_other
Query: 142 PSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGK-------TQQNRKLTDFYPVRRSSRKS 194 P++ +S A + P+K K + +G+ T NR++TDF+PVRRS RK+ Sbjct: 474 PATANSNKAGMKTMLKPAPVKSKTKSKGPTKGQPPLPLAATNGNREMTDFFPVRRSVRKT 533 Query: 195 KAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEITD 254 K ++ E + +++ + + +G+++ GKGRGV+A + F R +FVVEY GDLI I + Sbjct: 534 KTAVKEEWMRGLEQAVLEERCDGLQVRHFMGKGRGVVADRPFKRNEFVVEYVGDLISIGE 593 Query: 255 AKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDIDG 314 A +RE YA D + GCYMYYF++ S+ YC+DAT +T +LGRLINHS+ GN TK+ I Sbjct: 594 AAEREKRYALDENAGCYMYYFKHKSQQYCIDATVDTGKLGRLINHSRAGNLMTKVVLIKQ 653 Query: 315 VPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWL 350 PHL+L+A DI GEEL YDYGDRSK S+ HPWL Sbjct: 654 RPHLVLLAKDDIEPGEELTYDYGDRSKESLLHHPWL 689
HMT SET1
SET1.txt 10 ikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedae.vvvdatkkgn.iarfinhscepNceakvvavd 95 kg+g+ a+k+++ +++v+EY G++i+ + a+kre+ y ++ ++ +y++ ++ ++ ++vdat++ n r+inhs Nc++k +d ENSPTRP00000009519.4 225 GKGRGVIATKQFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTgCYMYYFQYLSKtYCVDATRETNrLGRLINHSKCGNCQTKLHDID 313 4799***********************************999888***999875444**********889******************* PP SET1.txt 96 gekkiviyakraIekgeeltydY 118 g +++++a+r+I++geel+ydY ENSPTRP00000009519.4 314 GVPHLILIASRDIAAGEELLYDY 336 *********************** PP
|
Protein Sequence (Fasta) | MARGRKMSKP RAVEAAAAAA AVAATAPGPE MVERRGPGRP RTDGENVFTG QSKIYSYMSP 60 NKCSGMRFPL QEENSVTHHE VKCQGKPLAG IYRKREEKRN AGNAVRSAMK SEEQKIKDAR 120 RGPLVPFPNQ KSEAAEPPKT PPSSCDSTNA AIAKQALKKP IKGKQAPRKK AQGKTQQNRK 180 LTDFYPVRRS SRKSKAELQS EERKRIDELI ESGKEEGMKI DLIDGKGRGV IATKQFSRGD 240 FVVEYHGDLI EITDAKKREA LYAQDPSTGC YMYYFQYLSK TYCVDATRET NRLGRLINHS 300 KCGNCQTKLH DIDGVPHLIL IASRDIAAGE ELLYDYGDRS KASIEAHPWL KH 352Protein Fasta Sequence
>ENSPTRP00000009519.4|SET1|Pan troglodytes MARGRKMSKPRAVEAAAAAAAVAATAPGPEMVERRGPGRPRTDGENVFTGQSKIYSYMSPNKCSGMRFPLQEENSVTHHEVKCQGKPLAGIYRKREEKRNAGNAVRSAMKSEEQKIKDARRGPLVPFPNQKSEAAEPPKTPPSSCDSTNAAIAKQALKKPIKGKQAPRKKAQGKTQQNRKLTDFYPVRRSSRKSKAELQSEERKRIDELIESGKEEGMKIDLIDGKGRGVIATKQFSRGDFVVEYHGDLIEITDAKKREALYAQDPSTGCYMYYFQYLSKTYCVDATRETNRLGRLINHSKCGNCQTKLHDIDGVPHLILIASRDIAAGEELLYDYGDRSKASIEAHPWLKH
|
Nucleotide Sequence (Fasta) | CTGGGTTTCC CGGGAGATCC CAGGCGGTGA CAGAGTGGAG CCATGGCTAG AGGCAGGAAG 60 ATGTCCAAGC CCCGCGCGGT GGAGGCGGCG GCGGCGGCGG CGGCGGTGGC AGCGACGGCC 120 CCGGGCCCGG AGATGGTGGA GCGGAGGGGC CCGGGGAGGC CCCGCACCGA CGGGGAAAAC 180 GTATTTACCG GGCAGTCAAA GATCTATTCC TACATGAGCC CGAACAAATG CTCTGGAATG 240 CGTTTCCCCC TTCAGGAAGA GAACTCAGTT ACACATCACG AAGTCAAATG CCAGGGGAAA 300 CCATTAGCCG GAATCTACAG GAAACGAGAA GAGAAAAGAA ATGCTGGGAA CGCAGTACGG 360 AGCGCCATGA AGTCCGAGGA ACAGAAGATC AAAGACGCCA GGAGAGGTCC CCTGGTACCT 420 TTTCCAAACC AAAAATCTGA AGCAGCAGAA CCTCCAAAAA CTCCACCCTC ATCTTGTGAT 480 TCCACCAATG CAGCCATTGC CAAGCAAGCC CTGAAAAAGC CCATCAAGGG CAAACAGGCC 540 CCTCGAAAAA AAGCTCAAGG AAAAACGCAA CAGAATCGCA AACTTACGGA TTTCTACCCT 600 GTCCGAAGGA GCTCCAGGAA GAGCAAAGCC GAGCTGCAGT CTGAAGAAAG GAAAAGAATA 660 GATGAATTGA TTGAAAGTGG GAAGGAAGAA GGAATGAAGA TTGACCTCAT CGATGGCAAA 720 GGCAGGGGTG TGATTGCCAC CAAGCAGTTC TCCCGGGGTG ACTTTGTGGT GGAATACCAC 780 GGGGACCTCA TCGAGATCAC CGACGCCAAG AAACGGGAGG CTCTGTACGC ACAGGACCCT 840 TCCACAGGCT GCTACATGTA CTATTTTCAG TATCTGAGCA AAACCTACTG CGTGGATGCA 900 ACTAGAGAGA CAAATCGCCT AGGAAGACTG ATCAATCACA GCAAATGTGG GAACTGCCAA 960 ACCAAACTGC ACGACATCGA CGGCGTACCT CACCTCATCC TCATCGCCTC CCGCGACATC 1020 GCGGCTGGGG AGGAGCTCCT GTATGACTAT GGGGACCGCA GCAAGGCTTC CATTGAAGCT 1080 CACCCGTGGC TGAAGCATTA ACCGGTGGGC CCCGTGCCCT CCCCGCCCCA CTTTCCCTTC 1140 TTCAAAGGAC AAAGTGCCCT CAAAGGGAAT TGAATTTTTT TTTACACACT TAATCTTAGC 1200 GGATTACTTC AGATGTTTTT AAAAAGTATA TTAAGATGCC TTTTCACTGT AGTATTTAAA 1260 TATCTGTTAC AGGTTTCCAA GGTGGACTTG AACAGATGGC CTTATATTAC CAAAACTTTT 1320 ATATTCTAGT TGTTTTTGTA CTTTTTTTGC ATACAAGCCG AACGTTTGTG CTTCCCGTGC 1380 ATGCAGTCAA AGACTCAGCA CAGGTTTTAG AGGAAATAGT CAAACATGAA CTAGGAAGCC 1440 AGGTGAGTCT CCTTTCTCCA GTGGAAGAGC CGGGACCTTC CCCCTGCACC CCCGACATCC 1500 AGGGACGGGG TGTAAGGAAG ACGCTGCCTC CCAATGGCCT GGACGGGATG TTTCCAAGCT 1560 CTTGTTCTCC TAACGTCTCA ACAGGCGCTC ACTGAAGTGT ATGAATATTT TTTAAAAAGG 1620 TTTTTGCAGT AAGCTAGTCT TCCCCTCTGC TTTCTCGAAA GCTTACTGAG CCCTGGGCCC 1680 CAAGCACGGG CTGGGCATAG ATTTCCTCTT CCACAAGCTG CCGCTTTTCT GGGCACCTTG 1740 AAGCATCAGG GCGCGAAATC AAACTAGATG TGGGCAGGGA GAGTGTTGCT TACCTGCCCT 1800 GCTGGGGCAG GGTTTCCTGA AACTGGGTTA ATTCTTTATA GAAATGTGAA CACTGAATTT 1860 ATTTTAAAAA ATAATAATAA AAATTAAAAA AAATTAAAAA TAAAAAAAAA ACCCACAGAA 1920 AACAACTTAC ATGTATATAG GTCTTGAAGT GAGTGAAGTG GCTTTTTTTT TTTTTTTTTT 1980 TTTTTTTTTT GCTTTTTTTT TGCTTTTTGT AGAAGAGATT GAGAATGGTA CTCTAATCAA 2040 AAATAAAGTT TTGTAGTGGG ACCAGAAATT ACTTACCTGA CATCCACCCC CATTCCCCCT 2100 CATCCTGCTG GGGTTGAAAG TTCCAGACCT GCTGTCGAGG CCTTGTGTTT GTCAGACACC 2160 CAGTGTCCTC CTGCAAGGAC GCAACTGTGA GCTGAGGTGT GAGCCTAGGA GCCCAGGATC 2220 CCTGACCCCG GCCGCTGCTG CCAGCCTCAG AAAGGCACCC AGGTGTGCAG GGGAGCACGC 2280 AGGGCCCGGC AGCCCCCAGG AATCAAGGAT AGGGCTAAGG TTTTCACCTT AACTGTGAAG 2340 GCAGGAGGAA TAGGTGGCTG CTTCCTCCCG CCCTTCACAG AACTGATTCT CACACACTGT 2400 CCCTTCAGTC CAGGGGGCCG GGGCTCAGGA GCAATGACCT GGTGTCTCCT GCCCACCCTG 2460 GTCCCAGGTA AATGTGAATG GAGACAGGTA TGAGAGGCTG TCCTCGTCTT TGATTCCCCC 2520 CCAACCCCAC CTTGGGCCTC ACAACGGTGC TACCTAAGAA AGTCTTCCCT CCCACCCCCC 2580 GCTAGCCTGG TCAGTGGTCA GCAAATTGGA AGAGGATCCG ATGGGAGTGT AAATGTGAGA 2640 TACAATGTCT TGATTATACC TGTTTGTGGT TTAGCTTTGT ATTTAAACAA GGAAATAAAC 2700 TTGAAAATTA TTTGTCATCA TAAAAATGAA ACAAATTAAA ATATTTATTG CCAGGC
2757Nucleotide Fasta Sequence
>ENSPTRP00000009519.4|SET1|Pan troglodytes CTGGGTTTCCCGGGAGATCCCAGGCGGTGACAGAGTGGAGCCATGGCTAGAGGCAGGAAGATGTCCAAGCCCCGCGCGGTGGAGGCGGCGGCGGCGGCGGCGGCGGTGGCAGCGACGGCCCCGGGCCCGGAGATGGTGGAGCGGAGGGGCCCGGGGAGGCCCCGCACCGACGGGGAAAACGTATTTACCGGGCAGTCAAAGATCTATTCCTACATGAGCCCGAACAAATGCTCTGGAATGCGTTTCCCCCTTCAGGAAGAGAACTCAGTTACACATCACGAAGTCAAATGCCAGGGGAAACCATTAGCCGGAATCTACAGGAAACGAGAAGAGAAAAGAAATGCTGGGAACGCAGTACGGAGCGCCATGAAGTCCGAGGAACAGAAGATCAAAGACGCCAGGAGAGGTCCCCTGGTACCTTTTCCAAACCAAAAATCTGAAGCAGCAGAACCTCCAAAAACTCCACCCTCATCTTGTGATTCCACCAATGCAGCCATTGCCAAGCAAGCCCTGAAAAAGCCCATCAAGGGCAAACAGGCCCCTCGAAAAAAAGCTCAAGGAAAAACGCAACAGAATCGCAAACTTACGGATTTCTACCCTGTCCGAAGGAGCTCCAGGAAGAGCAAAGCCGAGCTGCAGTCTGAAGAAAGGAAAAGAATAGATGAATTGATTGAAAGTGGGAAGGAAGAAGGAATGAAGATTGACCTCATCGATGGCAAAGGCAGGGGTGTGATTGCCACCAAGCAGTTCTCCCGGGGTGACTTTGTGGTGGAATACCACGGGGACCTCATCGAGATCACCGACGCCAAGAAACGGGAGGCTCTGTACGCACAGGACCCTTCCACAGGCTGCTACATGTACTATTTTCAGTATCTGAGCAAAACCTACTGCGTGGATGCAACTAGAGAGACAAATCGCCTAGGAAGACTGATCAATCACAGCAAATGTGGGAACTGCCAAACCAAACTGCACGACATCGACGGCGTACCTCACCTCATCCTCATCGCCTCCCGCGACATCGCGGCTGGGGAGGAGCTCCTGTATGACTATGGGGACCGCAGCAAGGCTTCCATTGAAGCTCACCCGTGGCTGAAGCATTAACCGGTGGGCCCCGTGCCCTCCCCGCCCCACTTTCCCTTCTTCAAAGGACAAAGTGCCCTCAAAGGGAATTGAATTTTTTTTTACACACTTAATCTTAGCGGATTACTTCAGATGTTTTTAAAAAGTATATTAAGATGCCTTTTCACTGTAGTATTTAAATATCTGTTACAGGTTTCCAAGGTGGACTTGAACAGATGGCCTTATATTACCAAAACTTTTATATTCTAGTTGTTTTTGTACTTTTTTTGCATACAAGCCGAACGTTTGTGCTTCCCGTGCATGCAGTCAAAGACTCAGCACAGGTTTTAGAGGAAATAGTCAAACATGAACTAGGAAGCCAGGTGAGTCTCCTTTCTCCAGTGGAAGAGCCGGGACCTTCCCCCTGCACCCCCGACATCCAGGGACGGGGTGTAAGGAAGACGCTGCCTCCCAATGGCCTGGACGGGATGTTTCCAAGCTCTTGTTCTCCTAACGTCTCAACAGGCGCTCACTGAAGTGTATGAATATTTTTTAAAAAGGTTTTTGCAGTAAGCTAGTCTTCCCCTCTGCTTTCTCGAAAGCTTACTGAGCCCTGGGCCCCAAGCACGGGCTGGGCATAGATTTCCTCTTCCACAAGCTGCCGCTTTTCTGGGCACCTTGAAGCATCAGGGCGCGAAATCAAACTAGATGTGGGCAGGGAGAGTGTTGCTTACCTGCCCTGCTGGGGCAGGGTTTCCTGAAACTGGGTTAATTCTTTATAGAAATGTGAACACTGAATTTATTTTAAAAAATAATAATAAAAATTAAAAAAAATTAAAAATAAAAAAAAAACCCACAGAAAACAACTTACATGTATATAGGTCTTGAAGTGAGTGAAGTGGCTTTTTTTTTTTTTTTTTTTTTTTTTTTTGCTTTTTTTTTGCTTTTTGTAGAAGAGATTGAGAATGGTACTCTAATCAAAAATAAAGTTTTGTAGTGGGACCAGAAATTACTTACCTGACATCCACCCCCATTCCCCCTCATCCTGCTGGGGTTGAAAGTTCCAGACCTGCTGTCGAGGCCTTGTGTTTGTCAGACACCCAGTGTCCTCCTGCAAGGACGCAACTGTGAGCTGAGGTGTGAGCCTAGGAGCCCAGGATCCCTGACCCCGGCCGCTGCTGCCAGCCTCAGAAAGGCACCCAGGTGTGCAGGGGAGCACGCAGGGCCCGGCAGCCCCCAGGAATCAAGGATAGGGCTAAGGTTTTCACCTTAACTGTGAAGGCAGGAGGAATAGGTGGCTGCTTCCTCCCGCCCTTCACAGAACTGATTCTCACACACTGTCCCTTCAGTCCAGGGGGCCGGGGCTCAGGAGCAATGACCTGGTGTCTCCTGCCCACCCTGGTCCCAGGTAAATGTGAATGGAGACAGGTATGAGAGGCTGTCCTCGTCTTTGATTCCCCCCCAACCCCACCTTGGGCCTCACAACGGTGCTACCTAAGAAAGTCTTCCCTCCCACCCCCCGCTAGCCTGGTCAGTGGTCAGCAAATTGGAAGAGGATCCGATGGGAGTGTAAATGTGAGATACAATGTCTTGATTATACCTGTTTGTGGTTTAGCTTTGTATTTAAACAAGGAAATAAACTTGAAAATTATTTGTCATCATAAAAATGAAACAAATTAAAATATTTATTGCCAGGC
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |