Tag |
Content |
WERAM ID |
WERAM-Soa-0054 |
Ensembl Protein ID |
ENSSARP00000005355.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1181 |
|
|
|
Organism |
Sorex araneus |
Domain Profile |
HAT HAT_other
Query: 1 VGSKTEVAECKEKFATSKDPPVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGR 60 VGSKTEVAECKEKFA SKDP VSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGR Sbjct: 120 VGSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGR 179 Query: 61 CLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSFRLSTNEVPEGSLGDFAEF 120 CLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETS+RLS NE PEG+LGDFAEF Sbjct: 180 CLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEF 239 Query: 121 QRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKES 180 QRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKES Sbjct: 240 QRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKES 299 Query: 181 ISSCNTIESGISSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQP 240 ISSCNTIESGI+SPSVLFWWEYEH+NRKMSGLIVGSAFGP+KILPVNLKAVKGYFTLRQP Sbjct: 300 ISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQP 359 Query: 241 VVLWKETDQLPVHSI-CVPLYHPYQKCSCSLLV-ARG-YVFWCLLLISKAGLNVHNSHVT 297 V+LWKE DQLPVHSI CVPLYHPYQKCSCSL+V ARG YVFWCLLLISKAGLNVHNSHVT Sbjct: 360 VILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVT 419 Query: 298 GLHSLPIVSMAADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHG 357 GLHSLPIVSM ADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHG Sbjct: 420 GLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHG 479 Query: 358 IAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDL 417 IAVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLF+QVDL Sbjct: 480 IAVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDL 539 Query: 418 IDLVRWKILRDKHIPQFLQEALDKKIESSGTTYFWRFKLFLLRIWYQSMQKNPSEVLWKP 477 IDLVRWKIL+DKHIPQFLQEAL+KKIESSG TYFWRFKLFLLRI YQSMQK PSE LWKP Sbjct: 540 IDLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKP 599 Query: 478 PHEDAKVLLTDX-XXXXXXXXXXXXXXXXXXXXXXXXQDRSRESEPDEPAD-------EP 529 HED+K+LL D Q+RS+E + +EP D + Sbjct: 600 THEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDA 659 Query: 530 GGREPVEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSD 589 GGREP+EEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSD Sbjct: 660 GGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSD 719 Query: 590 EDYEDRTARXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCFLTYQS 649 E+Y+DRTAR CFLTYQS Sbjct: 720 EEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQS 779 Query: 650 CQSLIYRRCLLHDSIARHPSPEDPD-IKRLLQSPCPFCDSPVF 691 CQSLIYRRCLLHDSIARHP+PEDPD IKRLLQSPCPFCDSPVF Sbjct: 780 CQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | VGSKTEVAEC KEKFATSKDP PVSQTFMLDR VFNPEGKALP PMRGFKYTSW SPMGCDANGR 60 CLLAALTMDN RLTIQANLNR LQWVQLVDLT EIYGERLYET SFRLSTNEVP EGSLGDFAEF 120 QRRHSMQTPV RMEWSGICTT QQVKHNNECR DVGSVLLAVL FENGNIAVWQ FQLPFVGKES 180 ISSCNTIESG ISSPSVLFWW EYEHSNRKMS GLIVGSAFGP VKILPVNLKA VKGYFTLRQP 240 VVLWKETDQL PVHSICVPLY HPYQKCSCSL LVARGYVFWC LLLISKAGLN VHNSHVTGLH 300 SLPIVSMAAD KQNGTVYTCS SDGKVRQLIP IFTDVALKFE HQLIKLSDVF GSVRTHGIAV 360 SPCGAYLAVI TTEGMVNGLH PVNKNYQVQF VTLKTFEEAA AQLLESSVQN LFRQVDLIDL 420 VRWKILRDKH IPQFLQEALD KKIESSGTTY FWRFKLFLLR IWYQSMQKNP SEVLWKPPHE 480 DAKVLLTDSP RPGPSEEEPH EEGAPPGRAR PGPQDRSRES EPDEPADEPG GREPVEEKLL 540 EIQGKIEAVE MHLTREHMKR VLGEVYLHTW ITENTSIPTR GLCNFLMSDE DYEDRTARXX 600 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXCFLTYQSC QSLIYRRCLL 660 HDSIARHPSP EDPDIKRLLQ SPCPFCDSPV F 691Protein Fasta Sequence
>ENSSARP00000005355.1|HAT_other|Sorex araneus VGSKTEVAECKEKFATSKDPPVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSFRLSTNEVPEGSLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGISSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWKETDQLPVHSICVPLYHPYQKCSCSLLVARGYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMAADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDLIDLVRWKILRDKHIPQFLQEALDKKIESSGTTYFWRFKLFLLRIWYQSMQKNPSEVLWKPPHEDAKVLLTDSPRPGPSEEEPHEEGAPPGRARPGPQDRSRESEPDEPADEPGGREPVEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEDYEDRTARXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXCFLTYQSCQSLIYRRCLLHDSIARHPSPEDPDIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | GTCGGCTCGA AAACAGAAGT TGCCGAGTGT AAGGAGAAAT TCGCCACCTC GAAAGACCCC 60 CCGGTCAGCC AGACGTTCAT GCTGGACAGG GTGTTCAACC CCGAGGGCAA GGCGCTGCCG 120 CCCATGCGGG GGTTCAAGTA CACCAGCTGG TCCCCCATGG GCTGTGACGC CAACGGCAGG 180 TGCCTGCTGG CCGCGCTGAC CATGGACAAC CGCCTGACCA TTCAGGCCAA CCTCAACCGG 240 CTGCAGTGGG TGCAGCTGGT GGACCTGACG GAGATCTACG GGGAGCGTCT GTACGAGACC 300 AGCTTCCGGC TCTCCACCAA CGAGGTGCCC GAGGGCAGCC TGGGCGACTT CGCCGAGTTC 360 CAGCGGCGGC ACAGCATGCA GACGCCCGTG CGCATGGAGT GGTCGGGCAT CTGCACCACG 420 CAGCAGGTGA AGCACAACAA CGAGTGCCGG GACGTGGGCA GCGTGCTGCT GGCCGTGCTC 480 TTCGAGAACG GCAACATCGC CGTGTGGCAG TTCCAGCTGC CCTTCGTGGG CAAGGAGTCC 540 ATCTCCTCCT GCAACACCAT CGAGTCGGGC ATCAGCTCGC CCAGCGTGCT GTTCTGGTGG 600 GAGTACGAGC ACAGCAACCG CAAGATGAGC GGCCTGATCG TGGGCAGCGC CTTCGGGCCC 660 GTCAAGATCC TGCCCGTGAA CCTCAAGGCC GTGAAGGGCT ACTTCACGCT GCGGCAGCCC 720 GTGGTGCTGT GGAAAGAGAC GGACCAGCTG CCGGTGCACA GTATCTGCGT GCCGCTCTAC 780 CACCCCTACC AGAAGTGCAG CTGCAGCCTG TTGGTGGCCC GCGGCTACGT CTTCTGGTGC 840 CTGCTGCTCA TCTCCAAGGC GGGCCTCAAC GTGCACAACT CCCACGTCAC GGGCCTGCAC 900 TCGCTGCCCA TCGTCTCCAT GGCCGCCGAC AAGCAGAACG GCACGGTCTA CACGTGCTCG 960 AGCGACGGCA AGGTCCGGCA GCTCATCCCC ATCTTCACGG ACGTGGCGCT CAAGTTCGAG 1020 CACCAGCTCA TCAAGCTGTC GGACGTGTTC GGCTCGGTGC GGACGCACGG CATCGCCGTG 1080 AGCCCCTGCG GGGCCTACCT GGCCGTCATC ACGACCGAGG GGATGGTCAA CGGCCTCCAC 1140 CCCGTCAACA AGAACTACCA GGTCCAGTTC GTGACCCTGA AGACCTTCGA GGAGGCCGCG 1200 GCCCAGCTCC TGGAGTCCTC GGTGCAGAAC CTCTTCCGGC AGGTGGACCT CATCGACCTG 1260 GTGCGCTGGA AGATCCTCCG GGACAAGCAC ATCCCGCAGT TCCTGCAGGA GGCGCTGGAC 1320 AAGAAGATCG AGAGCAGCGG CACCACCTAC TTCTGGCGCT TCAAGCTCTT CCTGCTGCGC 1380 ATCTGGTACC AGTCCATGCA GAAGAACCCG TCCGAGGTCC TGTGGAAGCC CCCGCACGAG 1440 GACGCCAAGG TCCTGCTCAC GGACTCGCCC CGGCCCGGGC CCTCGGAGGA GGAGCCGCAC 1500 GAGGAAGGCG CTCCTCCCGG GCGGGCGAGG CCGGGCCCGC AGGACAGGAG CCGGGAGAGC 1560 GAGCCCGACG AGCCCGCCGA CGAGCCCGGT GGCCGGGAGC CCGTGGAGGA GAAGCTGCTG 1620 GAGATCCAGG GCAAGATCGA GGCCGTGGAG ATGCACCTGA CGCGGGAGCA CATGAAGCGC 1680 GTGCTGGGCG AGGTGTACCT GCACACGTGG ATCACGGAGA ACACCAGCAT CCCCACGCGG 1740 GGGCTCTGCA ACTTCCTCAT GTCCGACGAG GACTACGAGG ACAGGACAGC ACGGNNNNNN 1800 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1860 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1920 NNNNNGTGCT TCTTGACCTA CCAGTCCTGC CAGAGCCTGA TCTACAGGAG GTGTCTGCTG 1980 CACGACAGCA TCGCCCGCCA CCCGTCTCCA GAAGACCCCG ACATCAAGAG GCTGCTGCAG 2040 AGCCCCTGCC CCTTCTGCGA CTCACCCGTC TTCTGA
2077Nucleotide Fasta Sequence
>ENSSARP00000005355.1|HAT_other|Sorex araneus GTCGGCTCGAAAACAGAAGTTGCCGAGTGTAAGGAGAAATTCGCCACCTCGAAAGACCCCCCGGTCAGCCAGACGTTCATGCTGGACAGGGTGTTCAACCCCGAGGGCAAGGCGCTGCCGCCCATGCGGGGGTTCAAGTACACCAGCTGGTCCCCCATGGGCTGTGACGCCAACGGCAGGTGCCTGCTGGCCGCGCTGACCATGGACAACCGCCTGACCATTCAGGCCAACCTCAACCGGCTGCAGTGGGTGCAGCTGGTGGACCTGACGGAGATCTACGGGGAGCGTCTGTACGAGACCAGCTTCCGGCTCTCCACCAACGAGGTGCCCGAGGGCAGCCTGGGCGACTTCGCCGAGTTCCAGCGGCGGCACAGCATGCAGACGCCCGTGCGCATGGAGTGGTCGGGCATCTGCACCACGCAGCAGGTGAAGCACAACAACGAGTGCCGGGACGTGGGCAGCGTGCTGCTGGCCGTGCTCTTCGAGAACGGCAACATCGCCGTGTGGCAGTTCCAGCTGCCCTTCGTGGGCAAGGAGTCCATCTCCTCCTGCAACACCATCGAGTCGGGCATCAGCTCGCCCAGCGTGCTGTTCTGGTGGGAGTACGAGCACAGCAACCGCAAGATGAGCGGCCTGATCGTGGGCAGCGCCTTCGGGCCCGTCAAGATCCTGCCCGTGAACCTCAAGGCCGTGAAGGGCTACTTCACGCTGCGGCAGCCCGTGGTGCTGTGGAAAGAGACGGACCAGCTGCCGGTGCACAGTATCTGCGTGCCGCTCTACCACCCCTACCAGAAGTGCAGCTGCAGCCTGTTGGTGGCCCGCGGCTACGTCTTCTGGTGCCTGCTGCTCATCTCCAAGGCGGGCCTCAACGTGCACAACTCCCACGTCACGGGCCTGCACTCGCTGCCCATCGTCTCCATGGCCGCCGACAAGCAGAACGGCACGGTCTACACGTGCTCGAGCGACGGCAAGGTCCGGCAGCTCATCCCCATCTTCACGGACGTGGCGCTCAAGTTCGAGCACCAGCTCATCAAGCTGTCGGACGTGTTCGGCTCGGTGCGGACGCACGGCATCGCCGTGAGCCCCTGCGGGGCCTACCTGGCCGTCATCACGACCGAGGGGATGGTCAACGGCCTCCACCCCGTCAACAAGAACTACCAGGTCCAGTTCGTGACCCTGAAGACCTTCGAGGAGGCCGCGGCCCAGCTCCTGGAGTCCTCGGTGCAGAACCTCTTCCGGCAGGTGGACCTCATCGACCTGGTGCGCTGGAAGATCCTCCGGGACAAGCACATCCCGCAGTTCCTGCAGGAGGCGCTGGACAAGAAGATCGAGAGCAGCGGCACCACCTACTTCTGGCGCTTCAAGCTCTTCCTGCTGCGCATCTGGTACCAGTCCATGCAGAAGAACCCGTCCGAGGTCCTGTGGAAGCCCCCGCACGAGGACGCCAAGGTCCTGCTCACGGACTCGCCCCGGCCCGGGCCCTCGGAGGAGGAGCCGCACGAGGAAGGCGCTCCTCCCGGGCGGGCGAGGCCGGGCCCGCAGGACAGGAGCCGGGAGAGCGAGCCCGACGAGCCCGCCGACGAGCCCGGTGGCCGGGAGCCCGTGGAGGAGAAGCTGCTGGAGATCCAGGGCAAGATCGAGGCCGTGGAGATGCACCTGACGCGGGAGCACATGAAGCGCGTGCTGGGCGAGGTGTACCTGCACACGTGGATCACGGAGAACACCAGCATCCCCACGCGGGGGCTCTGCAACTTCCTCATGTCCGACGAGGACTACGAGGACAGGACAGCACGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNGTGCTTCTTGACCTACCAGTCCTGCCAGAGCCTGATCTACAGGAGGTGTCTGCTGCACGACAGCATCGCCCGCCACCCGTCTCCAGAAGACCCCGACATCAAGAGGCTGCTGCAGAGCCCCTGCCCCTTCTGCGACTCACCCGTCTTCTGA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |