Tag |
Content |
WERAM ID |
WERAM-Prc-0045 |
Ensembl Protein ID |
ENSPCAP00000004132.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1572 |
|
|
|
Organism |
Procavia capensis |
Domain Profile |
HAT HAT_other
Query: 38 DAVPGPSAAFRLMVTRREPAVKLQYSVSGLEPLSWSDDHRVSVSTARSIAVLELICDVHN 97 DA PGPSAAFRLMVTRREPAVKLQY+VSGLEPL+WS+DHRVSVSTARSIAVLELICDVHN Sbjct: 38 DAAPGPSAAFRLMVTRREPAVKLQYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHN 97 Query: 98 PGQDLVIHRTSVPAPLNSCFLKVGSKKEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKA 157 PGQDLVIHRTSVPAPLNSC LKVGSK EVAECKEKFAASKDPTVSQTFMLDRVFNPEGKA Sbjct: 98 PGQDLVIHRTSVPAPLNSCLLKVGSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKA 157 Query: 158 LPPMRGFKYASWSPMGCDANGRCLLAALTMDNRLTVQANLSRLQWVQLVDLTEIYGERLY 217 LPPMRGFKY SWSPMGCDANGRCLLAALTMDNRLT+QANL+RLQWVQLVDLTEIYGERLY Sbjct: 158 LPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLY 217 Query: 218 ETNYRRSKTEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLA 277 ET+YR SK EAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLA Sbjct: 218 ETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLA 277 Query: 278 VLFENGNIAVWQFQLPFVGKESISSCNTIESGINSPSVLFWWEYEHNNRKMNGLIVGSAF 337 VLFENGNIAVWQFQLPFVGKESISSCNTIESGI SPSVLFWWEYEHNNRKM+GLIVGSAF Sbjct: 278 VLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAF 337 Query: 338 GPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGS 397 GP+KILPVNLKAVKGYFTLRQPV+LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGS Sbjct: 338 GPIKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGS 397 Query: 398 YVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDV 457 YVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDV Sbjct: 398 YVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDV 457 Query: 458 ALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKT 517 ALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVTLKT Sbjct: 458 ALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKT 517 Query: 518 FEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLHEALEKKIESSGATYFWRFK 577 FEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFL EALEKKIESSG TYFWRFK Sbjct: 518 FEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFK 577 Query: 578 LFLLRILYQSMQKIPSEALWKPSNEDSKILLVDSPGVGNAEDE-QEEGT-SKQMNKQSLQ 635 LFLLRILYQSMQK PSEALWKP++EDSKILLVDSPG+GNA+DE QEEGT SKQ+ KQ LQ Sbjct: 578 LFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQ 637 Query: 636 ERSKEGDTEEPTDDSVTQSGDVGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYL 695 ERSKEGD EEPTDDS+ +GD GGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYL Sbjct: 638 ERSKEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYL 697 Query: 696 HTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFT 755 HTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFT Sbjct: 698 HTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFT 757 Query: 756 DRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFC 815 DRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFC Sbjct: 758 DRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFC 817 Query: 816 DSPVF 820 DSPVF Sbjct: 818 DSPVF 822
|
Protein Sequence (Fasta) | MSTSNKARVG PETDGPAPPG EEEGEGGGEA GAKEPTADAV PGPSAAFRLM VTRREPAVKL 60 QYSVSGLEPL SWSDDHRVSV STARSIAVLE LICDVHNPGQ DLVIHRTSVP APLNSCFLKV 120 GSKKEVAECK EKFAASKDPT VSQTFMLDRV FNPEGKALPP MRGFKYASWS PMGCDANGRC 180 LLAALTMDNR LTVQANLSRL QWVQLVDLTE IYGERLYETN YRRSKTEAPE GNLGDFAEFQ 240 RRHSMQTPVR MEWSGICTTQ QVKHNNECRD VGSVLLAVLF ENGNIAVWQF QLPFVGKESI 300 SSCNTIESGI NSPSVLFWWE YEHNNRKMNG LIVGSAFGPV KILPVNLKAV KGYFTLRQPV 360 VLWKEMDQLP VHSIKCVPLY HPYQKCSCSL VVAARGSYVF WCLLLISKAG LNVHNSHVTG 420 LHSLPIVSMT ADKQNGTVYT CSSDGKVRQL IPIFTDVALK FEHQLIKLSD VFGSVRTHGI 480 AVSPCGAYLA VITTEGMVNG LHPVNKNYQV QFVTLKTFEE AAAQLLESSV QNLFKQVDLI 540 DLVRWKILKD KHIPQFLHEA LEKKIESSGA TYFWRFKLFL LRILYQSMQK IPSEALWKPS 600 NEDSKILLVD SPGVGNAEDE QEEGTSKQMN KQSLQERSKE GDTEEPTDDS VTQSGDVGGR 660 EPMEEKLLEI QGKIEAVEMH LTREHMKRVL GEVYLHTWIT ENTSIPTRGL CNFLMSDEEY 720 DDRTARVLIG HISKKMNKQT FPEHCSLCKE ILPFTDRKQA VCSNGHIWLR CFLTYQSCQS 780 LIYRRCLLHD SIARHPAPED PDWIKRLLQS PCPFCDSPVF Protein Fasta Sequence
>ENSPCAP00000004132.1|HAT_other|Procavia capensis MSTSNKARVGPETDGPAPPGEEEGEGGGEAGAKEPTADAVPGPSAAFRLMVTRREPAVKLQYSVSGLEPLSWSDDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCFLKVGSKKEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYASWSPMGCDANGRCLLAALTMDNRLTVQANLSRLQWVQLVDLTEIYGERLYETNYRRSKTEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGINSPSVLFWWEYEHNNRKMNGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLHEALEKKIESSGATYFWRFKLFLLRILYQSMQKIPSEALWKPSNEDSKILLVDSPGVGNAEDEQEEGTSKQMNKQSLQERSKEGDTEEPTDDSVTQSGDVGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | ATGTCCACCT CTAACAAGGC CCGGGTCGGG CCCGAGACCG ACGGGCCTGC GCCACCCGGG 60 GAGGAGGAGG GCGAAGGGGG TGGCGAGGCG GGAGCGAAGG AACCAACGGC AGACGCGGTT 120 CCTGGGCCCA GCGCTGCTTT CCGCCTCATG GTGACTAGGC GGGAGCCGGC CGTGAAGCTG 180 CAGTACTCAG TGAGTGGCCT GGAGCCTCTG TCGTGGTCCG ACGACCACCG CGTGTCGGTA 240 TCCACGGCCC GAAGCATCGC TGTGCTGGAG CTCATCTGCG ATGTGCACAA CCCGGGCCAG 300 GACCTGGTCA TCCACCGCAC CTCGGTGCCC GCCCCTCTCA ACAGCTGCTT CCTCAAAGTT 360 GGCTCAAAAA AAGAAGTTGC TGAGTGTAAG GAGAAATTTG CAGCCTCTAA AGATCCCACA 420 GTCAGTCAAA CTTTCATGTT GGATAGGGTG TTCAATCCTG AAGGCAAAGC TTTGCCACCA 480 ATGAGAGGAT TCAAATATGC TAGCTGGTCC CCCATGGGTT GTGATGCTAA TGGCAGGTGC 540 TTGTTGGCAG CGCTGACTAT GGATAACCGT CTGACCGTCC AGGCGAACCT CAGCCGGCTA 600 CAGTGGGTGC AGCTGGTGGA TCTGACTGAG ATTTACGGAG AACGACTGTA TGAGACCAAT 660 TACAGGCGCT CCAAAACTGA AGCCCCAGAA GGAAATCTTG GGGACTTTGC TGAGTTTCAG 720 AGGAGACATA GCATGCAGAC ACCAGTCAGA ATGGAATGGT CAGGCATCTG TACCACACAG 780 CAGGTCAAGC ATAACAATGA ATGCCGTGAT GTTGGTAGTG TGCTTCTGGC AGTTCTCTTT 840 GAAAACGGCA ACATTGCTGT ATGGCAGTTC CAGCTACCGT TTGTGGGAAA GGAGTCTATC 900 TCTTCATGCA ACACAATTGA GTCAGGAATC AACTCTCCTA GCGTTTTGTT TTGGTGGGAA 960 TATGAGCACA ATAATCGGAA AATGAATGGC CTTATTGTGG GGAGTGCTTT TGGACCTGTA 1020 AAAATCCTTC CTGTCAATCT CAAAGCTGTT AAAGGGTACT TCACTTTAAG GCAGCCTGTT 1080 GTCTTATGGA AAGAAATGGA CCAGTTGCCT GTGCACAGTA TTAAGTGTGT ACCGCTTTAC 1140 CATCCTTACC AGAAGTGCAG TTGTAGCTTA GTGGTGGCTG CAAGAGGATC CTATGTATTT 1200 TGGTGTCTTC TTCTGATCTC CAAAGCAGGT CTGAATGTTC ACAATTCCCA TGTCACAGGG 1260 CTTCACTCCC TACCAATTGT CTCCATGACT GCAGACAAAC AGAATGGAAC AGTATATACT 1320 TGTTCCAGTG ATGGGAAGGT GAGGCAGTTG ATTCCCATTT TCACAGATGT TGCATTAAAG 1380 TTTGAGCACC AGTTGATTAA GCTGTCGGAT GTGTTTGGCT CAGTGAGGAC CCATGGGATA 1440 GCAGTGAGCC CCTGTGGTGC ATATCTTGCT GTCATTACAA CCGAAGGCAT GGTCAATGGC 1500 CTCCATCCTG TCAACAAAAA CTACCAGGTC CAGTTTGTTA CTCTGAAAAC CTTTGAAGAG 1560 GCAGCTGCTC AACTCCTGGA ATCTTCAGTT CAAAATCTCT TTAAGCAGGT AGACTTAATA 1620 GACCTAGTAC GCTGGAAAAT TTTAAAAGAT AAACATATCC CTCAATTTTT ACATGAAGCT 1680 TTGGAAAAAA AGATTGAAAG CAGTGGGGCC ACTTATTTTT GGCGTTTTAA ACTTTTCCTA 1740 CTGAGGATCT TGTATCAGTC AATGCAGAAA ATTCCTTCAG AAGCCTTATG GAAACCCTCC 1800 AATGAGGACT CAAAAATCTT ACTAGTTGAC TCACCTGGGG TGGGCAATGC TGAGGATGAA 1860 CAAGAAGAAG GCACTTCTAA ACAAATGAAT AAACAAAGCC TTCAAGAGAG GAGCAAAGAA 1920 GGTGATACCG AGGAGCCTAC TGATGACTCA GTTACCCAGT CTGGTGACGT TGGAGGCCGT 1980 GAGCCAATGG AAGAGAAGCT GCTTGAAATT CAGGGGAAGA TTGAAGCTGT GGAAATGCAC 2040 TTGACTAGGG AACACATGAA GCGAGTCTTA GGAGAAGTTT ACCTGCACAC CTGGATCACA 2100 GAGAACACGA GCATCCCTAC CAGAGGGCTC TGTAACTTCT TAATGTCTGA TGAAGAATAC 2160 GATGACAGAA CGGCACGGGT GCTGATTGGA CATATCTCGA AGAAGATGAA CAAACAGACT 2220 TTCCCTGAGC ATTGTAGTTT GTGTAAAGAG ATCTTGCCAT TCACAGATCG AAAACAGGCA 2280 GTCTGTTCCA ATGGCCACAT TTGGCTCCGG TGCTTTTTAA CCTACCAGTC CTGCCAGAGT 2340 TTGATATACA GAAGGTGTTT GCTTCATGAC AGCATTGCCC GACATCCAGC TCCAGAAGAT 2400 CCTGACTGGA TTAAGAGGCT ACTCCAGAGC CCCTGCCCTT TCTGTGACTC TCCTGTCTTC 2460 TAA
2464Nucleotide Fasta Sequence
>ENSPCAP00000004132.1|HAT_other|Procavia capensis ATGTCCACCTCTAACAAGGCCCGGGTCGGGCCCGAGACCGACGGGCCTGCGCCACCCGGGGAGGAGGAGGGCGAAGGGGGTGGCGAGGCGGGAGCGAAGGAACCAACGGCAGACGCGGTTCCTGGGCCCAGCGCTGCTTTCCGCCTCATGGTGACTAGGCGGGAGCCGGCCGTGAAGCTGCAGTACTCAGTGAGTGGCCTGGAGCCTCTGTCGTGGTCCGACGACCACCGCGTGTCGGTATCCACGGCCCGAAGCATCGCTGTGCTGGAGCTCATCTGCGATGTGCACAACCCGGGCCAGGACCTGGTCATCCACCGCACCTCGGTGCCCGCCCCTCTCAACAGCTGCTTCCTCAAAGTTGGCTCAAAAAAAGAAGTTGCTGAGTGTAAGGAGAAATTTGCAGCCTCTAAAGATCCCACAGTCAGTCAAACTTTCATGTTGGATAGGGTGTTCAATCCTGAAGGCAAAGCTTTGCCACCAATGAGAGGATTCAAATATGCTAGCTGGTCCCCCATGGGTTGTGATGCTAATGGCAGGTGCTTGTTGGCAGCGCTGACTATGGATAACCGTCTGACCGTCCAGGCGAACCTCAGCCGGCTACAGTGGGTGCAGCTGGTGGATCTGACTGAGATTTACGGAGAACGACTGTATGAGACCAATTACAGGCGCTCCAAAACTGAAGCCCCAGAAGGAAATCTTGGGGACTTTGCTGAGTTTCAGAGGAGACATAGCATGCAGACACCAGTCAGAATGGAATGGTCAGGCATCTGTACCACACAGCAGGTCAAGCATAACAATGAATGCCGTGATGTTGGTAGTGTGCTTCTGGCAGTTCTCTTTGAAAACGGCAACATTGCTGTATGGCAGTTCCAGCTACCGTTTGTGGGAAAGGAGTCTATCTCTTCATGCAACACAATTGAGTCAGGAATCAACTCTCCTAGCGTTTTGTTTTGGTGGGAATATGAGCACAATAATCGGAAAATGAATGGCCTTATTGTGGGGAGTGCTTTTGGACCTGTAAAAATCCTTCCTGTCAATCTCAAAGCTGTTAAAGGGTACTTCACTTTAAGGCAGCCTGTTGTCTTATGGAAAGAAATGGACCAGTTGCCTGTGCACAGTATTAAGTGTGTACCGCTTTACCATCCTTACCAGAAGTGCAGTTGTAGCTTAGTGGTGGCTGCAAGAGGATCCTATGTATTTTGGTGTCTTCTTCTGATCTCCAAAGCAGGTCTGAATGTTCACAATTCCCATGTCACAGGGCTTCACTCCCTACCAATTGTCTCCATGACTGCAGACAAACAGAATGGAACAGTATATACTTGTTCCAGTGATGGGAAGGTGAGGCAGTTGATTCCCATTTTCACAGATGTTGCATTAAAGTTTGAGCACCAGTTGATTAAGCTGTCGGATGTGTTTGGCTCAGTGAGGACCCATGGGATAGCAGTGAGCCCCTGTGGTGCATATCTTGCTGTCATTACAACCGAAGGCATGGTCAATGGCCTCCATCCTGTCAACAAAAACTACCAGGTCCAGTTTGTTACTCTGAAAACCTTTGAAGAGGCAGCTGCTCAACTCCTGGAATCTTCAGTTCAAAATCTCTTTAAGCAGGTAGACTTAATAGACCTAGTACGCTGGAAAATTTTAAAAGATAAACATATCCCTCAATTTTTACATGAAGCTTTGGAAAAAAAGATTGAAAGCAGTGGGGCCACTTATTTTTGGCGTTTTAAACTTTTCCTACTGAGGATCTTGTATCAGTCAATGCAGAAAATTCCTTCAGAAGCCTTATGGAAACCCTCCAATGAGGACTCAAAAATCTTACTAGTTGACTCACCTGGGGTGGGCAATGCTGAGGATGAACAAGAAGAAGGCACTTCTAAACAAATGAATAAACAAAGCCTTCAAGAGAGGAGCAAAGAAGGTGATACCGAGGAGCCTACTGATGACTCAGTTACCCAGTCTGGTGACGTTGGAGGCCGTGAGCCAATGGAAGAGAAGCTGCTTGAAATTCAGGGGAAGATTGAAGCTGTGGAAATGCACTTGACTAGGGAACACATGAAGCGAGTCTTAGGAGAAGTTTACCTGCACACCTGGATCACAGAGAACACGAGCATCCCTACCAGAGGGCTCTGTAACTTCTTAATGTCTGATGAAGAATACGATGACAGAACGGCACGGGTGCTGATTGGACATATCTCGAAGAAGATGAACAAACAGACTTTCCCTGAGCATTGTAGTTTGTGTAAAGAGATCTTGCCATTCACAGATCGAAAACAGGCAGTCTGTTCCAATGGCCACATTTGGCTCCGGTGCTTTTTAACCTACCAGTCCTGCCAGAGTTTGATATACAGAAGGTGTTTGCTTCATGACAGCATTGCCCGACATCCAGCTCCAGAAGATCCTGACTGGATTAAGAGGCTACTCCAGAGCCCCTGCCCTTTCTGTGACTCTCCTGTCTTCTAA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |