Tag |
Content |
WERAM ID |
WERAM-Bot-0051 |
Ensembl Protein ID |
ENSBTAP00000005344.4 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1558 |
|
|
|
Organism |
Bos taurus |
Domain Profile |
HAT HAT_other
Query: 31 PAADAAPGPSTAFRLLVTRREPAVRLQYAVSGQETLAWSEDHRVSVSTARSIAVLELICD 90 PAADAAPGPS AFRL+VTRREPAV+LQYAVSG E LAWSEDHRVSVSTARSIAVLELICD Sbjct: 35 PAADAAPGPSAAFRLMVTRREPAVKLQYAVSGLEPLAWSEDHRVSVSTARSIAVLELICD 94 Query: 91 VHNPGQDLVIHRTSVPAPLHSCFLKVGSKREVAECKQRFATSEDPTVSQTFMLDRVFNPE 150 VHNPGQDLVIHRTSVPAPL+SC LKVGSK EVAECK++FA S+DPTVSQTFMLDRVFNPE Sbjct: 95 VHNPGQDLVIHRTSVPAPLNSCLLKVGSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPE 154 Query: 151 GKALPPLRGFKYTSWSPVGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGE 210 GKALPP+RGFKYTSWSP+GCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGE Sbjct: 155 GKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGE 214 Query: 211 RLYETSYRFSRNEAPEGSLRDFAEFQRRHSMQAPVRMEWSGICTTQQVKHNNECRDVGSV 270 RLYETSYR S+NEAPEG+L DFAEFQRRHSMQ PVRMEWSGICTTQQVKHNNECRDVGSV Sbjct: 215 RLYETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSV 274 Query: 271 LLAVLFENGNIAVWQFQLPFAGKESISSCNTIESGISSPSVLFWWEYEHNNRKMSGLIVG 330 LLAVLFENGNIAVWQFQLPF GKESISSCNTIESGI+SPSVLFWWEYEHNNRKMSGLIVG Sbjct: 275 LLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVG 334 Query: 331 SAFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAA 390 SAFGP+KILPVNLKAVKGYFTLRQPV+LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAA Sbjct: 335 SAFGPIKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAA 394 Query: 391 RGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIF 450 RGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIF Sbjct: 395 RGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIF 454 Query: 451 TDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVT 510 TDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVT Sbjct: 455 TDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVT 514 Query: 511 LKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLHEALEKKIESSGATYYW 570 LKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFL EALEKKIESSG TY+W Sbjct: 515 LKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKIESSGVTYFW 574 Query: 571 RFKLFLLRILYQSMQKTPSEALCKPTHEDSKILLVDSPGMGNAEDEQQEEGTSSKQISKQ 630 RFKLFLLRILYQSMQKTPSEAL KPTHEDSKILLVDSPGMGNA+DEQQEEGTSSKQ+ KQ Sbjct: 575 RFKLFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQ 634 Query: 631 -----SKDGDPEDPTEDLLPQSADTGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGE 685 SK+GD E+PT+D LP + D GGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGE Sbjct: 635 GLQERSKEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGE 694 Query: 686 VYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEIL 745 VYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEIL Sbjct: 695 VYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEIL 754 Query: 746 PFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPIPEDPDWIKRLLQSPC 805 PFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHP PEDPDWIKRLLQSPC Sbjct: 755 PFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPC 814 Query: 806 PFCDSPVF 813 PFCDSPVF Sbjct: 815 PFCDSPVF 822
|
Protein Sequence (Fasta) | MNTALTRVGP AAERPAPPEE VEGSETGGKE PAADAAPGPS TAFRLLVTRR EPAVRLQYAV 60 SGQETLAWSE DHRVSVSTAR SIAVLELICD VHNPGQDLVI HRTSVPAPLH SCFLKVGSKR 120 EVAECKQRFA TSEDPTVSQT FMLDRVFNPE GKALPPLRGF KYTSWSPVGC DANGRCLLAA 180 LTMDNRLTIQ ANLNRLQWVQ LVDLTEIYGE RLYETSYRFS RNEAPEGSLR DFAEFQRRHS 240 MQAPVRMEWS GICTTQQVKH NNECRDVGSV LLAVLFENGN IAVWQFQLPF AGKESISSCN 300 TIESGISSPS VLFWWEYEHN NRKMSGLIVG SAFGPVKILP VNLKAVKGYF TLRQPVVLWK 360 EMDQLPVHSI KCVPLYHPYQ KCSCSLVVAA RGSYVFWCLL LISKAGLNVH NSHVTGLHSL 420 PIVSMTADKQ NGTVYTCSSD GKVRQLIPIF TDVALKFEHQ LIKLSDVFGS VRTHGIAVSP 480 CGAYLAVITT EGMVNGLHPV NKNYQVQFVT LKTFEEAAAQ LLESSVQNLF KQVDLIDLVR 540 WKILKDKHIP QFLHEALEKK IESSGATYYW RFKLFLLRIL YQSMQKTPSE ALCKPTHEDS 600 KILLVDSPGM GNAEDEQQEE GTSSKQISKQ SKDGDPEDPT EDLLPQSADT GGREPMEEKL 660 LEIQGKIEAV EMHLTREHMK RVLGEVYLHT WITENTSIPT RGLCNFLMSD EEYDDRTARV 720 LIGHISKKMN KQTFPEHCSL CKEILPFTDR KQAVCSNGHI WLRCFLTYQS CQSLIYRRCL 780 LHDSIARHPI PEDPDWIKRL LQSPCPFCDS PVF 813Protein Fasta Sequence
>ENSBTAP00000005344.4|HAT_other|Bos taurus MNTALTRVGPAAERPAPPEEVEGSETGGKEPAADAAPGPSTAFRLLVTRREPAVRLQYAVSGQETLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLHSCFLKVGSKREVAECKQRFATSEDPTVSQTFMLDRVFNPEGKALPPLRGFKYTSWSPVGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRFSRNEAPEGSLRDFAEFQRRHSMQAPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFAGKESISSCNTIESGISSPSVLFWWEYEHNNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLHEALEKKIESSGATYYWRFKLFLLRILYQSMQKTPSEALCKPTHEDSKILLVDSPGMGNAEDEQQEEGTSSKQISKQSKDGDPEDPTEDLLPQSADTGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPIPEDPDWIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | ATGAATACGG CCCTGACCCG GGTGGGGCCC GCGGCCGAGC GGCCCGCGCC GCCCGAGGAG 60 GTGGAGGGCT CTGAGACCGG CGGGAAGGAG CCGGCGGCGG ACGCGGCCCC GGGGCCCAGC 120 ACGGCGTTCC GCCTGCTGGT GACTCGGCGG GAGCCGGCGG TGAGGCTGCA GTACGCGGTG 180 AGCGGCCAGG AGACGCTGGC GTGGTCCGAG GACCACCGCG TGTCGGTGTC CACGGCCCGC 240 AGCATCGCCG TGCTGGAACT CATCTGCGAC GTGCACAACC CGGGCCAGGA CCTGGTTATC 300 CACCGCACGT CGGTGCCCGC GCCTCTCCAC AGCTGCTTCC TCAAAGTTGG CTCAAAAAGG 360 GAAGTTGCCG AGTGTAAACA GAGATTTGCC ACCTCTGAAG ACCCCACAGT CAGTCAGACT 420 TTCATGTTGG ACAGGGTGTT CAACCCTGAG GGGAAGGCTC TGCCACCACT GAGAGGGTTC 480 AAGTACACTA GCTGGTCCCC TGTGGGCTGT GACGCTAATG GCAGGTGCCT GCTGGCCGCA 540 CTGACCATGG ACAATCGCCT GACCATCCAG GCTAACCTCA ACAGACTGCA GTGGGTCCAG 600 CTGGTGGACC TGACGGAGAT TTATGGAGAG CGTCTCTATG AGACCAGCTA CAGGTTCTCC 660 AGAAACGAGG CCCCGGAGGG CAGTCTCAGG GATTTCGCCG AGTTTCAGAG GCGACACAGC 720 ATGCAGGCGC CGGTGCGCAT GGAGTGGTCG GGCATCTGCA CCACCCAGCA GGTCAAGCAC 780 AACAACGAGT GCCGCGACGT GGGCAGCGTG CTCCTGGCGG TCCTCTTCGA GAACGGGAAC 840 ATCGCCGTGT GGCAGTTCCA GCTCCCCTTT GCAGGGAAGG AGTCCATCTC TTCATGCAAC 900 ACCATCGAGT CAGGAATCAG CTCTCCTAGC GTTTTGTTCT GGTGGGAATA TGAGCACAAC 960 AATCGGAAAA TGAGCGGCCT TATAGTGGGG AGTGCTTTTG GGCCTGTGAA AATCCTGCCT 1020 GTCAACCTTA AAGCAGTCAA AGGCTATTTC ACCTTGAGGC AGCCTGTCGT CTTGTGGAAA 1080 GAAATGGACC AGCTGCCAGT CCACAGCATC AAGTGCGTGC CGCTGTATCA CCCTTACCAG 1140 AAGTGCAGCT GCAGCTTAGT GGTGGCCGCA CGAGGGTCCT ATGTGTTTTG GTGTCTTCTC 1200 CTGATCTCCA AAGCAGGTCT GAATGTTCAC AACTCTCACG TCACAGGCCT GCACTCCCTG 1260 CCCATCGTCT CCATGACTGC CGACAAGCAG AATGGGACGG TGTATACTTG CTCCAGTGAC 1320 GGCAAGGTGA GGCAGCTGAT CCCCATTTTC ACAGATGTGG CGTTGAAGTT TGAGCACCAG 1380 TTGATTAAAC TCTCTGATGT GTTTGGCTCC GTGAGGACAC ACGGGATAGC CGTGAGCCCC 1440 TGCGGCGCAT ACCTGGCCGT CATCACCACT GAGGGCATGG TCAACGGCCT CCATCCCGTT 1500 AACAAAAACT ACCAGGTTCA GTTCGTTACT CTCAAAACCT TTGAAGAGGC AGCTGCTCAG 1560 CTTCTGGAGT CTTCTGTCCA GAATCTCTTT AAGCAGGTAG ATTTAATAGA CCTAGTACGC 1620 TGGAAGATTT TAAAGGATAA GCATATCCCT CAATTTTTAC ATGAAGCTTT GGAAAAAAAG 1680 ATCGAAAGCA GTGGGGCCAC CTATTACTGG CGTTTCAAAC TCTTCCTCTT GAGAATTTTA 1740 TATCAGTCGA TGCAGAAAAC CCCTTCAGAA GCCTTGTGCA AACCCACCCA CGAGGACTCA 1800 AAAATCTTGC TAGTTGACTC ACCTGGGATG GGCAATGCTG AGGATGAACA GCAGGAGGAA 1860 GGCACCTCTT CCAAACAGAT TAGTAAGCAA AGCAAAGACG GCGATCCAGA GGACCCCACA 1920 GAGGACTTAC TCCCTCAATC TGCGGATACT GGAGGCCGAG AGCCAATGGA AGAGAAACTC 1980 CTTGAAATCC AGGGAAAAAT TGAAGCAGTG GAAATGCACT TGACGAGGGA GCACATGAAG 2040 CGAGTCCTGG GCGAAGTGTA TCTACACACC TGGATCACAG AAAACACTAG CATCCCCACC 2100 AGGGGACTCT GTAACTTCCT GATGTCTGAT GAGGAGTATG ATGACAGAAC CGCACGGGTG 2160 CTGATTGGAC ACATCTCAAA GAAGATGAAC AAGCAGACCT TCCCTGAGCA CTGCAGTTTG 2220 TGTAAGGAGA TCTTGCCGTT TACAGATCGC AAGCAGGCCG TCTGCTCCAA TGGCCACATC 2280 TGGCTCCGGT GCTTTTTAAC CTACCAGTCC TGCCAGAGTT TGATATACAG AAGGTGTTTG 2340 CTCCATGACA GCATCGCTCG CCATCCAATT CCAGAAGACC CTGACTGGAT CAAGAGGTTA 2400 CTGCAGAGCC CCTGCCCTTT CTGCGACTCT CCTGTTTTCT GA
2443Nucleotide Fasta Sequence
>ENSBTAP00000005344.4|HAT_other|Bos taurus ATGAATACGGCCCTGACCCGGGTGGGGCCCGCGGCCGAGCGGCCCGCGCCGCCCGAGGAGGTGGAGGGCTCTGAGACCGGCGGGAAGGAGCCGGCGGCGGACGCGGCCCCGGGGCCCAGCACGGCGTTCCGCCTGCTGGTGACTCGGCGGGAGCCGGCGGTGAGGCTGCAGTACGCGGTGAGCGGCCAGGAGACGCTGGCGTGGTCCGAGGACCACCGCGTGTCGGTGTCCACGGCCCGCAGCATCGCCGTGCTGGAACTCATCTGCGACGTGCACAACCCGGGCCAGGACCTGGTTATCCACCGCACGTCGGTGCCCGCGCCTCTCCACAGCTGCTTCCTCAAAGTTGGCTCAAAAAGGGAAGTTGCCGAGTGTAAACAGAGATTTGCCACCTCTGAAGACCCCACAGTCAGTCAGACTTTCATGTTGGACAGGGTGTTCAACCCTGAGGGGAAGGCTCTGCCACCACTGAGAGGGTTCAAGTACACTAGCTGGTCCCCTGTGGGCTGTGACGCTAATGGCAGGTGCCTGCTGGCCGCACTGACCATGGACAATCGCCTGACCATCCAGGCTAACCTCAACAGACTGCAGTGGGTCCAGCTGGTGGACCTGACGGAGATTTATGGAGAGCGTCTCTATGAGACCAGCTACAGGTTCTCCAGAAACGAGGCCCCGGAGGGCAGTCTCAGGGATTTCGCCGAGTTTCAGAGGCGACACAGCATGCAGGCGCCGGTGCGCATGGAGTGGTCGGGCATCTGCACCACCCAGCAGGTCAAGCACAACAACGAGTGCCGCGACGTGGGCAGCGTGCTCCTGGCGGTCCTCTTCGAGAACGGGAACATCGCCGTGTGGCAGTTCCAGCTCCCCTTTGCAGGGAAGGAGTCCATCTCTTCATGCAACACCATCGAGTCAGGAATCAGCTCTCCTAGCGTTTTGTTCTGGTGGGAATATGAGCACAACAATCGGAAAATGAGCGGCCTTATAGTGGGGAGTGCTTTTGGGCCTGTGAAAATCCTGCCTGTCAACCTTAAAGCAGTCAAAGGCTATTTCACCTTGAGGCAGCCTGTCGTCTTGTGGAAAGAAATGGACCAGCTGCCAGTCCACAGCATCAAGTGCGTGCCGCTGTATCACCCTTACCAGAAGTGCAGCTGCAGCTTAGTGGTGGCCGCACGAGGGTCCTATGTGTTTTGGTGTCTTCTCCTGATCTCCAAAGCAGGTCTGAATGTTCACAACTCTCACGTCACAGGCCTGCACTCCCTGCCCATCGTCTCCATGACTGCCGACAAGCAGAATGGGACGGTGTATACTTGCTCCAGTGACGGCAAGGTGAGGCAGCTGATCCCCATTTTCACAGATGTGGCGTTGAAGTTTGAGCACCAGTTGATTAAACTCTCTGATGTGTTTGGCTCCGTGAGGACACACGGGATAGCCGTGAGCCCCTGCGGCGCATACCTGGCCGTCATCACCACTGAGGGCATGGTCAACGGCCTCCATCCCGTTAACAAAAACTACCAGGTTCAGTTCGTTACTCTCAAAACCTTTGAAGAGGCAGCTGCTCAGCTTCTGGAGTCTTCTGTCCAGAATCTCTTTAAGCAGGTAGATTTAATAGACCTAGTACGCTGGAAGATTTTAAAGGATAAGCATATCCCTCAATTTTTACATGAAGCTTTGGAAAAAAAGATCGAAAGCAGTGGGGCCACCTATTACTGGCGTTTCAAACTCTTCCTCTTGAGAATTTTATATCAGTCGATGCAGAAAACCCCTTCAGAAGCCTTGTGCAAACCCACCCACGAGGACTCAAAAATCTTGCTAGTTGACTCACCTGGGATGGGCAATGCTGAGGATGAACAGCAGGAGGAAGGCACCTCTTCCAAACAGATTAGTAAGCAAAGCAAAGACGGCGATCCAGAGGACCCCACAGAGGACTTACTCCCTCAATCTGCGGATACTGGAGGCCGAGAGCCAATGGAAGAGAAACTCCTTGAAATCCAGGGAAAAATTGAAGCAGTGGAAATGCACTTGACGAGGGAGCACATGAAGCGAGTCCTGGGCGAAGTGTATCTACACACCTGGATCACAGAAAACACTAGCATCCCCACCAGGGGACTCTGTAACTTCCTGATGTCTGATGAGGAGTATGATGACAGAACCGCACGGGTGCTGATTGGACACATCTCAAAGAAGATGAACAAGCAGACCTTCCCTGAGCACTGCAGTTTGTGTAAGGAGATCTTGCCGTTTACAGATCGCAAGCAGGCCGTCTGCTCCAATGGCCACATCTGGCTCCGGTGCTTTTTAACCTACCAGTCCTGCCAGAGTTTGATATACAGAAGGTGTTTGCTCCATGACAGCATCGCTCGCCATCCAATTCCAGAAGACCCTGACTGGATCAAGAGGTTACTGCAGAGCCCCTGCCCTTTCTGCGACTCTCCTGTTTTCTGA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |