Tag |
Content |
WERAM ID |
WERAM-Sus-0038 |
Ensembl Protein ID |
ENSSSCP00000006140.2 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1558 |
|
|
|
Organism |
Sus scrofa |
Domain Profile |
HAT HAT_other
Query: 1 MNTADKARVXXX----XXXXXXXXXXXXXXXXXXXXXDPAPGPSAAFRLLVTRREPAVRL 56 MNTAD+ARV D APGPSAAFRL+VTRREPAV+L Sbjct: 1 MNTADQARVGPADDGPAPSGEEEGEGGGEAGGKEPAADAAPGPSAAFRLMVTRREPAVKL 60 Query: 57 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 116 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV Sbjct: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 Query: 117 GSKTEVAECKEKFATSKDPTISQTFMLDRVFNPEGKALPPLRGFKYTSWSPMGCDANGRC 176 GSKTEVAECKEKFA SKDPT+SQTFMLDRVFNPEGKALPP+RGFKYTSWSPMGCDANGRC Sbjct: 121 GSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC 180 Query: 177 LLAALTMDNRLTVQANLNRLQWVQLADLTEIYGERLYETSYRFSKSEAPQGSLGDFAEFQ 236 LLAALTMDNRLT+QANLNRLQWVQL DLTEIYGERLYETSYR SK+EAP+G+LGDFAEFQ Sbjct: 181 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ 240 Query: 237 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 296 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI Sbjct: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 Query: 297 SSCNTIESGITSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPV 356 SSCNTIESGITSPSVLFWWEYEH+NRKMSGLIVGSAFGP+KILPVNLKAVKGYFTLRQPV Sbjct: 301 SSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPV 360 Query: 357 VLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 416 +LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG Sbjct: 361 ILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 Query: 417 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLADVFGSVRTHGI 476 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKL+DVFGSVRTHGI Sbjct: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 Query: 477 AVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 536 AVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI Sbjct: 481 AVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 Query: 537 DLVRWKILKDKHIPQFLQEALEKKMESSGATYYWRFKLFLLRILYQSMQKTP-EALWKPT 595 DLVRWKILKDKHIPQFLQEALEKK+ESSG TY+WRFKLFLLRILYQSMQKTP EALWKPT Sbjct: 541 DLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPT 600 Query: 596 HEDSKILLVDSPGMGNAEDEQQDEGTSSKPVNKPGLQERSKEGXXXXXXXXXXXSPTQAG 655 HEDSKILLVDSPGMGNA+DEQQ+EGTSSK V K GLQERSKEG PT G Sbjct: 601 HEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSL--PT-TG 657 Query: 656 DTGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLM 715 D GGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLM Sbjct: 658 DAGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLM 717 Query: 716 SDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTY 775 SDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTY Sbjct: 718 SDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTY 777 Query: 776 QSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 820 QSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF Sbjct: 778 QSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | MNTADKARVG PAADRPAPPE EEEGEAGGKG PVADPAPGPS AAFRLLVTRR EPAVRLQYAV 60 SGLEPLAWSE DHRVSVSTAR SIAVLELICD VHNPGQDLVI HRTSVPAPLN SCLLKVGSKT 120 EVAECKEKFA TSKDPTISQT FMLDRVFNPE GKALPPLRGF KYTSWSPMGC DANGRCLLAA 180 LTMDNRLTVQ ANLNRLQWVQ LADLTEIYGE RLYETSYRFS KSEAPQGSLG DFAEFQRRHS 240 MQTPVRMEWS GICTTQQVKH NNECRDVGSV LLAVLFENGN IAVWQFQLPF VGKESISSCN 300 TIESGITSPS VLFWWEYEHS NRKMSGLIVG SAFGPVKILP VNLKAVKGYF TLRQPVVLWK 360 EMDQLPVHSI KCVPLYHPYQ KCSCSLVVAA RGSYVFWCLL LISKAGLNVH NSHVTGLHSL 420 PIVSMTADKQ NGTVYTCSSD GKVRQLIPIF TDVALKFEHQ LIKLADVFGS VRTHGIAVSP 480 CGAYLAVITT EGMVNGLHPV NKNYQVQFVT LKTFEEAAAQ LLESSVQNLF KQVDLIDLVR 540 WKILKDKHIP QFLQEALEKK MESSGATYYW RFKLFLLRIL YQSMQKTPEA LWKPTHEDSK 600 ILLVDSPGMG NAEDEQQDEG TSSKPVNKPG LQERSKEGDA EDAAAAAAES PTQAGDTGGR 660 EPMEEKLLEI QGKIEAVEMH LTREHMKRVL GEVYLHTWIT ENTSIPTRGL CNFLMSDEEY 720 DDRTARVLIG HISKKMNKQT FPEHCSLCKE ILPFTDRKQA VCSNGHIWLR CFLTYQSCQS 780 LIYRRCLLHD SIARHPAPED PDWIKRLLQS PCPFCDSPVF Protein Fasta Sequence
>ENSSSCP00000006140.2|HAT_other|Sus scrofa MNTADKARVGPAADRPAPPEEEEGEAGGKGPVADPAPGPSAAFRLLVTRREPAVRLQYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKVGSKTEVAECKEKFATSKDPTISQTFMLDRVFNPEGKALPPLRGFKYTSWSPMGCDANGRCLLAALTMDNRLTVQANLNRLQWVQLADLTEIYGERLYETSYRFSKSEAPQGSLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLADVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKMESSGATYYWRFKLFLLRILYQSMQKTPEALWKPTHEDSKILLVDSPGMGNAEDEQQDEGTSSKPVNKPGLQERSKEGDAEDAAAAAAESPTQAGDTGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | CTCCGGGCCT AAGGGGAAGA AAACCGCCGC GGAGGGCGCT GGGGGCCGGC GGCCGCGGTG 60 CAAGAGGCGG CAACCGGACG GCGACGCGGA GTTGGCGCGC CCGGGGTCCG GCTTCAGCCT 120 CTGAGGGGAG ATGAATACGG CCGACAAGGC CCGGGTGGGG CCCGCGGCCG ACCGGCCTGC 180 GCCGCCCGAG GAAGAAGAGG GCGAGGCGGG CGGGAAGGGG CCGGTGGCGG ACCCGGCCCC 240 CGGGCCCAGC GCCGCGTTCC GCCTCCTGGT GACTCGGCGG GAGCCGGCCG TGAGGCTGCA 300 GTATGCGGTG AGCGGCCTGG AGCCGCTGGC GTGGTCCGAG GACCACCGCG TGTCGGTGTC 360 CACCGCCCGT AGCATCGCTG TGCTGGAGCT CATCTGCGAC GTGCACAACC CGGGCCAGGA 420 CCTGGTTATC CACCGCACCT CCGTGCCCGC GCCTCTCAAC AGCTGCCTCC TCAAAGTTGG 480 CTCAAAAACA GAAGTTGCTG AGTGCAAGGA GAAATTTGCC ACCTCTAAGG ACCCCACAAT 540 CAGTCAGACG TTCATGCTGG ATAGGGTGTT CAACCCTGAG GGGAAGGCTC TGCCCCCGCT 600 GAGAGGGTTC AAGTACACCA GCTGGTCCCC CATGGGTTGT GACGCGAACG GCAGGTGCCT 660 CTTGGCAGCA CTGACCATGG ACAACCGCCT GACCGTCCAG GCCAACCTCA ACAGACTGCA 720 GTGGGTCCAG CTGGCCGACC TGACGGAGAT TTACGGGGAA CGCCTTTACG AGACCAGTTA 780 CAGGTTCTCT AAAAGTGAGG CCCCGCAGGG GAGCCTCGGC GATTTTGCTG AGTTTCAGAG 840 GAGGCACAGC ATGCAGACCC CCGTCAGGAT GGAGTGGTCG GGCATCTGCA CCACGCAGCA 900 GGTCAAGCAC AACAACGAGT GCCGCGACGT GGGCAGCGTG CTCCTGGCAG TCCTCTTCGA 960 AAACGGCAAC ATCGCCGTGT GGCAGTTCCA GCTCCCGTTT GTAGGAAAGG AGTCTATCTC 1020 TTCGTGCAAC ACGATTGAGT CAGGAATCAC CTCTCCTAGC GTTTTGTTTT GGTGGGAGTA 1080 CGAGCACAGC AACCGGAAAA TGAGCGGCCT TATTGTGGGG AGTGCTTTCG GACCTGTAAA 1140 AATTCTTCCC GTCAACCTCA AAGCCGTCAA GGGCTACTTC ACCTTGAGGC AGCCCGTCGT 1200 CTTGTGGAAG GAAATGGACC AGCTGCCGGT TCACAGCATC AAGTGCGTGC CGCTCTATCA 1260 CCCCTACCAG AAGTGTAGCT GTAGCCTCGT GGTGGCCGCG AGAGGATCCT ACGTGTTTTG 1320 GTGTCTTCTT CTCATCTCCA AAGCGGGTCT GAATGTTCAC AATTCCCACG TCACAGGCCT 1380 CCACTCGCTG CCCATCGTCT CCATGACGGC GGACAAGCAG AACGGGACGG TCTACACCTG 1440 CTCCAGTGAC GGGAAGGTGC GGCAGCTGAT TCCCATCTTC ACGGACGTGG CCTTGAAGTT 1500 CGAACACCAG CTGATTAAGC TGGCGGACGT ATTCGGCTCC GTGAGGACAC ACGGGATCGC 1560 AGTGAGCCCC TGCGGCGCGT ACCTGGCCGT CATTACAACG GAGGGCATGG TCAACGGCCT 1620 CCACCCCGTG AACAAAAACT ACCAGGTTCA GTTCGTGACC CTCAAAACCT TTGAGGAGGC 1680 AGCGGCTCAG CTTCTGGAAT CTTCAGTTCA GAATCTCTTT AAGCAGGTTG ATTTAATAGA 1740 CCTGGTGCGC TGGAAGATTT TAAAAGATAA ACACATCCCG CAGTTTTTAC AAGAGGCTTT 1800 GGAAAAAAAG ATGGAAAGCA GCGGGGCCAC CTATTATTGG CGTTTCAAAC TCTTCCTCCT 1860 GAGGATTTTG TATCAGTCGA TGCAGAAAAC TCCAGAAGCC TTATGGAAAC CCACCCACGA 1920 GGACTCGAAA ATCTTACTCG TTGACTCACC TGGGATGGGC AACGCCGAGG ACGAACAGCA 1980 AGACGAAGGC ACGTCCTCCA AACCGGTGAA CAAGCCAGGC CTCCAGGAGC GGAGCAAAGA 2040 AGGAGACGCA GAGGACGCGG CGGCGGCGGC GGCGGAGTCG CCCACTCAGG CCGGGGACAC 2100 CGGAGGCCGC GAGCCCATGG AGGAGAAGCT CCTCGAGATC CAAGGGAAGA TCGAAGCTGT 2160 GGAAATGCAC CTGACCCGGG AGCACATGAA GCGCGTCCTG GGCGAAGTCT ACCTGCACAC 2220 CTGGATCACA GAAAACACTA GCATCCCCAC CAGGGGACTC TGCAACTTCT TAATGTCGGA 2280 TGAAGAGTAC GACGACCGGA CAGCCCGGGT GCTCATCGGA CACATCTCCA AGAAGATGAA 2340 CAAGCAGACG TTCCCTGAGC ACTGCAGCTT GTGTAAAGAG ATCCTGCCCT TCACAGATCG 2400 CAAGCAGGCC GTCTGCTCCA ATGGCCACAT TTGGCTCCGG TGCTTTTTAA CCTACCAGTC 2460 CTGCCAGAGT TTGATATACA GAAGGTGTTT GCTCCATGAC AGCATTGCCC GCCACCCAGC 2520 TCCAGAAGAT CCTGACTGGA TTAAGAGGTT ACTGCAAAGC CCCTGCCCCT TCTGCGATTC 2580 TCCTGTCTTC TGA
2594Nucleotide Fasta Sequence
>ENSSSCP00000006140.2|HAT_other|Sus scrofa CTCCGGGCCTAAGGGGAAGAAAACCGCCGCGGAGGGCGCTGGGGGCCGGCGGCCGCGGTGCAAGAGGCGGCAACCGGACGGCGACGCGGAGTTGGCGCGCCCGGGGTCCGGCTTCAGCCTCTGAGGGGAGATGAATACGGCCGACAAGGCCCGGGTGGGGCCCGCGGCCGACCGGCCTGCGCCGCCCGAGGAAGAAGAGGGCGAGGCGGGCGGGAAGGGGCCGGTGGCGGACCCGGCCCCCGGGCCCAGCGCCGCGTTCCGCCTCCTGGTGACTCGGCGGGAGCCGGCCGTGAGGCTGCAGTATGCGGTGAGCGGCCTGGAGCCGCTGGCGTGGTCCGAGGACCACCGCGTGTCGGTGTCCACCGCCCGTAGCATCGCTGTGCTGGAGCTCATCTGCGACGTGCACAACCCGGGCCAGGACCTGGTTATCCACCGCACCTCCGTGCCCGCGCCTCTCAACAGCTGCCTCCTCAAAGTTGGCTCAAAAACAGAAGTTGCTGAGTGCAAGGAGAAATTTGCCACCTCTAAGGACCCCACAATCAGTCAGACGTTCATGCTGGATAGGGTGTTCAACCCTGAGGGGAAGGCTCTGCCCCCGCTGAGAGGGTTCAAGTACACCAGCTGGTCCCCCATGGGTTGTGACGCGAACGGCAGGTGCCTCTTGGCAGCACTGACCATGGACAACCGCCTGACCGTCCAGGCCAACCTCAACAGACTGCAGTGGGTCCAGCTGGCCGACCTGACGGAGATTTACGGGGAACGCCTTTACGAGACCAGTTACAGGTTCTCTAAAAGTGAGGCCCCGCAGGGGAGCCTCGGCGATTTTGCTGAGTTTCAGAGGAGGCACAGCATGCAGACCCCCGTCAGGATGGAGTGGTCGGGCATCTGCACCACGCAGCAGGTCAAGCACAACAACGAGTGCCGCGACGTGGGCAGCGTGCTCCTGGCAGTCCTCTTCGAAAACGGCAACATCGCCGTGTGGCAGTTCCAGCTCCCGTTTGTAGGAAAGGAGTCTATCTCTTCGTGCAACACGATTGAGTCAGGAATCACCTCTCCTAGCGTTTTGTTTTGGTGGGAGTACGAGCACAGCAACCGGAAAATGAGCGGCCTTATTGTGGGGAGTGCTTTCGGACCTGTAAAAATTCTTCCCGTCAACCTCAAAGCCGTCAAGGGCTACTTCACCTTGAGGCAGCCCGTCGTCTTGTGGAAGGAAATGGACCAGCTGCCGGTTCACAGCATCAAGTGCGTGCCGCTCTATCACCCCTACCAGAAGTGTAGCTGTAGCCTCGTGGTGGCCGCGAGAGGATCCTACGTGTTTTGGTGTCTTCTTCTCATCTCCAAAGCGGGTCTGAATGTTCACAATTCCCACGTCACAGGCCTCCACTCGCTGCCCATCGTCTCCATGACGGCGGACAAGCAGAACGGGACGGTCTACACCTGCTCCAGTGACGGGAAGGTGCGGCAGCTGATTCCCATCTTCACGGACGTGGCCTTGAAGTTCGAACACCAGCTGATTAAGCTGGCGGACGTATTCGGCTCCGTGAGGACACACGGGATCGCAGTGAGCCCCTGCGGCGCGTACCTGGCCGTCATTACAACGGAGGGCATGGTCAACGGCCTCCACCCCGTGAACAAAAACTACCAGGTTCAGTTCGTGACCCTCAAAACCTTTGAGGAGGCAGCGGCTCAGCTTCTGGAATCTTCAGTTCAGAATCTCTTTAAGCAGGTTGATTTAATAGACCTGGTGCGCTGGAAGATTTTAAAAGATAAACACATCCCGCAGTTTTTACAAGAGGCTTTGGAAAAAAAGATGGAAAGCAGCGGGGCCACCTATTATTGGCGTTTCAAACTCTTCCTCCTGAGGATTTTGTATCAGTCGATGCAGAAAACTCCAGAAGCCTTATGGAAACCCACCCACGAGGACTCGAAAATCTTACTCGTTGACTCACCTGGGATGGGCAACGCCGAGGACGAACAGCAAGACGAAGGCACGTCCTCCAAACCGGTGAACAAGCCAGGCCTCCAGGAGCGGAGCAAAGAAGGAGACGCAGAGGACGCGGCGGCGGCGGCGGCGGAGTCGCCCACTCAGGCCGGGGACACCGGAGGCCGCGAGCCCATGGAGGAGAAGCTCCTCGAGATCCAAGGGAAGATCGAAGCTGTGGAAATGCACCTGACCCGGGAGCACATGAAGCGCGTCCTGGGCGAAGTCTACCTGCACACCTGGATCACAGAAAACACTAGCATCCCCACCAGGGGACTCTGCAACTTCTTAATGTCGGATGAAGAGTACGACGACCGGACAGCCCGGGTGCTCATCGGACACATCTCCAAGAAGATGAACAAGCAGACGTTCCCTGAGCACTGCAGCTTGTGTAAAGAGATCCTGCCCTTCACAGATCGCAAGCAGGCCGTCTGCTCCAATGGCCACATTTGGCTCCGGTGCTTTTTAACCTACCAGTCCTGCCAGAGTTTGATATACAGAAGGTGTTTGCTCCATGACAGCATTGCCCGCCACCCAGCTCCAGAAGATCCTGACTGGATTAAGAGGTTACTGCAAAGCCCCTGCCCCTTCTGCGATTCTCCTGTCTTCTGA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |