Tag |
Content |
WERAM ID |
WERAM-Sah-0204 |
Ensembl Protein ID |
ENSSHAP00000022249.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1362 |
|
|
|
Organism |
Sarcophilus harrisii |
Domain Profile |
HAT HAT_other
Query: 1 PLMSDMLKVGSKKEVAECKEKFATSKDPTVSQSFMLDRVFNPEGKSLPPMRGFKYTSWSP 60 PL S +LKVGSK EVAECKEKFA SKDPTVSQ+FMLDRVFNPEGK+LPPMRGFKYTSWSP Sbjct: 112 PLNSCLLKVGSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSP 171 Query: 61 MGCDANGRCLLSALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYKLTKNEAPEG 120 MGCDANGRCLL+ALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSY+L+KNEAPEG Sbjct: 172 MGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEG 231 Query: 121 DLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKNNNECHDVGSVLLAVLFENGNIAVWQFE 180 +LGDFAEFQRRHSMQTPVRMEWSGICTTQQVK+NNEC DVGSVLLAVLFENGNIAVWQF+ Sbjct: 232 NLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQ 291 Query: 181 LPFIGKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPVKILPVNLKAVK 240 LPF+GKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGP+KILPVNLKAVK Sbjct: 292 LPFVGKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVK 351 Query: 241 GYFTLRQPVILWKEMDQLPVHSIKCIPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGL 300 GYFTLRQPVILWKEMDQLPVHSIKC+PLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGL Sbjct: 352 GYFTLRQPVILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGL 411 Query: 301 NVHNSHVTGLHTLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSEV 360 NVHNSHVTGLH+LPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLS+V Sbjct: 412 NVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDV 471 Query: 361 FGSVRTHGIAVSPCGAYLAVITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQ 420 FGSVRTHGIAVSPCGAYLA+ITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQ Sbjct: 472 FGSVRTHGIAVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQ 531 Query: 421 NLFKQVDLLDLVRWKILKDKHIPQFLQEALDRKMEGSGSTYFWRFKLFLLRILYQSMQKT 480 NLFKQVDL+DLVRWKILKDKHIPQFLQEAL++K+E SG TYFWRFKLFLLRILYQSMQKT Sbjct: 532 NLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKT 591 Query: 481 PSEVLWKPTHEDNKILITDSLVLGHTDEEQQEEGTSFKQMSKAGGHEKNKERDVEDLADS 540 PSE LWKPTHED+KIL+ DS +G+ D+EQQEEGTS KQ+ K G E++KE DVE+ D Sbjct: 592 PSEALWKPTHEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDD 651 Query: 541 SSIQSGDA-SREPMEEKLLEIQGRIEAVEMHLSREHMKRVLGEVYLHTWITENTGIPTKG 599 S +GDA REPMEEKLLEIQG+IEAVEMHL+REHMKRVLGEVYLHTWITENT IPT+G Sbjct: 652 SLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRG 711 Query: 600 LCDYLMSDEGYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWL 659 LC++LMSDE YDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWL Sbjct: 712 LCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWL 771 Query: 660 RCFLTYQSCQSLIYRRCLLHDSIARHPTSRDPDWIKKLLQGPCPFCDSPVF 710 RCFLTYQSCQSLIYRRCLLHDSIARHP DPDWIK+LLQ PCPFCDSPVF Sbjct: 772 RCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | PLMSDMLKVG SKKEVAECKE KFATSKDPTV SQSFMLDRVF NPEGKSLPPM RGFKYTSWSP 60 MGCDANGRCL LSALTMDNRL TIQANLNRLQ WVQLVDLTEI YGERLYETSY KLTKNEAPEG 120 DLGDFAEFQR RHSMQTPVRM EWSGICTTQQ VKNNNECHDV GSVLLAVLFE NGNIAVWQFE 180 LPFIGKESIS SCNTIESGIT SPSVLFWWEY EHNNRKMSGL IVGSAFGPVK ILPVNLKAVK 240 GYFTLRQPVI LWKEMDQLPV HSIKCIPLYH PYQKCSCSLV VAARGSYVFW CLLLISKAGL 300 NVHNSHVTGL HTLPIVSMTA DKQNGTVYTC SSDGKVRQLI PIFTDVALKF EHQLIKLSEV 360 FGSVRTHGIA VSPCGAYLAV ITTEGMINGL HPVNKNYQVQ FVTLKTFEEA AAQLLESSVQ 420 NLFKQVDLLD LVRWKILKDK HIPQFLQEAL DRKMEGSGST YFWRFKLFLL RILYQSMQKT 480 PSEVLWKPTH EDNKILITDS LVLGHTDEEQ QEEGTSFKQM SKAGGHEKNK ERDVEDLADS 540 SSIQSGDASR EPMEEKLLEI QGRIEAVEMH LSREHMKRVL GEVYLHTWIT ENTGIPTKGL 600 CDYLMSDEGY DDRTARVLIG HISKKMNKQT FPEHCSLCKE ILPFTDRKQA VCSNGHIWLR 660 CFLTYQSCQS LIYRRCLLHD SIARHPTSRD PDWIKKLLQG PCPFCDSPVF Protein Fasta Sequence
>ENSSHAP00000022249.1|HAT_other|Sarcophilus harrisii PLMSDMLKVGSKKEVAECKEKFATSKDPTVSQSFMLDRVFNPEGKSLPPMRGFKYTSWSPMGCDANGRCLLSALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYKLTKNEAPEGDLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKNNNECHDVGSVLLAVLFENGNIAVWQFELPFIGKESISSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCIPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHTLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSEVFGSVRTHGIAVSPCGAYLAVITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLLDLVRWKILKDKHIPQFLQEALDRKMEGSGSTYFWRFKLFLLRILYQSMQKTPSEVLWKPTHEDNKILITDSLVLGHTDEEQQEEGTSFKQMSKAGGHEKNKERDVEDLADSSSIQSGDASREPMEEKLLEIQGRIEAVEMHLSREHMKRVLGEVYLHTWITENTGIPTKGLCDYLMSDEGYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPTSRDPDWIKKLLQGPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | CCTTTGATGT CAGATATGTT AAAGGTTGGT TCAAAAAAGG AGGTTGCTGA ATGTAAGGAG 60 AAATTTGCCA CCTCTAAAGA CCCCACAGTC AGTCAAAGCT TCATGCTAGA TAGGGTATTC 120 AATCCTGAAG GAAAATCTTT GCCACCAATG AGAGGATTCA AGTATACCAG CTGGTCTCCA 180 ATGGGCTGTG ATGCTAATGG GAGATGTCTG TTGTCAGCAT TAACCATGGA CAATCGATTG 240 ACCATTCAGG CTAACCTCAA CAGACTGCAG TGGGTACAGC TGGTTGATCT AACTGAGATC 300 TATGGAGAGC GCCTTTACGA AACCAGCTAT AAGCTAACCA AAAATGAAGC CCCTGAAGGA 360 GATCTTGGGG ACTTTGCTGA GTTTCAGAGG AGACACAGCA TGCAGACTCC AGTACGGATG 420 GAGTGGTCAG GCATCTGCAC CACCCAGCAG GTCAAAAACA ATAATGAATG CCATGATGTC 480 GGCAGTGTGC TCCTGGCAGT CCTTTTCGAA AATGGAAACA TTGCAGTGTG GCAGTTTGAG 540 CTCCCATTTA TTGGCAAGGA ATCTATCTCT TCATGTAACA CCATTGAATC AGGGATTACC 600 TCTCCTAGTG TTTTGTTTTG GTGGGAGTAT GAACACAACA ATCGAAAAAT GAGCGGGCTC 660 ATTGTGGGGA GTGCATTTGG GCCTGTTAAA ATTCTTCCCG TTAACCTCAA AGCAGTTAAA 720 GGCTACTTCA CATTAAGACA ACCTGTTATT CTTTGGAAAG AAATGGACCA GTTGCCAGTA 780 CACAGCATTA AATGTATTCC TCTTTATCAT CCATACCAAA AATGTAGTTG TAGTTTAGTG 840 GTGGCTGCTA GGGGATCCTA TGTGTTTTGG TGTCTTCTTC TGATATCCAA AGCAGGTTTG 900 AATGTCCATA ATTCCCATGT CACAGGACTT CACACACTAC CCATTGTTTC CATGACTGCA 960 GACAAACAAA ATGGCACAGT ATATACTTGT TCAAGTGATG GAAAGGTGAG GCAACTGATT 1020 CCCATTTTTA CAGATGTTGC ATTAAAGTTT GAGCACCAAC TGATAAAACT ATCAGAAGTA 1080 TTTGGCTCAG TGAGGACTCA TGGGATAGCA GTAAGCCCTT GTGGTGCATA CCTGGCAGTC 1140 ATTACAACTG AAGGCATGAT CAATGGTCTC CACCCTGTGA ACAAAAACTA CCAGGTTCAG 1200 TTTGTCACTC TCAAAACATT TGAAGAGGCA GCAGCTCAAC TCCTAGAGTC TTCAGTTCAA 1260 AATCTCTTTA AGCAGGTAGA CTTATTAGAC CTGGTACGCT GGAAAATTTT GAAAGATAAA 1320 CATATTCCTC AGTTTTTACA AGAAGCTTTG GATAGAAAGA TGGAAGGTTC TGGGTCTACT 1380 TACTTTTGGC GTTTTAAACT TTTCCTCTTG CGGATTTTAT ATCAGTCAAT GCAGAAAACC 1440 CCTTCAGAGG TCTTATGGAA GCCTACCCAT GAAGACAACA AAATCTTAAT AACTGACTCT 1500 CTTGTGCTGG GCCATACTGA TGAGGAGCAA CAAGAAGAAG GAACATCTTT TAAACAAATG 1560 AGCAAAGCAG GTGGTCATGA GAAGAACAAA GAAAGAGATG TTGAAGATCT AGCTGACAGC 1620 TCATCCATCC AATCTGGAGA TGCAAGTCGT GAGCCAATGG AAGAGAAACT CCTAGAAATC 1680 CAAGGAAGAA TTGAAGCTGT TGAAATGCAC TTAAGTAGGG AACATATGAA GAGGGTATTG 1740 GGGGAAGTAT ACCTGCACAC ATGGATCACG GAGAACACTG GCATTCCCAC CAAAGGACTC 1800 TGTGACTATT TAATGTCTGA TGAAGGTTAT GATGACAGAA CAGCTCGGGT ACTCATTGGG 1860 CACATCTCAA AGAAGATGAA CAAACAAACT TTCCCTGAAC ACTGCAGTTT ATGTAAAGAA 1920 ATTTTGCCAT TCACAGATCG TAAGCAGGCA GTTTGTTCTA ATGGGCACAT TTGGCTCCGG 1980 TGCTTTTTAA CTTACCAGTC TTGCCAGAGT TTGATATACA GAAGGTGTTT GCTTCATGAC 2040 AGCATTGCAC GACATCCAAC TTCACGAGAT CCTGACTGGA TAAAGAAGTT GTTGCAAGGC 2100 CCTTGCCCAT TCTGTGACTC CCCTGTTTTC TAA
2134Nucleotide Fasta Sequence
>ENSSHAP00000022249.1|HAT_other|Sarcophilus harrisii CCTTTGATGTCAGATATGTTAAAGGTTGGTTCAAAAAAGGAGGTTGCTGAATGTAAGGAGAAATTTGCCACCTCTAAAGACCCCACAGTCAGTCAAAGCTTCATGCTAGATAGGGTATTCAATCCTGAAGGAAAATCTTTGCCACCAATGAGAGGATTCAAGTATACCAGCTGGTCTCCAATGGGCTGTGATGCTAATGGGAGATGTCTGTTGTCAGCATTAACCATGGACAATCGATTGACCATTCAGGCTAACCTCAACAGACTGCAGTGGGTACAGCTGGTTGATCTAACTGAGATCTATGGAGAGCGCCTTTACGAAACCAGCTATAAGCTAACCAAAAATGAAGCCCCTGAAGGAGATCTTGGGGACTTTGCTGAGTTTCAGAGGAGACACAGCATGCAGACTCCAGTACGGATGGAGTGGTCAGGCATCTGCACCACCCAGCAGGTCAAAAACAATAATGAATGCCATGATGTCGGCAGTGTGCTCCTGGCAGTCCTTTTCGAAAATGGAAACATTGCAGTGTGGCAGTTTGAGCTCCCATTTATTGGCAAGGAATCTATCTCTTCATGTAACACCATTGAATCAGGGATTACCTCTCCTAGTGTTTTGTTTTGGTGGGAGTATGAACACAACAATCGAAAAATGAGCGGGCTCATTGTGGGGAGTGCATTTGGGCCTGTTAAAATTCTTCCCGTTAACCTCAAAGCAGTTAAAGGCTACTTCACATTAAGACAACCTGTTATTCTTTGGAAAGAAATGGACCAGTTGCCAGTACACAGCATTAAATGTATTCCTCTTTATCATCCATACCAAAAATGTAGTTGTAGTTTAGTGGTGGCTGCTAGGGGATCCTATGTGTTTTGGTGTCTTCTTCTGATATCCAAAGCAGGTTTGAATGTCCATAATTCCCATGTCACAGGACTTCACACACTACCCATTGTTTCCATGACTGCAGACAAACAAAATGGCACAGTATATACTTGTTCAAGTGATGGAAAGGTGAGGCAACTGATTCCCATTTTTACAGATGTTGCATTAAAGTTTGAGCACCAACTGATAAAACTATCAGAAGTATTTGGCTCAGTGAGGACTCATGGGATAGCAGTAAGCCCTTGTGGTGCATACCTGGCAGTCATTACAACTGAAGGCATGATCAATGGTCTCCACCCTGTGAACAAAAACTACCAGGTTCAGTTTGTCACTCTCAAAACATTTGAAGAGGCAGCAGCTCAACTCCTAGAGTCTTCAGTTCAAAATCTCTTTAAGCAGGTAGACTTATTAGACCTGGTACGCTGGAAAATTTTGAAAGATAAACATATTCCTCAGTTTTTACAAGAAGCTTTGGATAGAAAGATGGAAGGTTCTGGGTCTACTTACTTTTGGCGTTTTAAACTTTTCCTCTTGCGGATTTTATATCAGTCAATGCAGAAAACCCCTTCAGAGGTCTTATGGAAGCCTACCCATGAAGACAACAAAATCTTAATAACTGACTCTCTTGTGCTGGGCCATACTGATGAGGAGCAACAAGAAGAAGGAACATCTTTTAAACAAATGAGCAAAGCAGGTGGTCATGAGAAGAACAAAGAAAGAGATGTTGAAGATCTAGCTGACAGCTCATCCATCCAATCTGGAGATGCAAGTCGTGAGCCAATGGAAGAGAAACTCCTAGAAATCCAAGGAAGAATTGAAGCTGTTGAAATGCACTTAAGTAGGGAACATATGAAGAGGGTATTGGGGGAAGTATACCTGCACACATGGATCACGGAGAACACTGGCATTCCCACCAAAGGACTCTGTGACTATTTAATGTCTGATGAAGGTTATGATGACAGAACAGCTCGGGTACTCATTGGGCACATCTCAAAGAAGATGAACAAACAAACTTTCCCTGAACACTGCAGTTTATGTAAAGAAATTTTGCCATTCACAGATCGTAAGCAGGCAGTTTGTTCTAATGGGCACATTTGGCTCCGGTGCTTTTTAACTTACCAGTCTTGCCAGAGTTTGATATACAGAAGGTGTTTGCTTCATGACAGCATTGCACGACATCCAACTTCACGAGATCCTGACTGGATAAAGAAGTTGTTGCAAGGCCCTTGCCCATTCTGTGACTCCCCTGTTTTCTAA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |