Tag |
Content |
WERAM ID |
WERAM-Anp-0004 |
Ensembl Protein ID |
ENSAPLP00000000397.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1361 |
|
|
|
Organism |
Anas platyrhynchos |
Domain Profile |
HAT HAT_other
Query: 1 VSTARSIAVLEQLSDVQG-GQEMVIHRTAVPAPAAACYLKVGPKKEVVECREKFSSSMDP 59 VSTARSIAVLE + DV GQ++VIHRT+VPAP +C LKVG K EV EC+EKF++S DP Sbjct: 80 VSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKVGSKTEVAECKEKFAASKDP 139 Query: 60 TVSQTFMLDRVFNPEGKSLPPMRGFKYSSWSPLGCDANGRCLLAALTMDNRLTIHANLNR 119 TVSQTFMLDRVFNPEGK+LPPMRGFKY+SWSP+GCDANGRCLLAALTMDNRLTI ANLNR Sbjct: 140 TVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNR 199 Query: 120 LQWVQLVDLTEIYGERLHEANYKLSKADTPRGELEDFAEFQRRHSMQTPVRMEWSGICTT 179 LQWVQLVDLTEIYGERL+E +Y+LSK + P G L DFAEFQRRHSMQTPVRMEWSGICTT Sbjct: 200 LQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTT 259 Query: 180 QQVKHNNECRDVGSVLLAVLFENSNIAVWQFQLPFLGKESITSCNTIESGISSPSVLSWW 239 QQVKHNNECRDVGSVLLAVLFEN NIAVWQFQLPF+GKESI+SCNTIESGI+SPSVL WW Sbjct: 260 QQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWW 319 Query: 240 EYEHNNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWQEMDQLPVHSIKCIPL 299 EYEHNNRKMSGLIVGSAFGP+KILPVNLKAVKGYFTLRQPV+LW+EMDQLPVHSIKC+PL Sbjct: 320 EYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCVPL 379 Query: 300 YHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVY 359 YHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVY Sbjct: 380 YHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVY 439 Query: 360 TCSSDGKVRQLIPIFTDVALKFEHQLIKLSEVFGSVRTHGIAVSPCGAYLAVITTEGMTN 419 TCSSDGKVRQLIPIFTDVALKFEHQLIKLS+VFGSVRTHGIAVSPCGAYLA+ITTEGM N Sbjct: 440 TCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAIITTEGMIN 499 Query: 420 GLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDLTDLVRWKILKEKHIPQFLQE 479 GLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLF+QVDL DLVRWKILK+KHIPQFLQE Sbjct: 500 GLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQE 559 Query: 480 ALDKKIESCGSTYFWRFKLFLLRILYQSMQKAPSEVMWRPSHEDTKILVSDSPGMGSTED 539 AL+KKIES G TYFWRFKLFLLRILYQSMQK PSE +W+P+HED+KIL+ DSPGMG+ +D Sbjct: 560 ALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNADD 619 Query: 540 D-QEEGT-SKRASKQSLCDTGKGMDIDDTADDSLPQSSEIGGHEPMEEKLLEVQAQIEAV 597 + QEEGT SK+ KQ L + K D+++ DDSLP + + GG EPMEEKLLE+Q +IEAV Sbjct: 620 EQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAV 679 Query: 598 EMHLTREHMKRVLGEVYLHTWITENTSIPTRGVCDFLMSDDGYEDRTARVLIGHILKKMN 657 EMHLTREHMKRVLGEVYLHTWITENTSIPTRG+C+FLMSD+ Y+DRTARVLIGHI KKMN Sbjct: 680 EMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMN 739 Query: 658 KQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLVYRRCLLHDSIARHPT 717 KQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSL+YRRCLLHDSIARHP Sbjct: 740 KQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPA 799 Query: 718 PEDPEWIKRLLQGPCTFCDSPVF 740 PEDP+WIKRLLQ PC FCDSPVF Sbjct: 800 PEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | VSTARSIAVL EQLSDVQGGQ EMVIHRTAVP APAAACYLKV GPKKEVVECR EKFSSSMDPT 60 VSQTFMLDRV FNPEGKSLPP MRGFKYSSWS PLGCDANGRC LLAALTMDNR LTIHANLNRL 120 QWVQLVDLTE IYGERLHEAN YKLSKADTPR GELEDFAEFQ RRHSMQTPVR MEWSGICTTQ 180 QVKHNNECRD VGSVLLAVLF ENSNIAVWQF QLPFLGKESI TSCNTIESGI SSPSVLSWWE 240 YEHNNRKMSG LIVGSAFGPV KILPVNLKAV KGYFTLRQPV VLWQEMDQLP VHSIKCIPLY 300 HPYQKCSCSL VVAARGSYVF WCLLLISKAG LNVHNSHVTG LHSLPIVSMT ADKQNGTVYT 360 CSSDGKVRQL IPIFTDVALK FEHQLIKLSE VFGSVRTHGI AVSPCGAYLA VITTEGMTNG 420 LHPVNKNYQV QFVTLKTFEE AAAQLLESSV QNLFRQVDLT DLVRWKILKE KHIPQFLQEA 480 LDKKIESCGS TYFWRFKLFL LRILYQSMQK APSEVMWRPS HEDTKILVSD SPGMGSTEDD 540 QEEGTSKRAS KQSLCDTGKG MDIDDTADDS LPQSSEIGGH EPMEEKLLEV QAQIEAVEMH 600 LTREHMKRVL GEVYLHTWIT ENTSIPTRGV CDFLMSDDGY EDRTARVLIG HILKKMNKQT 660 FPEHCSLCKE ILPFTDRKQA VCSNGHIWLR CFLTYQSCQS LVYRRCLLHD SIARHPTPED 720 PEWIKRLLQG PCTFCDSPVF Protein Fasta Sequence
>ENSAPLP00000000397.1|HAT_other|Anas platyrhynchos VSTARSIAVLEQLSDVQGGQEMVIHRTAVPAPAAACYLKVGPKKEVVECREKFSSSMDPTVSQTFMLDRVFNPEGKSLPPMRGFKYSSWSPLGCDANGRCLLAALTMDNRLTIHANLNRLQWVQLVDLTEIYGERLHEANYKLSKADTPRGELEDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENSNIAVWQFQLPFLGKESITSCNTIESGISSPSVLSWWEYEHNNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWQEMDQLPVHSIKCIPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSEVFGSVRTHGIAVSPCGAYLAVITTEGMTNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDLTDLVRWKILKEKHIPQFLQEALDKKIESCGSTYFWRFKLFLLRILYQSMQKAPSEVMWRPSHEDTKILVSDSPGMGSTEDDQEEGTSKRASKQSLCDTGKGMDIDDTADDSLPQSSEIGGHEPMEEKLLEVQAQIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGVCDFLMSDDGYEDRTARVLIGHILKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLVYRRCLLHDSIARHPTPEDPEWIKRLLQGPCTFCDSPVF
|
Nucleotide Sequence (Fasta) | GTGAGCACGG CCCGCAGCAT CGCCGTGCTG GAGCAGCTGA GCGATGTGCA GGGAGGGCAG 60 GAGATGGTGA TCCACCGAAC CGCCGTGCCC GCGCCCGCCG CCGCCTGTTA CCTCAAGGTT 120 GGTCCAAAGA AAGAGGTAGT GGAATGTAGG GAAAAGTTCT CCAGTTCGAT GGATCCCACG 180 GTCAGTCAAA CATTTATGCT AGATCGAGTG TTCAATCCTG AGGGGAAGTC GCTGCCGCCA 240 ATGCGAGGCT TCAAGTACTC CAGCTGGTCG CCGCTGGGCT GTGACGCCAA CGGACGATGC 300 CTGCTGGCAG CTTTAACCAT GGACAATCGA TTGACCATCC ACGCAAACCT CAACAGACTG 360 CAGTGGGTGC AGCTGGTGGA CCTGACGGAA ATCTATGGCG AGCGTTTGCA CGAAGCCAAT 420 TACAAACTTT CCAAGGCTGA CACTCCCAGG GGAGAGCTAG AAGACTTCGC TGAATTTCAG 480 CGGCGGCACA GCATGCAGAC CCCAGTACGC ATGGAGTGGT CGGGTATCTG TACCACGCAG 540 CAGGTCAAAC ACAACAACGA GTGCCGGGAT GTTGGTAGCG TGCTCTTAGC AGTGCTCTTT 600 GAAAACAGTA ACATTGCGGT TTGGCAGTTT CAGCTTCCGT TTCTGGGTAA GGAATCAATT 660 ACTTCTTGCA ATACCATAGA GTCTGGAATA AGTTCTCCGA GTGTCCTGTC TTGGTGGGAA 720 TATGAACATA ACAACCGAAA GATGAGTGGA CTTATTGTAG GGAGTGCTTT TGGGCCAGTT 780 AAAATTCTTC CTGTCAATCT AAAAGCAGTC AAAGGCTACT TTACACTAAG ACAGCCCGTA 840 GTCTTATGGC AAGAAATGGA CCAGTTACCA GTGCACAGTA TCAAATGTAT TCCACTTTAC 900 CATCCCTACC AGAAATGTAG CTGTAGCTTA GTGGTGGCCG CAAGAGGATC TTACGTGTTT 960 TGGTGTCTTC TGTTGATATC CAAGGCAGGT CTGAACGTCC ATAATTCCCA CGTGACAGGG 1020 CTTCATTCGT TGCCAATTGT GTCTATGACT GCGGACAAAC AGAACGGCAC AGTGTATACG 1080 TGCTCCAGTG ACGGAAAGGT AAGGCAGCTG ATTCCTATAT TCACAGATGT TGCTTTAAAG 1140 TTCGAGCACC AGCTGATTAA GCTCTCAGAA GTGTTTGGCT CTGTGAGGAC TCACGGAATA 1200 GCTGTTAGCC CCTGTGGTGC GTACTTAGCA GTTATTACGA CAGAGGGCAT GACTAACGGT 1260 CTGCACCCCG TTAACAAAAA CTACCAAGTT CAGTTTGTTA CTCTTAAGAC TTTTGAGGAG 1320 GCAGCCGCGC AGCTCCTGGA ATCTTCTGTT CAGAACCTTT TCCGGCAAGT GGACTTGACA 1380 GATCTCGTAC GCTGGAAAAT TTTGAAGGAG AAGCATATTC CTCAATTTTT ACAGGAAGCG 1440 CTGGATAAAA AGATTGAGAG CTGCGGTTCT ACTTACTTCT GGCGGTTTAA GCTGTTTCTC 1500 CTGAGGATTT TGTACCAGTC AATGCAGAAA GCTCCCTCAG AGGTCATGTG GAGACCTTCA 1560 CATGAGGACA CAAAAATACT GGTATCTGAT TCTCCCGGAA TGGGCAGCAC TGAAGATGAT 1620 CAAGAGGAAG GAACTTCTAA ACGAGCCAGC AAGCAGAGCC TATGTGACAC GGGCAAAGGT 1680 ATGGACATAG ATGACACTGC CGATGATTCT CTTCCTCAGT CAAGTGAGAT AGGAGGCCAC 1740 GAGCCAATGG AAGAAAAGCT GCTTGAAGTA CAGGCACAGA TTGAGGCTGT AGAAATGCAC 1800 TTGACACGAG AGCACATGAA ACGGGTGTTG GGAGAAGTTT ATCTACACAC ATGGATTACA 1860 GAAAACACCA GTATTCCCAC CAGAGGAGTC TGTGACTTCT TAATGTCCGA TGACGGATAT 1920 GAGGACAGAA CAGCACGAGT GCTGATTGGG CATATCTTAA AGAAAATGAA CAAACAGACT 1980 TTTCCAGAGC ACTGCAGCTT GTGTAAAGAG ATCCTGCCAT TCACTGATCG CAAACAGGCA 2040 GTCTGCTCCA ATGGACATAT TTGGCTCAGG TGCTTTCTAA CCTACCAGTC CTGTCAGAGC 2100 TTGGTGTACA GGAGGTGTTT GCTTCATGAC AGCATTGCAC GGCATCCAAC TCCAGAAGAT 2160 CCTGAGTGGA TCAAGAGGTT ACTGCAAGGA CCTTGCACGT TCTGTGATTC TCCGGTCTTC 2220 TAGAGAAGAG CTATGTAAAG ACTGAAAATA CTTTCACTAC TACATAAGCT CCCTTCAGTC 2280 TAGAAGGATA GTGGGCACAG AAATAAACAC TTTACAGAAA GGGAAGACAA TGTCTGTGAC 2340 CCTGTAAATA GAACATTGGA GGTTCTAGAA ACTCCTTGGG TCCAGAGGCC ATTCCATGCT 2400 CCAGAACCAG GAGCACACCT GGAAGATTGA GTTCATAAAG CAATTTGGAA ACACACCCCG 2460 TTGAAGTATC AGCAGCTGGA CACACTTCCT TTTATTGAAC AGAAGCTGAA GTCTTAAGGT 2520 TGCAGCGTGA ACTGTCCTTT CTGCCTGGAC ACTGTCTGAT TTCCCAAAAG CTAGGATAGG 2580 AACAGCGGAT CTGCCTCTTG AATAGTCTTG GTGGGTAGAG AAAGCTTCAG CTTGTAGTGC 2640 TTGTTTGCTT TTTAACAAGT ATTGGCACTT GCTGCAGAAC ATTAAACTAT CAAAATAATT 2700 TGTGGCTCTA ATTTCTTGCA GAGTGAAGCA AATGCCAGTT ACAGCCATAT GAAAGAAA
2759Nucleotide Fasta Sequence
>ENSAPLP00000000397.1|HAT_other|Anas platyrhynchos GTGAGCACGGCCCGCAGCATCGCCGTGCTGGAGCAGCTGAGCGATGTGCAGGGAGGGCAGGAGATGGTGATCCACCGAACCGCCGTGCCCGCGCCCGCCGCCGCCTGTTACCTCAAGGTTGGTCCAAAGAAAGAGGTAGTGGAATGTAGGGAAAAGTTCTCCAGTTCGATGGATCCCACGGTCAGTCAAACATTTATGCTAGATCGAGTGTTCAATCCTGAGGGGAAGTCGCTGCCGCCAATGCGAGGCTTCAAGTACTCCAGCTGGTCGCCGCTGGGCTGTGACGCCAACGGACGATGCCTGCTGGCAGCTTTAACCATGGACAATCGATTGACCATCCACGCAAACCTCAACAGACTGCAGTGGGTGCAGCTGGTGGACCTGACGGAAATCTATGGCGAGCGTTTGCACGAAGCCAATTACAAACTTTCCAAGGCTGACACTCCCAGGGGAGAGCTAGAAGACTTCGCTGAATTTCAGCGGCGGCACAGCATGCAGACCCCAGTACGCATGGAGTGGTCGGGTATCTGTACCACGCAGCAGGTCAAACACAACAACGAGTGCCGGGATGTTGGTAGCGTGCTCTTAGCAGTGCTCTTTGAAAACAGTAACATTGCGGTTTGGCAGTTTCAGCTTCCGTTTCTGGGTAAGGAATCAATTACTTCTTGCAATACCATAGAGTCTGGAATAAGTTCTCCGAGTGTCCTGTCTTGGTGGGAATATGAACATAACAACCGAAAGATGAGTGGACTTATTGTAGGGAGTGCTTTTGGGCCAGTTAAAATTCTTCCTGTCAATCTAAAAGCAGTCAAAGGCTACTTTACACTAAGACAGCCCGTAGTCTTATGGCAAGAAATGGACCAGTTACCAGTGCACAGTATCAAATGTATTCCACTTTACCATCCCTACCAGAAATGTAGCTGTAGCTTAGTGGTGGCCGCAAGAGGATCTTACGTGTTTTGGTGTCTTCTGTTGATATCCAAGGCAGGTCTGAACGTCCATAATTCCCACGTGACAGGGCTTCATTCGTTGCCAATTGTGTCTATGACTGCGGACAAACAGAACGGCACAGTGTATACGTGCTCCAGTGACGGAAAGGTAAGGCAGCTGATTCCTATATTCACAGATGTTGCTTTAAAGTTCGAGCACCAGCTGATTAAGCTCTCAGAAGTGTTTGGCTCTGTGAGGACTCACGGAATAGCTGTTAGCCCCTGTGGTGCGTACTTAGCAGTTATTACGACAGAGGGCATGACTAACGGTCTGCACCCCGTTAACAAAAACTACCAAGTTCAGTTTGTTACTCTTAAGACTTTTGAGGAGGCAGCCGCGCAGCTCCTGGAATCTTCTGTTCAGAACCTTTTCCGGCAAGTGGACTTGACAGATCTCGTACGCTGGAAAATTTTGAAGGAGAAGCATATTCCTCAATTTTTACAGGAAGCGCTGGATAAAAAGATTGAGAGCTGCGGTTCTACTTACTTCTGGCGGTTTAAGCTGTTTCTCCTGAGGATTTTGTACCAGTCAATGCAGAAAGCTCCCTCAGAGGTCATGTGGAGACCTTCACATGAGGACACAAAAATACTGGTATCTGATTCTCCCGGAATGGGCAGCACTGAAGATGATCAAGAGGAAGGAACTTCTAAACGAGCCAGCAAGCAGAGCCTATGTGACACGGGCAAAGGTATGGACATAGATGACACTGCCGATGATTCTCTTCCTCAGTCAAGTGAGATAGGAGGCCACGAGCCAATGGAAGAAAAGCTGCTTGAAGTACAGGCACAGATTGAGGCTGTAGAAATGCACTTGACACGAGAGCACATGAAACGGGTGTTGGGAGAAGTTTATCTACACACATGGATTACAGAAAACACCAGTATTCCCACCAGAGGAGTCTGTGACTTCTTAATGTCCGATGACGGATATGAGGACAGAACAGCACGAGTGCTGATTGGGCATATCTTAAAGAAAATGAACAAACAGACTTTTCCAGAGCACTGCAGCTTGTGTAAAGAGATCCTGCCATTCACTGATCGCAAACAGGCAGTCTGCTCCAATGGACATATTTGGCTCAGGTGCTTTCTAACCTACCAGTCCTGTCAGAGCTTGGTGTACAGGAGGTGTTTGCTTCATGACAGCATTGCACGGCATCCAACTCCAGAAGATCCTGAGTGGATCAAGAGGTTACTGCAAGGACCTTGCACGTTCTGTGATTCTCCGGTCTTCTAGAGAAGAGCTATGTAAAGACTGAAAATACTTTCACTACTACATAAGCTCCCTTCAGTCTAGAAGGATAGTGGGCACAGAAATAAACACTTTACAGAAAGGGAAGACAATGTCTGTGACCCTGTAAATAGAACATTGGAGGTTCTAGAAACTCCTTGGGTCCAGAGGCCATTCCATGCTCCAGAACCAGGAGCACACCTGGAAGATTGAGTTCATAAAGCAATTTGGAAACACACCCCGTTGAAGTATCAGCAGCTGGACACACTTCCTTTTATTGAACAGAAGCTGAAGTCTTAAGGTTGCAGCGTGAACTGTCCTTTCTGCCTGGACACTGTCTGATTTCCCAAAAGCTAGGATAGGAACAGCGGATCTGCCTCTTGAATAGTCTTGGTGGGTAGAGAAAGCTTCAGCTTGTAGTGCTTGTTTGCTTTTTAACAAGTATTGGCACTTGCTGCAGAACATTAAACTATCAAAATAATTTGTGGCTCTAATTTCTTGCAGAGTGAAGCAAATGCCAGTTACAGCCATATGAAAGAAA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |