Tag |
Content |
WERAM ID |
WERAM-Ect-0059 |
Ensembl Protein ID |
ENSETEP00000006428.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1414 |
|
|
|
Organism |
Echinops telfairi |
Domain Profile |
HAT HAT_other
Query: 1 MSTANQARVEPETDXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFRLMVTRREPAVKL 60 M+TA+QARV P D FRLMVTRREPAVKL Sbjct: 1 MNTADQARVGPADDGPAPSGEEEGEGGGEAGGKEPAADAAPGPSAAFRLMVTRREPAVKL 60 Query: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCFLKV 120 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSC LKV Sbjct: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 Query: 121 GSKKEVAECKEKFATSKDPVVSQTFMLDRVFNPEGKALPPLRGFKYTSWSPMGCDANGRC 180 GSK EVAECKEKFA SKDP VSQTFMLDRVFNPEGKALPP+RGFKYTSWSPMGCDANGRC Sbjct: 121 GSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC 180 Query: 181 LLAALTMDNRLTIQANLSRLQWVQLVDLTEIYGERLYETSYRLSKTETSEGNLSDFAEFQ 240 LLAALTMDNRLTIQANL+RLQWVQLVDLTEIYGERLYETSYRLSK E EGNL DFAEFQ Sbjct: 181 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ 240 Query: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI Sbjct: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 Query: 301 SSCNTIESGINSPSVLFWWEYEHNNRKMSGLIVGSDFGPVKILPVNLKAVKGYFTLRQPV 360 SSCNTIESGI SPSVLFWWEYEHNNRKMSGLIVGS FGP+KILPVNLKAVKGYFTLRQPV Sbjct: 301 SSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPV 360 Query: 361 VLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 +LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG Sbjct: 361 ILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 Query: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLADVFGSVRTHGI 480 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKL+DVFGSVRTHGI Sbjct: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 Query: 481 AVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 AVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI Sbjct: 481 AVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 Query: 541 DLVRWKILKDKHIPQFLHEALEGKIESSGATYFWRFKLFLLRILYQSMQKIPSEPLWKPS 600 DLVRWKILKDKHIPQFL EALE KIESSG TYFWRFKLFLLRILYQSMQK PSE LWKP+ Sbjct: 541 DLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPT 600 Query: 601 NEDSKILLLDSP-----EDEQQEEGTSSKQVNKQSLQEKSKEGEGEEPTDDPLAQPADAG 655 +EDSKILL+DSP +DEQQEEGTSSKQV KQ LQE+SKEG+ EEPTDD L DAG Sbjct: 601 HEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDAG 660 Query: 656 SREPLEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 715 REP+EEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE Sbjct: 661 GREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 720 Query: 716 EYDDRTARXXXXXXXXXXXXXXXXXXXXXXXXXXXFTDRKQKGCSNGHIWLRCFLTYQSC 775 EYDDRTAR FTDRKQ CSNGHIWLRCFLTYQSC Sbjct: 721 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC 780 Query: 776 QSXXXXXXXXXXXXXXXXXXXXPDWIKRLLQSPCPFCDSPVF 817 QS PDWIKRLLQSPCPFCDSPVF Sbjct: 781 QSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | MSTANQARVE PETDGPAPSG EEEGEGGGEA GAKEPAADAA PGPSASFRLM VTRREPAVKL 60 QYAVSGLEPL AWSEDHRVSV STARSIAVLE LICDVHNPGQ DLVIHRTSVP APLNSCFLKV 120 GSKKEVAECK EKFATSKDPV VSQTFMLDRV FNPEGKALPP LRGFKYTSWS PMGCDANGRC 180 LLAALTMDNR LTIQANLSRL QWVQLVDLTE IYGERLYETS YRLSKTETSE GNLSDFAEFQ 240 RRHSMQTPVR MEWSGICTTQ QVKHNNECRD VGSVLLAVLF ENGNIAVWQF QLPFVGKESI 300 SSCNTIESGI NSPSVLFWWE YEHNNRKMSG LIVGSDFGPV KILPVNLKAV KGYFTLRQPV 360 VLWKEMDQLP VHSIKCVPLY HPYQKCSCSL VVAARGSYVF WCLLLISKAG LNVHNSHVTG 420 LHSLPIVSMT ADKQNGTVYT CSSDGKVRQL IPIFTDVALK FEHQLIKLAD VFGSVRTHGI 480 AVSPCGAYLA VITTEGMVNG LHPVNKNYQV QFVTLKTFEE AAAQLLESSV QNLFKQVDLI 540 DLVRWKILKD KHIPQFLHEA LEGKIESSGA TYFWRFKLFL LRILYQSMQK IPSEPLWKPS 600 NEDSKILLLD SPEDEQQEEG TSSKQVNKQS LQEKSKEGEG EEPTDDPLAQ PADAGSREPL 660 EEKLLEIQGK IEAVEMHLTR EHMKRVLGEV YLHTWITENT SIPTRGLCNF LMSDEEYDDR 720 TARXXXXXXX XXXXXXXXXX XXXXXXXXXX FTDRKQKGCS NGHIWLRCFL TYQSCQSXXX 780 XXXXXXXXXX XXXXXXXPDW IKRLLQSPCP FCDSPVF 817Protein Fasta Sequence
>ENSETEP00000006428.1|HAT_other|Echinops telfairi MSTANQARVEPETDGPAPSGEEEGEGGGEAGAKEPAADAAPGPSASFRLMVTRREPAVKLQYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCFLKVGSKKEVAECKEKFATSKDPVVSQTFMLDRVFNPEGKALPPLRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLSRLQWVQLVDLTEIYGERLYETSYRLSKTETSEGNLSDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGINSPSVLFWWEYEHNNRKMSGLIVGSDFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLADVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLHEALEGKIESSGATYFWRFKLFLLRILYQSMQKIPSEPLWKPSNEDSKILLLDSPEDEQQEEGTSSKQVNKQSLQEKSKEGEGEEPTDDPLAQPADAGSREPLEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARXXXXXXXXXXXXXXXXXXXXXXXXXXXFTDRKQKGCSNGHIWLRCFLTYQSCQSXXXXXXXXXXXXXXXXXXXXPDWIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | ATGTCCACCG CCAACCAGGC CCGGGTGGAG CCCGAGACCG ACGGGCCTGC GCCATCCGGG 60 GAGGAGGAGG GAGAAGGGGG CGGCGAGGCG GGCGCGAAGG AGCCTGCGGC GGACGCGGCT 120 CCCGGGCCCA GTGCCTCTTT CCGCCTAATG GTCACTCGGC GGGAGCCCGC CGTGAAGCTG 180 CAGTACGCCG TGAGCGGCCT GGAGCCGCTG GCCTGGTCGG AGGACCACCG CGTGTCGGTG 240 TCCACCGCCC GCAGCATCGC CGTGCTGGAG CTCATCTGCG ACGTGCACAA CCCGGGCCAG 300 GACCTGGTCA TCCATCGCAC CTCCGTGCCT GCCCCTCTCA ACAGCTGCTT CCTCAAAGTT 360 GGATCAAAAA AAGAAGTTGC TGAGTGTAAG GAGAAATTTG CCACCTCTAA AGACCCTGTA 420 GTCAGTCAGA CTTTCATGTT GGATAGGGTG TTCAATCCTG AAGGGAAGGC TTTGCCGCCA 480 CTGAGAGGAT TCAAATACAC CAGCTGGTCG CCCATGGGTT GTGATGCTAA TGGCAGGTGC 540 CTGTTGGCCG CACTCACCAT GGACAACCGC CTGACCATCC AGGCGAACCT CAGCCGGCTA 600 CAGTGGGTCC AGCTTGTGGA CCTGACTGAG ATTTATGGCG AACGCCTATA TGAGACCAGT 660 TACAGGCTTT CTAAGACCGA AACCTCTGAA GGAAACCTTT CGGACTTCGC TGAGTTTCAG 720 AGGAGGCACA GCATGCAGAC CCCAGTCAGA ATGGAGTGGT CGGGCATCTG TACCACCCAG 780 CAGGTCAAGC ATAACAACGA GTGTCGTGAT GTGGGGAGCG TGCTCCTGGC CGTGCTCTTT 840 GAAAATGGCA ACATTGCTGT GTGGCAGTTC CAGCTGCCGT TTGTGGGAAA GGAATCCATC 900 TCTTCCTGCA ACACAATTGA GTCAGGGATC AACTCTCCTA GTGTTCTGTT TTGGTGGGAA 960 TATGAGCACA ATAATCGGAA AATGAGTGGC CTTATTGTGG GAAGCGATTT CGGACCGGTG 1020 AAAATTCTTC CCGTCAATCT CAAAGCAGTT AAAGGCTACT TCACGCTAAG GCAGCCTGTT 1080 GTCTTGTGGA AAGAAATGGA CCAGTTGCCT GTGCACAGCA TTAAGTGTGT GCCACTCTAC 1140 CATCCTTACC AGAAGTGCAG CTGTAGTTTA GTGGTGGCTG CCAGAGGGTC CTATGTGTTT 1200 TGGTGTCTCC TTCTGATTTC CAAAGCAGGT CTGAACGTTC ACAATTCCCA TGTCACCGGC 1260 CTTCATTCGC TGCCCATTGT CTCAATGACT GCAGACAAAC AGAATGGAAC GGTATATACA 1320 TGTTCCAGTG ATGGGAAGGT GAGGCAGTTG ATTCCCATAT TCACGGATGT TGCATTAAAG 1380 TTTGAACACC AGTTGATTAA GCTTGCAGAT GTGTTTGGCT CCGTGAGGAC TCACGGGATA 1440 GCAGTGAGCC CTTGCGGTGC ATATCTGGCC GTCATTACCA CCGAGGGCAT GGTCAATGGC 1500 CTCCACCCCG TTAACAAAAA CTACCAGGTC CAGTTCGTTA CCTTGAAAAC CTTTGAAGAG 1560 GCAGCTGCTC AGCTCCTGGA ATCGTCCGTT CAAAATCTCT TCAAACAGGT GGATTTAATA 1620 GATCTAGTAC GGTGGAAGAT TTTAAAAGAT AAACACATCC CCCAATTTTT ACACGAAGCT 1680 TTGGAAGGAA AGATCGAAAG CAGTGGGGCC ACCTATTTTT GGCGGTTCAA GCTGTTCCTC 1740 CTGAGGATCT TATATCAGTC GATGCAGAAA ATTCCCTCGG AACCCTTATG GAAGCCCTCC 1800 AACGAGGACT CAAAAATCTT ACTGCTCGAC TCACCTGAGG ATGAACAGCA AGAAGAAGGC 1860 ACTTCTTCGA AACAGGTGAA CAAGCAGAGC CTTCAGGAGA AAAGCAAAGA AGGTGAGGGA 1920 GAGGAGCCTA CCGATGACCC CCTTGCCCAG CCCGCCGACG CTGGGAGCCG TGAGCCATTG 1980 GAGGAGAAAC TCCTGGAAAT CCAAGGGAAG ATTGAAGCTG TGGAGATGCA CTTGACGCGG 2040 GAACACATGA AGCGCGTCTT AGGAGAAGTC TACCTGCACA CCTGGATCAC AGAGAACACG 2100 AGCATCCCTA CCCGAGGACT CTGTAACTTC TTGATGTCCG ATGAAGAGTA CGATGACAGA 2160 ACAGCTCGGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 2220 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN TTCACAGACC GAAAACAGAA AGGATGTTCC 2280 AACGGCCACA TTTGGCTCCG GTGCTTTTTA ACCTACCAGT CCTGCCAGAG TTTNNNNNNN 2340 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNA TCCTGACTGG 2400 ATTAAGCGGC TACTGCAGAG CCCCTGTCCT TTCTGTGACT CTCCTGTCTT CTAA
2455Nucleotide Fasta Sequence
>ENSETEP00000006428.1|HAT_other|Echinops telfairi ATGTCCACCGCCAACCAGGCCCGGGTGGAGCCCGAGACCGACGGGCCTGCGCCATCCGGGGAGGAGGAGGGAGAAGGGGGCGGCGAGGCGGGCGCGAAGGAGCCTGCGGCGGACGCGGCTCCCGGGCCCAGTGCCTCTTTCCGCCTAATGGTCACTCGGCGGGAGCCCGCCGTGAAGCTGCAGTACGCCGTGAGCGGCCTGGAGCCGCTGGCCTGGTCGGAGGACCACCGCGTGTCGGTGTCCACCGCCCGCAGCATCGCCGTGCTGGAGCTCATCTGCGACGTGCACAACCCGGGCCAGGACCTGGTCATCCATCGCACCTCCGTGCCTGCCCCTCTCAACAGCTGCTTCCTCAAAGTTGGATCAAAAAAAGAAGTTGCTGAGTGTAAGGAGAAATTTGCCACCTCTAAAGACCCTGTAGTCAGTCAGACTTTCATGTTGGATAGGGTGTTCAATCCTGAAGGGAAGGCTTTGCCGCCACTGAGAGGATTCAAATACACCAGCTGGTCGCCCATGGGTTGTGATGCTAATGGCAGGTGCCTGTTGGCCGCACTCACCATGGACAACCGCCTGACCATCCAGGCGAACCTCAGCCGGCTACAGTGGGTCCAGCTTGTGGACCTGACTGAGATTTATGGCGAACGCCTATATGAGACCAGTTACAGGCTTTCTAAGACCGAAACCTCTGAAGGAAACCTTTCGGACTTCGCTGAGTTTCAGAGGAGGCACAGCATGCAGACCCCAGTCAGAATGGAGTGGTCGGGCATCTGTACCACCCAGCAGGTCAAGCATAACAACGAGTGTCGTGATGTGGGGAGCGTGCTCCTGGCCGTGCTCTTTGAAAATGGCAACATTGCTGTGTGGCAGTTCCAGCTGCCGTTTGTGGGAAAGGAATCCATCTCTTCCTGCAACACAATTGAGTCAGGGATCAACTCTCCTAGTGTTCTGTTTTGGTGGGAATATGAGCACAATAATCGGAAAATGAGTGGCCTTATTGTGGGAAGCGATTTCGGACCGGTGAAAATTCTTCCCGTCAATCTCAAAGCAGTTAAAGGCTACTTCACGCTAAGGCAGCCTGTTGTCTTGTGGAAAGAAATGGACCAGTTGCCTGTGCACAGCATTAAGTGTGTGCCACTCTACCATCCTTACCAGAAGTGCAGCTGTAGTTTAGTGGTGGCTGCCAGAGGGTCCTATGTGTTTTGGTGTCTCCTTCTGATTTCCAAAGCAGGTCTGAACGTTCACAATTCCCATGTCACCGGCCTTCATTCGCTGCCCATTGTCTCAATGACTGCAGACAAACAGAATGGAACGGTATATACATGTTCCAGTGATGGGAAGGTGAGGCAGTTGATTCCCATATTCACGGATGTTGCATTAAAGTTTGAACACCAGTTGATTAAGCTTGCAGATGTGTTTGGCTCCGTGAGGACTCACGGGATAGCAGTGAGCCCTTGCGGTGCATATCTGGCCGTCATTACCACCGAGGGCATGGTCAATGGCCTCCACCCCGTTAACAAAAACTACCAGGTCCAGTTCGTTACCTTGAAAACCTTTGAAGAGGCAGCTGCTCAGCTCCTGGAATCGTCCGTTCAAAATCTCTTCAAACAGGTGGATTTAATAGATCTAGTACGGTGGAAGATTTTAAAAGATAAACACATCCCCCAATTTTTACACGAAGCTTTGGAAGGAAAGATCGAAAGCAGTGGGGCCACCTATTTTTGGCGGTTCAAGCTGTTCCTCCTGAGGATCTTATATCAGTCGATGCAGAAAATTCCCTCGGAACCCTTATGGAAGCCCTCCAACGAGGACTCAAAAATCTTACTGCTCGACTCACCTGAGGATGAACAGCAAGAAGAAGGCACTTCTTCGAAACAGGTGAACAAGCAGAGCCTTCAGGAGAAAAGCAAAGAAGGTGAGGGAGAGGAGCCTACCGATGACCCCCTTGCCCAGCCCGCCGACGCTGGGAGCCGTGAGCCATTGGAGGAGAAACTCCTGGAAATCCAAGGGAAGATTGAAGCTGTGGAGATGCACTTGACGCGGGAACACATGAAGCGCGTCTTAGGAGAAGTCTACCTGCACACCTGGATCACAGAGAACACGAGCATCCCTACCCGAGGACTCTGTAACTTCTTGATGTCCGATGAAGAGTACGATGACAGAACAGCTCGGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNTTCACAGACCGAAAACAGAAAGGATGTTCCAACGGCCACATTTGGCTCCGGTGCTTTTTAACCTACCAGTCCTGCCAGAGTTTNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNATCCTGACTGGATTAAGCGGCTACTGCAGAGCCCCTGTCCTTTCTGTGACTCTCCTGTCTTCTAA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |