Tag |
Content |
WERAM ID |
WERAM-Dar-0157 |
Ensembl Protein ID |
ENSDARP00000092069.4 |
Gene Name |
gtf3c4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
848 |
|
|
|
Organism |
Danio rerio |
Domain Profile |
HAT HAT_other
Query: 1 MELVCDIHSLSQNLVLHRTHIPVPTTTCELKVGSEKDVRAVKESFANSKDPVVSQMIMID 60 +EL+CD+H+ Q+LV+HRT +P P +C LKVGS+ +V KE FA SKDP VSQ M+D Sbjct: 89 LELICDVHNPGQDLVIHRTSVPAPLNSCLLKVGSKTEVAECKEKFAASKDPTVSQTFMLD 148 Query: 61 RVINPQ---CGPLHGVKYTSWSPLGCDKLGRCLLAFLTLDNRLTIHSSHXXXXXXXXXXX 117 RV NP+ P+ G KYTSWSP+GCD GRCLLA LT+DNRLTI ++ Sbjct: 149 RVFNPEGKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDL 208 Query: 118 XXXXXXXXKTRKYSAQNDGTPPVSFKDFDELQRRFRMQTPVRMEWSSLCSMQQVQSDNAH 177 Y + P + DF E QRR MQTPVRMEWS +C+ QQV+ +N Sbjct: 209 TEIYGERLYETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNEC 268 Query: 178 KDINTVLLAVLMENGDVVVWQFSLPLNGKESVVSCSTIKSGVSSPSALTWWQYEHGGRKM 237 +D+ +VLLAVL ENG++ VWQF LP GKES+ SC+TI+SG++SPS L WW+YEH RKM Sbjct: 269 RDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWWEYEHNNRKM 328 Query: 238 SGLIIGSSVGPVKILPVNIKAVKGYFTLRQPVVLWQESDQIPVHNIKCITLFHPHQKCNC 297 SGLI+GS+ GP+KILPVN+KAVKGYFTLRQPV+LW+E DQ+PVH+IKC+ L+HP+QKC+C Sbjct: 329 SGLIVGSAFGPIKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCVPLYHPYQKCSC 388 Query: 298 SLVVAARGPYIFWCLLLISKAGLNVHNSHITGLHSTPIISMTACHNNGTVLTCSLDGKVK 357 SLVVAARG Y+FWCLLLISKAGLNVHNSH+TGLHS PI+SMTA NGTV TCS DGKV+ Sbjct: 389 SLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVR 448 Query: 358 KLTTVFTDLAVEFKQEEIVLPEGVAGRRIHGISISPNSAYLAVLSNEGMNNGFHPVALNY 417 +L +FTD+A++F+ + I L + R HGI++SP AYLA+++ EGM NG HPV NY Sbjct: 449 QLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAIITTEGMINGLHPVNKNY 508 Query: 418 QVQFITFKTPDEAAAKMLDAHC--LFRNSDLLDLLRWRILKDKSIPSAYENNLEEKLQTS 475 QVQF+T KT +EAAA++L++ LF+ DL+DL+RW+ILKDK IP + LE+K+++S Sbjct: 509 QVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKIESS 568 Query: 476 GSTYLWRLKLFFLRTLHKSMQKTPSEALWQPRRRSGQSLV---------------XXXXX 520 G TY WR KLF LR L++SMQKTPSEALW+P + L+ Sbjct: 569 GVTYFWRFKLFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNADDEQQEEGTSS 628 Query: 521 XXXXXXXXXXXXRQGDAEIXX---------------XXXXMKEIGSRIEAVESHLIRENM 565 ++GD E + EI +IEAVE HL RE+M Sbjct: 629 KQVVKQGLQERSKEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTREHM 688 Query: 566 KKVLGEVYLHTCVTQNTRVPTKGVCDFLTNDPANEDRAAKVLIGHIMNKMNKQTFPEYCA 625 K+VLGEVYLHT +T+NT +PT+G+C+FL +D +DR A+VLIGHI KMNKQTFPE+C+ Sbjct: 689 KRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCS 748 Query: 626 LCKETLPFTDCKRAECKNGHRWLRCALSYQACQGVTYRRCLLQDSIASVAEPEDSDWIKK 685 LCKE LPFTD K+A C NGH WLRC L+YQ+CQ + YRRCLL DSIA PED DWIK+ Sbjct: 749 LCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKR 808 Query: 686 ILQGPCIFCDSPFY 699 +LQ PC FCDSP + Sbjct: 809 LLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | MELVCDIHSL SQNLVLHRTH IPVPTTTCEL KVGSEKDVRA VKESFANSKD PVVSQMIMID 60 RVINPQCGPL HGVKYTSWSP LGCDKLGRCL LAFLTLDNRL TIHSSHLQLQ WTLLLDLTEL 120 YGEMLKTRKY SAQNDGTPPV SFKDFDELQR RFRMQTPVRM EWSSLCSMQQ VQSDNAHKDI 180 NTVLLAVLME NGDVVVWQFS LPLNGKESVV SCSTIKSGVS SPSALTWWQY EHGGRKMSGL 240 IIGSSVGPVK ILPVNIKAVK GYFTLRQPVV LWQESDQIPV HNIKCITLFH PHQKCNCSLV 300 VAARGPYIFW CLLLISKAGL NVHNSHITGL HSTPIISMTA CHNNGTVLTC SLDGKVKKLT 360 TVFTDLAVEF KQEEIVLPEG VAGRRIHGIS ISPNSAYLAV LSNEGMNNGF HPVALNYQVQ 420 FITFKTPDEA AAKMLDAHCL FRNSDLLDLL RWRILKDKSI PSAYENNLEE KLQTSGSTYL 480 WRLKLFFLRT LHKSMQKTPS EALWQPRRRS GQSLVGDGDE GGGGDEDEMM EDRQGDAEIR 540 EREERMKEIG SRIEAVESHL IRENMKKVLG EVYLHTCVTQ NTRVPTKGVC DFLTNDPANE 600 DRAAKVLIGH IMNKMNKQTF PEYCALCKET LPFTDCKRAE CKNGHRWLRC ALSYQACQGV 660 TYRRCLLQDS IASVAEPEDS DWIKKILQGP CIFCDSPFY 699Protein Fasta Sequence
>ENSDARP00000092069.4|HAT_other|Danio rerio MELVCDIHSLSQNLVLHRTHIPVPTTTCELKVGSEKDVRAVKESFANSKDPVVSQMIMIDRVINPQCGPLHGVKYTSWSPLGCDKLGRCLLAFLTLDNRLTIHSSHLQLQWTLLLDLTELYGEMLKTRKYSAQNDGTPPVSFKDFDELQRRFRMQTPVRMEWSSLCSMQQVQSDNAHKDINTVLLAVLMENGDVVVWQFSLPLNGKESVVSCSTIKSGVSSPSALTWWQYEHGGRKMSGLIIGSSVGPVKILPVNIKAVKGYFTLRQPVVLWQESDQIPVHNIKCITLFHPHQKCNCSLVVAARGPYIFWCLLLISKAGLNVHNSHITGLHSTPIISMTACHNNGTVLTCSLDGKVKKLTTVFTDLAVEFKQEEIVLPEGVAGRRIHGISISPNSAYLAVLSNEGMNNGFHPVALNYQVQFITFKTPDEAAAKMLDAHCLFRNSDLLDLLRWRILKDKSIPSAYENNLEEKLQTSGSTYLWRLKLFFLRTLHKSMQKTPSEALWQPRRRSGQSLVGDGDEGGGGDEDEMMEDRQGDAEIREREERMKEIGSRIEAVESHLIRENMKKVLGEVYLHTCVTQNTRVPTKGVCDFLTNDPANEDRAAKVLIGHIMNKMNKQTFPEYCALCKETLPFTDCKRAECKNGHRWLRCALSYQACQGVTYRRCLLQDSIASVAEPEDSDWIKKILQGPCIFCDSPFY
|
Nucleotide Sequence (Fasta) | AAACAGTTCT TCAGAGAAAG CAGGGACCGC GTCTGAAGCC CCGTCAGGAC CGGAGGATGA 60 TTTTTGGGCG GGACAGGGAC CAATAGTGCT GCGGGATCCT GCGATTAAAT TGTTGAGTCC 120 GGTTAGCGGT GTGGAGCCGC TGTCGTGGGC AGAAGATCAC AGACTGGCGG CCTGCAACAC 180 GAACAGCGTG TCTCTGATGG AGCTGGTGTG TGACATACAC AGTCTCAGTC AAAACCTGGT 240 GCTGCATCGC ACTCACATCC CCGTCCCCAC CACCACATGT GAACTCAAGG TGGGCTCAGA 300 AAAAGATGTG AGGGCTGTTA AAGAAAGCTT TGCCAATAGT AAAGACCCAG TAGTAAGCCA 360 AATGATCATG ATAGACCGAG TCATCAATCC TCAATGTGGA CCTCTCCATG GCGTCAAGTA 420 CACTAGTTGG TCACCGTTAG GCTGTGACAA ACTCGGCCGC TGCCTTCTTG CTTTCCTAAC 480 ACTTGACAAT CGTCTCACAA TCCACAGCAG TCACTTGCAA CTGCAGTGGA CGCTACTGCT 540 GGACCTTACA GAACTATATG GCGAAATGTT AAAAACCAGG AAATACTCCG CTCAGAATGA 600 TGGCACACCA CCGGTCTCCT TTAAAGACTT CGATGAACTG CAGCGTCGGT TTCGTATGCA 660 GACACCAGTG CGCATGGAGT GGTCCAGTTT GTGCAGTATG CAGCAAGTTC AGAGCGATAA 720 CGCTCATAAA GATATCAACA CAGTTCTTCT AGCTGTACTC ATGGAGAATG GAGACGTGGT 780 GGTGTGGCAG TTCAGCTTGC CCCTAAACGG AAAGGAATCA GTGGTGTCAT GCAGCACCAT 840 CAAATCTGGT GTGTCGTCAC CAAGCGCGCT CACCTGGTGG CAATATGAGC ACGGCGGGCG 900 GAAAATGAGC GGACTGATCA TAGGGAGCTC CGTAGGACCT GTAAAGATCC TCCCAGTCAA 960 CATTAAGGCA GTGAAGGGTT ACTTCACCCT CAGGCAGCCT GTGGTTTTGT GGCAGGAGAG 1020 CGACCAGATT CCTGTCCACA ACATCAAATG CATCACACTC TTCCATCCGC ACCAGAAATG 1080 CAACTGCAGT TTAGTGGTTG CGGCACGTGG TCCGTATATC TTCTGGTGCT TACTGCTGAT 1140 CTCCAAAGCT GGTCTGAATG TTCACAATTC CCACATCACG GGTCTTCATT CCACCCCTAT 1200 CATATCCATG ACGGCTTGCC ACAATAATGG GACTGTTTTA ACATGCTCAC TAGATGGGAA 1260 GGTTAAGAAG CTAACGACTG TTTTCACAGA CCTAGCTGTC GAGTTCAAGC AAGAAGAGAT 1320 TGTTCTGCCT GAAGGAGTCG CGGGTCGACG GATACACGGT ATATCAATCA GTCCAAACAG 1380 TGCCTACTTG GCTGTTCTCT CCAATGAGGG TATGAACAAT GGCTTCCATC CTGTCGCTCT 1440 CAACTACCAG GTACAGTTCA TCACCTTTAA GACTCCAGAT GAAGCGGCAG CTAAGATGCT 1500 TGATGCTCAC TGTCTCTTTA GAAACTCTGA TTTGCTGGAT TTGTTACGAT GGCGGATACT 1560 AAAAGACAAA AGCATCCCTT CGGCCTACGA GAATAATCTA GAAGAGAAGC TGCAAACCTC 1620 AGGGTCCACT TACCTGTGGC GTCTTAAGCT TTTCTTTCTC AGAACGCTTC ACAAGTCCAT 1680 GCAGAAAACT CCTTCAGAGG CTCTGTGGCA GCCAAGGCGA AGAAGTGGTC AGAGTTTAGT 1740 TGGTGATGGT GATGAAGGGG GTGGAGGTGA TGAAGATGAG ATGATGGAGG ACAGACAGGG 1800 GGATGCAGAA ATTAGGGAAA GAGAGGAGAG AATGAAAGAG ATTGGCAGCA GGATCGAAGC 1860 TGTGGAGTCC CATCTGATCC GGGAGAACAT GAAGAAGGTG CTGGGCGAGG TGTATCTGCA 1920 CACCTGTGTC ACTCAAAACA CACGTGTACC CACTAAAGGT GTGTGTGACT TCCTCACAAA 1980 TGACCCTGCT AATGAAGACA GGGCAGCCAA GGTGCTGATA GGTCATATCA TGAATAAGAT 2040 GAATAAACAG ACATTTCCAG AGTATTGCGC TCTCTGTAAA GAAACTCTGC CCTTCACCGA 2100 CTGCAAGCGG GCAGAGTGCA AGAATGGACA CAGGTGGCTC AGGTGTGCGC TGTCATATCA 2160 GGCCTGTCAG GGTGTGACAT ACCGCCGCTG TCTATTGCAG GACAGTATTG CCTCAGTAGC 2220 AGAGCCTGAG GATTCAGACT GGATCAAGAA AATCCTCCAA GGTCCCTGCA TCTTCTGCGA 2280 TTCTCCTTTC TACTGAAGCC TCGGGGCTAA ATAAAAGTCT CCACGACTGA CTACTGGATC 2340 ACTTAAGCGG GCTACATAAA TGACTAAAAC GCCTTTCTTT ACTTGAAAGA GAAACATTTG 2400 AAAGCCATAA AGTATTCTTT AATTATTTTG TAATTTAAGA CATTCGGGAG ATTTTTCCTC 2460 TCAACAGATC TTTATTTTAA AGCCAAAAAA GGGTCTAATT TTTATACAAT GAAAACTGTA 2520 TATGTTAATA AATTATTCCA CTGTA
2546Nucleotide Fasta Sequence
>ENSDARP00000092069.4|HAT_other|Danio rerio AAACAGTTCTTCAGAGAAAGCAGGGACCGCGTCTGAAGCCCCGTCAGGACCGGAGGATGATTTTTGGGCGGGACAGGGACCAATAGTGCTGCGGGATCCTGCGATTAAATTGTTGAGTCCGGTTAGCGGTGTGGAGCCGCTGTCGTGGGCAGAAGATCACAGACTGGCGGCCTGCAACACGAACAGCGTGTCTCTGATGGAGCTGGTGTGTGACATACACAGTCTCAGTCAAAACCTGGTGCTGCATCGCACTCACATCCCCGTCCCCACCACCACATGTGAACTCAAGGTGGGCTCAGAAAAAGATGTGAGGGCTGTTAAAGAAAGCTTTGCCAATAGTAAAGACCCAGTAGTAAGCCAAATGATCATGATAGACCGAGTCATCAATCCTCAATGTGGACCTCTCCATGGCGTCAAGTACACTAGTTGGTCACCGTTAGGCTGTGACAAACTCGGCCGCTGCCTTCTTGCTTTCCTAACACTTGACAATCGTCTCACAATCCACAGCAGTCACTTGCAACTGCAGTGGACGCTACTGCTGGACCTTACAGAACTATATGGCGAAATGTTAAAAACCAGGAAATACTCCGCTCAGAATGATGGCACACCACCGGTCTCCTTTAAAGACTTCGATGAACTGCAGCGTCGGTTTCGTATGCAGACACCAGTGCGCATGGAGTGGTCCAGTTTGTGCAGTATGCAGCAAGTTCAGAGCGATAACGCTCATAAAGATATCAACACAGTTCTTCTAGCTGTACTCATGGAGAATGGAGACGTGGTGGTGTGGCAGTTCAGCTTGCCCCTAAACGGAAAGGAATCAGTGGTGTCATGCAGCACCATCAAATCTGGTGTGTCGTCACCAAGCGCGCTCACCTGGTGGCAATATGAGCACGGCGGGCGGAAAATGAGCGGACTGATCATAGGGAGCTCCGTAGGACCTGTAAAGATCCTCCCAGTCAACATTAAGGCAGTGAAGGGTTACTTCACCCTCAGGCAGCCTGTGGTTTTGTGGCAGGAGAGCGACCAGATTCCTGTCCACAACATCAAATGCATCACACTCTTCCATCCGCACCAGAAATGCAACTGCAGTTTAGTGGTTGCGGCACGTGGTCCGTATATCTTCTGGTGCTTACTGCTGATCTCCAAAGCTGGTCTGAATGTTCACAATTCCCACATCACGGGTCTTCATTCCACCCCTATCATATCCATGACGGCTTGCCACAATAATGGGACTGTTTTAACATGCTCACTAGATGGGAAGGTTAAGAAGCTAACGACTGTTTTCACAGACCTAGCTGTCGAGTTCAAGCAAGAAGAGATTGTTCTGCCTGAAGGAGTCGCGGGTCGACGGATACACGGTATATCAATCAGTCCAAACAGTGCCTACTTGGCTGTTCTCTCCAATGAGGGTATGAACAATGGCTTCCATCCTGTCGCTCTCAACTACCAGGTACAGTTCATCACCTTTAAGACTCCAGATGAAGCGGCAGCTAAGATGCTTGATGCTCACTGTCTCTTTAGAAACTCTGATTTGCTGGATTTGTTACGATGGCGGATACTAAAAGACAAAAGCATCCCTTCGGCCTACGAGAATAATCTAGAAGAGAAGCTGCAAACCTCAGGGTCCACTTACCTGTGGCGTCTTAAGCTTTTCTTTCTCAGAACGCTTCACAAGTCCATGCAGAAAACTCCTTCAGAGGCTCTGTGGCAGCCAAGGCGAAGAAGTGGTCAGAGTTTAGTTGGTGATGGTGATGAAGGGGGTGGAGGTGATGAAGATGAGATGATGGAGGACAGACAGGGGGATGCAGAAATTAGGGAAAGAGAGGAGAGAATGAAAGAGATTGGCAGCAGGATCGAAGCTGTGGAGTCCCATCTGATCCGGGAGAACATGAAGAAGGTGCTGGGCGAGGTGTATCTGCACACCTGTGTCACTCAAAACACACGTGTACCCACTAAAGGTGTGTGTGACTTCCTCACAAATGACCCTGCTAATGAAGACAGGGCAGCCAAGGTGCTGATAGGTCATATCATGAATAAGATGAATAAACAGACATTTCCAGAGTATTGCGCTCTCTGTAAAGAAACTCTGCCCTTCACCGACTGCAAGCGGGCAGAGTGCAAGAATGGACACAGGTGGCTCAGGTGTGCGCTGTCATATCAGGCCTGTCAGGGTGTGACATACCGCCGCTGTCTATTGCAGGACAGTATTGCCTCAGTAGCAGAGCCTGAGGATTCAGACTGGATCAAGAAAATCCTCCAAGGTCCCTGCATCTTCTGCGATTCTCCTTTCTACTGAAGCCTCGGGGCTAAATAAAAGTCTCCACGACTGACTACTGGATCACTTAAGCGGGCTACATAAATGACTAAAACGCCTTTCTTTACTTGAAAGAGAAACATTTGAAAGCCATAAAGTATTCTTTAATTATTTTGTAATTTAAGACATTCGGGAGATTTTTCCTCTCAACAGATCTTTATTTTAAAGCCAAAAAAGGGTCTAATTTTTATACAATGAAAACTGTATATGTTAATAAATTATTCCACTGTA
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |