Tag |
Content |
WERAM ID |
WERAM-Fia-0020 |
Ensembl Protein ID |
ENSFALP00000001535.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1261 |
|
|
|
Organism |
Ficedula albicollis |
Domain Profile |
HAT HAT_other
Query: 1 MLDRVFNPEGKSLTPMRGFKYSSWSPLGCDANGRCLLAALTMDNRLTIHANLNRLQWVQL 60 MLDRVFNPEGK+L PMRGFKY+SWSP+GCDANGRCLLAALTMDNRLTI ANLNRLQWVQL Sbjct: 146 MLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQL 205 Query: 61 VDLTELYGERLLEASYRLCKADTPCGELGDFPEFQRRHSMQAPVRMEWSGICTTQQVKHN 120 VDLTE+YGERL E SYRL K + P G LGDF EFQRRHSMQ PVRMEWSGICTTQQVKHN Sbjct: 206 VDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHN 265 Query: 121 NECRDVGSVLLAVLFENGNIAIWQFQLPFLGKESITSCNTIESGIGSPSVLSWWEYEHNN 180 NECRDVGSVLLAVLFENGNIA+WQFQLPF+GKESI+SCNTIESGI SPSVL WWEYEHNN Sbjct: 266 NECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGITSPSVLFWWEYEHNN 325 Query: 181 RKMSGLIVGSAYGPVKIIPVNLKAVKGYFTLRQPVVLWQEMDQLPVHSIKCIPLYHPYQK 240 RKMSGLIVGSA+GP+KI+PVNLKAVKGYFTLRQPV+LW+EMDQLPVHSIKC+PLYHPYQK Sbjct: 326 RKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPVILWKEMDQLPVHSIKCVPLYHPYQK 385 Query: 241 CSCSLVVAARGPYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDG 300 CSCSLVVAARG YVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDG Sbjct: 386 CSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDG 445 Query: 301 KVRQLIPIFTDVALKFEHQLIKLSEVFGCVRTHGIAVSPCGAYLAVITTEGMANGLHPVN 360 KVRQLIPIFTDVALKFEHQLIKLS+VFG VRTHGIAVSPCGAYLA+ITTEGM NGLHPVN Sbjct: 446 KVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAIITTEGMINGLHPVN 505 Query: 361 KNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDLTDLVRWKILKDKHIPQFLQEALDKKI 420 KNYQVQFVTLKTFEEAAAQLLESSVQNLF+QVDL DLVRWKILKDKHIPQFLQEAL+KKI Sbjct: 506 KNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKI 565 Query: 421 ESCGSTYFWRFKLFLLRILYQSMQKAPSEVTWRPSHEDAKILISDSPGMGSTEDD-QEEG 479 ES G TYFWRFKLFLLRILYQSMQK PSE W+P+HED+KIL+ DSPGMG+ +D+ QEEG Sbjct: 566 ESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNADDEQQEEG 625 Query: 480 T-SKQASKQSLGDTGKGVDIDDPAEDSLHQSSDTGGREPMVEKLLEIQAQIEAVEMHLTR 538 T SKQ KQ L + K D+++P +DSL + D GGREPM EKLLEIQ +IEAVEMHLTR Sbjct: 626 TSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTR 685 Query: 539 EHMKRVLGEVYLHTWITENTSIPTRGVCDFLMSDDGYDDRTARVLIGHILKKMNKQTFPE 598 EHMKRVLGEVYLHTWITENTSIPTRG+C+FLMSD+ YDDRTARVLIGHI KKMNKQTFPE Sbjct: 686 EHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPE 745 Query: 599 HCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLVYRRCLLHDSIARHPTPEDPEW 658 HCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSL+YRRCLLHDSIARHP PEDP+W Sbjct: 746 HCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDW 805 Query: 659 IKRLLQGPCTFCDSPVF 675 IKRLLQ PC FCDSPVF Sbjct: 806 IKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | MLDRVFNPEG KSLTPMRGFK YSSWSPLGCD ANGRCLLAAL TMDNRLTIHA NLNRLQWVQL 60 VDLTELYGER LLEASYRLCK ADTPCGELGD FPEFQRRHSM QAPVRMEWSG ICTTQQVKHN 120 NECRDVGSVL LAVLFENGNI AIWQFQLPFL GKESITSCNT IESGIGSPSV LSWWEYEHNN 180 RKMSGLIVGS AYGPVKIIPV NLKAVKGYFT LRQPVVLWQE MDQLPVHSIK CIPLYHPYQK 240 CSCSLVVAAR GPYVFWCLLL ISKAGLNVHN SHVTGLHSLP IVSMTADKQN GTVYTCSSDG 300 KVRQLIPIFT DVALKFEHQL IKLSEVFGCV RTHGIAVSPC GAYLAVITTE GMANGLHPVN 360 KNYQVQFVTL KTFEEAAAQL LESSVQNLFR QVDLTDLVRW KILKDKHIPQ FLQEALDKKI 420 ESCGSTYFWR FKLFLLRILY QSMQKAPSEV TWRPSHEDAK ILISDSPGMG STEDDQEEGT 480 SKQASKQSLG DTGKGVDIDD PAEDSLHQSS DTGGREPMVE KLLEIQAQIE AVEMHLTREH 540 MKRVLGEVYL HTWITENTSI PTRGVCDFLM SDDGYDDRTA RVLIGHILKK MNKQTFPEHC 600 SLCKEILPFT DRKQAVCSNG HIWLRCFLTY QSCQSLVYRR CLLHDSIARH PTPEDPEWIK 660 RLLQGPCTFC DSPVF 675Protein Fasta Sequence
>ENSFALP00000001535.1|HAT_other|Ficedula albicollis MLDRVFNPEGKSLTPMRGFKYSSWSPLGCDANGRCLLAALTMDNRLTIHANLNRLQWVQLVDLTELYGERLLEASYRLCKADTPCGELGDFPEFQRRHSMQAPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAIWQFQLPFLGKESITSCNTIESGIGSPSVLSWWEYEHNNRKMSGLIVGSAYGPVKIIPVNLKAVKGYFTLRQPVVLWQEMDQLPVHSIKCIPLYHPYQKCSCSLVVAARGPYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSEVFGCVRTHGIAVSPCGAYLAVITTEGMANGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFRQVDLTDLVRWKILKDKHIPQFLQEALDKKIESCGSTYFWRFKLFLLRILYQSMQKAPSEVTWRPSHEDAKILISDSPGMGSTEDDQEEGTSKQASKQSLGDTGKGVDIDDPAEDSLHQSSDTGGREPMVEKLLEIQAQIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGVCDFLMSDDGYDDRTARVLIGHILKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLVYRRCLLHDSIARHPTPEDPEWIKRLLQGPCTFCDSPVF
|
Nucleotide Sequence (Fasta) | TCCAGGGAAG TCTTCCTAAA AAACCAGCTG TACAATTGTT TTAAAGGTTC TTTTTGTTCT 60 CTTTTAGGTT GGCCCAAAGA AGGAGGTGGC TGAGTGTAAG GAGAAATTCG CCAGCTCAGT 120 GGATCCCACC GTCAGCCAGA CGTTTATGCT GGACCGAGTG TTCAATCCCG AGGGCAAGTC 180 CCTGACACCC ATGCGGGGGT TCAAGTATTC CAGCTGGTCC CCTCTGGGCT GCGATGCCAA 240 CGGGAGGTGC CTGCTGGCAG CCCTGACCAT GGACAACCGC CTGACCATCC ACGCCAACCT 300 CAACCGGCTG CAGTGGGTGC AGCTGGTGGA CCTGACCGAG CTGTACGGCG AGCGGCTGCT 360 GGAGGCCAGT TACAGGCTGT GCAAGGCTGA CACTCCCTGC GGGGAGCTGG GAGACTTCCC 420 CGAGTTCCAG CGGCGGCACA GCATGCAGGC CCCGGTGCGC ATGGAGTGGT CGGGCATCTG 480 CACCACCCAG CAGGTCAAGC ACAACAACGA GTGCCGGGAC GTGGGCAGCG TGCTGCTGGC 540 CGTGCTCTTC GAGAACGGCA ACATCGCCAT CTGGCAGTTC CAGCTGCCCT TTCTGGGGAA 600 GGAATCCATC ACTTCCTGCA ACACCATCGA GTCGGGAATC GGCTCCCCCA GCGTGTTGTC 660 CTGGTGGGAG TACGAGCACA ACAACCGGAA GATGAGCGGG CTGATCGTGG GCAGCGCCTA 720 TGGCCCGGTG AAGATCATCC CTGTCAACCT CAAGGCGGTC AAAGGCTACT TCACACTGAG 780 ACAGCCCGTG GTCTTGTGGC AGGAGATGGA CCAGCTGCCC GTGCACAGCA TCAAATGCAT 840 CCCTCTCTAC CACCCCTACC AGAAATGCAG CTGCAGCCTG GTGGTGGCTG CCAGAGGCCC 900 TTACGTGTTC TGGTGCCTCC TGCTGATATC CAAAGCGGGG CTGAACGTCC ACAATTCCCA 960 CGTGACAGGG CTCCACTCCT TGCCCATTGT CTCCATGACT GCGGACAAGC AGAATGGCAC 1020 AGTGTACACC TGCTCCAGCG ATGGCAAGGT CAGGCAGCTC ATCCCCATAT TCACAGACGT 1080 TGCTTTGAAG TTTGAGCACC AGCTGATCAA GCTCTCAGAG GTGTTTGGCT GTGTCAGGAC 1140 TCATGGAATT GCTGTCAGCC CCTGCGGGGC ATACCTGGCA GTCATCACCA CTGAGGGCAT 1200 GGCCAACGGG CTGCACCCCG TCAACAAAAA CTACCAGGTG CAGTTTGTCA CCCTCAAGAC 1260 TTTTGAGGAG GCAGCTGCAC AGCTCTTGGA ATCTTCCGTT CAGAACCTTT TCCGGCAAGT 1320 GGACTTGACG GATCTTGTAC GCTGGAAAAT CTTGAAGGAT AAGCACATTC CTCAGTTCTT 1380 ACAGGAAGCA CTGGATAAAA AGATAGAGAG CTGTGGTTCC ACTTATTTCT GGAGGTTTAA 1440 GTTGTTTCTC TTGAGGATTT TGTACCAGTC GATGCAGAAA GCCCCCTCAG AGGTCACGTG 1500 GAGACCTTCA CACGAGGATG CCAAAATCTT GATATCAGAT TCCCCTGGGA TGGGCAGCAC 1560 TGAAGATGAT CAAGAGGAAG GAACTTCGAA ACAAGCCAGC AAGCAAAGCC TGGGGGACAC 1620 AGGCAAAGGT GTGGACATTG ATGACCCTGC AGAGGATTCT CTCCATCAAT CAAGTGACAC 1680 TGGAGGCCGT GAGCCAATGG TAGAAAAGCT TCTTGAAATA CAGGCACAGA TTGAGGCAGT 1740 AGAAATGCAC TTGACACGAG AACACATGAA AAGGGTGCTG GGAGAAGTTT ACCTACACAC 1800 GTGGATTACA GAGAACACCA GCATTCCCAC CAGAGGGGTC TGTGACTTCT TAATGTCCGA 1860 CGATGGCTAC GATGACAGAA CAGCACGAGT GCTGATTGGG CACATCCTGA AGAAGATGAA 1920 CAAACAGACC TTCCCTGAGC ACTGCAGCTT GTGCAAGGAG ATCCTGCCCT TCACAGACCG 1980 CAAACAGGCC GTGTGCTCCA ACGGCCACAT CTGGCTCAGG TGCTTTCTAA CCTACCAGTC 2040 CTGCCAGAGT TTGGTGTACA GGAGGTGTTT GCTTCATGAC AGCATTGCAC GGCACCCAAC 2100 CCCAGAAGAT CCTGAATGGA TCAAGAGGTT ATTGCAGGGA CCTTGCACCT TCTGTGATTC 2160 TCCTGTGTTC TAG
2174Nucleotide Fasta Sequence
>ENSFALP00000001535.1|HAT_other|Ficedula albicollis TCCAGGGAAGTCTTCCTAAAAAACCAGCTGTACAATTGTTTTAAAGGTTCTTTTTGTTCTCTTTTAGGTTGGCCCAAAGAAGGAGGTGGCTGAGTGTAAGGAGAAATTCGCCAGCTCAGTGGATCCCACCGTCAGCCAGACGTTTATGCTGGACCGAGTGTTCAATCCCGAGGGCAAGTCCCTGACACCCATGCGGGGGTTCAAGTATTCCAGCTGGTCCCCTCTGGGCTGCGATGCCAACGGGAGGTGCCTGCTGGCAGCCCTGACCATGGACAACCGCCTGACCATCCACGCCAACCTCAACCGGCTGCAGTGGGTGCAGCTGGTGGACCTGACCGAGCTGTACGGCGAGCGGCTGCTGGAGGCCAGTTACAGGCTGTGCAAGGCTGACACTCCCTGCGGGGAGCTGGGAGACTTCCCCGAGTTCCAGCGGCGGCACAGCATGCAGGCCCCGGTGCGCATGGAGTGGTCGGGCATCTGCACCACCCAGCAGGTCAAGCACAACAACGAGTGCCGGGACGTGGGCAGCGTGCTGCTGGCCGTGCTCTTCGAGAACGGCAACATCGCCATCTGGCAGTTCCAGCTGCCCTTTCTGGGGAAGGAATCCATCACTTCCTGCAACACCATCGAGTCGGGAATCGGCTCCCCCAGCGTGTTGTCCTGGTGGGAGTACGAGCACAACAACCGGAAGATGAGCGGGCTGATCGTGGGCAGCGCCTATGGCCCGGTGAAGATCATCCCTGTCAACCTCAAGGCGGTCAAAGGCTACTTCACACTGAGACAGCCCGTGGTCTTGTGGCAGGAGATGGACCAGCTGCCCGTGCACAGCATCAAATGCATCCCTCTCTACCACCCCTACCAGAAATGCAGCTGCAGCCTGGTGGTGGCTGCCAGAGGCCCTTACGTGTTCTGGTGCCTCCTGCTGATATCCAAAGCGGGGCTGAACGTCCACAATTCCCACGTGACAGGGCTCCACTCCTTGCCCATTGTCTCCATGACTGCGGACAAGCAGAATGGCACAGTGTACACCTGCTCCAGCGATGGCAAGGTCAGGCAGCTCATCCCCATATTCACAGACGTTGCTTTGAAGTTTGAGCACCAGCTGATCAAGCTCTCAGAGGTGTTTGGCTGTGTCAGGACTCATGGAATTGCTGTCAGCCCCTGCGGGGCATACCTGGCAGTCATCACCACTGAGGGCATGGCCAACGGGCTGCACCCCGTCAACAAAAACTACCAGGTGCAGTTTGTCACCCTCAAGACTTTTGAGGAGGCAGCTGCACAGCTCTTGGAATCTTCCGTTCAGAACCTTTTCCGGCAAGTGGACTTGACGGATCTTGTACGCTGGAAAATCTTGAAGGATAAGCACATTCCTCAGTTCTTACAGGAAGCACTGGATAAAAAGATAGAGAGCTGTGGTTCCACTTATTTCTGGAGGTTTAAGTTGTTTCTCTTGAGGATTTTGTACCAGTCGATGCAGAAAGCCCCCTCAGAGGTCACGTGGAGACCTTCACACGAGGATGCCAAAATCTTGATATCAGATTCCCCTGGGATGGGCAGCACTGAAGATGATCAAGAGGAAGGAACTTCGAAACAAGCCAGCAAGCAAAGCCTGGGGGACACAGGCAAAGGTGTGGACATTGATGACCCTGCAGAGGATTCTCTCCATCAATCAAGTGACACTGGAGGCCGTGAGCCAATGGTAGAAAAGCTTCTTGAAATACAGGCACAGATTGAGGCAGTAGAAATGCACTTGACACGAGAACACATGAAAAGGGTGCTGGGAGAAGTTTACCTACACACGTGGATTACAGAGAACACCAGCATTCCCACCAGAGGGGTCTGTGACTTCTTAATGTCCGACGATGGCTACGATGACAGAACAGCACGAGTGCTGATTGGGCACATCCTGAAGAAGATGAACAAACAGACCTTCCCTGAGCACTGCAGCTTGTGCAAGGAGATCCTGCCCTTCACAGACCGCAAACAGGCCGTGTGCTCCAACGGCCACATCTGGCTCAGGTGCTTTCTAACCTACCAGTCCTGCCAGAGTTTGGTGTACAGGAGGTGTTTGCTTCATGACAGCATTGCACGGCACCCAACCCCAGAAGATCCTGAATGGATCAAGAGGTTATTGCAGGGACCTTGCACCTTCTGTGATTCTCCTGTGTTCTAG
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |