Tag |
Content |
WERAM ID |
WERAM-Aim-0054 |
Ensembl Protein ID |
ENSAMEP00000004386.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0 |
1598 |
|
|
|
Organism |
Ailuropoda melanoleuca |
Domain Profile |
HAT HAT_other
Query: 1 MSSADKARVGPAADGPVPXXXXXXXXXXXXXXKEPATDAAPGPSAAFRFLVSRREPAVKL 60 M++AD+ARVGPA DGP P KEPA DAAPGPSAAFR +V+RREPAVKL Sbjct: 1 MNTADQARVGPADDGPAPSGEEEGEGGGEAGGKEPAADAAPGPSAAFRLMVTRREPAVKL 60 Query: 61 QYAVSGLXXXXXXXXTRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 QYAVSGL RVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV Sbjct: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 Query: 121 GSKAEVAECKEKFATSKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC 180 GSK EVAECKEKFA SKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC Sbjct: 121 GSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC 180 Query: 181 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKSEAPEGNLGDFAEFQ 240 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSK+EAPEGNLGDFAEFQ Sbjct: 181 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ 240 Query: 241 RRHSMQTPVRMEWSGICTTQQVTHNNECRDVGSVLLAVLLENGNIAVWQFQLPFVGKESI 300 RRHSMQTPVRMEWSGICTTQQV HNNECRDVGSVLLAVL ENGNIAVWQFQLPFVGKESI Sbjct: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 Query: 301 SSCNTIESGISSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPV 360 SSCNTIESGI+SPSVLFWWEYEH+NRKMSGLIVGSAFGP+KILPVNLKAVKGYFTLRQPV Sbjct: 301 SSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPV 360 Query: 361 VLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 +LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG Sbjct: 361 ILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 Query: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI Sbjct: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 Query: 481 AVSPCGAYLAVITTEGMVNGLHPVNKNHQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 AVSPCGAYLA+ITTEGM+NGLHPVNKN+QVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI Sbjct: 481 AVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 Query: 541 DLVRWKILKDKHIPQFLQEALEKKIESSGATYFWRFKLFLLRILYQSMQKTPSEALWKPT 600 DLVRWKILKDKHIPQFLQEALEKKIESSG TYFWRFKLFLLRILYQSMQKTPSEALWKPT Sbjct: 541 DLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPT 600 Query: 601 HEDSKILLVDSPGMGNAEEEQQEEGTSSKQANRQGLQERSREGDPEDPTDDSLTQAGDAG 660 HEDSKILLVDSPGMGNA++EQQEEGTSSKQ +QGLQERS+EGD E+PTDDSL GDAG Sbjct: 601 HEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDAG 660 Query: 661 GREPMEEKLLELQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 720 GREPMEEKLLE+QGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE Sbjct: 661 GREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 720 Query: 721 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC 780 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC Sbjct: 721 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC 780 Query: 781 QSLIYRRCLLHDSIARHPAPDDPDWIKRLLQSPCPFCDSPVF 822 QSLIYRRCLLHDSIARHPAP+DPDWIKRLLQSPCPFCDSPVF Sbjct: 781 QSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPFCDSPVF 822
|
Protein Sequence (Fasta) | MSSADKARVG PAADGPVPXX XXXXXXGGEA GGKEPATDAA PGPSAAFRFL VSRREPAVKL 60 QYAVSGLXXX XXXXXTRVSV STARSIAVLE LICDVHNPGQ DLVIHRTSVP APLNSCLLKV 120 GSKAEVAECK EKFATSKDPT VSQTFMLDRV FNPEGKALPP MRGFKYTSWS PMGCDANGRC 180 LLAALTMDNR LTIQANLNRL QWVQLVDLTE IYGERLYETS YRLSKSEAPE GNLGDFAEFQ 240 RRHSMQTPVR MEWSGICTTQ QVTHNNECRD VGSVLLAVLL ENGNIAVWQF QLPFVGKESI 300 SSCNTIESGI SSPSVLFWWE YEHSNRKMSG LIVGSAFGPV KILPVNLKAV KGYFTLRQPV 360 VLWKEMDQLP VHSIKCVPLY HPYQKCSCSL VVAARGSYVF WCLLLISKAG LNVHNSHVTG 420 LHSLPIVSMT ADKQNGTVYT CSSDGKVRQL IPIFTDVALK FEHQLIKLSD VFGSVRTHGI 480 AVSPCGAYLA VITTEGMVNG LHPVNKNHQV QFVTLKTFEE AAAQLLESSV QNLFKQVDLI 540 DLVRWKILKD KHIPQFLQEA LEKKIESSGA TYFWRFKLFL LRILYQSMQK TPSEALWKPT 600 HEDSKILLVD SPGMGNAEEE QQEEGTSSKQ ANRQGLQERS REGDPEDPTD DSLTQAGDAG 660 GREPMEEKLL ELQGKIEAVE MHLTREHMKR VLGEVYLHTW ITENTSIPTR GLCNFLMSDE 720 EYDDRTARVL IGHISKKMNK QTFPEHCSLC KEILPFTDRK QAVCSNGHIW LRCFLTYQSC 780 QSLIYRRCLL HDSIARHPAP DDPDWIKRLL QSPCPFCDSP VF 822Protein Fasta Sequence
>ENSAMEP00000004386.1|HAT_other|Ailuropoda melanoleuca MSSADKARVGPAADGPVPXXXXXXXXGGEAGGKEPATDAAPGPSAAFRFLVSRREPAVKLQYAVSGLXXXXXXXXTRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKVGSKAEVAECKEKFATSKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRCLLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKSEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVTHNNECRDVGSVLLAVLLENGNIAVWQFQLPFVGKESISSCNTIESGISSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAVKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNHQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLIDLVRWKILKDKHIPQFLQEALEKKIESSGATYFWRFKLFLLRILYQSMQKTPSEALWKPTHEDSKILLVDSPGMGNAEEEQQEEGTSSKQANRQGLQERSREGDPEDPTDDSLTQAGDAGGREPMEEKLLELQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPDDPDWIKRLLQSPCPFCDSPVF
|
Nucleotide Sequence (Fasta) | ATGAGCTCGG CCGACAAGGC CCGGGTGGGG CCCGCGGCCG ACGGGCCTGT GCCNNNNNNN 60 NNNNNNNNNN NNNNNNNNGG CGGCGAGGCG GGCGGGAAGG AGCCAGCAAC GGACGCGGCC 120 CCCGGGCCCA GTGCAGCGTT CCGCTTCCTG GTATCTCGGC GGGAGCCGGC CGTGAAGCTG 180 CAGTATGCAG TGAGCGGCTT GNNNNNNNNN NNNNNNNNNN NNNCCACCCG CGTGTCCGTG 240 TCCACGGCCC GCAGCATCGC TGTGCTGGAG CTCATCTGCG ACGTGCACAA CCCGGGCCAG 300 GACCTGGTTA TCCACCGCAC CTCGGTGCCC GCTCCTCTCA ACAGCTGCCT CCTCAAAGTT 360 GGCTCCAAAG CAGAAGTTGC TGAGTGTAAG GAGAAGTTTG CCACCTCCAA AGACCCCACA 420 GTCAGTCAGA CTTTCATGTT GGATAGGGTG TTCAACCCTG AGGGGAAGGC GTTGCCACCA 480 ATGAGAGGAT TCAAGTATAC TAGCTGGTCC CCCATGGGTT GCGATGCAAA TGGCAGGTGC 540 CTCTTGGCGG CGCTGACCAT GGACAATCGC CTGACCATCC AGGCCAACCT CAACAGACTG 600 CAGTGGGTCC AGCTGGTGGA TCTGACTGAG ATTTATGGAG AACGTCTTTA CGAGACCAGT 660 TACAGGCTCT CTAAAAGTGA GGCTCCAGAG GGGAATCTCG GGGACTTCGC GGAGTTTCAG 720 AGGAGGCACA GCATGCAGAC GCCAGTCCGG ATGGAGTGGT CGGGCATCTG TACCACTCAG 780 CAGGTTACGC ACAACAACGA GTGCCGGGAC GTGGGCAGCG TGCTCCTGGC GGTCCTCTTG 840 GAGAACGGGA ACATTGCTGT GTGGCAGTTC CAGCTGCCCT TCGTGGGGAA GGAGTCCATC 900 TCTTCGTGCA ACACCATTGA ATCAGGAATC AGCTCTCCTA GTGTTTTGTT TTGGTGGGAA 960 TATGAGCACA GTAATCGAAA AATGAGCGGC CTTATTGTGG GGAGTGCTTT TGGACCCGTG 1020 AAAATTCTTC CTGTCAATCT CAAAGCAGTT AAAGGCTATT TCACTTTAAG GCAGCCCGTT 1080 GTCTTGTGGA AAGAAATGGA CCAGTTGCCG GTGCACAGCA TTAAATGTGT TCCGCTCTAT 1140 CACCCTTACC AGAAGTGTAG TTGTAGCTTA GTGGTGGCTG CAAGAGGATC CTATGTGTTT 1200 TGGTGTCTCC TTCTGATCTC CAAAGCGGGT CTCAATGTCC ACAATTCCCA CGTCACAGGC 1260 CTTCACTCAC TGCCGATTGT CTCCATGACT GCAGACAAGC AGAATGGAAC AGTATATACC 1320 TGTTCCAGCG ATGGGAAGGT GAGGCAGCTG ATTCCCATTT TCACCGATGT CGCATTAAAG 1380 TTTGAGCACC AGTTAATTAA ACTCTCCGAT GTGTTTGGCT CAGTGAGGAC ACACGGGATA 1440 GCAGTGAGCC CCTGCGGCGC GTATCTGGCC GTCATCACCA CTGAGGGCAT GGTCAATGGC 1500 CTCCATCCCG TGAACAAAAA CCACCAGGTC CAGTTTGTCA CTCTCAAAAC CTTTGAAGAG 1560 GCAGCGGCTC AGCTCCTGGA ATCGTCAGTT CAGAATCTCT TCAAGCAGGT AGACTTAATA 1620 GATCTTGTAC GCTGGAAGAT TTTAAAAGAT AAACACATCC CTCAGTTTTT ACAGGAAGCT 1680 TTGGAAAAAA AGATTGAAAG CAGCGGGGCC ACCTATTTTT GGCGTTTCAA GCTGTTCCTC 1740 CTGCGGATTT TGTATCAGTC AATGCAGAAA ACCCCTTCGG AAGCATTATG GAAACCCACC 1800 CATGAGGACT CCAAAATCTT ATTAGTTGAC TCCCCTGGGA TGGGCAATGC TGAGGAAGAA 1860 CAGCAAGAGG AAGGCACGTC TTCCAAACAG GCGAATAGGC AGGGCCTTCA GGAGAGGAGC 1920 AGAGAGGGCG ACCCGGAGGA CCCCACCGAC GATTCACTGA CCCAGGCTGG AGATGCCGGG 1980 GGCCGTGAGC CAATGGAAGA GAAGCTTCTG GAGCTCCAAG GGAAAATCGA AGCTGTGGAA 2040 ATGCACTTGA CCAGGGAGCA CATGAAGCGA GTCTTAGGGG AAGTGTACTT GCACACCTGG 2100 ATCACAGAAA ACACTAGCAT CCCCACCAGG GGACTCTGTA ACTTCCTGAT GTCTGATGAG 2160 GAGTACGATG ACCGAACAGC ACGGGTGCTG ATTGGACACA TCTCAAAGAA GATGAACAAA 2220 CAGACTTTCC CTGAGCACTG TAGTTTGTGT AAAGAGATCT TGCCGTTCAC AGATCGCAAA 2280 CAGGCAGTCT GCTCCAATGG CCACATTTGG CTCCGGTGCT TTTTAACCTA CCAGTCCTGC 2340 CAGAGTTTGA TATACAGAAG GTGTTTGCTC CACGACAGCA TCGCCCGCCA TCCGGCTCCC 2400 GACGATCCTG ACTGGATTAA GAGGTTACTG CAAAGCCCCT GCCCTTTCTG CGATTCTCCT 2460 GTCTTC
2467Nucleotide Fasta Sequence
>ENSAMEP00000004386.1|HAT_other|Ailuropoda melanoleuca ATGAGCTCGGCCGACAAGGCCCGGGTGGGGCCCGCGGCCGACGGGCCTGTGCCNNNNNNNNNNNNNNNNNNNNNNNNNGGCGGCGAGGCGGGCGGGAAGGAGCCAGCAACGGACGCGGCCCCCGGGCCCAGTGCAGCGTTCCGCTTCCTGGTATCTCGGCGGGAGCCGGCCGTGAAGCTGCAGTATGCAGTGAGCGGCTTGNNNNNNNNNNNNNNNNNNNNNNCCACCCGCGTGTCCGTGTCCACGGCCCGCAGCATCGCTGTGCTGGAGCTCATCTGCGACGTGCACAACCCGGGCCAGGACCTGGTTATCCACCGCACCTCGGTGCCCGCTCCTCTCAACAGCTGCCTCCTCAAAGTTGGCTCCAAAGCAGAAGTTGCTGAGTGTAAGGAGAAGTTTGCCACCTCCAAAGACCCCACAGTCAGTCAGACTTTCATGTTGGATAGGGTGTTCAACCCTGAGGGGAAGGCGTTGCCACCAATGAGAGGATTCAAGTATACTAGCTGGTCCCCCATGGGTTGCGATGCAAATGGCAGGTGCCTCTTGGCGGCGCTGACCATGGACAATCGCCTGACCATCCAGGCCAACCTCAACAGACTGCAGTGGGTCCAGCTGGTGGATCTGACTGAGATTTATGGAGAACGTCTTTACGAGACCAGTTACAGGCTCTCTAAAAGTGAGGCTCCAGAGGGGAATCTCGGGGACTTCGCGGAGTTTCAGAGGAGGCACAGCATGCAGACGCCAGTCCGGATGGAGTGGTCGGGCATCTGTACCACTCAGCAGGTTACGCACAACAACGAGTGCCGGGACGTGGGCAGCGTGCTCCTGGCGGTCCTCTTGGAGAACGGGAACATTGCTGTGTGGCAGTTCCAGCTGCCCTTCGTGGGGAAGGAGTCCATCTCTTCGTGCAACACCATTGAATCAGGAATCAGCTCTCCTAGTGTTTTGTTTTGGTGGGAATATGAGCACAGTAATCGAAAAATGAGCGGCCTTATTGTGGGGAGTGCTTTTGGACCCGTGAAAATTCTTCCTGTCAATCTCAAAGCAGTTAAAGGCTATTTCACTTTAAGGCAGCCCGTTGTCTTGTGGAAAGAAATGGACCAGTTGCCGGTGCACAGCATTAAATGTGTTCCGCTCTATCACCCTTACCAGAAGTGTAGTTGTAGCTTAGTGGTGGCTGCAAGAGGATCCTATGTGTTTTGGTGTCTCCTTCTGATCTCCAAAGCGGGTCTCAATGTCCACAATTCCCACGTCACAGGCCTTCACTCACTGCCGATTGTCTCCATGACTGCAGACAAGCAGAATGGAACAGTATATACCTGTTCCAGCGATGGGAAGGTGAGGCAGCTGATTCCCATTTTCACCGATGTCGCATTAAAGTTTGAGCACCAGTTAATTAAACTCTCCGATGTGTTTGGCTCAGTGAGGACACACGGGATAGCAGTGAGCCCCTGCGGCGCGTATCTGGCCGTCATCACCACTGAGGGCATGGTCAATGGCCTCCATCCCGTGAACAAAAACCACCAGGTCCAGTTTGTCACTCTCAAAACCTTTGAAGAGGCAGCGGCTCAGCTCCTGGAATCGTCAGTTCAGAATCTCTTCAAGCAGGTAGACTTAATAGATCTTGTACGCTGGAAGATTTTAAAAGATAAACACATCCCTCAGTTTTTACAGGAAGCTTTGGAAAAAAAGATTGAAAGCAGCGGGGCCACCTATTTTTGGCGTTTCAAGCTGTTCCTCCTGCGGATTTTGTATCAGTCAATGCAGAAAACCCCTTCGGAAGCATTATGGAAACCCACCCATGAGGACTCCAAAATCTTATTAGTTGACTCCCCTGGGATGGGCAATGCTGAGGAAGAACAGCAAGAGGAAGGCACGTCTTCCAAACAGGCGAATAGGCAGGGCCTTCAGGAGAGGAGCAGAGAGGGCGACCCGGAGGACCCCACCGACGATTCACTGACCCAGGCTGGAGATGCCGGGGGCCGTGAGCCAATGGAAGAGAAGCTTCTGGAGCTCCAAGGGAAAATCGAAGCTGTGGAAATGCACTTGACCAGGGAGCACATGAAGCGAGTCTTAGGGGAAGTGTACTTGCACACCTGGATCACAGAAAACACTAGCATCCCCACCAGGGGACTCTGTAACTTCCTGATGTCTGATGAGGAGTACGATGACCGAACAGCACGGGTGCTGATTGGACACATCTCAAAGAAGATGAACAAACAGACTTTCCCTGAGCACTGTAGTTTGTGTAAAGAGATCTTGCCGTTCACAGATCGCAAACAGGCAGTCTGCTCCAATGGCCACATTTGGCTCCGGTGCTTTTTAACCTACCAGTCCTGCCAGAGTTTGATATACAGAAGGTGTTTGCTCCACGACAGCATCGCCCGCCATCCGGCTCCCGACGATCCTGACTGGATTAAGAGGTTACTGCAAAGCCCCTGCCCTTTCTGCGATTCTCCTGTCTTC
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |