Tag |
Content |
WERAM ID |
WERAM-Dan-0162 |
Ensembl Protein ID |
ENSDNOP00000018237.1 |
Gene Name |
GTF3C4 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HAT |
HAT_other |
0.00e+00 |
1546 |
|
|
|
Organism |
Dasypus novemcinctus |
Domain Profile |
HAT HAT_other
Query: 1 MSTAEKARIXXXXXXXXXXXXXXXXXXXXXXXXXXXXDAVPGPSASFRLMVTRREPAVKL 60 M+TA++AR+ DA PGPSA+FRLMVTRREPAVKL Sbjct: 1 MNTADQARVGPADDGPAPSGEEEGEGGGEAGGKEPAADAAPGPSAAFRLMVTRREPAVKL 60 Query: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV Sbjct: 61 QYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKV 120 Query: 121 GSKKEVAECKEKFATSKDPTVSQTFMLDRVFNPEGKALPPMRGFKYASWSPMGCDANGRC 180 GSK EVAECKEKFA SKDPTVSQTFMLDRVFNPEGKALPPMRGFKY SWSPMGCDANGRC Sbjct: 121 GSKTEVAECKEKFAASKDPTVSQTFMLDRVFNPEGKALPPMRGFKYTSWSPMGCDANGRC 180 Query: 181 LLAALTMDNRLTIQANFNRLQWVQLIDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ 240 LLAALTMDNRLTIQAN NRLQWVQL+DLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ Sbjct: 181 LLAALTMDNRLTIQANLNRLQWVQLVDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQ 240 Query: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECHDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 RRHSMQTPVRMEWSGICTTQQVKHNNEC DVGSVLLAVLFENGNIAVWQFQLPFVGKESI Sbjct: 241 RRHSMQTPVRMEWSGICTTQQVKHNNECRDVGSVLLAVLFENGNIAVWQFQLPFVGKESI 300 Query: 301 SSCNTIESGINSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAIKGYFTLRQPV 360 SSCNTIESGI SPSVLFWWEYEH+NRKMSGLIVGSAFGP+KILPVNLKA+KGYFTLRQPV Sbjct: 301 SSCNTIESGITSPSVLFWWEYEHNNRKMSGLIVGSAFGPIKILPVNLKAVKGYFTLRQPV 360 Query: 361 VLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 +LWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG Sbjct: 361 ILWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTG 420 Query: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI Sbjct: 421 LHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGI 480 Query: 481 AVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKSFEEAAAQLLESSVQNLFKQVDLI 540 AVSPCGAYLA+ITTEGM+NGLHPVNKNYQVQFVTLK+FEEAAAQLLESSVQNLFKQVDLI Sbjct: 481 AVSPCGAYLAIITTEGMINGLHPVNKNYQVQFVTLKTFEEAAAQLLESSVQNLFKQVDLI 540 Query: 541 DLIRWKILKDKHIPQFLQEALEKKFESSGATYFWRFKLFLLRILYQSMQKTPSEALWKPS 600 DL+RWKILKDKHIPQFLQEALEKK ESSG TYFWRFKLFLLRILYQSMQKTPSEALWKP+ Sbjct: 541 DLVRWKILKDKHIPQFLQEALEKKIESSGVTYFWRFKLFLLRILYQSMQKTPSEALWKPT 600 Query: 601 HEDSKILLVDSPGMGNDEDEQQEEGTSSKKLNKQSLQERSKEGDADEPTDDLLTQSGDAG 660 HEDSKILLVDSPGMGN +DEQQEEGTSSK++ KQ LQERSKEGD +EPTDD L +GDAG Sbjct: 601 HEDSKILLVDSPGMGNADDEQQEEGTSSKQVVKQGLQERSKEGDVEEPTDDSLPTTGDAG 660 Query: 661 GREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 720 GREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE Sbjct: 661 GREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDE 720 Query: 721 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC 780 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC Sbjct: 721 EYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSC 780 Query: 781 QSLIYRRCLLHDSIARHPAPE 801 QSLIYRRCLLHDSIARHPAPE Sbjct: 781 QSLIYRRCLLHDSIARHPAPE 801
|
Protein Sequence (Fasta) | MSTAEKARIG PAADGPAPPG EEEGEGGGEA GGKESGPDAV PGPSASFRLM VTRREPAVKL 60 QYAVSGLEPL AWSEDHRVSV STARSIAVLE LICDVHNPGQ DLVIHRTSVP APLNSCLLKV 120 GSKKEVAECK EKFATSKDPT VSQTFMLDRV FNPEGKALPP MRGFKYASWS PMGCDANGRC 180 LLAALTMDNR LTIQANFNRL QWVQLIDLTE IYGERLYETS YRLSKNEAPE GNLGDFAEFQ 240 RRHSMQTPVR MEWSGICTTQ QVKHNNECHD VGSVLLAVLF ENGNIAVWQF QLPFVGKESI 300 SSCNTIESGI NSPSVLFWWE YEHSNRKMSG LIVGSAFGPV KILPVNLKAI KGYFTLRQPV 360 VLWKEMDQLP VHSIKCVPLY HPYQKCSCSL VVAARGSYVF WCLLLISKAG LNVHNSHVTG 420 LHSLPIVSMT ADKQNGTVYT CSSDGKVRQL IPIFTDVALK FEHQLIKLSD VFGSVRTHGI 480 AVSPCGAYLA VITTEGMVNG LHPVNKNYQV QFVTLKSFEE AAAQLLESSV QNLFKQVDLI 540 DLIRWKILKD KHIPQFLQEA LEKKFESSGA TYFWRFKLFL LRILYQSMQK TPSEALWKPS 600 HEDSKILLVD SPGMGNDEDE QQEEGTSSKK LNKQSLQERS KEGDADEPTD DLLTQSGDAG 660 GREPMEEKLL EIQGKIEAVE MHLTREHMKR VLGEVYLHTW ITENTSIPTR GLCNFLMSDE 720 EYDDRTARVL IGHISKKMNK QTFPEHCSLC KEILPFTDRK QAVCSNGHIW LRCFLTYQSC 780 QSLIYRRCLL HDSIARHPAP EGK 803Protein Fasta Sequence
>ENSDNOP00000018237.1|HAT_other|Dasypus novemcinctus MSTAEKARIGPAADGPAPPGEEEGEGGGEAGGKESGPDAVPGPSASFRLMVTRREPAVKLQYAVSGLEPLAWSEDHRVSVSTARSIAVLELICDVHNPGQDLVIHRTSVPAPLNSCLLKVGSKKEVAECKEKFATSKDPTVSQTFMLDRVFNPEGKALPPMRGFKYASWSPMGCDANGRCLLAALTMDNRLTIQANFNRLQWVQLIDLTEIYGERLYETSYRLSKNEAPEGNLGDFAEFQRRHSMQTPVRMEWSGICTTQQVKHNNECHDVGSVLLAVLFENGNIAVWQFQLPFVGKESISSCNTIESGINSPSVLFWWEYEHSNRKMSGLIVGSAFGPVKILPVNLKAIKGYFTLRQPVVLWKEMDQLPVHSIKCVPLYHPYQKCSCSLVVAARGSYVFWCLLLISKAGLNVHNSHVTGLHSLPIVSMTADKQNGTVYTCSSDGKVRQLIPIFTDVALKFEHQLIKLSDVFGSVRTHGIAVSPCGAYLAVITTEGMVNGLHPVNKNYQVQFVTLKSFEEAAAQLLESSVQNLFKQVDLIDLIRWKILKDKHIPQFLQEALEKKFESSGATYFWRFKLFLLRILYQSMQKTPSEALWKPSHEDSKILLVDSPGMGNDEDEQQEEGTSSKKLNKQSLQERSKEGDADEPTDDLLTQSGDAGGREPMEEKLLEIQGKIEAVEMHLTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEGK
|
Nucleotide Sequence (Fasta) | AAAACTGCCG CGGAGGACGC TGGGGGCTGG AGGCGGTTGT GCTGAAGGCA GTAACCCGAC 60 CTCAACCCGT GTTTGGCGCG CCCCGGGATC AGGCTTCTGG ATCCTAGGGA AGATGTCCAC 120 GGCCGAGAAG GCCCGCATAG GGCCCGCGGC TGACGGGCCT GCGCCGCCCG GGGAGGAGGA 180 GGGCGAGGGG GGCGGCGAGG CGGGCGGGAA GGAGTCGGGA CCGGACGCGG TTCCCGGGCC 240 CAGCGCCTCG TTCCGCCTCA TGGTGACTCG GCGGGAGCCG GCGGTGAAGC TGCAGTACGC 300 GGTGAGCGGC CTGGAGCCGC TGGCCTGGTC CGAGGACCAC CGCGTATCCG TGTCCACCGC 360 CCGTAGCATC GCTGTGTTGG AGCTCATCTG CGACGTGCAC AACCCGGGAC AGGACCTGGT 420 CATCCACCGC ACATCGGTGC CCGCACCTCT CAACAGCTGC CTCCTCAAAG TTGGCTCAAA 480 AAAAGAAGTT GCTGAGTGTA AGGAGAAATT TGCCACCTCT AAAGACCCCA CAGTCAGTCA 540 AACTTTCATG TTGGATAGGG TGTTCAATCC TGAAGGGAAG GCTTTGCCAC CAATGAGAGG 600 ATTCAAATAC GCTAGTTGGT CCCCCATGGG TTGTGATGCT AATGGCAGGT GCCTCTTGGC 660 AGCATTGACC ATGGACAATC GCCTGACCAT CCAGGCTAAC TTCAACAGAC TCCAGTGGGT 720 ACAGTTGATT GATCTGACTG AGATTTATGG AGAACGTCTT TATGAGACCA GTTATAGACT 780 CTCTAAAAAT GAGGCCCCAG AAGGAAATCT TGGGGACTTT GCTGAGTTTC AGAGGAGACA 840 CAGCATGCAA ACACCGGTCA GAATGGAGTG GTCGGGCATC TGTACCACAC AGCAGGTCAA 900 GCATAACAAT GAATGCCACG ATGTTGGCAG TGTGCTCCTT GCAGTCCTCT TTGAAAATGG 960 TAATATTGCT GTATGGCAGT TTCAGCTACC CTTTGTAGGA AAGGAATCCA TCTCTTCGTG 1020 TAACACAATT GAATCGGGAA TCAACTCTCC TAGTGTTTTG TTTTGGTGGG AATATGAGCA 1080 CAGTAATCGG AAAATGAGTG GCCTTATAGT GGGGAGTGCT TTTGGACCTG TAAAAATTCT 1140 TCCAGTCAAT CTCAAAGCAA TTAAAGGCTA CTTCACCTTA AGGCAGCCTG TTGTCTTATG 1200 GAAAGAAATG GACCAATTGC CAGTGCATAG CATTAAATGT GTGCCACTTT ATCATCCTTA 1260 CCAAAAGTGT AGTTGTAGCT TAGTGGTTGC TGCAAGAGGA TCCTATGTGT TTTGGTGTCT 1320 TCTTCTGATC TCCAAAGCAG GTCTGAATGT TCACAATTCC CATGTCACAG GCCTTCACTC 1380 ACTGCCAATT GTCTCCATGA CTGCAGACAA ACAGAATGGA ACAGTATATA CTTGTTCCAG 1440 TGATGGGAAG GTGAGGCAGT TAATTCCCAT TTTCACAGAT GTTGCATTAA AGTTTGAACA 1500 CCAGTTGATT AAACTTTCAG ATGTTTTTGG CTCAGTAAGG ACTCACGGTA TAGCAGTGAG 1560 CCCCTGTGGT GCATATCTGG CTGTCATTAC AACTGAAGGT ATGGTCAATG GCCTCCATCC 1620 TGTTAACAAA AACTACCAGG TCCAGTTTGT TACTCTCAAA TCCTTTGAAG AGGCAGCTGC 1680 TCAACTCCTG GAATCTTCAG TTCAAAATCT CTTTAAGCAG GTAGACTTAA TAGACCTAAT 1740 ACGCTGGAAA ATTTTAAAAG ATAAACACAT CCCTCAGTTT TTACAGGAAG CTTTGGAAAA 1800 AAAGTTTGAA AGCAGTGGGG CCACCTATTT TTGGCGTTTT AAACTTTTCC TCTTGAGGAT 1860 TTTATATCAG TCAATGCAGA AAACCCCTTC AGAAGCCTTA TGGAAACCCA GCCATGAGGA 1920 CTCAAAAATC TTACTAGTTG ACTCACCTGG AATGGGCAAT GATGAGGATG AACAGCAAGA 1980 AGAAGGCACT TCTTCCAAAA AGTTGAATAA GCAAAGCCTT CAGGAAAGGA GCAAAGAAGG 2040 TGATGCAGAC GAACCAACTG ATGATCTACT CACCCAGTCT GGAGATGCTG GAGGCCGTGA 2100 GCCAATGGAA GAAAAACTCC TTGAAATCCA AGGGAAAATT GAAGCTGTGG AGATGCACTT 2160 GACTAGGGAA CACATGAAGC GAGTCTTAGG AGAAGTTTAC CTGCACACCT GGATCACAGA 2220 AAACACTAGC ATCCCTACCA GAGGACTCTG TAATTTTTTA ATGTCTGATG AAGAATATGA 2280 TGACAGAACA GCACGGGTGC TGATTGGACA TATCTCAAAG AAGATGAACA AACAAACTTT 2340 CCCTGAGCAC TGTAGTCTTT GTAAAGAGAT CTTGCCATTC ACAGATCGCA AACAGGCAGT 2400 CTGCTCCAAT GGTCACATAT GGCTCCGGTG CTTTTTAACT TACCAGTCTT GCCAGAGTTT 2460 GATATACAGA AGGTGTTTGC TCCATGACAG CATTGCCCGA CATCCAGCAC CAGAAGGTAA 2520 ATGATTTCCC AGCTTTGAGA CTGAGATTGA GCATGGTGGC AGAGCGTCAG CTTCTCAGAA 2580 TAATGACTTG GGTCTGACTC CACAATGCTA AGTATTATGC ACTCAGATGG TTAGCAGGCT 2640 TGTTCTGCTC TTTACAGTAT TATCTTGAAG CCACAATACT AATTTTAAAA TGTTAATTGG 2700 AAATTGTAAT GTAAAAGTCC GTATCACGTT TTAGTATGTG GGTTAAAAGT GGGTCTGCAG 2760 TTTGCTGGTT TTTTTTTTTA AGATTTTTTT AAAAATTTAT CTCCCCTTCC CCTCCCCCAC 2820 CCCAGTTGTC TGCGCTCTCT GTCCATTTGC TGTGTGTTCT TATGTCTGCT TGTATTCTTG 2880 TCAGCGGCAC CAGGAATCTG TGTCTCTTTG TTGT
2915Nucleotide Fasta Sequence
>ENSDNOP00000018237.1|HAT_other|Dasypus novemcinctus AAAACTGCCGCGGAGGACGCTGGGGGCTGGAGGCGGTTGTGCTGAAGGCAGTAACCCGACCTCAACCCGTGTTTGGCGCGCCCCGGGATCAGGCTTCTGGATCCTAGGGAAGATGTCCACGGCCGAGAAGGCCCGCATAGGGCCCGCGGCTGACGGGCCTGCGCCGCCCGGGGAGGAGGAGGGCGAGGGGGGCGGCGAGGCGGGCGGGAAGGAGTCGGGACCGGACGCGGTTCCCGGGCCCAGCGCCTCGTTCCGCCTCATGGTGACTCGGCGGGAGCCGGCGGTGAAGCTGCAGTACGCGGTGAGCGGCCTGGAGCCGCTGGCCTGGTCCGAGGACCACCGCGTATCCGTGTCCACCGCCCGTAGCATCGCTGTGTTGGAGCTCATCTGCGACGTGCACAACCCGGGACAGGACCTGGTCATCCACCGCACATCGGTGCCCGCACCTCTCAACAGCTGCCTCCTCAAAGTTGGCTCAAAAAAAGAAGTTGCTGAGTGTAAGGAGAAATTTGCCACCTCTAAAGACCCCACAGTCAGTCAAACTTTCATGTTGGATAGGGTGTTCAATCCTGAAGGGAAGGCTTTGCCACCAATGAGAGGATTCAAATACGCTAGTTGGTCCCCCATGGGTTGTGATGCTAATGGCAGGTGCCTCTTGGCAGCATTGACCATGGACAATCGCCTGACCATCCAGGCTAACTTCAACAGACTCCAGTGGGTACAGTTGATTGATCTGACTGAGATTTATGGAGAACGTCTTTATGAGACCAGTTATAGACTCTCTAAAAATGAGGCCCCAGAAGGAAATCTTGGGGACTTTGCTGAGTTTCAGAGGAGACACAGCATGCAAACACCGGTCAGAATGGAGTGGTCGGGCATCTGTACCACACAGCAGGTCAAGCATAACAATGAATGCCACGATGTTGGCAGTGTGCTCCTTGCAGTCCTCTTTGAAAATGGTAATATTGCTGTATGGCAGTTTCAGCTACCCTTTGTAGGAAAGGAATCCATCTCTTCGTGTAACACAATTGAATCGGGAATCAACTCTCCTAGTGTTTTGTTTTGGTGGGAATATGAGCACAGTAATCGGAAAATGAGTGGCCTTATAGTGGGGAGTGCTTTTGGACCTGTAAAAATTCTTCCAGTCAATCTCAAAGCAATTAAAGGCTACTTCACCTTAAGGCAGCCTGTTGTCTTATGGAAAGAAATGGACCAATTGCCAGTGCATAGCATTAAATGTGTGCCACTTTATCATCCTTACCAAAAGTGTAGTTGTAGCTTAGTGGTTGCTGCAAGAGGATCCTATGTGTTTTGGTGTCTTCTTCTGATCTCCAAAGCAGGTCTGAATGTTCACAATTCCCATGTCACAGGCCTTCACTCACTGCCAATTGTCTCCATGACTGCAGACAAACAGAATGGAACAGTATATACTTGTTCCAGTGATGGGAAGGTGAGGCAGTTAATTCCCATTTTCACAGATGTTGCATTAAAGTTTGAACACCAGTTGATTAAACTTTCAGATGTTTTTGGCTCAGTAAGGACTCACGGTATAGCAGTGAGCCCCTGTGGTGCATATCTGGCTGTCATTACAACTGAAGGTATGGTCAATGGCCTCCATCCTGTTAACAAAAACTACCAGGTCCAGTTTGTTACTCTCAAATCCTTTGAAGAGGCAGCTGCTCAACTCCTGGAATCTTCAGTTCAAAATCTCTTTAAGCAGGTAGACTTAATAGACCTAATACGCTGGAAAATTTTAAAAGATAAACACATCCCTCAGTTTTTACAGGAAGCTTTGGAAAAAAAGTTTGAAAGCAGTGGGGCCACCTATTTTTGGCGTTTTAAACTTTTCCTCTTGAGGATTTTATATCAGTCAATGCAGAAAACCCCTTCAGAAGCCTTATGGAAACCCAGCCATGAGGACTCAAAAATCTTACTAGTTGACTCACCTGGAATGGGCAATGATGAGGATGAACAGCAAGAAGAAGGCACTTCTTCCAAAAAGTTGAATAAGCAAAGCCTTCAGGAAAGGAGCAAAGAAGGTGATGCAGACGAACCAACTGATGATCTACTCACCCAGTCTGGAGATGCTGGAGGCCGTGAGCCAATGGAAGAAAAACTCCTTGAAATCCAAGGGAAAATTGAAGCTGTGGAGATGCACTTGACTAGGGAACACATGAAGCGAGTCTTAGGAGAAGTTTACCTGCACACCTGGATCACAGAAAACACTAGCATCCCTACCAGAGGACTCTGTAATTTTTTAATGTCTGATGAAGAATATGATGACAGAACAGCACGGGTGCTGATTGGACATATCTCAAAGAAGATGAACAAACAAACTTTCCCTGAGCACTGTAGTCTTTGTAAAGAGATCTTGCCATTCACAGATCGCAAACAGGCAGTCTGCTCCAATGGTCACATATGGCTCCGGTGCTTTTTAACTTACCAGTCTTGCCAGAGTTTGATATACAGAAGGTGTTTGCTCCATGACAGCATTGCCCGACATCCAGCACCAGAAGGTAAATGATTTCCCAGCTTTGAGACTGAGATTGAGCATGGTGGCAGAGCGTCAGCTTCTCAGAATAATGACTTGGGTCTGACTCCACAATGCTAAGTATTATGCACTCAGATGGTTAGCAGGCTTGTTCTGCTCTTTACAGTATTATCTTGAAGCCACAATACTAATTTTAAAATGTTAATTGGAAATTGTAATGTAAAAGTCCGTATCACGTTTTAGTATGTGGGTTAAAAGTGGGTCTGCAGTTTGCTGGTTTTTTTTTTTAAGATTTTTTTAAAAATTTATCTCCCCTTCCCCTCCCCCACCCCAGTTGTCTGCGCTCTCTGTCCATTTGCTGTGTGTTCTTATGTCTGCTTGTATTCTTGTCAGCGGCACCAGGAATCTGTGTCTCTTTGTTGT
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |