WERAM Information


Tag Content
WERAM ID WERAM-Chg-0009
Ensembl Protein ID EAQ93631
Gene Name CHGG_01866
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
CHGG_01866 EAQ93631 EAQ93631
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 0.31 35
Organism Chaetomium globosum
Domain Profile
  HAT HAT_other

Query: 85  QDGDLVEPTNFPLPTLG---------PKLKELSGDI----------HNGRGFCVIRGLNP 125
++GD+ EPT+ LPT G KL E+ G I H R V+ +
Sbjct: 641 KEGDVEEPTDDSLPTTGDAGGREPMEEKLLEIQGKIEAVEMHLTREHMKR---VLGEVYL 697
Query: 126 ASYTVEDLTLVYLGVQSYIAEQRGRQDKCGNMLVHIVADNSTKQAAEHHRHSTKAIT-FH 184
++ E+ ++ G+ +++ D+ +L+ ++ KQ H K I F
Sbjct: 698 HTWITENTSIPTRGLCNFLMSDEEYDDRTARVLIGHISKKMNKQTFPEHCSLCKEILPFT 757
Query: 185 NEKLAMSSAGDSVAPPQYIVVHHCSATPMKRL----------AATRPDVIRTLSRSDWPF 234
+ K A+ S G + ++ C + +R A PD I+ L +S PF
Sbjct: 758 DRKQAVCSNG-HIWLRCFLTYQSCQSLIYRRCLLHDSIARHPAPEDPDWIKRLLQSPCPF 816
Query: 235 A 235

Sbjct: 817 C 817

Protein Sequence
(Fasta)
MSIASAGPVL AAATAPVVGT PNLALHAKTN ADVPTLPAGF PAHLPDKLAW TGSDFNKTSD 60
HILNLSGTHL AEIRAALGSY KSLGQDGDLV EPTNFPLPTL GPKLKELSGD IHNGRGFCVI 120
RGLNPASYTV EDLTLVYLGV QSYIAEQRGR QDKCGNMLVH IVADNSTKQA AEHHRHSTKA 180
ITFHNEKLAM SSAGDSVAPP QYIVVHHCSA TPMKRLAATR PDVIRTLSRS DWPFAMPRFQ 240
CRPVIFYRDS KLVMNFGRAA LLGSDAHPRP QHLPSLTARQ IEALDAIEAI AQATQLEIQT 300
QAGDMHFINN LAILHRREGF ANGQARTEKR HLVRMRLRSA KQGWSIPREL EGEWDEAFKK 360
HGVKHWHVEP MPSYFFPMRL YPN 383
Nucleotide Sequence
(Fasta)
ATGTCCATTG CAAGTGCCGG ACCCGTTCTC GCCGCGGCCA CCGCCCCAGT CGTTGGCACT 60
CCCAACTTGG CCCTGCACGC CAAAACTAAT GCCGATGTCC CCACCCTTCC GGCCGGATTC 120
CCCGCCCATC TGCCGGATAA GTTGGCCTGG ACCGGCTCCG ACTTCAACAA GACATCTGAC 180
CACATCCTGA ACTTGAGTGG TACCCACCTC GCCGAAATCA GAGCGGCTCT TGGAAGCTAC 240
AAATCTCTCG GTCAAGATGG TGACCTCGTG GAACCCACCA ACTTTCCACT CCCGACACTC 300
GGCCCGAAGC TCAAGGAGCT CAGCGGTGAC ATCCATAATG GCAGGGGATT CTGTGTCATC 360
CGGGGTCTCA ACCCTGCTTC CTACACCGTT GAGGACTTGA CTCTGGTCTA CCTTGGCGTT 420
CAGTCGTACA TTGCCGAGCA GCGCGGCCGC CAGGACAAGT GTGGAAACAT GCTCGTCCAT 480
ATTGTTGCGG ATAACAGCAC CAAGCAGGCG GCCGAGCATC ATCGGCATTC GACTAAAGCA 540
ATTACGTTCC ACAACGAGAA GCTGGCGATG TCGTCAGCTG GTGACTCCGT AGCACCGCCA 600
CAGTACATTG TGGTGCATCA TTGCTCTGCT ACACCGATGA AACGTCTGGC AGCTACCCGG 660
CCCGATGTGA TTCGCACTTT GTCCCGGTCT GACTGGCCTT TCGCCATGCC ACGCTTCCAG 720
TGCCGTCCGG TTATCTTCTA CCGGGATTCG AAGCTGGTCA TGAACTTTGG TCGCGCCGCT 780
CTTCTAGGCA GCGATGCTCA TCCCCGACCT CAGCATCTGC CCTCCCTGAC AGCCCGCCAG 840
ATCGAAGCCT TGGATGCCAT CGAGGCCATT GCGCAGGCTA CTCAGCTGGA GATCCAGACT 900
CAGGCAGGGG ACATGCATTT CATCAACAAC CTCGCCATCC TTCACCGCCG CGAAGGCTTT 960
GCTAATGGAC AAGCCCGTAC CGAGAAACGC CACCTGGTGC GCATGCGGCT TCGCAGTGCC 1020
AAGCAAGGAT GGTCGATCCC TCGCGAGCTC GAGGGGGAGT GGGACGAAGC GTTCAAGAAG 1080
CATGGCGTCA AGCACTGGCA CGTGGAGCCG ATGCCCTCGT ACTTTTTCCC CATGCGCCTC 1140
TATCCCAACT AG 1153
Sequence Source Ensembl
Orthology
Created Date 25-Jun-2016