WERAM Information


Tag Content
WERAM ID WERAM-Thc-0047
Ensembl Protein ID EOY03697
Gene Name TCM_018807
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
TCM_018807 EOY03697 EOY03697
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 0.17 35
Organism Theobroma cacao
Domain Profile
  HAT HAT_other

Query: 112 NLLSSKGIYEAVELHPLTPE---LITSDDSASTIALQITLFPNMGFCIGITAHHAVLDGK 168
LL +G EAVE+H LT E + + T + T P G C + + D
Sbjct: 668 KLLEIQGKIEAVEMH-LTREHMKRVLGEVYLHTWITENTSIPTRGLCNFLMSDEEYDDRT 726
Query: 169 ATTLFMKSWAYLCNQGYTEHSSLPPELTPFLDR 201
A L + Q + EH SL E+ PF DR
Sbjct: 727 ARVLIGHISKKMNKQTFPEHCSLCKEILPFTDR 759

Protein Sequence
(Fasta)
MKILEFTRIK PSPDSPKSAA ELSLPLTFFD IFWFKLPPVE RLFFYQINNS TPAYFNSVIL 60
PKLKHSLSLT LLHYLPLAGN LKWPSTSPKP IILYTPNDGV SLTVAESAAD FNLLSSKGIY 120
EAVELHPLTP ELITSDDSAS TIALQITLFP NMGFCIGITA HHAVLDGKAT TLFMKSWAYL 180
CNQGYTEHSS LPPELTPFLD RSVIKDVTGL DLDMLYLNQW LASIGSDSGT NNKSLKILPN 240
KGEAPNLVRA TFEITREDFK KLRDRALAQL SDSGKELHLS TFVLTLAYVT RCIVKARGGE 300
DDRNVGVGFT IDCRPRLNPP VPENYFGNCN TITGDLIKAR DFLDENGFGF SVHKCCRITS 360
I 361
Nucleotide Sequence
(Fasta)
GCCCTTATAT ATTTACTTTC TTGATATTAT CATCCCAAAA GGGTCAAAGA AAAGAGCTGT 60
TACACGGCTG TGCCATTGCA ATCGGCCAGC AGAATGAAAA TCCTTGAGTT CACCAGAATC 120
AAACCATCCC CTGATTCACC AAAATCAGCA GCAGAGTTGT CCCTTCCTCT CACTTTCTTT 180
GACATTTTCT GGTTCAAACT CCCACCAGTT GAGCGACTCT TTTTTTACCA AATTAACAAC 240
TCAACCCCTG CGTATTTCAA CTCAGTAATC CTCCCAAAAC TCAAGCACTC TCTTTCTCTC 300
ACTCTCCTCC ATTACCTCCC TCTCGCTGGT AACCTCAAGT GGCCATCAAC TTCGCCTAAA 360
CCCATTATTT TATACACTCC AAATGATGGA GTTTCGCTCA CGGTTGCTGA GTCCGCTGCA 420
GACTTTAACC TTCTCTCAAG CAAGGGAATC TATGAAGCTG TCGAGTTACA TCCTTTGACA 480
CCTGAGCTGA TAACATCAGA TGATTCTGCA TCAACTATAG CTTTGCAAAT AACCCTTTTT 540
CCAAATATGG GGTTTTGCAT CGGAATCACT GCTCACCATG CTGTTCTTGA TGGTAAAGCT 600
ACAACCTTGT TCATGAAATC TTGGGCTTAT TTATGCAATC AAGGTTATAC AGAACACTCT 660
TCCTTGCCGC CAGAGCTAAC CCCATTTCTT GACAGGAGTG TTATCAAAGA TGTTACGGGG 720
CTTGACCTTG ATATGCTATA CTTGAACCAG TGGCTAGCTA GTATCGGGTC AGATTCAGGT 780
ACTAATAACA AAAGCTTGAA GATTTTGCCA AACAAAGGAG AAGCTCCTAA TTTAGTTCGA 840
GCAACGTTTG AGATTACTCG TGAAGATTTC AAGAAATTAA GAGACAGGGC ATTGGCTCAA 900
TTATCAGATA GTGGGAAAGA GCTTCATCTA TCAACTTTTG TGCTTACGCT TGCTTATGTA 960
ACACGTTGCA TAGTTAAAGC AAGAGGTGGA GAAGATGATA GAAATGTTGG TGTTGGATTC 1020
ACCATTGACT GCAGGCCGCG TTTAAACCCT CCTGTCCCCG AGAATTACTT TGGAAACTGC 1080
AATACAATTA CAGGAGACTT AATCAAAGCA AGAGATTTCT TGGACGAAAA CGGGTTTGGT 1140
TTTTCTGTTC ATAAGTGTTG CAGGATCACC TCGATTTGAA GTTTATGGTT CAGATTTTGG 1200
TTGGGGAAAA CCATTGAAGG TGGTAGTTGT TTCCATTGAC AAGAACGAAG CTATTTCCAT 1260
GGCTGAGAGC AGAGATGGGA GCAGGGGAGT CGAGGTTGGT TTGGCTTTGA AAAAGCATGA 1320
AATGGAGTTT TTTTCCTCTT TGTTTCTAAA AGATGTTTAA CAAGATGTTG AAATGTGGAA 1380
TAAAATGGAA ATAGTGAGTA CTTATTTTTA AAAAA 1416
Sequence Source Ensembl
Orthology
Created Date 25-Jun-2016