WERAM Information


Tag Content
WERAM ID WERAM-Thc-0046
Ensembl Protein ID EOY03628
Gene Name TCM_018720
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
TCM_018720 EOY03628 EOY03628
TCM_018720 EOY03629 EOY03629
TCM_018720 EOY03630 EOY03630
TCM_018720 EOY03631 EOY03631
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader PHD 8.90e-08 33.3 671 717
Organism Theobroma cacao
Domain Profile
  Me_Reader PHD

   PHD.txt   3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCk 51 
+C +C d++ ek + C++C++ fH++C+++ ++++p+ swyC+ C+
EOY03628 671 VCSIC-LDGKVEKVLYRCQGCQRFFHADCMGVREQEVPNR-SWYCQFCV 717
7****.55555566999**********************9.*******8 PP

Protein Sequence
(Fasta)
MSNPSSTSGV SDQPGPGLMG LAHRGIGLSN TIHSEVAQCL PLPSLPVFCG ASDPELLLFD 60
DPTGGASRSL NRPEIIAQSS RIADLLRETD VSYLNLRDEA SSATYDYVEP LELHVQVLQY 120
NPAAFEYVTP GLVKEQVSGG AVFERKPPES SFPHISQFQR DISSTYNQQT DVIANDAPKS 180
SSRKPKGKKK AANDVGSSVR PDPTELQDAI IGHFREMLED FCGRAQIPSD DRDETEWLSL 240
PVNDVRMLVN EIMSIRTKRL LHLVPVDILV KLLRVLDHQI HRAEGLSVDE CEHQDSDVFS 300
SVFCALESIH ASLAVMAHND MPKQLYHEEI IERILEFSRH QIMDVMSAYD PSYRALHKPS 360
ENGAVEDDED EELDAELGSA SKKRRSTKSV KAKKSALNKV SGAVNAILQK LCTILGLLKD 420
LLLIEKLSDS CVLQLLKTSF TTFLVDNIQL LQLKAIGLIT GIFYSYTQHR TYIIDEMVQL 480
LWKLPFSKRA LRAYHLPDEE QRQIQMVTAL LIQLVHGSAN LPEALKQTSS GSPILEVSVD 540
DSYLTKCHES VQDTCCHFWT RVLQRLASVK TQDASELKVM IENLVADLLT TLNLPEYPAA 600
APALEVLCVL LLQNAGLKSK DISARAMAID LVGTIAARLK HDSLLCRKDK FWISEELLSG 660
DNDHESYPNG VCSICLDGKV EKVLYRCQGC QRFFHADCMG VREQEVPNRS WYCQFCVCKK 720
QLLVLQSYCE SQYQDNENKN YGRSERSESS DPITKVEIVQ QMLLNYLQDA ASIDDIHLFV 780
RWCYLCLWYK DGPKSQQNFK YYLARLRSKA IVRDSGTVSS LLIRDSVKKI ALALGQNNSF 840
SRGFDKILYL LLVSLRENSP VIRAKALRAV SIIVEADPEV LGDKRVQVAV EGRFCDSAIS 900
VREAALELVG RHIASHPDVG LKYFEKVAER IKDTGVSVRK RAIKIIRDMC NANPNFSGFT 960
SACIEIISRV SDDESSIQDL VCKTFYEFWF EEPSGLQTQY PGDGSSVPLE VAKKTEQIVE 1020
MLRRLPNHQF LVTVIKRNLV LDFFPQSAKA AGINPVSLAA VRRRCELMCK CLLEKILQVE 1080
EMSNVEAEVP TLPYVLALHA FCVVDPSLCM PASDPSQFVI TLQPYLKSQV DNRVVAQLLE 1140
SIIFIIDAVV PLMRKLPPSV IEELKQDLKH MIVRHSFLTV VHACIKCLCS VTKKAGNGGT 1200
VVEYLIQLFF KLLDSQATDN KQQVGRSLFC LGLLIRYGNS LFSGPTNKNI DVASSLSLFK 1260
KYLLMDDFSI KVRSLQALGF ALIARPEYML EKDIGKILEA ALAPSSNVRL KMQVLQNLLE 1320
YLLDAESQMG TDKAGNDAVH YSVEGGGSVP VAAGAGDTNI CGGIVQLYWD NILGRCLDFN 1380
EEVRQSALKI VEVVLRQGLV HPITCVPYLI ALETDPLEVN QKLAHHLLMN MNEKYPAFFE 1440
SRLGDGLQMS FIFMRSISGN ARENLNEKSQ SKFSGNLKGK SDAGSLTQAR LGVSRIYKLI 1500
RGNRVARNKF MSSIVRKFDN PSWNDSVVPF LMYCTETLAL LPFSSPDEPL YLIYAINRVI 1560
QVRAGALEAN MKALSSNLLK ADAQKTTNEN GTVQLDHSRA VFNYMATVDL NGTIQEEAVV 1620
QPALYHMTSI DLNGAIQQKL THESISHYTP AVETTMHKMN HSETHTLSEE DMQKIQADCL 1680
AATALQLLMK LKRHLKIVYS LNDQRCQAFS PNEPIKPGDV LTRQNIPFDI SETHTSLPCT 1740
YQELVQRYQE FKNALREDSI DYSIFTANIK RKRPNPRRGG KAMRMTGGDE DDDYDDEDWK 1800
GGVRRLSNSG RKSYGSRGSR QRW 1823
Nucleotide Sequence
(Fasta)
GACGATTAAG AAATTGAAAA TCGAACATAG AGCTTCTCCA CCAAGCTAGA TAGTAGGAAA 60
ATAGAGTCCA CCAAGGGCAT TTTAGTAACC TAAGTAATAT CCCCCATTCC ATTTTTAAGC 120
ATGCTATAAG TAAAATTCAT CGAAAAAAAT CCCCCCAAAA ACTTTCACAC TTTTTCCCCG 180
CCATTTGCTT CTCTCTTCTT CCCAATCTTT TCTTAGGGCT CAAAAAGCTC TTCAACTTCT 240
CCAAGGAAAC CCTAATTCCT TTCAAATTAT CCCCAGATTT CTACATTATT ACGAAACTAC 300
CACTCCCTTC TACACTCTTC ATGCTCTCTC TCTGCAGCAA ATCCCTCGCT CGCCGATTAA 360
ACAATCGATG AGCAATCCAA GCAGCACGTC CGGGGTTTCG GACCAACCGG GTCCGGGACT 420
GATGGGTTTG GCCCACCGTG GTATCGGCCT TTCCAACACC ATCCACTCCG AGGTCGCGCA 480
GTGCTTGCCT CTTCCTTCGT TGCCCGTTTT CTGTGGCGCT TCGGATCCGG AGCTCCTACT 540
ATTCGACGAC CCGACTGGTG GAGCTTCCAG GTCTTTGAAT CGACCGGAAA TTATTGCACA 600
GTCTAGTCGA ATCGCCGATC TGCTTCGCGA AACTGATGTT TCATATCTGA ATCTTAGAGA 660
TGAGGCGAGT TCAGCTACAT ATGACTATGT GGAGCCCTTG GAACTTCATG TACAAGTTCT 720
CCAATACAAC CCTGCAGCAT TTGAGTATGT CACTCCTGGT CTTGTTAAGG AGCAAGTCTC 780
TGGTGGTGCA GTGTTTGAAA GGAAGCCACC CGAGTCAAGT TTTCCTCATA TTAGTCAATT 840
CCAGAGAGAC ATTAGCAGCA CTTACAATCA GCAGACTGAT GTCATAGCTA ATGATGCACC 900
GAAATCTTCT TCCAGGAAGC CAAAAGGCAA GAAAAAAGCT GCTAATGATG TTGGCTCATC 960
GGTTCGACCA GATCCTACAG AGCTTCAAGA TGCCATCATT GGGCACTTCC GTGAGATGCT 1020
AGAGGACTTT TGTGGCAGAG CTCAAATTCC CAGTGATGAT CGGGATGAGA CAGAATGGTT 1080
GTCATTGCCT GTCAACGATG TTAGAATGCT TGTAAATGAA ATTATGTCTA TACGTACAAA 1140
GAGACTTCTA CATTTGGTTC CTGTAGATAT CCTTGTGAAA TTATTACGGG TTCTAGATCA 1200
TCAGATACAT CGAGCAGAAG GTTTGTCAGT TGATGAATGT GAGCATCAAG ACTCAGATGT 1260
ATTCTCTTCA GTTTTTTGTG CCCTGGAGTC CATTCATGCT TCTTTGGCAG TAATGGCACA 1320
TAATGACATG CCAAAGCAAT TATATCATGA AGAGATCATT GAAAGGATTT TAGAGTTCTC 1380
CAGGCACCAG ATAATGGATG TTATGTCAGC TTATGATCCA TCATATCGTG CCTTGCATAA 1440
ACCAAGTGAA AATGGAGCAG TTGAAGATGA TGAAGACGAA GAGCTTGATG CTGAACTAGG 1500
TTCTGCTAGC AAGAAACGAC GTAGTACTAA GAGTGTTAAA GCCAAGAAAT CAGCATTGAA 1560
CAAGGTCTCT GGTGCTGTGA ATGCTATACT ACAAAAACTC TGCACTATTC TTGGTTTACT 1620
CAAGGACTTG TTGTTGATTG AGAAATTATC TGATAGTTGT GTTCTACAAT TGCTGAAGAC 1680
AAGCTTTACT ACTTTTTTGG TGGATAACAT ACAGCTCTTG CAACTCAAAG CAATTGGTTT 1740
GATAACTGGG ATATTCTACT CTTACACCCA ACATAGAACA TATATAATAG ATGAAATGGT 1800
TCAGCTGCTA TGGAAGTTAC CATTTTCAAA GCGAGCATTA AGAGCATATC ACCTACCTGA 1860
TGAAGAACAG AGGCAGATCC AGATGGTTAC GGCTCTGCTG ATTCAGTTGG TCCATGGCAG 1920
TGCAAACCTT CCTGAAGCTT TAAAGCAAAC ATCAAGTGGG AGTCCGATCT TGGAAGTCTC 1980
AGTTGATGAT AGTTATTTAA CCAAATGTCA TGAATCTGTC CAGGATACAT GCTGTCATTT 2040
CTGGACCCGT GTCCTTCAGC GCCTTGCTTC TGTAAAGACT CAAGATGCCT CTGAGTTGAA 2100
AGTGATGATT GAGAATCTAG TCGCTGATTT ACTGACAACA TTAAATCTAC CAGAATATCC 2160
TGCTGCAGCT CCTGCTTTGG AGGTTCTTTG TGTTTTATTG CTCCAAAATG CCGGTCTTAA 2220
ATCCAAGGAC ATCTCTGCAC GTGCAATGGC AATTGATCTT GTTGGCACAA TAGCAGCAAG 2280
GTTGAAGCAT GATTCTCTCC TCTGTAGGAA GGACAAGTTC TGGATATCCG AAGAATTGCT 2340
TAGTGGGGAT AACGATCACG AAAGTTACCC AAATGGTGTA TGTTCAATTT GTTTGGATGG 2400
AAAGGTAGAA AAAGTGTTGT ATAGGTGCCA AGGTTGTCAA AGATTTTTCC ATGCTGATTG 2460
TATGGGGGTA AGAGAACAAG AAGTTCCTAA TCGTAGTTGG TACTGCCAGT TTTGTGTCTG 2520
TAAGAAGCAA CTTCTTGTGT TGCAATCATA TTGTGAGTCA CAGTACCAGG ATAATGAGAA 2580
TAAGAATTAT GGTCGCTCAG AAAGGTCTGA ATCTTCTGAT CCAATTACGA AAGTTGAAAT 2640
TGTTCAGCAA ATGCTTTTGA ATTATCTTCA AGATGCTGCT TCTATTGATG ATATCCATCT 2700
CTTTGTTCGA TGGTGTTATC TATGCTTGTG GTATAAGGAT GGCCCCAAAT CTCAACAAAA 2760
TTTCAAGTAC TACCTTGCTA GACTGAGATC AAAGGCAATA GTGCGTGACT CAGGGACCGT 2820
TTCTTCACTG TTGATAAGGG ATTCGGTCAA GAAGATTGCT TTGGCACTGG GACAAAATAA 2880
TTCTTTCTCT AGAGGGTTTG ACAAGATTCT TTACTTGCTT CTGGTTAGCT TAAGAGAGAA 2940
CTCTCCTGTA ATTAGGGCTA AGGCTTTACG AGCAGTTAGT ATTATTGTAG AAGCTGATCC 3000
AGAGGTATTA GGTGACAAAC GTGTTCAAGT GGCTGTTGAG GGAAGGTTTT GTGACTCTGC 3060
AATATCTGTC AGAGAAGCAG CACTGGAACT TGTTGGCAGA CATATTGCTT CACATCCTGA 3120
TGTTGGTTTA AAGTACTTTG AGAAGGTGGC AGAGAGGATT AAAGACACTG GAGTCAGTGT 3180
GCGGAAACGA GCAATCAAAA TTATTCGAGA TATGTGCAAT GCGAATCCCA ACTTCTCAGG 3240
ATTTACAAGT GCTTGCATCG AGATTATTTC TCGTGTTAGT GATGATGAAT CAAGCATTCA 3300
GGATCTTGTC TGTAAGACAT TTTATGAGTT CTGGTTTGAG GAACCTTCTG GACTGCAGAC 3360
TCAGTATCCT GGAGATGGTA GTTCTGTTCC ATTGGAGGTG GCTAAGAAGA CCGAGCAGAT 3420
CGTTGAAATG CTAAGGCGGT TGCCTAATCA CCAGTTTCTT GTAACTGTGA TTAAGCGTAA 3480
CTTGGTCCTC GATTTTTTCC CTCAATCAGC GAAAGCTGCT GGAATCAACC CTGTCTCACT 3540
TGCAGCGGTA CGTAGGCGAT GTGAGTTGAT GTGCAAGTGC TTACTGGAAA AAATATTGCA 3600
AGTGGAGGAA ATGAGTAATG TGGAAGCAGA GGTTCCTACA CTTCCCTATG TGCTGGCCTT 3660
GCATGCTTTT TGTGTAGTTG ACCCATCACT TTGCATGCCA GCTTCTGATC CTTCCCAATT 3720
TGTGATTACT CTACAGCCGT ATCTTAAGAG TCAGGTTGAT AACAGAGTTG TTGCACAGTT 3780
ACTGGAGAGT ATAATCTTTA TAATTGATGC TGTTGTGCCT TTGATGCGGA AGTTGCCTCC 3840
TAGTGTTATC GAAGAACTAA AGCAGGACTT GAAGCACATG ATTGTCCGGC ATTCTTTTTT 3900
GACTGTTGTT CATGCTTGCA TCAAGTGTCT TTGTTCTGTG ACTAAAAAGG CTGGGAATGG 3960
TGGTACTGTT GTTGAGTACC TCATTCAGTT ATTTTTCAAA CTATTGGATT CCCAAGCAAC 4020
TGATAACAAG CAGCAAGTGG GGCGTTCGCT CTTCTGTCTT GGATTGCTAA TCCGCTATGG 4080
AAACTCTTTA TTTAGTGGTC CCACTAACAA AAATATTGAT GTTGCCAGCA GTCTTAGTTT 4140
GTTTAAAAAA TATCTTCTAA TGGATGATTT TAGTATAAAG GTTAGATCTC TGCAGGCATT 4200
AGGCTTTGCT CTAATTGCTA GGCCTGAATA TATGTTGGAA AAAGACATTG GGAAGATATT 4260
AGAGGCAGCA TTAGCACCAA GTTCTAATGT TCGTCTTAAG ATGCAAGTGT TGCAAAATTT 4320
GTTGGAATAT CTTCTTGATG CGGAAAGTCA AATGGGAACG GATAAAGCCG GTAATGATGC 4380
AGTTCATTAT TCTGTAGAAG GTGGCGGTAG TGTCCCTGTA GCTGCAGGTG CTGGTGATAC 4440
TAACATTTGT GGGGGTATAG TCCAGTTGTA CTGGGATAAT ATTCTGGGGA GATGCTTGGA 4500
CTTTAATGAA GAAGTTCGCC AATCTGCCCT AAAGATAGTG GAAGTTGTGC TTCGTCAAGG 4560
TCTTGTTCAT CCTATTACTT GTGTGCCATA CCTTATAGCT CTTGAAACAG ATCCTCTGGA 4620
AGTCAACCAA AAGTTGGCTC ATCATTTGCT AATGAATATG AATGAGAAAT ATCCTGCTTT 4680
TTTCGAAAGC CGTTTAGGAG ATGGCCTTCA GATGTCATTT ATCTTCATGC GTTCCATTAG 4740
TGGCAATGCC CGTGAAAATC TAAATGAAAA ATCCCAATCC AAGTTTTCTG GAAATTTGAA 4800
AGGGAAATCT GATGCTGGGT CTTTAACACA AGCAAGGCTG GGAGTTTCCA GAATTTACAA 4860
GCTCATTCGT GGAAATCGGG TTGCTAGAAA CAAATTTATG TCCTCAATTG TGCGCAAATT 4920
TGATAATCCT AGCTGGAATG ATTCAGTTGT GCCTTTCTTG ATGTATTGTA CAGAAACTCT 4980
TGCTTTGTTA CCATTCTCAT CTCCTGATGA ACCGCTCTAT TTGATCTATG CTATAAATCG 5040
AGTAATACAA GTTAGAGCTG GGGCACTTGA GGCAAATATG AAAGCCTTGA GTTCAAATTT 5100
GCTAAAGGCA GATGCTCAGA AGACAACTAA TGAAAATGGG ACTGTTCAAC TGGATCATAG 5160
TCGAGCTGTT TTCAATTATA TGGCTACAGT TGATTTGAAT GGAACAATTC AGGAGGAGGC 5220
TGTGGTTCAG CCTGCTCTCT ATCACATGAC ATCCATTGAT TTGAATGGTG CAATCCAACA 5280
AAAGCTCACT CATGAGTCTA TTTCACATTA TACTCCTGCA GTGGAGACAA CAATGCATAA 5340
GATGAACCAT TCTGAAACTC ATACTCTCTC CGAAGAGGAT ATGCAAAAAA TCCAGGCCGA 5400
CTGTCTTGCT GCTACTGCAC TACAGCTTCT CATGAAGCTG AAAAGACACC TAAAAATTGT 5460
TTATAGCCTG AATGATCAAA GATGCCAGGC ATTTTCTCCA AATGAACCTA TAAAGCCTGG 5520
GGACGTTCTC ACGAGGCAGA ACATTCCGTT TGACATCAGT GAAACACACA CTAGCCTGCC 5580
TTGCACTTAT CAAGAATTGG TGCAGAGATA TCAGGAATTT AAAAACGCAT TGAGGGAAGA 5640
TAGTATTGAT TACTCAATTT TCACAGCAAA CATCAAAAGG AAGCGCCCAA ATCCCAGGAG 5700
AGGAGGGAAA GCAATGCGCA TGACTGGTGG GGATGAAGAT GATGATTATG ATGATGAAGA 5760
CTGGAAAGGT GGTGTACGGA GACTGAGTAA TAGTGGAAGG AAAAGTTACG GCAGCAGAGG 5820
CAGTAGGCAG CGATGGTAGA TGTAAATACA ATGACAGGTA GGCTAACATG TACATTAGGT 5880
TGAAGGAAAA GAATAGAAAT TAGGATTAGG ATAGGTTAAG AGCAAAAAGG ATAGGGAGAA 5940
TAGGAAAGAG AGGAAAGCAA GCAGGCAGGC AGGGGGGATG CATTTTAATT TTTGTTGTAG 6000
CTGTTTCCAT TTGTAATATT GATTTGGCTC TAGAGTCAGA ATGTGCATTC TTTCCTTTTA 6060
ACCACTTGGT ATTTTAGACA GATTTGTAAC CTTTGTTCGA TAGGAAACTC TTATGAGATA 6120
TACAGAAAAA AGTAATGGAA ATATACTGCC TTTACTATTC AGATTGGATC TTGAAAATAA 6180
TACCAAGGGA AATACATTGC TTGATTTATG ATCGGGTGGT CTCATATGTA TGACAGGATT 6240
AAAGGGTGTG TTCCAAAGTA AATGAGCAAA AATAATGTAA CCAGTTTAAT TTATGATTCA 6300
GGGTTACATG TATAAATTAT TCTTATGCTT 6331
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Sot-0081 PGSC0003DMT400067527 Solanum tuberosum 42 1e-05 51.2
Created Date 25-Jun-2016