WERAM Information


Tag Content
WERAM ID WERAM-Hos-0130
Ensembl Protein ID ENSP00000259021.4
Uniprot Accession O95251; KAT7_HUMAN; B3KN74; B4DF85; B4DFB4; B4DFE0; B4DGY4; E7ER15; G5E9K7
Genbank Protein ID NP_001186084.1; NP_001186085.1; NP_001186086.1; NP_001186087.1; NP_008998.1
Protein Name Histone acetyltransferase KAT7
Genbank Nucleotide ID NM_001199155.1; NM_001199156.1; NM_001199157.1; NM_001199158.1; NM_007067.4
Gene Name KAT7;HBO1;HBOa;MYST2
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000136504.11 ENST00000259021.8 ENSP00000259021.4
ENSG00000136504.11 ENST00000424009.6 ENSP00000398961.2
ENSG00000136504.11 ENST00000435742.2 ENSP00000409477.2
ENSG00000136504.11 ENST00000454930.6 ENSP00000413415.2
ENSG00000136504.11 ENST00000510819.5 ENSP00000423385.1
ENSG00000136504.11 ENST00000509773.5 ENSP00000424577.1
Details
Type Family Domain Substrates AA References (PMIDs)
HAT MYST MYST-type HAT H3K14; H4K5; H4K8; H4K12; H3K23 K 24125069; 26620551
Status Reviewed
Classification
Type Family E-value Score Start End
HAT MYST 4.10e-130 435.3 335 607
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Component of the HBO1 complex which has a histone H4-specific acetyltransferase activity, a reduced activity toward histone H3 and is responsible for the bulk of histone H4 acetylation in vivo. Through chromatin acetylation it may regulate DNA replication and act as a coactivator of TP53-dependent transcription. Specifically represses AR-mediated transcription.
Domain Profile
  HAT MYST

           MYST.txt   4 kniekielgkyeiktwysspfpeelkklkklyicefclkytkskeslkrhlkkCklrkpPgneiyrkdklsvfevdGkkqklycqnlCllak 95 
++i++i++g+ye++twy+sp+pee+++l ly+cefclky+ks+++l+rh++kC +++pPg+eiyrk+++svfevdGkk+k+ycqnlCllak
ENSP00000259021.4 335 NMIKTIAFGRYELDTWYHSPYPEEYARLGRLYMCEFCLKYMKSQTILRRHMAKCVWKHPPGDEIYRKGSISVFEVDGKKNKIYCQNLCLLAK 426
689***************************************************************************************** PP
MYST.txt 96 lfldhktlyydvepflFYvltetdekgahlvGyFskekesaekynlaCilvlPpyqrkGyGklLiefsYelsrkegkigsPekPLsdlglls 187
lfldhktlyydvepflFYv+te+d++g+hl+GyFskek+s +yn++Cil++P+y+r+GyGk+Li+fsY+ls++e+k+gsPe+PLsdlgl+s
ENSP00000259021.4 427 LFLDHKTLYYDVEPFLFYVMTEADNTGCHLIGYFSKEKNSFLNYNVSCILTMPQYMRQGYGKMLIDFSYLLSKVEEKVGSPERPLSDLGLIS 518
******************************************************************************************** PP
MYST.txt 188 YrsyWkevllellkekeekkkitikelskatgiaveDivstleslnlikekkgqyillksekiveel.lklakskkkklkvdeeklkwkp 276
YrsyWkevll++l ++ + k+i+ike+s++t++++ Divstl++l+++k++kg++ +lk++++++e+ +k+ak++++++++d+++lkw+p
ENSP00000259021.4 519 YRSYWKEVLLRYL-HNFQGKEISIKEISQETAVNPVDIVSTLQALQMLKYWKGKHLVLKRQDLIDEWiAKEAKRSNSNKTMDPSCLKWTP 607
*************.999*************************************************95566777888999********97 PP

Protein Sequence
(Fasta)
MPRRKRNAGS SSDGTEDSDF STDLEHTDSS ESDGTSRRSA RVTRSSARLS QSSQDSSPVR 60
NLQSFGTEEP AYSTRRVTRS QQQPTPVTPK KYPLRQTRSS GSETEQVVDF SDRETKNTAD 120
HDESPPRTPT GNAPSSESDI DISSPNVSHD ESIAKDMSLK DSGSDLSHRP KRRRFHESYN 180
FNMKCPTPGC NSLGHLTGKH ERHFSISGCP LYHNLSADEC KVRAQSRDKQ IEERMLSHRQ 240
DDNNRHATRH QAPTERQLRY KEKVAELRKK RNSGLSKEQK EKYMEHRQTY GNTREPLLEN 300
LTSEYDLDLF RRAQARASED LEKLRLQGQI TEGSNMIKTI AFGRYELDTW YHSPYPEEYA 360
RLGRLYMCEF CLKYMKSQTI LRRHMAKCVW KHPPGDEIYR KGSISVFEVD GKKNKIYCQN 420
LCLLAKLFLD HKTLYYDVEP FLFYVMTEAD NTGCHLIGYF SKEKNSFLNY NVSCILTMPQ 480
YMRQGYGKML IDFSYLLSKV EEKVGSPERP LSDLGLISYR SYWKEVLLRY LHNFQGKEIS 540
IKEISQETAV NPVDIVSTLQ ALQMLKYWKG KHLVLKRQDL IDEWIAKEAK RSNSNKTMDP 600
SCLKWTPPKG T 611
Nucleotide Sequence
(Fasta)
AAAATGGCGC CCACTAGCTT CACCAAAAGG ATCGAACTTC CCAGCACCCT CTCTGGCCCG 60
TAACGTCAGG TGACGCGAGA CCCAGCCGGA AGTGAAGGAA AAAGCGCTTC AGCCCGCGGC 120
GCCTGCGCAG AACGCTCCAG ACGCTGAGAG GCAGGAGGCA CTAGGGATCG TCCGCAGGAT 180
TGGGACTGAT ACAGAGGCCG CCACGGAGCC CGCCGGAGCC ACCGTTCCTG CTGCTGCCGC 240
CGCTGCCCGA ATCGGAACCG TCGGGCCGCA GCCGCCGGCA ATGCCGCGAA GGAAGAGGAA 300
TGCAGGCAGT AGTTCAGATG GAACCGAAGA TTCCGATTTT TCTACAGATC TCGAGCACAC 360
AGACAGTTCA GAAAGTGATG GCACATCCCG ACGATCTGCT CGAGTCACCC GCTCCTCAGC 420
CAGGCTAAGC CAGAGTTCTC AAGATTCCAG TCCTGTTCGA AATCTGCAGT CTTTTGGCAC 480
TGAGGAGCCT GCTTACTCTA CCAGAAGAGT GACCCGTAGT CAGCAGCAGC CTACCCCAGT 540
GACACCGAAA AAATACCCTC TTCGGCAGAC TCGTTCATCT GGTTCAGAAA CTGAGCAAGT 600
GGTTGATTTT TCAGATAGAG AAACTAAAAA TACAGCTGAT CATGATGAGT CACCGCCTCG 660
AACTCCAACT GGAAATGCGC CTTCTTCTGA GTCTGACATA GACATCTCCA GCCCCAATGT 720
ATCTCACGAT GAGAGCATTG CCAAGGACAT GTCCCTGAAG GACTCAGGCA GTGATCTCTC 780
TCATCGCCCC AAGCGCCGTC GCTTCCATGA AAGCTACAAC TTCAATATGA AGTGTCCTAC 840
ACCAGGCTGT AACTCTCTAG GACACCTTAC AGGAAAACAT GAGAGACATT TCTCCATCTC 900
AGGATGCCCA CTGTATCATA ACCTCTCAGC TGACGAATGC AAGGTGAGAG CACAGAGCCG 960
GGATAAGCAG ATAGAAGAAA GGATGCTGTC TCACAGGCAA GATGACAACA ACAGGCATGC 1020
AACCAGGCAC CAGGCACCAA CGGAGAGACA GCTTCGATAT AAGGAAAAAG TGGCTGAACT 1080
CAGGAAGAAA AGAAATTCTG GACTGAGCAA AGAACAGAAA GAGAAATATA TGGAACACAG 1140
ACAGACCTAT GGGAACACAC GGGAACCTCT TTTAGAAAAC CTGACAAGCG AGTATGACTT 1200
GGATCTTTTC CGAAGAGCAC AAGCCCGGGC TTCAGAGGAT TTGGAGAAGT TAAGGCTGCA 1260
AGGCCAAATC ACAGAGGGAA GCAACATGAT TAAAACAATT GCTTTTGGCC GCTATGAGCT 1320
TGATACCTGG TATCATTCTC CATATCCTGA AGAATATGCA CGGCTGGGAC GTCTCTATAT 1380
GTGTGAATTC TGTTTAAAAT ATATGAAGAG CCAAACGATA CTCCGCCGGC ACATGGCCAA 1440
ATGTGTGTGG AAACACCCAC CTGGTGATGA GATATATCGC AAAGGTTCAA TCTCTGTGTT 1500
TGAAGTGGAT GGCAAGAAAA ACAAGATCTA CTGCCAAAAC CTGTGCCTGT TGGCCAAACT 1560
TTTTCTGGAC CACAAGACAT TATATTATGA TGTGGAGCCC TTCCTGTTCT ATGTTATGAC 1620
AGAGGCGGAC AACACTGGCT GTCACCTGAT TGGATATTTT TCTAAGGAAA AGAATTCATT 1680
CCTCAACTAC AACGTCTCCT GTATCCTTAC TATGCCTCAG TACATGAGAC AGGGCTATGG 1740
CAAGATGCTT ATTGATTTCA GTTATTTGCT TTCCAAAGTC GAAGAAAAAG TTGGCTCCCC 1800
AGAACGTCCA CTCTCAGATC TGGGGCTTAT AAGCTATCGC AGTTACTGGA AAGAAGTACT 1860
TCTCCGCTAC CTGCATAATT TTCAAGGCAA AGAGATTTCT ATCAAAGAAA TCAGTCAGGA 1920
GACGGCTGTG AATCCTGTGG ACATTGTCAG CACTCTGCAA GCCCTTCAGA TGCTCAAATA 1980
CTGGAAGGGA AAACACCTAG TTTTAAAGAG ACAGGACCTG ATTGATGAGT GGATAGCCAA 2040
AGAGGCCAAA AGGTCCAACT CCAATAAAAC CATGGATCCC AGCTGCTTAA AATGGACCCC 2100
TCCCAAGGGC ACTTAAAGTG ACCTGTCATT CCGAGCCAGC GAACCCCAGC AGTAGGAATC 2160
CGTACCCTAG GGATCTGTCT GTCATTTCTC TGTTGCTCTT GTGATTGGCA AGTACAGTAT 2220
CCTTTGGGAA GGCCATCCCC CTCAGGACTG TCCTGGCTCC GACCTTTGTG TACACTGCAG 2280
ACGCTGGTTC TGAGGAACTG TTGTTTCGGC CTCAGTGAGG TTGCCTGGAT GGGATCTGTA 2340
TTAGACTTGA GTGCAGGTCT CTCAGCACTG ACCCAAGGAG TTCTGTTATG GTACTGTACC 2400
TGTCCAGTCA CTGGTTCTCT CCTCATGTCC TCTCGCCCCA TGAGGTTGTG TTGTGTCTTC 2460
TAAGCGTGGT ACTAGTGCTT GCCACCTGGT CACCAGACCT CCAAATATGG CTGCCACCAC 2520
CAGGACCTTT CCAGTTACTC CTTATATGTG TGTTCTATGG AGGGGCAGGG AAAAGGTGGC 2580
ACTTGTGAGT GTGTGTGGAT TGGCAGGGGG TCCATTCACT TTGGGTTCCA TCTTGCTTTA 2640
AATTTCTTCA TTTTGATTAA GAGACCTCTT TTTGATCTGT ATTGGGCTAA CCAGAGCCAA 2700
ATACTTTTGA AGAGTTTCCC AGGGACTAGT CATGGTAATA GCATATAATT GATCTGAATG 2760
AGATGGAGAG AAGAATGAAG GGGTGGTGGT TCTGGGTTTG ATTTGAGTTC ACCTGTGGGC 2820
AGTGGGCAGT GGGCAGTGTC TTGGTGAAAG GGAACGGATA CTACTTTTTG CCTCACCGTA 2880
AAGTACTCAC TAGTAAATAT TTCCTTCTCT CTTTACTCCC ACTTTTTACG TTTGCAGGTG 2940
CCAAAGTAAT GTCCACTTTT CCCTTTCATG CTGCATATTA ACTGGTTAAT TATACTGCAG 3000
AAACCTTTTC ACCTCCACTA GTCTGATACA GTACATCTGT ACTTCCATAT ACCTTGCACT 3060
GATTTTGTCT GAGTGCCCTG GGAGAAGTAG AAAATGATTG AAAGTGACTT CCGTATCTCA 3120
GCCCATGACT CAGCAAGGCA GAATGGCCAC CCCTGCCAAA GTTTGCTTCT CTTTTCAACA 3180
GTGCCTCACC CTCCCTCTAG GATTAAAGTG CTTCTGCCCT TCCACGAACT CCTCCTCCAT 3240
TTCCTTTTTG GGATTTGTCA CCATCCTTCT ATTCTCTGGT CTTCTATTTT TGGTGTTGTT 3300
CAAGTGAAGG AAGAGATGTT CCCTCTAATT TCTCTCTAGC CCATTATAAC CTGCTATCTT 3360
GGGGCAACTT TTGATGTATG ACATGTCACC CTTCCCAACT TGGTCTCCTC CAACATGCTG 3420
TCTTCATGTG GAGCCCTCAC CACAATCCCT GACTCCGGTC ATTTGTGCCT TTCTCTTGTC 3480
ATCTCTGTAC ACTACTTATA TTCACTGTGG GTTGGGGGAG CTAATTTTAA GCATGTTCAG 3540
TGGCAGCTCC CCTCCAGTTT CAGTGTCACT GTTAAAATTT ATCAAAAAGC AACTTCACTA 3600
GGGGTTTTCT TAAGGGATAA AGGCCTTTTA CAGAAGCTAA ACCCTTCCCC ACATGTGGTA 3660
GAATGTGCTC TTCTATATCT ACTCCTCAAT AAAGCATGTT CTCTGCTCAA GTCTGTTTCA 3720
TCTGGGGGCT CTCATTTATA TATGAAAATG ATGCACACGA TCTGCTACTA ATAGTAAATG 3780
CACTTGGGAT TTGCTTTCCC TAGCAGTAAA CTGTTGAGGG ATGTGGTTTG TGGCTATGGA 3840
ATGTTTTTCC CTGTGATACA GGCTGTCTGT AAAGATCAAG GGAGTGCTCA CTCTGAACTT 3900
CTCTAGATGG TGGCACAAAT TTGATCTGCC TCACTTTGGT TCCAGCTAAT CAGTATACGT 3960
AGCAATGATT AGTCAGTATT ACCCATTCTT TCACTAAGTG CCATTTTCCA CTGATTTTAG 4020
GGGCAAAGGA ACCAATAGGA AATTAGGATA TATGGGGGTA CAGTTGATGC CTGTAGGAGA 4080
TGGGAACAGA CATTCCTTCT CATCTCCAAG CTCATTCACC AGTATTGAGC AGTGTCACCT 4140
CTAATTATTG ACTCTCTCGC AGGTTGAAAT TATTCTTTTT GAAAATAGCT GCATTTTCAT 4200
GTAAGATATA CCCAGCACAG GAAAAGGGTG GCTGAGCACT AACCTCCGTA TGGTGGAAAG 4260
GAGGAGGCTG GGAATTGTAT GTGCTGGAAT GGTTTCACTC ACTGTGACCA GTAGTGGTGA 4320
GAACCCATAC AGTTGAAGTT TTTTGCACAG TCCTGATCCC AGGTCTCCAC TCGCTTTGCC 4380
ATCCCACTTT ACTCCCTAAA AATAAAAGGA TTTATTATCT CATTTAAACC CCCACAGGTG 4440
TGGAAACAGA GTTTCACTTG CCTTGGCAAC TTTGCATGAG ACTATCCCAT TTCATTCCGT 4500
TTTTTTTTTT TTGAGTCAGA GTCTGGCTCT GTTGCCCAGG TTGGAGTGCA GTGGCGCAGT 4560
TTTGGCTCAC AACCTCTGCC TCCCGGGTTC AAGTGATTCT TCTGTCTCAG CCTTCCGAAT 4620
AGCTGGGATT ACAGGTGCCT GTCACCATGC CCAGCTAATT TTTGTATTTT TAGTAGAGAC 4680
AGGGTTTCGT CATGTTGGTC AGGCTGATCT CGAACTCCTG ACCTCAGGTG ATCCGCCCAC 4740
CTTGGCCTCC CAAAGTGCTG GGATTACAGG CGTGAGCCAC TGCACCCGAC CTATTTTTTT 4800
TTTTTTTTTT TTTTTTTTTT TTAAAAAAAG ACAGTCTCAC TCTATCATCC AGTCCGGAAT 4860
GCAGTGGCAT GATCTCAGCT CACTGCAATG TCTGCCTCCT GGATTCCAGT GATTCTCCTG 4920
CCTCAGCCTC TCAAGTAGCT GGGATTACAG GTGCAGGCCA CCTGGCTAAT TTTTGTATGT 4980
TTAGTAGAGA CAGGGTTTTG CCATGTTGGC CAGGCCAGTC TCAAACTCTT GACCTCAAGT 5040
GATCACCCGC CTCATCCTCC CAAAGTGCTG GGATTACAGC CGTGAGCCTC TGCACCCAGC 5100
TTTTAACTCC CTCTTATCTG CATAACAGAA GCTTAGCTGC TTAAGCTCCT TTATTAGAAG 5160
AGCAAAAGTC TGAAATTATT CCTGAAACCT GCTCAATGGA AGTACCTACT CTATTGGTTG 5220
CTTCCCATAT GGTTGTCACT GTACCTTCAT ACTGCCTCAT TTGACCCTCA TATTAGCCCT 5280
GTACAGTAGA TGGGTACACT GGTTTGCCAA AGGAGACCTG GAATCCAAGG TGGAAGTAAG 5340
CAGCAAAGCC AGAAACTTCA ATTCTGGTCT GTCTACCTTG ATAGCCTGCA CCCTCCCCTC 5400
TACCGTTTTC TTCCACTATT TTTGATTCCT TAATGATGAA TCATCCTCTC CCTTCTAGTT 5460
GGATTTGTTT CTAATGGCTT CCATTACAAG GATAATAATG AAACTGGTGA AAACTTTCAG 5520
GCAAAAGGAT TTTCTTTTTA TATTTTTTCT TATTATTTTT TAATTATTAA CCAAATTAAC 5580
TCATTACAGT AAAAAGGACT GATTTTTAAG CCAGCTGTGA TAGCTCTGTA ATAGTCTGTA 5640
ATCTCAGCAC TTTGGGAGGC CAAGGCGGGC AGATCGCTTG AGTCCAGGAA TTCGAGACTA 5700
GCCTGGGCAG CATGGTGAAA CCCCAGCTCT ACAAAAAATA GAAAAATCAG ACGTGGGCAC 5760
ATGCCTGTAG TCTCAGCTAC TTGGGAGGCT GAGGCACGAG AATCGCCTGA ACCTGGGAGG 5820
CAGAAGTTGC AATGAGCTGA GATGATGCCA CTGCACTCCA GCCTGGGTGA CAGAGTGAGA 5880
CCCTGTCTCA AAAACAAAAA ACAGAATTGA TTGATGTTAG TTGGCTTTAG AAGCAGCAAG 5940
TTTAGGGGGC TACAGAGCTA AACCAGGAAG CAAAAGATGT GCCTCATTCT GGCATTGTTT 6000
CTGATTTAGG AATAAACTGT TCAGTAAGCA CTGTCCCTTT ACTTCCATGG TTTTCTTCAT 6060
TCCTCACCAC AGCACAGTAA GGTGGATATT ATAGTCTTCT TCTAGATGAA AAATTGAGGC 6120
TCATAGTGGT CTTGCTGCTG TGTCATAGCA ATAGAATGAG AGAGCCTTGC TTCCCTGAGT 6180
CCAAATCCCA TACTTTTGGC ATTGTTATGA GGTCTGGTCA CCTGATGCTT CCATGCTATT 6240
TTCCCATTTC TTATCTGGGG ATAATGAGTC ATATTAAGTA ATTTTTTTTT TTGAGACGGA 6300
GTTTCGTTCT GTCACCCAGG CTGGAGTGCA GTGGTGCGAT CTTGGCTCAC TGCAAGCTCT 6360
GCCTCCCGGG TTCATGCCAT TCTTCTGCTT CAGTCTCCCG AGTAGCTGGG ACTACAGGTG 6420
CCCACCACCA CGCCCAGCTA ATTTTTTGTA TTTTTAGTAG AATGAGGTTT CACCGTGTTA 6480
GCCAGGATGA TCTCGATCTC CTGACCTCGT GATCCACTCG CCTCAGCCTC CCAAAGTGCT 6540
GGGATTACAG GCGTGAGCCA TTGCACCCAG CCATTTTTTT TTTTTTTAAG ACGACGTCTC 6600
ACTCTGTCAC CTATGCTGGA GTGCAGTGGC GTGATCTAGG CTCATTGCAA CCTCTGCCTC 6660
CCAGGTTCAA GCGATTTTCC TGCCTCAGCC TCCCAAGTAG CTGGGATTAC AGGTGCCCAC 6720
CACCTCGCCT GGCTAATTTT TGTATTTTTA GTAGAGATGA GGTTTTGCCC TGTTGGCTAG 6780
GTTGGTCTTG AACTCCTGAC CTCAGGTGAT CCACTCACCT CAGCCTCCCA AAGTGCTGGG 6840
ATTACAGGCA GGAGCCACTG CGCCCAGCCA AGTAACTTTT AACAGTGTGG TATAACCTTT 6900
AAATGACAAG GTGATGCTTT TGACTTGTCC TCAACTTTGA TTTGTACTGA TTTGTCCCTA 6960
TAGTTCTGGG TGGGGTGGGT CAAAACAAAG TCTCGAGCTG TACCAGGATC AAGCAGCACA 7020
GCTCAGCCAT GATCCTTTTA CCACTTTTTT CTTCTGTCCT TGAGACTCTA ATTAAAGCAC 7080
TGGATTTTTA AAAATCACCC TTGTAAATAT GCACACATTT GTCTATAGTT GAGGAAATTG 7140
TGCCGTTGAA GTCCATTCTT GGACATGGAG TTAAGAAACC CTGGTTTGAG AAAAAGCCCC 7200
AGTGAGACAG CAGGAATCCT TTTACCATAC AACCCTCAAC TAGTTTAGTG TGCTCAAGCT 7260
CAAATAACCA ATCCCATCAA GTGAAAAGAA TGGCAGCAGG GAGAAGGCCT GGCTCACTGA 7320
GGCTCTCAGC ATTAGTTTCC TCTACCTCTT GTGTCTCACA GGTGCACATA TGTACAGCAT 7380
ATCAAAGTGT TGAATGTCAT GAGAATAAAA TATGAAAACT ACTTTGCTGA ATGATAGTAT 7440
GTGATGTGTG CTAGGACTTC TAGAAGCCAC CCTTTGCTTT GCTGTTCATT GGGATCATGG 7500
AATCGGACCT CAGCTGGTTT TGCCTCAGCA CTTTCTTTCA CAAAATTATG TGTGACTGCC 7560
TCCTCCAGAC TGTTTCCTGC TGATAGGGGC AGTTTAATAG CCTTCTTCCT GTGTGGTATC 7620
TGCAACAAAA TCCCAATGAA TGTCACCAAG AAGGAAACAA AGGATTGCCC AGCGATGAGA 7680
AATGTCCCTG GTGCCAAAAC ATCAGTTTGC CCCTAACCTC TTGTGCAATA CCTTTAAGTC 7740
CAGGTCATGT TGTTACCATT TGGGGGTTTG CGGATTTGTT TACTTGTGCC CAAGAATGGA 7800
GAAAATAACC TGTACTATTG TACAACTCTG GCTCCATGGC TCCTCACAAA TGTTCCATGT 7860
GAGATATAAA CATCTTTATC CTCGACAAGT CATGTTCATT CCAAGAAACC AGTCTTTGTT 7920
CTTAATTGGA CATTTGTTTC TGCAAACAGC TTACCATACA TTCAATTCCA AAGTTATCAG 7980
AAACCTACAC TCTTATCTCA CAAATTTAGA GGTGTGGTAG ATCATCTCCA AAGATGGCCA 8040
CCAACAGTTG CTCTCATCCT CTGTGCACGT GCTATTTCCA ATAAGGTCTA TTTTTTTCTA 8100
CAGTGAGGGC TGAACTTGTG ACTTGTTTTG ACTAATAGGA TATGGAAGTG ATATTTTGGC 8160
AGCTTCCACT TTTGCTCTTG AAAATAAGCT GACACGTTTC CAAACGAGAA GCTTGGGCTA 8220
AATTACTGAA TGGTGAGAGA CGATGCATAG GAGAACCAAG GTGCTCTAGT CACAGCACTA 8280
AAAGCCCAGA CTTGTGTCTC TTGAAGGTTT CAACCCCGCC AGACTCCCAG CTGAACGCAC 8340
CCTCATGAGT GGCCCTTGCT GGTACCACAT GACACTAAAG ACATCTAGCT GAGTCCTGTT 8400
AACCCAGAGA ATTATGAGAA TACTGTTTTA ACCACACGTT TTGGAATGGT TTGATACATG 8460
GCAATGGAGA AGTGAAACAA GGGGACTTCG GAAACTAAAG GGCTGGAATT CAGTTTGCCT 8520
TGTAGGTTGA TTGGAAGCCA GATGTGCCTA GAGGAAGGCT ACCACCTTGT GCAATTCCAG 8580
GGGACACTGT TTATGTTCCG TGTAAATGGC AGCCTCAGTT CACCTCATTT GGTTATTTAT 8640
CGTGTCTTCG CTGTCAGTCA AATTGCTTCT GAGATAACTG GCTGGCCTTG GAATTCTTAG 8700
CCACCTCCTT AAGCGGATCA GGAAAACTGA AGAATATCCT TCTGTATGTA TGTATGTATT 8760
TATTGATTGA TCGATTTATG AGACAGGGTC TCCTTCTGTC ACCCAGGCTG GAGTGCAGTG 8820
GTACGATCAC GGCTCACTGC TGCGTCGCCT TCCCAGGCTC CAGCTATCCT CCCACCTCAA 8880
CCTCCAGAGT AGTTGAGACC ACAGGCGTGC ACTACCACGC CCGGCTACCT TTTTGTATTT 8940
TCAGTAGAGA CGAGGTTTCG CCGTGTTGCC CAGGCTGGTT CAAGCGGAGC TCAAGCAATC 9000
AGCCTGCCTC GGCCTCCCAA AGTGTTGGGA TTACAGGCAT GAGCCGCTGC GCCCAACCTT 9060
CTTCTGCTGT CGAGATACTG CTCATCACCT GCCTGCTCCA GAATTCATGT GGCTTCTCAT 9120
TGCTCAATGG ATTAAGTTCA TGTTTATCCT GGCTTTCAAG TCTTTCCGTA AGCTGACTCA 9180
ACCTACATAG CTTTCATCAT TCCCTTACAC ATAACCTCAA CGTGCAACAG GATTAGTCTA 9240
TTATTCCCTT TCTTGTGTTT ACTGAGAAAG CCTCCACTTC AACGTTCCAT GAAGTGTGTT 9300
CCATTAAATA CCAAAGTATA GGCAAAAAGT TCTGTGGTCA AATAAATTTG GAAAACACAG 9360
AGTGTTTCCA AAGTTAGTAT CAGGCCAGGC ATGGTGGGAG GATCACTTGA GCCCAGGAGT 9420
TCGAGACCAG CCTGGGCAAC ATAGGGAGAC CCAATCCCTA CAAAAAAATT AGTTGGGCAT 9480
GGTGGTGTGC ACCCGTAGTG CCAGCTACTC AGGAGGCTGA GGTAGGAGGA TCACCTGAGC 9540
CCAGGAAGTC AAGGCTGTGG TCAGCTGAGA TCCCACCAGT GTGCTCCAGC CTGGGTGACA 9600
GAGCAAGACC CTGTCTCAAA AAATAAAAAA ATAAAGATAA TACC 9645
Sequence Source Ensembl
Keyword

KW-0007--Acetylation
KW-0012--Acyltransferase
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0181--Complete proteome
KW-0235--DNA replication
KW-0479--Metal-binding
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-1185--Reference proteome
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR016181--Acyl_CoA_acyltransferase
IPR002717--HAT_MYST-type
IPR002515--Znf_C2HC

PROSITE

PS51726--MYST_HAT

Pfam

PF01853--MOZ_SAS
PF01530--zf-C2HC

Gene Ontology

GO:0005737--C:cytoplasm
GO:0000123--C:histone acetyltransferase complex
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0004402--F:histone acetyltransferase activity
GO:0003700--F:transcription factor activity, sequence-specific DNA binding
GO:0008270--F:zinc ion binding
GO:0006260--P:DNA replication
GO:0043966--P:histone H3 acetylation
GO:0043983--P:histone H4-K12 acetylation
GO:0043981--P:histone H4-K5 acetylation
GO:0043982--P:histone H4-K8 acetylation
GO:1900182--P:positive regulation of protein localization to nucleus
GO:0006355--P:regulation of transcription, DNA-templated
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Poa-0075 ENSPPYP00000010014.1 Pongo abelii 100 0.0 1200
WERAM-Pat-0076 ENSPTRP00000015934.4 Pan troglodytes 100 0.0 1200
WERAM-Gog-0085 ENSGGOP00000007314.2 Gorilla gorilla 100 0.0 1200
WERAM-Eqc-0063 ENSECAP00000008441.1 Equus caballus 100 0.0 1200
WERAM-Aim-0136 ENSAMEP00000012925.1 Ailuropoda melanoleuca 100 0.0 1200
WERAM-Tut-0008 ENSTTRP00000000444.1 Tursiops truncatus 100 0.0 1199
WERAM-Sus-0134 ENSSSCP00000018601.2 Sus scrofa 100 0.0 1199
WERAM-Chs-0057 ENSCSAP00000001508.1 Chlorocebus sabaeus 100 0.0 1199
WERAM-Fec-0034 ENSFCAP00000002433.3 Felis catus 100 0.0 1198
WERAM-Bot-0165 ENSBTAP00000024887.5 Bos taurus 100 0.0 1198
WERAM-Orc-0005 ENSOCUP00000000581.3 Oryctolagus cuniculus 100 0.0 1197
WERAM-Ict-0154 ENSSTOP00000015280.1 Ictidomys tridecemlineatus 100 0.0 1197
WERAM-Ova-0056 ENSOARP00000006067.1 Ovis aries 100 0.0 1197
WERAM-Caf-0180 ENSCAFP00000024952.2 Canis familiaris 100 0.0 1195
WERAM-Cap-0016 ENSCPOP00000001650.2 Cavia porcellus 99 0.0 1194
WERAM-Myl-0126 ENSMLUP00000010098.2 Myotis lucifugus 99 0.0 1193
WERAM-Sah-0111 ENSSHAP00000011807.1 Sarcophilus harrisii 99 0.0 1193
WERAM-Mod-0099 ENSMODP00000015208.2 Monodelphis domestica 99 0.0 1193
WERAM-Paa-0175 ENSPANP00000020135.1 Papio anubis 99 0.0 1191
WERAM-Loa-0037 ENSLAFP00000002342.3 Loxodonta africana 99 0.0 1191
WERAM-Mum-0158 ENSMUSP00000090441.5 Mus musculus 100 0.0 1189
WERAM-Dan-0041 ENSDNOP00000004227.2 Dasypus novemcinctus 98 0.0 1187
WERAM-Mim-0145 ENSMICP00000015020.1 Microcebus murinus 100 0.0 1182
WERAM-Pes-0018 ENSPSIP00000004151.1 Pelodiscus sinensis 98 0.0 1180
WERAM-Gaga-0100 ENSGALP00000016147.4 Gallus gallus 98 0.0 1176
WERAM-Anp-0058 ENSAPLP00000006419.1 Anas platyrhynchos 98 0.0 1167
WERAM-Meg-0019 ENSMGAP00000001561.1 Meleagris gallopavo 98 0.0 1165
WERAM-Lac-0093 ENSLACP00000011978.1 Latimeria chalumnae 95 0.0 1157
WERAM-Tag-0027 ENSTGUP00000001933.1 Taeniopygia guttata 96 0.0 1149
WERAM-Fia-0073 ENSFALP00000005432.1 Ficedula albicollis 96 0.0 1148
WERAM-Nol-0032 ENSNLEP00000003949.1 Nomascus leucogenys 100 0.0 1134
WERAM-Otg-0020 ENSOGAP00000000937.2 Otolemur garnettii 95 0.0 1129
WERAM-Anc-0047 ENSACAP00000004991.2 Anolis carolinensis 95 0.0 1127
WERAM-Mup-0176 ENSMPUP00000015299.1 Mustela putorius furo 100 0.0 1122
WERAM-Ocp-0075 ENSOPRP00000006602.1 Ochotona princeps 98 0.0 1114
WERAM-Leo-0131 ENSLOCP00000016384.1 Lepisosteus oculatus 91 0.0 1112
WERAM-Mae-0040 ENSMEUP00000003756.1 Macropus eugenii 90 0.0 1060
WERAM-Xet-0138 ENSXETP00000047416.3 Xenopus tropicalis 84 0.0 1018
WERAM-Dar-0160 ENSDARP00000093977.3 Danio rerio 84 0.0 1016
WERAM-Orn-0076 ENSONIP00000008231.1 Oreochromis niloticus 78 0.0 976
WERAM-Ect-0053 ENSETEP00000005742.1 Echinops telfairi 88 0.0 959
WERAM-Gaa-0064 ENSGACP00000007863.1 Gasterosteus aculeatus 78 0.0 956
WERAM-Tar-0174 ENSTRUP00000036778.1 Takifugu rubripes 76 0.0 941
WERAM-Pof-0033 ENSPFOP00000002754.2 Poecilia formosa 75 0.0 941
WERAM-Ere-0057 ENSEEUP00000004856.1 Erinaceus europaeus 82 0.0 929
WERAM-Orla-0180 ENSORLP00000020926.1 Oryzias latipes 77 0.0 922
WERAM-Gam-0139 ENSGMOP00000014177.1 Gadus morhua 81 0.0 904
WERAM-Xim-0096 ENSXMAP00000008646.1 Xiphophorus maculatus 81 0.0 897
WERAM-Tub-0022 ENSTBEP00000002206.1 Tupaia belangeri 83 0.0 879
WERAM-Asm-0232 ENSAMXP00000025043.1 Astyanax mexicanus 75 0.0 858
WERAM-Ten-0132 ENSTNIP00000014081.1 Tetraodon nigroviridis 70 0.0 856
WERAM-Pem-0064 ENSPMAP00000007030.1 Petromyzon marinus 72 0.0 813
WERAM-Caj-0080 ENSCJAP00000013077.2 Callithrix jacchus 100 0.0 694
WERAM-Ran-0147 ENSRNOP00000029608.6 Rattus norvegicus 100 2e-176 616
WERAM-Cis-0023 ENSCSAVP00000005035.1 Ciona savignyi 59 6e-164 575
WERAM-Cii-0017 ENSCINP00000006517.3 Ciona intestinalis 59 2e-162 570
WERAM-Chh-0033 ENSCHOP00000003799.1 Choloepus hoffmanni 100 1e-138 491
WERAM-Drm-0031 FBpp0304793 Drosophila melanogaster 54 8e-136 481
WERAM-Dio-0138 ENSDORP00000012841.1 Dipodomys ordii 57 7e-105 379
WERAM-Ptv-0164 ENSPVAP00000013957.1 Pteropus vampyrus 57 8e-105 378
WERAM-Ora-0008 ENSOANP00000001123.3 Ornithorhynchus anatinus 57 1e-104 377
WERAM-Mam-0215 ENSMMUP00000030925.2 Macaca mulatta 56 7e-104 375
WERAM-Tas-0059 ENSTSYP00000005370.1 Tarsius syrichta 56 7e-104 375
WERAM-Prc-0040 ENSPCAP00000003620.1 Procavia capensis 56 3e-103 373
WERAM-Vip-0046 ENSVPAP00000004212.1 Vicugna pacos 53 2e-95 347
WERAM-Zem-0080 GRMZM2G140288_P01 Zea mays 57 3e-93 340
WERAM-Art-0119 AT5G09740.1 Arabidopsis thaliana 58 1e-92 338
WERAM-Sob-0038 Sb02g039960.1 Sorghum bicolor 57 2e-92 337
WERAM-Mua-0086 GSMUA_Achr6P10570_001 Musa acuminata 59 2e-92 337
WERAM-Thc-0095 EOY16150 Theobroma cacao 58 3e-92 337
WERAM-Glm-0072 GLYMA06G47380.1 Glycine max 58 3e-92 337
WERAM-Arl-0101 fgenesh2_kg.6__ 927__ AT5G09740.1 Arabidopsis lyrata 58 3e-92 337
WERAM-Sol-0133 Solyc11g013520.1.1 Solanum lycopersicum 58 3e-92 337
WERAM-Sem-0015 EFJ14590 Selaginella moellendorffii 57 4e-92 336
WERAM-Bro-0183 Bo9g173870.1 Brassica oleracea 58 4e-92 336
WERAM-Prp-0062 EMJ23350 Prunus persica 58 4e-92 336
WERAM-Brr-0045 Bra009399.1-P Brassica rapa 58 4e-92 336
WERAM-Pot-0027 POPTR_0002s09010.1 Populus trichocarpa 57 7e-92 335
WERAM-Sei-0114 Si029855m Setaria italica 57 8e-92 335
WERAM-Met-0059 AES68391 Medicago truncatula 58 8e-92 335
WERAM-Php-0014 PP1S13_111V6.1 Physcomitrella patens 57 9e-92 335
WERAM-Viv-0113 VIT_18s0001g04100.t01 Vitis vinifera 58 9e-92 335
WERAM-Orbr-0089 OB07G28490.1 Oryza brachyantha 57 1e-91 334
WERAM-Orl-0045 KN538920.1_FGP012 Oryza longistaminata 57 1e-91 334
WERAM-Crn-0008 AAW41586 Cryptococcus neoformans 54 2e-91 334
WERAM-Orp-0078 OPUNC07G21500.1 Oryza punctata 57 2e-91 334
WERAM-Ors-0074 OS07T0626600-01 Oryza sativa 57 2e-91 334
WERAM-Orr-0084 ORUFI07G24020.1 Oryza rufipogon 57 2e-91 334
WERAM-Orni-0080 ONIVA07G22490.1 Oryza nivara 57 2e-91 334
WERAM-Orm-0076 OMERI07G19290.1 Oryza meridionalis 57 2e-91 334
WERAM-Ori-0077 BGIOSGA023877-PA Oryza indica 57 2e-91 333
WERAM-Brd-0008 BRADI1G20770.1 Brachypodium distachyon 56 2e-91 333
WERAM-Orgl-0078 OGLUM07G22920.1 Oryza glumaepatula 57 3e-91 333
WERAM-Org-0080 ORGLA07G0178500.1 Oryza glaberrima 57 4e-91 333
WERAM-Orb-0081 OBART07G22890.1 Oryza barthii 57 4e-91 333
Created Date 25-Jun-2016