WERAM Information


Tag Content
WERAM ID WERAM-Drm-0013
Ensembl Protein ID FBpp0297152
Uniprot Accession Q9VW15; ASH1_DROME; A8WHI9; M9NFL0; Q24189; Q8MQX5
Genbank Protein ID NP_001246834.1; NP_524160.2
Protein Name Histone-lysine N-methyltransferase ash1
Genbank Nucleotide ID NM_001259905.1; NM_079436.3
Gene Name ASH1
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
FBgn0005386 FBtr0306010 FBpp0297152
FBgn0005386 FBtr0306009 FBpp0297151
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET2 SET H3K4; H3K9; H3K36; H4K20 K 20236312
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET2 3.30e-47 160 1392 1506
HMT SET1 1.70e-28 99.7 1398 1506
Organism Drosophila melanogaster
NCBI Taxa ID 7227
Functional Description
(View)
Trithorax group (TrxG) protein that has histone methyltransferase activity. Specifically trimethylates 'Lys-4' of histone H3, a specific tag for epigenetic transcriptional activation. May also trimethylate H3 'Lys-9' and H4 'Lys-20'; however the relevance of this activity is unclear in vivo. TrxG proteins are generally required to maintain the transcriptionally active state of homeotic genes throughout development. Does not act as a coactivator required for transcriptional activation, but specifically prevent inappropriate Polycomb Group (PcG) silencing of homeotic genes in cells in which they must stay transcriptionally active.
Domain Profile
  HMT SET2

     SET2.txt    3 veliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncetqkwtvegelr 98  
ve+++t++kG+G+r+k i+k+++ileYvGev++eke+k+R+++++ ++ +++Y+l+ld + viD++++G+ +Rf+nhsCePnce+qkw+v+g r
FBpp0297152 1392 VERFMTADKGWGVRTKLPIAKGTYILEYVGEVVTEKEFKQRMASIYLND-THHYCLHLDGGLVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSR 1486
799********************************************99.9********************************************* PP
SET2.txt 99 vglfakkkikkgeeltfdYn 118
++lfak++i++geelt+dYn
FBpp0297152 1487 MVLFAKRAIEEGEELTYDYN 1506
*******************8 PP

  HMT SET1

     SET1.txt    9 kikglglvakkeiekeelviEYvGevirsevadkrek.eyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakvvavdgekkiviy 103 
+ kg+g+++k +i+k++ + EYvGev+ +++ ++r + y ++ + y +ld +v+d +++g+ rf+nhscepNce + +v+g ++v++
FBpp0297152 1398 ADKGWGVRTKLPIAKGTYILEYVGEVVTEKEFKQRMAsIYLND-THHYCLHLDGG--LVIDGQRMGSDCRFVNHSCEPNCEMQKWSVNGLSRMVLF 1490
568***********************99999888888456666.66799999999..*************************************** PP
SET1.txt 104 akraIekgeeltydYk 119
akraIe+geeltydY+
FBpp0297152 1491 AKRAIEEGEELTYDYN 1506
***************7 PP

Protein Sequence
(Fasta)
MSCSQNETAA AKVLETQRAQ ESGSENEETD SITDQSSQSK SIKSATQFSV QRSDTDGLRM 60
RISAIRPTLG VVATKKPPKS RKMSTQDTES GCSEAKNRAV SKKVKVKRKK LASSSGISKS 120
DKVSKSKKSQ ISAFSSDSED DLPLKVHQQR APRVLLSAII QAAQSASKPT LDIGISSSDN 180
ELPNLVQAAI KRVESDTEDT TVEGSFRKAA KDKNLPQYQS TLLQDFMEKT QMLGQTVNAK 240
LAEEKVAKAK EETLVQTAVP RKRRGRPKKV VPTVPAPGNS GPAINESADS GVISTTSTTQ 300
STTPSPKMQN ENAVPTGSLP IASSSKPKID MAYLDKRMYA TERVLYPPPR SKRRQNNKKT 360
ACSSSNKEEL QLDPLWREID VNKKFRLRSM SVGAASGTGA STTICSKVLA AKSGYVSDYG 420
SVRHQRSSHN HNSGYKSDAS CKSRYSTKSC MSRRSRAKSC GYRSDCKESG KSGLRMRRKR 480
RASMLLKSSA DDTVEDQDIL QLAGLSLGQS SEESNEYISK PSLKSLPTTS ASKKYGEINR 540
YVTTGQYFGR GGSLSATNPD NFISKMMNQR KETPAPSKSS CKIKSRRSSA ASMCSSYVSG 600
VSRMRRRHRR KSFSHNKSLN IDSKLLTEIE IITSTFNSRC RIQDDRLTGS SGKEKLLADA 660
NKLQATLAAP SPAQQLTLNG GGPASTLSKP LKRGLKKRKL SEPLVDFAML SASASGTPNG 720
SGSSNGNTKR RHKKSQSNDS SSPDDHKLPL KKRHYLLTPG ERPPAEVAFA NGKLNAEAWA 780
AAAAAAKSTA STKSQAQFNA RSVKSALTPK KRHLLEQPTS VSGAGSSASN SPLRIVVDNN 840
SISGGKLLDI SPSSLCSLKQ QRRGGAAKQK VSAAKDLVQL QSPAGSYPPP GVFEPSVELE 900
IQIPLSKLNE SVITKAEVES PLLSALDIKE DTKKEVGQRV VETLLHKTGG NLLLKRKRKK 960
INRTGFPTVR RKKRKVSVEQ QTTAVIDEHE PEFDPDDEPL QSLRETRSSN NVNVQAAPNP 1020
PLDCERVPQA GEARETFVAR TNQKAPRLSV VALERLQRPQ TPARGRPRGR KPKNREQAEA 1080
APQPPPKSEP EIRPAKKRGR QPKQPVLEEP PPTPPPQQKK NKMEPNIRLP DGIDPNTNFS 1140
CKIRLKRRKN LEAGTQPKKE KPVQPVTVEE IPPEIPVSQE EIDAEAEAKR LDSIPTEHDP 1200
LPASESHNPG PQDYASCSES SEDKASTTSL RKLSKVKKTY LVAGLFSNHY KQSLMPPPAK 1260
VNKKPGLEEQ VGPASLLPPP PYCEKYLRRT EMDFELPYDI WWAYTNSKLP TRNVVPSWNY 1320
RKIRTNVYAE SVRPNLAGFD HPTCNCKNQG EKSCLDNCLN RMVYTECSPS NCPAGEKCRN 1380
QKIQRHAVAP GVERFMTADK GWGVRTKLPI AKGTYILEYV GEVVTEKEFK QRMASIYLND 1440
THHYCLHLDG GLVIDGQRMG SDCRFVNHSC EPNCEMQKWS VNGLSRMVLF AKRAIEEGEE 1500
LTYDYNFSLF NPSEGQPCRC NTPQCRGVIG GKSQRVKPLP AVEAKPSGEG LSGRNGRQRK 1560
QKAKKHAQRQ AGKDISSAVA VAKLQPLSEK EKKLVRQFNT FLVRNFEKIR RCKAKRASDA 1620
AATASSPALG TTNGDIPGRR PSTPSSPSLA AQISALCSPR NIKTRGLTQA VHDPELEKMA 1680
KMAVVLRDIC SAMETLKMSD LLTTVSSKKK KPIKTTLSGK LGSTAATSKV EFRSIQAQVE 1740
QGHYKTPQEF DDHMQQLFVE AKQQHGDDEG KEKALQSLKD SYEQQKIASY VQLVEILGDS 1800
ESLQSFKPKE VLSSEEEPGK IAVKKSPGAK ERDSPIVPLK VTPPPLLPIE ASPDEDVIRC 1860
ICGLYKDEGL MIQCSKCMVW QHTECTKADI DADNYQCERC EPREVDREIP LEEFTEEGHR 1920
YYLSLMRGDL QVRQGDAVYV LRDIPIKDES GKVLPTKKHT YETIGAIDYQ ECDIFRVEHL 1980
WKNELGKRFI FGHHFLRPHE TFHEPSRRFY PNEVVRVSLY EVVPIELVIG RCWVLDRTTF 2040
CKGRPMECND EDHCYICELR VDKTARFFSK AKANHPACTK SYAFRKFPEK IKISKSYAPH 2100
DVDPSLLKTR KQKTELDVGA GPTTMHKVSG RQEQHQAKMV GRKPRGISAP ADATAVHVVT 2160
PVAPNKQMLK KRKSRLENVL ITMKLKCLDA QTAQEQPIDL SYLLSGRGAR QRKTQQSSSS 2220
STANST 2226
Nucleotide Sequence
(Fasta)
CACACTGAAA CGCAGTCTTT TTCTTTATTT ATGTTCGTAA ATTTTTGGTT GCCTGGCTTT 60
TTGTTGTTGC GCCACATTAA CAATAACAAT TGATGCGCCC CGCCACCGCC CACTAGTCGC 120
GAAAAAAGTG AATTATCGGT ACCGCGGATT AATGTGCCAC ATGCGAGAAG CGTTGCAACG 180
TTGCGCCGTC GTCGGGCGTC GAACAAAGCA GTTTTTGGGA AAGTGAAAAC GGCGCCATTT 240
TAGAAGCAGG CAAGCCACGG CCTAAGGCTC AAGAAAAGTG CAAATGCAAT GTGTGCAGCA 300
ATAGATTGTC ATTTCGGCAC AAACGAAACC GACTACTGAG GTCTACTACT AGCCCAGAAA 360
ACAATCAAGC GTAAGTGTGC TGTAGCTCCC CAGCGACTCC AAAATATGAG CTGTAGCCAA 420
AATGAGACGG CAGCAGCAAA GGTTCTGGAA ACGCAACGCG CACAAGAATC CGGCAGTGAA 480
AATGAGGAAA CCGATTCCAT TACGGATCAA TCGAGCCAGT CGAAGTCGAT CAAGTCAGCC 540
ACCCAGTTTA GCGTGCAGCG TTCGGACACC GATGGACTGC GAATGAGAAT CTCGGCCATC 600
CGCCCCACGT TAGGAGTCGT AGCAACCAAG AAACCCCCAA AGTCCAGAAA AATGTCTACC 660
CAGGACACCG AGTCCGGCTG CTCGGAGGCC AAAAATAGAG CGGTCAGCAA GAAAGTGAAG 720
GTCAAGCGCA AGAAGCTGGC AAGTTCCAGT GGGATTAGTA AATCGGACAA AGTGTCTAAG 780
TCTAAGAAGT CACAGATCTC GGCATTTTCC TCGGACTCAG AGGACGATCT GCCCTTGAAG 840
GTGCATCAGC AGAGAGCTCC GCGAGTGCTG CTAAGTGCTA TCATCCAGGC GGCACAGTCG 900
GCCAGCAAAC CCACCCTCGA TATCGGAATC TCGTCCAGCG ACAACGAGTT ACCTAACCTG 960
GTGCAGGCGG CTATCAAGCG AGTGGAGAGC GATACGGAGG ACACAACTGT GGAAGGAAGT 1020
TTCCGCAAAG CGGCCAAGGA CAAGAACCTA CCCCAGTACC AGTCAACTTT GCTGCAAGAC 1080
TTCATGGAGA AGACTCAGAT GCTGGGGCAG ACCGTCAATG CGAAGCTTGC GGAGGAGAAA 1140
GTGGCTAAAG CTAAGGAGGA GACCCTAGTC CAAACAGCTG TTCCTCGAAA GCGCAGAGGT 1200
AGACCCAAAA AAGTGGTTCC CACAGTTCCT GCCCCAGGAA ACTCTGGCCC TGCTATAAAC 1260
GAATCTGCCG ATTCGGGTGT GATAAGCACC ACTAGCACAA CGCAGAGCAC TACTCCCTCC 1320
CCGAAAATGC AAAATGAGAA TGCTGTGCCG ACGGGATCGC TGCCAATCGC CTCCAGCAGC 1380
AAGCCTAAGA TCGACATGGC GTATCTGGAC AAGCGAATGT ATGCCACTGA GCGGGTGCTC 1440
TATCCACCGC CCAGGAGTAA GCGACGGCAG AACAATAAAA AGACAGCCTG CAGTTCATCC 1500
AACAAAGAGG AACTTCAGCT TGATCCGCTG TGGCGAGAGA TCGACGTGAA CAAGAAGTTT 1560
AGGCTGAGGA GTATGAGTGT GGGTGCGGCT AGTGGAACAG GAGCAAGCAC CACTATTTGC 1620
AGTAAGGTCT TGGCCGCTAA GAGCGGTTAC GTCTCGGATT ACGGAAGCGT ACGACATCAG 1680
CGGAGCAGCC ACAACCACAA CTCCGGTTAC AAGTCCGATG CCAGCTGCAA GAGTCGATAC 1740
AGCACCAAGA GTTGTATGAG TCGCAGGAGC AGGGCAAAGA GCTGCGGCTA CCGGAGTGAT 1800
TGCAAGGAAT CTGGAAAGTC AGGCCTAAGG ATGAGGCGGA AGCGCCGGGC TTCCATGCTC 1860
TTGAAGAGCT CAGCAGACGA TACTGTCGAG GACCAGGACA TCCTTCAGCT AGCTGGATTA 1920
TCTCTGGGCC AGAGCAGTGA GGAGAGCAAC GAATACATCA GTAAGCCGAG CCTTAAAAGC 1980
CTTCCCACGA CAAGTGCCAG CAAGAAGTAC GGCGAAATCA ATCGCTATGT GACCACCGGG 2040
CAGTATTTCG GTCGAGGCGG TAGTTTGTCT GCCACCAACC CGGATAACTT CATTAGTAAA 2100
ATGATGAACC AGCGTAAAGA AACCCCGGCT CCCAGCAAAT CGTCCTGCAA GATTAAATCT 2160
CGCCGCTCAT CGGCAGCCAG TATGTGCAGC AGCTATGTGT CTGGGGTGTC TAGAATGCGT 2220
CGCAGACATC GCCGGAAGAG TTTCAGTCAC AACAAATCAT TGAACATTGA TTCCAAGCTG 2280
CTCACCGAAA TCGAGATAAT CACAAGCACC TTCAACTCAA GGTGTCGCAT CCAAGATGAT 2340
CGTTTGACCG GAAGCAGTGG CAAAGAGAAG CTATTGGCCG ATGCCAACAA GCTGCAGGCC 2400
ACCCTAGCAG CACCAAGCCC TGCCCAGCAG TTGACTCTGA ATGGAGGAGG ACCAGCTTCC 2460
ACCCTTTCCA AACCATTAAA ACGGGGCCTA AAGAAGAGAA AGCTGAGTGA GCCCCTAGTG 2520
GACTTCGCTA TGCTATCAGC TAGCGCTAGT GGAACTCCCA ATGGCAGCGG AAGCAGTAAT 2580
GGAAATACCA AACGACGACA CAAGAAATCT CAAAGCAATG ACAGTTCCAG TCCCGATGAT 2640
CACAAGTTGC CGTTGAAAAA GCGCCACTAT CTTTTGACTC CAGGAGAGCG TCCGCCGGCG 2700
GAAGTAGCAT TCGCCAATGG GAAATTGAAC GCAGAGGCCT GGGCTGCAGC CGCGGCGGCT 2760
GCTAAGAGCA CTGCGTCTAC CAAATCGCAA GCCCAGTTTA ATGCTAGGAG CGTAAAATCT 2820
GCGCTAACGC CCAAAAAAAG ACATCTCCTA GAGCAACCTA CGTCTGTGAG CGGAGCTGGC 2880
TCGTCGGCCA GTAATTCTCC CCTGAGAATT GTTGTCGATA ATAATTCCAT AAGTGGTGGA 2940
AAGTTGCTGG ATATAAGTCC TAGTTCCTTG TGTTCCCTCA AACAGCAGAG GAGAGGAGGA 3000
GCAGCTAAAC AGAAGGTGTC GGCAGCGAAG GACCTCGTTC AGCTTCAATC CCCAGCTGGT 3060
AGCTATCCTC CCCCCGGTGT GTTCGAGCCA TCTGTGGAGC TGGAAATCCA AATTCCTCTT 3120
AGTAAACTGA ACGAATCCGT CATAACCAAA GCAGAGGTCG AATCTCCACT GCTCTCAGCA 3180
TTGGACATTA AGGAGGATAC GAAAAAGGAG GTTGGCCAGC GCGTTGTCGA AACCTTGCTA 3240
CACAAAACGG GAGGCAATCT TCTTCTGAAA CGCAAGCGGA AAAAGATAAA TCGAACTGGA 3300
TTTCCCACTG TTCGCAGGAA GAAACGTAAA GTTAGCGTGG AGCAGCAAAC AACAGCCGTG 3360
ATTGATGAGC ACGAGCCGGA GTTTGATCCC GATGATGAGC CACTGCAATC CCTAAGAGAG 3420
ACCAGGAGTA GCAATAATGT CAATGTGCAG GCGGCACCTA ATCCTCCACT GGATTGCGAG 3480
CGAGTTCCCC AAGCAGGCGA GGCCAGAGAA ACCTTTGTGG CCAGGACCAA TCAAAAAGCC 3540
CCTCGATTAT CGGTGGTGGC CCTGGAGCGC CTACAGCGTC CTCAAACACC AGCTAGAGGA 3600
AGACCGCGAG GTAGAAAACC TAAGAACAGG GAACAAGCTG AAGCTGCACC TCAACCGCCG 3660
CCCAAATCGG AACCTGAGAT AAGGCCAGCC AAAAAACGTG GCCGGCAACC CAAGCAGCCG 3720
GTACTGGAAG AGCCACCACC CACACCACCT CCTCAACAGA AAAAAAACAA AATGGAACCA 3780
AATATTAGAC TACCAGATGG CATCGATCCC AATACGAATT TCAGCTGCAA GATTCGCTTG 3840
AAGCGGCGAA AGAACTTAGA GGCTGGAACC CAACCAAAAA AGGAGAAGCC AGTCCAGCCA 3900
GTGACGGTGG AAGAGATTCC ACCAGAAATT CCCGTCAGTC AAGAAGAAAT AGATGCAGAA 3960
GCAGAGGCTA AACGGCTAGA CAGTATTCCT ACCGAGCACG ATCCCTTGCC TGCCAGTGAG 4020
TCCCACAACC CCGGTCCGCA GGACTATGCC AGTTGCAGTG AATCCAGCGA GGACAAGGCA 4080
TCAACCACAT CCTTGCGGAA ACTATCTAAG GTCAAGAAGA CCTATCTTGT AGCAGGACTC 4140
TTCTCGAATC ATTACAAACA ATCCCTGATG CCGCCCCCAG CAAAGGTGAA CAAAAAACCA 4200
GGCCTGGAGG AGCAAGTGGG TCCTGCGAGC CTTCTACCAC CACCTCCCTA TTGCGAAAAG 4260
TACCTCCGAA GGACTGAAAT GGACTTTGAG CTACCTTACG ACATTTGGTG GGCTTACACT 4320
AATTCCAAAT TACCCACTCG AAACGTTGTG CCATCCTGGA ATTACCGAAA GATCCGCACC 4380
AATGTATACG CAGAGTCAGT GCGTCCAAAT TTGGCGGGAT TCGATCATCC CACCTGTAAC 4440
TGTAAGAACC AGGGCGAGAA GTCTTGTTTG GACAACTGTC TCAACCGCAT GGTTTATACG 4500
GAGTGCTCGC CCAGCAATTG CCCGGCTGGA GAGAAGTGCC GAAATCAGAA GATCCAGCGA 4560
CATGCCGTCG CACCCGGAGT GGAGCGCTTT ATGACGGCAG ATAAAGGATG GGGAGTGCGA 4620
ACTAAGCTAC CCATTGCGAA GGGAACCTAC ATTCTGGAGT ACGTGGGCGA GGTAGTCACG 4680
GAAAAGGAAT TCAAGCAGAG GATGGCCAGC ATCTACCTAA ATGACACCCA CCACTATTGC 4740
TTACATTTGG ATGGAGGACT GGTTATCGAC GGTCAGCGGA TGGGCAGCGA TTGTAGGTTT 4800
GTCAACCATT CTTGCGAACC CAATTGCGAG ATGCAAAAAT GGAGTGTTAA CGGCCTATCT 4860
AGAATGGTTT TATTCGCCAA AAGAGCCATA GAGGAGGGGG AGGAGCTGAC ATACGATTAC 4920
AACTTTTCGC TGTTCAATCC CTCAGAGGGT CAGCCCTGCA GGTGCAACAC GCCCCAGTGT 4980
CGTGGTGTCA TTGGTGGCAA GTCGCAGAGA GTTAAACCCC TTCCTGCTGT GGAGGCCAAG 5040
CCATCTGGAG AGGGACTTTC TGGCCGAAAC GGGCGCCAAC GGAAGCAGAA GGCCAAAAAG 5100
CATGCTCAAC GGCAGGCGGG AAAAGATATT TCATCAGCAG TGGCGGTGGC AAAGCTTCAA 5160
CCATTGTCCG AAAAAGAAAA GAAACTGGTC AGACAATTCA ACACATTTCT AGTCAGGAAC 5220
TTCGAAAAGA TACGCAGATG CAAGGCCAAG CGGGCGTCAG ATGCAGCGGC GACTGCATCC 5280
TCGCCAGCAC TTGGCACCAC TAATGGGGAC ATTCCTGGTA GACGTCCCTC CACACCATCT 5340
TCTCCTTCCT TGGCAGCGCA GATTTCCGCG CTCTGCTCGC CTCGCAACAT AAAAACCCGT 5400
GGACTCACAC AGGCAGTGCA TGATCCCGAA CTAGAGAAAA TGGCCAAAAT GGCTGTAGTT 5460
CTAAGGGATA TTTGCAGTGC CATGGAGACC CTCAAAATGT CCGATTTGTT GACGACAGTG 5520
TCCAGCAAGA AAAAGAAGCC TATAAAGACC ACTTTGAGTG GAAAATTGGG TTCTACAGCT 5580
GCAACTTCAA AAGTGGAATT CAGATCGATA CAAGCCCAGG TGGAGCAGGG ACATTACAAA 5640
ACGCCGCAGG AATTCGATGA CCACATGCAG CAGCTCTTTG TGGAGGCCAA GCAGCAACAC 5700
GGCGATGATG AGGGCAAGGA AAAAGCGCTG CAGTCCCTGA AGGATAGCTA TGAGCAACAG 5760
AAGATCGCCA GCTATGTTCA GCTGGTGGAG ATTCTTGGTG ATTCGGAATC GTTGCAGAGC 5820
TTTAAACCAA AGGAAGTGCT TTCGTCAGAG GAGGAACCTG GAAAGATAGC AGTGAAGAAA 5880
TCACCAGGAG CAAAGGAAAG AGATTCCCCA ATTGTGCCAT TGAAAGTGAC TCCGCCACCC 5940
CTGCTTCCTA TTGAGGCCTC TCCTGATGAG GATGTAATCC GATGTATTTG TGGATTGTAC 6000
AAAGATGAAG GCTTGATGAT TCAGTGCTCC AAATGCATGG TGTGGCAGCA TACTGAATGT 6060
ACCAAAGCTG ACATCGATGC GGATAATTAT CAGTGCGAGC GTTGCGAGCC AAGGGAAGTG 6120
GATCGAGAAA TTCCCCTGGA GGAATTTACC GAAGAGGGAC ACCGTTACTA TCTCTCCCTA 6180
ATGCGTGGTG ATCTGCAGGT ACGACAGGGC GATGCCGTCT ATGTCCTACG AGACATCCCC 6240
ATCAAGGATG AGTCCGGCAA GGTGTTACCG ACGAAGAAGC ACACCTATGA AACGATCGGA 6300
GCCATTGATT ACCAAGAGTG CGATATCTTT CGGGTGGAGC ACTTGTGGAA AAACGAACTG 6360
GGAAAGCGTT TTATATTCGG ACACCATTTC CTGCGTCCTC ATGAAACTTT CCACGAACCA 6420
TCGCGTCGTT TCTACCCCAA TGAGGTGGTA CGAGTTTCAC TTTATGAGGT GGTACCCATT 6480
GAGTTGGTCA TTGGACGCTG CTGGGTGTTG GATCGAACAA CTTTCTGTAA AGGACGCCCC 6540
ATGGAATGCA ACGATGAGGA TCATTGCTAC ATCTGCGAGC TGCGTGTGGA CAAGACGGCA 6600
AGGTTCTTCT CAAAGGCCAA GGCCAACCAT CCAGCCTGCA CCAAGAGCTA TGCCTTCAGA 6660
AAATTTCCCG AGAAGATTAA GATCTCCAAG AGCTATGCGC CCCATGATGT GGATCCGTCG 6720
TTGCTGAAGA CAAGGAAGCA AAAGACTGAA TTAGACGTAG GAGCCGGGCC AACAACGATG 6780
CACAAGGTGT CTGGCAGGCA GGAACAGCAT CAGGCCAAGA TGGTTGGGCG AAAGCCTCGT 6840
GGGATCAGTG CCCCAGCGGA TGCAACTGCT GTCCATGTCG TAACACCCGT GGCGCCCAAT 6900
AAACAGATGC TTAAGAAGAG GAAATCGCGT TTAGAGAACG TTTTGATAAC TATGAAGCTT 6960
AAGTGTCTGG ATGCACAAAC GGCACAGGAG CAACCCATTG ACTTGTCATA CCTGCTGTCC 7020
GGACGCGGCG CCCGGCAGCG GAAGACCCAG CAGTCCAGTA GCAGTTCTAC GGCCAACTCA 7080
ACATAACCGA TGTAACGCGA GTAGATTAAG CCGTGTATAT ACGCCTAAAG TTAGAGGCAT 7140
TGATTAATAT TTATGGAACT TCAATTAACA ATTAGCATTT AGCCATGTAA GATGTAACGG 7200
TGATAGTAGT AATCCTAAAG TGTACATTGT TATGAACAAA CTTCGTGAAA GGACTCCGGT 7260
CCCGGCTAGG ATGGAACAGA TTATGAATTA TGGTTAATAT AAGGATTTAT TTAAATACTA 7320
GTGCTTACGC CACCGCACCA AGCCCCCAAT TTAGATGAAT TTCTTTTGTT TTTAATTGTT 7380
TATATTGGCT CTCCATATTA ATTTTGTTTT ACAAAACTAT CCCCCTCTCG TCTTTGCAAT 7440
TGAAGGCGTT TGACTTTTGC ATTGTTATTT TAAATTATTG TATTATGTTA TGGCTGCTTA 7500
ATTATGTAAC ACCCGAGTAG GACAACTTAT TGCGCTATTA GTTTTTAGTT ATTTTCCAAT 7560
TGTTATTAAC TGCTAACAAA AATTTTAAAA GTGAAATAAA AAATATATTT TTATT 7616
Sequence Source Ensembl
Keyword

KW-0010--Activator
KW-0156--Chromatin regulator
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0217--Developmental protein
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR006560--AWS_dom
IPR001025--BAH_dom
IPR003616--Post-SET_dom
IPR001214--SET_dom
IPR019786--Zinc_finger_PHD-type_CS
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS51215--AWS
PS51038--BAH
PS50868--POST_SET
PS50280--SET
PS01359--ZF_PHD_1

Pfam

PF01426--BAH
PF00856--SET

Gene Ontology

GO:0000785--C:chromatin
GO:0005634--C:nucleus
GO:0005700--C:polytene chromosome
GO:0043234--C:protein complex
GO:0003682--F:chromatin binding
GO:0035035--F:histone acetyltransferase binding
GO:0042054--F:histone methyltransferase activity
GO:0042800--F:histone methyltransferase activity (H3-K4 specific)
GO:0046974--F:histone methyltransferase activity (H3-K9 specific)
GO:0042799--F:histone methyltransferase activity (H4-K20 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0042802--F:identical protein binding
GO:0008270--F:zinc ion binding
GO:0048096--P:chromatin-mediated maintenance of transcription
GO:0001700--P:embryonic development via the syncytial blastoderm
GO:0051568--P:histone H3-K4 methylation
GO:0051567--P:histone H3-K9 methylation
GO:0034770--P:histone H4-K20 methylation
GO:0016571--P:histone methylation
GO:0048477--P:oogenesis
GO:0018991--P:oviposition
GO:0010906--P:regulation of glucose metabolic process
GO:0006355--P:regulation of transcription, DNA-templated
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Orn-0231 ENSONIP00000024817.1 Oreochromis niloticus 41 0.0 670
WERAM-Asm-0099 ENSAMXP00000010273.1 Astyanax mexicanus 41 0.0 662
WERAM-Pof-0100 ENSPFOP00000008940.2 Poecilia formosa 41 0.0 655
WERAM-Dar-0162 ENSDARP00000052914.6 Danio rerio 40 0.0 654
WERAM-Meg-0006 ENSMGAP00000000305.2 Meleagris gallopavo 40 0.0 650
WERAM-Gaga-0134 ENSGALP00000023545.4 Gallus gallus 40 0.0 650
WERAM-Xim-0181 ENSXMAP00000014700.1 Xiphophorus maculatus 40 0.0 649
WERAM-Ova-0035 ENSOARP00000004767.1 Ovis aries 40 0.0 647
WERAM-Bot-0048 ENSBTAP00000005172.5 Bos taurus 40 0.0 646
WERAM-Pat-0013 ENSPTRP00000002411.3 Pan troglodytes 40 0.0 646
WERAM-Paa-0029 ENSPANP00000005654.1 Papio anubis 40 0.0 646
WERAM-Chs-0022 ENSCSAP00000016009.1 Chlorocebus sabaeus 40 0.0 644
WERAM-Tar-0134 ENSTRUP00000029370.1 Takifugu rubripes 40 0.0 643
WERAM-Mup-0063 ENSMPUP00000005424.1 Mustela putorius furo 40 0.0 642
WERAM-Orc-0146 ENSOCUP00000012882.3 Oryctolagus cuniculus 40 0.0 642
WERAM-Sah-0094 ENSSHAP00000010988.1 Sarcophilus harrisii 39 0.0 641
WERAM-Hos-0093 ENSP00000357330.3 Homo sapiens 40 0.0 641
WERAM-Caj-0092 ENSCJAP00000015181.2 Callithrix jacchus 39 0.0 640
WERAM-Ten-0204 ENSTNIP00000020694.1 Tetraodon nigroviridis 40 0.0 640
WERAM-Gaa-0026 ENSGACP00000004363.3 Gasterosteus aculeatus 40 0.0 639
WERAM-Orla-0009 ENSORLP00000000752.1 Oryzias latipes 41 0.0 639
WERAM-Aim-0001 ENSAMEP00000000048.1 Ailuropoda melanoleuca 39 0.0 639
WERAM-Otg-0100 ENSOGAP00000008607.2 Otolemur garnettii 39 0.0 639
WERAM-Myl-0149 ENSMLUP00000012265.2 Myotis lucifugus 40 0.0 638
WERAM-Cap-0050 ENSCPOP00000004321.2 Cavia porcellus 39 0.0 638
WERAM-Tut-0149 ENSTTRP00000012646.1 Tursiops truncatus 39 0.0 638
WERAM-Ran-0136 ENSRNOP00000027629.6 Rattus norvegicus 40 0.0 638
WERAM-Fia-0043 ENSFALP00000002960.1 Ficedula albicollis 40 0.0 636
WERAM-Mum-0089 ENSMUSP00000088451.4 Mus musculus 39 0.0 635
WERAM-Eqc-0151 ENSECAP00000016148.1 Equus caballus 39 0.0 635
WERAM-Dan-0138 ENSDNOP00000030955.1 Dasypus novemcinctus 40 0.0 635
WERAM-Anp-0113 ENSAPLP00000012903.1 Anas platyrhynchos 40 0.0 635
WERAM-Fec-0122 ENSFCAP00000011217.3 Felis catus 39 0.0 634
WERAM-Gam-0060 ENSGMOP00000006181.1 Gadus morhua 39 0.0 633
WERAM-Caf-0179 ENSCAFP00000024919.4 Canis familiaris 39 0.0 632
WERAM-Mod-0150 ENSMODP00000021257.2 Monodelphis domestica 39 1e-180 632
WERAM-Tag-0191 ENSTGUP00000017698.1 Taeniopygia guttata 40 5e-180 630
WERAM-Sus-0043 ENSSSCP00000006936.2 Sus scrofa 39 1e-179 629
WERAM-Anc-0028 ENSACAP00000003330.3 Anolis carolinensis 39 6e-179 626
WERAM-Mae-0076 ENSMEUP00000007694.1 Macropus eugenii 40 1e-177 622
WERAM-Vip-0083 ENSVPAP00000007858.1 Vicugna pacos 37 2e-165 582
WERAM-Ptv-0188 ENSPVAP00000016847.1 Pteropus vampyrus 39 2e-153 542
WERAM-Ocp-0053 ENSOPRP00000004587.1 Ochotona princeps 32 7e-107 387
WERAM-Nol-0132 ENSNLEP00000014766.2 Nomascus leucogenys 39 5e-101 368
WERAM-Pes-0038 ENSPSIP00000006003.1 Pelodiscus sinensis 40 8e-98 357
WERAM-Lac-0072 ENSLACP00000009517.1 Latimeria chalumnae 48 9e-92 337
WERAM-Prc-0146 ENSPCAP00000013700.1 Procavia capensis 39 4e-90 332
WERAM-Poa-0004 ENSPPYP00000000864.2 Pongo abelii 37 4e-90 332
WERAM-Tub-0092 ENSTBEP00000010737.1 Tupaia belangeri 36 3e-88 325
WERAM-Dio-0053 ENSDORP00000005023.1 Dipodomys ordii 35 7e-88 324
WERAM-Mam-0223 ENSMMUP00000038843.1 Macaca mulatta 42 6e-87 321
WERAM-Chh-0018 ENSCHOP00000001855.1 Choloepus hoffmanni 37 8e-86 317
WERAM-Soa-0011 ENSSARP00000001033.1 Sorex araneus 37 9e-85 313
WERAM-Gog-0128 ENSGGOP00000010946.2 Gorilla gorilla 41 7e-81 301
WERAM-Ect-0013 ENSETEP00000000939.1 Echinops telfairi 51 2e-75 282
WERAM-Mim-0070 ENSMICP00000006826.1 Microcebus murinus 53 2e-74 280
WERAM-Pem-0046 ENSPMAP00000005441.1 Petromyzon marinus 45 2e-69 263
WERAM-Cii-0041 ENSCINP00000016215.3 Ciona intestinalis 44 7e-58 224
WERAM-Ast-0016 CADATEAP00005891 Aspergillus terreus 38 2e-51 203
WERAM-Asf-0034 CADAFLAP00012044 Aspergillus flavus 36 1e-49 197
WERAM-Coi-0001 EAS35087 Coccidioides immitis 35 1e-49 197
WERAM-Asc-0014 CADACLAP00003400 Aspergillus clavatus 38 1e-49 197
WERAM-Asfu-0023 CADAFUAP00005795 Aspergillus fumigatus 38 1e-49 197
WERAM-Cis-0036 ENSCSAVP00000007852.1 Ciona savignyi 41 3e-49 196
WERAM-Pyt-0012 EFQ92879 Pyrenophora teres 39 1e-45 184
WERAM-Pytr-0024 EDU50267 Pyrenophora triticirepentis 39 7e-45 181
WERAM-Gag-0017 GGTG_04964T0 Gaeumannomyces graminis 36 2e-44 180
WERAM-Ict-0161 ENSSTOP00000016952.1 Ictidomys tridecemlineatus 38 5e-44 178
WERAM-Lem-0037 CBX95014 Leptosphaeria maculans 38 6e-44 178
WERAM-Tra-0101 Traes_2DS_F8E15CA0E.1 Triticum aestivum 42 8e-44 177
WERAM-Loa-0040 ENSLAFP00000002747.3 Loxodonta africana 38 2e-43 177
WERAM-Hov-0052 MLOC_53863.1 Hordeum vulgare 42 2e-43 176
WERAM-Zem-0120 GRMZM2G352431_P01 Zea mays 41 2e-43 176
WERAM-Tas-0101 ENSTSYP00000010617.1 Tarsius syrichta 38 2e-43 176
WERAM-Orbr-0026 OB02G27880.1 Oryza brachyantha 41 2e-43 176
WERAM-Brd-0085 BRADI3G45727.1 Brachypodium distachyon 41 3e-43 176
WERAM-Sei-0057 Si016071m Setaria italica 41 4e-43 176
WERAM-Lep-0028 LPERR02G16340.1 Leersia perrieri 41 4e-43 175
WERAM-Chg-0029 EAQ89993 Chaetomium globosum 38 4e-43 175
WERAM-Sob-0071 Sb04g022620.1 Sorghum bicolor 40 4e-43 175
WERAM-Orni-0022 ONIVA02G22370.1 Oryza nivara 41 2e-42 173
WERAM-Orr-0022 ORUFI02G21420.1 Oryza rufipogon 41 2e-42 173
WERAM-Ors-0021 OS02T0554000-01 Oryza sativa 41 3e-42 172
WERAM-Trv-0017 EHK22623 Trichoderma virens 37 3e-42 172
WERAM-Orp-0022 OPUNC02G18200.1 Oryza punctata 40 3e-42 172
WERAM-Org-0023 ORGLA02G0174000.1 Oryza glaberrima 41 3e-42 172
WERAM-Orgl-0024 OGLUM02G20630.1 Oryza glumaepatula 40 7e-42 171
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 43 7e-42 171
WERAM-Leo-0066 ENSLOCP00000008456.1 Lepisosteus oculatus 43 1e-41 171
WERAM-Aet-0014 EMT20452 Aegilops tauschii 38 2e-41 169
WERAM-Ora-0074 ENSOANP00000011168.1 Ornithorhynchus anatinus 42 2e-41 169
WERAM-Sem-0083 EFJ16223 Selaginella moellendorffii 39 2e-41 169
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 37 2e-40 167
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 37 2e-40 166
WERAM-Trr-0026 EGR50011 Trichoderma reesei 37 2e-40 166
WERAM-Glm-0058 GLYMA06G12391.1 Glycine max 41 3e-40 166
WERAM-Php-0036 PP1S183_22V6.1 Physcomitrella patens 38 4e-40 166
WERAM-Viv-0111 VIT_18s0001g01700.t01 Vitis vinifera 38 4e-40 165
WERAM-Scp-0013 SPAC29B12.02c.1:pep Schizosaccharomyces pombe 44 5e-40 165
WERAM-Prp-0004 EMJ23127 Prunus persica 39 5e-40 165
WERAM-Sol-0077 Solyc06g059960.2.1 Solanum lycopersicum 39 9e-40 164
WERAM-Met-0069 KEH35350 Medicago truncatula 34 1e-39 164
WERAM-Orm-0025 OMERI02G20080.1 Oryza meridionalis 37 1e-39 164
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 36 3e-39 162
WERAM-Usm-0014 UM02500P0 Ustilago maydis 41 3e-39 162
WERAM-Pot-0026 POPTR_0002s07930.1 Populus trichocarpa 38 1e-38 160
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 36 2e-38 159
Created Date 25-Jun-2016