Tag |
Content |
WERAM ID |
WERAM-Sac-0021 |
Ensembl Protein ID |
YHR119W |
Uniprot Accession |
P38827; SET1_YEAST; D3DL69 |
Genbank Protein ID |
NP_011987.1 |
Protein Name |
Histone-lysine N-methyltransferase, H3 lysine-4 specific |
Genbank Nucleotide ID |
NM_001179249.1 |
Gene Name |
SET1 |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
SET1 |
SET |
H3K4 |
K |
20236312 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
SET1 |
1.00e-45 |
153.1 |
339 |
1055 |
|
Organism |
Saccharomyces cerevisiae |
NCBI Taxa ID |
559292 |
Functional Description (View)Functional Description
Catalytic component of the COMPASS (Set1C) complex that specifically mono-, di- and trimethylates histone H3 to form H3K4me1/2/3, which subsequently plays a role in telomere length maintenance and transcription elongation regulation. |
Catalytic component of the COMPASS (Set1C) complex that specifically mono-, di- and trimethylates histone H3 to form H3K4me1/2/3, which subsequently plays a role in telomere length maintenance and transcription elongation regulation.
|
Domain Profile |
HMT SET1
SET1.txt 80 nhscepNceakvvavdge 97 +hs N+ +k v+++ + YHR119W 339 KHSILNNIISKFVEINVK 356 79999******9999865 PP SET1.txt 1 kelevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNceakvvavdgek 98 k++++a+s+i+++gl+a ++i+++e++iEYvGe ir+ va++rek+y k++ig +ylfr+de+ +v+datkkg+iarfinh+c+pNc+ak+++v g++ YHR119W 938 KPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGsSYLFRVDEN--TVIDATKKGGIARFINHCCDPNCTAKIIKVGGRR 1034 689**************************************************9*********..********************************** PP SET1.txt 99 kiviyakraIekgeeltydYk 119 +iviya r+I++ eeltydYk YHR119W 1035 RIVIYALRDIAASEELTYDYK 1055 ********************7 PP
|
Protein Sequence (Fasta) | MSNYYRRAHA SSGSYRQPQE QPQYSRSGHY QYSNGHSHQQ YSSQYNQRRR YNHNDGTRRR 60 YNDDRPHSSN NASTRQYYAT NNSQSGPYVN KKSDISSRRG MSQSRYSNSN VHNTLASSSG 120 SLPTESALLL QQRPPSVLRY NTDNLKSKFH YFDPIKGEFF NKDKMLSWKA TDKEFSETGY 180 YVVKELQDGQ FKFKIKHRHP EIKASDPRNE NGIMTSGKVA THRKCRNSLI LLPRISYDRY 240 SLGPPPSCEI VVYPAQDSTT TNIQDISIKN YFKKYGEISH FEAFNDPNSA LPLHVYLIKY 300 ASSDGKINDA AKAAFSAVRK HESSGCFIMG FKFEVILNKH SILNNIISKF VEINVKKLQK 360 LQENLKKAKE KEAENEKAKE LQGKDITLPK EPKVDTLSHS SGSEKRIPYD LLGVVNNRPV 420 LHVSKIFVAK HRFCVEDFKY KLRGYRCAKF IDHPTGIYII FNDIAHAQTC SNAESGNLTI 480 MSRSRRIPIL IKFHLILPRF QNRTRFNKSS SSSNSTNVPI KYESKEEFIE ATAKQILKDL 540 EKTLHVDIKK RLIGPTVFDA LDHANFPELL AKRELKEKEK RQQIASKIAE DELKRKEEAK 600 RDFDLFGLYG GYAKSNKRNL KRHNSLALDH TSLKRKKLSN GIKPMAHLLN EETDSKETTP 660 LNDEGITRVS KEHDEEDENM TSSSSEEEEE EAPDKKFKSE SEPTTPESDH LHGIKPLVPD 720 QNGSSDVLDA SSMYKPTATE IPEPVYPPEE YDLKYSQTLS SMDLQNAIKD EEDMLILKQL 780 LSTYTPTVTP ETSAALEYKI WQSRRKVLEE EKASDWQIEL NGTLFDSELQ PGSSFKAEGF 840 RKIADKLKIN YLPHRRRVHQ PLNTVNIHNE RNEYTPELCQ REESSNKEPS DSVPQEVSSS 900 RDNRASNRRF QQDIEAQKAA IGTESELLSL NQLNKRKKPV MFARSAIHNW GLYALDSIAA 960 KEMIIEYVGE RIRQPVAEMR EKRYLKNGIG SSYLFRVDEN TVIDATKKGG IARFINHCCD 1020 PNCTAKIIKV GGRRRIVIYA LRDIAASEEL TYDYKFEREK DDEERLPCLC GAPNCKGFLN 1080 Protein Fasta Sequence
>YHR119W|SET1|Saccharomyces cerevisiae MSNYYRRAHASSGSYRQPQEQPQYSRSGHYQYSNGHSHQQYSSQYNQRRRYNHNDGTRRRYNDDRPHSSNNASTRQYYATNNSQSGPYVNKKSDISSRRGMSQSRYSNSNVHNTLASSSGSLPTESALLLQQRPPSVLRYNTDNLKSKFHYFDPIKGEFFNKDKMLSWKATDKEFSETGYYVVKELQDGQFKFKIKHRHPEIKASDPRNENGIMTSGKVATHRKCRNSLILLPRISYDRYSLGPPPSCEIVVYPAQDSTTTNIQDISIKNYFKKYGEISHFEAFNDPNSALPLHVYLIKYASSDGKINDAAKAAFSAVRKHESSGCFIMGFKFEVILNKHSILNNIISKFVEINVKKLQKLQENLKKAKEKEAENEKAKELQGKDITLPKEPKVDTLSHSSGSEKRIPYDLLGVVNNRPVLHVSKIFVAKHRFCVEDFKYKLRGYRCAKFIDHPTGIYIIFNDIAHAQTCSNAESGNLTIMSRSRRIPILIKFHLILPRFQNRTRFNKSSSSSNSTNVPIKYESKEEFIEATAKQILKDLEKTLHVDIKKRLIGPTVFDALDHANFPELLAKRELKEKEKRQQIASKIAEDELKRKEEAKRDFDLFGLYGGYAKSNKRNLKRHNSLALDHTSLKRKKLSNGIKPMAHLLNEETDSKETTPLNDEGITRVSKEHDEEDENMTSSSSEEEEEEAPDKKFKSESEPTTPESDHLHGIKPLVPDQNGSSDVLDASSMYKPTATEIPEPVYPPEEYDLKYSQTLSSMDLQNAIKDEEDMLILKQLLSTYTPTVTPETSAALEYKIWQSRRKVLEEEKASDWQIELNGTLFDSELQPGSSFKAEGFRKIADKLKINYLPHRRRVHQPLNTVNIHNERNEYTPELCQREESSNKEPSDSVPQEVSSSRDNRASNRRFQQDIEAQKAAIGTESELLSLNQLNKRKKPVMFARSAIHNWGLYALDSIAAKEMIIEYVGERIRQPVAEMREKRYLKNGIGSSYLFRVDENTVIDATKKGGIARFINHCCDPNCTAKIIKVGGRRRIVIYALRDIAASEELTYDYKFEREKDDEERLPCLCGAPNCKGFLN
|
Nucleotide Sequence (Fasta) | ATGTCAAATT ACTATAGAAG AGCACACGCG TCTTCTGGTT CATACAGACA ACCCCAGGAA 60 CAGCCTCAAT ATTCGCGTTC TGGTCACTAT CAGTATTCAA ACGGCCATTC TCACCAACAA 120 TATTCTAGTC AATATAATCA ACGTCGACGT TATAACCATA ATGATGGTAC AAGGCGACGC 180 TATAATGACG ATCGCCCACA TAGTTCAAAC AATGCAAGTA CGCGACAGTA CTATGCTACT 240 AACAACAGCC AAAGCGGCCC ATATGTAAAT AAGAAATCTG ACATCAGTAG TCGGAGGGGC 300 ATGTCTCAAT CACGGTATTC AAATAGCAAT GTTCACAATA CATTAGCGTC TTCGAGTGGA 360 TCTCTTCCCA CAGAATCTGC TCTGCTTTTG CAACAAAGAC CACCTTCAGT TTTGAGATAC 420 AACACAGATA ATTTGAAGTC TAAGTTTCAT TATTTTGATC CCATAAAAGG CGAGTTCTTC 480 AATAAGGATA AGATGCTTTC GTGGAAGGCT ACAGATAAAG AATTTTCTGA AACAGGTTAT 540 TACGTAGTCA AAGAGTTACA AGATGGACAG TTTAAGTTCA AAATAAAACA CAGACATCCG 600 GAGATAAAAG CATCCGACCC ACGTAATGAA AACGGTATCA TGACTAGCGG AAAAGTGGCA 660 ACCCACAGAA AATGCAGGAA CTCACTAATT CTATTGCCTC GCATATCTTA TGACAGGTAC 720 TCCTTAGGGC CTCCCCCTTC ATGTGAAATA GTTGTCTATC CAGCGCAAGA TTCAACAACA 780 ACCAATATCC AAGACATATC AATAAAAAAC TATTTTAAAA AGTATGGAGA AATTTCTCAT 840 TTTGAAGCAT TTAATGATCC TAATAGCGCT TTACCTTTGC ATGTTTATCT TATAAAGTAT 900 GCCAGTTCTG ATGGAAAAAT TAATGATGCA GCAAAAGCAG CCTTTAGTGC CGTTAGAAAG 960 CACGAATCTT CGGGTTGCTT TATCATGGGC TTCAAGTTCG AAGTGATTTT AAACAAGCAT 1020 TCCATTTTGA ATAATATCAT TTCTAAATTT GTTGAAATAA ATGTCAAAAA GCTACAGAAG 1080 TTACAAGAGA ACCTGAAGAA GGCTAAAGAG AAAGAAGCAG AAAACGAAAA AGCAAAGGAA 1140 TTACAGGGCA AAGATATTAC CTTGCCCAAG GAACCTAAGG TAGACACATT ATCTCATTCG 1200 TCCGGAAGTG AAAAAAGAAT TCCATATGAT CTCTTGGGGG TAGTTAATAA CAGACCTGTT 1260 TTACATGTCT CCAAAATATT TGTTGCCAAA CATAGGTTCT GCGTTGAGGA CTTTAAATAC 1320 AAGTTAAGGG GATACAGATG TGCGAAATTT ATTGATCATC CAACTGGTAT CTATATTATT 1380 TTTAATGACA TTGCCCATGC GCAAACATGT TCGAATGCAG AGTCAGGAAA TTTAACAATA 1440 ATGTCTCGGA GCAGAAGAAT TCCTATTCTA ATAAAGTTTC ATCTCATTCT CCCTAGGTTC 1500 CAAAACAGAA CTAGATTCAA TAAATCTAGC TCATCTTCAA ATTCTACAAA TGTACCTATA 1560 AAATACGAGT CCAAAGAGGA GTTCATTGAA GCTACAGCAA AACAAATATT AAAAGATTTG 1620 GAAAAGACTT TACATGTTGA TATTAAGAAG AGATTGATTG GTCCTACGGT ATTTGATGCT 1680 TTGGACCATG CAAATTTTCC TGAATTGTTA GCTAAAAGAG AACTAAAGGA GAAAGAGAAG 1740 AGACAACAGA TTGCATCTAA AATTGCTGAA GATGAATTGA AACGTAAAGA AGAAGCCAAA 1800 AGAGATTTTG ATTTGTTTGG TTTATATGGT GGCTATGCAA AATCTAATAA AAGAAATTTA 1860 AAAAGGCATA ATTCACTCGC GTTGGATCAT ACTTCTTTAA AGAGGAAAAA GCTATCCAAT 1920 GGTATCAAAC CAATGGCACA TTTACTGAAC GAAGAAACCG ATTCCAAAGA AACTACCCCA 1980 TTGAACGATG AAGGGATCAC TCGCGTATCA AAAGAACATG ATGAAGAAGA CGAAAATATG 2040 ACATCTTCAT CTTCTGAAGA AGAGGAAGAA GAAGCTCCAG ATAAGAAATT CAAGAGTGAG 2100 TCTGAGCCAA CCACCCCCGA ATCTGATCAC CTTCATGGTA TTAAGCCGTT AGTACCCGAT 2160 CAAAATGGGT CGTCTGACGT ACTGGATGCT TCTTCGATGT ATAAACCTAC TGCTACCGAA 2220 ATTCCCGAAC CTGTATATCC ACCTGAGGAA TATGACTTGA AATATAGTCA GACTTTATCT 2280 TCTATGGATT TGCAGAATGC TATCAAAGAT GAGGAAGATA TGCTAATTTT AAAGCAGTTA 2340 TTGAGCACAT ATACTCCTAC CGTCACACCA GAAACAAGCG CAGCTCTGGA ATATAAAATT 2400 TGGCAATCTC GCCGAAAAGT TCTTGAAGAA GAGAAGGCTT CCGATTGGCA AATAGAGCTT 2460 AATGGAACTT TATTTGATAG TGAACTACAA CCAGGTAGCT CTTTTAAAGC TGAAGGGTTC 2520 AGGAAAATTG CGGATAAATT AAAAATTAAT TACCTACCCC ATCGTCGCAG AGTTCACCAA 2580 CCTTTAAATA CGGTGAATAT TCACAATGAA AGGAATGAGT ACACACCTGA ACTTTGTCAA 2640 AGAGAAGAAT CCTCGAATAA AGAACCTTCA GACTCAGTTC CTCAAGAAGT TTCATCCTCT 2700 AGAGATAATA GGGCATCAAA TAGAAGATTT CAGCAGGACA TAGAGGCACA GAAAGCCGCA 2760 ATTGGTACGG AATCTGAGCT GCTATCACTA AATCAATTAA ATAAAAGAAA AAAGCCAGTT 2820 ATGTTCGCTC GTTCAGCAAT TCACAACTGG GGTTTATATG CTCTAGACTC TATCGCAGCA 2880 AAGGAAATGA TTATCGAGTA CGTTGGTGAA AGGATCAGGC AACCTGTAGC AGAAATGAGA 2940 GAGAAAAGAT ATCTGAAAAA TGGGATTGGA TCCAGTTACC TTTTTAGGGT TGATGAAAAC 3000 ACGGTTATTG ATGCCACCAA GAAAGGTGGT ATAGCCCGTT TCATTAATCA TTGTTGTGAT 3060 CCAAATTGTA CGGCAAAGAT TATAAAGGTT GGCGGGAGAA GGAGAATTGT TATCTATGCA 3120 CTGCGTGATA TCGCGGCAAG CGAAGAGTTG ACATATGATT ACAAATTTGA GAGAGAAAAG 3180 GATGACGAGG AAAGACTTCC TTGTTTATGT GGAGCACCTA ATTGTAAAGG TTTCTTGAAC 3240 TGA
3244Nucleotide Fasta Sequence
>YHR119W|SET1|Saccharomyces cerevisiae ATGTCAAATTACTATAGAAGAGCACACGCGTCTTCTGGTTCATACAGACAACCCCAGGAACAGCCTCAATATTCGCGTTCTGGTCACTATCAGTATTCAAACGGCCATTCTCACCAACAATATTCTAGTCAATATAATCAACGTCGACGTTATAACCATAATGATGGTACAAGGCGACGCTATAATGACGATCGCCCACATAGTTCAAACAATGCAAGTACGCGACAGTACTATGCTACTAACAACAGCCAAAGCGGCCCATATGTAAATAAGAAATCTGACATCAGTAGTCGGAGGGGCATGTCTCAATCACGGTATTCAAATAGCAATGTTCACAATACATTAGCGTCTTCGAGTGGATCTCTTCCCACAGAATCTGCTCTGCTTTTGCAACAAAGACCACCTTCAGTTTTGAGATACAACACAGATAATTTGAAGTCTAAGTTTCATTATTTTGATCCCATAAAAGGCGAGTTCTTCAATAAGGATAAGATGCTTTCGTGGAAGGCTACAGATAAAGAATTTTCTGAAACAGGTTATTACGTAGTCAAAGAGTTACAAGATGGACAGTTTAAGTTCAAAATAAAACACAGACATCCGGAGATAAAAGCATCCGACCCACGTAATGAAAACGGTATCATGACTAGCGGAAAAGTGGCAACCCACAGAAAATGCAGGAACTCACTAATTCTATTGCCTCGCATATCTTATGACAGGTACTCCTTAGGGCCTCCCCCTTCATGTGAAATAGTTGTCTATCCAGCGCAAGATTCAACAACAACCAATATCCAAGACATATCAATAAAAAACTATTTTAAAAAGTATGGAGAAATTTCTCATTTTGAAGCATTTAATGATCCTAATAGCGCTTTACCTTTGCATGTTTATCTTATAAAGTATGCCAGTTCTGATGGAAAAATTAATGATGCAGCAAAAGCAGCCTTTAGTGCCGTTAGAAAGCACGAATCTTCGGGTTGCTTTATCATGGGCTTCAAGTTCGAAGTGATTTTAAACAAGCATTCCATTTTGAATAATATCATTTCTAAATTTGTTGAAATAAATGTCAAAAAGCTACAGAAGTTACAAGAGAACCTGAAGAAGGCTAAAGAGAAAGAAGCAGAAAACGAAAAAGCAAAGGAATTACAGGGCAAAGATATTACCTTGCCCAAGGAACCTAAGGTAGACACATTATCTCATTCGTCCGGAAGTGAAAAAAGAATTCCATATGATCTCTTGGGGGTAGTTAATAACAGACCTGTTTTACATGTCTCCAAAATATTTGTTGCCAAACATAGGTTCTGCGTTGAGGACTTTAAATACAAGTTAAGGGGATACAGATGTGCGAAATTTATTGATCATCCAACTGGTATCTATATTATTTTTAATGACATTGCCCATGCGCAAACATGTTCGAATGCAGAGTCAGGAAATTTAACAATAATGTCTCGGAGCAGAAGAATTCCTATTCTAATAAAGTTTCATCTCATTCTCCCTAGGTTCCAAAACAGAACTAGATTCAATAAATCTAGCTCATCTTCAAATTCTACAAATGTACCTATAAAATACGAGTCCAAAGAGGAGTTCATTGAAGCTACAGCAAAACAAATATTAAAAGATTTGGAAAAGACTTTACATGTTGATATTAAGAAGAGATTGATTGGTCCTACGGTATTTGATGCTTTGGACCATGCAAATTTTCCTGAATTGTTAGCTAAAAGAGAACTAAAGGAGAAAGAGAAGAGACAACAGATTGCATCTAAAATTGCTGAAGATGAATTGAAACGTAAAGAAGAAGCCAAAAGAGATTTTGATTTGTTTGGTTTATATGGTGGCTATGCAAAATCTAATAAAAGAAATTTAAAAAGGCATAATTCACTCGCGTTGGATCATACTTCTTTAAAGAGGAAAAAGCTATCCAATGGTATCAAACCAATGGCACATTTACTGAACGAAGAAACCGATTCCAAAGAAACTACCCCATTGAACGATGAAGGGATCACTCGCGTATCAAAAGAACATGATGAAGAAGACGAAAATATGACATCTTCATCTTCTGAAGAAGAGGAAGAAGAAGCTCCAGATAAGAAATTCAAGAGTGAGTCTGAGCCAACCACCCCCGAATCTGATCACCTTCATGGTATTAAGCCGTTAGTACCCGATCAAAATGGGTCGTCTGACGTACTGGATGCTTCTTCGATGTATAAACCTACTGCTACCGAAATTCCCGAACCTGTATATCCACCTGAGGAATATGACTTGAAATATAGTCAGACTTTATCTTCTATGGATTTGCAGAATGCTATCAAAGATGAGGAAGATATGCTAATTTTAAAGCAGTTATTGAGCACATATACTCCTACCGTCACACCAGAAACAAGCGCAGCTCTGGAATATAAAATTTGGCAATCTCGCCGAAAAGTTCTTGAAGAAGAGAAGGCTTCCGATTGGCAAATAGAGCTTAATGGAACTTTATTTGATAGTGAACTACAACCAGGTAGCTCTTTTAAAGCTGAAGGGTTCAGGAAAATTGCGGATAAATTAAAAATTAATTACCTACCCCATCGTCGCAGAGTTCACCAACCTTTAAATACGGTGAATATTCACAATGAAAGGAATGAGTACACACCTGAACTTTGTCAAAGAGAAGAATCCTCGAATAAAGAACCTTCAGACTCAGTTCCTCAAGAAGTTTCATCCTCTAGAGATAATAGGGCATCAAATAGAAGATTTCAGCAGGACATAGAGGCACAGAAAGCCGCAATTGGTACGGAATCTGAGCTGCTATCACTAAATCAATTAAATAAAAGAAAAAAGCCAGTTATGTTCGCTCGTTCAGCAATTCACAACTGGGGTTTATATGCTCTAGACTCTATCGCAGCAAAGGAAATGATTATCGAGTACGTTGGTGAAAGGATCAGGCAACCTGTAGCAGAAATGAGAGAGAAAAGATATCTGAAAAATGGGATTGGATCCAGTTACCTTTTTAGGGTTGATGAAAACACGGTTATTGATGCCACCAAGAAAGGTGGTATAGCCCGTTTCATTAATCATTGTTGTGATCCAAATTGTACGGCAAAGATTATAAAGGTTGGCGGGAGAAGGAGAATTGTTATCTATGCACTGCGTGATATCGCGGCAAGCGAAGAGTTGACATATGATTACAAATTTGAGAGAGAAAAGGATGACGAGGAAAGACTTCCTTGTTTATGTGGAGCACCTAATTGTAAAGGTTTCTTGAACTGA
|
Sequence Source |
Ensembl |
Keyword |
KW-0002--3D-structure KW-0156--Chromatin regulator KW-0158--Chromosome KW-0181--Complete proteome KW-0489--Methyltransferase KW-0539--Nucleus KW-0597--Phosphoprotein KW-1185--Reference proteome KW-0949--S-adenosyl-L-methionine KW-0808--Transferase --
|
Interpro |
IPR024657--COMPASS_Set1_N-SET IPR017111--Hist_H3-K4_MeTrfase_1_fun IPR003616--Post-SET_dom IPR024636--SET_assoc IPR001214--SET_dom
|
PROSITE |
PS50868--POST_SET PS51572--SAM_MT43_1 PS50280--SET
|
Pfam |
PF11764--N-SET PF00856--SET PF11767--SET_assoc
|
Gene Ontology |
GO:0000781--C:chromosome, telomeric region GO:0048188--C:Set1C/COMPASS complex GO:0042054--F:histone methyltransferase activity GO:0018024--F:histone-lysine N-methyltransferase activity GO:0016279--F:protein-lysine N-methyltransferase activity GO:0003723--F:RNA binding GO:0030437--P:ascospore formation GO:0030466--P:chromatin silencing at silent mating-type cassette GO:0006348--P:chromatin silencing at telomere GO:0044648--P:histone H3-K4 dimethylation GO:0051568--P:histone H3-K4 methylation GO:0080182--P:histone H3-K4 trimethylation GO:0018027--P:peptidyl-lysine dimethylation GO:0035066--P:positive regulation of histone acetylation GO:1903341--P:regulation of meiotic DNA double-strand break formation GO:0043618--P:regulation of transcription from RNA polymerase II promoter in response to stress GO:0000723--P:telomere maintenance
|
Orthology |
|
Created Date |
25-Jun-2016 |