Tag |
Content |
WERAM ID |
WERAM-Art-0042 |
Ensembl Protein ID |
AT2G23380.1 |
Uniprot Accession |
P93831; CLF_ARATH; O80455 |
Genbank Protein ID |
NP_179919.1 |
Protein Name |
Histone-lysine N-methyltransferase CLF |
Genbank Nucleotide ID |
NM_127902.5 |
Gene Name |
CLF |
Ensembl Information |
|
Details |
Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
HMT |
EZ |
SET |
H3K27 |
K |
20703330; 26484201 |
|
Status |
Reviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
EZ |
2.60e-63 |
212 |
512 |
867 |
HMT |
SET1 |
2.50e-31 |
109 |
754 |
867 |
|
Organism |
Arabidopsis thaliana |
NCBI Taxa ID |
3702 |
Functional Description (View)Functional Description
Polycomb group (PcG) protein. Catalytic subunit of some PcG multiprotein complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target genes. Required to regulate floral development by repressing the AGAMOUS homeotic gene in leaves, influorescence stems and flowers. Regulates the antero-posterior organization of the endosperm, as well as the division and elongation rates of leaf cells. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development. |
Polycomb group (PcG) protein. Catalytic subunit of some PcG multiprotein complex, which methylates 'Lys-27' of histone H3, leading to transcriptional repression of the affected target genes. Required to regulate floral development by repressing the AGAMOUS homeotic gene in leaves, influorescence stems and flowers. Regulates the antero-posterior organization of the endosperm, as well as the division and elongation rates of leaf cells. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development. PcG proteins are not required to initiate repression, but to maintain it during later stages of development.
|
Domain Profile |
HMT EZ
EZ.txt 19 esvekneylgey 30 es++k+e++ge AT2G23380.1 512 ESLRKEEFMGET 523 78999**99985 PP EZ.txt 1 krillgksdvaGwGlflkesvekneylgeytGelisddeadkrGkiydrakssflfnlndqlvidakrkGnklkfanhsakpncyakvllvaGdhriG 98 +r+llg+sdv+GwG+flk+sv+k+eylgeytGelis++eadkrGkiydr+++sflfnlndq+v+da+rkG+klkfanhs +pncyakv++vaGdhr+G AT2G23380.1 752 QRVLLGISDVSGWGAFLKNSVSKHEYLGEYTGELISHKEADKRGKIYDRENCSFLFNLNDQFVLDAYRKGDKLKFANHSPEPNCYAKVIMVAGDHRVG 849 79************************************************************************************************ PP EZ.txt 99 lfakrrieaseelffdyr 116 +fak+ri a+eelf+dyr AT2G23380.1 850 IFAKERILAGEELFYDYR 867 *****************7 PP
HMT SET1
SET1.txt 3 levakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakvvavdgekki 100 + + s++ g+g + k++++k+e + EY+Ge+i++++adkr k y++++++ +lf+l+++ +v+da +kg++ +f nhs+epNc+akv++v g++++ AT2G23380.1 754 VLLGISDVSGWGAFLKNSVSKHEYLGEYTGELISHKEADKRGKIYDRENCS-FLFNLNDQ--FVLDAYRKGDKLKFANHSPEPNCYAKVIMVAGDHRV 848 677899*****************************************9776.********..************************************ PP SET1.txt 101 viyakraIekgeeltydYk 119 +i+ak +I +geel+ydY+ AT2G23380.1 849 GIFAKERILAGEELFYDYR 867 ******************6 PP
|
Protein Sequence (Fasta) | MASEASPSSS ATRSEPPKDS PAEERGPASK EVSEVIESLK KKLAADRCIS IKKRIDENKK 60 NLFAITQSFM RSSMERGGSC KDGSDLLVKR QRDSPGMKSG IDESNNNRYV EDGPASSGMV 120 QGSSVPVKIS LRPIKMPDIK RLSPYTTWVF LDRNQRMTED QSVVGRRRIY YDQTGGEALI 180 CSDSEEEAID DEEEKRDFLE PEDYIIRMTL EQLGLSDSVL AELASFLSRS TSEIKARHGV 240 LMKEKEVSES GDNQAESSLL NKDMEGALDS FDNLFCRRCL VFDCRLHGCS QDLIFPAEKP 300 APWCPPVDEN LTCGANCYKT LLKSGRFPGY GTIEGKTGTS SDGAGTKTTP TKFSSKLNGR 360 KPKTFPSESA SSNEKCALET SDSENGLQQD TNSDKVSSSP KVKGSGRRVG RKRNKNRVAE 420 RVPRKTQKRQ KKTEASDSDS IASGSCSPSD AKHKDNEDAT SSSQKHVKSG NSGKSRKNGT 480 PAEVSNNSVK DDVPVCQSNE VASELDAPGS DESLRKEEFM GETVSRGRLA TNKLWRPLEK 540 SLFDKGVEIF GMNSCLIARN LLSGFKSCWE VFQYMTCSEN KASFFGGDGL NPDGSSKFDI 600 NGNMVNNQVR RRSRFLRRRG KVRRLKYTWK SAAYHSIRKR ITEKKDQPCR QFNPCNCKIA 660 CGKECPCLLN GTCCEKYCGC PKSCKNRFRG CHCAKSQCRS RQCPCFAADR ECDPDVCRNC 720 WVIGGDGSLG VPSQRGDNYE CRNMKLLLKQ QQRVLLGISD VSGWGAFLKN SVSKHEYLGE 780 YTGELISHKE ADKRGKIYDR ENCSFLFNLN DQFVLDAYRK GDKLKFANHS PEPNCYAKVI 840 MVAGDHRVGI FAKERILAGE ELFYDYRYEP DRAPAWAKKP EAPGSKKDEN VTPSVGRPKK 900 LA 902Protein Fasta Sequence
>AT2G23380.1|CLF|Arabidopsis thaliana MASEASPSSSATRSEPPKDSPAEERGPASKEVSEVIESLKKKLAADRCISIKKRIDENKKNLFAITQSFMRSSMERGGSCKDGSDLLVKRQRDSPGMKSGIDESNNNRYVEDGPASSGMVQGSSVPVKISLRPIKMPDIKRLSPYTTWVFLDRNQRMTEDQSVVGRRRIYYDQTGGEALICSDSEEEAIDDEEEKRDFLEPEDYIIRMTLEQLGLSDSVLAELASFLSRSTSEIKARHGVLMKEKEVSESGDNQAESSLLNKDMEGALDSFDNLFCRRCLVFDCRLHGCSQDLIFPAEKPAPWCPPVDENLTCGANCYKTLLKSGRFPGYGTIEGKTGTSSDGAGTKTTPTKFSSKLNGRKPKTFPSESASSNEKCALETSDSENGLQQDTNSDKVSSSPKVKGSGRRVGRKRNKNRVAERVPRKTQKRQKKTEASDSDSIASGSCSPSDAKHKDNEDATSSSQKHVKSGNSGKSRKNGTPAEVSNNSVKDDVPVCQSNEVASELDAPGSDESLRKEEFMGETVSRGRLATNKLWRPLEKSLFDKGVEIFGMNSCLIARNLLSGFKSCWEVFQYMTCSENKASFFGGDGLNPDGSSKFDINGNMVNNQVRRRSRFLRRRGKVRRLKYTWKSAAYHSIRKRITEKKDQPCRQFNPCNCKIACGKECPCLLNGTCCEKYCGCPKSCKNRFRGCHCAKSQCRSRQCPCFAADRECDPDVCRNCWVIGGDGSLGVPSQRGDNYECRNMKLLLKQQQRVLLGISDVSGWGAFLKNSVSKHEYLGEYTGELISHKEADKRGKIYDRENCSFLFNLNDQFVLDAYRKGDKLKFANHSPEPNCYAKVIMVAGDHRVGIFAKERILAGEELFYDYRYEPDRAPAWAKKPEAPGSKKDENVTPSVGRPKKLA
|
Nucleotide Sequence (Fasta) | GATCTGGTTT CTTGACAATG GCGTCAGAAG CTTCGCCTTC TTCTTCGGCC ACCAGATCGG 60 AGCCACCCAA AGACTCTCCG GCGGAGGAGA GAGGTCCAGC TTCTAAGGAA GTATCAGAAG 120 TAATAGAATC GCTAAAGAAG AAGCTTGCAG CTGATAGGTG TATATCAATA AAGAAAAGGA 180 TTGATGAAAA CAAGAAGAAT TTGTTTGCTA TTACTCAAAG TTTTATGAGG TCTTCTATGG 240 AACGAGGAGG TAGCTGTAAA GATGGCAGTG ATCTTTTAGT TAAGAGGCAA AGAGATTCGC 300 CAGGTATGAA AAGCGGAATC GATGAAAGTA ATAACAACAG ATATGTAGAA GATGGACCTG 360 CCAGTTCAGG AATGGTTCAA GGATCTAGTG TCCCTGTCAA AATTTCCTTA CGTCCTATCA 420 AAATGCCTGA TATCAAACGT TTGTCACCTT ATACCACATG GGTTTTTCTG GACAGAAATC 480 AAAGAATGAC TGAAGACCAG TCTGTAGTGG GTCGAAGGAG AATTTATTAT GATCAAACTG 540 GCGGGGAAGC GCTTATCTGC AGTGATAGTG AAGAGGAGGC CATTGACGAC GAAGAAGAAA 600 AAAGAGATTT TTTGGAGCCT GAAGATTATA TTATTCGCAT GACCCTTGAG CAACTAGGTC 660 TTTCAGACTC AGTCCTGGCG GAACTAGCAA GTTTCTTGTC TAGAAGTACT AGTGAAATCA 720 AGGCAAGACA TGGAGTGCTT ATGAAGGAAA AAGAAGTATC CGAGAGTGGC GATAATCAAG 780 CAGAGAGCTC CCTTCTCAAC AAAGATATGG AAGGAGCATT AGATTCTTTC GATAACCTGT 840 TCTGCCGTAG ATGCCTTGTA TTTGATTGCC GGCTTCATGG GTGTTCACAG GATCTCATTT 900 TTCCGGCTGA GAAACCAGCT CCATGGTGTC CTCCTGTAGA TGAAAATTTA ACCTGTGGTG 960 CAAACTGCTA TAAAACGCTT CTCAAGTCTG GAAGATTTCC GGGATATGGC ACCATTGAAG 1020 GTAAAACTGG CACTTCATCA GATGGTGCAG GTACTAAAAC CACACCCACG AAGTTCTCCA 1080 GCAAACTTAA TGGGAGAAAA CCAAAGACCT TCCCAAGTGA AAGTGCATCG TCTAATGAAA 1140 AGTGCGCACT AGAAACAAGT GACTCAGAGA ATGGACTACA GCAGGATACC AATTCCGATA 1200 AAGTTTCATC ATCGCCAAAG GTGAAAGGTA GTGGGAGACG AGTAGGTCGT AAGAGGAACA 1260 AAAACCGAGT TGCTGAGCGA GTTCCTCGTA AGACTCAGAA GAGGCAGAAG AAGACAGAAG 1320 CCTCGGATAG TGATTCCATC GCCAGTGGAA GTTGTTCACC CAGCGATGCA AAACATAAAG 1380 ATAATGAAGA TGCTACTTCC TCTTCTCAGA AGCATGTAAA ATCTGGGAAC TCCGGGAAGT 1440 CAAGGAAGAA TGGCACTCCT GCCGAAGTCT CCAATAATTC TGTGAAGGAT GACGTTCCTG 1500 TTTGCCAGTC AAATGAGGTT GCGTCAGAGC TTGATGCGCC GGGTAGTGAT GAAAGTCTAA 1560 GGAAAGAAGA GTTTATGGGT GAAACTGTAT CTCGAGGAAG ATTGGCTACA AATAAGTTGT 1620 GGAGACCACT TGAGAAAAGC CTTTTTGATA AAGGTGTTGA GATTTTTGGA ATGAATAGCT 1680 GCTTGATTGC TAGAAATCTT TTGAGTGGTT TCAAATCATG TTGGGAGGTC TTCCAATACA 1740 TGACGTGCTC GGAAAATAAA GCTTCCTTCT TTGGAGGTGA TGGATTGAAT CCTGATGGCT 1800 CTTCCAAGTT CGATATCAAT GGAAATATGG TTAATAACCA AGTGAGGAGA AGGTCAAGAT 1860 TTCTACGTAG GAGAGGCAAA GTGCGGCGCT TGAAGTATAC CTGGAAGTCT GCTGCATATC 1920 ATTCAATTAG GAAAAGAATT ACTGAGAAGA AAGACCAGCC CTGCCGTCAG TTTAATCCAT 1980 GTAACTGCAA AATTGCTTGT GGGAAGGAAT GTCCTTGTTT GCTAAACGGG ACTTGCTGCG 2040 AGAAGTACTG CGGTTGCCCA AAGAGCTGCA AGAATAGGTT TAGAGGTTGT CATTGTGCCA 2100 AAAGTCAGTG TCGAAGCCGC CAGTGTCCAT GCTTTGCTGC AGATCGGGAA TGTGACCCAG 2160 ATGTTTGTAG AAACTGCTGG GTCATTGGTG GGGATGGTTC GCTTGGGGTC CCAAGCCAAA 2220 GAGGCGATAA TTATGAGTGC AGGAATATGA AATTGCTCCT GAAACAACAA CAAAGGGTTT 2280 TACTTGGAAT ATCTGATGTT TCTGGTTGGG GAGCTTTCTT AAAGAACAGT GTAAGTAAGC 2340 ATGAATACCT TGGGGAATAC ACAGGAGAGC TGATCTCACA TAAAGAGGCA GATAAACGCG 2400 GGAAGATATA CGATCGCGAG AACTGCTCTT TTCTCTTCAA TCTAAACGAT CAGTTTGTGC 2460 TAGATGCTTA CAGGAAAGGA GATAAACTGA AATTCGCCAA CCATTCTCCT GAACCTAACT 2520 GTTACGCAAA GGTCATCATG GTTGCTGGAG ATCACAGGGT GGGGATCTTC GCAAAAGAGA 2580 GGATACTGGC TGGAGAAGAA CTATTTTACG ATTACCGGTA TGAGCCAGAT CGAGCTCCAG 2640 CTTGGGCCAA AAAACCTGAA GCTCCTGGTT CTAAGAAAGA CGAAAATGTT ACACCTTCTG 2700 TTGGTAGACC CAAGAAGCTT GCTTAGCAAC AAAAGAAACA ACCATTTTTT TGTCAATTCT 2760 TTGGTTACAG GTGGAAGAAC GCTTTAATCC TCATTACTCT CCACACGGAA GAACACATTG 2820 AAACAAATTC ATACATTTTG CTGAGTCTAA AGAAAAATTG TATTCGTTGG ATTAAAATTT 2880 CCTTTTTTCG TTTTACATTT CTGGATTATC ATTTTATTGT ACTGAGACTC GGGTTAAAGT 2940 TTTTAAATTA CAGATGAGAA ACTTGGTG
2969Nucleotide Fasta Sequence
>AT2G23380.1|SET1|Arabidopsis thaliana GATCTGGTTTCTTGACAATGGCGTCAGAAGCTTCGCCTTCTTCTTCGGCCACCAGATCGGAGCCACCCAAAGACTCTCCGGCGGAGGAGAGAGGTCCAGCTTCTAAGGAAGTATCAGAAGTAATAGAATCGCTAAAGAAGAAGCTTGCAGCTGATAGGTGTATATCAATAAAGAAAAGGATTGATGAAAACAAGAAGAATTTGTTTGCTATTACTCAAAGTTTTATGAGGTCTTCTATGGAACGAGGAGGTAGCTGTAAAGATGGCAGTGATCTTTTAGTTAAGAGGCAAAGAGATTCGCCAGGTATGAAAAGCGGAATCGATGAAAGTAATAACAACAGATATGTAGAAGATGGACCTGCCAGTTCAGGAATGGTTCAAGGATCTAGTGTCCCTGTCAAAATTTCCTTACGTCCTATCAAAATGCCTGATATCAAACGTTTGTCACCTTATACCACATGGGTTTTTCTGGACAGAAATCAAAGAATGACTGAAGACCAGTCTGTAGTGGGTCGAAGGAGAATTTATTATGATCAAACTGGCGGGGAAGCGCTTATCTGCAGTGATAGTGAAGAGGAGGCCATTGACGACGAAGAAGAAAAAAGAGATTTTTTGGAGCCTGAAGATTATATTATTCGCATGACCCTTGAGCAACTAGGTCTTTCAGACTCAGTCCTGGCGGAACTAGCAAGTTTCTTGTCTAGAAGTACTAGTGAAATCAAGGCAAGACATGGAGTGCTTATGAAGGAAAAAGAAGTATCCGAGAGTGGCGATAATCAAGCAGAGAGCTCCCTTCTCAACAAAGATATGGAAGGAGCATTAGATTCTTTCGATAACCTGTTCTGCCGTAGATGCCTTGTATTTGATTGCCGGCTTCATGGGTGTTCACAGGATCTCATTTTTCCGGCTGAGAAACCAGCTCCATGGTGTCCTCCTGTAGATGAAAATTTAACCTGTGGTGCAAACTGCTATAAAACGCTTCTCAAGTCTGGAAGATTTCCGGGATATGGCACCATTGAAGGTAAAACTGGCACTTCATCAGATGGTGCAGGTACTAAAACCACACCCACGAAGTTCTCCAGCAAACTTAATGGGAGAAAACCAAAGACCTTCCCAAGTGAAAGTGCATCGTCTAATGAAAAGTGCGCACTAGAAACAAGTGACTCAGAGAATGGACTACAGCAGGATACCAATTCCGATAAAGTTTCATCATCGCCAAAGGTGAAAGGTAGTGGGAGACGAGTAGGTCGTAAGAGGAACAAAAACCGAGTTGCTGAGCGAGTTCCTCGTAAGACTCAGAAGAGGCAGAAGAAGACAGAAGCCTCGGATAGTGATTCCATCGCCAGTGGAAGTTGTTCACCCAGCGATGCAAAACATAAAGATAATGAAGATGCTACTTCCTCTTCTCAGAAGCATGTAAAATCTGGGAACTCCGGGAAGTCAAGGAAGAATGGCACTCCTGCCGAAGTCTCCAATAATTCTGTGAAGGATGACGTTCCTGTTTGCCAGTCAAATGAGGTTGCGTCAGAGCTTGATGCGCCGGGTAGTGATGAAAGTCTAAGGAAAGAAGAGTTTATGGGTGAAACTGTATCTCGAGGAAGATTGGCTACAAATAAGTTGTGGAGACCACTTGAGAAAAGCCTTTTTGATAAAGGTGTTGAGATTTTTGGAATGAATAGCTGCTTGATTGCTAGAAATCTTTTGAGTGGTTTCAAATCATGTTGGGAGGTCTTCCAATACATGACGTGCTCGGAAAATAAAGCTTCCTTCTTTGGAGGTGATGGATTGAATCCTGATGGCTCTTCCAAGTTCGATATCAATGGAAATATGGTTAATAACCAAGTGAGGAGAAGGTCAAGATTTCTACGTAGGAGAGGCAAAGTGCGGCGCTTGAAGTATACCTGGAAGTCTGCTGCATATCATTCAATTAGGAAAAGAATTACTGAGAAGAAAGACCAGCCCTGCCGTCAGTTTAATCCATGTAACTGCAAAATTGCTTGTGGGAAGGAATGTCCTTGTTTGCTAAACGGGACTTGCTGCGAGAAGTACTGCGGTTGCCCAAAGAGCTGCAAGAATAGGTTTAGAGGTTGTCATTGTGCCAAAAGTCAGTGTCGAAGCCGCCAGTGTCCATGCTTTGCTGCAGATCGGGAATGTGACCCAGATGTTTGTAGAAACTGCTGGGTCATTGGTGGGGATGGTTCGCTTGGGGTCCCAAGCCAAAGAGGCGATAATTATGAGTGCAGGAATATGAAATTGCTCCTGAAACAACAACAAAGGGTTTTACTTGGAATATCTGATGTTTCTGGTTGGGGAGCTTTCTTAAAGAACAGTGTAAGTAAGCATGAATACCTTGGGGAATACACAGGAGAGCTGATCTCACATAAAGAGGCAGATAAACGCGGGAAGATATACGATCGCGAGAACTGCTCTTTTCTCTTCAATCTAAACGATCAGTTTGTGCTAGATGCTTACAGGAAAGGAGATAAACTGAAATTCGCCAACCATTCTCCTGAACCTAACTGTTACGCAAAGGTCATCATGGTTGCTGGAGATCACAGGGTGGGGATCTTCGCAAAAGAGAGGATACTGGCTGGAGAAGAACTATTTTACGATTACCGGTATGAGCCAGATCGAGCTCCAGCTTGGGCCAAAAAACCTGAAGCTCCTGGTTCTAAGAAAGACGAAAATGTTACACCTTCTGTTGGTAGACCCAAGAAGCTTGCTTAGCAACAAAAGAAACAACCATTTTTTTGTCAATTCTTTGGTTACAGGTGGAAGAACGCTTTAATCCTCATTACTCTCCACACGGAAGAACACATTGAAACAAATTCATACATTTTGCTGAGTCTAAAGAAAAATTGTATTCGTTGGATTAAAATTTCCTTTTTTCGTTTTACATTTCTGGATTATCATTTTATTGTACTGAGACTCGGGTTAAAGTTTTTAAATTACAGATGAGAAACTTGGTG
|
Sequence Source |
Ensembl |
Keyword |
KW-0181--Complete proteome KW-0217--Developmental protein KW-0221--Differentiation KW-0287--Flowering KW-0489--Methyltransferase KW-0539--Nucleus KW-1185--Reference proteome KW-0678--Repressor KW-0949--S-adenosyl-L-methionine KW-0804--Transcription KW-0805--Transcription regulation KW-0808--Transferase --
|
Interpro |
IPR026489--CXC_dom IPR025778--Hist-Lys_N-MeTrfase_EZ IPR001214--SET_dom IPR033467--Tesmin/TSO1-like_CXC
|
PROSITE |
PS51633--CXC PS51576--SAM_MT43_EZ PS50280--SET
|
Pfam |
PF00856--SET
|
Gene Ontology |
GO:0005634--C:nucleus GO:0031519--C:PcG protein complex GO:0018024--F:histone-lysine N-methyltransferase activity GO:0003727--F:single-stranded RNA binding GO:0003700--F:transcription factor activity, sequence-specific DNA binding GO:0030154--P:cell differentiation GO:0009294--P:DNA mediated transformation GO:0009908--P:flower development GO:0034968--P:histone lysine methylation GO:0016571--P:histone methylation GO:0009965--P:leaf morphogenesis GO:0045857--P:negative regulation of molecular function, epigenetic GO:0006349--P:regulation of gene expression by genetic imprinting GO:0006351--P:transcription, DNA-templated GO:0010228--P:vegetative to reproductive phase transition of meristem
|
Orthology |
|
Created Date |
25-Jun-2016 |