WERAM Information


Tag Content
WERAM ID WERAM-Hos-0263
Ensembl Protein ID ENSP00000398837.1
Uniprot Accession Q9UMN6; KMT2B_HUMAN; O15022; O95836; Q96GP2; Q96IP3; Q9UK25; Q9Y668; Q9Y669
Genbank Protein ID NP_055542.1
Protein Name Histone-lysine N-methyltransferase 2B
Genbank Nucleotide ID NM_014727.2
Gene Name KMT2B;HRX2;MLL2;MLL4;TRX2;WBP7;MLL1B;WBP-7
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000272333.5 ENST00000420124.2 ENSP00000398837.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET1 SET H3K4 K 25537518; 20951770; 26886794
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET1 1.50e-45 156.4 2576 2691
Me_Reader PHD 1.40e-10 43.4 1202 1394
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase. Methylates 'Lys-4' of histone H3. H3 'Lys-4' methylation represents a specific tag for epigenetic transcriptional activation. Plays a central role in beta-globin locus transcription regulation by being recruited by NFE2. Plays an important role in controlling bulk H3K4me during oocyte growth and preimplantation development. Required during the transcriptionally active period of oocyte growth for the establishment and/or maintenance of bulk H3K4 trimethylation (H3K4me3), global transcriptional silencing that preceeds resumption of meiosis, oocyte survival and normal zygotic genome activation.
Domain Profile
  HMT SET1

           SET1.txt    2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNceakv 91  
++ v++s+i+g+gl++k++i+++e+viEY G virs ++dkrek y+ k+ig+y+fr+d+ vvdat++gn+arfinhscepNc ++v
ENSP00000398837.1 2576 AVGVYRSAIHGRGLFCKRNIDAGEMVIEYSGIVIRSVLTDKREKFYDGKGIGCYMFRMDDF--DVVDATMHGNAARFINHSCEPNCFSRV 2663
58899*******************************************************9..9************************** PP
SET1.txt 92 vavdgekkiviyakraIekgeeltydYk 119
++v+g+k+ivi+a r+I +geeltydYk
ENSP00000398837.1 2664 IHVEGQKHIVIFALRRILRGEELTYDYK 2691
***************************7 PP

  Me_Reader PHD

            PHD.txt    2 tiClvCgkddegekemvqCdeCddwfHlkCvklplsslpeg.kswyCpsCk 51  
++Cl+C ++ +e+v C+ C d fH C++ ++++lp+ + w+C++Ck
ENSP00000398837.1 1202 MVCLLCASKGL--HELVFCQVCCDPFHPFCLEEAERPLPQHhDTWCCRRCK 1250
68****44444..459******************9999966467******7 PP
PHD.txt 2 tiClvCgkddegekemvqCdeCddwfHlkCvklp.lsslp.egkswyCpsCke 52
++C vCg+++ g+k +++C+ C++++H C++++ ++++ + ++w+C C++
ENSP00000398837.1 1250 KFCHVCGRKGRGSKHLLECERCRHAYHPACLGPSyPTRATrKRRHWICSACVR 1302
59****999999999*******************54444434478******86 PP
PHD.txt 2 tiClvCgkddegeke...mvqCdeCddwfHlkCvklp......lsslpegkswyCpsCk 51
++C++C +++e++++ m+qC +Cd+w+H+kC +l+ ls lp++ ++C C
ENSP00000398837.1 1336 NYCPICTRCYEDNDYeskMMQCAQCDHWVHAKCEGLSdedyeiLSGLPDSVLYTCGPCA 1394
688888655555443444*******************7766666666666779999995 PP

Protein Sequence
(Fasta)
MAAAAGGGSC PGPGSARGRF PGRPRGAGGG GGRGGRGNGA ERVRVALRRG GGATGPGGAE 60
PGEDTALLRL LGLRRGLRRL RRLWAGPRVQ RGRGRGRGRG WGPSRGCVPE EESSDGESDE 120
EEFQGFHSDE DVAPSSLRSA LRSQRGRAPR GRGRKHKTTP LPPPRLADVA PTPPKTPARK 180
RGEEGTERMV QALTELLRRA QAPQAPRSRA CEPSTPRRSR GRPPGRPAGP CRRKQQAVVV 240
AEAAVTIPKP EPPPPVVPVK HQTGSWKCKE GPGPGPGTPR RGGQSSRGGR GGRGRGRGGG 300
LPFVIKFVSR AKKVKMGQLS LGLESGQGQG QHEESWQDVP QRRVGSGQGG SPCWKKQEQK 360
LDDEEEEKKE EEEKDKEGEE KEERAVAEEM MPAAEKEEAK LPPPPLTPPA PSPPPPLPPP 420
STSPPPPLCP PPPPPVSPPP LPSPPPPPAQ EEQEESPPPV VPATCSRKRG RPPLTPSQRA 480
EREAARAGPE GTSPPTPTPS TATGGPPEDS PTVAPKSTTF LKNIRQFIMP VVSARSSRVI 540
KTPRRFMDED PPKPPKVEVS PVLRPPITTS PPVPQEPAPV PSPPRAPTPP STPVPLPEKR 600
RSILREPTFR WTSLTRELPP PPPAPPPPPA PSPPPAPATS SRRPLLLRAP QFTPSEAHLK 660
IYESVLTPPP LGAPEAPEPE PPPADDSPAE PEPRAVGRTN HLSLPRFAPV VTTPVKAEVS 720
PHGAPALSNG PQTQAQLLQP LQALQTQLLP QALPPPQPQL QPPPSPQQMP PLEKARIAGV 780
GSLPLSGVEE KMFSLLKRAK VQLFKIDQQQ QQKVAASMPL SPGGQMEEVA GAVKQISDRG 840
PVRSEDESVE AKRERPSGPE SPVQGPRIKH VCRHAAVALG QARAMVPEDV PRLSALPLRD 900
RQDLATEDTS SASETESVPS RSRRGKVEAA GPGGESEPTG SGGTLAHTPR RSLPSHHGKK 960
MRMARCGHCR GCLRVQDCGS CVNCLDKPKF GGPNTKKQCC VYRKCDKIEA RKMERLAKKG 1020
RTIVKTLLPW DSDESPEASP GPPGPRRGAG AGGPREEVVA HPGPEEQDSL LQRKSARRCV 1080
KQRPSYDIFE DSDDSEPGGP PAPRRRTPRE NELPLPEPEE QSRPRKPTLQ PVLQLKARRR 1140
LDKDALAPGP FASFPNGWTG KQKSPDGVHR VRVDFKEDCD LENVWLMGGL SVLTSVPGGP 1200
PMVCLLCASK GLHELVFCQV CCDPFHPFCL EEAERPLPQH HDTWCCRRCK FCHVCGRKGR 1260
GSKHLLECER CRHAYHPACL GPSYPTRATR KRRHWICSAC VRCKSCGATP GKNWDVEWSG 1320
DYSLCPRCTQ LYEKGNYCPI CTRCYEDNDY ESKMMQCAQC DHWVHAKCEG LSDEDYEILS 1380
GLPDSVLYTC GPCAGAAQPR WREALSGALQ GGLRQVLQGL LSSKVVGPLL LCTQCGPDGK 1440
QLHPGPCGLQ AVSQRFEDGH YKSVHSFMED MVGILMRHSE EGETPDRRAG GQMKGLLLKL 1500
LESAFGWFDA HDPKYWRRST RLPNGVLPNA VLPPSLDHVY AQWRQQEPET PESGQPPGDP 1560
SAAFQGKDPA AFSHLEDPRQ CALCLKYGDA DSKEAGRLLY IGQNEWTHVN CAIWSAEVFE 1620
ENDGSLKNVH AAVARGRQMR CELCLKPGAT VGCCLSSCLS NFHFMCARAS YCIFQDDKKV 1680
FCQKHTDLLD GKEIVNPDGF DVLRRVYVDF EGINFKRKFL TGLEPDAINV LIGSIRIDSL 1740
GTLSDLSDCE GRLFPIGYQC SRLYWSTVDA RRRCWYRCRI LEYRPWGPRE EPAHLEAAEE 1800
NQTIVHSPAP SSEPPGGEDP PLDTDVLVPG APERHSPIQN LDPPLRPDSG SAPPPAPRSF 1860
SGARIKVPNY SPSRRPLGGV SFGPLPSPGS PSSLTHHIPT VGDPDFPAPP RRSRRPSPLA 1920
PRPPPSRWAS PPLKTSPQLR VPPPTSVVTA LTPTSGELAP PGPAPSPPPP EDLGPDFEDM 1980
EVVSGLSAAD LDFAASLLGT EPFQEEIVAA GAMGSSHGGP GDSSEEESSP TSRYIHFPVT 2040
VVSAPGLAPS ATPGAPRIEQ LDGVDDGTDS EAEAVQQPRG QGTPPSGPGV VRAGVLGAAG 2100
DRARPPEDLP SEIVDFVLKN LGGPGDGGAG PREESLPPAP PLANGSQPSQ GLTASPADPT 2160
RTFAWLPGAP GVRVLSLGPA PEPPKPATSK IILVNKLGQV FVKMAGEGEP VPPPVKQPPL 2220
PPTISPTAPT SWTLPPGPLL GVLPVVGVVR PAPPPPPPPL TLVLSSGPAS PPRQAIRVKR 2280
VSTFSGRSPP APPPYKAPRL DEDGEASEDT PQVPGLGSGG FSRVRMKTPT VRGVLDLDRP 2340
GEPAGEESPG PLQERSPLLP LPEDGPPQVP DGPPDLLLES QWHHYSGEAS SSEEEPPSPD 2400
DKENQAPKRT GPHLRFEISS EDGFSVEAES LEGAWRTLIE KVQEARGHAR LRHLSFSGMS 2460
GARLLGIHHD AVIFLAEQLP GAQRCQHYKF RYHQQGEGQE EPPLNPHGAA RAEVYLRKCT 2520
FDMFNFLASQ HRVLPEGATC DEEEDEVQLR STRRATSLEL PMAMRFRHLK KTSKEAVGVY 2580
RSAIHGRGLF CKRNIDAGEM VIEYSGIVIR SVLTDKREKF YDGKGIGCYM FRMDDFDVVD 2640
ATMHGNAARF INHSCEPNCF SRVIHVEGQK HIVIFALRRI LRGEELTYDY KFPIEDASNK 2700
LPCNCGAKRC RRFLN 2715
Nucleotide Sequence
(Fasta)
ATGGCGGCGG CGGCGGGCGG CGGCAGTTGC CCCGGGCCTG GCTCCGCGCG GGGCCGCTTC 60
CCGGGCCGGC CGCGGGGCGC CGGCGGGGGC GGGGGCCGCG GCGGACGGGG CAACGGGGCC 120
GAAAGAGTGC GGGTAGCTCT GCGGCGCGGC GGTGGCGCGA CGGGGCCGGG CGGAGCCGAG 180
CCCGGGGAGG ACACGGCCCT GCTCCGTTTG CTGGGGCTCC GCCGGGGCCT GCGCCGGCTC 240
CGCCGCCTGT GGGCCGGCCC GCGGGTCCAG CGGGGCCGGG GACGGGGTCG GGGCCGGGGC 300
TGGGGCCCGA GTCGAGGCTG CGTGCCGGAG GAGGAGAGCA GTGACGGGGA ATCCGACGAG 360
GAGGAGTTTC AGGGTTTTCA TTCAGATGAA GATGTGGCCC CCAGTTCCCT GCGCTCTGCG 420
CTCCGATCCC AGCGAGGTCG AGCGCCCCGA GGTCGGGGTC GCAAGCATAA GACGACCCCC 480
CTTCCTCCTC CTCGCCTAGC AGATGTGGCT CCTACCCCCC CAAAGACCCC TGCCCGGAAA 540
CGGGGTGAGG AAGGCACAGA ACGGATGGTG CAGGCACTGA CTGAACTTCT CCGGCGGGCC 600
CAGGCACCCC AAGCACCCCG GAGCCGGGCA TGTGAGCCCT CCACCCCCCG GCGGTCTCGG 660
GGACGGCCCC CAGGACGGCC AGCAGGCCCC TGCAGGAGGA AGCAGCAAGC AGTAGTGGTG 720
GCAGAAGCAG CTGTGACAAT CCCCAAACCT GAGCCCCCAC CTCCTGTGGT TCCAGTGAAA 780
CATCAGACTG GCAGCTGGAA ATGCAAGGAG GGGCCCGGTC CAGGACCTGG GACCCCCAGG 840
CGTGGAGGAC AGTCAAGCCG TGGAGGCCGT GGAGGCAGGG GCCGCGGCCG AGGTGGTGGG 900
CTCCCCTTTG TGATCAAGTT TGTTTCAAGG GCCAAAAAAG TAAAGATGGG ACAATTGTCC 960
TTGGGACTCG AATCAGGTCA AGGTCAAGGT CAACATGAGG AAAGTTGGCA GGATGTCCCC 1020
CAAAGAAGAG TTGGATCTGG ACAGGGAGGG AGCCCTTGCT GGAAAAAGCA GGAACAGAAG 1080
CTGGATGACG AGGAAGAAGA GAAGAAAGAA GAAGAAGAAA AAGACAAGGA GGGAGAAGAG 1140
AAGGAAGAAA GAGCTGTAGC TGAGGAGATG ATGCCAGCTG CGGAAAAGGA AGAGGCAAAG 1200
CTGCCACCAC CGCCTCTGAC TCCTCCAGCC CCTTCACCTC CTCCACCCCT CCCACCCCCT 1260
TCGACATCTC CTCCACCCCC ACTCTGCCCT CCACCACCAC CCCCAGTGTC CCCACCACCT 1320
CTACCATCCC CTCCACCGCC TCCTGCCCAA GAGGAGCAGG AGGAATCCCC TCCTCCTGTG 1380
GTCCCAGCTA CGTGCTCCAG GAAGAGGGGC CGGCCTCCCC TGACTCCCAG CCAGCGGGCG 1440
GAGCGGGAAG CTGCTCGGGC AGGGCCAGAG GGCACCTCTC CTCCCACTCC AACCCCCAGC 1500
ACCGCCACGG GAGGCCCTCC GGAAGACAGT CCCACCGTGG CCCCCAAAAG CACCACCTTC 1560
CTGAAGAATA TCCGGCAGTT TATTATGCCT GTGGTGAGTG CCCGCTCCTC CCGTGTCATC 1620
AAGACACCCC GGCGATTTAT GGATGAAGAC CCCCCCAAAC CCCCAAAGGT GGAGGTCTCA 1680
CCTGTCCTGC GACCTCCCAT TACCACCTCC CCACCTGTTC CCCAGGAGCC AGCACCAGTC 1740
CCCTCTCCAC CACGTGCCCC AACTCCTCCA TCTACCCCAG TTCCACTCCC TGAGAAGAGA 1800
CGGTCCATCC TAAGGGAACC CACATTTCGC TGGACCTCAC TGACCCGGGA GCTGCCCCCT 1860
CCTCCCCCAG CCCCTCCACC TCCCCCGGCC CCCTCCCCAC CCCCTGCTCC TGCCACCTCC 1920
TCCCGGAGGC CCCTACTCCT TCGGGCCCCT CAGTTTACCC CAAGCGAAGC CCACCTGAAG 1980
ATCTACGAAT CGGTGCTTAC TCCTCCTCCT CTTGGGGCTC CTGAAGCCCC TGAGCCAGAG 2040
CCTCCTCCTG CCGATGACTC TCCAGCTGAG CCTGAGCCTC GGGCAGTGGG CCGCACCAAC 2100
CACCTCAGCC TGCCTCGATT CGCCCCTGTG GTCACCACTC CTGTTAAGGC CGAGGTGTCC 2160
CCTCACGGGG CTCCAGCTCT GAGCAACGGG CCACAGACAC AGGCTCAGCT ACTGCAGCCC 2220
CTGCAGGCCT TGCAAACCCA GCTCCTGCCC CAGGCACTAC CGCCACCACA GCCACAGCTG 2280
CAGCCACCGC CGTCACCACA GCAGATGCCT CCCCTGGAAA AAGCCCGGAT TGCGGGCGTG 2340
GGTTCCTTGC CGCTGTCTGG GGTAGAGGAG AAGATGTTCA GCCTCCTCAA GAGAGCCAAA 2400
GTGCAGCTAT TCAAGATCGA TCAGCAGCAG CAGCAGAAGG TGGCAGCTTC CATGCCGCTG 2460
AGCCCTGGAG GGCAGATGGA GGAGGTGGCC GGGGCTGTCA AGCAGATCTC CGACAGAGGC 2520
CCTGTCCGGT CTGAAGATGA GTCGGTGGAA GCTAAGAGAG AGCGGCCCTC AGGTCCCGAG 2580
TCCCCTGTGC AAGGTCCCCG CATCAAACAT GTCTGCCGTC ATGCTGCTGT GGCCCTGGGT 2640
CAGGCCCGGG CCATGGTGCC TGAAGATGTC CCTCGCCTCA GTGCCCTCCC TCTCCGGGAT 2700
CGGCAGGACC TCGCCACAGA GGATACATCA TCGGCGTCCG AGACTGAGAG TGTCCCGTCA 2760
CGGTCCCGGC GGGGAAAGGT GGAGGCAGCA GGCCCTGGGG GAGAATCAGA GCCCACAGGT 2820
TCTGGAGGGA CCCTGGCCCA CACACCCCGG CGCTCACTGC CCTCCCATCA CGGCAAGAAG 2880
ATGCGCATGG CTCGATGTGG ACACTGTCGG GGCTGCCTAC GTGTGCAGGA CTGTGGGTCC 2940
TGTGTCAACT GCCTAGACAA GCCCAAGTTT GGGGGCCCTA ACACCAAGAA GCAGTGCTGT 3000
GTATACCGGA AGTGTGACAA AATAGAGGCT CGGAAGATGG AACGACTGGC TAAAAAAGGC 3060
CGGACGATAG TGAAGACGCT GTTGCCCTGG GATTCCGATG AATCTCCTGA GGCCTCCCCT 3120
GGTCCTCCAG GCCCACGCCG GGGGGCGGGA GCTGGGGGGC CCCGGGAGGA GGTGGTGGCC 3180
CACCCAGGGC CCGAGGAGCA GGACTCCCTC CTGCAGCGCA AGTCAGCTCG GCGCTGCGTC 3240
AAACAGCGAC CCTCCTATGA TATCTTCGAG GATTCGGATG ACTCGGAGCC CGGGGGCCCC 3300
CCTGCTCCTC GGCGTCGGAC CCCCCGAGAA AATGAGCTGC CACTGCCAGA ACCTGAGGAG 3360
CAGAGCCGGC CCCGCAAACC TACCCTGCAG CCTGTGTTGC AGCTCAAGGC CCGAAGGCGC 3420
CTGGACAAGG ATGCTTTGGC CCCTGGCCCC TTTGCTTCTT TTCCCAATGG CTGGACTGGA 3480
AAGCAGAAGT CTCCCGATGG TGTGCACCGC GTCCGTGTGG ATTTTAAGGA GGATTGTGAT 3540
TTAGAGAACG TGTGGCTGAT GGGGGGCCTG AGTGTGCTCA CCTCTGTGCC AGGGGGCCCC 3600
CCGATGGTGT GCTTGCTGTG TGCCAGCAAA GGACTCCACG AGCTGGTGTT CTGTCAAGTC 3660
TGCTGTGACC CATTCCACCC ATTCTGCCTG GAGGAGGCCG AGCGGCCCCT GCCCCAGCAT 3720
CACGACACCT GGTGCTGCCG TCGCTGCAAA TTCTGCCACG TCTGTGGACG CAAAGGTCGT 3780
GGATCCAAGC ACCTCCTGGA GTGCGAGCGC TGCCGCCATG CATACCACCC GGCCTGTCTG 3840
GGGCCCAGCT ATCCAACCCG GGCCACGCGC AAACGGCGCC ACTGGATCTG TTCAGCCTGT 3900
GTGCGCTGTA AGAGCTGTGG GGCAACTCCA GGCAAGAACT GGGACGTCGA GTGGTCTGGA 3960
GATTACAGCC TCTGCCCCAG GTGCACCCAG CTATATGAGA AAGGAAACTA CTGCCCGATC 4020
TGTACACGCT GCTATGAAGA CAACGACTAT GAGAGCAAGA TGATGCAGTG CGCACAGTGC 4080
GATCACTGGG TGCATGCCAA GTGCGAGGGG CTCTCAGATG AAGACTACGA GATCCTTTCA 4140
GGACTGCCAG ACTCGGTGCT GTACACCTGC GGACCGTGTG CTGGGGCAGC GCAGCCCCGC 4200
TGGCGAGAGG CCCTGAGCGG GGCCCTCCAG GGGGGCCTGC GCCAGGTGCT CCAGGGCCTG 4260
CTGAGCTCCA AGGTGGTGGG CCCACTGCTG CTCTGCACCC AGTGTGGGCC AGATGGGAAG 4320
CAACTGCACC CAGGACCCTG CGGCCTGCAA GCTGTGAGTC AGCGCTTCGA GGATGGCCAC 4380
TACAAGTCTG TGCACAGCTT CATGGAGGAC ATGGTGGGCA TCCTCATGCG GCACTCGGAG 4440
GAGGGAGAGA CCCCGGACCG CCGGGCTGGA GGCCAGATGA AGGGGCTCCT GCTGAAGCTG 4500
CTAGAATCTG CGTTCGGCTG GTTCGACGCC CACGACCCCA AGTACTGGCG ACGGAGTACC 4560
CGGCTGCCAA ACGGAGTCCT TCCCAATGCG GTGTTGCCCC CATCCCTGGA TCATGTCTAT 4620
GCGCAGTGGA GACAGCAGGA ACCAGAGACC CCAGAATCAG GGCAGCCTCC AGGGGATCCC 4680
TCAGCAGCAT TCCAGGGCAA GGATCCGGCT GCCTTCTCAC ACCTGGAGGA CCCCCGTCAG 4740
TGTGCACTCT GCCTCAAATA CGGGGATGCA GACTCCAAGG AGGCGGGGCG GCTCTTGTAC 4800
ATCGGGCAGA ACGAGTGGAC ACACGTCAAC TGTGCCATCT GGTCGGCGGA AGTCTTCGAG 4860
GAGAACGACG GCTCCCTCAA GAATGTGCAT GCTGCTGTGG CCCGAGGGAG GCAGATGCGC 4920
TGCGAGCTCT GCCTGAAGCC TGGCGCCACG GTGGGCTGCT GCCTGTCCTC CTGCCTCAGC 4980
AACTTCCACT TCATGTGTGC CCGGGCCAGC TACTGCATCT TCCAGGATGA CAAGAAAGTC 5040
TTCTGCCAGA AACACACTGA TCTCCTGGAT GGCAAGGAAA TTGTGAACCC CGATGGTTTT 5100
GATGTTCTCC GCCGAGTCTA TGTGGACTTC GAGGGCATCA ACTTCAAGCG GAAGTTCTTG 5160
ACGGGGCTTG AACCCGATGC CATCAACGTG CTCATTGGTT CCATCCGCAT TGACTCCCTG 5220
GGTACTCTGT CTGATCTCTC GGACTGCGAG GGACGGCTCT TCCCCATTGG CTACCAGTGC 5280
TCCCGTCTGT ACTGGAGCAC AGTGGATGCT CGGAGGCGCT GCTGGTATCG GTGCCGAATT 5340
CTGGAGTATC GGCCATGGGG GCCGAGGGAA GAGCCAGCTC ACCTGGAGGC TGCAGAGGAG 5400
AACCAGACCA TTGTGCACAG CCCCGCCCCT TCCTCAGAGC CCCCAGGTGG TGAGGACCCC 5460
CCACTGGACA CAGATGTTCT TGTCCCTGGA GCTCCTGAGC GCCACTCGCC CATTCAGAAC 5520
CTGGACCCTC CACTGCGGCC AGATTCAGGC AGCGCCCCTC CTCCAGCCCC CCGTTCTTTT 5580
TCGGGGGCTC GAATCAAAGT GCCCAACTAC TCGCCATCCC GGAGGCCCTT GGGGGGTGTC 5640
TCCTTTGGCC CCCTGCCCTC CCCTGGAAGT CCATCTTCAC TGACCCACCA CATCCCCACA 5700
GTGGGAGACC CGGACTTCCC AGCTCCCCCC AGACGTTCCC GTCGTCCCAG CCCTTTGGCT 5760
CCCAGGCCGC CTCCATCACG GTGGGCCTCC CCTCCTCTAA AAACCTCCCC TCAGCTCAGG 5820
GTGCCCCCTC CTACCTCAGT CGTCACAGCC CTCACACCTA CCTCAGGGGA GCTGGCTCCC 5880
CCTGGCCCGG CCCCATCTCC ACCACCCCCT GAAGACCTGG GCCCAGACTT CGAGGACATG 5940
GAGGTGGTGT CAGGACTGAG TGCTGCTGAC CTGGACTTCG CGGCCAGCCT GCTGGGGACT 6000
GAGCCCTTCC AGGAAGAGAT TGTAGCCGCT GGGGCCATGG GGAGCAGCCA CGGGGGCCCG 6060
GGGGACAGCT CCGAGGAGGA GTCCAGCCCC ACCTCCCGCT ACATCCACTT CCCTGTGACT 6120
GTGGTGTCCG CCCCTGGTCT GGCCCCCAGC GCTACCCCTG GAGCCCCCCG CATTGAACAG 6180
CTGGACGGCG TGGACGACGG CACTGACAGT GAGGCTGAGG CGGTGCAGCA GCCTCGGGGC 6240
CAGGGCACGC CTCCTTCGGG GCCAGGAGTA GTCCGGGCAG GGGTCCTTGG GGCTGCAGGG 6300
GACAGGGCCC GGCCTCCTGA GGACCTGCCA TCGGAAATTG TGGATTTTGT GTTGAAGAAC 6360
CTAGGGGGTC CTGGGGATGG AGGTGCTGGC CCTAGAGAGG AGTCACTCCC CCCGGCGCCT 6420
CCCCTGGCTA ATGGCAGCCA GCCCTCCCAA GGCCTGACCG CCAGCCCAGC TGACCCCACC 6480
CGCACATTTG CCTGGCTCCC AGGGGCCCCA GGGGTCCGGG TGTTAAGCCT TGGCCCTGCC 6540
CCTGAGCCCC CCAAACCCGC CACATCCAAA ATCATACTTG TCAACAAGCT GGGGCAAGTA 6600
TTTGTGAAGA TGGCTGGGGA GGGTGAACCT GTCCCACCCC CAGTGAAGCA GCCACCTTTG 6660
CCCCCCACCA TTTCCCCCAC GGCTCCCACC TCCTGGACTC TGCCCCCAGG CCCCCTCCTC 6720
GGCGTGCTGC CCGTGGTCGG AGTGGTCCGC CCTGCCCCGC CCCCGCCACC CCCTCCCCTG 6780
ACGCTGGTGC TGAGCAGTGG GCCAGCCAGC CCGCCCCGCC AGGCCATCCG CGTCAAGAGG 6840
GTGTCCACTT TCTCCGGCCG GTCCCCGCCA GCACCTCCCC CATACAAAGC CCCCCGGCTG 6900
GATGAAGATG GAGAGGCCTC AGAGGATACC CCTCAGGTTC CAGGGCTTGG CAGTGGCGGG 6960
TTTAGCCGTG TGAGGATGAA AACCCCCACA GTGCGTGGGG TCCTTGACCT GGATCGGCCT 7020
GGGGAGCCCG CTGGGGAAGA AAGTCCTGGG CCCCTCCAGG AACGGTCCCC TTTGCTGCCA 7080
CTTCCGGAAG ATGGTCCTCC CCAGGTCCCC GATGGTCCCC CAGACCTGCT GCTTGAGTCC 7140
CAGTGGCACC ACTATTCAGG TGAGGCTTCG AGCTCTGAGG AAGAGCCTCC ATCCCCAGAT 7200
GATAAAGAGA ACCAGGCCCC AAAACGGACT GGCCCACATC TGCGCTTCGA GATCAGCAGT 7260
GAGGATGGGT TCAGCGTTGA GGCAGAGAGC TTGGAGGGGG CGTGGAGAAC TCTGATCGAG 7320
AAAGTGCAAG AGGCCCGAGG GCATGCCCGA CTCAGACATC TCTCCTTTAG TGGAATGAGT 7380
GGGGCGAGAC TCCTGGGCAT CCACCATGAT GCTGTCATCT TCCTGGCCGA GCAGCTCCCC 7440
GGAGCCCAGC GTTGCCAGCA CTATAAGTTC CGTTACCACC AGCAGGGAGA GGGCCAGGAG 7500
GAGCCGCCCC TGAATCCCCA TGGGGCTGCT CGGGCAGAGG TCTATCTCCG GAAGTGCACC 7560
TTTGACATGT TCAACTTCCT GGCCTCCCAG CACCGGGTGC TCCCTGAGGG GGCCACCTGT 7620
GATGAGGAAG AGGATGAGGT GCAGCTCAGG TCAACCAGAC GTGCCACCAG CCTGGAGCTG 7680
CCCATGGCCA TGCGTTTTCG TCACCTTAAG AAGACGTCCA AAGAAGCTGT GGGTGTCTAC 7740
AGATCAGCCA TCCACGGGCG AGGCCTGTTC TGTAAGCGCA ACATCGACGC GGGGGAGATG 7800
GTCATCGAGT ACTCTGGCAT TGTCATCCGC TCGGTGTTGA CTGACAAGCG GGAGAAGTTC 7860
TACGATGGGA AGGGCATCGG GTGCTATATG TTCCGCATGG ATGACTTTGA TGTAGTGGAC 7920
GCCACGATGC ATGGCAATGC CGCCCGCTTC ATCAACCACT CCTGTGAGCC CAACTGCTTC 7980
TCTCGGGTCA TCCACGTGGA GGGCCAGAAA CACATTGTTA TCTTCGCCCT GCGCCGCATC 8040
CTGCGTGGTG AGGAGCTCAC CTACGACTAC AAGTTCCCCA TCGAGGATGC CAGCAACAAG 8100
CTGCCCTGCA ACTGTGGCGC CAAGCGCTGC CGTCGGTTCC TTAACTGAGG CCGTGGCTGC 8160
CCACCACGAC CCCTCACACC TCCTGCTGCC GTCGCTGCCA TCTTGCCCCT AGCCTGGGGG 8220
CTCCCTAGCC CCTCCCAGAG CATCTCACCC CCACCCTCAT GTTCAGGGTG GATGTGGGCA 8280
TGCAGGTGAC AAGGGCCCTG CCTCCACCCC TCCAGCCCAT CCAGCAATCG CCCCCTTTCT 8340
GCCCTGGGGG CCCAGGATGT AGATATTGTA CAAAGGTTTC TAAATCCCTT CTTTTCTATG 8400
CACTTTTTTA TTTAAGAGGT GGGGTCCCAG GTGGGAACCC CCCCACAATA AAGTCTGTCA 8460
ATGTTTGGA
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0007--Acetylation
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0181--Complete proteome
KW-0238--DNA-binding
KW-0479--Metal-binding
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-1185--Reference proteome
KW-0677--Repeat
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0862--Zinc
KW-0863--Zinc-finger
--

Interpro

IPR003889--FYrich_C
IPR003888--FYrich_N
IPR003616--Post-SET_dom
IPR001214--SET_dom
IPR002857--Znf_CXXC
IPR011011--Znf_FYVE_PHD
IPR001965--Znf_PHD
IPR019787--Znf_PHD-finger
IPR013083--Znf_RING/FYVE/PHD

PROSITE

PS51543--FYRC
PS51542--FYRN
PS50868--POST_SET
PS50280--SET
PS51058--ZF_CXXC
PS01359--ZF_PHD_1
PS50016--ZF_PHD_2

Pfam

PF05965--FYRC
PF05964--FYRN
PF00628--PHD
PF00856--SET
PF02008--zf-CXXC

Gene Ontology

GO:0035097--C:histone methyltransferase complex
GO:0005654--C:nucleoplasm
GO:0005634--C:nucleus
GO:0003677--F:DNA binding
GO:0042800--F:histone methyltransferase activity (H3-K4 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0003700--F:transcription factor activity, sequence-specific DNA binding
GO:0008270--F:zinc ion binding
GO:0048096--P:chromatin-mediated maintenance of transcription
GO:0016458--P:gene silencing
GO:0051568--P:histone H3-K4 methylation
GO:0080182--P:histone H3-K4 trimethylation
GO:0007613--P:memory
GO:0009994--P:oocyte differentiation
GO:0001541--P:ovarian follicle development
GO:0030728--P:ovulation
GO:0051569--P:regulation of histone H3-K4 methylation
GO:0006351--P:transcription, DNA-templated

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0090 ENSPTRP00000018591.5 Pan troglodytes 100 0.0 3373
WERAM-Paa-0047 ENSPANP00000008627.1 Papio anubis 99 0.0 3356
WERAM-Chs-0048 ENSCSAP00000000728.1 Chlorocebus sabaeus 99 0.0 3354
WERAM-Caj-0107 ENSCJAP00000018740.2 Callithrix jacchus 96 0.0 3284
WERAM-Otg-0054 ENSOGAP00000003714.2 Otolemur garnettii 94 0.0 3231
WERAM-Mim-0097 ENSMICP00000009069.1 Microcebus murinus 93 0.0 3184
WERAM-Fec-0070 ENSFCAP00000005933.3 Felis catus 91 0.0 3177
WERAM-Aim-0005 ENSAMEP00000000356.1 Ailuropoda melanoleuca 92 0.0 3171
WERAM-Dan-0042 ENSDNOP00000004307.3 Dasypus novemcinctus 92 0.0 3169
WERAM-Ptv-0144 ENSPVAP00000012104.1 Pteropus vampyrus 93 0.0 3155
WERAM-Mup-0069 ENSMPUP00000006153.1 Mustela putorius furo 92 0.0 3144
WERAM-Caf-0080 ENSCAFP00000010253.3 Canis familiaris 92 0.0 3140
WERAM-Myl-0129 ENSMLUP00000010443.2 Myotis lucifugus 90 0.0 3110
WERAM-Sus-0021 ENSSSCP00000003118.2 Sus scrofa 91 0.0 3109
WERAM-Bot-0034 ENSBTAP00000003584.5 Bos taurus 92 0.0 3078
WERAM-Mam-0017 ENSMMUP00000003194.2 Macaca mulatta 98 0.0 2580
WERAM-Ict-0194 ENSSTOP00000017577.1 Ictidomys tridecemlineatus 91 0.0 2469
WERAM-Orc-0128 ENSOCUP00000011001.2 Oryctolagus cuniculus 90 0.0 2432
WERAM-Mum-0016 ENSMUSP00000006470.7 Mus musculus 90 0.0 2417
WERAM-Ocp-0011 ENSOPRP00000001256.2 Ochotona princeps 88 0.0 2397
WERAM-Ran-0238 ENSRNOP00000071806.1 Rattus norvegicus 90 0.0 2387
WERAM-Cap-0040 ENSCPOP00000003358.2 Cavia porcellus 89 0.0 2370
WERAM-Nol-0118 ENSNLEP00000013752.1 Nomascus leucogenys 96 0.0 2290
WERAM-Gog-0190 ENSGGOP00000016403.2 Gorilla gorilla 99 0.0 2233
WERAM-Ova-0040 ENSOARP00000005352.1 Ovis aries 90 0.0 2130
WERAM-Loa-0170 ENSLAFP00000015714.4 Loxodonta africana 93 0.0 2119
WERAM-Dio-0085 ENSDORP00000008518.1 Dipodomys ordii 85 0.0 2105
WERAM-Mod-0122 ENSMODP00000017499.3 Monodelphis domestica 78 0.0 2053
WERAM-Poa-0085 ENSPPYP00000011057.1 Pongo abelii 99 0.0 1931
WERAM-Eqc-0036 ENSECAP00000005758.1 Equus caballus 93 0.0 1900
WERAM-Mae-0133 ENSMEUP00000013331.1 Macropus eugenii 81 0.0 1802
WERAM-Soa-0016 ENSSARP00000001796.1 Sorex araneus 90 0.0 1707
WERAM-Tub-0051 ENSTBEP00000006325.1 Tupaia belangeri 87 0.0 1348
WERAM-Anc-0174 ENSACAP00000016400.2 Anolis carolinensis 63 0.0 1308
WERAM-Prc-0048 ENSPCAP00000004470.1 Procavia capensis 86 0.0 1238
WERAM-Tut-0028 ENSTTRP00000002118.1 Tursiops truncatus 94 0.0 1196
WERAM-Ere-0115 ENSEEUP00000012211.1 Erinaceus europaeus 92 0.0 1127
WERAM-Xet-0112 ENSXETP00000038145.3 Xenopus tropicalis 48 0.0 948
WERAM-Xim-0124 ENSXMAP00000010573.1 Xiphophorus maculatus 47 0.0 897
WERAM-Asm-0104 ENSAMXP00000010393.1 Astyanax mexicanus 44 0.0 829
WERAM-Tar-0169 ENSTRUP00000035798.1 Takifugu rubripes 45 0.0 817
WERAM-Leo-0024 ENSLOCP00000004822.1 Lepisosteus oculatus 49 0.0 810
WERAM-Ora-0035 ENSOANP00000005967.3 Ornithorhynchus anatinus 54 0.0 770
WERAM-Dar-0134 ENSDARP00000123759.1 Danio rerio 51 0.0 766
WERAM-Orn-0025 ENSONIP00000003449.1 Oreochromis niloticus 52 0.0 762
WERAM-Gaga-0075 ENSGALP00000011008.4 Gallus gallus 51 0.0 757
WERAM-Sah-0138 ENSSHAP00000014807.1 Sarcophilus harrisii 51 0.0 756
WERAM-Pes-0058 ENSPSIP00000007945.1 Pelodiscus sinensis 51 0.0 756
WERAM-Fia-0123 ENSFALP00000010386.1 Ficedula albicollis 51 0.0 754
WERAM-Tag-0002 ENSTGUP00000000072.1 Taeniopygia guttata 51 0.0 753
WERAM-Pof-0072 ENSPFOP00000006792.1 Poecilia formosa 53 0.0 753
WERAM-Meg-0028 ENSMGAP00000002448.2 Meleagris gallopavo 53 0.0 752
WERAM-Anp-0040 ENSAPLP00000004456.1 Anas platyrhynchos 51 0.0 749
WERAM-Lac-0176 ENSLACP00000020625.1 Latimeria chalumnae 51 0.0 745
WERAM-Orla-0062 ENSORLP00000008124.1 Oryzias latipes 51 0.0 733
WERAM-Gam-0001 ENSGMOP00000000116.1 Gadus morhua 47 0.0 728
WERAM-Gaa-0074 ENSGACP00000010144.1 Gasterosteus aculeatus 47 0.0 711
WERAM-Ten-0223 ENSTNIP00000002397.1 Tetraodon nigroviridis 47 0.0 672
WERAM-Vip-0054 ENSVPAP00000005197.1 Vicugna pacos 50 0.0 634
WERAM-Chh-0010 ENSCHOP00000000906.1 Choloepus hoffmanni 79 1e-170 599
WERAM-Pem-0014 ENSPMAP00000002218.1 Petromyzon marinus 48 1e-143 509
WERAM-Ect-0072 ENSETEP00000007880.1 Echinops telfairi 69 1e-127 457
WERAM-Cis-0045 ENSCSAVP00000009955.1 Ciona savignyi 40 1e-123 443
WERAM-Tas-0109 ENSTSYP00000011196.1 Tarsius syrichta 63 9e-114 410
WERAM-Cii-0034 ENSCINP00000025384.2 Ciona intestinalis 55 2e-99 363
WERAM-Drm-0010 FBpp0082406 Drosophila melanogaster 48 3e-92 338
WERAM-Tum-0027 CAZ85029 Tuber melanosporum 55 1e-41 171
WERAM-Php-0006 PP1S101_4V6.1 Physcomitrella patens 54 3e-40 166
WERAM-Org-0116 ORGLA12G0159200.1 Oryza glaberrima 55 4e-40 166
WERAM-Asc-0034 CADACLAP00008186 Aspergillus clavatus 52 5e-40 166
WERAM-Cae-0021 C26E6.9a Caenorhabditis elegans 53 5e-40 165
WERAM-Ors-0112 OS12T0613200-02 Oryza sativa 55 5e-40 165
WERAM-Scp-0045 SPCC306.04c.1:pep Schizosaccharomyces pombe 55 1e-39 164
WERAM-Ast-0003 CADATEAP00001100 Aspergillus terreus 52 1e-39 164
WERAM-Pot-0072 POPTR_0005s28130.1 Populus trichocarpa 44 1e-39 164
WERAM-Coi-0035 EAS31778 Coccidioides immitis 51 1e-39 164
WERAM-Asn-0015 CADANIAP00003254 Aspergillus nidulans 53 1e-39 164
WERAM-Lem-0021 CBX90234 Leptosphaeria maculans 52 1e-39 164
WERAM-Asni-0037 CADANGAP00014055 Aspergillus niger 43 1e-39 164
WERAM-Thc-0094 EOY15831 Theobroma cacao 45 2e-39 164
WERAM-Aso-0006 CADAORAP00000676 Aspergillus oryzae 52 2e-39 164
Created Date 25-Jun-2016