WERAM Information


Tag Content
WERAM ID WERAM-Hos-0212
Ensembl Protein ID ENSP00000386759.3
Uniprot Accession Q9BYW2; SETD2_HUMAN; O75397; O75405; Q17RW8; Q5BKS9; Q5QGN2; Q69YI5; Q6IN64; Q6ZN53; Q6ZS25; Q8N3R0; Q8TCN0; Q9C0D1; Q9H696; Q9NZW9
Genbank Protein ID NP_054878.5
Protein Name Histone-lysine N-methyltransferase SETD2
Genbank Nucleotide ID NM_014159.6
Gene Name SETD2;HYPB;SET2;HIF-1;HIP-1;KMT3A;HBP231;HSPC069;p231HBP
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000181555.19 ENST00000409792.3 ENSP00000386759.3
ENSG00000181555.19 ENST00000445387.5 ENSP00000411901.1
Details
Type Family Domain Substrates AA References (PMIDs)
HMT SET2 SET H3K36 K 25537518; 26002201; 26807165; 20951770; 20889125
Status Reviewed
Classification
Type Family E-value Score Start End
HMT SET2 2.00e-52 178.5 1551 1667
HMT SET1 8.80e-30 105.5 806 1667
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Histone methyltransferase that specifically trimethylates 'Lys-36' of histone H3 (H3K36me3) using dimethylated 'Lys-36' (H3K36me2) as substrate. Represents the main enzyme generating H3K36me3, a specific tag for epigenetic transcriptional activation. Plays a role in chromatin structure modulation during elongation by coordinating recruitment of the FACT complex and by interacting with hyperphosphorylated POLR2A. Acts as a key regulator of DNA mismatch repair in G1 and early S phase by generating H3K36me3, a mark required to recruit MSH6 subunit of the MutS alpha complex: early recruitment of the MutS alpha complex to chromatin to be replicated allows a quick identification of mismatch DNA to initiate the mismatch repair reaction. H3K36me3 also plays an essential role in the maintenance of a heterochromatic state, by recruiting DNA methyltransferase DNMT3A. H3K36me3 is also enhanced in intron-containing genes, suggesting that SETD2 recruitment is enhanced by splicing and that splicing is coupled to recruitment of elongating RNA polymerase. Required during angiogenesis. Recruited to the promoters of adenovirus 12 E1A gene in case of infection, possibly leading to regulate its expression.
Domain Profile
  HMT SET2

           SET2.txt    2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncetqkw 91  
+ve+i tekkG+Glra++++++++f+leY+Gev+d+ke+k+R+key+++k+ ++Y+++l++de+iDat+kGn++Rf+nhsCePncetqkw
ENSP00000386759.3 1551 DVEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVKEYARNKNIHYYFMALKNDEIIDATQKGNCSRFMNHSCEPNCETQKW 1640
699*************************************************************************************** PP
SET2.txt 92 tvegelrvglfakkkikkgeeltfdYn 118
tv+g+lrvg+f++k +++g+eltfdY+
ENSP00000386759.3 1641 TVNGQLRVGFFTTKLVPSGSELTFDYQ 1667
**************************7 PP

  HMT SET1

           SET1.txt  37 sevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhs 82 
+++a++ e++ k+++ +++ e++ v+ d ++ + ++f +
ENSP00000386759.3 806 NSEAENIEPSVMKISSNSFMNVHLESKPVICDSRNLTDHSKFACEE 851
56789999999999999***99999999999999999999999865 PP
SET1.txt 3 levakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNceakv 91
+ev ++kg+gl+a+k++ +++v EY+Gev+ +++ + r key +++ + y+++l++d ++dat+kgn +rf+nhscepNce++
ENSP00000386759.3 1552 VEVILTEKKGWGLRAAKDLPSNTFVLEYCGEVLDHKEFKARVKEYARNKNIhYYFMALKND--EIIDATQKGNCSRFMNHSCEPNCETQK 1639
77888899***********************************998776664888899999..9************************** PP
SET1.txt 92 vavdgekkiviyakraIekgeeltydYk 119
+v+g+ +++ + ++ ++ g elt+dY+
ENSP00000386759.3 1640 WTVNGQLRVGFFTTKLVPSGSELTFDYQ 1667
***************************6 PP

Protein Sequence
(Fasta)
MKQLQPQPPP KMGDFYDPEH PTPEEEENEA KIENVQKTGF IKGPMFKGVA SSRFLPKGTK 60
TKVNLEEQGR QKVSFSFSLT KKTLQNRFLT ALGNEKQSDT PNPPAVPLQV DSTPKMKMEI 120
GDTLSTAEES SPPKSRVELG KIHFKKHLLH VTSRPLLATT TAVASPPTHA APLPAVIAES 180
TTVDSPPSSP PPPPPPAQAT TLSSPAPVTE PVALPHTPIT VLMAAPVPLP VDVAVRSLKE 240
PPIIIVPESL EADTKQDTIS NSLEEHVTQI LNEQADISSK KEDSHIGKDE EIPDSSKISL 300
SCKKTGSKKK SSQSEGIFLG SESDEDSVRT SSSQRSHDLK FSASIEKERD FKKSSAPLKS 360
EDLGKPSRSK TDRDDKYFSY SKLERDTRYV SSRCRSERER RRSRSHSRSE RGSRTNLSYS 420
RSERSHYYDS DRRYHRSSPY RERTRYSRPY TDNRARESSD SEEEYKKTYS RRTSSHSSSY 480
RDLRTSSYSK SDRDCKTETS YLEMERRGKY SSKLERESKR TSENEAIKRC CSPPNELGFR 540
RGSSYSKHDS SASRYKSTLS KPIPKSDKFK NSFCCTELNE EIKQSHSFSL QTPCSKGSEL 600
RMINKNPERE KAGSPAPSNR LNDSPTLKKL DELPIFKSEF ITHDSHDSIK ELDSLSKVKN 660
DQLRSFCPIE LNINGSPGAE SDLATFCTSK TDAVLMTSDD SVTGSELSPL VKACMLSSNG 720
FQNISRCKEK DLDDTCMLHK KSESPFRETE PLVSPHQDKL MSMPVMTVDY SKTVVKEPVD 780
TRVSCCKTKD SDIYCTLNDS NPSLCNSEAE NIEPSVMKIS SNSFMNVHLE SKPVICDSRN 840
LTDHSKFACE EYKQSIGSTS SASVNHFDDL YQPIGSSGIA SSLQSLPPGI KVDSLTLLKC 900
GENTSPVLDA VLKSKKSSEF LKHAGKETIV EVGSDLPDSG KGFASRENRR NNGLSGKCLQ 960
EAQEEGNSIL PERRGRPEIS LDERGEGGHV HTSDDSEVVF SSCDLNLTME DSDGVTYALK 1020
CDSSGHAPEI VSTVHEDYSG SSESSNDESD SEDTDSDDSS IPRNRLQSVV VVPKNSTLPM 1080
EETSPCSSRS SQSYRHYSDH WEDERLESRR HLYEEKFESI ASKACPQTDK FFLHKGTEKN 1140
PEISFTQSSR KQIDNRLPEL SHPQSDGVDS TSHTDVKSDP LGHPNSEETV KAKIPSRQQE 1200
ELPIYSSDFE DVPNKSWQQT TFQNRPDSRL GKTELSFSSS CEIPHVDGLH SSEELRNLGW 1260
DFSQEKPSTT YQQPDSSYGA CGGHKYQQNA EQYGGTRDYW QGNGYWDPRS GRPPGTGVVY 1320
DRTQGQVPDS LTDDREEEEN WDQQDGSHFS DQSDKFLLSL QKDKGSVQAP EISSNSIKDT 1380
LAVNEKKDFS KNLEKNDIKD RGPLKKRRQE IESDSESDGE LQDRKKVRVE VEQGETSVPP 1440
GSALVGPSCV MDDFRDPQRW KECAKQGKMP CYFDLIEENV YLTERKKNKS HRDIKRMQCE 1500
CTPLSKDERA QGEIACGEDC LNRLLMIECS SRCPNGDYCS NRRFQRKQHA DVEVILTEKK 1560
GWGLRAAKDL PSNTFVLEYC GEVLDHKEFK ARVKEYARNK NIHYYFMALK NDEIIDATQK 1620
GNCSRFMNHS CEPNCETQKW TVNGQLRVGF FTTKLVPSGS ELTFDYQFQR YGKEAQKCFC 1680
GSANCRGYLG GENRVSIRAA GGKMKKERSR KKDSVDGELE ALMENGEGLS DKNQVLSLSR 1740
LMVRIETLEQ KLTCLELIQN THSQSCLKSF LERHGLSLLW IWMAELGDGR ESNQKLQEEI 1800
IKTLEHLPIP TKNMLEESKV LPIIQRWSQT KTAVPPLSEG DGYSSENTSR AHTPLNTPDP 1860
STKLSTEADT DTPKKLMFRR LKIISENSMD SAISDATSEL EGKDGKEDLD QLENVPVEEE 1920
EELQSQQLLP QQLPECKVDS ETNIEASKLP TSEPEADAEI EPKESNGTKL EEPINEETPS 1980
QDEEEGVSDV ESERSQEQPD KTVDISDLAT KLLDSWKDLK EVYRIPKKSQ TEKENTTTER 2040
GRDAVGFRDQ TPAPKTPNRS RERDPDKQTQ NKEKRKRRSS LSPPSSAYER GTKRPDDRYD 2100
TPTSKKKVRI KDRNKLSTEE RRKLFEQEVA QREAQKQQQQ MQNLGMTSPL PYDSLGYNAP 2160
HHPFAGYPPG YPMQAYVDPS NPNAGKVLLP TPSMDPVCSP APYDHAQPLV GHSTEPLSAP 2220
PPVPVVPHVA APVEVSSSQY VAQSDGVVHQ DSSVAVLPVP APGPVQGQNY SVWDSNQQSV 2280
SVQQQYSPAQ SQATIYYQGQ TCPTVYGVTS PYSQTTPPIV QSYAQPSLQY IQGQQIFTAH 2340
PQGVVVQPAA AVTTIVAPGQ PQPLQPSEMV VTNNLLDLPP PSPPKPKTIV LPPNWKTARD 2400
PEGKIYYYHV ITRQTQWDPP TWESPGDDAS LEHEAEMDLG TPTYDENPMK ASKKPKTAEA 2460
DTSSELAKKS KEVFRKEMSQ FIVQCLNPYR KPDCKVGRIT TTEDFKHLAR KLTHGVMNKE 2520
LKYCKNPEDL ECNENVKHKT KEYIKKYMQK FGAVYKPKED TELE 2564
Nucleotide Sequence
(Fasta)
GCTGCGGGGA GGGGGCTCGC GTCGCCGTCT CCAGCCGCTC CCGATGAAGC AGCTGCAGCC 60
GCAGCCGCCT CCGAAGATGG GGGATTTCTA CGACCCGGAG CACCCGACCC CTGAAGAAGA 120
AGAAAATGAG GCAAAGATTG AAAATGTGCA GAAAACAGGT TTCATCAAAG GACCAATGTT 180
CAAAGGTGTT GCTTCTAGTC GATTTTTGCC CAAAGGCACC AAAACAAAAG TTAATTTGGA 240
AGAACAGGGA CGACAGAAGG TGTCATTCAG CTTCAGCCTT ACAAAGAAAA CTTTGCAGAA 300
TAGGTTTCTC ACTGCACTTG GCAATGAAAA GCAAAGTGAT ACTCCAAACC CTCCAGCTGT 360
ACCTCTTCAG GTAGACTCGA CTCCTAAAAT GAAAATGGAA ATTGGTGATA CCTTATCTAC 420
TGCAGAAGAA TCTTCCCCAC CAAAGTCAAG GGTGGAATTG GGCAAAATTC ATTTTAAGAA 480
ACATCTGCTT CATGTAACAT CCAGGCCACT GCTGGCTACT ACCACAGCAG TAGCATCTCC 540
ACCTACTCAT GCAGCACCAT TACCAGCAGT GATAGCAGAA TCAACAACTG TAGACTCACC 600
GCCCTCATCT CCGCCTCCAC CGCCTCCACC TGCCCAAGCC ACAACACTCT CATCACCAGC 660
ACCAGTAACA GAGCCAGTGG CCTTGCCACA TACACCAATA ACAGTTCTAA TGGCAGCACC 720
AGTACCCTTA CCAGTAGATG TAGCAGTTAG ATCTCTGAAA GAACCACCAA TTATAATTGT 780
ACCAGAATCT TTAGAAGCAG ATACTAAGCA GGACACTATA TCTAATAGTT TAGAAGAACA 840
CGTAACTCAA ATATTGAATG AGCAAGCAGA TATTTCCTCA AAAAAAGAAG ATTCCCATAT 900
TGGGAAGGAT GAAGAAATTC CAGATAGTTC TAAGATTAGT CTGAGCTGTA AAAAAACAGG 960
TTCTAAGAAG AAATCCTCAC AATCTGAAGG CATCTTTCTT GGTTCAGAAT CTGATGAAGA 1020
TTCTGTACGG ACTTCTTCAA GTCAAAGATC ACATGATTTA AAATTTTCAG CAAGCATTGA 1080
AAAGGAAAGA GATTTTAAAA AGAGCTCAGC ACCTTTAAAA AGTGAGGATC TAGGGAAACC 1140
TTCACGATCT AAAACAGACA GAGATGATAA ATATTTTAGC TATTCAAAAC TTGAAAGAGA 1200
TACTCGGTAT GTATCTTCCC GATGTAGATC AGAAAGAGAG CGACGGCGGA GCAGATCTCA 1260
CTCTAGGTCT GAGAGAGGCT CTAGAACTAA TTTATCCTAT TCCAGGTCAG AACGATCTCA 1320
TTATTATGAC TCTGATCGTC GCTACCATAG GAGCTCCCCT TATCGAGAGA GGACGCGCTA 1380
TTCTCGGCCA TACACAGATA ACAGAGCACG AGAGAGTTCT GACTCAGAAG AAGAGTATAA 1440
GAAGACATAC TCAAGGCGTA CCTCATCTCA TTCCTCTTCT TACAGAGACC TAAGGACATC 1500
ATCCTATTCT AAATCTGATC GGGACTGTAA AACTGAGACC TCTTACTTAG AGATGGAAAG 1560
AAGAGGCAAG TATTCTTCAA AACTAGAAAG AGAATCTAAA AGGACTTCAG AAAATGAAGC 1620
AATTAAAAGA TGTTGTTCTC CCCCTAATGA ACTGGGATTC CGACGAGGGT CATCATATTC 1680
TAAGCATGAC AGTAGTGCTT CCCGTTATAA ATCTACCCTT TCAAAACCTA TACCCAAGTC 1740
TGATAAATTT AAAAATTCTT TCTGTTGTAC AGAATTAAAT GAAGAAATCA AACAGTCTCA 1800
TTCTTTTAGT TTACAGACAC CTTGTTCAAA AGGTAGTGAA TTAAGAATGA TTAATAAAAA 1860
TCCTGAAAGA GAAAAGGCTG GGTCTCCAGC TCCATCAAAT CGATTAAATG ATTCACCTAC 1920
TTTAAAAAAG CTAGATGAAT TGCCTATTTT TAAGTCCGAA TTTATAACAC ATGATAGCCA 1980
TGATAGTATT AAGGAATTAG ACTCTTTATC TAAAGTGAAG AATGATCAAT TAAGAAGTTT 2040
TTGTCCCATA GAATTAAATA TAAATGGATC TCCTGGGGCA GAATCTGATT TGGCAACATT 2100
TTGCACTTCT AAAACTGATG CTGTTTTAAT GACTTCTGAT GATAGTGTGA CTGGATCGGA 2160
ATTATCCCCT TTGGTCAAAG CATGCATGCT TTCATCAAAT GGATTTCAGA ATATTAGTAG 2220
GTGCAAAGAA AAAGACTTGG ATGATACCTG CATGCTGCAT AAGAAGTCAG AAAGCCCATT 2280
TAGAGAAACA GAACCTCTGG TGTCACCACA CCAAGATAAA CTCATGTCTA TGCCAGTTAT 2340
GACTGTGGAT TATTCCAAAA CAGTAGTTAA AGAACCAGTT GATACGAGGG TTTCTTGCTG 2400
CAAAACCAAA GATTCAGACA TATACTGTAC TTTGAACGAT AGCAACCCTT CTTTGTGTAA 2460
CTCTGAAGCT GAAAATATTG AGCCTTCAGT TATGAAGATT TCTTCAAATA GCTTTATGAA 2520
TGTGCATTTG GAATCAAAAC CAGTTATATG TGATAGTAGA AATTTGACAG ATCACTCAAA 2580
ATTTGCATGT GAAGAATATA AGCAGAGCAT CGGTAGCACT AGTTCAGCTT CTGTTAATCA 2640
TTTTGATGAT TTATATCAAC CTATTGGGAG TTCAGGTATT GCTTCATCTC TTCAGAGTCT 2700
TCCACCAGGA ATAAAGGTGG ACAGTCTAAC TCTCTTGAAA TGCGGAGAGA ACACATCTCC 2760
AGTTCTGGAT GCAGTGCTAA AGAGTAAAAA AAGTTCAGAG TTTTTAAAGC ATGCAGGGAA 2820
AGAAACAATA GTAGAAGTAG GTAGTGACCT TCCTGATTCA GGAAAGGGAT TTGCTTCCAG 2880
GGAGAACAGG CGTAATAATG GGTTATCTGG GAAATGTTTG CAAGAGGCTC AAGAAGAAGG 2940
GAATTCCATA TTGCCTGAAA GAAGAGGAAG ACCAGAAATC TCTTTAGATG AAAGAGGAGA 3000
AGGAGGACAT GTGCATACTT CTGATGACTC AGAAGTTGTA TTTTCTTCTT GTGATTTGAA 3060
TTTAACCATG GAAGACAGTG ATGGTGTAAC TTATGCATTA AAGTGTGACA GTAGTGGTCA 3120
TGCCCCAGAA ATTGTGTCTA CAGTTCATGA AGATTATTCT GGCTCTTCTG AAAGTTCAAA 3180
TGATGAAAGT GATTCAGAAG ATACAGATTC GGATGATAGC AGTATTCCAA GAAACCGTCT 3240
CCAGTCTGTT GTGGTTGTGC CAAAGAATTC TACTTTGCCC ATGGAAGAAA CAAGTCCTTG 3300
TTCTTCTCGG AGCAGTCAAA GTTATAGACA CTATTCTGAC CATTGGGAAG ATGAGAGATT 3360
GGAGTCAAGG AGACATTTGT ATGAGGAAAA ATTTGAAAGT ATAGCAAGTA AAGCCTGTCC 3420
TCAAACTGAT AAGTTTTTCC TTCATAAAGG AACAGAGAAG AATCCGGAAA TTTCTTTTAC 3480
ACAGTCCAGT AGAAAACAAA TAGATAATCG CCTGCCTGAA CTTTCTCATC CTCAGAGTGA 3540
TGGGGTTGAT AGTACAAGTC ATACAGATGT GAAATCTGAC CCTCTGGGTC ACCCAAATTC 3600
AGAGGAAACC GTGAAAGCCA AAATACCTTC TAGGCAGCAA GAAGAGCTGC CAATTTATTC 3660
TTCTGATTTT GAAGATGTCC CAAATAAGTC TTGGCAACAG ACCACTTTCC AAAACAGGCC 3720
AGATAGTAGA CTGGGAAAAA CAGAATTGAG TTTTTCTTCC TCTTGTGAGA TACCACATGT 3780
GGATGGCTTG CACTCATCAG AAGAGCTCAG AAACTTAGGT TGGGACTTCT CTCAAGAAAA 3840
GCCTTCTACC ACGTATCAGC AACCTGACAG TAGCTATGGA GCTTGTGGTG GACACAAGTA 3900
TCAGCAAAAT GCAGAACAGT ATGGTGGGAC ACGTGATTAC TGGCAAGGCA ATGGTTACTG 3960
GGATCCAAGA TCAGGTAGAC CTCCTGGAAC TGGGGTTGTG TATGATCGAA CTCAAGGACA 4020
AGTACCAGAT TCCCTAACAG ATGATCGTGA AGAAGAGGAG AATTGGGATC AACAGGATGG 4080
ATCCCATTTT TCAGACCAGT CCGATAAATT TCTTCTATCC CTTCAGAAAG ACAAGGGGTC 4140
AGTGCAAGCA CCTGAAATAA GCAGCAATTC CATTAAGGAC ACTTTAGCTG TGAATGAAAA 4200
GAAAGATTTT TCAAAAAACT TAGAAAAAAA TGATATCAAA GATAGAGGGC CTCTTAAAAA 4260
AAGGAGGCAG GAAATAGAGA GTGATTCTGA AAGTGATGGT GAGCTTCAGG ACAGAAAGAA 4320
AGTTAGAGTG GAGGTAGAGC AGGGAGAGAC ATCAGTGCCC CCAGGTTCAG CACTGGTTGG 4380
GCCCTCCTGT GTCATGGATG ACTTCAGGGA CCCACAGCGA TGGAAGGAAT GTGCCAAGCA 4440
AGGGAAAATG CCATGTTACT TTGATCTTAT TGAAGAAAAT GTTTATTTAA CAGAAAGAAA 4500
GAAGAATAAA TCTCATCGAG ATATTAAGCG AATGCAGTGT GAGTGTACAC CTCTTTCTAA 4560
AGATGAAAGA GCTCAAGGTG AAATAGCATG TGGGGAAGAT TGTCTTAATC GTCTTCTCAT 4620
GATTGAATGT TCTTCTCGGT GTCCAAATGG GGATTATTGT TCCAATAGAC GGTTTCAGAG 4680
AAAACAGCAT GCAGATGTGG AAGTCATACT CACAGAAAAG AAAGGCTGGG GCTTGAGAGC 4740
TGCCAAAGAC CTTCCTTCGA ACACCTTTGT CCTAGAATAT TGTGGAGAGG TACTCGATCA 4800
TAAAGAGTTT AAAGCTCGAG TGAAGGAGTA TGCACGAAAC AAAAACATCC ATTACTATTT 4860
CATGGCCCTG AAGAATGATG AGATAATAGA TGCCACTCAA AAAGGAAATT GCTCTCGTTT 4920
CATGAATCAC AGCTGTGAAC CAAATTGTGA AACCCAAAAA TGGACTGTGA ACGGACAACT 4980
GAGGGTTGGG TTTTTTACCA CCAAACTGGT TCCTTCAGGC TCAGAGTTAA CGTTTGACTA 5040
TCAGTTCCAG AGATATGGAA AAGAAGCCCA GAAATGTTTC TGCGGATCAG CCAATTGCCG 5100
GGGTTACCTG GGAGGAGAAA ACAGAGTCAG CATCAGAGCA GCAGGAGGGA AAATGAAGAA 5160
GGAACGATCT CGTAAGAAGG ATTCAGTGGA TGGAGAGCTA GAAGCTCTGA TGGAAAATGG 5220
TGAGGGTCTC TCTGATAAAA ACCAGGTGCT CAGCTTATCC CGGCTAATGG TTAGAATTGA 5280
AACTTTGGAG CAGAAACTTA CCTGTCTGGA ACTCATACAG AACACACACT CACAGTCCTG 5340
CCTGAAGTCC TTTCTGGAAC GTCATGGGCT GTCTTTGTTG TGGATCTGGA TGGCAGAGCT 5400
AGGTGACGGC CGGGAAAGTA ACCAGAAGCT TCAGGAAGAG ATTATAAAGA CTTTGGAACA 5460
CTTGCCCATT CCTACTAAAA ATATGTTGGA GGAAAGCAAA GTACTTCCAA TTATTCAACG 5520
CTGGTCTCAG ACTAAGACTG CTGTCCCTCC GTTGAGTGAA GGAGATGGGT ATTCTAGTGA 5580
GAATACATCG CGTGCTCATA CACCACTCAA CACACCTGAT CCTTCCACCA AGCTGAGCAC 5640
AGAAGCTGAC ACAGACACTC CCAAGAAACT AATGTTTCGC AGACTGAAAA TTATAAGTGA 5700
AAATAGCATG GACAGTGCAA TCTCTGATGC AACCAGTGAG CTAGAAGGCA AGGATGGCAA 5760
AGAGGATCTT GATCAATTAG AAAATGTCCC TGTAGAGGAA GAGGAAGAAT TGCAGTCACA 5820
ACAGCTACTC CCACAACAGC TGCCTGAATG CAAAGTTGAT AGTGAAACCA ACATAGAAGC 5880
TAGTAAGCTA CCTACATCTG AACCAGAAGC TGACGCTGAA ATAGAGCCCA AAGAGAGCAA 5940
CGGCACAAAA CTAGAAGAAC CTATTAATGA AGAAACACCA TCCCAAGATG AAGAGGAGGG 6000
TGTGTCTGAT GTGGAGAGTG AAAGGAGCCA AGAACAGCCA GATAAAACAG TGGATATAAG 6060
TGATTTGGCC ACCAAACTCC TGGACAGTTG GAAAGACCTA AAGGAGGTAT ATCGAATTCC 6120
AAAGAAAAGT CAAACTGAAA AGGAAAACAC AACAACTGAA CGAGGAAGGG ATGCTGTTGG 6180
CTTCAGAGAT CAAACACCTG CCCCGAAGAC TCCTAATAGG TCAAGAGAGA GAGACCCAGA 6240
CAAGCAAACT CAAAATAAAG AGAAAAGGAA ACGAAGAAGC TCCCTCTCAC CACCCTCTTC 6300
TGCCTATGAG CGGGGAACAA AAAGGCCAGA TGACAGATAT GATACACCAA CTTCTAAAAA 6360
GAAAGTACGA ATTAAAGACC GCAATAAACT TTCTACAGAG GAACGCCGGA AGTTGTTTGA 6420
GCAAGAGGTG GCTCAACGGG AGGCTCAGAA ACAACAGCAA CAGATGCAGA ACCTGGGAAT 6480
GACATCACCA CTGCCCTATG ACTCTCTTGG TTATAATGCC CCGCATCATC CCTTTGCTGG 6540
TTACCCACCA GGTTATCCCA TGCAGGCCTA TGTGGATCCC AGCAACCCTA ATGCTGGAAA 6600
GGTGCTCCTG CCCACACCCA GCATGGACCC AGTGTGTTCT CCTGCTCCTT ATGATCATGC 6660
TCAGCCCTTG GTGGGACATT CTACAGAACC CCTTTCTGCC CCTCCACCAG TACCAGTGGT 6720
GCCACATGTG GCAGCTCCTG TGGAAGTTTC CAGTTCCCAG TATGTGGCCC AGAGTGATGG 6780
TGTAGTACAC CAAGACTCCA GCGTTGCTGT CTTGCCAGTG CCGGCCCCCG GCCCAGTTCA 6840
GGGACAGAAT TATAGTGTTT GGGATTCAAA CCAACAGTCT GTCAGTGTAC AGCAGCAGTA 6900
CTCTCCTGCA CAGTCTCAAG CAACCATATA TTATCAAGGA CAGACATGTC CAACAGTCTA 6960
TGGTGTGACA TCACCTTATT CACAGACAAC TCCACCAATT GTACAGAGTT ATGCCCAGCC 7020
AAGTCTTCAG TATATCCAGG GGCAACAGAT TTTCACAGCT CATCCACAAG GAGTGGTGGT 7080
ACAGCCAGCC GCAGCAGTGA CTACAATAGT TGCACCAGGG CAGCCTCAGC CCTTGCAGCC 7140
ATCTGAAATG GTTGTGACAA ATAATCTCTT GGATCTGCCG CCCCCCTCTC CTCCCAAACC 7200
AAAAACCATT GTCTTACCTC CCAACTGGAA GACAGCTCGA GATCCAGAAG GGAAGATTTA 7260
TTACTACCAT GTGATCACAA GGCAGACTCA GTGGGATCCT CCTACTTGGG AAAGCCCAGG 7320
AGATGATGCC AGCCTTGAGC ATGAAGCTGA GATGGACCTG GGAACTCCAA CATATGATGA 7380
AAACCCCATG AAGGCCTCGA AAAAGCCCAA GACAGCAGAA GCAGACACCT CCAGTGAACT 7440
AGCAAAGAAA AGCAAAGAAG TATTCAGAAA AGAGATGTCC CAGTTCATCG TCCAGTGCCT 7500
GAACCCTTAC CGGAAACCTG ACTGCAAAGT GGGAAGAATT ACCACAACTG AAGACTTTAA 7560
ACATCTGGCT CGCAAGCTGA CTCACGGTGT TATGAATAAG GAGCTGAAGT ACTGTAAGAA 7620
TCCTGAGGAC CTGGAGTGCA ATGAGAATGT GAAACACAAA ACCAAGGAGT ACATTAAGAA 7680
GTACATGCAG AAGTTTGGGG CTGTTTACAA ACCCAAAGAG GACACTGAAT TAGAGTGACT 7740
GTTGGGCCAG GGTGGGAGGA TGGGTGGTCA GGTAAGACAG ACTCTAGGGA GAGGAAATCC 7800
TGTGGGCCTT TCTGTCCCAC CCCTGTCAGC ACTGTGCTAC TGATGATACA TCACCCTGGG 7860
GAATTCAACC CTGCAGATGT CAACTGAAGG CCACAAAAAT GAACTCCATC TACAAGTGAT 7920
TACCTAGTTG TGAGCTGTTG GCATGTGGTT AGAAGCCATC AGAGGTGCAA GGGCTTAGAA 7980
AAGACCCTGG CCAGACCTGA CTCCACTCTT AAACCTGGGT CTTCTCCTTG GCGGTGCTGT 8040
CAGCGCACAG ACCCATGCGC ATCCCCACCC ACAACCCTTT ACCCTGATGA TCTGTATTAT 8100
ATTTTAATGT ATATGTGAAT ATATTGAAAA TAATTTGTTT TT 8143
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0010--Activator
KW-0025--Alternative splicing
KW-0156--Chromatin regulator
KW-0158--Chromosome
KW-0175--Coiled coil
KW-0181--Complete proteome
KW-1017--Isopeptide bond
KW-0489--Methyltransferase
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-1185--Reference proteome
KW-0949--S-adenosyl-L-methionine
KW-0804--Transcription
KW-0805--Transcription regulation
KW-0808--Transferase
KW-0832--Ubl conjugation
--

Interpro

IPR006560--AWS_dom
IPR003616--Post-SET_dom
IPR001214--SET_dom
IPR013257--SRI
IPR001202--WW_dom

PROSITE

PS51215--AWS
PS50868--POST_SET
PS50280--SET
PS01159--WW_DOMAIN_1
PS50020--WW_DOMAIN_2

Pfam

PF00856--SET
PF08236--SRI
PF00397--WW

Gene Ontology

GO:0005694--C:chromosome
GO:0005654--C:nucleoplasm
GO:0046975--F:histone methyltransferase activity (H3-K36 specific)
GO:0018024--F:histone-lysine N-methyltransferase activity
GO:0001525--P:angiogenesis
GO:0035441--P:cell migration involved in vasculogenesis
GO:0060977--P:coronary vasculature morphogenesis
GO:0048701--P:embryonic cranial skeleton morphogenesis
GO:0060669--P:embryonic placenta morphogenesis
GO:0030900--P:forebrain development
GO:0097676--P:histone H3-K36 dimethylation
GO:0097198--P:histone H3-K36 trimethylation
GO:0048332--P:mesoderm morphogenesis
GO:0006298--P:mismatch repair
GO:0001763--P:morphogenesis of a branching structure
GO:0001843--P:neural tube closure
GO:0034728--P:nucleosome organization
GO:0060039--P:pericardium development
GO:0010793--P:regulation of mRNA export from nucleus
GO:0006355--P:regulation of transcription, DNA-templated
GO:0048864--P:stem cell development
GO:0006368--P:transcription elongation from RNA polymerase II promoter

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pat-0126 ENSPTRP00000025629.4 Pan troglodytes 100 0.0 4471
WERAM-Gog-0197 ENSGGOP00000025052.2 Gorilla gorilla 99 0.0 4466
WERAM-Poa-0125 ENSPPYP00000015574.2 Pongo abelii 99 0.0 4444
WERAM-Chs-0194 ENSCSAP00000012204.1 Chlorocebus sabaeus 98 0.0 4411
WERAM-Paa-0066 ENSPANP00000003313.1 Papio anubis 98 0.0 4384
WERAM-Nol-0062 ENSNLEP00000007605.1 Nomascus leucogenys 96 0.0 4318
WERAM-Mam-0194 ENSMMUP00000027793.2 Macaca mulatta 98 0.0 4084
WERAM-Eqc-0058 ENSECAP00000007791.1 Equus caballus 91 0.0 4061
WERAM-Caf-0145 ENSCAFP00000019740.3 Canis familiaris 92 0.0 4054
WERAM-Fec-0157 ENSFCAP00000015869.2 Felis catus 91 0.0 4032
WERAM-Aim-0141 ENSAMEP00000013463.1 Ailuropoda melanoleuca 91 0.0 4011
WERAM-Ict-0161 ENSSTOP00000016952.1 Ictidomys tridecemlineatus 91 0.0 4003
WERAM-Bot-0037 ENSBTAP00000003838.5 Bos taurus 89 0.0 3960
WERAM-Mup-0172 ENSMPUP00000014934.1 Mustela putorius furo 90 0.0 3949
WERAM-Ova-0043 ENSOARP00000005431.1 Ovis aries 89 0.0 3912
WERAM-Dan-0009 ENSDNOP00000001031.3 Dasypus novemcinctus 89 0.0 3810
WERAM-Mum-0177 ENSMUSP00000116313.2 Mus musculus 84 0.0 3690
WERAM-Myl-0114 ENSMLUP00000009203.2 Myotis lucifugus 85 0.0 3507
WERAM-Ran-0141 ENSRNOP00000028408.5 Rattus norvegicus 86 0.0 3461
WERAM-Tut-0177 ENSTTRP00000015037.1 Tursiops truncatus 91 0.0 3323
WERAM-Cap-0096 ENSCPOP00000007389.2 Cavia porcellus 89 0.0 3304
WERAM-Loa-0040 ENSLAFP00000002747.3 Loxodonta africana 90 0.0 3135
WERAM-Mod-0123 ENSMODP00000017559.3 Monodelphis domestica 74 0.0 3091
WERAM-Sah-0117 ENSSHAP00000012598.1 Sarcophilus harrisii 74 0.0 3065
WERAM-Pes-0163 ENSPSIP00000019522.1 Pelodiscus sinensis 66 0.0 2693
WERAM-Tas-0101 ENSTSYP00000010617.1 Tarsius syrichta 91 0.0 2680
WERAM-Anp-0041 ENSAPLP00000004480.1 Anas platyrhynchos 61 0.0 2500
WERAM-Vip-0107 ENSVPAP00000009702.1 Vicugna pacos 91 0.0 2499
WERAM-Orc-0024 ENSOCUP00000002465.3 Oryctolagus cuniculus 94 0.0 2486
WERAM-Gaga-0059 ENSGALP00000008825.4 Gallus gallus 63 0.0 2452
WERAM-Ptv-0035 ENSPVAP00000003990.1 Pteropus vampyrus 89 0.0 2443
WERAM-Tag-0003 ENSTGUP00000000172.1 Taeniopygia guttata 60 0.0 2350
WERAM-Fia-0033 ENSFALP00000002007.1 Ficedula albicollis 61 0.0 2294
WERAM-Caj-0020 ENSCJAP00000004202.1 Callithrix jacchus 95 0.0 2284
WERAM-Ocp-0056 ENSOPRP00000004800.1 Ochotona princeps 85 0.0 2167
WERAM-Anc-0016 ENSACAP00000002203.3 Anolis carolinensis 59 0.0 2144
WERAM-Prc-0063 ENSPCAP00000005892.1 Procavia capensis 84 0.0 1985
WERAM-Dio-0143 ENSDORP00000013341.1 Dipodomys ordii 77 0.0 1925
WERAM-Tub-0124 ENSTBEP00000013685.1 Tupaia belangeri 89 0.0 1760
WERAM-Lac-0058 ENSLACP00000008064.1 Latimeria chalumnae 64 0.0 1626
WERAM-Mim-0156 ENSMICP00000016078.1 Microcebus murinus 85 0.0 1622
WERAM-Meg-0030 ENSMGAP00000002517.2 Meleagris gallopavo 68 0.0 1287
WERAM-Orn-0031 ENSONIP00000003854.1 Oreochromis niloticus 55 0.0 1243
WERAM-Soa-0060 ENSSARP00000006011.1 Sorex araneus 91 0.0 1202
WERAM-Pof-0017 ENSPFOP00000001631.1 Poecilia formosa 54 0.0 911
WERAM-Tar-0164 ENSTRUP00000035102.1 Takifugu rubripes 50 0.0 896
WERAM-Xim-0177 ENSXMAP00000014226.1 Xiphophorus maculatus 56 0.0 890
WERAM-Asm-0171 ENSAMXP00000016940.1 Astyanax mexicanus 50 0.0 884
WERAM-Leo-0048 ENSLOCP00000006757.1 Lepisosteus oculatus 55 0.0 853
WERAM-Dar-0144 ENSDARP00000114457.1 Danio rerio 52 0.0 778
WERAM-Orla-0081 ENSORLP00000010328.1 Oryzias latipes 67 0.0 760
WERAM-Cii-0057 ENSCINP00000025949.2 Ciona intestinalis 62 1e-81 304
WERAM-Drm-0037 FBpp0290774 Drosophila melanogaster 52 2e-72 273
WERAM-Cis-0029 ENSCSAVP00000005982.1 Ciona savignyi 63 1e-70 267
WERAM-Art-0032 AT1G77300.1 Arabidopsis thaliana 46 2e-61 236
WERAM-Php-0036 PP1S183_22V6.1 Physcomitrella patens 47 3e-61 236
WERAM-Bro-0120 Bo6g121240.1 Brassica oleracea 47 4e-61 235
WERAM-Brr-0078 Bra015678.1-P Brassica rapa 47 5e-61 235
WERAM-Arl-0052 fgenesh2_kg.2__ 2024__ AT1G77300.1 Arabidopsis lyrata 47 7e-61 234
WERAM-Crn-0030 AAW44778 Cryptococcus neoformans 35 1e-60 234
WERAM-Tum-0015 CAZ82931 Tuber melanosporum 37 9e-60 231
WERAM-Sem-0083 EFJ16223 Selaginella moellendorffii 47 2e-59 229
WERAM-Amt-0022 ERN08747 Amborella trichopoda 47 2e-59 229
WERAM-Prp-0004 EMJ23127 Prunus persica 47 6e-59 228
WERAM-Nec-0001 EFNCRP00000000016 Neurospora crassa 34 7e-59 228
WERAM-Zyt-0011 Mycgr3P20674 Zymoseptoria tritici 35 8e-59 228
WERAM-Chg-0015 EAQ91281 Chaetomium globosum 36 1e-58 227
WERAM-Chr-0033 EDP02327 Chlamydomonas reinhardtii 43 2e-58 226
WERAM-Fuo-0030 FOXG_09334P0 Fusarium oxysporum 35 2e-58 226
WERAM-Glm-0044 GLYMA04G42405.1 Glycine max 44 4e-58 226
WERAM-Put-0020 PTTG_08628P0 Puccinia triticina 36 4e-58 225
WERAM-Mel-0032 EGG05344 Melampsora laricipopulina 36 5e-58 225
WERAM-Trr-0038 EGR45981 Trichoderma reesei 35 8e-58 224
WERAM-Sol-0077 Solyc06g059960.2.1 Solanum lycopersicum 45 1e-57 224
WERAM-Usm-0014 UM02500P0 Ustilago maydis 34 2e-57 223
WERAM-Pyt-0041 EFQ85104 Pyrenophora teres 36 3e-57 223
WERAM-Pug-0021 EFP82907 Puccinia graminis 36 5e-57 222
WERAM-Lem-0034 CBX96970 Leptosphaeria maculans 36 5e-57 222
WERAM-Asc-0008 CADACLAP00001395 Aspergillus clavatus 33 5e-57 222
WERAM-Pytr-0017 EDU46790 Pyrenophora triticirepentis 35 6e-57 221
WERAM-Miv-0037 MVLG_05378T0 Microbotryum violaceum 34 1e-56 221
WERAM-Asni-0036 CADANGAP00013329 Aspergillus niger 34 1e-56 221
WERAM-Trv-0033 EHK18776 Trichoderma virens 35 1e-56 221
WERAM-Orp-0022 OPUNC02G18200.1 Oryza punctata 45 2e-56 220
WERAM-Asf-0029 CADAFLAP00010840 Aspergillus flavus 34 2e-56 220
WERAM-Orni-0022 ONIVA02G22370.1 Oryza nivara 45 2e-56 219
WERAM-Orgl-0024 OGLUM02G20630.1 Oryza glumaepatula 45 3e-56 219
WERAM-Sob-0071 Sb04g022620.1 Sorghum bicolor 45 3e-56 219
WERAM-Thc-0097 EOY16446 Theobroma cacao 44 3e-56 219
WERAM-Viv-0111 VIT_18s0001g01700.t01 Vitis vinifera 44 3e-56 219
WERAM-Orr-0022 ORUFI02G21420.1 Oryza rufipogon 45 3e-56 219
WERAM-Spr-0007 CBQ71244 Sporisorium reilianum 34 4e-56 219
WERAM-Mao-0006 MGG_01661T0 Magnaporthe oryzae 34 4e-56 219
WERAM-Ved-0022 EGY14913 Verticillium dahliae 34 6e-56 218
WERAM-Sei-0057 Si016071m Setaria italica 45 7e-56 218
WERAM-Met-0069 KEH35350 Medicago truncatula 44 7e-56 218
WERAM-Ors-0021 OS02T0554000-01 Oryza sativa 45 9e-56 218
WERAM-Asfu-0029 CADAFUAP00006520 Aspergillus fumigatus 34 9e-56 218
WERAM-Coi-0027 EAS30731 Coccidioides immitis 35 1e-55 217
WERAM-Org-0023 ORGLA02G0174000.1 Oryza glaberrima 45 1e-55 217
WERAM-Tra-0077 Traes_2BS_D44A3B19F.1 Triticum aestivum 44 2e-55 217
WERAM-Aso-0020 CADAORAP00004002 Aspergillus oryzae 35 2e-55 216
WERAM-Scj-0019 EEB07769 Schizosaccharomyces japonicus 34 2e-55 216
WERAM-Yal-0019 CAG79692 Yarrowia lipolytica 36 2e-55 216
WERAM-Zem-0120 GRMZM2G352431_P01 Zea mays 44 3e-55 216
WERAM-Mua-0138 GSMUA_Achr9P08570_001 Musa acuminata 45 4e-55 215
WERAM-Brd-0085 BRADI3G45727.1 Brachypodium distachyon 44 4e-55 215
WERAM-Hov-0052 MLOC_53863.1 Hordeum vulgare 44 5e-55 215
WERAM-Lab-0050 EDR09647 Laccaria bicolor 42 6e-55 215
WERAM-Asn-0025 CADANIAP00006232 Aspergillus nidulans 37 8e-55 214
WERAM-Phn-0008 SNOT_01198 Phaeosphaeria nodorum 35 1e-54 214
WERAM-Lep-0028 LPERR02G16340.1 Leersia perrieri 45 2e-54 213
WERAM-Scp-0013 SPAC29B12.02c.1:pep Schizosaccharomyces pombe 35 2e-54 213
WERAM-Map-0019 MAPG_06439T0 Magnaporthe poae 34 2e-54 213
WERAM-Orbr-0026 OB02G27880.1 Oryza brachyantha 44 6e-54 211
WERAM-Pot-0026 POPTR_0002s07930.1 Populus trichocarpa 45 2e-53 210
WERAM-Scs-0026 EDO04606 Sclerotinia sclerotiorum 33 2e-53 210
WERAM-Gag-0009 GGTG_01904T0 Gaeumannomyces graminis 33 2e-53 209
WERAM-Orm-0025 OMERI02G20080.1 Oryza meridionalis 42 3e-53 209
WERAM-Asg-0016 AAS52557 Ashbya gossypii 35 5e-53 208
WERAM-Cag-0005 CAG58098 Candida glabrata 36 3e-52 206
WERAM-Caa-0013 KGT70398 Candida albicans 46 1e-51 204
WERAM-Ori-0052 BGIOSGA016424-PA Oryza indica 43 2e-51 203
WERAM-Aet-0014 EMT20452 Aegilops tauschii 39 9e-51 201
WERAM-Osl-0018 ABP00208 Ostreococcus lucimarinus 47 2e-50 200
WERAM-Sac-0023 YJL168C Saccharomyces cerevisiae 43 2e-50 199
WERAM-Xet-0165 ENSXETP00000005100.3 Xenopus tropicalis 48 2e-50 199
WERAM-Kll-0003 CAH02882 Kluyveromyces lactis 35 1e-49 197
WERAM-Kop-0006 CAY68124 Komagataella pastoris 47 2e-49 197
WERAM-Sus-0116 ENSSSCP00000016766.2 Sus scrofa 41 1e-48 194
WERAM-Otg-0155 ENSOGAP00000013154.2 Otolemur garnettii 41 2e-48 193
WERAM-Gam-0039 ENSGMOP00000004113.1 Gadus morhua 48 2e-48 193
WERAM-Ora-0032 ENSOANP00000005155.1 Ornithorhynchus anatinus 42 3e-48 192
WERAM-Gaa-0041 ENSGACP00000006025.1 Gasterosteus aculeatus 48 8e-48 191
Created Date 25-Jun-2016