WERAM Information


Tag Content
WERAM ID WERAM-Hos-0089
Ensembl Protein ID ENSP00000234420.4
Uniprot Accession P52701; MSH6_HUMAN; B4DF41; B4E3I4; F5H2F9; O43706; O43917; Q8TCX4; Q9BTB5
Genbank Protein ID NP_000170.1; NP_001268421.1; NP_001268422.1; NP_001268423.1
Protein Name DNA mismatch repair protein Msh6
Genbank Nucleotide ID NM_000179.2; NM_001281492.1; NM_001281493.1; NM_001281494.1
Gene Name MSH6;GTBP
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSG00000116062.14 ENST00000234420.9 ENSP00000234420.4
ENSG00000116062.14 ENST00000420813.5 ENSP00000390382.1
ENSG00000116062.14 ENST00000455383.5 ENSP00000397484.1
ENSG00000116062.14 ENST00000445503.5 ENSP00000405294.1
ENSG00000116062.14 ENST00000411819.1 ENSP00000406248.1
ENSG00000116062.14 ENST00000606499.1 ENSP00000475605.1
ENSG00000116062.14 ENST00000616033.4 ENSP00000480261.1
Details
Type Family Domain Substrates AA References (PMIDs)
Me_Reader PWWP PWWP H3K36me3 K 23622243
Status Reviewed
Classification
Type Family E-value Score Start End
Me_Reader PWWP 1.80e-22 81.5 92 153
Organism Homo sapiens
NCBI Taxa ID 9606
Functional Description
(View)
Component of the post-replicative DNA mismatch repair system (MMR). Heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. ATP binding and hydrolysis play a pivotal role in mismatch repair functions. The ATPase activity associated with MutS alpha regulates binding similar to a molecular switch: mismatched DNA provokes ADP-->ATP exchange, resulting in a discernible conformational transition that converts MutS alpha into a sliding clamp capable of hydrolysis-independent diffusion along the DNA backbone. This transition is crucial for mismatch repair. MutS alpha may also play a role in DNA homologous recombination repair. Recruited on chromatin in G1 and early S phase via its PWWP domain that specifically binds trimethylated 'Lys-36' of histone H3 (H3K36me3): early recruitment to chromatin to be replicated allowing a quick identification of mismatch repair to initiate the DNA mismatch repair reaction.
Domain Profile
  Me_Reader PWWP

           PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 
+gdLVwaK++gYpwWP+lv+++p++++ + +++ ++ +++V+FF+++++r wv+++ l+py++
ENSP00000234420.4 92 PGDLVWAKMEGYPWWPCLVYNHPFDGTFI-REKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTG 153
69***************************.*******************************86 PP

Protein Sequence
(Fasta)
MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60
ARSASPPKAK NLNGGLRRSV APAAPTSCDF SPGDLVWAKM EGYPWWPCLV YNHPFDGTFI 120
REKGKSVRVH VQFFDDSPTR GWVSKRLLKP YTGSKSKEAQ KGGHFYSAKP EILRAMQRAD 180
EALNKDKIKR LELAVCDEPS EPEEEEEMEV GTTYVTDKSE EDNEIESEEE VQPKTQGSRR 240
SSRQIKKRRV ISDSESDIGG SDVEFKPDTK EEGSSDEISS GVGDSESEGL NSPVKVARKR 300
KRMVTGNGSL KRKSSRKETP SATKQATSIS SETKNTLRAF SAPQNSESQA HVSGGGDDSS 360
RPTVWYHETL EWLKEEKRRD EHRRRPDHPD FDASTLYVPE DFLNSCTPGM RKWWQIKSQN 420
FDLVICYKVG KFYELYHMDA LIGVSELGLV FMKGNWAHSG FPEIAFGRYS DSLVQKGYKV 480
ARVEQTETPE MMEARCRKMA HISKYDRVVR REICRIITKG TQTYSVLEGD PSENYSKYLL 540
SLKEKEEDSS GHTRAYGVCF VDTSLGKFFI GQFSDDRHCS RFRTLVAHYP PVQVLFEKGN 600
LSKETKTILK SSLSCSLQEG LIPGSQFWDA SKTLRTLLEE EYFREKLSDG IGVMLPQVLK 660
GMTSESDSIG LTPGEKSELA LSALGGCVFY LKKCLIDQEL LSMANFEEYI PLDSDTVSTT 720
RSGAIFTKAY QRMVLDAVTL NNLEIFLNGT NGSTEGTLLE RVDTCHTPFG KRLLKQWLCA 780
PLCNHYAIND RLDAIEDLMV VPDKISEVVE LLKKLPDLER LLSKIHNVGS PLKSQNHPDS 840
RAIMYEETTY SKKKIIDFLS ALEGFKVMCK IIGIMEEVAD GFKSKILKQV ISLQTKNPEG 900
RFPDLTVELN RWDTAFDHEK ARKTGLITPK AGFDSDYDQA LADIRENEQS LLEYLEKQRN 960
RIGCRTIVYW GIGRNRYQLE IPENFTTRNL PEEYELKSTK KGCKRYWTKT IEKKLANLIN 1020
AEERRDVSLK DCMRRLFYNF DKNYKDWQSA VECIAVLDVL LCLANYSRGG DGPMCRPVIL 1080
LPEDTPPFLE LKGSRHPCIT KTFFGDDFIP NDILIGCEEE EQENGKAYCV LVTGPNMGGK 1140
STLMRQAGLL AVMAQMGCYV PAEVCRLTPI DRVFTRLGAS DRIMSGESTF FVELSETASI 1200
LMHATAHSLV LVDELGRGTA TFDGTAIANA VVKELAETIK CRTLFSTHYH SLVEDYSQNV 1260
AVRLGHMACM VENECEDPSQ ETITFLYKFI KGACPKSYGF NAARLANLPE EVIQKGHRKA 1320
REFEKMNQSL RLFREVCLAS ERSTVDAEAV HKLLTLIKEL
Nucleotide Sequence
(Fasta)
GGCGAGGCGC CTGTTGATTG GCCACTGGGG CCCGGGTTCC TCCGGCGGAG CGCGCCTCCC 60
CCCAGATTTC CCGCCAGCAG GAGCCGCGCG GTAGATGCGG TGCTTTTAGG AGCTCCGTCC 120
GACAGAACGG TTGGGCCTTG CCGGCTGTCG GTATGTCGCG ACAGAGCACC CTGTACAGCT 180
TCTTCCCCAA GTCTCCGGCG CTGAGTGATG CCAACAAGGC CTCGGCCAGG GCCTCACGCG 240
AAGGCGGCCG TGCCGCCGCT GCCCCCGGGG CCTCTCCTTC CCCAGGCGGG GATGCGGCCT 300
GGAGCGAGGC TGGGCCTGGG CCCAGGCCCT TGGCGCGCTC CGCGTCACCG CCCAAGGCGA 360
AGAACCTCAA CGGAGGGCTG CGGAGATCGG TAGCGCCTGC TGCCCCCACC AGTTGTGACT 420
TCTCACCAGG AGATTTGGTT TGGGCCAAGA TGGAGGGTTA CCCCTGGTGG CCTTGTCTGG 480
TTTACAACCA CCCCTTTGAT GGAACATTCA TCCGCGAGAA AGGGAAATCA GTCCGTGTTC 540
ATGTACAGTT TTTTGATGAC AGCCCAACAA GGGGCTGGGT TAGCAAAAGG CTTTTAAAGC 600
CATATACAGG TTCAAAATCA AAGGAAGCCC AGAAGGGAGG TCATTTTTAC AGTGCAAAGC 660
CTGAAATACT GAGAGCAATG CAACGTGCAG ATGAAGCCTT AAATAAAGAC AAGATTAAGA 720
GGCTTGAATT GGCAGTTTGT GATGAGCCCT CAGAGCCAGA AGAGGAAGAA GAGATGGAGG 780
TAGGCACAAC TTACGTAACA GATAAGAGTG AAGAAGATAA TGAAATTGAG AGTGAAGAGG 840
AAGTACAGCC TAAGACACAA GGATCTAGGC GAAGTAGCCG CCAAATAAAA AAACGAAGGG 900
TCATATCAGA TTCTGAGAGT GACATTGGTG GCTCTGATGT GGAATTTAAG CCAGACACTA 960
AGGAGGAAGG AAGCAGTGAT GAAATAAGCA GTGGAGTGGG GGATAGTGAG AGTGAAGGCC 1020
TGAACAGCCC TGTCAAAGTT GCTCGAAAGC GGAAGAGAAT GGTGACTGGA AATGGCTCTC 1080
TTAAAAGGAA AAGCTCTAGG AAGGAAACGC CCTCAGCCAC CAAACAAGCA ACTAGCATTT 1140
CATCAGAAAC CAAGAATACT TTGAGAGCTT TCTCTGCCCC TCAAAATTCT GAATCCCAAG 1200
CCCACGTTAG TGGAGGTGGT GATGACAGTA GTCGCCCTAC TGTTTGGTAT CATGAAACTT 1260
TAGAATGGCT TAAGGAGGAA AAGAGAAGAG ATGAGCACAG GAGGAGGCCT GATCACCCCG 1320
ATTTTGATGC ATCTACACTC TATGTGCCTG AGGATTTCCT CAATTCTTGT ACTCCTGGGA 1380
TGAGGAAGTG GTGGCAGATT AAGTCTCAGA ACTTTGATCT TGTCATCTGT TACAAGGTGG 1440
GGAAATTTTA TGAGCTGTAC CACATGGATG CTCTTATTGG AGTCAGTGAA CTGGGGCTGG 1500
TATTCATGAA AGGCAACTGG GCCCATTCTG GCTTTCCTGA AATTGCATTT GGCCGTTATT 1560
CAGATTCCCT GGTGCAGAAG GGCTATAAAG TAGCACGAGT GGAACAGACT GAGACTCCAG 1620
AAATGATGGA GGCACGATGT AGAAAGATGG CACATATATC CAAGTATGAT AGAGTGGTGA 1680
GGAGGGAGAT CTGTAGGATC ATTACCAAGG GTACACAGAC TTACAGTGTG CTGGAAGGTG 1740
ATCCCTCTGA GAACTACAGT AAGTATCTTC TTAGCCTCAA AGAAAAAGAG GAAGATTCTT 1800
CTGGCCATAC TCGTGCATAT GGTGTGTGCT TTGTTGATAC TTCACTGGGA AAGTTTTTCA 1860
TAGGTCAGTT TTCAGATGAT CGCCATTGTT CGAGATTTAG GACTCTAGTG GCACACTATC 1920
CCCCAGTACA AGTTTTATTT GAAAAAGGAA ATCTCTCAAA GGAAACTAAA ACAATTCTAA 1980
AGAGTTCATT GTCCTGTTCT CTTCAGGAAG GTCTGATACC CGGCTCCCAG TTTTGGGATG 2040
CATCCAAAAC TTTGAGAACT CTCCTTGAGG AAGAATATTT TAGGGAAAAG CTAAGTGATG 2100
GCATTGGGGT GATGTTACCC CAGGTGCTTA AAGGTATGAC TTCAGAGTCT GATTCCATTG 2160
GGTTGACACC AGGAGAGAAA AGTGAATTGG CCCTCTCTGC TCTAGGTGGT TGTGTCTTCT 2220
ACCTCAAAAA ATGCCTTATT GATCAGGAGC TTTTATCAAT GGCTAATTTT GAAGAATATA 2280
TTCCCTTGGA TTCTGACACA GTCAGCACTA CAAGATCTGG TGCTATCTTC ACCAAAGCCT 2340
ATCAACGAAT GGTGCTAGAT GCAGTGACAT TAAACAACTT GGAGATTTTT CTGAATGGAA 2400
CAAATGGTTC TACTGAAGGA ACCCTACTAG AGAGGGTTGA TACTTGCCAT ACTCCTTTTG 2460
GTAAGCGGCT CCTAAAGCAA TGGCTTTGTG CCCCACTCTG TAACCATTAT GCTATTAATG 2520
ATCGTCTAGA TGCCATAGAA GACCTCATGG TTGTGCCTGA CAAAATCTCC GAAGTTGTAG 2580
AGCTTCTAAA GAAGCTTCCA GATCTTGAGA GGCTACTCAG TAAAATTCAT AATGTTGGGT 2640
CTCCCCTGAA GAGTCAGAAC CACCCAGACA GCAGGGCTAT AATGTATGAA GAAACTACAT 2700
ACAGCAAGAA GAAGATTATT GATTTTCTTT CTGCTCTGGA AGGATTCAAA GTAATGTGTA 2760
AAATTATAGG GATCATGGAA GAAGTTGCTG ATGGTTTTAA GTCTAAAATC CTTAAGCAGG 2820
TCATCTCTCT GCAGACAAAA AATCCTGAAG GTCGTTTTCC TGATTTGACT GTAGAATTGA 2880
ACCGATGGGA TACAGCCTTT GACCATGAAA AGGCTCGAAA GACTGGACTT ATTACTCCCA 2940
AAGCAGGCTT TGACTCTGAT TATGACCAAG CTCTTGCTGA CATAAGAGAA AATGAACAGA 3000
GCCTCCTGGA ATACCTAGAG AAACAGCGCA ACAGAATTGG CTGTAGGACC ATAGTCTATT 3060
GGGGGATTGG TAGGAACCGT TACCAGCTGG AAATTCCTGA GAATTTCACC ACTCGCAATT 3120
TGCCAGAAGA ATACGAGTTG AAATCTACCA AGAAGGGCTG TAAACGATAC TGGACCAAAA 3180
CTATTGAAAA GAAGTTGGCT AATCTCATAA ATGCTGAAGA ACGGAGGGAT GTATCATTGA 3240
AGGACTGCAT GCGGCGACTG TTCTATAACT TTGATAAAAA TTACAAGGAC TGGCAGTCTG 3300
CTGTAGAGTG TATCGCAGTG TTGGATGTTT TACTGTGCCT GGCTAACTAT AGTCGAGGGG 3360
GTGATGGTCC TATGTGTCGC CCAGTAATTC TGTTGCCGGA AGATACCCCC CCCTTCTTAG 3420
AGCTTAAAGG ATCACGCCAT CCTTGCATTA CGAAGACTTT TTTTGGAGAT GATTTTATTC 3480
CTAATGACAT TCTAATAGGC TGTGAGGAAG AGGAGCAGGA AAATGGCAAA GCCTATTGTG 3540
TGCTTGTTAC TGGACCAAAT ATGGGGGGCA AGTCTACGCT TATGAGACAG GCTGGCTTAT 3600
TAGCTGTAAT GGCCCAGATG GGTTGTTACG TCCCTGCTGA AGTGTGCAGG CTCACACCAA 3660
TTGATAGAGT GTTTACTAGA CTTGGTGCCT CAGACAGAAT AATGTCAGGT GAAAGTACAT 3720
TTTTTGTTGA ATTAAGTGAA ACTGCCAGCA TACTCATGCA TGCAACAGCA CATTCTCTGG 3780
TGCTTGTGGA TGAATTAGGA AGAGGTACTG CAACATTTGA TGGGACGGCA ATAGCAAATG 3840
CAGTTGTTAA AGAACTTGCT GAGACTATAA AATGTCGTAC ATTATTTTCA ACTCACTACC 3900
ATTCATTAGT AGAAGATTAT TCTCAAAATG TTGCTGTGCG CCTAGGACAT ATGGCATGCA 3960
TGGTAGAAAA TGAATGTGAA GACCCCAGCC AGGAGACTAT TACGTTCCTC TATAAATTCA 4020
TTAAGGGAGC TTGTCCTAAA AGCTATGGCT TTAATGCAGC AAGGCTTGCT AATCTCCCAG 4080
AGGAAGTTAT TCAAAAGGGA CATAGAAAAG CAAGAGAATT TGAGAAGATG AATCAGTCAC 4140
TACGATTATT TCGGGAAGTT TGCCTGGCTA GTGAAAGGTC AACTGTAGAT GCTGAAGCTG 4200
TCCATAAATT GCTGACTTTG ATTAAGGAAT TATAGACTGA CTACATTGGA AGCTTTGAGT 4260
TGACTTCTGA CAAAGGTGGT AAATTCAGAC AACATTATGA TCTAATAAAC TTTATTTTTT 4320
AAAAATGACC ATTTTTCCAT TTTCTTTCTA GGAAATTAAA CCCTTTTAAT TCTTATCTAC 4380
CTTCTACATA ATGGTTATTG AATACTCCAC AATATATTAA GTCTAGATGT TATGGTACAT 4440
GCATACACTT TCAGGCTGTT TTATACCCAC TGTCACCAAT ACACATAAAT GGGGGAGGAA 4500
AAGCTATGAA ACTGTATAGG GCTGTATATA TACTTGTCTC AGCTTAATGC AGGAAATTGG 4560
TTTAATTTCC AGCAGTTTTG TCTAAACTGT TCAAAAAAAA ACTATGAACA GAGTTCAAAT 4620
ACAGGACTGT TTGTTTTGAA GAGACTTTCT AAAGTGTACT TAAAACATAG TAGTTTTTTA 4680
CCTTTCACAA AACTGAGTTA CAAGAATACT TTTGTTTTAC AGTGCATCCC TTCCTAGGAA 4740
GTCTCATTAA AACACTCACT TTTTCTAGGG GTGATTTTGA ATGCTGCACA GGGAAGGGAA 4800
GGAAATAATA GTCTTAACTT TTCTTAAAGG ATACCAGAAA CATTGCTGGA TATAATTTAA 4860
GATTAGTGTT TTCTCTTTCA TAGAAAGAAC GTACATACTG GGACATGAGT ACAGTTACAG 4920
CAAGTCTAGG TGTGCTAACA AAACAGGGCA CATTCAAGTA CAGTAAGATT TTGCTTGAAA 4980
TTAAAAACAA ACTACATGAG ATTAAAGCAT TAAAATCATA TTTCTCAATC TGAATACATG 5040
TTAAAAAAAA AAAATCAAAA GGAACGCAGA AGTGCTAGCT CACATTTTTA CCATATTACA 5100
AAAGCAATTG GTACCCATGT CCATAAAGGC AGCAACAAAG CTGCTTGTCT ATTGAAGATT 5160
ACTACTGCAA ATTGGACTGC ATTCAATGCT AGTTGTAAAA ACACCAGCTT TTCAGAAGTT 5220
GGTATCTGTA CAAAATTGCA GCTTATTTTC TTCACTTCTG TCCCTTCAAG TCTTTACACA 5280
GTAATGCTAA AACACCCAGC TTTGAGATCC TGAGTCAATA TATTGCCACT TTCTTTTTGG 5340
TAGCTTGAGC TTCATAGTGT CAACTGACCT TGTGTATCCA TTTTTAATAC AGTCTCTTCC 5400
TGTAGCATGG GCAAATATTT TAAATCTTCT TCCAAAAAAG TGTTTTAAGT TATGATGTTA 5460
CAATGGCAGG ACTTTTTCTT TAGGGAAGGA ATTCAGTTGT GCTGCAATGT ATTAGATTCT 5520
ATAGGTGGAG CAGAGTCATA TAGTGTATCT GTATCATGTG TAGGCTCACC AGCTAATGTA 5580
CAAGGATTAG ACAGTGTTCC AGCACCACAG TCACAGAAAA ACCTAAAGCA AAATGAAACC 5640
CAAATATTAG AAAAGTGAGG GGGAAAGTAA TTGGGTAATA TATCAAGCAA GTGTGCTACA 5700
TACCTATCAT GTCTAATAAA CTCTACATCA TGTCCCTGAT GGCACTTCTT AATGCAGTTC 5760
ACACATATGG CATTTCGATC TGTGGTGTTA CAAGTATGAC ATCTAAAAAG CAAAAGCTTA 5820
AATTACTTTT CTCAAACATG TCATTAATGC AAAACATTCC ATTCTGTTTA TATATTACTA 5880
TGACCTTTGG CTTTAAGAGG ACCAAAACAA AATTCTTTGT GGCTCCAGCC CAGATTAATT 5940
CTGAAAAGGA ACTTTAATGG AGTAAGTGAT TTTCCTGTCA TCTGTGTCTT CGGAGGGAAG 6000
AGAAATGATT TGTAAATTGT ATAAAGGCAG TTCTTTCCAC TTTAAAAGCC TCTCAAATGT 6060
TTCTGGGCTG AAAACAATTT TTGGAGGCGT GAAGAGTCAA AACTGTCACA GTGACTGGGA 6120
TATATCAAAC ACTTAACCCC GACATCTTTA CCTTGAAATT TCTAGGAAAA CATTACACAA 6180
CATGAGTTAC ATGAATGACA TCAGTTACTG TAGCATTAGG TTTTTCCATA GTTATGGTCT 6240
TTGTTTTGTT TTGTAGAGAC AGGGTCTCCC TATGTTGCCC AGGCTGGTTT AGAACTCCTG 6300
GGCTCCAGTG ATCCTCCCAC TTCAGCCTCC CAAAGTGCTA GGATTACAGG CATAAGCCAC 6360
CACGCCTACC CACAGTTACA GTCTTAAACA CGATCTTCAA GTAGATTGAT GATAAAATTT 6420
TCAGTTAGTT ATAGTCTCAA CACCGGCAAA TAGCCAAAAA TGCTAGGCAT TGCTAATTTA 6480
AAAAGGAAAT CAGTCTTCCT CTTTTCAGGA CTCAAATATA TTTCTAAGTT ACCTGTAGAA 6540
ATCATGCATG GGATAGCTGG TATAACTTGA TATTTTATAT AAACATTGGC CTCTACTAAC 6600
AGCCTTTTCT ATGGCATCTT GATTGTTCAT TATTTTGTTA TCTGTAATAA AAGAAAGAAT 6660
AAGTAAAAAT TCAGAGGAAT GTTAATATTT TAAAAACCAA AGATTATAGG ATTATTCTAA 6720
CAGAAGAGCC ACTATTTTTA AGAGCTTTAA ATGAAGCTAA CCAATGAAGT AATTGTAAGA 6780
AATCAGCTAA GAATAGAATT TTCCTTGTAT AAGATACTCC AACCATTTAG AACCAAAGCT 6840
CTGTTTCTTT CAAAATCTAT CTTAAACTGT TGCTAACTTG GAGAGTGACA TAAGGAATCA 6900
AGTTATAAAA CGGCTTCTGA TTATCTTTCA TGGCATATTG CATATATTTA TAGGTATAGC 6960
AGACTCCAAC ATACCTTTCA TTGTCACATT AACACCAGAT GCTAAAAATA AGCCTCCAAA 7020
CCGGTTGTTA AAAATCTGAT TGCCTTCTAG TGTTGCAGTT GCGTGATTTG TAATTTCAAT 7080
ACCTGAAGTA AAATTTACAA ACAAGTAGAT ACATCACTTT ATACTGCTTC TTAAAAACCT 7140
GAAATTAGCA AGCAAATGTA AACTGCTTCT TTTATAGAAG TACATTAACC CTCTTAATGT 7200
CTACTGAATA AAATGTAGAT ACCTATTTCA ACCACCAACA GTAACATTCA CTTATCAATG 7260
ACTATGGTCA AAACTGCAAT TAACTTTCGC ACCAACCTAA CTGTCTTAAA GTTTAAATAC 7320
ATGATACTTG GATTTCATTT GCATCCATTT TAACATCTCT TTTTCTGTTG CAGATTTAAA 7380
CTGGTAAATT CATCTGAGGA ATTGAATCTA TCTGTATTCC TAGTGGTAAT ACAAGCCTGC 7440
ATTTATTCTA TCCCAATAAA TGTTTCATAA TCACGA 7477
Sequence Source Ensembl
Keyword

KW-0002--3D-structure
KW-0007--Acetylation
KW-0025--Alternative splicing
KW-0067--ATP-binding
KW-0158--Chromosome
KW-0181--Complete proteome
KW-0903--Direct protein sequencing
KW-0225--Disease mutation
KW-0227--DNA damage
KW-0234--DNA repair
KW-0238--DNA-binding
KW-0362--Hereditary nonpolyposis colorectal cancer
KW-0945--Host-virus interaction
KW-0547--Nucleotide-binding
KW-0539--Nucleus
KW-0597--Phosphoprotein
KW-0621--Polymorphism
KW-1185--Reference proteome
--

Interpro

IPR007695--DNA_mismatch_repair_MutS-lik_N
IPR000432--DNA_mismatch_repair_MutS_C
IPR007861--DNA_mismatch_repair_MutS_clamp
IPR007696--DNA_mismatch_repair_MutS_core
IPR016151--DNA_mismatch_repair_MutS_N
IPR007860--DNA_mmatch_repair_MutS_con_dom
IPR027417--P-loop_NTPase
IPR000313--PWWP_dom

PROSITE

PS00486--DNA_MISMATCH_REPAIR_2
PS50812--PWWP

Pfam

PF01624--MutS_I
PF05188--MutS_II
PF05192--MutS_III
PF05190--MutS_IV
PF00488--MutS_V
PF00855--PWWP

Gene Ontology

GO:0005737--C:cytoplasm
GO:0005794--C:Golgi apparatus
GO:0043231--C:intracellular membrane-bounded organelle
GO:0032301--C:MutSalpha complex
GO:0000790--C:nuclear chromatin
GO:0000228--C:nuclear chromosome
GO:0005654--C:nucleoplasm
GO:0005886--C:plasma membrane
GO:0005524--F:ATP binding
GO:0003682--F:chromatin binding
GO:0003684--F:damaged DNA binding
GO:0032137--F:guanine/thymine mispair binding
GO:0035064--F:methylated histone binding
GO:0030983--F:mismatched DNA binding
GO:0008340--P:determination of adult lifespan
GO:0006281--P:DNA repair
GO:0097193--P:intrinsic apoptotic signaling pathway
GO:0008630--P:intrinsic apoptotic signaling pathway in response to DNA damage
GO:0045190--P:isotype switching
GO:0000710--P:meiotic mismatch repair
GO:0006298--P:mismatch repair
GO:0045910--P:negative regulation of DNA recombination
GO:0051096--P:positive regulation of helicase activity
GO:0045830--P:positive regulation of isotype switching
GO:0007131--P:reciprocal meiotic recombination
GO:0009411--P:response to UV
GO:0016446--P:somatic hypermutation of immunoglobulin genes
GO:0016447--P:somatic recombination of immunoglobulin gene segments
GO:0016032--P:viral process

Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Chs-0168 ENSCSAP00000009745.1 Chlorocebus sabaeus 98 0.0 2514
WERAM-Mam-0208 ENSMMUP00000030085.2 Macaca mulatta 98 0.0 2510
WERAM-Poa-0108 ENSPPYP00000013873.2 Pongo abelii 99 0.0 2508
WERAM-Pat-0096 ENSPTRP00000020434.4 Pan troglodytes 99 0.0 2504
WERAM-Paa-0131 ENSPANP00000007626.1 Papio anubis 98 0.0 2492
WERAM-Gog-0195 ENSGGOP00000018924.1 Gorilla gorilla 96 0.0 2454
WERAM-Nol-0087 ENSNLEP00000010403.1 Nomascus leucogenys 99 0.0 2424
WERAM-Otg-0214 ENSOGAP00000021215.1 Otolemur garnettii 93 0.0 2376
WERAM-Ict-0166 ENSSTOP00000016600.1 Ictidomys tridecemlineatus 92 0.0 2368
WERAM-Aim-0076 ENSAMEP00000006662.1 Ailuropoda melanoleuca 92 0.0 2365
WERAM-Mup-0123 ENSMPUP00000010923.1 Mustela putorius furo 92 0.0 2364
WERAM-Dan-0137 ENSDNOP00000026714.1 Dasypus novemcinctus 91 0.0 2358
WERAM-Bot-0012 ENSBTAP00000001867.5 Bos taurus 92 0.0 2355
WERAM-Ova-0042 ENSOARP00000005406.1 Ovis aries 90 0.0 2321
WERAM-Ptv-0116 ENSPVAP00000010278.1 Pteropus vampyrus 90 0.0 2321
WERAM-Myl-0101 ENSMLUP00000008006.2 Myotis lucifugus 90 0.0 2320
WERAM-Tut-0091 ENSTTRP00000007640.1 Tursiops truncatus 91 0.0 2320
WERAM-Sus-0141 ENSSSCP00000023698.1 Sus scrofa 91 0.0 2312
WERAM-Orc-0143 ENSOCUP00000012569.2 Oryctolagus cuniculus 90 0.0 2306
WERAM-Prc-0145 ENSPCAP00000013528.1 Procavia capensis 89 0.0 2278
WERAM-Loa-0023 ENSLAFP00000001328.3 Loxodonta africana 93 0.0 2278
WERAM-Cap-0004 ENSCPOP00000000836.2 Cavia porcellus 88 0.0 2270
WERAM-Caf-0034 ENSCAFP00000003882.3 Canis familiaris 93 0.0 2252
WERAM-Ran-0105 ENSRNOP00000021923.6 Rattus norvegicus 85 0.0 2227
WERAM-Mum-0014 ENSMUSP00000005503.3 Mus musculus 85 0.0 2202
WERAM-Fec-0079 ENSFCAP00000006578.2 Felis catus 92 0.0 2176
WERAM-Sah-0081 ENSSHAP00000009277.1 Sarcophilus harrisii 81 0.0 2002
WERAM-Mod-0012 ENSMODP00000001344.2 Monodelphis domestica 80 0.0 1991
WERAM-Gaga-0092 ENSGALP00000038835.2 Gallus gallus 70 0.0 1757
WERAM-Xet-0147 ENSXETP00000049000.3 Xenopus tropicalis 66 0.0 1747
WERAM-Meg-0053 ENSMGAP00000005001.2 Meleagris gallopavo 70 0.0 1742
WERAM-Caj-0115 ENSCJAP00000020345.2 Callithrix jacchus 91 0.0 1741
WERAM-Pes-0030 ENSPSIP00000005328.1 Pelodiscus sinensis 71 0.0 1731
WERAM-Tag-0082 ENSTGUP00000005725.1 Taeniopygia guttata 71 0.0 1715
WERAM-Lac-0149 ENSLACP00000017187.1 Latimeria chalumnae 65 0.0 1715
WERAM-Anp-0154 ENSAPLP00000015904.1 Anas platyrhynchos 74 0.0 1634
WERAM-Anc-0146 ENSACAP00000013849.3 Anolis carolinensis 61 0.0 1627
WERAM-Leo-0169 ENSLOCP00000020114.1 Lepisosteus oculatus 62 0.0 1596
WERAM-Ocp-0051 ENSOPRP00000004458.1 Ochotona princeps 90 0.0 1594
WERAM-Orn-0123 ENSONIP00000012391.1 Oreochromis niloticus 60 0.0 1571
WERAM-Orla-0115 ENSORLP00000013893.1 Oryzias latipes 60 0.0 1554
WERAM-Tar-0161 ENSTRUP00000034377.1 Takifugu rubripes 66 0.0 1461
WERAM-Pof-0182 ENSPFOP00000015502.2 Poecilia formosa 67 0.0 1453
WERAM-Xim-0134 ENSXMAP00000011014.1 Xiphophorus maculatus 67 0.0 1446
WERAM-Asm-0007 ENSAMXP00000001159.1 Astyanax mexicanus 59 0.0 1437
WERAM-Ten-0197 ENSTNIP00000019708.1 Tetraodon nigroviridis 67 0.0 1422
WERAM-Gaa-0054 ENSGACP00000007160.1 Gasterosteus aculeatus 66 0.0 1417
WERAM-Gam-0173 ENSGMOP00000017118.1 Gadus morhua 68 0.0 1390
WERAM-Tub-0139 ENSTBEP00000014916.1 Tupaia belangeri 91 0.0 1379
WERAM-Dio-0122 ENSDORP00000011809.1 Dipodomys ordii 92 0.0 1278
WERAM-Pem-0088 ENSPMAP00000009336.1 Petromyzon marinus 59 0.0 1254
WERAM-Cis-0032 ENSCSAVP00000006844.1 Ciona savignyi 50 0.0 1021
WERAM-Ere-0070 ENSEEUP00000005594.1 Erinaceus europaeus 89 0.0 996
WERAM-Eqc-0026 ENSECAP00000004388.1 Equus caballus 96 0.0 972
WERAM-Tas-0125 ENSTSYP00000013319.1 Tarsius syrichta 93 0.0 847
WERAM-Ect-0048 ENSETEP00000005096.1 Echinops telfairi 89 0.0 791
WERAM-Mim-0103 ENSMICP00000009629.1 Microcebus murinus 83 0.0 724
WERAM-Met-0160 AES82183 Medicago truncatula 40 0.0 660
WERAM-Chh-0108 ENSCHOP00000011885.1 Choloepus hoffmanni 94 5e-179 626
WERAM-Vip-0079 ENSVPAP00000007177.1 Vicugna pacos 94 3e-168 590
WERAM-Dar-0233 ENSDARP00000130154.1 Danio rerio 63 5e-24 111
WERAM-Soa-0010 ENSSARP00000001031.1 Sorex araneus 46 4e-12 72.0
WERAM-Fia-0121 ENSFALP00000010353.1 Ficedula albicollis 46 9e-12 70.9
WERAM-Mae-0114 ENSMEUP00000010800.1 Macropus eugenii 32 3e-11 69.3
WERAM-Ora-0032 ENSOANP00000005155.1 Ornithorhynchus anatinus 34 7e-11 67.8
WERAM-Sei-0090 Si024584m Setaria italica 35 4e-06 52.0
Created Date 25-Jun-2016