WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Hos-0089 | ||||||||||||
Ensembl Protein ID | ENSP00000234420.4 | ||||||||||||
Uniprot Accession | P52701; MSH6_HUMAN; B4DF41; B4E3I4; F5H2F9; O43706; O43917; Q8TCX4; Q9BTB5 | ||||||||||||
Genbank Protein ID | NP_000170.1; NP_001268421.1; NP_001268422.1; NP_001268423.1 | ||||||||||||
Protein Name | DNA mismatch repair protein Msh6 | ||||||||||||
Genbank Nucleotide ID | NM_000179.2; NM_001281492.1; NM_001281493.1; NM_001281494.1 | ||||||||||||
Gene Name | MSH6;GTBP | ||||||||||||
Ensembl Information | |||||||||||||
Details |
|
||||||||||||
Status | Reviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Homo sapiens | ||||||||||||
NCBI Taxa ID | 9606 | ||||||||||||
Functional Description (View) |
Component of the post-replicative DNA mismatch repair system (MMR). Heterodimerizes with MSH2 to form MutS alpha, which binds to DNA mismatches thereby initiating DNA repair. When bound, MutS alpha bends the DNA helix and shields approximately 20 base pairs, and recognizes single base mismatches and dinucleotide insertion-deletion loops (IDL) in the DNA. After mismatch binding, forms a ternary complex with the MutL alpha heterodimer, which is thought to be responsible for directing the downstream MMR events, including strand discrimination, excision, and resynthesis. ATP binding and hydrolysis play a pivotal role in mismatch repair functions. The ATPase activity associated with MutS alpha regulates binding similar to a molecular switch: mismatched DNA provokes ADP-->ATP exchange, resulting in a discernible conformational transition that converts MutS alpha into a sliding clamp capable of hydrolysis-independent diffusion along the DNA backbone. This transition is crucial for mismatch repair. MutS alpha may also play a role in DNA homologous recombination repair. Recruited on chromatin in G1 and early S phase via its PWWP domain that specifically binds trimethylated 'Lys-36' of histone H3 (H3K36me3): early recruitment to chromatin to be replicated allowing a quick identification of mismatch repair to initiate the DNA mismatch repair reaction. | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60 ARSASPPKAK NLNGGLRRSV APAAPTSCDF SPGDLVWAKM EGYPWWPCLV YNHPFDGTFI 120 REKGKSVRVH VQFFDDSPTR GWVSKRLLKP YTGSKSKEAQ KGGHFYSAKP EILRAMQRAD 180 EALNKDKIKR LELAVCDEPS EPEEEEEMEV GTTYVTDKSE EDNEIESEEE VQPKTQGSRR 240 SSRQIKKRRV ISDSESDIGG SDVEFKPDTK EEGSSDEISS GVGDSESEGL NSPVKVARKR 300 KRMVTGNGSL KRKSSRKETP SATKQATSIS SETKNTLRAF SAPQNSESQA HVSGGGDDSS 360 RPTVWYHETL EWLKEEKRRD EHRRRPDHPD FDASTLYVPE DFLNSCTPGM RKWWQIKSQN 420 FDLVICYKVG KFYELYHMDA LIGVSELGLV FMKGNWAHSG FPEIAFGRYS DSLVQKGYKV 480 ARVEQTETPE MMEARCRKMA HISKYDRVVR REICRIITKG TQTYSVLEGD PSENYSKYLL 540 SLKEKEEDSS GHTRAYGVCF VDTSLGKFFI GQFSDDRHCS RFRTLVAHYP PVQVLFEKGN 600 LSKETKTILK SSLSCSLQEG LIPGSQFWDA SKTLRTLLEE EYFREKLSDG IGVMLPQVLK 660 GMTSESDSIG LTPGEKSELA LSALGGCVFY LKKCLIDQEL LSMANFEEYI PLDSDTVSTT 720 RSGAIFTKAY QRMVLDAVTL NNLEIFLNGT NGSTEGTLLE RVDTCHTPFG KRLLKQWLCA 780 PLCNHYAIND RLDAIEDLMV VPDKISEVVE LLKKLPDLER LLSKIHNVGS PLKSQNHPDS 840 RAIMYEETTY SKKKIIDFLS ALEGFKVMCK IIGIMEEVAD GFKSKILKQV ISLQTKNPEG 900 RFPDLTVELN RWDTAFDHEK ARKTGLITPK AGFDSDYDQA LADIRENEQS LLEYLEKQRN 960 RIGCRTIVYW GIGRNRYQLE IPENFTTRNL PEEYELKSTK KGCKRYWTKT IEKKLANLIN 1020 AEERRDVSLK DCMRRLFYNF DKNYKDWQSA VECIAVLDVL LCLANYSRGG DGPMCRPVIL 1080 LPEDTPPFLE LKGSRHPCIT KTFFGDDFIP NDILIGCEEE EQENGKAYCV LVTGPNMGGK 1140 STLMRQAGLL AVMAQMGCYV PAEVCRLTPI DRVFTRLGAS DRIMSGESTF FVELSETASI 1200 LMHATAHSLV LVDELGRGTA TFDGTAIANA VVKELAETIK CRTLFSTHYH SLVEDYSQNV 1260 AVRLGHMACM VENECEDPSQ ETITFLYKFI KGACPKSYGF NAARLANLPE EVIQKGHRKA 1320 REFEKMNQSL RLFREVCLAS ERSTVDAEAV HKLLTLIKEL |
||||||||||||
Nucleotide Sequence (Fasta) | GGCGAGGCGC CTGTTGATTG GCCACTGGGG CCCGGGTTCC TCCGGCGGAG CGCGCCTCCC 60 CCCAGATTTC CCGCCAGCAG GAGCCGCGCG GTAGATGCGG TGCTTTTAGG AGCTCCGTCC 120 GACAGAACGG TTGGGCCTTG CCGGCTGTCG GTATGTCGCG ACAGAGCACC CTGTACAGCT 180 TCTTCCCCAA GTCTCCGGCG CTGAGTGATG CCAACAAGGC CTCGGCCAGG GCCTCACGCG 240 AAGGCGGCCG TGCCGCCGCT GCCCCCGGGG CCTCTCCTTC CCCAGGCGGG GATGCGGCCT 300 GGAGCGAGGC TGGGCCTGGG CCCAGGCCCT TGGCGCGCTC CGCGTCACCG CCCAAGGCGA 360 AGAACCTCAA CGGAGGGCTG CGGAGATCGG TAGCGCCTGC TGCCCCCACC AGTTGTGACT 420 TCTCACCAGG AGATTTGGTT TGGGCCAAGA TGGAGGGTTA CCCCTGGTGG CCTTGTCTGG 480 TTTACAACCA CCCCTTTGAT GGAACATTCA TCCGCGAGAA AGGGAAATCA GTCCGTGTTC 540 ATGTACAGTT TTTTGATGAC AGCCCAACAA GGGGCTGGGT TAGCAAAAGG CTTTTAAAGC 600 CATATACAGG TTCAAAATCA AAGGAAGCCC AGAAGGGAGG TCATTTTTAC AGTGCAAAGC 660 CTGAAATACT GAGAGCAATG CAACGTGCAG ATGAAGCCTT AAATAAAGAC AAGATTAAGA 720 GGCTTGAATT GGCAGTTTGT GATGAGCCCT CAGAGCCAGA AGAGGAAGAA GAGATGGAGG 780 TAGGCACAAC TTACGTAACA GATAAGAGTG AAGAAGATAA TGAAATTGAG AGTGAAGAGG 840 AAGTACAGCC TAAGACACAA GGATCTAGGC GAAGTAGCCG CCAAATAAAA AAACGAAGGG 900 TCATATCAGA TTCTGAGAGT GACATTGGTG GCTCTGATGT GGAATTTAAG CCAGACACTA 960 AGGAGGAAGG AAGCAGTGAT GAAATAAGCA GTGGAGTGGG GGATAGTGAG AGTGAAGGCC 1020 TGAACAGCCC TGTCAAAGTT GCTCGAAAGC GGAAGAGAAT GGTGACTGGA AATGGCTCTC 1080 TTAAAAGGAA AAGCTCTAGG AAGGAAACGC CCTCAGCCAC CAAACAAGCA ACTAGCATTT 1140 CATCAGAAAC CAAGAATACT TTGAGAGCTT TCTCTGCCCC TCAAAATTCT GAATCCCAAG 1200 CCCACGTTAG TGGAGGTGGT GATGACAGTA GTCGCCCTAC TGTTTGGTAT CATGAAACTT 1260 TAGAATGGCT TAAGGAGGAA AAGAGAAGAG ATGAGCACAG GAGGAGGCCT GATCACCCCG 1320 ATTTTGATGC ATCTACACTC TATGTGCCTG AGGATTTCCT CAATTCTTGT ACTCCTGGGA 1380 TGAGGAAGTG GTGGCAGATT AAGTCTCAGA ACTTTGATCT TGTCATCTGT TACAAGGTGG 1440 GGAAATTTTA TGAGCTGTAC CACATGGATG CTCTTATTGG AGTCAGTGAA CTGGGGCTGG 1500 TATTCATGAA AGGCAACTGG GCCCATTCTG GCTTTCCTGA AATTGCATTT GGCCGTTATT 1560 CAGATTCCCT GGTGCAGAAG GGCTATAAAG TAGCACGAGT GGAACAGACT GAGACTCCAG 1620 AAATGATGGA GGCACGATGT AGAAAGATGG CACATATATC CAAGTATGAT AGAGTGGTGA 1680 GGAGGGAGAT CTGTAGGATC ATTACCAAGG GTACACAGAC TTACAGTGTG CTGGAAGGTG 1740 ATCCCTCTGA GAACTACAGT AAGTATCTTC TTAGCCTCAA AGAAAAAGAG GAAGATTCTT 1800 CTGGCCATAC TCGTGCATAT GGTGTGTGCT TTGTTGATAC TTCACTGGGA AAGTTTTTCA 1860 TAGGTCAGTT TTCAGATGAT CGCCATTGTT CGAGATTTAG GACTCTAGTG GCACACTATC 1920 CCCCAGTACA AGTTTTATTT GAAAAAGGAA ATCTCTCAAA GGAAACTAAA ACAATTCTAA 1980 AGAGTTCATT GTCCTGTTCT CTTCAGGAAG GTCTGATACC CGGCTCCCAG TTTTGGGATG 2040 CATCCAAAAC TTTGAGAACT CTCCTTGAGG AAGAATATTT TAGGGAAAAG CTAAGTGATG 2100 GCATTGGGGT GATGTTACCC CAGGTGCTTA AAGGTATGAC TTCAGAGTCT GATTCCATTG 2160 GGTTGACACC AGGAGAGAAA AGTGAATTGG CCCTCTCTGC TCTAGGTGGT TGTGTCTTCT 2220 ACCTCAAAAA ATGCCTTATT GATCAGGAGC TTTTATCAAT GGCTAATTTT GAAGAATATA 2280 TTCCCTTGGA TTCTGACACA GTCAGCACTA CAAGATCTGG TGCTATCTTC ACCAAAGCCT 2340 ATCAACGAAT GGTGCTAGAT GCAGTGACAT TAAACAACTT GGAGATTTTT CTGAATGGAA 2400 CAAATGGTTC TACTGAAGGA ACCCTACTAG AGAGGGTTGA TACTTGCCAT ACTCCTTTTG 2460 GTAAGCGGCT CCTAAAGCAA TGGCTTTGTG CCCCACTCTG TAACCATTAT GCTATTAATG 2520 ATCGTCTAGA TGCCATAGAA GACCTCATGG TTGTGCCTGA CAAAATCTCC GAAGTTGTAG 2580 AGCTTCTAAA GAAGCTTCCA GATCTTGAGA GGCTACTCAG TAAAATTCAT AATGTTGGGT 2640 CTCCCCTGAA GAGTCAGAAC CACCCAGACA GCAGGGCTAT AATGTATGAA GAAACTACAT 2700 ACAGCAAGAA GAAGATTATT GATTTTCTTT CTGCTCTGGA AGGATTCAAA GTAATGTGTA 2760 AAATTATAGG GATCATGGAA GAAGTTGCTG ATGGTTTTAA GTCTAAAATC CTTAAGCAGG 2820 TCATCTCTCT GCAGACAAAA AATCCTGAAG GTCGTTTTCC TGATTTGACT GTAGAATTGA 2880 ACCGATGGGA TACAGCCTTT GACCATGAAA AGGCTCGAAA GACTGGACTT ATTACTCCCA 2940 AAGCAGGCTT TGACTCTGAT TATGACCAAG CTCTTGCTGA CATAAGAGAA AATGAACAGA 3000 GCCTCCTGGA ATACCTAGAG AAACAGCGCA ACAGAATTGG CTGTAGGACC ATAGTCTATT 3060 GGGGGATTGG TAGGAACCGT TACCAGCTGG AAATTCCTGA GAATTTCACC ACTCGCAATT 3120 TGCCAGAAGA ATACGAGTTG AAATCTACCA AGAAGGGCTG TAAACGATAC TGGACCAAAA 3180 CTATTGAAAA GAAGTTGGCT AATCTCATAA ATGCTGAAGA ACGGAGGGAT GTATCATTGA 3240 AGGACTGCAT GCGGCGACTG TTCTATAACT TTGATAAAAA TTACAAGGAC TGGCAGTCTG 3300 CTGTAGAGTG TATCGCAGTG TTGGATGTTT TACTGTGCCT GGCTAACTAT AGTCGAGGGG 3360 GTGATGGTCC TATGTGTCGC CCAGTAATTC TGTTGCCGGA AGATACCCCC CCCTTCTTAG 3420 AGCTTAAAGG ATCACGCCAT CCTTGCATTA CGAAGACTTT TTTTGGAGAT GATTTTATTC 3480 CTAATGACAT TCTAATAGGC TGTGAGGAAG AGGAGCAGGA AAATGGCAAA GCCTATTGTG 3540 TGCTTGTTAC TGGACCAAAT ATGGGGGGCA AGTCTACGCT TATGAGACAG GCTGGCTTAT 3600 TAGCTGTAAT GGCCCAGATG GGTTGTTACG TCCCTGCTGA AGTGTGCAGG CTCACACCAA 3660 TTGATAGAGT GTTTACTAGA CTTGGTGCCT CAGACAGAAT AATGTCAGGT GAAAGTACAT 3720 TTTTTGTTGA ATTAAGTGAA ACTGCCAGCA TACTCATGCA TGCAACAGCA CATTCTCTGG 3780 TGCTTGTGGA TGAATTAGGA AGAGGTACTG CAACATTTGA TGGGACGGCA ATAGCAAATG 3840 CAGTTGTTAA AGAACTTGCT GAGACTATAA AATGTCGTAC ATTATTTTCA ACTCACTACC 3900 ATTCATTAGT AGAAGATTAT TCTCAAAATG TTGCTGTGCG CCTAGGACAT ATGGCATGCA 3960 TGGTAGAAAA TGAATGTGAA GACCCCAGCC AGGAGACTAT TACGTTCCTC TATAAATTCA 4020 TTAAGGGAGC TTGTCCTAAA AGCTATGGCT TTAATGCAGC AAGGCTTGCT AATCTCCCAG 4080 AGGAAGTTAT TCAAAAGGGA CATAGAAAAG CAAGAGAATT TGAGAAGATG AATCAGTCAC 4140 TACGATTATT TCGGGAAGTT TGCCTGGCTA GTGAAAGGTC AACTGTAGAT GCTGAAGCTG 4200 TCCATAAATT GCTGACTTTG ATTAAGGAAT TATAGACTGA CTACATTGGA AGCTTTGAGT 4260 TGACTTCTGA CAAAGGTGGT AAATTCAGAC AACATTATGA TCTAATAAAC TTTATTTTTT 4320 AAAAATGACC ATTTTTCCAT TTTCTTTCTA GGAAATTAAA CCCTTTTAAT TCTTATCTAC 4380 CTTCTACATA ATGGTTATTG AATACTCCAC AATATATTAA GTCTAGATGT TATGGTACAT 4440 GCATACACTT TCAGGCTGTT TTATACCCAC TGTCACCAAT ACACATAAAT GGGGGAGGAA 4500 AAGCTATGAA ACTGTATAGG GCTGTATATA TACTTGTCTC AGCTTAATGC AGGAAATTGG 4560 TTTAATTTCC AGCAGTTTTG TCTAAACTGT TCAAAAAAAA ACTATGAACA GAGTTCAAAT 4620 ACAGGACTGT TTGTTTTGAA GAGACTTTCT AAAGTGTACT TAAAACATAG TAGTTTTTTA 4680 CCTTTCACAA AACTGAGTTA CAAGAATACT TTTGTTTTAC AGTGCATCCC TTCCTAGGAA 4740 GTCTCATTAA AACACTCACT TTTTCTAGGG GTGATTTTGA ATGCTGCACA GGGAAGGGAA 4800 GGAAATAATA GTCTTAACTT TTCTTAAAGG ATACCAGAAA CATTGCTGGA TATAATTTAA 4860 GATTAGTGTT TTCTCTTTCA TAGAAAGAAC GTACATACTG GGACATGAGT ACAGTTACAG 4920 CAAGTCTAGG TGTGCTAACA AAACAGGGCA CATTCAAGTA CAGTAAGATT TTGCTTGAAA 4980 TTAAAAACAA ACTACATGAG ATTAAAGCAT TAAAATCATA TTTCTCAATC TGAATACATG 5040 TTAAAAAAAA AAAATCAAAA GGAACGCAGA AGTGCTAGCT CACATTTTTA CCATATTACA 5100 AAAGCAATTG GTACCCATGT CCATAAAGGC AGCAACAAAG CTGCTTGTCT ATTGAAGATT 5160 ACTACTGCAA ATTGGACTGC ATTCAATGCT AGTTGTAAAA ACACCAGCTT TTCAGAAGTT 5220 GGTATCTGTA CAAAATTGCA GCTTATTTTC TTCACTTCTG TCCCTTCAAG TCTTTACACA 5280 GTAATGCTAA AACACCCAGC TTTGAGATCC TGAGTCAATA TATTGCCACT TTCTTTTTGG 5340 TAGCTTGAGC TTCATAGTGT CAACTGACCT TGTGTATCCA TTTTTAATAC AGTCTCTTCC 5400 TGTAGCATGG GCAAATATTT TAAATCTTCT TCCAAAAAAG TGTTTTAAGT TATGATGTTA 5460 CAATGGCAGG ACTTTTTCTT TAGGGAAGGA ATTCAGTTGT GCTGCAATGT ATTAGATTCT 5520 ATAGGTGGAG CAGAGTCATA TAGTGTATCT GTATCATGTG TAGGCTCACC AGCTAATGTA 5580 CAAGGATTAG ACAGTGTTCC AGCACCACAG TCACAGAAAA ACCTAAAGCA AAATGAAACC 5640 CAAATATTAG AAAAGTGAGG GGGAAAGTAA TTGGGTAATA TATCAAGCAA GTGTGCTACA 5700 TACCTATCAT GTCTAATAAA CTCTACATCA TGTCCCTGAT GGCACTTCTT AATGCAGTTC 5760 ACACATATGG CATTTCGATC TGTGGTGTTA CAAGTATGAC ATCTAAAAAG CAAAAGCTTA 5820 AATTACTTTT CTCAAACATG TCATTAATGC AAAACATTCC ATTCTGTTTA TATATTACTA 5880 TGACCTTTGG CTTTAAGAGG ACCAAAACAA AATTCTTTGT GGCTCCAGCC CAGATTAATT 5940 CTGAAAAGGA ACTTTAATGG AGTAAGTGAT TTTCCTGTCA TCTGTGTCTT CGGAGGGAAG 6000 AGAAATGATT TGTAAATTGT ATAAAGGCAG TTCTTTCCAC TTTAAAAGCC TCTCAAATGT 6060 TTCTGGGCTG AAAACAATTT TTGGAGGCGT GAAGAGTCAA AACTGTCACA GTGACTGGGA 6120 TATATCAAAC ACTTAACCCC GACATCTTTA CCTTGAAATT TCTAGGAAAA CATTACACAA 6180 CATGAGTTAC ATGAATGACA TCAGTTACTG TAGCATTAGG TTTTTCCATA GTTATGGTCT 6240 TTGTTTTGTT TTGTAGAGAC AGGGTCTCCC TATGTTGCCC AGGCTGGTTT AGAACTCCTG 6300 GGCTCCAGTG ATCCTCCCAC TTCAGCCTCC CAAAGTGCTA GGATTACAGG CATAAGCCAC 6360 CACGCCTACC CACAGTTACA GTCTTAAACA CGATCTTCAA GTAGATTGAT GATAAAATTT 6420 TCAGTTAGTT ATAGTCTCAA CACCGGCAAA TAGCCAAAAA TGCTAGGCAT TGCTAATTTA 6480 AAAAGGAAAT CAGTCTTCCT CTTTTCAGGA CTCAAATATA TTTCTAAGTT ACCTGTAGAA 6540 ATCATGCATG GGATAGCTGG TATAACTTGA TATTTTATAT AAACATTGGC CTCTACTAAC 6600 AGCCTTTTCT ATGGCATCTT GATTGTTCAT TATTTTGTTA TCTGTAATAA AAGAAAGAAT 6660 AAGTAAAAAT TCAGAGGAAT GTTAATATTT TAAAAACCAA AGATTATAGG ATTATTCTAA 6720 CAGAAGAGCC ACTATTTTTA AGAGCTTTAA ATGAAGCTAA CCAATGAAGT AATTGTAAGA 6780 AATCAGCTAA GAATAGAATT TTCCTTGTAT AAGATACTCC AACCATTTAG AACCAAAGCT 6840 CTGTTTCTTT CAAAATCTAT CTTAAACTGT TGCTAACTTG GAGAGTGACA TAAGGAATCA 6900 AGTTATAAAA CGGCTTCTGA TTATCTTTCA TGGCATATTG CATATATTTA TAGGTATAGC 6960 AGACTCCAAC ATACCTTTCA TTGTCACATT AACACCAGAT GCTAAAAATA AGCCTCCAAA 7020 CCGGTTGTTA AAAATCTGAT TGCCTTCTAG TGTTGCAGTT GCGTGATTTG TAATTTCAAT 7080 ACCTGAAGTA AAATTTACAA ACAAGTAGAT ACATCACTTT ATACTGCTTC TTAAAAACCT 7140 GAAATTAGCA AGCAAATGTA AACTGCTTCT TTTATAGAAG TACATTAACC CTCTTAATGT 7200 CTACTGAATA AAATGTAGAT ACCTATTTCA ACCACCAACA GTAACATTCA CTTATCAATG 7260 ACTATGGTCA AAACTGCAAT TAACTTTCGC ACCAACCTAA CTGTCTTAAA GTTTAAATAC 7320 ATGATACTTG GATTTCATTT GCATCCATTT TAACATCTCT TTTTCTGTTG CAGATTTAAA 7380 CTGGTAAATT CATCTGAGGA ATTGAATCTA TCTGTATTCC TAGTGGTAAT ACAAGCCTGC 7440 ATTTATTCTA TCCCAATAAA TGTTTCATAA TCACGA 7477 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Keyword | KW-0002--3D-structure |
||||||||||||
Interpro | IPR007695--DNA_mismatch_repair_MutS-lik_N |
||||||||||||
PROSITE | |||||||||||||
Pfam | PF01624--MutS_I |
||||||||||||
Gene Ontology | GO:0005737--C:cytoplasm |
||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |