WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Cap-0004 | ||||||||||||
Ensembl Protein ID | ENSCPOP00000000836.2 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Cavia porcellus | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYSF FPKSPALNNA NKGPARSSRE GGSGAARGAS PSGGDAARRD AGPGPQPTAA 60 SVSPPEGKNL NGGLRRSTAP AASASNSSCD FSPGDLVWAK MEGYPWWPCL VYNHPFDGTC 120 IREKGKSVRV HVQFFDDSPT RGWVSKRLLK PYTVGSKSKD AQKGGHFYSS KPEILRAMQR 180 ADEALNKNKT ERLELAVCDE PSEPEEEMEV GATYASEKSE EDNEIESEEE VRPKMQGSRR 240 SSRQRVMSDS ESDIGGSDVE FKPDTKEEGS SDEISSGAGD SESEGLDSPV KAAPKRKKMG 300 IGKIGFKKNS SRKETPSVTK RATGILSETK NTSSAFSVPQ NSESQAQGSG GGDDSCGPTV 360 WYHETLEWLK KEKRRDEHRR RPDHPDFDPS TLYVPEDFLN SCTPGMRKWW QIKSQNFDLV 420 IFYKVGKFYE LYHMDALVGV SELGLVFMKG NWAHSGFPEI AFGRYSDSLV QKGYKVARVE 480 QTETPEMMEA RCRKMAHISK YDKVVRREIC RIITKGTQTY SVLEGDPSEN YSKYLLSLKE 540 KEDSSGHVRL YGVCFVDTSL GKFFIGQFSD DRHCSRFRTL VAHYPPVQVL FEKGNLSVET 600 KTILKGTLSS SLQEGLIPGS QFWDAAKTLR TLLEEGYFTE KLNEDTGVML PSVLKGMTSE 660 SDSIGLTPGE NSELALSALG GCVFYLKKCL IDQELLSMAN FEEYIPLDSD MVSVRPGAIF 720 TKANQRMVLD AVTLNNLEIF LNGTNGSTEG TLLERIDTCY TPFGKRLLKQ WLCAPLCSPF 780 AINDRLDAVE DLMDVPDKIS EVTDLLKKLP DLERLLSKIH NVGSPLKSQN HPDSRAIMYE 840 ETTYSKKKII DFLSALEGFK VMCKIIEIME EVVDGFKSKI LKQVVTLQIK NPEGRFPDLT 900 TELNRWDTAF DHEKARRTGL ITPKAGFDSD YDQALADIRE NEQSLLEYLD KQRSRIGCRT 960 IVYWGIGRNR YQLEIPENFT IHNLPEEYEL KSTKKGCKRY WTKTIEKKLA NLINAEERRD 1020 VSLKDCMRRL FYNFDKNYKD WQCAVECIAV LDVLLCLANY SQGGDGPMCR PALLLPGEHN 1080 PPFLELRGSR HPCITKTFFG DDFIPNDILI GCEEQQEDGR AYCVLVTGPN MGGKSTLIRQ 1140 AGLLAVMAQM GCYVPAEMCR LTPVDRVFTR LGASDRIMSG ESTFFVELSE TASILRHATA 1200 HSLVLVDELG RGTATFDGTA IASAVVKELA ETIRCRTLFS THYHSLVEDY SKNVAVRLGH 1260 MACMVENECE DPSQETITFL YKFVQGACPK SYGFNAARLA HLPEEVIQKG HRKAREFEKM 1320 NQSLRLFREV CLASERSSID AEALHKLLTL IKKL 1354 |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCGCGAC AGAGTACTCT GTACAGCTTC TTCCCCAAGT CTCCGGCGTT AAACAATGCC 60 AACAAGGGCC CAGCTAGGTC CTCGCGCGAA GGCGGCAGCG GCGCGGCCCG CGGGGCCTCC 120 CCCTCTGGCG GCGATGCGGC CCGGAGAGAC GCTGGGCCTG GGCCCCAGCC TACGGCGGCC 180 TCCGTGTCGC CGCCCGAGGG CAAGAACCTG AACGGAGGGC TGCGGCGGTC GACAGCTCCT 240 GCGGCCTCCG CTAGCAACAG TTCCTGTGAC TTCTCACCAG GTGATTTGGT CTGGGCCAAG 300 ATGGAGGGCT ACCCCTGGTG GCCTTGCCTG GTTTACAACC ACCCCTTTGA TGGAACATGC 360 ATCCGTGAGA AAGGGAAGTC AGTTCGGGTT CATGTACAGT TTTTTGATGA CAGCCCAACA 420 AGGGGCTGGG TTAGCAAAAG GCTGCTAAAA CCCTACACAG TAGGTTCAAA ATCTAAGGAC 480 GCCCAGAAGG GAGGTCACTT TTATAGTTCA AAGCCTGAGA TACTCAGAGC AATGCAACGT 540 GCAGATGAAG CCTTAAATAA AAACAAGACT GAAAGACTGG AGTTGGCAGT GTGTGATGAG 600 CCCTCAGAGC CAGAAGAAGA GATGGAGGTA GGTGCAACTT ATGCATCAGA GAAGAGTGAA 660 GAAGATAATG AAATTGAGAG TGAAGAAGAA GTGCGGCCTA AGATGCAAGG ATCTAGGAGA 720 AGTAGTCGTC AAAGGGTCAT GTCAGACTCT GAGAGTGACA TTGGTGGCTC TGATGTGGAA 780 TTCAAGCCAG ACACTAAGGA GGAAGGAAGC AGTGATGAGA TAAGCAGTGG AGCTGGGGAC 840 AGTGAGAGCG AAGGCTTGGA CAGCCCTGTC AAAGCTGCTC CAAAGCGGAA GAAAATGGGA 900 ATTGGAAAGA TTGGTTTTAA AAAGAATAGT TCAAGGAAGG AAACACCTTC AGTTACCAAG 960 CGAGCGACTG GCATTTTATC AGAGACCAAA AATACTTCGA GTGCTTTCTC TGTCCCTCAA 1020 AATTCTGAAT CTCAAGCCCA AGGTAGTGGA GGAGGTGATG ACAGTTGTGG CCCGACTGTC 1080 TGGTATCATG AAACTTTAGA ATGGCTGAAG AAGGAAAAGA GAAGAGATGA GCACAGGAGG 1140 CGCCCTGATC ACCCTGATTT TGATCCATCC ACACTTTATG TGCCTGAAGA TTTCCTTAAT 1200 TCGTGTACTC CTGGTATGAG GAAGTGGTGG CAGATTAAGT CTCAGAACTT TGATCTTGTC 1260 ATCTTTTATA AAGTAGGGAA ATTTTATGAA CTGTACCATA TGGATGCTCT TGTTGGAGTC 1320 AGTGAACTGG GGCTGGTATT TATGAAAGGC AACTGGGCCC ATTCTGGTTT TCCTGAAATA 1380 GCATTTGGCC GATACTCTGA TTCCCTTGTG CAGAAAGGCT ATAAAGTAGC ACGAGTGGAA 1440 CAAACTGAAA CACCGGAAAT GATGGAGGCA CGGTGCAGAA AGATGGCACA TATATCTAAG 1500 TATGACAAGG TGGTGCGGAG AGAGATTTGT AGGATCATTA CCAAGGGTAC ACAGACCTAT 1560 AGTGTGCTTG AAGGTGACCC TTCTGAGAAC TATAGTAAAT ATCTTCTTAG CCTCAAAGAA 1620 AAAGAAGATT CTTCTGGCCA CGTTCGACTA TATGGTGTGT GCTTTGTTGA CACTTCACTG 1680 GGAAAGTTTT TCATAGGTCA GTTTTCAGAC GATCGCCATT GTTCAAGATT TAGGACTCTT 1740 GTGGCACACT ATCCTCCAGT ACAAGTCTTG TTTGAGAAAG GAAATCTCTC AGTGGAAACT 1800 AAGACAATTC TGAAAGGAAC ATTATCCTCT TCTCTTCAGG AAGGTCTGAT ACCAGGCTCC 1860 CAATTTTGGG ATGCAGCCAA GACTTTGCGA ACTCTTCTTG AAGAAGGGTA CTTTACTGAA 1920 AAGCTAAATG AGGACACTGG GGTGATGTTA CCCTCGGTGC TTAAAGGTAT GACCTCAGAG 1980 TCCGATTCCA TTGGGTTGAC ACCAGGAGAG AACAGTGAAC TGGCTCTCTC TGCTCTAGGT 2040 GGTTGTGTCT TCTACCTCAA AAAATGCCTT ATTGATCAGG AGCTTTTATC AATGGCTAAT 2100 TTTGAAGAAT ACATACCTTT GGATTCTGAC ATGGTCAGTG TAAGACCTGG TGCTATTTTT 2160 ACTAAAGCCA ATCAACGAAT GGTGCTAGAT GCAGTGACAT TAAACAATTT AGAGATTTTT 2220 CTGAATGGGA CCAATGGTTC TACTGAAGGG ACCCTGTTAG AGAGGATTGA TACTTGCTAT 2280 ACTCCCTTTG GTAAGCGGCT CCTAAAGCAG TGGCTTTGTG CCCCACTCTG TAGTCCTTTT 2340 GCTATCAATG ATCGCTTAGA TGCAGTGGAA GATCTCATGG ATGTGCCTGA CAAAATCTCT 2400 GAAGTTACAG ACCTTCTAAA AAAGCTTCCA GATCTTGAGA GGTTATTGAG TAAAATTCAT 2460 AATGTTGGTT CTCCTCTAAA GAGCCAGAAC CACCCTGATA GTAGAGCTAT AATGTATGAG 2520 GAAACTACAT ACAGCAAGAA AAAGATCATT GATTTTCTTT CAGCCCTAGA AGGCTTCAAA 2580 GTAATGTGTA AAATTATAGA GATTATGGAA GAAGTTGTTG ATGGTTTTAA GTCTAAAATT 2640 CTTAAGCAAG TAGTTACTCT TCAGATCAAA AATCCTGAAG GACGCTTTCC TGATTTGACT 2700 ACAGAACTGA ATCGATGGGA TACAGCCTTT GACCATGAAA AGGCTCGAAG GACTGGACTG 2760 ATAACTCCCA AAGCAGGATT TGATTCTGAT TATGATCAAG CTCTTGCTGA CATAAGAGAA 2820 AATGAACAGA GCCTCCTAGA GTACCTAGAC AAACAGCGCA GTCGAATTGG CTGTAGGACT 2880 ATAGTCTACT GGGGGATTGG TAGAAACCGT TACCAGTTAG AAATTCCAGA GAATTTCACC 2940 ATCCATAATT TACCAGAAGA ATATGAACTG AAGTCTACCA AAAAGGGCTG TAAACGATAC 3000 TGGACCAAAA CTATTGAGAA GAAGTTGGCT AACCTGATAA ATGCTGAGGA ACGTAGAGAT 3060 GTGTCACTGA AGGACTGCAT GCGGCGACTG TTCTATAACT TTGACAAAAA TTACAAGGAC 3120 TGGCAATGTG CTGTTGAGTG CATCGCGGTG TTGGATGTCT TACTGTGCCT TGCTAACTAC 3180 AGTCAAGGGG GTGATGGTCC TATGTGTCGC CCAGCACTTC TGTTACCAGG AGAACACAAT 3240 CCACCTTTCT TAGAGCTTAG AGGATCACGA CACCCCTGTA TCACAAAGAC TTTTTTTGGG 3300 GATGATTTTA TTCCTAATGA CATCCTCATA GGCTGTGAAG AACAACAGGA AGACGGCAGA 3360 GCCTATTGTG TGCTTGTTAC TGGACCAAAT ATGGGGGGCA AGTCTACACT CATAAGACAG 3420 GCTGGCCTGT TGGCTGTAAT GGCCCAGATG GGCTGTTACG TCCCTGCGGA AATGTGCAGG 3480 CTCACACCAG TTGATAGAGT GTTTACTAGA CTTGGTGCCT CAGATAGAAT AATGTCTGGT 3540 GAAAGTACAT TTTTTGTTGA ATTGAGTGAA ACCGCCAGCA TACTTAGGCA TGCAACAGCA 3600 CATTCTCTGG TGCTTGTGGA TGAATTAGGA AGAGGTACTG CCACATTTGA CGGGACAGCA 3660 ATAGCAAGTG CAGTTGTTAA AGAACTTGCT GAGACCATAA GGTGTCGCAC ATTGTTTTCT 3720 ACCCACTACC ATTCATTAGT AGAGGATTAT TCTAAGAACG TTGCTGTGCG CCTAGGACAT 3780 ATGGCATGCA TGGTAGAAAA TGAATGTGAG GATCCCAGTC AGGAGACTAT TACCTTCCTT 3840 TATAAATTTG TTCAAGGAGC TTGTCCTAAG AGCTATGGCT TTAATGCAGC AAGACTTGCT 3900 CATCTCCCAG AAGAAGTTAT TCAAAAGGGA CATAGAAAAG CAAGAGAATT TGAGAAGATG 3960 AATCAGTCAC TACGACTATT TCGGGAAGTC TGCCTGGCTA GTGAAAGGTC GTCTATAGAT 4020 GCTGAAGCAC TCCATAAGTT GCTGACTTTG ATTAAGAAAT TA 4063 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |