WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Pat-0096 | ||||||||||||
Ensembl Protein ID | ENSPTRP00000020434.4 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Pan troglodytes | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | ASARASREGG RAAAAPGASP SPGGDAAWSE AGPGPRPLAR SASPPKAKNL NGGLRRSAAP 60 AAPTSCDFSP GDLVWAKMEG YPWWPCLVYN HPFDGTFIRE KGKSVRVHVQ FFDDSPTRGW 120 VSKRLLKPYT GSKSKEAQKG GHFYSAKPEI LRAMQRADEA LNKDKIKRLE LAVCDEPSEP 180 EEEEEMEVGT TYVTDKSEED NEIESEEEVQ PKTQGSRRSS RQIKKRRVIS DSESDIGGSD 240 VEFKPDTKEE GSSDEISSGV GDSESEGPNS PVKVARKRKR MVTGNGSLKR KSSRKETSSA 300 TKQATSISSE TKNTLRAFSA PQNSESQSHV SGGGDDSSRP TVWYHETLEW LKEEKRRDEH 360 RRRPDHPDFD ASTLYVPEDF LNSCTPGMRK WWQIKSQNFD LVICYKVGKF YELYHMDALI 420 GVSELGLVFM KGNWAHSGFP EIAFGRYSDS LVQKGYKVAR VEQTETPEMM EARCRKMAHI 480 SKYDRVVRRE ICRIITKGTQ TYSVLEGDPS ENYSKYLLSL KEKEEDSSGH TRAYGVCFVD 540 TSLGKFFIGQ FSDDRHCSRF RTLVAHYPPV QVLFEKGNLS KETKTILKSS LSSSLQEGLI 600 PGSQFWDASK TLRTLLEEEY FREKLSDGIG VMLPQVLKGM TSESDSIGLT PGEKSELALS 660 ALGGCVFYLK KCLIDQELLS MANFEEYIPL DSDTVSTTRS GAIFTKAYQR MVLDAVTLNN 720 LEIFLNGTNG STEGTLLERV DTCHTPFGKR LLKQWLCAPL CNPYAINDRL DAIEDLMVVP 780 DKISEVVELL KKLPDLERLL SKIHNVGSPL KSQNHPDSRA IMYEETTYSK KKIIDFLSAL 840 EGFKVMCKII GIMEEVADGF KSKVLKQVIS LQTKNPEGRF PDLTVELNRW DTAFDHEKAR 900 KTGLITPKAG FDSDYDQALA DIRENEQSLL EYLEKQRNRI GCRTIVYWGI GRNRYQLEIP 960 ENFTTRNLPE EYELKSTKKG CKRYWTKTIE KKLANLINAE ERRDVSLKDC MRRLFYNFDK 1020 NYKDWQSAVE CIAVLDVLLC LANYSRGGDG PMCRPVILLP EDTPPFLELK GSRHPCITKT 1080 FFGDDFIPND ILIGCEEEEQ ENGKAYCVLV TGPNMGGKST LMRQAGLLAV MAQMGCYVPA 1140 EVCRLTPIDR VFTRLGASDR IMSGESTFFV ELSETASILM HATAHSLVLV DELGRGTATF 1200 DGTAIANAVV KELAETIKCR TLFSTHYHSL VEDYSQNVAV RLGHMACMVE NECEDPSQET 1260 ITFLYKFIKG ACPKSYGFNA ARLANLPEEV IQKGHRKARE FEKMNQSLRL FREVCLASER 1320 STVDAEAVHK LLTLIKEL 1338 |
||||||||||||
Nucleotide Sequence (Fasta) | GATCCAACAG CCTCGGCCAG GGCCTCACGC GAAGGCGGCC GTGCCGCCGC TGCCCCCGGG 60 GCCTCTCCTT CCCCAGGCGG GGATGCGGCC TGGAGCGAGG CTGGGCCTGG GCCCAGGCCC 120 TTGGCGCGCT CCGCGTCGCC GCCCAAGGCG AAGAACCTCA ACGGAGGGCT GCGGAGATCG 180 GCAGCGCCTG CTGCCCCCAC CAGTTGTGAC TTCTCACCAG GAGATTTGGT TTGGGCCAAG 240 ATGGAGGGTT ACCCCTGGTG GCCTTGTCTG GTTTACAACC ACCCCTTTGA TGGAACATTC 300 ATCCGCGAGA AAGGGAAATC AGTCCGTGTT CATGTACAGT TTTTTGATGA CAGCCCAACA 360 AGGGGCTGGG TTAGCAAAAG GCTTTTAAAG CCATATACAG GTTCAAAATC AAAGGAAGCC 420 CAGAAGGGAG GTCATTTTTA CAGTGCAAAG CCTGAAATAC TGAGAGCAAT GCAACGTGCA 480 GACGAAGCCT TAAATAAAGA CAAGATTAAG AGGCTTGAAT TGGCAGTTTG TGATGAGCCC 540 TCAGAGCCAG AAGAGGAAGA AGAGATGGAG GTAGGCACAA CTTACGTAAC AGATAAGAGT 600 GAAGAAGATA ATGAAATTGA GAGTGAAGAG GAAGTACAGC CTAAGACACA AGGATCTAGG 660 CGAAGTAGCC GCCAAATAAA AAAACGAAGG GTCATATCAG ATTCTGAGAG TGACATTGGT 720 GGCTCTGATG TGGAATTTAA GCCAGACACT AAGGAGGAAG GAAGCAGTGA TGAAATAAGC 780 AGTGGAGTGG GGGATAGTGA GAGTGAAGGC CCGAACAGCC CTGTCAAAGT TGCTCGAAAG 840 CGGAAGAGAA TGGTGACTGG AAATGGCTCT CTTAAAAGGA AAAGCTCTAG GAAGGAAACG 900 TCCTCAGCCA CCAAACAAGC AACTAGCATT TCATCAGAAA CCAAGAATAC TTTGAGAGCT 960 TTCTCTGCCC CTCAAAATTC TGAATCCCAA TCCCACGTTA GTGGAGGTGG TGATGACAGT 1020 AGTCGCCCTA CTGTTTGGTA TCATGAAACT TTAGAATGGC TTAAGGAGGA AAAGAGAAGA 1080 GATGAGCACA GGAGGAGGCC TGATCACCCC GATTTTGATG CATCTACACT CTATGTGCCT 1140 GAGGATTTCC TCAATTCTTG TACTCCTGGG ATGAGGAAGT GGTGGCAGAT TAAGTCTCAG 1200 AACTTTGATC TTGTCATCTG TTACAAGGTG GGGAAATTTT ATGAGCTGTA CCACATGGAT 1260 GCTCTTATTG GAGTCAGTGA ACTGGGGCTG GTATTCATGA AAGGCAACTG GGCCCATTCT 1320 GGCTTTCCTG AAATTGCATT TGGCCGTTAT TCAGATTCCC TGGTGCAGAA GGGCTATAAA 1380 GTAGCACGAG TGGAACAGAC TGAGACTCCA GAAATGATGG AGGCACGATG TAGAAAGATG 1440 GCACATATAT CCAAGTATGA TAGAGTGGTG AGGAGGGAGA TCTGTAGGAT CATTACCAAG 1500 GGTACACAGA CTTACAGTGT GCTGGAAGGT GATCCCTCTG AGAACTACAG TAAGTATCTT 1560 CTTAGCCTCA AAGAAAAAGA GGAAGATTCT TCTGGCCATA CTCGTGCATA TGGTGTGTGC 1620 TTTGTTGATA CTTCGCTGGG AAAGTTTTTC ATAGGTCAGT TTTCAGATGA TCGCCATTGT 1680 TCGAGATTTA GGACTCTAGT GGCACACTAT CCCCCAGTAC AAGTCTTATT TGAAAAAGGA 1740 AATCTCTCAA AGGAAACTAA AACAATTCTA AAGAGTTCAT TGTCCTCTTC TCTTCAGGAA 1800 GGTCTGATAC CCGGCTCCCA GTTTTGGGAT GCATCCAAAA CTTTGAGAAC TCTCCTTGAG 1860 GAAGAATATT TTAGGGAAAA GCTAAGTGAT GGCATTGGGG TGATGTTACC CCAGGTGCTT 1920 AAAGGTATGA CTTCAGAGTC TGATTCCATT GGGTTGACAC CAGGAGAGAA AAGTGAATTG 1980 GCCCTCTCTG CTCTAGGTGG TTGTGTCTTC TACCTCAAAA AATGCCTTAT TGATCAGGAG 2040 CTTTTATCAA TGGCTAATTT TGAAGAATAT ATTCCCTTGG ATTCTGACAC AGTCAGCACT 2100 ACAAGATCTG GTGCTATCTT CACCAAAGCC TATCAACGAA TGGTGCTAGA TGCAGTGACA 2160 TTAAACAACT TGGAGATTTT TCTGAATGGA ACAAATGGTT CTACTGAAGG AACCTTACTA 2220 GAGAGGGTTG ATACTTGCCA TACTCCTTTT GGTAAGCGGC TCCTAAAGCA ATGGCTTTGT 2280 GCCCCACTCT GTAACCCTTA TGCTATTAAT GATCGTCTAG ATGCCATAGA AGACCTCATG 2340 GTTGTGCCTG ACAAAATCTC CGAAGTTGTA GAGCTTCTAA AGAAGCTTCC AGATCTTGAG 2400 AGACTACTCA GTAAAATTCA TAATGTTGGG TCTCCCCTGA AGAGTCAGAA CCACCCAGAC 2460 AGCAGGGCTA TAATGTATGA AGAAACTACA TACAGCAAAA AGAAGATTAT TGATTTTCTT 2520 TCTGCTCTGG AAGGATTCAA AGTAATGTGT AAAATTATAG GGATCATGGA AGAAGTTGCT 2580 GATGGTTTTA AGTCTAAAGT CCTTAAGCAG GTCATCTCTC TGCAGACAAA AAATCCTGAA 2640 GGTCGTTTTC CTGATTTGAC TGTAGAATTG AACCGATGGG ATACAGCCTT TGACCATGAA 2700 AAGGCTCGAA AGACTGGACT TATTACTCCC AAAGCAGGCT TTGACTCTGA TTATGACCAA 2760 GCTCTTGCTG ACATAAGAGA AAATGAACAG AGCCTCCTGG AATACCTAGA GAAACAGCGC 2820 AACAGAATTG GCTGTAGGAC CATAGTCTAT TGGGGGATTG GTAGGAACCG TTACCAGCTG 2880 GAAATTCCTG AGAATTTCAC CACTCGCAAT TTGCCAGAAG AATACGAGTT GAAATCTACC 2940 AAGAAGGGCT GTAAACGATA CTGGACCAAA ACTATTGAAA AGAAGTTGGC TAATCTCATA 3000 AATGCTGAAG AACGGAGAGA TGTATCATTG AAGGACTGCA TGCGGCGACT GTTCTATAAC 3060 TTTGATAAAA ATTACAAGGA CTGGCAGTCT GCTGTAGAGT GTATCGCAGT GTTGGATGTT 3120 TTATTGTGCC TGGCTAACTA TAGTCGAGGG GGTGATGGTC CTATGTGTCG CCCAGTAATT 3180 CTGTTGCCAG AAGATACCCC CCCCTTCTTA GAGCTTAAAG GATCACGCCA TCCTTGCATT 3240 ACGAAGACTT TTTTTGGAGA TGATTTTATT CCTAATGACA TTCTAATAGG CTGTGAGGAA 3300 GAGGAGCAGG AAAATGGCAA AGCCTATTGT GTGCTTGTTA CTGGACCAAA TATGGGGGGC 3360 AAGTCTACGC TTATGAGACA GGCTGGCTTA TTAGCTGTAA TGGCCCAGAT GGGTTGTTAC 3420 GTCCCTGCTG AAGTGTGCAG GCTCACACCA ATTGATAGAG TGTTTACTAG ACTTGGCGCC 3480 TCAGACAGAA TAATGTCAGG TGAAAGTACA TTTTTTGTTG AATTAAGTGA AACTGCCAGC 3540 ATACTCATGC ATGCAACAGC ACATTCTCTG GTGCTTGTGG ATGAATTAGG AAGAGGTACT 3600 GCAACATTTG ATGGGACAGC AATAGCAAAT GCAGTTGTTA AAGAACTTGC TGAGACTATA 3660 AAATGTCGTA CATTATTTTC AACTCACTAC CATTCATTAG TAGAAGATTA TTCTCAAAAT 3720 GTTGCTGTGC GCCTAGGACA TATGGCATGC ATGGTAGAAA ATGAATGTGA AGACCCCAGC 3780 CAGGAGACTA TTACCTTCCT CTATAAATTC ATTAAGGGAG CTTGTCCTAA AAGCTATGGC 3840 TTTAATGCAG CAAGGCTTGC TAATCTCCCA GAGGAAGTTA TTCAAAAGGG ACATAGAAAA 3900 GCAAGAGAAT TTGAGAAGAT GAATCAGTCA CTACGATTAT TTCGGGAAGT TTGCCTGGCT 3960 AGTGAAAGGT CAACTGTAGA TGCTGAAGCT GTCCATAAAT TGCTGACTTT GATTAAGGAA 4020 TTATAGACTA CATTGGAAGC TTTGAGTTGA CTTCTGACAA AGGTGGTAAA TTCAGACAAC 4080 ATTATGATCT AATAAACTTT ATTTTTTAAA AATG 4115 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |