WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Paa-0131 | ||||||||||||
Ensembl Protein ID | ENSPANP00000007626.1 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Papio anubis | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAVPGA SPSLGEDAAW SEAGPGPRPS 60 ARSASPPKAK NLNGGLRRSA APAAPTSNSC DFSPGDLVWA KMEGYPWWPC LVYNHPFDGT 120 FIREKGKSVR VHVQFFDDSP TRGWVSKRLL KPYTGSKSKE AQKGGHFYSA KPEILRAMQR 180 ADEALNKDKI KRLELAVCDE PSEPEEEEEM EVGTTYVTDK SEEDNEIESE EEVQPKTQGS 240 RRSSRQIKKR RVISDSESDI GGSDVEFKPD TKEEGSSDEI SSGVGDSESE GPNSPVKVAR 300 KRKRVVTGNG SLKRKSSRKE MPSATKRATG ISSETKNTLS AFSAPQNSES QAHVSGGGDD 360 SSRPTVWYHE TLEWLKEEKR RDEHRRRPDH PDFNASTLYV PEDFLNSCTP GMRKWWQIKS 420 QNFDLVICYK VGKFYELYHM DALIGVSELG LVFMKGNWAH SGFPEIAFGR YSDSLVQKGY 480 KVARVEQTET PEMMEARCRK MAHISKYDRV VRREICRIIT KGTQTYSVLE GDPSENYSKY 540 LLSLKEKEED SSGHTRAYGV CFVDTSLGKF FIGQFSDDRH CSRFRTLVAH YPPVQVLFEK 600 GNLSKETKTI LKSSLSSSLQ EGLIPGSQFW DASKTLRTLL EEGYFREKLS DDIGVLLPQV 660 LKGMTSESDS IGLTPGEKSE LALSALGGCI FYLKKCLIDQ ELLSMANFEE YIPLDSDIVS 720 TTRSGAIFTK AYQRMVLDAV TLNNLEIFLN GTNGSTEGTL LERVDTCHTP FGKRLLKQWL 780 CAPLCSPYAI NDRLDAIEDL MVVPDKISEV VELLKKLPDL ERLLSKIHNV GSPLKSQNHP 840 DSRAIMYEET TYSKKKIIDF LSALEGFKVM CKIIGIMEEV IDGFKSKILK QVISLQTKNP 900 EGRFPDLTTE LNRWDTAFDH EKARKTGLIT PKAGFDSDYD QALADIRENE QSLLEYLEKQ 960 RNRIGCRTIV YWGIGRNRYQ LEIPENFTTR NLPEEYELKS TKKGCKRYWT KTIEKKLANL 1020 INAEERRDVS LKDCMRRLFY NFDKNYKDWQ SAVECIAVLD VLLCLANYSR GGDGPMCRPV 1080 ILLPEDLELK GSRHPCITKT FFGDDFIPND ILIGCEEEEQ ENGKAYCVLV TGPNMGGKST 1140 LMRQAGLLAV MAQMGCYVPA EVCRLTPIDR VFTRLGASDR IMSVGESTFF VELSETASIL 1200 MHATAHSLVL VDELGRGTAT FDGTAIANAV VKELAETIKC RTLFSTHYHS LVEDYSQNVA 1260 VRLGHMACMV ENECEDPSQE TITFLYKFIK GACPKSYGFN AARLANLPEE VIQKGHRKAR 1320 EFEKMNQSLR LFREVCLASE RSTVDAEAVH KLLTLIKEL 1359 |
||||||||||||
Nucleotide Sequence (Fasta) | CACGGCGAGG CGCCTGTTGA TTGGCCACTG GGGCCCGGGT TCCTCCGGCG GAGCGCGCCT 60 CCCCCCCAGG TTTCCCGCCA GCAGGAGCCG CGCGGTAGGT GCGGTGCGTT TCGGCGCTCC 120 GTCCGACAGA ACGGTTGGGC CTTGTCGGCT GTCGCTATGT CGCGACAGAG CACCCTGTAC 180 AGCTTCTTCC CCAAGTCTCC GGCGCTGAGT GATGCCAACA AGGCCTCGGC CAGGGCCTCT 240 CGCGAAGGCG GCCGTGCCGC CGCTGTCCCC GGGGCCTCTC CTTCCCTAGG CGAGGATGCG 300 GCCTGGAGCG AGGCTGGGCC TGGGCCCAGG CCCTCGGCGC GCTCCGCGTC GCCGCCTAAG 360 GCGAAGAACC TCAACGGAGG GCTGCGGAGA TCCGCAGCGC CTGCGGCCCC CACCAGCAAC 420 AGTTGTGACT TCTCACCAGG AGATTTGGTT TGGGCCAAGA TGGAGGGTTA CCCCTGGTGG 480 CCTTGTCTGG TTTACAACCA CCCCTTTGAT GGAACATTCA TCCGCGAGAA AGGGAAATCA 540 GTCCGTGTTC ATGTACAGTT TTTTGATGAC AGCCCAACAA GGGGCTGGGT TAGCAAAAGA 600 CTTTTAAAGC CATACACAGG TTCAAAATCA AAGGAAGCCC AGAAGGGAGG TCATTTTTAC 660 AGTGCAAAGC CTGAAATACT GAGAGCAATG CAACGTGCAG ATGAAGCCTT AAATAAAGAC 720 AAGATTAAGA GGCTTGAATT GGCAGTTTGT GATGAGCCCT CAGAGCCAGA AGAGGAAGAA 780 GAGATGGAGG TAGGCACAAC TTATGTAACA GATAAGAGTG AAGAAGATAA TGAAATTGAG 840 AGTGAAGAGG AAGTACAGCC TAAGACACAA GGATCTAGGC GAAGTAGCCG CCAAATAAAA 900 AAACGAAGGG TCATATCAGA CTCTGAGAGT GACATTGGTG GCTCTGATGT GGAATTTAAG 960 CCAGATACTA AGGAGGAAGG AAGCAGTGAT GAAATAAGCA GTGGAGTGGG GGATAGTGAG 1020 AGTGAAGGCC CAAACAGTCC TGTCAAAGTT GCTCGAAAGC GGAAGAGAGT GGTCACTGGA 1080 AATGGCTCTC TTAAAAGGAA AAGTTCTAGG AAGGAAATGC CCTCAGCCAC CAAACGAGCA 1140 ACTGGCATTT CATCAGAAAC CAAGAATACT TTGAGTGCTT TCTCTGCCCC TCAAAATTCT 1200 GAATCCCAAG CCCATGTTAG TGGAGGTGGT GATGACAGTA GTCGCCCTAC TGTTTGGTAT 1260 CATGAAACTT TAGAATGGCT TAAGGAGGAA AAGAGAAGAG ATGAGCACAG GAGGCGGCCT 1320 GATCACCCCG ATTTTAATGC ATCTACACTC TATGTGCCTG AGGATTTCCT CAATTCTTGT 1380 ACTCCTGGGA TGAGGAAGTG GTGGCAGATT AAGTCTCAGA ACTTTGATCT TGTCATCTGT 1440 TATAAGGTGG GGAAATTTTA TGAACTGTAC CACATGGATG CTCTTATTGG AGTCAGTGAA 1500 CTGGGGCTGG TATTCATGAA AGGCAACTGG GCCCATTCTG GCTTTCCTGA AATTGCATTT 1560 GGCCGTTATT CAGACTCCTT GGTGCAGAAA GGCTATAAAG TAGCACGAGT GGAACAGACT 1620 GAGACTCCGG AAATGATGGA GGCACGATGC CGAAAGATGG CACATATATC CAAGTATGAT 1680 AGAGTGGTGA GGAGGGAGAT CTGTAGGATC ATTACCAAGG GTACACAGAC TTACAGTGTG 1740 CTGGAAGGTG ATCCCTCTGA GAACTACAGT AAGTATCTTC TTAGCCTCAA AGAAAAAGAG 1800 GAAGATTCTT CTGGCCATAC TCGTGCATAT GGTGTGTGCT TTGTTGATAC TTCACTGGGA 1860 AAGTTTTTCA TAGGTCAGTT TTCAGATGAT CGCCATTGTT CGAGATTTAG GACTCTAGTG 1920 GCACACTATC CCCCAGTACA AGTCTTATTT GAAAAAGGAA ATCTCTCAAA GGAAACGAAA 1980 ACAATTTTAA AGAGTTCATT GTCCTCTTCT CTTCAGGAAG GTCTGATACC TGGCTCCCAG 2040 TTTTGGGATG CATCCAAAAC TTTGAGAACT CTCCTTGAGG AAGGATATTT TCGGGAAAAG 2100 CTAAGTGATG ACATTGGGGT GTTGTTACCC CAGGTGCTTA AAGGTATGAC TTCAGAGTCA 2160 GATTCCATTG GGTTGACACC AGGAGAGAAA AGTGAATTGG CCCTCTCTGC TCTAGGTGGT 2220 TGTATCTTCT ACCTCAAAAA ATGCCTTATT GATCAGGAGC TTTTATCAAT GGCTAATTTT 2280 GAAGAATATA TTCCCTTGGA TTCTGACATA GTCAGCACTA CAAGATCTGG TGCTATCTTC 2340 ACCAAAGCCT ATCAACGAAT GGTGCTAGAT GCAGTGACAT TAAACAACTT GGAGATTTTT 2400 CTGAATGGAA CAAACGGTTC TACTGAAGGA ACCCTGCTAG AGAGGGTTGA TACTTGCCAT 2460 ACTCCCTTCG GTAAGCGGCT CCTAAAGCAA TGGCTTTGTG CCCCACTCTG TAGCCCTTAT 2520 GCTATCAATG ATCGTCTAGA TGCCATAGAA GACCTCATGG TTGTGCCTGA CAAAATCTCC 2580 GAAGTTGTAG AGCTTCTAAA GAAGCTTCCA GATCTTGAGA GACTACTCAG TAAAATTCAT 2640 AATGTTGGGT CTCCCCTTAA GAGTCAGAAC CATCCAGATA GCAGGGCTAT AATGTATGAA 2700 GAAACTACAT ACAGCAAAAA AAAGATCATT GATTTTCTTT CTGCTCTGGA AGGATTCAAA 2760 GTAATGTGTA AAATTATAGG GATCATGGAA GAAGTCATTG ATGGTTTTAA GTCTAAAATC 2820 CTTAAGCAGG TCATCTCTCT GCAGACAAAA AATCCTGAAG GCCGTTTTCC TGATTTGACT 2880 ACAGAACTGA ACCGATGGGA TACAGCGTTT GACCATGAAA AGGCTCGAAA GACTGGACTT 2940 ATTACTCCCA AAGCAGGCTT TGACTCTGAT TATGACCAAG CTCTTGCTGA CATAAGAGAA 3000 AATGAACAGA GCCTCCTGGA ATACCTAGAG AAACAGCGCA ACAGAATTGG CTGTAGGACC 3060 ATAGTCTATT GGGGGATTGG TAGGAACCGT TACCAGTTGG AAATTCCTGA GAATTTCACC 3120 ACTCGCAATT TGCCAGAAGA ATATGAATTG AAATCTACCA AGAAGGGCTG TAAACGATAC 3180 TGGACCAAAA CTATTGAGAA GAAGTTGGCT AATCTCATAA ATGCTGAAGA ACGGAGAGAT 3240 GTATCATTGA AGGACTGCAT GCGGCGACTG TTCTATAACT TTGATAAAAA TTACAAGGAC 3300 TGGCAGTCTG CTGTAGAGTG TATCGCAGTG TTGGATGTCT TACTGTGCCT GGCTAACTAC 3360 AGTCGAGGGG GTGATGGTCC TATGTGTCGC CCAGTAATTC TGTTGCCAGA AGATTTAGAG 3420 CTTAAAGGAT CACGCCATCC TTGCATTACG AAGACTTTTT TTGGAGATGA TTTTATTCCT 3480 AATGACATTC TAATAGGCTG TGAGGAAGAG GAGCAGGAAA ATGGCAAAGC CTATTGTGTG 3540 CTTGTTACTG GACCAAATAT GGGGGGCAAG TCTACACTCA TGAGACAGGC TGGCTTATTA 3600 GCTGTAATGG CCCAGATGGG TTGTTACGTC CCCGCTGAAG TGTGCAGGCT CACACCAATT 3660 GATAGAGTGT TTACTAGACT TGGTGCCTCA GATAGAATAA TGTCAGTAGG TGAAAGTACA 3720 TTTTTTGTTG AATTAAGTGA AACTGCCAGC ATACTTATGC ATGCAACAGC ACATTCTCTG 3780 GTGCTTGTGG ATGAATTAGG AAGAGGTACT GCAACATTTG ATGGGACAGC AATAGCAAAT 3840 GCAGTTGTTA AAGAACTTGC TGAGACTATA AAATGTCGTA CATTATTTTC AACTCACTAC 3900 CATTCATTAG TAGAAGATTA TTCTCAAAAT GTTGCTGTGC GCTTAGGACA TATGGCATGC 3960 ATGGTAGAAA ATGAATGTGA AGACCCCAGC CAGGAGACTA TTACCTTCCT CTATAAATTC 4020 ATTAAGGGAG CTTGTCCTAA AAGCTATGGC TTTAATGCAG CAAGGCTTGC TAATCTCCCA 4080 GAGGAAGTTA TTCAAAAGGG ACATAGAAAA GCAAGAGAAT TTGAGAAGAT GAATCAGTCT 4140 CTACGATTAT TTCGGGAAGT TTGCCTGGCT AGTGAAAGGT CAACTGTAGA TGCTGAAGCT 4200 GTCCATAAAT TGCTGACTTT GATTAAGGAA TTATAG 4237 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |