WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Prc-0145 | ||||||||||||
Ensembl Protein ID | ENSPCAP00000013528.1 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Procavia capensis | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYTF FPKSPALNNA NKAPARPSSG GSTAAATVAS PSPSGDAAWS EAGPGSLASS 60 ASPSEAKKLN GGLRRQAAPA VPASCDFSPG DLVWAKMEGY PWWPCLVYNH PFDGTFVREK 120 GKSVRIHVQF FDDSPTRGWV SKRLLKPYTG SKAKEAQKGG HFYSSKPEIL RAMQSADEAL 180 NKDKLKRLEL AVCDEPSEPE EEEEMEVDAT NTLEKREEPA ETESEEEVRP KIQGARRSGR 240 QIKKRRVISD SESDAGGSDV EFKPDAKEEA SSDEMSSGVG DSEGEGLYSP ARVAPKRKKT 300 VTGNSSLKRK NSRKEMSSAT KKATGIPSET KSTLSAFSAP QNSESQPHIS GSSDDGSRST 360 VWYHETLEWL KEEKRRDERR RRPDHPDFDA STLYVPEDFL NSCTPGMRKW WQIKSQNFDL 420 VIFYKVGKFY ELYHMDALVG VSELGLVFMK GNWAHSGFPE IAFGRYSDSL VQKGYKVARV 480 EQTETPEMME ARCRKMAHIS KHDRVVRREI CRVITRGTQT YSVLEGDPSE NHCKYLLSLK 540 EKEDDCYGHT RVYGVCFVDT SLGRFYVGQF SDDRHCSRFR TLVAHYPPVQ VLFEKSNLST 600 ETKTILRSSL SSSLQEGLIP GSQFWDAAKT LRVLLEDGYF TEKLNEDSGV RLPQVLKGMT 660 SESDSIGLTP GEKSELALSA LGGCVFYLKK CLIDQELLSM ANFEEYIPLD SDMVGATKPG 720 AVFAKASQRM VLDAVTLNNL EIFLNGTNGS TEGTLLERID TCHTPFGKRL LKQWLCAPLC 780 SPFPINDRLD AIEDLMAVPD KISELVDLLK RLPDLERLLS KIHNVGSPLK SQNHRDSRAI 840 MYEETTYSKK KIIDFLSALE GFKVMCKIIE VMEEVVSDFK SKLLKQVITL QAKNPEGRFP 900 DLTAELNRWD TAFDHEKARK TGLITPKAGF DADYDQALAD IRENEQSLLE YLEKQRNRIG 960 CRTIVYWGIG RNRYQLEIPE NFTTRNLPEE YELKSTKKGC KRYWTKTIEK KLANLINAEE 1020 RRDVSLKDCM RRLFYNFDKN YKDWQAAVEC IAVLDVLLCL ATYSQGGDGP MCRPVVLLPT 1080 EDTPPFLELQ GSRHPCITKT FFGDDFIPND ILIGCEEEEE RGKAYCVLVT GPNMGGKSTL 1140 MRQAGLLAVM AQVGCYVPAE VCRLTPIDRV FTRLGASDRI MSGESTFFVE LSETASILTH 1200 ATAHSLVLVD ELGRGTATFD GTAIASAVVK ELAENIKCRT LFSTHYHSLV ENYSQNVAVR 1260 LGHMACMVEN ECEDPSQETI TFLYKFIKGA CPKSYGFNAA RLANLPEEVI QKGHRKAREF 1320 EMTTKSLRLF REVCLASERS TLEAEAVHKL LTLIKDL 1357 |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCGCGAC AGAGCACCCT GTACACCTTC TTCCCCAAGT CTCCTGCGCT GAATAATGCC 60 AACAAAGCCC CAGCCAGGCC CTCAAGTGGA GGCAGCACTG CCGCCGCCAC TGTGGCCTCT 120 CCTTCCCCCA GCGGGGATGC GGCCTGGAGC GAGGCCGGGC CTGGGTCCCT GGCGAGCTCT 180 GCGTCGCCCT CTGAGGCGAA GAAGCTCAAC GGAGGCCTGC GGAGGCAGGC GGCGCCTGCA 240 GTCCCCGCCA GCTGTGACTT CTCGCCAGGT GACCTGGTCT GGGCCAAGAT GGAGGGCTAC 300 CCCTGGTGGC CCTGCTTGGT CTACAACCAC CCCTTTGATG GGACCTTCGT CCGGGAGAAA 360 GGGAAGTCTG TCCGCATTCA CGTGCAGTTC TTTGATGACA GCCCCACCAG GGGCTGGGTC 420 AGCAAAAGGC TGTTAAAGCC GTACACAGGT TCAAAAGCAA AGGAAGCCCA GAAGGGAGGT 480 CATTTTTACA GTTCAAAGCC TGAAATACTC CGAGCAATGC AAAGTGCAGA TGAAGCCTTA 540 AATAAAGACA AGCTGAAGAG GCTTGAATTG GCAGTGTGTG ACGAGCCCTC AGAGCCAGAA 600 GAGGAAGAAG AGATGGAGGT AGATGCAACG AACACCTTAG AGAAGCGTGA AGAACCTGCT 660 GAAACAGAAA GTGAAGAGGA AGTGCGGCCT AAGATACAAG GGGCTCGGCG TAGTGGCCGT 720 CAGATAAAAA AACGTAGAGT CATATCAGAC TCAGAGAGTG ATGCTGGTGG TTCTGATGTG 780 GAGTTCAAGC CAGATGCCAA GGAGGAGGCC AGCAGCGATG AAATGAGCAG TGGTGTTGGG 840 GACAGTGAGG GAGAAGGCCT CTACAGCCCC GCCAGGGTTG CCCCAAAGCG GAAGAAAACG 900 GTGACTGGAA ACAGCTCTCT TAAAAGGAAG AATTCAAGGA AGGAAATGTC TTCAGCTACT 960 AAAAAAGCAA CTGGCATTCC ATCAGAAACA AAGAGTACTT TGAGTGCTTT CTCCGCCCCT 1020 CAGAATTCTG AGTCCCAGCC CCACATTAGT GGGAGCAGTG ATGATGGAAG TCGTTCCACT 1080 GTGTGGTATC ATGAAACCTT GGAATGGCTT AAGGAGGAGA AGAGAAGAGA TGAACGCAGG 1140 AGGCGGCCTG ATCACCCTGA TTTTGATGCG TCCACACTCT ACGTGCCGGA AGACTTCCTT 1200 AATTCTTGCA CTCCTGGGAT GAGGAAGTGG TGGCAGATTA AGTCTCAGAA TTTTGATCTT 1260 GTCATCTTTT ACAAGGTGGG CAAGTTTTAT GAGTTGTATC ACATGGATGC ACTTGTGGGA 1320 GTCAGTGAAC TGGGCCTGGT GTTTATGAAA GGCAACTGGG CCCACTCTGG TTTCCCCGAG 1380 ATCGCCTTCG GCCGCTACTC AGACTCTCTG GTGCAGAAGG GCTATAAGGT GGCACGGGTG 1440 GAGCAGACGG AGACCCCTGA GATGATGGAG GCGCGGTGCC GGAAGATGGC CCACATCTCG 1500 AAGCATGATC GGGTGGTGAG GCGGGAGATC TGCAGGGTCA TCACCAGAGG CACCCAGACC 1560 TACAGTGTTC TAGAAGGTGA CCCCTCGGAG AACCACTGCA AGTATCTTCT GAGCCTCAAA 1620 GAAAAAGAGG ACGATTGTTA TGGCCACACA CGGGTGTACG GTGTGTGCTT TGTCGACACG 1680 TCCCTGGGAA GGTTTTATGT GGGCCAGTTT TCTGATGATC GCCATTGCTC CCGGTTCAGG 1740 ACTCTGGTGG CACACTATCC CCCGGTACAA GTCCTGTTTG AGAAAAGTAA TCTCTCCACA 1800 GAAACCAAGA CGATTCTGAG GAGTTCACTA TCCTCTTCTC TTCAGGAAGG GCTTATACCA 1860 GGCTCCCAGT TCTGGGATGC AGCCAAAACT TTGCGCGTTC TCCTTGAAGA TGGGTATTTT 1920 ACAGAAAAGC TAAATGAGGA CAGTGGGGTG CGGCTACCCC AGGTGCTTAA AGGCATGACC 1980 TCCGAGTCAG ATTCCATTGG GCTGACTCCA GGAGAGAAGA GTGAGCTGGC ACTGTCTGCT 2040 CTGGGCGGCT GTGTTTTCTA CCTCAAAAAA TGCCTTATCG ACCAGGAGCT TTTATCAATG 2100 GCTAATTTTG AAGAATATAT TCCCTTGGAT TCTGATATGG TTGGTGCCAC CAAGCCTGGT 2160 GCTGTCTTTG CTAAAGCCAG TCAGCGAATG GTGCTAGATG CAGTGACATT AAACAACTTG 2220 GAGATTTTTC TCAACGGGAC AAATGGTTCT ACTGAAGGGA CTCTGCTAGA GAGAATTGAC 2280 ACTTGCCATA CTCCCTTTGG GAAGCGGCTT CTAAAGCAGT GGCTTTGTGC CCCACTCTGC 2340 AGCCCTTTCC CTATCAATGA TCGACTAGAT GCCATAGAAG ACCTCATGGC TGTGCCTGAC 2400 AAAATCTCTG AGCTGGTGGA CCTGTTAAAG AGGCTTCCAG ACCTTGAGAG GCTGCTGAGC 2460 AAAATACACA ATGTTGGGTC TCCCCTCAAG AGCCAGAACC ACCGTGATAG CAGGGCCATC 2520 ATGTATGAAG AAACTACGTA TAGCAAAAAA AAGATTATTG ATTTTCTTTC TGCTCTGGAA 2580 GGATTCAAAG TAATGTGTAA AATTATAGAG GTAATGGAAG AGGTTGTCAG TGACTTCAAG 2640 TCTAAGCTCC TTAAGCAGGT CATTACTCTC CAGGCAAAAA ATCCTGAAGG TCGCTTTCCT 2700 GATTTGACTG CAGAACTGAA CCGGTGGGAT ACAGCCTTTG ATCATGAGAA GGCTCGTAAG 2760 ACTGGACTTA TTACTCCGAA AGCAGGATTT GATGCAGATT ATGACCAGGC TCTTGCTGAC 2820 ATAAGAGAAA ATGAACAGAG CCTCCTGGAG TATTTGGAGA AACAGCGCAA TCGAATTGGC 2880 TGCAGGACCA TAGTCTACTG GGGGATTGGT AGGAATCGGT ATCAGCTGGA GATTCCAGAG 2940 AACTTCACCA CCCGCAACTT GCCTGAAGAG TACGAGCTGA AGTCTACCAA GAAGGGCTGT 3000 AAACGGTACT GGACCAAAAC AATTGAGAAG AAGTTGGCTA ATCTGATCAA CGCTGAGGAG 3060 CGGAGAGATG TGTCACTGAA GGATTGCATG CGGCGACTCT TTTATAACTT TGATAAAAAC 3120 TACAAGGACT GGCAGGCTGC TGTTGAGTGC ATTGCAGTGC TGGATGTCTT GCTGTGTCTG 3180 GCTACCTACA GTCAAGGAGG TGACGGTCCC ATGTGCCGCC CAGTGGTTCT GTTGCCGACA 3240 GAGGACACTC CCCCCTTCTT AGAGCTTCAA GGGTCACGCC ATCCCTGCAT TACCAAGACC 3300 TTTTTTGGAG ATGACTTTAT TCCTAATGAC ATCCTAATAG GCTGTGAGGA AGAGGAGGAA 3360 AGGGGCAAAG CTTATTGTGT GCTTGTCACT GGACCTAATA TGGGGGGCAA ATCGACGCTC 3420 ATGAGGCAGG CTGGCCTGTT GGCAGTAATG GCCCAGGTGG GCTGCTACGT ACCTGCTGAA 3480 GTGTGTAGGC TCACCCCCAT CGACCGAGTG TTCACGAGGC TCGGTGCCTC AGACAGAATC 3540 ATGTCAGGGG AAAGCACATT TTTTGTTGAG TTGAGTGAAA CTGCCAGCAT ACTCACACAT 3600 GCCACGGCTC ATTCCCTGGT GCTGGTGGAT GAATTAGGAA GAGGGACGGC AACATTTGAT 3660 GGGACAGCAA TAGCAAGTGC AGTTGTTAAA GAACTTGCTG AGAACATCAA ATGTCGTACG 3720 TTGTTCTCAA CACACTACCA TTCCTTAGTA GAAAATTACT CTCAGAACGT TGCCGTGCGC 3780 CTAGGACACA TGGCCTGCAT GGTAGAAAAT GAGTGTGAAG ACCCCAGCCA GGAGACGATC 3840 ACCTTCCTGT ATAAATTCAT TAAGGGTGCT TGTCCCAAAA GCTACGGCTT TAATGCAGCA 3900 AGGCTTGCCA ATCTCCCGGA GGAAGTTATT CAGAAGGGAC ACAGAAAAGC AAGAGAATTT 3960 GAGATGACGA CCAAGTCGCT GCGGCTATTT CGGGAAGTTT GCCTGGCTAG TGAAAGGTCA 4020 ACCCTAGAGG CTGAAGCTGT CCATAAGTTG CTGACTTTGA TTAAGGATTT ATAG 4075 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |