WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Sus-0141 | ||||||||||||
Ensembl Protein ID | ENSSSCP00000023698.1 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Sus scrofa | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYSF FPKSPALTNA NKAPARVSSE GSAPAAAAGA SPSPGGDAAW SETGPGSGSP 60 AGSASRAEAS SSSCDFSPGD LVWAKMEGYP WWPCLVYNHP FDGTFIREKG KSARVHVQFF 120 DDSPTRGWVS RRLLKPYTGS KSKEAQKGGH FYSAKPEILR AMQRADEALN KDKIKRLELA 180 VCDEPSEPEE EEKSEEENEM ESEEEVQPKV QGSRRSSRHI KKRRVISDSE SDVGGSDVEF 240 KPDTKEEGSS DEMSSGVGDS DSEGLDSPVK IAPKRKRMVT GNGSLKRKSS RKEMPSATKR 300 AIGISSETRS TLSAFSAPQN SEPQAHISGG CDDSNRPTVW YHETLEWLKE ERRRDVHRRR 360 PDHPDFDAST LYVPEDFLNS CTPGMRKWWQ IKSQNFDLVI FYKVGKFYEL YHMDALIGVS 420 ELGLVFMKGN WAHSGFPEIA FGRYSDSLVQ KGYKVARVEQ TETPEMMEAR CRKMAHISKY 480 DRVVRREICR VITKGTQTYS VLEGDPSENY SKYLLSLKEK EDDSSGHTRV YGVCFVDTSL 540 GKFFIGQFSD DRHCSRFRTL VAHYPPVQVL FEKGSLSTET KMILKGSLSS SLQEGLIPGS 600 QFWDAGKTLR TLLEEGYFTD KLNEDSGVML PQVLKGMTSE SDSIGLTPGE KSELALSALG 660 GCVFYLKKCL IDQELLSMAN FEEYIPLDSD VVSASRPGAV FAKANQRMVL DAVTLNNLEI 720 FLNATNGSPE GTLLEKIDTC HTPFGKRLLK QWLCAPLCNP YAISDRLDAI EDLMVVPDKI 780 SEVVDLLKKL PDLERLLSKI HNVGSPLKSQ NHPDSRAIMY EETTYSKKKI IDFLSALEGF 840 KVICKIRGIM EEVIDDFKSK ILKQVLTLQT KNPEGRFPDL TVELNRWDTA FDHEKARKTG 900 LITPKAGFDS DYDQALADIR ENEQSLLEYL EKQRSRIGCR TIVYWGIGRN RYQLEIPENF 960 TTRNLPEEYE LKSTKKGCKR YWTKTIEKKL ANLINAEERR DVSLKDCMRR LFYNFDKNYK 1020 DWQAAVECIA VLDVLLCLAN YSRGGDGPMC RPVILLPGED TPPFLYLKGS RHPCITKTFF 1080 GDDFIPNDIL IGCEEEEEEN DKAYCVLVTG PNMGGKSTLM RQAGLLAVMA QMGCYVPAEV 1140 CRLTPIDRVF TRLGASDRIM SGESTFFVEL SETASILTHA TAHSLVLVDE LGRGTATFDG 1200 TAIANAVVKE LAENIKCRTL FSTHYHSLVE DYSQNVAVRL GHMACMVENE CEDPSQETIT 1260 FLYKFIKGAC PKSYGFNAAR LANLPEEVIQ KGHRKAREFE KMTQSLRLFR EVCLASERST 1320 VDAEAVHKLL TLME 1334 |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCGCGAC AGAGCACGCT GTACAGCTTC TTCCCCAAGT CTCCTGCGCT GACTAATGCC 60 AACAAAGCCC CAGCCAGGGT CTCAAGTGAA GGCAGCGCCC CCGCCGCTGC CGCCGGGGCC 120 TCTCCTTCCC CAGGCGGGGA TGCGGCCTGG AGCGAGACTG GGCCTGGCTC GGGGTCCCCG 180 GCGGGCTCCG CGTCGCGGGC TGAAGCGAGC AGCAGTTCTT GTGACTTCTC ACCAGGTGAT 240 TTGGTTTGGG CCAAGATGGA GGGTTACCCC TGGTGGCCTT GCCTTGTTTA CAACCACCCT 300 TTTGATGGAA CATTCATCCG GGAGAAAGGA AAGTCTGCCC GAGTTCATGT ACAGTTTTTT 360 GATGACAGCC CAACAAGAGG CTGGGTTAGC AGGAGGCTAT TAAAGCCATA TACAGGTTCA 420 AAATCAAAGG AAGCCCAGAA AGGAGGTCAT TTTTACAGTG CAAAGCCTGA AATACTCAGA 480 GCAATGCAAC GTGCAGATGA AGCCTTGAAT AAAGACAAGA TTAAGAGGCT TGAATTGGCG 540 GTGTGTGATG AGCCCTCAGA ACCAGAGGAG GAAGAAAAGA GTGAAGAAGA AAATGAAATG 600 GAGAGTGAAG AGGAAGTACA GCCTAAAGTG CAAGGATCCA GGCGAAGTAG CCGCCATATA 660 AAAAAACGAA GGGTCATATC AGACTCTGAG AGTGATGTTG GTGGCTCTGA CGTGGAATTC 720 AAGCCAGACA CTAAGGAGGA AGGAAGCAGT GATGAAATGA GCAGTGGAGT GGGAGATAGC 780 GACAGTGAAG GCCTGGACAG CCCTGTCAAA ATTGCTCCAA AGCGTAAGAG AATGGTAACT 840 GGAAATGGCT CCCTCAAAAG GAAAAGTTCA AGGAAGGAAA TGCCCTCAGC CACCAAACGA 900 GCAATTGGCA TTTCATCAGA AACCAGGAGT ACTTTGAGTG CTTTCTCTGC CCCTCAAAAT 960 TCTGAACCCC AGGCCCACAT TAGCGGAGGA TGTGATGACA GCAACCGCCC CACTGTCTGG 1020 TATCATGAAA CTTTAGAGTG GCTCAAGGAG GAAAGGAGGA GAGATGTGCA CAGGAGGCGA 1080 CCCGATCACC CTGATTTTGA TGCATCCACC CTCTACGTGC CTGAGGACTT CCTTAATTCC 1140 TGTACGCCTG GGATGAGGAA GTGGTGGCAA ATTAAGTCGC AGAACTTTGA TCTTGTCATA 1200 TTTTATAAAG TGGGGAAGTT CTATGAGCTG TACCACATGG ATGCTCTTAT TGGAGTTAGT 1260 GAACTAGGGC TTGTATTCAT GAAAGGCAAC TGGGCCCATT CTGGTTTCCC TGAAATTGCA 1320 TTTGGCCGAT ATTCAGATTC CCTGGTTCAG AAGGGCTATA AAGTAGCACG AGTGGAACAG 1380 ACTGAGACCC CGGAAATGAT GGAGGCACGA TGCCGAAAGA TGGCACATAT ATCGAAGTAT 1440 GATCGAGTGG TGAGGAGGGA GATCTGTAGG GTCATTACCA AGGGCACACA GACCTACAGT 1500 GTGCTGGAAG GTGACCCCTC TGAGAACTAC AGTAAGTATC TTCTTAGCCT CAAAGAAAAA 1560 GAGGATGATT CTTCCGGCCA CACTCGAGTG TACGGTGTAT GCTTTGTTGA CACCTCTCTG 1620 GGAAAGTTCT TTATAGGTCA GTTTTCAGAT GACCGCCATT GTTCCAGGTT CAGGACTCTA 1680 GTGGCACACT ATCCGCCAGT TCAAGTGTTG TTTGAGAAAG GAAGTCTCTC AACGGAAACG 1740 AAGATGATTC TGAAGGGTTC ATTATCCTCT TCTCTTCAGG AAGGTCTGAT ACCAGGCTCC 1800 CAGTTTTGGG ATGCAGGCAA AACTTTGAGA ACTCTCCTTG AAGAAGGATA TTTTACAGAC 1860 AAGTTGAATG AAGACAGTGG GGTGATGTTA CCCCAGGTGC TTAAAGGTAT GACATCAGAG 1920 TCTGATTCTA TTGGGTTGAC ACCAGGAGAG AAGAGTGAAT TGGCCCTCTC TGCTCTAGGT 1980 GGTTGTGTCT TCTACCTCAA AAAATGCCTT ATTGATCAGG AGCTTTTATC AATGGCTAAT 2040 TTTGAAGAAT ATATTCCCTT GGATTCTGAT GTGGTCAGTG CTTCAAGACC TGGTGCTGTC 2100 TTTGCTAAAG CCAATCAACG AATGGTGCTA GATGCTGTGA CATTAAACAA CTTGGAGATT 2160 TTTTTGAACG CAACAAATGG TTCTCCTGAA GGGACCCTGC TAGAGAAGAT TGATACTTGC 2220 CATACTCCCT TTGGCAAGCG GCTCCTAAAG CAATGGCTTT GTGCCCCCCT CTGTAACCCT 2280 TATGCTATCA GTGATCGTCT AGACGCCATA GAAGACCTCA TGGTTGTGCC TGACAAAATC 2340 TCTGAGGTTG TAGACCTTCT AAAGAAGCTT CCAGACCTTG AGCGGCTACT GAGTAAAATT 2400 CATAATGTTG GGTCTCCCCT GAAGAGCCAG AACCACCCAG ATAGCAGGGC TATAATGTAT 2460 GAAGAAACCA CATACAGCAA AAAAAAGATT ATTGATTTTC TTTCTGCTCT GGAAGGATTT 2520 AAAGTAATAT GTAAAATTAG AGGGATTATG GAAGAAGTCA TTGATGACTT TAAGTCTAAA 2580 ATCCTTAAGC AGGTCCTTAC TCTGCAGACA AAAAATCCTG AAGGTCGCTT TCCTGATTTG 2640 ACTGTAGAAC TGAACCGATG GGATACAGCC TTTGACCATG AAAAGGCTCG AAAGACTGGA 2700 CTCATTACTC CTAAAGCAGG ATTTGACTCT GATTATGACC AAGCTCTTGC TGACATAAGA 2760 GAAAATGAAC AGAGCCTCCT AGAATACTTG GAGAAACAGC GCAGTCGAAT TGGCTGTAGG 2820 ACCATAGTCT ACTGGGGGAT TGGTAGGAAT CGTTACCAGT TGGAAATTCC AGAGAATTTC 2880 ACCACCCGTA ATTTGCCAGA AGAATATGAG TTGAAATCCA CCAAGAAGGG CTGTAAACGA 2940 TACTGGACCA AAACAATTGA GAAGAAGTTG GCTAATCTGA TAAATGCTGA AGAACGGAGA 3000 GATGTTTCAC TGAAGGACTG CATGCGACGA CTATTTTATA ACTTTGATAA AAATTACAAG 3060 GACTGGCAGG CTGCTGTAGA GTGCATCGCA GTGTTGGATG TCTTATTGTG CCTGGCTAAC 3120 TACAGTCGAG GGGGTGATGG TCCTATGTGT CGTCCAGTAA TTCTGTTGCC AGGAGAAGAC 3180 ACTCCTCCCT TTCTGTACCT TAAAGGATCA CGCCATCCCT GCATTACAAA GACTTTTTTT 3240 GGTGATGACT TTATTCCTAA TGACATTCTC ATAGGCTGTG AGGAAGAAGA GGAGGAAAAT 3300 GACAAAGCTT ATTGTGTGCT TGTTACTGGG CCAAATATGG GGGGCAAGTC TACACTTATG 3360 AGACAGGCTG GCCTATTAGC TGTAATGGCC CAGATGGGTT GTTATGTACC AGCTGAAGTG 3420 TGCAGGCTCA CACCAATTGA TAGAGTGTTT ACTAGACTTG GTGCCTCAGA CAGAATAATG 3480 TCAGGTGAAA GTACATTTTT TGTTGAATTG AGTGAAACTG CCAGTATTCT TACACATGCA 3540 ACAGCACATT CTCTGGTGCT TGTGGATGAA TTAGGAAGAG GTACTGCAAC ATTTGATGGG 3600 ACAGCAATAG CAAATGCTGT TGTTAAAGAA CTGGCTGAGA ATATAAAGTG TCGTACGTTG 3660 TTTTCTACCC ACTACCATTC ATTAGTTGAA GACTATTCTC AAAATGTTGC AGTGCGCCTG 3720 GGACACATGG CATGCATGGT AGAAAATGAA TGCGAGGATC CCAGCCAGGA GACTATTACC 3780 TTCCTCTATA AATTCATTAA AGGAGCTTGT CCTAAAAGCT ATGGCTTTAA TGCAGCAAGG 3840 CTTGCTAATC TTCCAGAGGA GGTTATTCAA AAGGGACATA GAAAAGCAAG AGAATTCGAG 3900 AAGATGACTC AGTCATTACG ATTATTTCGG GAAGTTTGTC TGGCTAGTGA AAGGTCGACT 3960 GTAGATGCTG AAGCTGTCCA TAAGTTGCTG ACTTTGATGG AG 4003 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |