WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Gog-0195 | ||||||||||||
Ensembl Protein ID | ENSGGOP00000018924.1 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Gorilla gorilla | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60 ARSASPPKAK NLNGGLRRSA APAAPTSNSC DFSPGDLVWA KMEGYPWWPC LVYNHPFDGT 120 FIREKGKSVR VHVQFFDDSP TRGWVSKRLL KPYTGKSHYW MKLSVFYPSK LQGLAVVKAS 180 GMGKDLAKKD SQTPFKSSAS PEENKLVQVG TTYVTDKSEE DNEIESEEEV QPKTQGSRRS 240 SRQIKKRRVI SDSESDIGGS DVEFKPDTKE EGSSDEISSG VGDSESEGLN SPVKVARKRK 300 RMVTGNGSLK RKSSRKETPS ATKQATSISS ETKNTLRAFS APQNSESQAH VSGGGDDSSR 360 PTVWYHETLE WLKEEKRRDE HRRRPDHPDF DASTLYVPED FLNSCTPGMR KWWQIKSQNF 420 DLVICYKVGK FYELYHMDAL IGVSELGLVF MKGNWAHSGF PEIAFGRYSD SLVQKGYKVA 480 RVEQTETPEM MEARCRKMAH ISKYDRVVRR EICRIITKGT QTYSVLEGDP SENYSKYLLS 540 LKEKEEDSSG HTRAYGVCFV DTSLGKFFIG QFSDDRHCSR FRTLVAHYPP VQVLFEKGNL 600 SKETKTILKS SLSSSLQEGL IPGSQFWDAS KTLRTLLEEE YFREKLSDGI GVMLPQVLKG 660 MTSESDSIGL TPGEKSELAL SALGGCVFYL KKCLIDQELL SMANFEEYIP LDSDTVSTTR 720 SGAIFTKAYQ RMVLDAVTLN NLEIFLNGTN GSTEGTLLER VDTCHTPFGK RLLKQWLCAP 780 LCNPYAINDR LDAIEDLMVV PDKISEVVEL LKKLPDLERL LSKIHNVGSP LKSQNHPDSR 840 AIMYEETTYS KKKIIDFLSA LEGFKVMCKI IGIMEEVADG FKSKILKQVI SLQTKNPEGR 900 FPDLTVELNR WDTAFDHEKA RKTGLITPKA GFDSDYDQAL ADIRENEQSL LEYLEKQRNR 960 IGCRTIVYWG IGRNRYQLEI PENFTTRNLP EEYELKSTKK GCKRYWTKTI EKKLANLINA 1020 EERRDVSLKD CMRRLFYNFD KNYKDWQSAV ECIAVLDVLL CLANYSRGGD GPMCRPVILL 1080 PEDTPPFLEL KGSRHPCITK TFFGDDFIPN DILIGCEEEE QENGKAYCVL VTGPNMGGKS 1140 TLMRQAGLLA VMAQMGCYVP AEVCRLTPID RVFTRLGASD RIMSGESTFF VELSETASIL 1200 MHATAHSLVL VDELGRGTAT FDGTAIANAV VKELAETIQC RTLFSTHYHS LVEDYSQNVA 1260 VRLGHMACMV ENECEDPSQE TITFLYKFIK GACPKSYGFN AARLANLPEE VIQKGHRKAR 1320 EFEKMNQSLR LFREVCLASE RSTVDAEAVH KLLTLIKEL 1359 |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCGCGAC AGAGCACCCT GTACAGCTTC TTCCCCAAGT CTCCGGCGCT GAGTGATGCC 60 AACAAGGCCT CGGCCCGGGC CTCACGCGAA GGCGGCCGTG CCGCCGCTGC CCCCGGGGCC 120 TCTCCTTCCC CAGGCGGGGA TGCGGCCTGG AGCGAGGCTG GGCCTGGGCC CAGGCCCTTG 180 GCGCGCTCCG CGTCGCCGCC CAAGGCGAAG AACCTCAACG GAGGGCTGCG GAGATCGGCA 240 GCGCCTGCTG CCCCCACCAG CAACAGTTGT GACTTCTCAC CAGGAGATTT GGTTTGGGCC 300 AAGATGGAGG GTTACCCCTG GTGGCCTTGT CTGGTTTACA ACCACCCCTT TGATGGAACA 360 TTCATCCGTG AGAAAGGGAA ATCAGTCCGT GTTCATGTAC AGTTTTTTGA TGACAGCCCA 420 ACAAGGGGCT GGGTTAGCAA AAGGCTTTTA AAGCCATATA CAGGTAAGAG TCACTACTGG 480 ATGAAATTAA GTGTATTTTA TCCCAGTAAA TTGCAAGGGT TGGCAGTTGT GAAAGCTTCC 540 GGCATGGGAA AGGATCTGGC TAAAAAAGAT TCTCAAACCC CTTTTAAATC TTCTGCCTCA 600 CCTGAAGAGA ACAAGCTTGT CCAGGTAGGC ACAACTTACG TAACAGATAA GAGTGAAGAA 660 GATAATGAAA TTGAGAGTGA AGAGGAAGTA CAGCCTAAGA CACAAGGATC TAGGCGAAGT 720 AGCCGCCAAA TAAAAAAACG AAGGGTCATA TCAGACTCTG AGAGTGACAT TGGTGGCTCT 780 GATGTGGAAT TTAAGCCAGA CACTAAGGAG GAAGGAAGCA GTGATGAAAT AAGCAGTGGA 840 GTGGGGGATA GTGAGAGTGA AGGCCTGAAC AGCCCTGTCA AAGTTGCTCG AAAGCGGAAG 900 AGAATGGTGA CTGGAAATGG CTCTCTTAAA AGGAAAAGCT CTAGGAAGGA AACGCCCTCG 960 GCCACCAAAC AAGCAACTAG CATTTCATCA GAAACCAAGA ATACTTTGAG AGCTTTCTCT 1020 GCCCCTCAAA ATTCTGAATC CCAAGCCCAT GTTAGTGGAG GTGGTGATGA CAGTAGTCGC 1080 CCTACTGTTT GGTATCATGA AACTTTAGAA TGGCTTAAGG AGGAAAAGAG AAGAGATGAG 1140 CACAGGAGGA GGCCTGATCA CCCCGATTTT GATGCATCTA CACTCTATGT GCCTGAGGAT 1200 TTCCTCAATT CTTGTACTCC TGGGATGAGG AAGTGGTGGC AGATTAAGTC TCAGAACTTT 1260 GATCTTGTCA TCTGTTACAA GGTGGGGAAA TTTTATGAGC TGTACCACAT GGATGCTCTT 1320 ATTGGAGTCA GTGAACTGGG GCTGGTATTC ATGAAAGGCA ACTGGGCCCA TTCTGGCTTT 1380 CCTGAAATTG CATTTGGCCG TTATTCAGAT TCCCTGGTGC AGAAGGGCTA TAAAGTAGCA 1440 CGAGTGGAAC AGACTGAGAC TCCAGAAATG ATGGAGGCAC GATGTAGAAA GATGGCACAT 1500 ATATCCAAGT ATGACAGAGT GGTGAGGAGG GAGATCTGTA GGATCATTAC CAAGGGTACA 1560 CAGACTTACA GTGTGCTGGA AGGTGATCCC TCTGAGAACT ACAGTAAGTA TCTTCTTAGC 1620 CTCAAAGAAA AAGAGGAAGA TTCTTCTGGC CATACTCGTG CATATGGTGT GTGCTTTGTT 1680 GATACTTCGC TGGGAAAGTT TTTCATAGGT CAGTTTTCAG ATGATCGCCA TTGTTCGAGA 1740 TTTAGGACTC TGGTGGCACA CTATCCCCCA GTACAAGTCT TATTTGAAAA AGGAAATCTC 1800 TCAAAGGAAA CTAAAACAAT TCTAAAGAGT TCATTGTCCT CTTCTCTTCA GGAAGGTCTG 1860 ATACCCGGCT CCCAGTTTTG GGATGCATCC AAAACTTTGA GAACTCTCCT TGAGGAAGAA 1920 TATTTTAGGG AAAAGCTAAG TGATGGCATT GGGGTGATGT TACCCCAGGT GCTTAAAGGT 1980 ATGACTTCAG AGTCTGATTC CATTGGGTTG ACACCAGGAG AGAAAAGTGA ATTGGCCCTC 2040 TCTGCTCTAG GTGGTTGTGT CTTCTACCTC AAAAAATGCC TTATTGATCA GGAGCTTTTA 2100 TCAATGGCTA ATTTTGAAGA ATATATTCCC TTGGATTCTG ACACAGTCAG CACTACAAGA 2160 TCTGGTGCTA TCTTCACCAA AGCCTATCAA CGAATGGTGC TAGATGCAGT GACATTAAAC 2220 AACTTGGAGA TTTTTCTGAA TGGAACAAAT GGTTCTACTG AAGGAACCCT ACTAGAGAGG 2280 GTTGATACTT GCCATACTCC TTTTGGTAAG CGGCTCCTAA AGCAATGGCT TTGTGCCCCA 2340 CTCTGTAACC CTTATGCTAT TAATGATCGT CTAGATGCCA TAGAAGACCT CATGGTTGTG 2400 CCTGACAAAA TCTCTGAAGT TGTAGAGCTT CTAAAGAAGC TTCCAGATCT TGAGAGACTA 2460 CTCAGTAAAA TTCATAATGT TGGGTCTCCC CTGAAGAGTC AGAACCACCC AGACAGCAGG 2520 GCTATAATGT ATGAAGAAAC TACATACAGC AAAAAGAAGA TTATTGATTT TCTTTCTGCT 2580 CTGGAAGGAT TCAAAGTAAT GTGTAAAATT ATAGGGATCA TGGAAGAAGT CGCTGATGGT 2640 TTTAAGTCTA AAATCCTTAA GCAGGTCATC TCTCTGCAGA CAAAAAATCC TGAAGGTCGT 2700 TTTCCTGATT TGACTGTAGA ATTGAACCGA TGGGATACAG CCTTTGACCA TGAAAAGGCT 2760 CGAAAGACTG GACTTATTAC TCCCAAAGCA GGCTTTGACT CTGATTATGA CCAAGCTCTT 2820 GCTGACATAA GAGAAAATGA ACAGAGCCTC CTGGAATACC TAGAGAAGCA GCGCAACAGA 2880 ATTGGCTGTA GGACCATAGT CTATTGGGGG ATTGGTAGGA ACCGTTACCA GCTGGAAATT 2940 CCTGAGAATT TCACCACTCG CAATTTGCCA GAAGAATACG AATTGAAATC TACCAAGAAG 3000 GGCTGTAAAC GATACTGGAC CAAAACTATT GAAAAGAAGT TGGCTAATCT CATAAATGCT 3060 GAGGAACGGA GAGATGTATC ATTGAAGGAC TGCATGCGGC GACTGTTCTA TAACTTTGAT 3120 AAAAATTACA AGGACTGGCA GTCTGCTGTA GAGTGTATCG CAGTGTTGGA TGTTTTACTG 3180 TGCCTGGCTA ACTATAGTCG AGGGGGTGAT GGTCCTATGT GTCGCCCAGT AATTCTGTTG 3240 CCAGAAGATA CCCCCCCCTT CTTAGAGCTT AAAGGATCAC GCCATCCTTG CATTACAAAG 3300 ACTTTTTTTG GAGATGATTT TATTCCTAAT GACATTCTAA TAGGCTGTGA GGAAGAGGAG 3360 CAGGAAAATG GCAAAGCCTA TTGTGTGCTT GTTACTGGAC CGAATATGGG GGGCAAGTCT 3420 ACGCTTATGA GACAGGCTGG CTTATTAGCT GTAATGGCCC AGATGGGTTG TTACGTCCCT 3480 GCTGAAGTGT GCAGGCTCAC ACCAATTGAT AGAGTGTTTA CTAGACTTGG TGCCTCAGAC 3540 AGAATAATGT CAGGTGAAAG TACATTTTTT GTTGAATTAA GTGAAACTGC CAGCATACTC 3600 ATGCATGCAA CAGCACATTC TCTGGTGCTT GTGGATGAAT TAGGAAGAGG TACTGCAACA 3660 TTTGATGGGA CAGCAATAGC AAATGCAGTT GTTAAAGAAC TTGCTGAGAC TATACAATGT 3720 CGTACATTAT TTTCAACTCA CTACCATTCA TTAGTAGAAG ATTATTCTCA AAATGTTGCT 3780 GTGCGCCTAG GACATATGGC ATGCATGGTA GAAAATGAAT GTGAAGACCC CAGCCAGGAG 3840 ACTATTACCT TCCTCTATAA ATTCATTAAG GGAGCTTGTC CTAAAAGCTA TGGCTTTAAT 3900 GCAGCAAGGC TTGCTAATCT CCCAGAGGAA GTTATTCAAA AGGGACATAG AAAAGCAAGA 3960 GAATTTGAGA AGATGAATCA GTCACTACGA TTATTTCGGG AAGTTTGCCT GGCTAGTGAA 4020 AGGTCAACTG TAGATGCTGA AGCTGTCCAT AAATTGCTGA CTTTGATTAA GGAATTATAG 4080 ACTACATTGG AAGCTTTGAG TTGACTTCTG ACAAAGGTGG TAAATTCAGA CAACATTATG 4140 ATCTAATAAA CTTTATTTTT T 4162 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |