WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Bot-0012 | ||||||||||||
Ensembl Protein ID | ENSBTAP00000001867.5 | ||||||||||||
Gene Name | MSH6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Bos taurus | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSRQSTLFSF FPKSPAVNNA NKAPVRASSG SSAAAAATGA SPSPGGDAAW NEAGPEPLAG 60 TTSRAEARNL NGGLRKSASP AVPASSCDFS PGDLVWAKME GYPWWPCLVY NHPFDGTFIR 120 EKGKSARVHV QFFDDSPTRG WVSRRLLKPY TGSHSKEAQK GGHFYSAKPE ILRAMQRADE 180 ALNKDKIKRL ELAVCDEPSE PEEEEETEVG ATYTSDKSEE ENDIESEEEV RPKVQGSRRS 240 SRQIKKRRVI SDSESDVGGS DVEFKPDTKE EASSDEISSG VGDSDSEGLD TPVKVAPKRK 300 RMVTGNGSLK RKSSRKEMPS ATKRATGISS ETKNTLSAFS VPQNSEPQAH ISGGCDDSNR 360 PTVWYHETLE WLKEEKRRDV HRRRPDHPDF DASTLYVPED FLNSCTPGMR KWWQIKSQNF 420 DLVIFYKVGK FYEMYHMDAL IGVSELGLVF MKGNWAHSGF PEIAFGRYSD SLVQKGYKVA 480 RVEQTETPEM MEARCRKMAH ISKYDRVVRR EICRIITKGT QTYSVLEGDP SENYSKYLLS 540 LKEKEEESSG HTRVYGVCFV DTSLGRFFIG QFSDDRHCSR FRTLVAHYPP VQVLFEKGNL 600 SMDTKMILKS SLSSSLQEGL IPGSQFWDAA KTLRTLLEEG YFIDKLNEDG GVMLPQVLKG 660 MTSESDSIGL TPGEKSELAL SALGGCVFYL KKCLIDQELL SMANFEEYVP LDSDMVHATR 720 PGAVFAKANQ RMVLDAVTLN NLEIFLNGTN GSTEGTLLEK IDTCHTPFGK RLLKQWLCAP 780 LCNPHAINDR LDAIEDLMVV PDKISEVVDL LKKLPDLERL LSKIHNVGSP LKSQNHPDSR 840 AIMYEETTYS KKKIIDFLSA LEGFKVICKI IGIMEEVIDD FKSKILKQVL TLQTKSPEGR 900 FPDLTSELNR WDTAFDHEKA RKTGLITPKA GFDSDYDQAL ADIRENEQSL LEYLEKQRSR 960 IGCRTIVYWG IGRNRYQLEI PENFITRNLP EEYELKSTKK GCKRYWTKTI EKKLGNLINA 1020 EERRDASLKD CMRRLFYNFD KNYKDWQAAV ECIAVLDVLL CLTNYSRGGD GPMCRPIILL 1080 PEEDTPPFLD LKGSRHPCIT KTFFGDDFIP NDILIGCEEE EEENGKAYCV LVTGPNMGGK 1140 STLMRQAGLL AIMAQMGCYV PAEVCRLTPI DRVFTRLGAS DRIMSGESTF FVELSETASI 1200 LTHATAHSLV LVDELGRGTA TFDGTAIANA VVKELAENIK CRTLFSTHYH SLVEDYSQNV 1260 AVRLGHMACM VENECEDPSQ ETITFLYKFI KGACPKSYGF NAARLANLPE EVIQKGHRKA 1320 REFEKMTQSL RLFREVCLAS ERSTVDADAV HKLLTLIEEL |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCGCGAC AGAGCACGCT GTTTAGCTTC TTCCCTAAGT CTCCAGCCGT GAATAATGCC 60 AACAAAGCCC CCGTCAGAGC CTCAAGTGGA AGCAGCGCCG CCGCCGCTGC CACCGGGGCC 120 TCCCCTTCCC CAGGCGGGGA TGCGGCCTGG AACGAGGCCG GGCCTGAGCC CCTAGCGGGT 180 ACCACGTCGC GGGCCGAGGC CAGGAACCTC AACGGAGGGC TGCGGAAGTC GGCATCCCCT 240 GCGGTCCCCG CCAGTTCTTG TGACTTCTCA CCAGGTGATT TGGTTTGGGC CAAGATGGAG 300 GGTTACCCTT GGTGGCCTTG CCTGGTTTAC AACCACCCTT TTGATGGAAC ATTCATCCGT 360 GAGAAAGGAA AGTCTGCCCG AGTTCATGTA CAGTTTTTTG ATGACAGCCC AACACGGGGC 420 TGGGTTAGCA GAAGGCTATT AAAGCCATAT ACAGGTTCAC ATTCAAAGGA AGCCCAGAAA 480 GGAGGTCATT TTTATAGTGC AAAGCCTGAA ATACTCAGAG CAATGCAACG TGCAGATGAA 540 GCCTTGAATA AAGACAAGAT TAAGAGGCTT GAGTTGGCAG TATGTGATGA GCCCTCAGAG 600 CCAGAGGAGG AAGAAGAGAC GGAGGTAGGT GCCACTTACA CATCAGATAA GAGTGAAGAG 660 GAAAATGACA TTGAGAGTGA AGAGGAAGTG AGGCCAAAGG TGCAAGGATC CAGACGAAGT 720 AGCCGACAAA TAAAAAAACG AAGGGTCATA TCAGACTCTG AGAGTGACGT TGGTGGCTCT 780 GATGTGGAAT TCAAGCCAGA CACTAAGGAG GAAGCAAGCA GTGATGAAAT AAGCAGTGGC 840 GTGGGGGACA GTGATAGTGA AGGCCTGGAC ACCCCCGTCA AAGTTGCTCC AAAGCGGAAG 900 AGAATGGTAA CTGGGAATGG TTCCCTCAAG AGGAAGAGTT CAAGGAAGGA AATGCCTTCA 960 GCCACCAAAC GAGCAACTGG CATTTCATCA GAAACCAAGA ATACTTTGAG TGCTTTCTCT 1020 GTCCCTCAAA ATTCTGAACC CCAAGCCCAC ATTAGTGGGG GATGTGATGA CAGTAATCGC 1080 CCCACTGTCT GGTATCATGA AACTTTAGAG TGGCTTAAGG AGGAAAAGAG AAGAGATGTG 1140 CACAGGAGGC GGCCTGATCA CCCTGATTTT GATGCATCCA CACTCTATGT GCCCGAGGAT 1200 TTCCTTAATT CCTGTACTCC TGGGATGAGG AAGTGGTGGC AAATTAAGTC TCAGAACTTT 1260 GATCTGGTCA TATTTTATAA AGTGGGGAAG TTTTATGAGA TGTACCACAT GGATGCTCTT 1320 ATTGGGGTCA GTGAACTAGG GCTGGTATTC ATGAAAGGCA ACTGGGCCCA TTCTGGTTTC 1380 CCTGAAATTG CATTTGGCCG ATACTCAGAT TCCCTGGTCC AGAAGGGCTA TAAAGTAGCA 1440 CGAGTAGAAC AGACTGAGAC CCCAGAAATG ATGGAGGCAC GATGCCGAAA AATGGCACAC 1500 ATATCTAAGT ATGACAGAGT GGTGCGGAGG GAGATCTGTA GGATCATTAC GAAGGGTACA 1560 CAGACCTACA GTGTGCTGGA AGGTGACCCC TCGGAGAACT ACAGTAAATA CCTTCTTAGC 1620 CTCAAAGAGA AAGAAGAGGA GTCTTCTGGC CACACTCGCG TGTATGGAGT ATGCTTTGTT 1680 GATACCTCTC TAGGGAGGTT CTTCATAGGT CAGTTTTCAG ATGATCGCCA TTGTTCCAGG 1740 TTTAGGACTT TAGTGGCACA CTATCCTCCA GTACAAGTCT TATTTGAGAA GGGAAATCTC 1800 TCAATGGATA CTAAGATGAT TCTCAAGAGT TCATTATCCT CTTCTCTTCA GGAAGGTCTG 1860 ATACCGGGCT CCCAGTTTTG GGATGCAGCC AAAACTTTGA GAACTCTCCT TGAAGAAGGA 1920 TATTTTATAG ACAAGTTAAA TGAGGACGGT GGGGTGATGC TACCCCAGGT GCTTAAAGGT 1980 ATGACCTCGG AGTCAGATTC TATTGGATTG ACACCAGGAG AGAAGAGTGA ATTAGCCCTC 2040 TCTGCTCTTG GTGGTTGTGT CTTCTACCTC AAAAAATGCC TTATTGATCA GGAGCTTTTA 2100 TCAATGGCTA ATTTTGAAGA ATATGTTCCC TTGGATTCTG ACATGGTCCA TGCTACAAGA 2160 CCTGGTGCTG TCTTTGCTAA AGCCAATCAA CGAATGGTGC TAGATGCTGT GACATTAAAC 2220 AACTTGGAGA TTTTTCTGAA TGGAACAAAC GGTTCTACTG AAGGGACCTT GTTAGAGAAG 2280 ATTGACACCT GCCATACTCC CTTTGGTAAA CGGCTCTTAA AGCAGTGGCT CTGTGCCCCA 2340 CTCTGTAACC CTCATGCGAT CAATGATCGC CTGGATGCCA TAGAAGACCT AATGGTTGTG 2400 CCTGACAAAA TCTCTGAGGT TGTAGACCTT CTTAAGAAGC TTCCAGACCT TGAGAGACTC 2460 CTGAGTAAAA TTCATAATGT TGGGTCTCCC CTCAAGAGCC AGAACCACCC AGATAGCAGG 2520 GCTATAATGT ATGAAGAAAC CACATACAGC AAAAAAAAGA TTATTGATTT TCTTTCTGCT 2580 CTGGAAGGAT TCAAAGTAAT ATGTAAAATT ATAGGGATTA TGGAAGAAGT CATTGATGAC 2640 TTTAAGTCTA AAATCCTCAA GCAGGTCCTT ACTCTGCAGA CAAAAAGTCC TGAAGGGCGC 2700 TTTCCTGATT TGACTTCAGA ACTGAACCGA TGGGATACAG CCTTTGACCA TGAAAAGGCT 2760 CGAAAGACTG GACTGATTAC TCCCAAAGCA GGATTTGACT CTGATTATGA CCAAGCTCTT 2820 GCTGACATAA GAGAAAATGA ACAGAGCCTC CTGGAATACT TGGAGAAACA GCGCAGTCGA 2880 ATTGGCTGTA GAACCATAGT TTACTGGGGA ATTGGTAGGA ATCGTTACCA GTTGGAAATT 2940 CCAGAGAATT TCATCACCCG TAATTTGCCA GAAGAATATG AGTTGAAATC CACCAAAAAG 3000 GGTTGTAAAC GATACTGGAC CAAAACAATT GAGAAGAAGT TGGGTAATCT GATAAACGCC 3060 GAAGAACGGA GAGACGCATC ATTAAAGGAC TGCATGAGGC GACTGTTCTA TAACTTTGAT 3120 AAAAATTACA AGGACTGGCA GGCTGCTGTG GAGTGCATCG CAGTGTTGGA TGTGTTATTG 3180 TGCCTGACCA ACTACAGTCG AGGGGGTGAT GGTCCTATGT GTCGTCCAAT AATTCTGTTG 3240 CCAGAAGAAG ATACTCCTCC CTTTCTAGAC CTTAAAGGAT CACGCCATCC CTGCATTACG 3300 AAGACTTTTT TTGGTGATGA CTTTATTCCT AATGACATTC TAATAGGCTG TGAGGAAGAG 3360 GAGGAGGAAA ATGGCAAAGC TTATTGTGTG CTTGTTACTG GACCAAATAT GGGAGGCAAG 3420 TCTACACTTA TGAGACAGGC TGGCCTATTA GCTATAATGG CCCAGATGGG CTGTTACGTA 3480 CCAGCTGAAG TGTGTAGGCT CACACCAATT GATAGAGTGT TTACTAGACT TGGTGCCTCA 3540 GACAGAATAA TGTCAGGTGA AAGTACATTT TTTGTTGAAT TGAGTGAAAC TGCCAGTATA 3600 CTTACACATG CAACAGCACA TTCTCTGGTG CTTGTGGATG AATTAGGAAG AGGTACCGCA 3660 ACATTCGATG GGACAGCAAT AGCAAACGCA GTTGTGAAAG AACTTGCTGA GAACATAAAG 3720 TGTCGTACGT TGTTTTCCAC CCACTACCAT TCATTAGTTG AAGATTATTC TCAAAATGTT 3780 GCAGTGCGCC TAGGACACAT GGCATGCATG GTAGAAAATG AATGTGAAGA TCCCAGCCAG 3840 GAGACAATTA CCTTCCTCTA TAAATTCATT AAAGGAGCCT GTCCGAAAAG CTATGGTTTT 3900 AATGCAGCAA GGCTTGCTAA TCTTCCAGAG GAGGTTATTC AAAAGGGACA TAGAAAAGCA 3960 AGAGAATTTG AGAAGATGAC TCAGTCACTG CGGTTATTTC GGGAGGTTTG TTTGGCTAGT 4020 GAAAGGTCTA CTGTGGATGC TGATGCTGTC CATAAGTTGC TGACTTTGAT TGAGGAATTA 4080 TAGACTACAT TGCAAGCTTT GAGTTCACTT TTGACAAAGG TGGTAAATTC AGACAACATT 4140 ATGATCTAAT AAACTTTATT TTTT 4165 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |