WERAM Information


Tag Content
WERAM ID WERAM-Gog-0195
Ensembl Protein ID ENSGGOP00000018924.1
Gene Name MSH6
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSGGOG00000021987.1 ENSGGOT00000031654.1 ENSGGOP00000018924.1
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader PWWP 6.10e-23 81.2 94 155
Organism Gorilla gorilla
Domain Profile
  Me_Reader PWWP

              PWWP.txt   1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 
+gdLVwaK++gYpwWP+lv+++p++++ + +++ ++ +++V+FF+++++r wv+++ l+py++
ENSGGOP00000018924.1 94 PGDLVWAKMEGYPWWPCLVYNHPFDGTFI-REKGKSVRVHVQFFDDSPTRGWVSKRLLKPYTG 155
69***************************.*******************************85 PP

Protein Sequence
(Fasta)
MSRQSTLYSF FPKSPALSDA NKASARASRE GGRAAAAPGA SPSPGGDAAW SEAGPGPRPL 60
ARSASPPKAK NLNGGLRRSA APAAPTSNSC DFSPGDLVWA KMEGYPWWPC LVYNHPFDGT 120
FIREKGKSVR VHVQFFDDSP TRGWVSKRLL KPYTGKSHYW MKLSVFYPSK LQGLAVVKAS 180
GMGKDLAKKD SQTPFKSSAS PEENKLVQVG TTYVTDKSEE DNEIESEEEV QPKTQGSRRS 240
SRQIKKRRVI SDSESDIGGS DVEFKPDTKE EGSSDEISSG VGDSESEGLN SPVKVARKRK 300
RMVTGNGSLK RKSSRKETPS ATKQATSISS ETKNTLRAFS APQNSESQAH VSGGGDDSSR 360
PTVWYHETLE WLKEEKRRDE HRRRPDHPDF DASTLYVPED FLNSCTPGMR KWWQIKSQNF 420
DLVICYKVGK FYELYHMDAL IGVSELGLVF MKGNWAHSGF PEIAFGRYSD SLVQKGYKVA 480
RVEQTETPEM MEARCRKMAH ISKYDRVVRR EICRIITKGT QTYSVLEGDP SENYSKYLLS 540
LKEKEEDSSG HTRAYGVCFV DTSLGKFFIG QFSDDRHCSR FRTLVAHYPP VQVLFEKGNL 600
SKETKTILKS SLSSSLQEGL IPGSQFWDAS KTLRTLLEEE YFREKLSDGI GVMLPQVLKG 660
MTSESDSIGL TPGEKSELAL SALGGCVFYL KKCLIDQELL SMANFEEYIP LDSDTVSTTR 720
SGAIFTKAYQ RMVLDAVTLN NLEIFLNGTN GSTEGTLLER VDTCHTPFGK RLLKQWLCAP 780
LCNPYAINDR LDAIEDLMVV PDKISEVVEL LKKLPDLERL LSKIHNVGSP LKSQNHPDSR 840
AIMYEETTYS KKKIIDFLSA LEGFKVMCKI IGIMEEVADG FKSKILKQVI SLQTKNPEGR 900
FPDLTVELNR WDTAFDHEKA RKTGLITPKA GFDSDYDQAL ADIRENEQSL LEYLEKQRNR 960
IGCRTIVYWG IGRNRYQLEI PENFTTRNLP EEYELKSTKK GCKRYWTKTI EKKLANLINA 1020
EERRDVSLKD CMRRLFYNFD KNYKDWQSAV ECIAVLDVLL CLANYSRGGD GPMCRPVILL 1080
PEDTPPFLEL KGSRHPCITK TFFGDDFIPN DILIGCEEEE QENGKAYCVL VTGPNMGGKS 1140
TLMRQAGLLA VMAQMGCYVP AEVCRLTPID RVFTRLGASD RIMSGESTFF VELSETASIL 1200
MHATAHSLVL VDELGRGTAT FDGTAIANAV VKELAETIQC RTLFSTHYHS LVEDYSQNVA 1260
VRLGHMACMV ENECEDPSQE TITFLYKFIK GACPKSYGFN AARLANLPEE VIQKGHRKAR 1320
EFEKMNQSLR LFREVCLASE RSTVDAEAVH KLLTLIKEL 1359
Nucleotide Sequence
(Fasta)
ATGTCGCGAC AGAGCACCCT GTACAGCTTC TTCCCCAAGT CTCCGGCGCT GAGTGATGCC 60
AACAAGGCCT CGGCCCGGGC CTCACGCGAA GGCGGCCGTG CCGCCGCTGC CCCCGGGGCC 120
TCTCCTTCCC CAGGCGGGGA TGCGGCCTGG AGCGAGGCTG GGCCTGGGCC CAGGCCCTTG 180
GCGCGCTCCG CGTCGCCGCC CAAGGCGAAG AACCTCAACG GAGGGCTGCG GAGATCGGCA 240
GCGCCTGCTG CCCCCACCAG CAACAGTTGT GACTTCTCAC CAGGAGATTT GGTTTGGGCC 300
AAGATGGAGG GTTACCCCTG GTGGCCTTGT CTGGTTTACA ACCACCCCTT TGATGGAACA 360
TTCATCCGTG AGAAAGGGAA ATCAGTCCGT GTTCATGTAC AGTTTTTTGA TGACAGCCCA 420
ACAAGGGGCT GGGTTAGCAA AAGGCTTTTA AAGCCATATA CAGGTAAGAG TCACTACTGG 480
ATGAAATTAA GTGTATTTTA TCCCAGTAAA TTGCAAGGGT TGGCAGTTGT GAAAGCTTCC 540
GGCATGGGAA AGGATCTGGC TAAAAAAGAT TCTCAAACCC CTTTTAAATC TTCTGCCTCA 600
CCTGAAGAGA ACAAGCTTGT CCAGGTAGGC ACAACTTACG TAACAGATAA GAGTGAAGAA 660
GATAATGAAA TTGAGAGTGA AGAGGAAGTA CAGCCTAAGA CACAAGGATC TAGGCGAAGT 720
AGCCGCCAAA TAAAAAAACG AAGGGTCATA TCAGACTCTG AGAGTGACAT TGGTGGCTCT 780
GATGTGGAAT TTAAGCCAGA CACTAAGGAG GAAGGAAGCA GTGATGAAAT AAGCAGTGGA 840
GTGGGGGATA GTGAGAGTGA AGGCCTGAAC AGCCCTGTCA AAGTTGCTCG AAAGCGGAAG 900
AGAATGGTGA CTGGAAATGG CTCTCTTAAA AGGAAAAGCT CTAGGAAGGA AACGCCCTCG 960
GCCACCAAAC AAGCAACTAG CATTTCATCA GAAACCAAGA ATACTTTGAG AGCTTTCTCT 1020
GCCCCTCAAA ATTCTGAATC CCAAGCCCAT GTTAGTGGAG GTGGTGATGA CAGTAGTCGC 1080
CCTACTGTTT GGTATCATGA AACTTTAGAA TGGCTTAAGG AGGAAAAGAG AAGAGATGAG 1140
CACAGGAGGA GGCCTGATCA CCCCGATTTT GATGCATCTA CACTCTATGT GCCTGAGGAT 1200
TTCCTCAATT CTTGTACTCC TGGGATGAGG AAGTGGTGGC AGATTAAGTC TCAGAACTTT 1260
GATCTTGTCA TCTGTTACAA GGTGGGGAAA TTTTATGAGC TGTACCACAT GGATGCTCTT 1320
ATTGGAGTCA GTGAACTGGG GCTGGTATTC ATGAAAGGCA ACTGGGCCCA TTCTGGCTTT 1380
CCTGAAATTG CATTTGGCCG TTATTCAGAT TCCCTGGTGC AGAAGGGCTA TAAAGTAGCA 1440
CGAGTGGAAC AGACTGAGAC TCCAGAAATG ATGGAGGCAC GATGTAGAAA GATGGCACAT 1500
ATATCCAAGT ATGACAGAGT GGTGAGGAGG GAGATCTGTA GGATCATTAC CAAGGGTACA 1560
CAGACTTACA GTGTGCTGGA AGGTGATCCC TCTGAGAACT ACAGTAAGTA TCTTCTTAGC 1620
CTCAAAGAAA AAGAGGAAGA TTCTTCTGGC CATACTCGTG CATATGGTGT GTGCTTTGTT 1680
GATACTTCGC TGGGAAAGTT TTTCATAGGT CAGTTTTCAG ATGATCGCCA TTGTTCGAGA 1740
TTTAGGACTC TGGTGGCACA CTATCCCCCA GTACAAGTCT TATTTGAAAA AGGAAATCTC 1800
TCAAAGGAAA CTAAAACAAT TCTAAAGAGT TCATTGTCCT CTTCTCTTCA GGAAGGTCTG 1860
ATACCCGGCT CCCAGTTTTG GGATGCATCC AAAACTTTGA GAACTCTCCT TGAGGAAGAA 1920
TATTTTAGGG AAAAGCTAAG TGATGGCATT GGGGTGATGT TACCCCAGGT GCTTAAAGGT 1980
ATGACTTCAG AGTCTGATTC CATTGGGTTG ACACCAGGAG AGAAAAGTGA ATTGGCCCTC 2040
TCTGCTCTAG GTGGTTGTGT CTTCTACCTC AAAAAATGCC TTATTGATCA GGAGCTTTTA 2100
TCAATGGCTA ATTTTGAAGA ATATATTCCC TTGGATTCTG ACACAGTCAG CACTACAAGA 2160
TCTGGTGCTA TCTTCACCAA AGCCTATCAA CGAATGGTGC TAGATGCAGT GACATTAAAC 2220
AACTTGGAGA TTTTTCTGAA TGGAACAAAT GGTTCTACTG AAGGAACCCT ACTAGAGAGG 2280
GTTGATACTT GCCATACTCC TTTTGGTAAG CGGCTCCTAA AGCAATGGCT TTGTGCCCCA 2340
CTCTGTAACC CTTATGCTAT TAATGATCGT CTAGATGCCA TAGAAGACCT CATGGTTGTG 2400
CCTGACAAAA TCTCTGAAGT TGTAGAGCTT CTAAAGAAGC TTCCAGATCT TGAGAGACTA 2460
CTCAGTAAAA TTCATAATGT TGGGTCTCCC CTGAAGAGTC AGAACCACCC AGACAGCAGG 2520
GCTATAATGT ATGAAGAAAC TACATACAGC AAAAAGAAGA TTATTGATTT TCTTTCTGCT 2580
CTGGAAGGAT TCAAAGTAAT GTGTAAAATT ATAGGGATCA TGGAAGAAGT CGCTGATGGT 2640
TTTAAGTCTA AAATCCTTAA GCAGGTCATC TCTCTGCAGA CAAAAAATCC TGAAGGTCGT 2700
TTTCCTGATT TGACTGTAGA ATTGAACCGA TGGGATACAG CCTTTGACCA TGAAAAGGCT 2760
CGAAAGACTG GACTTATTAC TCCCAAAGCA GGCTTTGACT CTGATTATGA CCAAGCTCTT 2820
GCTGACATAA GAGAAAATGA ACAGAGCCTC CTGGAATACC TAGAGAAGCA GCGCAACAGA 2880
ATTGGCTGTA GGACCATAGT CTATTGGGGG ATTGGTAGGA ACCGTTACCA GCTGGAAATT 2940
CCTGAGAATT TCACCACTCG CAATTTGCCA GAAGAATACG AATTGAAATC TACCAAGAAG 3000
GGCTGTAAAC GATACTGGAC CAAAACTATT GAAAAGAAGT TGGCTAATCT CATAAATGCT 3060
GAGGAACGGA GAGATGTATC ATTGAAGGAC TGCATGCGGC GACTGTTCTA TAACTTTGAT 3120
AAAAATTACA AGGACTGGCA GTCTGCTGTA GAGTGTATCG CAGTGTTGGA TGTTTTACTG 3180
TGCCTGGCTA ACTATAGTCG AGGGGGTGAT GGTCCTATGT GTCGCCCAGT AATTCTGTTG 3240
CCAGAAGATA CCCCCCCCTT CTTAGAGCTT AAAGGATCAC GCCATCCTTG CATTACAAAG 3300
ACTTTTTTTG GAGATGATTT TATTCCTAAT GACATTCTAA TAGGCTGTGA GGAAGAGGAG 3360
CAGGAAAATG GCAAAGCCTA TTGTGTGCTT GTTACTGGAC CGAATATGGG GGGCAAGTCT 3420
ACGCTTATGA GACAGGCTGG CTTATTAGCT GTAATGGCCC AGATGGGTTG TTACGTCCCT 3480
GCTGAAGTGT GCAGGCTCAC ACCAATTGAT AGAGTGTTTA CTAGACTTGG TGCCTCAGAC 3540
AGAATAATGT CAGGTGAAAG TACATTTTTT GTTGAATTAA GTGAAACTGC CAGCATACTC 3600
ATGCATGCAA CAGCACATTC TCTGGTGCTT GTGGATGAAT TAGGAAGAGG TACTGCAACA 3660
TTTGATGGGA CAGCAATAGC AAATGCAGTT GTTAAAGAAC TTGCTGAGAC TATACAATGT 3720
CGTACATTAT TTTCAACTCA CTACCATTCA TTAGTAGAAG ATTATTCTCA AAATGTTGCT 3780
GTGCGCCTAG GACATATGGC ATGCATGGTA GAAAATGAAT GTGAAGACCC CAGCCAGGAG 3840
ACTATTACCT TCCTCTATAA ATTCATTAAG GGAGCTTGTC CTAAAAGCTA TGGCTTTAAT 3900
GCAGCAAGGC TTGCTAATCT CCCAGAGGAA GTTATTCAAA AGGGACATAG AAAAGCAAGA 3960
GAATTTGAGA AGATGAATCA GTCACTACGA TTATTTCGGG AAGTTTGCCT GGCTAGTGAA 4020
AGGTCAACTG TAGATGCTGA AGCTGTCCAT AAATTGCTGA CTTTGATTAA GGAATTATAG 4080
ACTACATTGG AAGCTTTGAG TTGACTTCTG ACAAAGGTGG TAAATTCAGA CAACATTATG 4140
ATCTAATAAA CTTTATTTTT T 4162
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Hos-0089 ENSP00000234420.4 Homo sapiens 96 0.0 2431
WERAM-Poa-0108 ENSPPYP00000013873.2 Pongo abelii 96 0.0 2405
WERAM-Chs-0168 ENSCSAP00000009745.1 Chlorocebus sabaeus 95 0.0 2401
WERAM-Mam-0208 ENSMMUP00000030085.2 Macaca mulatta 95 0.0 2400
WERAM-Pat-0096 ENSPTRP00000020434.4 Pan troglodytes 96 0.0 2394
WERAM-Paa-0131 ENSPANP00000007626.1 Papio anubis 95 0.0 2393
WERAM-Nol-0087 ENSNLEP00000010403.1 Nomascus leucogenys 96 0.0 2319
WERAM-Ict-0166 ENSSTOP00000016600.1 Ictidomys tridecemlineatus 90 0.0 2274
WERAM-Otg-0214 ENSOGAP00000021215.1 Otolemur garnettii 90 0.0 2270
WERAM-Aim-0076 ENSAMEP00000006662.1 Ailuropoda melanoleuca 90 0.0 2267
WERAM-Mup-0123 ENSMPUP00000010923.1 Mustela putorius furo 89 0.0 2259
WERAM-Bot-0012 ENSBTAP00000001867.5 Bos taurus 89 0.0 2256
WERAM-Dan-0137 ENSDNOP00000026714.1 Dasypus novemcinctus 88 0.0 2243
WERAM-Ptv-0116 ENSPVAP00000010278.1 Pteropus vampyrus 88 0.0 2232
WERAM-Ova-0042 ENSOARP00000005406.1 Ovis aries 88 0.0 2224
WERAM-Tut-0091 ENSTTRP00000007640.1 Tursiops truncatus 88 0.0 2215
WERAM-Myl-0101 ENSMLUP00000008006.2 Myotis lucifugus 87 0.0 2214
WERAM-Orc-0143 ENSOCUP00000012569.2 Oryctolagus cuniculus 88 0.0 2206
WERAM-Sus-0141 ENSSSCP00000023698.1 Sus scrofa 88 0.0 2199
WERAM-Loa-0023 ENSLAFP00000001328.3 Loxodonta africana 90 0.0 2182
WERAM-Prc-0145 ENSPCAP00000013528.1 Procavia capensis 86 0.0 2178
WERAM-Cap-0004 ENSCPOP00000000836.2 Cavia porcellus 85 0.0 2171
WERAM-Ran-0105 ENSRNOP00000021923.6 Rattus norvegicus 83 0.0 2143
WERAM-Caf-0034 ENSCAFP00000003882.3 Canis familiaris 90 0.0 2141
WERAM-Mum-0014 ENSMUSP00000005503.3 Mus musculus 83 0.0 2105
WERAM-Fec-0079 ENSFCAP00000006578.2 Felis catus 89 0.0 2070
WERAM-Sah-0081 ENSSHAP00000009277.1 Sarcophilus harrisii 78 0.0 1920
WERAM-Mod-0012 ENSMODP00000001344.2 Monodelphis domestica 75 0.0 1919
WERAM-Caj-0115 ENSCJAP00000020345.2 Callithrix jacchus 92 0.0 1734
WERAM-Gaga-0092 ENSGALP00000038835.2 Gallus gallus 68 0.0 1692
WERAM-Xet-0147 ENSXETP00000049000.3 Xenopus tropicalis 64 0.0 1679
WERAM-Meg-0053 ENSMGAP00000005001.2 Meleagris gallopavo 68 0.0 1677
WERAM-Pes-0030 ENSPSIP00000005328.1 Pelodiscus sinensis 69 0.0 1662
WERAM-Tag-0082 ENSTGUP00000005725.1 Taeniopygia guttata 69 0.0 1655
WERAM-Lac-0149 ENSLACP00000017187.1 Latimeria chalumnae 62 0.0 1634
WERAM-Anp-0154 ENSAPLP00000015904.1 Anas platyrhynchos 74 0.0 1613
WERAM-Ocp-0051 ENSOPRP00000004458.1 Ochotona princeps 90 0.0 1573
WERAM-Anc-0146 ENSACAP00000013849.3 Anolis carolinensis 60 0.0 1561
WERAM-Orn-0123 ENSONIP00000012391.1 Oreochromis niloticus 59 0.0 1521
WERAM-Leo-0169 ENSLOCP00000020114.1 Lepisosteus oculatus 69 0.0 1450
WERAM-Tar-0161 ENSTRUP00000034377.1 Takifugu rubripes 67 0.0 1442
WERAM-Orla-0115 ENSORLP00000013893.1 Oryzias latipes 68 0.0 1437
WERAM-Pof-0182 ENSPFOP00000015502.2 Poecilia formosa 67 0.0 1434
WERAM-Xim-0134 ENSXMAP00000011014.1 Xiphophorus maculatus 67 0.0 1426
WERAM-Ten-0197 ENSTNIP00000019708.1 Tetraodon nigroviridis 68 0.0 1402
WERAM-Gaa-0054 ENSGACP00000007160.1 Gasterosteus aculeatus 66 0.0 1400
WERAM-Gam-0173 ENSGMOP00000017118.1 Gadus morhua 68 0.0 1372
WERAM-Asm-0007 ENSAMXP00000001159.1 Astyanax mexicanus 66 0.0 1307
WERAM-Tub-0139 ENSTBEP00000014916.1 Tupaia belangeri 87 0.0 1280
WERAM-Dio-0122 ENSDORP00000011809.1 Dipodomys ordii 92 0.0 1279
WERAM-Pem-0088 ENSPMAP00000009336.1 Petromyzon marinus 59 0.0 1237
WERAM-Cis-0032 ENSCSAVP00000006844.1 Ciona savignyi 50 0.0 1006
WERAM-Ere-0070 ENSEEUP00000005594.1 Erinaceus europaeus 90 0.0 972
WERAM-Eqc-0026 ENSECAP00000004388.1 Equus caballus 95 0.0 972
WERAM-Tas-0125 ENSTSYP00000013319.1 Tarsius syrichta 94 0.0 832
WERAM-Ect-0048 ENSETEP00000005096.1 Echinops telfairi 89 0.0 795
WERAM-Met-0160 AES82183 Medicago truncatula 41 0.0 652
WERAM-Chh-0108 ENSCHOP00000011885.1 Choloepus hoffmanni 94 3e-180 630
WERAM-Mim-0103 ENSMICP00000009629.1 Microcebus murinus 93 2e-168 591
WERAM-Vip-0079 ENSVPAP00000007177.1 Vicugna pacos 94 5e-168 589
WERAM-Dar-0233 ENSDARP00000130154.1 Danio rerio 63 6e-24 111
WERAM-Soa-0010 ENSSARP00000001031.1 Sorex araneus 44 2e-12 73.2
WERAM-Fia-0121 ENSFALP00000010353.1 Ficedula albicollis 44 3e-12 72.4
WERAM-Mae-0114 ENSMEUP00000010800.1 Macropus eugenii 39 4e-11 68.9
WERAM-Ora-0032 ENSOANP00000005155.1 Ornithorhynchus anatinus 37 6e-11 68.2
WERAM-Sei-0090 Si024584m Setaria italica 39 5e-06 51.6
Created Date 25-Jun-2016