WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Pem-0088 | ||||||||||||
Ensembl Protein ID | ENSPMAP00000009336.1 | ||||||||||||
Gene Name | msh6 | ||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Petromyzon marinus | ||||||||||||
Domain Profile | Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakklktqeaeenkylVlFFgnkherawvkrkklvpyse 63 |
||||||||||||
Protein Sequence (Fasta) | MSQKNTLFKY FNKSPGLTAK AKSAGSPTEA DPPPPGASSP PTEKERGPRA ANAKRERGSD 60 VAQAREAAKA PKSSEESAHR AARSAAAIND NLACIEFQPG DLLWAKLEGY PWWPSLVYDH 120 PESGAHTRGR GKSLRIHVQF FDTPPTRSWV GAKYCKPYNG SSSKEAQRGG VFFSTNPQIL 180 GAMKQADKAI PLSMEKRLDL VVCVEPSDEE DSDEADEVDA IASDVEANGK GDGEEEEEED 240 DGGIEKNGRK NKRRRIVVPS DTDDSDAEFK PDTVDASSDE ASSGVDEKEA SEPESASDPE 300 SPVKKRKRTT TKAPALTPSR QPLMLGLSSV ASSGKAATPK QAQTGSNAKA KLSAFSSDTL 360 ESQNGGGQDG GKEGGGWDHE KLDWLRDGKR RDGKRRLQSD PEYDPSSIYV PDSFLSACTP 420 GMRRWWEIKS TLFDTVLFFK VGKFYELYHM DALTGVSELG LLFMKGTWAH SGFPETSFAR 480 FSDGLVQKGY KVARVEQTET PDMMEARCRA MARPATKLDK VVKREVCRII TKGTQTYSIL 540 DGDPSDAQNK FLLAVREREG PPGGSGNNSG ANETGASVRT YGVCFVDTSV GVFHLGQFTD 600 DRHCSRFRTL VAHHAPAQVL YERGGLSAET QKIVRVTLAS ALQDALTPST QFWDATKTLK 660 VEDERGGSGG GGGSSGALPP VLRAMTAASD SLGLAAADEA ELALSALGAM LFYLRKCLID 720 HELLSMGNFR RYQPPDLVLA GDAHNGGYTG LATRRMVLDG VTLTNLEVLQ SSTTGGVEGT 780 LLVRLDSCLT PFGRRLLRAW LVAPPCDPRA IDDRLDALED LMAVPDKVLQ VKDLLRKMPD 840 LERLLSKIHG LGTPLRASRD HPDSRAVLYE ETAYSKRKIA DFLSTLEGFR AASQAAGLLQ 900 ECAGDFKSRL LRKVVSTEDG GHFPDLSPEL QRWDTAFDHQ KARSSGVITP RPGFDPEYDQ 960 ALLDISANKK LLDEYLEKQR KRLSCKSIVY WGTGRNCFQM EVPDAVAQRN TPEEYQLKST 1020 KKGFRRFHSP EIERLMALLS DAEERRDIAM RDCMRRLFYN FDASYKQWSA AMHCMAVLDT 1080 DVLLSLAHYS EQGGEGANPM CRPSVRPPPS EGGSSGGGPF LELRGSRHPC VSTTFLGDSF 1140 IPNDVVVGLS DGDDDGGARC VLVTGPNMGG KSTLMRQAGL IVIMAQLGCY VPAEACRLTP 1200 VDRVFTRLGA SDRIMSVLRR AGESTFFVEL SETSSILQHA TRHSLVLLDE LGFLLRPRGR 1260 GTATYDGTAI ASAVVHELAE GVRCRTLFST HYHSLVEDFA ASPHVRLGHM ACMVEQENDD 1320 DPSQETITFL YKFVAGACPK SYGFNAARLA GLPDAVVRAG HRKAREFESS TNALRAFR 1378 |
||||||||||||
Nucleotide Sequence (Fasta) | ATGTCACAGA AGAACACGCT CTTCAAGTAC TTCAACAAAT CGCCGGGCCT GACCGCCAAG 60 GCCAAGAGCG CTGGCTCGCC CACCGAGGCC GACCCGCCTC CCCCGGGGGC CTCGTCTCCG 120 CCTACCGAGA AGGAGCGCGG TCCGCGGGCG GCGAACGCTA AGCGAGAGCG CGGCTCCGAT 180 GTGGCCCAGG CCCGAGAGGC GGCGAAGGCG CCAAAGTCGT CGGAAGAGAG TGCTCATAGG 240 GCAGCACGTA GCGCCGCAGC CATCAATGAC AATCTGGCAT GCATTGAGTT CCAACCAGGT 300 GACTTGCTGT GGGCCAAGCT GGAGGGATAC CCGTGGTGGC CCAGCCTGGT GTATGACCAC 360 CCCGAGTCCG GAGCGCACAC ACGGGGCCGC GGAAAGTCCC TGAGAATCCA CGTGCAGTTT 420 TTTGATACTC CTCCTACCCG CAGCTGGGTC GGTGCAAAGT ACTGCAAGCC CTACAATGGA 480 TCCAGCTCGA AGGAGGCGCA GAGGGGAGGC GTCTTCTTCA GCACCAATCC TCAGATCCTG 540 GGGGCGATGA AGCAGGCCGA CAAGGCGATT CCGCTGAGCA TGGAAAAGCG CCTAGACCTC 600 GTCGTGTGTG TGGAGCCATC GGATGAAGAG GACAGCGATG AAGCTGATGA AGTGGATGCG 660 ATTGCGTCGG ACGTGGAAGC GAATGGAAAG GGTGACGGCG AGGAGGAAGA GGAGGAGGAC 720 GACGGGGGAA TCGAGAAAAA CGGTCGGAAG AACAAGCGGC GCCGCATTGT GGTGCCCTCC 780 GACACCGATG ACAGCGACGC CGAGTTCAAA CCGGACACGG TCGATGCGAG CAGCGATGAG 840 GCCAGCAGCG GTGTCGATGA GAAGGAAGCA AGCGAGCCCG AGTCCGCCTC CGACCCGGAG 900 TCGCCCGTCA AGAAGCGCAA GCGCACCACA ACCAAGGCTC CCGCCCTAAC GCCCAGTCGC 960 CAGCCGTTGA TGCTGGGTCT GAGCAGCGTT GCGTCGTCCG GAAAGGCGGC GACCCCCAAG 1020 CAAGCCCAGA CGGGTTCCAA TGCCAAGGCT AAGCTGTCCG CCTTCTCGTC GGACACCCTC 1080 GAGAGCCAGA ATGGCGGGGG GCAGGATGGC GGAAAAGAGG GTGGCGGGTG GGACCATGAG 1140 AAGCTGGACT GGTTGCGGGA CGGCAAGCGT CGCGACGGCA AGAGGCGGCT CCAGTCGGAC 1200 CCCGAGTACG ACCCTTCCTC CATCTACGTC CCCGATTCCT TCCTGTCCGC GTGCACGCCT 1260 GGCATGCGGC GCTGGTGGGA GATCAAGAGC ACGTTGTTCG ACACCGTGCT CTTCTTCAAG 1320 GTGGGCAAGT TCTACGAGCT GTACCACATG GACGCGCTGA CGGGCGTGTC GGAGCTGGGC 1380 CTGCTCTTCA TGAAGGGCAC ATGGGCCCAC TCGGGCTTCC CAGAGACCTC CTTTGCGCGC 1440 TTCTCGGATG GGCTGGTGCA GAAGGGCTAC AAGGTGGCGC GAGTGGAGCA GACGGAGACG 1500 CCCGACATGA TGGAGGCTCG CTGCCGCGCC ATGGCTCGCC CGGCCACAAA GTTAGACAAG 1560 GTGGTGAAGC GCGAGGTTTG CCGCATCATC ACCAAGGGCA CGCAGACGTA CAGCATCCTC 1620 GACGGCGATC CCTCAGACGC GCAGAACAAA TTCCTGCTGG CCGTGCGCGA GAGGGAGGGC 1680 CCGCCCGGCG GCAGTGGCAA CAATAGCGGC GCGAACGAGA CGGGGGCGTC CGTGCGCACG 1740 TACGGCGTGT GTTTCGTCGA CACGTCTGTC GGGGTCTTCC ACCTGGGCCA GTTCACGGAC 1800 GACCGCCACT GCTCGCGCTT CCGCACGCTG GTCGCGCATC ACGCACCCGC GCAGGTTTTG 1860 TACGAGCGCG GGGGCTTGTC AGCCGAGACG CAGAAGATTG TCCGTGTGAC GCTGGCTTCG 1920 GCACTGCAGG ATGCTCTGAC GCCCTCCACT CAGTTCTGGG ATGCCACCAA GACGCTCAAG 1980 GTCGAGGACG AACGCGGAGG CAGCGGCGGC GGTGGGGGCA GTAGCGGTGC CCTGCCGCCC 2040 GTGCTGCGCG CCATGACGGC CGCCAGCGAC TCGCTGGGCC TGGCGGCGGC GGACGAGGCG 2100 GAGCTGGCGC TGTCCGCCCT GGGGGCCATG CTCTTCTACC TTCGCAAGTG CCTCATCGAC 2160 CACGAGCTCC TCTCCATGGG CAACTTCCGC CGCTACCAGC CGCCCGACCT TGTGCTCGCC 2220 GGCGATGCGC ACAACGGTGG TTACACGGGA CTCGCGACTC GCCGCATGGT GCTGGACGGC 2280 GTGACTCTCA CCAACCTGGA GGTTCTGCAG AGCAGCACGA CGGGCGGCGT GGAGGGCACG 2340 TTGCTGGTGC GCCTCGACTC GTGCCTCACG CCGTTCGGGC GCCGCCTGCT GCGCGCCTGG 2400 CTGGTGGCGC CGCCGTGCGA CCCACGAGCC ATCGACGACC GCCTCGACGC GCTGGAGGAC 2460 CTCATGGCCG TCCCCGACAA GGTGTTGCAG GTGAAGGATC TTCTGCGCAA AATGCCCGAC 2520 CTGGAGCGGC TGCTGAGCAA GATTCACGGA CTGGGCACCC CGCTGCGCGC GAGCCGCGAC 2580 CACCCGGACA GCCGCGCCGT CCTCTACGAG GAAACGGCGT ACAGCAAGCG CAAGATCGCC 2640 GACTTCCTCT CCACGCTCGA GGGTTTCCGC GCTGCCAGCC AGGCTGCTGG CCTGCTGCAG 2700 GAGTGCGCGG GCGACTTCAA GTCGCGGCTT CTGCGCAAGG TGGTGTCTAC GGAAGATGGG 2760 GGGCACTTCC CAGACCTGTC CCCCGAGTTG CAGCGCTGGG ACACGGCGTT CGACCACCAG 2820 AAGGCGCGCA GCTCAGGCGT CATCACCCCA CGTCCAGGCT TCGACCCCGA GTATGACCAG 2880 GCGCTCCTGG ACATCTCCGC CAACAAGAAA TTGCTCGATG AATACTTGGA GAAGCAGCGC 2940 AAGCGTCTGA GCTGCAAGTC CATCGTCTAC TGGGGCACGG GTCGCAACTG CTTCCAGATG 3000 GAGGTGCCGG ACGCCGTGGC GCAGAGGAAC ACTCCCGAGG AGTACCAGCT CAAGTCGACC 3060 AAGAAGGGCT TCCGCCGATT CCACTCGCCC GAGATCGAGC GGCTGATGGC GCTTCTCTCG 3120 GACGCCGAGG AGCGGCGCGA CATAGCCATG CGCGACTGCA TGCGCCGCCT CTTCTACAAC 3180 TTCGACGCCA GCTACAAGCA GTGGAGCGCC GCCATGCACT GCATGGCCGT CCTGGACACA 3240 GACGTGCTTC TCAGCCTGGC TCACTACAGC GAGCAGGGCG GAGAGGGCGC TAACCCCATG 3300 TGCCGCCCGA GCGTGCGGCC GCCCCCCTCG GAGGGCGGCT CGTCGGGGGG GGGGCCCTTC 3360 CTGGAGCTGC GTGGCTCCCG CCACCCCTGC GTCTCCACCA CCTTCCTGGG CGACAGCTTC 3420 ATCCCCAACG ACGTGGTGGT GGGCCTTTCG GATGGAGACG ATGACGGCGG CGCCCGTTGC 3480 GTGCTCGTCA CGGGACCCAA CATGGGCGGC AAGTCGACGC TCATGCGCCA GGCGGGGCTC 3540 ATCGTCATCA TGGCACAGCT GGGCTGCTAC GTGCCGGCCG AGGCGTGCCG CCTGACGCCC 3600 GTGGACCGCG TGTTCACGCG GCTGGGGGCC TCTGACCGCA TCATGTCTGT GTTGCGACGC 3660 GCAGGGGAGA GCACGTTCTT CGTGGAGCTG AGCGAGACGT CCAGCATCCT GCAGCACGCG 3720 ACGCGCCACT CCCTCGTGCT GCTCGACGAG CTCGGATTTT TGTTGCGCCC CCGAGGCCGC 3780 GGCACGGCGA CGTACGACGG GACGGCGATC GCGAGCGCCG TGGTGCACGA GCTCGCCGAG 3840 GGCGTGCGCT GCCGCACGCT CTTCTCCACA CACTACCACT CGCTCGTGGA GGACTTCGCG 3900 GCCAGCCCAC ACGTGCGCCT GGGCCACATG GCGTGCATGG TGGAGCAGGA GAATGACGAC 3960 GACCCCAGCC AAGAGACCAT CACCTTCCTC TACAAGTTCG TTGCCGGCGC GTGCCCCAAG 4020 AGCTACGGCT TCAACGCCGC CCGTCTGGCC GGGCTGCCCG ACGCGGTTGT GCGGGCAGGA 4080 CACCGCAAGG CGCGCGAGTT CGAGAGCAGC ACCAACGCCC TCCGTGCATT CAGG 4135 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |