WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Hos-0063 | ||||||||||||
Ensembl Protein ID | ENSP00000251900.4 | ||||||||||||
Uniprot Accession | Q9UQR0; SCML2_HUMAN; Q5JXE6; Q86U98; Q8IWD0; Q8NDP2; Q9UGC5 | ||||||||||||
Genbank Protein ID | NP_006080.1 | ||||||||||||
Protein Name | Sex comb on midleg-like protein 2 | ||||||||||||
Genbank Nucleotide ID | NM_006089.2 | ||||||||||||
Gene Name | SCML2 | ||||||||||||
Ensembl Information |
|
||||||||||||
Details |
|
||||||||||||
Status | Reviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Homo sapiens | ||||||||||||
NCBI Taxa ID | 9606 | ||||||||||||
Functional Description (View) |
Putative Polycomb group (PcG) protein. PcG proteins act by forming multiprotein complexes, which are required to maintain the transcriptionally repressive state of homeotic genes throughout development (By similarity). | ||||||||||||
Domain Profile | Me_Reader MBT MBT-2.txt 2 dwkeylrktldgakvapkslseklseskallsfkvgmrlEvvdkknsseirvatveeviggrlkvsfeesededddyWidesspfifpvGwa 93 |
||||||||||||
Protein Sequence (Fasta) | MGQTVNEDSM DVKKENQEKT PQSSTSSVQR DDFHWEEYLK ETGSISAPSE CFRQSQIPPV 60 NDFKVGMKLE ARDPRNATSV CIATVIGITG ARLRLRLDGS DNRNDFWRLV DSPDIQPVGT 120 CEKEGDLLQP PLGYQMNTSS WPMFLLKTLN GSEMASATLF KKEPPKPPLN NFKVGMKLEA 180 IDKKNPYLIC PATIGDVKGD EVHITFDGWS GAFDYWCKYD SRDIFPAGWC RLTGDVLQPP 240 GTSVPIVKNI AKTESSPSEA SQHSMQSPQK TTLILPTQQV RRSSRIKPPG PTAVPKRSSS 300 VKNITPRKKG PNSGKKEKPL PVICSTSAAS LKSLTRDRGM LYKDVASGPC KIVMSTVCVY 360 VNKHGNFGPH LDPKRIQQLP DHFGPGPVNV VLRRIVQACV DCALETKTVF GYLKPDNRGG 420 EVITASFDGE THSIQLPPVN SASFALRFLE NFCHSLQCDN LLSSQPFSSS RGHTHSSAEH 480 DKNQSAKEDV TERQSTKRSP QQTVPYVVPL SPKLPKTKEY ASEGEPLFAG GSAIPKEENL 540 SEDSKSSSLN SGNYLNPACR NPMYIHTSVS QDFSRSVPGT TSSPLVGDIS PKSSPHEVKF 600 QMQRKSEAPS YIAVPDPSVL KQGFSKDPST WSVDEVIQFM KHTDPQISGP LADLFRQHEI 660 DGKALFLLKS DVMMKYMGLK LGPALKLCYY IEKLKEGKYS |
||||||||||||
Nucleotide Sequence (Fasta) | GGCGGCAGTG GCGGTTGCGG GGCATGCGCG GCTCCGCGCG CGGCTTCTCA AACATGGCGG 60 CGGCGGTGTG AAGCTCGGTG CCGGCTCGCG CGATCGGTGG GACAGAATTT CGTTGTTTTC 120 ACCGACGAGA CTGGAGGAAA CAACACCAAA TAGGGATACC ATGGGACAAA CAGTGAATGA 180 AGATTCCATG GATGTCAAGA AGGAGAATCA AGAGAAAACT CCTCAGTCAA GTACATCTTC 240 TGTACAAAGG GATGATTTCC ACTGGGAGGA GTATTTGAAA GAGACTGGGT CTATAAGTGC 300 TCCTTCAGAG TGCTTCCGTC AGTCTCAGAT TCCACCTGTG AATGATTTCA AAGTTGGTAT 360 GAAATTGGAA GCCCGTGACC CTCGCAATGC CACTTCAGTA TGTATTGCTA CGGTTATTGG 420 AATTACTGGG GCCAGGTTAC GGTTACGACT GGATGGTAGT GACAACAGAA ATGATTTTTG 480 GAGGCTTGTC GATTCCCCAG ACATACAACC TGTTGGGACA TGTGAAAAGG AAGGAGACTT 540 ACTTCAACCT CCACTAGGGT ACCAGATGAA TACATCCTCC TGGCCGATGT TCCTCTTAAA 600 GACACTAAAT GGGTCTGAAA TGGCATCTGC CACATTATTT AAGAAGGAAC CACCAAAGCC 660 CCCACTAAAT AATTTTAAAG TGGGGATGAA ACTGGAAGCT ATTGACAAAA AGAACCCGTA 720 TCTCATCTGT CCTGCGACCA TTGGAGATGT TAAAGGGGAT GAAGTTCATA TCACATTTGA 780 TGGCTGGAGT GGAGCTTTTG ATTACTGGTG CAAGTATGAT TCTCGAGATA TTTTCCCAGC 840 TGGGTGGTGT CGCCTGACAG GAGATGTATT ACAACCCCCA GGAACTAGTG TTCCTATTGT 900 AAAGAATATA GCAAAAACAG AGTCTTCTCC TTCCGAAGCA AGCCAGCATT CAATGCAGTC 960 TCCACAGAAA ACTACTCTAA TATTACCAAC ACAGCAGGTC AGGAGATCAA GTCGAATTAA 1020 ACCACCTGGA CCTACTGCAG TCCCCAAAAG GAGCAGTTCT GTTAAAAATA TCACACCAAG 1080 GAAAAAAGGT CCAAACTCAG GAAAAAAGGA AAAACCTTTG CCCGTGATAT GTTCTACATC 1140 TGCAGCTTCT CTAAAATCGC TGACCAGAGA CCGTGGCATG TTATATAAAG ATGTCGCTTC 1200 TGGGCCATGT AAAATAGTGA TGTCTACAGT CTGTGTCTAT GTAAACAAAC ATGGAAACTT 1260 TGGCCCTCAT CTGGATCCCA AGAGAATCCA GCAGCTGCCT GACCACTTCG GCCCGGGCCC 1320 GGTGAATGTG GTGCTTCGCC GGATTGTGCA GGCCTGTGTG GATTGTGCCC TTGAAACTAA 1380 AACTGTTTTT GGATACCTGA AGCCAGATAA TCGTGGAGGA GAAGTGATAA CTGCCTCCTT 1440 TGATGGGGAA ACTCATTCCA TCCAGCTCCC TCCAGTGAAC AGTGCATCAT TTGCTCTTCG 1500 CTTTCTTGAG AACTTCTGCC ACAGTCTGCA GTGTGATAAC CTTTTGAGTA GCCAGCCTTT 1560 TAGTTCTTCC AGGGGTCATA CTCACAGCTC TGCAGAGCAT GATAAAAATC AGTCAGCAAA 1620 AGAAGATGTA ACAGAAAGGC AAAGCACCAA ACGATCTCCT CAGCAAACTG TACCATATGT 1680 TGTTCCTCTC TCTCCTAAGC TCCCCAAAAC AAAGGAGTAT GCGTCTGAAG GAGAACCATT 1740 GTTTGCTGGG GGAAGTGCCA TTCCCAAAGA GGAGAATCTT TCAGAAGATT CTAAGAGCTC 1800 ATCACTAAAT TCAGGAAATT ATTTGAATCC TGCCTGTAGA AATCCTATGT ATATTCATAC 1860 TTCAGTCTCC CAGGATTTTT CTCGAAGTGT GCCAGGCACC ACAAGTTCAC CACTAGTTGG 1920 GGACATATCC CCCAAGAGCA GTCCCCATGA AGTTAAATTC CAAATGCAGA GGAAAAGTGA 1980 AGCTCCAAGT TATATAGCTG TACCTGATCC CAGTGTCCTG AAACAAGGCT TCTCTAAGGA 2040 CCCTTCAACC TGGTCTGTGG ATGAAGTGAT ACAGTTTATG AAACATACAG ATCCTCAGAT 2100 ATCAGGCCCC CTCGCCGACC TCTTCAGGCA ACATGAAATT GATGGGAAGG CTCTGTTCCT 2160 ACTCAAGAGT GATGTGATGA TGAAGTATAT GGGGCTGAAG CTGGGGCCAG CATTAAAGCT 2220 GTGTTACTAC ATTGAAAAGC TTAAAGAAGG AAAATACAGT TAAAAAAATG TGTAAGTTTA 2280 GATTGGACAT AATTCTCAGG TGTACTGTTA ACATTTTAAT TTAAAAGTAT TTCTCTTAGC 2340 AGTTTTTGTT TTGTAGACAG TTCCCATAAA AATATTTTAT CAGAATTGCA GAACTGTAGT 2400 AACAGTTCAG TCAACTTTGT TTTTTTCCTG GAGTCACCAA CCAGCTTTGG GAGACACAGC 2460 CGCCACTCCC CCAGTCTACT TCTTTAAAAA GCATTTAACA GGTTAGTATT GGCATATTCA 2520 AATTGGCAGT TCTTTATGTC TTTTTAAATT TTCATTGTAC AGTTTACAAA TATACTTAAT 2580 GTAGTTAACA GAGAAAAACC TTTGATTTTG GTTAACCTTT ATATCTAGAA CCAAAACAGC 2640 TAAATCCCAA AGGGGAAAAT ATCAGGGATT GACAACTTCT ATAATTAAAT CCATGAGAAT 2700 TTTTCCTCAC TAGAGAATTT AAAGGTGCAC CTGTAGATAT CATCTTTTCT CAGATATTTT 2760 GTGTGATACT CTGTGGTGTT CTGTTCATGT TCTATCAGTA TATCTAGAAA GGGAATAGCC 2820 ATATAAATTA TTTTCCTTTT ATTATTTCTC TGTATGTTGT ATTTGATCAT ATTTAAAGGA 2880 AAAAAGCAAG CTTATAAGCT TTCATGAAGT GTTCTTACCA GTTTTTGATA AATTTTTTAA 2940 ATTATAGGAT AGAATTGTCA TTTTATGCAG GAGATATTTA TACTACGAGG GTTGTTTGGA 3000 TGGAGTCAGA TTAAATTTTT TCAGTGAAAT TCCTATTATT TTAAAACTTT CCATATTTTC 3060 ACTACGCTTG GACATTTAAC TAGGCATTCT TTTCTTACAT CTCTATATGA AGATACCTGT 3120 GTCCAAAATT TTTGAAGATA TATATTGTAT GTGTTTATTC TTCATATGGT ACTTTACCAT 3180 ATTTATATAT TGTTTTATAC CTGTAGGTTT ACACACAAGT AAATCTTTTT TTTCTTGAAT 3240 TTAATCTGGC ACTTTGCACT GCCACAGAGG TGACGATGAA CTATGTATAT AGTTAGATGT 3300 TTTGATTTCG TAAAAAATAT ATGTCCATCG TTTGCTATCA CCAGTACCTC TCAGCTTACT 3360 CTTCAGGGGA TATGAAACAA TCTGTAGATT GGTTTCCATA CAGGGAAGTT CTCTGTCCTA 3420 TGCAATGTTT CTAATTAATT TGCTTAGTTC TGAGCCATTT ATTCTGCTAC ACTTTGAAAG 3480 ATATATTAGT TCTGACTTAT TGTTTGGGGC TTTATTTTAT TTTTATTTTT TTGAGATGGA 3540 GTTTCACTCT TGTTGCCCAG GCTGGAGTGC AATAGCTCGA TCTCGGCTCA CTGCAACCTC 3600 GCCTCCCTGG TTCAAGCAAT TCTCCTGCCT CAGCGATTAC AGGAATGCAC CACCACGCCC 3660 AGCTAATTTT GTAGAGATGG GGTTTCTCCA TGTTGGTCAG GCTGGTCTCA AACTCCTGAC 3720 CTCAGGTGAT CCGCCCACCT CGGCCTTGCA AAATGCTGGG ATTACAGGCA TGAGCCACCA 3780 TGCCCAGCCA TGTTTGGGGC TTTATTTTAT AAGTTAGAAC TTTGAAGAGG AAATGGTGCT 3840 ATATGTTTAT TGTTATTACT TTGTGTAACT TTGATGTAAT GTTTATAAGC TATGAGAATC 3900 AGTTATAAAA GTATTAGCCA TTTGTTGTAA ATGCCAATAA AATATTCACC AGGGGCAAGA 3960 ATGTACATTT TCTTTTTAGA AAACCAAATG TACTTTAGAC ATGAATGCAA CTATTTAAAG 4020 AATAGCTTCA TTTATGTTAT TCCTTACATG TCATAAGATT CTTACTTAAA CTTGGTCTTC 4080 TTTCAAGTTG TTTGTATGAA GATGCTGTAC CCACTTGAAC AGTCCTCAGG TGTTTACATA 4140 AATACTATGT TTTACAGTTT TCATATTTTA AAATATTAAT AAAGTTAATC GCAACGATTC 4200 4201 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Keyword | KW-0002--3D-structure |
||||||||||||
Interpro | IPR021987--DUF3588 |
||||||||||||
PROSITE | PS51079--MBT |
||||||||||||
Pfam | PF12140--DUF3588 |
||||||||||||
Gene Ontology | GO:0005634--C:nucleus |
||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |