| Tag |
Content |
| WERAM ID |
WERAM-Mum-0060 |
| Ensembl Protein ID |
ENSMUSP00000025444.6 |
| Uniprot Accession |
Q9CWW7; CXXC1_MOUSE |
| Genbank Protein ID |
NP_083144.1 |
| Protein Name |
CXXC-type zinc finger protein 1 |
| Genbank Nucleotide ID |
NM_028868.3 |
| Gene Name |
CXXC1;Cgbp;Pccx1 |
| Ensembl Information |
|
| Details |
| Type |
Family |
Domain |
Substrates |
AA |
References (PMIDs) |
| Me_Reader |
PHD |
PHD-type |
H3K4me3 |
K |
23201125 |
|
| Status |
Reviewed |
| Classification |
| Type |
Family |
E-value |
Score |
Start |
End |
| Me_Reader |
PHD |
6.70e-12 |
46.9 |
28 |
75 |
|
| Organism |
Mus musculus |
| NCBI Taxa ID |
10090 |
Functional Description (View)Functional Description
Transcriptional activator that exhibits a unique DNA binding specificity for CpG unmethylated motifs with a preference for CpGG. |
Transcriptional activator that exhibits a unique DNA binding specificity for CpG unmethylated motifs with a preference for CpGG.
|
| Domain Profile |
Me_Reader PHD
PHD.txt 3 iClvCgkddegekemvqCdeCddwfHlkCvklplsslpegkswyCpsCke 52 +C+ C+k+d + +m+ Cd+C++wfH++C+ ++++ +++ ++wyC++C+e ENSMUSP00000025444.6 28 YCI-CRKPDINC-FMIGCDNCNEWFHGDCIRITEKMAKAIREWYCRECRE 75 8*9.******95.6*******************666665589******98 PP
|
Protein Sequence (Fasta) | MEGDGSDLEP PDAGDDSKSE NGENAPIYCI CRKPDINCFM IGCDNCNEWF HGDCIRITEK 60 MAKAIREWYC RECREKDPKL EIRYRHKKCR ERDGSERAGS EPRDEGGGRK RPASDPELQR 120 RAGSGTGVGA MLARGSASPH KSSPQPLVAT PSQHHHQQQQ QQQQQIKRSA RMCGECEACR 180 RTEDCGHCDF CRDMKKFGGP NKIRQKCRLR QCQLRARESY KYFPSSLSPV TPSEALPRPR 240 RPPPTQQQPQ QSQKLGRIRE DEGTVLSSVV KEPPEATATP EPLSDEDLAL DPDLYQDFCA 300 GAFDDHGLPW MSDAEESPFL DPALRKRAVK VKHVKRREKK SEKKKEERYK RHRQKQKHKD 360 KWKHPERADA KDPASLPQCL GPGCVRAAQP GSKYCSDDCG MKLAANRIYE ILPQRIQQWQ 420 QSPCIAEEHG KKLLERIRRE QQSARTRLQE MERRFHELEA IILRAKQQAV REDEENNEND 480 SDDTDLQIFC VSCGHPINPR VALRHMERCY AKYESQTSFG SMYPTRIEGA TRLFCDVYNP 540 QSKTYCKRLQ VLCPEHSRDP KVPADEVCGC PLVRDVFELT GDFCRLPKRQ CNRHYCWEKL 600 RRAEVDLERV RVWYKLDELF EQERNVRTAM TNRAGLLALM LHQTIQHDPL TTDLRSSADR 660 Protein Fasta Sequence
>ENSMUSP00000025444.6|CXXC1;Cgbp;Pccx1|Mus musculus MEGDGSDLEPPDAGDDSKSENGENAPIYCICRKPDINCFMIGCDNCNEWFHGDCIRITEKMAKAIREWYCRECREKDPKLEIRYRHKKCRERDGSERAGSEPRDEGGGRKRPASDPELQRRAGSGTGVGAMLARGSASPHKSSPQPLVATPSQHHHQQQQQQQQQIKRSARMCGECEACRRTEDCGHCDFCRDMKKFGGPNKIRQKCRLRQCQLRARESYKYFPSSLSPVTPSEALPRPRRPPPTQQQPQQSQKLGRIREDEGTVLSSVVKEPPEATATPEPLSDEDLALDPDLYQDFCAGAFDDHGLPWMSDAEESPFLDPALRKRAVKVKHVKRREKKSEKKKEERYKRHRQKQKHKDKWKHPERADAKDPASLPQCLGPGCVRAAQPGSKYCSDDCGMKLAANRIYEILPQRIQQWQQSPCIAEEHGKKLLERIRREQQSARTRLQEMERRFHELEAIILRAKQQAVREDEENNENDSDDTDLQIFCVSCGHPINPRVALRHMERCYAKYESQTSFGSMYPTRIEGATRLFCDVYNPQSKTYCKRLQVLCPEHSRDPKVPADEVCGCPLVRDVFELTGDFCRLPKRQCNRHYCWEKLRRAEVDLERVRVWYKLDELFEQERNVRTAMTNRAGLLALMLHQTIQHDPLTTDLRSSADR
|
Nucleotide Sequence (Fasta) | GGACTTGGCT GACAAGGTGG CGGCGCGCTC GGAGGACCTG AACTGTAGGC GCGGCCATGC 60 TTCCCGGAAC GTATGGTCTC CGCTGTCTTA CGGACTCGAT GGTCAAGATG GCGGCGACAG 120 AGAGACCTTC GCGGCTCTAG GTGGGAGACC TATTTGTTTG CCGCTAAAGC CGTCGCGCCG 180 GGCGTGAGGA GTTAAAGAGT GCGGGGCCGC GGAGGCGGGA CCAGTGGCGG AAGTAGTTGC 240 GGGCGCCTTT GCGGGCGCCT TTGCTCCCGC CGCGGACGCT GCCGGGGAGC TCGTTCGGCT 300 TCGCGGGTTG TTGCACGGGG TCCAGAGCGG GCGCTCTGTG AGCGGAGATA TGGAAGGAGA 360 TGGCTCAGAC CTGGAACCTC CGGATGCCGG GGACGACAGC AAGTCTGAGA ATGGGGAGAA 420 CGCTCCCATC TACTGCATCT GTCGCAAACC GGACATCAAT TGCTTCATGA TTGGATGTGA 480 CAACTGCAAC GAGTGGTTCC ATGGAGACTG CATCCGGATC ACTGAGAAGA TGGCCAAGGC 540 CATCCGGGAA TGGTACTGTC GGGAGTGCCG AGAGAAGGAC CCGAAGCTGG AGATTCGTTA 600 CCGCCACAAA AAGTGCCGGG AGAGGGATGG CAGTGAGCGG GCCGGCAGTG AGCCCCGGGA 660 TGAGGGAGGA GGGCGCAAGA GGCCGGCTTC AGATCCAGAG CTGCAGCGCC GGGCAGGGTC 720 AGGGACAGGG GTTGGGGCCA TGCTTGCTCG GGGCTCTGCT TCCCCCCACA AATCTTCTCC 780 ACAGCCCTTG GTGGCCACAC CTAGCCAGCA CCACCACCAA CAGCAGCAGC AGCAGCAGCA 840 GCAAATCAAA CGATCAGCTC GGATGTGTGG TGAGTGCGAG GCCTGCCGAC GCACTGAGGA 900 CTGTGGCCAC TGTGACTTCT GCCGTGACAT GAAGAAGTTT GGGGGCCCCA ACAAGATCCG 960 GCAGAAGTGC CGGCTTCGTC AGTGTCAGCT GCGGGCACGG GAATCGTACA AGTACTTCCC 1020 TTCCTCGCTC TCGCCGGTGA CACCCTCAGA GGCCCTGCCA AGGCCCCGCC GGCCACCACC 1080 CACTCAACAG CAGCCACAGC AGTCCCAGAA GCTGGGGCGT ATTCGTGAAG ATGAGGGGAC 1140 AGTGTTGTCA TCAGTGGTTA AGGAGCCACC AGAGGCTACA GCAACACCTG AGCCACTTTC 1200 AGATGAGGAC CTAGCACTGG ACCCCGATCT GTACCAGGAC TTCTGTGCTG GGGCCTTTGA 1260 TGATCACGGC CTACCCTGGA TGAGCGATGC AGAAGAGTCC CCGTTCCTGG ATCCTGCACT 1320 GAGGAAGCGG GCGGTGAAGG TGAAGCACGT GAAGCGCCGG GAGAAGAAGT CCGAGAAGAA 1380 GAAGGAGGAG AGGTACAAAC GGCATCGACA GAAGCAGAAA CATAAAGACA AATGGAAACA 1440 CCCAGAGAGG GCTGATGCCA AGGACCCTGC ATCTCTCCCG CAGTGCCTGG GGCCTGGCTG 1500 TGTGCGGGCT GCCCAGCCTG GGTCCAAGTA TTGTTCAGAT GACTGTGGCA TGAAGCTGGC 1560 AGCCAACCGA ATCTATGAGA TCCTCCCCCA GCGCATCCAG CAGTGGCAGC AAAGCCCCTG 1620 CATCGCTGAA GAGCACGGCA AGAAGCTACT TGAGCGGATC CGCCGTGAGC AGCAGAGCGC 1680 CCGCACCCGC CTTCAGGAAA TGGAGCGCAG ATTCCACGAG CTTGAGGCCA TCATTCTTCG 1740 CGCTAAGCAG CAAGCTGTGC GAGAGGATGA GGAGAACAAC GAGAACGACA GTGATGACAC 1800 AGATCTGCAG ATCTTCTGCG TCTCCTGCGG GCATCCCATC AACCCACGAG TTGCCTTGCG 1860 TCACATGGAA CGTTGCTATG CTAAGTATGA GAGCCAGACG TCTTTTGGGT CCATGTACCC 1920 CACACGCATC GAGGGGGCTA CCCGACTCTT CTGTGATGTC TACAATCCTC AGAGCAAGAC 1980 ATATTGTAAG CGGCTCCAGG TGCTGTGTCC TGAGCACTCC CGGGACCCCA AAGTGCCAGC 2040 TGATGAGGTC TGTGGGTGTC CACTTGTACG TGATGTCTTT GAGCTCACAG GTGACTTCTG 2100 CCGCCTGCCC AAGCGCCAGT GTAATCGCCA TTATTGCTGG GAGAAGCTTC GGCGTGCAGA 2160 AGTGGACTTG GAGCGCGTGC GGGTGTGGTA CAAGCTGGAC GAGCTCTTTG AGCAGGAACG 2220 TAATGTACGC ACAGCCATGA CCAACAGAGC AGGGCTACTT GCCCTGATGC TTCACCAAAC 2280 GATCCAACAC GATCCACTCA CTACCGACCT TCGCTCTAGT GCCGACCGCT GACTTCACCA 2340 GCCCAGAGCC TCTCGGCCCT GCATTCAGTT AGGGGAGAGG CCCTGCATTG CCAGCTGTCC 2400 GTTCCTCCAT TCATCTGTTT CTTCTGCTGC CCTTAGTTTG ATCCTCACCA TCCACGTTTT 2460 GGCTTCCGTT TGCCCTTATC AAAGGGTCTC TGCTTTCCCT TTGTCACCGT CTGTCTCCTC 2520 TTTCCATAGT CAGGGCTGGG GTGAGACTCA AACTTACTCA TCCTTGCCTA TACCCACCCC 2580 CCAAAAACAG GTTTTATTAA TAAAAAATGT GAAGAACC
2619Nucleotide Fasta Sequence
>ENSMUSP00000025444.6|PHD|Mus musculus GGACTTGGCTGACAAGGTGGCGGCGCGCTCGGAGGACCTGAACTGTAGGCGCGGCCATGCTTCCCGGAACGTATGGTCTCCGCTGTCTTACGGACTCGATGGTCAAGATGGCGGCGACAGAGAGACCTTCGCGGCTCTAGGTGGGAGACCTATTTGTTTGCCGCTAAAGCCGTCGCGCCGGGCGTGAGGAGTTAAAGAGTGCGGGGCCGCGGAGGCGGGACCAGTGGCGGAAGTAGTTGCGGGCGCCTTTGCGGGCGCCTTTGCTCCCGCCGCGGACGCTGCCGGGGAGCTCGTTCGGCTTCGCGGGTTGTTGCACGGGGTCCAGAGCGGGCGCTCTGTGAGCGGAGATATGGAAGGAGATGGCTCAGACCTGGAACCTCCGGATGCCGGGGACGACAGCAAGTCTGAGAATGGGGAGAACGCTCCCATCTACTGCATCTGTCGCAAACCGGACATCAATTGCTTCATGATTGGATGTGACAACTGCAACGAGTGGTTCCATGGAGACTGCATCCGGATCACTGAGAAGATGGCCAAGGCCATCCGGGAATGGTACTGTCGGGAGTGCCGAGAGAAGGACCCGAAGCTGGAGATTCGTTACCGCCACAAAAAGTGCCGGGAGAGGGATGGCAGTGAGCGGGCCGGCAGTGAGCCCCGGGATGAGGGAGGAGGGCGCAAGAGGCCGGCTTCAGATCCAGAGCTGCAGCGCCGGGCAGGGTCAGGGACAGGGGTTGGGGCCATGCTTGCTCGGGGCTCTGCTTCCCCCCACAAATCTTCTCCACAGCCCTTGGTGGCCACACCTAGCCAGCACCACCACCAACAGCAGCAGCAGCAGCAGCAGCAAATCAAACGATCAGCTCGGATGTGTGGTGAGTGCGAGGCCTGCCGACGCACTGAGGACTGTGGCCACTGTGACTTCTGCCGTGACATGAAGAAGTTTGGGGGCCCCAACAAGATCCGGCAGAAGTGCCGGCTTCGTCAGTGTCAGCTGCGGGCACGGGAATCGTACAAGTACTTCCCTTCCTCGCTCTCGCCGGTGACACCCTCAGAGGCCCTGCCAAGGCCCCGCCGGCCACCACCCACTCAACAGCAGCCACAGCAGTCCCAGAAGCTGGGGCGTATTCGTGAAGATGAGGGGACAGTGTTGTCATCAGTGGTTAAGGAGCCACCAGAGGCTACAGCAACACCTGAGCCACTTTCAGATGAGGACCTAGCACTGGACCCCGATCTGTACCAGGACTTCTGTGCTGGGGCCTTTGATGATCACGGCCTACCCTGGATGAGCGATGCAGAAGAGTCCCCGTTCCTGGATCCTGCACTGAGGAAGCGGGCGGTGAAGGTGAAGCACGTGAAGCGCCGGGAGAAGAAGTCCGAGAAGAAGAAGGAGGAGAGGTACAAACGGCATCGACAGAAGCAGAAACATAAAGACAAATGGAAACACCCAGAGAGGGCTGATGCCAAGGACCCTGCATCTCTCCCGCAGTGCCTGGGGCCTGGCTGTGTGCGGGCTGCCCAGCCTGGGTCCAAGTATTGTTCAGATGACTGTGGCATGAAGCTGGCAGCCAACCGAATCTATGAGATCCTCCCCCAGCGCATCCAGCAGTGGCAGCAAAGCCCCTGCATCGCTGAAGAGCACGGCAAGAAGCTACTTGAGCGGATCCGCCGTGAGCAGCAGAGCGCCCGCACCCGCCTTCAGGAAATGGAGCGCAGATTCCACGAGCTTGAGGCCATCATTCTTCGCGCTAAGCAGCAAGCTGTGCGAGAGGATGAGGAGAACAACGAGAACGACAGTGATGACACAGATCTGCAGATCTTCTGCGTCTCCTGCGGGCATCCCATCAACCCACGAGTTGCCTTGCGTCACATGGAACGTTGCTATGCTAAGTATGAGAGCCAGACGTCTTTTGGGTCCATGTACCCCACACGCATCGAGGGGGCTACCCGACTCTTCTGTGATGTCTACAATCCTCAGAGCAAGACATATTGTAAGCGGCTCCAGGTGCTGTGTCCTGAGCACTCCCGGGACCCCAAAGTGCCAGCTGATGAGGTCTGTGGGTGTCCACTTGTACGTGATGTCTTTGAGCTCACAGGTGACTTCTGCCGCCTGCCCAAGCGCCAGTGTAATCGCCATTATTGCTGGGAGAAGCTTCGGCGTGCAGAAGTGGACTTGGAGCGCGTGCGGGTGTGGTACAAGCTGGACGAGCTCTTTGAGCAGGAACGTAATGTACGCACAGCCATGACCAACAGAGCAGGGCTACTTGCCCTGATGCTTCACCAAACGATCCAACACGATCCACTCACTACCGACCTTCGCTCTAGTGCCGACCGCTGACTTCACCAGCCCAGAGCCTCTCGGCCCTGCATTCAGTTAGGGGAGAGGCCCTGCATTGCCAGCTGTCCGTTCCTCCATTCATCTGTTTCTTCTGCTGCCCTTAGTTTGATCCTCACCATCCACGTTTTGGCTTCCGTTTGCCCTTATCAAAGGGTCTCTGCTTTCCCTTTGTCACCGTCTGTCTCCTCTTTCCATAGTCAGGGCTGGGGTGAGACTCAAACTTACTCATCCTTGCCTATACCCACCCCCCAAAAACAGGTTTTATTAATAAAAAATGTGAAGAACC
|
| Sequence Source |
Ensembl |
| Keyword |
KW-0007--Acetylation KW-0010--Activator KW-0175--Coiled coil KW-0181--Complete proteome KW-0238--DNA-binding KW-1017--Isopeptide bond KW-0479--Metal-binding KW-0539--Nucleus KW-0597--Phosphoprotein KW-1185--Reference proteome KW-0804--Transcription KW-0805--Transcription regulation KW-0832--Ubl conjugation KW-0862--Zinc KW-0863--Zinc-finger --
|
| Interpro |
IPR022056--CpG-bd_C IPR019786--Zinc_finger_PHD-type_CS IPR002857--Znf_CXXC IPR011011--Znf_FYVE_PHD IPR001965--Znf_PHD IPR019787--Znf_PHD-finger IPR013083--Znf_RING/FYVE/PHD
|
| PROSITE |
PS51058--ZF_CXXC PS01359--ZF_PHD_1 PS50016--ZF_PHD_2
|
| Pfam |
PF00628--PHD PF12269--zf-CpG_bind_C PF02008--zf-CXXC
|
| Gene Ontology |
GO:0005737--C:cytoplasm GO:0035097--C:histone methyltransferase complex GO:0016363--C:nuclear matrix GO:0016607--C:nuclear speck GO:0005634--C:nucleus GO:0048188--C:Set1C/COMPASS complex GO:0000987--F:core promoter proximal region sequence-specific DNA binding GO:0042800--F:histone methyltransferase activity (H3-K4 specific) GO:0045322--F:unmethylated CpG binding GO:0008270--F:zinc ion binding GO:0051568--P:histone H3-K4 methylation GO:0045893--P:positive regulation of transcription, DNA-templated GO:0006355--P:regulation of transcription, DNA-templated GO:0006351--P:transcription, DNA-templated
|
| Orthology |
|
| Created Date |
25-Jun-2016 |