WERAM Information


Tag Content
WERAM ID WERAM-Tub-0037
Ensembl Protein ID ENSTBEP00000004417.1
Gene Name ZNF541
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSTBEG00000005105.1 ENSTBET00000005127.1 ENSTBEP00000004417.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HDAC HD2 0.071 37
Organism Tupaia belangeri
Domain Profile
  HDAC HD2

Query: 258 PHPKEE---NVAGGCNQQNGGPADWPEPRGTFVCKNCSQMFYTEKGLSSH 304
PHP ++ N GG + P+ G F CK+C++ F +E GL SH
Sbjct: 236 PHPSKQAGKNSGGGSTGETSKQQQTPKSAGAFGCKSCTRTFTSEMGLQSH 285

Protein Sequence
(Fasta)
MQVFQMITKS QRIFSHAQVA AASSQLPGPE GKQAAVKPPQ GPWPLQPPPP ASAADSLHTG 60
PGNLEPEGSP ARRRKNLPVA PRETSPGSTK RDSKGGPKAA TALPPLPAAA LDPPGNSDLS 120
SLAKQLRSSK GTLDLGDIFP TAGSRQLGGN ELAPGAQLSG KQAQSENGST SGATKGEKSL 180
ACSRGAGYRL FSGSSRAQRF SGFRKEKVKM DLCCAASPSQ VAMASFSSAG PPADAPRDSK 240
SKLTIFNRIQ GGNIYRLPHP KEENVAGGCN QQNGGPADWP EPRGTFVCKN CSQMFYTEKG 300
LSSHMCFHSE QWPSPRGKQE QQMFGMEFCK PPRQVLRPEG DGSSPGAKKP LDSSATAPLV 360
PPMSVPMAPE IRPPGSLAEG QEKDGEERDG RENSQHRKRK KRPPPSTSGE PGAGGCHQSC 420
LRSPVFLVDR LLKGLLQCSR YTPFPMLTFF RECSGLYFIS LCYRSNXXXX XXXXXXXXXH 480
VGSFDICVVD DISIEPXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 540
XXXXTELCNV ACSSVMPGGC TNLELALHCL HDARGDVQVA LETLLLGGPR KPRAHPLADY 600
RYTGSDVWTP MEKRLFKKAF CTHKKDFYLI HKTIQTETVA QCVEYYYIWK KMIKFDCGRA 660
SGLEKRVKRE PDEVERTEKK IPCSPRERPS HRPTPELKIK TKSYRRESIL NSSPSAGPKR 720
TPEAPGSVES QGVFPCRECE RVFDKIKSRN AHMKRHRLQD HVEPLVRVKW SVKPFQIKEE 780
DEEEEELGAD ISPLQW 796
Nucleotide Sequence
(Fasta)
ATGCAGGTGT TCCAGATGAT CACCAAGTCC CAAAGGATCT TCTCCCACGC CCAGGTGGCG 60
GCAGCCTCCT CCCAGCTCCC GGGGCCCGAG GGCAAGCAGG CTGCTGTGAA GCCACCGCAG 120
GGGCCATGGC CGCTGCAGCC CCCACCACCG GCGTCCGCTG CTGACTCTCT TCATACCGGC 180
CCTGGAAACC TGGAGCCAGA GGGATCCCCA GCCCGCAGGA GGAAAAACCT GCCAGTGGCT 240
CCCAGAGAGA CGTCCCCGGG CAGCACGAAG CGAGACTCAA AGGGAGGTCC AAAAGCGGCC 300
ACTGCTCTGC CTCCCCTCCC AGCGGCAGCC CTGGACCCTC CCGGGAACTC AGACCTCTCT 360
TCCCTGGCCA AGCAGTTGCG ATCATCGAAA GGGACCTTGG ACCTGGGGGA CATATTCCCT 420
ACTGCAGGCT CTCGGCAGCT GGGGGGCAAT GAGCTGGCGC CAGGAGCCCA GCTGTCCGGG 480
AAGCAGGCCC AGTCCGAAAA TGGCTCAACT TCTGGGGCCA CAAAAGGTGA AAAGAGCCTA 540
GCCTGCTCCC GGGGTGCAGG CTACAGGCTG TTCTCGGGCA GCTCCAGGGC CCAGCGGTTC 600
TCAGGCTTCC GGAAGGAGAA AGTGAAGATG GACTTGTGCT GCGCAGCGTC GCCCAGCCAA 660
GTAGCCATGG CCTCCTTCTC ATCCGCCGGG CCGCCAGCCG ACGCCCCCCG GGACTCGAAG 720
TCCAAGCTGA CAATATTCAA CAGGATCCAG GGTGGAAACA TCTACAGGCT CCCCCATCCG 780
AAGGAGGAGA ACGTGGCAGG CGGATGTAAC CAGCAAAATG GGGGCCCCGC AGACTGGCCA 840
GAGCCCAGGG GCACTTTCGT GTGCAAGAAC TGCAGCCAGA TGTTCTACAC CGAGAAGGGG 900
CTGAGCAGCC ACATGTGTTT TCACAGCGAG CAGTGGCCGT CGCCTCGAGG CAAGCAGGAG 960
CAGCAGATGT TTGGCATGGA GTTTTGCAAG CCACCGAGGC AGGTGCTGAG GCCAGAGGGG 1020
GACGGAAGTT CCCCAGGAGC CAAGAAGCCC TTGGACAGCT CAGCCACAGC CCCTTTAGTT 1080
CCCCCCATGT CAGTCCCCAT GGCTCCTGAA ATCCGACCCC CAGGGAGTCT GGCTGAGGGA 1140
CAAGAGAAAG ACGGGGAGGA GAGAGATGGC AGGGAGAACA GCCAACACAG GAAGCGGAAG 1200
AAGCGCCCCC CCCCCTCCAC GTCTGGGGAG CCGGGTGCCG GAGGGTGCCA CCAGAGCTGC 1260
CTGCGGTCTC CAGTGTTCCT GGTGGACCGC CTCCTGAAGG GCCTGCTTCA GTGCTCCCGC 1320
TACACGCCAT TCCCCATGCT CACCTTCTTC CGGGAATGCT CGGGGCTGTA CTTCATCTCG 1380
CTCTGTTACA GATCCAATCN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNGTCAC 1440
GTGGGCTCCT TTGACATCTG CGTGGTGGAT GACATCAGCA TTGAACCNNN NNNNNNNNNN 1500
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1560
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 1620
NNNNNNNNNN TGACAGAACT CTGCAACGTG GCGTGCTCTA GCGTGATGCC GGGCGGGTGC 1680
ACCAACCTGG AGCTGGCACT GCACTGCCTG CACGACGCCC GGGGAGATGT GCAGGTGGCC 1740
TTGGAGACCC TCTTACTCGG AGGACCCCGG AAGCCTCGGG CTCACCCGCT CGCTGACTAC 1800
CGCTACACAG GATCAGACGT CTGGACACCC ATGGAGAAGA GGCTCTTTAA GAAGGCGTTC 1860
TGTACCCACA AGAAGGACTT TTATCTGATA CACAAGACGA TCCAGACGGA GACCGTAGCC 1920
CAGTGCGTTG AGTATTACTA CATCTGGAAA AAAATGATCA AGTTTGACTG CGGCCGAGCC 1980
TCAGGGCTAG AAAAAAGGGT TAAGAGAGAG CCTGATGAGG TAGAAAGGAC AGAGAAAAAG 2040
ATCCCTTGCA GCCCTCGAGA GAGACCCAGC CACCGTCCAA CTCCCGAGCT AAAGATAAAG 2100
ACCAAGAGTT ACAGGAGGGA GTCCATTCTC AACTCCAGCC CCAGTGCAGG CCCCAAGCGG 2160
ACCCCAGAGG CGCCAGGGAG TGTGGAGAGT CAGGGCGTGT TCCCCTGCAG AGAGTGTGAG 2220
AGGGTGTTTG ACAAGATCAA GAGTCGAAAC GCCCACATGA AGCGGCACCG CCTTCAGGAC 2280
CACGTGGAGC CTCTGGTCAG GGTGAAGTGG TCCGTGAAGC CCTTCCAGAT TAAGGAGGAG 2340
GACGAGGAGG AGGAGGAGCT GGGCGCAGAC ATCAGCCCCC TGCAGTGGTG A 2392
Sequence Source Ensembl
Orthology
Created Date 25-Jun-2016