WERAM Information


Tag Content
WERAM ID WERAM-Pes-0035
Ensembl Protein ID ENSPSIP00000005673.1
Gene Name KIAA2026
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSPSIG00000005276.1 ENSPSIT00000005706.1 ENSPSIP00000005673.1
Status Unreviewed
Classification
Type Family E-value Score Start End
Ac_Reader Bromodomain 7.90e-06 25.9 101 811
Organism Pelodiscus sinensis
Domain Profile
  Ac_Reader Bromodomain

             BROMO.txt  30 mdLstikerleegnYsspeefvkDvrlifnNakaynenk 68 
m+L +++e+ +g+Y +++fv+D+rl+++ ++ +n+ +
ENSPSIP00000005673.1 101 MWLLKMEEKFANGQYVGITDFVADFRLMLETCYRLNGVD 139
*********************************999866 PP
BROMO.txt 5 vsepFrepvdpqleipdYydiikeP 29
+ ++F++ + + +dY d ik+P
ENSPSIP00000005673.1 789 LMKAFQRNRSR--LKKDYDDFIKQP 811
56778444444..569********9 PP

Protein Sequence
(Fasta)
MESPAPGESS EGPAPVGGLP EEEQQQENER DSSCRWRQNL SPELQQGYRI LCEFLLEKHR 60
PLTAPFLKPL GDHLIVPEEG GTTSPADRTG SDSLPQQADG MWLLKMEEKF ANGQYVGITD 120
FVADFRLMLE TCYRLNGVDH WLSKQAQKLE MMLEQKLALL SWHLREKTTI AVTSKGCYGL 180
EDEKGTVCTS TRRRSTARIL AGLTTGVFES VMVQILRQEE HQRAKEEKRL REQERKEAEE 240
VCQKEIEEWE KSLLAQAAPT RMETMWEIPA IGHFLCLAQQ ILNLPEIVFY ELERCLLMPQ 300
CNVFLSKIMT SLLSPPHRRA TLHRRPTLPY RTWEAALRQK VQQWYTVVGQ TDNPDGCAEK 360
LGLCPQFFKV LGEVNPLEGK PFHELPFYQK VWLLKGLCDF VYETQKDVQD AVLGQPIHEC 420
REVILGYDFL ENAYVHFPQF CGADVRIYKQ KPFQAPAFPA PPIKVKKVPR IKLEKVKCEY 480
LSKSNGEVRF GDREELLPHC KMEAGKSMDS LICCPAKIHL DSFSLTTEKE MKPNCEIKMH 540
RTCDIKKPGC SKENQEMPIS PGEIDGFGEP LSPGEIRVVE NGERDHEASL IETEPKPLKT 600
NTLKTCQVHV NGTHSNNPDI NCHRVARELI LENSLWNNKK LKLTKMRAKK KKKKKKKLKD 660
ILNEHLQRKR EIHPHPFKTY KPEIHNKFFI IKKKAKHKKH KSGKKSISKK AITKKRKAVT 720
KPTVPDFQLI CTNLDELREL ITKIENELKD LENIRKKSGR WYHRKQAVKE LHSTLIRLLN 780
ELLPWEPKLM KAFQRNRSRL KKDYDDFIKQ PDHDKFTREL WSNEESEVDN GKEPFSAVTS 840
KSSESAEHLE ILQKDHSEFD EMKLLEMDFS AGRSRLLKKD LTSKEIQKIL PKSIKRQSKQ 900
NSYLDDSTKE LSPRKKVKLS TNDAIVQSTE IEVQTNSCLN ELKQSELPPP ESFALTDSAT 960
SISNFLKGTN PIQALLAKNI GNKVTLTNQM TPPVGMDIIT SEKAVMSPVE SSPLRPAMPY 1020
QTNSKSPLQM VYKMPSGHCV PIDVHNSSVK IQMQPVIDPK TGEKVMQQVL MLPKNFLGQH 1080
KEGKGVTKLH CPSFSQTTDI NDSLSSVLAN PTVNATTQMS CTVFNKSITH LSQVTSPVSK 1140
PQPLSSVTST SNLLTSAVKM SQSETGQMAT AVSSAIFSVP LPSSTVSTLD QHLASTTLSE 1200
FTNTSCSIHS LAPQQTSSSC ESKQELKTVC IRESQSILVT TRGGNTGIVK VQTNPGQISP 1260
NALSPNSVFT YAPPLQAFLV PKSPTLSTST FPSAATVTSS LPLFGASSTS SSVAIPAGLN 1320
QLIGKNLKFP VGQLPTSGSL DRVIAKTQQV SSPTFLSSIS STSTLVSATP NSTVNVINIS 1380
SGSTEQSNAD ITHIPSIQQQ VDITTKKHPV MQPEIASATN GDMPHLKNPV KKFMLVTNPP 1440
ILSPCGATGV NIIPTPTSTG FSAQKLVFIN TPIPSSPSNT SLITESLKKT LPSSIGKTYV 1500
NAPEQPQLLL IPSTMGTPMK VSSSTTVSQV KDVKIGLNIG QAIINTTGNA PHVPPINILQ 1560
SAAPKGTDAI SNKGFILPSS TSGSLVPGCS SFVSQNITSV NESIGVATKA THDFTVTTAN 1620
ACLDSVSVTA SGTSAGTRPT VLVSGNDTSR IRPLLTNPLC ASNIGNTMAI STVKTGHLAS 1680
SVLISTTQPR LSSQSISSGF QFPVTVTLPG PIAASTKVIQ TVPCLTAVPA AAQCLPLPKG 1740
HPPTLVQFQS PGLSATVPSN ANVHKPQNVL PAASPNAGKK ISFSSFASLP CQQKPTNLSK 1800
LAQAYSCAPG ASVPQSTTAS TTVTSQAVAQ LNETCFQQKI VINTCTPLAP GTHIMISGTR 1860
FIVPPQGLGA GSHVLLISAN PKHGPPLVIN NGQGAQGVPF VNHIPQSIML APSNSLSWQT 1920
SKHPLKSSTK IVNSLGTANT LPIVHATPQI INTAAKSCAP PSTTTQFVSS VIKPPVNVLA 1980
KTSLFSAINA GNSQLPSSTS VLHLDTSIKK LLVSPEGAIL NAINTPASKA VSSLSSPLSP 2040
IAVSKSINPT SVFPAFQSSG LSKSDKAAS 2069
Nucleotide Sequence
(Fasta)
NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNC GCGCCCACCC 60
GCCGGCAGCT CGAGTGACTG GCCTGAGAGC TGATGCTAGC CGCGGAGCCC GGACACAGCG 120
CGAACCTGGA GTAGCCCAGG CGGGACATGG AGTCTCCGGC CCCCGGAGAG AGTTCAGAGG 180
GTCCAGCCCC AGTCGGCGGC TTGCCAGAAG AAGAGCAGCA GCAGGAGAAC GAGAGGGACA 240
GCAGCTGCCG CTGGAGGCAG AACCTGAGCC CGGAGCTCCA GCAGGGTTAC CGGATTCTCT 300
GCGAGTTTCT GCTGGAGAAG CACCGGCCCC TGACAGCTCC TTTTCTGAAG CCCCTGGGAG 360
ACCATTTAAT AGTGCCGGAA GAAGGCGGAA CTACCTCCCC AGCAGATCGC ACCGGCAGCG 420
ATTCCCTTCC ACAGCAAGCG GACGGAATGT GGCTGCTGAA GATGGAAGAG AAATTTGCCA 480
ATGGACAATA CGTTGGGATC ACGGATTTCG TGGCTGATTT TCGGCTAATG CTTGAGACGT 540
GTTACCGGCT GAACGGGGTG GATCACTGGC TCTCCAAACA GGCCCAGAAG CTGGAGATGA 600
TGCTGGAACA GAAGCTAGCT TTGCTCTCCT GGCACTTGAG AGAGAAGACA ACAATAGCAG 660
TAACATCGAA AGGGTGCTAT GGGTTGGAAG ATGAGAAAGG AACAGTATGT ACTTCAACAA 720
GGCGTCGTTC CACAGCTCGC ATTTTAGCAG GATTGACCAC AGGTGTTTTT GAGTCTGTAA 780
TGGTTCAGAT TCTGAGACAA GAAGAACACC AGAGAGCAAA AGAAGAAAAG AGGCTACGAG 840
AGCAAGAACG GAAAGAGGCA GAAGAAGTTT GTCAGAAAGA AATAGAAGAA TGGGAAAAGT 900
CACTTTTAGC TCAAGCAGCA CCTACAAGGA TGGAGACTAT GTGGGAAATT CCAGCTATTG 960
GTCATTTCCT CTGTTTAGCA CAACAAATTC TAAACTTGCC TGAAATAGTC TTCTATGAGT 1020
TGGAACGTTG TCTTCTAATG CCTCAATGTA ATGTTTTTTT ATCTAAAATA ATGACGTCTT 1080
TATTAAGTCC TCCTCATCGT AGAGCTACAT TGCATCGAAG ACCAACTCTT CCTTATAGGA 1140
CTTGGGAAGC AGCACTTAGA CAGAAGGTAC AACAGTGGTA CACAGTTGTA GGGCAAACTG 1200
ACAACCCCGA TGGCTGTGCA GAAAAGCTTG GACTATGCCC CCAGTTTTTT AAAGTACTTG 1260
GGGAAGTTAA TCCTTTGGAA GGAAAACCAT TTCATGAGCT CCCATTCTAC CAAAAAGTAT 1320
GGCTGCTAAA AGGCTTATGT GACTTTGTTT ATGAAACACA AAAAGATGTT CAAGATGCTG 1380
TTCTTGGACA GCCTATACAT GAGTGCAGGG AAGTGATCCT TGGTTATGAT TTCTTGGAGA 1440
ATGCCTATGT TCATTTTCCA CAGTTCTGTG GTGCTGATGT ACGGATTTAC AAACAGAAGC 1500
CTTTTCAAGC TCCTGCATTT CCAGCCCCAC CCATCAAAGT AAAAAAGGTA CCACGGATTA 1560
AATTGGAGAA AGTGAAATGT GAGTACCTCA GCAAGAGCAA TGGGGAAGTC AGATTTGGTG 1620
ATAGAGAAGA GCTGCTACCT CACTGTAAAA TGGAAGCAGG GAAAAGCATG GATTCTCTTA 1680
TTTGTTGCCC AGCAAAAATT CACTTGGATA GCTTCAGTCT CACTACAGAA AAAGAAATGA 1740
AACCTAACTG TGAAATTAAA ATGCATAGAA CCTGTGACAT AAAAAAACCT GGCTGCTCTA 1800
AAGAGAACCA GGAGATGCCA ATTAGTCCAG GAGAAATTGA TGGCTTTGGA GAACCTCTTA 1860
GCCCAGGAGA AATTAGGGTT GTAGAAAATG GAGAGAGAGA TCATGAAGCT TCCCTAATAG 1920
AAACCGAGCC CAAGCCATTA AAAACAAATA CCCTTAAAAC CTGCCAAGTA CATGTGAATG 1980
GGACTCATAG CAACAATCCA GACATAAATT GCCACAGAGT TGCTAGAGAA CTGATTTTGG 2040
AAAATTCATT ATGGAATAAT AAGAAACTAA AACTTACTAA GATGCGGGCA AAAAAGAAGA 2100
AAAAGAAAAA AAAGAAATTG AAAGACATTT TGAATGAACA TCTTCAGAGA AAGCGTGAGA 2160
TTCATCCTCA TCCATTCAAA ACTTACAAAC CTGAGATTCA TAATAAGTTT TTTATCATCA 2220
AAAAGAAAGC AAAACACAAG AAGCACAAAT CTGGAAAAAA ATCCATATCT AAAAAAGCAA 2280
TCACAAAGAA GAGGAAAGCT GTTACAAAGC CTACGGTGCC AGACTTTCAG CTAATTTGCA 2340
CTAATCTTGA TGAACTCAGG GAATTAATCA CAAAAATCGA GAATGAACTC AAAGATCTGG 2400
AGAACATTAG AAAGAAATCG GGAAGGTGGT ACCATCGGAA ACAAGCAGTA AAGGAATTGC 2460
ACAGCACACT GATACGATTG CTAAATGAAT TATTACCATG GGAACCAAAG CTAATGAAGG 2520
CTTTTCAGAG AAACAGGTCT CGATTGAAGA AGGACTATGA TGATTTCATA AAACAGCCAG 2580
ACCATGATAA ATTTACCAGA GAACTATGGA GTAATGAAGA AAGTGAAGTT GATAATGGAA 2640
AAGAACCTTT TAGTGCAGTA ACCAGTAAAT CTTCAGAATC TGCAGAGCAT CTGGAGATCC 2700
TGCAAAAAGA TCACTCAGAG TTTGATGAAA TGAAACTGTT AGAAATGGAT TTTTCTGCAG 2760
GAAGGAGCAG GCTACTAAAG AAAGACTTGA CTTCTAAAGA AATACAGAAG ATACTGCCCA 2820
AATCTATTAA ACGTCAGTCT AAGCAAAATA GTTACTTAGA TGATAGCACA AAAGAACTTT 2880
CACCAAGGAA GAAAGTTAAA TTAAGCACAA ATGACGCTAT AGTTCAGAGT ACAGAAATTG 2940
AAGTGCAGAC TAACAGTTGT TTGAATGAAC TGAAACAAAG CGAGCTACCA CCTCCAGAAT 3000
CATTTGCCCT GACTGATTCT GCAACATCAA TATCAAACTT TCTGAAAGGG ACCAACCCCA 3060
TCCAAGCTTT GCTTGCTAAA AACATTGGGA ACAAAGTGAC CTTAACGAAT CAGATGACAC 3120
CACCTGTGGG TATGGACATA ATTACTTCTG AAAAGGCAGT TATGTCACCT GTGGAGTCAT 3180
CCCCACTACG GCCAGCAATG CCCTACCAAA CCAATTCAAA GAGTCCTTTA CAAATGGTAT 3240
ACAAAATGCC AAGTGGCCAC TGTGTACCAA TAGATGTCCA TAACAGCTCA GTGAAGATTC 3300
AGATGCAACC AGTGATTGAC CCTAAAACAG GAGAAAAAGT CATGCAACAA GTTCTTATGT 3360
TACCCAAAAA TTTTCTCGGA CAACACAAAG AAGGAAAAGG TGTAACAAAG CTGCACTGTC 3420
CATCTTTCTC ACAGACAACA GATATTAATG ATTCTTTATC ATCTGTTCTT GCAAACCCAA 3480
CAGTAAATGC TACCACTCAA ATGTCTTGTA CAGTTTTTAA CAAGAGTATT ACACATTTGT 3540
CACAAGTGAC TAGTCCAGTG AGCAAACCAC AGCCTTTGTC CTCTGTAACA TCAACAAGTA 3600
ACTTACTAAC ATCAGCTGTT AAGATGAGCC AAAGTGAGAC TGGCCAAATG GCAACTGCAG 3660
TTTCATCTGC CATATTTTCT GTACCTTTGC CTTCATCTAC TGTTTCCACA TTAGATCAGC 3720
ATTTGGCGTC AACAACACTA AGTGAATTTA CAAATACTTC ATGTTCCATC CACAGTTTAG 3780
CACCTCAGCA GACTAGTAGC TCCTGTGAAT CTAAACAGGA GCTGAAAACT GTGTGTATAA 3840
GAGAATCACA ATCCATTCTT GTCACAACAC GAGGGGGGAA CACTGGAATT GTTAAAGTAC 3900
AGACAAACCC AGGCCAGATT TCACCTAATG CTTTATCTCC AAATTCAGTT TTCACTTATG 3960
CACCTCCGCT TCAGGCATTT TTGGTTCCAA AATCCCCAAC ATTGTCAACT TCTACTTTTC 4020
CATCTGCAGC AACAGTGACA TCTAGTCTCC CACTGTTTGG TGCCTCCTCA ACATCATCTT 4080
CTGTTGCCAT TCCTGCAGGC TTAAACCAGT TAATAGGGAA AAATCTCAAA TTTCCAGTAG 4140
GGCAGCTTCC CACTAGTGGC AGTTTGGATC GGGTGATAGC GAAAACTCAA CAGGTGTCCT 4200
CTCCTACTTT CTTGTCCTCC ATTTCATCAA CTAGCACTTT GGTGTCAGCA ACTCCAAATA 4260
GTACTGTTAA TGTAATTAAC ATTTCTTCAG GAAGTACTGA ACAGAGTAAT GCAGATATTA 4320
CCCATATTCC TTCTATTCAG CAACAAGTTG ATATCACAAC CAAAAAACAT CCTGTTATGC 4380
AACCTGAGAT TGCTTCAGCA ACAAATGGTG ATATGCCCCA TTTAAAAAAT CCTGTTAAAA 4440
AATTTATGTT GGTCACAAAT CCACCTATTC TTTCTCCTTG TGGAGCTACA GGAGTAAATA 4500
TAATACCAAC TCCAACATCC ACTGGGTTTA GTGCCCAGAA ATTAGTTTTC ATTAATACCC 4560
CAATTCCCAG TAGCCCATCA AACACCAGTC TAATTACAGA ATCATTAAAG AAGACACTGC 4620
CTTCCTCTAT TGGTAAAACA TATGTTAATG CTCCAGAACA ACCTCAGTTG CTTCTAATTC 4680
CATCTACAAT GGGCACACCA ATGAAAGTAA GCTCATCAAC TACTGTGTCT CAAGTAAAAG 4740
ATGTTAAAAT TGGACTAAAC ATAGGTCAGG CCATTATAAA TACCACAGGA AATGCACCAC 4800
ATGTTCCACC AATTAACATA TTGCAGTCTG CAGCTCCCAA AGGAACAGAT GCCATAAGCA 4860
ACAAGGGTTT TATTTTGCCT TCATCAACAA GTGGTAGTTT GGTTCCAGGA TGTTCTAGTT 4920
TTGTGAGTCA AAATATTACA TCTGTTAATG AATCTATAGG TGTTGCAACA AAAGCAACAC 4980
ATGATTTTAC AGTAACAACA GCGAATGCAT GTTTAGATTC TGTTTCAGTG ACTGCAAGTG 5040
GGACTTCTGC TGGGACACGA CCCACTGTTT TGGTTAGTGG AAATGACACT TCAAGAATAA 5100
GGCCACTTTT AACTAATCCA CTGTGTGCAT CAAATATTGG AAATACTATG GCCATATCAA 5160
CTGTTAAAAC AGGACATCTT GCTTCATCTG TTCTTATTTC AACTACACAA CCAAGACTAT 5220
CTTCTCAAAG TATATCATCT GGTTTCCAGT TTCCAGTTAC AGTTACCTTG CCTGGACCTA 5280
TTGCAGCCTC CACTAAAGTC ATACAAACAG TTCCTTGCTT AACAGCAGTT CCAGCTGCTG 5340
CACAGTGTCT CCCCCTCCCA AAAGGACATC CCCCCACACT AGTACAATTT CAGTCACCAG 5400
GACTTTCAGC TACAGTGCCA AGTAATGCAA ACGTTCATAA ACCTCAAAAT GTATTGCCAG 5460
CTGCTTCACC AAATGCAGGT AAAAAAATTA GTTTTTCCAG CTTTGCTTCT CTCCCATGCC 5520
AGCAGAAGCC TACTAATTTA TCAAAACTAG CTCAGGCTTA TTCTTGTGCT CCAGGTGCCT 5580
CTGTCCCTCA GTCTACTACT GCTTCTACTA CTGTAACTAG CCAGGCAGTA GCTCAGTTGA 5640
ATGAAACTTG TTTTCAACAG AAAATAGTTA TTAACACCTG CACCCCTTTG GCACCTGGAA 5700
CTCATATCAT GATTAGTGGC ACTCGATTTA TTGTTCCACC ACAAGGCCTT GGCGCAGGCA 5760
GTCATGTTCT TTTAATCTCT GCTAATCCAA AACATGGGCC TCCGTTAGTT ATTAACAATG 5820
GCCAAGGTGC TCAAGGAGTA CCATTTGTTA ATCATATTCC CCAAAGCATT ATGCTAGCAC 5880
CAAGTAATTC ATTAAGTTGG CAAACATCAA AACATCCTTT GAAAAGCTCT ACAAAAATTG 5940
TGAACTCTCT TGGGACAGCA AACACTCTGC CTATTGTACA TGCAACACCA CAGATAATAA 6000
ACACTGCTGC TAAATCATGT GCTCCACCCT CTACAACTAC CCAGTTTGTG TCTTCAGTGA 6060
TAAAACCTCC AGTTAATGTT TTAGCTAAAA CTTCTTTGTT TTCTGCCATA AATGCTGGTA 6120
ACTCTCAGTT ACCAAGTAGC ACTTCAGTAT TACACTTAGA TACATCTATC AAGAAACTGT 6180
TGGTCAGCCC AGAAGGAGCC ATTTTGAATG CTATAAATAC TCCAGCATCT AAGGCGGTTT 6240
CTTCTCTATC TTCACCCCTG TCACCCATTG CAGTGTCCAA AAGCATAAAT CCAACCAGTG 6300
TCTTCCCTGC TTTTCAGTCT TCTGGCCTAA GTAAATCTGA CAAAGCTGCA TCCTGA 6357
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mup-0088 ENSMPUP00000008310.1 Mustela putorius furo 65 0.0 2178
WERAM-Sus-0034 ENSSSCP00000023408.1 Sus scrofa 65 0.0 2174
WERAM-Lac-0107 ENSLACP00000013276.1 Latimeria chalumnae 51 0.0 1552
WERAM-Dan-0200 ENSDNOP00000032165.1 Dasypus novemcinctus 69 0.0 1431
WERAM-Hos-0214 ENSP00000444993.1 Homo sapiens 77 1e-172 605
WERAM-Leo-0092 ENSLOCP00000012306.1 Lepisosteus oculatus 53 6e-159 560
WERAM-Orn-0173 ENSONIP00000017673.1 Oreochromis niloticus 66 8e-157 553
WERAM-Pof-0240 ENSPFOP00000019940.1 Poecilia formosa 62 3e-153 541
WERAM-Xim-0057 ENSXMAP00000005178.1 Xiphophorus maculatus 64 2e-150 532
WERAM-Asm-0198 ENSAMXP00000018724.1 Astyanax mexicanus 65 2e-146 519
WERAM-Caj-0064 ENSCJAP00000010906.1 Callithrix jacchus 29 6e-07 55.5
WERAM-Fia-0029 ENSFALP00000001836.1 Ficedula albicollis 29 8e-07 55.1
WERAM-Tar-0047 ENSTRUP00000011219.1 Takifugu rubripes 31 9e-07 55.1
WERAM-Ten-0212 ENSTNIP00000021345.1 Tetraodon nigroviridis 31 1e-06 54.3
WERAM-Meg-0088 ENSMGAP00000008554.2 Meleagris gallopavo 31 2e-06 53.5
Created Date 25-Jun-2016