WERAM Information
Tag | Content | ||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Pes-0170 | ||||||||||||
Ensembl Protein ID | ENSPSIP00000020005.1 | ||||||||||||
Gene Name | |||||||||||||
Ensembl Information |
|
||||||||||||
Status | Unreviewed | ||||||||||||
Classification |
|
||||||||||||
Organism | Pelodiscus sinensis | ||||||||||||
Domain Profile | HMT SET1 SET1.txt 2 elevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNce 88 |
||||||||||||
Protein Sequence (Fasta) | MKAKMVALKG INKVMAQSNM GMPPMVMNRF PFMGQPMPGT QTSEGQNLMP QAITQDGSLT 60 PQIARPTPPN FGPGFVNDSQ RKQYEEWLQE TQQLLQMQQK YLEEQIGVHR KSKKALSAKQ 120 RTAKKAGREF PEEDAEQLKH VTEQQSMVQK QLEQIRKQQK EHAELIEEYR IKQQQQCGIT 180 PHAIMPGVQS QPSMVPGGTP PAMNQQNFPI VAQQLQHQQH TVVIPGQSNP ARMPNLPGWQ 240 PAPAPPNHIP LNAPRMQPPM TQLPMNPGTP TPVPGSNANA QSGPPPRVEF DDNNPFSESF 300 QERERKERLR EQQERQRIQL MQEVDRQRAL QRMEMEQHGM IGSELNNRAS LSQIPFFNSD 360 LPCDFMQPPR PLQQSPQQQQ QQIGQVLQQG PVNSPPAVNF MQSSERRPLG PTSFGPEPPS 420 IPGGSPNFHS VKQSHGGISG TSFAQNQVRP PFAPGIPVVP PAAGSGLSCG QDTSTPHAPN 480 FPGSSQSLIQ LYSDIIPEEK GKKKRTRKKK KDDDAESIKA PSTPHSDITA PPTPAVSDST 540 STPTVSTPSE LIRHQNELES AELTGSSTTN TAENQPSLEL ECKHFNSGLL QKKFPQKSSV 600 NPETEQGKTD SPASIQEIKL EKAESDQCSG QAEPKTENQS GIMIEDKATL HPASSAQSPP 660 QSTSTPATKG ESGNELLKHL LKNKKSSSLL NQKSENSCRS EESAGDNKLV EKQNTAEAMP 720 TVGNQMQGVF GCGNSQLQRT DVGHEIKKQR NKRTQRTGEK AAPRSKKRKK EEEEKQAIYP 780 NTDTFIQLKQ QLSLLPLMEP IIGVSFAHFL PYGSGQLSGG NRLMGSFGSA TLDGVSDYYS 840 QLIYKQNNLS NPPTPPASLP PTPPPVACQK MVNGFATTEE LAGKAGMLGG HDVTKVLGPK 900 QFQLPFRPQD DLLARAMAQG PKTVDVPASL PTPPHNNQEE LRVQDHCDDR DTPDSFVPSS 960 SPESVVGMEV SRYPDLSLVK EETPEPVSSP IIPILPSSTG KGSEAKRNYV KSEPSSGTLP 1020 GSGFFGSQLG PSQNGPKSGL ISVAITLHPT AAENISSVVA AFSNLLHVRI PNSYEVSNAP 1080 DVPSSMGMIN NHRMHPALEY RQHLLLHGPQ AGSIGPTRLV GSYGLKQPNV PFPPNSNGLA 1140 SYKDHNQTIA EGSALRPQWC SHCKVVVLGS GVRKSFKDLS FLKQDSREIS DRVEKDVVFC 1200 SNNCFVLYSA AVQAKNSESK DSLPSFAQSP MKEMTQKPFH QYNNNISTLD VHCLPQLQEK 1260 VSPPSSPPIT FPPAFEAAKV EAKPDELKVT VKLKPRLKTI HSSLDDCRPL SKKWRGMKWK 1320 KWSIHVVIPK GSFKPPGEEE IDEFLKKLGT TLKPDPVPKD YRKCCFCHEE GDGLTDGPAR 1380 LLNLDLDLWV HLNCALWSTE VYETQAGALI NVELALRRGL QMKCMFCHKM GATSSCHRLR 1440 CTNIYHFTCA IKAQCMFFKD KTMLCPMHKP KGAHEQELSY FAVFRRVYIQ RDEVRQIASI 1500 VQRGEREHTF RVGSLIFHTI GQLLPQQMQA FHSSKALFPV GYEASRLYWS MRYANRRCRY 1560 LCSIEEKDGF PVFVIRIVEH GHDDLVLTDS TPKGVWDKIL ETVASVRKDS EMLQLFPGYL 1620 KGEDLFGLTV SAVARIAESL PGVEACENYT FRYGRNPLME LPLAINPTGC ARAEPKMSTH 1680 VKRFVLRPHT LNSTSTSKSF QSTVTGELNA PYSKQFVHSK SSQYRKMKTE WKSNVYLARS 1740 RIQGLGLYAA RDIEKHTMII EYIGTIIRNE VANRKEKLYE SQNRGVYMFR IDNDHVIDAT 1800 LTGGPARYIN HSCAPNCVAE VVTFERGHKI IISSNRRIQK GEELCYDYKF DFEDDQHKIP 1860 CHCGAVNCRK WMN 1873 |
||||||||||||
Nucleotide Sequence (Fasta) | ACTTTGATGC AATCACTGAT CCCATTATGA AAGCAAAAAT GGTAGCTCTT AAAGGTATCA 60 ATAAAGTAAT GGCACAGAGC AATATGGGAA TGCCACCAAT GGTTATGAAC AGGTTTCCCT 120 TTATGGGTCA GCCAATGCCA GGAACACAGA CTAGTGAAGG CCAGAATCTT ATGCCGCAAG 180 CAATCACACA GGATGGAAGT TTGACACCTC AGATTGCAAG GCCTACTCCT CCAAATTTTG 240 GTCCTGGTTT TGTTAATGAT TCACAAAGAA AGCAATATGA AGAATGGCTT CAAGAAACAC 300 AACAACTTCT TCAAATGCAA CAGAAATACC TGGAAGAGCA AATTGGCGTG CACAGAAAAT 360 CAAAAAAAGC GCTCTCAGCA AAACAACGTA CTGCCAAAAA AGCTGGCCGT GAATTCCCAG 420 AAGAAGATGC AGAACAGCTT AAGCATGTTA CCGAGCAGCA GAGTATGGTC CAAAAACAGC 480 TAGAACAGAT TCGAAAACAG CAAAAGGAAC ATGCAGAGCT CATTGAGGAA TATCGAATCA 540 AGCAGCAGCA GCAATGTGGG ATAACACCGC ATGCTATAAT GCCAGGCGTC CAGTCTCAGC 600 CATCTATGGT TCCAGGAGGA ACACCACCAG CAATGAATCA GCAAAATTTC CCCATAGTCG 660 CACAACAACT CCAGCATCAG CAGCACACAG TTGTAATTCC TGGGCAATCC AACCCAGCCA 720 GAATGCCAAA CTTACCTGGA TGGCAACCTG CACCTGCTCC TCCAAATCAC ATCCCCCTCA 780 ATGCTCCAAG GATGCAACCT CCAATGACAC AGTTACCAAT GAATCCTGGC ACCCCAACTC 840 CAGTGCCAGG TTCCAATGCA AATGCACAGT CAGGGCCACC ACCAAGGGTT GAATTTGATG 900 ATAATAATCC ATTTAGTGAA AGTTTTCAAG AGCGGGAGCG AAAGGAACGT TTGCGAGAGC 960 AGCAGGAACG GCAACGAATA CAGCTAATGC AGGAGGTGGA CCGACAGAGG GCTCTGCAGA 1020 GGATGGAAAT GGAACAACAT GGCATGATAG GGTCAGAATT AAATAACAGA GCTTCCTTGT 1080 CTCAGATACC TTTCTTTAAC TCTGACCTAC CCTGTGATTT CATGCAGCCT CCGCGTCCTC 1140 TTCAGCAGTC TCCACAGCAG CAGCAGCAGC AAATAGGGCA AGTTTTGCAA CAAGGCCCTG 1200 TGAACTCACC ACCTGCTGTA AATTTTATGC AAAGCAGTGA GCGAAGACCA CTGGGGCCTA 1260 CCTCTTTTGG ACCCGAGCCA CCTTCAATTC CTGGTGGATC CCCTAACTTC CATTCTGTAA 1320 AACAATCCCA TGGGGGTATC TCTGGGACCA GCTTTGCACA GAACCAAGTC AGGCCTCCAT 1380 TTGCGCCTGG TATACCTGTT GTGCCTCCAG CAGCTGGTAG TGGTCTTTCC TGTGGCCAAG 1440 ACACCAGCAC ACCCCATGCA CCAAATTTTC CTGGGTCAAG TCAGTCTCTT ATTCAGCTGT 1500 ACTCTGATAT AATTCCGGAA GAGAAAGGGA AAAAGAAAAG GACACGAAAA AAGAAAAAGG 1560 ATGATGATGC TGAGTCAATA AAAGCACCAT CAACTCCACA TTCGGATATA ACTGCACCTC 1620 CTACTCCAGC TGTTTCAGAT TCTACCTCTA CCCCAACAGT TAGCACACCT AGTGAACTTA 1680 TTCGTCATCA GAATGAGCTG GAGTCAGCAG AGCTAACAGG CTCATCAACA ACAAATACTG 1740 CAGAGAACCA GCCTTCCTTA GAGCTGGAAT GTAAGCATTT CAATAGTGGC TTGCTCCAAA 1800 AGAAATTTCC CCAGAAATCG AGTGTCAATC CTGAGACTGA ACAGGGTAAG ACAGATTCTC 1860 CAGCCAGCAT TCAGGAAATT AAACTGGAAA AGGCTGAATC TGATCAATGT TCTGGCCAAG 1920 CTGAGCCTAA AACAGAAAAT CAGAGTGGTA TTATGATAGA AGACAAGGCC ACACTACACC 1980 CTGCCTCTTC AGCACAGAGT CCACCACAAT CGACCAGCAC CCCTGCAACC AAGGGGGAGT 2040 CAGGGAATGA GCTGCTGAAA CACTTACTTA AAAATAAAAA ATCATCCTCT CTTCTAAATC 2100 AGAAATCAGA GAATAGCTGC CGATCAGAAG AAAGTGCTGG GGATAACAAA CTAGTGGAGA 2160 AACAGAACAC AGCTGAAGCA ATGCCAACTG TGGGGAATCA AATGCAAGGT GTATTTGGAT 2220 GTGGTAACAG CCAGCTTCAG AGAACAGATG TAGGACATGA AATAAAGAAA CAGCGAAATA 2280 AACGAACTCA GAGGACTGGA GAGAAGGCAG CCCCTCGGTC TAAGAAGAGA AAAAAAGAGG 2340 AAGAGGAGAA ACAGGCTATT TATCCTAACA CAGATACATT TATCCAACTC AAACAACAAC 2400 TCTCTCTGCT GCCTCTAATG GAACCAATAA TTGGAGTGAG CTTTGCACAC TTTCTCCCCT 2460 ATGGCAGTGG CCAGCTAAGT GGTGGAAATC GACTTATGGG AAGTTTTGGT AGTGCTACAC 2520 TTGATGGGGT TTCTGATTAC TATTCCCAGC TGATCTACAA GCAGAATAAT TTGAGTAATC 2580 CTCCAACACC CCCAGCCTCT CTTCCTCCAA CACCACCGCC AGTAGCTTGT CAAAAAATGG 2640 TGAATGGATT TGCAACTACT GAGGAACTTG CTGGAAAAGC TGGAATGCTG GGTGGACATG 2700 ATGTTACCAA AGTGCTTGGA CCAAAACAGT TCCAGTTACC TTTCAGGCCA CAAGATGATT 2760 TGTTAGCAAG AGCTATGGCT CAGGGCCCTA AAACTGTGGA TGTTCCTGCT TCACTTCCAA 2820 CACCACCTCA TAACAATCAG GAAGAATTAA GGGTTCAAGA TCACTGTGAT GACCGGGACA 2880 CTCCTGACAG CTTTGTTCCC TCTTCCTCTC CTGAAAGTGT GGTGGGAATG GAAGTAAGTA 2940 GGTATCCAGA TTTGTCTCTG GTAAAGGAAG AAACCCCGGA GCCTGTATCA TCTCCCATCA 3000 TTCCAATCCT ACCCAGCAGT ACTGGAAAAG GTTCAGAAGC AAAAAGGAAT TATGTAAAAT 3060 CAGAACCTAG TTCAGGAACT TTACCTGGTT CTGGCTTCTT TGGCTCTCAG CTTGGGCCAT 3120 CCCAAAATGG TCCGAAATCT GGCCTGATAT CTGTAGCAAT TACTTTACAT CCCACAGCTG 3180 CAGAGAATAT TAGTAGTGTC GTGGCTGCAT TTTCCAATCT ACTGCATGTG AGAATTCCCA 3240 ACAGCTACGA AGTTAGTAAT GCCCCAGATG TTCCATCCTC CATGGGAATG ATAAACAATC 3300 ACAGAATGCA TCCAGCTCTG GAATACAGAC AGCACTTATT ACTTCATGGT CCTCAGGCAG 3360 GATCCATAGG CCCCACCAGA TTAGTAGGGT CTTATGGATT GAAGCAACCT AATGTGCCAT 3420 TTCCTCCAAA CAGCAATGGT CTAGCCAGTT ATAAAGATCA CAATCAAACT ATTGCAGAAG 3480 GCTCAGCACT GAGACCTCAG TGGTGTTCTC ATTGCAAAGT GGTTGTACTT GGCAGTGGTG 3540 TGCGGAAATC TTTCAAAGAT CTATCTTTCC TTAAACAGGA TTCCCGAGAG ATCTCTGATA 3600 GAGTGGAAAA GGATGTTGTC TTTTGTAGCA ACAACTGCTT TGTTCTTTAT TCAGCAGCTG 3660 TGCAAGCAAA AAACTCAGAG AGCAAGGACT CACTTCCATC ATTCGCACAG TCACCGATGA 3720 AAGAAATGAC CCAGAAACCG TTCCATCAGT ACAACAACAA CATTTCCACA TTAGATGTAC 3780 ATTGTCTCCC TCAGCTGCAA GAAAAAGTTT CTCCGCCTTC ATCACCCCCT ATCACATTTC 3840 CTCCTGCATT TGAAGCAGCT AAGGTAGAAG CAAAGCCAGA TGAGCTTAAA GTAACAGTGA 3900 AGCTAAAACC TAGGTTAAAA ACAATACACA GTAGCCTTGA TGATTGTCGC CCTCTAAGCA 3960 AGAAATGGAG AGGAATGAAA TGGAAAAAAT GGAGCATTCA TGTTGTGATT CCTAAAGGAT 4020 CATTCAAACC TCCTGGTGAA GAGGAAATAG ATGAGTTCCT CAAGAAACTG GGCACAACCC 4080 TTAAACCTGA CCCTGTGCCT AAAGACTACA GAAAATGTTG CTTCTGTCAT GAAGAGGGTG 4140 ATGGACTAAC CGATGGACCA GCAAGGCTTC TTAACCTTGA CTTAGACCTT TGGGTCCATT 4200 TGAACTGTGC TCTTTGGTCT ACAGAGGTCT ATGAGACACA GGCTGGTGCC TTAATTAACG 4260 TGGAACTAGC ACTGCGAAGA GGCTTGCAAA TGAAATGCAT GTTCTGTCAC AAAATGGGTG 4320 CCACCAGCAG TTGTCACAGG TTAAGATGCA CCAATATTTA TCACTTTACC TGTGCCATTA 4380 AAGCACAATG CATGTTTTTT AAAGACAAAA CCATGCTTTG CCCCATGCAC AAACCAAAGG 4440 GAGCTCATGA GCAAGAACTT AGTTACTTTG CAGTTTTCAG GAGGGTCTAC ATTCAACGTG 4500 ATGAGGTACG GCAGATTGCT AGTATAGTAC AGCGGGGAGA ACGCGAGCAC ACTTTCCGTG 4560 TAGGAAGCCT GATCTTCCAC ACCATTGGTC AGCTGCTGCC ACAACAGATG CAGGCATTCC 4620 ATTCTTCAAA AGCTCTCTTC CCTGTTGGAT ACGAGGCCAG CCGGTTGTAT TGGAGCATGC 4680 GGTATGCAAA CAGGCGGTGT CGCTATCTGT GTTCTATTGA GGAGAAGGAT GGCTTTCCAG 4740 TGTTTGTGAT CAGGATTGTT GAGCATGGTC ATGATGATCT GGTTCTTACT GATTCAACAC 4800 CAAAAGGTGT GTGGGATAAA ATCTTGGAAA CTGTTGCTTC TGTTAGAAAG GATTCTGAAA 4860 TGCTGCAGCT CTTTCCTGGA TATTTGAAGG GTGAAGACCT CTTTGGTTTG ACAGTCTCTG 4920 CAGTGGCAAG AATCGCTGAA TCACTTCCTG GGGTTGAGGC TTGTGAGAAC TACACTTTCC 4980 GATATGGCCG AAACCCTTTA ATGGAACTTC CTCTTGCCAT CAACCCCACG GGCTGTGCCC 5040 GTGCTGAGCC TAAAATGAGT ACCCATGTCA AGAGGTTTGT GTTAAGGCCT CACACCTTGA 5100 ATAGCACCAG CACCTCAAAG TCATTTCAGA GCACAGTGAC AGGAGAGCTA AATGCGCCTT 5160 ACAGTAAACA GTTTGTCCAT TCCAAGTCCT CCCAATACCG CAAAATGAAG ACTGAATGGA 5220 AATCCAATGT ATATCTGGCT CGCTCTCGGA TTCAGGGCCT AGGCTTATAT GCTGCTAGAG 5280 ACATTGAAAA GCACACCATG ATCATTGAAT ATATTGGAAC TATCATCCGT AATGAGGTAG 5340 CAAACAGGAA AGAGAAGCTA TATGAATCTC AGAATCGTGG AGTGTACATG TTCCGCATTG 5400 ACAATGATCA TGTCATTGAT GCTACATTGA CAGGAGGGCC TGCGAGGTAT ATTAACCATT 5460 CATGTGCACC TAACTGTGTA GCTGAGGTGG TGACTTTTGA GAGAGGACAC AAGATTATCA 5520 TCAGCTCCAA CAGGAGAATC CAGAAGGGGG AGGAGCTTTG CTATGACTAT AAGTTTGATT 5580 TTGAAGATGA CCAGCACAAG ATCCCATGTC ACTGTGGAGC TGTAAACTGC CGAAAATGGA 5640 TGAACTAG 5649 |
||||||||||||
Sequence Source | Ensembl | ||||||||||||
Orthology | |||||||||||||
Created Date | 25-Jun-2016 |