WERAM Information
Tag | Content | ||||||||||||||||||||||||||||||
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
WERAM ID | WERAM-Soa-0010 | ||||||||||||||||||||||||||||||
Ensembl Protein ID | ENSSARP00000001031.1 | ||||||||||||||||||||||||||||||
Gene Name | WHSC1 | ||||||||||||||||||||||||||||||
Ensembl Information |
|
||||||||||||||||||||||||||||||
Status | Unreviewed | ||||||||||||||||||||||||||||||
Classification |
|
||||||||||||||||||||||||||||||
Organism | Sorex araneus | ||||||||||||||||||||||||||||||
Domain Profile | HMT SET2 SET2.txt 2 kveliktekkGyGlrakeeikkdefileYvGevidekeakkRlkeyeeekvkdfYlleldkdeviDatkkGnlaRfinhsCePncet 88 Me_Reader PWWP PWWP.txt 1 agdLVwaKlkgYpwWPalvisppleakkl..ktqeaeenkylVlFFgnkherawvkrkklvpys 62 HMT SET1 SET1.txt 5 vakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeig.vylfrldedaevvvdatkkgniarfinhscepNceak 90 Me_Reader PHD PHD.txt 2 tiClvCgkddegekemvqCde.CddwfHlkCvklplsslpegkswyCpsCk 51 |
||||||||||||||||||||||||||||||
Protein Sequence (Fasta) | MELSIKKSPL SVHKVVKCIK MKQAPEILGN TNGKTQNCEV SRECSVFLSK AQLSTSLHEG 60 MQKLNGHDAL PFIPTEKLKD LTPRVFNGIT GAQDANLRFK SQEKRTWXXX XXXXXXXXXX 120 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 180 XXXXXXXXXX XXXXXXXIPS KKESCLNTTG DKDHLLKYNV GDLVWSKVSG YPWWPCMVSA 240 DPLLHSYTKL KGQKKSARQY HVQFFGDAPE RAWIFEKSLV AFEGEGQFEK LCQESAKQAP 300 TKAEKIKLLK PISGKLRAQW EMGIVQAEEA ASMSVEERKA KFTFIYVGDQ LHLNPHVAKE 360 AGIATESPGG MVGTSGVSEE ASEGPKSPKE EGIPIKRRRR AKLASCAENH ESEPGTVKST 420 PQKTAEFGTK RGVGSPPGRK KAPASTPRSR KGDAASQFLV FCQKHRDEVV AEHPDASGEE 480 IEELLGSQWN MLNEKQKARY HTKFALVASA QAEDDSGNLN GKKRSHTKRT QDFAEDVEVE 540 DVPRKRLRTD KHSLRKREIT DKMARTSSCK AIEATSSLKS QAATKNLSDA CKPLKKRNRA 600 SAAAPSALGF SKSSSPSASL TENEVSDSPG DEPLESPYES ADEAQTEVSK KSERGVAAKK 660 EYVCQLCEKP GSLLLCEGPC CGAFHLACLG LSRRPEGRFT CSACASGKRV HSCFVCKESQ 720 ADVKRCVVSQ CGKFYHEACV RKFPLTVFES RGFRCPLHSC VSCHASNPSN PRPSKGKMMR 780 CVRCPVAYHG GDACLAAGCS VIASNSIICT SHFTARKGKR HHAHVNVSWC FVCSKGGSLL 840 CCESCPAAFH PDCLNIDMPD GSWFCNDCRA GKKLHFQDII WVKLGNYRWW PAEVCHPKNV 900 PPNIQKMKHE IGEFPVFFFG SKDYYWTHQA RVFPYMEGDR GSRHHGVRGI GRVFKNALQE 960 AEARFREVKL QREARETQES ERKPPPYKHI KVNKPYGKVQ IYTADISEIP KCNCKPTDES 1020 PCGFDSECLN RMLMFECHPQ VCPAGESCQN QCFTKRQYPE TKIIKTDGKG WGLVAKRDIR 1080 KGEFVNEYVG ELIDEEECMA RIKYAHENDI THFYMLTIDK DRIIDAGPKG NYSRFMNHSC 1140 QPNCETLKWT VNGDTRVGLF AVCDIPAGTE LTFNYNLDCL GDEKTVCRCG ASHCSGVLGD 1200 RAKXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 1260 XXXXXXXXXX XXKWECPWHH CDVCGKPSTS FCHFCPNSFC KEHQDGTAFV STQDGRSYCC 1320 EHDLGADTVR GAKTEKPFPE PGKAKGKRKK RRCWRRVTEG K 1361 |
||||||||||||||||||||||||||||||
Nucleotide Sequence (Fasta) | ATGGAATTGA GCATCAAAAA AAGTCCTCTT TCTGTTCACA AAGTTGTAAA GTGCATAAAG 60 ATGAAGCAGG CGCCAGAAAT CCTTGGCAAT ACAAATGGGA AGACTCAAAA CTGTGAAGTC 120 AGTCGTGAGT GCTCTGTGTT TCTCAGCAAA GCACAGCTTT CTACTAGTCT GCACGAGGGG 180 ATGCAAAAGC TTAATGGCCA TGATGCACTT CCCTTTATTC CAACGGAGAA ACTGAAAGAT 240 TTAACTCCCC GGGTGTTTAA TGGGATAACC GGTGCTCAAG ATGCTAACTT GCGTTTTAAG 300 TCCCAGGAAA AGAGGACTTG GNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 360 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 420 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 480 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 540 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NATTCCGTCT 600 AAGAAAGAGT CCTGTCTAAA TACTACAGGA GATAAAGACC ACTTGTTGAA ATACAATGTG 660 GGTGATTTGG TGTGGTCCAA AGTGTCCGGT TATCCTTGGT GGCCATGCAT GGTTTCTGCA 720 GATCCTCTCC TTCACAGCTA CACCAAACTT AAAGGTCAGA AAAAGAGTGC CCGCCAGTAT 780 CATGTCCAGT TCTTCGGTGA TGCCCCAGAA AGAGCGTGGA TATTTGAGAA GAGCCTGGTA 840 GCTTTTGAAG GAGAAGGACA GTTTGAAAAA TTATGCCAGG AGAGTGCCAA GCAGGCACCC 900 ACAAAAGCTG AGAAAATTAA GCTCTTGAAA CCTATTTCAG GGAAATTGAG AGCTCAGTGG 960 GAAATGGGCA TTGTCCAAGC TGAAGAAGCT GCCAGCATGT CTGTAGAGGA GCGGAAAGCC 1020 AAGTTTACCT TCATCTATGT GGGAGACCAG CTTCATCTCA ACCCTCATGT AGCTAAGGAG 1080 GCGGGCATTG CTACAGAGTC TCCAGGAGGA ATGGTGGGGA CCTCAGGAGT CAGTGAGGAA 1140 GCCTCTGAGG GGCCCAAGTC CCCAAAAGAA GAGGGCATTC CCATCAAGCG AAGGCGACGG 1200 GCCAAATTGG CGAGTTGTGC TGAGAACCAT GAATCTGAGC CTGGGACAGT AAAGAGCACT 1260 CCTCAAAAGA CGGCAGAGTT TGGCACTAAG AGAGGAGTCG GCTCTCCTCC TGGAAGGAAG 1320 AAGGCCCCAG CCTCTACTCC GAGAAGCAGA AAAGGAGATG CAGCATCCCA GTTTTTGGTC 1380 TTCTGTCAGA AGCACAGGGA TGAGGTGGTT GCAGAGCACC CAGACGCCTC AGGTGAGGAG 1440 ATTGAAGAGT TGCTTGGCTC CCAGTGGAAC ATGCTCAATG AGAAGCAGAA AGCGCGCTAT 1500 CATACAAAGT TTGCCCTGGT GGCCTCTGCC CAGGCAGAAG ACGACTCTGG TAATTTAAAT 1560 GGGAAAAAAA GAAGTCACAC CAAGAGGACA CAGGACTTTG CAGAAGATGT TGAAGTTGAG 1620 GATGTACCAA GGAAAAGACT CAGGACTGAC AAGCACAGTC TTCGGAAGAG AGAGATCACT 1680 GACAAAATGG CCAGAACAAG CTCTTGCAAG GCCATAGAGG CCACCTCCTC CCTGAAGAGC 1740 CAGGCAGCAA CGAAAAATCT GTCTGATGCA TGCAAACCAC TGAAGAAGCG AAATCGGGCT 1800 TCCGCTGCTG CACCTTCAGC TCTTGGGTTT AGCAAAAGTT CATCTCCTTC TGCGTCATTA 1860 ACTGAGAATG AGGTGTCGGA CAGCCCAGGA GATGAGCCCT TGGAGTCCCC GTATGAGAGT 1920 GCCGATGAAG CACAGACAGA AGTGTCCAAA AAGTCTGAGC GAGGAGTGGC TGCCAAAAAG 1980 GAGTATGTGT GCCAGCTGTG TGAGAAGCCT GGCAGCCTCC TGCTCTGCGA GGGCCCTTGC 2040 TGTGGGGCGT TCCACCTTGC CTGCTTGGGC CTTTCTCGGC GGCCGGAAGG GAGGTTCACC 2100 TGCAGCGCCT GTGCTTCAGG CAAGCGGGTT CACTCGTGTT TCGTGTGTAA GGAGAGCCAG 2160 GCCGACGTTA AGCGCTGTGT CGTGTCGCAG TGCGGAAAGT TCTACCATGA AGCTTGCGTG 2220 AGAAAATTCC CGCTGACTGT GTTTGAGAGC CGAGGCTTCC GCTGTCCTCT GCACAGCTGT 2280 GTGAGCTGCC ATGCATCCAA CCCTTCCAAC CCCAGACCAT CCAAAGGTAA AATGATGCGG 2340 TGTGTCCGCT GTCCCGTGGC CTATCATGGA GGGGATGCTT GTCTGGCAGC AGGCTGCTCA 2400 GTGATCGCTT CTAACAGCAT CATCTGCACC AGCCACTTCA CAGCCCGAAA GGGAAAAAGG 2460 CATCATGCCC ATGTCAATGT GAGCTGGTGC TTTGTGTGCT CCAAAGGGGG CAGCCTCTTG 2520 TGCTGTGAGT CCTGCCCGGC AGCCTTCCAC CCCGATTGCC TGAACATCGA CATGCCTGAT 2580 GGCAGCTGGT TCTGCAATGA CTGCAGGGCT GGGAAGAAGT TGCACTTCCA GGACATCATC 2640 TGGGTGAAGC TGGGCAACTA CAGATGGTGG CCGGCAGAAG TTTGCCATCC CAAAAATGTT 2700 CCCCCAAATA TTCAGAAAAT GAAGCACGAG ATTGGAGAAT TCCCTGTGTT TTTCTTTGGG 2760 TCTAAAGATT ATTATTGGAC GCATCAGGCG CGAGTGTTCC CGTACATGGA GGGGGACCGG 2820 GGCAGCCGCC ACCATGGGGT CCGAGGGATC GGCAGAGTCT TCAAGAACGC ACTGCAAGAA 2880 GCTGAGGCTC GCTTTCGAGA AGTCAAACTT CAGAGGGAAG CCAGAGAGAC GCAAGAGAGT 2940 GAGCGCAAGC CCCCGCCATA CAAGCACATT AAGGTGAATA AGCCTTATGG AAAAGTCCAG 3000 ATCTACACCG CTGATATCTC AGAGATCCCA AAGTGCAACT GCAAACCCAC GGACGAGAGC 3060 CCCTGTGGCT TCGACTCGGA GTGTCTGAAC CGGATGCTGA TGTTCGAGTG CCACCCCCAG 3120 GTGTGCCCTG CGGGGGAGTC TTGCCAGAAC CAGTGCTTCA CCAAGCGCCA GTACCCCGAG 3180 ACCAAGATCA TCAAGACCGA TGGCAAAGGG TGGGGCCTGG TTGCTAAAAG GGACATCAGA 3240 AAGGGAGAGT TTGTGAATGA GTATGTCGGT GAGCTGATTG ACGAGGAAGA GTGTATGGCA 3300 AGAATCAAAT ACGCACATGA GAACGACATC ACCCACTTTT ACATGCTCAC CATAGACAAG 3360 GATCGTATCA TTGATGCTGG CCCCAAAGGG AACTATTCTC GGTTCATGAA TCACAGCTGC 3420 CAACCCAACT GTGAGACACT CAAATGGACA GTGAACGGTG ACACTCGAGT GGGCCTGTTT 3480 GCTGTGTGTG ACATCCCAGC AGGAACTGAG CTGACGTTCA ACTACAACCT TGACTGTCTG 3540 GGGGACGAAA AGACGGTCTG TCGGTGTGGG GCGTCCCACT GCAGCGGGGT CCTCGGGGAC 3600 AGGGCAAAGN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3660 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3720 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 3780 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNGGAAAT GGGAATGTCC CTGGCATCAT 3840 TGTGACGTCT GTGGCAAACC ATCGACTTCC TTTTGCCACT TTTGCCCCAA TTCGTTTTGC 3900 AAGGAGCATC AAGACGGGAC AGCCTTCGTC TCAACCCAGG ACGGGCGTTC ATACTGCTGC 3960 GAGCATGACT TGGGGGCAGA CACAGTTCGA GGTGCCAAGA CTGAGAAACC CTTTCCAGAG 4020 CCTGGGAAGG CCAAGGGGAA GAGAAAGAAA AGGAGGTGTT GGCGGAGGGT CACAGAAGGC 4080 AAATAG 4087 |
||||||||||||||||||||||||||||||
Sequence Source | Ensembl | ||||||||||||||||||||||||||||||
Orthology | |||||||||||||||||||||||||||||||
Created Date | 25-Jun-2016 |