Tag |
Content |
WERAM ID |
WERAM-Soa-0096 |
Ensembl Protein ID |
ENSSARP00000008932.1 |
Gene Name |
EZH1 |
Ensembl Information |
|
Status |
Unreviewed |
Classification |
Type |
Family |
E-value |
Score |
Start |
End |
HMT |
EZ |
5.30e-62 |
206.5 |
214 |
729 |
HMT |
SET1 |
2.80e-42 |
143 |
614 |
729 |
|
Organism |
Sorex araneus |
Domain Profile |
HMT EZ
EZ.txt 58 lndqlvidakrkGnklkfanh 78 + ++i++++k k +f n ENSSARP00000008932.1 214 KRKRHAIESNKKSSKKQFSND 234 556778999999999999885 PP EZ.txt 1 krillgksdvaGwGlflkesvekneylgeytGelisddeadkrGkiydrakssflfnlndqlvidakrkGnklkfanhsakpncyakvl 89 k++ll++sdvaGwG f+kesv+kne+++ey+Gelis+dead+rGk+yd+++ssflfnln+++v+da+rkGnk++fanhs +pncyakv+ ENSSARP00000008932.1 614 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVV 702 789************************************************************************************** PP EZ.txt 90 lvaGdhriGlfakrrieaseelffdyr 116 +v+GdhriG+fakr+i+a+eelffdyr ENSSARP00000008932.1 703 MVNGDHRIGIFAKRAIQAGEELFFDYR 729 **************************7 PP
HMT SET1
SET1.txt 1 kelevakskikglglvakkeiekeelviEYvGevirsevadkrekeyekkeigvylfrldedaevvvdatkkgniarfinhscepNcea 89 k+l +a s+++g+g ++k++++k+e++ EY+Ge+i++++ad+r k y+k+ ++++lf+l++d +vvdat+kgn++rf nhs++pNc+a ENSSARP00000008932.1 614 KHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKY-MSSFLFNLNND--FVVDATRKGNKIRFANHSVNPNCYA 699 678999*******************************************9.778********..************************* PP SET1.txt 90 kvvavdgekkiviyakraIekgeeltydYk 119 kvv+v+g+++i+i+akraI++geel++dY+ ENSSARP00000008932.1 700 KVVMVNGDHRIGIFAKRAIQAGEELFFDYR 729 *****************************6 PP
|
Protein Sequence (Fasta) | MDIQNPPTSK CITYWKRKVK SEYMRLRQLK RLQANMGAKX XXXXXXXXXX XXXXXXXXXX 60 XXXXXXXXXX XXXXXXXXXX XXXXXXXXXP GFASQHMLMR SLNTVALVPI MYSWSPLQQN 120 FMVEDETVLC NIPYMGDEVK EEDETFIEEL INNYDGKVHX XXXMIPGSVL ISDAVFLELV 180 DALNQYSDEE EEGHNDTSDG KQDDSKEDLP VTRKRKRHAI ESNKKSSKKQ FSNDMIFSAI 240 ASMFPENGVP DDMKEXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX XXXXXXXXXX 300 XXXXXXXXXX XFHATPNVYK RRNKEIKIEP EPCGTDCFLL LEGAKEYAML HNPRSKCSGR 360 RRRRHHVVSA SCSITSSSAV SQTKEGDSDR DTGNDWASSS SEANSRCQTP TKQKASPAPP 420 QLCVVEAPSE PVEWTGAEES LFRVFHGTYF NNFCSIARLL GTKTCKQVFQ FAVRESLILK 480 LPTDELTNPS QKKKRKHRSE LWAAHCRKIQ LKKDNNSTQV YNYQPCDHPD RPCDSTCPCI 540 MTQNFCEKFC QCNPDCQNRF GCRCKTQCNT KQCPCYLAVR ECDPDLCLTC GASEHWDCKV 600 VSCKNCSIQR GLKKHLLLAP SDVAGWGTFI KESVQKNEFI SEYCGELISQ DEADRRGKVY 660 DKYMSSFLFN LNNDFVVDAT RKGNKIRFAN HSVNPNCYAK VVMVNGDHRI GIFAKRAIQA 720 GEELFFDYRY SQADALQYVG IER 743Protein Fasta Sequence
>ENSSARP00000008932.1|SET1|Sorex araneus MDIQNPPTSKCITYWKRKVKSEYMRLRQLKRLQANMGAKXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXPGFASQHMLMRSLNTVALVPIMYSWSPLQQNFMVEDETVLCNIPYMGDEVKEEDETFIEELINNYDGKVHXXXXMIPGSVLISDAVFLELVDALNQYSDEEEEGHNDTSDGKQDDSKEDLPVTRKRKRHAIESNKKSSKKQFSNDMIFSAIASMFPENGVPDDMKEXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXFHATPNVYKRRNKEIKIEPEPCGTDCFLLLEGAKEYAMLHNPRSKCSGRRRRRHHVVSASCSITSSSAVSQTKEGDSDRDTGNDWASSSSEANSRCQTPTKQKASPAPPQLCVVEAPSEPVEWTGAEESLFRVFHGTYFNNFCSIARLLGTKTCKQVFQFAVRESLILKLPTDELTNPSQKKKRKHRSELWAAHCRKIQLKKDNNSTQVYNYQPCDHPDRPCDSTCPCIMTQNFCEKFCQCNPDCQNRFGCRCKTQCNTKQCPCYLAVRECDPDLCLTCGASEHWDCKVVSCKNCSIQRGLKKHLLLAPSDVAGWGTFIKESVQKNEFISEYCGELISQDEADRRGKVYDKYMSSFLFNLNNDFVVDATRKGNKIRFANHSVNPNCYAKVVMVNGDHRIGIFAKRAIQAGEELFFDYRYSQADALQYVGIER
|
Nucleotide Sequence (Fasta) | ATGGATATAC AAAATCCCCC AACTTCCAAA TGTATCACTT ATTGGAAAAG AAAAGTCAAA 60 TCTGAATATA TGAGACTTCG ACAACTCAAA CGGCTTCAGG CAAATATGGG TGCAAAGNNN 120 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 180 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 240 NNNNNNNNNN NNNNNNNNNN NNNNNNNCCG GGCTTTGCAA GTCAACACAT GTTAATGAGG 300 TCTCTGAATA CGGTCGCCTT GGTTCCCATC ATGTATTCCT GGTCCCCTCT CCAGCAGAAC 360 TTTATGGTTG AAGATGAGAC CGTTCTGTGT AATATTCCCT ACATGGGAGA CGAGGTGAAA 420 GAAGAAGACG AGACTTTCAT CGAGGAGCTG ATCAATAACT ATGATGGGAA GGTCCATNNN 480 NNNNNNNAAA TGATCCCCGG GTCTGTCCTC ATTAGTGATG CTGTTTTCCT GGAGCTGGTC 540 GATGCTCTGA ATCAGTACTC GGATGAGGAG GAGGAAGGGC ACAACGATAC CTCAGATGGA 600 AAACAGGACG ACAGCAAAGA AGACCTGCCA GTAACGAGAA AAAGAAAACG GCATGCGATA 660 GAAAGCAACA AAAAGAGTTC CAAAAAACAG TTCTCAAATG ACATGATCTT CAGTGCAATT 720 GCCTCGATGT TTCCTGAGAA TGGGGTCCCG GATGACATGA AGGAGAGNNN NNNNNNNNNN 780 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 840 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN 900 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NCTTTTCATG CCACCCCTAA TGTATATAAA 960 CGCAGGAACA AAGAAATCAA GATTGAACCG GAACCCTGTG GCACAGACTG CTTCCTCCTG 1020 CTGGAAGGAG CTAAGGAGTA CGCCATGCTC CACAACCCTC GTTCCAAGTG CTCTGGTCGT 1080 CGCCGGAGAA GGCACCACGT GGTCAGTGCC TCCTGCTCCA TCACCTCATC CTCTGCCGTC 1140 TCACAGACTA AAGAAGGGGA CAGCGACAGG GACACGGGCA ACGACTGGGC CTCTAGTTCT 1200 TCAGAGGCTA ACTCCCGCTG TCAGACCCCC ACCAAGCAGA AGGCCAGCCC GGCGCCCCCC 1260 CAGCTCTGTG TCGTTGAGGC ACCCTCCGAG CCTGTGGAGT GGACGGGAGC TGAAGAATCC 1320 CTTTTTCGAG TCTTCCATGG CACTTATTTC AACAACTTTT GCTCCATAGC CCGGCTCCTG 1380 GGGACCAAGA CGTGCAAGCA GGTCTTCCAG TTTGCCGTCA GAGAATCTTT GATCCTGAAG 1440 CTGCCGACCG ACGAACTTAC GAACCCTTCA CAGAAGAAGA AAAGAAAGCA CAGGTCAGAG 1500 CTGTGGGCCG CGCACTGCAG GAAAATCCAG CTGAAGAAAG ATAACAACTC CACGCAGGTG 1560 TACAACTACC AGCCCTGTGA CCACCCGGAC CGCCCCTGCG ACAGCACCTG CCCCTGCATC 1620 ATGACTCAGA ATTTCTGTGA GAAGTTCTGC CAGTGCAACC CAGACTGCCA GAATCGCTTT 1680 GGCTGCCGCT GTAAGACGCA GTGCAACACC AAGCAGTGTC CCTGCTACCT GGCCGTGCGG 1740 GAGTGTGACC CCGACCTCTG TCTCACGTGC GGCGCCTCCG AGCACTGGGA CTGCAAAGTG 1800 GTTTCCTGCA AAAACTGCAG CATCCAGCGG GGTCTCAAGA AGCACCTGCT GCTGGCTCCC 1860 TCGGACGTGG CCGGATGGGG CACCTTCATC AAGGAGTCGG TGCAGAAGAA CGAATTCATT 1920 TCCGAGTACT GCGGTGAGCT CATCTCTCAG GATGAGGCTG ACCGGCGGGG GAAGGTCTAT 1980 GACAAATACA TGTCCAGCTT TCTCTTCAAC CTCAACAACG ATTTTGTGGT GGATGCTACC 2040 CGCAAAGGAA ACAAAATCCG ATTTGCAAAC CATTCAGTGA ATCCCAACTG TTACGCCAAA 2100 GTGGTCATGG TGAACGGGGA TCATCGGATT GGGATCTTTG CCAAGAGGGC AATTCAAGCT 2160 GGGGAAGAGC TTTTCTTTGA CTACAGGTAC AGCCAAGCAG ACGCTCTTCA GTACGTGGGG 2220 ATTGAGCGG
Nucleotide Fasta Sequence
>ENSSARP00000008932.1|SET1|Sorex araneus ATGGATATACAAAATCCCCCAACTTCCAAATGTATCACTTATTGGAAAAGAAAAGTCAAATCTGAATATATGAGACTTCGACAACTCAAACGGCTTCAGGCAAATATGGGTGCAAAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCCGGGCTTTGCAAGTCAACACATGTTAATGAGGTCTCTGAATACGGTCGCCTTGGTTCCCATCATGTATTCCTGGTCCCCTCTCCAGCAGAACTTTATGGTTGAAGATGAGACCGTTCTGTGTAATATTCCCTACATGGGAGACGAGGTGAAAGAAGAAGACGAGACTTTCATCGAGGAGCTGATCAATAACTATGATGGGAAGGTCCATNNNNNNNNNNAAATGATCCCCGGGTCTGTCCTCATTAGTGATGCTGTTTTCCTGGAGCTGGTCGATGCTCTGAATCAGTACTCGGATGAGGAGGAGGAAGGGCACAACGATACCTCAGATGGAAAACAGGACGACAGCAAAGAAGACCTGCCAGTAACGAGAAAAAGAAAACGGCATGCGATAGAAAGCAACAAAAAGAGTTCCAAAAAACAGTTCTCAAATGACATGATCTTCAGTGCAATTGCCTCGATGTTTCCTGAGAATGGGGTCCCGGATGACATGAAGGAGAGNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNNCTTTTCATGCCACCCCTAATGTATATAAACGCAGGAACAAAGAAATCAAGATTGAACCGGAACCCTGTGGCACAGACTGCTTCCTCCTGCTGGAAGGAGCTAAGGAGTACGCCATGCTCCACAACCCTCGTTCCAAGTGCTCTGGTCGTCGCCGGAGAAGGCACCACGTGGTCAGTGCCTCCTGCTCCATCACCTCATCCTCTGCCGTCTCACAGACTAAAGAAGGGGACAGCGACAGGGACACGGGCAACGACTGGGCCTCTAGTTCTTCAGAGGCTAACTCCCGCTGTCAGACCCCCACCAAGCAGAAGGCCAGCCCGGCGCCCCCCCAGCTCTGTGTCGTTGAGGCACCCTCCGAGCCTGTGGAGTGGACGGGAGCTGAAGAATCCCTTTTTCGAGTCTTCCATGGCACTTATTTCAACAACTTTTGCTCCATAGCCCGGCTCCTGGGGACCAAGACGTGCAAGCAGGTCTTCCAGTTTGCCGTCAGAGAATCTTTGATCCTGAAGCTGCCGACCGACGAACTTACGAACCCTTCACAGAAGAAGAAAAGAAAGCACAGGTCAGAGCTGTGGGCCGCGCACTGCAGGAAAATCCAGCTGAAGAAAGATAACAACTCCACGCAGGTGTACAACTACCAGCCCTGTGACCACCCGGACCGCCCCTGCGACAGCACCTGCCCCTGCATCATGACTCAGAATTTCTGTGAGAAGTTCTGCCAGTGCAACCCAGACTGCCAGAATCGCTTTGGCTGCCGCTGTAAGACGCAGTGCAACACCAAGCAGTGTCCCTGCTACCTGGCCGTGCGGGAGTGTGACCCCGACCTCTGTCTCACGTGCGGCGCCTCCGAGCACTGGGACTGCAAAGTGGTTTCCTGCAAAAACTGCAGCATCCAGCGGGGTCTCAAGAAGCACCTGCTGCTGGCTCCCTCGGACGTGGCCGGATGGGGCACCTTCATCAAGGAGTCGGTGCAGAAGAACGAATTCATTTCCGAGTACTGCGGTGAGCTCATCTCTCAGGATGAGGCTGACCGGCGGGGGAAGGTCTATGACAAATACATGTCCAGCTTTCTCTTCAACCTCAACAACGATTTTGTGGTGGATGCTACCCGCAAAGGAAACAAAATCCGATTTGCAAACCATTCAGTGAATCCCAACTGTTACGCCAAAGTGGTCATGGTGAACGGGGATCATCGGATTGGGATCTTTGCCAAGAGGGCAATTCAAGCTGGGGAAGAGCTTTTCTTTGACTACAGGTACAGCCAAGCAGACGCTCTTCAGTACGTGGGGATTGAGCGG
|
Sequence Source |
Ensembl |
Orthology |
|
Created Date |
25-Jun-2016 |