WERAM Information


Tag Content
WERAM ID WERAM-Mup-0105
Ensembl Protein ID ENSMPUP00000009565.1
Gene Name SI
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSMPUG00000009641.1 ENSMPUT00000009721.1 ENSMPUP00000009565.1
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 1.4 32
Organism Mustela putorius furo
Domain Profile
  HAT HAT_other

Query: 442 AAYETYNRGNAKNVWVNESDGTTAVIGEVWPGLTVFPDFTNPNCIDWWADECNIFYQEVK 501
A +Y RG +W T+ + +FPD I A+E + +V
Sbjct: 250 ARVRSYLRGMKYPLWQVGDIFTSKENSLAVYNIPLFPDDPKARFIHQLAEEDRLL--KVS 307
Query: 502 YDGLWIDMNEVSSF---IQGSKNGCESNKLNYPPFTPDILDKLLYSKTICMDTVQYW--G 556
WI++ E F + S G L P P D ++ ++ + G
Sbjct: 308 LSSFWIELQERQEFKLSVTSSVMGISGYSLATPSLFPSSADVIVPKSRKQFRAIKKYITG 367
Query: 557 KHYDVHSLYGYSMAIATEKAIEKVFTNKRSFILTR 591
+ YD E AIE FTN R F+L R
Sbjct: 368 EEYDTE-----------EGAIE-AFTNIRDFLLLR 390

Protein Sequence
(Fasta)
MAKKKFTGLE ISLIVLFAIV TIIAIALIAV LATKTPAVEE ISNSTSTPPT TRTTTVYPGP 60
SDAKCPSELN DHINERINCI PEQYPYKEAV CAMRGCCWKP WNDSIIPWCF FADNHGYNAD 120
KVTTTSTGLE ATLNRISSPT LFGDDVSSAL FIAQNQTRNR FRFKITDPSN RRYEVPHQFV 180
GEFTGTAASD TLYEVQVTEN PFSIKVIRKS NRRILFDTSI GPLVYSDQYL QISTKLPSEY 240
IYGIGEHIHK RFRHDLNWKT WPIFTRDQLP GDNNNNLYGH HTFFMCIEDN SGKSFGVFLM 300
NSNAMEIFIQ PTPIITYRVI GGILDFYIFL GDTPEQVVQQ YQELIGRPAM PAYWSLGFQL 360
SRWNYKSLDV LKEVVKRNRD AGIPFDTQVT DIDYMEAKKD FTYDKVAFQG LPEFVQDLHN 420
NGQKYVIILD PAISINKLTN GAAYETYNRG NAKNVWVNES DGTTAVIGEV WPGLTVFPDF 480
TNPNCIDWWA DECNIFYQEV KYDGLWIDMN EVSSFIQGSK NGCESNKLNY PPFTPDILDK 540
LLYSKTICMD TVQYWGKHYD VHSLYGYSMA IATEKAIEKV FTNKRSFILT RSTFAGSGRY 600
AAHWLGDNTA SWEQMKWSIA GMLEFNLFGI PLVGADICGF VANTTEELCR RWMQLGAFYP 660
FSRNHNGDIY EHQDPAFFGQ NSLLVNSSKH YLNIRYTLLP FLYTLFYKAH MFGETVARPV 720
LHEFYEDMNT WSEDTQFLWG PALLITPVLK EGADTVSAYI PNATWFDYET GAKRPWRKQQ 780
VNMYLPGDKI GLHLRGGYII PIQQPAVTTT ASRKNPLGLI VALDEDNTAK GDFFWDDGET 840
KNTIQNGNYI LYTFSVSNNK LDILCTHSSY QEGTTLAFET IKILGLTDSV TQVTVVEDNQ 900
PIKDHYNFTY TASDQNLLIY NLNFNLGRNF TVQWNQNYSV NERFTCYPDA DPATKEKCEA 960
RGCLWETVPL SSQAPDCYFP RQYNPYLVSS TQYSSMGITT DLQLNPTRAQ IKLPSDPIST 1020
LRVEVKYHKN DMLQFKIYDP QNKRYEVPVP LNIPTTPTST YENRLYDVEI KENPFGIQVR 1080
RRSTGKVIWD SYLPGFAFNN QFIQISTRLP SEYIYGFGEV EHTTFKRDLN WHTWGMFTRD 1140
QPPGYKLNSY GFHPYYMALE DEGYAHGVLL LNSNAMDVTF QPTPALTYRI IGGILDFYVF 1200
LGPTPEIATQ QYHEVIGRPV MPPYWALGFQ LCRYGYRNTS EVEQVYNDMV AAQIPYDVQY 1260
TDIDYMERQL DFTIDENFHD LPQFVDKIRQ EGMRYIIILD PAISGNETKP YPAFERGMEK 1320
DVFVKWPNTS DICWAKVWPD LPNITIDESL TEDEAVNASR AYVAFPDFFR NATAEWWARE 1380
IIDFYNNQMR FDGLWIDMNE PSSFVHGTVS NQCRNKELNY PPYFPELTKR ANGLHFRTMC 1440
METEQILSDG SSVLHYDVHN LYGWSQMKPS YDALQKTTGK RGIVISRSTY PTGGQWGGHW 1500
LGDNYAQWDN LDKSIIGMME FSLFGISYTG ADICGFFNNS EYELCARWMQ LGAFYPYSRN 1560
HNIAFTRRQD PASWNATFSE MSRNILNIRY TLLPYFYTQM HEIHAHGGTV IRPLLHEFFN 1620
DKITWDIFKQ FLWGPAFLVT AVLEPSVKSV IGYVPDARWF DYHTGQDIKV RGQFHEFYTP 1680
LDTINLHVRG GHILPCQEPD KNTFHSRKNY MKLIVAADNN QTAQGSLFWD DGESIDTYEK 1740
GLYFLAQFNL NKNTLTSTIL KNGYINKNEM RLGFINIWGK GKTAVNEVIL IYNGNREVVK 1800
FSENLDKEIL NINLTVNDVT LDQPIQISWS
Nucleotide Sequence
(Fasta)
ATGGCAAAGA AAAAATTTAC TGGATTAGAA ATCTCTCTGA TTGTCCTTTT TGCTATAGTT 60
ACTATAATAG CTATTGCCCT AATTGCTGTT TTAGCAACTA AGACACCTGC TGTTGAAGAA 120
ATTAGTAATT CTACTTCAAC TCCACCTACC ACTCGTACAA CTACTGTGTA TCCTGGTCCA 180
AGTGATGCTA AATGTCCAAG TGAGCTAAAT GATCATATCA ACGAGAGAAT AAACTGCATT 240
CCAGAGCAAT ATCCATATAA AGAGGCAGTT TGTGCCATGC GAGGTTGCTG CTGGAAGCCA 300
TGGAATGACT CTATTATTCC CTGGTGCTTC TTCGCAGATA ACCATGGCTA TAATGCTGAC 360
AAAGTGACAA CAACAAGTAC TGGACTTGAA GCCACATTAA ACAGGATATC TTCACCTACA 420
CTATTTGGAG ATGATGTTTC TAGTGCTCTC TTCATAGCTC AAAATCAGAC ACGTAATCGT 480
TTCCGGTTTA AGATTACTGA TCCAAGTAAT AGAAGATATG AAGTTCCTCA TCAGTTTGTA 540
GGAGAATTTA CTGGAACTGC AGCCTCTGAT ACATTATATG AGGTGCAGGT TACAGAAAAT 600
CCATTTAGCA TCAAAGTTAT TAGAAAAAGC AATAGGAGAA TTTTGTTTGA CACCAGCATT 660
GGTCCCCTGG TATACTCTGA TCAATATTTA CAGATCTCAA CCAAACTTCC CAGTGAATAT 720
ATCTATGGTA TTGGGGAACA TATTCATAAG AGATTTCGTC ATGATTTAAA CTGGAAAACA 780
TGGCCAATTT TTACCCGTGA TCAACTTCCT GGTGATAATA ATAATAATTT ATATGGCCAT 840
CACACATTCT TCATGTGCAT TGAAGATAAT TCTGGAAAGT CATTTGGTGT TTTCTTAATG 900
AACAGCAATG CAATGGAAAT TTTCATCCAG CCTACTCCAA TAATAACTTA CAGAGTTATT 960
GGTGGAATTC TGGACTTTTA CATCTTTCTA GGAGATACAC CAGAACAAGT AGTTCAACAG 1020
TATCAAGAGC TCATTGGACG ACCAGCAATG CCAGCATATT GGAGTCTTGG ATTCCAACTT 1080
AGTCGCTGGA ATTATAAGTC ACTTGATGTA CTGAAAGAAG TAGTAAAAAG AAACCGGGAT 1140
GCTGGCATAC CATTTGATAC ACAGGTCACT GATATTGACT ATATGGAAGC CAAGAAAGAC 1200
TTTACTTATG ATAAAGTTGC ATTTCAGGGG CTCCCTGAAT TTGTTCAAGA TTTACATAAC 1260
AATGGACAGA AATATGTCAT CATCTTGGAC CCTGCAATTT CAATAAATAA GCTTACCAAT 1320
GGAGCAGCAT ATGAGACCTA TAATAGGGGA AATGCAAAAA ATGTGTGGGT AAATGAATCA 1380
GATGGGACTA CAGCAGTTAT TGGAGAGGTA TGGCCAGGAT TAACAGTATT CCCTGATTTT 1440
ACTAATCCGA ACTGCATTGA TTGGTGGGCA GATGAGTGCA ACATTTTCTA TCAAGAAGTG 1500
AAATATGATG GACTTTGGAT TGACATGAAT GAAGTTTCCA GCTTTATTCA AGGTTCAAAG 1560
AATGGATGTG AAAGCAACAA ACTAAATTAT CCACCTTTTA CTCCTGATAT TCTTGACAAA 1620
CTCCTGTATT CCAAAACAAT TTGCATGGAC ACTGTGCAAT ACTGGGGGAA GCATTATGAT 1680
GTTCACAGCC TCTATGGGTA CAGCATGGCT ATTGCCACAG AGAAAGCCAT AGAAAAAGTT 1740
TTTACCAATA AGAGAAGCTT TATTCTCACC CGGTCAACTT TTGCTGGATC TGGACGTTAT 1800
GCTGCACATT GGTTAGGAGA TAATACTGCT TCATGGGAAC AAATGAAATG GTCTATTGCA 1860
GGAATGCTGG AGTTCAACCT GTTTGGAATA CCATTGGTTG GAGCGGACAT CTGTGGGTTT 1920
GTGGCTAATA CCACAGAAGA GCTTTGCAGA AGGTGGATGC AACTGGGGGC ATTTTATCCA 1980
TTTTCCAGGA ACCATAATGG TGATATATAT GAGCATCAGG ATCCAGCATT TTTTGGGCAG 2040
AATTCTCTTT TGGTTAATTC ATCAAAGCAT TATTTGAATA TTCGCTATAC ATTATTACCT 2100
TTCCTCTATA CTCTGTTTTA CAAAGCCCAT ATGTTTGGAG AAACTGTAGC AAGGCCAGTT 2160
CTTCATGAGT TTTATGAGGA TATGAATACC TGGAGTGAGG ACACTCAGTT CTTATGGGGT 2220
CCCGCATTAC TTATTACTCC CGTCTTGAAA GAGGGAGCAG ATACAGTGAG TGCATACATC 2280
CCTAATGCTA CTTGGTTTGA TTATGAAACT GGTGCAAAGA GGCCGTGGAG AAAACAACAA 2340
GTTAATATGT ACCTTCCAGG AGACAAAATA GGATTACATC TTAGAGGAGG TTATATTATC 2400
CCCATTCAAC AACCTGCTGT AACTACAACT GCAAGCCGAA AGAACCCTCT AGGACTTATA 2460
GTTGCATTAG ATGAAGATAA TACAGCAAAA GGAGATTTTT TCTGGGATGA TGGAGAAACG 2520
AAAAATACCA TACAAAATGG CAACTACATT TTATATACTT TTTCAGTTTC TAATAACAAA 2580
TTAGATATTT TATGCACACA TTCATCATAC CAGGAAGGAA CCACTTTAGC TTTTGAGACT 2640
ATAAAAATCC TTGGTTTGAC AGACTCTGTT ACACAAGTTA CAGTGGTGGA AGATAATCAG 2700
CCAATAAAAG ATCACTACAA TTTTACTTAC ACTGCTTCTG ACCAGAATCT CCTAATTTAC 2760
AATCTCAACT TTAACCTTGG AAGGAACTTT ACCGTTCAAT GGAATCAAAA TTACTCAGTA 2820
AATGAAAGGT TTACTTGTTA TCCAGATGCA GACCCTGCAA CTAAAGAAAA GTGTGAAGCA 2880
CGTGGCTGTT TATGGGAAAC GGTTCCTCTT AGCTCCCAAG CACCTGATTG TTACTTTCCT 2940
AGACAATACA ACCCTTATTT GGTCAGTTCA ACTCAGTATT CATCAATGGG TATAACAACT 3000
GACCTACAGC TGAATCCTAC AAGAGCTCAA ATAAAGCTAC CTTCTGACCC CATCTCAACT 3060
CTTCGTGTGG AGGTGAAATA TCACAAAAAT GATATGCTGC AGTTTAAGAT CTATGATCCC 3120
CAAAATAAGA GATATGAAGT TCCAGTACCT TTAAACATTC CAACCACACC AACAAGTACT 3180
TACGAAAACA GACTTTATGA TGTTGAAATC AAAGAAAATC CTTTTGGCAT CCAGGTTCGA 3240
AGGAGAAGTA CAGGAAAAGT TATTTGGGAT TCTTACCTGC CTGGATTTGC TTTTAATAAC 3300
CAGTTTATTC AAATATCTAC TCGCCTGCCA TCAGAATATA TATACGGTTT TGGGGAAGTG 3360
GAACATACAA CATTTAAGAG AGACCTGAAC TGGCATACTT GGGGAATGTT TACAAGAGAC 3420
CAACCCCCTG GTTATAAACT TAATTCCTAT GGATTTCATC CCTATTACAT GGCTCTGGAA 3480
GATGAAGGCT ATGCTCATGG AGTTCTCTTA CTTAACAGCA ATGCAATGGA CGTTACATTT 3540
CAGCCAACTC CTGCTCTAAC TTACCGTATA ATTGGAGGAA TCTTGGACTT TTATGTGTTT 3600
TTGGGCCCAA CTCCAGAAAT TGCAACACAA CAATACCATG AAGTAATTGG TCGACCTGTC 3660
ATGCCACCTT ACTGGGCTTT AGGATTCCAA TTATGTCGTT ATGGGTACAG AAATACTTCA 3720
GAAGTTGAGC AAGTATATAA TGATATGGTG GCTGCTCAGA TCCCCTATGA TGTTCAATAT 3780
ACAGACATTG ATTACATGGA AAGGCAATTA GACTTTACAA TTGATGAAAA CTTCCATGAC 3840
CTTCCTCAGT TTGTTGACAA AATAAGACAG GAAGGAATGA GATACATTAT TATCCTGGAT 3900
CCAGCAATTT CAGGAAATGA AACAAAGCCT TACCCTGCAT TTGAAAGAGG AATGGAGAAA 3960
GATGTTTTTG TCAAATGGCC TAATACCAGT GATATTTGTT GGGCAAAGGT TTGGCCAGAT 4020
TTGCCCAATA TAACAATAGA TGAAAGTCTA ACGGAAGATG AAGCTGTTAA TGCCTCCAGA 4080
GCTTATGTAG CTTTTCCAGA CTTTTTCAGG AATGCCACAG CAGAGTGGTG GGCAAGAGAA 4140
ATTATAGATT TCTACAATAA CCAGATGAGG TTTGATGGTT TGTGGATTGA CATGAATGAA 4200
CCATCAAGTT TTGTACATGG AACAGTCAGC AATCAATGCA GAAATAAAGA GTTAAATTAT 4260
CCACCTTATT TCCCAGAACT CACAAAAAGA GCTAATGGAT TACATTTCAG AACTATGTGT 4320
ATGGAAACCG AGCAAATTCT TAGTGATGGA TCATCTGTTT TGCATTACGA TGTTCATAAT 4380
CTGTATGGAT GGTCACAAAT GAAACCTAGT TATGATGCAT TGCAGAAGAC AACTGGCAAA 4440
AGAGGAATTG TTATTTCTCG TTCCACATAT CCTACTGGTG GACAATGGGG AGGGCATTGG 4500
CTTGGAGACA ACTATGCACA ATGGGACAAT TTGGACAAAT CAATCATTGG TATGATGGAA 4560
TTTAGTCTCT TTGGAATCTC ATATACTGGA GCAGATATTT GTGGTTTCTT CAACAATTCA 4620
GAATATGAGC TCTGTGCCCG CTGGATGCAA CTTGGAGCAT TTTATCCGTA CTCAAGAAAT 4680
CACAACATTG CTTTTACTAG GAGACAAGAT CCTGCTTCCT GGAATGCAAC TTTTTCTGAA 4740
ATGTCAAGGA ATATTCTAAA TATTAGATAC ACTTTATTGC CTTACTTCTA TACACAGATG 4800
CATGAAATTC ATGCTCATGG GGGCACTGTT ATTCGACCCC TTTTGCATGA GTTCTTTAAT 4860
GACAAGATAA CATGGGATAT ATTCAAGCAG TTCTTGTGGG GGCCAGCATT TCTGGTTACT 4920
GCTGTGTTGG AACCTAGTGT TAAGTCTGTA ATAGGCTATG TTCCTGATGC TCGGTGGTTT 4980
GATTATCATA CGGGCCAAGA TATTAAAGTC AGAGGGCAGT TTCATGAATT TTATACTCCT 5040
TTAGATACAA TAAACCTACA TGTCCGTGGT GGTCACATCC TACCATGTCA GGAGCCTGAT 5100
AAAAACACAT TTCACAGTCG AAAAAATTAC ATGAAGCTCA TTGTTGCTGC AGATAATAAT 5160
CAGACGGCAC AGGGATCTCT GTTTTGGGAT GATGGAGAGA GTATCGACAC CTATGAAAAA 5220
GGCTTATATT TCTTGGCACA ATTTAATTTA AATAAGAATA CCTTGACAAG CACTATACTG 5280
AAAAATGGTT ACATAAACAA AAATGAAATG AGGCTTGGAT TCATCAACAT ATGGGGGAAA 5340
GGAAAGACTG CTGTTAATGA GGTTATTCTT ATATATAACG GAAATAGAGA AGTAGTTAAA 5400
TTTAGTGAAA ACCTAGACAA GGAGATACTA AATATTAATC TGACAGTGAA TGATGTTACT 5460
CTAGATCAAC CAATACAAAT CAGCTGGTCA TGA 5494
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Caj-0026 ENSCJAP00000005224.2 Callithrix jacchus 84 0.0 3126
WERAM-Ran-0173 ENSRNOP00000072960.1 Rattus norvegicus 76 0.0 2323
Created Date 25-Jun-2016