WERAM Information


Tag Content
WERAM ID WERAM-Caj-0026
Ensembl Protein ID ENSCJAP00000005224.2
Gene Name SI
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSCJAG00000002837.2 ENSCJAT00000005507.2 ENSCJAP00000005224.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 0.58 33
Organism Callithrix jacchus
Domain Profile
  HAT HAT_other

Query: 471 LTVYPDFTNPNCIDWWANECSIFHQQVQYDGLWIDMNEVSSF---IQGSTKGCNSNNLNY 527
+ ++PD I A E + +V WI++ E F + S G + +L
Sbjct: 282 IPLFPDDPKARFIHQLAEEDRLL--KVSLSSFWIELQERQEFKLSVTSSVMGISGYSLAT 339
Query: 528 PPFTPDILDKLMYSKTICMDSVQNW--GKQYDVHSLYGYSMAIATEEAVKRVFPNKRSFI 585
P P D ++ +++ + G++YD TEE F N R F+
Sbjct: 340 PSLFPSSADVIVPKSRKQFRAIKKYITGEEYD------------TEEGAIEAFTNIRDFL 387
Query: 586 LTR 588
L R
Sbjct: 388 LLR 390

Protein Sequence
(Fasta)
MARRKFSGLE ISLIVLFVIV TIIAIALIVV LATKTPAVDE ISDSTSTPAT TRTTTNPSGS 60
GRCPNELNDN VNLRINCIPE QFATEETCKQ RGCCWRPWND SLIPWCFFVD NHGYNGEGIT 120
STDLGLQDTL NRIPSPTLFG NDIGSVSVTT QNQTSSRFRF KITDPNNKRH EVPHQYVKEF 180
TGPAVSDTLY DVSITENPFS IKVIRKSNGR TLFDTSIGPL VYSDQYLQIS TRLPSEYIYG 240
IGEQVHKRFR HDLYWKTWPI FTRDQLPGDN NNNLYGHQTF FMCIEDPSGE SFGVFLMNSN 300
AMEIFIQPTP IVTYRVTGGI LDFYIFLGDT PEQVVQQYQQ LVGLPAMPAY WSLGFQLSRW 360
NYKSLDVVKE VVRRNREAGI PFDTQVTDID YMENKKDFTY DEVAFQGLPE FVQDLHNNGQ 420
KYVIILDPAI SINQRANGTA YATYERGNAQ NVWVNESDGI TPIIGEVWPG LTVYPDFTNP 480
NCIDWWANEC SIFHQQVQYD GLWIDMNEVS SFIQGSTKGC NSNNLNYPPF TPDILDKLMY 540
SKTICMDSVQ NWGKQYDVHS LYGYSMAIAT EEAVKRVFPN KRSFILTRST FAGSGRHAAH 600
WLGDNTASWE QMEWSITGML EFSLFGIPLV GADICGFEAE TTEELCRRWM QLGAFYPFSR 660
NHNSDGYEHQ DPAFFGQNSL LVNSSRHYLT IRYTLLPFLY TLFYKAHMFG ETVARPVLHE 720
FYQDTNSWIE DLEFLWGPAL LITPVLKQGA DTVSAYIPDA VWYDYESGAK RPWRKQRVDM 780
YLPADKIGLH LRGGYIIPIQ EPDVTTTASR KNPLGLIVAL NENNTAKGDF FWDDGETKDI 840
LQNDNYILYT FSVSDNKLDI VCTHSTYQEG TTLAFKTVKI LGLTDTVTQV TVAENNQPIS 900
AHNNFTYDAS NQVLLITDLN LNLGKNFSVQ WNQVFSENET FNCYPDADFA SEEKCIQRGC 960
LWRTGSLSKA PECYFPRQDN PYSITSTQYS SMGVTADLQL NPANTRIKLP SDPISTLRVE 1020
VKYHKNDMLQ FKIYDPQNKR YEVPVPLNIP TTPISTYENR LYDVEIKENP FGIQIRRRST 1080
GRVIWDSHLP GFAFNDQFIQ ISTRLPSEYI YGFGEVEHTA FKRDLNWHTW GMFTRDQPPG 1140
YKLNSYGFHP YYMALEEEGN AHSVLLLNSN AMDVTFQPTP ALTYRTVGGI LDFYMFLGPT 1200
PEVATKQYHE VIGHPVMPPY WALGFQLCRY GYANTSEVIE VYEAMVNASI PYDVQYTDID 1260
YMERQLDFTI GEAFQDLPQF VDKIRGEGMR YIIILDPAIS GNETKPYPAF QRGQQEDVFV 1320
KWPNTNDICW AKVWPDLPNI TIDKTLTEDE AVNASRAHVA FPDFFRTSTA GWWAREILDF 1380
YNDQMKFDGL WIDMNEPSSF VNGTTSNQCR NDKLNYPPYF PELTKRTDGL HFRTMCMETE 1440
QILSDGSSVL HYNVHNLYGW SQMKPSYDAL QKTTGKRGIV ISRSTFPTGG RWGGHWLGDN 1500
YARWDNLDKS IIGMMEFSLF GISYTGADIC GFFNNSEYHL CTRWMQLGAF YPYSRNHNIA 1560
NTRRQDPASW NETFAEMSRN ILNIRYTLLP YFYTQMHEIH AHGGTVIRPL LHEFFSEKPT 1620
WDIFRQFLWG PAFMVTPVLE PYVQSVNAYV PNARWFDYHT SEDIKVREQF HTFNASYETI 1680
NLHVRGGHIL PCQEPAQNTF HSRQNYMKLI VAADDNQMAQ GFLFWDDGES IDTYERDLYF 1740
YVQFNLNKTI LTSTVLKRGY INKNEMMLGV INVWGKGPTP VTAVTLTYNG NTNSLAFSQD 1800
NNKEILTIDL TNYNVTLDEP IEINWS 1826
Nucleotide Sequence
(Fasta)
TATTTTGGCA GCCTTATCCA ACTCTGGTAC AACATAGCAA ACACAACAGC TTATGAAATA 60
AGATGGCAAG AAGAAAATTT AGTGGATTAG AAATCTCTCT GATCGTCCTC TTTGTCATAG 120
TTACTATAAT TGCTATTGCC CTAATTGTTG TTTTAGCAAC TAAGACACCT GCTGTTGATG 180
AAATTAGTGA TTCTACTTCA ACTCCAGCTA CTACTCGTAC AACTACAAAT CCTTCTGGTT 240
CAGGAAGATG TCCAAATGAG TTAAATGATA ATGTCAATTT GAGAATAAAC TGCATTCCAG 300
AACAATTCGC AACAGAGGAA ACTTGTAAAC AGAGAGGCTG CTGCTGGAGG CCATGGAATG 360
ACTCTCTTAT TCCTTGGTGC TTCTTCGTAG ATAATCATGG TTATAATGGT GAAGGAATTA 420
CATCAACAGA TCTTGGACTT CAAGACACAT TAAACAGGAT ACCTTCACCT ACACTATTTG 480
GAAATGATAT TGGCAGTGTT TCCGTCACAA CTCAAAATCA GACATCCAGT CGTTTCCGGT 540
TCAAGATTAC TGATCCAAAT AATAAAAGAC ATGAAGTTCC TCATCAGTAT GTAAAAGAGT 600
TTACTGGTCC TGCAGTTTCT GACACATTGT ATGATGTGAG CATTACAGAA AACCCATTTA 660
GCATCAAAGT TATTAGGAAA AGCAATGGTA GAACTTTGTT TGACACCAGC ATTGGTCCCT 720
TAGTGTACTC TGACCAGTAC TTACAGATCT CAACCCGTCT TCCCAGTGAA TATATTTACG 780
GTATTGGGGA ACAGGTTCAT AAGAGATTTC GTCATGATTT ATACTGGAAA ACATGGCCAA 840
TTTTTACTCG AGATCAACTT CCTGGTGATA ATAATAATAA TTTATATGGC CATCAAACAT 900
TTTTTATGTG TATTGAAGAT CCATCTGGAG AGTCATTTGG TGTGTTTTTA ATGAATAGCA 960
ATGCAATGGA GATTTTTATC CAGCCTACTC CAATAGTAAC ATATAGAGTT ACTGGTGGCA 1020
TTCTGGATTT TTACATCTTT CTAGGAGATA CGCCAGAACA AGTGGTTCAA CAGTATCAAC 1080
AGCTTGTTGG ACTACCAGCA ATGCCAGCAT ATTGGAGTCT TGGATTTCAA CTAAGTCGCT 1140
GGAATTATAA GTCATTAGAT GTAGTGAAAG AAGTGGTAAG GAGAAACCGG GAAGCCGGCA 1200
TACCATTTGA TACACAAGTA ACTGATATTG ACTACATGGA AAACAAGAAA GACTTTACTT 1260
ATGATGAAGT TGCATTTCAA GGACTCCCTG AATTTGTTCA AGATTTGCAT AACAATGGAC 1320
AGAAATATGT CATCATCTTG GACCCTGCAA TTTCCATAAA TCAACGTGCC AATGGAACAG 1380
CATATGCCAC CTATGAGAGG GGAAATGCAC AAAATGTGTG GGTAAATGAG TCAGACGGAA 1440
TAACACCAAT TATCGGAGAG GTGTGGCCAG GATTGACAGT GTACCCTGAT TTCACCAATC 1500
CAAACTGCAT TGATTGGTGG GCAAATGAAT GCAGTATTTT CCATCAACAA GTGCAGTATG 1560
ATGGACTTTG GATTGACATG AATGAAGTTT CCAGCTTTAT TCAAGGTTCA ACAAAAGGAT 1620
GTAATTCCAA CAACTTGAAT TATCCACCTT TTACTCCTGA TATTCTTGAC AAACTCATGT 1680
ATTCCAAAAC AATTTGCATG GATTCTGTGC AGAACTGGGG GAAACAGTAT GATGTTCACA 1740
GCCTCTATGG ATACAGTATG GCTATAGCCA CAGAGGAAGC TGTAAAAAGA GTTTTTCCTA 1800
ATAAGAGAAG CTTCATTCTT ACTCGGTCAA CATTTGCTGG ATCTGGAAGA CATGCTGCAC 1860
ATTGGTTAGG AGACAACACT GCTTCATGGG AACAAATGGA ATGGTCTATA ACTGGAATGC 1920
TGGAGTTCAG TTTGTTTGGA ATACCTTTGG TTGGAGCAGA TATCTGTGGA TTTGAGGCTG 1980
AAACCACGGA AGAGCTTTGC AGAAGATGGA TGCAACTTGG GGCATTTTAT CCATTTTCCA 2040
GAAACCATAA TTCTGATGGA TATGAACATC AGGATCCTGC ATTTTTTGGG CAGAATTCAC 2100
TTTTGGTTAA TTCATCAAGG CACTATCTAA CCATTCGCTA CACCTTATTA CCTTTCCTCT 2160
ATACTCTGTT TTATAAAGCC CATATGTTTG GAGAAACAGT AGCAAGGCCA GTTCTTCATG 2220
AGTTTTATCA GGATACGAAC AGCTGGATTG AGGACCTTGA GTTTTTGTGG GGCCCCGCAT 2280
TACTTATTAC TCCTGTTTTA AAACAGGGAG CAGATACTGT GAGTGCCTAC ATCCCTGATG 2340
CTGTTTGGTA TGATTATGAA TCTGGTGCAA AAAGGCCATG GAGGAAACAA CGTGTTGATA 2400
TGTATCTTCC AGCAGACAAG ATAGGATTAC ATCTTAGAGG AGGTTATATC ATCCCCATTC 2460
AAGAACCAGA TGTAACAACA ACAGCAAGCC GTAAGAATCC TCTAGGACTC ATAGTTGCAT 2520
TAAATGAAAA TAACACCGCA AAAGGAGACT TTTTCTGGGA TGATGGGGAA ACTAAAGATA 2580
TCTTACAAAA TGACAACTAC ATATTATATA CATTTTCAGT GTCTGACAAC AAATTAGATA 2640
TTGTGTGCAC ACATTCAACA TACCAGGAAG GAACTACCTT AGCATTCAAG ACTGTAAAAA 2700
TCCTTGGGTT GACAGACACT GTTACACAAG TAACAGTGGC GGAAAATAAT CAACCAATAA 2760
GCGCTCATAA CAATTTCACT TATGATGCTT CTAACCAGGT TCTCCTAATT ACAGATCTCA 2820
ACCTTAATCT TGGAAAAAAC TTTAGTGTTC AATGGAATCA AGTTTTCTCA GAAAATGAAA 2880
CATTTAATTG TTATCCAGAT GCAGATTTTG CATCTGAAGA AAAGTGTATA CAACGTGGCT 2940
GTTTATGGCG AACGGGTTCT CTATCCAAAG CACCTGAGTG TTACTTTCCC AGACAAGATA 3000
ACCCTTATTC AATCACCTCA ACTCAATATT CATCAATGGG TGTAACAGCT GACCTCCAAC 3060
TAAATCCTGC AAATACTAGA ATAAAGTTAC CTTCTGATCC CATCTCAACT CTTCGCGTGG 3120
AGGTGAAATA TCACAAAAAT GACATGTTGC AGTTTAAGAT TTATGATCCC CAAAATAAGA 3180
GATACGAAGT TCCAGTACCA TTAAACATTC CAACCACCCC AATAAGTACT TATGAAAACA 3240
GACTTTATGA TGTTGAAATT AAGGAAAATC CTTTTGGCAT CCAGATTCGA CGGAGAAGCA 3300
CTGGGAGGGT TATTTGGGAT TCTCACCTGC CTGGATTTGC TTTCAATGAT CAGTTCATTC 3360
AAATATCGAC TCGCCTGCCA TCAGAATATA TATATGGTTT TGGGGAGGTG GAACACACAG 3420
CATTTAAGCG AGATCTGAAC TGGCATACTT GGGGAATGTT CACAAGAGAC CAACCCCCTG 3480
GTTATAAACT TAATTCCTAC GGATTTCATC CCTATTACAT GGCTCTAGAA GAGGAAGGCA 3540
ATGCTCACAG TGTTCTCTTA CTCAACAGCA ATGCAATGGA TGTTACATTC CAGCCAACCC 3600
CTGCTCTAAC TTATCGCACA GTTGGAGGAA TCTTGGATTT TTATATGTTT TTGGGCCCAA 3660
CCCCAGAAGT TGCAACAAAG CAATACCATG AAGTAATTGG CCATCCAGTC ATGCCACCGT 3720
ATTGGGCTTT GGGATTCCAA TTATGTCGTT ATGGATATGC AAATACTTCA GAGGTTATTG 3780
AAGTATATGA AGCTATGGTG AATGCTAGCA TCCCCTATGA TGTTCAGTAC ACAGATATTG 3840
ACTACATGGA AAGGCAACTA GACTTTACAA TTGGTGAAGC ATTCCAGGAC CTTCCTCAGT 3900
TTGTTGACAA AATAAGGGGA GAAGGAATGA GATACATTAT TATCCTGGAT CCAGCAATTT 3960
CAGGAAATGA AACAAAGCCT TATCCTGCAT TTCAAAGAGG ACAGCAGGAA GATGTCTTTG 4020
TCAAATGGCC TAACACCAAT GATATCTGCT GGGCAAAGGT TTGGCCAGAT TTGCCCAACA 4080
TAACAATAGA TAAAACTCTA ACTGAAGATG AAGCTGTTAA TGCTTCCAGA GCTCATGTAG 4140
CTTTTCCAGA TTTCTTCAGG ACTTCCACAG CAGGGTGGTG GGCAAGAGAA ATTCTAGACT 4200
TTTACAATGA CCAGATGAAA TTTGATGGTT TGTGGATTGA TATGAATGAG CCATCAAGTT 4260
TTGTAAATGG AACAACTTCT AATCAATGCA GAAATGATAA ACTAAATTAT CCACCTTATT 4320
TCCCAGAACT CACAAAAAGA ACTGATGGAT TACATTTCAG AACAATGTGC ATGGAAACTG 4380
AGCAGATTCT TAGTGATGGA TCGTCAGTTT TGCATTACAA TGTTCACAAT CTCTATGGAT 4440
GGTCACAAAT GAAACCTAGT TATGATGCAC TGCAGAAGAC AACCGGAAAA AGAGGAATTG 4500
TTATTTCTCG TTCCACGTTT CCTACTGGTG GACGATGGGG AGGACACTGG CTTGGAGACA 4560
ACTATGCACG ATGGGACAAC CTGGACAAAT CGATCATTGG TATGATGGAA TTTAGTCTCT 4620
TTGGAATATC ATATACTGGA GCAGACATCT GTGGTTTCTT CAACAACTCA GAATATCACC 4680
TCTGTACCCG CTGGATGCAA CTTGGGGCAT TTTACCCATA CTCAAGGAAT CACAACATTG 4740
CAAATACCAG AAGACAAGAT CCCGCTTCCT GGAATGAAAC TTTTGCTGAA ATGTCAAGGA 4800
ATATTCTAAA TATTAGATAC ACATTATTGC CCTATTTCTA TACACAAATG CATGAAATTC 4860
ATGCTCATGG TGGCACTGTT ATTCGACCCC TTTTGCATGA GTTCTTTAGT GAAAAACCAA 4920
CCTGGGATAT ATTCAGGCAG TTTTTATGGG GTCCAGCATT TATGGTTACC CCGGTACTGG 4980
AACCTTATGT TCAAAGTGTA AATGCCTATG TCCCCAATGC TCGTTGGTTT GACTACCATA 5040
CGAGTGAAGA TATTAAGGTC AGAGAACAAT TTCATACGTT TAATGCTTCT TATGAGACAA 5100
TAAACCTACA TGTCCGTGGT GGTCACATCC TACCATGTCA AGAGCCAGCT CAAAACACAT 5160
TTCACAGTCG ACAAAATTAC ATGAAGCTCA TTGTTGCCGC AGATGATAAT CAGATGGCAC 5220
AGGGTTTTCT GTTTTGGGAT GATGGAGAGA GTATAGATAC CTATGAAAGA GACCTATATT 5280
TTTATGTACA ATTTAATTTA AACAAGACCA TCTTAACAAG CACTGTATTG AAAAGAGGTT 5340
ACATAAATAA AAATGAAATG ATGCTTGGGG TCATTAATGT ATGGGGGAAA GGACCAACTC 5400
CCGTCACAGC GGTTACTCTA ACATATAATG GAAATACAAA TTCGCTTGCA TTTTCTCAAG 5460
ACAATAACAA GGAGATACTA ACCATTGATC TGACCAACTA CAATGTTACT CTAGATGAGC 5520
CAATAGAAAT CAACTGGTCA TGAAGATCAC CATCAGTTTT AGTTTTCAGT GGGAGAAAAC 5580
ACCAGGATTC AAGTTTCACA GCACTTACAA CTTCCCTCTT CATTTGGTTC TTGTACTCTA 5640
CAAAATACAA CTTTCATAAC GTGGAAAAGC TATTTCACAG CATACATCAA TGATAATGCT 5700
AATTTTGTTA TATTCATGTG ACTTGGATTC AATTTTAAGA CATTTAATAA AATTTTAATA 5760
GTTCTATTTA TACTATTAAG TTTCAGCTGC AATTGTAAAT TAGTTACTAA ACATATATGA 5820
CATAGCTAAG ATATAATTCA AACTACTGAT TTTTAAATTA AACCAATTTT TGTGTAATTG 5880
TAAATTGTAT GTTATATTTT ATCAATGTTT ACCAGATTTA ATATATGTGA CAACAATTAT 5940
CACAAGATTT AATTATTTCT TAGTATGTAT ATTTAATTAG AAAAAGAGAA TAAAAATATG 6000
TAAGTGT 6008
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mup-0105 ENSMPUP00000009565.1 Mustela putorius furo 84 0.0 3110
WERAM-Ran-0173 ENSRNOP00000072960.1 Rattus norvegicus 77 0.0 2313
Created Date 25-Jun-2016