WERAM Information


Tag Content
WERAM ID WERAM-Sus-0131
Ensembl Protein ID ENSSSCP00000018442.2
Gene Name
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
ENSSSCG00000017402.2 ENSSSCT00000018947.2 ENSSSCP00000018442.2
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 3.6 29
Organism Sus scrofa
Domain Profile
  HAT HAT_other

Query: 27  PIEVRHYLSQWIESQAWDSIDLDNPQENIKATQLLEGLVQELQKKAEHQVGEDGFLLKIK 86
P VR YL + ++ W D+ +EN A + + + + HQ+ E+ LLK+
Sbjct: 249 PARVRSYL-RGMKYPLWQVGDIFTSKENSLAVYNIPLFPDDPKARFIHQLAEEDRLLKVS 307
Query: 87 LGHYATQLQKPYNF 100
L + +LQ+ F
Sbjct: 308 LSSFWIELQERQEF 321

Protein Sequence
(Fasta)
MAVWIQAQQL QGDALHQMQA LYGQHFPIEV RHYLSQWIES QAWDSIDLDN PQENIKATQL 60
LEGLVQELQK KAEHQVGEDG FLLKIKLGHY ATQLQKPYNF PPLDFVYCIL TVLFNERQLV 120
EHGGNGSSSM GAPAAAAAEE PNTVSKLLEK VRKVSVNVKT KLEKLQQTQE YFIIQYQESL 180
RIQAQFAQLA QLNPQERLSR ETALQQKQVT LEAWLQREAQ TLQQYRVELA EKHQKTLQLL 240
RKQQTIILDD ELIQWKRRQQ LAGNGGPPEG SLDVLQSWCE KLAEIIWQNR QQIRRAEHLC 300
QQLPIPGPVE EMLAEVNATI TDIISALVTS TFIIEKQPPQ VLKTQTKFAA TVRLLVGGKL 360
NVHMNPPQVK ATIISEQQAK SLLKNESTRN ESSGEILNNC CVMEYHQATG TLSAHFRNMS 420
LKRIKRADRR GAESVTEEKF TVLFESQFSV GSNELVFQVK TLSLPVVVIV HGSQDHNATA 480
TVLWDNAFAE PGRVPFAVPD KVLWPQLCEA LNMKFKAEVQ SNRGLTKENL VFLAQKLFNS 540
SSSHLEDYSG MSVSWSQFNR ENLPGWNYTF WQWFDGVMEV LKKHHKPHWN DGAILGFVNK 600
QQAHDLLINK PDGTFLLRFS DSEIGGITIA WKFDSPDRNL WNLKPFTTRD FSIRSLADRL 660
GDLSYLIYVF PDRPKDEVFS KYYTPVLAPA SAAKAVDGYV KPQIKQVVPE FVSASSDSAG 720
GNATYMDQAP SPAVCPQAHY SIYPQNPDPV LDQDGEFDLD ETMDVARHVE ELLRRPMDSL 780
DPRLSPPAGL FASTRGSLS 799
Nucleotide Sequence
(Fasta)
AGATTGTAAA CCATGGCTGT GTGGATACAA GCCCAGCAGC TCCAAGGAGA TGCCCTTCAT 60
CAGATGCAAG CACTGTATGG TCAGCATTTC CCCATCGAGG TGCGGCATTA TTTATCCCAG 120
TGGATTGAAA GCCAAGCTTG GGACTCAATA GATCTTGATA ATCCACAGGA GAACATTAAG 180
GCCACCCAGC TCCTGGAGGG CCTGGTGCAG GAGCTGCAGA AGAAGGCAGA GCACCAAGTG 240
GGGGAAGATG GGTTCTTACT GAAGATCAAG CTGGGGCACT ACGCCACGCA GCTCCAGAAA 300
CCCTACAATT TCCCACCTCT GGACTTTGTT TATTGCATAC TTACAGTGCT GTTTAATGAG 360
AGACAACTGG TAGAACATGG AGGAAACGGG TCCTCCTCCA TGGGGGCTCC GGCGGCCGCG 420
GCGGCCGAGG AGCCCAACAC GGTGAGCAAG TTGCTGGAGA AGGTGCGCAA GGTCAGCGTC 480
AACGTGAAGA CCAAGCTGGA GAAGCTGCAG CAGACTCAAG AGTACTTCAT CATCCAGTAT 540
CAAGAGAGCC TGAGGATCCA GGCCCAGTTT GCCCAGCTGG CCCAGCTGAA CCCCCAGGAG 600
CGTCTGAGCC GGGAGACGGC CCTCCAGCAG AAGCAGGTGA CCCTGGAGGC CTGGCTGCAG 660
CGCGAGGCCC AGACCCTGCA GCAGTACCGC GTGGAGCTGG CCGAGAAGCA CCAGAAGACC 720
CTGCAGCTGC TGCGGAAGCA GCAGACCATC ATTCTGGATG ACGAGCTGAT CCAGTGGAAG 780
CGGCGGCAGC AGCTGGCGGG GAACGGAGGG CCCCCCGAGG GCAGCCTGGA CGTGCTGCAG 840
TCCTGGTGTG AGAAGTTGGC GGAGATCATC TGGCAGAACC GGCAGCAGAT CCGCAGAGCT 900
GAGCACCTCT GCCAGCAGCT GCCCATCCCC GGCCCCGTGG AGGAGATGCT GGCTGAGGTC 960
AACGCCACCA TCACAGACAT CATCTCAGCC CTGGTGACCA GCACATTCAT CATCGAGAAG 1020
CAGCCCCCTC AGGTCCTGAA GACGCAGACC AAGTTCGCGG CCACCGTGCG CCTGCTGGTG 1080
GGCGGGAAGC TGAACGTGCA CATGAACCCC CCGCAGGTGA AGGCCACCAT CATCAGTGAG 1140
CAGCAGGCCA AGTCGCTGCT CAAGAACGAG AGCACCCGCA ACGAGTCCAG CGGCGAGATC 1200
TTGAACAACT GCTGTGTGAT GGAGTATCAC CAGGCCACCG GCACCCTCAG CGCCCACTTC 1260
CGGAACATGT CACTAAAGAG GATCAAGCGC GCTGACCGGC GAGGTGCAGA GTCCGTGACG 1320
GAGGAGAAGT TCACGGTCCT GTTCGAGTCT CAGTTCAGTG TCGGCAGCAA TGAGCTGGTG 1380
TTCCAGGTGA AGACCCTGTC CCTTCCCGTG GTTGTCATCG TTCATGGCAG CCAGGACCAC 1440
AATGCTACCG CCACCGTGCT GTGGGACAAT GCCTTTGCTG AGCCGGGCAG GGTGCCCTTT 1500
GCGGTGCCTG ACAAAGTCCT GTGGCCGCAG CTGTGCGAGG CGCTCAACAT GAAATTCAAG 1560
GCCGAGGTGC AGAGCAACCG GGGCCTGACC AAGGAGAACC TCGTGTTCCT GGCGCAGAAG 1620
CTCTTCAACA GCAGCAGCAG CCACCTGGAG GACTACAGCG GCATGTCCGT GTCCTGGTCC 1680
CAGTTCAACA GGGAGAACTT ACCAGGCTGG AACTACACCT TCTGGCAGTG GTTTGACGGG 1740
GTCATGGAGG TGCTGAAGAA GCATCACAAG CCCCATTGGA ATGACGGGGC CATCCTAGGT 1800
TTCGTGAATA AGCAACAGGC CCATGACCTG CTCATCAACA AGCCCGATGG GACCTTCCTG 1860
CTGCGCTTTA GCGACTCAGA AATCGGGGGC ATCACCATTG CCTGGAAGTT TGACTCCCCT 1920
GACCGTAACC TGTGGAATCT GAAGCCTTTC ACCACGCGGG ATTTCTCCAT CCGGTCCCTG 1980
GCGGACCGGC TGGGGGACCT GAGCTATCTC ATCTATGTGT TTCCCGACCG GCCCAAGGAC 2040
GAGGTCTTCT CCAAGTACTA CACTCCTGTG CTCGCTCCGG CCTCGGCAGC TAAAGCAGTG 2100
GACGGATACG TGAAGCCACA GATCAAGCAA GTGGTCCCTG AGTTTGTGAG CGCCTCTTCA 2160
GACTCTGCCG GGGGCAATGC CACCTACATG GACCAGGCCC CCTCCCCAGC CGTGTGCCCC 2220
CAGGCTCATT ACAGCATATA CCCACAGAAC CCGGACCCCG TCCTCGACCA GGATGGAGAA 2280
TTCGACCTGG ATGAGACCAT GGATGTAGCC CGGCATGTGG AGGAACTCTT ACGCCGCCCG 2340
ATGGACAGTC TGGACCCCCG CCTCTCCCCG CCTGCTGGTC TCTTCGCTTC TACCAGGGGC 2400
TCACTCTCCT GA 2413
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Fia-0007 ENSFALP00000000694.1 Ficedula albicollis 84 0.0 1374
WERAM-Sah-0037 ENSSHAP00000004406.1 Sarcophilus harrisii 87 0.0 1342
WERAM-Tag-0038 ENSTGUP00000002670.1 Taeniopygia guttata 83 0.0 1297
WERAM-Ocp-0086 ENSOPRP00000007636.1 Ochotona princeps 74 0.0 644
Created Date 25-Jun-2016