WERAM Information


Tag Content
WERAM ID WERAM-Arl-0126
Ensembl Protein ID scaffold_401535.1
Gene Name HSI2
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
scaffold_401535.1 scaffold_401535.1 scaffold_401535.1
Status Unreviewed
Classification
Type Family E-value Score Start End
Me_Reader ZF-CW 5.80e-18 65.3 59 595
Organism Arabidopsis lyrata
Domain Profile
  Me_Reader ZF-CW

          ZF-CW.txt  4 ekvWvqCddClK 15
++ W +C C K
scaffold_401535.1 59 QSGWRECYLCNK 70
677999999988 PP
ZF-CW.txt 2 geekvWvqCddClKWRrLpgevdasvlsekWiCinNs.dvrynnCsvpeEsl 52
ge+++W++CddC+KWRrLp vda l++kW+Ci+N+ dv++++Cs+peEsl
scaffold_401535.1 547 GEQERWATCDDCSKWRRLP--VDAL-LPFKWTCIDNVwDVSRCSCSAPEESL 595
689****************..9996.************************97 PP

Protein Sequence
(Fasta)
MFEVKMVSKM CMNASCGTTS TVEWKKGWPL RSGLLADLCY RCGSAYESSL FCEQFHKDQS 60
GWRECYLCNK RLHCGCIASK VTIELMDYGG VGCSTCTCCH QLNLNTRGEN PGVFSRLPMK 120
PLADRQHVNG ESGMNIDGGR NEAGLFSQPL VMGGDKREEF MPHRGFGQLM NSENTTTGYR 180
LDAAGEMHES SPSQPSLNMG LAVNPFSPSF ATEGLEGKKH IGASQPNMVH GSASNILQKP 240
SKPAISTPPV ASKSAQARIG RPPVEGRGKG HLLPRYWPKY TDKEVQQISG NLNLNIVPLF 300
EKTLSASDAG RIGRLVLPKA CAEAYFPPIS QSEGIPLKIQ DVRGKEWTFQ FRYWPNNNSR 360
MYVLEGVTPC IQSMMLQAGD TVTFSRVDPG GKLIMGSRKA ANAGDMQGCG LTNGTSTEDT 420
SSSGVTENPP SINGSSCPSL IPQELNGMPE NLSSQKSETN GGRIGDDPAR VKEKKRTRTI 480
GAKNKRLLLH SEESMELRLT WEEAQDLLRP SPSAKPTIVV IEEKEIEEYD EPPVFGKRTI 540
VTTRPSGEQE RWATCDDCSK WRRLPVDALL PFKWTCIDNV WDVSRCSCSA PEESLKELEN 600
VLKVGREYKK RRTGESQAAK SQQEPCGLDA LASAAVLGDT IGEPEVATTT RHPRHRAGCS 660
CIVCIQPPSG KGRHKPTCGC TVCSTVKRRF KTLMMRRKKK QLERDVTAAE DKKKKDMELA 720
ESDKSKEEKE VNTARIDLNS DPYNKEDAEA VAVEKEESRK RAIGQCSGVV AQGAGDVLGV 780
TELEGEAKKV GEEPRVSS 798
Nucleotide Sequence
(Fasta)
AAAATCTAGG TGGGTTTGTG ATGTTTGAAG TCAAAATGGT GTCAAAGATG TGCATGAACG 60
CTTCATGTGG TACGACTTCT ACTGTTGAAT GGAAGAAAGG TTGGCCTCTT CGATCTGGTC 120
TTCTCGCTGA TCTCTGTTAT CGTTGCGGAT CTGCGTATGA GAGTTCTCTT TTCTGTGAAC 180
AATTCCATAA GGACCAATCT GGTTGGAGGG AATGCTATTT GTGTAACAAG AGACTACATT 240
GTGGATGCAT TGCTTCTAAG GTAACGATTG AGCTTATGGA CTATGGTGGT GTTGGTTGTA 300
GTACTTGTAC TTGCTGCCAT CAACTCAATT TGAACACGAG GGGTGAGAAT CCAGGTGTTT 360
TTAGCAGATT GCCAATGAAA CCGTTAGCTG ATAGACAACA TGTAAATGGC GAAAGCGGAA 420
TGAATATTGA CGGAGGAAGA AACGAAGCTG GTCTCTTTTC TCAGCCACTA GTCATGGGCG 480
GAGATAAAAG GGAAGAGTTC ATGCCTCACC GTGGGTTTGG TCAGCTAATG AATTCAGAAA 540
ATACCACCAC CGGGTATAGG CTGGATGCTG CTGGGGAAAT GCATGAATCA TCACCTTCAC 600
AGCCATCTTT AAACATGGGT TTGGCTGTAA ATCCATTTAG CCCATCTTTT GCAACCGAGG 660
GTCTCGAGGG AAAGAAACAC ATCGGTGCTT CTCAGCCCAA CATGGTCCAT GGCTCTGCCT 720
CTAATATACT GCAAAAACCA TCAAAACCTG CTATTTCAAC TCCTCCTGTG GCTAGTAAAT 780
CCGCTCAGGC GCGGATTGGA AGACCTCCTG TCGAAGGGCG AGGGAAAGGC CACTTACTTC 840
CTCGGTATTG GCCAAAATAT ACAGATAAAG AGGTTCAGCA GATCTCGGGA AACTTGAATT 900
TAAACATTGT ACCTCTTTTT GAGAAAACTC TTAGTGCCAG TGATGCTGGT CGCATTGGTC 960
GTCTAGTTCT TCCCAAAGCC TGTGCAGAGG CATATTTTCC TCCGATTAGT CAATCGGAAG 1020
GCATTCCTCT GAAAATCCAA GATGTGAGGG GTAAGGAGTG GACGTTCCAG TTCAGATATT 1080
GGCCCAATAA CAACAGTAGA ATGTATGTTT TAGAAGGGGT CACTCCATGC ATACAGTCCA 1140
TGATGCTACA GGCTGGTGAT ACAGTAACTT TCAGTCGGGT TGATCCTGGT GGAAAACTAA 1200
TCATGGGTTC CAGAAAGGCA GCTAATGCTG GAGACATGCA GGGTTGTGGT CTCACCAATG 1260
GAACTTCAAC CGAGGACACA TCATCGTCTG GTGTTACAGA AAACCCACCC TCCATAAATG 1320
GGTCATCGTG TCCCTCACTA ATACCGCAAG AATTGAACGG TATGCCTGAA AATTTGAGCT 1380
CACAGAAGAG TGAGACTAAC GGGGGCAGGA TAGGTGATGA TCCTGCACGA GTTAAAGAGA 1440
AGAAGAGAAC TCGAACCATT GGGGCAAAAA ATAAGAGACT TCTTTTGCAT AGTGAAGAAT 1500
CTATGGAGCT GAGACTCACC TGGGAGGAAG CTCAGGACTT GCTTCGTCCC TCTCCTAGTG 1560
CAAAGCCTAC CATCGTCGTC ATTGAGGAGA AAGAAATTGA AGAATATGAC GAACCTCCTG 1620
TCTTTGGAAA GAGGACTATA GTCACTACAA GACCTTCAGG TGAACAGGAA CGATGGGCAA 1680
CTTGCGATGA CTGCTCTAAA TGGAGAAGGT TACCTGTAGA CGCTCTTCTT CCCTTTAAAT 1740
GGACATGTAT AGACAATGTT TGGGACGTGA GCAGGTGTTC ATGTTCTGCA CCGGAGGAGA 1800
GTCTGAAGGA ACTTGAGAAT GTTCTTAAAG TAGGAAGAGA GTACAAGAAG AGAAGAACTG 1860
GGGAAAGCCA GGCAGCAAAA AGTCAGCAAG AACCGTGTGG TTTGGATGCA CTGGCGAGTG 1920
CAGCAGTCTT AGGAGACACA ATAGGCGAGC CAGAGGTTGC CACCACGACC AGACATCCAA 1980
GGCACAGGGC TGGATGCTCT TGCATCGTGT GCATTCAGCC ACCAAGTGGG AAAGGTAGAC 2040
ACAAGCCTAC ATGTGGGTGC ACTGTGTGTA GCACCGTGAA GAGAAGGTTC AAAACGCTTA 2100
TGATGAGGAG GAAGAAGAAG CAGTTGGAGC GCGATGTAAC AGCAGCAGAA GATAAGAAGA 2160
AGAAGGACAT GGAACTGGCT GAGTCTGATA AGAGTAAGGA GGAGAAGGAA GTGAACACAG 2220
CGAGGATAGA TCTGAACAGT GATCCATACA ACAAAGAAGA TGCTGAAGCT GTGGCGGTGG 2280
AGAAAGAAGA GAGTCGAAAA AGAGCAATAG GACAGTGTTC GGGCGTGGTG GCTCAAGGCG 2340
CTGGTGATGT ATTAGGAGTT ACGGAGTTAG AAGGAGAGGC TAAGAAAGTT GGTGAAGAGC 2400
CGAGAGTTTC AAGCTAATAT GGAAGGAGAA AAACGAA 2438
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Art-0048 AT2G30470.1 Arabidopsis thaliana 94 0.0 1474
Created Date 25-Jun-2016