WERAM Information


Tag Content
WERAM ID WERAM-Miv-0015
Ensembl Protein ID MVLG_02319T0
Gene Name
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
MVLG_02319 MVLG_02319T0 MVLG_02319T0
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 2.3 33
Organism Microbotryum violaceum
Domain Profile
  HAT HAT_other

Query: 755 EHCPACSSPIALEGVRAARCERGHVWERC 783
EHC C + + A C GH+W RC
Sbjct: 745 EHCSLCKEILPFTDRKQAVCSNGHIWLRC 773
Database: Homo_sapiens.GRCh38.pep.all.fa
Posted date: Jun 9, 2016 10:45 AM
Number of letters in database: 38,475,758
Number of sequences in database: 102,450

Lambda K H
0.316 0.131 0.396
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 102450
Number of Hits to DB: 63,161,615
Number of extensions: 2503209
Number of successful extensions: 6328
Number of sequences better than 10.0: 1
Number of HSP's gapped: 6414
Number of HSP's successfully gapped: 1
Length of query: 871
Length of database: 38,475,758
Length adjustment: 116
Effective length of query: 755
Effective length of database: 26,591,558
Effective search space: 20076626290
Effective search space used: 20076626290
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 69 (31.2 bits)

Protein Sequence
(Fasta)
MAASMQDASM RDDPAPDPNN FDEPELPSLP STSNFELPRY NISTYNFTPL VSSPSPLSLN 60
HAGQILLLTR EQINILTPAT GYQTEQPIVQ SGALANLTAN ANTSKGKERA NADDPTPLPA 120
FRTTIPIEKK DLINWGGWAN EIHVGTAASG SLEHFWRDAA WSPLGLGPLG GCALVALSNN 180
NEAIVYEPQK NAHKSQWVTA FDISSHLIRL LLRDEDPTTP ASDKPIETQE DRRRLATKIW 240
ECQSTAVAWS SAVTGAYGDF SILAIGHRSG HVSLWRRAHN GDLSILHRYR LDPQASWITL 300
LSWSNWIPSH DGTTTTGLLA ATDSKGQTWV VEVSQFVSAG RPLLSDQEAA PSEKTEDVWA 360
TKPIVVCETD QRPPTLAFTK LGTVTFIDLE PNTSEDPTST YQPESSYELE LTTQGDWMGT 420
TPFAPCSGIE WYASSKILIL SLESSAFYIL AKTEAQWHFI DSSESVEDGY PVTTTSAQVT 480
RIARQVFLRT FSQTYATRAR RQELQAAGGT WKKGARIYGL TGFGEDGELG WLYENVVPDV 540
MNYKAIGHTR TQFVVLPILG DVKGEKQRER LQKLLGEPRN PLESSPLAVL RAFLRYADET 600
ISNFAWTSSI LTLLEAESIP LETPIINSLI DRPTTAQAIY DHLYLDAALN ALRYRELVTR 660
FLARHTALPK GLKRRTGEVH RALARRIVHQ VVTRLGELMS SAQEHLMPTE LPISGRILLT 720
SASFTNAPLK FEDPSDVSSS LPDPDTLQHT FGGSEHCPAC SSPIALEGVR AARCERGHVW 780
ERCSITLKVV DTVVVRTCVG CERKALMGLG SKKEKKAVLP AETEGQEGAA EGEGRGIVET 840
MLRESAKPND EKKAQKRANN FNLAERERVS T 871
Nucleotide Sequence
(Fasta)
ATGGCCGCGT CCATGCAAGA CGCGTCCATG CGCGATGACC CTGCTCCCGA CCCCAACAAC 60
TTTGACGAAC CCGAACTACC CTCCCTCCCT TCGACGTCAA ACTTTGAACT ACCCCGATAC 120
AACATCAGCA CCTACAACTT CACGCCCTTG GTCTCATCCC CAAGTCCATT GTCGCTCAAC 180
CATGCCGGCC AGATCCTGCT CCTGACGCGA GAGCAAATCA ACATTCTGAC CCCAGCAACA 240
GGCTACCAGA CCGAGCAACC GATCGTACAG TCCGGCGCAC TCGCCAACCT CACCGCCAAT 300
GCGAACACCT CAAAGGGCAA GGAGAGGGCC AATGCAGACG ATCCCACGCC TTTACCTGCC 360
TTTCGGACGA CTATCCCCAT AGAAAAAAAG GATCTTATCA ACTGGGGTGG ATGGGCGAAT 420
GAGATCCACG TCGGGACGGC CGCGTCGGGT TCACTGGAAC ACTTCTGGAG GGATGCAGCT 480
TGGTCACCAT TGGGTTTGGG CCCCCTAGGA GGGTGTGCCC TCGTGGCTTT GTCCAACAAC 540
AACGAAGCCA TCGTATATGA ACCTCAAAAG AACGCACACA AGTCACAATG GGTCACTGCC 600
TTTGACATTT CGTCGCACTT GATCAGGTTA CTTTTGCGCG ATGAAGATCC GACGACGCCC 660
GCGAGTGATA AACCGATCGA GACGCAGGAG GATCGACGCC GGCTCGCCAC CAAAATTTGG 720
GAATGTCAAA GCACAGCCGT CGCTTGGTCG TCTGCGGTAA CCGGTGCTTA TGGAGACTTT 780
TCGATTCTCG CCATCGGTCA CAGGAGTGGG CATGTTTCGC TGTGGAGACG AGCTCACAAC 840
GGCGACCTAT CGATACTGCA TCGGTACCGG CTCGACCCCC AAGCGAGCTG GATTACCCTC 900
CTCTCGTGGT CCAACTGGAT ACCGTCCCAC GACGGCACAA CTACAACAGG CCTACTAGCC 960
GCCACCGACT CGAAAGGTCA GACATGGGTC GTCGAAGTCT CGCAGTTTGT CAGCGCGGGT 1020
CGGCCGTTGT TATCGGATCA AGAGGCTGCA CCGTCTGAAA AGACGGAGGA TGTTTGGGCG 1080
ACGAAACCTA TTGTCGTTTG TGAGACCGAT CAGCGACCCC CTACTCTCGC ATTCACCAAG 1140
CTCGGTACAG TCACCTTCAT CGACTTGGAG CCTAACACCT CTGAGGACCC TACGTCGACC 1200
TACCAACCTG AAAGCTCTTA CGAGCTCGAA TTGACCACAC AAGGGGACTG GATGGGTACG 1260
ACGCCCTTCG CTCCATGCAG CGGGATCGAA TGGTACGCCT CGTCCAAAAT CCTCATCCTC 1320
TCCCTCGAAT CATCGGCTTT CTATATCCTC GCCAAAACGG AGGCTCAATG GCACTTTATC 1380
GACTCTTCCG AATCCGTCGA GGATGGATAC CCAGTCACGA CGACGAGTGC CCAAGTGACC 1440
CGAATAGCAC GCCAAGTCTT CCTCCGCACC TTTAGCCAAA CCTACGCCAC CCGTGCCAGA 1500
AGACAAGAAC TTCAAGCCGC CGGAGGCACA TGGAAGAAGG GTGCTCGGAT CTACGGCCTG 1560
ACCGGTTTCG GGGAGGATGG TGAATTGGGT TGGTTGTATG AGAACGTCGT GCCCGATGTG 1620
ATGAATTATA AGGCCATTGG GCATACGAGA ACGCAGTTTG TAGTGTTGCC AATTTTGGGG 1680
GATGTGAAAG GGGAGAAGCA GAGGGAGAGG TTGCAGAAAC TTTTAGGCGA GCCGAGGAAT 1740
CCTCTCGAAT CCTCACCACT CGCTGTCCTG CGCGCCTTCC TACGCTACGC CGATGAAACC 1800
ATTTCCAACT TTGCTTGGAC GTCCTCCATC CTCACCCTAC TTGAAGCCGA ATCGATACCT 1860
CTCGAAACCC CCATCATCAA TTCCCTTATA GACCGTCCAA CCACAGCCCA AGCCATCTAC 1920
GATCACCTCT ACCTCGACGC CGCGCTCAAC GCTCTGCGGT ACCGAGAACT CGTCACTCGA 1980
TTCTTGGCCC GACATACGGC TCTGCCGAAG GGGTTGAAGA GGAGGACAGG AGAGGTTCAT 2040
CGTGCATTGG CGAGGAGAAT TGTGCATCAG GTTGTGACGA GGTTGGGAGA GCTGATGAGC 2100
AGTGCTCAGG AACATCTGAT GCCTACCGAA CTTCCCATCT CCGGTCGCAT CCTCCTCACC 2160
TCGGCCTCCT TCACCAACGC CCCACTCAAG TTCGAAGACC CATCCGACGT CTCCTCCTCC 2220
CTACCCGACC CGGACACCCT ACAACACACA TTCGGCGGTT CCGAACATTG TCCCGCCTGT 2280
TCCTCCCCCA TCGCCTTGGA AGGCGTTCGT GCAGCGAGGT GTGAAAGGGG TCATGTTTGG 2340
GAGAGGTGTT CGATTACGTT GAAGGTGGTG GATACGGTGG TGGTGAGGAC CTGTGTGGGG 2400
TGTGAGAGGA AGGCTTTGAT GGGGTTGGGG TCGAAGAAGG AGAAGAAGGC GGTGCTCCCG 2460
GCGGAGACGG AGGGGCAGGA AGGGGCGGCT GAAGGGGAAG GGAGAGGAAT AGTTGAGACT 2520
ATGCTGAGGG AGTCTGCGAA ACCAAATGAC GAAAAGAAGG CGCAGAAGAG GGCAAACAAC 2580
TTCAATCTAG CAGAACGTGA GCGCGTATCC ACTTAG 2617
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Pug-0039 EHS64598 Puccinia graminis 29 1e-19 96.3
WERAM-Lab-0048 EDR10144 Laccaria bicolor 25 2e-09 62.0
WERAM-Mel-0033 EGG04126 Melampsora laricipopulina 48 3e-07 55.1
Created Date 25-Jun-2016