WERAM Information


Tag Content
WERAM ID WERAM-Pug-0039
Ensembl Protein ID EHS64598
Gene Name
Ensembl Information
Ensembl Gene ID Ensembl Transcript ID Ensembl Protein ID
PGTG_22386 EHS64598 EHS64598
Status Unreviewed
Classification
Type Family E-value Score Start End
HAT HAT_other 0.076 38
Organism Puccinia graminis
Domain Profile
  HAT HAT_other

Query: 923 EQCPACLQSVCFDSLRFAICRVGHVWDRCSVTFQILSTIKVRIC 966
E C C + + F + A+C GH+W RC +T+Q ++ R C
Sbjct: 745 EHCSLCKEILPFTDRKQAVCSNGHIWLRCFLTYQSCQSLIYRRC 788
Database: Homo_sapiens.GRCh38.pep.all.fa
Posted date: Jun 9, 2016 10:45 AM
Number of letters in database: 38,475,758
Number of sequences in database: 102,450

Lambda K H
0.318 0.133 0.406
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Sequences: 102450
Number of Hits to DB: 88,443,981
Number of extensions: 4040516
Number of successful extensions: 12714
Number of sequences better than 10.0: 9
Number of HSP's gapped: 12939
Number of HSP's successfully gapped: 12
Length of query: 1046
Length of database: 38,475,758
Length adjustment: 117
Effective length of query: 929
Effective length of database: 26,489,108
Effective search space: 24608381332
Effective search space used: 24608381332
Neighboring words threshold: 11
Window for multiple hits: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 70 (31.6 bits)

Protein Sequence
(Fasta)
MYARCRLSEF CDEDRLSVGR GGDVHETEPE RYKDTDDMSS DTAAGQTTQP IDLPKPTDSS 60
YGHADILQTL PIDSSPSCLE WSNDGQAFAV TKANIYLLTP ILGYLVSPED RHTHTQAQPT 120
ENGSHSQDAP STPSRSNPSG IQESIADSTT KSAGGQPKIP FFTTIIEINK QLGVNWSTHS 180
NDISTITPPN DDRFWRAATW SPSGLSALGS CLLAGLSTTC DVFVYSPCQN YQAGLWAIKE 240
TLNLSEELLT IFYDFYPSII GHQEDPTKSI DVSPETAWDI SEEGEEKRSR FTAGVLRTQA 300
TCIAWSPAYS HSNCDTPVLN QHDDLYDVDF SLLAVGHRRG DLSLWRHTSK GHMELESFNP 360
ICSDGRTLNL LSWSDWKLSA RRRVDDTSAQ QYQLTAHLAV ANSKGVVYLM QVCRPFERPT 420
KPVSPISRME IQIIGIYQDP LNQSSITYFK WLPPSGNTLS RLIISRLGEI VLVPLSSTST 480
EGQSDSQSGL FLGPTQVIQL PVLQPHDDRL CWADCNSWAT CSGMSIIPTS LPGITQIIAM 540
LSNGLIFVLR ESLDPNPSSS SSAMANPLEL DLAHSVQLSL DFRAKFRAIG YSAHPPPNST 600
PITKQNVMSV YGSHVLSGLQ HVSGDHQQLH SIDPISLLSS CIMSWLYEID APNKFRYKPE 660
NHQILQFCLA YFNPVGSVQR RVQLLGMLEN QITAILPAML SKSASSILVS PSSKLLNIFN 720
ILHSIFSDPG SRESPSFDLL NSTLSNLVDL LPLGTETAYF ALPDHQPILN YDGGTTSMAL 780
KSQLINQLFY NLNLDHLRLK VNLCNFLLGR ADKAKFPSLR SRLVEAKVSL SRVIHRLVLQ 840
TIAQFFLRHV GQLSSDERPV FNRYQCATKA IDRFPEPTIE QLLEPDEVDH KLLDQDRLSS 900
APDDSLGNGN PLISFGGEEN EVEQCPACLQ SVCFDSLRFA ICRVGHVWDR CSVTFQILST 960
IKVRICTGCG RKSMVKGGGE DQTTSRPPEA SQEGAQMETG DHQHPSTSTQ QEDADVGRLG 1020
KPSLVQVLLD SSICCWHCGG RWRLSS 1046
Nucleotide Sequence
(Fasta)
ATGTATGCGC GTTGTCGGTT GTCGGAGTTT TGTGACGAAG ATCGTTTGAG TGTGGGCCGG 60
GGAGGAGATG TGCACGAAAC CGAACCGGAA CGATATAAGG ATACCGATGA CATGAGCTCA 120
GATACGGCAG CTGGACAAAC AACACAACCC ATCGACCTGC CCAAGCCGAC GGACTCGAGC 180
TATGGACATG CGGACATCCT CCAAACCCTC CCGATCGATA GCTCCCCTTC ATGTTTGGAA 240
TGGTCGAACG ACGGCCAAGC CTTTGCAGTC ACCAAGGCGA ACATATACCT CTTGACACCC 300
ATTCTCGGAT ACTTGGTTTC TCCGGAAGAC CGTCATACGC ATACACAAGC CCAACCCACT 360
GAGAATGGCT CTCATTCGCA AGACGCTCCA TCTACGCCTT CAAGATCTAA TCCTTCGGGC 420
ATACAAGAAT CAATAGCAGA CTCGACTACA AAATCGGCCG GTGGTCAGCC TAAGATACCA 480
TTCTTTACGA CGATTATCGA GATCAATAAA CAACTTGGCG TTAATTGGTC AACCCACTCT 540
AATGATATAT CGACCATCAC ACCGCCTAAT GATGATCGAT TCTGGCGAGC AGCTACCTGG 600
TCTCCTAGTG GCTTATCAGC CTTGGGATCG TGCTTGCTTG CTGGATTGTC TACCACTTGC 660
GATGTCTTTG TATATTCACC CTGTCAGAAT TACCAAGCTG GATTGTGGGC AATCAAAGAA 720
ACGCTAAACC TCTCCGAAGA ATTGTTGACA ATCTTTTACG ATTTTTATCC ATCTATCATT 780
GGCCATCAAG AGGACCCGAC GAAGTCAATT GATGTCAGTC CAGAGACTGC ATGGGACATA 840
TCGGAAGAGG GAGAGGAGAA ACGAAGCAGA TTTACTGCGG GCGTTCTCAG AACTCAAGCC 900
ACTTGTATTG CCTGGTCGCC TGCCTATTCA CATTCCAACT GCGACACACC CGTCTTGAAT 960
CAGCATGACG ATTTATATGA TGTCGATTTT TCGCTGCTTG CGGTAGGCCA CAGAAGAGGG 1020
GATCTATCTC TGTGGAGACA CACATCCAAA GGACACATGG AGCTTGAGTC CTTTAACCCA 1080
ATTTGCTCCG ACGGCCGGAC TCTCAACTTG CTATCGTGGT CAGATTGGAA ATTGAGCGCT 1140
AGGCGAAGAG TTGACGATAC ATCGGCACAG CAGTACCAGT TAACTGCCCA TCTAGCAGTG 1200
GCTAATTCGA AAGGGGTAGT TTATCTCATG CAAGTCTGTC GACCGTTCGA GAGACCGACG 1260
AAGCCAGTCT CTCCCATTTC CCGGATGGAG ATACAGATCA TAGGGATATA CCAGGACCCA 1320
CTGAACCAAT CATCGATCAC CTACTTCAAA TGGTTACCTC CCTCTGGGAA TACTCTGTCG 1380
AGATTGATCA TCAGTCGGTT GGGCGAGATC GTTCTTGTTC CGTTGTCTTC GACTTCTACT 1440
GAAGGCCAGA GTGATTCTCA GTCTGGATTG TTCCTCGGAC CCACTCAGGT GATTCAATTG 1500
CCTGTCTTGC AACCTCATGA CGATCGCTTG TGTTGGGCGG ACTGCAACAG CTGGGCAACA 1560
TGTTCAGGAA TGAGTATCAT CCCAACAAGT TTACCTGGGA TTACGCAAAT CATAGCGATG 1620
CTTTCGAACG GCTTGATATT CGTTTTGAGG GAATCTTTAG ATCCCAACCC GTCATCATCA 1680
TCTTCGGCTA TGGCCAACCC ACTCGAGCTC GACTTGGCAC ATTCCGTGCA ACTTTCACTA 1740
GACTTTCGAG CCAAGTTTCG CGCGATCGGC TATTCTGCCC ACCCTCCACC CAACAGCACG 1800
CCGATCACCA AGCAGAACGT CATGAGTGTC TATGGATCAC ATGTTCTCAG TGGTCTACAA 1860
CATGTCTCAG GTGACCACCA GCAGCTCCAT TCAATCGATC CGATTTCTCT ACTGAGCAGC 1920
TGCATCATGA GTTGGTTGTA CGAAATCGAC GCCCCCAACA AGTTTCGCTA CAAACCGGAG 1980
AACCACCAAA TTCTCCAGTT TTGTTTAGCA TATTTCAATC CGGTTGGATC GGTCCAAAGG 2040
AGGGTGCAGC TACTAGGGAT GCTCGAGAAT CAAATCACTG CGATATTGCC GGCCATGCTT 2100
TCCAAATCGG CTTCGTCCAT CTTAGTGTCG CCTTCTTCAA AACTGTTGAA CATTTTCAAC 2160
ATCCTCCACT CGATTTTCTC TGATCCTGGG TCTCGGGAAT CTCCGTCTTT CGATCTCCTC 2220
AACTCCACTC TTTCGAATTT GGTTGATTTG CTTCCACTGG GAACAGAAAC TGCTTACTTC 2280
GCGCTACCGG ACCATCAACC CATTCTAAAC TATGATGGAG GGACGACGTC TATGGCCTTG 2340
AAAAGTCAAC TAATTAACCA ACTCTTCTAT AATCTCAATC TTGACCACCT TCGCTTGAAG 2400
GTCAACCTTT GTAACTTTTT GCTGGGTCGC GCCGATAAAG CTAAATTCCC CTCATTGAGA 2460
TCTCGGCTCG TTGAGGCCAA AGTCTCTCTC TCACGAGTCA TCCATCGCCT GGTCTTGCAA 2520
ACCATTGCGC AGTTCTTTCT TCGACATGTA GGGCAACTTT CGAGCGATGA AAGGCCAGTT 2580
TTCAATCGAT ATCAATGTGC GACCAAAGCT ATTGACCGGT TCCCAGAGCC AACAATCGAG 2640
CAACTTTTGG AACCTGATGA GGTCGACCAC AAGCTGTTAG ACCAAGATCG GCTTTCATCG 2700
GCCCCTGATG ATTCACTGGG CAATGGAAAT CCTCTGATCT CGTTCGGAGG AGAAGAAAAC 2760
GAAGTGGAGC AGTGTCCCGC TTGTCTTCAA TCTGTTTGTT TTGATAGTCT GCGATTTGCG 2820
ATCTGTCGCG TTGGCCATGT TTGGGACCGA TGCTCAGTCA CCTTTCAAAT CTTATCGACG 2880
ATTAAAGTGA GGATCTGTAC CGGCTGCGGG AGAAAATCGA TGGTAAAAGG TGGTGGCGAG 2940
GATCAGACGA CCAGTCGACC TCCTGAGGCC AGTCAAGAAG GGGCTCAAAT GGAGACTGGT 3000
GATCATCAAC ATCCTTCAAC TTCTACTCAG CAGGAGGATG CGGATGTCGG TCGGCTGGGC 3060
AAGCCTTCGT TGGTTCAGGT CTTGCTGGAT TCATCCATCT GTTGTTGGCA TTGCGGTGGG 3120
AGATGGAGAC TTTCTTCTTG A 3142
Sequence Source Ensembl
Orthology
WERAM ID Ensembl Protein ID Species Identity E-value Score
WERAM-Mel-0033 EGG04126 Melampsora laricipopulina 37 4e-28 124
WERAM-Miv-0015 MVLG_02319T0 Microbotryum violaceum 29 1e-19 96.7
WERAM-Lab-0048 EDR10144 Laccaria bicolor 35 1e-09 63.5
Created Date 25-Jun-2016