BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= YIR004W__[Saccharomyces_cerevisiae]
(432 letters)
Database: nr.pal
6,348,806 sequences; 2,166,943,470 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gi|6322194|ref|NP_012269.1| Cytosolic J-domain-containing p... 675 0.0
gi|45269663|gb|AAS56212.1| YIR004W [Saccharomyces cerevisiae] 674 0.0
gi|151943162|gb|EDN61497.1| dnaJ protein [Saccharomyces cer... 673 0.0
gi|156840966|ref|XP_001643860.1| hypothetical protein Kpol_... 456 e-126
gi|50290713|ref|XP_447789.1| unnamed protein product [Candi... 456 e-126
gi|45187888|ref|NP_984111.1| ADR015Wp [Ashbya gossypii ATCC... 424 e-117
gi|50307369|ref|XP_453663.1| unnamed protein product [Kluyv... 420 e-116
gi|50416962|ref|XP_457597.1| hypothetical protein DEHA0B150... 276 2e-72
gi|146417314|ref|XP_001484626.1| hypothetical protein PGUG_... 259 3e-67
gi|145246054|ref|XP_001395276.1| hypothetical protein An12g... 245 6e-63
gi|149246760|ref|XP_001527805.1| conserved hypothetical pro... 231 6e-59
gi|68489878|ref|XP_711232.1| peroxisomal protein import pro... 214 8e-54
gi|67525835|ref|XP_660979.1| hypothetical protein AN3375.2 ... 214 1e-53
gi|50421801|ref|XP_459458.1| hypothetical protein DEHA0E036... 209 4e-52
gi|146421607|ref|XP_001486748.1| hypothetical protein PGUG_... 207 1e-51
gi|19115249|ref|NP_594337.1| DNAJ domain protein Caj1/Djp1 ... 205 5e-51
gi|6320888|ref|NP_010967.1| Nuclear type II J heat shock pr... 202 4e-50
gi|156841245|ref|XP_001643997.1| hypothetical protein Kpol_... 199 5e-49
gi|126273851|ref|XP_001387305.1| DnaJ-like protein [Pichia ... 192 4e-47
gi|164656675|ref|XP_001729465.1| hypothetical protein MGL_3... 192 4e-47
gi|150864850|ref|XP_001383838.2| hypothetical protein PICST... 190 1e-46
gi|50290783|ref|XP_447824.1| unnamed protein product [Candi... 190 2e-46
gi|169867178|ref|XP_001840170.1| hypothetical protein CC1G_... 189 5e-46
gi|68478826|ref|XP_716575.1| peroxisomal protein import pro... 188 7e-46
gi|71016108|ref|XP_758866.1| hypothetical protein UM02719.1... 181 1e-43
gi|71010807|ref|XP_758417.1| hypothetical protein UM02270.1... 174 1e-41
gi|149234463|ref|XP_001523111.1| conserved hypothetical pro... 162 4e-38
gi|46135729|ref|XP_389556.1| hypothetical protein FG09380.1... 155 6e-36
gi|50308287|ref|XP_454145.1| unnamed protein product [Kluyv... 154 9e-36
gi|85094073|ref|XP_959815.1| hypothetical protein NCU06052 ... 153 2e-35
gi|116200638|ref|XP_001226131.1| hypothetical protein CHGG_... 152 7e-35
gi|27764299|emb|CAD60579.1| unnamed protein product [Podosp... 151 1e-34
gi|169596008|ref|XP_001791428.1| hypothetical protein SNOG_... 148 7e-34
gi|39958186|ref|XP_364390.1| hypothetical protein MGG_09235... 147 1e-33
gi|154322250|ref|XP_001560440.1| hypothetical protein BC1G_... 147 2e-33
gi|50555818|ref|XP_505317.1| hypothetical protein [Yarrowia... 146 4e-33
gi|156060771|ref|XP_001596308.1| hypothetical protein SS1G_... 145 4e-33
gi|119178585|ref|XP_001240954.1| hypothetical protein CIMG_... 144 1e-32
gi|115402369|ref|XP_001217261.1| conserved hypothetical pro... 140 2e-31
gi|121709452|ref|XP_001272423.1| DnaJ domain protein [Asper... 138 1e-30
gi|159122951|gb|EDP48071.1| DnaJ domain protein [Aspergillu... 137 1e-30
gi|70982562|ref|XP_746809.1| DnaJ domain protein [Aspergill... 137 1e-30
gi|119488622|ref|XP_001262761.1| DnaJ domain protein [Neosa... 137 1e-30
gi|169785547|ref|XP_001827234.1| [Aspergillus oryzae] >gi|8... 136 3e-30
gi|154271919|ref|XP_001536812.1| conserved hypothetical pro... 136 3e-30
gi|50292765|ref|XP_448815.1| unnamed protein product [Candi... 134 1e-29
gi|19112890|ref|NP_596098.1| DNAJ protein Caj1/Djp1-type [S... 132 4e-29
gi|164661659|ref|XP_001731952.1| hypothetical protein MGL_1... 132 7e-29
gi|170096332|ref|XP_001879386.1| predicted protein [Laccari... 124 2e-26
gi|125553182|gb|EAY98891.1| hypothetical protein OsI_020124... 113 3e-23
gi|125595081|gb|EAZ35140.1| hypothetical protein OsJ_018623... 111 8e-23
gi|125571724|gb|EAZ13239.1| hypothetical protein OsJ_003064... 110 2e-22
gi|169849199|ref|XP_001831303.1| hypothetical protein CC1G_... 108 6e-22
gi|115477372|ref|NP_001062282.1| Os08g0522600 [Oryza sativa... 108 1e-21
gi|169596010|ref|XP_001791429.1| hypothetical protein SNOG_... 107 1e-21
gi|115465213|ref|NP_001056206.1| Os05g0543700 [Oryza sativa... 107 1e-21
gi|125527401|gb|EAY75515.1| hypothetical protein OsI_003362... 107 2e-21
gi|15226572|ref|NP_179746.1| DNAJ heat shock N-terminal dom... 106 3e-21
gi|115479909|ref|NP_001063548.1| Os09g0493800 [Oryza sativa... 106 4e-21
gi|15234962|ref|NP_195626.1| DNAJ heat shock N-terminal dom... 105 5e-21
gi|58258647|ref|XP_566736.1| hypothetical protein [Cryptoco... 105 8e-21
gi|2230757|emb|CAA72705.1| dnaJ-like protein [Arabidopsis t... 103 2e-20
gi|157335094|emb|CAO60924.1| unnamed protein product [Vitis... 103 2e-20
gi|168040786|ref|XP_001772874.1| predicted protein [Physcom... 103 2e-20
gi|4680190|gb|AAD27555.1|AF111710_1 putative dnaJ-like prot... 103 3e-20
gi|168060184|ref|XP_001782078.1| predicted protein [Physcom... 102 4e-20
gi|115446689|ref|NP_001047124.1| Os02g0555700 [Oryza sativa... 102 5e-20
gi|58264958|ref|XP_569635.1| chaperone regulator [Cryptococ... 102 6e-20
gi|168010215|ref|XP_001757800.1| predicted protein [Physcom... 102 6e-20
gi|15223142|ref|NP_177796.1| DNAJ heat shock N-terminal dom... 102 6e-20
gi|45187762|ref|NP_983985.1| ADL111Wp [Ashbya gossypii ATCC... 102 6e-20
gi|125539880|gb|EAY86275.1| hypothetical protein OsI_007508... 102 7e-20
gi|125582507|gb|EAZ23438.1| hypothetical protein OsJ_006921... 101 8e-20
gi|168064859|ref|XP_001784375.1| predicted protein [Physcom... 101 9e-20
gi|168065214|ref|XP_001784549.1| predicted protein [Physcom... 100 2e-19
gi|147798803|emb|CAN63215.1| hypothetical protein [Vitis vi... 100 2e-19
gi|125604054|gb|EAZ43379.1| hypothetical protein OsJ_026862... 100 3e-19
gi|157340777|emb|CAO47582.1| unnamed protein product [Vitis... 100 4e-19
gi|118486373|gb|ABK95027.1| unknown [Populus trichocarpa] 100 4e-19
gi|125562232|gb|EAZ07680.1| hypothetical protein OsI_028912... 99 6e-19
gi|26449747|dbj|BAC41997.1| putative DnaJ protein [Arabidop... 99 6e-19
gi|57340266|gb|AAW50121.1| DnaJ-like protein [Brassica juncea] 98 1e-18
gi|123501575|ref|XP_001328100.1| DnaJ domain containing pro... 98 1e-18
gi|147818705|emb|CAN76186.1| hypothetical protein [Vitis vi... 98 1e-18
gi|170086698|ref|XP_001874572.1| predicted protein [Laccari... 97 2e-18
gi|18394951|ref|NP_564134.1| DNAJ heat shock N-terminal dom... 97 3e-18
gi|170098883|ref|XP_001880660.1| predicted protein [Laccari... 96 4e-18
gi|61842931|ref|XP_590020.1| PREDICTED: similar to DnaJ (Hs... 95 1e-17
gi|76655162|ref|XP_883889.1| PREDICTED: similar to DnaJ (Hs... 95 1e-17
gi|149642569|ref|NP_001092591.1| DnaJ (Hsp40) homolog, subf... 95 1e-17
gi|164655807|ref|XP_001729032.1| hypothetical protein MGL_3... 95 1e-17
gi|145354587|ref|XP_001421562.1| predicted protein [Ostreoc... 94 2e-17
gi|20067161|gb|AAM09527.1|AF490904_1 macrothioredoxin [Homo... 94 2e-17
gi|118093487|ref|XP_421968.2| PREDICTED: similar to DnaJ (H... 94 2e-17
gi|30268341|emb|CAD89982.1| hypothetical protein [Homo sapi... 94 2e-17
gi|30699227|ref|NP_177828.2| DNAJ heat shock N-terminal dom... 94 3e-17
gi|109100283|ref|XP_001102034.1| PREDICTED: similar to DnaJ... 94 3e-17
gi|119631367|gb|EAX10962.1| DnaJ (Hsp40) homolog, subfamily... 94 3e-17
gi|119631364|gb|EAX10959.1| DnaJ (Hsp40) homolog, subfamily... 94 3e-17
gi|114582040|ref|XP_515961.2| PREDICTED: DnaJ (Hsp40) homol... 94 3e-17
>gi|6322194|ref|NP_012269.1| Cytosolic J-domain-containing protein, required for peroxisomal
protein import and involved in peroxisome assembly,
homologous to E. coli DnaJ; Djp1p [Saccharomyces
cerevisiae]
gi|731907|sp|P40564|YIS4_YEAST Uncharacterized J domain-containing protein YIR004W
gi|557853|emb|CAA86206.1| unnamed protein product [Saccharomyces cerevisiae]
Length = 432
Score = 675 bits (1741), Expect = 0.0, Method: Composition-based stats.
Identities = 388/432 (89%), Positives = 388/432 (89%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD
Sbjct: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQK
Sbjct: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKTEELNAED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
SPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN
Sbjct: 121 EAEKEKENVETMEESPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
EQVGA RVDQLSKTLIERLSILTESVYDDACKDSFKKKF
Sbjct: 181 EQVGAEAKKKKTKLEQFEEEQEVEKQKRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT
Sbjct: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH
Sbjct: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE
Sbjct: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
Query: 421 LVAEATKKKRHT 432
LVAEATKKKRHT
Sbjct: 421 LVAEATKKKRHT 432
>gi|45269663|gb|AAS56212.1| YIR004W [Saccharomyces cerevisiae]
Length = 432
Score = 674 bits (1740), Expect = 0.0, Method: Composition-based stats.
Identities = 387/432 (89%), Positives = 387/432 (89%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD
Sbjct: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQK
Sbjct: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKTEELNAED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
SPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN
Sbjct: 121 EAEKEKENVETMEESPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
EQVGA RVDQLSKTLIERLSILTESVYDDACKDSFKKKF
Sbjct: 181 EQVGAEAKKKKTKLEQFEEEQEVEKQKRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT
Sbjct: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
VSAAIDAQNTMKELEKMKEASTNN PLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH
Sbjct: 301 VSAAIDAQNTMKELEKMKEASTNNGPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE
Sbjct: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
Query: 421 LVAEATKKKRHT 432
LVAEATKKKRHT
Sbjct: 421 LVAEATKKKRHT 432
>gi|151943162|gb|EDN61497.1| dnaJ protein [Saccharomyces cerevisiae YJM789]
Length = 432
Score = 673 bits (1736), Expect = 0.0, Method: Composition-based stats.
Identities = 387/432 (89%), Positives = 388/432 (89%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD
Sbjct: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQK
Sbjct: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKTEELNAED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
SPADGKT+GTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN
Sbjct: 121 EAEKEKENVETMEESPADGKTDGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
EQVGA RVDQLSKTLIERLSILTESVYDDACKDSFKKKF
Sbjct: 181 EQVGAEAKKKKTKLEQFEEEQEVEKQKRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT
Sbjct: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH
Sbjct: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE
Sbjct: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
Query: 421 LVAEATKKKRHT 432
LVAEATKKKRHT
Sbjct: 421 LVAEATKKKRHT 432
>gi|156840966|ref|XP_001643860.1| hypothetical protein Kpol_499p30 [Vanderwaltozyma polyspora DSM
70294]
gi|156114487|gb|EDO16002.1| hypothetical protein Kpol_499p30 [Vanderwaltozyma polyspora DSM
70294]
Length = 408
Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats.
Identities = 266/432 (61%), Positives = 316/432 (73%), Gaps = 24/432 (5%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDTEYYDLL + TA++IEIKKAYRKKSI+EHPDKNPNDPTATERFQAISEAYQVL D
Sbjct: 1 MVVDTEYYDLLDIDITATAIEIKKAYRKKSIKEHPDKNPNDPTATERFQAISEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
+LRA YDK+G+++AIP+GGFEDAAEQFS IFGG+AF YIGEL LLKNLQK
Sbjct: 61 KNLRANYDKFGKEKAIPKGGFEDAAEQFSAIFGGEAFIPYIGELTLLKNLQKTEEL---- 116
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
+ + + ++ +NK +T +V K +
Sbjct: 117 -----------------NAEDEAEKQREEEEKQKKEKEAKENKDSST---FSVSTDVKND 156
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
+V V++L +TLIE+LSILTES YD+ CK SF+KKF
Sbjct: 157 SEVKKDEPKKKTKMEQFEEEQQLEKDKTVEKLKQTLIEKLSILTESAYDEDCKMSFEKKF 216
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EEEANLLKMESFGLDILHTIGD Y E+A IFL SQNLFG GG+F SMKAKGGV MDTLRT
Sbjct: 217 EEEANLLKMESFGLDILHTIGDAYCERARIFLGSQNLFGFGGMFQSMKAKGGVVMDTLRT 276
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
VSAAIDAQ+TMKELE+MK A+ ++EPL DK G E+ KPT EELA+QE LLMGKVLSAAWH
Sbjct: 277 VSAAIDAQHTMKELERMKLATESDEPLVDKHGKEEPKPTAEELAEQEHLLMGKVLSAAWH 336
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
GSK+EI STLR VC KVLED++V K TL++RAE++KLLG+VF++ +RTK EQEEAQ+FEE
Sbjct: 337 GSKFEIMSTLRAVCDKVLEDNTVDKGTLVKRAESLKLLGKVFQRAYRTKAEQEEAQVFEE 396
Query: 421 LVAEATKKKRHT 432
LVAEATKKK+H+
Sbjct: 397 LVAEATKKKQHS 408
>gi|50290713|ref|XP_447789.1| unnamed protein product [Candida glabrata]
gi|49527100|emb|CAG60738.1| unnamed protein product [Candida glabrata CBS 138]
Length = 425
Score = 456 bits (1173), Expect = e-126, Method: Composition-based stats.
Identities = 275/442 (62%), Positives = 320/442 (72%), Gaps = 27/442 (6%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVD+ YYDLLG+ TA+++EIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVL
Sbjct: 1 MVVDSTYYDLLGIGPTATAVEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLSS 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
++LRAKYDK+G++EAIP+GGFEDAAEQFS IFGG+AFASYIGEL LLKNLQK
Sbjct: 61 EELRAKYDKFGKQEAIPKGGFEDAAEQFSAIFGGEAFASYIGELTLLKNLQKTEELNAED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVH-DGNKK 179
+ + E + + ++TVH D
Sbjct: 121 -----------------EAQKQKEAEEAQKRKEKEEEMKKNGHVQGSGQDITVHPDPEGT 163
Query: 180 NEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKK 239
+ A R+++LSKTLIERLSILTESVYDDACK+SF+KK
Sbjct: 164 KPKDDAVNQKKKTKLEEFEEQQKIEREKRIEELSKTLIERLSILTESVYDDACKNSFQKK 223
Query: 240 FEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLR 299
FEEEAN+LKMESFG+DILHTIGD+Y EKA+IFLASQNLFG GGIFHS+KAKGGV MDTLR
Sbjct: 224 FEEEANMLKMESFGVDILHTIGDIYCEKAKIFLASQNLFGFGGIFHSVKAKGGVLMDTLR 283
Query: 300 TVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNE---------QIKPTTEELAQQEQLL 350
TVSAAIDAQNTMKELEKMKEAST + K+ + + KPT EELAQQEQLL
Sbjct: 284 TVSAAIDAQNTMKELEKMKEASTEDTEENSKNQQKTETETTTAPKPKPTAEELAQQEQLL 343
Query: 351 MGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKV 410
MGKVLSAAWHG+K+E+TSTLR VC KVL+D + T I+RAEA++LLG+VF+KT+RTK
Sbjct: 344 MGKVLSAAWHGTKFEMTSTLRSVCDKVLDDQKIDLNTRIKRAEALRLLGKVFQKTYRTKS 403
Query: 411 EQEEAQIFEELVAEATKKKRHT 432
EQEEAQIFEELVAEATKK R+T
Sbjct: 404 EQEEAQIFEELVAEATKKHRNT 425
>gi|45187888|ref|NP_984111.1| ADR015Wp [Ashbya gossypii ATCC 10895]
gi|44982672|gb|AAS51935.1| ADR015Wp [Ashbya gossypii ATCC 10895]
Length = 436
Score = 424 bits (1089), Expect = e-117, Method: Composition-based stats.
Identities = 248/435 (57%), Positives = 299/435 (68%), Gaps = 5/435 (1%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYDLLGVS A +IEIKKAYRKKS+QEHPDKNPNDP ATERFQAISEAYQVL
Sbjct: 1 MVVDTAYYDLLGVSPDAKAIEIKKAYRKKSVQEHPDKNPNDPKATERFQAISEAYQVLSS 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
D+LRAKYDK+G++EA+PQ GFEDA EQF+ IFGG+AFASYIGEL LLKN+QK
Sbjct: 61 DELRAKYDKFGKEEAVPQNGFEDAGEQFAAIFGGEAFASYIGELTLLKNIQKTEELVQQD 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
+ K G T A + G + + G K
Sbjct: 121 EEEKQREKQRVHEKTQDQKK--GATPQTGAPAAGEPAAAAERVKPNGRGAIEENGGAKAA 178
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXR---VDQLSKTLIERLSILTESVYDDACKDSFK 237
G + +D+LSK L +RLS++TES YD+ CK +F+
Sbjct: 179 SDKGDGETQDERKKTKLEQFEEQQRLDKEKMIDKLSKILCDRLSVVTESSYDEPCKRAFE 238
Query: 238 KKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDT 297
KKFEEEAN+LKMESFGLDILHTIG+VY +KAEIFL +Q + G+GG FHS++AK G +DT
Sbjct: 239 KKFEEEANMLKMESFGLDILHTIGEVYCQKAEIFLKNQRILGIGGFFHSVRAKCGFVVDT 298
Query: 298 LRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSA 357
+RTVSAA+DAQNTM+ELEK+K A ++EPL D GNE KPT EELA EQL+MGKVLSA
Sbjct: 299 VRTVSAALDAQNTMQELEKLKLAVDSDEPLRDDKGNELPKPTVEELAHMEQLVMGKVLSA 358
Query: 358 AWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQI 417
AWHGSK+EI STL+ VC +VLED + +T IRRAEA+ +LG VFK+T+RT VEQE+AQ+
Sbjct: 359 AWHGSKFEIMSTLKSVCTRVLEDKNAELETRIRRAEALIMLGRVFKRTYRTPVEQEDAQV 418
Query: 418 FEELVAEATKKKRHT 432
FEEL AEATK K +
Sbjct: 419 FEELAAEATKNKSRS 433
>gi|50307369|ref|XP_453663.1| unnamed protein product [Kluyveromyces lactis]
gi|49642797|emb|CAH00759.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
Length = 433
Score = 420 bits (1080), Expect = e-116, Method: Composition-based stats.
Identities = 251/431 (58%), Positives = 304/431 (70%), Gaps = 9/431 (2%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYDLLGV+T A I+IKKAYRKKS++EHPDKNP+DPTATERFQAISEAYQVL
Sbjct: 1 MVVDTTYYDLLGVATDAKQIDIKKAYRKKSVKEHPDKNPDDPTATERFQAISEAYQVLSS 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
++LR KYDK+G++EA+P+ GFEDA EQF+ IFGG+AF SYIGEL LLKN+Q
Sbjct: 61 EELRMKYDKFGKEEAMPKNGFEDAGEQFAAIFGGEAFTSYIGELTLLKNIQNTQELSEED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
A K + ++ +N DD A ++ N T
Sbjct: 121 ERAKEQE---------ASEKATQQAHNLNNNNNASNGDDDTKSAEDSTNNGTAKKLTSGG 171
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
+ +++LSKTL +RLSILTES YDDACK+SF KKF
Sbjct: 172 DDSSNPETKKKGKLEEFEEQQMLDKEKSIEELSKTLSDRLSILTESAYDDACKESFDKKF 231
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EEEAN+LKMESFGLDILHTIG++Y EKA IFL SQ L+G GG +HS+KAKGG+ MDT+RT
Sbjct: 232 EEEANMLKMESFGLDILHTIGEIYCEKANIFLKSQYLWGFGGFYHSVKAKGGLVMDTVRT 291
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
VSAA+DAQ+TM ELEK+KE + + EPL D+ GN KPT EELAQ EQLLMGKVLSAAW+
Sbjct: 292 VSAALDAQSTMTELEKLKETANSEEPLKDEAGNVVEKPTVEELAQLEQLLMGKVLSAAWY 351
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
GSK+EI STLR VC KVLED++ T IRRAEA+K LG+VF++ +RTK EQE+AQ+FEE
Sbjct: 352 GSKFEIMSTLRSVCDKVLEDETAEMSTRIRRAEALKRLGKVFRRAYRTKTEQEDAQVFEE 411
Query: 421 LVAEATKKKRH 431
LVAEA+KKK H
Sbjct: 412 LVAEASKKKGH 422
>gi|50416962|ref|XP_457597.1| hypothetical protein DEHA0B15048g [Debaryomyces hansenii CBS767]
gi|49653262|emb|CAG85608.1| unnamed protein product [Debaryomyces hansenii CBS767]
Length = 523
Score = 276 bits (707), Expect = 2e-72, Method: Composition-based stats.
Identities = 193/502 (38%), Positives = 268/502 (53%), Gaps = 76/502 (15%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYDLL + A+S++IKKAYRK +I+ HPDKNP DPTA +FQ + EAYQVL D
Sbjct: 1 MVVDTTYYDLLSLQPDATSLDIKKAYRKAAIKLHPDKNPGDPTAAAKFQEVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
D+LR+KYDKYG++E+IP GFED +E FS+IFGGDAF +IGEL LL+ L K
Sbjct: 61 DNLRSKYDKYGKQESIPSEGFEDPSEFFSMIFGGDAFKDWIGELSLLQELSKSAELSGYG 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGT--------TNAVDAALGNTNEKDDKN---------- 162
+ +T+ + T N+N +D+++
Sbjct: 121 DEEKKDSTEEEKTGDDSKTQTSESKAQDATASTTTTGDTTKNSNSQDEEDVKLRDKTQKL 180
Query: 163 --KARTTSGNLTVHDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRV-DQLSKTLIER 219
+ + GN+ +DG++K + ++LSK L ++
Sbjct: 181 YLEDTSGGGNVGTNDGDRKELTKEEKEELKRKEELEKFEEECRIKKIEMREELSKKLTDK 240
Query: 220 LSILTESVYDDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFG 279
LS+ TE+ + +SFK+K + EA LKMESFGL+ILHTIG VY K +IFL +Q G
Sbjct: 241 LSLFTETDMKEDVAESFKQKLKYEAESLKMESFGLEILHTIGSVYKTKLKIFLKNQTFLG 300
Query: 280 MGGIFHSMKAKGGVFMDTLRTVSAAIDAQNTMKELEKM---------KEASTNNEPLFDK 330
GG++ S+K KGGV DT +T+++A+DAQ TM+E KM KEA E
Sbjct: 301 FGGLWWSVKEKGGVVRDTFKTITSALDAQLTMQEYAKMQEDNEYHAKKEAEEQAEAKQKA 360
Query: 331 DGNEQIKP----------------------------------------------TTEELA 344
+ +QI+ T E++A
Sbjct: 361 EEVDQIEKEIEEMKKEKEQKEAAAANNSADKVVKEGEGIKSDEKKEEEKVPEKHTAEDIA 420
Query: 345 QQEQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKK 404
E+ LMGKVL+AAW GS++EI ST+RGVC +L+D V + RA+ ++L+GEVF
Sbjct: 421 DMEKYLMGKVLAAAWSGSRFEIQSTVRGVCDNILQDKEVPLNVRVARAKGLRLIGEVFSS 480
Query: 405 TFRTKVEQEEAQIFEELVAEAT 426
RT+ E EEA++FEELVAEAT
Sbjct: 481 ITRTEAEDEEARVFEELVAEAT 502
>gi|146417314|ref|XP_001484626.1| hypothetical protein PGUG_02355 [Pichia guilliermondii ATCC 6260]
gi|146390099|gb|EDK38257.1| hypothetical protein PGUG_02355 [Pichia guilliermondii ATCC 6260]
Length = 507
Score = 259 bits (662), Expect = 3e-67, Method: Composition-based stats.
Identities = 182/491 (37%), Positives = 255/491 (51%), Gaps = 63/491 (12%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYDLLGV+T A+S+EIKKAYRK +I+ HPDKNP+DP A +FQ + EAYQVL D
Sbjct: 1 MVVDTTYYDLLGVATDATSLEIKKAYRKAAIRLHPDKNPDDPQAAAKFQEVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
D+LR+KYDK+G++E+IP GFED +E FS IFGG+AF +IGEL L++ L K
Sbjct: 61 DNLRSKYDKHGKQESIPSEGFEDPSEFFSAIFGGEAFRPWIGELSLIQELTKSAELSGYG 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
P + T + + +E + + + +K+
Sbjct: 121 EDEATKTGGESTGEDPGMNRDEKTRLFI-LHEADKSEVVGEGSGEKNDDKVATVNADKEL 179
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
Q +L+K L+++LS+ TE+ +SF++K
Sbjct: 180 TQKEKEEQRRKEELEKFEEECAQKKIETRKELTKNLVDKLSLFTETDMKPDVVESFRQKL 239
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
EA LKMESFGL+ILHT+G +Y K +IFL Q G+GG++ SMK KGG+ +T T
Sbjct: 240 HYEAESLKMESFGLEILHTLGLIYKSKLKIFLKKQTFMGLGGLWWSMKEKGGMVKETFST 299
Query: 301 VSAAIDAQNTMKELEKMKE----------------ASTNNEPLFD----KDGNEQI---- 336
++ A+DAQ TM+E KM+E A TN+E D K+ EQ
Sbjct: 300 ITTALDAQLTMQEYAKMQEDNEYHAKKEEEEAKNAAGTNSESETDDKAAKEAKEQATKDM 359
Query: 337 --------------------------------------KPTTEELAQQEQLLMGKVLSAA 358
K + EEL E+ L+ KVL+AA
Sbjct: 360 NDISKKLEEIRVLAEEEEQKKEGDVTEKKGAKAEVIPQKHSKEELEDMERYLLAKVLAAA 419
Query: 359 WHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIF 418
W GSK+EI T+RGVC +L + V I +A +++G+VF K RT+ E EEA++F
Sbjct: 420 WSGSKFEIQGTVRGVCDNILYNKDVPLDKRIAQANGPRIIGDVFSKITRTEEEDEEARVF 479
Query: 419 EELVAEATKKK 429
EE+VA A+KK+
Sbjct: 480 EEIVATASKKR 490
>gi|145246054|ref|XP_001395276.1| hypothetical protein An12g01900 [Aspergillus niger]
gi|134079988|emb|CAK48472.1| unnamed protein product [Aspergillus niger]
Length = 474
Score = 245 bits (625), Expect = 6e-63, Method: Composition-based stats.
Identities = 165/461 (35%), Positives = 234/461 (50%), Gaps = 49/461 (10%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DT YYD LGV TA+ +EIKKAYRK +I HPDKNP D TA RFQ I EAYQVL D
Sbjct: 1 MVADTSYYDALGVPPTATELEIKKAYRKLAIIHHPDKNPGDETAHARFQEIGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNL--------QK 112
++LR +YDK+G+++A+P GGFED +E F +IFGG+AF IGE+ L+K+L Q+
Sbjct: 61 EELRKRYDKFGKEDAVPGGGFEDPSEFFGMIFGGNAFVDLIGEISLMKDLTTTMDITMQQ 120
Query: 113 XXXXXXXXXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLT 172
A+G T G A A + A +SG T
Sbjct: 121 MEEEELAASAEEKLNIHDETQAHAAEGTTEGAAQATPAESTTAASEKPAAAASPSSGTST 180
Query: 173 ---------VHDGNKKNEQV-------------------GAXXXXXXXXXXXXXXXXXXX 204
+ D +++ ++ G
Sbjct: 181 PRRYMGQQAIMDKSEEEARMEAAGLSQEEKELRKKEKKKGGLSREQQERLAAYELERKKA 240
Query: 205 XXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVY 264
RV+ L+ L++++S+ TE+ +F++K E LKMESFGL+ILH IG Y
Sbjct: 241 REERVNTLATKLVDKISVWTETDKSPDMTRAFEEKIRLEVENLKMESFGLEILHAIGATY 300
Query: 265 YEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRTVSAAIDAQNTMKELEKMKEASTNN 324
+KA FL SQ G+ G F +K KG + +T T+S AIDAQ TM+E+ K++E +
Sbjct: 301 VQKATSFLKSQKFLGISGFFSRLKDKGTLAKETWTTISTAIDAQMTMEEMAKLEERGGED 360
Query: 325 EPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVS 384
T E+ A+ E+ + GK+L+AAW GSK+EI S LR VC +VL D +
Sbjct: 361 W-------------TDEKKAEYEKKVTGKILAAAWRGSKFEIQSVLRDVCDQVLGDKRIK 407
Query: 385 KKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEELVAEA 425
+ RA A+ + G ++ K R E+ + FE+L+AE+
Sbjct: 408 LDKRVERAHALVIAGNIYSKAERDPDEEGDFMAFEQLMAES 448
>gi|149246760|ref|XP_001527805.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
gi|146447759|gb|EDK42147.1| conserved hypothetical protein [Lodderomyces elongisporus NRRL
YB-4239]
Length = 460
Score = 231 bits (590), Expect = 6e-59, Method: Composition-based stats.
Identities = 153/342 (44%), Positives = 198/342 (57%), Gaps = 25/342 (7%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDTEYYDLLG+ TA+S+EIKKAYRK +I+ HPDKNPNDPTA +FQ + +AYQVL D
Sbjct: 1 MVVDTEYYDLLGIEVTATSLEIKKAYRKAAIRLHPDKNPNDPTAAAKFQEVGQAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
D LRAKYDKYG++E+IP GFED AE FS+IFGG+AF +IGEL LL+ L K
Sbjct: 61 DALRAKYDKYGKQESIPSEGFEDPAEFFSMIFGGEAFKDWIGELSLLQELTKSAELSGYG 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTN-EKDDKN----------------- 162
+ +T G + + GN KD KN
Sbjct: 121 DDDDDNNGKDDVTKK-EENETQGKNEKKEGSGGNEKGAKDSKNTPTYSEHAAQESQDTDE 179
Query: 163 KARTTSGNLTV-----HDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLI 217
+ ++S N + HD K + R ++L+K LI
Sbjct: 180 RPFSSSSNQKLLSNAAHDDTKLTRKEQEEKRRKEELEKFEEECRIKKIETR-NELAKKLI 238
Query: 218 ERLSILTESVYDDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNL 277
E+LS+LTE+ D +SFK K + EA LKMESFGL+ILHTIG VY K++I L +Q
Sbjct: 239 EKLSLLTETDMKDDVVESFKAKIKYEAESLKMESFGLEILHTIGLVYKSKSKILLKNQTF 298
Query: 278 FGMGGIFHSMKAKGGVFMDTLRTVSAAIDAQNTMKELEKMKE 319
FG GG++ S+K KGGV DT RT+SAA+DAQ TM+E +M++
Sbjct: 299 FGWGGLWASVKEKGGVVKDTFRTISAALDAQRTMEEYAQMQQ 340
Score = 38.9 bits (89), Expect = 0.80, Method: Composition-based stats.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 4/82 (4%)
Query: 295 MDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFD--KDGNEQIKPT--TEELAQQEQLL 350
+D + A+ T+ E+ K T NE L + K Q +PT T E + ++L
Sbjct: 372 LDKVHREQEEAQAKETLGEIPKETSKDTVNETLGEDSKGLKPQAEPTKHTAEDWPRCKVL 431
Query: 351 MGKVLSAAWHGSKYEITSTLRG 372
+ KVL+AAW+GSK+EI TLR
Sbjct: 432 LAKVLAAAWNGSKFEIQGTLRA 453
>gi|68489878|ref|XP_711232.1| peroxisomal protein import protein [Candida albicans SC5314]
gi|68489923|ref|XP_711209.1| peroxisomal protein import protein [Candida albicans SC5314]
gi|46432491|gb|EAK91970.1| potential peroxisomal protein import protein [Candida albicans
SC5314]
gi|46432516|gb|EAK91994.1| potential peroxisomal protein import protein [Candida albicans
SC5314]
Length = 508
Score = 214 bits (546), Expect = 8e-54, Method: Composition-based stats.
Identities = 145/322 (45%), Positives = 189/322 (58%), Gaps = 9/322 (2%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYDLL + T+A+S+EIKKAYRK +I+ HPDKNPNDP A +FQ + EAYQVL D
Sbjct: 1 MVVDTTYYDLLNIETSATSLEIKKAYRKAAIKLHPDKNPNDPDAAAKFQEVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
+ LRAKYDKYG++E+IPQ GFED AE FS+IFGG+AF +IGEL LL L K
Sbjct: 61 ETLRAKYDKYGKQESIPQEGFEDPAEFFSMIFGGEAFKDWIGELSLLSELSKSAELSGYS 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
+ K +N +T E K+ +T L + N
Sbjct: 121 DNADKKKDTKTAGTGSDESK---ESNESKKQKLDTTENGHKS---STEPRLLANGENTPG 174
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVD---QLSKTLIERLSILTESVYDDACKDSFK 237
Q+ +++ +LSK LI +LS+ TE+ D +SF+
Sbjct: 175 NQLTEKELEEKKRKEELEKFEEECRVKKIETRNELSKKLINKLSLFTETDMKDDVVESFQ 234
Query: 238 KKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDT 297
K + EA LKMESFGL+ILHTIG +Y K++IFL +Q FG GG + SMK KGGV DT
Sbjct: 235 TKIKYEAESLKMESFGLEILHTIGHIYKTKSKIFLKNQTFFGWGGFWWSMKEKGGVVKDT 294
Query: 298 LRTVSAAIDAQNTMKELEKMKE 319
+TVSAA+DAQ TM+E +M++
Sbjct: 295 FKTVSAALDAQRTMEEYTQMQQ 316
Score = 100 bits (249), Expect = 2e-19, Method: Composition-based stats.
Identities = 57/114 (50%), Positives = 80/114 (70%)
Query: 316 KMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWHGSKYEITSTLRGVCK 375
++KE + + E K T EELA+ E+ LMGKVL+AAW+GSK+EI T+R VC
Sbjct: 375 QLKEGQEAEGSVSKFETKEPAKHTAEELAEMEKYLMGKVLAAAWNGSKFEIQGTVRAVCD 434
Query: 376 KVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEELVAEATKKK 429
+LED VS +T + RA+A++L+G+VF RT+ E EEA++FEELVAEA+KK+
Sbjct: 435 NILEDKDVSLETRVARAKALRLIGDVFVSMTRTEAEAEEARVFEELVAEASKKR 488
>gi|67525835|ref|XP_660979.1| hypothetical protein AN3375.2 [Aspergillus nidulans FGSC A4]
gi|40744163|gb|EAA63343.1| hypothetical protein AN3375.2 [Aspergillus nidulans FGSC A4]
Length = 466
Score = 214 bits (545), Expect = 1e-53, Method: Composition-based stats.
Identities = 160/451 (35%), Positives = 224/451 (49%), Gaps = 42/451 (9%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YYD LGV TA+ +EIKKAYRK ++ HPDKNP D TA ERFQAI EAYQVL D
Sbjct: 1 MVVDTSYYDALGVPPTATELEIKKAYRKLAVVTHPDKNPGDETAHERFQAIGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNL-QKXXXXXXX 119
+LR +YD +G++ A+P GFED E F +IFGGDAF IGE+ LL++L +
Sbjct: 61 AELRKRYDTHGKEGAVPDQGFEDPNEFFGMIFGGDAFYDLIGEISLLQDLTTRMEITTEE 120
Query: 120 XXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTS-----GNLTVH 174
+ +G + G T + A + + T++ G +
Sbjct: 121 AEEDLAASTEEKLNINEQEGTSVGETTSSGAGSRASAASPSPAASGTSTPRPRLGQQAIM 180
Query: 175 DGNKKNEQV---GAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTE------ 225
D K +E++ A + ++L +ER E
Sbjct: 181 D--KSDEEIRMQAAGVTEEERELRKKEKKKGGLTREQAERLQAFELERQKAREERVDMLA 238
Query: 226 -------SVYDDACKDS-FKKKFEEEANL----LKMESFGLDILHTIGDVYYEKAEIFLA 273
SV+ + K + + FEE+ L LK++SFG++ILH IG Y KA FL
Sbjct: 239 TKLIDKISVWTETDKGADVTRAFEEKIKLEVENLKIQSFGIEILHAIGATYVSKATSFLK 298
Query: 274 SQNLFGMGGIFHSMKAKGGVFMDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGN 333
SQ G+ G F +K KG + + T+S IDAQ TM+E+ K++E N
Sbjct: 299 SQKFLGISGFFSRLKDKGTLAKEAWTTISTVIDAQLTMEEMAKLEEKGGENW-------- 350
Query: 334 EQIKPTTEELAQQEQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAE 393
T E A+ + GK+L+AAW GSK EI S LR VC KVL D + + I RA
Sbjct: 351 -----TDEMRAEYSVKVTGKLLAAAWRGSKLEIQSVLRDVCDKVLGDKKIKLEKRIERAH 405
Query: 394 AMKLLGEVFKKTFRTKVEQEEAQIFEELVAE 424
AM + G ++ K R ++ + FE+L+A+
Sbjct: 406 AMIIAGNIYSKAERDPDDEGDYMAFEQLMAD 436
>gi|50421801|ref|XP_459458.1| hypothetical protein DEHA0E03685g [Debaryomyces hansenii CBS767]
gi|49655126|emb|CAG87674.1| unnamed protein product [Debaryomyces hansenii CBS767]
Length = 451
Score = 209 bits (531), Expect = 4e-52, Method: Composition-based stats.
Identities = 155/437 (35%), Positives = 227/437 (51%), Gaps = 29/437 (6%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
M+ DT+YYD+LGV TA+ +E+KKAYRK++I+ HPDKN NDP A +FQ + EAY +L D
Sbjct: 1 MIKDTKYYDILGVEPTATDVELKKAYRKQAIKCHPDKNGNDPDAAAKFQELGEAYGILQD 60
Query: 61 DDLRAKYDKYG----RKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXX 116
+ RA YD+ G + + D AE FS+IFGG+ F +IGEL +L + K
Sbjct: 61 KEKRALYDEMGVEGMQSNNVAGEADIDPAEFFSMIFGGEVFKDWIGELSMLNEVSKTADI 120
Query: 117 XXXXXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDG 176
+ ++ T + +A D N EKDD L+
Sbjct: 121 LGDEEGTELESQTADSTTATSEVATQ-SESASDVTKTN-EEKDDI---------LSTEAI 169
Query: 177 NKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSF 236
NKK +Q RV LS+ L+ R+ T + + D F
Sbjct: 170 NKKKKQ--KMTQHQREEILKLHEETKKAQEERVRVLSENLLSRIEQYTSASTNQDSLDRF 227
Query: 237 KKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMD 296
K K EE LK+ESFG+++LH IG +Y +A + S FG+ IF S+K+K F +
Sbjct: 228 KTKLNEELEDLKIESFGIELLHLIGKIYTNQAHATINSCKTFGVSKIFSSVKSKTNSFKN 287
Query: 297 TLRTVSAAIDAQNT----MKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMG 352
+ A+DAQ + ++E E ++EA E L D + Q+ + E+L+ G
Sbjct: 288 GFSILKTALDAQASVEAMVREQEDIQEAIEKGEELSDSQKHRQV--------EMERLITG 339
Query: 353 KVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQ 412
KVL+AAW +K+E+T L VC +VL D S+ KK I R++A+ +GE K R+ E
Sbjct: 340 KVLAAAWASTKFEVTGILNKVCTRVLNDKSLGKKVRISRSQAVLYIGETMLKVQRSPEEA 399
Query: 413 EEAQIFEELVAEATKKK 429
E+A+IFEE++A+AT KK
Sbjct: 400 EDARIFEEMMADATAKK 416
>gi|146421607|ref|XP_001486748.1| hypothetical protein PGUG_00125 [Pichia guilliermondii ATCC 6260]
gi|146387869|gb|EDK36027.1| hypothetical protein PGUG_00125 [Pichia guilliermondii ATCC 6260]
Length = 456
Score = 207 bits (528), Expect = 1e-51, Method: Composition-based stats.
Identities = 147/435 (33%), Positives = 224/435 (51%), Gaps = 19/435 (4%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DT YYD+LGVS TA+ +E+KKAYRK++I+ HPDKN NDP A +FQ + EAY +L +
Sbjct: 1 MVKDTTYYDILGVSPTATDVELKKAYRKQAIKLHPDKNANDPNAAAKFQELGEAYGILQN 60
Query: 61 DDLRAKYDKYGRK--EAIPQGGFE---DAAEQFSVIFGGDAFASYIGELMLLKNLQKXXX 115
DLRA YD+ G + + P+ G D +E F ++FGGD+F +IGEL +L + K
Sbjct: 61 ADLRATYDEVGIEGLKNNPEAGEAADIDPSEFFGMVFGGDSFKDWIGELSMLNEMAKTAE 120
Query: 116 XXXXXXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGN--TNEKDDKNKARTTSGNLTV 173
+ T ++ + G + ++ TS + +
Sbjct: 121 VLGDEEDKEGGKPESVQGADSSASATGAGSSTAASGTGTDVVHHNGEQLDVSHTSDSHML 180
Query: 174 HDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACK 233
+ + RV++LSK LI R+ + +
Sbjct: 181 SSEEIERRKKKKISKQQREEILRLHDEAKQAKRLRVEELSKVLIARIEKYNSAKANPDGL 240
Query: 234 DSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGV 293
SF K +E LK+ESFGL++LH IG +Y +A + S FG+ I+ S+K K
Sbjct: 241 ASFTAKLNQELEDLKIESFGLELLHLIGKIYTNQANAAIRSSKTFGVSKIYSSVKQKTDT 300
Query: 294 FMDTLRTVSAAIDAQNTM----KELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQL 349
+ V +A+DAQ++M KE E+M E N L D + ++Q+ + E+L
Sbjct: 301 VKNGYSIVKSALDAQSSMEAMVKEQEEMAERRDPNVELTDSEKSQQV--------EMEKL 352
Query: 350 LMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTK 409
+MGK L+ AW +K+E+T L VC+KVL+D +SKK + RA+A+ LG+ K R+
Sbjct: 353 MMGKFLATAWASTKFEVTGVLNKVCEKVLQDKLLSKKERLSRADALLYLGKHLLKVERSA 412
Query: 410 VEQEEAQIFEELVAE 424
E EEA+IFE+++AE
Sbjct: 413 DEAEEARIFEDIMAE 427
>gi|19115249|ref|NP_594337.1| DNAJ domain protein Caj1/Djp1 type [Schizosaccharomyces pombe
972h-]
gi|1723277|sp|Q10209|YAY1_SCHPO Uncharacterized J domain-containing protein C4H3.01
gi|1184014|emb|CAA93340.1| DNAJ domain protein Caj1/Djp1 type [Schizosaccharomyces pombe]
Length = 392
Score = 205 bits (522), Expect = 5e-51, Method: Composition-based stats.
Identities = 139/429 (32%), Positives = 230/429 (53%), Gaps = 53/429 (12%)
Query: 3 VDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPT-ATERFQAISEAYQVLGDD 61
VDTEYYDLLG+ST A++++IKKAYRK +++ HPDKNP+DP A+E+FQ ISEAYQVLGD+
Sbjct: 5 VDTEYYDLLGISTDATAVDIKKAYRKLAVKYHPDKNPDDPQGASEKFQKISEAYQVLGDE 64
Query: 62 DLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXXX 121
LR++YD++G+++A+P+ GF DA + F+ +FGG F ++GEL +K + +
Sbjct: 65 KLRSQYDQFGKEKAVPEQGFTDAYDFFTNLFGGAPFREWVGELSFVKEMFREED------ 118
Query: 122 XXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKNE 181
+AV+ N ++ + T + KKN
Sbjct: 119 ------------------------SAVEQGQMNDKQQLLLESSEPTPTIKQQFNDRKKNA 154
Query: 182 QVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKFE 241
Q+ R+ ++++ L +RL + ++ ++K+
Sbjct: 155 QI-----REREALAKREQEMIEDRRQRIKEVTENLEKRLDDWIAKATTEEGLNALREKYT 209
Query: 242 EEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRTV 301
+EAN L++ESFG++ILH IG+VY +K L S FG+GG + MK KG + T TV
Sbjct: 210 QEANTLRIESFGVEILHAIGEVYTQKGRTVLKSSK-FGIGGFWSRMKEKGKIARATWDTV 268
Query: 302 SAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWHG 361
SAA+DA+ ++ +++K+++ + + + EE A+ E + GK+L A+W G
Sbjct: 269 SAAMDAKLSIDQMQKLEDKGED-------------QASAEERAKLELDITGKILRASWCG 315
Query: 362 SKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEEL 421
++Y+I LR C +L+ V + ++RA A+ +G +F + +IFE L
Sbjct: 316 ARYDIQGVLREACSNLLK-KRVPTELRLKRAHALLEIGTIFSNV--EADPDDPNRIFENL 372
Query: 422 VAEATKKKR 430
+ E KK++
Sbjct: 373 ILENKKKRK 381
>gi|6320888|ref|NP_010967.1| Nuclear type II J heat shock protein of the E. coli dnaJ family,
contains a leucine zipper-like motif, binds to
non-native substrates for presentation to Ssa3p, may
function during protein translocation, assembly and
disassembly; Caj1p [Saccharomyces cerevisiae]
gi|729007|sp|P39101|CAJ1_YEAST Protein CAJ1
gi|560126|dbj|BAA04700.1| CAJ1 [Saccharomyces cerevisiae]
gi|603281|gb|AAB64583.1| Caj1p [Saccharomyces cerevisiae]
gi|151944759|gb|EDN63018.1| conserved protein [Saccharomyces cerevisiae YJM789]
Length = 391
Score = 202 bits (514), Expect = 4e-50, Method: Composition-based stats.
Identities = 145/435 (33%), Positives = 222/435 (51%), Gaps = 50/435 (11%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV +TEYYD+LG+ A+ EIKKAYR+K+++ HPDK+P+DP A +FQA+ EAYQVL D
Sbjct: 1 MVKETEYYDILGIKPEATPTEIKKAYRRKAMETHPDKHPDDPDAQAKFQAVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
LR+KYD++G+++A+PQ GFEDA+E F+ IFGGD F +IGE L K L +
Sbjct: 61 PGLRSKYDQFGKEDAVPQQGFEDASEYFTAIFGGDGFKDWIGEFSLFKELNEAT------ 114
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
+ G + A T + D+ +T G + HD NK
Sbjct: 115 -------------------EMFGKEDEEGTAATETEKADE-----STDGGMVKHDTNKAE 150
Query: 181 E-QVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKK 239
+ +VD+L++ L E++S +V + ++ F +K
Sbjct: 151 SLKKDKLSKEQREKLMEMEKKRREDMMKQVDELAEKLNEKISRYLIAVKSNNLEE-FTRK 209
Query: 240 FEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLR 299
++E LK+ESFGL++L+ + VY KA F+ S+ +G+ IF +
Sbjct: 210 LDQEIEDLKLESFGLELLYLLARVYKTKANNFIMSKKTYGISKIFTGTRDNARSVKSAYN 269
Query: 300 TVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAW 359
+S ++AQ K +EKM E +T+ +++ A+ E + GK L W
Sbjct: 270 LLSTGLEAQ---KAMEKMSEVNTDELDQYER-------------AKFESTMAGKALGVMW 313
Query: 360 HGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFE 419
SK+E+ L+ VC K+L D V K I +A+AM + F R+ E EEA++FE
Sbjct: 314 AMSKFELERKLKDVCNKILNDKKVPSKERIAKAKAMLFIAHKFASARRSPEEAEEARVFE 373
Query: 420 ELVAEATKK--KRHT 432
EL+ +K K+HT
Sbjct: 374 ELILGEQEKEHKKHT 388
>gi|156841245|ref|XP_001643997.1| hypothetical protein Kpol_1070p22 [Vanderwaltozyma polyspora DSM
70294]
gi|156114629|gb|EDO16139.1| hypothetical protein Kpol_1070p22 [Vanderwaltozyma polyspora DSM
70294]
Length = 380
Score = 199 bits (505), Expect = 5e-49, Method: Composition-based stats.
Identities = 140/430 (32%), Positives = 212/430 (49%), Gaps = 51/430 (11%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DT+YYD+LGV A+ EIKKAYR+++++ HPDK+P+DP A +FQA+ EAYQVL D
Sbjct: 1 MVKDTQYYDILGVKPEATPAEIKKAYRRRAMETHPDKHPDDPEAQSKFQAVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
LR++YD++G+ +A+PQ GFEDA E F+ IFGGD F +IGE L K L +
Sbjct: 61 PGLRSRYDEFGKDDAVPQHGFEDATEFFTTIFGGDGFKDWIGEFSLFKELNE-------- 112
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
P G T ++ N + D K A G LT +K
Sbjct: 113 -----AVEGFDENGQPTTGGPGATDDS------NMVKHDGKASAADRKGKLTKEQRDKLM 161
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
E +V++LS L +L + + D F+ K
Sbjct: 162 EM---------------EQKRREDIARQVNELSLKLDAKLKNYLLASREKHL-DEFQLKL 205
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
++E LK+ESFG+++LH + VY KA F+ S+ G +F + T
Sbjct: 206 DQEIEELKLESFGMELLHVLAKVYKNKANNFIMSKKTHGFSKLFTGPRDNARSVKQTYNL 265
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
+S ++AQ TM+++ ++ N E L E A+ E ++ GK L W
Sbjct: 266 LSTGLEAQKTMEQMSEV-----NPEEL-----------DQYERAKFESMMAGKALGVMWA 309
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
SK+E+ L+ VC ++L D +V K + +A+AM + F++ R+ E EEA++FEE
Sbjct: 310 MSKFELERKLKEVCSRILTDRNVPSKERLAKAKAMLYFADKFERAKRSPEEAEEARVFEE 369
Query: 421 LVAEATKKKR 430
++ K R
Sbjct: 370 MILGEKAKHR 379
>gi|126273851|ref|XP_001387305.1| DnaJ-like protein [Pichia stipitis CBS 6054]
gi|126213175|gb|EAZ63282.1| DnaJ-like protein [Pichia stipitis CBS 6054]
Length = 460
Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats.
Identities = 151/437 (34%), Positives = 226/437 (51%), Gaps = 22/437 (5%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DT YYD+LGV TA+++E+KKAYRK++I+ HPDKN NDP A +FQ + EAY VL D
Sbjct: 1 MVKDTTYYDILGVEPTATAVELKKAYRKQAIKLHPDKNANDPQAAAKFQELGEAYGVLQD 60
Query: 61 DDLRAKYDKYGRK--EAIPQGGFE---DAAEQFSVIFGGDAFASYIGELMLLKNLQKXXX 115
+ RA YD+ G + + GG + D E F +IFGG++F +IGEL +LK + +
Sbjct: 61 SNSRAAYDELGVEGMKKSDVGGVDQDVDPVEMFGMIFGGNSFNEWIGELSMLKEVSQ--- 117
Query: 116 XXXXXXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSG---NLT 172
S G + + + ++ + + D NK TSG LT
Sbjct: 118 ------TAEVLDEKEDDTISIDSGNGSVSGSGLEGKVAELSVSDQDNKVNQTSGANVELT 171
Query: 173 VHDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDAC 232
NKK +Q RVD+LSK LI R+ +V +
Sbjct: 172 SESINKKKKQ--RMTPEQRAEILRLHEESKKAKQARVDELSKNLISRIEKYQSAVTNKDS 229
Query: 233 KDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGG 292
F+ K +E LK+ESFG+ +LH +G +Y +A + + FG+ IF S+K K
Sbjct: 230 LAQFQSKLLQEFEDLKIESFGIQLLHLMGKIYTHQANATIQASRTFGVSKIFTSVKTKTD 289
Query: 293 VFMDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMG 352
+ + +DAQ +++E+ K +EA+ G E + A+ E+ +MG
Sbjct: 290 NVKNGYNILKTGLDAQASVEEMVKEQEAAQAAA---LASGEELSELERYRQAEMEKFIMG 346
Query: 353 KVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQ 412
K L+ AW +K+E+T L V VL D +SKK ++RAEA+ + ++ + RT E+
Sbjct: 347 KFLATAWATTKFEVTGILNKVSNVVLNDKKLSKKERVKRAEAVLYMAKLMSQMKRTAEEE 406
Query: 413 EEAQIFEELVAEATKKK 429
EEAQIFE+++AEAT KK
Sbjct: 407 EEAQIFEQMMAEATAKK 423
>gi|164656675|ref|XP_001729465.1| hypothetical protein MGL_3500 [Malassezia globosa CBS 7966]
gi|159103356|gb|EDP42251.1| hypothetical protein MGL_3500 [Malassezia globosa CBS 7966]
Length = 448
Score = 192 bits (488), Expect = 4e-47, Method: Composition-based stats.
Identities = 133/433 (30%), Positives = 215/433 (49%), Gaps = 34/433 (7%)
Query: 2 VVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGDD 61
+ D EYY+LLGV A+ +++KKAYRK +I+ HPDK ++ E+F+ I EAY+VL D
Sbjct: 20 IADMEYYELLGVRGDATELDLKKAYRKAAIRNHPDKGGDE----EKFKMIGEAYRVLSDS 75
Query: 62 DLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXX-XXXXX 120
+ RA YD+YG+K+ + G ++A E F +FGG+ F IGE+ L+K+ +
Sbjct: 76 NERAVYDRYGKKKPTDEVGLKEATEMFGNLFGGERFVDLIGEISLIKDFGRASEIMMTDE 135
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
SP TT+ V A + E + A + + +K
Sbjct: 136 EREEMERQMNEHLKSP-------TTDTVTGASDESGEANPATSASESQSQVASSSASKAE 188
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTES----VYDDACKDSF 236
Q R+ L++ L +R+ + DD F
Sbjct: 189 RQKHKLTPEQRKKLDEFEKEKVEKEQKRITDLTEKLKDRIRPFVNARNPGAEDDNETKIF 248
Query: 237 KKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQ--NLFGMGGIFHSMKAKGGVF 294
K+ EEA LK+ESFG+++LHTIG VY K+ +L ++ N GM G ++ +K +GG+
Sbjct: 249 TKRMREEAEDLKLESFGVELLHTIGSVYLTKSNTWLKTKRGNFLGMPGFWNRLKERGGLI 308
Query: 295 MDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKV 354
+T + +A++ Q +M+EL + +E +E E+ Q EQ + GK+
Sbjct: 309 KETWNVMGSAVNVQMSMEELARRQEKGDLSEA---------------EMQQLEQDVNGKM 353
Query: 355 LSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEE 414
L A W G+++E+ LR VC VL + VS K L++RA A+ LLG ++ + + ++E
Sbjct: 354 LLATWRGTRWEVNGVLRRVCDNVLNEKGVSDKVLMQRARALALLGSIYSEVVPDESDEER 413
Query: 415 AQIFEELVAEATK 427
++ E LVA A +
Sbjct: 414 REL-ERLVARAAQ 425
>gi|150864850|ref|XP_001383838.2| hypothetical protein PICST_57157 [Pichia stipitis CBS 6054]
gi|149386106|gb|ABN65809.2| predicted protein [Pichia stipitis CBS 6054]
Length = 414
Score = 190 bits (483), Expect = 1e-46, Method: Composition-based stats.
Identities = 113/223 (50%), Positives = 151/223 (67%), Gaps = 5/223 (2%)
Query: 211 QLSKTLIERLSILTESVYDDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEI 270
+LS LI++LS+ TE+ D SFK K + EA LKMESFGL+ILHT+G +Y K++I
Sbjct: 192 ELSNKLIDKLSLFTETDMKDDVAQSFKGKLQYEAESLKMESFGLEILHTLGSIYKTKSKI 251
Query: 271 FLASQNLFGMGGIFHSMKAKGGVFMDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDK 330
FL +Q FG GG +HS+K KGGV DT TVS A+DAQ TM+E KM++ + + +
Sbjct: 252 FLKNQTFFGWGGFWHSVKEKGGVVKDTFSTVSTALDAQRTMEEYSKMQQDNEYHALKEAE 311
Query: 331 DGNEQI-----KPTTEELAQQEQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSK 385
+ + + T EELA+ E+ LMGKVL+AAW GSK+EI T+RGVC +L D+ V
Sbjct: 312 EEEAKKSAAEQEHTPEELAEMEKYLMGKVLAAAWSGSKFEIQGTIRGVCDNILYDEEVPL 371
Query: 386 KTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEELVAEATKK 428
K I RA A+KL+GEVF RT+ E EEA+IFE+LVAEA++K
Sbjct: 372 KKRIDRANALKLIGEVFSAVTRTEDEDEEARIFEQLVAEASQK 414
Score = 142 bits (358), Expect = 4e-32, Method: Composition-based stats.
Identities = 72/112 (64%), Positives = 91/112 (81%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MVVDT YY+LLGV A+S+EIKKAYRK +I+ HPDKNP+DP+A +FQ + EAYQVL D
Sbjct: 1 MVVDTAYYELLGVQANATSLEIKKAYRKAAIRLHPDKNPDDPSAAAKFQEVGEAYQVLSD 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQK 112
+ LRAKYDK+G++E+IP GFED +E FS+IFGG+AF +IGEL LL+ L K
Sbjct: 61 EKLRAKYDKFGKQESIPTEGFEDPSEFFSMIFGGEAFKEWIGELTLLQELSK 112
>gi|50290783|ref|XP_447824.1| unnamed protein product [Candida glabrata]
gi|49527135|emb|CAG60773.1| unnamed protein product [Candida glabrata CBS 138]
Length = 382
Score = 190 bits (482), Expect = 2e-46, Method: Composition-based stats.
Identities = 141/428 (32%), Positives = 208/428 (48%), Gaps = 51/428 (11%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DTE+YD+LG+S A+ EIKKAYRK ++ HPDK+P+DP A +FQA+ EAYQVL D
Sbjct: 1 MVKDTEFYDVLGISPEATPSEIKKAYRKMAMLTHPDKHPDDPEAQAKFQAVGEAYQVLND 60
Query: 61 DDLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXX 120
LR +YD++G+ A+PQ GFEDA E F+ IFGGD F +IG+ L K L +
Sbjct: 61 PALRKQYDEFGKDNAVPQQGFEDAEEYFTAIFGGDGFKDWIGDFSLFKELNEATDMMSED 120
Query: 121 XXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKN 180
K +G T+A D + T E+ +K K+
Sbjct: 121 ATTDATAAATTSEAGMV--KHDGKTDAKDKSGKMTKEQREK----------LWEMEKKRR 168
Query: 181 EQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVYDDACKDSFKKKF 240
E+V +V++L++ L E+L +V D F +K
Sbjct: 169 EEVA----------------------KQVEELARKLKEKLLQYNLAV-KGGHLDDFNRKL 205
Query: 241 EEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSMKAKGGVFMDTLRT 300
++E LK+ESFGL++L+ I VY KA +L ++ FG IF S +
Sbjct: 206 DQEVEELKLESFGLELLYLIARVYKTKANNYLMAKKTFGFSKIFTSTRDNARTVKSAYNL 265
Query: 301 VSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAAWH 360
+S ++AQ M+++ K+ D+D +Q E A+ E + GK L W
Sbjct: 266 LSTGMEAQKAMEQMSKV-----------DEDQLDQY-----ERAKFENEMAGKALGVMWA 309
Query: 361 GSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTFRTKVEQEEAQIFEE 420
+K+E+ L+ VC VL D SVS RA+ + + F R+ E EEA++FEE
Sbjct: 310 MNKFELERKLKDVCNTVLSDKSVSSSERRERAKGLLFIASRFASAKRSPEEAEEAKVFEE 369
Query: 421 LVAEATKK 428
+ +K
Sbjct: 370 FILGEKQK 377
>gi|169867178|ref|XP_001840170.1| hypothetical protein CC1G_02633 [Coprinopsis cinerea okayama7#130]
gi|116498722|gb|EAU81617.1| hypothetical protein CC1G_02633 [Coprinopsis cinerea okayama7#130]
Length = 443
Score = 189 bits (479), Expect = 5e-46, Method: Composition-based stats.
Identities = 141/404 (34%), Positives = 202/404 (50%), Gaps = 59/404 (14%)
Query: 3 VDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGDDD 62
++T YYD+LGV A++ +IKKAYR+ +I+ HPDKNP+DPTA RF I AYQ L D
Sbjct: 44 LETGYYDILGVPVDATTDDIKKAYRRLAIKHHPDKNPDDPTAAARFTEIGIAYQTLSDPA 103
Query: 63 LRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXXXX 122
LR KY+++G KE+ P+GGF D E F IFGG+ F IG + L + ++
Sbjct: 104 LRKKYNEFGAKESQPEGGFVDPEEVFGAIFGGERFVPIIGHIGLAQEMKAAMQE------ 157
Query: 123 XXXXXXXXXXXXSPADGKTNGTTNAVDAALGNTNEKDDKNKARTTSGNLTVHDGNKKNEQ 182
DG+ G+T EK D +T + +K E+
Sbjct: 158 ---------------DGEDEE---------GDTKEKKDPK-------TMTPEERARKEEK 186
Query: 183 VGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVY---DDACKDSFKKK 239
RV QL + LI +LSI TES D S+K
Sbjct: 187 ----DRIKAEKERQRNAEKAAARAERVGQLVENLIRKLSIFTESATGPNDPDVTRSWKTI 242
Query: 240 FEEEANLLKMESFGLDILHTIGDVYYEKAEIFLAS-QNLFGMGGIFHSMKAKGGVFMDTL 298
E EA LK ES+G+D+LH IG VY KA+ LA+ Q +FGMGG H+++ K VF +T+
Sbjct: 243 CELEAEDLKRESYGVDLLHAIGFVYAAKAKHHLATNQTIFGMGGWLHNVQGKYHVFSETV 302
Query: 299 RTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQEQLLMGKVLSAA 358
T+ AAI+ + ++ ++++ P EE + E+ K L A
Sbjct: 303 STLRAAIELKAVFDQIAAAEKSANGLSP--------------EERRKLEEQAAEKGLQAL 348
Query: 359 WHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVF 402
+ G+K EI S LR VC +VL + S+S+ L RA A+++LGE +
Sbjct: 349 FKGTKLEIESILREVCDRVLSEPSLSRDKLALRAVALQMLGEAY 392
>gi|68478826|ref|XP_716575.1| peroxisomal protein import protein [Candida albicans SC5314]
gi|68478933|ref|XP_716521.1| peroxisomal protein import protein [Candida albicans SC5314]
gi|46438191|gb|EAK97526.1| potential peroxisomal protein import protein [Candida albicans
SC5314]
gi|46438246|gb|EAK97580.1| potential peroxisomal protein import protein [Candida albicans
SC5314]
Length = 461
Score = 188 bits (477), Expect = 7e-46, Method: Composition-based stats.
Identities = 145/440 (32%), Positives = 227/440 (51%), Gaps = 30/440 (6%)
Query: 1 MVVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGD 60
MV DT YYD+L V TA+ +E+KKAYRK++I+ HPDKN NDP A E+FQ + EAY +L +
Sbjct: 1 MVKDTTYYDILQVEVTATDVELKKAYRKQAIKLHPDKNANDPKAAEKFQELGEAYGILSN 60
Query: 61 DDLRAKYDKYG-----RKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXX 115
+ R YD++G + Q D AE F++IFGGD+F +IGEL +L ++ +
Sbjct: 61 PESRKIYDEFGVEGMKENPTMQQAADIDPAEFFNMIFGGDSFKQWIGELSMLNDMSRMGE 120
Query: 116 XXXXXXXXXXXXXXXXXXXSPADGKTNGTTNAVDAALGN--------TNEKDDKNKARTT 167
GK +G T++++ ++ N +
Sbjct: 121 IITEDEVELDNVEEESK-----SGKQHGATDSLNESVQTLTISDNNNNNNTKVSSSNNKP 175
Query: 168 SGNLTVHDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESV 227
+ NLT + K +Q R+D L+ L++R+ S
Sbjct: 176 TSNLTSEEIKKLKKQ--KINEQQREQMLKYQEEAKQAKLKRIDDLTSALLKRIENYQLSK 233
Query: 228 YDDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQNLFGMGGIFHSM 287
+ DSF +K + E +K+ESFG+ +LH IG +Y +KA + + FG+ IF S+
Sbjct: 234 NNKEALDSFTRKLQTEFEDMKIESFGIQLLHLIGKIYIDKANATIHASKTFGVSKIFTSV 293
Query: 288 KAKGGVFMDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQ- 346
K+K + + A+DAQ +++++ K +E F E +PT EEL +Q
Sbjct: 294 KSKTETVKNGYSILKTAVDAQLSIEQMVKEQEQ-------FLLAQEEGHQPTQEELVKQA 346
Query: 347 --EQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKK 404
E+++ GK L+ AW +K+E+T LR VC VL D ++SKK + RAEA+ +G+ K
Sbjct: 347 EMERIITGKFLATAWATTKFEVTDILRKVCHNVLRDKTISKKERVARAEALLYIGKEMSK 406
Query: 405 TFRTKVEQEEAQIFEELVAE 424
R+ E+EEA+IFEE++AE
Sbjct: 407 VERSPEEEEEARIFEEMMAE 426
>gi|71016108|ref|XP_758866.1| hypothetical protein UM02719.1 [Ustilago maydis 521]
gi|46098384|gb|EAK83617.1| hypothetical protein UM02719.1 [Ustilago maydis 521]
Length = 481
Score = 181 bits (458), Expect = 1e-43, Method: Composition-based stats.
Identities = 138/439 (31%), Positives = 215/439 (48%), Gaps = 37/439 (8%)
Query: 2 VVDTEYYDLLGVSTTASSIEIKKAYRKKSIQEHPDKNPNDPTATERFQAISEAYQVLGDD 61
+ D EYYDLLGV AS +++KKAYRK +I+ HPDK ++ E F+ I EAY+VL D+
Sbjct: 35 IADMEYYDLLGVRGDASDLDLKKAYRKAAIKNHPDKGGDE----ETFKMIGEAYRVLSDN 90
Query: 62 DLRAKYDKYGRKEAIPQGGFEDAAEQFSVIFGGDAFASYIGELMLLKNLQKXXXXXXXXX 121
LRA YDKYG+K+ + G ++A + F +FGG+ F IGE+ L+K+ K
Sbjct: 91 HLRADYDKYGKKKPTDEVGLKEATDMFGSLFGGERFVDLIGEISLIKDFGKASEIMMTEE 150
Query: 122 XXXXXXXXXXXXXSPADGKTNGTTNAVDAALGN---------TNEKDDKNKARTTSGNLT 172
A+ T AVDA + +A TS +
Sbjct: 151 EKEELEAQMKAEHKKAN--PTDTPAAVDATKTDLPSEAASAGDAAARKAAEAGATSEEIA 208
Query: 173 VHDGNKKNEQVGAXXXXXXXXXXXXXXXXXXXXXXRVDQLSKTLIERLSILTESVY---- 228
+ +Q RV+ L++ L ER+ ++
Sbjct: 209 EAKKKEDAKQRQKLTPEQKAKLEELEKEKEENERKRVEDLAEKLKERIRPFVDARKPGDK 268
Query: 229 DDACKDSFKKKFEEEANLLKMESFGLDILHTIGDVYYEKAEIFLASQ--NLFGMGGIFHS 286
DD+ F++K +EEA LK+ESFG+++LH IG++Y KA ++ ++ ++ G GG
Sbjct: 269 DDSQTQIFERKMKEEAEDLKLESFGVELLHAIGNIYVMKATTWIKTKKHSMLGFGGFMSR 328
Query: 287 MKAKGGVFMDTLRTVSAAIDAQNTMKELEKMKEASTNNEPLFDKDGNEQIKPTTEELAQQ 346
MK +G V +T + +A++ + +M EL + +E E +EL
Sbjct: 329 MKERGAVVKETWGMLGSALNVKASMDELARRQEKGEIPE---------------DELRAL 373
Query: 347 EQLLMGKVLSAAWHGSKYEITSTLRGVCKKVLEDDSVSKKTLIRRAEAMKLLGEVFKKTF 406
EQ + GK+L A W G+++EI+ LR VC KVL + V+ K L RA+A+ LG ++K
Sbjct: 374 EQDMSGKMLLATWRGTRFEISGILRQVCDKVLNEKGVNDKVLFNRAQAILFLGMIYKSVQ 433
Query: 407 RTKVEQEEAQIFEELVAEA 425
+ + E ++ E LVAEA
Sbjct: 434 PDESDDERREL-ERLVAEA 451