BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= YHR160C__[Saccharomyces_cerevisiae]
         (283 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           6,899,187 sequences; 2,350,152,223 total letters

Searching..................................................done


Results from round 1


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_012030.1|  Peroxin required for targeting of peroxiso...   509   e-143
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]         506   e-142
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...    60   2e-07
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...    39   0.30 
ref|XP_001799579.1|  hypothetical protein SNOG_09281 [Phaeos...    36   3.7  
ref|NP_207583.1|  anti-codon nuclease masking agent (prrB) [...    35   4.4  
ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...    35   8.1  
dbj|BAC98110.1|  mKIAA1153 protein [Mus musculus]                  34   9.3  
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score =  509 bits (1312), Expect = e-143,   Method: Composition-based stats.
 Identities = 283/283 (100%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score =  506 bits (1304), Expect = e-142,   Method: Composition-based stats.
 Identities = 281/283 (99%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSF+KIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFSKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQS+SYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSQSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score = 60.1 bits (144), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 40/122 (32%), Positives = 65/122 (53%), Gaps = 3/122 (2%)

Query: 162 NSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANK 221
           ++ +D  +  EF+++E    E D H+  E +  S   E+   K +A+DI++ C  N+ N 
Sbjct: 163 HNDLDMYLEKEFNDIELELQELDEHSHSEFA--SGVQEQQKFKEIATDIVDTCITNNKNS 220

Query: 222 DVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDR 281
            +  +L  SKFMGLM  ISDG + LK D   + N      +    +G   G +FHD++D 
Sbjct: 221 TMTSKLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYS-TKTGKTVGNDFHDIQDT 279

Query: 282 IA 283
           I+
Sbjct: 280 IS 281
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score = 39.3 bits (90), Expect = 0.30,   Method: Composition-based stats.
 Identities = 51/212 (24%), Positives = 86/212 (40%), Gaps = 46/212 (21%)

Query: 49  IELKFLQHSEDLNQSRSYVNIRPRTLEDQSYKFEAPNL----NDNETSWAKDFRYNFPKN 104
           +E  FLQ++  +N       + P TL  Q++   A       + +  SW  DF       
Sbjct: 49  VEATFLQNNSGMNA------VHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFS---SMK 99

Query: 105 VEPPIENQIANLNINNGLRTSQTDFPLGFYSQKNFNI-------ASFPVVDHQIFKTTGL 157
           V+ P+E   +  N+       Q    +     +N  I        +  +VDH   K+   
Sbjct: 100 VQDPLEFADSYKNLYKQYEQKQFHNTVPAQCPRNLYIPTVTRPQTTLNIVDHHEDKS--- 156

Query: 158 EHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNN 217
                S +D LI+ EF ++E          E ++    LE +E+A        IE C+ +
Sbjct: 157 -----SAVDQLIDDEFEKIEE---------EVKDQNQKLEFQESA-----QSFIEICERS 197

Query: 218 SANKDVKERLNSSKFMGLMGSISDGSIVLKKD 249
                +KE+L  SKF  +M  +SDG   +++D
Sbjct: 198 Q----MKEKLQRSKFFQVMQKVSDGEATIRQD 225
>ref|XP_001799579.1| hypothetical protein SNOG_09281 [Phaeosphaeria nodorum SN15]
 gb|EAT83473.2| hypothetical protein SNOG_09281 [Phaeosphaeria nodorum SN15]
          Length = 1422

 Score = 35.8 bits (81), Expect = 3.7,   Method: Composition-based stats.
 Identities = 52/193 (26%), Positives = 78/193 (40%), Gaps = 25/193 (12%)

Query: 16   SSTEKGPFTGRDNTLSFNKIGSRL---NSPPILKDKIELKFLQHSEDLNQSRSYVNIRPR 72
            ++T   PF GR    S  K  S L       + + K     +QH  D  +S   +NI  R
Sbjct: 1140 AATASSPFNGRS---SLGKSASTLIEQAKQVLARSKPSPPSVQH--DFGRSVPNLNIHRR 1194

Query: 73   TLEDQSYKF--EAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINNGLRTSQTDFP 130
            T+  Q+          ND    W +  R+  PK       N + +  I  GL +     P
Sbjct: 1195 TMSAQTSVLGRSMGARNDRPAYWNRPSRF-VPKECYGQGANAVRDYRIKYGLSSPANTRP 1253

Query: 131  LGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLE-EDVHTEE 189
                S +    AS P+     +       P+NS+       ++SE E+S +E  DV  EE
Sbjct: 1254 ---NSIEPVVSASSPISTQMSYA------PVNSYTQE----QYSEEESSGIEIVDVDAEE 1300

Query: 190  ENSGTSLEDEETA 202
            EN G + E+E +A
Sbjct: 1301 ENPGATEEEENSA 1313
>ref|NP_207583.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori
           26695]
 gb|AAD07838.1| anti-codon nuclease masking agent (prrB) [Helicobacter pylori
           26695]
          Length = 431

 Score = 35.4 bits (80), Expect = 4.4,   Method: Composition-based stats.
 Identities = 30/107 (28%), Positives = 52/107 (48%), Gaps = 12/107 (11%)

Query: 129 FPLGFYSQKNFNIASFPVVDHQIFKTTGLEHP---INSHIDSLINAEFSELEASSLEEDV 185
           F LG + + N N++ F  VD   FK      P   I   I  +++A F+EL  + L  ++
Sbjct: 131 FLLGEWCKNNINVSGFASVDMTAFKKYKFPIPPLEIQQEIVKILDA-FTELN-TELNTEL 188

Query: 186 HTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKF 232
           +TE       L+  +   +   + +++F D N ++KD KERL    +
Sbjct: 189 NTE-------LKARKKQYEYYQNMLLDFNDINQSHKDAKERLAQKTY 228
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score = 34.7 bits (78), Expect = 8.1,   Method: Composition-based stats.
 Identities = 19/59 (32%), Positives = 26/59 (44%), Gaps = 6/59 (10%)

Query: 224 KERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           K++  +SKFMGLM  +S G + L K       L    G      G   G E+  V D +
Sbjct: 209 KDKFGASKFMGLMRQVSTGDVTLSKSESGYTGLHMTAG------GEAVGAEYRAVTDEV 261
>dbj|BAC98110.1| mKIAA1153 protein [Mus musculus]
          Length = 556

 Score = 34.3 bits (77), Expect = 9.3,   Method: Composition-based stats.
 Identities = 22/64 (34%), Positives = 37/64 (57%), Gaps = 2/64 (3%)

Query: 158 EHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNN 217
           E P+N  ++SL+N  FS    SS  +D    EE++G  LE++E A +    +I++  +NN
Sbjct: 286 EFPLNK-VNSLLNGTFSAEMLSSEPKDSTPVEESNG-ELEEKEIAEQADEDNIVDAAENN 343

Query: 218 SANK 221
           S  +
Sbjct: 344 SGEQ 347
Searching..................................................done Results from round 2


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

ref|NP_012030.1|  Peroxin required for targeting of peroxiso...   522   e-146
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]         522   e-146
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...   189   2e-46
Sequences not found previously or not previously below threshold:

ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...    55   9e-06
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...    52   4e-05
ref|XP_454648.1|  unnamed protein product [Kluyveromyces lac...    50   2e-04
ref|NP_011755.1|  Peroxin required for targeting of peroxiso...    49   3e-04
gb|AAS56603.1|  YGR239C [Saccharomyces cerevisiae]                 47   0.002
ref|XP_001524003.1|  hypothetical protein LELG_04816 [Lodder...    42   0.077
ref|NP_683055.1|  aminopeptidase P [Thermosynechococcus elon...    40   0.17 
ref|ZP_00514737.1|  Amino acid adenylation [Crocosphaera wat...    38   0.67 
ref|XP_001799579.1|  hypothetical protein SNOG_09281 [Phaeos...    38   0.74 
ref|XP_447344.1|  hypothetical protein CAGL0I02134g [Candida...    38   0.88 
emb|CAN61965.1|  hypothetical protein [Vitis vinifera]             37   1.2  
ref|XP_001558155.1|  hypothetical protein BC1G_03187 [Botryo...    36   3.4  
ref|YP_001318346.1|  Xylose isomerase domain protein TIM bar...    36   3.4  
ref|XP_447342.1|  hypothetical protein CAGL0I02090g [Candida...    36   3.5  
gb|EDN64196.1|  mitochondrial translational activator [Sacch...    35   4.5  
gb|AAA02906.1|  PET111 protein                                     35   4.8  
ref|NP_013984.1|  Mitochondrial translational activator spec...    35   4.9  
ref|YP_001195790.1|  hypothetical protein Fjoh_3457 [Flavoba...    35   5.2  
ref|ZP_01166677.1|  hypothetical protein MED92_11509 [Oceano...    35   6.0  
ref|ZP_01835295.1|  Fibril-like structure subunit FibA, puta...    35   7.1  
ref|NP_690455.1|  DNA ligase III [Heliothis zea virus 1] >gi...    35   8.4  
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score =  522 bits (1345), Expect = e-146,   Method: Composition-based stats.
 Identities = 283/283 (100%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score =  522 bits (1344), Expect = e-146,   Method: Composition-based stats.
 Identities = 281/283 (99%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSF+KIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFSKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQS+SYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSQSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score =  189 bits (480), Expect = 2e-46,   Method: Composition-based stats.
 Identities = 74/298 (24%), Positives = 132/298 (44%), Gaps = 35/298 (11%)

Query: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL-- 60
           S+ CQ+N + +F    E+   TGR   L+  ++ +R      +++ +E +FL++ E L  
Sbjct: 2   SSVCQSNPLQQFARKGEQA--TGRWR-LNEGRVQNRGGES--IQNSVENEFLRNDESLSI 56

Query: 61  ---NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLN 117
              NQ  +     P  +   S   E       + SW  +F       ++ P+E       
Sbjct: 57  RGGNQPHNMFGRAPAQMGHSSGVVE-------KESWLNEFS---KIAIKDPLEFSDRYKQ 106

Query: 118 INNGLRTSQTDFPL---GFYS---------QKNFNIASFPVVDHQIFKTTGLEHPINSHI 165
           + +   T QT   +   G+Y          Q+N+++    +      + T      ++ +
Sbjct: 107 LYSNYETRQTVTSMHNQGYYQPMSNMNLHVQQNWSMMEGQINQSMNSQETAHIDQAHNDL 166

Query: 166 DSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKE 225
           D  +  EF+++E    E D H+  E +  S   E+   K +A+DI++ C  N+ N  +  
Sbjct: 167 DMYLEKEFNDIELELQELDEHSHSEFA--SGVQEQQKFKEIATDIVDTCITNNKNSTMTS 224

Query: 226 RLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           +L  SKFMGLM  ISDG + LK D   + N      +    +G   G +FHD++D I+
Sbjct: 225 KLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYS-TKTGKTVGNDFHDIQDTIS 281
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score = 54.6 bits (130), Expect = 9e-06,   Method: Composition-based stats.
 Identities = 29/119 (24%), Positives = 45/119 (37%), Gaps = 12/119 (10%)

Query: 164 HIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDV 223
            ++  +  EF  LE      +V   E  S   L+ E+   +  A  I        +    
Sbjct: 155 ELERYLEREFDVLEGELAPPEV--PEALSAPLLDHEQLGFQESAKAIYATL----SAPIH 208

Query: 224 KERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           K++  +SKFMGLM  +S G + L K          + G      G   G E+  V D +
Sbjct: 209 KDKFGASKFMGLMRQVSTGDVTLSKSE------SGYTGLHMTAGGEAVGAEYRAVTDEV 261
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score = 52.3 bits (124), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 55/261 (21%), Positives = 97/261 (37%), Gaps = 51/261 (19%)

Query: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSF---NKIGSRLNSPPILKDKIELKFLQHSED 59
           S  C  + + + +S   KGP+        +   +  G   N        +E  FLQ++  
Sbjct: 2   SQYCAADPIQRMVS--RKGPYLDTQQQRVYKHRSGTGHGTNIRQPTTGIVEATFLQNNSG 59

Query: 60  LNQSRSYVNIRPRTLEDQSYKFEAPNL----NDNETSWAKDFRYNFPKNVEPPIENQIAN 115
           +N       + P TL  Q++   A       + +  SW  DF       V+ P+E   + 
Sbjct: 60  MNA------VHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFS---SMKVQDPLEFADSY 110

Query: 116 LNINNGLRTSQTDFPLGFYSQKNFNI-------ASFPVVDHQIFKTTGLEHPINSHIDSL 168
            N+       Q    +     +N  I        +  +VDH   K++         +D L
Sbjct: 111 KNLYKQYEQKQFHNTVPAQCPRNLYIPTVTRPQTTLNIVDHHEDKSSA--------VDQL 162

Query: 169 INAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLN 228
           I+ EF ++E    +++   E               +  A   IE C+       +KE+L 
Sbjct: 163 IDDEFEKIEEEVKDQNQKLE--------------FQESAQSFIEICE----RSQMKEKLQ 204

Query: 229 SSKFMGLMGSISDGSIVLKKD 249
            SKF  +M  +SDG   +++D
Sbjct: 205 RSKFFQVMQKVSDGEATIRQD 225
>ref|XP_454648.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CN41|PEX21_KLULA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG99735.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 306

 Score = 50.0 bits (118), Expect = 2e-04,   Method: Composition-based stats.
 Identities = 45/195 (23%), Positives = 88/195 (45%), Gaps = 34/195 (17%)

Query: 111 NQIANLNINNGLRTSQTDFPLGF---YSQKNFNIASF----PV-VDH--------QIFKT 154
           +Q AN+ +++     +T+FP  +   YSQ     AS+    PV V H        Q    
Sbjct: 95  DQFANMQVHD-----KTEFPTDYKQMYSQYEARGASYSMGAPVMVSHSQMFPRHQQSLMV 149

Query: 155 TGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEEN-------SGTSLEDEETAMKGLA 207
             LE    +   + ++ +F+ELE    +++   +++          + L++++  +K  A
Sbjct: 150 NQLESQAYASSSAKLDEQFAELERQVQDDEKEQQQDKDDDFHLKETSPLDEDQRQLKEAA 209

Query: 208 SDIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNS 267
             I  +   +  +     + ++SKF+GLM +ISDG I LKK+   ++       +    +
Sbjct: 210 QSI--YTTLSDKSSTTSSKFSNSKFLGLMRNISDGVITLKKNPDEDKYT---ELYSPS-T 263

Query: 268 GNWAGLEFHDVEDRI 282
           G   G E+  V+D +
Sbjct: 264 GETFGEEYFPVQDSV 278
>ref|NP_011755.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex18p; Pex21p [Saccharomyces cerevisiae]
 sp|P50091|PEX21_YEAST Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAA61191.1| ORF 288 [Saccharomyces cerevisiae]
 emb|CAA97267.1| unnamed protein product [Saccharomyces cerevisiae]
 gb|EDN61824.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 288

 Score = 49.2 bits (116), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 31/121 (25%), Positives = 57/121 (47%), Gaps = 12/121 (9%)

Query: 170 NAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFC--------DNNSANK 221
           +AEF  LE    +E             + +++ ++ +A+DI++ C         +++ + 
Sbjct: 166 DAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSS 225

Query: 222 DVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDR 281
            V+ +L+ SKF+ LM +IS G + LKK+   + N      F   N+G   G     V+D 
Sbjct: 226 SVESKLSESKFIQLMRNISSGDVTLKKN--ADGN-SASELFS-SNNGELVGNRHIFVKDE 281

Query: 282 I 282
           I
Sbjct: 282 I 282
>gb|AAS56603.1| YGR239C [Saccharomyces cerevisiae]
          Length = 288

 Score = 46.9 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 30/121 (24%), Positives = 56/121 (46%), Gaps = 12/121 (9%)

Query: 170 NAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFC--------DNNSANK 221
           +AEF  LE    +E             + +++ ++ +A+DI++ C         +++ + 
Sbjct: 166 DAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSS 225

Query: 222 DVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDR 281
            V+ +L+  KF+ LM +IS G + LKK+   + N      F   N+G   G     V+D 
Sbjct: 226 SVESKLSEWKFIQLMRNISSGDVTLKKN--ADGN-SASELFS-SNNGELVGNRHIFVKDE 281

Query: 282 I 282
           I
Sbjct: 282 I 282
>ref|XP_001524003.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK46635.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 489

 Score = 41.5 bits (96), Expect = 0.077,   Method: Composition-based stats.
 Identities = 23/102 (22%), Positives = 50/102 (49%), Gaps = 12/102 (11%)

Query: 149 HQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLAS 208
            Q+ K    E  + S  D+ +  E  E+E+S  E ++H ++  + T+    E++M  L  
Sbjct: 218 QQVHKLEDQETQLESEFDA-VEKELQEVESSMQETNLHDKDLFAQTA-RKVESSMNSL-- 273

Query: 209 DIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDN 250
                   N+ + ++K +  +S+F+ LM  +++  + L+ D 
Sbjct: 274 --------NTTDAEMKSKFENSEFLKLMNRVANKQVELQDDK 307
>ref|NP_683055.1| aminopeptidase P [Thermosynechococcus elongatus BP-1]
 dbj|BAC09817.1| aminopeptidase P [Thermosynechococcus elongatus BP-1]
          Length = 435

 Score = 40.4 bits (93), Expect = 0.17,   Method: Composition-based stats.
 Identities = 14/37 (37%), Positives = 19/37 (51%)

Query: 242 GSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDV 278
           G I    + G E   QK+  F    +G+W GL+ HDV
Sbjct: 328 GEITTLINEGKENQTQKYRTFFMHGTGHWLGLDVHDV 364
>ref|ZP_00514737.1| Amino acid adenylation [Crocosphaera watsonii WH 8501]
 gb|EAM52575.1| Amino acid adenylation [Crocosphaera watsonii WH 8501]
          Length = 1119

 Score = 38.1 bits (87), Expect = 0.67,   Method: Composition-based stats.
 Identities = 25/106 (23%), Positives = 46/106 (43%), Gaps = 3/106 (2%)

Query: 100 NFPKNVEPPIENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEH 159
           N+P  +EP ++  +  L INN   T QT++PL  Y+     +    + D   F ++ +  
Sbjct: 360 NYP--IEPTLKQSLQGLEINNIQTTEQTNYPLTLYAAITPELTLRLLYDSDRFSSSSINR 417

Query: 160 PINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKG 205
            +  H+++L+       E    E  + T  E     +E     +KG
Sbjct: 418 LL-GHLETLLTNISLNPEQQLWEVSLLTNSEKKQLLIEGNREEIKG 462
>ref|XP_001799579.1| hypothetical protein SNOG_09281 [Phaeosphaeria nodorum SN15]
 gb|EAT83473.2| hypothetical protein SNOG_09281 [Phaeosphaeria nodorum SN15]
          Length = 1422

 Score = 38.1 bits (87), Expect = 0.74,   Method: Composition-based stats.
 Identities = 53/196 (27%), Positives = 79/196 (40%), Gaps = 25/196 (12%)

Query: 16   SSTEKGPFTGRDNTLSFNKIGSRL--NSPPIL-KDKIELKFLQHSEDLNQSRSYVNIRPR 72
            ++T   PF GR    S  K  S L   +  +L + K     +QH  D  +S   +NI  R
Sbjct: 1140 AATASSPFNGRS---SLGKSASTLIEQAKQVLARSKPSPPSVQH--DFGRSVPNLNIHRR 1194

Query: 73   TLEDQSYKF--EAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINNGLRTSQTDFP 130
            T+  Q+          ND    W +  R+  PK       N + +  I  GL +     P
Sbjct: 1195 TMSAQTSVLGRSMGARNDRPAYWNRPSRF-VPKECYGQGANAVRDYRIKYGLSSPANTRP 1253

Query: 131  LGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEE-DVHTEE 189
                S +    AS P+     +       P+NS        ++SE E+S +E  DV  EE
Sbjct: 1254 ---NSIEPVVSASSPISTQMSYA------PVNS----YTQEQYSEEESSGIEIVDVDAEE 1300

Query: 190  ENSGTSLEDEETAMKG 205
            EN G + E+E +A   
Sbjct: 1301 ENPGATEEEENSATTE 1316
>ref|XP_447344.1| hypothetical protein CAGL0I02134g [Candida glabrata CBS138]
 emb|CAG60281.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 259

 Score = 37.7 bits (86), Expect = 0.88,   Method: Composition-based stats.
 Identities = 31/143 (21%), Positives = 56/143 (39%), Gaps = 24/143 (16%)

Query: 158 EHPINSHI-----DSLINAEFSELEASSLEED-----------VHTEEENSGTSLEDEET 201
           EHPI   +     +   N EF +LE     +D            +   E       ++++
Sbjct: 121 EHPIIDRLSPEAREDYYNNEFEQLEKELENDDDVQDGDQFQDLENKTYEYYNEFAFNDQS 180

Query: 202 AMKGLASDIIEFCDNNSA--NKDVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKH 259
             +  A DI+    N++   + ++ E+L+ S F+ L+ SI  G + L+       N Q  
Sbjct: 181 HFQQSARDILNNMSNSTHEYSSELNEKLSGSTFVNLLKSIDIGDVSLQD------NRQGP 234

Query: 260 VGFCFQNSGNWAGLEFHDVEDRI 282
                 N     G ++ D+ D I
Sbjct: 235 KELKNSNDNTTLGNKYADIPDYI 257
>emb|CAN61965.1| hypothetical protein [Vitis vinifera]
          Length = 2703

 Score = 37.3 bits (85), Expect = 1.2,   Method: Composition-based stats.
 Identities = 36/139 (25%), Positives = 53/139 (38%), Gaps = 16/139 (11%)

Query: 1    MNSNRCQTNEVNKFISSTEKGPFT-GRDNTLSFNKIGSRLNSPPILKDKIELK------- 52
            +NSN   T +VN         PF   ++ +    K  S+L +P IL+D ++ K       
Sbjct: 883  LNSNSTDTFKVNGHHLKPFMEPFNQDKEESFQLAKKLSKLRNPKILEDPLQRKPRRRVSE 942

Query: 53   -----FLQHSEDLNQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVE- 106
                  L        S S  NIRPR    +   FEAP       S     R    +  E 
Sbjct: 943  EDHPAHLHTRGSQTPSPSARNIRPRASLARDSMFEAPQAPTIPPSEGGVLRSPPQQRYET 1002

Query: 107  --PPIENQIANLNINNGLR 123
              PP   + +NL++   +R
Sbjct: 1003 RRPPTTPEASNLHLKKSIR 1021
>ref|XP_001558155.1| hypothetical protein BC1G_03187 [Botryotinia fuckeliana B05.10]
 gb|EDN19554.1| hypothetical protein BC1G_03187 [Botryotinia fuckeliana B05.10]
          Length = 958

 Score = 35.8 bits (81), Expect = 3.4,   Method: Composition-based stats.
 Identities = 47/207 (22%), Positives = 81/207 (39%), Gaps = 37/207 (17%)

Query: 31  SFNKIGSRLNSPPILKD-KIELKFL--QHSEDLN--QSRSYVNIRPRTLEDQSYKFEAPN 85
           S + +   L  P ++++ ++E   L  +HS DL     R    +RP   +D     E  +
Sbjct: 148 SIDTLKHPLLEPRVVRNLRVEFPLLLSEHSFDLRGFARRDGFEVRP---QDIKLPLELLS 204

Query: 86  LNDNE-TSWAKDFRYNFPKNVEPPIENQIANLNINNGL---------------------- 122
           + +NE   +  DF YNF       I N+   +  N+ +                      
Sbjct: 205 IENNEGLGFPSDF-YNFEIEAFETIANEKIEVTRNSMVYLQTALKTTLTVENKQEIWEDD 263

Query: 123 RTSQTDFPLGFYSQKNFNIASFPVVDHQI-FKTTGLEHPINSHIDSLINAEFSELEASSL 181
           RT    FP+  +     +  S PV D  + F ++G E P+ S  DSL + +   +E    
Sbjct: 264 RTHPRKFPVSNHVTPPLSPISAPVPDSPLPFVSSGCEIPVLSDPDSLTSLDLRRIEDEVF 323

Query: 182 EEDVHT----EEENSGTSLEDEETAMK 204
           +ED+ T    +   + T    +E   K
Sbjct: 324 KEDLPTPLRNQSMTAATISTSDEQCFK 350
>ref|YP_001318346.1| Xylose isomerase domain protein TIM barrel [Alkaliphilus
           metalliredigens QYMF]
 gb|ABR46687.1| Xylose isomerase domain protein TIM barrel [Alkaliphilus
           metalliredigens QYMF]
          Length = 311

 Score = 35.8 bits (81), Expect = 3.4,   Method: Composition-based stats.
 Identities = 21/86 (24%), Positives = 37/86 (43%), Gaps = 4/86 (4%)

Query: 133 FYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENS 192
           F  +KN  I+S    D+ + +   L    + H+  +INA  S L    +   V   +   
Sbjct: 64  FLKEKNMFISSLAYYDNHLHEDLNLRKSYHDHLKKVINAA-SMLGVELVGTFVGRHQ--- 119

Query: 193 GTSLEDEETAMKGLASDIIEFCDNNS 218
           G S+ +     K + SDI++F    +
Sbjct: 120 GLSIVENFQEFKVVFSDILDFAQEKN 145
>ref|XP_447342.1| hypothetical protein CAGL0I02090g [Candida glabrata CBS138]
 emb|CAG60279.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 1136

 Score = 35.8 bits (81), Expect = 3.5,   Method: Composition-based stats.
 Identities = 47/228 (20%), Positives = 95/228 (41%), Gaps = 37/228 (16%)

Query: 45  LKDKIELKFLQHSEDLNQSRSYVNIRPRTLEDQS----------YKFEAPNLNDNETSWA 94
           + +  E K + HS D++   + ++ R   +ED+            K ++P L D   +  
Sbjct: 699 MSEGTEAKTVTHSSDMDDQTTMISSRHNVVEDRQSPPQLKEAVDMKIDSPTLEDTSANLE 758

Query: 95  KDFRYNFPKNVEPPIENQIANL--NINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQI- 151
           +     FP+ V   +   + NL  N + G+ ++     LG   + N +   F  +  ++ 
Sbjct: 759 RAQVEKFPEVVARQVGAVVPNLVPNQSTGMASNSV---LGMKKENNMSREHFENLRDELS 815

Query: 152 -FKTTGLE--HPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKG--- 205
             K   LE  +  ++HI   + +E ++L A   ++    ++      +EDE +  +    
Sbjct: 816 KIKQAALEKANAADNHIRE-LESELNKLRAEQQKQKEDNKK------MEDEISQFRKRYD 868

Query: 206 -LASDIIEFCDNNSANKDV--KERLNSSKFMGLMGSISDGSIVLKKDN 250
            L +D  E  D     +D+   + L++ KF G++   SD     K D 
Sbjct: 869 LLDADYKEMEDTLQQYEDILAAKVLDAEKFNGIIKVQSD-----KIDE 911
>gb|EDN64196.1| mitochondrial translational activator [Saccharomyces cerevisiae
           YJM789]
          Length = 801

 Score = 35.4 bits (80), Expect = 4.5,   Method: Composition-based stats.
 Identities = 13/45 (28%), Positives = 29/45 (64%)

Query: 109 IENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFK 153
           ++ Q+ +L +N+ +R++ T   + FY+Q   N  +FP+ + Q+F+
Sbjct: 206 VKPQMISLGLNSFIRSNNTQLAVEFYTQAITNPDTFPITEKQLFE 250
>gb|AAA02906.1| PET111 protein
          Length = 800

 Score = 35.4 bits (80), Expect = 4.8,   Method: Composition-based stats.
 Identities = 13/45 (28%), Positives = 29/45 (64%)

Query: 109 IENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFK 153
           ++ Q+ +L +N+ +R++ T   + FY+Q   N  +FP+ + Q+F+
Sbjct: 206 VKPQMISLGLNSFIRSNNTQLAVEFYTQAITNPDTFPITEKQLFE 250
>ref|NP_013984.1| Mitochondrial translational activator specific for the COX2 mRNA;
           located in the mitochondrial inner membrane; Pet111p
           [Saccharomyces cerevisiae]
 sp|P08468|PT111_YEAST Protein PET111, mitochondrial precursor
 emb|CAA88584.1| PetIIIp [Saccharomyces cerevisiae]
 gb|AAT93022.1| YMR257C [Saccharomyces cerevisiae]
          Length = 800

 Score = 35.4 bits (80), Expect = 4.9,   Method: Composition-based stats.
 Identities = 13/45 (28%), Positives = 29/45 (64%)

Query: 109 IENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFK 153
           ++ Q+ +L +N+ +R++ T   + FY+Q   N  +FP+ + Q+F+
Sbjct: 206 VKPQMISLGLNSFIRSNNTQLAVEFYTQAITNPDTFPITEKQLFE 250
>ref|YP_001195790.1| hypothetical protein Fjoh_3457 [Flavobacterium johnsoniae UW101]
 gb|ABQ06471.1| hypothetical protein Fjoh_3457 [Flavobacterium johnsoniae UW101]
          Length = 207

 Score = 35.4 bits (80), Expect = 5.2,   Method: Composition-based stats.
 Identities = 25/75 (33%), Positives = 38/75 (50%), Gaps = 2/75 (2%)

Query: 77  QSYKFEAPNLNDN-ETSWAKDFRYNFPKNVEPPIENQIANLNINNGLRTSQ-TDFPLGFY 134
           + Y+FE+P L +  E   AK     F +  EP I+  + N  IN+GLR  Q  + P+ F 
Sbjct: 82  EQYEFESPVLKEEPEMYKAKCPHCRFQEEWEPKIKYILKNQIINDGLRRDQWYNLPIWFQ 141

Query: 135 SQKNFNIASFPVVDH 149
            + N N+     +DH
Sbjct: 142 KEVNGNLFWAYNLDH 156
>ref|ZP_01166677.1| hypothetical protein MED92_11509 [Oceanospirillum sp. MED92]
 gb|EAR61351.1| hypothetical protein MED92_11509 [Oceanospirillum sp. MED92]
          Length = 811

 Score = 35.0 bits (79), Expect = 6.0,   Method: Composition-based stats.
 Identities = 34/131 (25%), Positives = 53/131 (40%), Gaps = 16/131 (12%)

Query: 58  EDLNQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLN 117
           E L+   SY++  P   E+    F  P L  +   WA  +      +  P  E Q+ NL 
Sbjct: 64  EILDAVASYISNTPNPTEESYRSFINPILKRHPEFWAVHWAPRVLHDQRPFFEEQLKNLG 123

Query: 118 INNGLR----TSQTDFPLG----FYS-------QKNFNIASFPVVDHQIFKTTGLEHPIN 162
            N G+     T+ T  P      +Y        +KN +I  F V   +I K   ++  ++
Sbjct: 124 YNQGITELQATTNTLAPSSHQEEYYPVLLTQPLEKNRSIIGFNVASRKINKEVMIK-TLS 182

Query: 163 SHIDSLINAEF 173
             ID L +A F
Sbjct: 183 QDIDFLSSAPF 193
>ref|ZP_01835295.1| Fibril-like structure subunit FibA, putative [Streptococcus
           pneumoniae SP23-BS72]
 gb|EDK81687.1| Fibril-like structure subunit FibA, putative [Streptococcus
           pneumoniae SP23-BS72]
          Length = 390

 Score = 35.0 bits (79), Expect = 7.1,   Method: Composition-based stats.
 Identities = 22/88 (25%), Positives = 38/88 (43%), Gaps = 8/88 (9%)

Query: 71  PRTLEDQSYKFEAPNLNDNETSWAKDFRYNFP---KNVEPPIENQIANLNINNGLRTSQT 127
           P+  E+     + P+L DNET+ +     N P   K  +PP   QI            + 
Sbjct: 287 PQVTEETIKVGQTPDLTDNETNLS-----NLPAGTKVADPPPAGQIDTTKPGTYTGKVRV 341

Query: 128 DFPLGFYSQKNFNIASFPVVDHQIFKTT 155
           D+P G  ++ + ++   P  + Q +K T
Sbjct: 342 DYPDGSSTEVSVSVNDLPAPETQTYKVT 369
>ref|NP_690455.1| DNA ligase III [Heliothis zea virus 1]
 gb|AAN04331.1|AF451898_35 DNA ligase III [Heliothis zea virus 1]
          Length = 1411

 Score = 34.6 bits (78), Expect = 8.4,   Method: Composition-based stats.
 Identities = 41/152 (26%), Positives = 65/152 (42%), Gaps = 19/152 (12%)

Query: 6   CQTNEVNKFISSTEKGPFTGRDNTLSFNKIGS-RLNS-PPILKDKIELKFLQHSEDLN-- 61
           C +N V   + ++E  PF   +N  SFN +     NS  P+  + +E   L   E  N  
Sbjct: 331 CTSNNVLGDVVNSESLPF---NNVESFNCMEPLPFNSMEPLPFNSVESLPLNSVESFNYM 387

Query: 62  QSRSYVNIRPRTLEDQSY-KFEA-PNLNDNETSWAKDFRYN----------FPKNVEPPI 109
           +   + N+ P  + + S  ++ A  +L+   T    DF++N          F  N+    
Sbjct: 388 EPLPFTNVEPTNIMNHSIDQYSAIDSLSLFNTFNNLDFQFNNNQVFDVDQAFDSNLNQSC 447

Query: 110 ENQIANLNINNGLRTSQTDFPLGFYSQKNFNI 141
            NQ+ N+N N    T QT F      Q +FNI
Sbjct: 448 NNQVFNVNPNQSFNTPQTSFNTNHTPQPSFNI 479
Searching..................................................done Results from round 3


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

ref|NP_012030.1|  Peroxin required for targeting of peroxiso...   423   e-117
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]         422   e-116
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...   307   5e-82
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...   250   1e-64
ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...   180   1e-43
ref|NP_011755.1|  Peroxin required for targeting of peroxiso...   145   5e-33
gb|AAS56603.1|  YGR239C [Saccharomyces cerevisiae]                142   2e-32
ref|XP_454648.1|  unnamed protein product [Kluyveromyces lac...   139   2e-31
Sequences not found previously or not previously below threshold:

ref|XP_447344.1|  hypothetical protein CAGL0I02134g [Candida...    51   8e-05
gb|AAX14715.1|  Pex20p [Pichia angusta]                            48   6e-04
ref|XP_001226956.1|  hypothetical protein CHGG_09029 [Chaeto...    44   0.012
ref|XP_459664.1|  hypothetical protein DEHA0E08701g [Debaryo...    42   0.042
ref|XP_569156.1|  peroxisome targeting sequence binding prot...    42   0.050
ref|YP_070357.1|  hypothetical protein YPTB1831 [Yersinia ps...    41   0.074
ref|XP_001385675.2|  hypothetical protein PICST_84845 [Pichi...    41   0.080
ref|XP_001731443.1|  hypothetical protein MGL_1626 [Malassez...    40   0.17 
ref|XP_001524003.1|  hypothetical protein LELG_04816 [Lodder...    40   0.17 
emb|CAA97269.1|  unnamed protein product [Saccharomyces cere...    40   0.21 
ref|NP_683055.1|  aminopeptidase P [Thermosynechococcus elon...    40   0.25 
ref|XP_001877777.1|  predicted protein [Laccaria bicolor S23...    39   0.32 
ref|XP_001906106.1|  unnamed protein product [Podospora anse...    39   0.35 
ref|XP_001321780.1|  hypothetical protein TVAG_276300 [Trich...    39   0.41 
gb|EDU39990.1|  conserved hypothetical protein [Pyrenophora ...    39   0.51 
ref|XP_001029521.1|  hypothetical protein TTHERM_01466260 [T...    38   0.55 
ref|XP_368788.1|  hypothetical protein MGG_00456 [Magnaporth...    38   0.57 
ref|YP_001816720.1|  hypothetical protein VF_2649 [Vibrio fi...    38   0.61 
ref|ZP_01105153.1|  transcription elongation factor NusA [Fl...    38   0.77 
ref|ZP_01119447.1|  transcription elongation factor NusA [Ro...    38   0.81 
ref|XP_001482477.1|  hypothetical protein PGUG_05497 [Pichia...    38   0.92 
emb|CAF97724.1|  unnamed protein product [Tetraodon nigrovir...    37   1.2  
ref|YP_001336098.1|  hypothetical protein KPN_02450 [Klebsie...    37   1.8  
ref|XP_001679672.1|  Hypothetical protein CBG02534 [Caenorha...    36   2.5  
ref|ZP_02427325.1|  hypothetical protein CLORAM_00703 [Clost...    36   3.6  
ref|XP_001556458.1|  hypothetical protein BC1G_05227 [Botryo...    35   4.6  
ref|XP_001797478.1|  hypothetical protein SNOG_07125 [Phaeos...    35   4.6  
ref|XP_001141615.1|  PREDICTED: adducin 3 (gamma) isoform 21...    35   4.9  
ref|NP_001012836.1|  peroxisomal biogenesis factor 5 [Gallus...    35   4.9  
ref|XP_001019542.2|  DEAD/DEAH box helicase family protein [...    35   5.4  
ref|XP_977174.1|  hypothetical protein TTHERM_00037740 [Tetr...    35   5.6  
gb|EEH49084.1|  peroxin 20 [Paracoccidioides brasiliensis Pb18]    35   6.1  
ref|ZP_03840659.1|  DNA repair protein RecN [Proteus mirabil...    35   7.3  
ref|NP_859786.1|  hypothetical protein HH0255 [Helicobacter ...    35   8.1  
ref|XP_503644.1|  YlPEX20 [Yarrowia lipolytica] >gi|49649513...    35   8.4  
ref|XP_001652339.1|  embryonic polarity dorsal [Aedes aegypt...    35   8.7  
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score =  423 bits (1087), Expect = e-117,   Method: Composition-based stats.
 Identities = 283/283 (100%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score =  422 bits (1085), Expect = e-116,   Method: Composition-based stats.
 Identities = 281/283 (99%), Positives = 283/283 (100%)

Query: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL 60
           MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSF+KIGSRLNSPPILKDKIELKFLQHSEDL
Sbjct: 1   MNSNRCQTNEVNKFISSTEKGPFTGRDNTLSFSKIGSRLNSPPILKDKIELKFLQHSEDL 60

Query: 61  NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120
           NQS+SYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN
Sbjct: 61  NQSQSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINN 120

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180
           GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS
Sbjct: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASS 180

Query: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS
Sbjct: 181 LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240

Query: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA
Sbjct: 241 DGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score =  307 bits (787), Expect = 5e-82,   Method: Composition-based stats.
 Identities = 74/298 (24%), Positives = 132/298 (44%), Gaps = 35/298 (11%)

Query: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDL-- 60
           S+ CQ+N + +F    E+   TGR   L+  ++ +R      +++ +E +FL++ E L  
Sbjct: 2   SSVCQSNPLQQFARKGEQA--TGRWR-LNEGRVQNRGGES--IQNSVENEFLRNDESLSI 56

Query: 61  ---NQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLN 117
              NQ  +     P  +   S   E       + SW  +F       ++ P+E       
Sbjct: 57  RGGNQPHNMFGRAPAQMGHSSGVVE-------KESWLNEFS---KIAIKDPLEFSDRYKQ 106

Query: 118 INNGLRTSQTDFPL---GFYS---------QKNFNIASFPVVDHQIFKTTGLEHPINSHI 165
           + +   T QT   +   G+Y          Q+N+++    +      + T      ++ +
Sbjct: 107 LYSNYETRQTVTSMHNQGYYQPMSNMNLHVQQNWSMMEGQINQSMNSQETAHIDQAHNDL 166

Query: 166 DSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKE 225
           D  +  EF+++E    E D H+  E +  S   E+   K +A+DI++ C  N+ N  +  
Sbjct: 167 DMYLEKEFNDIELELQELDEHSHSEFA--SGVQEQQKFKEIATDIVDTCITNNKNSTMTS 224

Query: 226 RLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRIA 283
           +L  SKFMGLM  ISDG + LK D   + N      +    +G   G +FHD++D I+
Sbjct: 225 KLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYS-TKTGKTVGNDFHDIQDTIS 281
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score =  250 bits (638), Expect = 1e-64,   Method: Composition-based stats.
 Identities = 55/261 (21%), Positives = 97/261 (37%), Gaps = 51/261 (19%)

Query: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSF---NKIGSRLNSPPILKDKIELKFLQHSED 59
           S  C  + + + +S   KGP+        +   +  G   N        +E  FLQ++  
Sbjct: 2   SQYCAADPIQRMVS--RKGPYLDTQQQRVYKHRSGTGHGTNIRQPTTGIVEATFLQNNSG 59

Query: 60  LNQSRSYVNIRPRTLEDQSYKFEAPNL----NDNETSWAKDFRYNFPKNVEPPIENQIAN 115
           +N       + P TL  Q++   A       + +  SW  DF       V+ P+E   + 
Sbjct: 60  MNA------VHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFS---SMKVQDPLEFADSY 110

Query: 116 LNINNGLRTSQTDFPLGFYSQKNFNI-------ASFPVVDHQIFKTTGLEHPINSHIDSL 168
            N+       Q    +     +N  I        +  +VDH   K++         +D L
Sbjct: 111 KNLYKQYEQKQFHNTVPAQCPRNLYIPTVTRPQTTLNIVDHHEDKSSA--------VDQL 162

Query: 169 INAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLN 228
           I+ EF ++E    +++   E               +  A   IE C+       +KE+L 
Sbjct: 163 IDDEFEKIEEEVKDQNQKLE--------------FQESAQSFIEICE----RSQMKEKLQ 204

Query: 229 SSKFMGLMGSISDGSIVLKKD 249
            SKF  +M  +SDG   +++D
Sbjct: 205 RSKFFQVMQKVSDGEATIRQD 225
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score =  180 bits (456), Expect = 1e-43,   Method: Composition-based stats.
 Identities = 60/287 (20%), Positives = 92/287 (32%), Gaps = 35/287 (12%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDLNQS 63
           + CQ + + K I+ TE GP  G      FN+    L      +       LQ       S
Sbjct: 2   SLCQGSAMQKLIAKTE-GPMAGVGRVGGFNRPSGGLGQSSAEQQ------LQARAGERAS 54

Query: 64  RSYVNIRPRTLEDQSYKFEAPNLNDN-ETSWAKDFRYNFPKNVEPPIENQIANLNINNGL 122
           ++        LE Q          D  +  W + F       VE P+           G 
Sbjct: 55  QNRFM---AVLEPQRELGGRMARGDGLQADWVRQFS---SMQVEDPLAFSAEYQRAYAGY 108

Query: 123 RTSQTDFP----LGFYSQKNFNIASFPVVDHQIFKTTGLEHPI---NSHIDSLINAEFSE 175
              Q   P    LG+    +  + + P    Q               + ++  +  EF  
Sbjct: 109 EQRQAARPAARVLGYGG--SMFMPTMPQQQLQQQVAPQQTAQAALQEAELERYLEREFDV 166

Query: 176 LEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGL 235
           LE      +V   E  S   L+ E+   +  A  I        +    K++  +SKFMGL
Sbjct: 167 LEGELAPPEV--PEALSAPLLDHEQLGFQESAKAIYATL----SAPIHKDKFGASKFMGL 220

Query: 236 MGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           M  +S G + L K          + G      G   G E+  V D +
Sbjct: 221 MRQVSTGDVTLSKSE------SGYTGLHMTAGGEAVGAEYRAVTDEV 261
>ref|NP_011755.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex18p; Pex21p [Saccharomyces cerevisiae]
 sp|P50091|PEX21_YEAST Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAA61191.1| ORF 288 [Saccharomyces cerevisiae]
 emb|CAA97267.1| unnamed protein product [Saccharomyces cerevisiae]
 gb|EDN61824.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 288

 Score =  145 bits (365), Expect = 5e-33,   Method: Composition-based stats.
 Identities = 55/295 (18%), Positives = 112/295 (37%), Gaps = 31/295 (10%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIG---SRLNSPPILKDK-IELKFLQHSED 59
           + C T+ + K I   ++G     D+ +   +     + L +    +D  +E  FL +  +
Sbjct: 3   SVCHTSPIEKII---QQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSN 59

Query: 60  LNQSRSY-VNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNI 118
            +   +     +P  L      F  PN   + + W   F       ++  +E       +
Sbjct: 60  FDGIDNVRYQNQPSPL-----TFITPNNTVDSSDWVPQFS---SMKIDDSLEFSSEYKRL 111

Query: 119 NNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPIN--SHIDSL-INAEFSE 175
            +   + Q       +      +       +   KT   +   N  +  D+   +AEF  
Sbjct: 112 YSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQV 171

Query: 176 LEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCD--------NNSANKDVKERL 227
           LE    +E             + +++ ++ +A+DI++ C         +++ +  V+ +L
Sbjct: 172 LEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231

Query: 228 NSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           + SKF+ LM +IS G + LKK+   + N      F   N+G   G     V+D I
Sbjct: 232 SESKFIQLMRNISSGDVTLKKN--ADGN-SASELFS-SNNGELVGNRHIFVKDEI 282
>gb|AAS56603.1| YGR239C [Saccharomyces cerevisiae]
          Length = 288

 Score =  142 bits (359), Expect = 2e-32,   Method: Composition-based stats.
 Identities = 54/295 (18%), Positives = 111/295 (37%), Gaps = 31/295 (10%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIG---SRLNSPPILKDK-IELKFLQHSED 59
           + C T+ + K I   ++G     D+ +   +     + L +    +D  +E  FL +  +
Sbjct: 3   SVCHTSPIEKII---QQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSN 59

Query: 60  LNQSRSY-VNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNI 118
            +   +     +P  L      F  PN   + + W   F       ++  +E       +
Sbjct: 60  FDGIDNVRYQNQPSPL-----TFITPNNTVDSSDWVPQFS---SMKIDDSLEFSSEYKRL 111

Query: 119 NNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPIN--SHIDSL-INAEFSE 175
            +   + Q       +      +       +   KT   +   N  +  D+   +AEF  
Sbjct: 112 YSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQV 171

Query: 176 LEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCD--------NNSANKDVKERL 227
           LE    +E             + +++ ++ +A+DI++ C         +++ +  V+ +L
Sbjct: 172 LEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231

Query: 228 NSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           +  KF+ LM +IS G + LKK+   + N      F   N+G   G     V+D I
Sbjct: 232 SEWKFIQLMRNISSGDVTLKKN--ADGN-SASELFS-SNNGELVGNRHIFVKDEI 282
>ref|XP_454648.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CN41|PEX21_KLULA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG99735.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 306

 Score =  139 bits (350), Expect = 2e-31,   Method: Composition-based stats.
 Identities = 47/293 (16%), Positives = 102/293 (34%), Gaps = 30/293 (10%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQ--HSEDLN 61
           + CQTN +N  +S +       R    + ++            + ++ +F Q   +   +
Sbjct: 2   SVCQTNPLNTLVSKSGY-----RFGQPNQHQQSIAQQQSRPRNNALDHQFSQFTGASGPS 56

Query: 62  QSRSYVNIRP-----RTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANL 116
            +       P           +          +   W   F       V    E      
Sbjct: 57  FAHPQHQQLPMAAVVANATTAASTTSNTATASDHHVWIDQFAN---MQVHDKTEFPTDYK 113

Query: 117 NINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSEL 176
            + +        + +G     + +   FP    Q      LE    +   + ++ +F+EL
Sbjct: 114 QMYSQYEARGASYSMGAPVMVSHSQM-FPRHQ-QSLMVNQLESQAYASSSAKLDEQFAEL 171

Query: 177 EASSLEEDVHTEEENS-------GTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNS 229
           E    +++   +++          + L++++  +K  A  I      +  +     + ++
Sbjct: 172 ERQVQDDEKEQQQDKDDDFHLKETSPLDEDQRQLKEAAQSIYT--TLSDKSSTTSSKFSN 229

Query: 230 SKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
           SKF+GLM +ISDG I LKK+   ++       +    +G   G E+  V+D +
Sbjct: 230 SKFLGLMRNISDGVITLKKNPDEDKYT---ELYSPS-TGETFGEEYFPVQDSV 278
>ref|XP_447344.1| hypothetical protein CAGL0I02134g [Candida glabrata CBS138]
 emb|CAG60281.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 259

 Score = 51.5 bits (122), Expect = 8e-05,   Method: Composition-based stats.
 Identities = 50/302 (16%), Positives = 104/302 (34%), Gaps = 69/302 (22%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSE--DLN 61
           + C +N +    S T    FT R    ++     R      + + +E +FL      D  
Sbjct: 2   SACHSNPLQSLSSKTH--TFTTRTRAHNYEHNHDR------ISNPLEREFLHQDSIYDTA 53

Query: 62  QSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINNG 121
              +    +PR +         P      T W  +F+     +  P ++           
Sbjct: 54  TPVAIPIQQPRQV---------PRDTKKNTQWLSEFQ-QLSISDVPSVKPSFT------- 96

Query: 122 LRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDS--------LINAEF 173
                          +    ++F ++++ + K +  +   +  ID           N EF
Sbjct: 97  ---------------RPAPTSTFRIINNDVRKDSLFKFQEHPIIDRLSPEAREDYYNNEF 141

Query: 174 SELEASSLEEDVHTE----EENSGTSLE-------DEETAMKGLASDIIEFCDNNSA--N 220
            +LE     +D   +    ++    + E       ++++  +  A DI+    N++   +
Sbjct: 142 EQLEKELENDDDVQDGDQFQDLENKTYEYYNEFAFNDQSHFQQSARDILNNMSNSTHEYS 201

Query: 221 KDVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVED 280
            ++ E+L+ S F+ L+ SI  G + L+       N Q        N     G ++ D+ D
Sbjct: 202 SELNEKLSGSTFVNLLKSIDIGDVSLQ------DNRQGPKELKNSNDNTTLGNKYADIPD 255

Query: 281 RI 282
            I
Sbjct: 256 YI 257
>gb|AAX14715.1| Pex20p [Pichia angusta]
          Length = 309

 Score = 48.4 bits (114), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 19/86 (22%), Positives = 33/86 (38%), Gaps = 3/86 (3%)

Query: 169 INAEFSELEASSLEED-VHTEEENSGTSLEDEETAMKGLASDIIEFCDN--NSANKDVKE 225
           ++A F ELE    +E     + E      ED++     LA  +           +K   +
Sbjct: 172 VDAAFDELEHEMHDEKMEEQQAEQVKPQNEDDKIKFAQLARSVFLTMSQPPADISKQTSD 231

Query: 226 RLNSSKFMGLMGSISDGSIVLKKDNG 251
           +   S F+ LM  IS   + + +D  
Sbjct: 232 KFQKSNFLKLMNRISTREVEMNEDRD 257
>ref|XP_001226956.1| hypothetical protein CHGG_09029 [Chaetomium globosum CBS 148.51]
 gb|EAQ85015.1| hypothetical protein CHGG_09029 [Chaetomium globosum CBS 148.51]
          Length = 368

 Score = 44.2 bits (103), Expect = 0.012,   Method: Composition-based stats.
 Identities = 30/163 (18%), Positives = 53/163 (32%), Gaps = 35/163 (21%)

Query: 117 NINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSEL 176
            + +     Q        +Q  +     P VD     TTG+  P           E   +
Sbjct: 220 ELADWTEKQQVANTEFAEAQDKWMAQHGPRVD----PTTGVTVPSP--------DEMEAI 267

Query: 177 EASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLM 236
           +A+  +     ++  S   L          A DI+     NS + +  E+  +S+F  LM
Sbjct: 268 DANLEKLAQEQDKRRSDDDLVH-------AAIDIV-----NSVSDNTSEKFKNSRFFELM 315

Query: 237 GSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVE 279
             I +  +V++ D           GF    +G        D +
Sbjct: 316 RRIGNREVVVEGD-----------GFVNAATGEAVNTSHDDEQ 347
>ref|XP_459664.1| hypothetical protein DEHA0E08701g [Debaryomyces hansenii CBS767]
 emb|CAG87898.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 413

 Score = 42.2 bits (98), Expect = 0.042,   Method: Composition-based stats.
 Identities = 38/267 (14%), Positives = 84/267 (31%), Gaps = 35/267 (13%)

Query: 8   TNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQ--HSEDLNQSRS 65
           +N +      T++      +   +  + G+   +  I+  ++   F Q  + +  NQ + 
Sbjct: 9   SNALQNLSKHTQRDNTLQNEIIRNQPQQGNGFRNQAIVDQRLNQDFQQFSNGQAFNQYQP 68

Query: 66  YVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRY--------NFPKNVEPPIENQIANLN 117
               +      + +     + +  +  W +DF          +   N++        +  
Sbjct: 69  EPVHQQFHQHHRQHNQAQHSQHHPQQGWVQDFSKMSINQQPRHQAGNIQNGKVQNDWHQQ 128

Query: 118 INNGLRTSQTDFPLGFYSQKN---------------FNIASFPVVDHQIFKTTGLEHPIN 162
             N          + +  Q                        V +H+       E+ I 
Sbjct: 129 FMNQTSQHPAQNNVAYQPQHQMGGFRMNMRTNLSNSLYQTRGNVPEHKEMHKMEEENQIF 188

Query: 163 SHIDSLINAEFSELEASSLEEDVHTEE--ENSGTSLEDEETAMKGLASDIIEFCDNNSAN 220
                  +A+F ++E     E++  E+  E      E E+      A  + E       N
Sbjct: 189 -------DAQFDQMERELQHENMSNEQTAEVQMDGNEFEKEQFAKTAKQVQESM-LKGEN 240

Query: 221 KDVKERLNSSKFMGLMGSISDGSIVLK 247
            +   +  +S F+ LM SIS+ S+ L 
Sbjct: 241 AETNAKFQNSNFLKLMSSISNRSVELS 267
>ref|XP_569156.1| peroxisome targeting sequence binding protein [Cryptococcus
           neoformans var. neoformans JEC21]
 ref|XP_777112.1| hypothetical protein CNBB3440 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gb|EAL22465.1| hypothetical protein CNBB3440 [Cryptococcus neoformans var.
           neoformans B-3501A]
 gb|AAW41849.1| peroxisome targeting sequence binding protein, putative
           [Cryptococcus neoformans var. neoformans JEC21]
          Length = 799

 Score = 41.9 bits (97), Expect = 0.050,   Method: Composition-based stats.
 Identities = 22/158 (13%), Positives = 49/158 (31%), Gaps = 10/158 (6%)

Query: 90  ETSWAKDFRYNFPKNVEPPIENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDH 149
           ++ W ++F  +      PP+    +   + N            +  Q+   +       H
Sbjct: 118 KSGWQEEFSQH--VAAAPPLGTTRSQKPLYNAGLAPWEVPETQYQLQRPA-LMRPSFRSH 174

Query: 150 QIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASD 209
            I  +          I +   A  S+ E             +    L++ +  +   A  
Sbjct: 175 DIPISEASLTFPRPDIHAA--APVSKYEEQI-----GQPTGSETLPLDESQQLLARTARS 227

Query: 210 IIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLK 247
            +   +  S       +   SKF+ L+  + D  +V+K
Sbjct: 228 FVSNLETQSDILSANPKFAQSKFLSLLRGLGDEQVVVK 265
>ref|YP_070357.1| hypothetical protein YPTB1831 [Yersinia pseudotuberculosis IP 32953]
 emb|CAH21070.1| bacteriophage hypothetical protein [Yersinia pseudotuberculosis IP
            32953]
          Length = 2133

 Score = 41.5 bits (96), Expect = 0.074,   Method: Composition-based stats.
 Identities = 42/180 (23%), Positives = 67/180 (37%), Gaps = 27/180 (15%)

Query: 4    NRCQTNEVNKFISSTEKGPF-TGRDNTLSFNKIGSRLNSPPILKDKIELKFLQ-----HS 57
            N   T +V+   S    G F T R         G R      ++ ++   F+Q     ++
Sbjct: 1206 NYFGTTDVHGEASGKRSGAFVTDRKRPRYEFSEGKRSEITHPVRQEVHDAFMQVVNVVNN 1265

Query: 58   EDLNQSRSYVNIR------PRTLEDQSYKFEAPNLNDNETSWA-KDFRYNFPKNVEP--- 107
             D+ +  + ++           LE  +  FE   L+  E S    D+  N  K  E    
Sbjct: 1266 SDMMKRATVLDASRSKPYWTTKLEMSARAFERYLLDKAEKSGISNDYMANLRKADEHANP 1325

Query: 108  -----PIENQI------ANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTG 156
                 P E ++      A  ++ N L+T  TD  + FYS+KN NI    V+       TG
Sbjct: 1326 ETYAYPTEAELSGGVRQAFEHLFNTLKTKPTDKGIAFYSRKNTNIEQGNVISDTGHTVTG 1385
>ref|XP_001385675.2| hypothetical protein PICST_84845 [Pichia stipitis CBS 6054]
 gb|ABN67646.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 402

 Score = 41.5 bits (96), Expect = 0.080,   Method: Composition-based stats.
 Identities = 26/133 (19%), Positives = 47/133 (35%), Gaps = 10/133 (7%)

Query: 126 QTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDV 185
           +T+     Y+Q+        V            H +      L ++ F +LE    ++  
Sbjct: 163 RTNLSTPLYAQQ-------QVQTGASLTEHQEIHKMEQE-KQLFDSHFDQLEKELNQQSQ 214

Query: 186 HTEEENSGTSLEDEETAMKGLASDIIEFC-DNNSANKDVKERLNSSKFMGLMGSISDGSI 244
              E        + E      A  I       ++A+   K ++ +S F+ LM SIS+  +
Sbjct: 215 EKPEVEVQVDKVENE-QFAETARQIENSLRQFDTADAATKAKIENSDFLKLMSSISNKQV 273

Query: 245 VLKKDNGTERNLQ 257
           VL  D   +   Q
Sbjct: 274 VLDGDKLVDSTGQ 286
>ref|XP_001731443.1| hypothetical protein MGL_1626 [Malassezia globosa CBS 7966]
 gb|EDP44229.1| hypothetical protein MGL_1626 [Malassezia globosa CBS 7966]
          Length = 813

 Score = 40.3 bits (93), Expect = 0.17,   Method: Composition-based stats.
 Identities = 38/230 (16%), Positives = 76/230 (33%), Gaps = 44/230 (19%)

Query: 55  QHSEDLNQSRSYVNIRPRT--LEDQSYKFEAPNLNDNETS----WAKDF-----RYNFPK 103
           Q  E L Q   +    P    +E   ++ +   +  + TS    WA+DF           
Sbjct: 106 QLRESLPQVVPHEQWAPSQAVMETSFHRQQHTPVPVSSTSATPAWAQDFLHATRTSTKHT 165

Query: 104 NVEP-----PIENQIANLNINNGLR--TSQTDFPLGFYSQKNFNIASFPVVDHQ---IFK 153
            V+P     P+    A   +  G+    SQ   PL             PV+         
Sbjct: 166 AVQPASNIHPVPAGAAAATVPRGMPPLASQFHRPLHA----------VPVMGQHLQDSLV 215

Query: 154 TTGLEHPINSHIDSLINAEFSELEASSLEEDVH----TEEENSGTSLEDEETAMKGLASD 209
           +      +     S  ++ F+ ++A + + +      +E +    + +      + L   
Sbjct: 216 SRAHNDDLAYVSQSTWDSAFTAIDAQTQQPNESAAIPSETQLDTMNADQLAQTAERL--- 272

Query: 210 IIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKH 259
                  N+   D  E+   S+F+ LM  + D    ++ D   +R+ +  
Sbjct: 273 ------LNAVQHDSSEKFQQSEFLQLMRKLRDRVAEVRGDQIVDRDAKGK 316
>ref|XP_001524003.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK46635.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 489

 Score = 40.3 bits (93), Expect = 0.17,   Method: Composition-based stats.
 Identities = 16/108 (14%), Positives = 42/108 (38%), Gaps = 13/108 (12%)

Query: 148 DHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEEN--SGTSLEDEETAMKG 205
             +  +   LE       ++ + +EF  +E    E +   +E N             ++ 
Sbjct: 214 QSEHQQVHKLEDQ-----ETQLESEFDAVEKELQEVESSMQETNLHDKDLFAQTARKVES 268

Query: 206 LASDIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDNGTE 253
             +        N+ + ++K +  +S+F+ LM  +++  + L+ D   +
Sbjct: 269 SMNS------LNTTDAEMKSKFENSEFLKLMNRVANKQVELQDDKLVD 310
>emb|CAA97269.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 123

 Score = 39.9 bits (92), Expect = 0.21,   Method: Composition-based stats.
 Identities = 20/132 (15%), Positives = 45/132 (34%), Gaps = 16/132 (12%)

Query: 4   NRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIG---SRLNSPPILKDK-IELKFLQHSED 59
           + C T+ + K I   ++G     D+ +   +     + L +    +D  +E  FL +  +
Sbjct: 3   SVCHTSPIEKII---QQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSN 59

Query: 60  LNQSRSY-VNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNI 118
            +   +     +P  L      F  PN   + + W   F       ++  +E       +
Sbjct: 60  FDGIDNVRYQNQPSPL-----TFITPNNTVDSSDWVPQFS---SMKIDDSLEFSSEYKRL 111

Query: 119 NNGLRTSQTDFP 130
            +   + Q    
Sbjct: 112 YSNYESQQRLNS 123
>ref|NP_683055.1| aminopeptidase P [Thermosynechococcus elongatus BP-1]
 dbj|BAC09817.1| aminopeptidase P [Thermosynechococcus elongatus BP-1]
          Length = 435

 Score = 39.5 bits (91), Expect = 0.25,   Method: Composition-based stats.
 Identities = 16/46 (34%), Positives = 23/46 (50%), Gaps = 4/46 (8%)

Query: 233 MGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDV 278
           +GL+     G I    + G E   QK+  F    +G+W GL+ HDV
Sbjct: 323 LGLLR----GEITTLINEGKENQTQKYRTFFMHGTGHWLGLDVHDV 364
>ref|XP_001877777.1| predicted protein [Laccaria bicolor S238N-H82]
 gb|EDR11880.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 702

 Score = 39.2 bits (90), Expect = 0.32,   Method: Composition-based stats.
 Identities = 30/131 (22%), Positives = 49/131 (37%), Gaps = 13/131 (9%)

Query: 136 QKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLI--NAEFS----ELEASSLEEDVHTEE 189
           Q N  +  F +     +             +  I  + EF+     LE SS  E    +E
Sbjct: 154 QWNLGLTQFRMNQTPAYAPALHMQQQPVISNKRISWDREFNAQVLSLETSSTMEIEQKQE 213

Query: 190 ENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKD 249
               T  E ++ A    A  ++E    N  N+    +   S+F+GLM  + DG +V+  +
Sbjct: 214 IYQQTPREQDDLA--RTAGMLLE----NIKNEQ-NPKFQKSQFLGLMKQLRDGEVVVDGN 266

Query: 250 NGTERNLQKHV 260
              E N Q   
Sbjct: 267 QMVENNGQTSQ 277
>ref|XP_001906106.1| unnamed protein product [Podospora anserina]
 emb|CAP66772.1| unnamed protein product [Podospora anserina]
          Length = 374

 Score = 39.2 bits (90), Expect = 0.35,   Method: Composition-based stats.
 Identities = 26/147 (17%), Positives = 52/147 (35%), Gaps = 20/147 (13%)

Query: 110 ENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLI 169
           E + A  ++ +     Q D    +  Q+            Q               D+ +
Sbjct: 198 EVKDAFADLFDQYEQ-QADQLTDYQRQEQEFEQE------QAKWMAEHGPKALPPTDAEM 250

Query: 170 NAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNS 229
            A  SE+E  + E++ H+           E   +   A DI+     N++     ++   
Sbjct: 251 AAINSEMERIADEQEEHSRRR--------ENADLARAAEDILRAVSGNNS-----DKFKH 297

Query: 230 SKFMGLMGSISDGSIVLKKDNGTERNL 256
           S F+ LM  I+   IV+ ++N  + + 
Sbjct: 298 SNFLELMRRIAASEIVVNEENFIDADT 324
>ref|XP_001321780.1| hypothetical protein TVAG_276300 [Trichomonas vaginalis G3]
 gb|EAY09557.1| hypothetical protein TVAG_276300 [Trichomonas vaginalis G3]
          Length = 2819

 Score = 38.8 bits (89), Expect = 0.41,   Method: Composition-based stats.
 Identities = 38/163 (23%), Positives = 62/163 (38%), Gaps = 28/163 (17%)

Query: 48   KIELKFLQHSEDLNQ-SRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDF--RYNFPKN 104
             ++ KF  + E LN     Y                  NL +   S A+D        + 
Sbjct: 2474 SVDDKF--NDESLNIFPTIYFGR---------------NLTEYSQSCAEDLIKSPGLKEI 2516

Query: 105  VEPPIENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSH 164
            V+   ENQ+ N+ I+   R      PL  Y   +F      V  H  F+++ + H  N  
Sbjct: 2517 VQDK-ENQLFNV-ISKVFRLVSRVTPLTRY--FDFMRTFKYVYRHLSFESSHVHHLANES 2572

Query: 165  ID--SLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKG 205
            ID    I  + ++LE      ++    EN  T  +D+E+ +K 
Sbjct: 2573 IDFVMTIENQVAQLEQEIK--NLEGCSENKDTEFDDKESKLKE 2613
>gb|EDU39990.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 489

 Score = 38.8 bits (89), Expect = 0.51,   Method: Composition-based stats.
 Identities = 17/73 (23%), Positives = 36/73 (49%), Gaps = 7/73 (9%)

Query: 177 EASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLM 236
           E      +   EE++      D+E  M   A+ ++E   +N++     E+  +S+F+GLM
Sbjct: 355 EHILQRPEPEQEEDSQPPPNTDDE--MAETAARLLERVADNTS-----EKFQNSQFLGLM 407

Query: 237 GSISDGSIVLKKD 249
             + D  + +++D
Sbjct: 408 RRLRDREVRVQED 420
>ref|XP_001029521.1| hypothetical protein TTHERM_01466260 [Tetrahymena thermophila SB210]
 gb|EAR81858.1| hypothetical protein TTHERM_01466260 [Tetrahymena thermophila SB210]
          Length = 2847

 Score = 38.4 bits (88), Expect = 0.55,   Method: Composition-based stats.
 Identities = 41/175 (23%), Positives = 68/175 (38%), Gaps = 25/175 (14%)

Query: 51   LKFLQHSEDLNQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIE 110
            LK LQ+ +   QS + + I+     +Q+     P   +N      +F     +  +   E
Sbjct: 2518 LKVLQNDQKYQQSLAEITIK-NDCNNQNKNTSQPANLENSPQAKNEFFDARKEIYQDKKE 2576

Query: 111  NQIANLNI----NNGLRTS-------------QTDF-PLGFYSQKNFNIASFPVVDHQIF 152
            N  AN  I    N  L T              Q +  PL   SQ N     F  V+ Q+F
Sbjct: 2577 NIFANNEISTEKNENLETKNQNQLKKSDSIFSQNENQPLQEQSQNNKKQKIFSNVEFQMF 2636

Query: 153  KTTGLEHPINSHIDSLINAEFSELE-----ASSLEEDVHTEEENSGTSLEDEETA 202
            KT  L   IN  I    + +F++ E     + S ++D++ +      S+E ++  
Sbjct: 2637 KTEKLSEQINDQI-FKSDIQFTKNESIKQKSESQKDDLNIQNLTKTISIESKQEE 2690
>ref|XP_368788.1| hypothetical protein MGG_00456 [Magnaporthe grisea 70-15]
 gb|EDK02931.1| hypothetical protein MGG_00456 [Magnaporthe grisea 70-15]
          Length = 1598

 Score = 38.4 bits (88), Expect = 0.57,   Method: Composition-based stats.
 Identities = 26/137 (18%), Positives = 57/137 (41%), Gaps = 15/137 (10%)

Query: 155  TGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMK---GLASDI- 210
            T +E  + +H+  L + E   L+   L+E V   + N  +  + E+  ++    LA+DI 
Sbjct: 1072 TQMETMLENHL--LDDLEEDVLQE--LDEQVKVNQANHASKCDSEQRLLETYTDLATDIE 1127

Query: 211  ----IEFCDNNSANKDVKE--RLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCF 264
                +   +        ++  +L++S F G +GS+ DG++     +  +R   ++  F  
Sbjct: 1128 EERMVRVKEMAYKASQKEDDRKLSAS-FKGRVGSLDDGTMTTSTPDKGKRKSTRNEPFSP 1186

Query: 265  QNSGNWAGLEFHDVEDR 281
                  +  +     D 
Sbjct: 1187 SLRPKVSTADLMFAMDE 1203
>ref|YP_001816720.1| hypothetical protein VF_2649 [Vibrio fischeri ES114]
 gb|ACB55688.1| hypothetical protein VF_2649 [Vibrio fischeri ES114]
          Length = 201

 Score = 38.4 bits (88), Expect = 0.61,   Method: Composition-based stats.
 Identities = 10/90 (11%), Positives = 31/90 (34%), Gaps = 10/90 (11%)

Query: 139 FNIASFPVVDHQIFKTTGLEHPINS-HIDSLINAEFSELEASSLEEDVHTEEENSGTSLE 197
           + +     +      +    +P+++ +I+S I   F + +          E         
Sbjct: 75  WKLKQLNKLASIELLSDADLNPLHNFNIESYITRHFDKRKRHISRNTQENETRR------ 128

Query: 198 DEETAMKGLASDIIEFCDNNSANKDVKERL 227
                ++ + + +I+  + +  NK    +L
Sbjct: 129 ---YELQEICNSLIDTLEKHHFNKITSSKL 155
>ref|ZP_01105153.1| transcription elongation factor NusA [Flavobacteriales bacterium
           HTCC2170]
 gb|EAR02238.1| transcription elongation factor NusA [Flavobacteriales bacterium
           HTCC2170]
          Length = 410

 Score = 38.0 bits (87), Expect = 0.77,   Method: Composition-based stats.
 Identities = 24/112 (21%), Positives = 40/112 (35%), Gaps = 20/112 (17%)

Query: 157 LEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAM-----KGLASD-- 209
           L   I+ H ++ I  +F +LE      +VH     +   L+DE   +     + + SD  
Sbjct: 117 LISKIHEHDNTTIYKQFKDLEGEIYTAEVHHIRHKAVILLDDEGNEIILPKDRQIPSDFF 176

Query: 210 --------IIEFCDNNSANKDV-----KERLNSSKFMGLMGSISDGSIVLKK 248
                   IIE  +       +       R     F   +  + DG I +KK
Sbjct: 177 RKGDNVRGIIESVELKGNKPSIIMSRSSPRFLEQLFFQEIPEVFDGLITIKK 228
>ref|ZP_01119447.1| transcription elongation factor NusA [Robiginitalea biformata
           HTCC2501]
 gb|EAR15975.1| transcription elongation factor NusA [Robiginitalea biformata
           HTCC2501]
          Length = 410

 Score = 38.0 bits (87), Expect = 0.81,   Method: Composition-based stats.
 Identities = 24/112 (21%), Positives = 39/112 (34%), Gaps = 20/112 (17%)

Query: 157 LEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAM-----KGLASD-- 209
           L   I+ H ++ I  +F +LE      +VH     +   L+DE   +     K + SD  
Sbjct: 117 LIAKIHEHDNTTIYKQFKDLEGEIYTAEVHHIRHKAIILLDDEGNELIMPKEKQIPSDFF 176

Query: 210 --------IIEFCDNNSAN-----KDVKERLNSSKFMGLMGSISDGSIVLKK 248
                   IIE  +               +     F   +  + DG I +KK
Sbjct: 177 RKGDNVRGIIESVELKGNKPLIIMSRTSPKFLEQLFHQEIPEVFDGLITIKK 228
>ref|XP_001482477.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
 gb|EDK41399.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
          Length = 439

 Score = 37.6 bits (86), Expect = 0.92,   Method: Composition-based stats.
 Identities = 20/106 (18%), Positives = 36/106 (33%), Gaps = 18/106 (16%)

Query: 144 FPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAM 203
            P+ +HQ                   + +F  LE    + + H E +             
Sbjct: 181 GPITEHQEIHRDDHRQLAF-------DEQFDRLEQELAQAETHNEMDKE---------EF 224

Query: 204 KGLASDIIEF--CDNNSANKDVKERLNSSKFMGLMGSISDGSIVLK 247
              A  + E    DN   +++  ++   S F+ LM SI D S+ + 
Sbjct: 225 ARTAGRVKETMLVDNPERSEETTQKFQQSNFLKLMTSIHDRSVEIS 270
>emb|CAF97724.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 1412

 Score = 37.2 bits (85), Expect = 1.2,   Method: Composition-based stats.
 Identities = 18/95 (18%), Positives = 36/95 (37%), Gaps = 1/95 (1%)

Query: 146 VVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKG 205
           VVD  +     L     S  +  +N  FS +                  + ED   A++ 
Sbjct: 393 VVDSMVSLCRELCPMSCSAAER-LNPPFSSISEQVSIPRASIRNALMERAAEDINRALEE 451

Query: 206 LASDIIEFCDNNSANKDVKERLNSSKFMGLMGSIS 240
           +   ++ +  N+  ++ ++E   S K + LM  +S
Sbjct: 452 VKLSVVSYLTNSIVDQILQELYTSHKTLVLMRQVS 486
>ref|YP_001336098.1| hypothetical protein KPN_02450 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
 gb|ABR77868.1| hypothetical protein KPN_02450 [Klebsiella pneumoniae subsp.
           pneumoniae MGH 78578]
          Length = 323

 Score = 36.8 bits (84), Expect = 1.8,   Method: Composition-based stats.
 Identities = 13/52 (25%), Positives = 25/52 (48%)

Query: 132 GFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEE 183
                    + SF ++DHQ    +  E  I S + +++  EF  L+A++ +E
Sbjct: 117 PVQLPDGATVGSFCLMDHQPRSFSAHEMQILSDLAAIVEDEFKVLDAATSDE 168
>ref|XP_001679672.1| Hypothetical protein CBG02534 [Caenorhabditis briggsae AF16]
 emb|CAP23997.1| C. briggsae CBR-VPS-35 protein [Caenorhabditis briggsae]
          Length = 787

 Score = 36.5 bits (83), Expect = 2.5,   Method: Composition-based stats.
 Identities = 15/89 (16%), Positives = 36/89 (40%), Gaps = 8/89 (8%)

Query: 113 IANLNINNGLRTSQTDFPLGFYSQK-NFNIASFPVVDHQIFKTTGLEHPINSHIDSLINA 171
               N+       +    + ++  +   NIA++ +V + + + T L H  +      ++A
Sbjct: 390 DEYKNVLRLSELPEYIKVMNYFDYRGQCNIAAY-MVQNMLEQETILRHQED------VDA 442

Query: 172 EFSELEASSLEEDVHTEEENSGTSLEDEE 200
            FS + A   +++   E  +      DE+
Sbjct: 443 AFSLISALLKDQEKQPESSHETEEFADEQ 471
>ref|ZP_02427325.1| hypothetical protein CLORAM_00703 [Clostridium ramosum DSM 1402]
 gb|EDS19827.1| hypothetical protein CLORAM_00703 [Clostridium ramosum DSM 1402]
          Length = 1149

 Score = 35.7 bits (81), Expect = 3.6,   Method: Composition-based stats.
 Identities = 29/174 (16%), Positives = 51/174 (29%), Gaps = 14/174 (8%)

Query: 12  NKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDLNQSRSYVNIRP 71
           N+  S TE           +     S  N     +++I+     +    N    Y     
Sbjct: 253 NQMESLTE---VKTTFRGQNSGTSQSHGNYIQATENEIKNNAASNGVTPNSDGQYDFSER 309

Query: 72  RTLEDQSY------KFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQIANLNINNGLRTS 125
           R +E Q              LN +      D       NV     +      I N    +
Sbjct: 310 RDMETQGNYYYLYNNVLFVTLNTSAYPGGNDEENANNPNVSSASRDNSEAEAIVNNFAQT 369

Query: 126 QTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEAS 179
                  + +Q  +      +V H     T  +H  +S I++ ++A F +L   
Sbjct: 370 LKSATSEYQNQYQWI-----IVTHHKSTQTVAKHAADSDIENYVDAGFEKLMDE 418
>ref|XP_001556458.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
 gb|EDN24065.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
          Length = 355

 Score = 35.3 bits (80), Expect = 4.6,   Method: Composition-based stats.
 Identities = 8/46 (17%), Positives = 22/46 (47%)

Query: 216 NNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVG 261
            +S + +  ++  +S+F+ LM    D  + ++ D   +  ++   G
Sbjct: 291 LSSVSNNQSDKFQNSQFLQLMRQFRDKEVTVEGDKVVDGPMKTKQG 336
>ref|XP_001797478.1| hypothetical protein SNOG_07125 [Phaeosphaeria nodorum SN15]
 gb|EAT85776.2| hypothetical protein SNOG_07125 [Phaeosphaeria nodorum SN15]
          Length = 362

 Score = 35.3 bits (80), Expect = 4.6,   Method: Composition-based stats.
 Identities = 16/71 (22%), Positives = 33/71 (46%), Gaps = 5/71 (7%)

Query: 187 TEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVL 246
             E+       ++E  M   A  ++E   +N++     E+  +S+F+ LM  + D  + +
Sbjct: 233 QPEQKQEQPPLNDEDEMAATAGRLLERVADNTS-----EKFQNSQFLQLMRRLRDHEVRV 287

Query: 247 KKDNGTERNLQ 257
           ++D   E N Q
Sbjct: 288 EEDKIVEINAQ 298
>ref|XP_001141615.1| PREDICTED: adducin 3 (gamma) isoform 21 [Pan troglodytes]
          Length = 699

 Score = 35.3 bits (80), Expect = 4.9,   Method: Composition-based stats.
 Identities = 34/176 (19%), Positives = 57/176 (32%), Gaps = 18/176 (10%)

Query: 60  LNQSRSYVNIRPRTLEDQSYKFEAPNLNDNETSWAK-----DFRYNFPKNVEPPIENQIA 114
           LN   +Y+ +    + ++S   E       + +W K           P  +E P +    
Sbjct: 440 LNSPNTYMKV---NVPEESRNGETSPRT--KITWMKAEDSSKVSGGTPIKIEDPNQFVPL 494

Query: 115 NLNINNGLRTSQTDF-PLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEF 173
           N N N     ++ D    G  SQ    I          F+      P   +        F
Sbjct: 495 NTNPNEIREQNRYDLKTAGPQSQLLAGIVVDKPPSTMQFEDDDHGPPAPPN-------PF 547

Query: 174 SELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNS 229
           S L    LEE   T E       + E+  +   AS + +      + ++V E+L  
Sbjct: 548 SHLTEGELEEYKRTIERKQQGLEDAEQELLSDDASSVSQIQSQTQSPQNVPEKLEE 603
>ref|NP_001012836.1| peroxisomal biogenesis factor 5 [Gallus gallus]
 sp|Q5ZMQ9|PEX5_CHICK Peroxisomal targeting signal 1 receptor (Peroxisome receptor 1)
           (Peroxisomal C-terminal targeting signal import
           receptor) (PTS1-BP) (Peroxin-5) (PTS1 receptor)
 emb|CAG30984.1| hypothetical protein [Gallus gallus]
          Length = 645

 Score = 35.3 bits (80), Expect = 4.9,   Method: Composition-based stats.
 Identities = 19/92 (20%), Positives = 37/92 (40%), Gaps = 10/92 (10%)

Query: 171 AEFSELEASSLEEDVHTEEENSGTSLEDEETA--MKGLASDIIEFCDNNSANKDVKERLN 228
            E+ E     L      ++  +    E+ +    +K  ASD          +K    +LN
Sbjct: 163 EEYLEQSEEKLWLGESEDQSLADKWYEEYQPEDDLKKTASDF--------LSKVDDPKLN 214

Query: 229 SSKFMGLMGSISDGSIVLKKDNGTERNLQKHV 260
           SS+F+  +  I DG + ++ +  T R+  +  
Sbjct: 215 SSEFLKFVRQIGDGRVSIEANQVTHRDQDQAE 246
>ref|XP_001019542.2| DEAD/DEAH box helicase family protein [Tetrahymena thermophila
           SB210]
 gb|EAR99297.2| DEAD/DEAH box helicase family protein [Tetrahymena thermophila
           SB210]
          Length = 1392

 Score = 35.3 bits (80), Expect = 5.4,   Method: Composition-based stats.
 Identities = 34/142 (23%), Positives = 54/142 (38%), Gaps = 12/142 (8%)

Query: 103 KNVEPPIENQIANLNINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPIN 162
           KN+   +ENQ  + +I+     S+  F +GF     FN  +  +VD          +  N
Sbjct: 115 KNLNKELENQFTSKSIDEFNINSKNPFSVGFQ----FNELTNELVDFNNEMIN--VNFKN 168

Query: 163 SHIDSLINAEFSELEASS----LEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNS 218
              +      F   + SS     +    T++    ++ E  E     LA +I EFC  N 
Sbjct: 169 FSSEGYTGKIFQIGDKSSNVEDEDLSYKTKKYLPYSNKEQIEQDFLDLAKNIDEFCKQNE 228

Query: 219 ANKDVK--ERLNSSKFMGLMGS 238
              + K   R +  K   LM S
Sbjct: 229 KTTETKKNSRSDKGKNENLMES 250
>ref|XP_977174.1| hypothetical protein TTHERM_00037740 [Tetrahymena thermophila
           SB210]
 gb|EAR86437.1| hypothetical protein TTHERM_00037740 [Tetrahymena thermophila
           SB210]
          Length = 4844

 Score = 35.3 bits (80), Expect = 5.6,   Method: Composition-based stats.
 Identities = 19/96 (19%), Positives = 37/96 (38%), Gaps = 2/96 (2%)

Query: 136 QKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTS 195
           Q+NF       V     ++ G     N  +    N +  ++E    +     E+E     
Sbjct: 53  QENFKQNLNQNVQDIQIESNGKIDE-NQGLLQKTNEDKQQMEIEMQKLKEE-EQEKDKGK 110

Query: 196 LEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSK 231
            +  E A+K   +  I+    ++  K VK++ N +K
Sbjct: 111 QDKFEQALKEEDAIPIDIGRTSTKAKTVKDKGNKNK 146
>gb|EEH49084.1| peroxin 20 [Paracoccidioides brasiliensis Pb18]
          Length = 808

 Score = 34.9 bits (79), Expect = 6.1,   Method: Composition-based stats.
 Identities = 43/283 (15%), Positives = 82/283 (28%), Gaps = 44/283 (15%)

Query: 8   TNEVNKFISSTEKGPFTGRDNTLSFNKIGSRLNSPPILKDKIELKFLQHSEDLNQSRSYV 67
           +N +  F   +       +D  LS         S       ++ +F      L       
Sbjct: 9   SNALQNFQKHSSVDRTLQQDRILSRQSPAQGFRSQNHNDGALDPEFNAFEAGLPGPPLSD 68

Query: 68  NIRPRTLEDQSYKFEAPNLNDNET-SWAKDF--------RYNFPKNVEPPIENQIANLN- 117
              P +    +     P LN  +   WA DF          +  +    P  +  +  N 
Sbjct: 69  IQHPASFLPPA--AHGPALNHIQNPGWASDFQNLQISSPSQHVSQQQFRPESSASSWQND 126

Query: 118 ----INN--GLRTSQTDFP-LGF-YSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLI 169
               ++         T  P   F Y      +    +  H        +  +  H +  +
Sbjct: 127 FMREVHQKPQSEPPATHTPQTPFRYGPVMGVMGFGGMSSHHPVYQDPYQSHVQEHKEQFV 186

Query: 170 NAEFSELEASSLE------EDVHTEEENSGTSLEDEET----------------AMKGLA 207
             E +   A +        ++  ++EE +   LE ++T                    LA
Sbjct: 187 FDESAFEAAFAEARAEVELQEQKSQEEGTTEPLEVQQTIRIGSDTIRPPDEGTNESDELA 246

Query: 208 SDIIEFCDNNSANKDVKERLNSSKFMGLMGSISDGSIVLKKDN 250
               +  D  S + D  ++   S F+ LM  + D  +V+  D 
Sbjct: 247 KTAGQLLD--SVSHDQSQKFKESNFLALMRQLRDREVVIDGDE 287
>ref|ZP_03840659.1| DNA repair protein RecN [Proteus mirabilis ATCC 29906]
 gb|EEI48302.1| DNA repair protein RecN [Proteus mirabilis ATCC 29906]
          Length = 553

 Score = 34.9 bits (79), Expect = 7.3,   Method: Composition-based stats.
 Identities = 22/112 (19%), Positives = 43/112 (38%), Gaps = 13/112 (11%)

Query: 144 FPVVDHQIFKTTGLEHPINSHIDSLINAEFSELEASSLEEDVHTEEENSGTSLEDEETAM 203
             V  H+I + + +E   NS +D L        EAS    +V  E  +    LE +   +
Sbjct: 246 LNVAKHEISELSTMESQFNSLLDMLE-------EASIQISEVSDELRHYSDRLEMDPNRL 298

Query: 204 KGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGS-ISDGSIVLKKDNGTER 254
             L   I ++   +  ++   E L       L    I +   +L++++  + 
Sbjct: 299 FELEKRISKYISLSRKHRVTPEELYE-----LHQQLIEEKEALLRQNDDCDA 345
>ref|NP_859786.1| hypothetical protein HH0255 [Helicobacter hepaticus ATCC 51449]
 gb|AAP76852.1| hypothetical protein HH_0255 [Helicobacter hepaticus ATCC 51449]
          Length = 1299

 Score = 34.5 bits (78), Expect = 8.1,   Method: Composition-based stats.
 Identities = 26/119 (21%), Positives = 43/119 (36%), Gaps = 8/119 (6%)

Query: 72  RTLEDQSYKFEAPNLNDNETSWAK--DFRYNFPKNVEPPIENQIANLNINNGLRTSQTDF 129
            T +  +Y+ E  N  D+E    +   F       ++   EN  +       L+   T +
Sbjct: 465 TTRDYSTYRIEI-NKKDDEPYMVQLSPFVRVDMNKIKASFENNESYKKAQKELQQLHTQY 523

Query: 130 PLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEF---SELEASSLEEDV 185
              FYS  +    +    D    KT+ L H   S+    IN  F    E+E    +E+ 
Sbjct: 524 DTSFYSNYSPGNMTL--YDLMNKKTSQLNHIKKSYGIDYINQFFKTPKEIEDIIKKEEQ 580
>ref|XP_503644.1| YlPEX20 [Yarrowia lipolytica]
 emb|CAG79226.1| YlPEX20 [Yarrowia lipolytica CLIB122]
          Length = 417

 Score = 34.5 bits (78), Expect = 8.4,   Method: Composition-based stats.
 Identities = 27/131 (20%), Positives = 38/131 (29%), Gaps = 8/131 (6%)

Query: 121 GLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTT--GLEHPINSHIDSLINAEFSELEA 178
                QT       SQ     A+F  V+  I KT+  G E   +    +    +   L  
Sbjct: 175 QTEHQQTHKTETKSSQDAAFEAAFGAVEESITKTSDKGKEVEKDPMEQTYRYDQADALNR 234

Query: 179 SSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFCDNNSANKDVKERLNSSKFMGLMGS 238
              E               DE      +A  I    +        K +   S FM LM  
Sbjct: 235 Q-AEHISDNISREEVDIKTDENGEFASIARQIASSLEEAD-----KSKFEKSTFMNLMRR 288

Query: 239 ISDGSIVLKKD 249
           I +  + L  D
Sbjct: 289 IGNHEVTLDGD 299
>ref|XP_001652339.1| embryonic polarity dorsal [Aedes aegypti]
 gb|EAT41423.1| embryonic polarity dorsal [Aedes aegypti]
          Length = 515

 Score = 34.5 bits (78), Expect = 8.7,   Method: Composition-based stats.
 Identities = 32/177 (18%), Positives = 56/177 (31%), Gaps = 38/177 (21%)

Query: 54  LQHSEDLNQSRSYVNIRPRTLE-DQSYKFEAPNLNDNETSWAKDFRYNFPKNVEPPIENQ 112
           +QH+  +      + I P      Q Y  +    N  E+          P     P E  
Sbjct: 329 MQHAASMEPKERMIKIEPMDSHFQQQYTAQNAYRNPTESPDGN----MTPMTAASPYEGA 384

Query: 113 IANLN------INNGLRTSQTDFPLGF----YSQK------------NFNIASFPVVDHQ 150
           I          +N+  +++  +F +G     Y Q             N   AS PV   Q
Sbjct: 385 IGYQAGNEYYVVNHRYQSNYNNFSVGGPSNGYQQYHYQQTGVHYGMNNLAGASLPVNGQQ 444

Query: 151 IFKTTGLEHPINSHIDSLINAEFSE-----------LEASSLEEDVHTEEENSGTSL 196
            F     +  +  ++D L++ E ++           +      + +  EEEN   S 
Sbjct: 445 GFNLQQQQSQMPDNLDCLLDMELAKGFPMNSSEIKTMIGQIEPQRMQQEEENLSNSF 501
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF excluding environmental samples
  from WGS projects
    Posted date:  May 23, 2008  5:56 PM
  Number of letters in database: 883,778,997
  Number of sequences in database:  2,617,685
  
  Database: /host/Blast/data/nr_perl/nr.01
    Posted date:  May 23, 2008  5:54 PM
  Number of letters in database: 976,759,346
  Number of sequences in database:  2,761,413
  
  Database: /host/Blast/data/nr_perl/nr.02
    Posted date:  May 23, 2008  5:48 PM
  Number of letters in database: 374,670,760
  Number of sequences in database:  1,165,270
  
  Database: /host/Blast/data/nr_perl/nr.03
    Posted date:  Apr 28, 2009  5:40 PM
  Number of letters in database: 114,943,120
  Number of sequences in database:  354,819
  
Lambda     K      H
   0.309    0.146    0.383 

Lambda     K      H
   0.267   0.0445    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,778,864,638
Number of Sequences: 6899187
Number of extensions: 159559585
Number of successful extensions: 467618
Number of sequences better than 10.0: 300
Number of HSP's better than 10.0 without gapping: 111
Number of HSP's successfully gapped in prelim test: 455
Number of HSP's that attempted gapping in prelim test: 463620
Number of HSP's gapped (non-prelim): 3581
length of query: 283
length of database: 2,350,152,223
effective HSP length: 132
effective length of query: 151
effective length of database: 1,439,459,539
effective search space: 217358390389
effective search space used: 217358390389
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.5 bits)
S2: 78 (34.5 bits)