BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= YGR239C__[Saccharomyces_cerevisiae]
         (288 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           6,899,187 sequences; 2,350,152,223 total letters

Searching..................................................done


Results from round 1


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|AAS56603.1|  YGR239C [Saccharomyces cerevisiae]                505   e-141
ref|NP_011755.1|  Peroxin required for targeting of peroxiso...   504   e-141
emb|CAA97269.1|  unnamed protein product [Saccharomyces cere...   226   1e-57
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...    88   8e-16
ref|XP_454648.1|  unnamed protein product [Kluyveromyces lac...    77   1e-12
ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...    76   3e-12
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...    66   3e-09
ref|XP_447344.1|  hypothetical protein CAGL0I02134g [Candida...    49   5e-04
ref|XP_001564347.1|  ABC transporter-like protein [Leishmani...    37   1.3  
ref|YP_001194277.1|  hypothetical protein Fjoh_1926 [Flavoba...    35   6.5  
ref|NP_012030.1|  Peroxin required for targeting of peroxiso...    35   6.7  
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]          35   7.0  
>gb|AAS56603.1| YGR239C [Saccharomyces cerevisiae]
          Length = 288

 Score =  505 bits (1301), Expect = e-141,   Method: Composition-based stats.
 Identities = 287/288 (99%), Positives = 287/288 (99%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSE KFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSEWKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>ref|NP_011755.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex18p; Pex21p [Saccharomyces cerevisiae]
 sp|P50091|PEX21_YEAST Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAA61191.1| ORF 288 [Saccharomyces cerevisiae]
 emb|CAA97267.1| unnamed protein product [Saccharomyces cerevisiae]
 gb|EDN61824.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 288

 Score =  504 bits (1299), Expect = e-141,   Method: Composition-based stats.
 Identities = 288/288 (100%), Positives = 288/288 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>emb|CAA97269.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 123

 Score =  226 bits (577), Expect = 1e-57,   Method: Composition-based stats.
 Identities = 123/123 (100%), Positives = 123/123 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNS 123
           LNS
Sbjct: 121 LNS 123
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score = 87.8 bits (216), Expect = 8e-16,   Method: Composition-based stats.
 Identities = 82/298 (27%), Positives = 144/298 (48%), Gaps = 34/298 (11%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           M SVC ++P+++  ++G +      +   R +    E      +  + VE  FL N  + 
Sbjct: 1   MSSVCQSNPLQQFARKGEQATGRWRLNEGRVQNRGGE------SIQNSVENEFLRNDESL 54

Query: 61  DGIDNVRYQNQP------SPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSN 114
               ++R  NQP      +P      +  V+   W+ +FS + I D LEFS  YK+LYSN
Sbjct: 55  ----SIRGGNQPHNMFGRAPAQMGHSSGVVEKESWLNEFSKIAIKDPLEFSDRYKQLYSN 110

Query: 115 YESQQRLNSSR-----QHLPFKNCMIRKTSCTYPPQ--KTLRQQRQGNRDNPTDAFQFDA 167
           YE++Q + S       Q +   N  +++       Q  +++  Q   + D   +      
Sbjct: 111 YETRQTVTSMHNQGYYQPMSNMNLHVQQNWSMMEGQINQSMNSQETAHIDQAHNDLDMYL 170

Query: 168 EFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
           E +  + E++ +  +  +  +     Q+Q + + IATDIV  C         ++  +S++
Sbjct: 171 EKEFNDIELELQELDEHSHSEFASGVQEQQKFKEIATDIVDTCI--------TNNKNSTM 222

Query: 228 ESKLSESKFIQLMRNISSGDVTLK--KNADGNS-ASELFSSNNGELVGNRHIFVKDEI 282
            SKL++SKF+ LM  IS G+VTLK  ++ D N+ A EL+S+  G+ VGN    ++D I
Sbjct: 223 TSKLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYSTKTGKTVGNDFHDIQDTI 280
>ref|XP_454648.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CN41|PEX21_KLULA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG99735.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 306

 Score = 77.4 bits (189), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 67/211 (31%), Positives = 109/211 (51%), Gaps = 22/211 (10%)

Query: 85  DSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPP 144
           D   W+ QF++M++ D  EF ++YK++YS YE+  R  S     P    M+  +      
Sbjct: 89  DHHVWIDQFANMQVHDKTEFPTDYKQMYSQYEA--RGASYSMGAP---VMVSHSQMFPRH 143

Query: 145 QKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKW-------FDQDQS 197
           Q++L   +  ++   + + + D +F  LER++Q +  E    +D+ +        D+DQ 
Sbjct: 144 QQSLMVNQLESQAYASSSAKLDEQFAELERQVQDDEKEQQQDKDDDFHLKETSPLDEDQR 203

Query: 198 ELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGN 257
           +L+  A  I           ++ S  SS+  SK S SKF+ LMRNIS G +TLKKN D +
Sbjct: 204 QLKEAAQSIY----------TTLSDKSSTTSSKFSNSKFLGLMRNISDGVITLKKNPDED 253

Query: 258 SASELFSSNNGELVGNRHIFVKDEIHKDILD 288
             +EL+S + GE  G  +  V+D +  D LD
Sbjct: 254 KYTELYSPSTGETFGEEYFPVQDSVLGDPLD 284
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score = 75.9 bits (185), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 64/200 (32%), Positives = 93/200 (46%), Gaps = 22/200 (11%)

Query: 87  SDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFKNCMIRKT--SCTYPP 144
           +DWV QFSSM+++D L FS+EY+R Y+ YE +Q    + + L +   M   T        
Sbjct: 80  ADWVRQFSSMQVEDPLAFSAEYQRAYAGYEQRQAARPAARVLGYGGSMFMPTMPQQQLQQ 139

Query: 145 QKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ-KERYEPITRRDEKWFDQDQSELQRIA 203
           Q   +Q  Q            + EF VLE E+   E  E ++       D +Q   Q  A
Sbjct: 140 QVAPQQTAQAALQEAELERYLEREFDVLEGELAPPEVPEALS---APLLDHEQLGFQESA 196

Query: 204 TDIVKCCTPPPSSASSSSTLSSSV-ESKLSESKFIQLMRNISSGDVTLKKNADGNSASEL 262
             I              +TLS+ + + K   SKF+ LMR +S+GDVTL K+  G +   L
Sbjct: 197 KAIY-------------ATLSAPIHKDKFGASKFMGLMRQVSTGDVTLSKSESGYTG--L 241

Query: 263 FSSNNGELVGNRHIFVKDEI 282
             +  GE VG  +  V DE+
Sbjct: 242 HMTAGGEAVGAEYRAVTDEV 261
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score = 65.9 bits (159), Expect = 3e-09,   Method: Composition-based stats.
 Identities = 60/210 (28%), Positives = 92/210 (43%), Gaps = 34/210 (16%)

Query: 49  VEKHFLHNGSNFDGI--DNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSS 106
           VE  FL N S  + +    +  Q   +  T  T   +VD++ WV  FSSMK+ D LEF+ 
Sbjct: 49  VEATFLQNNSGMNAVHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFSSMKVQDPLEFAD 108

Query: 107 EYKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFD 166
            YK LY  YE +Q  N+     P +N  I   +    PQ TL         +       D
Sbjct: 109 SYKNLYKQYEQKQFHNTVPAQCP-RNLYIPTVT---RPQTTLNIVDHHEDKSSAVDQLID 164

Query: 167 AEFQVLEREIQKERYEPITRRDEKWFDQDQS-ELQRIATDIVKCCTPPPSSASSSSTLSS 225
            EF+ +E E++               DQ+Q  E Q  A   ++ C              S
Sbjct: 165 DEFEKIEEEVK---------------DQNQKLEFQESAQSFIEICE------------RS 197

Query: 226 SVESKLSESKFIQLMRNISSGDVTLKKNAD 255
            ++ KL  SKF Q+M+ +S G+ T++++ +
Sbjct: 198 QMKEKLQRSKFFQVMQKVSDGEATIRQDGE 227
>ref|XP_447344.1| hypothetical protein CAGL0I02134g [Candida glabrata CBS138]
 emb|CAG60281.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 259

 Score = 48.9 bits (115), Expect = 5e-04,   Method: Composition-based stats.
 Identities = 67/292 (22%), Positives = 130/292 (44%), Gaps = 48/292 (16%)

Query: 3   SVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDG 62
           S CH++P++ +  + H         + RT+ AH     H    +  +E+ FLH  S +D 
Sbjct: 2   SACHSNPLQSLSSKTHTF-------TTRTR-AHNYEHNHDRISNP-LEREFLHQDSIYDT 52

Query: 63  IDNVRYQ-NQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRL 121
              V     QP  +    P +T  ++ W+ +F  + I D       + R       +   
Sbjct: 53  ATPVAIPIQQPRQV----PRDTKKNTQWLSEFQQLSISDVPSVKPSFTRPAPTSTFRIIN 108

Query: 122 NSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKE-- 179
           N  R+   FK          +     + +     R++      ++ EF+ LE+E++ +  
Sbjct: 109 NDVRKDSLFK----------FQEHPIIDRLSPEARED-----YYNNEFEQLEKELENDDD 153

Query: 180 -----RYEPITRRDEKWFDQ----DQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
                +++ +  +  +++++    DQS  Q+ A DI+       + ++S+   SS +  K
Sbjct: 154 VQDGDQFQDLENKTYEYYNEFAFNDQSHFQQSARDILN------NMSNSTHEYSSELNEK 207

Query: 231 LSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEI 282
           LS S F+ L+++I  GDV+L+ N  G    EL +SN+   +GN++  + D I
Sbjct: 208 LSGSTFVNLLKSIDIGDVSLQDNRQG--PKELKNSNDNTTLGNKYADIPDYI 257
>ref|XP_001564347.1| ABC transporter-like protein [Leishmania braziliensis
            MHOM/BR/75/M2904]
 emb|CAM38406.1| ABC transporter-like protein [Leishmania braziliensis]
          Length = 2084

 Score = 37.4 bits (85), Expect = 1.3,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 31/66 (46%), Gaps = 5/66 (7%)

Query: 55   HNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMK-----IDDSLEFSSEYK 109
            H  +  +  D+  YQN P+P  F  PN   ++S      SS++     ID  L  +S+  
Sbjct: 1293 HQAAGAEAADSSTYQNNPTPPLFTKPNQVTEASRKSSAMSSVQTPAAGIDSLLPRASDLH 1352

Query: 110  RLYSNY 115
            RL+  Y
Sbjct: 1353 RLFIEY 1358
>ref|YP_001194277.1| hypothetical protein Fjoh_1926 [Flavobacterium johnsoniae UW101]
 gb|ABQ04958.1| hypothetical protein Fjoh_1926 [Flavobacterium johnsoniae UW101]
          Length = 657

 Score = 35.0 bits (79), Expect = 6.5,   Method: Composition-based stats.
 Identities = 20/66 (30%), Positives = 34/66 (51%)

Query: 59  NFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQ 118
           N+   D +RY + P+PLT +  N T++    V  F ++    +L FS  Y+ L S  +  
Sbjct: 122 NYIQADEIRYYSVPTPLTELFFNTTINKGQNVDSFITLNTSKNLNFSIAYRGLRSEGDYI 181

Query: 119 QRLNSS 124
            +L S+
Sbjct: 182 NQLVSA 187
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score = 35.0 bits (79), Expect = 6.7,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 46/88 (52%), Gaps = 8/88 (9%)

Query: 166 DAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSS 225
           +AEF  LE    +E             + +++ ++ +A+DI++ C         +++ + 
Sbjct: 170 NAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFC--------DNNSANK 221

Query: 226 SVESKLSESKFIQLMRNISSGDVTLKKN 253
            V+ +L+ SKF+ LM +IS G + LKK+
Sbjct: 222 DVKERLNSSKFMGLMGSISDGSIVLKKD 249
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score = 35.0 bits (79), Expect = 7.0,   Method: Composition-based stats.
 Identities = 23/88 (26%), Positives = 46/88 (52%), Gaps = 8/88 (9%)

Query: 166 DAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSS 225
           +AEF  LE    +E             + +++ ++ +A+DI++ C         +++ + 
Sbjct: 170 NAEFSELEASSLEEDVHTEEENSGTSLEDEETAMKGLASDIIEFC--------DNNSANK 221

Query: 226 SVESKLSESKFIQLMRNISSGDVTLKKN 253
            V+ +L+ SKF+ LM +IS G + LKK+
Sbjct: 222 DVKERLNSSKFMGLMGSISDGSIVLKKD 249
Searching..................................................done Results from round 2


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

gb|AAS56603.1|  YGR239C [Saccharomyces cerevisiae]                425   e-117
ref|NP_011755.1|  Peroxin required for targeting of peroxiso...   422   e-116
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...   309   1e-82
ref|XP_447344.1|  hypothetical protein CAGL0I02134g [Candida...   258   3e-67
ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...   215   2e-54
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...   209   2e-52
ref|XP_454648.1|  unnamed protein product [Kluyveromyces lac...   206   1e-51
emb|CAA97269.1|  unnamed protein product [Saccharomyces cere...   196   1e-48
Sequences not found previously or not previously below threshold:

ref|NP_012030.1|  Peroxin required for targeting of peroxiso...    68   6e-10
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]          68   7e-10
gb|AAX14715.1|  Pex20p [Pichia angusta]                            54   2e-05
ref|XP_001524003.1|  hypothetical protein LELG_04816 [Lodder...    46   0.002
gb|AAX11696.1|  peroxin 20 [Pichia pastoris]                       45   0.007
ref|XP_459664.1|  hypothetical protein DEHA0E08701g [Debaryo...    45   0.009
ref|ZP_02134834.1|  hypothetical protein VFMJ11_16570 [Vibri...    43   0.017
gb|EEH49084.1|  peroxin 20 [Paracoccidioides brasiliensis Pb18]    43   0.023
ref|ZP_02183337.1|  hypothetical protein FBALC1_11662 [Flavo...    43   0.025
ref|YP_692667.1|  chromosome segregation SMC protein [Alcani...    41   0.092
gb|ABH11425.1|  peroxin 20 [Penicillium chrysogenum]               41   0.12 
ref|XP_001830051.1|  hypothetical protein CC1G_04484 [Coprin...    41   0.13 
gb|AAC69180.1|  peroxisomal targeting sequence 1 receptor [N...    40   0.15 
ref|XP_001385675.2|  hypothetical protein PICST_84845 [Pichi...    40   0.24 
gb|AAC69177.1|  peroxisomal targeting signal 1 receptor; PTS...    39   0.35 
ref|XP_001025150.1|  DNA repair helicase (rad3) [Tetrahymena...    39   0.49 
emb|CAL54022.1|  putative peroxisomal targeting signal 1 rec...    38   0.95 
ref|XP_001877777.1|  predicted protein [Laccaria bicolor S23...    37   2.1  
ref|ZP_01062455.1|  glutamyl-tRNA synthetase [Flavobacterium...    36   2.8  
gb|EEH37053.1|  peroxin 20 [Paracoccidioides brasiliensis Pb01]    36   2.9  
ref|XP_001443227.1|  hypothetical protein GSPATT00011430001 ...    36   2.9  
ref|XP_001482477.1|  hypothetical protein PGUG_05497 [Pichia...    35   5.3  
ref|XP_001556458.1|  hypothetical protein BC1G_05227 [Botryo...    35   6.0  
ref|XP_001033074.1|  hypothetical protein TTHERM_00471900 [T...    35   6.1  
ref|ZP_01624021.1|  hypothetical protein L8106_27826 [Lyngby...    35   6.6  
ref|XP_001015711.1|  hypothetical protein TTHERM_00077700 [T...    35   7.2  
gb|AAC23564.2|  peroxin [Yarrowia lipolytica]                      35   8.5  
ref|XP_001215991.1|  conserved hypothetical protein [Aspergi...    35   9.8  
ref|XP_503644.1|  YlPEX20 [Yarrowia lipolytica] >gi|49649513...    35   10.0 
>gb|AAS56603.1| YGR239C [Saccharomyces cerevisiae]
          Length = 288

 Score =  425 bits (1092), Expect = e-117,   Method: Composition-based stats.
 Identities = 287/288 (99%), Positives = 287/288 (99%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSE KFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSEWKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>ref|NP_011755.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex18p; Pex21p [Saccharomyces cerevisiae]
 sp|P50091|PEX21_YEAST Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAA61191.1| ORF 288 [Saccharomyces cerevisiae]
 emb|CAA97267.1| unnamed protein product [Saccharomyces cerevisiae]
 gb|EDN61824.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 288

 Score =  422 bits (1086), Expect = e-116,   Method: Composition-based stats.
 Identities = 288/288 (100%), Positives = 288/288 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score =  309 bits (793), Expect = 1e-82,   Method: Composition-based stats.
 Identities = 82/298 (27%), Positives = 144/298 (48%), Gaps = 34/298 (11%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           M SVC ++P+++  ++G +      +   R +    E      +  + VE  FL N  + 
Sbjct: 1   MSSVCQSNPLQQFARKGEQATGRWRLNEGRVQNRGGE------SIQNSVENEFLRNDESL 54

Query: 61  DGIDNVRYQNQP------SPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSN 114
               ++R  NQP      +P      +  V+   W+ +FS + I D LEFS  YK+LYSN
Sbjct: 55  ----SIRGGNQPHNMFGRAPAQMGHSSGVVEKESWLNEFSKIAIKDPLEFSDRYKQLYSN 110

Query: 115 YESQQRLNSSR-----QHLPFKNCMIRKTSCTYPPQ--KTLRQQRQGNRDNPTDAFQFDA 167
           YE++Q + S       Q +   N  +++       Q  +++  Q   + D   +      
Sbjct: 111 YETRQTVTSMHNQGYYQPMSNMNLHVQQNWSMMEGQINQSMNSQETAHIDQAHNDLDMYL 170

Query: 168 EFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
           E +  + E++ +  +  +  +     Q+Q + + IATDIV  C         ++  +S++
Sbjct: 171 EKEFNDIELELQELDEHSHSEFASGVQEQQKFKEIATDIVDTCI--------TNNKNSTM 222

Query: 228 ESKLSESKFIQLMRNISSGDVTLK--KNADGNS-ASELFSSNNGELVGNRHIFVKDEI 282
            SKL++SKF+ LM  IS G+VTLK  ++ D N+ A EL+S+  G+ VGN    ++D I
Sbjct: 223 TSKLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYSTKTGKTVGNDFHDIQDTI 280
>ref|XP_447344.1| hypothetical protein CAGL0I02134g [Candida glabrata CBS138]
 emb|CAG60281.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 259

 Score =  258 bits (660), Expect = 3e-67,   Method: Composition-based stats.
 Identities = 65/293 (22%), Positives = 127/293 (43%), Gaps = 48/293 (16%)

Query: 3   SVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDG 62
           S CH++P++ +  + H     +         AH     H    +  +E+ FLH  S +D 
Sbjct: 2   SACHSNPLQSLSSKTHTFTTRTR--------AHNYEHNHDRISNP-LEREFLHQDSIYDT 52

Query: 63  IDNVRYQ-NQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRL 121
              V     QP  +    P +T  ++ W+ +F  + I D       + R       +   
Sbjct: 53  ATPVAIPIQQPRQV----PRDTKKNTQWLSEFQQLSISDVPSVKPSFTRPAPTSTFRIIN 108

Query: 122 NSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQK--- 178
           N  R+   FK          +     + +     R++      ++ EF+ LE+E++    
Sbjct: 109 NDVRKDSLFK----------FQEHPIIDRLSPEARED-----YYNNEFEQLEKELENDDD 153

Query: 179 ----ERYEPITRRDEKWFDQ----DQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
               ++++ +  +  +++++    DQS  Q+ A DI+       + ++S+   SS +  K
Sbjct: 154 VQDGDQFQDLENKTYEYYNEFAFNDQSHFQQSARDILN------NMSNSTHEYSSELNEK 207

Query: 231 LSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIH 283
           LS S F+ L+++I  GDV+L+ N  G    EL +SN+   +GN++  + D I 
Sbjct: 208 LSGSTFVNLLKSIDIGDVSLQDNRQG--PKELKNSNDNTTLGNKYADIPDYIL 258
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score =  215 bits (549), Expect = 2e-54,   Method: Composition-based stats.
 Identities = 75/290 (25%), Positives = 114/290 (39%), Gaps = 32/290 (11%)

Query: 3   SVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDG 62
           S+C  S ++K+I +        +    R                S  E+  L   +    
Sbjct: 2   SLCQGSAMQKLIAKTE----GPMAGVGRV---GGFNRPSGGLGQSSAEQQ-LQARAGERA 53

Query: 63  IDN--VRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
             N  +        L           +DWV QFSSM+++D L FS+EY+R Y+ YE +Q 
Sbjct: 54  SQNRFMAVLEPQRELGGRMARGDGLQADWVRQFSSMQVEDPLAFSAEYQRAYAGYEQRQA 113

Query: 121 LNSSRQHLPFKNCMIRKT--SCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ- 177
              + + L +   M   T        Q   +Q  Q            + EF VLE E+  
Sbjct: 114 ARPAARVLGYGGSMFMPTMPQQQLQQQVAPQQTAQAALQEAELERYLEREFDVLEGELAP 173

Query: 178 KERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV-ESKLSESKF 236
            E  E ++       D +Q   Q  A  I              +TLS+ + + K   SKF
Sbjct: 174 PEVPEALS---APLLDHEQLGFQESAKAIY-------------ATLSAPIHKDKFGASKF 217

Query: 237 IQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDI 286
           + LMR +S+GDVTL K+  G +   L  +  GE VG  +  V DE+ +  
Sbjct: 218 MGLMRQVSTGDVTLSKSESGYTG--LHMTAGGEAVGAEYRAVTDEVVQVP 265
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score =  209 bits (532), Expect = 2e-52,   Method: Composition-based stats.
 Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 34/260 (13%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           M   C   PI++++ +     +       + +      T         VE  FL N S  
Sbjct: 1   MSQYCAADPIQRMVSRKGPYLDTQQQRVYKHRSGTGHGTNIRQPTTGIVEATFLQNNSGM 60

Query: 61  DGI--DNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQ 118
           + +    +  Q   +  T  T   +VD++ WV  FSSMK+ D LEF+  YK LY  YE +
Sbjct: 61  NAVHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFSSMKVQDPLEFADSYKNLYKQYEQK 120

Query: 119 QRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQK 178
           Q  N+     P +N  I   +    PQ TL         +       D EF+ +E E++ 
Sbjct: 121 QFHNTVPAQCP-RNLYIPTVT---RPQTTLNIVDHHEDKSSAVDQLIDDEFEKIEEEVK- 175

Query: 179 ERYEPITRRDEKWFDQDQS-ELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFI 237
                         DQ+Q  E Q  A   ++ C              S ++ KL  SKF 
Sbjct: 176 --------------DQNQKLEFQESAQSFIEICE------------RSQMKEKLQRSKFF 209

Query: 238 QLMRNISSGDVTLKKNADGN 257
           Q+M+ +S G+ T++++ +  
Sbjct: 210 QVMQKVSDGEATIRQDGENY 229
>ref|XP_454648.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CN41|PEX21_KLULA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG99735.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 306

 Score =  206 bits (525), Expect = 1e-51,   Method: Composition-based stats.
 Identities = 77/298 (25%), Positives = 136/298 (45%), Gaps = 27/298 (9%)

Query: 3   SVCHTSPIEKIIQQ-GHRI----QNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNG 57
           SVC T+P+  ++ + G+R     Q+   I  ++++  +  L   ++         F H  
Sbjct: 2   SVCQTNPLNTLVSKSGYRFGQPNQHQQSIAQQQSRPRNNALDHQFSQFTGASGPSFAHPQ 61

Query: 58  SNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYES 117
                +  V      +  T        D   W+ QF++M++ D  EF ++YK++YS YE+
Sbjct: 62  HQQLPMAAVVANATTAASTTSNTATASDHHVWIDQFANMQVHDKTEFPTDYKQMYSQYEA 121

Query: 118 QQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ 177
           +    S     P    M+  +      Q++L   +  ++   + + + D +F  LER++Q
Sbjct: 122 R--GASYSMGAP---VMVSHSQMFPRHQQSLMVNQLESQAYASSSAKLDEQFAELERQVQ 176

Query: 178 KERYEPITRRDEKW-------FDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
            +  E    +D+ +        D+DQ +L+  A  I    +   S+ SS  + S      
Sbjct: 177 DDEKEQQQDKDDDFHLKETSPLDEDQRQLKEAAQSIYTTLSDKSSTTSSKFSNS------ 230

Query: 231 LSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
               KF+ LMRNIS G +TLKKN D +  +EL+S + GE  G  +  V+D +  D LD
Sbjct: 231 ----KFLGLMRNISDGVITLKKNPDEDKYTELYSPSTGETFGEEYFPVQDSVLGDPLD 284
>emb|CAA97269.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 123

 Score =  196 bits (499), Expect = 1e-48,   Method: Composition-based stats.
 Identities = 123/123 (100%), Positives = 123/123 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNS 123
           LNS
Sbjct: 121 LNS 123
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score = 68.4 bits (166), Expect = 6e-10,   Method: Composition-based stats.
 Identities = 51/295 (17%), Positives = 103/295 (34%), Gaps = 29/295 (9%)

Query: 2   PSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFD 61
            + C T+ + K I    +        +       + L          +E  FL +  + +
Sbjct: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRL-NSPPILKDKIELKFLQHSEDLN 61

Query: 62  GIDN-VRYQNQPSPLT-----FITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNY 115
              + V    +P  L      F  PN   + + W   F   + +         +   +N 
Sbjct: 62  QSRSYVNI--RPRTLEDQSYKFEAPNLNDNETSWAKDF---RYNFPKNVEPPIENQIANL 116

Query: 116 ESQQRLNSSRQHLP---FKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVL 172
                L +S+   P   +        S      +  +     +  N       +AEF  L
Sbjct: 117 NINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSEL 176

Query: 173 E-REIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231
           E   ++++ +           D++ + ++ +A+DI++ C          ++ +  V+ +L
Sbjct: 177 EASSLEEDVHTEEENSGTSLEDEE-TAMKGLASDIIEFCDN--------NSANKDVKERL 227

Query: 232 SESKFIQLMRNISSGDVTLKKNADGNSASELFS----SNNGELVGNRHIFVKDEI 282
           + SKF+ LM +IS G + LKK+       +        N+G   G     V+D I
Sbjct: 228 NSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score = 68.0 bits (165), Expect = 7e-10,   Method: Composition-based stats.
 Identities = 51/295 (17%), Positives = 103/295 (34%), Gaps = 29/295 (9%)

Query: 2   PSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFD 61
            + C T+ + K I    +        +       + L          +E  FL +  + +
Sbjct: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFSKIGSRL-NSPPILKDKIELKFLQHSEDLN 61

Query: 62  GIDN-VRYQNQPSPLT-----FITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNY 115
              + V    +P  L      F  PN   + + W   F   + +         +   +N 
Sbjct: 62  QSQSYVNI--RPRTLEDQSYKFEAPNLNDNETSWAKDF---RYNFPKNVEPPIENQIANL 116

Query: 116 ESQQRLNSSRQHLP---FKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVL 172
                L +S+   P   +        S      +  +     +  N       +AEF  L
Sbjct: 117 NINNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSEL 176

Query: 173 E-REIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231
           E   ++++ +           D++ + ++ +A+DI++ C          ++ +  V+ +L
Sbjct: 177 EASSLEEDVHTEEENSGTSLEDEE-TAMKGLASDIIEFCDN--------NSANKDVKERL 227

Query: 232 SESKFIQLMRNISSGDVTLKKNADGNSASELFS----SNNGELVGNRHIFVKDEI 282
           + SKF+ LM +IS G + LKK+       +        N+G   G     V+D I
Sbjct: 228 NSSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
>gb|AAX14715.1| Pex20p [Pichia angusta]
          Length = 309

 Score = 53.8 bits (128), Expect = 2e-05,   Method: Composition-based stats.
 Identities = 43/227 (18%), Positives = 80/227 (35%), Gaps = 30/227 (13%)

Query: 42  YATEDSHVEKH---FLHNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKI 98
           + T D  +E     F     ++       +Q  P+P         ++ + WV  F  M +
Sbjct: 48  HQTVDQQLESEYRAFQQQTPSYQA----SFQTLPTPAF-------IEENQWVNHFRDMNL 96

Query: 99  DDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFK---------NCMIRKTSCTYPPQKTLR 149
            D+       +    +  SQQ  +   Q    +         +   R  +       +L 
Sbjct: 97  HDTPGAQHLQQVPTQSLHSQQWRHEFAQSQQQQQQQETADTYSSAFRMRTGAPQLATSLA 156

Query: 150 QQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQS-ELQRIATDIVK 208
            Q         D    DA F  LE E+  E+ E       K  ++D   +  ++A  +  
Sbjct: 157 PQVVSAATVREDFSAVDAAFDELEHEMHDEKMEEQQAEQVKPQNEDDKIKFAQLARSVFL 216

Query: 209 CCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNAD 255
             + PP      + +S     K  +S F++LM  IS+ +V + ++ D
Sbjct: 217 TMSQPP------ADISKQTSDKFQKSNFLKLMNRISTREVEMNEDRD 257
>ref|XP_001524003.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK46635.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 489

 Score = 46.5 bits (109), Expect = 0.002,   Method: Composition-based stats.
 Identities = 21/117 (17%), Positives = 51/117 (43%), Gaps = 14/117 (11%)

Query: 137 KTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQ 196
             + +    +    Q+    ++     Q ++EF  +E+E+Q      +    ++    D+
Sbjct: 204 TNTASPQFHQQSEHQQVHKLEDQET--QLESEFDAVEKELQ-----EVESSMQETNLHDK 256

Query: 197 SELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKN 253
               + A  +        SS +S +T  + ++SK   S+F++LM  +++  V L+ +
Sbjct: 257 DLFAQTARKV-------ESSMNSLNTTDAEMKSKFENSEFLKLMNRVANKQVELQDD 306
>gb|AAX11696.1| peroxin 20 [Pichia pastoris]
          Length = 323

 Score = 44.9 bits (105), Expect = 0.007,   Method: Composition-based stats.
 Identities = 35/256 (13%), Positives = 96/256 (37%), Gaps = 37/256 (14%)

Query: 9   PIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDNVRY 68
           P  ++I++  R  ++      R +  + +    +   +  +E   +    +F  +D    
Sbjct: 74  PSAELIRKNTRFGDNWS-SEFRQQQENNKWVEDFGAMN--LENVRMEQEQSFQTMD---- 126

Query: 69  QNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHL 128
           Q         T N       W  +F+ M+ +  +E+    +        +     +R+  
Sbjct: 127 QRTNMQAMSSTAN------AWTQEFNHMQHNKEMEYEMRIRE-----TQRLVNAPAREVF 175

Query: 129 PFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREI---------QKE 179
              N   ++ +  +       Q + G++ +      F+ +F+ ++ EI         ++ 
Sbjct: 176 SMVNQHSQQHNQNFTH----NQTQLGDQSHYMTNLAFERQFETVQNEIDGMDMFSDSEQV 231

Query: 180 RYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQL 239
             +   ++++   D + ++   IA  +           +  +T+S + E K  +S F++L
Sbjct: 232 EQQKELQKEDDNNDIETTKFAEIAQQVFN------QMNNVDTTVSQNTEHKFKQSNFLRL 285

Query: 240 MRNISSGDVTLKKNAD 255
           M  ++  +V +  + D
Sbjct: 286 MDKVAQREVEINGSGD 301
>ref|XP_459664.1| hypothetical protein DEHA0E08701g [Debaryomyces hansenii CBS767]
 emb|CAG87898.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 413

 Score = 44.5 bits (104), Expect = 0.009,   Method: Composition-based stats.
 Identities = 38/190 (20%), Positives = 70/190 (36%), Gaps = 21/190 (11%)

Query: 66  VRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSR 125
           +    QP         N    +DW  QF +       + +  Y+     ++      + R
Sbjct: 103 MSINQQPRH-QAGNIQNGKVQNDWHQQFMNQTSQHPAQNNVAYQ---PQHQMGGFRMNMR 158

Query: 126 QHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPIT 185
            +L      + +T    P  K + +  + N+        FDA+F  +ERE+Q E      
Sbjct: 159 TNLSNS---LYQTRGNVPEHKEMHKMEEENQ-------IFDAQFDQMERELQHENMSNEQ 208

Query: 186 RRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISS 245
             + +  D ++ E ++ A       T      S     ++   +K   S F++LM +IS+
Sbjct: 209 TAEVQ-MDGNEFEKEQFAK------TAKQVQESMLKGENAETNAKFQNSNFLKLMSSISN 261

Query: 246 GDVTLKKNAD 255
             V L    D
Sbjct: 262 RSVELSAEGD 271
>ref|ZP_02134834.1| hypothetical protein VFMJ11_16570 [Vibrio fischeri MJ11]
 gb|EDQ23433.1| hypothetical protein VFMJ11_16570 [Vibrio fischeri MJ11]
          Length = 1695

 Score = 43.4 bits (101), Expect = 0.017,   Method: Composition-based stats.
 Identities = 34/238 (14%), Positives = 76/238 (31%), Gaps = 45/238 (18%)

Query: 71  QPSPLTFITPNNTVDSSDWVP----QFSSMKIDDSL------EFSSEYKRLYSNYESQQR 120
            P    F       +++ W+     + SS+     L      EF + + +L++  E    
Sbjct: 691 HPKH-EFSYNQQVKENAFWIKSVPQEKSSLTFVRKLVKEKIDEFQTAFSQLFTGSEKSII 749

Query: 121 LNSSRQHL---PFKNC--MIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLERE 175
           +N+  Q      F      IR          ++                +D +       
Sbjct: 750 VNAVNQDCAEELFMGLVDYIRSNQ---EQSASIHIN------------LYDDKLAFN--- 791

Query: 176 IQKERYEPITRRDE--KWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSE 233
              +R+      DE  +W + ++ +++ +A  I+       + +  ++T +       + 
Sbjct: 792 -AFDRFAETANYDELKEWLELNKGKVREVADTIIDILRTRLTYSKFTNTQAEKDGQSYAH 850

Query: 234 SKFIQLMRNISSGDVTLKKNADG--------NSASELFSSNNGELVGNRHIFVKDEIH 283
             F +    +   D+ ++  A G          ASE  S +     G +    +D  H
Sbjct: 851 LSFFRNNDKVDCSDIAIEDMASGIACDGLLTGDASESKSGSYFTGFGLKGTDYQDYPH 908
>gb|EEH49084.1| peroxin 20 [Paracoccidioides brasiliensis Pb18]
          Length = 808

 Score = 43.0 bits (100), Expect = 0.023,   Method: Composition-based stats.
 Identities = 40/283 (14%), Positives = 83/283 (29%), Gaps = 39/283 (13%)

Query: 7   TSPIEKIIQQG--HRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGID 64
           ++ ++   +     R      I S+++         H    D  ++  F    +   G  
Sbjct: 9   SNALQNFQKHSSVDRTLQQDRILSRQSPAQGFRSQNHN---DGALDPEFNAFEAGLPG-P 64

Query: 65  NVRYQNQPS----PLTFITPNNTVDSSDWVPQFSSMKIDDSLE-FSSEYKRLYSNYESQQ 119
            +     P+    P       N + +  W   F +++I    +  S +  R  S+  S Q
Sbjct: 65  PLSDIQHPASFLPPAAHGPALNHIQNPGWASDFQNLQISSPSQHVSQQQFRPESSASSWQ 124

Query: 120 --RLNSSRQ------------HLPFKNCMIRKTSC---TYPPQKTLRQQRQGNRDNPTDA 162
              +    Q              PF+   +                +   Q +     + 
Sbjct: 125 NDFMREVHQKPQSEPPATHTPQTPFRYGPVMGVMGFGGMSSHHPVYQDPYQSHVQEHKEQ 184

Query: 163 FQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSST 222
           F FD           +   E   ++ ++    +  E+Q+          PP    + S  
Sbjct: 185 FVFDESAFEAAFAEARAEVELQEQKSQEEGTTEPLEVQQTIRIGSDTIRPPDEGTNESDE 244

Query: 223 LSSSV-----------ESKLSESKFIQLMRNISSGDVTLKKNA 254
           L+ +              K  ES F+ LMR +   +V +  + 
Sbjct: 245 LAKTAGQLLDSVSHDQSQKFKESNFLALMRQLRDREVVIDGDE 287
>ref|ZP_02183337.1| hypothetical protein FBALC1_11662 [Flavobacteriales bacterium
           ALC-1]
 gb|EDP70188.1| hypothetical protein FBALC1_11662 [Flavobacteriales bacterium
           ALC-1]
          Length = 190

 Score = 43.0 bits (100), Expect = 0.025,   Method: Composition-based stats.
 Identities = 25/80 (31%), Positives = 33/80 (41%), Gaps = 5/80 (6%)

Query: 193 DQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLK- 251
           D +  E   +A  I          A  SS + S+V   LS SK  +LMR I  G+ T+K 
Sbjct: 54  DPEAKEFVEMAKSITGFKVFTTGDAKISSDMKSTVAKYLSSSKLEELMR-IKDGNQTVKF 112

Query: 252 ---KNADGNSASELFSSNNG 268
              +  D N   EL     G
Sbjct: 113 YVREGKDENHVKELLMFITG 132
>ref|YP_692667.1| chromosome segregation SMC protein [Alcanivorax borkumensis SK2]
 emb|CAL16395.1| chromosome segregation SMC protein [Alcanivorax borkumensis SK2]
          Length = 1166

 Score = 41.1 bits (95), Expect = 0.092,   Method: Composition-based stats.
 Identities = 24/127 (18%), Positives = 48/127 (37%), Gaps = 12/127 (9%)

Query: 117 SQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREI 176
           ++ + +S RQ+L      ++        +K L +++    D P    Q   E + LE  +
Sbjct: 810 ARTQADSLRQNLSRLEAQVQ----MMAERKALLEEQTNEGDEPGTELQMQLE-EKLEVRL 864

Query: 177 QKERYEPITRRDEKWFDQDQS-------ELQRIATDIVKCCTPPPSSASSSSTLSSSVES 229
           + E      RR+ +  D +         + +R A ++              ST   +V  
Sbjct: 865 EAEHKLAEARRELETVDHEMRNLEGQRGQFERQAQEVRSTMEQKRMQWQDLSTRRQTVAE 924

Query: 230 KLSESKF 236
           +L+E  F
Sbjct: 925 QLAEQNF 931
>gb|ABH11425.1| peroxin 20 [Penicillium chrysogenum]
          Length = 347

 Score = 40.7 bits (94), Expect = 0.12,   Method: Composition-based stats.
 Identities = 32/212 (15%), Positives = 71/212 (33%), Gaps = 17/212 (8%)

Query: 66  VRYQNQP-SPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSS 124
           +   + P S L             W  +F + +    +  + +  +++  Y       S 
Sbjct: 106 ISGPSHPVSQLQRPHIAPAAAHGGWHAEFMAQQ-QGPMSLAQQTPQMHGAY-QPSFTPSY 163

Query: 125 RQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ-KERYEP 183
           + +    N M +        Q+  +Q  +   +   D  + D E Q ++   Q  + +  
Sbjct: 164 QMYGSSMNQM-QPLQGMQTEQQNTQQFDESAFEAAFDQARADMELQGVDIAQQDNQEHLQ 222

Query: 184 ITRRDE--KWFDQDQSEL----------QRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231
            T + +  +   Q+Q  L          Q   T          ++     ++ +    K 
Sbjct: 223 ETNKVDISEPAIQEQIRLGSDLIPQSDKQNPETQSRDADELARTAGQLLDSVRNEQSEKF 282

Query: 232 SESKFIQLMRNISSGDVTLKKNADGNSASELF 263
            +S F+ LMR I   +V ++ +    +A  L 
Sbjct: 283 QQSNFLALMRRIRDREVEVEGDDFRETAQSLH 314
>ref|XP_001830051.1| hypothetical protein CC1G_04484 [Coprinopsis cinerea okayama7#130]
 gb|EAU91716.1| hypothetical protein CC1G_04484 [Coprinopsis cinerea okayama7#130]
          Length = 726

 Score = 40.7 bits (94), Expect = 0.13,   Method: Composition-based stats.
 Identities = 35/221 (15%), Positives = 63/221 (28%), Gaps = 29/221 (13%)

Query: 46  DSHVEKHFLHNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFS 105
           D H +   L              Q Q              ++DW   F    +  +    
Sbjct: 83  DGHFDVSALRQALPMHAHGPPALQQQ------AELPGAAATADWGMDFMQQAMGGTP--- 133

Query: 106 SEYKRLYSNYESQQRLNSSRQHLPFKN-----------CMIRKTSCTYPP--QKTLRQQR 152
               + Y   ++ Q    +  H P  N              R  +    P     ++ Q 
Sbjct: 134 --MTQTYQPAQTSQAHA-AHVHAPSINIQQGPTWNPGMSAFRMNTMQMQPGFMPQMQMQP 190

Query: 153 QGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTP 212
           Q      +    +D EF  LE  +            E   +Q Q E    A         
Sbjct: 191 QATAATTSKRISWDKEFSALELNVGPTS----EHVPEVVQEQPQQEQSAPAQKQFDGDEL 246

Query: 213 PPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKN 253
             ++      +      K  +S+F++LM+ +  G+V ++ N
Sbjct: 247 ARTAGMLLDHVQHETNPKFKKSQFLELMKQLRDGEVIVEGN 287
>gb|AAC69180.1| peroxisomal targeting sequence 1 receptor [Nicotiana tabacum]
          Length = 552

 Score = 40.3 bits (93), Expect = 0.15,   Method: Composition-based stats.
 Identities = 24/112 (21%), Positives = 44/112 (39%), Gaps = 10/112 (8%)

Query: 152 RQGNRDNPTDAFQFDAEFQVLEREIQK-----ERYEPI--TRRDEKWFDQDQSELQRIAT 204
            Q  R  P  A  +  EF     E        + +E           F+ +QS+L  I  
Sbjct: 41  SQAMRPAPHVADGWINEFAQNRVEHADPNAWAQSFEQQHGANGWASEFEHEQSQLGLIGQ 100

Query: 205 ---DIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKN 253
                +        +   + TL+ + + K   SKF+Q +  +S G++T+++N
Sbjct: 101 MRGANIPNLAAMEQTRMLAHTLAQNNDPKFQNSKFLQFVSKMSRGEITIEEN 152
>ref|XP_001385675.2| hypothetical protein PICST_84845 [Pichia stipitis CBS 6054]
 gb|ABN67646.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 402

 Score = 39.9 bits (92), Expect = 0.24,   Method: Composition-based stats.
 Identities = 27/143 (18%), Positives = 55/143 (38%), Gaps = 21/143 (14%)

Query: 112 YSNYESQQRLNSSRQHLPFKNCMIRKTSC--TYPPQKTLRQQRQGNRDNPTDAFQFDAEF 169
             NY S     + R +L       ++     +    + + +  Q           FD+ F
Sbjct: 150 APNYASSAFSMNMRTNLSTPLYAQQQVQTGASLTEHQEIHKMEQE-------KQLFDSHF 202

Query: 170 QVLEREI--QKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
             LE+E+  Q +    +  + +K  ++   +    A  I        +S     T  ++ 
Sbjct: 203 DQLEKELNQQSQEKPEVEVQVDKVENE---QFAETARQI-------ENSLRQFDTADAAT 252

Query: 228 ESKLSESKFIQLMRNISSGDVTL 250
           ++K+  S F++LM +IS+  V L
Sbjct: 253 KAKIENSDFLKLMSSISNKQVVL 275
>gb|AAC69177.1| peroxisomal targeting signal 1 receptor; PTS1 receptor; Pex5p
           [Nicotiana tabacum]
          Length = 741

 Score = 39.1 bits (90), Expect = 0.35,   Method: Composition-based stats.
 Identities = 23/112 (20%), Positives = 43/112 (38%), Gaps = 10/112 (8%)

Query: 152 RQGNRDNPTDAFQFDAEFQVLEREIQK-----ERYEPI--TRRDEKWFDQDQSELQRIAT 204
               R  P  A  +  EF     E        + +E           F+ +QS+L  I  
Sbjct: 230 SLAMRPAPHVADGWINEFAQNRVEHADPNAWAQSFEQQHGANGWASEFEHEQSQLGMIGQ 289

Query: 205 ---DIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKN 253
                +        +   + TL+ + + K   SKF+Q +  +S G++T+++N
Sbjct: 290 MRGANIPNLAAMEQTRMLAHTLAQNNDPKFQNSKFLQFVSKMSRGEITIEEN 341
>ref|XP_001025150.1| DNA repair helicase (rad3) [Tetrahymena thermophila SB210]
 gb|EAS04905.1| DNA repair helicase (rad3) [Tetrahymena thermophila SB210]
          Length = 1032

 Score = 38.8 bits (89), Expect = 0.49,   Method: Composition-based stats.
 Identities = 30/152 (19%), Positives = 61/152 (40%), Gaps = 8/152 (5%)

Query: 95   SMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTYP-PQKTLRQQRQ 153
            ++K  D+LE     ++ +SN +  Q      Q   F    ++        PQK  +Q   
Sbjct: 864  NLKKKDNLELEQAEQQSFSNTQKPQHGIQVVQKRAFDFTSVKSLQNQIQKPQKLFKQNIL 923

Query: 154  GNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPP 213
             ++D  +         Q LE+ +Q  +  P  +  +   +Q + + Q +           
Sbjct: 924  VSKDEESQKSY-----QTLEKPVQNIKKNPFCQSSDDEENQPEQKFQIVKPSQSIKQENN 978

Query: 214  PSSASSSSTLSSSVESKLSESKFIQLMRNISS 245
             ++ SS    S  ++ K S+ K +  ++N+SS
Sbjct: 979  QNTTSSQFDFSKPIKIKSSQDKIM--IKNVSS 1008
>emb|CAL54022.1| putative peroxisomal targeting signal 1 rece (ISS) [Ostreococcus
           tauri]
          Length = 607

 Score = 37.6 bits (86), Expect = 0.95,   Method: Composition-based stats.
 Identities = 27/113 (23%), Positives = 44/113 (38%), Gaps = 19/113 (16%)

Query: 144 PQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIA 203
           P   + +  Q      T   Q+  EF  + R    + +     R           +Q   
Sbjct: 116 PVPRMDRWAQE-FQQSTQGAQWGDEFDSMNRNKWAQEFSEQQTRS----------IQAPL 164

Query: 204 TDIVKCCTPPPSSASSSSTLSSSVES--KLSESKFIQLMRNISSGDVTLKKNA 254
           TDI         +A  SS L++++ +  K + SKF QLM  + SG V +K + 
Sbjct: 165 TDI------AAHTAKQSSELAATLNADPKFANSKFAQLMSKLGSGQVVVKDDG 211
>ref|XP_001877777.1| predicted protein [Laccaria bicolor S238N-H82]
 gb|EDR11880.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 702

 Score = 36.8 bits (84), Expect = 2.1,   Method: Composition-based stats.
 Identities = 31/171 (18%), Positives = 61/171 (35%), Gaps = 24/171 (14%)

Query: 115 YESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLER 174
            + QQ L+ + QHL   N    K    Y     +  Q+ G + N         +      
Sbjct: 114 SQHQQILSPTSQHLVHNNIEEAKQDLHYRGDTAI-IQQPGVQWNLGLTQFRMNQTPAYAP 172

Query: 175 EIQKERYEPITRRDEKW---FDQDQSELQRIA-------TDIVKCCTPPPSSASSSS--- 221
            +  ++   I+ +   W   F+     L+  +        +I +         + ++   
Sbjct: 173 ALHMQQQPVISNKRISWDREFNAQVLSLETSSTMEIEQKQEIYQQTPREQDDLARTAGML 232

Query: 222 --TLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGEL 270
              + +    K  +S+F+ LM+ +  G+V +    DGN   E    NNG+ 
Sbjct: 233 LENIKNEQNPKFQKSQFLGLMKQLRDGEVVV----DGNQMVE----NNGQT 275
>ref|ZP_01062455.1| glutamyl-tRNA synthetase [Flavobacterium sp. MED217]
 gb|EAQ47845.1| glutamyl-tRNA synthetase [Leeuwenhoekiella blandensis MED217]
          Length = 1123

 Score = 36.1 bits (82), Expect = 2.8,   Method: Composition-based stats.
 Identities = 29/108 (26%), Positives = 44/108 (40%), Gaps = 9/108 (8%)

Query: 150 QQRQGNRDNPTDAFQ--FDAEFQVLEREIQKERYEPITRRDEKWFDQDQS---ELQRIAT 204
           Q+++G+ DNP    Q      F+ L+RE+   + E           +DQ    E+Q+   
Sbjct: 661 QEKEGSSDNPQLDKQKELYESFESLKRELADLKQENQELSKPIDIPEDQKLEEEIQQAQE 720

Query: 205 DIVKCCTPPPSSASSSSTLSSSVES--KLS--ESKFIQLMRNISSGDV 248
             VK    P  S S     SS   +  K+     K +Q M +   GDV
Sbjct: 721 KAVKNLENPEDSKSQEDAKSSQKNAGKKMQKMGQKLMQQMASGGGGDV 768
>gb|EEH37053.1| peroxin 20 [Paracoccidioides brasiliensis Pb01]
          Length = 355

 Score = 36.1 bits (82), Expect = 2.9,   Method: Composition-based stats.
 Identities = 37/251 (14%), Positives = 76/251 (30%), Gaps = 34/251 (13%)

Query: 46  DSHVEKHFLHNGSNFDGIDNVRYQNQPS----PLTFITPNNTVDSSDWVPQFSSMKIDDS 101
           D  ++  F    +   G   +     P+    P       N + +  W   F +++I   
Sbjct: 47  DGALDPEFNAFEAGLHGA-PLSDIQHPASFLPPAAHGPVLNHIHNPGWASDFQNLQISGP 105

Query: 102 ----------LEFSSE------YKRLYSNYESQQRLN-SSRQHLPFKNCM-IRKTSCTYP 143
                      E S+        ++ +   +SQ     +S+  L +   M +        
Sbjct: 106 SQHVSQQQFRPESSASGWQNDFMRQAHPKPQSQTPATQTSQMPLRYGPVMGVMGFGGMSS 165

Query: 144 PQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIA 203
                +   Q +     + F FD           +   E   ++ ++    +  E+Q+  
Sbjct: 166 HHSVYQDPYQSHVQEHKEQFVFDESAFEAAFAEARAEVELQEQKSQEEGTTEPLEIQQTI 225

Query: 204 TDIVKCCTPPPSSASSSSTLSSSV-----------ESKLSESKFIQLMRNISSGDVTLKK 252
                   PP    + S  L+ +              K  ES F+ LMR +   +V +  
Sbjct: 226 RIGSDTIRPPVEGTNESDELAKTAGQLLDSVSHDQSQKFKESNFLALMRQLRDREVVVDG 285

Query: 253 NADGNSASELF 263
           +    +A  L 
Sbjct: 286 DEFLPTAQPLH 296
>ref|XP_001443227.1| hypothetical protein GSPATT00011430001 [Paramecium tetraurelia
           strain d4-2]
 emb|CAK75830.1| unnamed protein product [Paramecium tetraurelia]
          Length = 675

 Score = 36.1 bits (82), Expect = 2.9,   Method: Composition-based stats.
 Identities = 32/170 (18%), Positives = 62/170 (36%), Gaps = 37/170 (21%)

Query: 129 PFKNCMIRKTSCTYPPQKTLRQQRQGN-------RDNPTDAFQFDAEF---------QVL 172
           P K   I +       QK L+     N       R +  + +    +F         + +
Sbjct: 255 PQKRLSIEQILTQLQSQKKLQCTSHQNIVCKILRRSSAHEDYHSPNQFSKPLKTLPNEEI 314

Query: 173 EREIQKE--RYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
           E++I+    R+  I  +    +DQ   E + +A     C      +    +T SS  + +
Sbjct: 315 EQQIEPNSCRHYQIQHKI-NLYDQ---EFKELAKGF--CSQQQTDNKIKENTKSSKRKVQ 368

Query: 231 LS----ESKFIQLMRNISSGDVT-LKKNADGNSASELFSSNNGELVGNRH 275
           +     +S +IQ +      + + L++NA         SS+  E V N +
Sbjct: 369 IQIPIVQSNYIQKLEKPKISENSDLQQNA--------KSSSTNETVKNSY 410
>ref|XP_001482477.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
 gb|EDK41399.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
          Length = 439

 Score = 35.3 bits (80), Expect = 5.3,   Method: Composition-based stats.
 Identities = 25/127 (19%), Positives = 40/127 (31%), Gaps = 18/127 (14%)

Query: 129 PFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRD 188
           PF+  M R    T         Q     D       FD +F  LE+E+ +        ++
Sbjct: 166 PFRLNM-RTDLSTPLQGPITEHQEIHRDD--HRQLAFDEQFDRLEQELAQAETHNEMDKE 222

Query: 189 EKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDV 248
                    E  R A  + +           +   S     K  +S F++LM +I    V
Sbjct: 223 ---------EFARTAGRVKET------MLVDNPERSEETTQKFQQSNFLKLMTSIHDRSV 267

Query: 249 TLKKNAD 255
            +    D
Sbjct: 268 EISDEGD 274
>ref|XP_001556458.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
 gb|EDN24065.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
          Length = 355

 Score = 35.3 bits (80), Expect = 6.0,   Method: Composition-based stats.
 Identities = 27/133 (20%), Positives = 48/133 (36%), Gaps = 8/133 (6%)

Query: 152 RQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCT 211
            Q  +    +    D +    E E++ +R E      +     D  +    AT       
Sbjct: 221 SQAAKAQNEEIHAMDQQHVTKEAELELDRAEK-QLLGQSLLGADTIQDPATATSEQNAHA 279

Query: 212 PPPSSASSSSTLSSSVE---SKLSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNG 268
           P   S ++   LSS       K   S+F+QLMR     +VT++ +   +    + +   G
Sbjct: 280 PDALSRTAGELLSSVSNNQSDKFQNSQFLQLMRQFRDKEVTVEGDKVVDGP--MKTKQGG 337

Query: 269 ELVGNRHIFVKDE 281
              GN+  F +  
Sbjct: 338 --FGNQTRFEQQY 348
>ref|XP_001033074.1| hypothetical protein TTHERM_00471900 [Tetrahymena thermophila
           SB210]
 gb|EAR85411.1| hypothetical protein TTHERM_00471900 [Tetrahymena thermophila
           SB210]
          Length = 965

 Score = 34.9 bits (79), Expect = 6.1,   Method: Composition-based stats.
 Identities = 46/240 (19%), Positives = 76/240 (31%), Gaps = 24/240 (10%)

Query: 6   HTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDN 65
               +E   Q+   +   +   SK  ++  T +       +S   K FL N S       
Sbjct: 717 QKQKLEYEKQRQESMSPKTRFDSK-GQIRSTTIRPGSNILNSSQHKTFLQNLSRQLRDTT 775

Query: 66  VRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSR 125
           +  Q   S           ++ D    FS  KID          +L SNY  Q+  N  R
Sbjct: 776 IINQQNQSSAQ-------EENEDNQNSFSQNKIDS----KDSRAKL-SNYHFQEDNNQKR 823

Query: 126 Q--HLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKE--RY 181
           Q       +      S  Y  ++ L Q   G  D  T+  +       ++  I  +   Y
Sbjct: 824 QVDDCNNNSINFSSVSQHYLDEENLGQSELGITDQETNIQKLYQPQFNVDSYIGNKFSSY 883

Query: 182 EPITRRDEKWFDQDQSELQRIA-------TDIVKCCTPPPSSASSSSTLSSSVESKLSES 234
                ++  + + D+S  Q            I K  +   S +   +    SV  +L  S
Sbjct: 884 FSKQSQEIAFIEGDKSNSQHSNLLLGLGDKSISKSNSDEMSQSYFINEKRDSVNEQLYSS 943
>ref|ZP_01624021.1| hypothetical protein L8106_27826 [Lyngbya sp. PCC 8106]
 gb|EAW34016.1| hypothetical protein L8106_27826 [Lyngbya sp. PCC 8106]
          Length = 873

 Score = 34.9 bits (79), Expect = 6.6,   Method: Composition-based stats.
 Identities = 25/156 (16%), Positives = 63/156 (40%), Gaps = 10/156 (6%)

Query: 56  NGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNY 115
             S+F+       + Q +  TF++  + V          ++K+ D L+   E+ +L++  
Sbjct: 161 QDSHFNDSIEDFGRLQAALNTFVSQADQVKGE------QAIKLIDELKLKVEHYQLFTQE 214

Query: 116 ESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRD----NPTDAFQFDAEFQV 171
           + ++ L    +  P K    +        Q+    +RQ            + ++  + + 
Sbjct: 215 KLEKVLPQEAEVKPQKQLESKHQQAEDIQQQISDLKRQHQEQLQNREAELSVKYKEQIRQ 274

Query: 172 LEREIQKERYEPITRRDEKWFDQDQSELQRIATDIV 207
            E E+  +  E + + + +   Q Q +L+R  T+++
Sbjct: 275 REAELSLQHQEQLEKSESQLKHQHQEQLKRRETELL 310
>ref|XP_001015711.1| hypothetical protein TTHERM_00077700 [Tetrahymena thermophila SB210]
 gb|EAR95466.1| hypothetical protein TTHERM_00077700 [Tetrahymena thermophila SB210]
          Length = 1206

 Score = 34.9 bits (79), Expect = 7.2,   Method: Composition-based stats.
 Identities = 28/188 (14%), Positives = 67/188 (35%), Gaps = 12/188 (6%)

Query: 49   VEKHFLHNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKI-DDSLEFSSE 107
            +E+  L N     G+  +    Q +        + +  +        ++I  +  EF++ 
Sbjct: 896  LEQLTLQNSQQQGGLLQLEIPQQQNEHFIRFTESNLQYNKESDDIEQIQIKQNQDEFNT- 954

Query: 108  YKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDA 167
             K+ Y  Y ++Q      Q+   +N    +T   +  +    Q    N    +D    + 
Sbjct: 955  -KQNYDQYNTKQSDYQFEQNAAEQNLY--QTQTNFTIKPIPVQNTLSNLTESSDQNFKNR 1011

Query: 168  EFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
            E +  +   Q E+++            ++S +Q+I   + +        ++S  +  S  
Sbjct: 1012 EIKYEQIRDQDEQFQERQSS-------NKSYIQQINQILNQMQNLKQLQSTSQKSKDSQQ 1064

Query: 228  ESKLSESK 235
             S  + S+
Sbjct: 1065 NSISNNSQ 1072
>gb|AAC23564.2| peroxin [Yarrowia lipolytica]
          Length = 417

 Score = 34.5 bits (78), Expect = 8.5,   Method: Composition-based stats.
 Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 14/97 (14%)

Query: 154 GNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPP 213
               +P +      +   L R+ +    + I+R +      +  E   IA  I       
Sbjct: 214 EVEKDPMEQTYRYDQADALNRQAE-HISDNISREEVDIKTDENGEFASIARQI------- 265

Query: 214 PSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTL 250
                 +S+L  + +SK  +S F+ LMR I + +VTL
Sbjct: 266 ------ASSLEEADKSKFEKSTFMNLMRRIGNHEVTL 296
>ref|XP_001215991.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gb|EAU33357.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 376

 Score = 34.5 bits (78), Expect = 9.8,   Method: Composition-based stats.
 Identities = 35/241 (14%), Positives = 74/241 (30%), Gaps = 49/241 (20%)

Query: 72  PSPLTFITPNNTVDSSDWVPQFSSMKIDDS-------LEFSSE---------YKRLYSNY 115
           P         +  ++  W   F  ++I             S+          ++  + N 
Sbjct: 74  PFTAQGPHFMHPAENQSWAADFQKLQISGPSHPIHQQPGLSASSAPMMAQQGWQNEFMNQ 133

Query: 116 ES-QQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRD---------NPTDAFQF 165
           +  QQR   ++QH P+        + +YP   +        ++            D   F
Sbjct: 134 QQPQQRHTPNQQHQPYTQGFQSSFAPSYPMHASPMNAFPAAQEAIANQHPQTETFDESAF 193

Query: 166 DAEFQVLEREIQKERYEPITRRDEKWFDQ----------DQSELQRIATDIV-------- 207
           +A F+    ++  +  E      E+   +          +  E  RI +D +        
Sbjct: 194 EAAFEQARADMASQAAEITQEHTEETTVEATHIETTTSFEPQETVRIGSDTIPQSNKEDP 253

Query: 208 -----KCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASEL 262
                       ++     ++S     K  ES F+ LMR I   +V ++ +    ++  L
Sbjct: 254 QSRVNDADELARTAGQLLDSVSHDQSQKFRESNFLALMRRIRDREVHVEGDEFRETSQSL 313

Query: 263 F 263
            
Sbjct: 314 H 314
>ref|XP_503644.1| YlPEX20 [Yarrowia lipolytica]
 emb|CAG79226.1| YlPEX20 [Yarrowia lipolytica CLIB122]
          Length = 417

 Score = 34.5 bits (78), Expect = 10.0,   Method: Composition-based stats.
 Identities = 22/97 (22%), Positives = 39/97 (40%), Gaps = 14/97 (14%)

Query: 154 GNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPP 213
               +P +      +   L R+ +    + I+R +      +  E   IA  I       
Sbjct: 214 EVEKDPMEQTYRYDQADALNRQAE-HISDNISREEVDIKTDENGEFASIARQI------- 265

Query: 214 PSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTL 250
                 +S+L  + +SK  +S F+ LMR I + +VTL
Sbjct: 266 ------ASSLEEADKSKFEKSTFMNLMRRIGNHEVTL 296
Searching..................................................done Results from round 3


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

gb|AAS56603.1|  YGR239C [Saccharomyces cerevisiae]                379   e-103
ref|NP_011755.1|  Peroxin required for targeting of peroxiso...   378   e-103
ref|XP_001647071.1|  hypothetical protein Kpol_1050p71 [Vand...   278   3e-73
ref|NP_983945.1|  ADL151Wp [Ashbya gossypii ATCC 10895] >gi|...   265   4e-69
ref|NP_012030.1|  Peroxin required for targeting of peroxiso...   262   3e-68
gb|EDN62399.1|  peroxin [Saccharomyces cerevisiae YJM789]         259   2e-67
ref|XP_454648.1|  unnamed protein product [Kluyveromyces lac...   239   1e-61
ref|XP_446317.1|  unnamed protein product [Candida glabrata]...   235   2e-60
ref|XP_447344.1|  hypothetical protein CAGL0I02134g [Candida...   229   2e-58
gb|AAX14715.1|  Pex20p [Pichia angusta]                           180   1e-43
emb|CAA97269.1|  unnamed protein product [Saccharomyces cere...   167   9e-40
Sequences not found previously or not previously below threshold:

ref|XP_459664.1|  hypothetical protein DEHA0E08701g [Debaryo...    55   7e-06
gb|EEH49084.1|  peroxin 20 [Paracoccidioides brasiliensis Pb18]    53   3e-05
ref|XP_001401400.1|  hypothetical protein An04g00370 [Asperg...    51   9e-05
ref|XP_001524003.1|  hypothetical protein LELG_04816 [Lodder...    49   3e-04
gb|AAX11696.1|  peroxin 20 [Pichia pastoris]                       49   3e-04
ref|XP_001821473.1|  hypothetical protein [Aspergillus oryza...    48   0.001
ref|XP_503644.1|  YlPEX20 [Yarrowia lipolytica] >gi|49649513...    47   0.002
ref|XP_001830051.1|  hypothetical protein CC1G_04484 [Coprin...    46   0.003
ref|XP_661922.1|  hypothetical protein AN4318.2 [Aspergillus...    45   0.008
ref|XP_001482477.1|  hypothetical protein PGUG_05497 [Pichia...    44   0.013
ref|XP_752214.2|  conserved hypothetical protein [Aspergillu...    44   0.017
ref|XP_001215991.1|  conserved hypothetical protein [Aspergi...    43   0.019
gb|ABH11425.1|  peroxin 20 [Penicillium chrysogenum]               43   0.023
ref|XP_001906106.1|  unnamed protein product [Podospora anse...    43   0.029
gb|AAC23564.2|  peroxin [Yarrowia lipolytica]                      43   0.029
ref|XP_001385675.2|  hypothetical protein PICST_84845 [Pichi...    42   0.060
ref|XP_001271502.1|  conserved hypothetical protein [Aspergi...    42   0.065
ref|XP_001267407.1|  conserved hypothetical protein [Neosart...    40   0.16 
ref|XP_001015711.1|  hypothetical protein TTHERM_00077700 [T...    39   0.34 
gb|EEH37053.1|  peroxin 20 [Paracoccidioides brasiliensis Pb01]    39   0.41 
ref|XP_001033074.1|  hypothetical protein TTHERM_00471900 [T...    38   0.72 
ref|XP_001586173.1|  hypothetical protein SS1G_12748 [Sclero...    35   5.3  
ref|XP_001556458.1|  hypothetical protein BC1G_05227 [Botryo...    35   7.0  
>gb|AAS56603.1| YGR239C [Saccharomyces cerevisiae]
          Length = 288

 Score =  379 bits (975), Expect = e-103,   Method: Composition-based stats.
 Identities = 287/288 (99%), Positives = 287/288 (99%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSE KFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSEWKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>ref|NP_011755.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex18p; Pex21p [Saccharomyces cerevisiae]
 sp|P50091|PEX21_YEAST Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAA61191.1| ORF 288 [Saccharomyces cerevisiae]
 emb|CAA97267.1| unnamed protein product [Saccharomyces cerevisiae]
 gb|EDN61824.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 288

 Score =  378 bits (970), Expect = e-103,   Method: Composition-based stats.
 Identities = 288/288 (100%), Positives = 288/288 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180
           LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER
Sbjct: 121 LNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKER 180

Query: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240
           YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM
Sbjct: 181 YEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLM 240

Query: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
           RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD
Sbjct: 241 RNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
>ref|XP_001647071.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO19213.1| hypothetical protein Kpol_1050p71 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 283

 Score =  278 bits (712), Expect = 3e-73,   Method: Composition-based stats.
 Identities = 82/298 (27%), Positives = 144/298 (48%), Gaps = 34/298 (11%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           M SVC ++P+++  ++G +      +   R +    E      +  + VE  FL N  + 
Sbjct: 1   MSSVCQSNPLQQFARKGEQATGRWRLNEGRVQNRGGE------SIQNSVENEFLRNDESL 54

Query: 61  DGIDNVRYQNQP------SPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSN 114
               ++R  NQP      +P      +  V+   W+ +FS + I D LEFS  YK+LYSN
Sbjct: 55  ----SIRGGNQPHNMFGRAPAQMGHSSGVVEKESWLNEFSKIAIKDPLEFSDRYKQLYSN 110

Query: 115 YESQQRLNSSR-----QHLPFKNCMIRKTSCTYPPQ--KTLRQQRQGNRDNPTDAFQFDA 167
           YE++Q + S       Q +   N  +++       Q  +++  Q   + D   +      
Sbjct: 111 YETRQTVTSMHNQGYYQPMSNMNLHVQQNWSMMEGQINQSMNSQETAHIDQAHNDLDMYL 170

Query: 168 EFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
           E +  + E++ +  +  +  +     Q+Q + + IATDIV  C         ++  +S++
Sbjct: 171 EKEFNDIELELQELDEHSHSEFASGVQEQQKFKEIATDIVDTCI--------TNNKNSTM 222

Query: 228 ESKLSESKFIQLMRNISSGDVTLK--KNADGNS-ASELFSSNNGELVGNRHIFVKDEI 282
            SKL++SKF+ LM  IS G+VTLK  ++ D N+ A EL+S+  G+ VGN    ++D I
Sbjct: 223 TSKLAKSKFMGLMMQISDGEVTLKIDEHPDANTNAKELYSTKTGKTVGNDFHDIQDTI 280
>ref|NP_983945.1| ADL151Wp [Ashbya gossypii ATCC 10895]
 sp|Q75AS1|PEX21_ASHGO Peroxisomal membrane protein PEX21 (Peroxin-21)
 gb|AAS51769.1| ADL151Wp [Ashbya gossypii ATCC 10895]
          Length = 307

 Score =  265 bits (677), Expect = 4e-69,   Method: Composition-based stats.
 Identities = 73/289 (25%), Positives = 110/289 (38%), Gaps = 30/289 (10%)

Query: 3   SVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDG 62
           S+C  S ++K+I +        +    R                S  E+  L   +    
Sbjct: 2   SLCQGSAMQKLIAKTE----GPMAGVGRV---GGFNRPSGGLGQSSAEQQ-LQARAGERA 53

Query: 63  IDN--VRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
             N  +        L           +DWV QFSSM+++D L FS+EY+R Y+ YE +Q 
Sbjct: 54  SQNRFMAVLEPQRELGGRMARGDGLQADWVRQFSSMQVEDPLAFSAEYQRAYAGYEQRQA 113

Query: 121 LNSSRQHLPFKNCMIRKT--SCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ- 177
              + + L +   M   T        Q   +Q  Q            + EF VLE E+  
Sbjct: 114 ARPAARVLGYGGSMFMPTMPQQQLQQQVAPQQTAQAALQEAELERYLEREFDVLEGELAP 173

Query: 178 KERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFI 237
            E  E ++       D +Q   Q  A  I    + P              + K   SKF+
Sbjct: 174 PEVPEALSA---PLLDHEQLGFQESAKAIYATLSAPIH------------KDKFGASKFM 218

Query: 238 QLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDI 286
            LMR +S+GDVTL K+  G +   L  +  GE VG  +  V DE+ +  
Sbjct: 219 GLMRQVSTGDVTLSKSESGYTG--LHMTAGGEAVGAEYRAVTDEVVQVP 265
>ref|NP_012030.1| Peroxin required for targeting of peroxisomal matrix proteins
           containing PTS2; interacts with Pex7p; partially
           redundant with Pex21p; Pex18p [Saccharomyces cerevisiae]
 sp|P38855|PEX18_YEAST Peroxisomal membrane protein PEX18 (Peroxin-18)
 gb|AAB68992.1| Yhr160cp [Saccharomyces cerevisiae]
          Length = 283

 Score =  262 bits (669), Expect = 3e-68,   Method: Composition-based stats.
 Identities = 50/294 (17%), Positives = 102/294 (34%), Gaps = 27/294 (9%)

Query: 2   PSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFD 61
            + C T+ + K I    +        +       + L          +E  FL +  + +
Sbjct: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFNKIGSRL-NSPPILKDKIELKFLQHSEDLN 61

Query: 62  GIDNVRYQNQPSPLT-----FITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYE 116
              +     +P  L      F  PN   + + W   F   + +         +   +N  
Sbjct: 62  QSRSY-VNIRPRTLEDQSYKFEAPNLNDNETSWAKDF---RYNFPKNVEPPIENQIANLN 117

Query: 117 SQQRLNSSRQHLP---FKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLE 173
               L +S+   P   +        S      +  +     +  N       +AEF  LE
Sbjct: 118 INNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELE 177

Query: 174 -REIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLS 232
              ++++ +           D++ + ++ +A+DI++ C          ++ +  V+ +L+
Sbjct: 178 ASSLEEDVHTEEENSGTSLEDEE-TAMKGLASDIIEFCDN--------NSANKDVKERLN 228

Query: 233 ESKFIQLMRNISSGDVTLKKNADGNSASELFS----SNNGELVGNRHIFVKDEI 282
            SKF+ LM +IS G + LKK+       +        N+G   G     V+D I
Sbjct: 229 SSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
>gb|EDN62399.1| peroxin [Saccharomyces cerevisiae YJM789]
          Length = 283

 Score =  259 bits (662), Expect = 2e-67,   Method: Composition-based stats.
 Identities = 50/294 (17%), Positives = 102/294 (34%), Gaps = 27/294 (9%)

Query: 2   PSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFD 61
            + C T+ + K I    +        +       + L          +E  FL +  + +
Sbjct: 3   SNRCQTNEVNKFISSTEKGPFTGRDNTLSFSKIGSRL-NSPPILKDKIELKFLQHSEDLN 61

Query: 62  GIDNVRYQNQPSPLT-----FITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYE 116
              +     +P  L      F  PN   + + W   F   + +         +   +N  
Sbjct: 62  QSQSY-VNIRPRTLEDQSYKFEAPNLNDNETSWAKDF---RYNFPKNVEPPIENQIANLN 117

Query: 117 SQQRLNSSRQHLP---FKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLE 173
               L +S+   P   +        S      +  +     +  N       +AEF  LE
Sbjct: 118 INNGLRTSQTDFPLGFYSQKNFNIASFPVVDHQIFKTTGLEHPINSHIDSLINAEFSELE 177

Query: 174 -REIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLS 232
              ++++ +           D++ + ++ +A+DI++ C          ++ +  V+ +L+
Sbjct: 178 ASSLEEDVHTEEENSGTSLEDEE-TAMKGLASDIIEFCDN--------NSANKDVKERLN 228

Query: 233 ESKFIQLMRNISSGDVTLKKNADGNSASELFS----SNNGELVGNRHIFVKDEI 282
            SKF+ LM +IS G + LKK+       +        N+G   G     V+D I
Sbjct: 229 SSKFMGLMGSISDGSIVLKKDNGTERNLQKHVGFCFQNSGNWAGLEFHDVEDRI 282
>ref|XP_454648.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CN41|PEX21_KLULA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG99735.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 306

 Score =  239 bits (611), Expect = 1e-61,   Method: Composition-based stats.
 Identities = 77/298 (25%), Positives = 134/298 (44%), Gaps = 27/298 (9%)

Query: 3   SVCHTSPIEKIIQQ-----GHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNG 57
           SVC T+P+  ++ +     G   Q+   I  ++++  +  L   ++         F H  
Sbjct: 2   SVCQTNPLNTLVSKSGYRFGQPNQHQQSIAQQQSRPRNNALDHQFSQFTGASGPSFAHPQ 61

Query: 58  SNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYES 117
                +  V      +  T        D   W+ QF++M++ D  EF ++YK++YS YE+
Sbjct: 62  HQQLPMAAVVANATTAASTTSNTATASDHHVWIDQFANMQVHDKTEFPTDYKQMYSQYEA 121

Query: 118 QQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQ 177
           +    S     P    M+  +      Q++L   +  ++   + + + D +F  LER++Q
Sbjct: 122 R--GASYSMGAP---VMVSHSQMFPRHQQSLMVNQLESQAYASSSAKLDEQFAELERQVQ 176

Query: 178 KERYEPITRRDEKW-------FDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
            +  E    +D+ +        D+DQ +L+  A  I    +   S+ SS           
Sbjct: 177 DDEKEQQQDKDDDFHLKETSPLDEDQRQLKEAAQSIYTTLSDKSSTTSSKF--------- 227

Query: 231 LSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHKDILD 288
            S SKF+ LMRNIS G +TLKKN D +  +EL+S + GE  G  +  V+D +  D LD
Sbjct: 228 -SNSKFLGLMRNISDGVITLKKNPDEDKYTELYSPSTGETFGEEYFPVQDSVLGDPLD 284
>ref|XP_446317.1| unnamed protein product [Candida glabrata]
 sp|Q6FTX7|PEX21_CANGA Peroxisomal membrane protein PEX21 (Peroxin-21)
 emb|CAG59241.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 232

 Score =  235 bits (601), Expect = 2e-60,   Method: Composition-based stats.
 Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 34/260 (13%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           M   C   PI++++ +     +       + +      T         VE  FL N S  
Sbjct: 1   MSQYCAADPIQRMVSRKGPYLDTQQQRVYKHRSGTGHGTNIRQPTTGIVEATFLQNNSGM 60

Query: 61  DGI--DNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQ 118
           + +    +  Q   +  T  T   +VD++ WV  FSSMK+ D LEF+  YK LY  YE +
Sbjct: 61  NAVHPTTLNVQTHNAHATGRTIAKSVDTNSWVNDFSSMKVQDPLEFADSYKNLYKQYEQK 120

Query: 119 QRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQK 178
           Q  N+     P +N  I   +    PQ TL         +       D EF+ +E E++ 
Sbjct: 121 QFHNTVPAQCP-RNLYIPTVT---RPQTTLNIVDHHEDKSSAVDQLIDDEFEKIEEEVK- 175

Query: 179 ERYEPITRRDEKWFDQDQS-ELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFI 237
                         DQ+Q  E Q  A   ++ C              S ++ KL  SKF 
Sbjct: 176 --------------DQNQKLEFQESAQSFIEICE------------RSQMKEKLQRSKFF 209

Query: 238 QLMRNISSGDVTLKKNADGN 257
           Q+M+ +S G+ T++++ +  
Sbjct: 210 QVMQKVSDGEATIRQDGENY 229
>ref|XP_447344.1| hypothetical protein CAGL0I02134g [Candida glabrata CBS138]
 emb|CAG60281.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 259

 Score =  229 bits (585), Expect = 2e-58,   Method: Composition-based stats.
 Identities = 65/294 (22%), Positives = 128/294 (43%), Gaps = 48/294 (16%)

Query: 3   SVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDG 62
           S CH++P++ +  + H     +         AH     H    +  +E+ FLH  S +D 
Sbjct: 2   SACHSNPLQSLSSKTHTFTTRTR--------AHNYEHNHDRISNP-LEREFLHQDSIYDT 52

Query: 63  IDNVRYQ-NQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRL 121
              V     QP  +    P +T  ++ W+ +F  + I D       + R       +   
Sbjct: 53  ATPVAIPIQQPRQV----PRDTKKNTQWLSEFQQLSISDVPSVKPSFTRPAPTSTFRIIN 108

Query: 122 NSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQK--- 178
           N  R+   FK          +     + +     R++      ++ EF+ LE+E++    
Sbjct: 109 NDVRKDSLFK----------FQEHPIIDRLSPEARED-----YYNNEFEQLEKELENDDD 153

Query: 179 ----ERYEPITRRDEKWFDQ----DQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESK 230
               ++++ +  +  +++++    DQS  Q+ A DI+       + ++S+   SS +  K
Sbjct: 154 VQDGDQFQDLENKTYEYYNEFAFNDQSHFQQSARDILN------NMSNSTHEYSSELNEK 207

Query: 231 LSESKFIQLMRNISSGDVTLKKNADGNSASELFSSNNGELVGNRHIFVKDEIHK 284
           LS S F+ L+++I  GDV+L+ N  G    EL +SN+   +GN++  + D I +
Sbjct: 208 LSGSTFVNLLKSIDIGDVSLQDNRQG--PKELKNSNDNTTLGNKYADIPDYILE 259
>gb|AAX14715.1| Pex20p [Pichia angusta]
          Length = 309

 Score =  180 bits (457), Expect = 1e-43,   Method: Composition-based stats.
 Identities = 50/262 (19%), Positives = 93/262 (35%), Gaps = 33/262 (12%)

Query: 8   SPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAH-YATEDSHVEKH---FLHNGSNFDGI 63
           + + K  Q+ +   ++SL    R       L A  + T D  +E     F     ++   
Sbjct: 15  NALNKFTQRAN--VDNSLANQLRANSDSQRLQAQPHQTVDQQLESEYRAFQQQTPSYQAS 72

Query: 64  DNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNS 123
               +Q  P+P         ++ + WV  F  M + D+       +    +  SQQ  + 
Sbjct: 73  ----FQTLPTPAF-------IEENQWVNHFRDMNLHDTPGAQHLQQVPTQSLHSQQWRHE 121

Query: 124 SRQHLPFK---------NCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLER 174
             Q    +         +   R  +       +L  Q         D    DA F  LE 
Sbjct: 122 FAQSQQQQQQQETADTYSSAFRMRTGAPQLATSLAPQVVSAATVREDFSAVDAAFDELEH 181

Query: 175 EIQKERYEPITRRDEKWFDQDQS-ELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSE 233
           E+  E+ E       K  ++D   +  ++A  +    + PP      + +S     K  +
Sbjct: 182 EMHDEKMEEQQAEQVKPQNEDDKIKFAQLARSVFLTMSQPP------ADISKQTSDKFQK 235

Query: 234 SKFIQLMRNISSGDVTLKKNAD 255
           S F++LM  IS+ +V + ++ D
Sbjct: 236 SNFLKLMNRISTREVEMNEDRD 257
>emb|CAA97269.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 123

 Score =  167 bits (423), Expect = 9e-40,   Method: Composition-based stats.
 Identities = 123/123 (100%), Positives = 123/123 (100%)

Query: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60
           MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF
Sbjct: 1   MPSVCHTSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNF 60

Query: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
           DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR
Sbjct: 61  DGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120

Query: 121 LNS 123
           LNS
Sbjct: 121 LNS 123
>ref|XP_459664.1| hypothetical protein DEHA0E08701g [Debaryomyces hansenii CBS767]
 emb|CAG87898.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 413

 Score = 54.9 bits (131), Expect = 7e-06,   Method: Composition-based stats.
 Identities = 36/247 (14%), Positives = 83/247 (33%), Gaps = 25/247 (10%)

Query: 11  EKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDNVRYQN 70
           +++ Q   +  N       + +  H +   H+   +                   +    
Sbjct: 48  QRLNQDFQQFSNGQAFNQYQPEPVHQQFHQHHRQHNQAQHSQHHPQQGWVQDFSKMSINQ 107

Query: 71  QPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHLPF 130
           QP         N    +DW  QF +       + +  Y+  +     +  + ++  +  +
Sbjct: 108 QPRH-QAGNIQNGKVQNDWHQQFMNQTSQHPAQNNVAYQPQHQMGGFRMNMRTNLSNSLY 166

Query: 131 KNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEK 190
           +      T    P  K + +  + N+        FDA+F  +ERE+Q E        + +
Sbjct: 167 Q------TRGNVPEHKEMHKMEEENQI-------FDAQFDQMERELQHENMSNEQTAEVQ 213

Query: 191 W-FDQ-DQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDV 248
              ++ ++ +  + A  + +               ++   +K   S F++LM +IS+  V
Sbjct: 214 MDGNEFEKEQFAKTAKQVQESMLKGE---------NAETNAKFQNSNFLKLMSSISNRSV 264

Query: 249 TLKKNAD 255
            L    D
Sbjct: 265 ELSAEGD 271
>gb|EEH49084.1| peroxin 20 [Paracoccidioides brasiliensis Pb18]
          Length = 808

 Score = 52.5 bits (125), Expect = 3e-05,   Method: Composition-based stats.
 Identities = 36/284 (12%), Positives = 76/284 (26%), Gaps = 41/284 (14%)

Query: 7   TSPIEKII--QQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGID 64
           ++ ++         R      I S+++         H    D  ++  F    +      
Sbjct: 9   SNALQNFQKHSSVDRTLQQDRILSRQSPAQGFRSQNHN---DGALDPEFNAFEAGL-PGP 64

Query: 65  NVRYQNQPS----PLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
            +     P+    P       N + +  W   F +++I  S       ++      +   
Sbjct: 65  PLSDIQHPASFLPPAAHGPALNHIQNPGWASDFQNLQIS-SPSQHVSQQQFRPESSASSW 123

Query: 121 LNSSRQHLPFKNCMIRKTSCTYP-------------------PQKTLRQQRQGNRDNPTD 161
            N   + +  K       + T                          +   Q +     +
Sbjct: 124 QNDFMREVHQKPQSEPPATHTPQTPFRYGPVMGVMGFGGMSSHHPVYQDPYQSHVQEHKE 183

Query: 162 AFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSS 221
            F FD           +   E   ++ ++    +  E+Q+          PP    + S 
Sbjct: 184 QFVFDESAFEAAFAEARAEVELQEQKSQEEGTTEPLEVQQTIRIGSDTIRPPDEGTNESD 243

Query: 222 TL-----------SSSVESKLSESKFIQLMRNISSGDVTLKKNA 254
            L           S     K  ES F+ LMR +   +V +  + 
Sbjct: 244 ELAKTAGQLLDSVSHDQSQKFKESNFLALMRQLRDREVVIDGDE 287
>ref|XP_001401400.1| hypothetical protein An04g00370 [Aspergillus niger]
 emb|CAK38492.1| unnamed protein product [Aspergillus niger]
          Length = 378

 Score = 51.0 bits (121), Expect = 9e-05,   Method: Composition-based stats.
 Identities = 40/302 (13%), Positives = 81/302 (26%), Gaps = 49/302 (16%)

Query: 7   TSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDNV 66
           ++ ++   +     +          +    +           ++  F    S   G    
Sbjct: 9   SNALQNFQKHASVDRTLQQDRIISRQSPSQQGFRSQNPTQGVLDPEFAAFESGLAGAPPP 68

Query: 67  RYQN-----QPSPLTFITPNNTVDSSDWVPQFSSMKIDDSL----EFSSEYKRLYSNYES 117
             Q+      PS    ++     +SS+W   F +M I        + S       +    
Sbjct: 69  DVQHAVPFVPPSHHFPVSS--PAESSNWAADFQNMHISGPPHSLHQTSGPAVAPMAAVSQ 126

Query: 118 QQRLNSSRQH---------------LPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDA 162
           Q   N   +                 P        TS  +PP +     +Q   D   D 
Sbjct: 127 QGWHNEFAKQQQSSYQQHQPHVQGLQPSFTPRFPMTSAAFPPMQATTANQQPVAD-TFDE 185

Query: 163 FQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSST 222
             F+A F+    ++  +  E      E+  ++    +     + +K  +         + 
Sbjct: 186 SAFEAAFEQARADMMSQETETTPVHAEEVMEETDQVVAESTEEKIKIGSDTIPQTD-KND 244

Query: 223 LSSSVES---------------------KLSESKFIQLMRNISSGDVTLKKNADGNSASE 261
             + V                       K  ES F+ LMR I   +V ++ +        
Sbjct: 245 QQARVNDADELARTAGQLLDSVKHDQSQKFKESNFLALMRRIRDREVQVEGDEFREIPHP 304

Query: 262 LF 263
           L 
Sbjct: 305 LH 306
>ref|XP_001524003.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK46635.1| hypothetical protein LELG_04816 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 489

 Score = 49.5 bits (117), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 30/202 (14%), Positives = 64/202 (31%), Gaps = 60/202 (29%)

Query: 89  WVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQ---------------------- 126
           WV  FS M I +        + +     S       +Q                      
Sbjct: 128 WVDDFSKMNIHN-------QQLMVQQRNS--WGQEFQQNHQHAQQVSHHHQQQQQQQQPQ 178

Query: 127 -------HLPFKNCMIRK--------TSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQV 171
                     +   M+           + +    +    Q+    ++     Q ++EF  
Sbjct: 179 LMQNRYIGSSYGGGMMNMQRPMTMGTNTASPQFHQQSEHQQVHKLEDQE--TQLESEFDA 236

Query: 172 LEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKL 231
           +E+E+Q+                D+    + A  +        SS +S +T  + ++SK 
Sbjct: 237 VEKELQEVESSMQETNL-----HDKDLFAQTARKV-------ESSMNSLNTTDAEMKSKF 284

Query: 232 SESKFIQLMRNISSGDVTLKKN 253
             S+F++LM  +++  V L+ +
Sbjct: 285 ENSEFLKLMNRVANKQVELQDD 306
>gb|AAX11696.1| peroxin 20 [Pichia pastoris]
          Length = 323

 Score = 49.5 bits (117), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 27/181 (14%), Positives = 65/181 (35%), Gaps = 10/181 (5%)

Query: 77  FITPNNTVDSSDWVPQFSSMKIDDSLEFSSEY--KRLYSNYESQQRLNSSRQHLPFKNCM 134
                 +  ++ W  +F+ M+ +  +E+       +   N  +++  +   QH    N  
Sbjct: 129 TNMQAMSSTANAWTQEFNHMQHNKEMEYEMRIRETQRLVNAPAREVFSMVNQHSQQHNQN 188

Query: 135 IRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQ 194
                     Q          R   T   + D      + E  +++ E    +++   D 
Sbjct: 189 FTHNQTQLGDQSHYMTNLAFERQFETVQNEIDGMDMFSDSEQVEQQKELQ--KEDDNNDI 246

Query: 195 DQSELQRIATDIVKCCTPPPSSASSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNA 254
           + ++   IA  +           +  +T+S + E K  +S F++LM  ++  +V +  + 
Sbjct: 247 ETTKFAEIAQQVFN------QMNNVDTTVSQNTEHKFKQSNFLRLMDKVAQREVEINGSG 300

Query: 255 D 255
           D
Sbjct: 301 D 301
>ref|XP_001821473.1| hypothetical protein [Aspergillus oryzae RIB40]
 dbj|BAE59471.1| unnamed protein product [Aspergillus oryzae]
          Length = 306

 Score = 47.9 bits (113), Expect = 0.001,   Method: Composition-based stats.
 Identities = 48/295 (16%), Positives = 94/295 (31%), Gaps = 52/295 (17%)

Query: 7   TSPIEKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDNV 66
           ++ ++   Q+   I          ++ +H++        +  ++  F    SN   I   
Sbjct: 9   SNALQNF-QKHASIDRTLQQDRLVSRQSHSQGFRSQNPNEGTLDPEFAAFESN---IAGA 64

Query: 67  RYQNQPSPLTFITP-----NNTVDSSDWVPQFSSMKIDDSLEFSSEY----KRLYSNYES 117
              +   P  F TP     +N V+  +W   F  ++I  SL  + +         SN   
Sbjct: 65  SLPDSQHPAHFATPPHFAVHNQVEKHNWAADFQRLQISGSLPLAHQQAGPASSPASNLAQ 124

Query: 118 QQRLNSS----------RQHLPFKNCMIRKTSCTYP----------PQKTLRQQRQGNRD 157
               N            +QH PF        + +YP          P + L   +  +  
Sbjct: 125 HGWRNDFLAQQRQPRPAQQHQPFAQGFQPFLTPSYPIHEAAGNTLSPAQDLTTTQLPS-T 183

Query: 158 NPTDAFQFDAEFQVLEREIQK---ERYEPITRRDEKWFDQDQSELQRI------------ 202
              D   F+A F+  + ++     +  + +T           S+++ I            
Sbjct: 184 EAFDESAFEAAFEQAKADLASQTADHAQELTNETTNPDATVTSQVEAIRIGSDTIPQTEK 243

Query: 203 --ATDIVKCCTPPPSSASS-SSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNA 254
             A            +A     ++S     K  ES F+ LMR I   +V ++ + 
Sbjct: 244 DDARAHYNDPDELARTAGQLLESVSHDQSQKFRESNFLALMRRIRDREVHIEGDE 298
>ref|XP_503644.1| YlPEX20 [Yarrowia lipolytica]
 emb|CAG79226.1| YlPEX20 [Yarrowia lipolytica CLIB122]
          Length = 417

 Score = 46.8 bits (110), Expect = 0.002,   Method: Composition-based stats.
 Identities = 47/282 (16%), Positives = 79/282 (28%), Gaps = 64/282 (22%)

Query: 28  SKRTKLAHTELTAHYATEDSH--VEKHFLHNGSNFDGIDNVRYQNQPSPLTFITPNNTVD 85
             R++    +L   +            F  +            Q     +    P     
Sbjct: 40  QFRSQTQGGQLNNEFQQFAQAGPAHNSFEQSQMGPHFGQQHFGQPHQPQMGQHAPMAHGQ 99

Query: 86  SSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFK-----NCMIRKTSC 140
            SDW   FS + +          +  +     +Q   S  Q  P         M   +S 
Sbjct: 100 QSDWAQSFSQLNLGPQTGPQHTQQSNWGQDFMRQSPQS-HQVQPQMANGVMGSMSGMSSF 158

Query: 141 TYP-----------PQKTLRQQRQGNRDNPTDAFQFDAEFQVLE----------REIQKE 179
                           +T  QQ        +    F+A F  +E          +E++K+
Sbjct: 159 GPMYSNSQLMNSTYGLQTEHQQTHKTETKSSQDAAFEAAFGAVEESITKTSDKGKEVEKD 218

Query: 180 RYEP--------------------ITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASS 219
             E                     I+R +      +  E   IA  I             
Sbjct: 219 PMEQTYRYDQADALNRQAEHISDNISREEVDIKTDENGEFASIARQI------------- 265

Query: 220 SSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASE 261
           +S+L  + +SK  +S F+ LMR I + +VTL  + D     E
Sbjct: 266 ASSLEEADKSKFEKSTFMNLMRRIGNHEVTL--DGDKLVNKE 305
>ref|XP_001830051.1| hypothetical protein CC1G_04484 [Coprinopsis cinerea okayama7#130]
 gb|EAU91716.1| hypothetical protein CC1G_04484 [Coprinopsis cinerea okayama7#130]
          Length = 726

 Score = 46.0 bits (108), Expect = 0.003,   Method: Composition-based stats.
 Identities = 32/205 (15%), Positives = 60/205 (29%), Gaps = 15/205 (7%)

Query: 58  SNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYES 117
           S       +     P+            ++DW   F    +  +    +      S   +
Sbjct: 89  SALRQALPMHAHGPPALQQQAELPGAAATADWGMDFMQQAMGGTPMTQTYQPAQTSQAHA 148

Query: 118 QQRLNSSR--QHLPFKN---CMIRKTSCTYPP----QKTLRQQRQGNRDNPTDAFQFDAE 168
                 S   Q  P  N      R  +    P    Q  ++ Q      +   +  +D E
Sbjct: 149 AHVHAPSINIQQGPTWNPGMSAFRMNTMQMQPGFMPQMQMQPQATAATTSKRIS--WDKE 206

Query: 169 FQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSVE 228
           F  LE  +            E   +Q Q E    A           ++      +     
Sbjct: 207 FSALELNVGPTS----EHVPEVVQEQPQQEQSAPAQKQFDGDELARTAGMLLDHVQHETN 262

Query: 229 SKLSESKFIQLMRNISSGDVTLKKN 253
            K  +S+F++LM+ +  G+V ++ N
Sbjct: 263 PKFKKSQFLELMKQLRDGEVIVEGN 287
>ref|XP_661922.1| hypothetical protein AN4318.2 [Aspergillus nidulans FGSC A4]
 gb|EAA60479.1| hypothetical protein AN4318.2 [Aspergillus nidulans FGSC A4]
          Length = 400

 Score = 44.8 bits (105), Expect = 0.008,   Method: Composition-based stats.
 Identities = 35/235 (14%), Positives = 64/235 (27%), Gaps = 43/235 (18%)

Query: 55  HNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSN 114
            N         + YQ  PS       +      +++ Q                 R    
Sbjct: 99  QNLHISGPSRAINYQPGPSAAPMSNASQQGWHGEFLRQQQQ------------QHRPALQ 146

Query: 115 YESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLER 174
                     +   P              P+         +R    D   FDA F+  + 
Sbjct: 147 ENQALGQRFQQSFTPNYTMNNSMAVYAPGPET-----VHVSRTEEFDESAFDAAFEQAKA 201

Query: 175 EIQ---KERYEPITRRDEKWFDQDQSE---LQRIATDIVKCCTPPPSSASSS-------- 220
           ++    +E        +    ++DQ E    Q+I  + ++  +   S             
Sbjct: 202 DMASQFEESLVETNVTESNANNEDQKEEAPAQQIGHETIRIGSDTISQVDKDDPQAAAND 261

Query: 221 ------------STLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASELF 263
                       +++S     K  ES F+ LMR I   +V L+ +    +A  L 
Sbjct: 262 ADELARTAGHLLNSVSHETNQKFRESNFLALMRRIRDHEVQLEGDEFRETAQSLH 316
>ref|XP_001482477.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
 gb|EDK41399.1| hypothetical protein PGUG_05497 [Pichia guilliermondii ATCC 6260]
          Length = 439

 Score = 44.1 bits (103), Expect = 0.013,   Method: Composition-based stats.
 Identities = 29/148 (19%), Positives = 46/148 (31%), Gaps = 23/148 (15%)

Query: 108 YKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDA 167
           Y     N+   Q LN      PF+   +R    T         Q     D       FD 
Sbjct: 150 YLNQSQNHPQNQYLN-----QPFR-LNMRTDLSTPLQGPITEHQEIHRDD--HRQLAFDE 201

Query: 168 EFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSSTLSSSV 227
           +F  LE+E+ +        ++         E  R A  + +           +   S   
Sbjct: 202 QFDRLEQELAQAETHNEMDKE---------EFARTAGRVKET------MLVDNPERSEET 246

Query: 228 ESKLSESKFIQLMRNISSGDVTLKKNAD 255
             K  +S F++LM +I    V +    D
Sbjct: 247 TQKFQQSNFLKLMTSIHDRSVEISDEGD 274
>ref|XP_752214.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gb|EAL90176.2| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gb|EDP49990.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
          Length = 405

 Score = 43.7 bits (102), Expect = 0.017,   Method: Composition-based stats.
 Identities = 37/310 (11%), Positives = 91/310 (29%), Gaps = 58/310 (18%)

Query: 7   TSPIEKIIQQGHRIQNDS-LIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDN 65
           ++ ++ +  Q H   + +       ++ + ++      + D  ++  F    S+  G   
Sbjct: 9   SNALQNL--QKHASVDRTLQQDRLISRRSPSQGFRSQNSSDGVLDPEFAAFESDLAGPSL 66

Query: 66  VRYQNQ---PSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKR-------LYSNY 115
              Q+     +    +   +  ++  W   F  + I      S   ++         S  
Sbjct: 67  PNLQHAGPFSAHPHRLPVASHAENVTWAADFQRLHISGP---SHPIQQHHGPSATPASAM 123

Query: 116 ESQQRLNSSRQHLPFKNCMIRKTSCTYPP-QKTLRQ-------------------QRQGN 155
             Q   +   +    +    ++        Q +                        Q +
Sbjct: 124 SQQGWHSEFLKQQQQRQAPTQQHQQYTGGFQPSFSPHHPIHGGVINALPSVHRPATDQQH 183

Query: 156 RDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFD---------QDQSELQRIATDI 206
                D   F+A F+    ++  +       + E   +         Q+ +E  +I +D 
Sbjct: 184 ATETFDESAFEAAFEQARADMALQETNATEEKAEPASETIQPDLSTAQEPTEQIKIGSDT 243

Query: 207 VKCCT--PPPSSASSSSTL-----------SSSVESKLSESKFIQLMRNISSGDVTLKKN 253
           +       P S  + +  L           S     K  +S F+ LMR I   +V ++ +
Sbjct: 244 IPQIDKNDPLSRVNDADELARTAGQLLDSVSHDQSEKFRKSSFLALMRRIRDREVRIEDD 303

Query: 254 ADGNSASELF 263
               ++  L 
Sbjct: 304 EFRETSQSLH 313
>ref|XP_001215991.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
 gb|EAU33357.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
          Length = 376

 Score = 43.3 bits (101), Expect = 0.019,   Method: Composition-based stats.
 Identities = 47/308 (15%), Positives = 96/308 (31%), Gaps = 53/308 (17%)

Query: 7   TSPIEKIIQQGHRIQNDSLIPSKRT-KLAHTELTAHYATEDSHVEKHFLHNGSNFDGID- 64
           ++ ++    Q H   + +L   +   + + ++        +  ++  F    SN  G   
Sbjct: 9   SNALQNF--QKHASVDRTLQQDRIVSRSSPSQGFRSPNPHEGVLDPEFAAFESNIPGAPL 66

Query: 65  NVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDS-------LEFSSE---------Y 108
                  P         +  ++  W   F  ++I             S+          +
Sbjct: 67  PELRHAAPFTAQGPHFMHPAENQSWAADFQKLQISGPSHPIHQQPGLSASSAPMMAQQGW 126

Query: 109 KRLYSNYES-QQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLR---QQRQGNRDNPT---- 160
           +  + N +  QQR   ++QH P+        + +YP   +        Q    N      
Sbjct: 127 QNEFMNQQQPQQRHTPNQQHQPYTQGFQSSFAPSYPMHASPMNAFPAAQEAIANQHPQTE 186

Query: 161 --DAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQ----------DQSELQRIATDIV- 207
             D   F+A F+    ++  +  E      E+   +          +  E  RI +D + 
Sbjct: 187 TFDESAFEAAFEQARADMASQAAEITQEHTEETTVEATHIETTTSFEPQETVRIGSDTIP 246

Query: 208 -KCCTPPPSSASSSSTL-----------SSSVESKLSESKFIQLMRNISSGDVTLKKNAD 255
                 P S  + +  L           S     K  ES F+ LMR I   +V ++ +  
Sbjct: 247 QSNKEDPQSRVNDADELARTAGQLLDSVSHDQSQKFRESNFLALMRRIRDREVHVEGDEF 306

Query: 256 GNSASELF 263
             ++  L 
Sbjct: 307 RETSQSLH 314
>gb|ABH11425.1| peroxin 20 [Penicillium chrysogenum]
          Length = 347

 Score = 43.3 bits (101), Expect = 0.023,   Method: Composition-based stats.
 Identities = 41/312 (13%), Positives = 93/312 (29%), Gaps = 61/312 (19%)

Query: 7   TSPIEKIIQQGHRIQNDS-LIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDN 65
           ++ ++    Q H   + +       ++ +  +      +++  ++  F     N  G   
Sbjct: 9   SNALQNF--QKHTSVDRTLQQDRLTSRQSPVQGFRSQNSQEGILDPEFAAFEGNLAGP-- 64

Query: 66  VRYQNQPSPLTFITPN----------NTVDSSDWVPQFSSMKIDDSLEFSSEYKRL---- 111
               N   P  F  P+          +   ++ W   F +++I       S+ +R     
Sbjct: 65  -ALPNLQHPAHFGAPSQHAAVHHPAAHGASNASWASDFQNLQISGPSHPVSQLQRPHIAP 123

Query: 112 --------YSNYESQQRLNSSRQHLPFKNCMIRKT---------SCTYPPQKTLRQQRQG 154
                         QQ   S  Q  P  +   + +         S     Q     Q + 
Sbjct: 124 AAAHGGWHAEFMAQQQGPMSLAQQTPQMHGAYQPSFTPSYQMYGSSMNQMQPLQGMQTEQ 183

Query: 155 NRDNPTDAFQFDAEFQVLEREIQ----------KERYEPITRRDEKWFDQDQSELQRIAT 204
                 D   F+A F     +++           + +   T + +      Q ++ R+ +
Sbjct: 184 QNTQQFDESAFEAAFDQARADMELQGVDIAQQDNQEHLQETNKVDISEPAIQEQI-RLGS 242

Query: 205 DIVKCCTPPPSSASSSS-------------TLSSSVESKLSESKFIQLMRNISSGDVTLK 251
           D++           S               ++ +    K  +S F+ LMR I   +V ++
Sbjct: 243 DLIPQSDKQNPETQSRDADELARTAGQLLDSVRNEQSEKFQQSNFLALMRRIRDREVEVE 302

Query: 252 KNADGNSASELF 263
            +    +A  L 
Sbjct: 303 GDDFRETAQSLH 314
>ref|XP_001906106.1| unnamed protein product [Podospora anserina]
 emb|CAP66772.1| unnamed protein product [Podospora anserina]
          Length = 374

 Score = 42.9 bits (100), Expect = 0.029,   Method: Composition-based stats.
 Identities = 26/245 (10%), Positives = 73/245 (29%), Gaps = 47/245 (19%)

Query: 62  GIDNVRYQNQPSPLTFITPNNTVDSSD-WVPQFSSMKIDD------------------SL 102
            +         + +   +P      +  W+ +F+SMKI +                    
Sbjct: 81  PMAARLAAEHHAAMRLESPATGGVGNQAWINEFASMKIANGSSVQAGQTPGMAMSHPGVP 140

Query: 103 EFSSEYKRLYSNYES---QQRLNSSRQHLPFKNCMIRKTSCTYPPQKTLRQQRQGNRDNP 159
                     +   S    Q   + + +LP  +  +  +             +  + +  
Sbjct: 141 VVQQPMMLPTTGVNSFAPYQTGMAFQSYLPALSAPMTHSQLPGQEATAAAAVQVESAEVK 200

Query: 160 TDAFQFDAEFQVLEREIQKER-------------YEPITRRDEKWFDQD----QSELQRI 202
                   +++    ++   +                   +     D +     SE++RI
Sbjct: 201 DAFADLFDQYEQQADQLTDYQRQEQEFEQEQAKWMAEHGPKALPPTDAEMAAINSEMERI 260

Query: 203 ATDIVKCCTPPPSSASSSST------LSSSVESKLSESKFIQLMRNISSGDVTLKKNA-- 254
           A +  +      ++  + +       +S +   K   S F++LMR I++ ++ + +    
Sbjct: 261 ADEQEEHSRRRENADLARAAEDILRAVSGNNSDKFKHSNFLELMRRIAASEIVVNEENFI 320

Query: 255 DGNSA 259
           D ++ 
Sbjct: 321 DADTG 325
>gb|AAC23564.2| peroxin [Yarrowia lipolytica]
          Length = 417

 Score = 42.9 bits (100), Expect = 0.029,   Method: Composition-based stats.
 Identities = 49/284 (17%), Positives = 80/284 (28%), Gaps = 68/284 (23%)

Query: 28  SKRTKLAHTELTAHYATEDSH--VEKHFLHNGSNFDGIDNVRYQNQPSPLTFITPNNTVD 85
             R++    +L   +            F  +            Q     +    P     
Sbjct: 40  QFRSQTQGGQLNNEFQQFAQAGPAHNSFEQSQMGPHFGQQHFGQPHQPQMGQHAPMAHGQ 99

Query: 86  SSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNS--SRQHLPFK-----NCMIRKT 138
            SDW   FS + +          +   SN+       S  S Q  P         M   +
Sbjct: 100 QSDWAQSFSQLNLGPQTGPQHTQQ---SNWGEDFMGESPQSHQGQPQMANGVMGSMSGMS 156

Query: 139 SCTYP-----------PQKTLRQQRQGNRDNPTDAFQFDAEFQVLE----------REIQ 177
           S                 +T  QQ        +    F+A F  +E          +E++
Sbjct: 157 SFGPMYSNSQLMNSTYGLQTEHQQTHKTETKSSQDAPFEAAFGAVEESITKTSDKGKEVE 216

Query: 178 KERYEP--------------------ITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSA 217
           K+  E                     I+R +      +  E   IA  I           
Sbjct: 217 KDPMEQTYRYDQADALNRQAEHISDNISREEVDIKTDENGEFASIARQI----------- 265

Query: 218 SSSSTLSSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASE 261
             +S+L  + +SK  +S F+ LMR I + +VTL  + D     E
Sbjct: 266 --ASSLEEADKSKFEKSTFMNLMRRIGNHEVTL--DGDKLVNKE 305
>ref|XP_001385675.2| hypothetical protein PICST_84845 [Pichia stipitis CBS 6054]
 gb|ABN67646.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 402

 Score = 41.8 bits (97), Expect = 0.060,   Method: Composition-based stats.
 Identities = 21/93 (22%), Positives = 42/93 (45%), Gaps = 12/93 (12%)

Query: 163 FQFDAEFQVLEREI--QKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSS 220
             FD+ F  LE+E+  Q +    +  + +K  ++   +    A  I        +S    
Sbjct: 196 QLFDSHFDQLEKELNQQSQEKPEVEVQVDKVENE---QFAETARQI-------ENSLRQF 245

Query: 221 STLSSSVESKLSESKFIQLMRNISSGDVTLKKN 253
            T  ++ ++K+  S F++LM +IS+  V L  +
Sbjct: 246 DTADAATKAKIENSDFLKLMSSISNKQVVLDGD 278
>ref|XP_001271502.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
 gb|EAW10076.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
          Length = 398

 Score = 41.8 bits (97), Expect = 0.065,   Method: Composition-based stats.
 Identities = 45/309 (14%), Positives = 89/309 (28%), Gaps = 57/309 (18%)

Query: 7   TSPIEKIIQQGHRIQNDS-LIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDN 65
           ++ ++    Q H   + +       ++ + ++        +  ++  F    SN  G+  
Sbjct: 9   SNALQNF--QKHAAVDRTLQQDRLISRQSPSQGFRSQKPSEGTLDPEFAAFESNVAGLPL 66

Query: 66  VRYQNQ---PSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKR-------LYSNY 115
              Q+     +    +  +   +S +W   F  + I      S   ++         S+ 
Sbjct: 67  PNLQHAGPFHAHAPQMPISGPAESMNWATDFQQLHISGP---SHPVQQHRVPSAAPASSL 123

Query: 116 ESQQRLNSS----------RQHLPFKNCMIRKTSCTYPPQKTLR-----QQRQGNRDNP- 159
             Q   N            +QH PF     +  S  YP            Q       P 
Sbjct: 124 SQQGWHNEFLSQQQQRAHVQQHQPFSGGFQQSLSHGYPMHGATMNAFPTTQETAASQQPL 183

Query: 160 ---TDAFQFDAEFQVL--EREIQKERYEPITRRDEKWFDQ-DQSELQRIATDIVKCCTPP 213
               D   F+A F+    +  +Q+               Q D + +Q     I       
Sbjct: 184 AETFDESAFEAAFKQARADMAVQETTLTADQPEQTSAEVQLDPTAIQETPEQIKIGSDTI 243

Query: 214 PSSASSS-------------------STLSSSVESKLSESKFIQLMRNISSGDVTLKKNA 254
           P    S                     ++S     K  +S F+ LMR I   +V ++ + 
Sbjct: 244 PQIDKSDPQARVNDADELARTAGQLLDSVSHDQSEKFRQSNFLALMRRIRDREVQIEGDK 303

Query: 255 DGNSASELF 263
              ++  L 
Sbjct: 304 FRETSQPLH 312
>ref|XP_001267407.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
 gb|EAW25510.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
          Length = 363

 Score = 40.2 bits (93), Expect = 0.16,   Method: Composition-based stats.
 Identities = 39/315 (12%), Positives = 85/315 (26%), Gaps = 68/315 (21%)

Query: 7   TSPIEKIIQQGHRIQNDS-LIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGS------- 58
           ++ ++    Q H   + +       ++ + ++      + D  ++  F    S       
Sbjct: 9   SNALQNF--QKHASVDRTLQQDRLISRRSPSQGFRSQNSSDGVLDPEFAAFESDLAGPSL 66

Query: 59  -NFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKR------- 110
            N              P+     N T     W   F  + I      S   ++       
Sbjct: 67  PNLQHAGPFSANPHQLPVASHAENVT-----WAADFQRLHISGP---SHPIQQHHGPSAT 118

Query: 111 LYSNYESQQRLNSSRQHLPFKNCMIRKTSC--------TYPPQKT----LRQ-------- 150
             S    Q   +   +    +    ++                      +          
Sbjct: 119 PASAMSQQGWHSEFLKQQQQRPARTQQHQQYTGGFQPSFSHHHPIHGGVMNALPSVHGPA 178

Query: 151 QRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFD---------QDQSELQR 201
             Q +     D   F+A F+    ++  +       + E   +         Q+ +E  +
Sbjct: 179 TDQQHATETFDESAFEAAFEQARADMALQETNTTEEKTEPASETIQPDLSTAQEPTEQIK 238

Query: 202 IATDIVKCCT--PPPSSASSSSTL-----------SSSVESKLSESKFIQLMRNISSGDV 248
           I +D +       P S  + +  L           S     K  +S F+ LMR I   +V
Sbjct: 239 IGSDTIPQIDKNDPLSRVNDADELARTAGQLLDSVSHDQSEKFRQSNFLALMRRIRDREV 298

Query: 249 TLKKNADGNSASELF 263
            ++ +    +A  L 
Sbjct: 299 RIEDDEFRETAQSLH 313
>ref|XP_001015711.1| hypothetical protein TTHERM_00077700 [Tetrahymena thermophila SB210]
 gb|EAR95466.1| hypothetical protein TTHERM_00077700 [Tetrahymena thermophila SB210]
          Length = 1206

 Score = 39.4 bits (91), Expect = 0.34,   Method: Composition-based stats.
 Identities = 26/172 (15%), Positives = 57/172 (33%), Gaps = 6/172 (3%)

Query: 25   LIPSKRTKLAHTELTAHYATEDS-HVEKHFLHNGSNFDGIDNVRYQNQPSPLTFITPNNT 83
                 +    H           S  +E+  L N     G+  +    Q +        + 
Sbjct: 871  QNNEYQYVSNHQFGYEENQNFFSRQLEQLTLQNSQQQGGLLQLEIPQQQNEHFIRFTESN 930

Query: 84   VDSSDWVPQFSSMKI-DDSLEFSSEYKRLYSNYESQQRLNSSRQHLPFKNCMIRKTSCTY 142
            +  +        ++I  +  EF++  K+ Y  Y ++Q      Q+   +N    +T+ T 
Sbjct: 931  LQYNKESDDIEQIQIKQNQDEFNT--KQNYDQYNTKQSDYQFEQNAAEQNLYQTQTNFTI 988

Query: 143  PPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQ 194
             P     Q    N    +D    + E +  +   Q E+++     ++ +  Q
Sbjct: 989  KP--IPVQNTLSNLTESSDQNFKNREIKYEQIRDQDEQFQERQSSNKSYIQQ 1038
>gb|EEH37053.1| peroxin 20 [Paracoccidioides brasiliensis Pb01]
          Length = 355

 Score = 39.1 bits (90), Expect = 0.41,   Method: Composition-based stats.
 Identities = 37/293 (12%), Positives = 78/293 (26%), Gaps = 41/293 (13%)

Query: 7   TSPIEKII--QQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGID 64
           ++ ++         R      I S+++              D  ++  F    +   G  
Sbjct: 9   SNALQNFQKHSSVDRTLQQDRILSRQSPTQGFRSQNQN---DGALDPEFNAFEAGLHGA- 64

Query: 65  NVRYQNQPSPLTFITPNNTVDSSD----WVPQFSSMKIDDSLEFSSEYKRLYSNYESQQR 120
            +     P+       +  V +      W   F +++I    +     ++      +   
Sbjct: 65  PLSDIQHPASFLPPAAHGPVLNHIHNPGWASDFQNLQISGPSQ-HVSQQQFRPESSASGW 123

Query: 121 LNSSRQHL---PFKNCMIRKTSCTY----------------PPQKTLRQQRQGNRDNPTD 161
            N   +     P       +TS                         +   Q +     +
Sbjct: 124 QNDFMRQAHPKPQSQTPATQTSQMPLRYGPVMGVMGFGGMSSHHSVYQDPYQSHVQEHKE 183

Query: 162 AFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQSELQRIATDIVKCCTPPPSSASSSS 221
            F FD           +   E   ++ ++    +  E+Q+          PP    + S 
Sbjct: 184 QFVFDESAFEAAFAEARAEVELQEQKSQEEGTTEPLEIQQTIRIGSDTIRPPVEGTNESD 243

Query: 222 TL-----------SSSVESKLSESKFIQLMRNISSGDVTLKKNADGNSASELF 263
            L           S     K  ES F+ LMR +   +V +  +    +A  L 
Sbjct: 244 ELAKTAGQLLDSVSHDQSQKFKESNFLALMRQLRDREVVVDGDEFLPTAQPLH 296
>ref|XP_001033074.1| hypothetical protein TTHERM_00471900 [Tetrahymena thermophila
           SB210]
 gb|EAR85411.1| hypothetical protein TTHERM_00471900 [Tetrahymena thermophila
           SB210]
          Length = 965

 Score = 38.3 bits (88), Expect = 0.72,   Method: Composition-based stats.
 Identities = 46/237 (19%), Positives = 75/237 (31%), Gaps = 44/237 (18%)

Query: 11  EKIIQQGHRIQNDSLIPSKRTKLAHTELTAHYATEDSHVEKHFLHNGSNFDGIDNVRYQN 70
           E +  +  R  +   I S   +             +S   K FL N S       +  Q 
Sbjct: 729 ESMSPKT-RFDSKGQIRSTTIRPGSN-------ILNSSQHKTFLQNLSRQLRDTTIINQQ 780

Query: 71  QPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRLYSNYESQQRLNSSRQ--HL 128
             S           ++ D    FS  KID     S + +   SNY  Q+  N  RQ    
Sbjct: 781 NQSSAQ-------EENEDNQNSFSQNKID-----SKDSRAKLSNYHFQEDNNQKRQVDDC 828

Query: 129 PFKNCMIRKTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRD 188
              +      S  Y  ++ L Q   G  D  T+  +                Y+P    D
Sbjct: 829 NNNSINFSSVSQHYLDEENLGQSELGITDQETNIQKL---------------YQPQFNVD 873

Query: 189 EKWFDQDQSELQRIATDIVKCCTPPPSSASSS-------STLSSSVESKLSESKFIQ 238
               ++  S   + + +I        +S  S+        ++S S   ++S+S FI 
Sbjct: 874 SYIGNKFSSYFSKQSQEIAFIEGDKSNSQHSNLLLGLGDKSISKSNSDEMSQSYFIN 930
>ref|XP_001586173.1| hypothetical protein SS1G_12748 [Sclerotinia sclerotiorum 1980]
 gb|EDN97894.1| hypothetical protein SS1G_12748 [Sclerotinia sclerotiorum 1980]
          Length = 356

 Score = 35.2 bits (80), Expect = 5.3,   Method: Composition-based stats.
 Identities = 38/323 (11%), Positives = 84/323 (26%), Gaps = 80/323 (24%)

Query: 7   TSPIEKIIQQGHRIQNDSLIP---SKRTKLAHTELTAHYATEDSHVEKHFL--------- 54
           ++ ++    Q H   + +L       R   +              ++  F          
Sbjct: 9   SNALQNF--QKHSNVDRTLQQDRLVGRASPSQGF-RNSLGPNAGQLDAEFAAFQAGHQTF 65

Query: 55  HNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSL------------ 102
               +        +Q          P+       W   F +M I                
Sbjct: 66  SPSPSIQNFSPRHFQTPSPQSFQNAPSQFAAGQGWATDFQNMSISGPQAQFQQQELHQTN 125

Query: 103 ----------EFSSEY---------KRLYSNYE-------SQQRLNSSRQHLPFKNCMIR 136
                     EF+  Y         K+   +              N   + L  +N +I 
Sbjct: 126 QSQMGGGWHQEFAQSYGGSGSIAQGKQPVMSMNGLNGPSMMNHGYNPMPRVLMQQNQLIS 185

Query: 137 KTSCTYPPQKTLRQQRQGNRDNPTDAFQFDAEFQVLEREIQKERYEPITRRDEKWFDQDQ 196
               +    +           +     + +   +  E+E++  R +       +  + DQ
Sbjct: 186 TRQSSVENNEAFDDAAFARAFDEAANAEMERAREQ-EQELEITRAQQEAAIVSEQANADQ 244

Query: 197 SE---------------------LQRIATDIVKCCTPPPSSASSSS-----TLSSSVESK 230
                                  +Q  A   ++  T    + S ++     ++S +   K
Sbjct: 245 KAEFDLDRAENDLLNQAPVGADTIQDPAVATIEENTHATDALSRTAGELLSSVSDNQSDK 304

Query: 231 LSESKFIQLMRNISSGDVTLKKN 253
              S+F+QLMR     +VT++ +
Sbjct: 305 FQNSQFLQLMRQFRDKEVTVEGD 327
>ref|XP_001556458.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
 gb|EDN24065.1| hypothetical protein BC1G_05227 [Botryotinia fuckeliana B05.10]
          Length = 355

 Score = 34.8 bits (79), Expect = 7.0,   Method: Composition-based stats.
 Identities = 44/322 (13%), Positives = 85/322 (26%), Gaps = 81/322 (25%)

Query: 7   TSPIEKIIQQGHRIQNDSLIP---SKRTKLAHTELTAHYATEDSHVEKHF---------L 54
           ++ ++    Q H   + +L       R   +     A       H++  F         L
Sbjct: 9   SNALQNF--QKHSNVDRTLQQDRLVGRAPSSQGF-RASPGPNAGHLDAEFAAFQAGHQSL 65

Query: 55  HNGSNFDGIDNVRYQNQPSPLTFITPNNTVDSSDWVPQFSSMKIDDSLEFSSEYKRL-YS 113
               +        +Q  P                W   F +M I       +++++    
Sbjct: 66  IPSPSVQNFSPHHFQTPPPQHFQNVAPQNSTGQGWASDFQNMSISGP---QAQFQQQGVH 122

Query: 114 NYESQQRLNSSRQHLPF-----------------------KNCMIRKTSCTYPPQKTLRQ 150
                Q      Q                            N            Q     
Sbjct: 123 QINQPQMGAGWHQEFAQSHGAPGLIAQGSQPVMAMNRPSMMNVGYNPMPGVLMQQNQFAS 182

Query: 151 QRQGNRDNPTD----------AFQFDAEFQ-----VLEREIQKERYEPITRRD------E 189
             Q   +N                 +AE +      LE +  K + E I   D      E
Sbjct: 183 TSQSLVENNEAFDDAAFARAFEQAANAESERMQQQELESQAAKAQNEEIHAMDQQHVTKE 242

Query: 190 KWFDQDQSELQRIATDIV--KCCTPPPSSASSSS----------------TLSSSVESKL 231
              + D++E Q +   ++       P ++ S  +                ++S++   K 
Sbjct: 243 AELELDRAEKQLLGQSLLGADTIQDPATATSEQNAHAPDALSRTAGELLSSVSNNQSDKF 302

Query: 232 SESKFIQLMRNISSGDVTLKKN 253
             S+F+QLMR     +VT++ +
Sbjct: 303 QNSQFLQLMRQFRDKEVTVEGD 324
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF excluding environmental samples
  from WGS projects
    Posted date:  May 23, 2008  5:56 PM
  Number of letters in database: 883,778,997
  Number of sequences in database:  2,617,685
  
  Database: /host/Blast/data/nr_perl/nr.01
    Posted date:  May 23, 2008  5:54 PM
  Number of letters in database: 976,759,346
  Number of sequences in database:  2,761,413
  
  Database: /host/Blast/data/nr_perl/nr.02
    Posted date:  May 23, 2008  5:48 PM
  Number of letters in database: 374,670,760
  Number of sequences in database:  1,165,270
  
  Database: /host/Blast/data/nr_perl/nr.03
    Posted date:  Apr 28, 2009  5:40 PM
  Number of letters in database: 114,943,120
  Number of sequences in database:  354,819
  
Lambda     K      H
   0.307    0.155    0.385 

Lambda     K      H
   0.267   0.0475    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,684,767,792
Number of Sequences: 6899187
Number of extensions: 147205221
Number of successful extensions: 873019
Number of sequences better than 10.0: 300
Number of HSP's better than 10.0 without gapping: 2073
Number of HSP's successfully gapped in prelim test: 6114
Number of HSP's that attempted gapping in prelim test: 678427
Number of HSP's gapped (non-prelim): 75344
length of query: 288
length of database: 2,350,152,223
effective HSP length: 133
effective length of query: 155
effective length of database: 1,432,560,352
effective search space: 222046854560
effective search space used: 222046854560
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.3 bits)
S2: 78 (34.4 bits)