BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= YAL055W__[Saccharomyces_cerevisiae]
         (180 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           6,899,187 sequences; 2,350,152,223 total letters

Searching..................................................done


Results from round 1


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_009346.1|  Putative peroxisomal membrane protein requ...   361   1e-98
gb|EDN59708.1|  peroxisome-related protein [Saccharomyces ce...   356   5e-97
ref|XP_001643611.1|  hypothetical protein Kpol_1049p11 [Vand...    58   3e-07
ref|XP_448019.1|  unnamed protein product [Candida glabrata]...    55   3e-06
ref|NP_984505.1|  ADR410Cp [Ashbya gossypii ATCC 10895] >gi|...    54   4e-06
ref|XP_455093.1|  unnamed protein product [Kluyveromyces lac...    48   3e-04
ref|XP_460423.1|  hypothetical protein DEHA0F01595g [Debaryo...    43   0.009
ref|XP_001729924.1|  hypothetical protein MGL_2910 [Malassez...    38   0.31 
ref|YP_344555.1|  Carbamoyl-phosphate synthase, large subuni...    36   1.4  
dbj|BAE20140.1|  beta subunit of acetyl-coenzyme A carboxyla...    35   3.2  
ref|YP_742813.1|  carbamoyl-phosphate synthase large subunit...    33   7.6  
ref|ZP_01771256.1|  Hypothetical protein COLAER_00234 [Colli...    33   9.2  
ref|ZP_02406302.1|  fumarylacetoacetate hydrolase family pro...    33   9.2  
gb|EDO87026.1|  fumarylacetoacetate hydrolase family protein...    33   9.5  
ref|ZP_01765621.1|  fumarylacetoacetate hydrolase family pro...    33   9.7  
ref|YP_001078967.1|  fumarylacetoacetate hydrolase family pr...    33   9.8  
ref|ZP_01337165.1|  hypothetical protein Bpse4_03000368 [Bur...    33   10.0 
gb|EDS82204.1|  fumarylacetoacetate hydrolase family protein...    33   10.0 
>ref|NP_009346.1| Putative peroxisomal membrane protein required for import of
           peroxisomal proteins, functionally complements a Pichia
           pastoris pex22 mutation; Pex22p [Saccharomyces
           cerevisiae]
 sp|P39718|PEX22_YEAST Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAC04978.1| Yal055wp [Saccharomyces cerevisiae]
 gb|AAS56493.1| YAL055W [Saccharomyces cerevisiae]
          Length = 180

 Score =  361 bits (926), Expect = 1e-98,   Method: Composition-based stats.
 Identities = 180/180 (100%), Positives = 180/180 (100%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN
Sbjct: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
>gb|EDN59708.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 180

 Score =  356 bits (913), Expect = 5e-97,   Method: Composition-based stats.
 Identities = 177/180 (98%), Positives = 179/180 (99%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAI DAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIEDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCV+RLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD+EDVLIDTLCN
Sbjct: 121 MDGLWSCVKRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDIEDVLIDTLCN 180
>ref|XP_001643611.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO15753.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 177

 Score = 58.2 bits (139), Expect = 3e-07,   Method: Composition-based stats.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 13/117 (11%)

Query: 61  SKCIIMSKSI-QGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCD 119
           SK II++KS+ +   I W+       V+LV    T G   Q   D+   T N++KII+CD
Sbjct: 66  SKTIILTKSLCENKGIDWKRLFDTNCVVLV----TPGLKFQPSKDSVNST-NDYKIIHCD 120

Query: 120 SMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLID 176
           +M+GL+SC + LGK + +L   D   SG      +P DI RFV+ VV+ D +++  D
Sbjct: 121 TMEGLFSCAKHLGK-KKLLYVPDELESG------LPVDISRFVREVVELDSDNLRKD 170
>ref|XP_448019.1| unnamed protein product [Candida glabrata]
 sp|Q6FP25|PEX22_CANGA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG60970.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 174

 Score = 54.7 bits (130), Expect = 3e-06,   Method: Composition-based stats.
 Identities = 34/107 (31%), Positives = 54/107 (50%), Gaps = 16/107 (14%)

Query: 56  KKARKSKCIIMSKSIQGL-PIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHK 114
           K+  +S C+I+S  +  L  + W+E   +++V L+  S  D      I        + HK
Sbjct: 63  KRLHRSLCVIISNKLSNLVELDWDEILQEDIVFLILPSVNDFQNSNDI------KTDTHK 116

Query: 115 IIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRF 161
           II CD+  GLW+CVR L K + ++ + D           VP+DIGR+
Sbjct: 117 IINCDTELGLWACVRTLKKNELLVCADDIK---------VPDDIGRY 154
>ref|NP_984505.1| ADR410Cp [Ashbya gossypii ATCC 10895]
 sp|Q758W8|PEX22_ASHGO Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAS52329.1| ADR410Cp [Ashbya gossypii ATCC 10895]
          Length = 144

 Score = 54.3 bits (129), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 43/171 (25%), Positives = 78/171 (45%), Gaps = 31/171 (18%)

Query: 5   SRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKS--K 62
           S++  ++ R  G V  A  ++  S   Y+   +A            S++  KK R+S  +
Sbjct: 3   SKASRDQLRKYGAVSLASLLVAASIVAYRWWNAAP-----------SIEVEKKLRRSVSR 51

Query: 63  CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
           C+++++ IQ   +  +    D V+LL P    +G +K A        +N +K+I C +  
Sbjct: 52  CVVVTQGIQNEDMIHDLLFEDTVMLLAPGCTAEGRLKSA------SRENAYKVISCTTWQ 105

Query: 123 GLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDV 173
            +W+CVR   K   ++ + +  S        VP DIG +V     SD+ D+
Sbjct: 106 SVWACVRHFRKHTLLVRTSEVPSG-------VPADIGGYV-----SDISDI 144
>ref|XP_455093.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CLU6|PEX22_KLULA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG97800.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 156

 Score = 47.8 bits (112), Expect = 3e-04,   Method: Composition-based stats.
 Identities = 41/150 (27%), Positives = 71/150 (47%), Gaps = 30/150 (20%)

Query: 15  LGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCIIMSKSIQGLP 74
           +G++ T    +    Y+Y+         G R P    VK      KSKC ++++ +    
Sbjct: 29  VGVLSTVAVTVGYLLYLYR---------GQRNPNIRDVKP-----KSKCYVLTQDLFDKI 74

Query: 75  IKW-EEYAADEVVLLVP-TSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLG 132
             W EE + D V+L++P  +H    +K       + +  EHKI+  ++   +WS VR L 
Sbjct: 75  ENWQEELSKDSVMLVLPEVAHLGNHLK------LQLSSIEHKIVIFNNSSAVWSAVRHLK 128

Query: 133 KFQCILNSRDFTSSGGSDAAVVPEDIGRFV 162
           K++ ++ SRD TS        +P D+ R+V
Sbjct: 129 KYELVI-SRDKTSD-------MPVDLRRYV 150
>ref|XP_460423.1| hypothetical protein DEHA0F01595g [Debaryomyces hansenii CBS767]
 sp|Q6BMZ7|PEX22_DEBHA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG88727.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 196

 Score = 43.1 bits (100), Expect = 0.009,   Method: Composition-based stats.
 Identities = 44/186 (23%), Positives = 82/186 (44%), Gaps = 23/186 (12%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVT-SYYIYQKVTSAKEDNGARPPEGDSVKENK--- 56
           MP     +  +   L I     A +VT SY +Y     A+E+   +  EGD   E K   
Sbjct: 1   MPQAQIRKTQRNPKLWIAALVAASIVTISYKVYSSYI-AEENIEDKKTEGDGGVEEKLRI 59

Query: 57  ---KARKSKCIIMSKSI--QGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTK- 110
                +KS  + +S S+    LP+      ++ V  ++P + +   +   IG+A    + 
Sbjct: 60  AKRYTKKSIALTLSHSVLSSQLPLNEILLNSENVTFILPPNLSMDDLVCNIGNADEVERY 119

Query: 111 -------NEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVK 163
                  N +K+++C ++DG ++ ++ L     ++ S D   +       VP D+ RFVK
Sbjct: 120 NLPKTLLNNYKLLHCSNIDGYFNILKNLKPDTLLVCSEDLGIANN-----VPRDLHRFVK 174

Query: 164 FVVDSD 169
            +++ D
Sbjct: 175 EIINID 180
>ref|XP_001729924.1| hypothetical protein MGL_2910 [Malassezia globosa CBS 7966]
 gb|EDP42710.1| hypothetical protein MGL_2910 [Malassezia globosa CBS 7966]
          Length = 906

 Score = 38.1 bits (87), Expect = 0.31,   Method: Composition-based stats.
 Identities = 22/53 (41%), Positives = 29/53 (54%), Gaps = 1/53 (1%)

Query: 2   PPPSRSRINKTR-TLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVK 53
           P P  S  ++ R  L +VGT+ AV  TS  +Y K+  A ED  A P  GD V+
Sbjct: 411 PHPWLSDADRARLVLAVVGTSEAVDATSLSMYSKLVHALEDGHASPLHGDEVE 463
>ref|YP_344555.1| Carbamoyl-phosphate synthase, large subunit, glutamine-dependent
            [Nitrosococcus oceani ATCC 19707]
 gb|ABA59025.1| carbamoyl-phosphate synthase large subunit [Nitrosococcus oceani ATCC
            19707]
          Length = 1073

 Score = 35.8 bits (81), Expect = 1.4,   Method: Composition-based stats.
 Identities = 20/71 (28%), Positives = 41/71 (57%), Gaps = 6/71 (8%)

Query: 63   CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAF--RKTKNEHKIIYCDS 120
            CI ++K ++G P   +    DE+ L++ T  T+G  ++A+ D++  R++  +HK+ Y  +
Sbjct: 987  CIQINKVLEGQPHIVDMIKNDEIALIINT--TEG--RKAVSDSYTIRRSALQHKVTYTTT 1042

Query: 121  MDGLWSCVRRL 131
            + G W+    L
Sbjct: 1043 VAGAWATCEAL 1053
>dbj|BAE20140.1| beta subunit of acetyl-coenzyme A carboxylase [Gymnogrammitis
           dareiformis]
          Length = 197

 Score = 34.7 bits (78), Expect = 3.2,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 36/76 (47%), Gaps = 4/76 (5%)

Query: 104 DAFRKTKNEHKIIYCDSMDGLWSCVRR----LGKFQCILNSRDFTSSGGSDAAVVPEDIG 159
           D F +  ++H++++     GL   V+     L      L   DF   GGS  +VV E I 
Sbjct: 108 DFFDEDSHQHRLVFSQEKTGLTDAVQTGIGYLNGTLIALGVMDFHFMGGSMGSVVGEKIT 167

Query: 160 RFVKFVVDSDVEDVLI 175
           R +++  D+ +  VLI
Sbjct: 168 RLIEYAADNSLPLVLI 183
>ref|YP_742813.1| carbamoyl-phosphate synthase large subunit [Alkalilimnicola ehrlichei
            MLHE-1]
 gb|ABI57323.1| carbamoyl-phosphate synthase large subunit [Alkalilimnicola ehrlichei
            MLHE-1]
          Length = 1078

 Score = 33.5 bits (75), Expect = 7.6,   Method: Composition-based stats.
 Identities = 20/64 (31%), Positives = 39/64 (60%), Gaps = 6/64 (9%)

Query: 62   KCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAF--RKTKNEHKIIYCD 119
            +C +++K ++G P   +    DE+ L+V T  T+G  +QAI D++  R+   +HK+ Y  
Sbjct: 991  ECDVINKVLEGRPHIVDAIKNDEIDLIVNT--TEG--RQAIADSYSIRREALQHKVCYTT 1046

Query: 120  SMDG 123
            +++G
Sbjct: 1047 TING 1050
>ref|ZP_01771256.1| Hypothetical protein COLAER_00234 [Collinsella aerofaciens ATCC
           25986]
 gb|EBA40563.1| Hypothetical protein COLAER_00234 [Collinsella aerofaciens ATCC
           25986]
          Length = 439

 Score = 33.1 bits (74), Expect = 9.2,   Method: Composition-based stats.
 Identities = 16/62 (25%), Positives = 29/62 (46%), Gaps = 2/62 (3%)

Query: 111 NEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDV 170
           + H  + C ++DG+  C +  G  +C   +    +   +D A  P  I   + F+ D D+
Sbjct: 146 DHHTNVTCHTLDGIPVCGKSAG--ECAACAERVVAERAADNADTPMSIETIIDFIEDGDI 203

Query: 171 ED 172
           ED
Sbjct: 204 ED 205
>ref|ZP_02406302.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei DM98]
          Length = 227

 Score = 33.1 bits (74), Expect = 9.2,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 5   DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 60
>gb|EDO87026.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei 406e]
          Length = 296

 Score = 33.1 bits (74), Expect = 9.5,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 74  DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 129
>ref|ZP_01765621.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei 305]
 gb|EBA49408.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei 305]
          Length = 296

 Score = 33.1 bits (74), Expect = 9.7,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 74  DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 129
>ref|YP_001078967.1| fumarylacetoacetate hydrolase family protein [Burkholderia mallei
           NCTC 10247]
 gb|ABO02776.1| fumarylacetoacetate hydrolase family protein [Burkholderia mallei
           NCTC 10247]
 gb|EDK82808.1| fumarylacetoacetate hydrolase family protein [Burkholderia mallei
           2002721280]
 gb|EDU10793.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei 1655]
          Length = 287

 Score = 33.1 bits (74), Expect = 9.8,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 65  DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 120
>ref|ZP_01337165.1| hypothetical protein Bpse4_03000368 [Burkholderia pseudomallei
           406e]
          Length = 280

 Score = 33.1 bits (74), Expect = 10.0,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 58  DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 113
>gb|EDS82204.1| fumarylacetoacetate hydrolase family protein [Burkholderia
           pseudomallei S13]
          Length = 287

 Score = 33.1 bits (74), Expect = 10.0,   Method: Composition-based stats.
 Identities = 21/56 (37%), Positives = 27/56 (48%), Gaps = 3/56 (5%)

Query: 119 DSMDGLWSCVRRLGKFQCI-LNSRDFTSSGGSDAAVVPEDIGRFVKFVV--DSDVE 171
           D    L +CV R+GKF CI LN  D  +  G D    P   G++   +   D DVE
Sbjct: 65  DGAPRLGACVGRVGKFVCIGLNYSDHAAESGMDVPSEPVVFGKWTSAICGPDDDVE 120
Searching..................................................done Results from round 2


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

ref|NP_009346.1|  Putative peroxisomal membrane protein requ...   315   7e-85
gb|EDN59708.1|  peroxisome-related protein [Saccharomyces ce...   314   2e-84
ref|NP_984505.1|  ADR410Cp [Ashbya gossypii ATCC 10895] >gi|...   190   3e-47
ref|XP_455093.1|  unnamed protein product [Kluyveromyces lac...   146   7e-34
ref|XP_001643611.1|  hypothetical protein Kpol_1049p11 [Vand...   137   3e-31
ref|XP_448019.1|  unnamed protein product [Candida glabrata]...   129   1e-28
Sequences not found previously or not previously below threshold:

ref|XP_460423.1|  hypothetical protein DEHA0F01595g [Debaryo...    58   2e-07
ref|XP_001385228.2|  hypothetical protein PICST_32287 [Pichi...    51   4e-05
ref|XP_001483257.1|  hypothetical protein PGUG_03986 [Pichia...    50   6e-05
ref|YP_344555.1|  Carbamoyl-phosphate synthase, large subuni...    39   0.12 
ref|XP_001438526.1|  hypothetical protein GSPATT00000841001 ...    38   0.31 
ref|ZP_01061160.1|  iron(III) ABC transporter ATP-binding pr...    37   0.47 
emb|CAN13139.1|  orthologue of H. sapiens chromsome 20 open ...    37   0.66 
ref|XP_001740377.1|  hypothetical protein EDI_255300 [Entamo...    36   1.2  
ref|YP_001193226.1|  transcriptional regulator, LysR family ...    36   1.4  
gb|AAY82758.1|  conserved hypothetical protein [uncultured b...    35   1.9  
ref|YP_001307156.1|  FAD dependent oxidoreductase [Clostridi...    35   3.1  
ref|XP_590415.3|  PREDICTED: similar to C20orf32 [Bos taurus]      35   3.3  
gb|EEH44305.1|  conserved hypothetical protein [Paracoccidio...    35   3.5  
gb|EEH39732.1|  conserved hypothetical protein [Paracoccidio...    34   3.8  
ref|XP_001527082.1|  hypothetical protein LELG_01911 [Lodder...    34   4.8  
ref|YP_001245311.1|  Glu/Leu/Phe/Val dehydrogenase, C termin...    34   5.5  
ref|YP_001739804.1|  Glu/Leu/Phe/Val dehydrogenase [Thermoto...    33   6.8  
>ref|NP_009346.1| Putative peroxisomal membrane protein required for import of
           peroxisomal proteins, functionally complements a Pichia
           pastoris pex22 mutation; Pex22p [Saccharomyces
           cerevisiae]
 sp|P39718|PEX22_YEAST Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAC04978.1| Yal055wp [Saccharomyces cerevisiae]
 gb|AAS56493.1| YAL055W [Saccharomyces cerevisiae]
          Length = 180

 Score =  315 bits (808), Expect = 7e-85,   Method: Composition-based stats.
 Identities = 180/180 (100%), Positives = 180/180 (100%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN
Sbjct: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
>gb|EDN59708.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 180

 Score =  314 bits (805), Expect = 2e-84,   Method: Composition-based stats.
 Identities = 177/180 (98%), Positives = 179/180 (99%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAI DAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIEDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCV+RLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD+EDVLIDTLCN
Sbjct: 121 MDGLWSCVKRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDIEDVLIDTLCN 180
>ref|NP_984505.1| ADR410Cp [Ashbya gossypii ATCC 10895]
 sp|Q758W8|PEX22_ASHGO Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAS52329.1| ADR410Cp [Ashbya gossypii ATCC 10895]
          Length = 144

 Score =  190 bits (483), Expect = 3e-47,   Method: Composition-based stats.
 Identities = 43/171 (25%), Positives = 78/171 (45%), Gaps = 31/171 (18%)

Query: 5   SRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKS--K 62
           S++  ++ R  G V  A  ++  S   Y+   +A            S++  KK R+S  +
Sbjct: 3   SKASRDQLRKYGAVSLASLLVAASIVAYRWWNAAP-----------SIEVEKKLRRSVSR 51

Query: 63  CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
           C+++++ IQ   +  +    D V+LL P    +G +K A        +N +K+I C +  
Sbjct: 52  CVVVTQGIQNEDMIHDLLFEDTVMLLAPGCTAEGRLKSA------SRENAYKVISCTTWQ 105

Query: 123 GLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDV 173
            +W+CVR   K   ++ + +  S        VP DIG +V     SD+ D+
Sbjct: 106 SVWACVRHFRKHTLLVRTSEVPSG-------VPADIGGYV-----SDISDI 144
>ref|XP_455093.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CLU6|PEX22_KLULA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG97800.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 156

 Score =  146 bits (368), Expect = 7e-34,   Method: Composition-based stats.
 Identities = 41/153 (26%), Positives = 72/153 (47%), Gaps = 30/153 (19%)

Query: 15  LGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCIIMSKSIQGLP 74
           +G++ T    +    Y+Y+         G R P    VK      KSKC ++++ +    
Sbjct: 29  VGVLSTVAVTVGYLLYLYR---------GQRNPNIRDVKP-----KSKCYVLTQDLFDKI 74

Query: 75  IKW-EEYAADEVVLLVP-TSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLG 132
             W EE + D V+L++P  +H    +K       + +  EHKI+  ++   +WS VR L 
Sbjct: 75  ENWQEELSKDSVMLVLPEVAHLGNHLK------LQLSSIEHKIVIFNNSSAVWSAVRHLK 128

Query: 133 KFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFV 165
           K++ ++ SRD TS        +P D+ R+V  +
Sbjct: 129 KYELVI-SRDKTSD-------MPVDLRRYVGQI 153
>ref|XP_001643611.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO15753.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 177

 Score =  137 bits (346), Expect = 3e-31,   Method: Composition-based stats.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 13/117 (11%)

Query: 61  SKCIIMSKSI-QGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCD 119
           SK II++KS+ +   I W+       V+LV    T G   Q   D+   T N++KII+CD
Sbjct: 66  SKTIILTKSLCENKGIDWKRLFDTNCVVLV----TPGLKFQPSKDSVNST-NDYKIIHCD 120

Query: 120 SMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLID 176
           +M+GL+SC + LGK + +L   D   SG      +P DI RFV+ VV+ D +++  D
Sbjct: 121 TMEGLFSCAKHLGK-KKLLYVPDELESG------LPVDISRFVREVVELDSDNLRKD 170
>ref|XP_448019.1| unnamed protein product [Candida glabrata]
 sp|Q6FP25|PEX22_CANGA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG60970.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 174

 Score =  129 bits (324), Expect = 1e-28,   Method: Composition-based stats.
 Identities = 46/192 (23%), Positives = 75/192 (39%), Gaps = 40/192 (20%)

Query: 5   SRSRINK--TRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDS--------VKE 54
           S+ R N+     +  V    A        Y    +  ED+                 +KE
Sbjct: 2   SKRRKNQGYLAIIAAVSIGAA-------AYLWWRNHSEDDSVSKFNQQDDGTAQDNQIKE 54

Query: 55  N-------KKARKSKCIIMSKSIQGL-PIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAF 106
                   K+  +S C+I+S  +  L  + W+E   +++V L+  S  D      I    
Sbjct: 55  KPAIKLGKKRLHRSLCVIISNKLSNLVELDWDEILQEDIVFLILPSVNDFQNSNDIK--- 111

Query: 107 RKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVV 166
               + HKII CD+  GLW+CVR L K + ++ + D           VP+DIGR+   + 
Sbjct: 112 ---TDTHKIINCDTELGLWACVRTLKKNELLVCADDIK---------VPDDIGRYCDKIS 159

Query: 167 DSDVEDVLIDTL 178
                  L++ L
Sbjct: 160 QIKSSQQLMNYL 171
>ref|XP_460423.1| hypothetical protein DEHA0F01595g [Debaryomyces hansenii CBS767]
 sp|Q6BMZ7|PEX22_DEBHA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG88727.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 196

 Score = 58.1 bits (139), Expect = 2e-07,   Method: Composition-based stats.
 Identities = 40/185 (21%), Positives = 76/185 (41%), Gaps = 21/185 (11%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENK---- 56
           MP     +  +   L I     A +VT  Y       A+E+   +  EGD   E K    
Sbjct: 1   MPQAQIRKTQRNPKLWIAALVAASIVTISYKVYSSYIAEENIEDKKTEGDGGVEEKLRIA 60

Query: 57  --KARKSKCIIMSKSI--QGLPIKWEEYAADEVVLLVPT--------SHTDGSMKQAIGD 104
               +KS  + +S S+    LP+      ++ V  ++P          +   + +    +
Sbjct: 61  KRYTKKSIALTLSHSVLSSQLPLNEILLNSENVTFILPPNLSMDDLVCNIGNADEVERYN 120

Query: 105 AFRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKF 164
             +   N +K+++C ++DG ++ ++ L     ++ S D        A  VP D+ RFVK 
Sbjct: 121 LPKTLLNNYKLLHCSNIDGYFNILKNLKPDTLLVCSEDL-----GIANNVPRDLHRFVKE 175

Query: 165 VVDSD 169
           +++ D
Sbjct: 176 IINID 180
>ref|XP_001385228.2| hypothetical protein PICST_32287 [Pichia stipitis CBS 6054]
 gb|ABN67199.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 201

 Score = 50.8 bits (120), Expect = 4e-05,   Method: Composition-based stats.
 Identities = 35/178 (19%), Positives = 73/178 (41%), Gaps = 11/178 (6%)

Query: 3   PPSRSRIN-KTRTLGIVGTAIAVLVTSYY-IY-QKVTSAKEDNGARPPEGDSVKENKKAR 59
           P   SR N K     +V T++  +    Y +Y Q+    +E +G      +     K   
Sbjct: 6   PAGGSRSNPKLWLAAVVATSVVTIGYQIYQLYSQEDERIRERSGKVSEAIEGSVAKKYGN 65

Query: 60  KSKCIIMSKSIQG--LPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIY 117
           KS  + +S SI    LP+       + ++ ++P + ++  +   +  +        K++ 
Sbjct: 66  KSIALTLSHSILNSKLPLNEILINNENMIFILPPNLSEDDVTYNLDPSKSGIMPNFKLLK 125

Query: 118 CDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD--VEDV 173
           C ++ G    ++ L     ++ S D     G     + +D+  F+K +V+ D   ED+
Sbjct: 126 CSNIQGYVQVLKNLKPDMLLICSDDL----GLSFMNLSKDLNNFIKTIVNMDQNSEDI 179
>ref|XP_001483257.1| hypothetical protein PGUG_03986 [Pichia guilliermondii ATCC 6260]
 gb|EDK39888.1| hypothetical protein PGUG_03986 [Pichia guilliermondii ATCC 6260]
          Length = 178

 Score = 50.4 bits (119), Expect = 6e-05,   Method: Composition-based stats.
 Identities = 31/140 (22%), Positives = 60/140 (42%), Gaps = 16/140 (11%)

Query: 41  DNGARPPEGDS-VKENKKARKSKCIIMSKSIQG--LPIKWEEYAADEVVLLVPTSHTDGS 97
           +N  +P E  S +   K   KS  + +S S+    LP+      ++ V  ++P + +   
Sbjct: 28  NNYLKPSEKTSEISRKKYTNKSVALTLSHSVLNSQLPLNDILMNSENVTFILPPNLSLDD 87

Query: 98  MKQAIGDA--------FRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGS 149
           +   I D          R     +K++ C ++ G ++ ++ L     ++ S D       
Sbjct: 88  LACNISDENDGLATKLPRTLIQNYKLLNCKNIQGYFNLLKNLKPDTLLVCSDDL-----G 142

Query: 150 DAAVVPEDIGRFVKFVVDSD 169
             + VP DI RFVK +++ D
Sbjct: 143 ITSAVPRDINRFVKEIINID 162
>ref|YP_344555.1| Carbamoyl-phosphate synthase, large subunit, glutamine-dependent
            [Nitrosococcus oceani ATCC 19707]
 gb|ABA59025.1| carbamoyl-phosphate synthase large subunit [Nitrosococcus oceani ATCC
            19707]
          Length = 1073

 Score = 39.2 bits (90), Expect = 0.12,   Method: Composition-based stats.
 Identities = 17/70 (24%), Positives = 35/70 (50%), Gaps = 2/70 (2%)

Query: 63   CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
            CI ++K ++G P   +    DE+ L++  + T+G    +     R++  +HK+ Y  ++ 
Sbjct: 987  CIQINKVLEGQPHIVDMIKNDEIALII--NTTEGRKAVSDSYTIRRSALQHKVTYTTTVA 1044

Query: 123  GLWSCVRRLG 132
            G W+    L 
Sbjct: 1045 GAWATCEALR 1054
>ref|XP_001438526.1| hypothetical protein GSPATT00000841001 [Paramecium tetraurelia
           strain d4-2]
 emb|CAK71129.1| unnamed protein product [Paramecium tetraurelia]
          Length = 590

 Score = 38.1 bits (87), Expect = 0.31,   Method: Composition-based stats.
 Identities = 20/90 (22%), Positives = 28/90 (31%), Gaps = 9/90 (10%)

Query: 42  NGARPPEGDSVKENKKARKSKCIIMSKSIQ-------GLPIKWEEYAADEVVLLVPTSHT 94
           N  + P   +   NKK  K K I + KS+                +     VLL+  S  
Sbjct: 413 NILQNPNIINGTPNKKYHKDKLIQLIKSVFGSQLTIINKLKIENLHIDLNSVLLIFISLF 472

Query: 95  DGSMKQAIGDAFRKTKNEHKI--IYCDSMD 122
               K           NE K+  I+   M 
Sbjct: 473 QFREKNEGHIKISSKINEDKVESIHFTLMQ 502
>ref|ZP_01061160.1| iron(III) ABC transporter ATP-binding protein [Flavobacterium sp.
           MED217]
 gb|EAQ49211.1| iron(III) ABC transporter ATP-binding protein [Leeuwenhoekiella
           blandensis MED217]
          Length = 262

 Score = 37.3 bits (85), Expect = 0.47,   Method: Composition-based stats.
 Identities = 18/70 (25%), Positives = 30/70 (42%), Gaps = 3/70 (4%)

Query: 56  KKARKSKCIIMSKSIQGLPIKWEEYAADE--VVLLVPTSHTDGSMKQAIGDAFRKTKNEH 113
           ++ R + C  +S       +     A D   ++L  PT+H D   K +I    +   +EH
Sbjct: 139 EQLRHTACYALSDGQFQRALIARALAQDTPIIMLDEPTTHLDLHHKASIFSLLKNIAHEH 198

Query: 114 -KIIYCDSMD 122
            K I C + D
Sbjct: 199 NKTILCTTHD 208
>emb|CAN13139.1| orthologue of H. sapiens chromsome 20 open reading frame 32
           (C20orf32) [Sus scrofa]
          Length = 784

 Score = 36.9 bits (84), Expect = 0.66,   Method: Composition-based stats.
 Identities = 22/75 (29%), Positives = 34/75 (45%), Gaps = 3/75 (4%)

Query: 94  TDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLG--KFQCILNSRDFTSSGGSDA 151
           TD +++  I D  +   N ++I+  ++ + L SC   L       + NS D        A
Sbjct: 505 TDSNLQTKIRDQLQTIANAYQIL-LETKERLESCGWSLEVLATDKVQNSPDDLERFVLVA 563

Query: 152 AVVPEDIGRFVKFVV 166
             VPEDI RF   V+
Sbjct: 564 RTVPEDIKRFASIVI 578
>ref|XP_001740377.1| hypothetical protein EDI_255300 [Entamoeba dispar SAW760]
 gb|EDR23238.1| hypothetical protein EDI_255300 [Entamoeba dispar SAW760]
          Length = 435

 Score = 36.1 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 18/104 (17%), Positives = 34/104 (32%), Gaps = 3/104 (2%)

Query: 72  GLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRL 131
                  E   +    LV    +   +   I    +      +II C S+ G+    R L
Sbjct: 100 DKETDINEILKETRESLVSYYRSMNLIFSTIQRQSQSNLGTFRIIICTSVGGI---KRYL 156

Query: 132 GKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLI 175
               C+  S     +      V P D+  F   + ++ +  +L+
Sbjct: 157 QSVHCLFQSSSKPVTVEYGDDVHPGDVILFGDELTENGITSLLV 200
>ref|YP_001193226.1| transcriptional regulator, LysR family [Flavobacterium johnsoniae
           UW101]
 gb|ABQ03907.1| transcriptional regulator, LysR family [Flavobacterium johnsoniae
           UW101]
          Length = 299

 Score = 35.8 bits (81), Expect = 1.4,   Method: Composition-based stats.
 Identities = 19/60 (31%), Positives = 32/60 (53%), Gaps = 3/60 (5%)

Query: 65  IMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGL 124
           I+S +I+   IK+ E   DEVV+++  +H     K  + + F    NEH II+   M+ +
Sbjct: 145 IVSDTIKDHHIKYTELFQDEVVVVISENHPWAGKKYVVAEDF---VNEHLIIHSLPMETV 201
>gb|AAY82758.1| conserved hypothetical protein [uncultured bacterium eBACmed18B02]
          Length = 361

 Score = 35.4 bits (80), Expect = 1.9,   Method: Composition-based stats.
 Identities = 22/76 (28%), Positives = 32/76 (42%), Gaps = 5/76 (6%)

Query: 34  KVTSAKEDNGARPPEGDSVKENKKARKSKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSH 93
              S     G +   G   KE KK +KS  +++ K +   P   +  A      L+    
Sbjct: 244 WFNSEYHQPGKKNVLG---KEYKKGKKSLAVVI-KDLVNHPNCRDFVADRLCKFLITDEP 299

Query: 94  TDGSMKQAIGDAFRKT 109
           T   MKQ I DAF+K+
Sbjct: 300 TK-QMKQPIIDAFKKS 314
>ref|YP_001307156.1| FAD dependent oxidoreductase [Clostridium beijerinckii NCIMB 8052]
 gb|ABR32200.1| FAD dependent oxidoreductase [Clostridium beijerinckii NCIMB 8052]
          Length = 461

 Score = 34.6 bits (78), Expect = 3.1,   Method: Composition-based stats.
 Identities = 18/62 (29%), Positives = 27/62 (43%), Gaps = 2/62 (3%)

Query: 51  SVKENKKARKSKCIIMSKSIQG--LPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRK 108
           +VK   K  K +CII+ KSI      I  E+   +  ++ VP S  +  +   I D F  
Sbjct: 210 NVKSKMKNEKVRCIIVKKSIGDNLKKIIIEDIDEENSMIKVPISENENLIGIKISDNFSA 269

Query: 109 TK 110
             
Sbjct: 270 KS 271
>ref|XP_590415.3| PREDICTED: similar to C20orf32 [Bos taurus]
          Length = 785

 Score = 34.6 bits (78), Expect = 3.3,   Method: Composition-based stats.
 Identities = 24/79 (30%), Positives = 35/79 (44%), Gaps = 4/79 (5%)

Query: 91  TSHTDGSMKQA-IGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKF--QCILNSRDFTSSG 147
           T +   S  QA I D  +   N ++I+  +S + L SC   L       I NS D     
Sbjct: 502 TCNLTNSNLQARIRDQLQTISNSYQIL-LESKESLESCNWSLEVLVTDKIQNSPDDLERF 560

Query: 148 GSDAAVVPEDIGRFVKFVV 166
              A +VPED+ RF   ++
Sbjct: 561 VQVARMVPEDVKRFASIII 579
>gb|EEH44305.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb18]
          Length = 577

 Score = 34.6 bits (78), Expect = 3.5,   Method: Composition-based stats.
 Identities = 16/56 (28%), Positives = 24/56 (42%), Gaps = 6/56 (10%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENK 56
           +PPP  S+ N++       TA   L       +   +    N ++PP G SVK  K
Sbjct: 409 VPPPETSQANRSELFRSASTAALGLS------KDTPATPAGNASKPPNGHSVKAEK 458
>gb|EEH39732.1| conserved hypothetical protein [Paracoccidioides brasiliensis Pb01]
          Length = 577

 Score = 34.2 bits (77), Expect = 3.8,   Method: Composition-based stats.
 Identities = 16/54 (29%), Positives = 23/54 (42%), Gaps = 6/54 (11%)

Query: 3   PPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENK 56
           PP  S+ +K+       TA   L       +   +    NG++PP G SVK  K
Sbjct: 411 PPESSQADKSELFRSASTAALGLS------KDTPATPAGNGSKPPNGHSVKAEK 458
>ref|XP_001527082.1| hypothetical protein LELG_01911 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK43732.1| hypothetical protein LELG_01911 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 232

 Score = 33.8 bits (76), Expect = 4.8,   Method: Composition-based stats.
 Identities = 17/97 (17%), Positives = 41/97 (42%), Gaps = 10/97 (10%)

Query: 81  AADEVVLLVPTSHTD---GSMKQAIGDAFRKTKN-----EHKIIYCDSMDGLWSCVRRLG 132
           + ++++ ++P +  +   G +  A     +  +        KI+ C ++ G    ++ L 
Sbjct: 124 SNEKLIYIIPPNLNEEDVGLLNTAGNIVLQNKQELQLPPNFKILKCSNIQGYLQILKNLK 183

Query: 133 KFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD 169
               +L S D      +   VV  D+  F+K +++ D
Sbjct: 184 PDCLLLCSDDL--GISNFNGVVLRDLSNFIKNIINID 218
>ref|YP_001245311.1| Glu/Leu/Phe/Val dehydrogenase, C terminal [Thermotoga petrophila
           RKU-1]
 gb|ABQ47735.1| glutamate dehydrogenase (NAD/NADP) [Thermotoga petrophila RKU-1]
          Length = 416

 Score = 33.8 bits (76), Expect = 5.5,   Method: Composition-based stats.
 Identities = 23/89 (25%), Positives = 41/89 (46%), Gaps = 2/89 (2%)

Query: 4   PSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKC 63
           P ++ +   +  G VG   A+L++     + V  +    G   PEG  V+E  + +K   
Sbjct: 208 PRKATV-AVQGFGNVGQFAALLISQELGSKVVAVSDSKGGIYNPEGFDVEELIRYKKEHG 266

Query: 64  IIMSKSIQGLPIKWEEYAADEVVLLVPTS 92
            I++   +G  I  EE    +V +LVP +
Sbjct: 267 TIVTYP-KGERITNEELLELDVDVLVPAA 294
>ref|YP_001739804.1| Glu/Leu/Phe/Val dehydrogenase [Thermotoga sp. RQ2]
 gb|ACB10121.1| Glu/Leu/Phe/Val dehydrogenase [Thermotoga sp. RQ2]
          Length = 416

 Score = 33.5 bits (75), Expect = 6.8,   Method: Composition-based stats.
 Identities = 22/89 (24%), Positives = 41/89 (46%), Gaps = 2/89 (2%)

Query: 4   PSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKC 63
           P ++ +   +  G VG   A+L++     + V  +    G   PEG  V+E  + +K   
Sbjct: 208 PKKATV-AVQGFGNVGQFAALLISQELGSKVVAVSDSRGGIYNPEGFDVEELIRYKKEHG 266

Query: 64  IIMSKSIQGLPIKWEEYAADEVVLLVPTS 92
            +++   +G  I  EE    +V +LVP +
Sbjct: 267 TVVTYP-KGERITNEELLELDVDVLVPAA 294
Searching..................................................done Results from round 3


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

ref|NP_009346.1|  Putative peroxisomal membrane protein requ...   283   4e-75
gb|EDN59708.1|  peroxisome-related protein [Saccharomyces ce...   282   8e-75
ref|XP_460423.1|  hypothetical protein DEHA0F01595g [Debaryo...   188   1e-46
ref|XP_001385228.2|  hypothetical protein PICST_32287 [Pichi...   188   2e-46
ref|NP_984505.1|  ADR410Cp [Ashbya gossypii ATCC 10895] >gi|...   170   3e-41
ref|XP_448019.1|  unnamed protein product [Candida glabrata]...   160   5e-38
ref|XP_001483257.1|  hypothetical protein PGUG_03986 [Pichia...   144   3e-33
ref|XP_455093.1|  unnamed protein product [Kluyveromyces lac...   129   7e-29
ref|XP_001643611.1|  hypothetical protein Kpol_1049p11 [Vand...   119   8e-26
Sequences not found previously or not previously below threshold:

ref|XP_710304.1|  hypothetical protein CaO19.8811 [Candida a...    78   2e-13
ref|XP_001527082.1|  hypothetical protein LELG_01911 [Lodder...    73   8e-12
sp|Q9UW82|PEX22_PICPA  Peroxisome assembly protein 22 (Perox...    44   0.004
gb|ABD37672.1|  peroxin 22 [Pichia angusta]                        39   0.12 
gb|EEJ80676.1|  type II secretory pathway, ATPase PulE/Tfp p...    37   0.63 
ref|YP_344555.1|  Carbamoyl-phosphate synthase, large subuni...    37   0.68 
ref|YP_841293.1|  multidrug resistance efflux pump [Ralstoni...    37   0.69 
emb|CAN13139.1|  orthologue of H. sapiens chromsome 20 open ...    37   0.77 
gb|EDU43621.1|  conserved hypothetical protein [Pyrenophora ...    36   0.81 
ref|XP_505069.1|  hypothetical protein [Yarrowia lipolytica]...    36   1.1  
ref|ZP_01075545.1|  DNA topoisomerase IV subunit B [Marinomo...    36   1.2  
ref|XP_001169779.1|  PREDICTED: HEF-like protein isoform 1 [...    35   2.1  
ref|XP_514736.2|  PREDICTED: HEF-like protein isoform 5 [Pan...    35   2.4  
ref|XP_001169795.1|  PREDICTED: HEF-like protein isoform 2 [...    35   2.7  
ref|XP_001908882.1|  unnamed protein product [Podospora anse...    35   2.7  
ref|XP_976974.1|  TPR Domain containing protein [Tetrahymena...    35   2.9  
ref|YP_236117.1|  PqiB family protein [Pseudomonas syringae ...    34   3.2  
ref|XP_001663892.1|  hypothetical protein AaeL_AAEL013710 [A...    34   3.4  
ref|XP_001662137.1|  hypothetical protein AaeL_AAEL012007 [A...    34   3.4  
ref|NP_824569.1|  hypothetical protein SAV_3392 [Streptomyce...    34   3.7  
ref|XP_955253.1|  glucose-6-phosphate-1-dehydrogenase, putat...    34   5.4  
ref|ZP_02064345.1|  hypothetical protein BACOVA_01311 [Bacte...    33   8.3  
ref|YP_001692481.1|  conserved hypothetical protein [Finegol...    33   9.7  
>ref|NP_009346.1| Putative peroxisomal membrane protein required for import of
           peroxisomal proteins, functionally complements a Pichia
           pastoris pex22 mutation; Pex22p [Saccharomyces
           cerevisiae]
 sp|P39718|PEX22_YEAST Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAC04978.1| Yal055wp [Saccharomyces cerevisiae]
 gb|AAS56493.1| YAL055W [Saccharomyces cerevisiae]
          Length = 180

 Score =  283 bits (725), Expect = 4e-75,   Method: Composition-based stats.
 Identities = 180/180 (100%), Positives = 180/180 (100%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN
Sbjct: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
>gb|EDN59708.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 180

 Score =  282 bits (722), Expect = 8e-75,   Method: Composition-based stats.
 Identities = 177/180 (98%), Positives = 179/180 (99%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60
           MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK
Sbjct: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARK 60

Query: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
           SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAI DAFRKTKNEHKIIYCDS
Sbjct: 61  SKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIEDAFRKTKNEHKIIYCDS 120

Query: 121 MDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLIDTLCN 180
           MDGLWSCV+RLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD+EDVLIDTLCN
Sbjct: 121 MDGLWSCVKRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDIEDVLIDTLCN 180
>ref|XP_460423.1| hypothetical protein DEHA0F01595g [Debaryomyces hansenii CBS767]
 sp|Q6BMZ7|PEX22_DEBHA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG88727.1| unnamed protein product [Debaryomyces hansenii CBS767]
          Length = 196

 Score =  188 bits (478), Expect = 1e-46,   Method: Composition-based stats.
 Identities = 43/185 (23%), Positives = 79/185 (42%), Gaps = 21/185 (11%)

Query: 1   MPPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENK---- 56
           MP     +  +   L I     A +VT  Y       A+E+   +  EGD   E K    
Sbjct: 1   MPQAQIRKTQRNPKLWIAALVAASIVTISYKVYSSYIAEENIEDKKTEGDGGVEEKLRIA 60

Query: 57  --KARKSKCIIMSKSI--QGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDA------- 105
               +KS  + +S S+    LP+      ++ V  ++P + +   +   IG+A       
Sbjct: 61  KRYTKKSIALTLSHSVLSSQLPLNEILLNSENVTFILPPNLSMDDLVCNIGNADEVERYN 120

Query: 106 -FRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKF 164
             +   N +K+++C ++DG ++ ++ L     ++ S D        A  VP D+ RFVK 
Sbjct: 121 LPKTLLNNYKLLHCSNIDGYFNILKNLKPDTLLVCSEDL-----GIANNVPRDLHRFVKE 175

Query: 165 VVDSD 169
           +++ D
Sbjct: 176 IINID 180
>ref|XP_001385228.2| hypothetical protein PICST_32287 [Pichia stipitis CBS 6054]
 gb|ABN67199.2| predicted protein [Pichia stipitis CBS 6054]
          Length = 201

 Score =  188 bits (477), Expect = 2e-46,   Method: Composition-based stats.
 Identities = 35/178 (19%), Positives = 73/178 (41%), Gaps = 11/178 (6%)

Query: 3   PPSRSRIN-KTRTLGIVGTAIAVLVTSYY-IY-QKVTSAKEDNGARPPEGDSVKENKKAR 59
           P   SR N K     +V T++  +    Y +Y Q+    +E +G      +     K   
Sbjct: 6   PAGGSRSNPKLWLAAVVATSVVTIGYQIYQLYSQEDERIRERSGKVSEAIEGSVAKKYGN 65

Query: 60  KSKCIIMSKSIQG--LPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIY 117
           KS  + +S SI    LP+       + ++ ++P + ++  +   +  +        K++ 
Sbjct: 66  KSIALTLSHSILNSKLPLNEILINNENMIFILPPNLSEDDVTYNLDPSKSGIMPNFKLLK 125

Query: 118 CDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSD--VEDV 173
           C ++ G    ++ L     ++ S D     G     + +D+  F+K +V+ D   ED+
Sbjct: 126 CSNIQGYVQVLKNLKPDMLLICSDDL----GLSFMNLSKDLNNFIKTIVNMDQNSEDI 179
>ref|NP_984505.1| ADR410Cp [Ashbya gossypii ATCC 10895]
 sp|Q758W8|PEX22_ASHGO Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAS52329.1| ADR410Cp [Ashbya gossypii ATCC 10895]
          Length = 144

 Score =  170 bits (432), Expect = 3e-41,   Method: Composition-based stats.
 Identities = 41/166 (24%), Positives = 75/166 (45%), Gaps = 26/166 (15%)

Query: 5   SRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKS--K 62
           S++  ++ R  G V  A  ++  S   Y+   +A            S++  KK R+S  +
Sbjct: 3   SKASRDQLRKYGAVSLASLLVAASIVAYRWWNAAP-----------SIEVEKKLRRSVSR 51

Query: 63  CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
           C+++++ IQ   +  +    D V+LL P    +G +K A        +N +K+I C +  
Sbjct: 52  CVVVTQGIQNEDMIHDLLFEDTVMLLAPGCTAEGRLKSA------SRENAYKVISCTTWQ 105

Query: 123 GLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDS 168
            +W+CVR   K   ++ + +  S        VP DIG +V  + D 
Sbjct: 106 SVWACVRHFRKHTLLVRTSEVPSG-------VPADIGGYVSDISDI 144
>ref|XP_448019.1| unnamed protein product [Candida glabrata]
 sp|Q6FP25|PEX22_CANGA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG60970.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 174

 Score =  160 bits (404), Expect = 5e-38,   Method: Composition-based stats.
 Identities = 46/192 (23%), Positives = 75/192 (39%), Gaps = 40/192 (20%)

Query: 5   SRSRINK--TRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDS--------VKE 54
           S+ R N+     +  V    A        Y    +  ED+                 +KE
Sbjct: 2   SKRRKNQGYLAIIAAVSIGAA-------AYLWWRNHSEDDSVSKFNQQDDGTAQDNQIKE 54

Query: 55  N-------KKARKSKCIIMSKSIQGL-PIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAF 106
                   K+  +S C+I+S  +  L  + W+E   +++V L+  S  D      I    
Sbjct: 55  KPAIKLGKKRLHRSLCVIISNKLSNLVELDWDEILQEDIVFLILPSVNDFQNSNDIK--- 111

Query: 107 RKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVV 166
               + HKII CD+  GLW+CVR L K + ++ + D           VP+DIGR+   + 
Sbjct: 112 ---TDTHKIINCDTELGLWACVRTLKKNELLVCADDIK---------VPDDIGRYCDKIS 159

Query: 167 DSDVEDVLIDTL 178
                  L++ L
Sbjct: 160 QIKSSQQLMNYL 171
>ref|XP_001483257.1| hypothetical protein PGUG_03986 [Pichia guilliermondii ATCC 6260]
 gb|EDK39888.1| hypothetical protein PGUG_03986 [Pichia guilliermondii ATCC 6260]
          Length = 178

 Score =  144 bits (363), Expect = 3e-33,   Method: Composition-based stats.
 Identities = 31/140 (22%), Positives = 60/140 (42%), Gaps = 16/140 (11%)

Query: 41  DNGARPPEGDS-VKENKKARKSKCIIMSKSIQG--LPIKWEEYAADEVVLLVPTSHTDGS 97
           +N  +P E  S +   K   KS  + +S S+    LP+      ++ V  ++P + +   
Sbjct: 28  NNYLKPSEKTSEISRKKYTNKSVALTLSHSVLNSQLPLNDILMNSENVTFILPPNLSLDD 87

Query: 98  MKQAIGDA--------FRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGS 149
           +   I D          R     +K++ C ++ G ++ ++ L     ++ S D       
Sbjct: 88  LACNISDENDGLATKLPRTLIQNYKLLNCKNIQGYFNLLKNLKPDTLLVCSDDL-----G 142

Query: 150 DAAVVPEDIGRFVKFVVDSD 169
             + VP DI RFVK +++ D
Sbjct: 143 ITSAVPRDINRFVKEIINID 162
>ref|XP_455093.1| unnamed protein product [Kluyveromyces lactis]
 sp|Q6CLU6|PEX22_KLULA Peroxisome assembly protein 22 (Peroxin-22)
 emb|CAG97800.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 156

 Score =  129 bits (325), Expect = 7e-29,   Method: Composition-based stats.
 Identities = 43/166 (25%), Positives = 75/166 (45%), Gaps = 32/166 (19%)

Query: 7   SRINKTRTL--GIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCI 64
           +R  K   +  G++ T    +    Y+Y+         G R P    VK      KSKC 
Sbjct: 19  ARQKKLSIIAVGVLSTVAVTVGYLLYLYR---------GQRNPNIRDVKP-----KSKCY 64

Query: 65  IMSKSIQGLPIKW-EEYAADEVVLLVP-TSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
           ++++ +      W EE + D V+L++P  +H    +K       + +  EHKI+  ++  
Sbjct: 65  VLTQDLFDKIENWQEELSKDSVMLVLPEVAHLGNHLK------LQLSSIEHKIVIFNNSS 118

Query: 123 GLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDS 168
            +WS VR L K++ ++ SRD TS        +P D+ R+V  +   
Sbjct: 119 AVWSAVRHLKKYELVI-SRDKTSD-------MPVDLRRYVGQISHI 156
>ref|XP_001643611.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO15753.1| hypothetical protein Kpol_1049p11 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 177

 Score =  119 bits (299), Expect = 8e-26,   Method: Composition-based stats.
 Identities = 44/117 (37%), Positives = 66/117 (56%), Gaps = 13/117 (11%)

Query: 61  SKCIIMSKSI-QGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCD 119
           SK II++KS+ +   I W+       V+LV    T G   Q   D+   T N++KII+CD
Sbjct: 66  SKTIILTKSLCENKGIDWKRLFDTNCVVLV----TPGLKFQPSKDSVNST-NDYKIIHCD 120

Query: 120 SMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVEDVLID 176
           +M+GL+SC + LGK + +L   D   SG      +P DI RFV+ VV+ D +++  D
Sbjct: 121 TMEGLFSCAKHLGK-KKLLYVPDELESG------LPVDISRFVREVVELDSDNLRKD 170
>ref|XP_710304.1| hypothetical protein CaO19.8811 [Candida albicans SC5314]
 ref|XP_710296.1| hypothetical protein CaO19.1225 [Candida albicans SC5314]
 gb|EAK91032.1| hypothetical protein CaO19.1225 [Candida albicans SC5314]
 gb|EAK91040.1| hypothetical protein CaO19.8811 [Candida albicans SC5314]
          Length = 207

 Score = 78.4 bits (192), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 23/154 (14%), Positives = 50/154 (32%), Gaps = 27/154 (17%)

Query: 7   SRINKTRTLGIVGTAIAVLVTSYYI-YQKVTSAKEDNGARPPEGDSVK-----------E 54
           SR  K     ++ T+   L    Y  Y   T    +   +  E D               
Sbjct: 7   SRNPKLYVSALIATSAIALTYKLYSTYLAPTPTPTNKDKKQRELDGSSTDIELKPKFRIS 66

Query: 55  NKKARKSKCIIMSKSI--QGLPIKWEEYAADEVVLLVPTSHTDGSM-------------K 99
            + + KS  I +S +     LP+      ++ ++ ++P +  +  +              
Sbjct: 67  KRYSNKSIAITLSSTFLSSNLPLNEILINSENMIFIIPPNLNEDDLPIIETKDTDTDNAN 126

Query: 100 QAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGK 133
                +F K ++  K++ C +  G    ++ L  
Sbjct: 127 TNTNTSFYKDQSNFKLLKCSNFQGYLQILKNLKP 160
>ref|XP_001527082.1| hypothetical protein LELG_01911 [Lodderomyces elongisporus NRRL
           YB-4239]
 gb|EDK43732.1| hypothetical protein LELG_01911 [Lodderomyces elongisporus NRRL
           YB-4239]
          Length = 232

 Score = 73.0 bits (178), Expect = 8e-12,   Method: Composition-based stats.
 Identities = 23/128 (17%), Positives = 49/128 (38%), Gaps = 12/128 (9%)

Query: 55  NKKARKSKCIIMSKSIQGLPIKWEEY--AADEVVLLVPTSHTD---GSMKQAIGDAFRK- 108
            K   KS  I +S S     +   +   + ++++ ++P +  +   G +  A     +  
Sbjct: 96  KKFLNKSIAITLSSSFLSSQLPLVDLITSNEKLIYIIPPNLNEEDVGLLNTAGNIVLQNK 155

Query: 109 ----TKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKF 164
                    KI+ C ++ G    ++ L     +L S D      +   VV  D+  F+K 
Sbjct: 156 QELQLPPNFKILKCSNIQGYLQILKNLKPDCLLLCSDDL--GISNFNGVVLRDLSNFIKN 213

Query: 165 VVDSDVED 172
           +++ D   
Sbjct: 214 IINIDQNQ 221
>sp|Q9UW82|PEX22_PICPA Peroxisome assembly protein 22 (Peroxin-22)
 gb|AAD45664.1|AF133103_1 peroxisomal integral membrane protein [Pichia pastoris]
          Length = 187

 Score = 44.5 bits (104), Expect = 0.004,   Method: Composition-based stats.
 Identities = 26/140 (18%), Positives = 50/140 (35%), Gaps = 12/140 (8%)

Query: 30  YIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCIIMSKSIQ--GLPIKWEEYAADEVVL 87
           Y   K     +   +    G + +     R+   II+S+SI    LPI+       +VV 
Sbjct: 22  YSVYKSFITSDKPSSLINLGINQERKSYTRQKVAIIVSESILAIQLPIQEILKNTKDVVF 81

Query: 88  LVPTSHT--DGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCILNSRDFTS 145
           ++  +    +   +  +           K+I   +  G +  ++ +     I N  DF  
Sbjct: 82  VLAPTIAKDEFLRENEVDSGLS-----FKVIETGTAIGCFHVLKHIKATYNIFNLHDFLP 136

Query: 146 SG---GSDAAVVPEDIGRFV 162
           S     SD   +  D+  +V
Sbjct: 137 SSTKTSSDNEQLTFDLEEYV 156
>gb|ABD37672.1| peroxin 22 [Pichia angusta]
          Length = 160

 Score = 39.1 bits (90), Expect = 0.12,   Method: Composition-based stats.
 Identities = 19/134 (14%), Positives = 48/134 (35%), Gaps = 12/134 (8%)

Query: 7   SRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCIIM 66
               K  T  +  T +A LV    I         + G       + + +K   +S  +++
Sbjct: 3   RNSRKLWTRAVAATVVAGLV----IGGAWALKTINPGLFEEPAKTSEASKSNGQSVSLVL 58

Query: 67  SKSIQGLPIKWEEYAAD--EVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGL 124
           ++    L      Y  +   + +++  S      K      F   +N H++I   + + +
Sbjct: 59  TQK--DLDFFSAAYLNEYPNLTVILHPSV----DKSEFLSRFNVQRNSHQVIQVRTEESI 112

Query: 125 WSCVRRLGKFQCIL 138
           +  +++L     ++
Sbjct: 113 FHVLKQLSSNINLI 126
>gb|EEJ80676.1| type II secretory pathway, ATPase PulE/Tfp pilus assembly pathway,
           ATPase PilB [Kangiella koreensis DSM 16069]
          Length = 659

 Score = 36.8 bits (84), Expect = 0.63,   Method: Composition-based stats.
 Identities = 18/68 (26%), Positives = 27/68 (39%), Gaps = 9/68 (13%)

Query: 116 IYCDSMDGLWSCVRRLGKFQCILNSR-------DFTSSGGSDAAVVPEDIGRFVKFVVDS 168
           +YC++ DGLW  +R       I+          D   S G       ED  R+   + + 
Sbjct: 100 VYCENEDGLWKVLRLHKNHIFIILISEKNVTNMDIIQSNGRSTFGREED--RYTGEIKNI 157

Query: 169 DVEDVLID 176
           D+  V ID
Sbjct: 158 DLSTVKID 165
>ref|YP_344555.1| Carbamoyl-phosphate synthase, large subunit, glutamine-dependent
            [Nitrosococcus oceani ATCC 19707]
 gb|ABA59025.1| carbamoyl-phosphate synthase large subunit [Nitrosococcus oceani ATCC
            19707]
          Length = 1073

 Score = 36.8 bits (84), Expect = 0.68,   Method: Composition-based stats.
 Identities = 17/73 (23%), Positives = 35/73 (47%), Gaps = 2/73 (2%)

Query: 63   CIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMD 122
            CI ++K ++G P   +    DE+ L++  + T+G    +     R++  +HK+ Y  ++ 
Sbjct: 987  CIQINKVLEGQPHIVDMIKNDEIALII--NTTEGRKAVSDSYTIRRSALQHKVTYTTTVA 1044

Query: 123  GLWSCVRRLGKFQ 135
            G W+    L    
Sbjct: 1045 GAWATCEALRAET 1057
>ref|YP_841293.1| multidrug resistance efflux pump [Ralstonia eutropha H16]
 emb|CAJ96563.1| multidrug resistance efflux pump [Ralstonia eutropha H16]
          Length = 368

 Score = 36.8 bits (84), Expect = 0.69,   Method: Composition-based stats.
 Identities = 12/37 (32%), Positives = 17/37 (45%)

Query: 3  PPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAK 39
          P S+S+ +  R L      +A L    Y Y   TSA+
Sbjct: 16 PASKSKASPRRLLATGALVVAGLAAIGYGYHWWTSAR 52
>emb|CAN13139.1| orthologue of H. sapiens chromsome 20 open reading frame 32
           (C20orf32) [Sus scrofa]
          Length = 784

 Score = 36.8 bits (84), Expect = 0.77,   Method: Composition-based stats.
 Identities = 22/79 (27%), Positives = 37/79 (46%), Gaps = 5/79 (6%)

Query: 91  TSHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSC---VRRLGKFQCILNSRDFTSSG 147
            + TD +++  I D  +   N ++I+  ++ + L SC   +  L   + + NS D     
Sbjct: 502 GNLTDSNLQTKIRDQLQTIANAYQIL-LETKERLESCGWSLEVLATDK-VQNSPDDLERF 559

Query: 148 GSDAAVVPEDIGRFVKFVV 166
              A  VPEDI RF   V+
Sbjct: 560 VLVARTVPEDIKRFASIVI 578
>gb|EDU43621.1| conserved hypothetical protein [Pyrenophora tritici-repentis
           Pt-1C-BFP]
          Length = 443

 Score = 36.4 bits (83), Expect = 0.81,   Method: Composition-based stats.
 Identities = 21/95 (22%), Positives = 35/95 (36%), Gaps = 8/95 (8%)

Query: 77  WEEYAADEVVLLVPTSH--TDGSMKQAI-GDAFRKTKNEHKIIYCDSMDGLWSCVRRLGK 133
            E  + D VV     S   +D S+  ++   A R  ++   ++   +  G    VR LG 
Sbjct: 289 DELRSMDPVVFTPAPSTRGSDNSLWNSLHSQAVRLVEDPTMVMPFSTQTGFVHMVRHLGP 348

Query: 134 FQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDS 168
              ++   D  S          EDI ++V   V  
Sbjct: 349 D--LVYVVDALSGVNGQNV---EDIRKWVGQTVVI 378
>ref|XP_505069.1| hypothetical protein [Yarrowia lipolytica]
 emb|CAG77876.1| unnamed protein product [Yarrowia lipolytica CLIB122]
          Length = 195

 Score = 36.0 bits (82), Expect = 1.1,   Method: Composition-based stats.
 Identities = 16/111 (14%), Positives = 40/111 (36%), Gaps = 17/111 (15%)

Query: 58  ARKSKCIIMSKSIQG---LPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEH- 113
                 +++S+ +     +  K        +V++VP         +A+  +     ++H 
Sbjct: 78  TNSKITVVVSQELVQSQLVDFKHLMSVHPNLVVIVPPMVA-NKFHRALKSSVG---HDHG 133

Query: 114 -KIIYCDSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGRFVK 163
            K+I CD+  G+   ++ +     ++               +  +I RFV 
Sbjct: 134 VKVIRCDTDVGVIHVIKHIRPDLALI--------ADGVGDNIQGEIKRFVG 176
>ref|ZP_01075545.1| DNA topoisomerase IV subunit B [Marinomonas sp. MED121]
 gb|EAQ66392.1| DNA topoisomerase IV subunit B [Marinomonas sp. MED121]
          Length = 632

 Score = 36.0 bits (82), Expect = 1.2,   Method: Composition-based stats.
 Identities = 27/123 (21%), Positives = 44/123 (35%), Gaps = 10/123 (8%)

Query: 2   PPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKS 61
           P  S     +  +  IV    A +  ++ +   +    ED G +  +       K+ R S
Sbjct: 330 PQFSGQTKERLSSRQIVAFISATVKDAFSL--WLNKHVED-GQKIADIVISSAQKRMRAS 386

Query: 62  KCIIMSKSIQGLPIKWEEYA---ADEV---VLLVPTSHTDGSMKQAIGDAFRKTKN-EHK 114
           K ++  K  QG  +  +       D V   + LV      GS KQA    F+       K
Sbjct: 387 KKVVRKKITQGPALPGKLADCAGQDTVRSELFLVEGDSAGGSAKQARDKEFQAIMPLRGK 446

Query: 115 IIY 117
           I+ 
Sbjct: 447 ILN 449
>ref|XP_001169779.1| PREDICTED: HEF-like protein isoform 1 [Pan troglodytes]
 ref|XP_001169818.1| PREDICTED: HEF-like protein isoform 3 [Pan troglodytes]
          Length = 786

 Score = 35.2 bits (80), Expect = 2.1,   Method: Composition-based stats.
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 92  SHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCIL-----NSRDFTSS 146
           + TD +++  I D  +   N ++I+  ++ + L +C       + ++     NS D    
Sbjct: 506 NLTDSNLQNRIRDQMQTISNSYRIL-LETKESLDNCN---WPLEVLVTDSVQNSPDDLER 561

Query: 147 GGSDAAVVPEDIGRFVKFVV 166
               A ++PEDI RF   V+
Sbjct: 562 FVMVARMLPEDIKRFASIVI 581
>ref|XP_514736.2| PREDICTED: HEF-like protein isoform 5 [Pan troglodytes]
          Length = 778

 Score = 34.9 bits (79), Expect = 2.4,   Method: Composition-based stats.
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 92  SHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCIL-----NSRDFTSS 146
           + TD +++  I D  +   N ++I+  ++ + L +C       + ++     NS D    
Sbjct: 498 NLTDSNLQNRIRDQMQTISNSYRIL-LETKESLDNCN---WPLEVLVTDSVQNSPDDLER 553

Query: 147 GGSDAAVVPEDIGRFVKFVV 166
               A ++PEDI RF   V+
Sbjct: 554 FVMVARMLPEDIKRFASIVI 573
>ref|XP_001169795.1| PREDICTED: HEF-like protein isoform 2 [Pan troglodytes]
          Length = 667

 Score = 34.9 bits (79), Expect = 2.7,   Method: Composition-based stats.
 Identities = 19/80 (23%), Positives = 37/80 (46%), Gaps = 9/80 (11%)

Query: 92  SHTDGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKFQCIL-----NSRDFTSS 146
           + TD +++  I D  +   N ++I+  ++ + L +C       + ++     NS D    
Sbjct: 506 NLTDSNLQNRIRDQMQTISNSYRIL-LETKESLDNCN---WPLEVLVTDSVQNSPDDLER 561

Query: 147 GGSDAAVVPEDIGRFVKFVV 166
               A ++PEDI RF   V+
Sbjct: 562 FVMVARMLPEDIKRFASIVI 581
>ref|XP_001908882.1| unnamed protein product [Podospora anserina]
 emb|CAP69555.1| unnamed protein product [Podospora anserina]
          Length = 1700

 Score = 34.9 bits (79), Expect = 2.7,   Method: Composition-based stats.
 Identities = 18/79 (22%), Positives = 35/79 (44%), Gaps = 2/79 (2%)

Query: 95   DGSMKQAIGDAFRKTKNEHKIIYCDSMDGLWSCVRRLGKF-QCILNSRDFTSSGGSDAAV 153
            D              K+E +I + D++  +WS +R L K  + I+ S++   +    A V
Sbjct: 1451 DRQKAVDRFWELANLKDE-EINHSDNLQLVWSLLRPLKKNMERIMCSKELFPAAKQRAKV 1509

Query: 154  VPEDIGRFVKFVVDSDVED 172
            +  +I  F  F+ D   ++
Sbjct: 1510 LGAEIREFGNFLKDLRAKN 1528
>ref|XP_976974.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
 gb|EAR86545.1| TPR Domain containing protein [Tetrahymena thermophila SB210]
          Length = 917

 Score = 34.9 bits (79), Expect = 2.9,   Method: Composition-based stats.
 Identities = 20/110 (18%), Positives = 41/110 (37%), Gaps = 14/110 (12%)

Query: 29  YYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCIIMSKSIQGLPIKWEEYAADEVVLL 88
           YY Y  +  A ED          V +     ++KC++  +    +   + E   +  V +
Sbjct: 795 YYAYYSLAIAIEDQLKSTHSQAEVSK----LRAKCLVNYQKAIQVKPDYIEALNNIAVFI 850

Query: 89  VPTSHTDGSMKQAIGD---------AFRKTKNEHKIIYCDSMDGLWSCVR 129
              S  +  +K+   +                 HKII  D+++ L++ V+
Sbjct: 851 KTVSLQENKLKKNSKEYEIAEYFLKQAEGLAPNHKIII-DNLEKLYNEVQ 899
>ref|YP_236117.1| PqiB family protein [Pseudomonas syringae pv. syringae B728a]
 gb|AAY38079.1| PqiB family protein [Pseudomonas syringae pv. syringae B728a]
          Length = 559

 Score = 34.5 bits (78), Expect = 3.2,   Method: Composition-based stats.
 Identities = 11/93 (11%), Positives = 30/93 (32%), Gaps = 19/93 (20%)

Query: 2  PPPS------RSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKEN 55
          PP S      + R  +   + +V    A++  S  ++  +              + ++ N
Sbjct: 8  PPASVPRPEVKRRRLRVSLIWLVPIVAAIIGISMALHDWMNIGP-RITVSFLTAEGLEAN 66

Query: 56 KKARKSKCIIM------------SKSIQGLPIK 76
          K   K K +++            +  +  + + 
Sbjct: 67 KTQVKYKNVVIGMVTDISLSEDRTHVLANIELN 99
>ref|XP_001663892.1| hypothetical protein AaeL_AAEL013710 [Aedes aegypti]
 gb|EAT34026.1| hypothetical protein AaeL_AAEL013710 [Aedes aegypti]
          Length = 878

 Score = 34.5 bits (78), Expect = 3.4,   Method: Composition-based stats.
 Identities = 20/123 (16%), Positives = 47/123 (38%), Gaps = 12/123 (9%)

Query: 59  RKSKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYC 118
            K + + +S+ +       + Y ++  + +V   +        I        +    +  
Sbjct: 343 NKVRRLRLSQVL-------DMYYSNRGIPVVSIENLGKRKLPNIKQTPNGKLSRTSELNE 395

Query: 119 DSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGR--FVKFVVD-SDVEDVLI 175
           DS+  +++  R+   +   + S D       DA  VP+DI R  + K ++   D + +  
Sbjct: 396 DSLHRMFNADRKTKLWN--VFSEDEAQDPHGDAQNVPDDINRACYRKQILQTIDKDRLKQ 453

Query: 176 DTL 178
           +T 
Sbjct: 454 ETF 456
>ref|XP_001662137.1| hypothetical protein AaeL_AAEL012007 [Aedes aegypti]
 gb|EAT35855.1| hypothetical protein AaeL_AAEL012007 [Aedes aegypti]
          Length = 874

 Score = 34.5 bits (78), Expect = 3.4,   Method: Composition-based stats.
 Identities = 20/123 (16%), Positives = 47/123 (38%), Gaps = 12/123 (9%)

Query: 59  RKSKCIIMSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYC 118
            K + + +S+ +       + Y ++  + +V   +        I        +    +  
Sbjct: 343 NKVRRLRLSQVL-------DMYYSNRGIPVVSIENLGKRKLPNIKQTPNGKLSRTSELNE 395

Query: 119 DSMDGLWSCVRRLGKFQCILNSRDFTSSGGSDAAVVPEDIGR--FVKFVVD-SDVEDVLI 175
           DS+  +++  R+   +   + S D       DA  VP+DI R  + K ++   D + +  
Sbjct: 396 DSLHRMFNADRKTKLWN--VFSEDEAQDPHGDAQNVPDDINRACYRKQILQTIDKDRLKQ 453

Query: 176 DTL 178
           +T 
Sbjct: 454 ETF 456
>ref|NP_824569.1| hypothetical protein SAV_3392 [Streptomyces avermitilis MA-4680]
 dbj|BAC71104.1| hypothetical protein [Streptomyces avermitilis MA-4680]
          Length = 353

 Score = 34.5 bits (78), Expect = 3.7,   Method: Composition-based stats.
 Identities = 13/39 (33%), Positives = 20/39 (51%), Gaps = 2/39 (5%)

Query: 2   PPPSRSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKE 40
           P PSR+RIN    + +V      L ++   Y+K  +A E
Sbjct: 271 PRPSRARINAWWAVWVVSLLADRLAST--AYRKAETAPE 307
>ref|XP_955253.1| glucose-6-phosphate-1-dehydrogenase, putative [Theileria annulata
           strain Ankara]
 emb|CAI75777.1| glucose-6-phosphate-1-dehydrogenase, putative [Theileria annulata]
          Length = 878

 Score = 33.7 bits (76), Expect = 5.4,   Method: Composition-based stats.
 Identities = 13/60 (21%), Positives = 23/60 (38%), Gaps = 3/60 (5%)

Query: 115 IIYCDSMDGLWSCVRRL---GKFQCILNSRDFTSSGGSDAAVVPEDIGRFVKFVVDSDVE 171
           ++ C + +G       L      + +  S D T   G      P  I RF+  + D D++
Sbjct: 100 LLDCHTEEGFVLTASHLILHKIRRKLSKSPDVTVKIGLSGGSTPRGIYRFLGNMKDLDID 159
>ref|ZP_02064345.1| hypothetical protein BACOVA_01311 [Bacteroides ovatus ATCC 8483]
 gb|EDO13071.1| hypothetical protein BACOVA_01311 [Bacteroides ovatus ATCC 8483]
          Length = 882

 Score = 33.3 bits (75), Expect = 8.3,   Method: Composition-based stats.
 Identities = 11/50 (22%), Positives = 20/50 (40%), Gaps = 1/50 (2%)

Query: 12  TRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKS 61
              +  V   + +++T  YIY +    + +      +   V E K  RKS
Sbjct: 711 NAVMWSVTVLVILIITLAYIYYRRRMKRNNLIQSKEKQQPVIEEKY-RKS 759
>ref|YP_001692481.1| conserved hypothetical protein [Finegoldia magna ATCC 29328]
 dbj|BAG08591.1| conserved hypothetical protein [Finegoldia magna ATCC 29328]
          Length = 235

 Score = 32.9 bits (74), Expect = 9.7,   Method: Composition-based stats.
 Identities = 23/115 (20%), Positives = 43/115 (37%), Gaps = 4/115 (3%)

Query: 6   RSRINKTRTLGIVGTAIAVLVTSYYIYQKVTSAKEDNGARPPEGDSVKENKKARKSKCII 65
           +    K      +     V  ++ Y  + V    + N    PE DS+K  K+ +  +  I
Sbjct: 8   KKSTKKLTIALCMVIFAVVFSSTIYAVKLVNEQSKVNELELPELDSMKITKQIKNGERNI 67

Query: 66  MSKSIQGLPIKWEEYAADEVVLLVPTSHTDGSMKQAIGDAFRKTKNEHKIIYCDS 120
            S+S   L  K      +++ + +  S      +  I D       ++ II CD+
Sbjct: 68  KSQSTSRLNYKQI---QEQLGVSLLNSELSKDDRYMITD-VETDNKDYAIIKCDN 118
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF excluding environmental samples
  from WGS projects
    Posted date:  May 23, 2008  5:56 PM
  Number of letters in database: 883,778,997
  Number of sequences in database:  2,617,685
  
  Database: /host/Blast/data/nr_perl/nr.01
    Posted date:  May 23, 2008  5:54 PM
  Number of letters in database: 976,759,346
  Number of sequences in database:  2,761,413
  
  Database: /host/Blast/data/nr_perl/nr.02
    Posted date:  May 23, 2008  5:48 PM
  Number of letters in database: 374,670,760
  Number of sequences in database:  1,165,270
  
  Database: /host/Blast/data/nr_perl/nr.03
    Posted date:  Apr 28, 2009  5:40 PM
  Number of letters in database: 114,943,120
  Number of sequences in database:  354,819
  
Lambda     K      H
   0.312    0.152    0.404 

Lambda     K      H
   0.267   0.0464    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 2,451,484,767
Number of Sequences: 6899187
Number of extensions: 90731898
Number of successful extensions: 245888
Number of sequences better than 10.0: 47
Number of HSP's better than 10.0 without gapping: 27
Number of HSP's successfully gapped in prelim test: 71
Number of HSP's that attempted gapping in prelim test: 245793
Number of HSP's gapped (non-prelim): 104
length of query: 180
length of database: 2,350,152,223
effective HSP length: 125
effective length of query: 55
effective length of database: 1,487,753,848
effective search space: 81826461640
effective search space used: 81826461640
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.2 bits)
S2: 74 (32.9 bits)