BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= YOL044W__[Saccharomyces_cerevisiae]
         (383 letters)

Database: nr.pal 
           6,348,806 sequences; 2,166,943,470 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gi|151945589|gb|EDN63830.1|  peroxisome-related protein [Sac...   655   0.0  
gi|6324529|ref|NP_014598.1|  Phosphorylated tail-anchored ty...   655   0.0  
gi|50287169|ref|XP_446014.1|  unnamed protein product [Candi...    64   3e-08
gi|50310809|ref|XP_455427.1|  unnamed protein product [Kluyv...    62   9e-08
gi|156840853|ref|XP_001643804.1|  hypothetical protein Kpol_...    55   6e-06
gi|45185966|ref|NP_983682.1|  ACR280Cp [Ashbya gossypii ATCC...    49   6e-04
gi|26449580|dbj|BAC41916.1|  unknown protein [Arabidopsis th...    35   6.3  
gi|15227582|ref|NP_180523.1|  unknown protein [Arabidopsis t...    35   8.1  
>gi|151945589|gb|EDN63830.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 383

 Score =  655 bits (1690), Expect = 0.0,   Method: Composition-based stats.
 Identities = 344/383 (89%), Positives = 345/383 (90%)

Query: 1   MAASEIMNNLPMHXXXXXXXXXXXXXXFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMH              FIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVF+SNPMILDLFVSACDIMPSFTKLGLTLQ EILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFRSNPMILDLFVSACDIMPSFTKLGLTLQGEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVXXXXXXXXXXXXXXXXXXXXXXXMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSV                       MAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>gi|6324529|ref|NP_014598.1| Phosphorylated tail-anchored type II integral peroxisomal membrane
           protein required for peroxisome biogenesis, cells
           lacking Pex15p mislocalize peroxisomal matrix proteins
           to cytosol, overexpression results in impaired
           peroxisome assembly; Pex15p [Saccharomyces cerevisiae]
 gi|74583689|sp|Q08215|PEX15_YEAST Peroxisomal membrane protein PEX15 (Peroxin-15) (Peroxisome
           biosynthesis protein PAS21)
 gi|1419845|emb|CAA99046.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 383

 Score =  655 bits (1690), Expect = 0.0,   Method: Composition-based stats.
 Identities = 346/383 (90%), Positives = 346/383 (90%)

Query: 1   MAASEIMNNLPMHXXXXXXXXXXXXXXFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMH              FIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVXXXXXXXXXXXXXXXXXXXXXXXMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSV                       MAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>gi|50287169|ref|XP_446014.1| unnamed protein product [Candida glabrata]
 gi|49525321|emb|CAG58938.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 374

 Score = 63.5 bits (153), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 56/200 (28%), Positives = 97/200 (48%), Gaps = 12/200 (6%)

Query: 31  SDESTKSV---NDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPMILDLFV 87
           SDE   S    +D  S+ +  C++LFIK + K CLE M   G ++ +    N   ++LF+
Sbjct: 32  SDEPAMSPYQNDDNTSDEYYNCLDLFIKGEPKQCLEAMLSCGLLNESQIFQNMDSVELFI 91

Query: 88  SACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQ 147
           +AC  +     LG++LQ++I+ +F     + +E  +      S L +I K     I+ I 
Sbjct: 92  NACSRVSDLATLGISLQNKIIQLFIYS--EILEFVRRNSPAASALALITKLHGNIIRAIG 149

Query: 148 FNLTDHTEQEEKTL-ELESIMSDFIFVYITKMRTTIDV----VGLQELIEIFIFQVKVKL 202
                  E+ +    E ++++ D  F    K RT        V +  L E+++F V+++L
Sbjct: 150 LMRGSRDERYQIIADEKDALIHDIGFH--VKKRTNESRRQYNVEMLMLAELYLFDVQIQL 207

Query: 203 HHKKPSPNMYWALCKTLPKL 222
             KK SP +Y  LC  + +L
Sbjct: 208 EGKKKSPKLYEDLCDKVLQL 227
>gi|50310809|ref|XP_455427.1| unnamed protein product [Kluyveromyces lactis]
 gi|49644563|emb|CAG98135.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 357

 Score = 61.6 bits (148), Expect = 9e-08,   Method: Composition-based stats.
 Identities = 49/190 (25%), Positives = 89/190 (46%), Gaps = 7/190 (3%)

Query: 40  DQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKL 99
           D   E  QEC++L++K D+K+CLE M E G ++    +++     L +     M +   +
Sbjct: 32  DIEKEHAQECLDLYVKGDLKECLELMYEYGLLNSNKMQTSLKSWQLMMDCVSQMNNVGVI 91

Query: 100 GLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVIN-KFFRCCIKVIQFNLTDHTEQEE 158
           G +L   +   FT +    +  R I +  LS  L+I  +FF   +K  + N+  + E  +
Sbjct: 92  GTSLDKRLKEWFTNEE---LLLRLIKMKPLSDQLIITYQFFYSSLKFWKRNVKQNYEHID 148

Query: 159 KTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKT 218
              EL     + +     + +T  ++  L ++++  IF V+++   KK S  MY   C+ 
Sbjct: 149 ---ELSISCKELLLQTSRRCQTVTEIQNLSQILDFLIFDVQIETLQKKASITMYTRFCQL 205

Query: 219 LPKLSPTLKG 228
             KL   LK 
Sbjct: 206 DDKLQSKLKA 215
>gi|156840853|ref|XP_001643804.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
 gi|156114430|gb|EDO15946.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 370

 Score = 55.5 bits (132), Expect = 6e-06,   Method: Composition-based stats.
 Identities = 56/231 (24%), Positives = 110/231 (47%), Gaps = 20/231 (8%)

Query: 32  DESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPM-ILDLFVSAC 90
           DES   V+D  +  +Q+C+N F+  D   C+E M++ GF+D  + + + + IL+LF + C
Sbjct: 33  DESEVKVDDT-TRKYQQCLNTFVGGDPIKCIELMNKYGFLDQNLMEDSDIPILELFFNVC 91

Query: 91  DIMPSFTKL---GLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQ 147
           + +P+F  +      +   ILN +  D    +          + L +  KF +  IK ++
Sbjct: 92  ENIPNFKSIKSEDFVIVESILNKYLEDNDSSLN---------NDLTLYVKFLKSYIKFLK 142

Query: 148 FNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDV-VGLQELIEIFIFQVKVKLHHKK 206
            +     + E ++++L   + + I      + ++ DV + + E+IEI+   +++KL    
Sbjct: 143 SDSI--KDNEVRSIDLGYKVKNVI--AKINVESSEDVHMEICEMIEIYFVHIEIKLQENS 198

Query: 207 PSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKAKSK 257
            + + Y   CK+ P +   L         +  + I+N +  K QK   KSK
Sbjct: 199 LTTSRYELFCKSNPVIHQLLN-TKTKNGQTYYNMIMNQLTPKDQKVSKKSK 248
>gi|45185966|ref|NP_983682.1| ACR280Cp [Ashbya gossypii ATCC 10895]
 gi|44981756|gb|AAS51506.1| ACR280Cp [Ashbya gossypii ATCC 10895]
          Length = 357

 Score = 48.9 bits (115), Expect = 6e-04,   Method: Composition-based stats.
 Identities = 42/227 (18%), Positives = 103/227 (45%), Gaps = 5/227 (2%)

Query: 40  DQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKL 99
           D + E  +EC +L++K      L K+ + G ++    +    +    ++A + + S  ++
Sbjct: 30  DSQEERLRECRDLYLKAHFGGFLVKVYQYGLLEDGAQRYTADVWGWVLAAVNGLRSANEI 89

Query: 100 GLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEK 159
             ++  ++    +  +    +     L  L +  ++  +FR  +++      D  E  + 
Sbjct: 90  PSSVLRQLRTELSRSSGGVYDVVSA-LSVLERARLLLSYFRSAVRLASL---DTAENADY 145

Query: 160 TLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTL 219
              +E  +   +   +  + +  ++  L +L+E+++  ++V+  H++   ++YW LC+  
Sbjct: 146 LRRVEGNLCRELGRLVLDVHSEQELGYLVKLVELYLLDLQVRCLHREMDKSLYWTLCRKF 205

Query: 220 PKLSPTLKGLYLSKD-VSIEDAILNSIDNKIQKDKAKSKGKQRGVKQ 265
           P +S  L G   S++ VS E+ IL  +  K +  K K    +R V +
Sbjct: 206 PLMSRKLSGSPQSRNGVSCEEHILLQLQPKKKTIKNKHASSERRVAR 252
>gi|26449580|dbj|BAC41916.1| unknown protein [Arabidopsis thaliana]
          Length = 512

 Score = 35.4 bits (80), Expect = 6.3,   Method: Composition-based stats.
 Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 4/111 (3%)

Query: 217 KTLPKLSPTLKGLYLSKDVSIEDAILNSIDN-KIQKDKAKSKGKQRGVKQKIHHFHEPML 275
           + L    P     +  +D  +++   NS D  KI  D  + +  +R   Q+   F EP  
Sbjct: 334 RNLEPTVPQSDSAFFKRDEELKELSENSADEIKISYDSDEHEPSERTTDQE---FEEPYE 390

Query: 276 HNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDITVLAGSFWAVLKH 326
            N  EE+ ++ +A     +     + + T+PR    D+  L  + W VL H
Sbjct: 391 RNDGEERQQLVEAEASDVNHHGNSEESVTSPRSVLPDMLHLDQTAWEVLDH 441
>gi|15227582|ref|NP_180523.1| unknown protein [Arabidopsis thaliana]
 gi|3582336|gb|AAC35233.1| hypothetical protein [Arabidopsis thaliana]
          Length = 747

 Score = 35.4 bits (80), Expect = 8.1,   Method: Composition-based stats.
 Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 4/111 (3%)

Query: 217 KTLPKLSPTLKGLYLSKDVSIEDAILNSIDN-KIQKDKAKSKGKQRGVKQKIHHFHEPML 275
           + L    P     +  +D  +++   NS D  KI  D  + +  +R   Q+   F EP  
Sbjct: 569 RNLEPTVPQSDSAFFKRDEELKELSENSADEIKISYDSDEHEPSERTTDQE---FEEPYE 625

Query: 276 HNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDITVLAGSFWAVLKH 326
            N  EE+ ++ +A     +     + + T+PR    D+  L  + W VL H
Sbjct: 626 RNDGEERQQLVEAEASDVNHHGNSEESVTSPRSVLPDMLHLDQTAWEVLDH 676