BLASTP 2.2.17 [Aug-26-2007]


Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden, 
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,  
Eugene V. Koonin, and Stephen F. Altschul (2001), 
"Improving the accuracy of PSI-BLAST protein database searches with 
composition-based statistics and other refinements",  Nucleic Acids Res. 29:2994-3005.

Query= YOL044W__[Saccharomyces_cerevisiae]
         (383 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           6,899,187 sequences; 2,350,152,223 total letters

Searching..................................................done


Results from round 1


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EDN63830.1|  peroxisome-related protein [Saccharomyces ce...   696   0.0  
ref|NP_014598.1|  Phosphorylated tail-anchored type II integ...   696   0.0  
ref|XP_446014.1|  unnamed protein product [Candida glabrata]...    73   3e-11
ref|XP_455427.1|  unnamed protein product [Kluyveromyces lac...    63   3e-08
ref|XP_001643804.1|  hypothetical protein Kpol_1044p4 [Vande...    57   2e-06
ref|NP_983682.1|  ACR280Cp [Ashbya gossypii ATCC 10895] >gi|...    51   1e-04
ref|XP_001481573.1|  conserved hypothetical protein [Aspergi...    38   1.5  
ref|ZP_01924428.1|  SSS sodium solute transporter superfamil...    37   1.8  
ref|NP_001080024.1|  hypothetical protein LOC379716 [Xenopus...    36   4.0  
dbj|BAC41916.1|  unknown protein [Arabidopsis thaliana]            36   5.9  
ref|NP_180523.1|  unknown protein [Arabidopsis thaliana] >gi...    35   7.4  
>gb|EDN63830.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 383

 Score =  696 bits (1797), Expect = 0.0,   Method: Composition-based stats.
 Identities = 381/383 (99%), Positives = 382/383 (99%)

Query: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVF+SNPMILDLFVSACDIMPSFTKLGLTLQ EILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFRSNPMILDLFVSACDIMPSFTKLGLTLQGEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>ref|NP_014598.1| Phosphorylated tail-anchored type II integral peroxisomal membrane
           protein required for peroxisome biogenesis, cells
           lacking Pex15p mislocalize peroxisomal matrix proteins
           to cytosol, overexpression results in impaired
           peroxisome assembly; Pex15p [Saccharomyces cerevisiae]
 sp|Q08215|PEX15_YEAST Peroxisomal membrane protein PEX15 (Peroxin-15) (Peroxisome
           biosynthesis protein PAS21)
 emb|CAA99046.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 383

 Score =  696 bits (1796), Expect = 0.0,   Method: Composition-based stats.
 Identities = 383/383 (100%), Positives = 383/383 (100%)

Query: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>ref|XP_446014.1| unnamed protein product [Candida glabrata]
 emb|CAG58938.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 374

 Score = 73.2 bits (178), Expect = 3e-11,   Method: Composition-based stats.
 Identities = 96/389 (24%), Positives = 179/389 (46%), Gaps = 36/389 (9%)

Query: 7   MNNLPMHSLDSSLRDLLNDDLFIESDESTKSV---NDQRSEVFQECVNLFIKRDIKDCLE 63
           M+N+   S  +++++LL+DD F  SDE   S    +D  S+ +  C++LFIK + K CLE
Sbjct: 10  MSNVLPESTMTTVQNLLDDDDF--SDEPAMSPYQNDDNTSDEYYNCLDLFIKGEPKQCLE 67

Query: 64  KMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRK 123
            M   G ++ +    N   ++LF++AC  +     LG++LQ++I+ +F     + +E  +
Sbjct: 68  AMLSCGLLNESQIFQNMDSVELFINACSRVSDLATLGISLQNKIIQLFIYS--EILEFVR 125

Query: 124 IILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTL-ELESIMSDFIFVYITKMRTTI 182
                 S L +I K     I+ I        E+ +    E ++++ D  F    K RT  
Sbjct: 126 RNSPAASALALITKLHGNIIRAIGLMRGSRDERYQIIADEKDALIHDIGFH--VKKRTNE 183

Query: 183 DV----VGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIE 238
                 V +  L E+++F V+++L  KK SP +Y  LC  + +L        L +   + 
Sbjct: 184 SRRQYNVEMLMLAELYLFDVQIQLEGKKKSPKLYEDLCDKVLQLKSVFDETVLDEK-PLS 242

Query: 239 DAILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHE-PMLHNSSEEQVKVEDAFNQRTSTDS 297
             IL  ++ K    + K    ++ +++        P   +   +Q+K++   N   S  +
Sbjct: 243 QIILAKLEQKKDSVEKKKSSSRKSLRESTKVLQAIPTPSDEVRDQIKMD---NLALSKGA 299

Query: 298 RLQSTGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKN---GLLLTGLLLLLCLKKYKS 354
               T   P+K            W  LK   T+  + KN     L+  + ++L +K+Y+ 
Sbjct: 300 FSSITSHIPQK------------W--LKALTTQKWIYKNMKQLSLVLVVAVILLVKRYRM 345

Query: 355 LMAIFKHVPAAFHTVYPQIVGLLKLLASI 383
           +   F  +P+    + P IV +L+LL+S+
Sbjct: 346 ITKWFGEIPSTLSQLKPAIVEILRLLSSL 374
>ref|XP_455427.1| unnamed protein product [Kluyveromyces lactis]
 emb|CAG98135.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 357

 Score = 63.2 bits (152), Expect = 3e-08,   Method: Composition-based stats.
 Identities = 55/213 (25%), Positives = 99/213 (46%), Gaps = 11/213 (5%)

Query: 17  SSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVF 76
           + L  LL+DD F  + +      D   E  QEC++L++K D+K+CLE M E G ++    
Sbjct: 13  TRLEQLLDDDRFRLNLQPV----DIEKEHAQECLDLYVKGDLKECLELMYEYGLLNSNKM 68

Query: 77  KSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVIN 136
           +++     L +     M +   +G +L   +   FT +    +  R I +  LS  L+I 
Sbjct: 69  QTSLKSWQLMMDCVSQMNNVGVIGTSLDKRLKEWFTNEE---LLLRLIKMKPLSDQLIIT 125

Query: 137 -KFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFI 195
            +FF   +K  + N+  + E  +   EL     + +     + +T  ++  L ++++  I
Sbjct: 126 YQFFYSSLKFWKRNVKQNYEHID---ELSISCKELLLQTSRRCQTVTEIQNLSQILDFLI 182

Query: 196 FQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKG 228
           F V+++   KK S  MY   C+   KL   LK 
Sbjct: 183 FDVQIETLQKKASITMYTRFCQLDDKLQSKLKA 215
>ref|XP_001643804.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO15946.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 370

 Score = 57.0 bits (136), Expect = 2e-06,   Method: Composition-based stats.
 Identities = 59/242 (24%), Positives = 118/242 (48%), Gaps = 22/242 (9%)

Query: 21  DLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNP 80
           ++L++D F+  DES   V+D  +  +Q+C+N F+  D   C+E M++ GF+D  + + + 
Sbjct: 24  EVLDEDEFL--DESEVKVDDT-TRKYQQCLNTFVGGDPIKCIELMNKYGFLDQNLMEDSD 80

Query: 81  M-ILDLFVSACDIMPSFTKL---GLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVIN 136
           + IL+LF + C+ +P+F  +      +   ILN +  D    +          + L +  
Sbjct: 81  IPILELFFNVCENIPNFKSIKSEDFVIVESILNKYLEDNDSSLN---------NDLTLYV 131

Query: 137 KFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDV-VGLQELIEIFI 195
           KF +  IK ++ +     + E ++++L   + + I      + ++ DV + + E+IEI+ 
Sbjct: 132 KFLKSYIKFLKSDSI--KDNEVRSIDLGYKVKNVI--AKINVESSEDVHMEICEMIEIYF 187

Query: 196 FQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKAK 255
             +++KL     + + Y   CK+ P +   L         +  + I+N +  K QK   K
Sbjct: 188 VHIEIKLQENSLTTSRYELFCKSNPVIHQLLN-TKTKNGQTYYNMIMNQLTPKDQKVSKK 246

Query: 256 SK 257
           SK
Sbjct: 247 SK 248
>ref|NP_983682.1| ACR280Cp [Ashbya gossypii ATCC 10895]
 gb|AAS51506.1| ACR280Cp [Ashbya gossypii ATCC 10895]
          Length = 357

 Score = 51.2 bits (121), Expect = 1e-04,   Method: Composition-based stats.
 Identities = 53/256 (20%), Positives = 115/256 (44%), Gaps = 12/256 (4%)

Query: 11  PMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGF 70
           P  SLDS    LL  +LF   D     V D + E  +EC +L++K      L K+ + G 
Sbjct: 8   PSLSLDS----LLQHELF--QDARAGKV-DSQEERLRECRDLYLKAHFGGFLVKVYQYGL 60

Query: 71  IDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLS 130
           ++    +    +    ++A + + S  ++  ++  ++    +  +    +     L  L 
Sbjct: 61  LEDGAQRYTADVWGWVLAAVNGLRSANEIPSSVLRQLRTELSRSSGGVYDVVSA-LSVLE 119

Query: 131 KLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQEL 190
           +  ++  +FR  +++      D  E  +    +E  +   +   +  + +  ++  L +L
Sbjct: 120 RARLLLSYFRSAVRLASL---DTAENADYLRRVEGNLCRELGRLVLDVHSEQELGYLVKL 176

Query: 191 IEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKD-VSIEDAILNSIDNKI 249
           +E+++  ++V+  H++   ++YW LC+  P +S  L G   S++ VS E+ IL  +  K 
Sbjct: 177 VELYLLDLQVRCLHREMDKSLYWTLCRKFPLMSRKLSGSPQSRNGVSCEEHILLQLQPKK 236

Query: 250 QKDKAKSKGKQRGVKQ 265
           +  K K    +R V +
Sbjct: 237 KTIKNKHASSERRVAR 252
>ref|XP_001481573.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
 gb|EBA27381.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
          Length = 692

 Score = 37.7 bits (86), Expect = 1.5,   Method: Composition-based stats.
 Identities = 33/110 (30%), Positives = 52/110 (47%), Gaps = 10/110 (9%)

Query: 242 LNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQS 301
           L +I  ++Q D +  KG Q G       F + +L  ++  + ++ D   +RT+  SRL  
Sbjct: 16  LKAILEQMQADHSDEKGVQAGASSS--SFQDALLKANTVLE-EILDDLQRRTTRQSRLAK 72

Query: 302 TGTAPRKKNNDITVLAGSFWAV--LKHHFTRSVLNKNGLLLTGLLLLLCL 349
            G  P KK +    L GS   +  LK +F   +LN    +LT L+   CL
Sbjct: 73  LGW-PSKKGD----LEGSIAQLERLKTYFILVILNDRSCVLTSLIYCPCL 117
>ref|ZP_01924428.1| SSS sodium solute transporter superfamily [Victivallis vadensis
           ATCC BAA-548]
 gb|EDM95109.1| SSS sodium solute transporter superfamily [Victivallis vadensis
           ATCC BAA-548]
          Length = 495

 Score = 37.4 bits (85), Expect = 1.8,   Method: Composition-based stats.
 Identities = 27/84 (32%), Positives = 44/84 (52%), Gaps = 11/84 (13%)

Query: 306 PRKKNNDITVLAGSFWA------VLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIF 359
           P++  ND+T++  S +        L   FTR V   NG LLTGL++ LC      +++ F
Sbjct: 393 PKESINDMTIILASLFGGGLLSIYLLGFFTRRV--GNGALLTGLVIALCFNVLM-MLSSF 449

Query: 360 KHVPAAFHTVYPQIV--GLLKLLA 381
             +   FH+ +  I+  G+L L+A
Sbjct: 450 GVIRMPFHSYWTSILVNGILALIA 473
>ref|NP_001080024.1| hypothetical protein LOC379716 [Xenopus laevis]
 gb|AAH59330.1| MGC69071 protein [Xenopus laevis]
          Length = 246

 Score = 36.2 bits (82), Expect = 4.0,   Method: Composition-based stats.
 Identities = 34/109 (31%), Positives = 52/109 (47%), Gaps = 17/109 (15%)

Query: 20  RDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSN 79
           RDL    L +E++ S K ++D+ SE      N  + R        ++    + I +F+SN
Sbjct: 137 RDLYEIKLLMEAETSGKHLSDKLSED-----NGVVSRGSSSSRHPIALQIRLLIHIFRSN 191

Query: 80  P-MILDLFVSACDIMPSFTKLGL-----------TLQSEILNIFTLDTP 116
           P ++LDL  ++CD+     KLGL            L S IL+I TL  P
Sbjct: 192 PPLLLDLLKNSCDLFIPLDKLGLYKTNPGFVGLCGLTSSILSILTLLHP 240
>dbj|BAC41916.1| unknown protein [Arabidopsis thaliana]
          Length = 512

 Score = 35.8 bits (81), Expect = 5.9,   Method: Composition-based stats.
 Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 4/111 (3%)

Query: 217 KTLPKLSPTLKGLYLSKDVSIEDAILNSIDN-KIQKDKAKSKGKQRGVKQKIHHFHEPML 275
           + L    P     +  +D  +++   NS D  KI  D  + +  +R   Q+   F EP  
Sbjct: 334 RNLEPTVPQSDSAFFKRDEELKELSENSADEIKISYDSDEHEPSERTTDQE---FEEPYE 390

Query: 276 HNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDITVLAGSFWAVLKH 326
            N  EE+ ++ +A     +     + + T+PR    D+  L  + W VL H
Sbjct: 391 RNDGEERQQLVEAEASDVNHHGNSEESVTSPRSVLPDMLHLDQTAWEVLDH 441
>ref|NP_180523.1| unknown protein [Arabidopsis thaliana]
 gb|AAC35233.1| hypothetical protein [Arabidopsis thaliana]
          Length = 747

 Score = 35.4 bits (80), Expect = 7.4,   Method: Composition-based stats.
 Identities = 27/111 (24%), Positives = 47/111 (42%), Gaps = 4/111 (3%)

Query: 217 KTLPKLSPTLKGLYLSKDVSIEDAILNSIDN-KIQKDKAKSKGKQRGVKQKIHHFHEPML 275
           + L    P     +  +D  +++   NS D  KI  D  + +  +R   Q+   F EP  
Sbjct: 569 RNLEPTVPQSDSAFFKRDEELKELSENSADEIKISYDSDEHEPSERTTDQE---FEEPYE 625

Query: 276 HNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDITVLAGSFWAVLKH 326
            N  EE+ ++ +A     +     + + T+PR    D+  L  + W VL H
Sbjct: 626 RNDGEERQQLVEAEASDVNHHGNSEESVTSPRSVLPDMLHLDQTAWEVLDH 676
Searching..................................................done Results from round 2


                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value
Sequences used in model and found again:

gb|EDN63830.1|  peroxisome-related protein [Saccharomyces ce...   569   e-161
ref|NP_014598.1|  Phosphorylated tail-anchored type II integ...   566   e-160
ref|XP_446014.1|  unnamed protein product [Candida glabrata]...   380   e-103
ref|NP_983682.1|  ACR280Cp [Ashbya gossypii ATCC 10895] >gi|...   314   9e-84
ref|XP_455427.1|  unnamed protein product [Kluyveromyces lac...   254   7e-66
ref|XP_001643804.1|  hypothetical protein Kpol_1044p4 [Vande...   247   9e-64
Sequences not found previously or not previously below threshold:

ref|YP_001304172.1|  cysteinyl-tRNA synthetase [Parabacteroi...    41   0.11 
ref|YP_988614.1|  FtsK/SpoIIIE family protein [Bartonella ba...    39   0.49 
ref|XP_001518189.1|  PREDICTED: similar to Vps39/Vam6-like p...    38   1.5  
ref|ZP_01924428.1|  SSS sodium solute transporter superfamil...    38   1.6  
ref|XP_001459777.1|  hypothetical protein GSPATT00025114001 ...    38   1.8  
ref|XP_001642383.1|  hypothetical protein Kpol_274p8 [Vander...    37   2.2  
ref|XP_956670.1|  hypothetical protein NCU00157 [Neurospora ...    36   4.6  
ref|XP_001887599.1|  predicted protein [Laccaria bicolor S23...    36   6.0  
ref|XP_001459513.1|  hypothetical protein GSPATT00024847001 ...    36   6.3  
emb|CAO80294.1|  hypothetical protein [Candidatus Cloacamona...    35   7.6  
CONVERGED!
>gb|EDN63830.1| peroxisome-related protein [Saccharomyces cerevisiae YJM789]
          Length = 383

 Score =  569 bits (1468), Expect = e-161,   Method: Composition-based stats.
 Identities = 381/383 (99%), Positives = 382/383 (99%)

Query: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVF+SNPMILDLFVSACDIMPSFTKLGLTLQ EILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFRSNPMILDLFVSACDIMPSFTKLGLTLQGEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>ref|NP_014598.1| Phosphorylated tail-anchored type II integral peroxisomal membrane
           protein required for peroxisome biogenesis, cells
           lacking Pex15p mislocalize peroxisomal matrix proteins
           to cytosol, overexpression results in impaired
           peroxisome assembly; Pex15p [Saccharomyces cerevisiae]
 sp|Q08215|PEX15_YEAST Peroxisomal membrane protein PEX15 (Peroxin-15) (Peroxisome
           biosynthesis protein PAS21)
 emb|CAA99046.1| unnamed protein product [Saccharomyces cerevisiae]
          Length = 383

 Score =  566 bits (1460), Expect = e-160,   Method: Composition-based stats.
 Identities = 383/383 (100%), Positives = 383/383 (100%)

Query: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60
           MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD
Sbjct: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKD 60

Query: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120
           CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE
Sbjct: 61  CLEKMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIE 120

Query: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180
           TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT
Sbjct: 121 TRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRT 180

Query: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240
           TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA
Sbjct: 181 TIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDA 240

Query: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300
           ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ
Sbjct: 241 ILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQ 300

Query: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360
           STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK
Sbjct: 301 STGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFK 360

Query: 361 HVPAAFHTVYPQIVGLLKLLASI 383
           HVPAAFHTVYPQIVGLLKLLASI
Sbjct: 361 HVPAAFHTVYPQIVGLLKLLASI 383
>ref|XP_446014.1| unnamed protein product [Candida glabrata]
 emb|CAG58938.1| unnamed protein product [Candida glabrata CBS 138]
          Length = 374

 Score =  380 bits (976), Expect = e-103,   Method: Composition-based stats.
 Identities = 96/389 (24%), Positives = 179/389 (46%), Gaps = 36/389 (9%)

Query: 7   MNNLPMHSLDSSLRDLLNDDLFIESDESTKSV---NDQRSEVFQECVNLFIKRDIKDCLE 63
           M+N+   S  +++++LL+DD F  SDE   S    +D  S+ +  C++LFIK + K CLE
Sbjct: 10  MSNVLPESTMTTVQNLLDDDDF--SDEPAMSPYQNDDNTSDEYYNCLDLFIKGEPKQCLE 67

Query: 64  KMSEVGFIDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRK 123
            M   G ++ +    N   ++LF++AC  +     LG++LQ++I+ +F     + +E  +
Sbjct: 68  AMLSCGLLNESQIFQNMDSVELFINACSRVSDLATLGISLQNKIIQLFIYS--EILEFVR 125

Query: 124 IILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTL-ELESIMSDFIFVYITKMRTTI 182
                 S L +I K     I+ I        E+ +    E ++++ D  F    K RT  
Sbjct: 126 RNSPAASALALITKLHGNIIRAIGLMRGSRDERYQIIADEKDALIHDIGFH--VKKRTNE 183

Query: 183 DV----VGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIE 238
                 V +  L E+++F V+++L  KK SP +Y  LC  + +L        L +   + 
Sbjct: 184 SRRQYNVEMLMLAELYLFDVQIQLEGKKKSPKLYEDLCDKVLQLKSVFDETVLDEK-PLS 242

Query: 239 DAILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHE-PMLHNSSEEQVKVEDAFNQRTSTDS 297
             IL  ++ K    + K    ++ +++        P   +   +Q+K++   N   S  +
Sbjct: 243 QIILAKLEQKKDSVEKKKSSSRKSLRESTKVLQAIPTPSDEVRDQIKMD---NLALSKGA 299

Query: 298 RLQSTGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNG---LLLTGLLLLLCLKKYKS 354
               T   P+K            W  LK   T+  + KN     L+  + ++L +K+Y+ 
Sbjct: 300 FSSITSHIPQK------------W--LKALTTQKWIYKNMKQLSLVLVVAVILLVKRYRM 345

Query: 355 LMAIFKHVPAAFHTVYPQIVGLLKLLASI 383
           +   F  +P+    + P IV +L+LL+S+
Sbjct: 346 ITKWFGEIPSTLSQLKPAIVEILRLLSSL 374
>ref|NP_983682.1| ACR280Cp [Ashbya gossypii ATCC 10895]
 gb|AAS51506.1| ACR280Cp [Ashbya gossypii ATCC 10895]
          Length = 357

 Score =  314 bits (804), Expect = 9e-84,   Method: Composition-based stats.
 Identities = 68/354 (19%), Positives = 144/354 (40%), Gaps = 28/354 (7%)

Query: 11  PMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGF 70
           P  SLDS    LL  +LF   D     V D + E  +EC +L++K      L K+ + G 
Sbjct: 8   PSLSLDS----LLQHELF--QDARAGKV-DSQEERLRECRDLYLKAHFGGFLVKVYQYGL 60

Query: 71  IDITVFKSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLS 130
           ++    +    +    ++A + + S  ++  ++  ++    +  +    +     L  L 
Sbjct: 61  LEDGAQRYTADVWGWVLAAVNGLRSANEIPSSVLRQLRTELSRSSGGVYDVVSA-LSVLE 119

Query: 131 KLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQEL 190
           +  ++  +FR  +++      D  E  +    +E  +   +   +  + +  ++  L +L
Sbjct: 120 RARLLLSYFRSAVRLASL---DTAENADYLRRVEGNLCRELGRLVLDVHSEQELGYLVKL 176

Query: 191 IEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKD-VSIEDAILNSIDNKI 249
           +E+++  ++V+  H++   ++YW LC+  P +S  L G   S++ VS E+ IL  +  K 
Sbjct: 177 VELYLLDLQVRCLHREMDKSLYWTLCRKFPLMSRKLSGSPQSRNGVSCEEHILLQLQPKK 236

Query: 250 QKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKK 309
           +  K K    +R V +             + +Q +V    N    T      T     + 
Sbjct: 237 KTIKNKHASSERRVARPS---SAAPPRPGARQQDRVARP-NLPLMTPGSPMLTA---DRS 289

Query: 310 NNDITVLAGSFWAVLKHHFTRSVL---NKNGLLLTGLLLLLCLKKYKSLMAIFK 360
            N       S+ A+L +      L     +  +   ++L + L+K +     FK
Sbjct: 290 ENCERKPRHSYAAILNYL--PKWLTDFTDSRFIALVVMLAIALRKLR----WFK 337
>ref|XP_455427.1| unnamed protein product [Kluyveromyces lactis]
 emb|CAG98135.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
          Length = 357

 Score =  254 bits (650), Expect = 7e-66,   Method: Composition-based stats.
 Identities = 56/222 (25%), Positives = 102/222 (45%), Gaps = 11/222 (4%)

Query: 17  SSLRDLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVF 76
           + L  LL+DD F  + +      D   E  QEC++L++K D+K+CLE M E G ++    
Sbjct: 13  TRLEQLLDDDRFRLNLQPV----DIEKEHAQECLDLYVKGDLKECLELMYEYGLLNSNKM 68

Query: 77  KSNPMILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVIN 136
           +++     L +     M +   +G +L   +   FT +    +  R I +  LS  L+I 
Sbjct: 69  QTSLKSWQLMMDCVSQMNNVGVIGTSLDKRLKEWFTNEE---LLLRLIKMKPLSDQLIIT 125

Query: 137 -KFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFI 195
            +FF   +K  + N+  + E  +   EL     + +     + +T  ++  L ++++  I
Sbjct: 126 YQFFYSSLKFWKRNVKQNYEHID---ELSISCKELLLQTSRRCQTVTEIQNLSQILDFLI 182

Query: 196 FQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSI 237
           F V+++   KK S  MY   C+   KL   LK   +    S+
Sbjct: 183 FDVQIETLQKKASITMYTRFCQLDDKLQSKLKANKVKHAQSV 224
>ref|XP_001643804.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO15946.1| hypothetical protein Kpol_1044p4 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 370

 Score =  247 bits (632), Expect = 9e-64,   Method: Composition-based stats.
 Identities = 75/368 (20%), Positives = 162/368 (44%), Gaps = 26/368 (7%)

Query: 21  DLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCLEKMSEVGFIDITVFKSNP 80
           ++L++D F+  DES   V+D  +  +Q+C+N F+  D   C+E M++ GF+D  + + + 
Sbjct: 24  EVLDEDEFL--DESEVKVDDT-TRKYQQCLNTFVGGDPIKCIELMNKYGFLDQNLMEDSD 80

Query: 81  M-ILDLFVSACDIMPSFTKL---GLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVIN 136
           + IL+LF + C+ +P+F  +      +   ILN +  D    +          + L +  
Sbjct: 81  IPILELFFNVCENIPNFKSIKSEDFVIVESILNKYLEDNDSSLN---------NDLTLYV 131

Query: 137 KFFRCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDV-VGLQELIEIFI 195
           KF +  IK ++ +     + E ++++L   + + I      + ++ DV + + E+IEI+ 
Sbjct: 132 KFLKSYIKFLKSDSI--KDNEVRSIDLGYKVKNVI--AKINVESSEDVHMEICEMIEIYF 187

Query: 196 FQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKAK 255
             +++KL     + + Y   CK+ P +   L         +  + I+N +  K QK   K
Sbjct: 188 VHIEIKLQENSLTTSRYELFCKSNPVIHQLLN-TKTKNGQTYYNMIMNQLTPKDQKVSKK 246

Query: 256 SKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDITV 315
           SK      + +  H H+   +++  +  +             +     +  +   N I+ 
Sbjct: 247 SKPTVSSQQHQHQHQHQHNHNHNQHQHKRSNSNNKTMGDKKDKNAEQSSNNQSLTNQISR 306

Query: 316 LAGSFWAVLKHHFTRSVLNKNGLLLTGLLLLLCLKKYKSLMAIFKHVPAAFHTVYPQIVG 375
                   L   + R  ++   L++  LL++   ++ + L      +      + P ++ 
Sbjct: 307 TLRR----LSIFYNRLEISHRSLIVIILLIVTLSRRLRFLGKAKDLILNIKGKLAPSLMQ 362

Query: 376 LLKLLASI 383
           LL +LAS+
Sbjct: 363 LLNILASV 370
>ref|YP_001304172.1| cysteinyl-tRNA synthetase [Parabacteroides distasonis ATCC 8503]
 sp|A6LFT6|SYC_PARD8 Cysteinyl-tRNA synthetase (Cysteine--tRNA ligase) (CysRS)
 gb|ABR44550.1| cysteinyl-tRNA synthetase [Parabacteroides distasonis ATCC 8503]
          Length = 491

 Score = 41.4 bits (96), Expect = 0.11,   Method: Composition-based stats.
 Identities = 19/81 (23%), Positives = 32/81 (39%)

Query: 180 TTIDVVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIED 239
           +  D+  LQE+  +F+F +         S N Y A  K +  L    +    +KD +  D
Sbjct: 406 SEEDLKELQEVFHLFLFDILGMKDEASASGNHYEAFGKAVDLLLSIRQQAKANKDWATSD 465

Query: 240 AILNSIDNKIQKDKAKSKGKQ 260
            I N +     + K    G +
Sbjct: 466 KIRNELTAMGFEIKDTKDGAE 486
>ref|YP_988614.1| FtsK/SpoIIIE family protein [Bartonella bacilliformis KC583]
 gb|ABM45556.1| FtsK/SpoIIIE family protein [Bartonella bacilliformis KC583]
          Length = 872

 Score = 39.5 bits (91), Expect = 0.49,   Method: Composition-based stats.
 Identities = 50/258 (19%), Positives = 94/258 (36%), Gaps = 49/258 (18%)

Query: 93  MPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQ--FNL 150
           M +        QS+ ++ F+ +  + +  R I   + +    I+K F     V +  F L
Sbjct: 4   MRNLKT-----QSDSVSFFSDNNAESVSERSINASEENSFSNISKMF-SYPAVWEKAFTL 57

Query: 151 TDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVV-GLQELIEIFIFQVKVKLHHKKPSP 209
             +  +  +T E+E +        I   +    +    Q+L +I      +K   +    
Sbjct: 58  GQNV-RFTRTPEVEILRRRIEKDPIFAKQFEAFIQQESQKLTDI------IKCDQESI-- 108

Query: 210 NMYWALCKTLPKLSPTLKGLYLSKDVS-IEDAILNSIDNKIQKDKAKSKGKQ-------R 261
                    LP +   L    +    S     IL   + +I   + +    Q        
Sbjct: 109 --------KLPLVEKELNAQSVDNKSSFYSSTILKQSEQQIIATRTEHASTQYVHAVQAE 160

Query: 262 GVKQKIHHFHEPMLHNSS--------EEQVKVEDAFNQRTSTDSRLQSTGTAPRKKNNDI 313
            V+Q I  F    L +++         EQ+KV+D   + TS DS    T +   +  +D+
Sbjct: 161 SVEQGIKPF--CYLSDTAFFECEPLMLEQIKVQDPQREATSIDSANNETLS---EAVSDM 215

Query: 314 TVLAGSFWAVLKHHFTRS 331
           ++   S + VLK  F +S
Sbjct: 216 SIT--SLYRVLKCRFPQS 231
>ref|XP_001518189.1| PREDICTED: similar to Vps39/Vam6-like protein, partial
           [Ornithorhynchus anatinus]
          Length = 1309

 Score = 37.6 bits (86), Expect = 1.5,   Method: Composition-based stats.
 Identities = 11/48 (22%), Positives = 24/48 (50%)

Query: 1   MAASEIMNNLPMHSLDSSLRDLLNDDLFIESDESTKSVNDQRSEVFQE 48
           +A++  +  L   S+ + ++ LL D  F  + +  +  +D  SE  Q+
Sbjct: 720 VASNHFVWRLLPVSIATQIQQLLQDKQFELALQLAEMKDDSDSEKLQQ 767
>ref|ZP_01924428.1| SSS sodium solute transporter superfamily [Victivallis vadensis
           ATCC BAA-548]
 gb|EDM95109.1| SSS sodium solute transporter superfamily [Victivallis vadensis
           ATCC BAA-548]
          Length = 495

 Score = 37.6 bits (86), Expect = 1.6,   Method: Composition-based stats.
 Identities = 32/111 (28%), Positives = 51/111 (45%), Gaps = 13/111 (11%)

Query: 281 EQVKVEDAFNQRTSTDSRLQSTG--TAPRKKNNDITVLAGSFWA------VLKHHFTRSV 332
           E ++   A +   S      + G    P++  ND+T++  S +        L   FTR V
Sbjct: 366 EALRFAKAVSLLVSAGMIGGAVGIHYIPKESINDMTIILASLFGGGLLSIYLLGFFTRRV 425

Query: 333 LNKNGLLLTGLLLLLCLKKYKSLMAIFKHVPAAFHTVYPQIV--GLLKLLA 381
              NG LLTGL++ LC      L + F  +   FH+ +  I+  G+L L+A
Sbjct: 426 --GNGALLTGLVIALCFNVLMMLSS-FGVIRMPFHSYWTSILVNGILALIA 473
>ref|XP_001459777.1| hypothetical protein GSPATT00025114001 [Paramecium tetraurelia strain
            d4-2]
 emb|CAK92380.1| unnamed protein product [Paramecium tetraurelia]
          Length = 1319

 Score = 37.6 bits (86), Expect = 1.8,   Method: Composition-based stats.
 Identities = 27/119 (22%), Positives = 52/119 (43%), Gaps = 14/119 (11%)

Query: 231  LSKDVSIEDAILNSIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVK-VEDAF 289
             S    + D I   +  K        K K++ +  K +  + P+   + E+ V   ++A+
Sbjct: 1082 TSNKQWLADNIQLILSPK------TLKSKRKIILDKFNAIYGPLEKKNEEDPVAGFDNAW 1135

Query: 290  NQR--TSTDSRLQSTGTAPRKKNNDITVLAGSFWAVLKHHFTRSVLNKNGLLLTGLLLL 346
            N R  T   SR +   T+P+K N  I  ++ S  A+LK+      L ++  + T +  +
Sbjct: 1136 NFRQFTKNSSRTEMARTSPQKFNKRIDNMSESTKAILKY-----WLYRSRHMKTAIKYI 1189
>ref|XP_001642383.1| hypothetical protein Kpol_274p8 [Vanderwaltozyma polyspora DSM
           70294]
 gb|EDO14525.1| hypothetical protein Kpol_274p8 [Vanderwaltozyma polyspora DSM
           70294]
          Length = 477

 Score = 37.2 bits (85), Expect = 2.2,   Method: Composition-based stats.
 Identities = 22/110 (20%), Positives = 46/110 (41%), Gaps = 6/110 (5%)

Query: 184 VVGLQELIEIFIFQVKVKLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILN 243
              +++L +    QV++   H K     + +L +++P+        YLSK+ SI    L+
Sbjct: 283 RQEIEQLDQFIQKQVQIS-QHLKADEEEHMSLVQSIPR-----DITYLSKNQSITKQTLS 336

Query: 244 SIDNKIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRT 293
               KI   K  +       +Q    F + +  +S    ++++  F Q+ 
Sbjct: 337 QDLRKIAFIKETTDQSISNTQQFTILFQQLLTPDSKVSSIELDKFFQQKI 386
>ref|XP_956670.1| hypothetical protein NCU00157 [Neurospora crassa OR74A]
 sp|Q7RXQ1|CSN1_NEUCR COP9 signalosome complex subunit 1 (CSN complex subunit 1)
 gb|EAA27434.1| conserved hypothetical protein [Neurospora crassa OR74A]
 gb|ABB36580.1| CSN-1 [Neurospora crassa]
          Length = 425

 Score = 36.0 bits (82), Expect = 4.6,   Method: Composition-based stats.
 Identities = 33/192 (17%), Positives = 61/192 (31%), Gaps = 31/192 (16%)

Query: 6   IMNNLPMHSLDSSLR---DLLNDDLFIESDESTKSVNDQRSEVFQECVNLFIKRDIKDCL 62
           I   L   +         +LL++D F E  +        R    +  +  F+      C+
Sbjct: 239 IYGGLLALATMDRHELQANLLDNDSFREFLQ--------REPHIRRAITQFVNGRYAACI 290

Query: 63  EKMSEVG---FIDITVFKSNPM---------ILDLF--VSAC--DIM-PSFTKLGLTLQS 105
           E +        +DI + K  P          I+      S    D M  +F   G +++ 
Sbjct: 291 EILESYRPDYLLDIYLQKHVPKLYADIRTKSIVQYLKPFSCVRLDTMQKAFNGPGPSIED 350

Query: 106 EILNIFTLDTPQCIETRKIILGDLSKLLVINKFFRCCIKVIQFNLTDHTEQEEKTLELES 165
           E+   FT+     +  R   +     L  +  + +  +  I+       + E K      
Sbjct: 351 EL---FTMIKDGKLNARIDAINKSKALQTLENYEKQALDRIRRMNIMAADLEVKGSRKPG 407

Query: 166 IMSDFIFVYITK 177
            M+D  F   T 
Sbjct: 408 GMNDIPFSMTTD 419
>ref|XP_001887599.1| predicted protein [Laccaria bicolor S238N-H82]
 gb|EDR01786.1| predicted protein [Laccaria bicolor S238N-H82]
          Length = 470

 Score = 35.6 bits (81), Expect = 6.0,   Method: Composition-based stats.
 Identities = 18/114 (15%), Positives = 38/114 (33%), Gaps = 14/114 (12%)

Query: 82  ILDLFVSACDIMPSFTKLGLTLQSEILNIFTLDTPQCIETRKIILGDLSKLLV-----IN 136
            LD+   AC+  P+   L + L +++  I +    +            + + V     + 
Sbjct: 312 SLDIAFYACETFPNLNVLPI-LITQLRKITSDSNLEEFRLLADYSPPPNDIHVVDDGWLT 370

Query: 137 KFFRCCIKVIQ-FNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQE 189
           +F  C +   Q  +L     +  K   ++            +M +T     LQ 
Sbjct: 371 RF--CRLPFWQELDLILTHPRFSKLRRVDVSFKR-----TDEMESTSSEFELQM 417
>ref|XP_001459513.1| hypothetical protein GSPATT00024847001 [Paramecium tetraurelia
           strain d4-2]
 emb|CAK92116.1| unnamed protein product [Paramecium tetraurelia]
          Length = 354

 Score = 35.6 bits (81), Expect = 6.3,   Method: Composition-based stats.
 Identities = 35/169 (20%), Positives = 58/169 (34%), Gaps = 19/169 (11%)

Query: 140 RCCIKVIQFNLTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEIFIFQVK 199
           +  +K +Q       E+ +   +   I  +             DV    E I+  I   K
Sbjct: 120 KSTVKKLQIQEIQ--EKVQDLQDEIIICKERQKNTEIAKHQAEDV---LEQIDDDILHAK 174

Query: 200 VKLHH-KKPSPNMYWALCKTLPKLS---------PTLKGLYLSKDVSIEDAI--LNSIDN 247
           +KL   K     +Y ++ K   +            TLK    S+          LN + N
Sbjct: 175 IKLQQVKSQGEQIYESMSKLEDQQHDKKYYMGRTSTLKR-KRSEIQPPNQIKNQLNQLKN 233

Query: 248 KIQKDKAKSKGKQRGVKQKIHHFHEPMLHNSSEEQVKVEDAFNQRTSTD 296
           +I++ K   K K+    QKI    + +    + E +  ED  N   S D
Sbjct: 234 EIEQLKQSKKEKEEYYAQKIREIEQQLDQRKTNESIP-EDQLNASLSPD 281
>emb|CAO80294.1| hypothetical protein [Candidatus Cloacamonas acidaminovorans]
          Length = 552

 Score = 35.2 bits (80), Expect = 7.6,   Method: Composition-based stats.
 Identities = 33/140 (23%), Positives = 53/140 (37%), Gaps = 16/140 (11%)

Query: 150 LTDHTEQEEKTLELESIMSDFIFVYITKMRTTIDVVGLQELIEI--FIF---QVKV---- 200
           L    E       LESI S+   + I  ++TT + + L EL EI  F+F    ++     
Sbjct: 70  LKAIKESRALEARLESIFSELSLLPIKHLKTT-ERLELNELFEIKSFLFSYLHLQEILQE 128

Query: 201 -KLHHKKPSPNMYWALCKTLPKLSPTLKGLYLSKDVSIEDAILNSIDNKIQKDKAKSKGK 259
            KL+H+ P P+M     K    L P    L   +      ++L  +  K      + K  
Sbjct: 129 HKLNHQHPLPDMQ----KMFSLLDPEGNKLPTFRIYPSYSSVLKKLTQKQFAIAKRLKEA 184

Query: 260 QRGVKQKIHH-FHEPMLHNS 278
           ++   +K       P L   
Sbjct: 185 RKRDLEKAKQELGMPTLKEE 204
  Database: All non-redundant GenBank CDS
  translations+PDB+SwissProt+PIR+PRF excluding environmental samples
  from WGS projects
    Posted date:  May 23, 2008  5:56 PM
  Number of letters in database: 883,778,997
  Number of sequences in database:  2,617,685
  
  Database: /host/Blast/data/nr_perl/nr.01
    Posted date:  May 23, 2008  5:54 PM
  Number of letters in database: 976,759,346
  Number of sequences in database:  2,761,413
  
  Database: /host/Blast/data/nr_perl/nr.02
    Posted date:  May 23, 2008  5:48 PM
  Number of letters in database: 374,670,760
  Number of sequences in database:  1,165,270
  
  Database: /host/Blast/data/nr_perl/nr.03
    Posted date:  Apr 28, 2009  5:40 PM
  Number of letters in database: 114,943,120
  Number of sequences in database:  354,819
  
Lambda     K      H
   0.312    0.151    0.395 

Lambda     K      H
   0.267   0.0462    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 3,265,560,962
Number of Sequences: 6899187
Number of extensions: 135341797
Number of successful extensions: 414799
Number of sequences better than 10.0: 220
Number of HSP's better than 10.0 without gapping: 23
Number of HSP's successfully gapped in prelim test: 238
Number of HSP's that attempted gapping in prelim test: 414458
Number of HSP's gapped (non-prelim): 483
length of query: 383
length of database: 2,350,152,223
effective HSP length: 136
effective length of query: 247
effective length of database: 1,411,862,791
effective search space: 348730109377
effective search space used: 348730109377
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.2 bits)
S2: 80 (35.3 bits)