BLASTP 2.2.17 [Aug-26-2007]
Reference:
Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schäffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for composition-based statistics:
Schäffer, Alejandro A., L. Aravind, Thomas L. Madden,
Schaffer, Alejandro A., L. Aravind, Thomas L. Madden,
Sergei Shavirin, John L. Spouge, Yuri I. Wolf,
Eugene V. Koonin, and Stephen F. Altschul (2001),
"Improving the accuracy of PSI-BLAST protein database searches with
composition-based statistics and other refinements", Nucleic Acids Res. 29:2994-3005.
Query= YMR204C__[Saccharomyces_cerevisiae]
(420 letters)
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
6,899,187 sequences; 2,350,152,223 total letters
Searching..................................................done
Results from round 1
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_013931.1| Peripheral membrane protein of peroxisomes... 710 0.0
sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1 >... 710 0.0
ref|XP_449234.1| unnamed protein product [Candida glabrata]... 129 5e-28
ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vand... 114 2e-23
ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895] >gi|... 74 3e-11
ref|XP_455364.1| unnamed protein product [Kluyveromyces lac... 66 5e-09
emb|CAN73633.1| hypothetical protein [Vitis vinifera] 39 0.85
ref|XP_567573.1| histone deacetylase [Cryptococcus neoforma... 39 1.0
ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryo... 38 1.2
ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida a... 37 2.2
ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia... 36 6.6
ref|ZP_01306084.1| Integrins alpha chain [Oceanobacter sp. ... 35 8.1
ref|XP_001392840.1| hypothetical protein An08g07250 [Asperg... 35 9.0
>ref|NP_013931.1| Peripheral membrane protein of peroxisomes involved in peroxisomal
inheritance; Inp1p [Saccharomyces cerevisiae]
sp|Q03694|INP1_YEAST Inheritance of peroxisomes protein 1
emb|CAA88645.1| unknown [Saccharomyces cerevisiae]
Length = 420
Score = 710 bits (1832), Expect = 0.0, Method: Composition-based stats.
Identities = 420/420 (100%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1
gb|EDN64140.1| inheritance of peroxisomes [Saccharomyces cerevisiae YJM789]
Length = 420
Score = 710 bits (1832), Expect = 0.0, Method: Composition-based stats.
Identities = 419/420 (99%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDD+
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDD 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>ref|XP_449234.1| unnamed protein product [Candida glabrata]
sp|Q6FKL0|INP1_CANGA Inheritance of peroxisomes protein 1
emb|CAG62208.1| unnamed protein product [Candida glabrata CBS 138]
Length = 342
Score = 129 bits (324), Expect = 5e-28, Method: Composition-based stats.
Identities = 120/372 (32%), Positives = 180/372 (48%), Gaps = 67/372 (18%)
Query: 11 NSVRLTAKQEKKPQ-STFQTLKQSLKLSNNKKLKQDS-----TQHSNDTNKSVKAKKNGT 64
++V+ + E KPQ Q++K +LKL K+ + S KS ++NG
Sbjct: 2 STVKQRTRTEGKPQYPLMQSIKDTLKLKTTKRTNSEKKVSTLNDKSGSPIKSNSERRNGI 61
Query: 65 SSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSG 124
+ K G+ KR+S QR +LF Y NVQV N + N + R++S S N + +
Sbjct: 62 TRKVNGS--KRLSAQRVALFKYNNVQVFNY-------VRNGNNSRKSSMSSLN--SSGTV 110
Query: 125 VFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHV 184
V N+ S+ LKP L+ KG +E+YQI K+ A
Sbjct: 111 VMCNEYDSE----VTNLKPQMLIGKGVLELYQI---------------KTPAALDRKEQT 151
Query: 185 VNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELEN 244
+NY+S+GR G IVHP+LPKL+IT+L FKY I+F NPERYW+IEFL + Q EL
Sbjct: 152 MNYISLGRGGQIVHPILPKLKITKLRNENFKYLITFSNPERYWQIEFLQINGQLHDELLK 211
Query: 245 SVKAFENVISKICQF--SHINEGATIGNNESLSDKFKLPPTS-DIEPPNTEIINNDDDND 301
FE++IS +C F +I + A++ +N K PP + + E + +ND
Sbjct: 212 ITDEFESIISSVCIFIDENITKLASLKSN-------KTPPNNVNTLDQIKEKVQTATEND 264
Query: 302 DDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLY--PSDPTDAVSISINE 359
DD + ++L+YLL++ EQ T +SF Y P+D T+ ++
Sbjct: 265 DD----EAEELDYLLEDPIEQEPT---------------SSFEYDVPTDLTEVFQRTMGR 305
Query: 360 AFKNAIRRTAPV 371
N RR + V
Sbjct: 306 INSNGSRRYSSV 317
>ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO19485.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 511
Score = 114 bits (284), Expect = 2e-23, Method: Composition-based stats.
Identities = 98/286 (34%), Positives = 150/286 (52%), Gaps = 29/286 (10%)
Query: 12 SVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKN----GTSSK 67
S + + +KK QST + +K+SLK ++ K + ++ D ++S K+ + S
Sbjct: 29 SSQTSKTPKKKQQSTLRLIKESLKFKSSSKNSSKPSNNNIDKSESASMFKDTLKRSSISN 88
Query: 68 KTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFF 127
K KR S QR SLF Y NVQV+N P+ S R VS+ + + S
Sbjct: 89 KPRNLEKRSSAQRVSLFKYSNVQVVNCSAPL------SPVQIRKDSVSSGSTSTSRNYDI 142
Query: 128 NDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNY 187
N + S S +T+ +K SL++ G +E+YQI T +K + ++NY
Sbjct: 143 NSSLSVKSTSTVTMKSISLVSHGVMEVYQILT--------------PTKDPNETPQMMNY 188
Query: 188 LSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVK 247
S+GR+G IVHP+LP+LQ+TRL+G Y + FYNP RYW I+FLP+ + + NS+
Sbjct: 189 FSLGRNGKIVHPILPRLQVTRLHGYEASYSVLFYNPTRYWRIDFLPVENLVEKNQLNSIL 248
Query: 248 A-FENVISKICQFSHINEGATIGNNESLSDKFKLP----PTSDIEP 288
A FE+VIS IC+FS +E ++ L + P P ++ EP
Sbjct: 249 ADFESVISSICEFSKESEYTIHNDSRYLQKQIIEPKLLTPVTNNEP 294
>ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895]
sp|Q755Q4|INP1_ASHGO Inheritance of peroxisomes protein 1
gb|AAS53190.1| AFL184Wp [Ashbya gossypii ATCC 10895]
Length = 300
Score = 73.6 bits (179), Expect = 3e-11, Method: Composition-based stats.
Identities = 79/288 (27%), Positives = 121/288 (42%), Gaps = 78/288 (27%)
Query: 76 ISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDS 135
+S R +LF Y +V+V S+ + NS RR + S V D ++
Sbjct: 91 LSNDRVTLFRYEHVRV-TSYQSVRKKYRNSGDSRRGQE------RMESAVKGTDVLRRER 143
Query: 136 QNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGD 195
+ I+ +GP+EIYQI T K P QK V YL +GR
Sbjct: 144 ASVIE-------PRGPLEIYQIITPISK-----EPSQK-----------VTYLCLGRKEQ 180
Query: 196 IVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISK 255
I+ P+LPKL+IT L G ++ + +NPE W+IEFL + S N + AFEN +
Sbjct: 181 IIKPILPKLRITMLTREGLQFSVLSFNPENSWKIEFLGALGDSAVPC-NVILAFENAVKN 239
Query: 256 ICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYL 315
IC++ T +NND + + DDDDL YL
Sbjct: 240 ICRY-------------------------------TSELNNDPIEEVGTE--DDDDLEYL 266
Query: 316 LDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEAFKN 363
L ++ + +T S++ + + S+P+D ++E F N
Sbjct: 267 LYSDFAED---------DDTGSDITLNREHSSEPSD-----LHELFDN 300
>ref|XP_455364.1| unnamed protein product [Kluyveromyces lactis]
sp|Q6CL25|INP1_KLULA Inheritance of peroxisomes protein 1
emb|CAG98072.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
Length = 343
Score = 65.9 bits (159), Expect = 5e-09, Method: Composition-based stats.
Identities = 73/239 (30%), Positives = 110/239 (46%), Gaps = 36/239 (15%)
Query: 185 VNYLSIGRHGDIVHPVLPKLQITRL---NGAGFKYFISFYNPERYWEIEFLPLISQSQSE 241
+NYLS+G+ VHP+LPKL++ R+ FK IS +NPER+WE+ FL +
Sbjct: 96 MNYLSLGKKDQFVHPILPKLKVGRVPSDQKQEFKIIISLFNPERFWEVTFLSI--NHGRV 153
Query: 242 LENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDND 301
+ E V+SKICQ+++ + +N SL D T I P +I+
Sbjct: 154 PRKVINDLEQVLSKICQYNNKDP-----DNASLKDTKN--DTKAIPSPKIDILE------ 200
Query: 302 DDDDNYDDDDLNYLLDEEY-EQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
+ DDL YLL E +Q + S+ N S +++ + S +N+
Sbjct: 201 ------EVDDLEYLLAENVPDQEQIPEAESLELNESSTISSISMEHSLVQSG--DVVNKT 252
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSY--PFIDSPPYLQDRHRRFQ--RRSIS 415
FK A+ R P+ A S + KR+SS+ F+D P + Q RRSIS
Sbjct: 253 FKQALGRIKPIDFNTRAESS-----RYKRFSSFHSAFLDVPRISAEYQHSLQDKRRSIS 306
>emb|CAN73633.1| hypothetical protein [Vitis vinifera]
Length = 275
Score = 38.9 bits (89), Expect = 0.85, Method: Composition-based stats.
Identities = 34/105 (32%), Positives = 51/105 (48%), Gaps = 7/105 (6%)
Query: 288 PPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPS 347
PP+TE+ D+D+DD D+ +D D + DE Y +SFS S SNL+ + L
Sbjct: 58 PPDTELPEADEDHDDGDERWDPDSDMEIDDERYFVSPV-SSFSHCS-IPSNLHFTCLLVL 115
Query: 348 DPTDAVSISINEAFKNAIR-----RTAPVLNIPIAAPSIHSKQQN 387
T +S EA R R + V +PI PS+ +Q++
Sbjct: 116 QRTRKELLSTQEALIQLRRRLHAQRPSDVCPMPIDEPSVKVEQES 160
>ref|XP_567573.1| histone deacetylase [Cryptococcus neoformans var. neoformans JEC21]
ref|XP_773280.1| hypothetical protein CNBJ0580 [Cryptococcus neoformans var.
neoformans B-3501A]
gb|EAL18633.1| hypothetical protein CNBJ0580 [Cryptococcus neoformans var.
neoformans B-3501A]
gb|AAW46056.1| histone deacetylase, putative [Cryptococcus neoformans var.
neoformans JEC21]
Length = 596
Score = 38.5 bits (88), Expect = 1.0, Method: Composition-based stats.
Identities = 22/55 (40%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 273 SLSDKFKLPPTSDIEPPNTEIINNDD-DNDDDDDNYDDDDLNYLLDEEYEQGCTD 326
SLS++ LPP +I P +T I++++ D+D + DN D + + L+ EE E G TD
Sbjct: 10 SLSEEVPLPPAQEIRPGDTYQIDSEEYDSDQERDNLSDGEYDKLV-EEAENGYTD 63
>ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryomyces hansenii CBS767]
emb|CAG87064.1| unnamed protein product [Debaryomyces hansenii CBS767]
Length = 860
Score = 38.1 bits (87), Expect = 1.2, Method: Composition-based stats.
Identities = 47/148 (31%), Positives = 67/148 (45%), Gaps = 33/148 (22%)
Query: 121 ESSGVFFNDTQSQDSQNTI-KLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASS 179
+SS + D Q D++ + KL L+ G EI+Q+
Sbjct: 261 KSSRILVFDEQLNDNKTDMEKLSSGRLLGHGEFEIFQL---------------------- 298
Query: 180 HDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLP----LI 235
H+G V YLS G V+P+LPKL+I R+N F + NPERYW+I LI
Sbjct: 299 HNGDV-TYLSCGP--SFVYPLLPKLKILRINFNQF--ILPLVNPERYWKISINSDDKNLI 353
Query: 236 SQSQSELENSVKAFENVISKICQFSHIN 263
+ LE +VK + N+ I Q S I+
Sbjct: 354 HLLERTLERNVK-YRNLFFGINQLSSID 380
>ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
ref|XP_720850.1| hypothetical protein CaO19_13878 [Candida albicans SC5314]
ref|XP_888816.1| hypothetical protein CaJ7.0223 [Candida albicans SC5314]
gb|EAL02027.1| hypothetical protein CaO19.13878 [Candida albicans SC5314]
gb|EAL02561.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
dbj|BAE44713.1| hypothetical protein [Candida albicans]
Length = 684
Score = 37.4 bits (85), Expect = 2.2, Method: Composition-based stats.
Identities = 34/118 (28%), Positives = 51/118 (43%), Gaps = 16/118 (13%)
Query: 185 VNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELEN 244
V YLS G + ++P+LPK+++ R+N F + NPERYW+I SE N
Sbjct: 219 VTYLSCG--NNFIYPLLPKIKVFRINSNQF--LLPLVNPERYWKIFI-------NSEELN 267
Query: 245 SVKAFENVISKICQFSHINEGATIGNNE-----SLSDKFKLPPTSDIEPPNTEIINND 297
+ NV + QF ++E E +D P PP I+N+
Sbjct: 268 VINNLINVFQRNVQFISLHESENKNAPEPKSQFEFTDSINTKPKELESPPRNIYISNE 325
>ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
gb|EDK40579.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
Length = 714
Score = 35.8 bits (81), Expect = 6.6, Method: Composition-based stats.
Identities = 51/197 (25%), Positives = 83/197 (42%), Gaps = 41/197 (20%)
Query: 60 KKNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVP---IHNDIPNSSC--IRRNSQV 114
K+ KK+ KR + F L GN++ NSF+ + +P + N +V
Sbjct: 154 KRTLMRHKKSDNDMKR---ESFILDAKGNLRNRNSFIEGADTYLQLPKEPLGPLNINEKV 210
Query: 115 SANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKS 174
+ + V +++ DS +I + +L+ G E++Q+
Sbjct: 211 TLFQFNSAKVVLYHEMV--DSNQSIPKESGTLLGHGSFEVFQL----------------- 251
Query: 175 SKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEI----E 230
H G V YLS G V+P+LPKL+I R N F + NPERYW I E
Sbjct: 252 -----HKGDV-TYLSCG--SSFVYPLLPKLKILRTNFNQF--ILPLANPERYWRISLNTE 301
Query: 231 FLPLISQSQSELENSVK 247
+ +I + + LE ++
Sbjct: 302 DVKIIRRLERTLEEKIQ 318
>ref|ZP_01306084.1| Integrins alpha chain [Oceanobacter sp. RED65]
gb|EAT13274.1| Integrins alpha chain [Oceanobacter sp. RED65]
Length = 710
Score = 35.4 bits (80), Expect = 8.1, Method: Composition-based stats.
Identities = 25/67 (37%), Positives = 33/67 (49%), Gaps = 4/67 (5%)
Query: 276 DKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEY----EQGCTDNSFSV 331
D FK P+ ++ N I NN D +DD D D DD L EE+ E G DN+
Sbjct: 118 DAFKNDPSETLDTDNDGIGNNADTDDDGDGVADIDDPFPLNSEEWADVDEDGVGDNADLH 177
Query: 332 ISNTCSN 338
++NT N
Sbjct: 178 LNNTSQN 184
>ref|XP_001392840.1| hypothetical protein An08g07250 [Aspergillus niger]
emb|CAK39972.1| unnamed protein product [Aspergillus niger]
Length = 946
Score = 35.4 bits (80), Expect = 9.0, Method: Composition-based stats.
Identities = 23/70 (32%), Positives = 34/70 (48%)
Query: 9 KKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKK 68
+K V+ T+KQE KP + L QSLK + L + + H K K KKN + +
Sbjct: 291 QKQPVKKTSKQELKPDGDDRPLWQSLKAGVRRVLTRKAVSHDGGALKHSKTKKNKYQNVR 350
Query: 69 TGTQRKRIST 78
+ R+ST
Sbjct: 351 RASIIVRMST 360
Searching..................................................done
Results from round 2
Score E
Sequences producing significant alignments: (bits) Value
Sequences used in model and found again:
ref|NP_013931.1| Peripheral membrane protein of peroxisomes... 615 e-174
sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1 >... 615 e-174
ref|XP_449234.1| unnamed protein product [Candida glabrata]... 342 4e-92
ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vand... 282 4e-74
ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895] >gi|... 247 1e-63
ref|XP_455364.1| unnamed protein product [Kluyveromyces lac... 243 2e-62
Sequences not found previously or not previously below threshold:
ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia... 51 2e-04
ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida a... 48 0.001
ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryo... 48 0.002
ref|XP_001523637.1| hypothetical protein LELG_05053 [Lodder... 44 0.020
ref|XP_388140.1| hypothetical protein FG07964.1 [Gibberella... 41 0.17
ref|YP_944121.1| protein containing tetratricopeptide (TPR)... 40 0.39
ref|XP_001881199.1| predicted protein [Laccaria bicolor S23... 38 0.95
ref|YP_796256.1| ABC-type oligopeptide transport system, pe... 38 0.96
ref|XP_001734409.1| hypothetical protein EDI_325820 [Entamo... 38 1.5
ref|NP_784891.1| cell surface protein precursor [Lactobacil... 38 1.7
ref|ZP_03916763.1| transcription termination factor NusA [A... 37 2.6
ref|YP_001406022.1| anaerobic ribonucleoside-triphosphate r... 37 3.0
ref|XP_001793300.1| hypothetical protein SNOG_02702 [Phaeos... 37 3.1
ref|XP_001434978.1| hypothetical protein GSPATT00006888001 ... 37 4.1
ref|XP_001349291.1| histone acetyltransferase Gcn5, putativ... 36 5.4
ref|XP_001329685.1| hypothetical protein TVAG_453780 [Trich... 36 5.4
ref|XP_635620.1| hypothetical protein DDBDRAFT_0216167 [Dic... 36 6.3
gb|AAR20863.1| histone acetyltransferase [Plasmodium falcip... 36 6.4
ref|XP_001645330.1| hypothetical protein Kpol_1058p9 [Vande... 36 6.7
ref|YP_926979.1| AAA ATPase containing von Willebrand facto... 36 6.9
ref|XP_393101.3| PREDICTED: similar to pawn CG11101-PA, par... 36 6.9
ref|ZP_02368955.1| antigen, P35, putative [Borrelia valaisi... 36 7.0
>ref|NP_013931.1| Peripheral membrane protein of peroxisomes involved in peroxisomal
inheritance; Inp1p [Saccharomyces cerevisiae]
sp|Q03694|INP1_YEAST Inheritance of peroxisomes protein 1
emb|CAA88645.1| unknown [Saccharomyces cerevisiae]
Length = 420
Score = 615 bits (1585), Expect = e-174, Method: Composition-based stats.
Identities = 420/420 (100%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1
gb|EDN64140.1| inheritance of peroxisomes [Saccharomyces cerevisiae YJM789]
Length = 420
Score = 615 bits (1585), Expect = e-174, Method: Composition-based stats.
Identities = 419/420 (99%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDD+
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDD 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>ref|XP_449234.1| unnamed protein product [Candida glabrata]
sp|Q6FKL0|INP1_CANGA Inheritance of peroxisomes protein 1
emb|CAG62208.1| unnamed protein product [Candida glabrata CBS 138]
Length = 342
Score = 342 bits (877), Expect = 4e-92, Method: Composition-based stats.
Identities = 117/372 (31%), Positives = 178/372 (47%), Gaps = 67/372 (18%)
Query: 11 NSVRLTAKQEKKPQ-STFQTLKQSLKLSNNKKLKQDS-----TQHSNDTNKSVKAKKNGT 64
++V+ + E KPQ Q++K +LKL K+ + S KS ++NG
Sbjct: 2 STVKQRTRTEGKPQYPLMQSIKDTLKLKTTKRTNSEKKVSTLNDKSGSPIKSNSERRNGI 61
Query: 65 SSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSG 124
+ K G+ KR+S QR +LF Y NVQV N + N + R+ ++ + +
Sbjct: 62 TRKVNGS--KRLSAQRVALFKYNNVQVFNY-------VRNGNNSRK--SSMSSLNSSGTV 110
Query: 125 VFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHV 184
V N+ S+ LKP L+ KG +E+YQI K+ A
Sbjct: 111 VMCNEYDSE----VTNLKPQMLIGKGVLELYQI---------------KTPAALDRKEQT 151
Query: 185 VNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELEN 244
+NY+S+GR G IVHP+LPKL+IT+L FKY I+F NPERYW+IEFL + Q EL
Sbjct: 152 MNYISLGRGGQIVHPILPKLKITKLRNENFKYLITFSNPERYWQIEFLQINGQLHDELLK 211
Query: 245 SVKAFENVISKICQF--SHINEGATIGNNESLSDKFKLPPTS-DIEPPNTEIINNDDDND 301
FE++IS +C F +I + A++ +N K PP + + E + +ND
Sbjct: 212 ITDEFESIISSVCIFIDENITKLASLKSN-------KTPPNNVNTLDQIKEKVQTATEND 264
Query: 302 DDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLY--PSDPTDAVSISINE 359
DD + ++L+YLL++ EQ T +SF Y P+D T+ ++
Sbjct: 265 DD----EAEELDYLLEDPIEQEPT---------------SSFEYDVPTDLTEVFQRTMGR 305
Query: 360 AFKNAIRRTAPV 371
N RR + V
Sbjct: 306 INSNGSRRYSSV 317
>ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO19485.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 511
Score = 282 bits (721), Expect = 4e-74, Method: Composition-based stats.
Identities = 103/316 (32%), Positives = 159/316 (50%), Gaps = 37/316 (11%)
Query: 12 SVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKN----GTSSK 67
S + + +KK QST + +K+SLK ++ K + ++ D ++S K+ + S
Sbjct: 29 SSQTSKTPKKKQQSTLRLIKESLKFKSSSKNSSKPSNNNIDKSESASMFKDTLKRSSISN 88
Query: 68 KTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFF 127
K KR S QR SLF Y NVQV+N P+ S R VS+ + + S
Sbjct: 89 KPRNLEKRSSAQRVSLFKYSNVQVVNCSAPL------SPVQIRKDSVSSGSTSTSRNYDI 142
Query: 128 NDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNY 187
N + S S +T+ +K SL++ G +E+YQI T +K + ++NY
Sbjct: 143 NSSLSVKSTSTVTMKSISLVSHGVMEVYQILTP--------------TKDPNETPQMMNY 188
Query: 188 LSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVK 247
S+GR+G IVHP+LP+LQ+TRL+G Y + FYNP RYW I+FLP+ + + NS+
Sbjct: 189 FSLGRNGKIVHPILPRLQVTRLHGYEASYSVLFYNPTRYWRIDFLPVENLVEKNQLNSIL 248
Query: 248 A-FENVISKICQFSHINEGATIGNNESLSDKFKLP----PTSDIEPPNT--------EII 294
A FE+VIS IC+FS +E ++ L + P P ++ EP II
Sbjct: 249 ADFESVISSICEFSKESEYTIHNDSRYLQKQIIEPKLLTPVTNNEPNRLFENLKHTDAII 308
Query: 295 NNDDDNDDDDDNYDDD 310
+ + N+ N + D
Sbjct: 309 SAANINNIHTSNNEKD 324
Score = 42.0 bits (97), Expect = 0.098, Method: Composition-based stats.
Identities = 39/131 (29%), Positives = 50/131 (38%), Gaps = 20/131 (15%)
Query: 275 SDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISN 334
+ T I P + D DD DDLNYLL +E+ I +
Sbjct: 349 KEVINEIETISIPSPIETVKPIPSTEDVDDQ----DDLNYLL---FEEEEQPQKDLKIYD 401
Query: 335 TCSNLNASFL--------YPSDPTDAVSI--SINEAFKNAIRRTAPVLNI--PIAAPSIH 382
S +N+ + S TD IN+AFK AIR P N+ I S H
Sbjct: 402 DISRVNSISERILYDNNSFYSSTTDYFEQDAEINQAFKRAIRNFGPNRNLKEKIYEHSNH 461
Query: 383 SKQQNKRYSSY 393
S KR+SSY
Sbjct: 462 SVA-TKRFSSY 471
>ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895]
sp|Q755Q4|INP1_ASHGO Inheritance of peroxisomes protein 1
gb|AAS53190.1| AFL184Wp [Ashbya gossypii ATCC 10895]
Length = 300
Score = 247 bits (631), Expect = 1e-63, Method: Composition-based stats.
Identities = 79/288 (27%), Positives = 121/288 (42%), Gaps = 78/288 (27%)
Query: 76 ISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDS 135
+S R +LF Y +V+V S+ + NS RR + S V D ++
Sbjct: 91 LSNDRVTLFRYEHVRV-TSYQSVRKKYRNSGDSRRGQE------RMESAVKGTDVLRRER 143
Query: 136 QNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGD 195
+ I+ +GP+EIYQI T K P QK V YL +GR
Sbjct: 144 ASVIE-------PRGPLEIYQIITPISK-----EPSQK-----------VTYLCLGRKEQ 180
Query: 196 IVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISK 255
I+ P+LPKL+IT L G ++ + +NPE W+IEFL + S N + AFEN +
Sbjct: 181 IIKPILPKLRITMLTREGLQFSVLSFNPENSWKIEFLGALGDSAVPC-NVILAFENAVKN 239
Query: 256 ICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYL 315
IC++ T +NND + + DDDDL YL
Sbjct: 240 ICRY-------------------------------TSELNNDPIEEVGTE--DDDDLEYL 266
Query: 316 LDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEAFKN 363
L ++ + +T S++ + + S+P+D ++E F N
Sbjct: 267 LYSDFAED---------DDTGSDITLNREHSSEPSD-----LHELFDN 300
>ref|XP_455364.1| unnamed protein product [Kluyveromyces lactis]
sp|Q6CL25|INP1_KLULA Inheritance of peroxisomes protein 1
emb|CAG98072.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
Length = 343
Score = 243 bits (619), Expect = 2e-62, Method: Composition-based stats.
Identities = 87/329 (26%), Positives = 139/329 (42%), Gaps = 64/329 (19%)
Query: 96 VPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLM-AKGPIEI 154
P + + + R++ Q + +F +D S + K +++ +K EI
Sbjct: 33 QPWNPQVRTDNTKRKSVQ--------RTTLFRHDYVKVRSWPSDNPKNINVILSKTSFEI 84
Query: 155 YQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRL---NG 211
Y+I G K + +NYLS+G+ VHP+LPKL++ R+
Sbjct: 85 YEIDMGNSKER-------------------MNYLSLGKKDQFVHPILPKLKVGRVPSDQK 125
Query: 212 AGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNN 271
FK IS +NPER+WE+ FL + + E V+SKICQ+++ + +N
Sbjct: 126 QEFKIIISLFNPERFWEVTFLSI--NHGRVPRKVINDLEQVLSKICQYNNKDP-----DN 178
Query: 272 ESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEY-EQGCTDNSFS 330
SL D T I P +I+ + DDL YLL E +Q + S
Sbjct: 179 ASLKDTKN--DTKAIPSPKIDILE------------EVDDLEYLLAENVPDQEQIPEAES 224
Query: 331 VISNTCSNLNASFLYPSDPTDAVSISINEAFKNAIRRTAPVLNIPIAAPSIHSKQQNKRY 390
+ N S +++ + S +N+ FK A+ R P+ A S + KR+
Sbjct: 225 LELNESSTISSISMEHSLVQSG--DVVNKTFKQALGRIKPIDFNTRAESS-----RYKRF 277
Query: 391 SSY--PFIDSPPYLQDRHRRFQ--RRSIS 415
SS+ F+D P + Q RRSIS
Sbjct: 278 SSFHSAFLDVPRISAEYQHSLQDKRRSIS 306
>ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
gb|EDK40579.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
Length = 714
Score = 51.2 bits (121), Expect = 2e-04, Method: Composition-based stats.
Identities = 62/305 (20%), Positives = 112/305 (36%), Gaps = 74/305 (24%)
Query: 19 QEKKPQSTFQTLKQSLKLSN-------NKKLKQDSTQHSNDTNKSVKAKKN----GTSSK 67
Q+ +PQS Q + S ++S+ K K+D T+ + + ++ + +S
Sbjct: 90 QKTEPQS--QLNEDSSRISSRLNEPTFTKDTKEDKTKDTKEEIDALALVETLNPPTLTSN 147
Query: 68 KTGTQRKR-----------ISTQRFSLFTYGNVQVMNSFVPIHNDI------PNSSCIRR 110
+ RKR + + F L GN++ NSF+ + P
Sbjct: 148 VPMSPRKRTLMRHKKSDNDMKRESFILDAKGNLRNRNSFIEGADTYLQLPKEPLGPLNI- 206
Query: 111 NSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAP 170
N +V+ + V +++ DS +I + +L+ G E++Q+ G
Sbjct: 207 NEKVTLFQFNSAKVVLYHEM--VDSNQSIPKESGTLLGHGSFEVFQLHKGD--------- 255
Query: 171 FQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIE 230
V YLS G V+P+LPKL+I R N ++ + NPERYW I
Sbjct: 256 --------------VTYLSCG--SSFVYPLLPKLKILRTNFN--QFILPLANPERYWRIS 297
Query: 231 FLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLP-------PT 283
+E ++ E + + Q+ + + + + PT
Sbjct: 298 -------LNTEDVKIIRRLERTLEEKIQYRNFYLEDSFKDEDYTQTIANGSTHTPLPYPT 350
Query: 284 SDIEP 288
S P
Sbjct: 351 SATPP 355
>ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
ref|XP_720850.1| hypothetical protein CaO19_13878 [Candida albicans SC5314]
ref|XP_888816.1| hypothetical protein CaJ7.0223 [Candida albicans SC5314]
gb|EAL02027.1| hypothetical protein CaO19.13878 [Candida albicans SC5314]
gb|EAL02561.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
dbj|BAE44713.1| hypothetical protein [Candida albicans]
Length = 684
Score = 48.1 bits (113), Expect = 0.001, Method: Composition-based stats.
Identities = 38/154 (24%), Positives = 65/154 (42%), Gaps = 31/154 (20%)
Query: 145 SLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKL 204
SL+A G EI+Q+ G V YLS G + ++P+LPK+
Sbjct: 202 SLLAHGIFEIFQLHQGD-----------------------VTYLSCGNN--FIYPLLPKI 236
Query: 205 QITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINE 264
++ R+N ++ + NPERYW+I + S+ + + N + F+ + I N+
Sbjct: 237 KVFRINSN--QFLLPLVNPERYWKIF---INSEELNVINNLINVFQRNVQFISLHESENK 291
Query: 265 GATI-GNNESLSDKFKLPPTSDIEPPNTEIINND 297
A + +D P PP I+N+
Sbjct: 292 NAPEPKSQFEFTDSINTKPKELESPPRNIYISNE 325
>ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryomyces hansenii CBS767]
emb|CAG87064.1| unnamed protein product [Debaryomyces hansenii CBS767]
Length = 860
Score = 47.7 bits (112), Expect = 0.002, Method: Composition-based stats.
Identities = 57/237 (24%), Positives = 96/237 (40%), Gaps = 43/237 (18%)
Query: 114 VSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQK 173
V+ S + F++ + + + KL L+ G EI+Q+ G
Sbjct: 255 VTLFKYKSSRILVFDEQLNDNKTDMEKLSSGRLLGHGEFEIFQLHNGD------------ 302
Query: 174 SSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLP 233
V YLS G V+P+LPKL+I R+N ++ + NPERYW+I
Sbjct: 303 -----------VTYLSCG--PSFVYPLLPKLKILRINFN--QFILPLVNPERYWKISINS 347
Query: 234 ----LISQSQSELENSVKAFENVISKICQFSHIN--EGATIGNNESLSDKFKLPPTSDIE 287
LI + LE +VK + N+ I Q S I+ E I N + + K P S
Sbjct: 348 DDKNLIHLLERTLERNVK-YRNLFFGINQLSSIDVCEDMEIPKNVANGKETKQLPVSGT- 405
Query: 288 PPNTEIINNDDDNDDDDDNYDDDDLNY---LLDEEYEQGCTDNSFSVISNTCSNLNA 341
I+N+ ++ + ++ +NY ++ E + S + +N+N
Sbjct: 406 -----ILNSSSGSEKHEIPVTENPMNYQFPIIFNEIPESPPSAPLSPHNENITNVNL 457
>ref|XP_001523637.1| hypothetical protein LELG_05053 [Lodderomyces elongisporus NRRL
YB-4239]
gb|EDK46872.1| hypothetical protein LELG_05053 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 944
Score = 44.3 bits (103), Expect = 0.020, Method: Composition-based stats.
Identities = 50/214 (23%), Positives = 90/214 (42%), Gaps = 54/214 (25%)
Query: 142 KPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHPVL 201
+ SL+A G IEI+Q+ G V YLS G ++P+L
Sbjct: 270 QSGSLLAHGEIEIFQLHNGD-----------------------VTYLSCGGK-LFIYPLL 305
Query: 202 PKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSH 261
PK+++ R++ ++ + NPERYW+I E +++ E V+ Q+ +
Sbjct: 306 PKIKVLRISFN--QFLLPLLNPERYWKI-------YVDCEDVSTLDCLEGVLKWNVQYIN 356
Query: 262 INEGATIGNNESLSDKFKLPPTSDIEPPNTEI-INNDDDNDDDDDNYDDDDLNYLLDEEY 320
I+ +DK DI+ P + I + +D +D +D D +D +
Sbjct: 357 IS-----------NDKSDKSDKLDIDDPKHQKHIKHQNDQNDRNDQNDKNDQVF------ 399
Query: 321 EQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVS 354
+++S V NT S L P++ ++ +S
Sbjct: 400 ---QSNSSKLVSRNTFSTPKHLDLQPAELSENLS 430
>ref|XP_388140.1| hypothetical protein FG07964.1 [Gibberella zeae PH-1]
Length = 948
Score = 41.2 bits (95), Expect = 0.17, Method: Composition-based stats.
Identities = 32/140 (22%), Positives = 56/140 (40%), Gaps = 5/140 (3%)
Query: 260 SHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEE 319
SH +T N + T+ + ++ND D + D D D + + + E
Sbjct: 23 SHEPIMSTRRTNAIEVRESIEAKTAGTDVEMKNTMDNDIDAEGDPDVDMDAEGDEDAEGE 82
Query: 320 YEQGCTDNSFSVISNTCSNLNASF-LYPSDPTDAVSISINEAFKNAIRRTAP----VLNI 374
+ + + +I N + L + Y D D+ + RRT P +++
Sbjct: 83 VDDEGRPDMYRLIHNLSTYLCSVEDEYAWDLFDSGEQLAAGFQRIPNRRTLPDYFEIISE 142
Query: 375 PIAAPSIHSKQQNKRYSSYP 394
PIA +I K Q K+YSS+
Sbjct: 143 PIAFSTIRGKTQKKQYSSFA 162
>ref|YP_944121.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas
ingrahamii 37]
gb|ABM04522.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas
ingrahamii 37]
Length = 657
Score = 40.0 bits (92), Expect = 0.39, Method: Composition-based stats.
Identities = 31/148 (20%), Positives = 64/148 (43%), Gaps = 8/148 (5%)
Query: 7 ETKKNSVRLTAKQEKK---PQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNG 63
E +KN+ + +++K Q + K S + NN++ K + Q +N+ K+ + +KN
Sbjct: 473 EQQKNNEQQKNSEQQKNNEQQKNSEQQKNSEQQKNNEQQKNNEQQKNNEQQKNNEQQKNN 532
Query: 64 TSSK-----KTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANN 118
K K Q+K Q+ S N Q NS +N+ ++ ++N++ N+
Sbjct: 533 EQQKNNEQQKNNEQQKNNEQQKNSEQQKNNEQQKNSEQQKNNEQQKNNEQQKNNEQQTND 592
Query: 119 VTESSGVFFNDTQSQDSQNTIKLKPTSL 146
+ + T+ + ++ P L
Sbjct: 593 KQQMTAKEVQSTEQPEVNQELEALPNWL 620
Score = 35.4 bits (80), Expect = 9.1, Method: Composition-based stats.
Identities = 32/144 (22%), Positives = 60/144 (41%), Gaps = 12/144 (8%)
Query: 7 ETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSS 66
E +KNS + + Q + K + + N+++ K Q +N+ K+ + +KN
Sbjct: 467 EQQKNSEQQK---NNEQQKNSEQQKNNEQQKNSEQQKNSEQQKNNEQQKNNEQQKNNEQQ 523
Query: 67 K-----KTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTE 121
K K Q+K Q+ + N Q NS +N+ +S ++N++ NN +
Sbjct: 524 KNNEQQKNNEQQKNNEQQKNNEQQKNNEQQKNSEQQKNNEQQKNSEQQKNNEQQKNNEQQ 583
Query: 122 SSGVFFNDTQSQDSQNTIKLKPTS 145
N+ Q+ D Q + S
Sbjct: 584 ----KNNEQQTNDKQQMTAKEVQS 603
>ref|XP_001881199.1| predicted protein [Laccaria bicolor S238N-H82]
gb|EDR08129.1| predicted protein [Laccaria bicolor S238N-H82]
Length = 1974
Score = 38.5 bits (88), Expect = 0.95, Method: Composition-based stats.
Identities = 23/132 (17%), Positives = 44/132 (33%), Gaps = 8/132 (6%)
Query: 232 LPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNT 291
LP+ Q L V + I + Q + + E SD P + +
Sbjct: 1653 LPI--QEHRGLRRIVYSVVEDIPALLQSGFEVPLSISESQELTSDVVSEPLNEGSKDDIS 1710
Query: 292 EIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTD 351
+ ++ND D + + D +L Y L E + + + +P++
Sbjct: 1711 DPVDNDVHEDPTEHDADPLNLGYKLVSE------PLNEGREDDIADPIYGIHEHPTEYET 1764
Query: 352 AVSISINEAFKN 363
SI I + +
Sbjct: 1765 DHSIVIEDISQQ 1776
>ref|YP_796256.1| ABC-type oligopeptide transport system, periplasmic component
[Lactobacillus brevis ATCC 367]
gb|ABJ65225.1| ABC-type oligopeptide transport system, periplasmic component
[Lactobacillus brevis ATCC 367]
Length = 547
Score = 38.5 bits (88), Expect = 0.96, Method: Composition-based stats.
Identities = 38/147 (25%), Positives = 63/147 (42%), Gaps = 12/147 (8%)
Query: 10 KNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKKT 69
K +V LT E+ +ST S L N KK+ D T S K+ K K+ T
Sbjct: 239 KKAVHLTKINEQVTEST----TTSYNLYNAKKV--DETLLSGQQIKNNKNSKDFVKRLPT 292
Query: 70 GTQRKRISTQRFSLFTYGNVQ-----VMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSG 124
G+QR ++ ++ S F NV+ +N +++ + + S + + +G
Sbjct: 293 GSQRLDLNEKKVSAFKNVNVRKAFSLAINRTQLVNDVLQDGSVASKGFVPAGMGNNPKTG 352
Query: 125 VFFNDTQSQDSQNTIKLK-PTSLMAKG 150
FN S S + +K +L+AKG
Sbjct: 353 EAFNQEASIKSAVSYNVKQAKTLLAKG 379
>ref|XP_001734409.1| hypothetical protein EDI_325820 [Entamoeba dispar SAW760]
gb|EDR29440.1| hypothetical protein EDI_325820 [Entamoeba dispar SAW760]
Length = 469
Score = 38.1 bits (87), Expect = 1.5, Method: Composition-based stats.
Identities = 19/84 (22%), Positives = 35/84 (41%), Gaps = 6/84 (7%)
Query: 213 GFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNE 272
GF Y + E +I F+P S+ + + N + FE++I KI F+ I++ ++
Sbjct: 310 GFSYLLK----ENVIQIIFIPKNSEREVQPLNEL--FESMIHKINNFNKIDKLKITNESK 363
Query: 273 SLSDKFKLPPTSDIEPPNTEIINN 296
L +E + N
Sbjct: 364 ELIKLINNWKIESVEGNVIDAFKN 387
>ref|NP_784891.1| cell surface protein precursor [Lactobacillus plantarum WCFS1]
emb|CAD63738.1| cell surface protein precursor [Lactobacillus plantarum WCFS1]
Length = 1010
Score = 37.7 bits (86), Expect = 1.7, Method: Composition-based stats.
Identities = 28/132 (21%), Positives = 50/132 (37%), Gaps = 15/132 (11%)
Query: 12 SVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDT---NKSVKAKKNGTSSKK 68
+ + + + Q+T + ++S N + S Q +N N++ + K +SS K
Sbjct: 105 TSQENSSVQNTSQATSTSKQESSSTKNTSQTTSTSNQEANSAKSINQTTRTSKQESSSTK 164
Query: 69 TGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFN 128
+Q S Q NS I+ S+ +S + + T +S N
Sbjct: 165 NTSQTTSTSNQE-----------ANSAKSINQTTRTSN-QESSSAKNTSQTTSTSSRKIN 212
Query: 129 DTQSQDSQNTIK 140
T+SQ TI
Sbjct: 213 STKSQAQSLTIT 224
>ref|ZP_03916763.1| transcription termination factor NusA [Anaerococcus lactolyticus
ATCC 51172]
gb|EEI85643.1| transcription termination factor NusA [Anaerococcus lactolyticus
ATCC 51172]
Length = 407
Score = 37.0 bits (84), Expect = 2.6, Method: Composition-based stats.
Identities = 49/275 (17%), Positives = 90/275 (32%), Gaps = 17/275 (6%)
Query: 79 QRFSLFT-YGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDSQN 137
QR + Y N+ + VP+ +P S I +SG S+ S+N
Sbjct: 143 QRVDKYNVYVNLDRIEGIVPLKEQVPTESYIPNERMKFLIKEVRNSGKDPQIVLSRSSEN 202
Query: 138 TIKLKPTSLMAK---GPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHG 194
+ + + G IEIY I + + + IG G
Sbjct: 203 LVTRLFELEVPEITDGVIEIYSIAREAGSRTKMAVFSNDETIDAVGA-------CIGYKG 255
Query: 195 DIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVIS 254
V+ ++ LQ +++ + I + + + + +++ V + +
Sbjct: 256 ARVNSIVDSLQGEKIDIINYSKDIESFISNALSPADIIEVYVNERNKQSLVVVEDDQLSL 315
Query: 255 KICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLN- 313
I + A + K KL S + EI++ +++ D+ + DD DL
Sbjct: 316 AIGKEGQNARLAARLTGWKIDIKSKLEFDSLSQEEIDEILSINEEEVSDEKSTDDKDLEA 375
Query: 314 YLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSD 348
Y L EE D S P+D
Sbjct: 376 YDLAEENASDDKDEEISDEDKE-----EISKEPTD 405
>ref|YP_001406022.1| anaerobic ribonucleoside-triphosphate reductase [Campylobacter
hominis ATCC BAA-381]
gb|ABS52535.1| anaerobic ribonucleoside-triphosphate reductase [Campylobacter
hominis ATCC BAA-381]
Length = 893
Score = 37.0 bits (84), Expect = 3.0, Method: Composition-based stats.
Identities = 32/127 (25%), Positives = 52/127 (40%), Gaps = 13/127 (10%)
Query: 20 EKKPQSTFQTLKQSLKLSNNKKLKQ---DSTQHSNDTNKSVKAKKNG---TSSKKTGTQR 73
+K FQ L +L +N+K Q + ++ N KNG S+ K
Sbjct: 70 KKYKNEKFQNLADNL--GDNRKNNQNLSEKNENEISQNDETIQNKNGEIKISNDKNTQNL 127
Query: 74 KRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQ 133
K++S + S N Q + S DI NSS I +N ++S + E + +N++
Sbjct: 128 KKVSNENLS----EN-QNLKSDFFKETDIKNSSIILQNDEISNLTLKEKNFAKYNESAQN 182
Query: 134 DSQNTIK 140
S T
Sbjct: 183 SSGKTQN 189
>ref|XP_001793300.1| hypothetical protein SNOG_02702 [Phaeosphaeria nodorum SN15]
gb|EAT89433.2| hypothetical protein SNOG_02702 [Phaeosphaeria nodorum SN15]
Length = 582
Score = 37.0 bits (84), Expect = 3.1, Method: Composition-based stats.
Identities = 25/110 (22%), Positives = 47/110 (42%), Gaps = 4/110 (3%)
Query: 194 GDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVI 253
G ++H + P+ Q ++G F+ + Y+ IE LP E + + F+ V+
Sbjct: 118 GTLLHTIFPRSQCWCVDGQ--SIFVLRVRQDSYYRIE-LPYDRVMTDEDKAQISEFKRVL 174
Query: 254 SKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPN-TEIINNDDDNDD 302
S++ Q+ G+N ++ K PP ++ T+ N D D
Sbjct: 175 SQVLQYEKTQCPFRRGSNVEEIERPKSPPRKQLKKQQPTQKAKNVDLRQD 224
>ref|XP_001434978.1| hypothetical protein GSPATT00006888001 [Paramecium tetraurelia strain
d4-2]
emb|CAK67581.1| unnamed protein product [Paramecium tetraurelia]
Length = 5605
Score = 36.6 bits (83), Expect = 4.1, Method: Composition-based stats.
Identities = 52/340 (15%), Positives = 111/340 (32%), Gaps = 39/340 (11%)
Query: 8 TKKNSVRLTAKQEKKPQSTFQTLKQSLKL-SNNKKLKQDSTQHSNDTNKSVKAKKNGTSS 66
T + + ++ + ++ Q QT +++ ++ K+ Q++ + S +T + + K T
Sbjct: 1167 TNQETQQVNQETKQTNQEVKQTTQETKQINQETKQTNQETREVSQETKQVNQEIKQTTQE 1226
Query: 67 KKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRR---------NSQV--- 114
K Q + TQ+ + QV + + ++ + N +V
Sbjct: 1227 NKQVNQEVKQETQQVN---QQTKQVSQETQQTNQETRQTTQEVKQTNQESKQVNQEVKQT 1283
Query: 115 -----SANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFD----KLK 165
N T+ + + Q Q T + K + K + Q +D K
Sbjct: 1284 TQETKQTNEQTKQTNEQIKQSNEQIKQATQETKQITQEIKQVDQKQQTAVPYDHDTIKEG 1343
Query: 166 ENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPK-----LQITRLNGAGFKYFI-- 218
+ P K K + + + + P+ K +Q + I
Sbjct: 1344 YSDTPGLKPGKIDQSEQQECQEQTNDKDTPLDRPIEKKSVSRTVQQQEQTSEEAQAIIID 1403
Query: 219 -----SFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNES 273
S N E+ EI + + Q +N+ + E V SK+ + + T N+
Sbjct: 1404 SKVDQSLSNSEQNQEI--IKKVDQKVESSQNNAQETEQVTSKVTETTSQTNSITQQTNDQ 1461
Query: 274 LSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLN 313
S K ++ + + +N + N
Sbjct: 1462 SSITNKQTQQTNETIQQNNKTIQETNESISQNNKTVQETN 1501
>ref|XP_001349291.1| histone acetyltransferase Gcn5, putative [Plasmodium falciparum
3D7]
emb|CAD51140.1| histone acetyltransferase Gcn5, putative [Plasmodium falciparum
3D7]
Length = 1484
Score = 36.2 bits (82), Expect = 5.4, Method: Composition-based stats.
Identities = 21/126 (16%), Positives = 44/126 (34%), Gaps = 19/126 (15%)
Query: 217 FISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSD 276
+ FYN Y + L + S + N++ E+ + ++G N SL
Sbjct: 735 LVDFYNNANYIFKDKLDFVCNSNTPTCNNINDLED--------NKKDDGHVDKQNYSLEK 786
Query: 277 KFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTC 336
K T++ P + + D ++ +++ E+ N + NT
Sbjct: 787 KDNEEHTNNNSDPIMDGVE-----------LDVKKISEKKEDQIEKDKPKNDNTKNDNTK 835
Query: 337 SNLNAS 342
+ N +
Sbjct: 836 EDKNQN 841
>ref|XP_001329685.1| hypothetical protein TVAG_453780 [Trichomonas vaginalis G3]
gb|EAY17550.1| hypothetical protein TVAG_453780 [Trichomonas vaginalis G3]
Length = 977
Score = 36.2 bits (82), Expect = 5.4, Method: Composition-based stats.
Identities = 22/69 (31%), Positives = 32/69 (46%), Gaps = 1/69 (1%)
Query: 10 KNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKKT 69
KNSV+ + E Q T T + K N++ + + S+ +KA K+G SS K
Sbjct: 471 KNSVKCIKRSENSTQ-TESTYEDKRKQMNSRLTSRKPSNKSSSRQSPIKANKSGKSSDKL 529
Query: 70 GTQRKRIST 78
TQR S
Sbjct: 530 STQRLNPSA 538
>ref|XP_635620.1| hypothetical protein DDBDRAFT_0216167 [Dictyostelium discoideum AX4]
gb|EAL62181.1| hypothetical protein DDBDRAFT_0216167 [Dictyostelium discoideum AX4]
Length = 5650
Score = 35.8 bits (81), Expect = 6.3, Method: Composition-based stats.
Identities = 59/346 (17%), Positives = 113/346 (32%), Gaps = 53/346 (15%)
Query: 24 QSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKKTGTQRKRISTQRFSL 83
Q+ Q L + + + Q S D N+S+ +N ++ K++S
Sbjct: 4733 QNYLDDFSQELSTLVSNIINLTNEQQSIDVNQSIFHFRNSITTFVPLDLAKQLSINMVGT 4792
Query: 84 FTYGNVQVMNSFVPIHNDIPN-------------------SSCIRRNSQVSANNVTESSG 124
++ ++NS + N S I ++ A+N+
Sbjct: 4793 I---SIMLINSSLKQKKLYRNFCPEPKIIDGKYIEFFDRFKSNIIEKAKNEADNLNFLRN 4849
Query: 125 V--FFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTG-FDKLKENIAPFQKSSKASSHD 181
+ + + S+ N + LK S+ I + QI + + K S +
Sbjct: 4850 LCEYIREVHSKLVSNDV-LKEISVSTSESITLAQIIQKRIPEFSFDKEVISKKSFCLNGT 4908
Query: 182 GHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSE 241
+ I DI + + P +Q N Y F N F + S+ + +
Sbjct: 4909 PTSYYRILIKTITDIYNHINPMIQ----NKKNTVYSEIFTN-------HFNSINSEEEIK 4957
Query: 242 LENSVKAFENVISKICQFSHINEGATIGNNESL-SDKFKLPPTSDIEPPNTEIINNDDDN 300
N I I ++ + NE L ++ ++ N I++ DDN
Sbjct: 4958 QIN--------IDSIRTSNNSSPNLIQKINEQLDNNNNNNNNNNNNNNNNNNNIDHGDDN 5009
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYP 346
+D + D +D E C +I NT ++ +P
Sbjct: 5010 NDGESESDIED-----YENINSTC--GKIKIIKNTTRKIDQIDNHP 5048
>gb|AAR20863.1| histone acetyltransferase [Plasmodium falciparum]
gb|AAR22527.1| histone acetyltransferase [Plasmodium falciparum]
gb|AAR88436.1| histone acetyltransferase [Plasmodium falciparum]
Length = 1464
Score = 35.8 bits (81), Expect = 6.4, Method: Composition-based stats.
Identities = 21/126 (16%), Positives = 44/126 (34%), Gaps = 19/126 (15%)
Query: 217 FISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSD 276
+ FYN Y + L + S + N++ E+ + ++G N SL
Sbjct: 735 LVDFYNNANYIFKDKLDFVCNSNTPTCNNINDLED--------NKKDDGHVDKQNYSLEK 786
Query: 277 KFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTC 336
K T++ P + + D ++ +++ E+ N + NT
Sbjct: 787 KDNEEHTNNNSDPIMDGVE-----------LDVKKISEKKEDQIEKDKPKNDNTKNDNTK 835
Query: 337 SNLNAS 342
+ N +
Sbjct: 836 EDKNQN 841
>ref|XP_001645330.1| hypothetical protein Kpol_1058p9 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO17472.1| hypothetical protein Kpol_1058p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 646
Score = 35.8 bits (81), Expect = 6.7, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 338 NLNASFLYPSDPTDAVSISINEAFKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFID 397
N+ SF S P+ + I E + +R++PV+ PS + K+ NKR+SS I
Sbjct: 71 NIFTSFSGKS-PSSPLDNIIEEKNSSGEKRSSPVIISHRRQPSGNVKRSNKRHSSSSSIG 129
Query: 398 SPPYLQDRHRRFQRRSIS 415
SP D+ + R +S
Sbjct: 130 SPVLDSDKGKDKSERHVS 147
>ref|YP_926979.1| AAA ATPase containing von Willebrand factor type A (vWA) domain
[Shewanella amazonensis SB2B]
gb|ABL99309.1| AAA ATPase containing von Willebrand factor type A (vWA) domain
[Shewanella amazonensis SB2B]
Length = 2848
Score = 35.8 bits (81), Expect = 6.9, Method: Composition-based stats.
Identities = 17/56 (30%), Positives = 26/56 (46%), Gaps = 3/56 (5%)
Query: 259 FSHINEGATIGNNESLSDKFKLPPTSDIEPPNTE---IINNDDDNDDDDDNYDDDD 311
F + T +N+ + D P E +T+ I NN D++DD D+ D DD
Sbjct: 1272 FPNDATETTDSDNDGVGDNSDAFPNDANEQLDTDNDGIGNNADEDDDGDNVADTDD 1327
>ref|XP_393101.3| PREDICTED: similar to pawn CG11101-PA, partial [Apis mellifera]
Length = 6029
Score = 35.8 bits (81), Expect = 6.9, Method: Composition-based stats.
Identities = 30/136 (22%), Positives = 58/136 (42%), Gaps = 16/136 (11%)
Query: 3 LSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKN 62
++R + S+ ++ K + +T +S +L+ NK + ++ D N S + K
Sbjct: 1871 VARYNNRGRSLTVSRSPTKSSNNLEKTSAESAELNQNKARNRSKQGNNGDRNTSSRIKPT 1930
Query: 63 GTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTES 122
TSS ++F+L +S+ PNSS R N + ++NNV
Sbjct: 1931 STSSN----------GRQFTLREKDGTSSRSSY-----KRPNSSTSRSNGRSNSNNVRPP 1975
Query: 123 SGVFFNDTQSQDSQNT 138
++ +S +S +T
Sbjct: 1976 RLRNGSN-KSSESTST 1990
>ref|ZP_02368955.1| antigen, P35, putative [Borrelia valaisiana VS116]
Length = 291
Score = 35.8 bits (81), Expect = 7.0, Method: Composition-based stats.
Identities = 34/149 (22%), Positives = 56/149 (37%), Gaps = 17/149 (11%)
Query: 234 LISQSQSELENSVKAFENVISKICQFSHINEGAT--IGNNESLSDKFKLPPTSDIEPPNT 291
+++ Q E N ENVI KI Q + NE IG N + + IEP N
Sbjct: 57 ILNLGQKEPSN--NGKENVIQKITQRFNENEKLINKIGPNIEILAQKINTDVQKIEPINQ 114
Query: 292 EIINNDDDNDDDDDNYDD----DDLNYLLDE--EYEQGCTDNSFSVISNTCSN------- 338
IN + + DDN D + L EY++ ++++ S+
Sbjct: 115 FEINKNTFPEKQDDNIDAILENNLFRRLFYSSLEYDENNIKRLITILAQASSSNGDHYRV 174
Query: 339 LNASFLYPSDPTDAVSISINEAFKNAIRR 367
+ F ++ ++N K+ RR
Sbjct: 175 IGLIFWTGFKIQESFEKAVNLLTKDEQRR 203
Searching..................................................done
Results from round 3
Score E
Sequences producing significant alignments: (bits) Value
Sequences used in model and found again:
ref|NP_013931.1| Peripheral membrane protein of peroxisomes... 578 e-163
sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1 >... 578 e-163
ref|XP_449234.1| unnamed protein product [Candida glabrata]... 314 1e-83
ref|XP_455364.1| unnamed protein product [Kluyveromyces lac... 271 6e-71
ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vand... 268 6e-70
ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895] >gi|... 214 7e-54
ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia... 212 5e-53
ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryo... 179 5e-43
ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida a... 128 8e-28
Sequences not found previously or not previously below threshold:
ref|XP_001383052.2| Phosphatidylinositol-4-phosphate 5-kina... 86 5e-15
ref|XP_001523637.1| hypothetical protein LELG_05053 [Lodder... 83 4e-14
ref|XP_506086.1| hypothetical protein [Yarrowia lipolytica]... 45 0.013
ref|XP_001219610.1| hypothetical protein CHGG_00389 [Chaeto... 41 0.17
ref|ZP_02002225.1| conserved hypothetical protein [Beggiato... 40 0.34
ref|XP_001644941.1| hypothetical protein Kpol_1025p2 [Vande... 40 0.38
ref|YP_082369.1| hypothetical protein BCZK0765 [Bacillus ce... 39 0.61
ref|NP_014227.1| Largest subunit of the Anaphase-Promoting ... 39 0.67
ref|XP_745847.1| carbamoyl phosphate synthetase [Plasmodium... 38 0.99
gb|AAG10786.1| carbamoyl phosphate synthetase II [Plasmodiu... 38 1.5
ref|YP_796256.1| ABC-type oligopeptide transport system, pe... 37 2.1
ref|YP_001922312.1| putative glycerol-3-phosphate ABC trans... 37 2.2
ref|XP_001614588.1| E1-E2_ATPase/hydrolase, putative [Plasm... 37 2.2
ref|XP_001653742.1| hypothetical protein AaeL_AAEL001651 [A... 37 2.8
ref|YP_001887378.1| putative glycerol-3-phosphate ABC trans... 37 3.4
gb|ABZ96112.1| Esterase [Leptospira biflexa serovar Patoc s... 37 3.4
ref|XP_001033353.1| hypothetical protein TTHERM_00420980 [T... 37 3.8
ref|XP_001608456.1| hypothetical protein PVX_111155 [Plasmo... 37 3.9
emb|CAY17345.1| expressed protein [Schistosoma mansoni] 37 3.9
ref|YP_944121.1| protein containing tetratricopeptide (TPR)... 36 5.5
ref|YP_015868.1| replicative DNA helicase [Mycoplasma mobil... 36 6.3
ref|XP_420592.2| PREDICTED: similar to shroom family member... 36 6.6
ref|ZP_01049630.1| hypothetical protein MED134_08926 [Cellu... 36 6.9
ref|XP_001123188.1| PREDICTED: similar to CDK5 regulatory s... 36 7.5
ref|NP_001043616.1| Os01g0623500 [Oryza sativa (japonica cu... 35 7.9
ref|XP_001645330.1| hypothetical protein Kpol_1058p9 [Vande... 35 8.3
ref|XP_001300798.1| GAF domain containing protein [Trichomo... 35 9.4
>ref|NP_013931.1| Peripheral membrane protein of peroxisomes involved in peroxisomal
inheritance; Inp1p [Saccharomyces cerevisiae]
sp|Q03694|INP1_YEAST Inheritance of peroxisomes protein 1
emb|CAA88645.1| unknown [Saccharomyces cerevisiae]
Length = 420
Score = 578 bits (1489), Expect = e-163, Method: Composition-based stats.
Identities = 420/420 (100%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>sp|A6ZMR6|INP1_YEAS7 Inheritance of peroxisomes protein 1
gb|EDN64140.1| inheritance of peroxisomes [Saccharomyces cerevisiae YJM789]
Length = 420
Score = 578 bits (1489), Expect = e-163, Method: Composition-based stats.
Identities = 419/420 (99%), Positives = 420/420 (100%)
Query: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK
Sbjct: 1 MVLSRGETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAK 60
Query: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT
Sbjct: 61 KNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVT 120
Query: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH
Sbjct: 121 ESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSH 180
Query: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS
Sbjct: 181 DGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQS 240
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDD+
Sbjct: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDD 300
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA
Sbjct: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEA 360
Query: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL
Sbjct: 361 FKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQRRSISGLGDL 420
>ref|XP_449234.1| unnamed protein product [Candida glabrata]
sp|Q6FKL0|INP1_CANGA Inheritance of peroxisomes protein 1
emb|CAG62208.1| unnamed protein product [Candida glabrata CBS 138]
Length = 342
Score = 314 bits (804), Expect = 1e-83, Method: Composition-based stats.
Identities = 118/372 (31%), Positives = 180/372 (48%), Gaps = 67/372 (18%)
Query: 11 NSVRLTAKQEKKPQ-STFQTLKQSLKLSNNKKLKQDS-----TQHSNDTNKSVKAKKNGT 64
++V+ + E KPQ Q++K +LKL K+ + S KS ++NG
Sbjct: 2 STVKQRTRTEGKPQYPLMQSIKDTLKLKTTKRTNSEKKVSTLNDKSGSPIKSNSERRNGI 61
Query: 65 SSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSG 124
+ K G+ KR+S QR +LF Y NVQV N + N + R++S ++ + +
Sbjct: 62 TRKVNGS--KRLSAQRVALFKYNNVQVFNY-------VRNGNNSRKSS--MSSLNSSGTV 110
Query: 125 VFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHV 184
V N+ S+ LKP L+ KG +E+YQI K+ A
Sbjct: 111 VMCNEYDSE----VTNLKPQMLIGKGVLELYQI---------------KTPAALDRKEQT 151
Query: 185 VNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELEN 244
+NY+S+GR G IVHP+LPKL+IT+L FKY I+F NPERYW+IEFL + Q EL
Sbjct: 152 MNYISLGRGGQIVHPILPKLKITKLRNENFKYLITFSNPERYWQIEFLQINGQLHDELLK 211
Query: 245 SVKAFENVISKICQF--SHINEGATIGNNESLSDKFKLPPTS-DIEPPNTEIINNDDDND 301
FE++IS +C F +I + A++ +N K PP + + E + +ND
Sbjct: 212 ITDEFESIISSVCIFIDENITKLASLKSN-------KTPPNNVNTLDQIKEKVQTATEND 264
Query: 302 DDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLY--PSDPTDAVSISINE 359
DD + ++L+YLL++ EQ T +SF Y P+D T+ ++
Sbjct: 265 DD----EAEELDYLLEDPIEQEPT---------------SSFEYDVPTDLTEVFQRTMGR 305
Query: 360 AFKNAIRRTAPV 371
N RR + V
Sbjct: 306 INSNGSRRYSSV 317
>ref|XP_455364.1| unnamed protein product [Kluyveromyces lactis]
sp|Q6CL25|INP1_KLULA Inheritance of peroxisomes protein 1
emb|CAG98072.1| unnamed protein product [Kluyveromyces lactis NRRL Y-1140]
Length = 343
Score = 271 bits (694), Expect = 6e-71, Method: Composition-based stats.
Identities = 92/358 (25%), Positives = 148/358 (41%), Gaps = 68/358 (18%)
Query: 70 GTQRKRISTQRFSLFTYGNVQVMNS---FVPIHNDIPNSSCIRRNSQVSANNVTESSGVF 126
G+ +K+ S SL ++ P + + + R++ Q + +F
Sbjct: 5 GSPKKKSSGPFQSL-KNSFLRNKGQPPQLQPWNPQVRTDNTKRKSVQ--------RTTLF 55
Query: 127 FNDTQSQDSQNTIKLKPTSLM-AKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVV 185
+D S + K +++ +K EIY+I G K + +
Sbjct: 56 RHDYVKVRSWPSDNPKNINVILSKTSFEIYEIDMGNSKER-------------------M 96
Query: 186 NYLSIGRHGDIVHPVLPKLQITRL---NGAGFKYFISFYNPERYWEIEFLPLISQSQSEL 242
NYLS+G+ VHP+LPKL++ R+ FK IS +NPER+WE+ FL +
Sbjct: 97 NYLSLGKKDQFVHPILPKLKVGRVPSDQKQEFKIIISLFNPERFWEVTFLSIN--HGRVP 154
Query: 243 ENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDD 302
+ E V+SKICQ+++ + +N SL D T I P +I+
Sbjct: 155 RKVINDLEQVLSKICQYNNKDP-----DNASLKDTKN--DTKAIPSPKIDILE------- 200
Query: 303 DDDNYDDDDLNYLLDEEYE-QGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEAF 361
+ DDL YLL E Q + S+ N S +++ + S +N+ F
Sbjct: 201 -----EVDDLEYLLAENVPDQEQIPEAESLELNESSTISSISMEHSLVQSG--DVVNKTF 253
Query: 362 KNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSY--PFIDSPPYLQDRHRRFQ--RRSIS 415
K A+ R P+ A S + KR+SS+ F+D P + Q RRSIS
Sbjct: 254 KQALGRIKPIDFNTRAESS-----RYKRFSSFHSAFLDVPRISAEYQHSLQDKRRSIS 306
>ref|XP_001647343.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO19485.1| hypothetical protein Kpol_1018p13 [Vanderwaltozyma polyspora DSM
70294]
Length = 511
Score = 268 bits (685), Expect = 6e-70, Method: Composition-based stats.
Identities = 103/316 (32%), Positives = 158/316 (50%), Gaps = 37/316 (11%)
Query: 12 SVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGT----SSK 67
S + + +KK QST + +K+SLK ++ K + ++ D ++S K+ S
Sbjct: 29 SSQTSKTPKKKQQSTLRLIKESLKFKSSSKNSSKPSNNNIDKSESASMFKDTLKRSSISN 88
Query: 68 KTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFF 127
K KR S QR SLF Y NVQV+N P+ S R VS+ + + S
Sbjct: 89 KPRNLEKRSSAQRVSLFKYSNVQVVNCSAPL------SPVQIRKDSVSSGSTSTSRNYDI 142
Query: 128 NDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNY 187
N + S S +T+ +K SL++ G +E+YQI T +K + ++NY
Sbjct: 143 NSSLSVKSTSTVTMKSISLVSHGVMEVYQILTP--------------TKDPNETPQMMNY 188
Query: 188 LSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVK 247
S+GR+G IVHP+LP+LQ+TRL+G Y + FYNP RYW I+FLP+ + + NS+
Sbjct: 189 FSLGRNGKIVHPILPRLQVTRLHGYEASYSVLFYNPTRYWRIDFLPVENLVEKNQLNSIL 248
Query: 248 A-FENVISKICQFSHINEGATIGNNESLSDKFKLP----PTSDIEPPNT--------EII 294
A FE+VIS IC+FS +E ++ L + P P ++ EP II
Sbjct: 249 ADFESVISSICEFSKESEYTIHNDSRYLQKQIIEPKLLTPVTNNEPNRLFENLKHTDAII 308
Query: 295 NNDDDNDDDDDNYDDD 310
+ + N+ N + D
Sbjct: 309 SAANINNIHTSNNEKD 324
Score = 41.5 bits (96), Expect = 0.14, Method: Composition-based stats.
Identities = 43/132 (32%), Positives = 52/132 (39%), Gaps = 20/132 (15%)
Query: 274 LSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVIS 333
+ T I P + D D D DDLNYLL EE EQ D I
Sbjct: 348 TKEVINEIETISIPSPIETVKPIPSTEDVD----DQDDLNYLLFEEEEQPQKDLK---IY 400
Query: 334 NTCSNLNASFL--------YPSDPTDAVSI--SINEAFKNAIRRTAPVLNI--PIAAPSI 381
+ S +N+ + S TD IN+AFK AIR P N+ I S
Sbjct: 401 DDISRVNSISERILYDNNSFYSSTTDYFEQDAEINQAFKRAIRNFGPNRNLKEKIYEHSN 460
Query: 382 HSKQQNKRYSSY 393
HS KR+SSY
Sbjct: 461 HSVA-TKRFSSY 471
>ref|NP_985366.1| AFL184Wp [Ashbya gossypii ATCC 10895]
sp|Q755Q4|INP1_ASHGO Inheritance of peroxisomes protein 1
gb|AAS53190.1| AFL184Wp [Ashbya gossypii ATCC 10895]
Length = 300
Score = 214 bits (546), Expect = 7e-54, Method: Composition-based stats.
Identities = 79/288 (27%), Positives = 121/288 (42%), Gaps = 78/288 (27%)
Query: 76 ISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDS 135
+S R +LF Y +V+V S+ + NS RR + S V D ++
Sbjct: 91 LSNDRVTLFRYEHVRV-TSYQSVRKKYRNSGDSRRGQE------RMESAVKGTDVLRRER 143
Query: 136 QNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGD 195
+ I+ +GP+EIYQI T K P QK V YL +GR
Sbjct: 144 ASVIEP-------RGPLEIYQIITPISK-----EPSQK-----------VTYLCLGRKEQ 180
Query: 196 IVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENSVKAFENVISK 255
I+ P+LPKL+IT L G ++ + +NPE W+IEFL + S N + AFEN +
Sbjct: 181 IIKPILPKLRITMLTREGLQFSVLSFNPENSWKIEFLGALGDSAVPC-NVILAFENAVKN 239
Query: 256 ICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYL 315
IC++ T +NND + + DDDDL YL
Sbjct: 240 ICRY-------------------------------TSELNNDPIEEVGTE--DDDDLEYL 266
Query: 316 LDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEAFKN 363
L ++ + +T S++ + + S+P+D ++E F N
Sbjct: 267 LYSDFAED---------DDTGSDITLNREHSSEPSD-----LHELFDN 300
>ref|XP_001482722.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
gb|EDK40579.1| hypothetical protein PGUG_04677 [Pichia guilliermondii ATCC 6260]
Length = 714
Score = 212 bits (539), Expect = 5e-53, Method: Composition-based stats.
Identities = 61/341 (17%), Positives = 117/341 (34%), Gaps = 79/341 (23%)
Query: 19 QEKKPQSTFQTLKQSLKLSN-------NKKLKQDSTQHSNDTNKSVKAKKN----GTSSK 67
Q+ +PQS Q + S ++S+ K K+D T+ + + ++ + +S
Sbjct: 90 QKTEPQS--QLNEDSSRISSRLNEPTFTKDTKEDKTKDTKEEIDALALVETLNPPTLTSN 147
Query: 68 KTGTQRKR-----------ISTQRFSLFTYGNVQVMNSFVPIHNDI------PNSSCIRR 110
+ RKR + + F L GN++ NSF+ + P
Sbjct: 148 VPMSPRKRTLMRHKKSDNDMKRESFILDAKGNLRNRNSFIEGADTYLQLPKEPLGPLNI- 206
Query: 111 NSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAP 170
N +V+ + V +++ DS +I + +L+ G E++Q+ G
Sbjct: 207 NEKVTLFQFNSAKVVLYHEM--VDSNQSIPKESGTLLGHGSFEVFQLHKGD--------- 255
Query: 171 FQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIE 230
V YLS G V+P+LPKL+I R N ++ + NPERYW I
Sbjct: 256 --------------VTYLSCG--SSFVYPLLPKLKILRTNFN--QFILPLANPERYWRIS 297
Query: 231 FLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPN 290
+E ++ E + + Q+ + + + + + + P
Sbjct: 298 -------LNTEDVKIIRRLERTLEEKIQYRNFYLEDSFKDEDYTQTIANGSTHTPLPYPT 350
Query: 291 TEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSV 331
+ + + E + S
Sbjct: 351 SA---TPPFSVHHYLP---------IAHELPESPPSAPISP 379
>ref|XP_458910.1| hypothetical protein DEHA0D11176g [Debaryomyces hansenii CBS767]
emb|CAG87064.1| unnamed protein product [Debaryomyces hansenii CBS767]
Length = 860
Score = 179 bits (453), Expect = 5e-43, Method: Composition-based stats.
Identities = 57/238 (23%), Positives = 97/238 (40%), Gaps = 43/238 (18%)
Query: 113 QVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQ 172
+V+ S + F++ + + + KL L+ G EI+Q+ G
Sbjct: 254 KVTLFKYKSSRILVFDEQLNDNKTDMEKLSSGRLLGHGEFEIFQLHNGD----------- 302
Query: 173 KSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFL 232
V YLS G V+P+LPKL+I R+N ++ + NPERYW+I
Sbjct: 303 ------------VTYLSCG--PSFVYPLLPKLKILRINFN--QFILPLVNPERYWKISIN 346
Query: 233 P----LISQSQSELENSVKAFENVISKICQFSHIN--EGATIGNNESLSDKFKLPPTSDI 286
LI + LE +VK + N+ I Q S I+ E I N + + K P S
Sbjct: 347 SDDKNLIHLLERTLERNVK-YRNLFFGINQLSSIDVCEDMEIPKNVANGKETKQLPVSGT 405
Query: 287 EPPNTEIINNDDDNDDDDDNYDDDDLNY---LLDEEYEQGCTDNSFSVISNTCSNLNA 341
I+N+ ++ + ++ +NY ++ E + S + +N+N
Sbjct: 406 ------ILNSSSGSEKHEIPVTENPMNYQFPIIFNEIPESPPSAPLSPHNENITNVNL 457
>ref|XP_721363.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
ref|XP_720850.1| hypothetical protein CaO19_13878 [Candida albicans SC5314]
ref|XP_888816.1| hypothetical protein CaJ7.0223 [Candida albicans SC5314]
gb|EAL02027.1| hypothetical protein CaO19.13878 [Candida albicans SC5314]
gb|EAL02561.1| hypothetical protein CaO19.6525 [Candida albicans SC5314]
dbj|BAE44713.1| hypothetical protein [Candida albicans]
Length = 684
Score = 128 bits (322), Expect = 8e-28, Method: Composition-based stats.
Identities = 42/187 (22%), Positives = 76/187 (40%), Gaps = 40/187 (21%)
Query: 112 SQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPF 171
+V+ S + + +Q++ N SL+A G EI+Q+ G
Sbjct: 178 EKVTLFKYKSSKIL---ELVNQNNAN------GSLLAHGIFEIFQLHQGD---------- 218
Query: 172 QKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEF 231
V YLS G + ++P+LPK+++ R+N ++ + NPERYW+I
Sbjct: 219 -------------VTYLSCGNN--FIYPLLPKIKVFRINSN--QFLLPLVNPERYWKIF- 260
Query: 232 LPLISQSQSELENSVKAFENVISKICQFSHINEGATI-GNNESLSDKFKLPPTSDIEPPN 290
+ S+ + + N + F+ + I N+ A + +D P PP
Sbjct: 261 --INSEELNVINNLINVFQRNVQFISLHESENKNAPEPKSQFEFTDSINTKPKELESPPR 318
Query: 291 TEIINND 297
I+N+
Sbjct: 319 NIYISNE 325
>ref|XP_001383052.2| Phosphatidylinositol-4-phosphate 5-kinase and related FYVE
finger-containing proteins Signal transduction
mechanisms [Pichia stipitis CBS 6054]
gb|ABN65023.2| Phosphatidylinositol-4-phosphate 5-kinase and related FYVE
finger-containing proteins Signal transduction
mechanisms [Pichia stipitis CBS 6054]
Length = 839
Score = 85.8 bits (211), Expect = 5e-15, Method: Composition-based stats.
Identities = 47/273 (17%), Positives = 96/273 (35%), Gaps = 55/273 (20%)
Query: 64 TSSKKTGTQRKR-----------ISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCI---- 108
T+ + RK+ +S + + N++ N P+ I S
Sbjct: 207 TTKSTNLSPRKQAIMRYKKSDEFLSERSYIAEAKENLKASNRS-PVSMAIHESPDKPIVR 265
Query: 109 ---RRNSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLK 165
+ +V+ S + ++ ++ + N+ L+ G E++Q+ G
Sbjct: 266 ANLNMDEKVTLFKYKSSKIIVVDEQHNESNSNS---SSGRLLGHGEFEVFQLHNGD---- 318
Query: 166 ENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPER 225
V YLS G ++P+ K ++ R+ ++ + NPER
Sbjct: 319 -------------------VTYLSCG--PSFIYPLFSKQKLLRIGFN--QFILPLVNPER 355
Query: 226 YWEIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSD 285
YW+I + S + +E +FE V+ + E T + P S+
Sbjct: 356 YWKIF---INSDEPNVIEVLESSFERVVKYRNLYFKSKETETPRVGHY---PIESSPPSN 409
Query: 286 IEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDE 318
I E+ + ++ D +N + + L+E
Sbjct: 410 INDQPHELKSLSIIDESDANNSNSTPVLPTLEE 442
>ref|XP_001523637.1| hypothetical protein LELG_05053 [Lodderomyces elongisporus NRRL
YB-4239]
gb|EDK46872.1| hypothetical protein LELG_05053 [Lodderomyces elongisporus NRRL
YB-4239]
Length = 944
Score = 83.1 bits (204), Expect = 4e-14, Method: Composition-based stats.
Identities = 44/200 (22%), Positives = 80/200 (40%), Gaps = 54/200 (27%)
Query: 113 QVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQ 172
+V+ V + +Q +Q SL+A G IEI+Q+ G
Sbjct: 250 KVTLFKYKSLRIV---ELINQHTQ------SGSLLAHGEIEIFQLHNGD----------- 289
Query: 173 KSSKASSHDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFL 232
V YLS G ++P+LPK+++ R++ ++ + NPERYW+I
Sbjct: 290 ------------VTYLSCGGK-LFIYPLLPKIKVLRISFN--QFLLPLLNPERYWKIY-- 332
Query: 233 PLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTE 292
E +++ E V+ Q+ +I+ + + DI+ P +
Sbjct: 333 -----VDCEDVSTLDCLEGVLKWNVQYINISNDKSDKS-----------DKLDIDDPKHQ 376
Query: 293 I-INNDDDNDDDDDNYDDDD 311
I + +D +D +D D +D
Sbjct: 377 KHIKHQNDQNDRNDQNDKND 396
>ref|XP_506086.1| hypothetical protein [Yarrowia lipolytica]
emb|CAG78899.1| unnamed protein product [Yarrowia lipolytica CLIB122]
Length = 544
Score = 45.0 bits (105), Expect = 0.013, Method: Composition-based stats.
Identities = 41/241 (17%), Positives = 83/241 (34%), Gaps = 49/241 (20%)
Query: 90 QVMNSFVPI----HNDIPNSSCIRRNSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPT- 144
+ +SF P + + + R S A++++ + + D + +L P
Sbjct: 61 RQTSSFAPGSELANVQLRSRPGSRCASGTYASSISSGATLTEPDEADPNLDPWEQLPPVH 120
Query: 145 -----SLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYLSIGRHGDIVHP 199
++A G +++ + V Y+ G D V
Sbjct: 121 CAISERIVAHGRFQVFTLHN-----------------------DKVTYIKCG---DAVQA 154
Query: 200 VLPKLQITRLNGAGFKYFISFYNPERYWEIEFL----PLISQSQSELENS------VKAF 249
+LPKL++ R + + ++ P RYW +E + + LE S +KA
Sbjct: 155 ILPKLRLWRTSLS--QFIFPQPIPGRYWRVELFRSTYKIADDLRMALEQSCCFLDQIKAP 212
Query: 250 ENVISKICQFSHINEGATIGNNESLSDKFKLPP-TSDIEPPNTEIINNDDDNDDDDDNYD 308
E ++ I ++ L P S PP E ++ +DD++ +
Sbjct: 213 ELEEGAKEEYIENYLDQLIPEHDLKEKLVSLNPNNSGYIPPPKEKDEDETKVEDDNNTVE 272
Query: 309 D 309
+
Sbjct: 273 E 273
>ref|XP_001219610.1| hypothetical protein CHGG_00389 [Chaetomium globosum CBS 148.51]
gb|EAQ92154.1| hypothetical protein CHGG_00389 [Chaetomium globosum CBS 148.51]
Length = 476
Score = 41.2 bits (95), Expect = 0.17, Method: Composition-based stats.
Identities = 15/51 (29%), Positives = 23/51 (45%), Gaps = 4/51 (7%)
Query: 180 HDGHVVNYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIE 230
V +LS G P+LPK Q+ ++ K+ + P+ YW IE
Sbjct: 87 RAPGSVAFLSCGSALQ---PILPKSQVWCVDEESSKFILQIRRPQ-YWRIE 133
>ref|ZP_02002225.1| conserved hypothetical protein [Beggiatoa sp. PS]
gb|EDN67777.1| conserved hypothetical protein [Beggiatoa sp. PS]
Length = 352
Score = 40.0 bits (92), Expect = 0.34, Method: Composition-based stats.
Identities = 27/120 (22%), Positives = 43/120 (35%), Gaps = 13/120 (10%)
Query: 126 FFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVV 185
D Q + +I + SL +G YQI G KL + G++
Sbjct: 71 LGYDYQQDEVDTSIAYEIDSLDNQGFFAEYQIPMGTGKLTTGLRQDDNDQFGKHITGNM- 129
Query: 186 NYLSIGRHGDIVHPVLPKLQITRLNGAGFKYFISFYNPERYWEIEFLPLISQSQSELENS 245
S+G +P+ P +I G + + +N E Y+ EF + E NS
Sbjct: 130 ---SLG------YPISPTARIFF--SYGTAFKVPSFN-ELYYRNEFYAGNPNLEPEESNS 177
>ref|XP_001644941.1| hypothetical protein Kpol_1025p2 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO17083.1| hypothetical protein Kpol_1025p2 [Vanderwaltozyma polyspora DSM
70294]
Length = 1507
Score = 40.0 bits (92), Expect = 0.38, Method: Composition-based stats.
Identities = 26/121 (21%), Positives = 51/121 (42%), Gaps = 11/121 (9%)
Query: 39 NKKLKQDSTQHSNDTNKSVKAKK------NGTSSKKTGTQRKRISTQRFSLFTYGNVQVM 92
+KK Q + +N+T+++ + G+S K G F +
Sbjct: 1243 SKKTTQPPSSGTNETSETAVEAEKKGKGITGSSKKVPGAIHLGRRVDYLISFLKDEPKDT 1302
Query: 93 NSFVPIHNDIPNSSCIRRNSQVSANNVT--ESSGVFFNDTQSQDSQNTIKLKPTSLMAKG 150
+S ++ +PN +R +++A N + S G+ DT + ++ +LKP + KG
Sbjct: 1303 DSSGSNNDKLPNGPTKKRPRKITATNSSKTNSPGLEKQDTGNGTEPSSKRLKP---LPKG 1359
Query: 151 P 151
P
Sbjct: 1360 P 1360
>ref|YP_082369.1| hypothetical protein BCZK0765 [Bacillus cereus E33L]
gb|AAU19478.1| conserved hypothetical protein [Bacillus cereus E33L]
Length = 898
Score = 39.2 bits (90), Expect = 0.61, Method: Composition-based stats.
Identities = 28/144 (19%), Positives = 61/144 (42%), Gaps = 12/144 (8%)
Query: 239 QSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDD 298
Q E VK +N Q + +NE + +S ++ K+ D+ EI+++++
Sbjct: 162 QVVDEEIVKQSDN--FSKKQINKLNEETKVSEKKSTDEQVKVCTEEDLSHKKAEIVSSEE 219
Query: 299 DNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISIN 358
+ + + + LN + + Y + +D +F+ S N+ + P D D + ++
Sbjct: 220 ETEI---SLQKEVLNEIDENTYGEEQSDVAFNKGSTEMDNITINGEEPYDEEDENAEIVD 276
Query: 359 E-------AFKNAIRRTAPVLNIP 375
+ ++A R +LN P
Sbjct: 277 DRKQMATIPVEHADRSYESILNYP 300
>ref|NP_014227.1| Largest subunit of the Anaphase-Promoting Complex/Cyclosome
(APC/C), which is a ubiquitin-protein ligase required
for degradation of anaphase inhibitors, including
mitotic cyclins, during the metaphase/anaphase
transition; Apc1p [Saccharomyces cerevisiae]
sp|P53886|APC1_YEAST Anaphase-promoting complex subunit 1
emb|CAA96060.1| APC1 [Saccharomyces cerevisiae]
Length = 1748
Score = 39.2 bits (90), Expect = 0.67, Method: Composition-based stats.
Identities = 71/420 (16%), Positives = 137/420 (32%), Gaps = 95/420 (22%)
Query: 25 STFQTLKQSLKLSNNKKLKQDS---TQHSNDTNKSVKAKKNGT--SSKKTGTQRKRISTQ 79
S+ Q+ K L +S+ KK D+ +Q S +K ++ + T S+ + +
Sbjct: 222 SSDQSRKGELTISSTKKTGLDAAGNSQKSGGISKDLRKFSHLTRRSTSINSNSHDFNAAE 281
Query: 80 RFSLFTYGNVQVMNSFVPIHNDI--PNSSCIRRNSQVSANN--------VTESSGVFFND 129
R GNV DI SSC RR+ + + + FF+
Sbjct: 282 RVL---SGNVGN----ASGRTDIFALPSSCSRRSLSATLDRMGNNIAPTNRAAPSGFFDS 334
Query: 130 TQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKASSHDGHVVNYL- 188
+ S NT T + + + A+S V+ +
Sbjct: 335 S----SANTATHSNI--------------TPVSQPMQQQQQEYLNQTATSSKDIVLTEIS 376
Query: 189 SIGRHGDIVH------PVLPKLQITRLNGAGFKYFISFYNPERY---WEIEFLPLISQS- 238
S+ DI+ +L KL+ L + + F+ P + W I+ LP + S
Sbjct: 377 SLKLPDDIIFTSRRLSSILSKLKFLSLRFERREGLLIFHEPTHFCKIWLIDLLPDVLDSI 436
Query: 239 -----QSELENSVKAFENV-------ISKICQFSHINEGATIGNNESLSDKFKL----PP 282
+ +N ++ EN+ I + + + + +++K P
Sbjct: 437 PFKIYGNSPQNMIR-LENLKLKEPSRIQAMYIHELLESCLILVSEGQNKEEYKACLYDPF 495
Query: 283 TSDIEPPNTEIINNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTD----NSFSVISNTCSN 338
P + +++ L L Y + + I++ N
Sbjct: 496 VKITSPSK-------NISEELTKQNSLPSLQKLF--PYPETSFTKLCFEAVKYITSPAFN 546
Query: 339 LNASFLYPSDPTDAVSISINEAFKNAIRRTA----------PVLNIPIAAPSIHSKQQNK 388
++ FL+ A SI ++ A + + +L +PI + S Q+ K
Sbjct: 547 ISFIFLW----QSAYSILLSRANDDVVGGLKMEHDAFSLVLSLLILPIPSSSAQEYQEYK 602
>ref|XP_745847.1| carbamoyl phosphate synthetase [Plasmodium chabaudi chabaudi]
emb|CAH75934.1| carbamoyl phosphate synthetase, putative [Plasmodium chabaudi]
Length = 2105
Score = 38.5 bits (88), Expect = 0.99, Method: Composition-based stats.
Identities = 36/173 (20%), Positives = 74/173 (42%), Gaps = 20/173 (11%)
Query: 206 ITRLNGAGFKYFISFYNPERYWEIEFLP--LISQSQSELENSVKAFENVISKICQFSHIN 263
I + + + S YN E ++I + + +++E + K F N+ + C++ +I+
Sbjct: 1377 INNIANNSYYFRDSVYNNE--YKINKMREFMNKDNENEFIKNGKNFANITNGKCEYINID 1434
Query: 264 EGATIGNNESLSDKFKLPPTSDI----EPPNTEIINNDDDNDDD-------DDNYDDDDL 312
+ T N + ++ + P S+ N+ ++N+ +D+ D N+ +D
Sbjct: 1435 KEHTKNINHNEDNEGRSPTNSNTRRMASKNNSSLMNSYNDDKSDCFSELSYIQNHKKNDY 1494
Query: 313 NYLLDEEYEQGCTDNSFSVISNT--CSNLNASFLYPSDPTDAVSISINEAFKN 363
Y E+ +DN +S S T N ++ Y D ++ SIN N
Sbjct: 1495 EY---EDLYSRSSDNCYSSHSITMKGDNDSSVLDYEEDEFNSELSSINSENDN 1544
>gb|AAG10786.1| carbamoyl phosphate synthetase II [Plasmodium chabaudi adami]
Length = 1418
Score = 38.1 bits (87), Expect = 1.5, Method: Composition-based stats.
Identities = 38/173 (21%), Positives = 74/173 (42%), Gaps = 20/173 (11%)
Query: 206 ITRLNGAGFKYFISFYNPERYWEIEFLP--LISQSQSELENSVKAFENVISKICQFSHIN 263
I + + + S YN E +EI + + +++E S K F N+ + C++ +I+
Sbjct: 1203 INNIANNSYYFRDSVYNNE--YEINKMREFMNKDNENEFIKSGKNFANITNGKCEYINID 1260
Query: 264 EGATIGNNESLSDKFKLPPTSDI----EPPNTEIINNDDDNDDD-------DDNYDDDDL 312
+ T N + ++ + P S+ N+ ++N+ +D+ D N+ +D
Sbjct: 1261 KEHTKNINYNEDNECRSPNNSNTRRMASKNNSSLMNSYNDDKSDCFSELSYIQNHKKNDY 1320
Query: 313 NYLLDEEYEQGCTDNSFSVISNT--CSNLNASFLYPSDPTDAVSISINEAFKN 363
Y E+ +DN +S S T N ++ Y D ++ SIN N
Sbjct: 1321 EY---EDLYSRSSDNCYSSHSITMKGDNDSSVLDYEEDEFNSELSSINSENDN 1370
>ref|YP_796256.1| ABC-type oligopeptide transport system, periplasmic component
[Lactobacillus brevis ATCC 367]
gb|ABJ65225.1| ABC-type oligopeptide transport system, periplasmic component
[Lactobacillus brevis ATCC 367]
Length = 547
Score = 37.3 bits (85), Expect = 2.1, Method: Composition-based stats.
Identities = 38/147 (25%), Positives = 63/147 (42%), Gaps = 12/147 (8%)
Query: 10 KNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKKT 69
K +V LT E+ +ST S L N KK+ D T S K+ K K+ T
Sbjct: 239 KKAVHLTKINEQVTESTTT----SYNLYNAKKV--DETLLSGQQIKNNKNSKDFVKRLPT 292
Query: 70 GTQRKRISTQRFSLFTYGNVQ-----VMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSG 124
G+QR ++ ++ S F NV+ +N +++ + + S + + +G
Sbjct: 293 GSQRLDLNEKKVSAFKNVNVRKAFSLAINRTQLVNDVLQDGSVASKGFVPAGMGNNPKTG 352
Query: 125 VFFNDTQSQDSQNTIKLK-PTSLMAKG 150
FN S S + +K +L+AKG
Sbjct: 353 EAFNQEASIKSAVSYNVKQAKTLLAKG 379
>ref|YP_001922312.1| putative glycerol-3-phosphate ABC transporter [Clostridium
botulinum E3 str. Alaska E43]
gb|ACD52177.1| putative glycerol-3-phosphate ABC transporter [Clostridium
botulinum E3 str. Alaska E43]
Length = 439
Score = 37.3 bits (85), Expect = 2.2, Method: Composition-based stats.
Identities = 23/94 (24%), Positives = 35/94 (37%), Gaps = 13/94 (13%)
Query: 226 YWEIEFLPLISQSQSELENSVKAFENV-----ISKICQFSHINEGATIGNNESLSDKFKL 280
YW L + ++N + F N + I Q + AT N ++ K
Sbjct: 43 YW----YGLGGKLGKNMDNIINDFNNSQDKYEVKGISQ---EDYDATFKNLQAAIAAKKT 95
Query: 281 PPTSDIEPPNTEIINNDDDNDDDDDNYDDDDLNY 314
P +EP I+NN D NY +D N+
Sbjct: 96 PALVVLEPDKGAILNNK-GALADISNYTAEDSNF 128
>ref|XP_001614588.1| E1-E2_ATPase/hydrolase, putative [Plasmodium vivax SaI-1]
gb|EDL44861.1| E1-E2_ATPase/hydrolase, putative [Plasmodium vivax]
Length = 2068
Score = 37.3 bits (85), Expect = 2.2, Method: Composition-based stats.
Identities = 19/106 (17%), Positives = 31/106 (29%), Gaps = 1/106 (0%)
Query: 235 ISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEII 294
+ F ++S I N+ IG+N P S P ++
Sbjct: 1639 NDMEHTPCSGKQSIFNRIVSFINDKRKKNKYNQIGDNSLKEYFINNPEVSISSPDDSYTS 1698
Query: 295 NNDDDNDDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLN 340
+N+D D D E +Q D F +N +
Sbjct: 1699 ETSSENNDTTITDTQPDGQLSYDGELKQNDED-PFDSENNLSEKIT 1743
>ref|XP_001653742.1| hypothetical protein AaeL_AAEL001651 [Aedes aegypti]
gb|EAT47213.1| conserved hypothetical protein [Aedes aegypti]
Length = 1132
Score = 36.9 bits (84), Expect = 2.8, Method: Composition-based stats.
Identities = 21/116 (18%), Positives = 38/116 (32%), Gaps = 4/116 (3%)
Query: 265 GATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDD--DDNYDDDDLNYLLDEEYEQ 322
+ S S L + P N I+ND D DN + D++ +E +
Sbjct: 791 LMSNQTESSPSPPPSLMDVTASTPKNAAQISNDADEHSGPPTDNNESVDMDLSDEEYVPE 850
Query: 323 GCTDNSFSVISNTCSNLNASFL--YPSDPTDAVSISINEAFKNAIRRTAPVLNIPI 376
T+ S ++ + + D DA +N + A + P+
Sbjct: 851 HPTEASTALHDEPIEDSTEPYEPFEGEDIDDAFMPLMNPTLSSQDESVASIPTPPV 906
>ref|YP_001887378.1| putative glycerol-3-phosphate ABC transporter,
glycerol-3-phosphate-binding protein [Clostridium
botulinum B str. Eklund 17B]
gb|ACD23238.1| putative glycerol-3-phosphate ABC transporter,
glycerol-3-phosphate-binding protein [Clostridium
botulinum B str. Eklund 17B]
Length = 440
Score = 36.9 bits (84), Expect = 3.4, Method: Composition-based stats.
Identities = 22/99 (22%), Positives = 38/99 (38%), Gaps = 16/99 (16%)
Query: 226 YWEIEFLPLISQSQSELENSVKAFENV-----ISKICQFSHINEGATIGNNESLSDKFKL 280
YW L + ++N + F N + I Q + AT N ++ K
Sbjct: 43 YW----YGLGGKLGKNMDNIINDFNNSQDKYEVKGISQ---EDYDATFKNLQAAIAAKKT 95
Query: 281 PPTSDIEPPNTEIINN----DDDNDDDDDNYDDDDLNYL 315
P +EP I+NN D ++ ++ D + NY+
Sbjct: 96 PTLVVLEPDKGAILNNKGALADISNYTAEDSDFNSDNYI 134
>gb|ABZ96112.1| Esterase [Leptospira biflexa serovar Patoc strain 'Patoc 1 (Ames)']
gb|ABZ99836.1| Putative hydrolase; putative signal peptide [Leptospira biflexa
serovar Patoc strain 'Patoc 1 (Paris)']
Length = 574
Score = 36.9 bits (84), Expect = 3.4, Method: Composition-based stats.
Identities = 28/142 (19%), Positives = 53/142 (37%), Gaps = 11/142 (7%)
Query: 57 VKAKKNGTSSKKTGTQRKRISTQRFSLFTYGN---------VQVMNSFVPIHNDIPNSSC 107
+K +NG +K T +KR S+ R S + N ++ M F P ++
Sbjct: 349 LKGIQNGIMTKPKVTFQKRFSSDRVSYSAWPNPSKIEKTYHLRPMGFFSPGKITT-TANT 407
Query: 108 IRRNSQVSANNVTESSGVFFNDTQSQDSQNTIKLKPTSLMAKGPIEIYQICTGFDKLKEN 167
N + + + ++GV + + +L+ + +Y + LK
Sbjct: 408 SNGNDTILSGGTSATTGVPLLSEILDGAVSVPVKTNVNLIDRTNAMVYLSDSLSSPLKLR 467
Query: 168 IAPFQKSS-KASSHDGHVVNYL 188
F K +S + HV+ YL
Sbjct: 468 GRTFYKGRINSSDYAPHVMVYL 489
>ref|XP_001033353.1| hypothetical protein TTHERM_00420980 [Tetrahymena thermophila
SB210]
gb|EAR85690.1| hypothetical protein TTHERM_00420980 [Tetrahymena thermophila
SB210]
Length = 822
Score = 36.5 bits (83), Expect = 3.8, Method: Composition-based stats.
Identities = 38/185 (20%), Positives = 66/185 (35%), Gaps = 9/185 (4%)
Query: 241 ELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDN 300
L+ KA + I K + + I +N L P S +P N +II+N D
Sbjct: 113 NLQKFQKALQEQIFKYEEKLNELFSKYIPDNLVFQQNKHLAPISGWKPNNLQIISNMDYQ 172
Query: 301 DDDDDNYDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNA---SFLYPSDPTDAVSISI 357
+ + D + L ++D Q + T + S Y S
Sbjct: 173 NIAEHTNDPNKLIDMIDCRIIQNPQTAKAILQKLTEKYIQFTCNSNQYSSQSQFNNQSLQ 232
Query: 358 NEAFKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHRRFQ--RRSIS 415
N +N I P N P + N + +++ ++D ++ Q R S++
Sbjct: 233 NGFNQNGI----PQNNTPTVQLTTLFSNDNPNNINLKNLNTQKQIKDFKQQMQDKRNSLT 288
Query: 416 GLGDL 420
LG+L
Sbjct: 289 DLGNL 293
>ref|XP_001608456.1| hypothetical protein PVX_111155 [Plasmodium vivax SaI-1]
gb|EDL42432.1| hypothetical protein, conserved [Plasmodium vivax]
Length = 966
Score = 36.5 bits (83), Expect = 3.9, Method: Composition-based stats.
Identities = 23/121 (19%), Positives = 45/121 (37%), Gaps = 7/121 (5%)
Query: 247 KAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDNDDDDDN 306
+A EN + I + I +NES ++ E + +++ N++ +
Sbjct: 181 RALENQLKSIVELREILNNK--DDNESTKTFYETTDNELTEDNEDD-LSDHTGNENKNVG 237
Query: 307 YDDDDLNYLLDEEYEQGCTDNSFSVISNTCSNLNASFLYPSDPTDAVSISINEAFKNAIR 366
D+ + L+ E+ + +N N F D T+ S I + N I+
Sbjct: 238 EWVDEREFTLNSSVEEDYLQYDYPQENNR----NNIFSLSDDETEPCSFDILQNIHNGIK 293
Query: 367 R 367
R
Sbjct: 294 R 294
>emb|CAY17345.1| expressed protein [Schistosoma mansoni]
Length = 2167
Score = 36.5 bits (83), Expect = 3.9, Method: Composition-based stats.
Identities = 30/138 (21%), Positives = 50/138 (36%), Gaps = 9/138 (6%)
Query: 7 ETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSS 66
+T ++ + ++ P++T ++K S +NK S+ TN+ +K
Sbjct: 664 DTTNSATKDVQNKKTIPENTSTSVKDS----SNKNNNTKSSDTHLTTNEPAPLQKET--- 716
Query: 67 KKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRR-NSQVSANNVTESSGV 125
K T ST N+ +NS D S + N + N S V
Sbjct: 717 -KPMTNVSSKSTDSKIAAETENLSKVNSTDRKTTDKHESETSAKNNVATTHNESNSISNV 775
Query: 126 FFNDTQSQDSQNTIKLKP 143
DTQS ++ IK
Sbjct: 776 LKPDTQSNKKESDIKTNS 793
>ref|YP_944121.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas
ingrahamii 37]
gb|ABM04522.1| protein containing tetratricopeptide (TPR) repeat [Psychromonas
ingrahamii 37]
Length = 657
Score = 36.1 bits (82), Expect = 5.5, Method: Composition-based stats.
Identities = 33/139 (23%), Positives = 60/139 (43%), Gaps = 14/139 (10%)
Query: 7 ETKKNSVRLTAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSS 66
E +KNS + + Q + K S + NN++ K + Q +N+ K+ + +KN
Sbjct: 479 EQQKNSEQQK---NNEQQKNSEQQKNSEQQKNNEQQKNNEQQKNNEQQKNNEQQKNN-EQ 534
Query: 67 KKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANNVTESSGVF 126
+K Q+K Q N Q NS +N+ +S ++N++ NN + +
Sbjct: 535 QKNNEQQKNNEQQ------KNNEQQKNSEQQKNNEQQKNSEQQKNNEQQKNNEQQKN--- 585
Query: 127 FNDTQSQDSQNTIKLKPTS 145
N+ Q+ D Q + S
Sbjct: 586 -NEQQTNDKQQMTAKEVQS 603
>ref|YP_015868.1| replicative DNA helicase [Mycoplasma mobile 163K]
gb|AAT27657.1| replicative DNA helicase [Mycoplasma mobile 163K]
Length = 452
Score = 35.8 bits (81), Expect = 6.3, Method: Composition-based stats.
Identities = 18/67 (26%), Positives = 32/67 (47%)
Query: 228 EIEFLPLISQSQSELENSVKAFENVISKICQFSHINEGATIGNNESLSDKFKLPPTSDIE 287
+I+F + +Q + E E ++ NV++++ +S E A + E L+ K E
Sbjct: 57 KIDFNSIKNQLEKEPEKNIANINNVLAELIPWSSFEENALYFHKEILNYSRKRKVKEVFE 116
Query: 288 PPNTEII 294
N EII
Sbjct: 117 NWNKEII 123
>ref|XP_420592.2| PREDICTED: similar to shroom family member 3 protein [Gallus gallus]
Length = 2026
Score = 35.8 bits (81), Expect = 6.6, Method: Composition-based stats.
Identities = 31/155 (20%), Positives = 55/155 (35%), Gaps = 18/155 (11%)
Query: 35 KLSNNKKLKQDSTQHSNDTNKSVKAKKNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNS 94
K+ K + D + K + S + G K +S + L Y N Q N+
Sbjct: 1192 KMEVTKGTQTDPLRPQGQPCYGKKER----SFETPGKSGKSVSVEDL-LDRYDN-QQRNA 1245
Query: 95 FVPIHNDIPNSSCIRRNSQVSANNVTESSGVFFND-----------TQSQDSQNTIKLKP 143
+VP+H +S +N Q + G D + ++ ++ K+
Sbjct: 1246 YVPVHTRSRSSPTADKNHQDLLRRESSEFGAMVRDPFYVISAGARSSNKKERSHSEKMTF 1305
Query: 144 TSLMAKGPIEIYQICTGFDKLKENIAPFQKSSKAS 178
TS P + +I G L +N + S + S
Sbjct: 1306 TSYYPHPP-HVSEIGAGSSSLTKNHKVSELSRQDS 1339
>ref|ZP_01049630.1| hypothetical protein MED134_08926 [Cellulophaga sp. MED134]
gb|EAQ39602.1| hypothetical protein MED134_08926 [Dokdonia donghaensis MED134]
Length = 481
Score = 35.8 bits (81), Expect = 6.9, Method: Composition-based stats.
Identities = 19/142 (13%), Positives = 56/142 (39%), Gaps = 4/142 (2%)
Query: 2 VLSRGETKKNSVRL---TAKQEKKPQSTFQTLKQSLKLSNNKKLKQDSTQHSNDTNKSVK 58
+ ++ ET ++ V+ T +Q+ Q+T + +Q + +N+ + + + T+ +
Sbjct: 82 ITTQSETNQSVVKNSIDTREQQTSDQTTITSKEQVIAATNSSQRNSQTKSNITTTSNASG 141
Query: 59 AKKNGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSANN 118
K S + +S + L Y +V + ++ + ++ + +
Sbjct: 142 LKSGKISRSNSKINTSPLSNESNGLIAYSDVNNTTTS-STEKKATTTNQVTKDQKTTIGK 200
Query: 119 VTESSGVFFNDTQSQDSQNTIK 140
+ V N+T ++ ++
Sbjct: 201 EDNPTTVAANETVHTKEKDLVE 222
>ref|XP_001123188.1| PREDICTED: similar to CDK5 regulatory subunit associated protein 3
[Apis mellifera]
Length = 487
Score = 35.8 bits (81), Expect = 7.5, Method: Composition-based stats.
Identities = 36/192 (18%), Positives = 63/192 (32%), Gaps = 25/192 (13%)
Query: 244 NSVKAFENVISKICQFSHINEGATI--------------GNNESLSDKFKLPPTSDIEPP 289
+K+ E V+ C F + G GN + + P S EPP
Sbjct: 200 KKIKSLEKVVEFYCAFVNYTLGRQHDGGCVPMIKYIIDKGNTTTYEWIYGEAPLSVNEPP 259
Query: 290 NTEIINNDDDNDDDDDNYDDDDLNYLLD--EEYEQGCTDNSFSVISNTCSNLNASFLYPS 347
+NNDD + ++ D + +N +L+ E + G + ++ + N S
Sbjct: 260 LNISLNNDDLENKKIEDID-NIVNIMLEVGGEIDWGDENIDEAIEDENI-DYNISLEESG 317
Query: 348 DPTDAVSISINEAFKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFIDSPPYLQDRHR 407
+A A + + + N PS + N+ + F+ Y
Sbjct: 318 IVVEAAGHEGGIA--TGSKAYSILDN-----PSTRNDFINQLFELEAFLKVRLYEFKGDD 370
Query: 408 RFQRRSISGLGD 419
S S L D
Sbjct: 371 SVNFLSFSQLQD 382
>ref|NP_001043616.1| Os01g0623500 [Oryza sativa (japonica cultivar-group)]
dbj|BAB21091.1| cell division cycle gene CDC48-like [Oryza sativa Japonica Group]
dbj|BAF05530.1| Os01g0623500 [Oryza sativa (japonica cultivar-group)]
gb|EAZ12740.1| hypothetical protein OsJ_002565 [Oryza sativa (japonica
cultivar-group)]
Length = 812
Score = 35.4 bits (80), Expect = 7.9, Method: Composition-based stats.
Identities = 19/122 (15%), Positives = 41/122 (33%), Gaps = 12/122 (9%)
Query: 7 ETKKNSVRLTAKQEKKPQSTFQT-LKQSLKLSNNKKLKQDSTQHSNDTNKSVKAKK---- 61
+ +KN V+ E K + +T ++++ S++ K ++D + KK
Sbjct: 693 QKEKNKVKKETAPETKQEPKEKTKIQENGTKSSDSKTEKDKLDNKEGKKDKPADKKDKSD 752
Query: 62 -----NGTSSKKTGTQRKRISTQRFSLFTYGNVQVMNSFVPIHNDIPNSSCIRRNSQVSA 116
T ++ + + F V VMN + R+ Q++
Sbjct: 753 KGDAGETTLRPLNMEDLRKAKDEVAASFASEGV-VMNQIKEWNELYGKGG-SRKREQLTY 810
Query: 117 NN 118
Sbjct: 811 FL 812
>ref|XP_001645330.1| hypothetical protein Kpol_1058p9 [Vanderwaltozyma polyspora DSM
70294]
gb|EDO17472.1| hypothetical protein Kpol_1058p9 [Vanderwaltozyma polyspora DSM
70294]
Length = 646
Score = 35.4 bits (80), Expect = 8.3, Method: Composition-based stats.
Identities = 24/78 (30%), Positives = 38/78 (48%), Gaps = 1/78 (1%)
Query: 338 NLNASFLYPSDPTDAVSISINEAFKNAIRRTAPVLNIPIAAPSIHSKQQNKRYSSYPFID 397
N+ SF S P+ + I E + +R++PV+ PS + K+ NKR+SS I
Sbjct: 71 NIFTSFSGKS-PSSPLDNIIEEKNSSGEKRSSPVIISHRRQPSGNVKRSNKRHSSSSSIG 129
Query: 398 SPPYLQDRHRRFQRRSIS 415
SP D+ + R +S
Sbjct: 130 SPVLDSDKGKDKSERHVS 147
>ref|XP_001300798.1| GAF domain containing protein [Trichomonas vaginalis G3]
gb|EAX87868.1| GAF domain containing protein [Trichomonas vaginalis G3]
Length = 1554
Score = 35.4 bits (80), Expect = 9.4, Method: Composition-based stats.
Identities = 20/135 (14%), Positives = 48/135 (35%), Gaps = 10/135 (7%)
Query: 260 SHINEGATIGNNESLSDKFKLPPTSDIEPPNTEIINNDDDND------DDDDNYDDDDLN 313
S+ + T ++E+ + + + E I + ++++ D++ + D +
Sbjct: 1392 SNETKEETKDSDETKEETKNSGENKEETKNSNEEIQSSNESNMDSKEIDNESKEESKDND 1451
Query: 314 YLLDEEYEQGCTDNSFSVIS----NTCSNLNASFLYPSDPTDAVSISINEAFKNAIRRTA 369
+ + E + N IS N N S ++ S+ ++ N +
Sbjct: 1452 EIKENETDSNQNSNELKEISIENIKNEKNQNVSEIHSSENQNSQEDKEEPINSNKNSQEK 1511
Query: 370 PVLNIPIAAPSIHSK 384
N + S+H K
Sbjct: 1512 LEDNETLDNNSVHIK 1526
Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects
Posted date: May 23, 2008 5:56 PM
Number of letters in database: 883,778,997
Number of sequences in database: 2,617,685
Database: /host/Blast/data/nr_perl/nr.01
Posted date: May 23, 2008 5:54 PM
Number of letters in database: 976,759,346
Number of sequences in database: 2,761,413
Database: /host/Blast/data/nr_perl/nr.02
Posted date: May 23, 2008 5:48 PM
Number of letters in database: 374,670,760
Number of sequences in database: 1,165,270
Database: /host/Blast/data/nr_perl/nr.03
Posted date: Apr 28, 2009 5:40 PM
Number of letters in database: 114,943,120
Number of sequences in database: 354,819
Lambda K H
0.307 0.138 0.352
Lambda K H
0.267 0.0422 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,787,804,674
Number of Sequences: 6899187
Number of extensions: 181707701
Number of successful extensions: 1084212
Number of sequences better than 10.0: 300
Number of HSP's better than 10.0 without gapping: 1069
Number of HSP's successfully gapped in prelim test: 3037
Number of HSP's that attempted gapping in prelim test: 1022189
Number of HSP's gapped (non-prelim): 40149
length of query: 420
length of database: 2,350,152,223
effective HSP length: 137
effective length of query: 283
effective length of database: 1,404,963,604
effective search space: 397604699932
effective search space used: 397604699932
T: 11
A: 40
X1: 16 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (21.4 bits)
S2: 80 (35.4 bits)