
BLAST2 result
BLASTP 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= AC144729.7 - phase: 0
(226 letters)
Database: nr
2,540,612 sequences; 863,360,394 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_565077.2| peptidase U7 family protein [Arabidopsis thalia... 160 3e-38
gb|AAN17406.1| Unknown protein [Arabidopsis thaliana] gi|2805891... 160 3e-38
gb|AAF24059.1| putative protease SppA [Arabidopsis thaliana] 160 3e-38
ref|XP_467587.1| putative protease IV [Oryza sativa (japonica cu... 154 2e-36
ref|ZP_00521854.1| Peptidase S49, SppA 67 kDa type:Peptidase S49... 105 1e-21
emb|CAH08968.1| putative protease IV [Bacteroides fragilis NCTC ... 98 2e-19
ref|YP_100728.1| protease IV [Bacteroides fragilis YCH46] gi|522... 98 2e-19
gb|AAO76986.1| protease IV [Bacteroides thetaiotaomicron VPI-548... 94 2e-18
ref|ZP_00530536.1| Peptidase S49, SppA 67 kDa type:Peptidase S49... 92 1e-17
ref|ZP_00144963.1| Putative signal peptide peptidase sppA [Fusob... 85 2e-15
gb|AAQ65823.1| signal peptide peptidase SppA, 67K type [Porphyro... 84 2e-15
ref|ZP_00311036.1| COG0616: Periplasmic serine proteases (ClpP c... 83 7e-15
ref|ZP_00327330.1| COG0616: Periplasmic serine proteases (ClpP c... 82 9e-15
ref|NP_441055.1| protease IV [Synechocystis sp. PCC 6803] gi|165... 82 9e-15
ref|ZP_00160311.2| COG0616: Periplasmic serine proteases (ClpP c... 82 1e-14
dbj|BAB76289.1| protease IV [Nostoc sp. PCC 7120] gi|17232082|re... 81 2e-14
ref|NP_662527.1| protease IV [Chlorobium tepidum TLS] gi|2164765... 80 4e-14
dbj|BAB75712.1| protease IV [Nostoc sp. PCC 7120] gi|17231505|re... 75 2e-12
ref|ZP_00160734.1| COG0616: Periplasmic serine proteases (ClpP c... 74 2e-12
gb|EAK84449.1| hypothetical protein UM03629.1 [Ustilago maydis 5... 72 9e-12
>ref|NP_565077.2| peptidase U7 family protein [Arabidopsis thaliana]
gi|12325146|gb|AAG52522.1| putative protease IV;
48713-44371 [Arabidopsis thaliana]
gi|25406310|pir||F96767 proteinase IV F2P9.14 [imported]
- Arabidopsis thaliana
Length = 677
Score = 160 bits (405), Expect = 3e-38
Identities = 100/226 (44%), Positives = 130/226 (57%), Gaps = 42/226 (18%)
Query: 2 ISFRKYSKVRKWTVGISEAKEQIAIIRASGTMD----------SDIVTSNFIEKIGMVKD 51
+ ++KYS V+KWT+G++ ++QIAIIRA G++ S I+ IEKI V++
Sbjct: 354 VDYKKYSGVKKWTLGLTGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRE 413
Query: 52 SKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIV 111
SKK+KA I+RIDS GGD AS W I+ LA KPVIASMSD A S GYYMAM A AIV
Sbjct: 414 SKKYKAAIIRIDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANAIV 473
Query: 112 AENLTLTGSIRGTVSQNFNL---------DNELPSFIPY-------------DDAKVLDN 149
AENLTLTGSI G V+ F L + E S Y ++A++ +
Sbjct: 474 AENLTLTGSI-GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPLKPEEAELFEK 532
Query: 150 VV---------RLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVDA 186
+ L +DKM++VA GR+WTGKDA S GL+DA
Sbjct: 533 SAQHAYQLFRDKAALSRSMPVDKMEEVAQGRVWTGKDAHSRGLIDA 578
>gb|AAN17406.1| Unknown protein [Arabidopsis thaliana] gi|28058917|gb|AAO29968.1|
Unknown protein [Arabidopsis thaliana]
Length = 388
Score = 160 bits (405), Expect = 3e-38
Identities = 100/226 (44%), Positives = 130/226 (57%), Gaps = 42/226 (18%)
Query: 2 ISFRKYSKVRKWTVGISEAKEQIAIIRASGTMD----------SDIVTSNFIEKIGMVKD 51
+ ++KYS V+KWT+G++ ++QIAIIRA G++ S I+ IEKI V++
Sbjct: 65 VDYKKYSGVKKWTLGLTGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRE 124
Query: 52 SKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIV 111
SKK+KA I+RIDS GGD AS W I+ LA KPVIASMSD A S GYYMAM A AIV
Sbjct: 125 SKKYKAAIIRIDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANAIV 184
Query: 112 AENLTLTGSIRGTVSQNFNL---------DNELPSFIPY-------------DDAKVLDN 149
AENLTLTGSI G V+ F L + E S Y ++A++ +
Sbjct: 185 AENLTLTGSI-GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPLKPEEAELFEK 243
Query: 150 VV---------RLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVDA 186
+ L +DKM++VA GR+WTGKDA S GL+DA
Sbjct: 244 SAQHAYQLFRDKAALSRSMPVDKMEEVAQGRVWTGKDAHSRGLIDA 289
>gb|AAF24059.1| putative protease SppA [Arabidopsis thaliana]
Length = 680
Score = 160 bits (405), Expect = 3e-38
Identities = 100/226 (44%), Positives = 130/226 (57%), Gaps = 42/226 (18%)
Query: 2 ISFRKYSKVRKWTVGISEAKEQIAIIRASGTMD----------SDIVTSNFIEKIGMVKD 51
+ ++KYS V+KWT+G++ ++QIAIIRA G++ S I+ IEKI V++
Sbjct: 357 VDYKKYSGVKKWTLGLTGGRDQIAIIRAGGSISRVKGPLSTPGSAIIAEQLIEKIRSVRE 416
Query: 52 SKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIV 111
SKK+KA I+RIDS GGD AS W I+ LA KPVIASMSD A S GYYMAM A AIV
Sbjct: 417 SKKYKAAIIRIDSPGGDALASDLMWREIKLLAETKPVIASMSDVAASGGYYMAMAANAIV 476
Query: 112 AENLTLTGSIRGTVSQNFNL---------DNELPSFIPY-------------DDAKVLDN 149
AENLTLTGSI G V+ F L + E S Y ++A++ +
Sbjct: 477 AENLTLTGSI-GVVTARFTLAKLYEKIGFNKETISRGKYAELLGAEERPLKPEEAELFEK 535
Query: 150 VV---------RLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVDA 186
+ L +DKM++VA GR+WTGKDA S GL+DA
Sbjct: 536 SAQHAYQLFRDKAALSRSMPVDKMEEVAQGRVWTGKDAHSRGLIDA 581
>ref|XP_467587.1| putative protease IV [Oryza sativa (japonica cultivar-group)]
gi|46390611|dbj|BAD16095.1| putative protease IV [Oryza
sativa (japonica cultivar-group)]
Length = 674
Score = 154 bits (389), Expect = 2e-36
Identities = 97/227 (42%), Positives = 129/227 (56%), Gaps = 42/227 (18%)
Query: 1 MISFRKYSKVRKWTVGISEAKEQIAIIRASGTM----------DSDIVTSNFIEKIGMVK 50
M+ + KYS+V KWT+G+ EQIA+IRASG++ S I+ IEKI V+
Sbjct: 350 MVDYSKYSRVSKWTLGLQGGGEQIAVIRASGSITRTRSPLSVPSSGIIAEQLIEKIRTVR 409
Query: 51 DSKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAI 110
+S+K+KAVI+RIDS GGD AS W IR LA KPV+ASMSD A S GYYMAM A I
Sbjct: 410 ESEKYKAVILRIDSPGGDALASDLMWREIRLLADTKPVVASMSDVAASGGYYMAMAAPVI 469
Query: 111 VAENLTLTGSIRGTVSQNF---------NLDNELPSFIPY-------------DDAKVLD 148
VAE LTLTGSI G V+ F + + E+ S Y D+A++ +
Sbjct: 470 VAEKLTLTGSI-GVVTGKFILQKLYERIDFNKEIISKGRYAELNAADQRPLRPDEAELFE 528
Query: 149 NVV---------RLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVDA 186
+ + +D+M+ VA GR+W+G+DAAS GLVD+
Sbjct: 529 KSAQNAYALFRDKAAMSRSMNVDQMETVAQGRVWSGQDAASRGLVDS 575
>ref|ZP_00521854.1| Peptidase S49, SppA 67 kDa type:Peptidase S49, SppA [Solibacter
usitatus Ellin6076] gi|67864233|gb|EAM59240.1| Peptidase
S49, SppA 67 kDa type:Peptidase S49, SppA [Solibacter
usitatus Ellin6076]
Length = 574
Score = 105 bits (261), Expect = 1e-21
Identities = 75/224 (33%), Positives = 107/224 (47%), Gaps = 41/224 (18%)
Query: 2 ISFRKYSKVRKWTVGISEAKEQIAIIRASGTM-----------DSDIVTSNFIEKIGMVK 50
+S KYSKV +VG+ + K Q+A++ +G + +S++ + F + + V
Sbjct: 272 VSMDKYSKVTAESVGL-QGKSQVALVVGAGDIVRGDANDDGSGESNLTSYGFNKILRRVG 330
Query: 51 DSKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAI 110
+ KAVIVRIDS GG+ AS W + LA KKP++ SMSD A S GYYMAM I
Sbjct: 331 SDSQIKAVIVRIDSPGGEVTASDEIWREMNLLAKKKPMVISMSDTAASGGYYMAMTGDPI 390
Query: 111 VAENLTLTGS---------IRGTV------------SQNFNLDNELPSFIPYDDAKV--- 146
VA T TGS IRG +N ++D++ P AK+
Sbjct: 391 VAYPATFTGSIGVVFGKPNIRGLYEKLGISKDAIQRGKNADIDSDYTPLTPEQRAKLRAG 450
Query: 147 -----LDNVVRLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVD 185
D V ++ D++ +VA GR+W G A GLVD
Sbjct: 451 IDESYQDFVTKVANARHRTFDQIDQVAQGRVWLGSQAKPRGLVD 494
>emb|CAH08968.1| putative protease IV [Bacteroides fragilis NCTC 9343]
gi|60682742|ref|YP_212886.1| putative protease IV
[Bacteroides fragilis NCTC 9343]
Length = 592
Score = 97.8 bits (242), Expect = 2e-19
Identities = 67/197 (34%), Positives = 97/197 (49%), Gaps = 33/197 (16%)
Query: 20 AKEQIAIIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWEAI 79
A +I S D I+ S I + +K+ KAV++R++S GG AS+ W A+
Sbjct: 314 ASGEITDYAGSAASDEGIIGSKMIRDLRKLKEDDDVKAVVLRVNSPGGSAFASEQIWHAV 373
Query: 80 RSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIR--GTVSQNFNLDNELPS 137
+ L +KKPVI SM D A S GYY++ A +I+AE TLTGSI G + L ++
Sbjct: 374 KELKAKKPVIVSMGDYAASGGYYISCAADSIIAEPTTLTGSIGIFGMIPNVKGLTEKIG- 432
Query: 138 FIPYDDAKV-----LDNVVR--------LRLMTLGK----------------LDKMKKVA 168
+ YD K N++R L M +G+ DK++K+A
Sbjct: 433 -LTYDVVKTNQFSDFGNLMRPVNSDERALLQMMIGQGYDLFVSRCAEGRHMSKDKIEKIA 491
Query: 169 HGRIWTGKDAASHGLVD 185
GR+WTG+ A GLVD
Sbjct: 492 EGRVWTGEMAKKIGLVD 508
>ref|YP_100728.1| protease IV [Bacteroides fragilis YCH46]
gi|52217601|dbj|BAD50194.1| protease IV [Bacteroides
fragilis YCH46]
Length = 592
Score = 97.8 bits (242), Expect = 2e-19
Identities = 67/197 (34%), Positives = 97/197 (49%), Gaps = 33/197 (16%)
Query: 20 AKEQIAIIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWEAI 79
A +I S D I+ S I + +K+ KAV++R++S GG AS+ W A+
Sbjct: 314 ASGEITDYAGSAASDEGIIGSKMIRDLRKLKEDDDVKAVVLRVNSPGGSAFASEQIWHAV 373
Query: 80 RSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIR--GTVSQNFNLDNELPS 137
+ L +KKPVI SM D A S GYY++ A +I+AE TLTGSI G + L ++
Sbjct: 374 KELKAKKPVIVSMGDYAASGGYYISCAADSIIAEPTTLTGSIGIFGMIPNVKGLTEKIG- 432
Query: 138 FIPYDDAKV-----LDNVVR--------LRLMTLGK----------------LDKMKKVA 168
+ YD K N++R L M +G+ DK++K+A
Sbjct: 433 -LTYDVVKTNQFSDFGNLMRPVNSDERALLQMMIGQGYDLFVSRCAEGRHMSKDKIEKIA 491
Query: 169 HGRIWTGKDAASHGLVD 185
GR+WTG+ A GLVD
Sbjct: 492 EGRVWTGEMAKKIGLVD 508
>gb|AAO76986.1| protease IV [Bacteroides thetaiotaomicron VPI-5482]
gi|29347289|ref|NP_810792.1| protease IV [Bacteroides
thetaiotaomicron VPI-5482]
Length = 592
Score = 94.4 bits (233), Expect = 2e-18
Identities = 67/202 (33%), Positives = 98/202 (48%), Gaps = 33/202 (16%)
Query: 15 VGISEAKEQIAIIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQS 74
V + A +I S T + IV + I + +KD + KAV++R++S GG AS+
Sbjct: 309 VAVYYASGEITDYSGSSTSEEGIVGTKVIRDLRKLKDDEDVKAVVLRVNSPGGSAFASEQ 368
Query: 75 TWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIR--GTVSQNFNLD 132
W A++ L ++KPVI SM D A S GYY++ A IVAE TLTGSI G V L
Sbjct: 369 IWHAVKELKTEKPVIVSMGDYAASGGYYISCVADTIVAEPTTLTGSIGIFGMVPNVKELS 428
Query: 133 NELPSFIPYDDAKV-----LDNVVR----------LRLMTLG--------------KLDK 163
++ + YD K N++R ++T G +
Sbjct: 429 EKIG--LTYDVVKTNKFSDFGNIMRPFNQDEKTLMQMMITQGYDTFVNRCAEGRHMSKEA 486
Query: 164 MKKVAHGRIWTGKDAASHGLVD 185
++K+A GR+WTG+ A GLVD
Sbjct: 487 IEKIAEGRVWTGEAAKELGLVD 508
>ref|ZP_00530536.1| Peptidase S49, SppA 67 kDa type:Peptidase S49, SppA [Chlorobium
phaeobacteroides BS1] gi|67915867|gb|EAM65185.1|
Peptidase S49, SppA 67 kDa type:Peptidase S49, SppA
[Chlorobium phaeobacteroides BS1]
Length = 596
Score = 92.0 bits (227), Expect = 1e-17
Identities = 78/234 (33%), Positives = 112/234 (47%), Gaps = 49/234 (20%)
Query: 1 MISFRKYSKVRKWTVGISEAKEQIAIIRASGTM--DSDIVTSN----FIEK-----IGMV 49
++S +Y W E KE IA+I ASG + SD + + F E+ +
Sbjct: 279 LVSGVRYKAAVPWPYK-PETKEHIAVITASGPIVRSSDDMAAGTEQGFDEETLRSSVQAA 337
Query: 50 KDSKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGA 109
D + KA+++RIDS GGD AS + + + S KKP++ SMS A S GY +A+ A +
Sbjct: 338 LDDESVKAIVLRIDSPGGDALASANMLQVLDSARVKKPIVTSMSSVAASGGYMIALAADS 397
Query: 110 IVAENLTLTGSIR--------GTVSQNFNLDNEL----------PSFIPYDDAKV----- 146
I AE LT+TGSI + + L+ E+ F P D+A +
Sbjct: 398 IFAEPLTVTGSIGVYALKPEISKLQEKIALNREVFTRGKNADAYTVFKPLDEAGMAKFME 457
Query: 147 ---------LDNVVRLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVD--AGMP 189
LD VVR R MT ++D VA GR+W G+ A +GLVD G+P
Sbjct: 458 TTGWIYDDFLDKVVRSRKMTREEVD---AVAGGRVWMGEAAVKNGLVDRIGGLP 508
>ref|ZP_00144963.1| Putative signal peptide peptidase sppA [Fusobacterium nucleatum
subsp. vincentii ATCC 49256] gi|27886138|gb|EAA23435.1|
Putative signal peptide peptidase sppA [Fusobacterium
nucleatum subsp. vincentii ATCC 49256]
Length = 263
Score = 84.7 bits (208), Expect = 2e-15
Identities = 61/189 (32%), Positives = 91/189 (47%), Gaps = 30/189 (15%)
Query: 26 IIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASK 85
+IR ++ + S +EK+ + K++ K KAV++R++S GG S E ++ LA +
Sbjct: 2 LIRKEVSLGEEYNVSETLEKLNIAKENDKIKAVVLRVNSPGGSALTSDIIAEKVKELAEE 61
Query: 86 KPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIRGTVS-----QNFNLDN----ELP 136
KPV SMS A S GYY++ A I + T+TGSI G VS DN E
Sbjct: 62 KPVYVSMSSVAASGGYYISANANKIFVDRNTITGSI-GVVSILPDFSKLITDNGVNIEKI 120
Query: 137 SFIPYDDAKVLDNVVR-----------------LRLMTLGK---LDKMKKVAHGRIWTGK 176
S Y D +D+ L +++ G+ +K+K +A GRIWTG
Sbjct: 121 SDGEYSDLYSVDSFTEKKYNKIYNSNLKVYEDFLNVVSKGRKIDKEKLKTIAEGRIWTGD 180
Query: 177 DAASHGLVD 185
+A GL D
Sbjct: 181 EAIKIGLAD 189
>gb|AAQ65823.1| signal peptide peptidase SppA, 67K type [Porphyromonas gingivalis
W83] gi|34540445|ref|NP_904924.1| signal peptide
peptidase SppA, 67K type [Porphyromonas gingivalis W83]
Length = 595
Score = 84.3 bits (207), Expect = 2e-15
Identities = 56/189 (29%), Positives = 86/189 (44%), Gaps = 29/189 (15%)
Query: 26 IIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWEAIRSLASK 85
II+ D +T ++I D KAV++R++S GG S+ W+ + L +K
Sbjct: 321 IIKKPFDTDGSSITQELAKEIKAAADDDDIKAVVLRVNSPGGSAFTSEQIWKQVADLKAK 380
Query: 86 KPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIR--------GTVSQNFNLDNELPS 137
KP++ SM D A S GYY+A A +IVAE+ TLTGSI V++ ++ ++
Sbjct: 381 KPIVVSMGDVAASGGYYIACAANSIVAEHTTLTGSIGIFGMFPNFAGVAKKIGVNMDVVQ 440
Query: 138 FIPYDD-------AKVLDNVVRLRLMTLG--------------KLDKMKKVAHGRIWTGK 176
Y D V D + R + G ++ +A GR+W G
Sbjct: 441 TSKYADLGNTFAPMTVEDRALIQRYIEQGYDLFLTRVSEGRNRTKAQIDSIAQGRVWLGD 500
Query: 177 DAASHGLVD 185
A + GLVD
Sbjct: 501 KALALGLVD 509
>ref|ZP_00311036.1| COG0616: Periplasmic serine proteases (ClpP class) [Cytophaga
hutchinsonii]
Length = 583
Score = 82.8 bits (203), Expect = 7e-15
Identities = 67/221 (30%), Positives = 101/221 (45%), Gaps = 37/221 (16%)
Query: 2 ISFRKYSKVRKWTVGISEAKE-QIAIIRASGTMDSD------IVTSNFIEKIGMVKDSKK 54
I F Y K+ K + + E IA++ A+G + S I + F + +++ K
Sbjct: 280 IDFVSYKKIIKGKPEKTTSSEPHIAVLFANGEIQSGKGDNETIGSETFCTDLRRLREDKN 339
Query: 55 FKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAEN 114
KA+++R++S GG AS W I KPVIASM + A S GYY+AM IVA
Sbjct: 340 VKAIVLRVNSPGGSALASDLIWREIMLAREVKPVIASMGNVAASGGYYIAMACDTIVASP 399
Query: 115 LTLTGSIR------------------GTVSQNFNLDNELPSFI-PYDDAK--VLDNVVR- 152
T+TGSI T + L ++L S P DA+ ++ N V
Sbjct: 400 ATITGSIGVFGLLMNTEDLLNNKLGISTDREKTGLYSDLGSLTRPVTDAERMIIQNEVNA 459
Query: 153 -----LRLMTLGKLDKMKKV---AHGRIWTGKDAASHGLVD 185
+R G+ ++ + A GR+W GKDA + L+D
Sbjct: 460 IYATFIRKAAEGRHTSVETIEVHASGRVWAGKDAKENNLID 500
>ref|ZP_00327330.1| COG0616: Periplasmic serine proteases (ClpP class) [Trichodesmium
erythraeum IMS101]
Length = 608
Score = 82.4 bits (202), Expect = 9e-15
Identities = 67/223 (30%), Positives = 113/223 (50%), Gaps = 40/223 (17%)
Query: 2 ISFRKYSKVRKWTVGISE---AKEQIAIIRASGTMDSDIVTSNFI------EKIGMVKDS 52
IS YSKV + +S+ + +IA++ A G + + TS I +K+ ++
Sbjct: 297 ISLNNYSKVPRVAKTLSKNIKSNNKIAVVYAQGEVVNGSGTSRQIGGDRLAKKLRQLRLD 356
Query: 53 KKFKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVA 112
+K KAV++R++S GG AS+ ++ ++ +KPVI SM + A S GY+++M A IVA
Sbjct: 357 EKVKAVVLRVNSPGGSASASEVISREVKLMSEEKPVIVSMGNIAASGGYWISMNADRIVA 416
Query: 113 ENLTLTGSIRGTVSQNFNLDN-ELPSFIPYDDAKV-----LDNVVRLR----LMTLGKL- 161
E T+TGSI G FN+ + I +D K+ L+ R + L+ + K+
Sbjct: 417 EVNTITGSI-GVFGVLFNIQEIANQNGITWDVVKIGKFADLNTTSRPKTEDELVLIQKMV 475
Query: 162 -------------------DKMKKVAHGRIWTGKDAASHGLVD 185
+K++++A GR+W+G +A GLVD
Sbjct: 476 DSIYERFIKNVATARNLAPEKVEEIAQGRVWSGANAQKLGLVD 518
>ref|NP_441055.1| protease IV [Synechocystis sp. PCC 6803] gi|1652816|dbj|BAA17735.1|
protease IV [Synechocystis sp. PCC 6803]
gi|2499882|sp|P73689|SPPA_SYNY3 Protease IV homolog
(Endopeptidase IV) (Signal peptide peptidase)
Length = 610
Score = 82.4 bits (202), Expect = 9e-15
Identities = 68/223 (30%), Positives = 103/223 (45%), Gaps = 43/223 (19%)
Query: 2 ISFRKYSKVRKWTVGISEAKEQIAIIRASGTMDS------DIVTSNFIEKIGMVKDSKKF 55
IS +Y +++ W + +IAI+ G++ + +I + E + ++
Sbjct: 299 ISLAEYHRLQNWETENHDQDPKIAIVYLEGSIVNGRGTWENIGGDRYGELLRTIRQDDDI 358
Query: 56 KAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENL 115
KAV++RI+S GG A+ W + L ++KPVI SM + A S GY++A IVA+
Sbjct: 359 KAVVLRINSPGGSASAADIIWREVELLQAQKPVIISMGNVAASGGYWIATAGEKIVAQPN 418
Query: 116 TLTGSIRGTVSQNFNLDN---------------EL----PSFIPYDDAKV---------- 146
T+TGSI G S FN++N EL S P + ++
Sbjct: 419 TVTGSI-GVFSILFNVENLGDRLGLNWDEVATGELANVGSSIKPKTELELAIFQRSVDQV 477
Query: 147 ----LDNVVRLRLMTLGKLDKMKKVAHGRIWTGKDAASHGLVD 185
LD V R R ++ LD VA GR+WTG A GLVD
Sbjct: 478 YEIFLDKVGRARNLSPTALD---SVAQGRVWTGLAAQKVGLVD 517
>ref|ZP_00160311.2| COG0616: Periplasmic serine proteases (ClpP class) [Anabaena
variabilis ATCC 29413]
Length = 609
Score = 81.6 bits (200), Expect = 1e-14
Identities = 66/222 (29%), Positives = 103/222 (45%), Gaps = 40/222 (18%)
Query: 2 ISFRKYSKVRKWTVGISE-AKEQIAIIRASGTM------DSDIVTSNFIEKIGMVKDSKK 54
IS R+Y++V ++G++ +K +IA++ A G + D I F ++ +
Sbjct: 294 ISLRRYAQVPGQSLGVARNSKNKIAVVYAEGDIVDGKGDDGQIGGDRFARIFNKIRQDEN 353
Query: 55 FKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAEN 114
KAV++RI+S GG AS+ IR KPV+ SM D A S GY++A + I AE
Sbjct: 354 VKAVVLRINSPGGSATASEVMQREIRLTRESKPVVVSMGDYAASGGYWIATDSNRIFAEP 413
Query: 115 LTLTGSIR--GTVSQNFNLDNELPSFIPYDDAKVL---------------------DNVV 151
T+TGSI G + L N+ + I +D K +V
Sbjct: 414 NTITGSIGVFGVLFNGQKLAND--NGITWDAVKTARYADSQTVARPKSPQEIAIYQRSVD 471
Query: 152 RLRLMTLGKL--------DKMKKVAHGRIWTGKDAASHGLVD 185
R+ M + K+ K+ ++A GR+W+G A GLVD
Sbjct: 472 RIYNMFVNKVAQGRKLPTQKVAQIAQGRVWSGVTAKQIGLVD 513
>dbj|BAB76289.1| protease IV [Nostoc sp. PCC 7120] gi|17232082|ref|NP_488630.1|
protease IV [Nostoc sp. PCC 7120]
gi|25290101|pir||AF2379 proteinase IV [imported] -
Nostoc sp. (strain PCC 7120)
Length = 609
Score = 81.3 bits (199), Expect = 2e-14
Identities = 66/222 (29%), Positives = 103/222 (45%), Gaps = 40/222 (18%)
Query: 2 ISFRKYSKVRKWTVGISE-AKEQIAIIRASGTM------DSDIVTSNFIEKIGMVKDSKK 54
IS R+Y++V ++G+ + +K +IA++ A G + D I F ++ +
Sbjct: 294 ISLRRYAQVPGQSLGLEKNSKNKIAVVYAEGDIVDGKGDDGQIGGDRFARIFNKIRQDEN 353
Query: 55 FKAVIVRIDSVGGDFHASQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAEN 114
KAV++RI+S GG AS+ IR KPV+ SM D A S GY++A + I AE
Sbjct: 354 VKAVVLRINSPGGSATASEVMQREIRLTRESKPVVVSMGDYAASGGYWIATDSNRIFAEP 413
Query: 115 LTLTGSIR--GTVSQNFNLDNELPSFIPYDDAKVL---------------------DNVV 151
T+TGSI G + L N+ + I +D K +V
Sbjct: 414 NTITGSIGVFGVLFNGQKLAND--NGITWDAVKTARYADSQTVARPKSPQEIAIYQRSVD 471
Query: 152 RLRLMTLGKL--------DKMKKVAHGRIWTGKDAASHGLVD 185
R+ M + K+ K+ ++A GR+W+G A GLVD
Sbjct: 472 RIYNMFVNKVAQGRKLPTQKVAEIAQGRVWSGVTAKQIGLVD 513
>ref|NP_662527.1| protease IV [Chlorobium tepidum TLS] gi|21647650|gb|AAM72869.1|
protease IV [Chlorobium tepidum TLS]
Length = 597
Score = 80.1 bits (196), Expect = 4e-14
Identities = 65/207 (31%), Positives = 94/207 (45%), Gaps = 47/207 (22%)
Query: 22 EQIAIIRASGTMDSDIV----------TSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHA 71
++IA+I +G + SD + E + D K KA+++RIDS GGD A
Sbjct: 298 DRIAVINITGMIVSDGAGGMSEGDGTDVATVKEALQTAIDDLKVKAIVLRIDSPGGDALA 357
Query: 72 SQSTWEAIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSI---------- 121
+ + E + +KKP++ASMS A S GY +A+ I AE LT+TGSI
Sbjct: 358 ASTMLELLNEAKAKKPIVASMSGLAASGGYMVALAGDKIFAEPLTITGSIGVFSLKPDLS 417
Query: 122 ---------RGTVSQNFNLDNELPSFIPYDDAK--------------VLDNVVRLRLMTL 158
R + + D E P F +DDA + V + R MT
Sbjct: 418 SLLEKTGIRREVLIRGRFADAETP-FRAFDDASFRKFVELTGTVYEDFIAKVAKGRHMTP 476
Query: 159 GKLDKMKKVAHGRIWTGKDAASHGLVD 185
++D VA GR+W+GK A GL+D
Sbjct: 477 AQVD---AVAGGRVWSGKRALEVGLID 500
>dbj|BAB75712.1| protease IV [Nostoc sp. PCC 7120] gi|17231505|ref|NP_488053.1|
protease IV [Nostoc sp. PCC 7120]
gi|25332287|pir||AF2307 proteinase IV [imported] -
Nostoc sp. (strain PCC 7120)
Length = 273
Score = 74.7 bits (182), Expect = 2e-12
Identities = 60/198 (30%), Positives = 98/198 (49%), Gaps = 33/198 (16%)
Query: 18 SEAKEQIAIIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWE 77
S+ ++QIA I +G + S +E + V++ +KF A+++RIDS GG SQ +
Sbjct: 7 SKFRKQIARIEITGAIASG-TRKRVLEALKTVEE-RKFPALLLRIDSPGGTVGDSQEIYS 64
Query: 78 AIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIRGTVSQNFNLDNELPS 137
A++ L K ++AS + + S G Y+ MGA I+A T+TGSI G + + NL+ L
Sbjct: 65 ALKRLREKIKIVASFGNISASGGVYIGMGAEHIMANPGTITGSI-GVILRGNNLERLLDK 123
Query: 138 FI---------PYDDA----------------KVLDNVVRLRLMTLGK-----LDKMKKV 167
PY D +++D + + T+ + +DK+K
Sbjct: 124 IGVSFKVIKSGPYKDILSFDRELTEPEQDILQELIDTSYQQFVQTVAEGRSLAVDKVKSF 183
Query: 168 AHGRIWTGKDAASHGLVD 185
A GRI+TG+ A G+VD
Sbjct: 184 ADGRIFTGQQALELGVVD 201
>ref|ZP_00160734.1| COG0616: Periplasmic serine proteases (ClpP class) [Anabaena
variabilis ATCC 29413]
Length = 273
Score = 74.3 bits (181), Expect = 2e-12
Identities = 62/198 (31%), Positives = 100/198 (50%), Gaps = 33/198 (16%)
Query: 18 SEAKEQIAIIRASGTMDSDIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWE 77
S+ ++QIA I +G + S +E + V++ KKF A+++RIDS GG SQ +
Sbjct: 7 SKFRKQIARIEITGAIASG-TRKRVLEALKTVEE-KKFPALLLRIDSPGGTVGDSQEIYS 64
Query: 78 AIRSLASKKPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSIRGTVSQNFNLDNELP- 136
A++ L K ++AS + + S G Y+ MGA I+A T+TGSI G + + NL+ L
Sbjct: 65 ALKRLREKIKIVASFGNISASGGVYIGMGAEHIMANPGTITGSI-GVILRGNNLERLLDK 123
Query: 137 ---SFI-----PYDDA----------------KVLDNVVRLRLMTLGK-----LDKMKKV 167
SF PY D +++D + + T+ + ++K+K
Sbjct: 124 VGVSFKVIKSGPYKDILSFDRELTEPEQDILQELIDTSYQQFVQTVAEGRSLAVEKVKSF 183
Query: 168 AHGRIWTGKDAASHGLVD 185
A GRI+TG+ A G+VD
Sbjct: 184 ADGRIFTGQQALELGVVD 201
>gb|EAK84449.1| hypothetical protein UM03629.1 [Ustilago maydis 521]
gi|49074184|ref|XP_401244.1| hypothetical protein
UM03629.1 [Ustilago maydis 521]
Length = 949
Score = 72.4 bits (176), Expect = 9e-12
Identities = 43/118 (36%), Positives = 69/118 (58%), Gaps = 10/118 (8%)
Query: 22 EQIAIIRASGTMDS---DIVTSNFIEKIGMVKDSKKFKAVIVRIDSVGGDFHASQSTWEA 78
E++ ++ G + S + TS+ ++ + + K K++++RIDS GGD AS+S W+A
Sbjct: 382 ERVGVVYLLGGISSAPGEFSTSSVLKGLKEAAEHKDIKSIVLRIDSGGGDVVASESIWDA 441
Query: 79 IRSLASK--KPVIASMSDAATSAGYYMAMGAGAIVAENLTLTGSI-----RGTVSQNF 129
+R + KPV+AS + A S GYY A A AI+A T+TGSI R T+++ F
Sbjct: 442 VRRVREDYGKPVVASFGNTAASGGYYAASAADAILACENTVTGSIGVASLRPTITRAF 499
Database: nr
Posted date: Jul 5, 2005 12:34 AM
Number of letters in database: 863,360,394
Number of sequences in database: 2,540,612
Lambda K H
0.321 0.134 0.393
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 359,914,221
Number of Sequences: 2540612
Number of extensions: 13696503
Number of successful extensions: 34610
Number of sequences better than 10.0: 420
Number of HSP's better than 10.0 without gapping: 245
Number of HSP's successfully gapped in prelim test: 175
Number of HSP's that attempted gapping in prelim test: 33978
Number of HSP's gapped (non-prelim): 624
length of query: 226
length of database: 863,360,394
effective HSP length: 123
effective length of query: 103
effective length of database: 550,865,118
effective search space: 56739107154
effective search space used: 56739107154
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 73 (32.7 bits)
Medicago: description of AC144729.7