Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC005077A_C03 KMC005077A_c03
(1103 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
sp|Q02060|PSBS_SPIOL Photosystem II 22 kDa protein, chloroplast ... 398 e-110
sp|P54773|PSBS_LYCES Photosystem II 22 kDa protein, chloroplast ... 382 e-105
sp|Q9FPP4|PSBS_SOLSG Photosystem II 22 kDa protein, chloroplast ... 381 e-105
sp|Q9SMB4|PSBS_TOBAC Photosystem II 22 kDa protein, chloroplast ... 375 e-103
gb|AAK95290.1|AF410304_1 unknown protein [Arabidopsis thaliana] ... 367 e-100
>sp|Q02060|PSBS_SPIOL Photosystem II 22 kDa protein, chloroplast precursor (CP22)
gi|282837|pir||S26953 photosystem II 22K protein
precursor - spinach gi|21307|emb|CAA48557.1| 22kD-protein
of PSII [Spinacia oleracea] gi|260917|gb|AAB24338.1|
photosystem II 22 kda polypeptide [Spinacia oleracea]
Length = 274
Score = 398 bits (1023), Expect = e-110
Identities = 212/270 (78%), Positives = 237/270 (87%), Gaps = 2/270 (0%)
Frame = -1
Query: 1052 VLMSSVSSSYSVDLKKDPLLHLQSQRLRPKFS--QLSFNPLPSNSSLFSSRTFTTLALFK 879
++M VS++ ++DLK++ LL LQ Q+++PK S L F+PLPS+SS SS F TLALFK
Sbjct: 7 LMMPGVSTTNTIDLKRNALLKLQIQKIKPKSSTSNLFFSPLPSSSSS-SSTVFKTLALFK 65
Query: 878 SKAKAPPPKVVKQKPKVEDGVFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEALTGKGI 699
SKAKAP KV K K KVEDG+FGTSGGIGFTK+NELFVGRVAMIGFAASLLGE +TGKGI
Sbjct: 66 SKAKAPK-KVEKPKLKVEDGLFGTSGGIGFTKENELFVGRVAMIGFAASLLGEGITGKGI 124
Query: 698 LAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDDESPTGLDKAVIPPGKGL 519
L+QLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRG+FV DE TGL+KAVIPPGK +
Sbjct: 125 LSQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGRFV-DEPTTGLEKAVIPPGKDV 183
Query: 518 RGALGLSEGGPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETGIPINEI 339
R ALGL GPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETG+PINEI
Sbjct: 184 RSALGLKTKGPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETGVPINEI 243
Query: 338 EPLVLFNVAFFFIAALNPGTGKFVSDDGED 249
EPLVL NV FFFIAA+NPGTGKF++DD E+
Sbjct: 244 EPLVLLNVVFFFIAAINPGTGKFITDDEEE 273
>sp|P54773|PSBS_LYCES Photosystem II 22 kDa protein, chloroplast precursor (CP22)
gi|7489039|pir||T06331 photosystem II 22K protein -
tomato gi|706853|gb|AAA63649.1| 22 kDa component of
photosystem II
Length = 276
Score = 382 bits (980), Expect = e-105
Identities = 203/258 (78%), Positives = 223/258 (85%), Gaps = 3/258 (1%)
Frame = -1
Query: 1013 LKKDPLLHLQSQRLRPKFSQLSFNPLPSNSSLFSSRTFTTLALFKSKAKAPPPKVV--KQ 840
LK PL L L +FS S N ++SS F+S TT+ALFKSKAKAPP KV K+
Sbjct: 25 LKPKPLSSLFLPSLPLRFSSSSTN---ASSSKFTS---TTVALFKSKAKAPPKKVAPPKE 78
Query: 839 KPKVEDGVFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEALTGKGILAQLNLETGIPIY 660
K KVEDG+FGTSGGIGFTKQNELFVGRVAMIGFAASLLGEA+TGKGILAQLNLETGIPIY
Sbjct: 79 KQKVEDGIFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEAITGKGILAQLNLETGIPIY 138
Query: 659 EAEPLLLFFILFTLLGAIGALGDRGKFVDDESP-TGLDKAVIPPGKGLRGALGLSEGGPL 483
EAEPLLLFFILF LLGAIGALGDRGKFVDD +P TGL+KAVIPPGK + ALGLSEGGPL
Sbjct: 139 EAEPLLLFFILFNLLGAIGALGDRGKFVDDPTPPTGLEKAVIPPGKSFKSALGLSEGGPL 198
Query: 482 FGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETGIPINEIEPLVLFNVAFFF 303
FGFTK+NELFVGRLAQLG AFS+IGEIITGKGALAQLN ETG+PINEIEPL+LFN+AFFF
Sbjct: 199 FGFTKANELFVGRLAQLGIAFSIIGEIITGKGALAQLNFETGVPINEIEPLLLFNIAFFF 258
Query: 302 IAALNPGTGKFVSDDGED 249
AA+NPGTGKF++D+ ED
Sbjct: 259 FAAINPGTGKFITDEEED 276
>sp|Q9FPP4|PSBS_SOLSG Photosystem II 22 kDa protein, chloroplast precursor (CP22)
gi|12082782|gb|AAG48610.1|AF311720_1 photosystem II 22
kDa protein precursor [Solanum sogarandinum]
Length = 276
Score = 381 bits (979), Expect = e-105
Identities = 204/273 (74%), Positives = 232/273 (84%), Gaps = 10/273 (3%)
Frame = -1
Query: 1037 VSSSYSVDLKKDPLLHLQSQRLRPK-FSQLSFNPLP----SNSSLFSSRTFT--TLALFK 879
++++ VDL+ L +RL+PK S L LP S+++ FSS FT T+ALFK
Sbjct: 7 LTANAKVDLRSKESL---VERLKPKPLSSLFLPSLPLRFSSSTTNFSSSKFTSTTVALFK 63
Query: 878 SKAKAPPPKVV--KQKPKVEDGVFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEALTGK 705
SKAKAPP KV K+K KVEDG+FGTSGGIGFTKQNELFVGRVAMIGFAASLLGEA+TGK
Sbjct: 64 SKAKAPPKKVAPPKEKQKVEDGIFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEAITGK 123
Query: 704 GILAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDDESP-TGLDKAVIPPG 528
GILAQLNLETGIPIYEAEPLLLFFILF LLGAIGALGDRG+F+DD +P TGL+KAVIPPG
Sbjct: 124 GILAQLNLETGIPIYEAEPLLLFFILFNLLGAIGALGDRGRFIDDPAPATGLEKAVIPPG 183
Query: 527 KGLRGALGLSEGGPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETGIPI 348
K + ALGLSEGGPLFGFTK+NELFVGRLAQLG AFS+IGEIITGKGALAQLN ETG+PI
Sbjct: 184 KSFKSALGLSEGGPLFGFTKANELFVGRLAQLGIAFSIIGEIITGKGALAQLNFETGVPI 243
Query: 347 NEIEPLVLFNVAFFFIAALNPGTGKFVSDDGED 249
NEIEPL+LFN+AFFF AA+NPGTGKF++D+ ED
Sbjct: 244 NEIEPLLLFNIAFFFFAAINPGTGKFITDEEED 276
>sp|Q9SMB4|PSBS_TOBAC Photosystem II 22 kDa protein, chloroplast precursor (CP22)
gi|6103011|emb|CAA59007.1| precursor of photosystem II
subunit (22KDa) [Nicotiana tabacum]
Length = 274
Score = 375 bits (962), Expect = e-103
Identities = 203/271 (74%), Positives = 227/271 (82%), Gaps = 8/271 (2%)
Frame = -1
Query: 1037 VSSSYSVDLKKDPLLHLQSQRLRPKFSQLSFNP-----LPSNSSLFSSR-TFTTLALFKS 876
++++ VDL+ L +RL+PK F P PS S+ SS T TT+ALFKS
Sbjct: 7 LTANAKVDLRSKESL---VERLKPKPLSSFFLPSLPLKYPSASASASSHFTSTTVALFKS 63
Query: 875 KAKAPPPKVV-KQKPKVEDGVFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEALTGKGI 699
KAKAP KVV K K KVEDG+FGTSGGIGFTKQNELFVGRVAMIGFAASLLGEA+TGKGI
Sbjct: 64 KAKAPAKKVVPKPKEKVEDGIFGTSGGIGFTKQNELFVGRVAMIGFAASLLGEAITGKGI 123
Query: 698 LAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDDESP-TGLDKAVIPPGKG 522
LAQLNLETGIPIYEAEPLLLFFILF LLGAIGAL DRGKF+DD +P TGLDKAVIPPGKG
Sbjct: 124 LAQLNLETGIPIYEAEPLLLFFILFNLLGAIGALEDRGKFIDDPAPPTGLDKAVIPPGKG 183
Query: 521 LRGALGLSEGGPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLNIETGIPINE 342
+ ALGLSEGGPLF FTK+NELFVGRLAQLG AFS+IGEIITGKGALAQLN ETG+PINE
Sbjct: 184 FKSALGLSEGGPLFEFTKANELFVGRLAQLGIAFSIIGEIITGKGALAQLNFETGVPINE 243
Query: 341 IEPLVLFNVAFFFIAALNPGTGKFVSDDGED 249
IEPL+LFN+ FFF+AA+NPGTGKFV+D+ E+
Sbjct: 244 IEPLLLFNIVFFFVAAINPGTGKFVTDEEEE 274
>gb|AAK95290.1|AF410304_1 unknown protein [Arabidopsis thaliana] gi|25090250|gb|AAN72262.1|
At1g44575/T18F15 [Arabidopsis thaliana]
Length = 265
Score = 367 bits (942), Expect = e-100
Identities = 188/220 (85%), Positives = 199/220 (90%)
Frame = -1
Query: 908 RTFTTLALFKSKAKAPPPKVVKQKPKVEDGVFGTSGGIGFTKQNELFVGRVAMIGFAASL 729
++F LALFK K KA P KV K K KVEDG+FGTSGGIGFTK NELFVGRVAMIGFAASL
Sbjct: 46 QSFVPLALFKPKTKAAPKKVEKPKSKVEDGIFGTSGGIGFTKANELFVGRVAMIGFAASL 105
Query: 728 LGEALTGKGILAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDDESPTGLD 549
LGEALTGKGILAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDD PTGL+
Sbjct: 106 LGEALTGKGILAQLNLETGIPIYEAEPLLLFFILFTLLGAIGALGDRGKFVDD-PPTGLE 164
Query: 548 KAVIPPGKGLRGALGLSEGGPLFGFTKSNELFVGRLAQLGFAFSLIGEIITGKGALAQLN 369
KAVIPPGK +R ALGL E GPLFGFTK+NELFVGRLAQLG AFSLIGEIITGKGALAQLN
Sbjct: 165 KAVIPPGKNVRSALGLKEQGPLFGFTKANELFVGRLAQLGIAFSLIGEIITGKGALAQLN 224
Query: 368 IETGIPINEIEPLVLFNVAFFFIAALNPGTGKFVSDDGED 249
IETGIPI +IEPLVL NVAFFF AA+NPG GKF++DDGE+
Sbjct: 225 IETGIPIQDIEPLVLLNVAFFFFAAINPGNGKFITDDGEE 264
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 930,874,920
Number of Sequences: 1393205
Number of extensions: 21186523
Number of successful extensions: 66281
Number of sequences better than 10.0: 316
Number of HSP's better than 10.0 without gapping: 59463
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 65606
length of database: 448,689,247
effective HSP length: 125
effective length of database: 274,538,622
effective search space used: 66438346524
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)