Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC020463A_C01 KMC020463A_c01
(554 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
sp|P29344|RR1_SPIOL 30S ribosomal protein S1, chloroplast precur... 243 9e-64
ref|NP_198266.1| ribosomal protein S1; protein id: At5g29771.1, ... 243 1e-63
gb|ZP_00113919.1| hypothetical protein [Prochlorococcus marinus ... 104 6e-22
ref|ZP_00116164.1| hypothetical protein [Synechococcus sp. WH 8102] 103 2e-21
ref|NP_440890.1| 30S ribosomal protein S1 [Synechocystis sp. PCC... 102 4e-21
>sp|P29344|RR1_SPIOL 30S ribosomal protein S1, chloroplast precursor (CS1)
gi|322404|pir||A44121 ribosomal protein S1 precursor,
chloroplast - spinach gi|18060|emb|CAA46927.1| ribosomal
protein S1 [Spinacia oleracea] gi|170143|gb|AAA34045.1|
chloroplast ribosomal protein S1
Length = 411
Score = 243 bits (621), Expect = 9e-64
Identities = 127/192 (66%), Positives = 155/192 (80%), Gaps = 8/192 (4%)
Frame = +1
Query: 1 MASVAQQLSA-VRWSPMLWRRSQKQRRGAGT-------IVCSVAISNAQNKERAKLKQLF 156
MAS+AQQL+ +R P+ K T IV +VA+SNAQ +ER KLKQLF
Sbjct: 1 MASLAQQLAGGLRCPPLSNSNLSKPFSPKHTLKPRFSPIVSAVAVSNAQTRERQKLKQLF 60
Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
EDAYERCR AP +GVSFT++ F TAL+KYDF++E+G++VKGTVF TDA+GA VDITAKS+
Sbjct: 61 EDAYERCRNAPMEGVSFTIDDFHTALDKYDFNSEMGSRVKGTVFCTDANGALVDITAKSS 120
Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
AYLPL EACI++IK+VEEAG++PGVR+EFVIIGENE+DD+L LSL+ IQ+ LAWERCRQL
Sbjct: 121 AYLPLAEACIYRIKNVEEAGIIPGVREEFVIIGENEADDSLILSLRQIQYELAWERCRQL 180
Query: 517 QAEDAVVKGKIV 552
QAED VVKGKIV
Sbjct: 181 QAEDVVVKGKIV 192
>ref|NP_198266.1| ribosomal protein S1; protein id: At5g29771.1, supported by cDNA:
4565., supported by cDNA: gi_13877938, supported by
cDNA: gi_16649088 [Arabidopsis thaliana]
gi|13877939|gb|AAK44047.1|AF370232_1 putative ribosomal
protein S1 [Arabidopsis thaliana]
gi|16649089|gb|AAL24396.1| Unknown protein [Arabidopsis
thaliana] gi|21593804|gb|AAM65771.1| ribosomal protein
S1 [Arabidopsis thaliana] gi|23296539|gb|AAN13122.1|
putative ribosomal protein S1 [Arabidopsis thaliana]
Length = 416
Score = 243 bits (619), Expect = 1e-63
Identities = 122/195 (62%), Positives = 160/195 (81%), Gaps = 11/195 (5%)
Frame = +1
Query: 1 MASVAQQLSAVRWSPM-----LWRRSQK---QRRGAG---TIVCSVAISNAQNKERAKLK 147
MAS+AQQ S +R SP+ L RR+ K Q + A TIV +VA+S+ Q KER +LK
Sbjct: 1 MASLAQQFSGLRCSPLSSSSRLSRRASKNFPQNKSASVSPTIVAAVAMSSGQTKERLELK 60
Query: 148 QLFEDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITA 327
++FEDAYERCRT+P +GV+FT++ F A+E+YDF++EIGT+VKGTVF TDA+GA VDI+A
Sbjct: 61 KMFEDAYERCRTSPMEGVAFTVDDFAAAIEQYDFNSEIGTRVKGTVFKTDANGALVDISA 120
Query: 328 KSTAYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERC 507
KS+AYL +++ACIH+IKHVEEAG+VPG+ +EFVIIGENESDD+L LSL++IQ+ LAWERC
Sbjct: 121 KSSAYLSVEQACIHRIKHVEEAGIVPGMVEEFVIIGENESDDSLLLSLRNIQYELAWERC 180
Query: 508 RQLQAEDAVVKGKIV 552
RQLQAED +VK K++
Sbjct: 181 RQLQAEDVIVKAKVI 195
>gb|ZP_00113919.1| hypothetical protein [Prochlorococcus marinus str. MIT 9313]
Length = 367
Score = 104 bits (260), Expect = 6e-22
Identities = 52/131 (39%), Positives = 80/131 (60%)
Frame = +1
Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
+D R G FTL++F + L KYD++ + G V GTVF ++ GA +DI AK+
Sbjct: 52 DDPSSRAAKNDLSGAGFTLDEFASLLSKYDYNFKPGDIVNGTVFALESKGAMIDIGAKTA 111
Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
A++PLQE I++++ + + L+PG EF I+ E D L LS++ I++ AWER RQL
Sbjct: 112 AFMPLQEVSINRVEGLSDV-LLPGEIREFFIMSEENEDGQLSLSIRRIEYQRAWERVRQL 170
Query: 517 QAEDAVVKGKI 549
Q EDA + ++
Sbjct: 171 QKEDATIYSEV 181
>ref|ZP_00116164.1| hypothetical protein [Synechococcus sp. WH 8102]
Length = 367
Score = 103 bits (256), Expect = 2e-21
Identities = 52/131 (39%), Positives = 79/131 (59%)
Frame = +1
Query: 157 EDAYERCRTAPTDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKST 336
+D R + D FT+++F L KYD++ + G V GTVF +A GA +DI AK+
Sbjct: 52 DDPGSRASSRNLDDAGFTIDEFAALLSKYDYNFKPGDIVNGTVFALEAKGAMIDIGAKTA 111
Query: 337 AYLPLQEACIHKIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQL 516
A++PLQE I++++ + + L PG EF I+ E D L LS++ I++ AWER RQL
Sbjct: 112 AFMPLQEVSINRVEGLSDV-LQPGEIREFFIMSEENEDGQLALSVRRIEYQRAWERVRQL 170
Query: 517 QAEDAVVKGKI 549
Q EDA + ++
Sbjct: 171 QKEDATIYSEV 181
>ref|NP_440890.1| 30S ribosomal protein S1 [Synechocystis sp. PCC 6803]
gi|2500385|sp|P73530|RS1A_SYNY3 30S ribosomal protein S1
homolog A gi|7447089|pir||S77236 ribosomal protein S1 -
Synechocystis sp. (strain PCC 6803)
gi|1652650|dbj|BAA17570.1| 30S ribosomal protein S1
[Synechocystis sp. PCC 6803]
Length = 328
Score = 102 bits (253), Expect = 4e-21
Identities = 53/120 (44%), Positives = 73/120 (60%)
Frame = +1
Query: 190 TDGVSFTLEQFTTALEKYDFDAEIGTKVKGTVFGTDASGAYVDITAKSTAYLPLQEACIH 369
T + FTLE F L+KYD+ G V GTVF ++ GA +DI AK+ AY+P+QE I+
Sbjct: 7 TATIGFTLEDFAALLDKYDYHFSPGDIVAGTVFSMESRGALIDIGAKTAAYIPIQEMSIN 66
Query: 370 KIKHVEEAGLVPGVRDEFVIIGENESDDTLFLSLKSIQFGLAWERCRQLQAEDAVVKGKI 549
++ EE L P EF I+ + D L LS++ I++ AWER RQLQAEDA V+ +
Sbjct: 67 RVDDPEEV-LQPNETREFFILTDENEDGQLTLSIRRIEYMRAWERVRQLQAEDATVRSNV 125
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 468,392,961
Number of Sequences: 1393205
Number of extensions: 9625805
Number of successful extensions: 33639
Number of sequences better than 10.0: 79
Number of HSP's better than 10.0 without gapping: 32387
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 33596
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19521267756
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)