Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC001172A_C01 KMC001172A_c01
(552 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
dbj|BAA06523.1| cp33 [Arabidopsis thaliana] 69 9e-16
sp|P19684|ROC5_NICSY 33 kDa ribonucleoprotein, chloroplast precu... 74 1e-12
sp|P49313|ROC1_NICPL 30 kDa ribonucleoprotein, chloroplast precu... 55 3e-12
sp|Q08935|ROC1_NICSY 29 kDa ribonucleoprotein A, chloroplast pre... 55 4e-12
pir||S18883 ribonucleoprotein precursor - common tobacco (fragme... 72 6e-12
>dbj|BAA06523.1| cp33 [Arabidopsis thaliana]
Length = 321
Score = 68.6 bits (166), Expect(2) = 9e-16
Identities = 31/49 (63%), Positives = 41/49 (83%)
Frame = +1
Query: 322 LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDGT 468
L+Q+FGEAG VV V++V D TDRSRGF FVTMGS+E A++A+QMF+ +
Sbjct: 124 LSQIFGEAGTVVDVQIVYDKVTDRSRGFGFVTMGSIEEAKEAMQMFNSS 172
Score = 37.7 bits (86), Expect = 0.093
Identities = 17/48 (35%), Positives = 27/48 (55%)
Frame = +1
Query: 322 LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDG 465
L FG+ V+ +V+ + T RSRGF F++ S EN + A+ +G
Sbjct: 227 LKDAFGDQPGVLGAKVIYERNTGRSRGFGFISFESAENVQSALATMNG 274
Score = 36.2 bits (82), Expect(2) = 9e-16
Identities = 31/105 (29%), Positives = 46/105 (43%), Gaps = 14/105 (13%)
Frame = +3
Query: 51 FTPSSIPLTLNIPHRPKPPKPLNLKPKLFSPSSLSLYRFHLPS------------VSFHG 194
FTP S L N P+ PL L S++ +RF + S
Sbjct: 32 FTPKSFKLVANCPN------PLILH------SNIRRHRFFCAAETEASSADDEIQASVEE 79
Query: 195 FQETEEEPES--ETFQNSEPKVSHSNQSRRLFVGNLPYSLSSSQL 323
+E EEE + E + + S + RL+VGNLPY+++SS+L
Sbjct: 80 EEEVEEEGDEGEEEVEEEKQTTQASGEEWRLYVGNLPYTITSSEL 124
>sp|P19684|ROC5_NICSY 33 kDa ribonucleoprotein, chloroplast precursor
gi|100390|pir||S12111 ribonucleoprotein, 33K, precursor
- common tobacco gi|20005|emb|CAA37879.1| 33 kDa
ribonucleoprotein precursor [Nicotiana tabacum]
Length = 324
Score = 73.9 bits (180), Expect = 1e-12
Identities = 38/77 (49%), Positives = 56/77 (72%), Gaps = 1/77 (1%)
Frame = +1
Query: 241 QSQRCHTQTNQGGSLWETCLTHCLPLS-LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVT 417
+ +R +++ +GG L+ L + S L+++F EAG V +VE+V D TDRSRGFAFVT
Sbjct: 102 KEERVESESVEGGRLYVGNLPFSMTSSQLSEIFAEAGTVANVEIVYDRVTDRSRGFAFVT 161
Query: 418 MGSVENAEQAIQMFDGT 468
MGSVE A++AI++FDG+
Sbjct: 162 MGSVEEAKEAIRLFDGS 178
Score = 36.6 bits (83), Expect = 0.21
Identities = 22/49 (44%), Positives = 30/49 (60%), Gaps = 7/49 (14%)
Frame = +3
Query: 198 QETEEEP------ESETFQNSEPKV-SHSNQSRRLFVGNLPYSLSSSQL 323
QE +EE E E + E +V S S + RL+VGNLP+S++SSQL
Sbjct: 82 QEDDEEEVALSAEEEEEIEEKEERVESESVEGGRLYVGNLPFSMTSSQL 130
Score = 31.6 bits (70), Expect = 6.7
Identities = 15/43 (34%), Positives = 22/43 (50%)
Frame = +1
Query: 322 LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAI 450
L F + +S +V+ D + RSRGF F+T S E A+
Sbjct: 233 LRDAFADQPGFMSAKVIYDRSSGRSRGFGFITFSSAEAMNSAL 275
>sp|P49313|ROC1_NICPL 30 kDa ribonucleoprotein, chloroplast precursor (CP-RBP30)
gi|280402|pir||S26203 RNA-binding protein 30 -
curled-leaved tobacco gi|19708|emb|CAA46234.1| RNA
binding protein 30 [Nicotiana plumbaginifolia]
Length = 279
Score = 55.5 bits (132), Expect(2) = 3e-12
Identities = 28/49 (57%), Positives = 33/49 (67%)
Frame = +1
Query: 319 SLAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDG 465
+LA+LF AG+V VEV+ D T RSRGF FVTM S E E A Q F+G
Sbjct: 102 ALAELFERAGNVEMVEVIYDKLTGRSRGFGFVTMSSKEEVEAACQQFNG 150
Score = 46.2 bits (108), Expect = 3e-04
Identities = 24/49 (48%), Positives = 28/49 (56%)
Frame = +1
Query: 319 SLAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDG 465
+L LF E G VV +VV D + RSRGF FVT S E AI+ DG
Sbjct: 209 ALETLFSEQGKVVDAKVVYDRDSGRSRGFGFVTYSSAEEVNNAIESLDG 257
Score = 37.4 bits (85), Expect(2) = 3e-12
Identities = 32/109 (29%), Positives = 45/109 (40%)
Frame = +3
Query: 15 SYSFLPICKQLYFTPSSIPLTLNIPHRPKPPKPLNLKPKLFSPSSLSLYRFHLPSVSFHG 194
S FL + Q P T ++ PP LNL L SPSS RF + V+
Sbjct: 7 SLHFLSLTPQTLLLPKPTSQTTSLSFFSLPPSSLNLS--LSSPSSCFSSRF-VRKVTLPD 63
Query: 195 FQETEEEPESETFQNSEPKVSHSNQSRRLFVGNLPYSLSSSQLGSALRR 341
F + E+ + + E S ++FVGNL +S S+ L R
Sbjct: 64 FDQIEDVEDGDEGVEEERNFS---PDLKIFVGNLLFSADSAALAELFER 109
>sp|Q08935|ROC1_NICSY 29 kDa ribonucleoprotein A, chloroplast precursor (CP29A)
gi|100293|pir||S20069 ribonucleoprotein A, 29K - wood
tobacco gi|19754|emb|CAA43427.1| 29kD A
ribonucleoprotein [Nicotiana sylvestris]
Length = 273
Score = 55.5 bits (132), Expect(2) = 4e-12
Identities = 28/49 (57%), Positives = 33/49 (67%)
Frame = +1
Query: 319 SLAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDG 465
+LA+LF AG+V VEV+ D T RSRGF FVTM S E E A Q F+G
Sbjct: 102 ALAELFERAGNVEMVEVIYDKLTGRSRGFGFVTMSSKEEVEAACQQFNG 150
Score = 48.5 bits (114), Expect = 5e-05
Identities = 33/99 (33%), Positives = 45/99 (45%), Gaps = 7/99 (7%)
Frame = +1
Query: 190 MDSKRLKKNLNLKPSKIQSQRCHTQTNQGGSLWETCLTHCLPLS-------LAQLFGEAG 348
+D + L+ N P K ++ + GGS + + L+ L LF E G
Sbjct: 153 LDGRALRVNSGPPPEKRENSSFRGGSRGGGSFDSSNRVYVGNLAWGVDQDALETLFSEQG 212
Query: 349 DVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDG 465
VV +VV D + RSRGF FVT S E AI+ DG
Sbjct: 213 KVVDAKVVYDRDSGRSRGFGFVTYSSAEEVNNAIESLDG 251
Score = 37.0 bits (84), Expect(2) = 4e-12
Identities = 32/105 (30%), Positives = 46/105 (43%), Gaps = 6/105 (5%)
Frame = +3
Query: 45 LYFTPSSIPL------TLNIPHRPKPPKPLNLKPKLFSPSSLSLYRFHLPSVSFHGFQET 206
L TP ++PL T ++ PP LNL L S SS RF + V+ F +
Sbjct: 11 LSLTPQTLPLPKPTSQTTSLSFFSLPPSSLNLS--LSSSSSCFSSRF-VRKVTLSDFDQI 67
Query: 207 EEEPESETFQNSEPKVSHSNQSRRLFVGNLPYSLSSSQLGSALRR 341
E+ + + E S ++FVGNLP+S S+ L R
Sbjct: 68 EDVEDGDDGVEEERNFS---PDLKIFVGNLPFSADSAALAELFER 109
>pir||S18883 ribonucleoprotein precursor - common tobacco (fragment)
gi|20009|emb|CAA43429.1| ribonucleoprotein [Nicotiana
tabacum]
Length = 292
Score = 71.6 bits (174), Expect = 6e-12
Identities = 37/77 (48%), Positives = 55/77 (71%), Gaps = 1/77 (1%)
Frame = +1
Query: 241 QSQRCHTQTNQGGSLWETCLTHCLPLS-LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVT 417
+ + +++ +GG L+ L + S L+++F EAG V +VE+V D TDRSRGFAFVT
Sbjct: 91 KEESVESESVEGGRLYVGNLPFSMTSSQLSEIFAEAGTVANVEIVYDRVTDRSRGFAFVT 150
Query: 418 MGSVENAEQAIQMFDGT 468
MGSVE A++AI++FDG+
Sbjct: 151 MGSVEEAKEAIRLFDGS 167
Score = 36.6 bits (83), Expect = 0.21
Identities = 18/37 (48%), Positives = 24/37 (64%)
Frame = +3
Query: 213 EPESETFQNSEPKVSHSNQSRRLFVGNLPYSLSSSQL 323
E E E + E S S + RL+VGNLP+S++SSQL
Sbjct: 83 EEEEEIEEKEESVESESVEGGRLYVGNLPFSMTSSQL 119
Score = 33.5 bits (75), Expect = 1.8
Identities = 17/54 (31%), Positives = 27/54 (49%)
Frame = +1
Query: 322 LAQLFGEAGDVVSVEVVCDDFTDRSRGFAFVTMGSVENAEQAIQMFDGTVSLIS 483
L F + +S +V+ D + RSRGF F+T S E + A+ + V + S
Sbjct: 222 LRDAFADQPGFMSAKVIYDRSSGRSRGFGFITFSSAEAMKSALDTMNEVVRIHS 275
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 491,317,423
Number of Sequences: 1393205
Number of extensions: 11139256
Number of successful extensions: 58362
Number of sequences better than 10.0: 1139
Number of HSP's better than 10.0 without gapping: 53461
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 58065
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 19234190289
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)