Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC002938A_C03 KMC002938A_c03
(1561 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAM66970.1| putative RNA-binding protein [Arabidopsis thaliana] 301 2e-80
ref|NP_181259.1| chloroplast RNA-binding protein cp29, putative;... 299 7e-80
sp|P49313|ROC1_NICPL 30 kDa ribonucleoprotein, chloroplast precu... 294 3e-78
sp|Q43349|ROC2_ARATH 29 kDa ribonucleoprotein, chloroplast precu... 293 4e-78
dbj|BAA06519.1| cp29 [Arabidopsis thaliana] 292 9e-78
>gb|AAM66970.1| putative RNA-binding protein [Arabidopsis thaliana]
Length = 289
Score = 301 bits (770), Expect = 2e-80
Identities = 173/306 (56%), Positives = 217/306 (70%), Gaps = 10/306 (3%)
Frame = -3
Query: 1484 SATSLALPILQTRQ-PLGSSQRFSSS-------FCLNPNFKSISISAVFLPSTLSLSSSR 1329
SA+SLAL + P G S+ S S F LN + S SI+A + + +SR
Sbjct: 4 SASSLALSSFNPKSLPFGVSRPASVSLLSPSLSFKLNSDSVSFSIAAKW-----NSPASR 58
Query: 1328 FVTRVAVSSEFDQEEDTFSDDA--ETRSFAPDHKVFVGNLPFSVDSAQLAELFQDAGNVE 1155
FV VA++SEF+ EED F+D A + +SF+ D K+FVGNLPF+VDSAQLA+LF+ AGNVE
Sbjct: 59 FVRNVAITSEFEVEEDGFADVAPPKEQSFSADLKLFVGNLPFNVDSAQLAQLFESAGNVE 118
Query: 1154 VVEVIYDKMTGNSRGFAFVTMSSAAEAEAAAQQFNNYELEGRALRVNSGPPPKNENRGFN 975
+VEVIYDK+TG SRGF FVTMSS +E EAAAQQFN YEL+GR LRVN+GPPP GF+
Sbjct: 119 MVEVIYDKITGRSRGFGFVTMSSVSEVEAAAQQFNGYELDGRPLRVNAGPPPPKREDGFS 178
Query: 974 ENPRFRNNSFNRGGSDSYRGGSDGYRGGGSDGYRGGGSSSYSENRVHVGNLAWGVDNAAL 795
PR +SF GS GY G GGGS + S NRV+VGNL+WGVD+ AL
Sbjct: 179 RGPR---SSFGSSGS--------GYGG-------GGGSGAGSGNRVYVGNLSWGVDDMAL 220
Query: 794 ESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRSLDGADLNGRAIKVSQA 615
ESLF EQG+VV+A+VIYDR+SGRS+GFGFVT+ S EV +AI+SLDGADL+GR I+VS+A
Sbjct: 221 ESLFSEQGKVVEARVIYDRDSGRSKGFGFVTYDSSQEVQNAIKSLDGADLDGRQIRVSEA 280
Query: 614 DSKPKR 597
+++P R
Sbjct: 281 EARPPR 286
Score = 80.5 bits (197), Expect = 7e-14
Identities = 40/83 (48%), Positives = 55/83 (66%)
Frame = -3
Query: 842 RVHVGNLAWGVDNAALESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRS 663
++ VGNL + VD+A L LF G V +VIYD+ +GRSRGFGFVT SS EV +A +
Sbjct: 92 KLFVGNLPFNVDSAQLAQLFESAGNVEMVEVIYDKITGRSRGFGFVTMSSVSEVEAAAQQ 151
Query: 662 LDGADLNGRAIKVSQADSKPKRE 594
+G +L+GR ++V+ PKRE
Sbjct: 152 FNGYELDGRPLRVNAGPPPPKRE 174
>ref|NP_181259.1| chloroplast RNA-binding protein cp29, putative; protein id:
At2g37220.1, supported by cDNA: 8849., supported by cDNA:
gi_13877808, supported by cDNA: gi_15081716, supported by
cDNA: gi_16323481 [Arabidopsis thaliana]
gi|12230623|sp|Q9ZUU4|ROC1_ARATH Putative
ribonucleoprotein At2g37220, chloroplast precursor
gi|25386550|pir||A84790 probable RNA-binding protein
[imported] - Arabidopsis thaliana
gi|4056477|gb|AAC98043.1| putative RNA-binding protein
[Arabidopsis thaliana]
gi|13877809|gb|AAK43982.1|AF370167_1 putative RNA-binding
protein [Arabidopsis thaliana] gi|15081717|gb|AAK82513.1|
At2g37220/F3G5.1 [Arabidopsis thaliana]
gi|16323482|gb|AAL15235.1| putative RNA-binding protein
[Arabidopsis thaliana] gi|20197716|gb|AAM15222.1|
putative RNA-binding protein [Arabidopsis thaliana]
Length = 289
Score = 299 bits (766), Expect = 7e-80
Identities = 172/306 (56%), Positives = 216/306 (70%), Gaps = 10/306 (3%)
Frame = -3
Query: 1484 SATSLALPILQTRQ-PLGSSQRFSSS-------FCLNPNFKSISISAVFLPSTLSLSSSR 1329
SA+SLAL + P G S+ S S F LN + S SI+A + + +SR
Sbjct: 4 SASSLALSSFNPKSLPFGVSRPASVSLLSPSLSFKLNSDSVSFSIAAKW-----NSPASR 58
Query: 1328 FVTRVAVSSEFDQEEDTFSDDA--ETRSFAPDHKVFVGNLPFSVDSAQLAELFQDAGNVE 1155
F VA++SEF+ EED F+D A + +SF+ D K+FVGNLPF+VDSAQLA+LF+ AGNVE
Sbjct: 59 FARNVAITSEFEVEEDGFADVAPPKEQSFSADLKLFVGNLPFNVDSAQLAQLFESAGNVE 118
Query: 1154 VVEVIYDKMTGNSRGFAFVTMSSAAEAEAAAQQFNNYELEGRALRVNSGPPPKNENRGFN 975
+VEVIYDK+TG SRGF FVTMSS +E EAAAQQFN YEL+GR LRVN+GPPP GF+
Sbjct: 119 MVEVIYDKITGRSRGFGFVTMSSVSEVEAAAQQFNGYELDGRPLRVNAGPPPPKREDGFS 178
Query: 974 ENPRFRNNSFNRGGSDSYRGGSDGYRGGGSDGYRGGGSSSYSENRVHVGNLAWGVDNAAL 795
PR +SF GS GY G GGGS + S NRV+VGNL+WGVD+ AL
Sbjct: 179 RGPR---SSFGSSGS--------GYGG-------GGGSGAGSGNRVYVGNLSWGVDDMAL 220
Query: 794 ESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRSLDGADLNGRAIKVSQA 615
ESLF EQG+VV+A+VIYDR+SGRS+GFGFVT+ S EV +AI+SLDGADL+GR I+VS+A
Sbjct: 221 ESLFSEQGKVVEARVIYDRDSGRSKGFGFVTYDSSQEVQNAIKSLDGADLDGRQIRVSEA 280
Query: 614 DSKPKR 597
+++P R
Sbjct: 281 EARPPR 286
Score = 80.5 bits (197), Expect = 7e-14
Identities = 40/83 (48%), Positives = 55/83 (66%)
Frame = -3
Query: 842 RVHVGNLAWGVDNAALESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRS 663
++ VGNL + VD+A L LF G V +VIYD+ +GRSRGFGFVT SS EV +A +
Sbjct: 92 KLFVGNLPFNVDSAQLAQLFESAGNVEMVEVIYDKITGRSRGFGFVTMSSVSEVEAAAQQ 151
Query: 662 LDGADLNGRAIKVSQADSKPKRE 594
+G +L+GR ++V+ PKRE
Sbjct: 152 FNGYELDGRPLRVNAGPPPPKRE 174
>sp|P49313|ROC1_NICPL 30 kDa ribonucleoprotein, chloroplast precursor (CP-RBP30)
gi|280402|pir||S26203 RNA-binding protein 30 -
curled-leaved tobacco gi|19708|emb|CAA46234.1| RNA
binding protein 30 [Nicotiana plumbaginifolia]
Length = 279
Score = 294 bits (752), Expect = 3e-78
Identities = 178/306 (58%), Positives = 211/306 (68%), Gaps = 4/306 (1%)
Frame = -3
Query: 1502 LAMSTTSATSLALPILQTRQPLGSSQRFSSSF-CLNPNFKSISISAVFLPSTLSLSSSRF 1326
+A S +S L+L P +SQ S SF L P+ ++S+S+ PS S SSRF
Sbjct: 1 MASSASSLHFLSLTPQTLLLPKPTSQTTSLSFFSLPPSSLNLSLSS---PS--SCFSSRF 55
Query: 1325 VTRVAVSSEFDQEEDTFSDDA---ETRSFAPDHKVFVGNLPFSVDSAQLAELFQDAGNVE 1155
V +V + +FDQ ED D E R+F+PD K+FVGNL FS DSA LAELF+ AGNVE
Sbjct: 56 VRKVTLP-DFDQIEDVEDGDEGVEEERNFSPDLKIFVGNLLFSADSAALAELFERAGNVE 114
Query: 1154 VVEVIYDKMTGNSRGFAFVTMSSAAEAEAAAQQFNNYELEGRALRVNSGPPPKNENRGFN 975
+VEVIYDK+TG SRGF FVTMSS E EAA QQFN YEL+GRALRVNSGPPP+
Sbjct: 115 MVEVIYDKLTGRSRGFGFVTMSSKEEVEAACQQFNGYELDGRALRVNSGPPPEK-----R 169
Query: 974 ENPRFRNNSFNRGGSDSYRGGSDGYRGGGSDGYRGGGSSSYSENRVHVGNLAWGVDNAAL 795
EN FR NS S+RGGS RGGGS S NRV+VGNLAWGVD AL
Sbjct: 170 ENSSFRENS-------SFRGGS-----------RGGGSFD-SSNRVYVGNLAWGVDQDAL 210
Query: 794 ESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRSLDGADLNGRAIKVSQA 615
E+LF EQG+VVDAKV+YDR+SGRSRGFGFVT+SS +EVN+AI SLDG DLNGRAI+VS A
Sbjct: 211 ETLFSEQGKVVDAKVVYDRDSGRSRGFGFVTYSSAEEVNNAIESLDGVDLNGRAIRVSPA 270
Query: 614 DSKPKR 597
+++P R
Sbjct: 271 EARPPR 276
Score = 82.0 bits (201), Expect = 2e-14
Identities = 40/83 (48%), Positives = 56/83 (67%)
Frame = -3
Query: 842 RVHVGNLAWGVDNAALESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRS 663
++ VGNL + D+AAL LF G V +VIYD+ +GRSRGFGFVT SS +EV +A +
Sbjct: 88 KIFVGNLLFSADSAALAELFERAGNVEMVEVIYDKLTGRSRGFGFVTMSSKEEVEAACQQ 147
Query: 662 LDGADLNGRAIKVSQADSKPKRE 594
+G +L+GRA++V+ KRE
Sbjct: 148 FNGYELDGRALRVNSGPPPEKRE 170
>sp|Q43349|ROC2_ARATH 29 kDa ribonucleoprotein, chloroplast precursor (RNA-binding protein
cp29) gi|1076303|pir||S53490 RNA-binding protein cp29
precursor - Arabidopsis thaliana
gi|681902|dbj|BAA06518.1| cp29 [Arabidopsis thaliana]
Length = 334
Score = 293 bits (751), Expect = 4e-78
Identities = 175/331 (52%), Positives = 213/331 (63%), Gaps = 30/331 (9%)
Frame = -3
Query: 1499 AMSTTSATSLALPILQTRQPLGSSQRFSSSFCLNPNFKSISISAVFLPSTLSLSS----- 1335
A S ++ +LP+ +R S S SF L+ S + ++F S L SS
Sbjct: 5 ASSLSAFNPKSLPLCVSRPASVSVLPPSLSFKLH----SDHLVSIFASSALKCSSPAEYP 60
Query: 1334 SRFVTRVAVSSEFDQEEDTFSDDA------ETRSFAPDHKVFVGNLPFSVDSAQLAELFQ 1173
SRFV VAVSS+F+ EED D E SF+PD K+FVGNL F+VDSAQLA+LF+
Sbjct: 61 SRFVRNVAVSSDFEVEEDDMFADGDDSAPVERNSFSPDLKLFVGNLSFNVDSAQLAQLFE 120
Query: 1172 DAGNVEVVEVIYDKMTGNSRGFAFVTMSSAAEAEAAAQQFNNYELEGRALRVNSGPPPKN 993
AGNVE+VEVIYDK+TG SRGF FVTMS+AAE EAAAQQFN YE EGR LRVN+GPPP
Sbjct: 121 SAGNVEMVEVIYDKVTGRSRGFGFVTMSTAAEVEAAAQQFNGYEFEGRPLRVNAGPPPPK 180
Query: 992 ENRGFNENPRFRNNSFNRGGS-DSYRGGSDG-YRGGGSDGYR-----------------G 870
F+ PR RGG S RGG G RGGG R G
Sbjct: 181 REESFSRGPRSGGYGSERGGGYGSERGGGYGSERGGGYGSQRSGGGYGGSQRSSYGSGSG 240
Query: 869 GGSSSYSENRVHVGNLAWGVDNAALESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSP 690
GS S S NR++VGNL+WGVD+ ALE+LF EQG+VV+A+VIYDR+SGRS+GFGFVT SS
Sbjct: 241 SGSGSGSGNRLYVGNLSWGVDDMALENLFNEQGKVVEARVIYDRDSGRSKGFGFVTLSSS 300
Query: 689 DEVNSAIRSLDGADLNGRAIKVSQADSKPKR 597
EV AI SL+GADL+GR I+VS+A+++P R
Sbjct: 301 QEVQKAINSLNGADLDGRQIRVSEAEARPPR 331
Score = 79.0 bits (193), Expect = 2e-13
Identities = 43/113 (38%), Positives = 62/113 (54%)
Frame = -3
Query: 932 SDSYRGGSDGYRGGGSDGYRGGGSSSYSENRVHVGNLAWGVDNAALESLFREQGRVVDAK 753
S + D G D +S + ++ VGNL++ VD+A L LF G V +
Sbjct: 70 SSDFEVEEDDMFADGDDSAPVERNSFSPDLKLFVGNLSFNVDSAQLAQLFESAGNVEMVE 129
Query: 752 VIYDRESGRSRGFGFVTFSSPDEVNSAIRSLDGADLNGRAIKVSQADSKPKRE 594
VIYD+ +GRSRGFGFVT S+ EV +A + +G + GR ++V+ PKRE
Sbjct: 130 VIYDKVTGRSRGFGFVTMSTAAEVEAAAQQFNGYEFEGRPLRVNAGPPPPKRE 182
>dbj|BAA06519.1| cp29 [Arabidopsis thaliana]
Length = 326
Score = 292 bits (748), Expect = 9e-78
Identities = 173/321 (53%), Positives = 209/321 (64%), Gaps = 30/321 (9%)
Frame = -3
Query: 1469 ALPILQTRQPLGSSQRFSSSFCLNPNFKSISISAVFLPSTLSLSS-----SRFVTRVAVS 1305
+LP+ +R S S SF L+ S + ++F S L SS SRFV VAVS
Sbjct: 7 SLPLCVSRPASVSVLPPSLSFKLH----SDHLVSIFASSALKCSSPAEYPSRFVRNVAVS 62
Query: 1304 SEFDQEEDTFSDDA------ETRSFAPDHKVFVGNLPFSVDSAQLAELFQDAGNVEVVEV 1143
S+F+ EED D E SF+PD K+FVGNL F+VDSAQLA+LF+ AGNVE+VEV
Sbjct: 63 SDFEVEEDDMFADGDDSAPVERNSFSPDLKLFVGNLSFNVDSAQLAQLFESAGNVEMVEV 122
Query: 1142 IYDKMTGNSRGFAFVTMSSAAEAEAAAQQFNNYELEGRALRVNSGPPPKNENRGFNENPR 963
IYDK+TG SRGF FVTMS+AAE EAAAQQFN YE EGR LRVN+GPPP F+ PR
Sbjct: 123 IYDKVTGRSRGFGFVTMSTAAEVEAAAQQFNGYEFEGRPLRVNAGPPPPKREESFSRGPR 182
Query: 962 FRNNSFNRGGS-DSYRGGSDG-YRGGGSDGYR-----------------GGGSSSYSENR 840
RGG S RGG G RGGG R G GS S S NR
Sbjct: 183 SGGYGSERGGGYGSERGGGYGSERGGGYGSQRSGGGYGGSQRSSYGSGSGSGSGSGSGNR 242
Query: 839 VHVGNLAWGVDNAALESLFREQGRVVDAKVIYDRESGRSRGFGFVTFSSPDEVNSAIRSL 660
++VGNL+WGVD+ ALE+LF EQG+VV+A+VIYDR+SGRS+GFGFVT SS EV AI SL
Sbjct: 243 LYVGNLSWGVDDMALENLFNEQGKVVEARVIYDRDSGRSKGFGFVTLSSSQEVQKAINSL 302
Query: 659 DGADLNGRAIKVSQADSKPKR 597
+GADL+GR I+VS+A+++P R
Sbjct: 303 NGADLDGRQIRVSEAEARPPR 323
Score = 79.0 bits (193), Expect = 2e-13
Identities = 43/113 (38%), Positives = 62/113 (54%)
Frame = -3
Query: 932 SDSYRGGSDGYRGGGSDGYRGGGSSSYSENRVHVGNLAWGVDNAALESLFREQGRVVDAK 753
S + D G D +S + ++ VGNL++ VD+A L LF G V +
Sbjct: 62 SSDFEVEEDDMFADGDDSAPVERNSFSPDLKLFVGNLSFNVDSAQLAQLFESAGNVEMVE 121
Query: 752 VIYDRESGRSRGFGFVTFSSPDEVNSAIRSLDGADLNGRAIKVSQADSKPKRE 594
VIYD+ +GRSRGFGFVT S+ EV +A + +G + GR ++V+ PKRE
Sbjct: 122 VIYDKVTGRSRGFGFVTMSTAAEVEAAAQQFNGYEFEGRPLRVNAGPPPPKRE 174
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 1,325,374,605
Number of Sequences: 1393205
Number of extensions: 31210035
Number of successful extensions: 169744
Number of sequences better than 10.0: 2824
Number of HSP's better than 10.0 without gapping: 103559
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 149094
length of database: 448,689,247
effective HSP length: 128
effective length of database: 270,359,007
effective search space used: 105710371737
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)