Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004023A_C02 KMC004023A_c02
(713 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_171637.1| chloroplast nucleoid DNA binding protein, putat... 228 5e-59
gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putativ... 224 7e-58
ref|NP_191741.1| putative protein; protein id: At3g61820.1, supp... 208 7e-53
dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like protein ... 178 8e-44
ref|NP_173922.1| hypothetical protein; protein id: At1g25510.1, ... 145 4e-34
>ref|NP_171637.1| chloroplast nucleoid DNA binding protein, putative; protein id:
At1g01300.1, supported by cDNA: 7567. [Arabidopsis
thaliana] gi|25518405|pir||C86143 hypothetical protein
F6F3.10 - Arabidopsis thaliana
gi|9665144|gb|AAF97328.1|AC023628_9 Unknown protein
[Arabidopsis thaliana] gi|22135930|gb|AAM91547.1|
chloroplast nucleoid DNA binding protein, putative
[Arabidopsis thaliana]
Length = 485
Score = 228 bits (582), Expect = 5e-59
Identities = 112/128 (87%), Positives = 117/128 (90%)
Frame = -2
Query: 712 GNGGVIIDSGTSVTRLTRPAYTALRDAFRLGASHLKRAPEFSLFDTCFDLSGQTEVKVPT 533
GNGGVIIDSGTSVTRL RPAY A+RDAFR+GA LKRAP+FSLFDTCFDLS EVKVPT
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPDFSLFDTCFDLSNMNEVKVPT 417
Query: 532 VVLHFRGADVSLPATNYLIPVDSSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLGSSRV 353
VVLHFRGADVSLPATNYLIPVD++G FCFAFAGTM GLSIIGNIQQQGFRVVYDL SSRV
Sbjct: 418 VVLHFRGADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477
Query: 352 GFAPRGCA 329
GFAP GCA
Sbjct: 478 GFAPGGCA 485
>gb|AAM66061.1| chloroplast nucleoid DNA binding protein, putative [Arabidopsis
thaliana]
Length = 485
Score = 224 bits (572), Expect = 7e-58
Identities = 111/128 (86%), Positives = 115/128 (89%)
Frame = -2
Query: 712 GNGGVIIDSGTSVTRLTRPAYTALRDAFRLGASHLKRAPEFSLFDTCFDLSGQTEVKVPT 533
GNGGVIIDSGTSVTRL RPAY A+RDAFR+GA LKRAP FSLFDTCFDLS EVKVPT
Sbjct: 358 GNGGVIIDSGTSVTRLIRPAYIAMRDAFRVGAKTLKRAPNFSLFDTCFDLSNMNEVKVPT 417
Query: 532 VVLHFRGADVSLPATNYLIPVDSSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLGSSRV 353
VVLHFR ADVSLPATNYLIPVD++G FCFAFAGTM GLSIIGNIQQQGFRVVYDL SSRV
Sbjct: 418 VVLHFRRADVSLPATNYLIPVDTNGKFCFAFAGTMGGLSIIGNIQQQGFRVVYDLASSRV 477
Query: 352 GFAPRGCA 329
GFAP GCA
Sbjct: 478 GFAPGGCA 485
>ref|NP_191741.1| putative protein; protein id: At3g61820.1, supported by cDNA:
gi_14532549 [Arabidopsis thaliana]
gi|11357465|pir||T47974 hypothetical protein F15G16.210
- Arabidopsis thaliana gi|6850873|emb|CAB71112.1|
putative protein [Arabidopsis thaliana]
Length = 483
Score = 208 bits (529), Expect = 7e-53
Identities = 102/127 (80%), Positives = 109/127 (85%)
Frame = -2
Query: 712 GNGGVIIDSGTSVTRLTRPAYTALRDAFRLGASHLKRAPEFSLFDTCFDLSGQTEVKVPT 533
GNGGVIIDSGTSVTRLT+PAY ALRDAFRLGA+ LKRAP +SLFDTCFDLSG T VKVPT
Sbjct: 357 GNGGVIIDSGTSVTRLTQPAYVALRDAFRLGATKLKRAPSYSLFDTCFDLSGMTTVKVPT 416
Query: 532 VVLHFRGADVSLPATNYLIPVDSSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLGSSRV 353
VV HF G +VSLPA+NYLIPV++ G FCFAFAGTM LSIIGNIQQQGFRV YDL SRV
Sbjct: 417 VVFHFGGGEVSLPASNYLIPVNTEGRFCFAFAGTMGSLSIIGNIQQQGFRVAYDLVGSRV 476
Query: 352 GFAPRGC 332
GF R C
Sbjct: 477 GFLSRAC 483
>dbj|BAB63755.1| nucleoid DNA-binding protein cnd41-like protein [Oryza sativa
(japonica cultivar-group)]
Length = 500
Score = 178 bits (451), Expect = 8e-44
Identities = 90/129 (69%), Positives = 103/129 (79%), Gaps = 2/129 (1%)
Frame = -2
Query: 712 GNGGVIIDSGTSVTRLTRPAYTALRDAFRLGASHLKRAPE-FSLFDTCFDLSGQTEVKVP 536
G GGVI+DSGTSVTRL RPAY ALRDAFR A+ L+ +P FSLFDTC+DLSG VKVP
Sbjct: 372 GRGGVIVDSGTSVTRLARPAYAALRDAFRAAAAGLRLSPGGFSLFDTCYDLSGLKVVKVP 431
Query: 535 TVVLHFRG-ADVSLPATNYLIPVDSSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLGSS 359
TV +HF G A+ +LP NYLIPVDS G+FCFAFAGT G+SIIGNIQQQGFRVV+D
Sbjct: 432 TVSMHFAGGAEAALPPENYLIPVDSRGTFCFAFAGTDGGVSIIGNIQQQGFRVVFDGDGQ 491
Query: 358 RVGFAPRGC 332
R+GF P+GC
Sbjct: 492 RLGFVPKGC 500
>ref|NP_173922.1| hypothetical protein; protein id: At1g25510.1, supported by cDNA:
gi_20466515 [Arabidopsis thaliana]
gi|25518510|pir||D86385 hypothetical protein F2J7.6 -
Arabidopsis thaliana
gi|12321511|gb|AAG50814.1|AC079281_16 hypothetical
protein [Arabidopsis thaliana]
gi|20466516|gb|AAM20575.1| unknown protein [Arabidopsis
thaliana] gi|23198172|gb|AAN15613.1| unknown protein
[Arabidopsis thaliana]
Length = 483
Score = 145 bits (367), Expect = 4e-34
Identities = 71/128 (55%), Positives = 94/128 (72%), Gaps = 1/128 (0%)
Frame = -2
Query: 712 GNGGVIIDSGTSVTRLTRPAYTALRDAFRLGASHLKRAPEFSLFDTCFDLSGQTEVKVPT 533
G+GG+IIDSGT+VTRL Y +LRD+F G L++A ++FDTC++LS +T V+VPT
Sbjct: 356 GSGGIIIDSGTAVTRLQTEIYNSLRDSFVKGTLDLEKAAGVAMFDTCYNLSAKTTVEVPT 415
Query: 532 VVLHFRGADV-SLPATNYLIPVDSSGSFCFAFAGTMSGLSIIGNIQQQGFRVVYDLGSSR 356
V HF G + +LPA NY+IPVDS G+FC AFA T S L+IIGN+QQQG RV +DL +S
Sbjct: 416 VAFHFPGGKMLALPAKNYMIPVDSVGTFCLAFAPTASSLAIIGNVQQQGTRVTFDLANSL 475
Query: 355 VGFAPRGC 332
+GF+ C
Sbjct: 476 IGFSSNKC 483
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 601,944,962
Number of Sequences: 1393205
Number of extensions: 13750123
Number of successful extensions: 71455
Number of sequences better than 10.0: 395
Number of HSP's better than 10.0 without gapping: 56217
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 69581
length of database: 448,689,247
effective HSP length: 120
effective length of database: 281,504,647
effective search space used: 32936043699
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)