Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC016848A_C01 KMC016848A_c01
(655 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_191902.1| putative protein; protein id: At3g63430.1 [Arab... 134 1e-30
pir||S50755 hypothetical protein VSP-3 - Chlamydomonas reinhardt... 59 4e-08
ref|NP_173297.1| unknown protein; protein id: At1g18620.1, suppo... 58 1e-07
ref|NP_194482.1| putative protein; protein id: At4g27520.1, supp... 57 2e-07
gb|AAM64815.1| unknown [Arabidopsis thaliana] 57 2e-07
>ref|NP_191902.1| putative protein; protein id: At3g63430.1 [Arabidopsis thaliana]
gi|11288849|pir||T49184 hypothetical protein MAA21.60 -
Arabidopsis thaliana gi|7573326|emb|CAB87796.1| putative
protein [Arabidopsis thaliana]
Length = 540
Score = 134 bits (336), Expect = 1e-30
Identities = 73/153 (47%), Positives = 102/153 (65%), Gaps = 7/153 (4%)
Frame = -2
Query: 648 LPEESDVFVLLEKR---KGKDTSRASKLPRRLIFDTLQEILNRNQKLPPWKXAARGE--- 487
LP+ESD F LEK+ KGK SRA+ RRLIFD +QEI+ R + LPPW +
Sbjct: 397 LPQESDSFSFLEKQQYLKGKCASRAAAQERRLIFDAVQEIVARRRSLPPWMMVGEADNKM 456
Query: 486 ETVWSEFRRIRDREESASED-MFGVICGVLRKDMAAEMSGWGEWTVEMGDVVLDIERLVF 310
+ +WSEF++IRD++ S ED + G +CGVL +D++ + W ++ VEM + VLD+ERL+F
Sbjct: 457 QVIWSEFQKIRDKKSSTEEDDLVGYVCGVLGRDLSEDR--WRDFQVEMSEAVLDVERLIF 514
Query: 309 KDLIGETIQQLASFGPQCNNSNKVSALRRKLVF 211
KDLIGETI+QLA N+ +LRR+L+F
Sbjct: 515 KDLIGETIRQLAFL-------NRSDSLRRRLLF 540
>pir||S50755 hypothetical protein VSP-3 - Chlamydomonas reinhardtii
gi|530876|gb|AAB53953.1| amino acid feature: Rod protein
domain, aa 266 .. 468; amino acid feature: globular
protein domain, aa 32 .. 265
Length = 473
Score = 59.3 bits (142), Expect = 4e-08
Identities = 45/111 (40%), Positives = 54/111 (48%), Gaps = 11/111 (9%)
Frame = +3
Query: 345 PSP-PSIHPNHSSPPPCPSSTRRISPRTYPPTPTP-----PCHGSSETPT-----TPSPP 491
PSP PS+ P S P P PS + SPR PP P+P P S +P+ +PSP
Sbjct: 335 PSPSPSVQPA-SKPSPSPSPSPSPSPRPSPPLPSPSPSPSPSPSPSPSPSPKPSPSPSPS 393
Query: 492 PSLPXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPP 644
PS P S S S SP+ K + S + S PSP ASP K SPP
Sbjct: 394 PS-PSPKPSPSPSPSPSPSPSPKVSPSPSPSPSPSPSPKASPSPAKKPSPP 443
Score = 52.8 bits (125), Expect = 4e-06
Identities = 39/107 (36%), Positives = 53/107 (49%), Gaps = 5/107 (4%)
Frame = +3
Query: 345 PSP---PSIHPN-HSSPPPCPSSTRRISPRTYP-PTPTPPCHGSSETPTTPSPPPSLPXS 509
PSP PS P+ +SP P PS + SP+ P P+P+P +S+ +PSP PS
Sbjct: 301 PSPKASPSPSPSPKASPSPSPSPSPSPSPKASPSPSPSPSVQPASKPSPSPSPSPSPSPR 360
Query: 510 TAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPAS 650
+ S S SP+ + S + K S PSPS SP + SP S
Sbjct: 361 PSPPLPSPSPSPSPSPSPSPSPSPKPSPSPSPSPSPSPKPSPSPSPS 407
Score = 46.6 bits (109), Expect = 3e-04
Identities = 42/111 (37%), Positives = 50/111 (44%), Gaps = 9/111 (8%)
Frame = +3
Query: 345 PSP---PSIHPNHSSPPPCPSSTRRISPRTYP-----PTPTPPCHGSSETPTTPSPPPSL 500
PSP PS P +SP P PS SP P P+P+P S + +PSP PS+
Sbjct: 283 PSPKASPSPSPK-ASPSPSPSPKASPSPSPSPKASPSPSPSPSPSPSPKASPSPSPSPSV 341
Query: 501 -PXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPAS 650
P S S S S SP+ R S + S PSPS SP SP S
Sbjct: 342 QPASKPSPSPSPSPSPSP-RPSPPLPSPSPSPSPSPSPSPSPSPKPSPSPS 391
Score = 45.8 bits (107), Expect = 5e-04
Identities = 36/101 (35%), Positives = 45/101 (43%), Gaps = 1/101 (0%)
Frame = +3
Query: 345 PSPPSIHPNHSSPPPCPSSTRRISPRTYP-PTPTPPCHGSSETPTTPSPPPSLPXSTAGV 521
PSP + SP P P ++ SP+ P P+P+P S SP PS P +
Sbjct: 269 PSPKASPSPKVSPSPSPKASPSPSPKASPSPSPSPKASPSPSPSPKASPSPS-PSPSPSP 327
Query: 522 SGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPP 644
S S SP+ + A K S PSPS SP R SPP
Sbjct: 328 SPKASPSPSP--SPSVQPASKPSPSPSPSPSPSPRP--SPP 364
Score = 44.3 bits (103), Expect = 0.001
Identities = 32/96 (33%), Positives = 44/96 (45%), Gaps = 2/96 (2%)
Frame = +3
Query: 369 NHSSPPPCPSSTRRISPRTYPPTPTPPCHGSSETPTTPSPPPSLPXSTAGVSGSGSESPA 548
N + P PS SP+ P+P+P S +PSP PS P ++ S S SP+
Sbjct: 261 NRTGASPSPSPKASPSPKV-SPSPSPKASPSPSPKASPSPSPS-PKASPSPSPSPKASPS 318
Query: 549 ACRKSTASVAWKLSTCPSPSAS--PVTRKHRSPPAS 650
+ S + K S PSPS S P ++ SP S
Sbjct: 319 PSPSPSPSPSPKASPSPSPSPSVQPASKPSPSPSPS 354
Score = 32.3 bits (72), Expect = 5.7
Identities = 25/61 (40%), Positives = 33/61 (53%), Gaps = 10/61 (16%)
Frame = +3
Query: 342 RPSP-PSIHPNHS---SPPPCPSSTRRISPRTYPPTPTP-----PCHGSSETPT-TPSPP 491
+PSP PS P+ S SP P PS + SP+ P+P+P P +S +P PSPP
Sbjct: 385 KPSPSPSPSPSPSPKPSPSPSPSPSPSPSPKV-SPSPSPSPSPSPSPKASPSPAKKPSPP 443
Query: 492 P 494
P
Sbjct: 444 P 444
Score = 32.0 bits (71), Expect = 7.5
Identities = 19/51 (37%), Positives = 25/51 (48%), Gaps = 2/51 (3%)
Frame = +3
Query: 345 PSP-PSIHPNHSSPPPCPSSTRRISPR-TYPPTPTPPCHGSSETPTTPSPP 491
PSP P + P+ SP P PS + + SP P+P PP + P PP
Sbjct: 410 PSPSPKVSPS-PSPSPSPSPSPKASPSPAKKPSPPPPVEEGAPPPIEGPPP 459
>ref|NP_173297.1| unknown protein; protein id: At1g18620.1, supported by cDNA:
gi_20856608 [Arabidopsis thaliana]
gi|25518757|pir||H86319 hypothetical protein F26I16.4 -
Arabidopsis thaliana gi|9795593|gb|AAF98411.1|AC026238_3
Unknown protein [Arabidopsis thaliana]
gi|20856609|gb|AAM26675.1| At1g18620/F25I16_13
[Arabidopsis thaliana]
Length = 978
Score = 57.8 bits (138), Expect = 1e-07
Identities = 39/148 (26%), Positives = 68/148 (45%), Gaps = 28/148 (18%)
Frame = -2
Query: 645 PEESDVFVLLEKRKGKDTSRASKLPRRLIFDTLQEILNRN-----QKLPPWKXAARGEET 481
P ++F+++E+ KG +S K+ R+L+FD + E+L + + PW A+ +
Sbjct: 813 PINPELFLVIEQTKGCSSSSNEKINRKLVFDAVNEMLGKKLAFVESYVDPWMKQAKARKK 872
Query: 480 VWSEFRRIRD-----------------------REESASEDMFGVICGVLRKDMAAEMSG 370
V S +++ EE ED I L +DMA +
Sbjct: 873 VLSAQNLLKELCSEIEILQKQAKKRSENLLLLEEEEEEEEDFLKCI---LDEDMAIQSEK 929
Query: 369 WGEWTVEMGDVVLDIERLVFKDLIGETI 286
W ++ + +VLD+ERL+FKDL+ E +
Sbjct: 930 WTDFDDAIPGLVLDMERLLFKDLVKEIV 957
>ref|NP_194482.1| putative protein; protein id: At4g27520.1, supported by cDNA:
33380., supported by cDNA: gi_13358224 [Arabidopsis
thaliana] gi|7487504|pir||T05857 hypothetical protein
T29A15.10 - Arabidopsis thaliana
gi|4469003|emb|CAB38264.1| putative protein [Arabidopsis
thaliana] gi|7269606|emb|CAB81402.1| putative protein
[Arabidopsis thaliana]
gi|11762218|gb|AAG40387.1|AF325035_1 AT4g27520
[Arabidopsis thaliana] gi|23397249|gb|AAN31906.1|
unknown protein [Arabidopsis thaliana]
gi|24417234|gb|AAN60227.1| unknown [Arabidopsis
thaliana]
Length = 349
Score = 57.4 bits (137), Expect = 2e-07
Identities = 43/115 (37%), Positives = 55/115 (47%), Gaps = 4/115 (3%)
Frame = +3
Query: 318 GVRYPKRRRPSPPSIHPNHSSPPP----CPSSTRRISPRTYPPTPTPPCHGSSETPTTPS 485
G PK P P+ P S+ PP P S+ +SP T PP P GS +PTT
Sbjct: 158 GAHSPKSSSPVSPTTSPPGSTTPPGGAHSPKSSSAVSPATSPPGSMAPKSGSPVSPTT-- 215
Query: 486 PPPSLPXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPAS 650
PP+ P ST+ V S S A A +A K S+ PS++P+T SPP S
Sbjct: 216 SPPAPPKSTSPV----SPSSAPMTSPPAPMAPKSSSTIPPSSAPMT----SPPGS 262
Score = 46.6 bits (109), Expect = 3e-04
Identities = 41/108 (37%), Positives = 53/108 (48%), Gaps = 8/108 (7%)
Frame = +3
Query: 348 SPPSIHPNHSSPPP---CPSSTRRISPRTYPPTPTPPCHG--SSETPTTPSPPPSLPXST 512
+P S P +PP P S+ +SP T PP T P G S ++ + SP S P S
Sbjct: 144 APGSSTPGSMTPPGGAHSPKSSSPVSPTTSPPGSTTPPGGAHSPKSSSAVSPATSPPGSM 203
Query: 513 AGVSGSG---SESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPA 647
A SGS + SP A KST+ V SPS++P+T SPPA
Sbjct: 204 APKSGSPVSPTTSPPAPPKSTSPV--------SPSSAPMT----SPPA 239
Score = 43.9 bits (102), Expect = 0.002
Identities = 40/108 (37%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Frame = +3
Query: 318 GVRYPKRRRPSPPSIHPNHSSPPPCPSSTRRIS----PRTYPPTPTPPCHGSSETPTT-- 479
G PK P P+ +SPP P ST +S P T PP P P S+ P++
Sbjct: 201 GSMAPKSGSPVSPT-----TSPPAPPKSTSPVSPSSAPMTSPPAPMAPKSSSTIPPSSAP 255
Query: 480 -PSPPPSL-PXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASP 617
SPP S+ P S++ VS S + SP S+A ST SPS SP
Sbjct: 256 MTSPPGSMAPKSSSPVSNSPTVSP--------SLAPGGSTSSSPSDSP 295
Score = 33.9 bits (76), Expect = 2.0
Identities = 27/98 (27%), Positives = 38/98 (38%)
Frame = +3
Query: 330 PKRRRPSPPSIHPNHSSPPPCPSSTRRISPRTYPPTPTPPCHGSSETPTTPSPPPSLPXS 509
PK PPS P S PP + + SP + PT +P T ++PS PS S
Sbjct: 243 PKSSSTIPPSSAPMTS--PPGSMAPKSSSPVSNSPTVSPSLAPGGSTSSSPSDSPS--GS 298
Query: 510 TAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVT 623
G SG G + + K S+ + +T
Sbjct: 299 AMGPSGDGPSAAGDISTPAGAPGQKKSSANGMTVMSIT 336
>gb|AAM64815.1| unknown [Arabidopsis thaliana]
Length = 344
Score = 57.0 bits (136), Expect = 2e-07
Identities = 43/115 (37%), Positives = 55/115 (47%), Gaps = 4/115 (3%)
Frame = +3
Query: 318 GVRYPKRRRPSPPSIHPNHSSPPP----CPSSTRRISPRTYPPTPTPPCHGSSETPTTPS 485
G PK P P+ P S+ PP P S+ +SP T PP P GS +PTT
Sbjct: 153 GAHSPKSSSPVSPTTSPPGSTTPPGGAHSPKSSSAVSPATSPPGSMAPKSGSPVSPTT-- 210
Query: 486 PPPSLPXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPAS 650
PP+ P ST+ V S S A A +A K S+ PS++P+T SPP S
Sbjct: 211 XPPAPPKSTSPV----SPSSAPMTSPPAPMAPKSSSTIPPSSAPMT----SPPGS 257
Score = 45.1 bits (105), Expect = 9e-04
Identities = 40/108 (37%), Positives = 52/108 (48%), Gaps = 8/108 (7%)
Frame = +3
Query: 348 SPPSIHPNHSSPPP---CPSSTRRISPRTYPPTPTPPCHG--SSETPTTPSPPPSLPXST 512
+P S P +PP P S+ +SP T PP T P G S ++ + SP S P S
Sbjct: 139 APGSSTPGSMTPPGGAHSPKSSSPVSPTTSPPGSTTPPGGAHSPKSSSAVSPATSPPGSM 198
Query: 513 AGVSGSG---SESPAACRKSTASVAWKLSTCPSPSASPVTRKHRSPPA 647
A SGS + P A KST+ V SPS++P+T SPPA
Sbjct: 199 APKSGSPVSPTTXPPAPPKSTSPV--------SPSSAPMT----SPPA 234
Score = 42.4 bits (98), Expect = 0.006
Identities = 39/108 (36%), Positives = 50/108 (46%), Gaps = 8/108 (7%)
Frame = +3
Query: 318 GVRYPKRRRPSPPSIHPNHSSPPPCPSSTRRIS----PRTYPPTPTPPCHGSSETPTT-- 479
G PK P P+ + PP P ST +S P T PP P P S+ P++
Sbjct: 196 GSMAPKSGSPVSPT-----TXPPAPPKSTSPVSPSSAPMTSPPAPMAPKSSSTIPPSSAP 250
Query: 480 -PSPPPSL-PXSTAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASP 617
SPP S+ P S++ VS S + SP S+A ST SPS SP
Sbjct: 251 MTSPPGSMAPKSSSPVSNSPTVSP--------SLAPGGSTSSSPSDSP 290
Score = 33.9 bits (76), Expect = 2.0
Identities = 27/98 (27%), Positives = 38/98 (38%)
Frame = +3
Query: 330 PKRRRPSPPSIHPNHSSPPPCPSSTRRISPRTYPPTPTPPCHGSSETPTTPSPPPSLPXS 509
PK PPS P S PP + + SP + PT +P T ++PS PS S
Sbjct: 238 PKSSSTIPPSSAPMTS--PPGSMAPKSSSPVSNSPTVSPSLAPGGSTSSSPSDSPS--GS 293
Query: 510 TAGVSGSGSESPAACRKSTASVAWKLSTCPSPSASPVT 623
G SG G + + K S+ + +T
Sbjct: 294 AMGPSGDGPSAAGDISTPAGAPGQKKSSANGMTVMSIT 331
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 588,669,533
Number of Sequences: 1393205
Number of extensions: 15153927
Number of successful extensions: 179486
Number of sequences better than 10.0: 5076
Number of HSP's better than 10.0 without gapping: 85562
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 136675
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 28144814643
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)