Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC003464A_C01 KMC003464A_c01
(667 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
pir||T45722 hypothetical protein F1P2.170 - Arabidopsis thaliana... 59 4e-08
ref|NP_190346.2| putative protein; protein id: At3g47620.1, supp... 59 4e-08
ref|NP_564973.1| expressed protein; protein id: At1g69690.1, sup... 56 4e-07
pir||T03371 glycine-rich protein grp3 - maize gi|1532071|emb|CAA... 52 5e-06
emb|CAC24662.1| ala-pro rich protein [Leishmania major] 52 7e-06
>pir||T45722 hypothetical protein F1P2.170 - Arabidopsis thaliana
gi|6522545|emb|CAB61988.1| putative protein [Arabidopsis
thaliana]
Length = 477
Score = 59.3 bits (142), Expect = 4e-08
Identities = 54/184 (29%), Positives = 74/184 (39%), Gaps = 41/184 (22%)
Frame = -3
Query: 665 SNVGSLPASHASNTAA--FWMVA------GNGGNQGVSGG-------GGASNGDPIWAIP 531
SN GS + A+ FWMVA G GGN +GG G G+P+W P
Sbjct: 300 SNSGSTATAAAAQQIPGNFWMVAAAAAAGGGGGNNNQTGGLMTASIGTGGGGGEPVWTFP 359
Query: 530 SVGNSG--VYRGAVP-----AAAGGIHFMNFASPMMPFMPGSQ---------LGSSLMGN 399
S+ + +YR V A + G+HFMNFA+P M F+ G Q + N
Sbjct: 360 SINTAAAALYRSGVSGVPSGAVSSGLHFMNFAAP-MAFLTGQQQLATTSNHEINEDSNNN 418
Query: 398 GGGGSGGSA----------FMGESNLGMLSALNGYRQNPANGLPESPTSAGQHHGGGGGD 249
GG S G + + +LS LN Y + + + A GGG +
Sbjct: 419 EGGRSDGGGDHHNTQRHHHHQQQHHHNILSGLNQYGRQVS-----GDSQASGSLGGGDEE 473
Query: 248 DGHD 237
D D
Sbjct: 474 DQQD 477
>ref|NP_190346.2| putative protein; protein id: At3g47620.1, supported by cDNA:
gi_16604510 [Arabidopsis thaliana]
gi|16604511|gb|AAL24261.1| AT3g47620/F1P2_170
[Arabidopsis thaliana] gi|21655289|gb|AAM65356.1|
AT3g47620/F1P2_170 [Arabidopsis thaliana]
Length = 489
Score = 59.3 bits (142), Expect = 4e-08
Identities = 54/184 (29%), Positives = 74/184 (39%), Gaps = 41/184 (22%)
Frame = -3
Query: 665 SNVGSLPASHASNTAA--FWMVA------GNGGNQGVSGG-------GGASNGDPIWAIP 531
SN GS + A+ FWMVA G GGN +GG G G+P+W P
Sbjct: 312 SNSGSTATAAAAQQIPGNFWMVAAAAAAGGGGGNNNQTGGLMTASIGTGGGGGEPVWTFP 371
Query: 530 SVGNSG--VYRGAVP-----AAAGGIHFMNFASPMMPFMPGSQ---------LGSSLMGN 399
S+ + +YR V A + G+HFMNFA+P M F+ G Q + N
Sbjct: 372 SINTAAAALYRSGVSGVPSGAVSSGLHFMNFAAP-MAFLTGQQQLATTSNHEINEDSNNN 430
Query: 398 GGGGSGGSA----------FMGESNLGMLSALNGYRQNPANGLPESPTSAGQHHGGGGGD 249
GG S G + + +LS LN Y + + + A GGG +
Sbjct: 431 EGGRSDGGGDHHNTQRHHHHQQQHHHNILSGLNQYGRQVS-----GDSQASGSLGGGDEE 485
Query: 248 DGHD 237
D D
Sbjct: 486 DQQD 489
>ref|NP_564973.1| expressed protein; protein id: At1g69690.1, supported by cDNA:
gi_15912212, supported by cDNA: gi_19547990 [Arabidopsis
thaliana] gi|25404829|pir||G96718 unknown protein,
54453-53476 [imported] - Arabidopsis thaliana
gi|12325189|gb|AAG52540.1|AC013289_7 unknown protein;
54453-53476 [Arabidopsis thaliana]
gi|15912213|gb|AAL08240.1| At1g69690/T6C23_11
[Arabidopsis thaliana] gi|19547991|gb|AAL87359.1|
At1g69690/T6C23_11 [Arabidopsis thaliana]
Length = 325
Score = 56.2 bits (134), Expect = 4e-07
Identities = 56/160 (35%), Positives = 68/160 (42%), Gaps = 11/160 (6%)
Frame = -3
Query: 665 SNVGSLPASHASNTAAFWMVAGNGGNQGVSGGGGASNGDPIWAIP-SVGNSGVYRGAV-- 495
S GSLP S + TA FW N N +WA + +SGV G V
Sbjct: 190 STAGSLPTSQSPATAPFWSSGDNTQN--------------LWAFNINPHHSGVVAGDVYN 235
Query: 494 PAAAG-----GIHFMNFASPMMPFMPGSQLGSSLMGNGGGGSGGSAFMGESNLGMLSALN 330
P + G G+H MNFA+P+ F G L S G GGGG GG S+ G+L+ALN
Sbjct: 236 PNSGGSGGGSGVHLMNFAAPIALF-SGQPLAS---GYGGGGGGGGE---HSHYGVLAALN 288
Query: 329 -GYR--QNPANGLPESPTSAGQHHGGGGGDDGHDDSTSQH 219
YR N G HH + D STS H
Sbjct: 289 AAYRPVAETGNHNNNQQNRDGDHH----HNHQEDGSTSHH 324
>pir||T03371 glycine-rich protein grp3 - maize gi|1532071|emb|CAA69104.1|
glycine-rich protein [Zea mays]
Length = 256
Score = 52.4 bits (124), Expect = 5e-06
Identities = 42/121 (34%), Positives = 52/121 (42%), Gaps = 2/121 (1%)
Frame = -3
Query: 608 VAGNGGNQGVSGGGGASNGDPIWAIPSVGNSGVYRGAVPAAAGGIHFMNFASPMMPFMPG 429
VAG GG G GGGG +NG S G SG G AA G N+A+ G
Sbjct: 81 VAGGGG--GGQGGGGGTNGGS----GSGGGSGYGSGTSSTAASGPSSGNYANAEGKGAGG 134
Query: 428 SQLGSS--LMGNGGGGSGGSAFMGESNLGMLSALNGYRQNPANGLPESPTSAGQHHGGGG 255
G + G+G GG G GES + + + +GY A + AG HGGG
Sbjct: 135 GMGGGADGAYGSGAGGGVGKG-QGESGVALAPSSDGYYNGGAADATGGGSGAGGGHGGGA 193
Query: 254 G 252
G
Sbjct: 194 G 194
Score = 42.7 bits (99), Expect = 0.004
Identities = 43/138 (31%), Positives = 54/138 (38%), Gaps = 1/138 (0%)
Frame = -3
Query: 653 SLPASHASNTAAFWMVAGNGGNQGVSGGGGASNGDPIWAIPSVGNSGVYRGAVPAAAGGI 474
S+ S A+ A GG G GGG S G A G SG G GG
Sbjct: 17 SVGFSDAARVVRLGSYASAGGGGGGGGGGSGSTG----AAGYGGGSGGGGGYGIGKGGGD 72
Query: 473 HFMNFASPMMPFMPGSQLGSSLMGNGGGGSGGSAF-MGESNLGMLSALNGYRQNPANGLP 297
+ NF S + G Q G G G GGS + G S+ +A +G P++G
Sbjct: 73 WWNNFVSSVAGGGGGGQGGGGGTNGGSGSGGGSGYGSGTSS----TAASG----PSSGNY 124
Query: 296 ESPTSAGQHHGGGGGDDG 243
+ G G GGG DG
Sbjct: 125 ANAEGKGAGGGMGGGADG 142
Score = 36.6 bits (83), Expect = 0.31
Identities = 42/148 (28%), Positives = 52/148 (34%), Gaps = 7/148 (4%)
Frame = -3
Query: 665 SNVGSLPASHASNTAAFWMVAGNGGNQGVSGGGGASNGDPIWAIPSVGNSGVYRGAVPAA 486
S GS S S+TAA +GN N G GG G A S GV +G +
Sbjct: 101 SGGGSGYGSGTSSTAASGPSSGNYANAEGKGAGGGMGGGADGAYGSGAGGGVGKGQGESG 160
Query: 485 AGGIHFMNFASPMMPFMPGSQLGSSL------MGNGGGGSGGSAFMGESNLGMLSALNGY 324
+ P G G + G GGG GG+ G L+
Sbjct: 161 VA----------LAPSSDGYYNGGAADATGGGSGAGGGHGGGAGAPSYGTGGGLAEARAR 210
Query: 323 RQNPANGLP-ESPTSAGQHHGGGGGDDG 243
RQ + G + AG GGGGG G
Sbjct: 211 RQRRSWGSGYAAGIGAGTGGGGGGGFQG 238
>emb|CAC24662.1| ala-pro rich protein [Leishmania major]
Length = 356
Score = 52.0 bits (123), Expect = 7e-06
Identities = 45/143 (31%), Positives = 61/143 (42%), Gaps = 8/143 (5%)
Frame = +1
Query: 235 SSCPSSPPPPP*CCPALVGDSGRPLAGFCL*PLRAESMPKLLSPINALPP------LPPP 396
+S S PPPPP P + PL C+ P + L P A PP +PPP
Sbjct: 2 TSSMSVPPPPPAAIPLPSSATAVPLPPSCVVPPPPPAAAVPLPPAEATPPPPSAPSVPPP 61
Query: 397 PLPIRLEPN*LPGIKGIIGEAKFMK*IPPAAAGTAPLYTPLFPTDGIAQMGSPLLAPPPP 576
P + P PGI + A PP+A APL P + + ++APPPP
Sbjct: 62 PASVIPLPQ-PPGINAAVSAAAPPPPPPPSAIAAAPL-----PPEAAPPAPTSVVAPPPP 115
Query: 577 L--TPWLPPLPATIQNAAVLEAW 639
+ P + P +Q +V EAW
Sbjct: 116 MQAAPGV-TAPPPVQPISV-EAW 136
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 648,661,368
Number of Sequences: 1393205
Number of extensions: 17095309
Number of successful extensions: 155167
Number of sequences better than 10.0: 1892
Number of HSP's better than 10.0 without gapping: 73297
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 119252
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 28855580904
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)