Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC020468A_C01 KMC020468A_c01
(545 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_193655.1| putative protein; protein id: At4g19200.1, supp... 104 6e-22
gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus] 102 3e-21
emb|CAB61840.1| putative glycine and proline-rich protein [Sporo... 100 2e-20
ref|NP_568642.1| expressed protein; protein id: At5g45350.1, sup... 94 1e-18
ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g1... 92 4e-18
>ref|NP_193655.1| putative protein; protein id: At4g19200.1, supported by cDNA: 8188.
[Arabidopsis thaliana] gi|25407569|pir||A85217
hypothetical protein AT4g19200 [imported] - Arabidopsis
thaliana gi|7268715|emb|CAB78922.1| putative protein
[Arabidopsis thaliana] gi|21595622|gb|AAM66118.1|
unknown [Arabidopsis thaliana]
gi|24417344|gb|AAN60282.1| unknown [Arabidopsis
thaliana] gi|27311843|gb|AAO00887.1| Unknown protein
[Arabidopsis thaliana]
Length = 179
Score = 104 bits (260), Expect = 6e-22
Identities = 60/102 (58%), Positives = 66/102 (63%), Gaps = 14/102 (13%)
Frame = -1
Query: 446 SPGGHQ-GH--GGMGAMLAGGAAAAAAAYGAHGA-HAAHGAHGYA--HGGYAQG-----G 300
+PG H GH GG+G M+AG A AAAAAYGAH HA+H +G+A HGGY G
Sbjct: 80 APGAHHSGHSGGGLGGMIAGAAGAAAAAYGAHHVGHASHNPYGHAVGHGGYGHAPAHGFG 139
Query: 299 HMGHGKFKQH---GKFKHGKHGKFGKHKHGKFGKHGGFKKWK 183
H GHGKFK GKFKHGKHGK G KHG FG G FKKWK
Sbjct: 140 HGGHGKFKHGKHGGKFKHGKHGKHG--KHGMFGGGGKFKKWK 179
>gb|AAH05782.1| Unknown (protein for MGC:12025) [Mus musculus]
Length = 198
Score = 102 bits (254), Expect = 3e-21
Identities = 58/94 (61%), Positives = 60/94 (63%), Gaps = 8/94 (8%)
Frame = -1
Query: 440 GGHQGHGGMGAMLAGGAAAAAAAYGAHGAHAAHGAHGYAHGGYAQGGHMGHGKFKQHGKF 261
GG G G MG MLAGGAAAAAAAYG H H G+HG+ HGG GH G G GKF
Sbjct: 113 GGSGGMGAMGGMLAGGAAAAAAAYGVH--HLTSGSHGH-HGGGGPLGHFGGG--HHGGKF 167
Query: 260 KHGKHGKFGKHKHGKFGKHGG--------FKKWK 183
KHGKHGKF KHGKFGKHGG FKKWK
Sbjct: 168 KHGKHGKF---KHGKFGKHGGGMFGGGKKFKKWK 198
Score = 38.1 bits (87), Expect = 0.069
Identities = 29/89 (32%), Positives = 31/89 (34%), Gaps = 8/89 (8%)
Frame = -1
Query: 440 GGHQGHGGMGAMLAG--GAAAAAAAYGAHGAHAAHGAHGYAHGGY------AQGGHMGHG 285
GGH GHG GA Y G HG + HGGY QGG+ G
Sbjct: 25 GGHGGHGYPPGQYPPPPGAYPPQQGYPPQGYPPQHGGYPPQHGGYPPSGYPPQGGYPPSG 84
Query: 284 KFKQHGKFKHGKHGKFGKHKHGKFGKHGG 198
Q G G G G H G H G
Sbjct: 85 YPPQAGYPPGGYPGAHGSHSGGHGSHHAG 113
Score = 32.0 bits (71), Expect = 5.0
Identities = 21/65 (32%), Positives = 26/65 (39%), Gaps = 10/65 (15%)
Frame = -1
Query: 365 AHGAHAAHGAHGYAHGGY--------AQGGHMGHGKFKQHGKF--KHGKHGKFGKHKHGK 216
AHG HG HGY G Y Q G+ G QHG + +HG + G G
Sbjct: 20 AHGLAGGHGGHGYPPGQYPPPPGAYPPQQGYPPQGYPPQHGGYPPQHGGYPPSGYPPQGG 79
Query: 215 FGKHG 201
+ G
Sbjct: 80 YPPSG 84
>emb|CAB61840.1| putative glycine and proline-rich protein [Sporobolus stapfianus]
Length = 197
Score = 99.8 bits (247), Expect = 2e-20
Identities = 67/108 (62%), Positives = 71/108 (65%), Gaps = 21/108 (19%)
Frame = -1
Query: 443 PGG-HQG-----HGG--MGAMLAGGAAAAAAAYGAHG-AHAAHGAHGY--AHGGYAQG-- 303
PGG HQG HGG MG +LAGGAAAAAAAYGAH +H G HG+ HGGYA G
Sbjct: 93 PGGSHQGGHSSSHGGGNMG-LLAGGAAAAAAAYGAHKLSHGHSGGHGFPGGHGGYAVGGY 151
Query: 302 --GHMGHGKFKQ----HGKFKHGKHGKF--GKHKHGKFGKHGGFKKWK 183
G+ GHGKFK HGKFKHG HGKF GKH HG FG G FKKWK
Sbjct: 152 GHGYGGHGKFKHGHGGHGKFKHG-HGKFKHGKHGHGMFG-GGKFKKWK 197
>ref|NP_568642.1| expressed protein; protein id: At5g45350.1, supported by cDNA:
22538., supported by cDNA: gi_15529251 [Arabidopsis
thaliana] gi|2129603|pir||S65780 glycine/proline-rich
protein GPRP - Arabidopsis thaliana
gi|1465364|emb|CAA59059.1| GPRP [Arabidopsis thaliana]
gi|9758725|dbj|BAB09163.1| gene_id:MFC19.1~unknown
protein [Arabidopsis thaliana]
gi|15529252|gb|AAK97720.1| AT5g45350/MFC19_1
[Arabidopsis thaliana] gi|16974403|gb|AAL31127.1|
AT5g45350/MFC19_1 [Arabidopsis thaliana]
gi|21592344|gb|AAM64295.1| unknown [Arabidopsis
thaliana]
Length = 177
Score = 93.6 bits (231), Expect = 1e-18
Identities = 51/89 (57%), Positives = 59/89 (65%), Gaps = 2/89 (2%)
Frame = -1
Query: 443 PGGHQGH-GGMGAMLAGGAAAAAAAYGAHG-AHAAHGAHGYAHGGYAQGGHMGHGKFKQH 270
P H GH GG+G M+AG AAAAYGAH AH++HG +G+A G+ G G+G H
Sbjct: 94 PAHHSGHAGGIGGMIAG----AAAAYGAHHVAHSSHGPYGHAAYGHGFGHGHGYGYGHGH 149
Query: 269 GKFKHGKHGKFGKHKHGKFGKHGGFKKWK 183
GKFKHGKHGKF KHG FG G FKKWK
Sbjct: 150 GKFKHGKHGKFKHGKHGMFG-GGKFKKWK 177
>ref|NP_197267.1| glycine/proline-rich protein; protein id: At5g17650.1 [Arabidopsis
thaliana] gi|11357316|pir||T51469 glycine/proline-rich
protein - Arabidopsis thaliana
gi|9755790|emb|CAC01909.1| glycine/proline-rich protein
[Arabidopsis thaliana]
Length = 173
Score = 92.0 bits (227), Expect = 4e-18
Identities = 55/91 (60%), Positives = 60/91 (65%), Gaps = 3/91 (3%)
Frame = -1
Query: 446 SPGGHQGHGGMGAMLAGGAAAAAAAYGAHGAHAAHGAHGYAHG-GYAQGGHMGHGKFKQH 270
S GH HGG+GA++AGG AAAA GAH HG +G+ HG GY G H GHGKFK H
Sbjct: 94 SHSGHH-HGGIGAIIAGGVAAAA---GAHHMSHHHGHYGHHHGHGYGYGYH-GHGKFK-H 147
Query: 269 GKFKHGKHGKFGKHKHGKFGKHGG--FKKWK 183
GKFKHGK G KHG FGKH G FKKWK
Sbjct: 148 GKFKHGKFG-----KHGMFGKHKGKFFKKWK 173
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 503,883,885
Number of Sequences: 1393205
Number of extensions: 12653752
Number of successful extensions: 111339
Number of sequences better than 10.0: 1754
Number of HSP's better than 10.0 without gapping: 59389
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 90568
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 18660035355
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)