Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC020398A_C01 KMC020398A_c01
(577 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_180260.1| unknown protein; protein id: At2g26920.1 [Arabi... 121 6e-27
ref|NP_196773.1| putative protein; protein id: At5g12120.1, supp... 99 4e-20
ref|XP_209582.1| hypothetical protein XP_209582 [Homo sapiens] 40 0.028
ref|NP_650744.1| CG7709-PA [Drosophila melanogaster] gi|7300427|... 39 0.047
dbj|BAB85832.1| cell wall protein Awa1p [Saccharomyces cerevisiae] 37 0.18
>ref|NP_180260.1| unknown protein; protein id: At2g26920.1 [Arabidopsis thaliana]
gi|7485400|pir||T02643 hypothetical protein At2g26920
[imported] - Arabidopsis thaliana
gi|3426036|gb|AAC32235.1| unknown protein [Arabidopsis
thaliana]
Length = 646
Score = 121 bits (304), Expect = 6e-27
Identities = 74/143 (51%), Positives = 94/143 (64%), Gaps = 1/143 (0%)
Frame = -3
Query: 575 SPTLAAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLA-SPRSNPL 399
SP ++AA+SLGLFS +GSA TSGASSP+DW +GGSV DYT+IDWSLD+ L+ + R N
Sbjct: 512 SPPISAASSLGLFSAVGSAGTSGASSPIDWISGGSV--DYTNIDWSLDQSLSQNSRVNGN 569
Query: 398 WLGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRD 219
W GL K+S+ YD + + + M +N + V M S GV AA S D
Sbjct: 570 WSGL----KSSSQIYDENMNRYSLNGSMGGRLNNSNGVSMENS-GAGVIVETQQAATSPD 624
Query: 218 WSSPFEGKDLFSLPRQFVSSPSL 150
W+SPFEGKD+FSL RQ+V SPSL
Sbjct: 625 WTSPFEGKDIFSLSRQYV-SPSL 646
>ref|NP_196773.1| putative protein; protein id: At5g12120.1, supported by cDNA:
gi_15983776, supported by cDNA: gi_20260155 [Arabidopsis
thaliana] gi|9759379|dbj|BAB10030.1|
gene_id:MXC9.8~pir||T02643~strong similarity to unknown
protein [Arabidopsis thaliana]
gi|15983777|gb|AAL10485.1| AT5g12120/MXC9_8 [Arabidopsis
thaliana] gi|20260156|gb|AAM12976.1| unknown protein
[Arabidopsis thaliana]
Length = 619
Score = 99.0 bits (245), Expect = 4e-20
Identities = 68/146 (46%), Positives = 89/146 (60%), Gaps = 5/146 (3%)
Frame = -3
Query: 572 PTLAAAASLGLFSGIGSAATSGASSPVDWSTGGSV-QFDYTSIDWSLDRGLASPRSNPLW 396
P +A AASLGLFSG GS A S +SS +DW+T GS+ DY +IDWSLD+GLA R + +
Sbjct: 491 PAVAPAASLGLFSGFGSLAGS-SSSGLDWNTDGSLGHLDYNNIDWSLDKGLACVRPSQQY 549
Query: 395 LGL-SPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAA---G 228
+ SPYS S+ Y++ +G + +NG+ + GVA G AA
Sbjct: 550 VAASSPYSAASS-PYEAHMNGRTRMM------TNGNGM--------GVAMGVQEAALVGN 594
Query: 227 SRDWSSPFEGKDLFSLPRQFVSSPSL 150
R+W+SPFEGKDLFSL RQ+V PSL
Sbjct: 595 GREWTSPFEGKDLFSLSRQYV-PPSL 619
>ref|XP_209582.1| hypothetical protein XP_209582 [Homo sapiens]
Length = 252
Score = 39.7 bits (91), Expect = 0.028
Identities = 39/141 (27%), Positives = 64/141 (44%)
Frame = -3
Query: 575 SPTLAAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLW 396
SP+ ++++S S S+++S +SSP S+ S +S +SP S+
Sbjct: 91 SPSSSSSSSSSSPSSSNSSSSSSSSSPSSSSSSSSSSPSSSS---------SSPSSSSSS 141
Query: 395 LGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDW 216
SP S +S+ + SS+S +++ P S PS+ P S + +S +S + S
Sbjct: 142 SSSSPSSSSSSPSSSSSSSSSSSSSPSSSSPSSSG--SSPSSSNSSPSSSSSSPSSSS-- 197
Query: 215 SSPFEGKDLFSLPRQFVSSPS 153
SSP S SSPS
Sbjct: 198 SSPSPRSSSPSSSSSSTSSPS 218
Score = 35.8 bits (81), Expect = 0.40
Identities = 36/135 (26%), Positives = 56/135 (40%), Gaps = 2/135 (1%)
Frame = -3
Query: 551 SLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLWLGLSPYSK 372
S + S +++S +SSP + S ++S S +SP S+ SP S
Sbjct: 47 SSSILSSSIPSSSSSSSSPSSSPSSSSPSSSHSSSSPSSSSSTSSPSSSSSSSSSSPSSS 106
Query: 371 TSAHTYDSSASGAAAQLPMRSVPSNGSIVP--MPGSQDGGVASGDASAAGSRDWSSPFEG 198
S+ + SS+S +++ S PS+ S P S +S +S + S SS
Sbjct: 107 NSSSS-SSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSSSSSS 165
Query: 197 KDLFSLPRQFVSSPS 153
S P SSPS
Sbjct: 166 SPSSSSPSSSGSSPS 180
Score = 32.7 bits (73), Expect = 3.4
Identities = 35/134 (26%), Positives = 56/134 (41%), Gaps = 1/134 (0%)
Frame = -3
Query: 575 SPTLAAAASLGLFSGIGSAATSGASSPVDWSTG-GSVQFDYTSIDWSLDRGLASPRSNPL 399
SP+ + ++S S S+++S +SSP S+ S +S S +S S+
Sbjct: 102 SPSSSNSSSSSSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSSSSPSSSSSSPSSSSSSSS 161
Query: 398 WLGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRD 219
SP S + + + S +S ++ S PS+ S P P S +S S+ S
Sbjct: 162 SSSSSPSSSSPSSSGSSPSSSNSSPSSSSSSPSSSSSSPSPRSSSPSSSSSSTSSPSS-- 219
Query: 218 WSSPFEGKDLFSLP 177
SSP S P
Sbjct: 220 -SSPSSSSPSSSCP 232
>ref|NP_650744.1| CG7709-PA [Drosophila melanogaster] gi|7300427|gb|AAF55584.1|
CG7709-PA [Drosophila melanogaster]
Length = 950
Score = 38.9 bits (89), Expect = 0.047
Identities = 37/137 (27%), Positives = 56/137 (40%)
Frame = -3
Query: 563 AAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLWLGLS 384
A +AS SA +S S+P S+ G S +S +P S G
Sbjct: 605 APSASANSGGSYPSAPSSSYSAPSSSSSSGGPYASAPSSSYS------APSSGSNSGGPY 658
Query: 383 PYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDWSSPF 204
P + +S+++ S+++ + P S PS+ P PGS GG S++ S S
Sbjct: 659 PAAPSSSYSAPSASANSGGSYP--SAPSSSYSAPSPGSNSGGPYPAAPSSSYSAPSPSAN 716
Query: 203 EGKDLFSLPRQFVSSPS 153
G S P S+PS
Sbjct: 717 SGGPYASAPSSSYSAPS 733
Score = 34.7 bits (78), Expect = 0.89
Identities = 36/135 (26%), Positives = 50/135 (36%)
Frame = -3
Query: 557 AASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLWLGLSPY 378
AA +S ++A SG S P S+ S SP SN PY
Sbjct: 660 AAPSSSYSAPSASANSGGSYPSAPSSSYSAP---------------SPGSN----SGGPY 700
Query: 377 SKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDWSSPFEG 198
+ +Y + + A + P S PS+ P S GG + S++ S SS G
Sbjct: 701 PAAPSSSYSAPSPSANSGGPYASAPSSSYSAPSSSSNSGGPYAAAPSSSYSAPSSSSSSG 760
Query: 197 KDLFSLPRQFVSSPS 153
S P S+PS
Sbjct: 761 GPYPSAPSSSYSAPS 775
Score = 34.3 bits (77), Expect = 1.2
Identities = 35/137 (25%), Positives = 53/137 (38%)
Frame = -3
Query: 563 AAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLWLGLS 384
A +AS SA +S S+P S G S +S SP +N
Sbjct: 668 APSASANSGGSYPSAPSSSYSAPSPGSNSGGPYPAAPSSSYSAP----SPSAN----SGG 719
Query: 383 PYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDWSSPF 204
PY+ + +Y + +S + + P + PS+ P S GG S++ S SS
Sbjct: 720 PYASAPSSSYSAPSSSSNSGGPYAAAPSSSYSAPSSSSSSGGPYPSAPSSSYSAPSSSLS 779
Query: 203 EGKDLFSLPRQFVSSPS 153
G S P ++PS
Sbjct: 780 SGGPYPSAPSSSYAAPS 796
Score = 33.1 bits (74), Expect = 2.6
Identities = 28/106 (26%), Positives = 41/106 (38%)
Frame = -3
Query: 524 SAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLWLGLSPYSKTSAHTYDSS 345
SA +S S+P S G S +S +S PY + +Y +
Sbjct: 723 SAPSSSYSAPSSSSNSGGPYAAAPSSSYSAPSSSSSSGG--------PYPSAPSSSYSAP 774
Query: 344 ASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDWSSP 207
+S ++ P S PS+ P P S G G AA S +S+P
Sbjct: 775 SSSLSSGGPYPSAPSSSYAAPSPSSNSG----GPYPAAPSNSYSAP 816
Score = 32.3 bits (72), Expect = 4.4
Identities = 39/156 (25%), Positives = 57/156 (36%), Gaps = 21/156 (13%)
Frame = -3
Query: 557 AASLGLFSGIGSAATSGASSPVD--------WSTGGSVQFDYTSIDWSLDRGLASPRSNP 402
A S+ F + SA ++ +P +S+G S ++ S G S P
Sbjct: 494 APSVSSFVPLPSAPSTNYGAPSKTQSLGSSGYSSGPSSSYEAPVAPPSSSYGAPSSSFQP 553
Query: 401 LWLGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSI-------------VPMPGSQDG 261
+ S Y S+ + SS S +AA + S PS GS P + G
Sbjct: 554 ISPPSSSYGAPSSGSGSSSGSFSAAPSSLYSAPSKGSSGGSFQSAPSSSYSAPSASANSG 613
Query: 260 GVASGDASAAGSRDWSSPFEGKDLFSLPRQFVSSPS 153
G S++ S SS G S P S+PS
Sbjct: 614 GSYPSAPSSSYSAPSSSSSSGGPYASAPSSSYSAPS 649
>dbj|BAB85832.1| cell wall protein Awa1p [Saccharomyces cerevisiae]
Length = 1713
Score = 37.0 bits (84), Expect = 0.18
Identities = 37/133 (27%), Positives = 55/133 (40%)
Frame = -3
Query: 575 SPTLAAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLW 396
S + A+ AS + SG ++ TSG+SS + + S T S+ ++ S
Sbjct: 545 STSSASTASGSITSGTLTSITSGSSSATESGSSVSGSSSATESGSSVSGSTSATES---- 600
Query: 395 LGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDW 216
G S TSA SSASG+++ S S + GS G S S + +
Sbjct: 601 -GSSVSGSTSATESGSSASGSSSATESGSSVSGSTSATESGSSVSGSTSATESGSSASGS 659
Query: 215 SSPFEGKDLFSLP 177
SS E S+P
Sbjct: 660 SSATESGSASSVP 672
Score = 35.8 bits (81), Expect = 0.40
Identities = 36/122 (29%), Positives = 54/122 (43%)
Frame = -3
Query: 575 SPTLAAAASLGLFSGIGSAATSGASSPVDWSTGGSVQFDYTSIDWSLDRGLASPRSNPLW 396
S + A+ AS + SG ++ TSG+SS + + S T S+ ++ S
Sbjct: 735 STSSASTASGSITSGTLTSITSGSSSATESGSSASGSSSATESGSSVSGSTSATES---- 790
Query: 395 LGLSPYSKTSAHTYDSSASGAAAQLPMRSVPSNGSIVPMPGSQDGGVASGDASAAGSRDW 216
G S TSA SSASG+++ S S + ++ G ASG +SA S
Sbjct: 791 -GSSVSGSTSATESGSSASGSSSATESGSSVSGST----SATESGSSASGSSSATESGSA 845
Query: 215 SS 210
SS
Sbjct: 846 SS 847
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 529,459,329
Number of Sequences: 1393205
Number of extensions: 11934374
Number of successful extensions: 47586
Number of sequences better than 10.0: 119
Number of HSP's better than 10.0 without gapping: 42732
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 47126
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)