Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC001054A_C01 KMC001054A_c01
(539 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
sp|P32292|ARG2_PHAAU INDOLE-3-ACETIC ACID INDUCED PROTEIN ARG2 g... 114 6e-25
gb|AAF05766.1|AF192758_1 indole-3-acetic acid induced protein AR... 91 9e-18
pir||T01312 hypothetical protein T14P8.2 - Arabidopsis thaliana ... 88 6e-17
ref|NP_567231.1| coded for by A. thaliana cDNA T46835; protein i... 87 2e-16
pir||T01984 late-embryogenesis protein lea5 - common tobacco gi|... 80 2e-14
>sp|P32292|ARG2_PHAAU INDOLE-3-ACETIC ACID INDUCED PROTEIN ARG2 gi|7488882|pir||T10900
late-embryogenesis protein homolog - mung bean
gi|287564|dbj|BAA03307.1| ORF [Vigna radiata]
Length = 99
Score = 114 bits (286), Expect = 6e-25
Identities = 61/96 (63%), Positives = 72/96 (74%), Gaps = 3/96 (3%)
Frame = -3
Query: 519 MARSFTNIKAISALVAEEFSHSLARRGYAATAP---SAGRVGASMSGKMGSTKSGEEKAA 349
MARSFTN+K +SALVA+ FS++ R G+AA A SA R GAS+ G M KSGEEK
Sbjct: 1 MARSFTNVKVLSALVADGFSNTTTRHGFAAAAAATQSATRGGASIGGNM-VPKSGEEKVR 59
Query: 348 AREKVSWVPDPVTGYYKPENIKEIDVAELRSAVLGK 241
EKVSWVPDPVTGYY+PEN EIDVA++R+ VLGK
Sbjct: 60 GGEKVSWVPDPVTGYYRPENTNEIDVADMRATVLGK 95
>gb|AAF05766.1|AF192758_1 indole-3-acetic acid induced protein ARG-2 homolog [Glycine max]
Length = 86
Score = 90.9 bits (224), Expect = 9e-18
Identities = 50/93 (53%), Positives = 60/93 (63%)
Frame = -3
Query: 519 MARSFTNIKAISALVAEEFSHSLARRGYAATAPSAGRVGASMSGKMGSTKSGEEKAAARE 340
MARS N K SALV + FS RRGY+ +A G + KSGE+K +
Sbjct: 1 MARSIANAKTFSALVLDGFS----RRGYSQSATRGGVASIA-------PKSGEDKGVSSY 49
Query: 339 KVSWVPDPVTGYYKPENIKEIDVAELRSAVLGK 241
KVSWVPDPVTGYYKPENIKE+DVA+LR+ +L K
Sbjct: 50 KVSWVPDPVTGYYKPENIKEVDVADLRATLLRK 82
>pir||T01312 hypothetical protein T14P8.2 - Arabidopsis thaliana
gi|3193289|gb|AAC19273.1| similar to several small
proteins (~100 aa) that are induced by heat, auxin,
ethylene and wounding such as Phaseolus aureus
indole-3-acetic acid induced protein ARG (SW:32292)
[Arabidopsis thaliana] gi|7268998|emb|CAB80731.1| coded
for by A. thaliana cDNA AA041171, coded for by A.
thaliana cDNA R65517, coded for by A. thaliana cDNA
AA042089, coded for by A. thaliana cDNA W43164, coded
for by A. thaliana cDNA H37120, coded for by A. thaliana
cDNA T46835~similarity to similar to several small
proteins (~~100 aa) that are induced by heat, auxin,
ethylene and wounding such as Phaseolus aureus
indole-3-acetic acid induced protein ARG
(SW:32292)~contains EST gb:AI995253.1, AA042089, W43164,
T46835, R65517, H37120, AA041171 [Arabidop>
Length = 206
Score = 88.2 bits (217), Expect = 6e-17
Identities = 52/100 (52%), Positives = 67/100 (67%), Gaps = 6/100 (6%)
Frame = -3
Query: 528 SSPMARSFTNIKAISALVAEEFSHSLARRGYAATA-----PSAGRVGASMSGKMGSTKSG 364
+S MARS +N+K +SA V+ E S+++ RRGYAATA S GR GA S M K G
Sbjct: 107 TSKMARSISNVKIVSAFVSRELSNAIFRRGYAATAAQGSVSSGGRSGAVASAVM--KKKG 164
Query: 363 EEKAAAREKVSWVPDPVTGYYKPE-NIKEIDVAELRSAVL 247
E++ +K+SWVPDP TGYY+PE EID AELR+A+L
Sbjct: 165 VEEST--QKISWVPDPKTGYYRPETGSNEIDAAELRAALL 202
>ref|NP_567231.1| coded for by A. thaliana cDNA T46835; protein id: At4g02380.1,
supported by cDNA: 23194., supported by cDNA:
gi_14517507, supported by cDNA: gi_15294219, supported
by cDNA: gi_15450608, supported by cDNA: gi_15809759
[Arabidopsis thaliana] gi|14517508|gb|AAK62644.1|
AT4g02380/T14P8_2 [Arabidopsis thaliana]
gi|15294220|gb|AAK95287.1|AF410301_1 AT4g02380/T14P8_2
[Arabidopsis thaliana] gi|15450609|gb|AAK96576.1|
AT4g02380/T14P8_2 [Arabidopsis thaliana]
gi|15809760|gb|AAL06808.1| AT4g02380/T14P8_2
[Arabidopsis thaliana] gi|21592389|gb|AAM64340.1| late
embryogenis abundant protein [Arabidopsis thaliana]
Length = 97
Score = 86.7 bits (213), Expect = 2e-16
Identities = 51/97 (52%), Positives = 65/97 (66%), Gaps = 6/97 (6%)
Frame = -3
Query: 519 MARSFTNIKAISALVAEEFSHSLARRGYAATA-----PSAGRVGASMSGKMGSTKSGEEK 355
MARS +N+K +SA V+ E S+++ RRGYAATA S GR GA S M K G E+
Sbjct: 1 MARSISNVKIVSAFVSRELSNAIFRRGYAATAAQGSVSSGGRSGAVASAVM--KKKGVEE 58
Query: 354 AAAREKVSWVPDPVTGYYKPE-NIKEIDVAELRSAVL 247
+ +K+SWVPDP TGYY+PE EID AELR+A+L
Sbjct: 59 ST--QKISWVPDPKTGYYRPETGSNEIDAAELRAALL 93
>pir||T01984 late-embryogenesis protein lea5 - common tobacco
gi|2981167|gb|AAC06242.1| late embryogenis abundant
protein 5 [Nicotiana tabacum]
Length = 97
Score = 79.7 bits (195), Expect = 2e-14
Identities = 46/92 (50%), Positives = 60/92 (65%), Gaps = 1/92 (1%)
Frame = -3
Query: 519 MARSFTNIKAISALVAEEFSHSLARRGYAATAPSAGRVGASMSGKMGSTKSGEEKAAARE 340
MARSF+N K ISA V + S ++RRGYAA + ++ G SG K EE ++++
Sbjct: 1 MARSFSNSKLISAFVVDTVSSFVSRRGYAAASSASVPGGVRGSGVNIMMKKWEE--SSKK 58
Query: 339 KVSWVPDPVTGYYKPE-NIKEIDVAELRSAVL 247
SWVPDPVTGYY+PE + KEID AELR +L
Sbjct: 59 TTSWVPDPVTGYYRPESHAKEIDAAELRQMLL 90
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 496,448,903
Number of Sequences: 1393205
Number of extensions: 11365609
Number of successful extensions: 35219
Number of sequences better than 10.0: 40
Number of HSP's better than 10.0 without gapping: 33656
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 35126
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18462123008
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)