Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000509A_C01 KMC000509A_c01
(671 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_177159.1| hypothetical protein; protein id: At1g70010.1 [... 110 2e-23
ref|NP_174020.1| polyprotein, putative; protein id: At1g26990.1 ... 109 3e-23
gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana] 109 3e-23
pir||T01956 hypothetical protein T2L5.9 - Arabidopsis thaliana g... 106 3e-22
pir||T01879 hypothetical protein F8M12.17 - Arabidopsis thaliana... 106 3e-22
>ref|NP_177159.1| hypothetical protein; protein id: At1g70010.1 [Arabidopsis thaliana]
gi|25301690|pir||G96722 hypothetical protein F20P5.25
[imported] - Arabidopsis thaliana
gi|2194136|gb|AAB61111.1| Strong similarity to Zea mays
retrotransposon Hopscotch polyprotein (gb|U12626).
[Arabidopsis thaliana]
Length = 1315
Score = 110 bits (275), Expect = 2e-23
Identities = 54/101 (53%), Positives = 72/101 (70%)
Frame = -1
Query: 668 EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
EL WLT L++L+VP P+LL+CD+++A HIA N VFHERTKH++ DCH VRE+L L
Sbjct: 1215 ELVWLTNFLKELQVPLSKPTLLFCDNEAAIHIANNHVFHERTKHIESDCHSVRERLLKGL 1274
Query: 488 FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSSA 366
F L I++ Q AD TKPL F L+SK+G+LNI+ S+
Sbjct: 1275 FELYHINTELQIADPFTKPLYPSHFHRLISKMGLLNIFVSS 1315
>ref|NP_174020.1| polyprotein, putative; protein id: At1g26990.1 [Arabidopsis thaliana]
Length = 1425
Score = 109 bits (273), Expect = 3e-23
Identities = 52/100 (52%), Positives = 70/100 (70%)
Frame = -1
Query: 668 EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
EL WL YIL ++PF P+ LYCD+++A HIA N+VFHERTKH++ DCH VRE ++A +
Sbjct: 1324 ELIWLGYILTAFKIPFTHPAYLYCDNEAALHIANNSVFHERTKHIENDCHKVRECIEAGI 1383
Query: 488 FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSS 369
+ + + +Q AD LTKPL PF SKLG+LNIY +
Sbjct: 1384 LKTIFVRTDNQLADTLTKPLYPKPFRENNSKLGLLNIYEA 1423
>gb|AAF79879.1|AC000348_32 T7N9.5 [Arabidopsis thaliana]
Length = 1436
Score = 109 bits (273), Expect = 3e-23
Identities = 52/100 (52%), Positives = 70/100 (70%)
Frame = -1
Query: 668 EL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAKL 489
EL WL YIL ++PF P+ LYCD+++A HIA N+VFHERTKH++ DCH VRE ++A +
Sbjct: 1335 ELIWLGYILTAFKIPFTHPAYLYCDNEAALHIANNSVFHERTKHIENDCHKVRECIEAGI 1394
Query: 488 FHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSS 369
+ + + +Q AD LTKPL PF SKLG+LNIY +
Sbjct: 1395 LKTIFVRTDNQLADTLTKPLYPKPFRENNSKLGLLNIYEA 1434
>pir||T01956 hypothetical protein T2L5.9 - Arabidopsis thaliana
gi|3695393|gb|AAC62795.1| contains similarity to
retroviral aspartyl proteases (Pfam: rvp.hmm, score:
11.80) [Arabidopsis thaliana]
Length = 1244
Score = 106 bits (264), Expect = 3e-22
Identities = 51/102 (50%), Positives = 74/102 (72%)
Frame = -1
Query: 671 CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 492
CE+ WL +L DL++ S +++ DS +A +IATN VFHERTKH++IDCH+VRE+L
Sbjct: 1143 CEMVWLASLLLDLKIITGSVPIVFSDSTAAIYIATNPVFHERTKHIEIDCHLVRERLDKG 1202
Query: 491 LFHLLPISSVDQTADILTKPLESGPFSHLVSKLGVLNIYSSA 366
L +L + + DQ ADILTKPL FS+L+SK+ + NI++S+
Sbjct: 1203 LIRMLHVRTEDQVADILTKPLFPHQFSYLMSKMSLHNIFASS 1244
>pir||T01879 hypothetical protein F8M12.17 - Arabidopsis thaliana
gi|3513747|gb|AAC33963.1| contains similarity to reverse
transcriptases (Pfam; rvt.hmm, score: 11.19) [Arabidopsis
thaliana]
Length = 1633
Score = 106 bits (264), Expect = 3e-22
Identities = 47/93 (50%), Positives = 67/93 (71%)
Frame = -1
Query: 671 CEL*WLTYILQDLRVPFISPSLLYCDSQSARHIATNAVFHERTKHLDIDCHVVREKLQAK 492
CE+ WL +L+DL V P+ L+CD++SA H+ATN VFHERTKH++IDCH VR++++A
Sbjct: 1336 CEIIWLQQLLKDLHVTMTCPAKLFCDNKSALHLATNPVFHERTKHIEIDCHTVRDQIKAG 1395
Query: 491 LFHLLPISSVDQTADILTKPLESGPFSHLVSKL 393
L + + +Q ADILTKPL GPF L+ ++
Sbjct: 1396 KLKTLHVPTGNQLADILTKPLHPGPFHSLLKRI 1428
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 529,377,649
Number of Sequences: 1393205
Number of extensions: 11357165
Number of successful extensions: 42910
Number of sequences better than 10.0: 454
Number of HSP's better than 10.0 without gapping: 35593
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42125
length of database: 448,689,247
effective HSP length: 119
effective length of database: 282,897,852
effective search space used: 29421376608
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)