Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC019900A_C01 KMC019900A_c01
(576 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
gb|AAN04167.1| Putative copia-like retrotransposon polyprotein [... 34 1e-04
ref|NP_192350.1| putative polyprotein [Arabidopsis thaliana] gi|... 44 0.002
ref|NP_194619.1| putative protein; protein id: At4g28900.1 [Arab... 42 0.004
ref|NP_192807.1| retrotransposon like protein; protein id: At4g1... 42 0.007
pir||T02087 gag/pol polyprotein - maize retrotransposon Hopscotc... 42 0.007
>gb|AAN04167.1| Putative copia-like retrotransposon polyprotein [Oryza sativa
(japonica cultivar-group)]
Length = 1042
Score = 33.9 bits (76), Expect(2) = 1e-04
Identities = 16/46 (34%), Positives = 23/46 (49%)
Frame = -3
Query: 529 NWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGH 392
NW D+GAT H+T+ DQ+ +G G+ I+ IGH
Sbjct: 218 NWYVDTGATDHITSQLEKLNTREVYKGHDQIHTASGAGMKIKHIGH 263
Score = 33.1 bits (74), Expect(2) = 1e-04
Identities = 26/128 (20%), Positives = 52/128 (40%), Gaps = 23/128 (17%)
Frame = -1
Query: 321 AVSHSPVKASLFQSSLHSQPLAQAGLASQPVGNKASSAIDLPSS--LVNSSVSSSAPCNS 148
A+ H+P + + LH A+ +++ + + S +++ S L+ + S
Sbjct: 264 AIVHTPTRPLHLNNVLHVPQAAKNLISATKLASDNSVFVEIHSKYFLIKDRTTRSTVLKG 323
Query: 147 SSNISIYEL------------------WHNRMGHPHHDVLKQTLSLCNVPV---SSNKSV 31
+Y L WH+R+GHP ++ + +S +P S+ +SV
Sbjct: 324 PRRHGLYPLPSTSSTKQAFAVAPSLERWHSRLGHPSIPIVMKVISSNKLPCLRESNKESV 383
Query: 30 FSFCKKKK 7
C+K K
Sbjct: 384 CDACQKAK 391
>ref|NP_192350.1| putative polyprotein [Arabidopsis thaliana] gi|25407268|pir||G85055
probable polyprotein [imported] - Arabidopsis thaliana
gi|4773895|gb|AAD29768.1|AF076243_15 putative
polyprotein [Arabidopsis thaliana]
gi|7267198|emb|CAB77909.1| putative polyprotein
[Arabidopsis thaliana]
Length = 1017
Score = 43.5 bits (101), Expect = 0.002
Identities = 21/54 (38%), Positives = 33/54 (60%)
Frame = -3
Query: 550 FTAAPQVNWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGHL 389
FT+ P DSGA+HH+ NNP + +DN+ A +++ NG +P++ IG L
Sbjct: 321 FTSEPSKTLVIDSGASHHMINNPSL-IDNIKPAL-GNVVIANGDKVPVKEIGEL 372
>ref|NP_194619.1| putative protein; protein id: At4g28900.1 [Arabidopsis thaliana]
gi|7444467|pir||T08945 hypothetical protein F25O24.20 -
Arabidopsis thaliana gi|4972079|emb|CAB43904.1| putative
protein [Arabidopsis thaliana]
gi|7269745|emb|CAB81478.1| putative protein [Arabidopsis
thaliana]
Length = 1415
Score = 42.4 bits (98), Expect = 0.004
Identities = 19/44 (43%), Positives = 25/44 (56%)
Frame = -3
Query: 526 WCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIG 395
W DSGAT H+TN+ P + +D ++VGN LPI IG
Sbjct: 293 WVTDSGATSHITNSTSQLQSAQPYSGEDSVIVGNSDFLPITHIG 336
Score = 35.4 bits (80), Expect = 0.52
Identities = 16/36 (44%), Positives = 23/36 (63%)
Frame = -1
Query: 126 ELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFC 19
E+WH R+GHP+ DVL+Q L N + +K+ S C
Sbjct: 425 EVWHMRLGHPNQDVLQQLLR--NKAIVISKTSHSLC 458
>ref|NP_192807.1| retrotransposon like protein; protein id: At4g10690.1 [Arabidopsis
thaliana] gi|7444419|pir||T04204 hypothetical protein
T4F9.150 - Arabidopsis thaliana
gi|4539447|emb|CAB40035.1| retrotransposon like protein
[Arabidopsis thaliana] gi|7267767|emb|CAB81170.1|
retrotransposon like protein [Arabidopsis thaliana]
Length = 1515
Score = 41.6 bits (96), Expect = 0.007
Identities = 19/44 (43%), Positives = 25/44 (56%)
Frame = -3
Query: 526 WCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIG 395
W PDS AT H+TN ++ + D ++VGNG LPI IG
Sbjct: 324 WLPDSAATAHITNTTDGLQNSQTYSGDDSVIVGNGDFLPITHIG 367
Score = 31.6 bits (70), Expect = 7.5
Identities = 13/37 (35%), Positives = 23/37 (62%)
Frame = -1
Query: 126 ELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFCK 16
E+WH R+GHP+ +VL+ + + V NK+ + C+
Sbjct: 456 EVWHQRLGHPNKEVLQHLIKTKAIVV--NKTSSNMCE 490
>pir||T02087 gag/pol polyprotein - maize retrotransposon Hopscotch
gi|531389|gb|AAA57005.1| copia-like retrotransposon
Hopscotch polyprotein
Length = 1439
Score = 41.6 bits (96), Expect = 0.007
Identities = 19/45 (42%), Positives = 25/45 (55%)
Frame = -1
Query: 153 NSSSNISIYELWHNRMGHPHHDVLKQTLSLCNVPVSSNKSVFSFC 19
N SS E WH R+GHP D++ + +S N+P SN S S C
Sbjct: 447 NFSSTRVPLEHWHKRLGHPSRDIVHRVISNNNLPCLSNNSTTSVC 491
Score = 36.6 bits (83), Expect = 0.23
Identities = 18/58 (31%), Positives = 28/58 (48%)
Frame = -3
Query: 565 ASSTQFTAAPQVNWCPDSGATHHVTNNPGIFVDNVPLATQDQLLVGNGQGLPIQSIGH 392
A+S V W D+GAT H+T + + DQ++ NG G+ I +IG+
Sbjct: 310 ANSAAHQNGSNVPWYTDTGATDHITGDLDRLTMHDKYTGTDQIIAANGTGMTISNIGN 367
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 507,856,996
Number of Sequences: 1393205
Number of extensions: 10942540
Number of successful extensions: 42841
Number of sequences better than 10.0: 162
Number of HSP's better than 10.0 without gapping: 37509
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41131
length of database: 448,689,247
effective HSP length: 116
effective length of database: 287,077,467
effective search space used: 21530810025
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)