Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC004073A_C01 KMC004073A_c01
(642 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_566327.1| expressed protein; protein id: At3g08010.1, sup... 226 2e-58
dbj|BAA92865.1| ORF285 [Synechococcus sp. PCC 6301] gi|22002499|... 97 2e-19
ref|NP_488928.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2... 84 2e-15
ref|ZP_00072211.1| hypothetical protein [Trichodesmium erythraeu... 78 9e-14
gb|ZP_00112111.1| hypothetical protein [Nostoc punctiforme] 77 2e-13
>ref|NP_566327.1| expressed protein; protein id: At3g08010.1, supported by cDNA:
35360., supported by cDNA: gi_18252180 [Arabidopsis
thaliana] gi|6648213|gb|AAF21211.1|AC013483_35 unknown
protein [Arabidopsis thaliana]
gi|18252181|gb|AAL61923.1| unknown protein [Arabidopsis
thaliana] gi|24899681|gb|AAN65055.1| unknown protein
[Arabidopsis thaliana]
Length = 374
Score = 226 bits (576), Expect = 2e-58
Identities = 102/150 (68%), Positives = 129/150 (86%)
Frame = -2
Query: 641 EDLFGERWAFVQLPFSAVREELTSLQTNTIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRA 462
E+LFGE+WAFVQLP+SAVREE++ +FG+ LDLDL+GIE+D+ T+IPGL+V +SRA
Sbjct: 225 ENLFGEKWAFVQLPYSAVREEISDFDEKFVFGASLDLDLLGIEVDENTLIPGLSVATSRA 284
Query: 461 TVLSAIMNSFELCTVEADTARGSLILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACG 282
L+A MN E+C++EAD+++G LILSVGI+TRYVYATYKKTP TT EAEAWE+AKK G
Sbjct: 285 KPLAAWMNGLEVCSIEADSSKGCLILSVGIATRYVYATYKKTPVTTDEAEAWESAKKTSG 344
Query: 281 GLHFLAIQQDIESEECAGFWLLLDLPPPPV 192
GLHFLAIQ D++S++C GFWLL+DLPPPPV
Sbjct: 345 GLHFLAIQDDLDSDDCVGFWLLIDLPPPPV 374
>dbj|BAA92865.1| ORF285 [Synechococcus sp. PCC 6301] gi|22002499|gb|AAM82651.1|
unknown [Synechococcus sp. PCC 7942]
Length = 285
Score = 96.7 bits (239), Expect = 2e-19
Identities = 59/144 (40%), Positives = 81/144 (55%)
Frame = -2
Query: 635 LFGERWAFVQLPFSAVREELTSLQTNTIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATV 456
L G+RWAFV LPF+A+ E + FG L GI++ D+T IPGL + +SRA
Sbjct: 146 LRGDRWAFVDLPFAALAEHG---EWGIDFGEAFPL--AGIDLPDETPIPGLIIFASRAMP 200
Query: 455 LSAIMNSFELCTVEADTARGSLILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGGL 276
++A ++ E + D+ L+L G S R+ A P EA + AAK+A GL
Sbjct: 201 IAAWLSGLEPAWLTYDSPAKQLLLETGGSERWTLAALN-VPALQQEATQFNAAKQAAKGL 259
Query: 275 HFLAIQQDIESEECAGFWLLLDLP 204
HFLA+Q D S+ AGFWLL +LP
Sbjct: 260 HFLAVQVDPNSDRFAGFWLLRELP 283
>ref|NP_488928.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25365178|pir||AH2416
hypothetical protein alr4888 [imported] - Nostoc sp.
(strain PCC 7120) gi|17134025|dbj|BAB76587.1|
ORF_ID:alr4888~hypothetical protein [Nostoc sp. PCC
7120]
Length = 286
Score = 83.6 bits (205), Expect = 2e-15
Identities = 50/144 (34%), Positives = 79/144 (54%), Gaps = 1/144 (0%)
Frame = -2
Query: 635 LFGERWAFVQLPFSAVREELTSLQTNTIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATV 456
L G++W FV L +A E+ + F LD +++ +T IPG+ + S RA
Sbjct: 147 LEGQQWVFVSLS-AADLAEMPDWEIG--FSEAFPLDF--VQVSPETRIPGVLIFSPRALP 201
Query: 455 LSAIMNSFELCTVEADTARGS-LILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGG 279
++ M+ EL + DT++G L+L G + ++ A K PTT EA +E AK+ G
Sbjct: 202 IAGWMSGLELAFLRVDTSQGMRLVLETGATESWILANIKN-PTTVQEARGFEEAKQKANG 260
Query: 278 LHFLAIQQDIESEECAGFWLLLDL 207
+HF+ +Q + E+E AGFWLL +L
Sbjct: 261 VHFIGVQSNPEAESFAGFWLLQEL 284
>ref|ZP_00072211.1| hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 286
Score = 78.2 bits (191), Expect = 9e-14
Identities = 52/144 (36%), Positives = 77/144 (53%), Gaps = 1/144 (0%)
Frame = -2
Query: 635 LFGERWAFVQLPFSAVREELTSLQTNTIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATV 456
L GERW FV L A E + + FG L +M + + IPGL + SSRA
Sbjct: 147 LIGERWTFVSLEAGAFTE---MSEWDIDFGEAFPLSMMNLA--PLSAIPGLIIYSSRAQA 201
Query: 455 LSAIMNSFELCTVEADTARGS-LILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGG 279
L+A M+ EL ++ A + L+L+ G + ++ A P+T +EA+ + AK
Sbjct: 202 LAAWMSGLELAFIKFSPASPARLLLNTGGNDCWILANLSN-PSTIAEAKRFSEAKSKAKE 260
Query: 278 LHFLAIQQDIESEECAGFWLLLDL 207
+HFLA+Q + ESE AGFWLL ++
Sbjct: 261 VHFLAVQSNPESESFAGFWLLQEI 284
>gb|ZP_00112111.1| hypothetical protein [Nostoc punctiforme]
Length = 286
Score = 77.0 bits (188), Expect = 2e-13
Identities = 47/144 (32%), Positives = 79/144 (54%), Gaps = 1/144 (0%)
Frame = -2
Query: 635 LFGERWAFVQLPFSAVREELTSLQTNTIFGSGLDLDLMGIEIDDKTMIPGLAVGSSRATV 456
L G++W FV L + + E + FG L+L ++ + IPG+ + S RA
Sbjct: 147 LEGQQWVFVTLDAADLAE---MPEWEIGFGEAFPLELA--KVSPEARIPGILIFSPRALP 201
Query: 455 LSAIMNSFELCTVEADTARGS-LILSVGISTRYVYATYKKTPTTTSEAEAWEAAKKACGG 279
L+ M+ EL + DT+ + L+L G++ ++ A KK P +EA+ +E AK+ G
Sbjct: 202 LAGWMSGLELAFLRFDTSEEARLLLETGVNESWIVANIKK-PQVLAEAKGFEEAKQKANG 260
Query: 278 LHFLAIQQDIESEECAGFWLLLDL 207
+HF+ IQ D +++ AGFWLL ++
Sbjct: 261 VHFIGIQSDPKAQSFAGFWLLQEV 284
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 520,961,590
Number of Sequences: 1393205
Number of extensions: 10580152
Number of successful extensions: 26854
Number of sequences better than 10.0: 30
Number of HSP's better than 10.0 without gapping: 26077
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 26838
length of database: 448,689,247
effective HSP length: 118
effective length of database: 284,291,057
effective search space used: 27007650415
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)