Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC016405A_C01 KMC016405A_c01
(772 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_564667.1| thylakoid lumen 18.3 kDa protein; protein id: A... 322 5e-87
ref|ZP_00072906.1| hypothetical protein [Trichodesmium erythraeu... 109 4e-23
ref|NP_488140.1| hypothetical protein [Nostoc sp. PCC 7120] gi|2... 100 2e-20
ref|NP_681194.1| ORF_ID:tll0404~hypothetical protein [Thermosyne... 99 9e-20
gb|ZP_00110146.1| hypothetical protein [Nostoc punctiforme] 96 6e-19
>ref|NP_564667.1| thylakoid lumen 18.3 kDa protein; protein id: At1g54780.1,
supported by cDNA: 3853., supported by cDNA:
gi_14030682, supported by cDNA: gi_17064781, supported
by cDNA: gi_19698896, supported by cDNA: gi_20259867
[Arabidopsis thaliana] gi|25405770|pir||H96589
hypothetical protein T22H22.19 [imported] - Arabidopsis
thaliana gi|3776572|gb|AAC64889.1| ESTs gb|R65052,
gb|AA712146, gb|H76533, gb|H76282, gb|AA650771,
gb|H76287, gb|AA650887, gb|N37383, gb|Z29721 and
gb|Z29722 come from this gene. [Arabidopsis thaliana]
gi|14030683|gb|AAK53016.1|AF375432_1 At1g54780/T22H22_19
[Arabidopsis thaliana] gi|17064782|gb|AAL32545.1|
Unknown protein [Arabidopsis thaliana]
gi|19698897|gb|AAL91184.1| unknown protein [Arabidopsis
thaliana] gi|20259868|gb|AAM13281.1| unknown protein
[Arabidopsis thaliana] gi|21593390|gb|AAM65339.1|
unknown [Arabidopsis thaliana]
gi|23198362|gb|AAN15708.1| unknown protein [Arabidopsis
thaliana] gi|23505937|gb|AAN28828.1| At1g54780/T22H22_19
[Arabidopsis thaliana]
Length = 285
Score = 322 bits (824), Expect = 5e-87
Identities = 158/185 (85%), Positives = 177/185 (95%)
Frame = -2
Query: 771 VDDAGVLSRVTRSDLKGLLSDLESRKKFHINFITVRKLTSKADAFEYADQVLERWYPSVE 592
VDDAGVLSRVT+SDLK LLSDLE RKK +NFITVRKLTSKADAFEYADQVLE+WYPS+E
Sbjct: 101 VDDAGVLSRVTKSDLKKLLSDLEYRKKLRLNFITVRKLTSKADAFEYADQVLEKWYPSIE 160
Query: 591 EGNDKGIVVLVTSQKEGAVTGGPAFVQAVGENILDATVAENLPVLATDEKYNEAVLSTAK 412
EGN+KGIVVL+TSQKEGA+TGGPAF++AVGENILDATV+ENLPVLATDEKYNEAV S+AK
Sbjct: 161 EGNNKGIVVLITSQKEGAITGGPAFIEAVGENILDATVSENLPVLATDEKYNEAVYSSAK 220
Query: 411 RLVAAIDGLPDPGGPQVKDNKRESNFRTKEETEQKRGQFSLVVGGLLVVAFVVPMLQYYA 232
RLVAAIDG PDPGGP VKD+KRESNF+TKEET++KRGQFSLVVGGLLV+AFVVPM QY+A
Sbjct: 221 RLVAAIDGQPDPGGPTVKDSKRESNFKTKEETDEKRGQFSLVVGGLLVIAFVVPMAQYFA 280
Query: 231 YVAKK 217
YV++K
Sbjct: 281 YVSRK 285
>ref|ZP_00072906.1| hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 242
Score = 109 bits (273), Expect = 4e-23
Identities = 66/185 (35%), Positives = 103/185 (55%), Gaps = 4/185 (2%)
Frame = -2
Query: 771 VDDAGVLSRVTRSDLKGLLSDLESRKKFHINFITVRKLTSKADAFEYADQVLERWYPSVE 592
VDDA VLSRVT++ L L +L + + F+T+R+L A + +++ ++W+P++E
Sbjct: 55 VDDADVLSRVTKNKLNNTLENLANLTGNEVRFVTIRRLDYGETADSFTEKLFDKWFPTLE 114
Query: 591 EGNDKGIVVLVTSQKEGAVTGGPAFVQAVGENILDATVAENLPVLATD-EKYNEAVLSTA 415
++ +VVL T A+ G A + +I + V E + V D KYNEA L+ +
Sbjct: 115 AKANQTLVVLDTLTNNDAIRIGDAVKIFMSNDITQSLVNETIQVPIRDGNKYNEAFLAAS 174
Query: 414 KRLVAAIDGLPDPGGPQVKDN---KRESNFRTKEETEQKRGQFSLVVGGLLVVAFVVPML 244
RL A + G PDPG P +KD + + F++ EET + +VV LLV+A VVPM
Sbjct: 175 DRLTAVLSGEPDPGPPDIKDELSAQVAATFKSAEETNDQSATVLVVV--LLVIATVVPMA 232
Query: 243 QYYAY 229
Y+ Y
Sbjct: 233 TYFWY 237
>ref|NP_488140.1| hypothetical protein [Nostoc sp. PCC 7120] gi|25359462|pir||AE2318
hypothetical protein alr4100 [imported] - Nostoc sp.
(strain PCC 7120) gi|17133235|dbj|BAB75799.1|
ORF_ID:alr4100~hypothetical protein [Nostoc sp. PCC
7120]
Length = 245
Score = 100 bits (249), Expect = 2e-20
Identities = 59/185 (31%), Positives = 93/185 (49%), Gaps = 2/185 (1%)
Frame = -2
Query: 771 VDDAGVLSRVTRSDLKGLLSDLESRKKFHINFITVRKLTSKADAFEYADQVLERWYPSVE 592
+D V+SR+ + L DL + F+T+ +L +A + E+W+PS E
Sbjct: 56 LDQGDVISRINEGAISSSLEDLAKETGKEVRFVTIHRLDYGETPESFAQALFEKWFPSKE 115
Query: 591 EGNDKGIVVLVTSQKEGAVTGGPAFVQAVGENILDATVAENLPVLATD-EKYNEAVLSTA 415
++ ++VL T A+ G + + I ++ E L D KYN+A L +
Sbjct: 116 AQANQILLVLDTVTNGTAIITGDEVKPLLTDTIANSVAEETLAAPLRDGNKYNQAFLDAS 175
Query: 414 KRLVAAIDGLPDPGGPQVKDNKR-ESNFRTKEETEQKRGQFSLVVGGLLVVAFVVPMLQY 238
RLVA + G PDPG PQ+ D + E F+ EET+ +G + V GLL+ A ++PM Y
Sbjct: 176 DRLVAVLSGQPDPGPPQIVDKVQVEGTFKKAEETD--KGNATAWVVGLLIAATIIPMATY 233
Query: 237 YAYVA 223
Y Y+A
Sbjct: 234 YIYLA 238
>ref|NP_681194.1| ORF_ID:tll0404~hypothetical protein [Thermosynechococcus elongatus
BP-1] gi|22294125|dbj|BAC07956.1|
ORF_ID:tll0404~hypothetical protein [Thermosynechococcus
elongatus BP-1]
Length = 228
Score = 98.6 bits (244), Expect = 9e-20
Identities = 53/181 (29%), Positives = 93/181 (51%)
Frame = -2
Query: 771 VDDAGVLSRVTRSDLKGLLSDLESRKKFHINFITVRKLTSKADAFEYADQVLERWYPSVE 592
+D+ VLS VT+ + L DL +++ +T+ +L + D + +W+P E
Sbjct: 46 IDEGNVLSAVTQGSVGRSLQDLSEATGINVHVVTLHRLDYGETPQSFVDDLFSQWFPDPE 105
Query: 591 EGNDKGIVVLVTSQKEGAVTGGPAFVQAVGENILDATVAENLPVLATDEKYNEAVLSTAK 412
++ I+ L T A+ G A + + ++ V E + V + YN+AVL T
Sbjct: 106 SQANQVIIALDTVTNGTAIHYGDAVAERLNPETAESIVQETMRVPLREGNYNQAVLDTVD 165
Query: 411 RLVAAIDGLPDPGGPQVKDNKRESNFRTKEETEQKRGQFSLVVGGLLVVAFVVPMLQYYA 232
RL + G PDPG P V++ E +++KEET+ + +++V LL+ A V+PM+ Y+
Sbjct: 166 RLGKVLKGEPDPGPPVVREVVVEKTYKSKEETDDRSA--TIIVVALLIAATVIPMVTYFM 223
Query: 231 Y 229
Y
Sbjct: 224 Y 224
>gb|ZP_00110146.1| hypothetical protein [Nostoc punctiforme]
Length = 254
Score = 95.9 bits (237), Expect = 6e-19
Identities = 59/186 (31%), Positives = 93/186 (49%), Gaps = 5/186 (2%)
Frame = -2
Query: 771 VDDAGVLSRVTRSDLKGLLSDLESRKKFHINFITVRKLTSKADAFEYADQVLERWYPSVE 592
+D V+SR+ + DL + + +TVR+L + ++ E+W+P+ E
Sbjct: 65 LDQGEVISRLNEGKISSAFEDLAKQTNKEVRIVTVRRLDYGETPESFTKELFEKWFPTKE 124
Query: 591 -EGNDKGIVVLVTSQKEGAVTGG---PAFVQAVGENILDATVAENLPVLATDEKYNEAVL 424
+ N +V+ + +TG P A+ E++ TV+ +P L KYN+A L
Sbjct: 125 AQANQTLLVIDTVTNGTSIITGDEVKPLLTDAIAESVATETVS--VP-LRNGNKYNQAFL 181
Query: 423 STAKRLVAAIDGLPDPGGPQVKDNKR-ESNFRTKEETEQKRGQFSLVVGGLLVVAFVVPM 247
+ RLVA + G DPG PQ+ DN + E ++ EET Q G + V GLL+ A V+PM
Sbjct: 182 DASDRLVAVLSGKADPGPPQITDNVQVEGTYKKAEETNQ--GNATAWVVGLLIAATVIPM 239
Query: 246 LQYYAY 229
YY Y
Sbjct: 240 ATYYIY 245
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 661,854,952
Number of Sequences: 1393205
Number of extensions: 14399839
Number of successful extensions: 42726
Number of sequences better than 10.0: 36
Number of HSP's better than 10.0 without gapping: 40914
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 42613
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37815044670
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)