Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC015624A_C01 KMC015624A_c01
(768 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_564838.1| expressed protein; protein id: At1g64680.1, sup... 191 6e-60
ref|NP_563673.1| Expressed protein; protein id: At1g03055.1, sup... 114 2e-24
ref|NP_682732.1| ORF_ID:tll1942~hypothetical protein [Thermosyne... 83 5e-18
ref|NP_680560.1| unknown protein; protein id: At4g01995.1, suppo... 91 2e-17
gb|AAN65325.1| Hypothetical protein F10G7.9a [Caenorhabditis ele... 32 7.7
>ref|NP_564838.1| expressed protein; protein id: At1g64680.1, supported by cDNA:
101924. [Arabidopsis thaliana] gi|25373196|pir||H96669
protein F1N19.25 [imported] - Arabidopsis thaliana
gi|6633822|gb|AAF19681.1|AC009519_15 F1N19.25
[Arabidopsis thaliana]
Length = 250
Score = 191 bits (485), Expect(2) = 6e-60
Identities = 88/118 (74%), Positives = 100/118 (84%), Gaps = 5/118 (4%)
Frame = +2
Query: 110 MTVPFFHWLVGPSEVVEVEINGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTN 274
+TVPFFHWLVGPS+V+EVE+NGVKQ+SGV IKKC SGCVGMCVNMCK PTQDFFTN
Sbjct: 133 LTVPFFHWLVGPSQVIEVEVNGVKQRSGVRIKKCRYLENSGCVGMCVNMCKIPTQDFFTN 192
Query: 275 EFGLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTSVCPKLQ 448
EFGLPLTM PN+EDMSCEM+YGQAPP FEED +KQ C A ICS + PS+ +CPKL+
Sbjct: 193 EFGLPLTMNPNYEDMSCEMIYGQAPPAFEEDVATKQPCLADICS-MSNPSSPICPKLE 249
Score = 62.4 bits (150), Expect(2) = 6e-60
Identities = 28/29 (96%), Positives = 28/29 (96%)
Frame = +3
Query: 24 LLSMLPPGAPAQFRKLFPPTKWAAEFNAA 110
LLSMLPPGAP QFRKLFPPTKWAAEFNAA
Sbjct: 104 LLSMLPPGAPEQFRKLFPPTKWAAEFNAA 132
>ref|NP_563673.1| Expressed protein; protein id: At1g03055.1, supported by cDNA:
gi_14488101 [Arabidopsis thaliana]
Length = 264
Score = 114 bits (284), Expect = 2e-24
Identities = 50/94 (53%), Positives = 67/94 (71%), Gaps = 5/94 (5%)
Frame = +2
Query: 125 FHWLVGPSEVVEVEINGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTNEFGLP 289
F WLVGPSEV E E+NG K+KS V+I+KC S CVGMC ++CK P+Q F N G+P
Sbjct: 158 FAWLVGPSEVRETEVNGRKEKSVVYIEKCRFLEQSNCVGMCTHICKIPSQIFIKNSLGMP 217
Query: 290 LTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCY 391
+ M P+F D+SC+M++G+ PP E+DP KQ C+
Sbjct: 218 IYMEPDFNDLSCKMMFGREPPEIEDDPAMKQPCF 251
>ref|NP_682732.1| ORF_ID:tll1942~hypothetical protein [Thermosynechococcus elongatus
BP-1] gi|22295668|dbj|BAC09494.1|
ORF_ID:tll1942~hypothetical protein [Thermosynechococcus
elongatus BP-1]
Length = 218
Score = 83.2 bits (204), Expect(2) = 5e-18
Identities = 46/113 (40%), Positives = 61/113 (53%), Gaps = 10/113 (8%)
Frame = +2
Query: 131 WLVGPSEVVEVEINGVKQ-----KSGVHIKKC-----SGCVGMCVNMCKTPTQDFFTNEF 280
WLVG S+ VE+ Q SGV I+KC S C+ +C+N+CK PT+ FF
Sbjct: 113 WLVGASDRYWVEVIPPNQLPQWQHSGVRIQKCRYLAESQCMALCMNLCKKPTEQFFRQRL 172
Query: 281 GLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTSVCP 439
G+PLTM PNF+D SCEMV+G P + P+ C+ PS + CP
Sbjct: 173 GIPLTMTPNFKDYSCEMVFGTPAQPIPQPPL--LPCW-------QDPSQTPCP 216
Score = 30.0 bits (66), Expect(2) = 5e-18
Identities = 11/25 (44%), Positives = 16/25 (64%)
Frame = +3
Query: 33 MLPPGAPAQFRKLFPPTKWAAEFNA 107
++PP RKLF P++W E+NA
Sbjct: 80 LIPPMMSTLIRKLFRPSRWVCEWNA 104
>ref|NP_680560.1| unknown protein; protein id: At4g01995.1, supported by cDNA:
gi_17065173, supported by cDNA: gi_20259951 [Arabidopsis
thaliana] gi|17065174|gb|AAL32741.1| Unknown protein
[Arabidopsis thaliana] gi|20259952|gb|AAM13323.1|
unknown protein [Arabidopsis thaliana]
Length = 258
Score = 90.5 bits (223), Expect = 2e-17
Identities = 47/106 (44%), Positives = 67/106 (62%), Gaps = 6/106 (5%)
Frame = +2
Query: 110 MTVPFFHWLVGPSEVVEVEI-NGVKQKSGVHIKKC-----SGCVGMCVNMCKTPTQDFFT 271
+TV WL+GPS+V +++ NG SGV ++KC S CVG+C+N CK PTQ FF
Sbjct: 142 VTVLTCQWLMGPSKVNIIDLPNGESWDSGVFVEKCQYLEESKCVGVCINTCKLPTQTFFK 201
Query: 272 NEFGLPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSV 409
+ G+PL M PNF+D SC+ +G APP E+D + C+ + CS+
Sbjct: 202 DYMGVPLVMEPNFKDYSCQFKFGVAPP--EDDGNVNEPCF-ETCSI 244
>gb|AAN65325.1| Hypothetical protein F10G7.9a [Caenorhabditis elegans]
Length = 727
Score = 32.3 bits (72), Expect = 7.7
Identities = 17/49 (34%), Positives = 25/49 (50%)
Frame = +2
Query: 284 LPLTMIPNFEDMSCEMVYGQAPPPFEEDPVSKQTCYAKICSVVPQPSTS 430
LP + N++++S E+V P P DP K + +VV PSTS
Sbjct: 190 LPASFHDNYDEVSMEVVSPDEPQPSPNDPFIKPPIQIPLEAVVSLPSTS 238
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 721,023,822
Number of Sequences: 1393205
Number of extensions: 16827736
Number of successful extensions: 41919
Number of sequences better than 10.0: 12
Number of HSP's better than 10.0 without gapping: 40310
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 41902
length of database: 448,689,247
effective HSP length: 121
effective length of database: 280,111,442
effective search space used: 37534933228
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)