Nr search
BLASTX 2.2.2 [Dec-14-2001]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Query= KMC000760A_C01 KMC000760A_c01
(543 letters)
Database: nr
1,393,205 sequences; 448,689,247 total letters
Searching..................................................done
Score E
Sequences producing significant alignments: (bits) Value
ref|NP_567819.1| putative protein; protein id: At4g28760.1, supp... 78 8e-14
pir||T04523 hypothetical protein F16A16.130 - Arabidopsis thalia... 78 8e-14
ref|NP_199201.1| putative protein; protein id: At5g43880.1 [Arab... 61 1e-08
pir||T18281 hypothetical protein D1 - slime mold (Dictyostelium ... 43 0.002
dbj|BAB21772.1| KIAA1681 protein [Homo sapiens] 43 0.002
>ref|NP_567819.1| putative protein; protein id: At4g28760.1, supported by cDNA:
gi_15293152, supported by cDNA: gi_21281094 [Arabidopsis
thaliana] gi|15293153|gb|AAK93687.1| unknown protein
[Arabidopsis thaliana] gi|21281095|gb|AAM44897.1| unknown
protein [Arabidopsis thaliana]
Length = 924
Score = 77.8 bits (190), Expect = 8e-14
Identities = 49/137 (35%), Positives = 77/137 (55%), Gaps = 2/137 (1%)
Frame = -2
Query: 539 LHEAKRRQRRSNQKLVFDCVNLTLVEITGYQSESHLMGRLWSGDHRGLQVTEGAAPPLLV 360
+HE KRRQ+RS +KL+FD +N + E T ++ + G+ LV
Sbjct: 806 IHEGKRRQQRSTRKLIFDRINSIVSETTTTRTGN------------------GSLHFDLV 847
Query: 359 DLIVTQMKDLISS--GMRSVWESCGDSNSLVVDSIVRKEVVGKGWVEIMGLEVDILVKEV 186
+ + Q+KD +S R E D+NSL +S+V+ E+VG+ W + +E+D E+
Sbjct: 848 EHVWAQLKDWVSDEPSKRDSGEDM-DANSLAAESLVKDEIVGRTWTHSLQVEIDDFGIEI 906
Query: 185 EGKLLEELVEDAVVDLT 135
E +LL+ELVE+AV+DLT
Sbjct: 907 EKRLLQELVEEAVIDLT 923
>pir||T04523 hypothetical protein F16A16.130 - Arabidopsis thaliana
gi|4218122|emb|CAA22976.1| putative protein [Arabidopsis
thaliana] gi|7269731|emb|CAB81464.1| putative protein
[Arabidopsis thaliana]
Length = 880
Score = 77.8 bits (190), Expect = 8e-14
Identities = 49/137 (35%), Positives = 77/137 (55%), Gaps = 2/137 (1%)
Frame = -2
Query: 539 LHEAKRRQRRSNQKLVFDCVNLTLVEITGYQSESHLMGRLWSGDHRGLQVTEGAAPPLLV 360
+HE KRRQ+RS +KL+FD +N + E T ++ + G+ LV
Sbjct: 762 IHEGKRRQQRSTRKLIFDRINSIVSETTTTRTGN------------------GSLHFDLV 803
Query: 359 DLIVTQMKDLISS--GMRSVWESCGDSNSLVVDSIVRKEVVGKGWVEIMGLEVDILVKEV 186
+ + Q+KD +S R E D+NSL +S+V+ E+VG+ W + +E+D E+
Sbjct: 804 EHVWAQLKDWVSDEPSKRDSGEDM-DANSLAAESLVKDEIVGRTWTHSLQVEIDDFGIEI 862
Query: 185 EGKLLEELVEDAVVDLT 135
E +LL+ELVE+AV+DLT
Sbjct: 863 EKRLLQELVEEAVIDLT 879
>ref|NP_199201.1| putative protein; protein id: At5g43880.1 [Arabidopsis thaliana]
gi|8953749|dbj|BAA98068.1|
gene_id:F6B6.2~pir||T04523~similar to unknown protein
[Arabidopsis thaliana] gi|22136044|gb|AAM91604.1|
putative protein [Arabidopsis thaliana]
Length = 836
Score = 60.8 bits (146), Expect = 1e-08
Identities = 44/134 (32%), Positives = 71/134 (52%), Gaps = 3/134 (2%)
Frame = -2
Query: 527 KRRQRRSNQKLVFDCVNLTLVEITGYQSESHLMGRLWSGDHRGLQVTEGAAPPLLVDLIV 348
++R + + LVFD VN L+E+T + SG G+ V +
Sbjct: 705 QKRLGSNVKNLVFDLVNTLLLELTPSYLGPRSSPMILSGKPLGVYV-------------I 751
Query: 347 TQMKDLISSGMRSV---WESCGDSNSLVVDSIVRKEVVGKGWVEIMGLEVDILVKEVEGK 177
+M++ ++ R W+ GD +SL V+ +VR EV G E + LE+D + +E+E K
Sbjct: 752 NRMQECLTGNGRVEDRWWDEDGDLSSLAVNKVVRIEVAEIGSQESLRLEMDSMGEELELK 811
Query: 176 LLEELVEDAVVDLT 135
LLEELVE+A++DL+
Sbjct: 812 LLEELVEEALMDLS 825
>pir||T18281 hypothetical protein D1 - slime mold (Dictyostelium discoideum)
gi|2702256|gb|AAC18632.1| D1 ORF [Dictyostelium
discoideum]
Length = 1474
Score = 43.1 bits (100), Expect = 0.002
Identities = 27/63 (42%), Positives = 35/63 (54%), Gaps = 1/63 (1%)
Frame = +2
Query: 65 YNLQYCKNKAESHMRDSNQFRPCLSNQPLHPPQVPQATSPPLPSPKYPPPVP*S-QPTLS 241
+N Y K K + + D+N+ SN+P PPQ PQ PPL P+ PP P QPT+
Sbjct: 712 WNYIYLKLKKKYLLNDNNEN----SNRPQPPPQPPQ---PPLQPPQPPPQPPKQPQPTIP 764
Query: 242 PQP 250
PQP
Sbjct: 765 PQP 767
>dbj|BAB21772.1| KIAA1681 protein [Homo sapiens]
Length = 1236
Score = 43.1 bits (100), Expect = 0.002
Identities = 28/71 (39%), Positives = 30/71 (41%), Gaps = 1/71 (1%)
Frame = +2
Query: 83 KNKAESHMRDSNQFRPCLSNQP-LHPPQVPQATSPPLPSPKYPPPVP*SQPTLSPQPLF* 259
K + ES R P LS QP + P SPPLP P PPP P P P PL
Sbjct: 578 KARMESMNRPYTSLVPPLSPQPKIVTPYTASQPSPPLPPPPPPPPPPPPPPPPPPPPLPS 637
Query: 260 QYYPQPGCCCP 292
Q P G P
Sbjct: 638 QSAPSAGSAAP 648
Score = 35.8 bits (81), Expect = 0.35
Identities = 21/56 (37%), Positives = 26/56 (45%), Gaps = 15/56 (26%)
Frame = +2
Query: 128 PCLSNQPLHP------PQVP---------QATSPPLPSPKYPPPVP*SQPTLSPQP 250
P + +QPL P PQ P Q +S P+PSP +PPP P S P P
Sbjct: 862 PAMESQPLKPVPANVAPQSPPAVKAKPKWQPSSIPVPSPDFPPPPPESSLVFPPPP 917
Database: nr
Posted date: Apr 1, 2003 2:05 AM
Number of letters in database: 448,689,247
Number of sequences in database: 1,393,205
Lambda K H
0.318 0.135 0.401
Gapped
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 542,210,659
Number of Sequences: 1393205
Number of extensions: 14406965
Number of successful extensions: 119970
Number of sequences better than 10.0: 1264
Number of HSP's better than 10.0 without gapping: 63033
Number of HSP's successfully gapped in prelim test: 0
Number of HSP's that attempted gapping in prelim test: 0
Number of HSP's gapped (non-prelim): 99569
length of database: 448,689,247
effective HSP length: 115
effective length of database: 288,470,672
effective search space used: 18750593680
frameshift window, decay const: 50, 0.1
T: 12
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)